CN111241077A - Financial fraud behavior identification method based on internet data - Google Patents

Financial fraud behavior identification method based on internet data Download PDF

Info

Publication number
CN111241077A
CN111241077A CN202010003646.0A CN202010003646A CN111241077A CN 111241077 A CN111241077 A CN 111241077A CN 202010003646 A CN202010003646 A CN 202010003646A CN 111241077 A CN111241077 A CN 111241077A
Authority
CN
China
Prior art keywords
data
internet
public
financial
article
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010003646.0A
Other languages
Chinese (zh)
Other versions
CN111241077B (en
Inventor
翟恩荣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sichuan XW Bank Co Ltd
Original Assignee
Sichuan XW Bank Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sichuan XW Bank Co Ltd filed Critical Sichuan XW Bank Co Ltd
Priority to CN202010003646.0A priority Critical patent/CN111241077B/en
Publication of CN111241077A publication Critical patent/CN111241077A/en
Application granted granted Critical
Publication of CN111241077B publication Critical patent/CN111241077B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/03Credit; Loans; Processing thereof
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Finance (AREA)
  • Data Mining & Analysis (AREA)
  • Accounting & Taxation (AREA)
  • General Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to a financial fraud behavior identification method based on internet data, which comprises the following steps: A. collecting data on the Internet in real time, wherein the data at least comprises data of a news portal website, a financial forum and a financial community; B. cleaning the acquired data, and carrying out normalization processing on heterogeneous multi-source dirty data to obtain structured data; C. identifying negative public sentiments in the structured data through an emotion analysis method based on deep learning; D. calculating a public sentiment index according to the configuration information; E. and identifying financial fraud behaviors on the Internet according to the public opinion index, and performing early warning. The invention can monitor the public websites, communities, forums and the like on the Internet in real time on the premise of not optimizing intelligent configuration, can early warn financial fraudulent behaviors in the public websites, communities, forums and the like at the first time, and can monitor different fields and different platforms and identify the financial fraudulent behaviors by configuring the monitored contents.

Description

Financial fraud behavior identification method based on internet data
Technical Field
The invention relates to a financial fraud identification method, in particular to a financial fraud identification method based on internet data.
Background
In the current period of compliance supervision, new opportunities and challenges are faced by consuming financial institutions, including how to achieve smooth transition from the wild expansion phase to the robust development phase, how to face normal customer consumption degradation and the credit customers are seriously more, and the like, which are all difficult problems that are not avoidable by consuming financial institutions. Based on the industry background, bank risks need to be comprehensively grasped, and therefore an effective early warning system is constructed, risks are reduced, and crisis is avoided.
The existing financial fraud identification method generally processes fraud, so that the fraud cannot be prevented, and the reaction is slow after the fraud occurs, which often causes significant loss. When financial fraud occurs, the fraud behavior can be identified effectively by identifying the identity and identifying information such as fingerprints of the user. However, both of the above two methods cannot identify the sudden black product of wool type, and when a sudden problem or other hole leakage occurs in the system, a lawless person attacks the system by means of a system leak, which often causes great loss.
Disclosure of Invention
The invention provides a financial fraud behavior recognition method based on internet data, which can be used for timely early warning agency attacks, black product attacks and the like in network communities or forums by monitoring fraud behaviors on the internet.
The invention relates to a financial fraud behavior recognition method based on internet data, which comprises the following steps:
A. collecting data on the Internet in real time, wherein the data at least comprises data of a news portal website, a financial forum and a financial community;
B. cleaning the acquired data, and carrying out normalization processing on heterogeneous multi-source dirty data to obtain structured data; the structured data is relational model data.
C. And identifying negative public sentiments in the structured data through an emotion analysis method based on deep learning. The traditional emotion analysis technology is used for analyzing text emotion according to manually labeled emotion characteristics by using traditional machine learning algorithms such as SVM (support vector machine), CRF (learning reference number), and the like, but supervised learning depends on a large amount of manually labeled data, so that a system based on the supervised learning needs to pay high labeling cost. The emotion analysis method based on deep learning adopts a recurrent neural network to find characteristics related to tasks, avoids artificial characteristic design depending on specific tasks, and introduces an emotion polarity transfer model to strengthen capture of text relevance according to the front and back relevance between sentence words. The method based on deep learning is equivalent to the current method for manually marking the emotional characteristics in performance, but saves a large amount of workload of manual marking. In the prior art, a great deal of publications are available for emotion analysis methods based on deep learning, for example, patent application numbers 201711417352.7, 201811617266.5, 201810290094.9 and other published patent application documents, and deep learning is also a mature emotion analysis method, which is not the innovation point of the present invention and is not described in detail herein.
D. Public sentiment indexes are calculated according to the configuration information, so that different types of public sentiment indexes are calculated, such as fraud attack indexes, platform thunderstorm indexes, black-yield dynamic indexes and the like;
E. and identifying financial fraud behaviors on the Internet according to the public opinion index, carrying out early warning, and informing relevant management personnel of the financial fraud behaviors, so that appropriate measures are taken to stop the fraud behaviors in time.
The identification method disclosed by the invention is used for monitoring the public internet data crawled from the outside in real time, generating the corresponding public opinion index, and carrying out real-time early warning notification when the public opinion index is abnormal, so that the functions of identifying the black birth attack, monitoring the thunderstorm public opinion, calculating the fraud attack index and the like are realized. The risk can be predicted in advance through real-time early warning, measures can be taken in advance, strategies are adjusted, and loss is reduced to the minimum.
Further, when data on the internet is collected in real time in the step A, distributed capture is firstly carried out on the dynamic webpage on the internet, and when the data are captured, a master node in a distributed architecture is responsible for scheduling, and a slave node is responsible for capturing; and then, carrying out structured data extraction on the captured HTML codes, and converting the semi-structured HTML codes into required structured data. The semi-structured data is data of a non-relational model and has a basic fixed structure mode. The capture speed can be increased through a distributed architecture, and horizontal expansion is supported, so that field information of different websites can be quickly positioned, different information can be extracted according to different sources, and the information is finally stored in a relational database.
Furthermore, the dynamic webpage is subjected to distributed grabbing through the JS engine, so that the HTML code of the finally displayed page is obtained.
Further, in the step B, when data cleaning is performed, the unstructured data is converted into structured data, and data deduplication and data cleaning are performed. Unstructured data refers to data without fixed patterns, such as WORD, PDF, PPT, EXL, pictures in various formats, video, etc.
Specifically, the data deduplication is realized by a Bloom Filter (a binary vector data structure) -based data structure, and is based on a URL; the data cleaning is based on configuration, and json format data is converted into formatted data. The json data is a data structure assembled according to a specific format, the data can be well minimized through the data structure, useful data can be compressed as much as possible, but the data needs to be formatted to view the json data format, and the data can be viewed after the data is formatted.
Further, step D includes:
D1. setting a keyword list related to financial fraud, and marking an article corresponding to the data when the collected data has the same keywords as the keywords in the keyword list;
D2. calculating the public sentiment index of the current article: c, the public sentiment index of the current article is an intercept + score, the intercept is the sentiment index obtained by the current article through the sentiment analysis method based on deep learning in the step C, and the score is a numerical value obtained by carrying out predetermined calculation on various attributes of the current article;
D3. computing platform public opinion index: calculating the public sentiment index A of n articles searched by the platform in a set time rangeiI is article number, i is less than or equal to n, and public sentiment index AiThe highest article is set with weight n and public sentiment meansNumber AiThe lowest article is set to weight 1, and the platform public sentiment index is sum ([ n-top (a))i)]×Ai) /(1+2+ … + n), where top (A)i) Public opinion index A of the current ith articleiThe rank value of (c).
On the basis, in the step E, when the public sentiment index of the single article or the public sentiment index of the platform reaches a set threshold value, a related manager is notified in a short message and/or mail mode.
The method for identifying the financial fraudulent conduct based on the internet data can monitor open websites, communities, forums and the like on the internet in real time on the premise of not optimizing intelligent configuration, early warn the financial fraudulent conduct in the internet at the first time, and monitor different fields and different platforms and identify the financial fraudulent conduct by configuring the monitored content.
The present invention will be described in further detail with reference to the following examples. This should not be understood as limiting the scope of the above-described subject matter of the present invention to the following examples. Various substitutions and alterations according to the general knowledge and conventional practice in the art are intended to be included within the scope of the present invention without departing from the technical spirit of the present invention as described above.
Drawings
FIG. 1 is a flow chart of the method for identifying financial fraud based on Internet data according to the present invention.
Fig. 2 is a flow chart of real-time data acquisition over the internet.
Detailed Description
The method for identifying financial fraud based on internet data of the invention as shown in fig. 1 comprises:
A. as shown in fig. 2, data on the internet, including at least data of news portal sites, financial forums, and financial communities, are collected in real time by a web crawler algorithm. When data are collected, the dynamic webpage is rendered through the JS engine to perform distributed capture on the dynamic webpage on the Internet, and therefore the HTML code of the finally displayed page is obtained. During fetching, the master node in the distributed architecture is responsible for scheduling, and the slave nodes are responsible for fetching. And then, extracting structured data of the captured HTML codes, and converting the semi-structured HTML codes into the required structured data to obtain a Web page source file and a corresponding URL (uniform resource locator). The capture speed can be increased through a distributed architecture, and horizontal expansion is supported, so that field information of different websites can be quickly positioned, different information can be extracted according to different sources, and the information is finally stored in a relational database.
B. The collected data are cleaned, heterogeneous multi-source dirty data are normalized, unstructured data are converted into structured data, and data de-duplication and data cleaning are performed. Wherein the data deduplication is realized by a data structure based on BloomFilter (a binary vector data structure) and based on URL. Data cleansing is based on configuration to convert json formatted data to formatted data.
C. In the field of financial wind control, the most important application scene for natural language processing is public opinion analysis. The text is analyzed and mined through technologies such as text clustering and emotion analysis, and the discovery and tracking of negative public sentiment are achieved. The negative public opinion analysis identification needs to consider two aspects of scale and negative degree, and needs to find out the negative public opinion which rises faster in a period of time or participates in larger scale. The scale can be judged by the number of related web pages after the text clustering, and the negative degree is identified by the emotion analysis technology of the text.
And identifying negative public sentiments in the structured data through an emotion analysis method based on deep learning. The emotion analysis method based on deep learning adopts a recurrent neural network to find characteristics related to tasks, avoids artificial characteristic design depending on specific tasks, and introduces an emotion polarity transfer model to strengthen capture of text relevance according to the front and back relevance between sentence words. The method based on deep learning is equivalent to the current method for manually marking the emotional characteristics in performance, but saves a large amount of workload of manual marking. In the prior art, a great deal of published documents such as patent application numbers 201711417352.7, 201811617266.5, 201810290094.9 and the like exist in emotion analysis methods based on deep learning, and deep learning is also a mature emotion analysis method, which is not the innovation point of the present invention and is not described in detail herein.
D. And calculating the public sentiment indexes according to the configuration information, thereby realizing the calculation of different types of public sentiment indexes, such as fraud attack indexes, platform thunderstorm indexes, black yield dynamic indexes and the like. The method specifically comprises the following steps:
D1. and setting a keyword list related to financial fraud, and marking an article corresponding to the data when the collected data has the same keywords as the keywords in the keyword list. The keyword list is for example:
' good loan | | XX bank | | nationwide third internet bank | | | drip credit | | this day head bar | | | borrow | | pay treasure | | particle credit | | interest shop large amount | to stage | | | the profit | good work loan | | (oral sub | | | roll-over | | | money large amount borrow | | | theft | | | use | | | | pay off | | | | | | fraud | | | | profit | | | | | | | pay off | | | | | | | fraudulent use | pay off | | share | | | transfer | BUG | | | lose | | fraud | |, (i) lose | fraud | fraudulent use | in | | fraud | fraudulent use | the identity | lose | Code [ password cracking [ library ], code [ cipher code ] and code [ library ].
D2. Calculating the public sentiment index of the current article: and D, scoring the public sentiment index of the current article, wherein the intercept is the sentiment index obtained by the current article through the sentiment analysis method based on deep learning in the step C, and the score is a numerical value obtained by performing a preset calculation mode on each attribute of the current article. Wherein the attributes include: reading amount, comment amount, type (original creation or reprinting), days until the article is released, and the number of keywords contained in the article. Calculating the public sentiment index of the current article as follows:
a. for example, the number of keywords "openning" included in this document is n, and when n <1, the term "openning" is scored as 0; n is 1, and the score of the term "Kouzi" is 20; n > -2, and "kouzi" term score-100;
b. the number of whether the keyword roll is included is n, when n > is 1, the term roll is 100; n is less than 1, and the score of the term of stripping is 0.
c. The visit amount of the article is n, and when n is less than 10, the score of the article is 5 points; 100> n > -10, which score 20; 1000> n > -100, with a score of 50; n > 1000, the term score 100;
d. the evaluation amount of the article is n, and when n is less than 10, the score is 5; n > is 10, and the score of the item is 100;
e. the number of days of release of the article is n, and when n is less than 2, the score is 100; 7> n > -2, the term score 70; n > is 7, and the score of the term is 20.
In the calculation:
(1) a plurality of keywords can be defined, and each keyword is defined, the number of the keywords contained in the current article needs to be counted.
(2) The intercept is limited to (0, 30) minutes, and the maximum is 30 minutes;
(3) the score limit is (0, 70) points, with a maximum of 70 points;
D3. computing platform public opinion index: a platform public opinion index is generated every hour every day. Calculating the respective public sentiment indexes A of n articles searched by the platform in each houriI is article number, i is less than or equal to n, and public sentiment index AiThe highest article is set with weight n and public sentiment index AiThe lowest article is set to weight 1, and the platform public sentiment index is sum ([ n-top (a))i)]× Ai) /(1+2+ … + n), where top (A)i) Public opinion index A of the current ith articleiThe rank value of (c).
E. Identifying financial fraud behaviors on the Internet according to the public sentiment index, when one of the public sentiment index, the score and the sentiment index of a single article reaches a set threshold, or the public sentiment index of a platform reaches the threshold, or the public sentiment index of the platform is abnormal, such as: the current public sentiment index of the platform is larger than the maximum value of the public sentiment index of the platform at yesterday by 1.5; the public sentiment index of the current platform is larger than the average value of the public sentiment index of the platform at the previous 7 days multiplied by 1.5, and relevant managers are informed in a short message and/or mail mode, so that appropriate measures are taken to stop the occurrence of fraud behaviors in time.
For a single article, the alert content includes: time, alarm content and article links, for example: [ xx Bank ] 2019-01-1812: 39:40] [ financial fraud attack hint: the financial fraud attack index is 80 points https:// xx.
For the platform, the alarm content comprises: time, content, links ranking the top three of a single article, for example: [ xx Bank ] 2019-01-1812: 39:40] [ financial fraud attack hint: the financial fraud attack index is 80 points ] https:// xx.cc.com, https:// pp.mm.com, https:// gg.uu.com.

Claims (7)

1. The identification method of financial fraud based on internet data is characterized by comprising the following steps:
A. collecting data on the Internet in real time, wherein the data at least comprises data of a news portal website, a financial forum and a financial community;
B. cleaning the acquired data, and carrying out normalization processing on heterogeneous multi-source dirty data to obtain structured data;
C. identifying negative public sentiments in the structured data through an emotion analysis method based on deep learning;
D. calculating a public sentiment index according to the configuration information;
E. and identifying financial fraud behaviors on the Internet according to the public opinion index, and performing early warning.
2. A method for identifying internet data based financial fraud as claimed in claim 1, characterized by: in the step A, when data on the Internet are acquired in real time, distributed capture is firstly carried out on dynamic webpages on the Internet, and during capture, a master node in a distributed architecture is responsible for scheduling, and a slave node is responsible for capturing; and then, extracting the structural data of the captured HTML codes, and converting the semi-structured HTML codes into the required structural data.
3. A method for identifying internet data based financial fraud as claimed in claim 2, characterized by: the dynamic webpage distributed grabbing is that the dynamic webpage is rendered through the JS engine, so that HTML codes of the finally displayed page are obtained.
4. A method for identifying internet data based financial fraud as claimed in claim 1, characterized by: and B, converting the unstructured data into structured data during data cleaning, and performing data deduplication and data cleaning.
5. A method for identifying internet data based financial fraud as claimed in claim 4, characterized by: the data deduplication is realized by a data structure based on Bloom Filter based on URL; the data cleaning is based on configuration, and json format data is converted into formatted data.
6. A method for identifying internet data based financial fraud as claimed in claim 1, characterized by: the step D comprises the following steps:
D1. setting a keyword list related to financial fraud, and marking an article corresponding to the data when the collected data has the same keywords as the keywords in the keyword list;
D2. calculating the public sentiment index of the current article: c, the public sentiment index of the current article is an intercept + score, the intercept is the sentiment index obtained by the current article through the sentiment analysis method based on deep learning in the step C, and the score is a numerical value obtained by carrying out predetermined calculation on various attributes of the current article;
D3. computing platform public opinion index: calculating the public sentiment index A of n articles searched by the platform in a set time rangeiI is article number, i is less than or equal to n, and public sentiment index AiThe highest article is set with weight n and public sentiment index AiThe lowest article is set to weight 1, and the platform public sentiment index is sum ([ n [ ])-top(Ai)]×Ai) /(1+2+ … + n), where top (A)i) Public opinion index A of the current ith articleiThe rank value of (c).
7. A method for identifying internet data based financial fraud as claimed in claim 6, characterized by: and E, when the public sentiment index of the single article or the public sentiment index of the platform reaches a set threshold value, informing relevant managers in a short message and/or mail mode.
CN202010003646.0A 2020-01-03 2020-01-03 Identification method of financial fraud based on internet data Active CN111241077B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010003646.0A CN111241077B (en) 2020-01-03 2020-01-03 Identification method of financial fraud based on internet data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010003646.0A CN111241077B (en) 2020-01-03 2020-01-03 Identification method of financial fraud based on internet data

Publications (2)

Publication Number Publication Date
CN111241077A true CN111241077A (en) 2020-06-05
CN111241077B CN111241077B (en) 2023-06-09

Family

ID=70874312

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010003646.0A Active CN111241077B (en) 2020-01-03 2020-01-03 Identification method of financial fraud based on internet data

Country Status (1)

Country Link
CN (1) CN111241077B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113821794A (en) * 2021-09-14 2021-12-21 北京八分量信息科技有限公司 Distributed trusted computing system and method
CN114021039A (en) * 2021-11-10 2022-02-08 深圳安巽科技有限公司 Anti-fraud joint defense early warning method, system and storage medium

Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103544255A (en) * 2013-10-15 2014-01-29 常州大学 Text semantic relativity based network public opinion information analysis method
US9514133B1 (en) * 2013-06-25 2016-12-06 Jpmorgan Chase Bank, N.A. System and method for customized sentiment signal generation through machine learning based streaming text analytics
CN106960358A (en) * 2017-01-13 2017-07-18 重庆小富农康农业科技服务有限公司 A kind of financial fraud behavior based on rural area electronic commerce big data deep learning quantifies detecting system
CN108647823A (en) * 2018-05-10 2018-10-12 北京航空航天大学 Stock certificate data analysis method based on deep learning and device
CN108932291A (en) * 2018-05-23 2018-12-04 福建亿榕信息技术有限公司 Power grid public sentiment evaluation method, storage medium and computer
CN108959383A (en) * 2018-05-31 2018-12-07 平安科技(深圳)有限公司 Analysis method, device and the computer readable storage medium of network public-opinion
CN109241199A (en) * 2018-08-08 2019-01-18 广州初星科技有限公司 A method of it is found towards financial knowledge mapping
US20190026840A1 (en) * 2017-07-19 2019-01-24 Zhouyi TANG Method and System for Providing Real-Time Visual Information Based on Financial Flow Data
CN109471965A (en) * 2018-10-26 2019-03-15 四川才子软件信息网络有限公司 A kind of network public-opinion data sampling and processing method and monitoring platform based on big data
CN109684460A (en) * 2018-12-28 2019-04-26 四川新网银行股份有限公司 A kind of calculation method and system of the negative network public-opinion index based on deep learning
CN110008336A (en) * 2019-01-14 2019-07-12 阿里巴巴集团控股有限公司 A kind of public sentiment method for early warning and system based on deep learning
CN110110156A (en) * 2019-04-04 2019-08-09 平安科技(深圳)有限公司 Industry public sentiment monitoring method, device, computer equipment and storage medium
CN110134849A (en) * 2019-05-20 2019-08-16 瑞森网安(福建)信息科技有限公司 A kind of network public-opinion monitoring method and system
CN110232109A (en) * 2019-05-17 2019-09-13 深圳市兴海物联科技有限公司 A kind of Internet public opinion analysis method and system
CN110276679A (en) * 2019-05-23 2019-09-24 武汉大学 A kind of network individual credit fraud detection method towards deep learning
CN110297870A (en) * 2019-05-30 2019-10-01 南京邮电大学 A kind of financial field Chinese headline sensibility classification method
CN110413863A (en) * 2019-08-01 2019-11-05 信雅达系统工程股份有限公司 A kind of public sentiment news duplicate removal and method for pushing based on deep learning
CN110633373A (en) * 2018-06-20 2019-12-31 上海财经大学 Automobile public opinion analysis method based on knowledge graph and deep learning

Patent Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9514133B1 (en) * 2013-06-25 2016-12-06 Jpmorgan Chase Bank, N.A. System and method for customized sentiment signal generation through machine learning based streaming text analytics
CN103544255A (en) * 2013-10-15 2014-01-29 常州大学 Text semantic relativity based network public opinion information analysis method
CN106960358A (en) * 2017-01-13 2017-07-18 重庆小富农康农业科技服务有限公司 A kind of financial fraud behavior based on rural area electronic commerce big data deep learning quantifies detecting system
US20190026840A1 (en) * 2017-07-19 2019-01-24 Zhouyi TANG Method and System for Providing Real-Time Visual Information Based on Financial Flow Data
CN108647823A (en) * 2018-05-10 2018-10-12 北京航空航天大学 Stock certificate data analysis method based on deep learning and device
CN108932291A (en) * 2018-05-23 2018-12-04 福建亿榕信息技术有限公司 Power grid public sentiment evaluation method, storage medium and computer
CN108959383A (en) * 2018-05-31 2018-12-07 平安科技(深圳)有限公司 Analysis method, device and the computer readable storage medium of network public-opinion
CN110633373A (en) * 2018-06-20 2019-12-31 上海财经大学 Automobile public opinion analysis method based on knowledge graph and deep learning
CN109241199A (en) * 2018-08-08 2019-01-18 广州初星科技有限公司 A method of it is found towards financial knowledge mapping
CN109471965A (en) * 2018-10-26 2019-03-15 四川才子软件信息网络有限公司 A kind of network public-opinion data sampling and processing method and monitoring platform based on big data
CN109684460A (en) * 2018-12-28 2019-04-26 四川新网银行股份有限公司 A kind of calculation method and system of the negative network public-opinion index based on deep learning
CN110008336A (en) * 2019-01-14 2019-07-12 阿里巴巴集团控股有限公司 A kind of public sentiment method for early warning and system based on deep learning
CN110110156A (en) * 2019-04-04 2019-08-09 平安科技(深圳)有限公司 Industry public sentiment monitoring method, device, computer equipment and storage medium
CN110232109A (en) * 2019-05-17 2019-09-13 深圳市兴海物联科技有限公司 A kind of Internet public opinion analysis method and system
CN110134849A (en) * 2019-05-20 2019-08-16 瑞森网安(福建)信息科技有限公司 A kind of network public-opinion monitoring method and system
CN110276679A (en) * 2019-05-23 2019-09-24 武汉大学 A kind of network individual credit fraud detection method towards deep learning
CN110297870A (en) * 2019-05-30 2019-10-01 南京邮电大学 A kind of financial field Chinese headline sensibility classification method
CN110413863A (en) * 2019-08-01 2019-11-05 信雅达系统工程股份有限公司 A kind of public sentiment news duplicate removal and method for pushing based on deep learning

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
全颖等: "基于AI模式的互联网金融创新创业探究", 《科技创新与生产力》 *
周欣欣: "基于深度学习的网络交易欺诈检测模型的研究", 《中国优秀硕士学位论文全文数据库》 *
张庆庆等: "基于深度信念网络的文本情感分类研究", 《西北工业大学学报(社会科学版)》 *
蒋知义;马王荣;邹凯;李黎;: "基于情感倾向性分析的网络舆情情感演化特征研究" *
邢云菲;王晰巍;王铎;韦雅楠;: "基于信息熵的新媒体环境下负面网络舆情监测指标体系研究" *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113821794A (en) * 2021-09-14 2021-12-21 北京八分量信息科技有限公司 Distributed trusted computing system and method
CN113821794B (en) * 2021-09-14 2023-08-18 北京八分量信息科技有限公司 Distributed trusted computing system and method
CN114021039A (en) * 2021-11-10 2022-02-08 深圳安巽科技有限公司 Anti-fraud joint defense early warning method, system and storage medium
CN114021039B (en) * 2021-11-10 2022-08-23 深圳安巽科技有限公司 Anti-fraud joint defense early warning method, system and storage medium

Also Published As

Publication number Publication date
CN111241077B (en) 2023-06-09

Similar Documents

Publication Publication Date Title
Batra et al. Integrating StockTwits with sentiment analysis for better prediction of stock price movement
US9990356B2 (en) Device and method for analyzing reputation for objects by data mining
AU2022201654A1 (en) System and engine for seeded clustering of news events
KR101907041B1 (en) Apparatus and method for managing risk based on prediction on social web media
US8010524B2 (en) Method of monitoring electronic media
Tanwar et al. Unravelling unstructured data: A wealth of information in big data
CN111914087B (en) Public opinion analysis method
Ghankutkar et al. Modelling machine learning for analysing crime news
CN111241077A (en) Financial fraud behavior identification method based on internet data
CN113723737A (en) Enterprise portrait-based policy matching method, device, equipment and medium
Sood et al. Review the role of artificial intelligence in detecting and preventing financial fraud using natural language processing
Konagala et al. Fake news detection using deep learning: supervised fake news detection analysis in social media with semantic similarity method
CN115187066A (en) Risk identification method and device, electronic equipment and storage medium
CN114529383A (en) Method and system for realizing tax payment tracking and tax loss early warning
Jishag et al. Automated review analyzing system using sentiment analysis
Lee et al. Novel methods for resolving false positives during the detection of fraudulent activities on stock market financial discussion boards
Guha Exploring the Field of Text Mining
CN115345401A (en) Six-dimensional analysis method for finding enterprise financial risk
Ferdous et al. Assortment of bangladeshi e-commerce site reviews using machine learning approaches
Mascii et al. Sentiment analysis for performance evaluation of maintenance in healthcare
Li et al. Incorporation of Social Media Data into Macroeconomic Forecast Systems: A Mixed Frequency Modelling Approach.
CN117389998B (en) Data storage method and device based on large model
CN113706207B (en) Order success rate analysis method, device, equipment and medium based on semantic analysis
KR102580512B1 (en) Automated rpa learning apparatus and method for automatic sentence clustering deep learning model
KR20190085795A (en) On-line Risk Response Monitoring System through Analysis of Negative Buzz Media Transition and Amount of Conformity

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant