CN105279691A - Financial transaction detection method and equipment based on random forest model - Google Patents

Financial transaction detection method and equipment based on random forest model Download PDF

Info

Publication number
CN105279691A
CN105279691A CN201410361193.3A CN201410361193A CN105279691A CN 105279691 A CN105279691 A CN 105279691A CN 201410361193 A CN201410361193 A CN 201410361193A CN 105279691 A CN105279691 A CN 105279691A
Authority
CN
China
Prior art keywords
decision
variable
transaction
random forest
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410361193.3A
Other languages
Chinese (zh)
Inventor
赵金涛
邱雪涛
杨鸿超
王骏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Unionpay Co Ltd
Original Assignee
China Unionpay Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Unionpay Co Ltd filed Critical China Unionpay Co Ltd
Priority to CN201410361193.3A priority Critical patent/CN105279691A/en
Publication of CN105279691A publication Critical patent/CN105279691A/en
Pending legal-status Critical Current

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a financial transaction detection method based on a random forest model. The method includes: (a) obtaining a historical transaction table and a fraudulent transaction table; (b) utilizing the historical transaction table and the fraudulent transaction table to construct a sample data set which includes sample characteristic variables; (c) randomly extracting a plurality of samples from the sample data set with putbacks; and (d) randomly selecting an identical number of characteristic variables for each of the plurality of samples, so as to generate a decision tree model corresponding to the sample and to further generate a random forest model. The invention also discloses financial transaction detection equipment based on the random forest model.

Description

Based on financial transaction detection method and the equipment of Random Forest model
Technical field
The present invention relates to financial transaction fraud detection field, particularly a kind of financial transaction detection method based on Random Forest model and equipment.
Background technology
In traditional bank card fraudulent trading method for detecting, decision-tree model has calculated amount classifying rules that is relatively little, that generate the advantage such as can to understand, and can meet the demand of fraud detection work to a certain extent.But the easy over-fitting of single decision-tree model, classifying rules easily becomes complex, and classification results is unstable.Meanwhile, for the training of unbalanced data, the classification results of decision-tree model is obviously partial to most class, easily causes the inaccurate of classification results.
Summary of the invention
For solving the problem, according to an aspect of the present invention, a kind of financial transaction detection method based on Random Forest model is provided.The method comprises: (a) obtains historical trading table and fraudulent trading table; B () utilizes described historical trading table and described fraudulent trading table to construct sample data sets, described sample data sets comprises sample characteristics variable; C () extracts many increments originally with putting back at random from described sample data sets; D () is the characteristic variable of each this Stochastic choice of the increment equal number in described many increment bases, to generate and this corresponding decision-tree model of this increment, and then generate Random Forest model; E () trains and assesses each of the multiple decision-tree models in described Random Forest model, to obtain the accuracy rate of each decision-tree model; F () determines the ballot weight q of each decision-tree model based on described accuracy rate i; And (g) utilizes the multiple decision-tree models in described Random Forest model to export y to the response of inputted data of financial transaction iand described ballot weight q i, obtain voting results RF according to following formula and judge whether described financial transaction exists swindle:
RF = Σ i = 1 l yi * q i
Wherein, l is the quantity of decision-tree model.
By there being the random sampling of putting back to generate training sample, each sample Stochastic choice equal number characteristic variable participates in training, generates a decision-tree model, decides the classification of concluding the business finally by multiple decision-tree model ballot.This detection method and equipment overcome the shortcoming of single decision-tree model classifying rules complexity, the easy over-fitting of model, classification accuracy instability, this detection method and equipment are for the unbalancedness of bank card business dealing data simultaneously, also have good adaptive faculty.
Said method also can comprise: described transaction, when judging that described financial transaction exists swindle, adds in fraud detection result set by (h); I () confirms the transaction in described fraud detection result set, and the transaction confirming as swindle added in described fraudulent trading table; And (j) re-executes step (a) and (b).
In the above-mentioned methods, step (e) comprises further: the Gini coefficient that in (e1) calculation training sample, all variablees divide in all values; (e2) get Gini coefficient minimum be divided into the first best splitting point; And described training sample divides based on described first best splitting point by (e3), and (e1) and (e2) is repeated respectively to determine the second best splitting point to the training sample after dividing.
In the above-mentioned methods, described sample characteristics variable comprises original variable, context variable and statistical variable.
In the above-mentioned methods, described original variable includes but not limited to, the dealing money directly obtained from described fraudulent trading table and described historical trading table and exchange hour.
In the above-mentioned methods, described context variable includes but not limited to, transaction whether in areal and transaction whether in same trade company.
In the above-mentioned methods, described statistical variable includes but not limited to, certain card number or this card number transaction trade company statistical information within a period of time.
In the above-mentioned methods, the quantity of selected characteristic variable is n, and the relation of the total N of n and characteristic variable is as follows:
In the above-mentioned methods, when the accuracy rate of certain decision-tree model is lower than a threshold value, from described Random Forest model, this decision-tree model is given up.By the assessment to single decision tree, eliminate the decision tree that nicety of grading is too low, give different ballot weights to each decision-tree model simultaneously, thus improve the accuracy of Random Forest model.
According to another aspect of the present invention, provide a kind of financial transaction checkout equipment based on Random Forest model, comprising: for obtaining the device of historical trading table and fraudulent trading table; For utilizing described historical trading table and described fraudulent trading table to construct the device of sample data sets, described sample data sets comprises sample characteristics variable; Many increments device is originally extracted with putting back to for having at random from described sample data sets; For the characteristic variable for each this Stochastic choice of the increment equal number in described many increment bases, to generate and this corresponding decision-tree model of this increment, and then generate the device of Random Forest model; For training and assess each of the multiple decision-tree models in described Random Forest model, to obtain the device of the accuracy rate of each decision-tree model; For determining the ballot weight q of each decision-tree model based on described accuracy rate idevice; And for utilizing the multiple decision-tree models in described Random Forest model to export y to the response of inputted data of financial transaction iand described ballot weight q i, obtain voting results RF according to following formula and judge whether described financial transaction exists the device of swindle:
RF = Σ i = 1 l yi * q i
Wherein, l is the quantity of decision-tree model.
The said equipment also can comprise: for when judging that described financial transaction exists swindle, described transaction is added the device in fraud detection result set; For confirming the transaction in described fraud detection result set, and the transaction confirming as swindle is added the device in described fraudulent trading table.
In the said equipment, for training each of the multiple decision-tree models in described Random Forest model and assess, so that the device obtaining the accuracy rate of each decision-tree model is configured to perform following steps: the Gini coefficient that in (e1) calculation training sample, all variablees divide in all values; (e2) get Gini coefficient minimum be divided into the first best splitting point; And described training sample divides based on described first best splitting point by (e3), and (e1) and (e2) is repeated respectively to determine the second best splitting point to the training sample after dividing.
In the said equipment, described sample characteristics variable comprises original variable, context variable and statistical variable.
In the said equipment, described original variable includes but not limited to, the dealing money directly obtained from described fraudulent trading table and described historical trading table and exchange hour.
In the said equipment, described context variable includes but not limited to, transaction whether in areal and transaction whether in same trade company.
In the said equipment, described statistical variable includes but not limited to, certain card number or this card number transaction trade company statistical information within a period of time.
In the said equipment, the quantity of selected characteristic variable is n, and the relation of the total N of n and characteristic variable is as follows:
The said equipment also can comprise: for when the accuracy rate of certain decision-tree model is lower than a threshold value, give up the device of this decision-tree model from described Random Forest model.
Accompanying drawing explanation
After having read the specific embodiment of the present invention with reference to accompanying drawing, those skilled in the art will become apparent various aspects of the present invention.Those skilled in the art should be understood that: these accompanying drawings only for coordinating embodiment that technical scheme of the present invention is described, and and are not intended to be construed as limiting protection scope of the present invention.
Fig. 1 and Fig. 2 is the embodiment according to the application, based on the schematic flow sheet of the financial transaction detection method of Random Forest model.
Embodiment
Introduce below be of the present invention multiple may some in embodiment, aim to provide basic understanding of the present invention, be not intended to confirm key of the present invention or conclusive key element or limit claimed scope.Easy understand, according to technical scheme of the present invention, do not changing under connotation of the present invention, one of ordinary skill in the art can propose other implementation that can mutually replace.Therefore, following embodiment and accompanying drawing are only the exemplary illustrations to technical scheme of the present invention, and should not be considered as of the present invention all or the restriction be considered as technical solution of the present invention or restriction.
In general, this application provides a kind of financial transaction detection method based on random forest and equipment.By there being the random sampling of putting back to generate training sample, each sample Stochastic choice equal number characteristic variable participates in training, generates a decision-tree model, decides the classification of concluding the business finally by multiple decision-tree model ballot.
The transaction detection method flow scheme design based on random forest of the application as shown in Figure 1.This transaction detection method comprises sampling of data, extraction feature, data prediction, generation sample data, training pattern generation random forest and produces the steps such as transaction.Be specifically described for each step below:
1) sampling of data
Extract all swindle data in fraudulent trading table, from historical trading table, press card number extract transaction data, transaction record is labeled as fraud respectively, normal.Owing to comprising fraudulent trading data in historical trading table, need in the transaction extracted in historical trading table, the card number be included in fraudulent trading table to be rejected.Because fraudulent trading only accounts for a little part in production transaction, when constructing sample data, fraudulent trading and the desirable empirical value 200 (needing with reference to producing actual ratio) of arm's length transaction ratio.
2) feature is extracted
Sample characteristics variable is divided into original variable, context variable, statistical variable.Original variable directly obtains from fraudulent trading table and historical trading table, do not need to calculate, as dealing money, exchange hour etc.Context variable needs to obtain from transaction same card number, needs to carry out certain calculating or judgement, as transaction whether in areal, whether transaction in same trade company etc.Statistical variable is this card number or this card number transaction statistical information of trade company within a period of time, as in 30 days with the average dealing money of card number every, card to be concluded the business average every day such as stroke count etc.
3) data prediction
A. the characteristic variable of sampling sample is calculated;
B. variable discretize, carries out sliding-model control for the continuous variable in sample data, the variablees such as such as dealing money;
C. carry out randomly ordered to sample data.
4) generate sample data and and then generate Random Forest model
As shown in Figure 2, the training step of model is as follows for the product process figure of Random Forest model:
A. sample random sampling
The pretreated sample of tentation data is S, samples k time with putting back at random, and each sample size is 2/3rds of sample S, and sampling sample set is { s 1..., s k.
B. selected characteristic variable
Suppose that characteristic variable adds up to N, for each increment originally chooses n characteristic variable, wherein the variable chosen each time is as far as possible not identical.
C. decision-tree model is trained.
Every portion sampling sample training generates a decision-tree model, and symbiosis becomes k decision-tree model.Suppose that T is for a sampling sample, T=s i, i=1 ..., k.Sample T comprises arm's length transaction, fraudulent trading two classifications, be defined as classified variable Y, value normal, fraud, wherein the quantity of training sample is Num (T), the quantity of arm's length transaction is Num (normal), and the quantity of fraudulent trading is Num (fraud).
● the Gini coefficient of calculation training sample.The Gini coefficient Gini (T) of training sample calculates by following formula.
Gini(T)=1-p normal(T) 2-p fraud(T) 2
Wherein represent the probability of arm's length transaction in training sample T,
represent that fraudulent trading is at the probability in training sample T.
● determine split vertexes.With set X={X 1..., X n, represent original variable and the context variable of training sample, the value of each variable is Xi={c i1..., c im, suppose variable X i=c, c ∈ { c i1..., c imsample T is divided into two subset T (Xi≤c), T (Xi > c), calculate the Gini coefficient Gini (T this time divided xi=c).
Gini ( T Xi = c ) = Num ( T ( Xi ≤ c ) ) Num ( T ) Gini ( T ( Xi ≤ c ) ) + Num ( T ( Xi > c ) ) Num ( T ) Gini ( T ( Xi > c ) )
Wherein Num (T (Xi≤c)), Num (T (Xi > c)) are respectively the sample size of subset T (Xi≤c), T (Xi > c).Calculate the Gini coefficient that all variablees divide in all values, get Gini coefficient minimum be divided into best split vertexes.
● repeat previous step.Suppose Gini coefficient Gini (T ' xi=c) for minimum, then X i=c is the best split vertexes of previous step, subset T (Xi≤c), T (Xi > c) repeats previous step respectively and determines next best split vertexes.Division is stopped for the moment: the classified variable Y of subset T (Xi≤c) or T (Xi > c) subset is same type when meeting the following conditions; Set X is without value in any case, and the Gini coefficient of division no longer reduces; Sample size on subset T (Xi≤c) or T (Xi > c) is less than threshold alpha.
D. decision-tree model assessment.
Supposing that T ' is for test sample book, is remaining 1/3rd samples of S, i.e. T '=S-T.Test sample book corresponding to each decision-tree model is respectively tested, and adds up the accuracy rate r of each decision-tree model classification i, i=1 ..., k, if r ibe less than threshold value beta, then this decision-tree model given up.Suppose that qualified decision-tree model quantity is l, corresponding accuracy rate is respectively r i, i=1 ..., l.
E. the ballot weight of each decision tree is determined.
The classification results of random forest is the common decision of each decision tree ballot.Suppose that the output of each decision tree is y i, i=1 ..., l, the weight of each decision tree is q i, i=1 ..., l, q iby following formulae discovery:
q i = ri Σ j = 1 l r j , i = 1 , . . . l
Each sample data is by the classification results of random forest, and namely the voting results RF of multiple decision tree is:
RF = Σ i = 1 l yi * q i
5) fraud detection is carried out to production transaction
The Random Forest model good by Training valuation carries out fraud detection to production transaction, generates fraud detection result set. and carry out manual confirmation to the transaction in fraud detection result set, the transaction confirming as swindle adds fraudulent trading table.Re-training Renewal model at set intervals, ensures the fraudulent trading rule that model identifiable design is up-to-date.
Above, composition graphs 1 and Fig. 2 particularly illustrate the financial transaction detection method based on random forest of the present invention.Those skilled in the art can understand, when without departing from the spirit and scope of the present invention, the financial transaction detection method based on random forest of the present invention can also be realized with corresponding hardware device, computer program or alternate manner.These change and replacement is interpreted as falling in claims of the present invention limited range.

Claims (18)

1., based on a financial transaction detection method for Random Forest model, comprising:
A () obtains historical trading table and fraudulent trading table;
B () utilizes described historical trading table and described fraudulent trading table to construct sample data sets, described sample data sets comprises sample characteristics variable;
C () extracts many increments originally with putting back at random from described sample data sets;
D () is the characteristic variable of each this Stochastic choice of the increment equal number in described many increment bases, to generate and this corresponding decision-tree model of this increment, and then generate Random Forest model;
E () trains and assesses each of the multiple decision-tree models in described Random Forest model, to obtain the accuracy rate of each decision-tree model;
F () determines the ballot weight q of each decision-tree model based on described accuracy rate i; And
G () utilizes the multiple decision-tree models in described Random Forest model to export y to the response of inputted data of financial transaction iand described ballot weight q i, obtain voting results RF according to following formula and judge whether described financial transaction exists swindle:
RF = Σ i = 1 l yi * q i
Wherein, l is the quantity of decision-tree model.
2. the method for claim 1, also comprises:
H described transaction, when judging that described financial transaction exists swindle, adds in fraud detection result set by ();
I () confirms the transaction in described fraud detection result set, and the transaction confirming as swindle added in described fraudulent trading table; And
J () re-executes step (a) and (b).
3. the method for claim 1, wherein step (e) comprises further:
(e1) Gini coefficient that in calculation training sample, all variablees divide in all values;
(e2) get Gini coefficient minimum be divided into the first best splitting point; And
(e3) based on described first best splitting point, described training sample is divided, and (e1) and (e2) is repeated respectively to determine the second best splitting point to the training sample after dividing.
4. the method for claim 1, wherein described sample characteristics variable comprises original variable, context variable and statistical variable.
5. method as claimed in claim 4, wherein, described original variable is the dealing money and exchange hour that directly obtain from described fraudulent trading table and described historical trading table.
6. method as claimed in claim 4, wherein, described context variable be conclude the business whether in areal and transaction whether in same trade company.
7. method as claimed in claim 4, wherein, described statistical variable is certain card number or this card number transaction trade company statistical information within a period of time.
8. the quantity of the method for claim 1, wherein selected characteristic variable is n, and the relation of the total N of n and characteristic variable is as follows:
9., the method for claim 1, wherein when the accuracy rate of certain decision-tree model is lower than a threshold value, from described Random Forest model, give up this decision-tree model.
10., based on a financial transaction checkout equipment for Random Forest model, comprising:
For obtaining the device of historical trading table and fraudulent trading table;
For utilizing described historical trading table and described fraudulent trading table to construct the device of sample data sets, described sample data sets comprises sample characteristics variable;
Many increments device is originally extracted with putting back to for having at random from described sample data sets;
For the characteristic variable for each this Stochastic choice of the increment equal number in described many increment bases, to generate and this corresponding decision-tree model of this increment, and then generate the device of Random Forest model;
For training and assess each of the multiple decision-tree models in described Random Forest model, to obtain the device of the accuracy rate of each decision-tree model;
For determining the ballot weight q of each decision-tree model based on described accuracy rate idevice; And
For utilizing the multiple decision-tree models in described Random Forest model, y is exported to the response of inputted data of financial transaction iand described ballot weight q i, obtain voting results RF according to following formula and judge whether described financial transaction exists the device of swindle:
RF = Σ i = 1 l yi * q i
Wherein, l is the quantity of decision-tree model.
11. equipment as claimed in claim 10, also comprise:
For when judging that described financial transaction exists swindle, described transaction is added the device in fraud detection result set;
For confirming the transaction in described fraud detection result set, and the transaction confirming as swindle is added the device in described fraudulent trading table.
12. equipment as claimed in claim 10, wherein, for training each of the multiple decision-tree models in described Random Forest model and assess, so that the device obtaining the accuracy rate of each decision-tree model is configured to perform following steps:
(e1) Gini coefficient that in calculation training sample, all variablees divide in all values;
(e2) get Gini coefficient minimum be divided into the first best splitting point; And
(e3) based on described first best splitting point, described training sample is divided, and (e1) and (e2) is repeated respectively to determine the second best splitting point to the training sample after dividing.
13. equipment as claimed in claim 10, wherein, described sample characteristics variable comprises original variable, context variable and statistical variable.
14. equipment as claimed in claim 10, wherein, described original variable is the dealing money and exchange hour that directly obtain from described fraudulent trading table and described historical trading table.
15. equipment as claimed in claim 10, wherein, described context variable be conclude the business whether in areal and transaction whether in same trade company.
16. equipment as claimed in claim 10, wherein, described statistical variable is certain card number or this card number transaction trade company statistical information within a period of time.
17. equipment as claimed in claim 10, wherein, the quantity of selected characteristic variable is n, and the relation of the total N of n and characteristic variable is as follows:
18. equipment as claimed in claim 10, also comprise: for when the accuracy rate of certain decision-tree model is lower than a threshold value, give up the device of this decision-tree model from described Random Forest model.
CN201410361193.3A 2014-07-25 2014-07-25 Financial transaction detection method and equipment based on random forest model Pending CN105279691A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410361193.3A CN105279691A (en) 2014-07-25 2014-07-25 Financial transaction detection method and equipment based on random forest model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410361193.3A CN105279691A (en) 2014-07-25 2014-07-25 Financial transaction detection method and equipment based on random forest model

Publications (1)

Publication Number Publication Date
CN105279691A true CN105279691A (en) 2016-01-27

Family

ID=55148646

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410361193.3A Pending CN105279691A (en) 2014-07-25 2014-07-25 Financial transaction detection method and equipment based on random forest model

Country Status (1)

Country Link
CN (1) CN105279691A (en)

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106529960A (en) * 2016-11-07 2017-03-22 中国银联股份有限公司 Fraud transaction detection method for electronic transaction
CN106789912A (en) * 2016-11-22 2017-05-31 清华大学 Router data plane anomaly detection method based on classification regression tree
CN106897931A (en) * 2016-06-12 2017-06-27 阿里巴巴集团控股有限公司 A kind of recognition methods of abnormal transaction data and device
CN107330785A (en) * 2017-07-10 2017-11-07 广州市触通软件科技股份有限公司 A kind of petty load system and method based on the intelligent air control of big data
CN107423871A (en) * 2017-04-24 2017-12-01 成都知数科技有限公司 Financial air control field multiple features fusion extracting method
CN107481019A (en) * 2017-07-28 2017-12-15 上海携程商务有限公司 Order fraud recognition methods, system, storage medium and electronic equipment
CN107563645A (en) * 2017-09-04 2018-01-09 杭州云算信达数据技术有限公司 A kind of Financial Risk Analysis method based on big data
CN108074179A (en) * 2017-12-07 2018-05-25 深圳乐信软件技术有限公司 Financial air control tactics configuring method, system, server and storage medium
CN108305166A (en) * 2018-04-04 2018-07-20 淮阴师范学院 A kind of negative dealing fraud method towards financial field
CN108510096A (en) * 2017-02-24 2018-09-07 百度在线网络技术(北京)有限公司 Trade company's attrition prediction method, apparatus, equipment and storage medium
WO2018166457A1 (en) * 2017-03-15 2018-09-20 阿里巴巴集团控股有限公司 Neural network model training method and device, transaction behavior risk identification method and device
CN108665159A (en) * 2018-05-09 2018-10-16 深圳壹账通智能科技有限公司 A kind of methods of risk assessment, device, terminal device and storage medium
CN108964951A (en) * 2017-05-19 2018-12-07 腾讯科技(深圳)有限公司 A kind of method and server of warning information acquisition
CN109472610A (en) * 2018-11-09 2019-03-15 福建省农村信用社联合社 A kind of bank transaction is counter to cheat method and system, equipment and storage medium
CN109767314A (en) * 2018-12-14 2019-05-17 深圳壹账通智能科技有限公司 Trade company's risk management and control method, device, computer equipment and storage medium
CN109791679A (en) * 2016-09-26 2019-05-21 哈曼国际工业有限公司 The system and method for prediction for automobile guarantee fraud
CN110264342A (en) * 2019-06-19 2019-09-20 深圳前海微众银行股份有限公司 A kind of business audit method and device based on machine learning
CN110390526A (en) * 2018-04-18 2019-10-29 苏宁易购集团股份有限公司 A kind of network trading analysis method and system
CN110414845A (en) * 2019-07-31 2019-11-05 阿里巴巴集团控股有限公司 For the methods of risk assessment and device of target transaction
CN110634067A (en) * 2019-09-25 2019-12-31 上海应用技术大学 Bank account abnormal transaction identification method
TWI684151B (en) * 2016-10-21 2020-02-01 大陸商中國銀聯股份有限公司 Method and device for detecting illegal transaction
CN111401906A (en) * 2020-03-05 2020-07-10 中国工商银行股份有限公司 Transfer risk detection method and system
CN111553685A (en) * 2020-04-28 2020-08-18 中国工商银行股份有限公司 Method, device, electronic equipment and storage medium for determining transaction routing channel
CN112308466A (en) * 2020-11-26 2021-02-02 东莞市盟大塑化科技有限公司 Enterprise qualification auditing method and device, computer equipment and storage medium
CN112949954A (en) * 2019-11-22 2021-06-11 张捷 Method for establishing financial fraud recognition model based on recognition learning
CN113056753A (en) * 2018-11-21 2021-06-29 贝宝公司 Machine learning based on post-transaction data
CN118247046A (en) * 2024-05-28 2024-06-25 上海冰鉴信息科技有限公司 Behavior fraud prediction method and device and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102890803A (en) * 2011-07-21 2013-01-23 阿里巴巴集团控股有限公司 Method and device for determining abnormal transaction process of electronic commodity
CN103473231A (en) * 2012-06-06 2013-12-25 深圳先进技术研究院 Classifier building method and system
CN103530540A (en) * 2013-09-27 2014-01-22 西安交通大学 User identity attribute detection method based on man-machine interaction behavior characteristics
CN103678659A (en) * 2013-12-24 2014-03-26 焦点科技股份有限公司 E-commerce website cheat user identification method and system based on random forest algorithm

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102890803A (en) * 2011-07-21 2013-01-23 阿里巴巴集团控股有限公司 Method and device for determining abnormal transaction process of electronic commodity
CN103473231A (en) * 2012-06-06 2013-12-25 深圳先进技术研究院 Classifier building method and system
CN103530540A (en) * 2013-09-27 2014-01-22 西安交通大学 User identity attribute detection method based on man-machine interaction behavior characteristics
CN103678659A (en) * 2013-12-24 2014-03-26 焦点科技股份有限公司 E-commerce website cheat user identification method and system based on random forest algorithm

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
J.BUREZ ET AL: "Handling class imbalance in customer churn prediction", 《EXPERT SYSTEM WITH APPLICATIONS》 *
刘建丽等: "基于决策树的税务数据分析", 《现代计算机(专业版)》 *
周丽峰: "基于非平衡数据分类的贷款违约预测研究", 《中国优秀硕士学位论文全文数据库 经济与管理科学辑》 *
方匡南: "《随机森林组合预测理论及其在金融中的应用》", 31 May 2012, 厦门:厦门大学出版社 *
方匡南等: "随机森林方法研究综述", 《统计与信息论坛》 *
解明恩等: "《云南短期气候预测方法与模型》", 31 December 2000 *

Cited By (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106897931A (en) * 2016-06-12 2017-06-27 阿里巴巴集团控股有限公司 A kind of recognition methods of abnormal transaction data and device
CN109791679A (en) * 2016-09-26 2019-05-21 哈曼国际工业有限公司 The system and method for prediction for automobile guarantee fraud
TWI684151B (en) * 2016-10-21 2020-02-01 大陸商中國銀聯股份有限公司 Method and device for detecting illegal transaction
CN106529960A (en) * 2016-11-07 2017-03-22 中国银联股份有限公司 Fraud transaction detection method for electronic transaction
CN106789912A (en) * 2016-11-22 2017-05-31 清华大学 Router data plane anomaly detection method based on classification regression tree
CN106789912B (en) * 2016-11-22 2020-02-21 清华大学 Router data plane abnormal behavior detection method based on classification regression decision tree
CN108510096A (en) * 2017-02-24 2018-09-07 百度在线网络技术(北京)有限公司 Trade company's attrition prediction method, apparatus, equipment and storage medium
CN108629413B (en) * 2017-03-15 2020-06-16 创新先进技术有限公司 Neural network model training and transaction behavior risk identification method and device
TWI689874B (en) * 2017-03-15 2020-04-01 香港商阿里巴巴集團服務有限公司 Method and device for neural network model training and transaction behavior risk identification
WO2018166457A1 (en) * 2017-03-15 2018-09-20 阿里巴巴集团控股有限公司 Neural network model training method and device, transaction behavior risk identification method and device
CN108629413A (en) * 2017-03-15 2018-10-09 阿里巴巴集团控股有限公司 Neural network model training, trading activity Risk Identification Method and device
CN107423871A (en) * 2017-04-24 2017-12-01 成都知数科技有限公司 Financial air control field multiple features fusion extracting method
CN108964951A (en) * 2017-05-19 2018-12-07 腾讯科技(深圳)有限公司 A kind of method and server of warning information acquisition
CN108964951B (en) * 2017-05-19 2020-12-29 腾讯科技(深圳)有限公司 Method for acquiring alarm information and server
CN107330785A (en) * 2017-07-10 2017-11-07 广州市触通软件科技股份有限公司 A kind of petty load system and method based on the intelligent air control of big data
CN107481019A (en) * 2017-07-28 2017-12-15 上海携程商务有限公司 Order fraud recognition methods, system, storage medium and electronic equipment
CN107563645A (en) * 2017-09-04 2018-01-09 杭州云算信达数据技术有限公司 A kind of Financial Risk Analysis method based on big data
CN108074179A (en) * 2017-12-07 2018-05-25 深圳乐信软件技术有限公司 Financial air control tactics configuring method, system, server and storage medium
CN108305166A (en) * 2018-04-04 2018-07-20 淮阴师范学院 A kind of negative dealing fraud method towards financial field
CN110390526A (en) * 2018-04-18 2019-10-29 苏宁易购集团股份有限公司 A kind of network trading analysis method and system
CN108665159A (en) * 2018-05-09 2018-10-16 深圳壹账通智能科技有限公司 A kind of methods of risk assessment, device, terminal device and storage medium
CN109472610A (en) * 2018-11-09 2019-03-15 福建省农村信用社联合社 A kind of bank transaction is counter to cheat method and system, equipment and storage medium
CN113056753A (en) * 2018-11-21 2021-06-29 贝宝公司 Machine learning based on post-transaction data
CN109767314A (en) * 2018-12-14 2019-05-17 深圳壹账通智能科技有限公司 Trade company's risk management and control method, device, computer equipment and storage medium
CN110264342A (en) * 2019-06-19 2019-09-20 深圳前海微众银行股份有限公司 A kind of business audit method and device based on machine learning
CN110264342B (en) * 2019-06-19 2024-06-28 深圳前海微众银行股份有限公司 Business auditing method and device based on machine learning
CN110414845A (en) * 2019-07-31 2019-11-05 阿里巴巴集团控股有限公司 For the methods of risk assessment and device of target transaction
CN110414845B (en) * 2019-07-31 2023-09-19 创新先进技术有限公司 Risk assessment method and device for target transaction
CN110634067A (en) * 2019-09-25 2019-12-31 上海应用技术大学 Bank account abnormal transaction identification method
CN112949954A (en) * 2019-11-22 2021-06-11 张捷 Method for establishing financial fraud recognition model based on recognition learning
CN112949954B (en) * 2019-11-22 2023-11-07 张捷 Method for establishing financial fraud recognition model based on recognition learning
CN111401906A (en) * 2020-03-05 2020-07-10 中国工商银行股份有限公司 Transfer risk detection method and system
CN111553685A (en) * 2020-04-28 2020-08-18 中国工商银行股份有限公司 Method, device, electronic equipment and storage medium for determining transaction routing channel
CN112308466A (en) * 2020-11-26 2021-02-02 东莞市盟大塑化科技有限公司 Enterprise qualification auditing method and device, computer equipment and storage medium
CN118247046A (en) * 2024-05-28 2024-06-25 上海冰鉴信息科技有限公司 Behavior fraud prediction method and device and electronic equipment

Similar Documents

Publication Publication Date Title
CN105279691A (en) Financial transaction detection method and equipment based on random forest model
Martin et al. Estimating the gravity model when zero trade flows are frequent and economically determined
CN102567391B (en) Method and device for building classification forecasting mixed model
CN108364106A (en) A kind of expense report Risk Forecast Method, device, terminal device and storage medium
Dumitrescu* et al. Backtesting value-at-risk: from dynamic quantile to dynamic binary tests
CN110599336B (en) Financial product purchase prediction method and system
CN100595780C (en) Handwriting digital automatic identification method based on module neural network SN9701 rectangular array
CN108475393A (en) The system and method that decision tree is predicted are promoted by composite character and gradient
CN104572449A (en) Automatic test method based on case library
CN109635010B (en) User characteristic and characteristic factor extraction and query method and system
Plakandaras et al. Do leading indicators forecast US recessions? A nonlinear re‐evaluation using historical data
Dbouk et al. Towards a machine learning approach for earnings manipulation detection
CN105528465A (en) Credit status assessment method and device
CN110659961A (en) Method and device for identifying off-line commercial tenant
CN109508807A (en) Lottery user liveness prediction technique, system and terminal device, storage medium
CN112434862B (en) Method and device for predicting financial dilemma of marketing enterprises
Zhang et al. Research on personal credit scoring model based on multi-source data
CN114139931A (en) Enterprise data evaluation method and device, computer equipment and storage medium
CN111160929A (en) Method and device for determining client type
CN104462215B (en) A kind of scientific and technical literature based on time series is cited number Forecasting Methodology
CN111046947A (en) Training system and method of classifier and identification method of abnormal sample
CN109992592A (en) Impoverished College Studentss recognition methods based on campus consumption card pipelined data
Su et al. Detection of tax arrears based on ensemble leaering model
Akinci et al. Comparison of iron and steel production defects using classification algorithms
CN111612626A (en) Method and device for preprocessing bond evaluation data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20160127

RJ01 Rejection of invention patent application after publication