CN114912549B

CN114912549B - Training method of risk transaction identification model, and risk transaction identification method and device

Info

Publication number: CN114912549B
Application number: CN202210807503.4A
Authority: CN
Inventors: 王宁涛; 傅幸; 王维强
Original assignee: Alipay Hangzhou Information Technology Co Ltd
Current assignee: Alipay Hangzhou Information Technology Co Ltd
Priority date: 2022-07-11
Filing date: 2022-07-11
Publication date: 2022-12-13
Anticipated expiration: 2042-07-11
Also published as: CN114912549A

Abstract

The embodiment of the specification describes a training method of a risk transaction identification model, a risk transaction identification method and a risk transaction identification device. According to the method of the embodiment, when the risk transaction identification model is trained, the classification labels of the acquired black data samples and white data samples are known. And identifying each data sample by using the currently trained risk transaction identification model to obtain respective identification results, further determining a loss function, and continuing model training by using the loss function. The determined loss function can improve the learning weight of the black data samples, so that when the black data samples used for model learning are less than the white data samples, the problem that the learning task inclines to the classification labels of the white data samples can be weakened, and the accuracy of the model for identifying the risk transactions is improved.

Description

Training method of risk transaction identification model, and risk transaction identification method and device

Technical Field

One or more embodiments of the present disclosure relate to the field of artificial intelligence, and in particular, to a method for training a risk transaction recognition model, a risk transaction recognition method, and an apparatus for the same.

Background

In the risk prevention and control field, the black samples and the white samples are subjected to learning training through a deep learning network, so that the risk identification can be performed on the account by using the trained model.

However, the ratio of black and white samples in the field of risk control is often widely different. For example, the ratio of black and white samples may be 1. While black samples are generally more concerned for risk prevention and control, the model obtained based on the learning of the ratio of the black samples and the white samples is more concerned about the information of the white samples, so that the information of the black samples can be weakened or even omitted, and the accuracy of the obtained identification model is often lower when risk identification is carried out.

Disclosure of Invention

One or more embodiments of the present specification describe a training method of a risk transaction identification model, a risk transaction identification method, and an apparatus, which can improve accuracy of risk transaction identification.

According to a first aspect, there is provided a method of training a risk transaction recognition model, comprising:

acquiring a black data sample and a white data sample; the classification label of the black data sample is risk transaction, and the classification label of the white data sample is non-risk transaction;

inputting the black data samples and the white data samples into a currently trained risk transaction identification model to obtain identification results of the black data samples and the white data samples;

determining a loss function according to the recognition result of each black data sample and each white data sample; wherein the loss function is capable of increasing learning weights for the black data samples;

and continuously training the risk transaction identification model by using the loss function.

In one possible implementation manner, the recognition result includes: the label of the data sample is a probability value of risk transaction;

the determining a loss function according to the recognition result of each black data sample and each white data sample includes:

determining a first learning weight for the white data sample; and the number of the first and second groups,

determining a second learning weight for the black data sample; wherein the second learning weight is greater than the first learning weight, and the second learning weight satisfies: sorting the probability values of the risk transactions obtained by the risk transaction identification model from high to low to obtain N data samples corresponding to the first N probability values, wherein the ratio of the number of black data samples with classification labels as risk transactions contained in the N data samples to the number of all black data samples input into the risk transaction identification model is greater than a first preset threshold value;

determining the loss function according to the first learning weight and the second learning weight.

In one possible implementation, the determining the first learning weight of the white data sample includes:

acquiring a probability value of a risk transaction of a classification label output by a currently trained risk identification model;

determining the first learning weight according to the probability value of the risk transaction.

In one possible implementation, the determining the loss function according to the first learning weight and the second learning weight includes:

taking the first learning weight as a weight value for training the black data sample to obtain a black sample loss item;

taking the second learning weight as a weight value for training the white data sample to obtain a white sample loss item;

and calculating the sum of the black sample loss term and the white sample loss term to obtain the loss function.

In a possible implementation manner, the determining a loss function according to the recognition result of each black data sample and each white data sample includes:

the loss function is calculated using the following calculation:

wherein the content of the first and second substances,

for characterizing the loss function in a manner that is,

a label value corresponding to a classification label used to characterize the black data sample,

for watchesThe classification labels output by the currently trained risk recognition model are represented as probability values of risk transactions,

a parameter for characterizing a degree of interest of a balanced risk transaction identification model on the black data sample and the white data sample.

In one possible implementation form of the method,

is not greater than the first parameter; the first parameter is a value obtained by taking a logarithm of 10 as a ratio of the number of the white data samples to the number of the black data samples.

In a possible implementation manner, a ratio value of the number of the black data samples to the number of the white data samples is not greater than a second preset threshold.

According to a second aspect, there is provided a risk transaction identification method comprising:

acquiring transaction data to be identified, wherein the transaction data is to be risk identified;

inputting the transaction data to be identified into the risk transaction identification model to obtain an identification result output by the risk transaction identification model; the risk transaction identification model is obtained by training by using the risk transaction identification model training method according to any embodiment of the first aspect.

According to a third aspect, there is provided a training apparatus for risk transaction recognition models, comprising: the device comprises an acquisition module, an input module, a determination module and a training module;

the acquisition module is configured to acquire black data samples and white data samples; the classification label of the black data sample is risk transaction, and the classification label of the white data sample is non-risk transaction;

the input module is configured to input the black data samples and the white data samples acquired by the acquisition module into a currently trained risk transaction identification model to obtain identification results of the black data samples and the white data samples;

the determining module is configured to determine a loss function according to the recognition result of each black data sample and each white data sample obtained by the input module; wherein the loss function is capable of increasing learning weights for the black data samples;

the training module is configured to continue training the risk transaction identification model by using the loss function determined by the determination module.

According to a fourth aspect, there is provided a risk transaction identification device comprising: the system comprises a transaction to be identified acquisition module and an identification module;

the transaction to be identified acquisition module is configured to acquire transaction data to be identified, wherein the transaction data to be identified is to be risk identified;

the identification module is configured to input the transaction data to be identified, which is acquired by the transaction to be identified acquisition module, into the risk transaction identification model to obtain an identification result output by the risk transaction identification model; wherein the risk transaction identification model is trained by using the training device of the risk transaction identification model of the third aspect.

According to a fifth aspect, there is provided a computing device comprising: a memory having executable code stored therein, and a processor that, when executing the executable code, implements the method of any of the first and second aspects described above.

According to the method and the device provided by the embodiment of the specification, when a risk transaction identification model is trained, the classification labels of the acquired black data samples and white data samples are known. The data samples are identified by using the currently trained risk transaction identification model to obtain respective identification results, so that a loss function can be determined, and model training is continued by using the loss function. The determined loss function can improve the learning weight of the black data samples, so that when the black data samples used for model learning are less than the white data samples, the problem that the learning task inclines to the classification labels of the white data samples can be weakened, and the accuracy of the model for identifying the risk transactions is improved.

Drawings

In order to more clearly illustrate the embodiments of the present specification or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present specification, and other drawings can be obtained by those skilled in the art without creative efforts.

FIG. 1 is a flow chart of a method for training a risk transaction identification model provided in one embodiment of the present description;

FIG. 2 is a flow chart of a method for determining a penalty function provided in one embodiment of the present description;

FIG. 3 is a flow chart of a risk transaction identification method provided by one embodiment of the present description;

FIG. 4 is a schematic diagram of a training apparatus for risk transaction identification models according to an embodiment of the present disclosure;

fig. 5 is a schematic diagram of a risk transaction identification device provided in one embodiment of the present description.

Detailed Description

As previously mentioned, in the field of risk prevention and control, the ratio of black and white samples used to train a model is typically much different. This causes the estimates produced by the softmax layer in the deep learning network to be skewed towards the dominant labels in the multi-classification task. For example, the ratio of the black sample to the white sample is 1. Therefore, estimates of softmax layer yield will be skewed towards the white sample label, thereby weakening and ignoring the information of the black sample. Therefore, the model cannot better learn the information of the black sample, and an accurate risk identification result cannot be obtained frequently when risk prediction is carried out.

Based on the method and the device, the loss function capable of improving the learning weight of the black data sample is determined, the condition that the black data sample is less than the white data sample in model training is balanced, and therefore the accuracy of the model in model prediction is improved.

As shown in fig. 1, embodiments of the present specification provide a method for training a risk transaction recognition model, which may include the following steps:

step 101: acquiring a black data sample and a white data sample; the classification label of the black data sample is risk transaction, and the classification label of the white data sample is non-risk transaction;

step 103: inputting the black data samples and the white data samples into a currently trained risk transaction identification model to obtain identification results of the black data samples and the white data samples;

step 105: determining a loss function according to the recognition result of each black data sample and each white data sample; the loss function can improve the learning weight of the black data sample;

step 107: and continuously training the risk transaction recognition model by using the loss function.

In this embodiment, when the risk transaction identification model is trained, the classification labels of the obtained black data sample and white data sample are known. And identifying each data sample by using the currently trained risk transaction identification model to obtain respective identification results, further determining a loss function, and continuing model training by using the loss function. The determined loss function can improve the learning weight of the black data samples, so that when the black data samples learned by the model are less than the white data samples, the problem that the learning task inclines to the classification labels of the white data samples can be weakened, and the accuracy of risk transaction identification of the model is improved.

The steps in FIG. 1 are described below with reference to specific examples.

First in step 101, black and white data samples are obtained.

The training samples used for training the risk transaction identification model comprise black data samples with classification labels of risk transactions and white data samples with classification labels of non-risk transactions. For example, for some transaction data, it can be determined that some transactions are illegal transactions through reporting of users, manual analysis, and the like, and the data corresponding to the transactions are black data samples with classification labels as risk transactions. Similarly, the data which does not contain the illegal transaction is determined to be a white data sample with a classification label of non-risk transaction through the report of the user, manual analysis and the like.

It is worth pointing out that in practical applications, the black data samples of risk transactions are usually much smaller than the white data samples of non-risk transactions, which on the one hand results in that the sample size of one type of label is too small to be effective for deep learning. On the other hand, the estimation of the yield of the softmax layer in the deep learning network can be inclined to the labels with more sample data volume in the multi-classification task, and the inclination can directly influence the accuracy of the multi-classification task. The scheme is intended to solve the problem of unbalanced classification labels, and therefore, in a possible implementation manner, the ratio value of the number of the black data samples and the number of the white data samples for training the risk transaction recognition model is not greater than a second preset threshold value.

For example, if the second preset threshold is 1. For example, the proportional value of the number of black data samples and white data samples may be 1.

Then, in step 103, the black data samples and the white data samples are input into the currently trained risk transaction recognition model, and recognition results of the black data samples and the white data samples are obtained.

In this step, the black data samples and the white data samples are input into the currently trained risk transaction recognition model, and the estimated values of the data samples are output from the output layer of the deep learning network for optimizing the loss function.

Further in step 105, a loss function is determined based on the recognition results of each of the black data samples and white data samples.

In this step, the problem that the learning weight of the black data sample can be improved by determining the recognition result according to the black data sample and the white data sample is considered, so that the classification label imbalance of the black data sample and the white data sample is balanced is solved. For example, in one possible implementation, the recognition result may include a probability value that the label of the data sample is a risk transaction. Then, as shown in fig. 2, the step 105 may be implemented by the following steps when determining the loss function according to the recognition result of each black data sample and each white data sample:

step 201: determining a first learning weight for the white data sample; and the number of the first and second groups,

step 203: determining a second learning weight for the black data sample; wherein the second learning weight is greater than the first learning weight, and the second learning weight satisfies: sorting the probability values of the risk transactions obtained by the risk transaction identification model from high to low to obtain N data samples corresponding to the first N probability values, wherein the ratio of the number of black data samples with classification labels as risk transactions contained in the N data samples to the number of all black data samples input into the risk transaction identification model is greater than a first preset threshold value;

step 205: a loss function is determined based on the first learning weight and the second learning weight.

In the present embodiment, in determining the loss function, first, the first learning weight of the white data sample and the second learning weight of the black data sample may be determined. And then, determining the loss function according to the first learning weight and the second learning weight. It is worth noting that the determined second learning weight of the black data sample is larger than the first learning weight of the white data sample, so that the learning attention of the black data sample in the training process can be improved, and the problem of few classification labels of the black data sample is balanced is solved.

In addition, after the probability values of the risk transactions obtained by the risk transaction identification model are ranked from high to low, and N data samples corresponding to the first N probability values are obtained, the second learning weight further satisfies that the ratio of the number of black data samples with classification labels as risk transactions included in the N data samples to the number of all black data samples input into the risk transaction identification model is greater than a first preset threshold. That is to say, when the model training is performed through the loss function determined by the scheme, when the trained model identifies each data sample, the identification result can have a higher coverage degree on the risk transaction label of the black data sample, so that the accuracy of risk prediction performed by the model can be improved.

For example, for 10000 data samples, the number of black data samples with class labels for risk trading is 80, and the number of white data samples with class labels for non-risk trading is 9920. After the 10000 data samples are identified by using the currently trained risk transaction identification model, the probability values of the classification labels obtained by the identification result for the risk transactions are sorted from high to low, and the data samples corresponding to the first 100 probability values are selected. The number of classification labels contained in the data sample with the probability value of the top 100 can be judged as the number contained in the black data sample. If the 100 data samples completely comprise 80 black data samples with actual classification labels of risk transactions, the coverage degree of the black data samples in the recognition result is 100%, which indicates that the recognition result has a higher coverage degree. Whereas more of the 80 black data samples labeled risk transactions were not included in the 100 data samples. For example, only 50 black data samples whose actual classification labels are risk transactions are contained therein, and then the coverage degree at this time is 50/80=62.5%, it is obvious that the coverage degree of the black data samples by the recognition result is low. In this embodiment, by setting the first preset value, a ratio of the number of black data samples with classification labels included in the first N data samples as risk transactions to the number of all black data samples input into the risk transaction identification model is greater than a first preset threshold value, so that it is ensured that the identification result can have a higher coverage degree on the black data samples, and thus the reliability of the model for risk identification can be improved.

Step 201 is explained below.

Step 201 may consider the estimate from the currently trained risk trading model to determine the first learning weight for the white data sample. For example, the probability value of the risk transaction of the classification label output by the currently trained risk transaction recognition model may be obtained first, and then the first learning weight may be determined according to the probability value of the risk transaction.

For example, the first learning weight may be obtained by determining a hyper-parameter from an empirical value based on a weight determination method of the local loss function, and using the hyper-parameter as an index of a probability value of the risk transaction, which is a classification label output by the risk transaction identification model. Of course, in a possible implementation, the hyper-parameter may also be obtained by learning and constantly optimizing a neural network.

Step 203 is explained below.

Since the number of black data samples is less than the number of white data samples. Therefore, in this step, the second learning weight is greater than the first learning weight, that is, a higher learning weight is given to the black data sample, so that the attention degree of the model to the black data sample during learning is increased, and the problem of imbalance of the classification labels of the black data sample and the white data sample is balanced is solved.

Of course, in a possible implementation manner, the second learning weight may be set to 1, and is not determined according to the recognition result of the current risk transaction recognition model. That is, for the label value of the black data sample, the probability that the recognition result of the current risk transaction recognition model is the risk transaction is considered to be 100%. Therefore, the learning weight of the black data sample can be improved to the maximum extent, and the accuracy of the risk transaction identification by the risk transaction identification model is further improved.

Step 205 is explained below.

After determining the first learning weight of the white data sample and the second learning weight of the black data sample, a loss function is further determined according to the first learning weight and the second learning weight. For example, in one possible implementation, the first learning weight is used as a weight value of a training black data sample to obtain a black sample loss term, and the second learning weight is used as a weight value of a training white data sample to obtain a white sample loss term.

Then, the sum of the obtained black sample loss term and white sample loss term is calculated to obtain a loss function. Although the number of labels of the white data samples is larger, the second learning weight of the black data samples is larger. Therefore, when the model learns the sample data, the proportion of the black sample data in the optimization learning can be improved, so that the trained model is more reliable, and the accuracy of the model in risk identification is higher.

In one possible implementation, the value of the second learning weight may be set to 1, so that the loss function may be obtained by using the following calculation:

wherein the content of the first and second substances,

for characterizing a loss function;

the label value corresponding to the classification label for representing the black data sample, if the data sample is the black data sample, the label value

(ii) a If the data sample is a white data sample, then the tag value

；

The classification label output by the risk identification model used for representing the current training is a probability value of risk transaction; namely, a data sample trained by a model is input into a currently trained risk identification model, and the classification label of the input data sample is judged to be the probability value of risk transaction in the identification result output by the model.

And parameters for characterizing the attention of the balanced risk transaction identification model to the black data samples and the white data samples.

Is a hyper-parameter and can be obtained by empirical or experimental values. In one possible implementation form of the method,

is not greater than a first parameter which is a value obtained by taking the logarithm of 10 as a proportional value of the numbers of black data samples and white data samples. For example, if the ratio of black data samples to white data samples used to train the risk transaction recognition model is 1

I.e. by

Should be no greater than 3.

In the conventional cross entropy loss function, the loss term of the black data sample and the loss term of the white data sample both determine corresponding learning weights according to the identification result of the samples. For example, in the focal loss function, the learning weight of the black sample may be

，

The classification label of the sample data output for the model is the probability of risk trading, with a value between 0~1.

Then a multiplier less than 1 will reduce the proportion of black samples in the optimization target. In this embodiment, the black sample loss term of the black data sample is considered

The second learning weight in (1) is set to 1. Thus, for the label value of the black data sample, the probability that the identification result of the current risk transaction identification model is the risk transaction is considered to be 100%, so that the learning weight of the black data sample is improved to the maximum extent, and the accuracy of the risk transaction identification model for predicting the risk transaction is improved.

For example, for black data samples, if the conventional focal loss function is used, if the recognition result of the risk transaction recognition model is that the probability of risk transaction is 0.8, the parameter

Is 2. Then the penalty based on the conventional focal loss function is

And the loss obtained based on the loss function provided by the scheme is

Obviously, the attention degree of the loss function obtained based on the scheme to the black data sample is higher, so that the problem of unbalanced label balance between the black data sample and the white data sample can be achieved.

Finally, in step 107, the risk transaction recognition model continues to be trained using the loss function.

The process of training the risk transaction identification model is targeted to minimize the above-mentioned loss function. Specifically, on the basis of the loss function, in each iteration process, the value of the loss function is used for back propagation, and the model parameters of the risk transaction identification model are updated until an iteration stop condition is reached. Where the iteration stop condition may be, for example, a loss function convergence, a number of iterations reaching a preset number threshold, etc.

As shown in fig. 3, an embodiment of the present specification provides a risk transaction identification method, which may include the following steps:

step 301: acquiring transaction data to be identified, wherein the transaction data is to be risk identified;

step 303: inputting transaction data to be identified into a risk transaction identification model to obtain an identification result output by the risk transaction identification model; the risk transaction identification model is obtained by training by using a training method of the risk transaction identification model provided by any embodiment of the specification.

Because the risk transaction identification model is obtained by utilizing the loss function training which can improve the learning weight of the black data sample, the problem that the information learned by the model inclines to the label value of the white data sample due to the fact that the black data sample is far less than the white data sample is considered, and therefore the accuracy of identifying the risk transaction can be improved.

As shown in fig. 4, an embodiment of the present specification provides a training apparatus for a risk transaction identification model, including: an acquisition module 401, an input module 402, a determination module 403 and a training module 404;

an obtaining module 401 configured to obtain black data samples and white data samples; the classification label of the black data sample is risk transaction, and the classification label of the white data sample is non-risk transaction;

an input module 402, configured to input the black data samples and the white data samples acquired by the acquisition module 401 into the currently trained risk transaction identification model, so as to obtain identification results of the black data samples and the white data samples;

a determining module 403, configured to determine a loss function according to the recognition result of each black data sample and white data sample obtained by the input module 402; the loss function can improve the learning weight of the black data sample;

a training module 404 configured to continue training the risk transaction identification model using the loss function determined by the determination module 403.

In one possible implementation, the recognition result includes: the label of the data sample is a probability value of risk transaction;

the determining module 403, when determining the loss function according to the recognition result of each black data sample and white data sample, is configured to perform the following operations:

a loss function is determined based on the first learning weight and the second learning weight.

In one possible implementation, the determining module 403, in determining the first learning weights for the white data samples, is configured to perform the following operations:

a first learning weight is determined based on the probability value of the risk transaction.

In one possible implementation, the determining module 403, when determining the loss function according to the first learning weight and the second learning weight, is configured to perform the following operations:

taking the first learning weight as a weight value of the training black data sample to obtain a black sample loss item;

taking the second learning weight as a weight value of the training white data sample to obtain a white sample loss item;

and calculating the sum of the black sample loss term and the white sample loss term to obtain a loss function.

In one possible implementation, the determining module 403, when determining the loss function according to the recognition result of each black data sample and white data sample, is configured to calculate the loss function by using the following calculation formula:

wherein the content of the first and second substances,

for the purpose of characterizing the loss function,

the label value corresponding to the class label used to characterize the black data sample,

the classification label used for representing the output of the currently trained risk identification model is the probability value of the risk transaction,

In one possible implementation, the loss function determined by the determination module 403 may include, among the loss functions,

is not greater than the first parameter; the first parameter is a value obtained by taking the logarithm of 10 as the proportional value of the number of the white data samples and the black data samples.

In a possible implementation manner, the ratio of the number of the black data samples to the number of the white data samples acquired by the acquiring module 401 is not greater than a second preset threshold.

As shown in fig. 5, an embodiment of the present specification further provides a risk transaction identification apparatus, including: a transaction to be identified acquisition module 501 and an identification module 502;

the to-be-identified transaction obtaining module 501 is configured to obtain to-be-identified transaction data to be risk identified;

the identification module 502 is configured to input the transaction data to be identified, which is acquired by the transaction to be identified acquisition module 501, into the risk transaction identification model, so as to obtain an identification result output by the risk transaction identification model; the risk transaction identification model is obtained by training by using a training device of the risk transaction identification model provided in any embodiment of the specification.

The present specification also provides a computer-readable storage medium having stored thereon a computer program which, when executed in a computer, causes the computer to perform the method of any of the embodiments of the specification.

The present specification also provides a computing device comprising a memory having stored therein executable code and a processor that, when executing the executable code, implements the method of any of the embodiments of the specification.

It is to be understood that the schematic structure of the embodiment in this specification does not constitute a specific limitation to the training device of the risk transaction identification model and the risk transaction identification device. In other embodiments of the specification, the training means of the risk transaction identification model and the risk transaction identification means may comprise more or fewer components than shown, or some components may be combined, some components may be split, or a different arrangement of components. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.

For the information interaction, execution process, and other contents between the units in the apparatus, the specific contents may refer to the description in the method embodiment of the present specification because the same concept is based on the method embodiment of the present specification, and are not described herein again.

Those skilled in the art will recognize that in one or more of the examples described above, the functions described in this specification can be implemented in hardware, software, hardware, or any combination thereof. When implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium.

The above-mentioned embodiments, the purpose, technical solutions and advantages described in the present specification are further described in detail, it should be understood that the above-mentioned embodiments are only specific embodiments of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made on the basis of the technical solutions of the present invention should be included in the scope of the present invention.

Claims

1. The training method of the risk transaction identification model comprises the following steps:

continuing to train the risk transaction recognition model by using the loss function;

the loss function is calculated using the following calculation:

wherein L is used for characterizing the loss function, y is used for characterizing the label value corresponding to the classification label of the black data sample,

the category labels output by the risk identification model used for representing the current training are probability values of risk transactions, and gamma is used for representing parameters of attention degrees of the balanced risk transaction identification model to the black data samples and the white data samples;

wherein the learning weight of the black data sample is 1;

if the data sample is a black data sample, then the label value y =1; if the data sample is a white data sample, then the label value y =0.

2. The method of claim 1, wherein the recognition result comprises: the label of the data sample is a probability value of risk transaction;

3. The method of claim 2, wherein the determining a first learning weight for the white data sample comprises:

4. The method of claim 2, wherein the determining the loss function from the first learning weight and the second learning weight comprises:

5. The method of claim 1, wherein the value of γ is not greater than the first parameter; the first parameter is a value obtained by taking a logarithm of 10 as a proportional value of the number of the white data samples and the number of the black data samples.

6. The method according to any one of claims 1 to 5, wherein a value of a ratio of the number of black data samples to the number of white data samples is not greater than a second preset threshold.

7. A method of risk transaction identification, comprising:

inputting the transaction data to be identified into the risk transaction identification model to obtain an identification result output by the risk transaction identification model; wherein the risk transaction identification model is trained using the method of any one of claims 1 to 6.

8. Training device of risk transaction recognition model, includes: the device comprises an acquisition module, an input module, a determination module and a training module;

the training module is configured to continue training the risk transaction identification model by using the loss function determined by the determining module;

the determining module, when determining the loss function according to the recognition result of each black data sample and white data sample, is configured to calculate the loss function using the following calculation formula:

wherein L is used for representing the loss function, y is used for representing the label value corresponding to the classification label of the black data sample,

the category labels output by the risk identification model used for representing the current training are probability values of risk transactions, and gamma is used for representing parameters of attention degrees of the balanced risk transaction identification model to black data samples and white data samples;

wherein the learning weight of the black data sample is 1;

9. Risk transaction identification apparatus comprising: the system comprises a transaction to be identified acquisition module and an identification module;

the identification module is configured to input the transaction data to be identified, which is acquired by the transaction to be identified acquisition module, into the risk transaction identification model to obtain an identification result output by the risk transaction identification model; wherein the risk transaction identification model is trained by using the training device of the risk transaction identification model according to claim 8.

10. A computing device comprising a memory having executable code stored therein and a processor that, when executing the executable code, implements the method of any of claims 1-7.