CN112101488B

CN112101488B - Training method and device for machine learning model and storage medium

Info

Publication number: CN112101488B
Application number: CN202011291240.3A
Authority: CN
Inventors: 白亚龙; 张炜; 梅涛; 周伯文
Original assignee: Beijing Jingdong Century Trading Co Ltd; Beijing Wodong Tianjun Information Technology Co Ltd
Current assignee: Beijing Jingdong Century Trading Co Ltd; Beijing Wodong Tianjun Information Technology Co Ltd
Priority date: 2020-11-18
Filing date: 2020-11-18
Publication date: 2021-06-25
Anticipated expiration: 2040-11-18
Also published as: CN112101488A

Abstract

The disclosure provides a training method and a training device of a machine learning model and a computer readable storage medium, and relates to the technical field of artificial intelligence. The training method of the machine learning model comprises the following steps: inputting the training image into a machine learning model to obtain the prediction confidence of the training image; inputting the prediction confidence coefficient and the label true value of the training image into an evaluation index loss function to obtain a loss function value, wherein the evaluation index of the loss function value random learning model of the evaluation index loss function is monotonically decreased, and the evaluation index is represented by the prediction confidence coefficient and the label true value of the training image; and adjusting each parameter variable in the machine learning model according to the loss function value. The evaluation index of the machine learning model can be effectively improved.

Description

Training method and device for machine learning model and storage medium

Technical Field

The present disclosure relates to the field of artificial intelligence technologies, and in particular, to a training method and a training apparatus for a machine learning model, and a computer-readable storage medium.

Background

The loss function is a non-negative real-valued function that measures the degree of disparity between predicted and real values of the machine-learned model. The smaller the loss function value, the better the robustness of the machine learning model. Generally, the more appropriately the loss function is selected, the higher the performance of the machine learning model. Therefore, the loss function has strong relevance to the working performance of the machine learning model and the implementation effect of the machine learning task.

Disclosure of Invention

The technical problem solved by the present disclosure is how to effectively improve the evaluation index of the machine learning model.

According to one aspect of the present disclosure, there is provided a training method of a machine learning model, including: inputting the training image into a machine learning model to obtain the prediction confidence of the training image; inputting the prediction confidence coefficient and the label true value of the training image into an evaluation index loss function to obtain a loss function value, wherein the evaluation index of the loss function value random learning model of the evaluation index loss function is monotonically decreased, and the evaluation index is represented by the prediction confidence coefficient and the label true value of the training image; and adjusting each parameter variable in the machine learning model according to the loss function value.

In some embodiments, the training method further comprises: in some embodiments, the constructing a derivable evaluation index loss function includes, with the prediction confidence and the label of the training image as arguments: representing quantization values corresponding to different image marking conditions by using the prediction confidence coefficient and the label real value of the training image, wherein each image marking condition comprises different image marking combinations, and each image marking combination comprises the label real value and the prediction marking value of the training image; expressing evaluation indexes by using quantization values corresponding to different image marking conditions; and constructing a derivable evaluation index loss function by using the evaluation index.

In some embodiments, the different image tagging scenarios include: in a binary image labeling scenario, the true label value of the training image is positive and the predictive label value is positive, the true label value of the training image is negative and the predictive label value is positive, the true label value of the training image is positive and the predictive label value is negative, the true label value of the training image is negative and the predictive label value is negative.

In some embodiments, the representing the quantized values corresponding to different image labeling cases by using the prediction confidence and the true label values of the training images comprises:

wherein the content of the first and second substances,

representing that the true value of the label of the training image is positive and the predictive flag value is a positive corresponding quantization value; x represents each sample training image in a batch of training images;

representing the prediction confidence of a single training image, and the value interval is [0,1 ]]；

And the true value of the label of a single training image is represented, and the value is 0 or 1.

wherein the content of the first and second substances,

representing a quantization value corresponding to the fact that the true value of the label of the training image is positive and the predicted tag value is negative; x represents each sample training image in a batch of training images;

wherein the content of the first and second substances,

a quantization value corresponding to the fact that the true value of the label representing the training image is negative and the predictive flag value is positive; x represents each sample training image in a batch of training images;

wherein the content of the first and second substances,

representing a quantization value corresponding to the fact that the true value of the label of the training image is negative and the prediction marking value is negative; x represents each sample training image in a batch of training images;

The true value of the label representing a single training image,the value is 0 or 1.

In some embodiments, adjusting the respective parameter variables in the machine learning model according to the loss function values comprises: training the machine learning model by adopting a cross entropy loss function so as to make the machine learning model converge; and training the converged machine learning model by adopting the evaluation index loss function so as to adjust each parameter variable in the machine learning model according to the loss function value of the evaluation index loss function.

In some embodiments, the evaluation metrics include accuracy, precision, recall, F-score.

According to another aspect of the present disclosure, there is provided a training apparatus of a machine learning model, including: the confidence coefficient acquisition module is configured to input the training image into the machine learning model and obtain the prediction confidence coefficient of the training image; the function value obtaining module is configured to input the prediction confidence coefficient and the label true value of the training image into an evaluation index loss function to obtain a loss function value, wherein the evaluation index of the loss function value random learning model of the evaluation index loss function is monotonically decreased, and the evaluation index is represented by the prediction confidence coefficient and the label true value of the training image; a variable adjustment module configured to adjust respective parameter variables in the machine learning model according to the loss function values.

In some embodiments, the training apparatus further comprises a function building module configured to: and constructing a derivable evaluation index loss function by taking the prediction confidence coefficient and the label true value of the training image as independent variables.

In some embodiments, the function building module is configured to: representing quantization values corresponding to different image marking conditions by using the prediction confidence coefficient and the label real value of the training image, wherein each image marking condition comprises different image marking combinations, and each image marking combination comprises the label real value and the prediction marking value of the training image; expressing evaluation indexes by using quantization values corresponding to different image marking conditions; and constructing a derivable evaluation index loss function by using the evaluation index.

In some embodiments, the function building module is configured to: the quantization value corresponding to the true label value and the positive predictive marker value of the training image is expressed in the following way

Wherein the content of the first and second substances,

In some embodiments, the function building module is configured to: a quantized value corresponding to a true value of the label of the training image being positive and a negative value of the predictive flag being negative is represented in the following manner

Wherein the content of the first and second substances,

representing a quantization value corresponding to the fact that the true value of the label of the training image is positive and the predicted tag value is negative; x represents a batch of training imagesEach sample training image in (1);

In some embodiments, the function building module is configured to: a quantization value corresponding to a negative true value of the label and a positive predictive flag value of the training image is expressed in the following manner

Wherein the content of the first and second substances,

In some embodiments, the function building module is configured to: the quantization value corresponding to the fact that the true value of the label of the training image is negative and the predictive flag value is negative is expressed in the following way

Wherein the content of the first and second substances,

In some embodiments, the variable adjustment module is configured to: training the machine learning model by adopting a cross entropy loss function so as to make the machine learning model converge; and training the converged machine learning model by adopting the evaluation index loss function so as to adjust each parameter variable in the machine learning model according to the loss function value of the evaluation index loss function.

According to yet another aspect of the present disclosure, there is provided a training apparatus for a machine learning model, including: a memory; and a processor coupled to the memory, the processor configured to perform the aforementioned method of training a machine learning model based on instructions stored in the memory.

According to yet another aspect of the present disclosure, a computer-readable storage medium is provided, wherein the computer-readable storage medium stores computer instructions, which when executed by a processor, implement the aforementioned training method of a machine learning model.

The evaluation index of the machine learning model can be effectively improved.

Other features of the present disclosure and advantages thereof will become apparent from the following detailed description of exemplary embodiments thereof, which proceeds with reference to the accompanying drawings.

Drawings

In order to more clearly illustrate the embodiments of the present disclosure or technical solutions in the related art, the drawings required to be used in the description of the embodiments or the related art will be briefly introduced below, it is obvious that the drawings in the following description are only some embodiments of the present disclosure, and for those skilled in the art, other drawings may be obtained according to the drawings without inventive exercise.

Figure 1 shows the quantitative relationship between accuracy and the loss function values of the cross-entropy loss function.

Fig. 2 illustrates a flow diagram of a method of training a machine learning model of some embodiments of the present disclosure.

FIG. 3 illustrates some embodiments of constructing a derivable evaluation index loss function.

Figure 4 shows the magnitude relation between accuracy and the loss function values of the accuracy loss function.

Fig. 5 shows a schematic structural diagram of a training apparatus of a machine learning model according to some embodiments of the present disclosure.

Fig. 6 is a schematic structural diagram of a training apparatus for machine learning model according to further embodiments of the present disclosure.

Detailed Description

The technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the drawings in the embodiments of the present disclosure, and it is obvious that the described embodiments are only a part of the embodiments of the present disclosure, and not all of the embodiments. The following description of at least one exemplary embodiment is merely illustrative in nature and is in no way intended to limit the disclosure, its application, or uses. All other embodiments, which can be derived by a person skilled in the art from the embodiments disclosed herein without making any creative effort, shall fall within the protection scope of the present disclosure.

The evaluation index is an index used for measuring the performance of the machine learning model in machine learning and is also a final evaluation standard of the machine learning task. Common evaluation metrics include accuracy, precision, recall, and F-score (F-score).

The logarithmic, quadratic, and hinge loss functions are all conventional loss functions. The inventor researches and discovers that in the traditional loss functions, the value of the loss function is not monotonous with the evaluation index of the machine learning model. That is, the smaller the loss function value is, the higher the evaluation index is not necessarily. For example, a cross-entropy loss function is often chosen in the classification task. Assuming that p is the probability distribution of the desired output, q is the probability distribution of the actual output, H (p, q) represents the cross entropy, and x represents the input data, then there is formula (1):

（1）

and the evaluation index of the classification task generally adopts the accuracy. Figure 1 shows the quantitative relationship between accuracy and the loss function values of the cross-entropy loss function. By enumerating a number of discretized instances, as shown in FIG. 1, there is no monotonically decreasing relationship between accuracy and the loss function value of the cross-entropy loss function. That is, when the machine learning model is optimized so that the loss function value is lower, the classification accuracy of the machine learning model is likely to be lower. In view of this, the present disclosure provides a training method for a machine learning model, which can effectively improve evaluation indexes of the machine learning model.

Some embodiments of the training method of the machine learning model of the present disclosure are first described in conjunction with fig. 2.

Fig. 2 illustrates a flow diagram of a method of training a machine learning model of some embodiments of the present disclosure. As shown in fig. 2, the method includes steps S201 to S205.

In step S201, a training image is input to the machine learning model, and a prediction confidence of the training image is obtained.

In step S203, the prediction confidence and the label true value of the training image are input to the evaluation index loss function, and a loss function value is obtained.

The evaluation index of the loss function value random learning model of the evaluation index loss function is monotonically decreased, and the evaluation index is represented by the prediction confidence coefficient and the label true value of the training image. The evaluation index may be, for example, accuracy, precision, recall, and F-score.

In step S205, each parameter variable in the machine learning model is adjusted according to the loss function value.

In some embodiments, in order to adjust each parameter variable in the machine learning model more quickly so as to make the machine learning model converge quickly, the machine learning model may be trained by using a loss function such as cross entropy, and each parameter variable in the machine learning model may be adjusted. And after the machine learning model is converged, training the machine learning model by adopting an evaluation index loss function, and finely adjusting each parameter variable in the machine learning model according to the loss function value of the evaluation index loss function.

In this embodiment, since the evaluation index of the evaluation index loss function is monotonically decreased with the evaluation index of the learning model of the random machine, the machine learning model is trained by using the evaluation index loss function, which can effectively improve the evaluation index of the machine learning model, that is, the performance of the trained machine learning model on the evaluation index.

In some embodiments, the training method further comprises step S202. In step S202, a derivable evaluation index loss function is constructed with the prediction confidence and the true label value of the training image as arguments.

The inventors also studied the conventional loss function. The inventor further researches and discovers that, assuming that the prediction confidence of the machine learning model for the training image is I, the traditional training method usually adopts a mode of comparing with a preset threshold value or logically regressing softmax to convert I into 0 or 1, and then compares with the true value of the label. Therefore, if the evaluation index is directly calculated by the traditional method, the calculation formula of the evaluation index is not conducive, and further the evaluation index loss function is not conducive. The machine learning model based on the back propagation algorithm usually needs to calculate the gradient of the loss function, so that the evaluation index loss function is required to be a derivative function, and the traditional evaluation index calculation mode needs to be adjusted. The detailed implementation of step S202 is described in detail below with reference to fig. 3.

FIG. 3 illustrates some embodiments of constructing a derivable evaluation index loss function. As shown in fig. 3, step S202 specifically includes steps S3021 to S3023.

In step S3021, the prediction confidence and the true label value of the training image are used to represent the quantization values corresponding to different image labeling situations.

Wherein each image marker case comprises a different image marker combination, and the image marker combination comprises a label real value and a prediction marker value of the training image. For example, different image tagging scenarios include: in a binary image labeling scenario, the training image's tagreal value is positive and the predictive flag value is positive (predictive correct), the training image's tagreal value is negative and the predictive flag value is positive (predictive incorrect), the training image's tagreal value is positive and the predictive flag value is negative (predictive incorrect), the training image's tagreal value is negative and the predictive flag value is negative (predictive correct).

In order to meet the requirement that the calculation formula of the evaluation index is derivable, and simultaneously not change the meaning of the evaluation index, the discretization processing mode in the calculation formula of the evaluation index needs to be converted into the continuous processing mode. That is, I is not directly converted to 0 or 1, but I is regarded as I1 and (1-I) 0, and thus the evaluation index is converted from discrete to continuous by means of probability decomposition. Suppose that

The true value of the label representing the training image is positive and the predictive flag value is the positive corresponding quantized value,

the quantized values corresponding to positive true values of the labels representing the training images and negative values of the predictive markers,

the quantized values corresponding to negative true values of the labels representing the training images and positive predicted label values,

the quantized values corresponding to the true values of the labels representing the training images being negative and the predictive flag values being negative. Then, the following formulas (2) to (5) are provided:

（2）

（3）

（4）

（5）

wherein x represents each sample training image in a batch of training images;

In step S3022, the evaluation index is expressed by using the quantization values corresponding to different types of image markers.

For example, the evaluation index is the accuracy

When, there is formula (6):

（6）

as yet another example of this type of device,the evaluation index is the accuracy

When, there is formula (7):

（7）

as another example, the evaluation index is recall

When, there is formula (8):

（8）

also, for example, the evaluation index is F-score

When, there is formula (9):

（9）

in step S3023, a derivable evaluation index loss function is constructed using the evaluation index.

For example, after obtaining the continuously derivable accuracy calculation by equation (6), the continuously derivable accuracy loss function can be obtained by subtracting the accuracy from 1

I.e. equation (10):

（10）

the formula (10) provides that under the common two-classification scene, the calculation of the accuracy is adjusted to be in a continuously-derivable form, so that the accuracy loss function is optimized as the target loss function. Based on similar principle, under multi-classification scene, continuously derivable accuracy loss function

As shown in formula (11):

（11）

wherein x represents each sample training image in a batch of training images; n represents the number of samples in a batch of training images;

And the true value of the label of a single training image is represented, and the value is 0 or 1. Those skilled in the art will understand that the calculation formula given in the above embodiments is only an example, and those skilled in the art can flexibly adjust the calculation formula according to actual needs.

Figure 4 shows the magnitude relation between accuracy and the loss function values of the accuracy loss function. As can be seen from fig. 4, after the derivable evaluation index function is represented by the prediction confidence and the true label value of the training image and the derivable evaluation index loss function is constructed by using the derivable evaluation index function, the evaluation index of the loss function value randomizer learning model of the evaluation index loss function monotonically decreases. The experimental results of evaluation on the object recognition data sets such as FGVC-iMET2020, AliProduct, ImageNet and the like show that after the training method provided by the disclosure is adopted to train the machine learning model, the evaluation indexes such as the accuracy rate of recognizing the image by using the machine learning model can be improved by at least 0.3% -0.5%.

Some embodiments of the training apparatus of the machine learning model of the present disclosure are described below in conjunction with fig. 5.

Fig. 5 shows a schematic structural diagram of a training apparatus of a machine learning model according to some embodiments of the present disclosure. As shown in fig. 5, the training device 50 for machine learning model includes: a confidence obtaining module 501 configured to input the training image into the machine learning model to obtain a prediction confidence of the training image; a function value obtaining module 503 configured to input the prediction confidence and the label true value of the training image into an evaluation index loss function to obtain a loss function value, where the evaluation index of the loss function value randomizer learning model of the evaluation index loss function monotonically decreases, and the evaluation index is represented by the prediction confidence and the label true value of the training image; a variable adjustment module 505 configured to adjust respective parameter variables in the machine learning model according to the loss function values.

In some embodiments, the training apparatus further comprises a function building module 502 configured to: and constructing a derivable evaluation index loss function by taking the prediction confidence coefficient and the label true value of the training image as independent variables.

In some embodiments, function building module 502 is configured to: representing quantization values corresponding to different image marking conditions by using the prediction confidence coefficient and the label real value of the training image, wherein each image marking condition comprises different image marking combinations, and each image marking combination comprises the label real value and the prediction marking value of the training image; expressing evaluation indexes by using quantization values corresponding to different image marking conditions; and constructing a derivable evaluation index loss function by using the evaluation index.

In some embodiments, function building module 502 is configured to: the quantization value corresponding to the true label value and the positive predictive marker value of the training image is expressed in the following way

（2）

Wherein the content of the first and second substances,

In some embodiments, function building module 502 is configured to: a quantized value corresponding to a true value of the label of the training image being positive and a negative value of the predictive flag being negative is represented in the following manner

（3）

Wherein the content of the first and second substances,

representing a single training imageMeasuring confidence coefficient, and the value interval is [0, 1%]；

In some embodiments, function building module 502 is configured to: a quantization value corresponding to a negative true value of the label and a positive predictive flag value of the training image is expressed in the following manner

（4）

Wherein the content of the first and second substances,

In some embodiments, function building module 502 is configured to: the quantization value corresponding to the fact that the true value of the label of the training image is negative and the predictive flag value is negative is expressed in the following way

（5）

Wherein the content of the first and second substances,

representing a quantization value corresponding to the fact that the true value of the label of the training image is negative and the prediction marking value is negative; x represents the training of each sample in a batch of training imagesImage training;

In the above embodiment, the derivable evaluation index function is represented by the prediction confidence and the true label value of the training image, and after the derivable evaluation index loss function is constructed by using the derivable evaluation index function, the evaluation index of the random learning model of the loss function value of the evaluation index loss function is monotonically decreased.

In the embodiments provided in the present application, it should be understood that the disclosed technology can be implemented in other ways. The above-described embodiments of the apparatus are merely illustrative, and for example, the division of the units may be a logical division, and in actual implementation, there may be another division, for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, units or modules, and may be in an electrical or other form.

The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.

In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.

Further embodiments of the training apparatus of the machine learning model of the present disclosure are described below in conjunction with FIG. 6.

Fig. 6 is a schematic structural diagram of a training apparatus for machine learning model according to further embodiments of the present disclosure. As shown in fig. 6, the training device 60 for machine learning model includes: a memory 610 and a processor 620 coupled to the memory 610, the processor 620 being configured to perform the method of training a machine learning model in any of the foregoing embodiments based on instructions stored in the memory 610.

Memory 610 may include, for example, system memory, fixed non-volatile storage media, and the like. The system memory stores, for example, an operating system, an application program, a Boot Loader (Boot Loader), and other programs.

The training apparatus 60 for machine learning model may further include an input/output interface 630, a network interface 640, a storage interface 650, and the like. These

interfaces

630, 640, 650 and the connections between the memory 610 and the processor 620 may be through a bus 660, for example. The input/output interface 630 provides a connection interface for input/output devices such as a display, a mouse, a keyboard, and a touch screen. The network interface 640 provides a connection interface for various networking devices. The storage interface 650 provides a connection interface for external storage devices such as an SD card and a usb disk.

The present disclosure also includes a computer-readable storage medium having stored thereon computer instructions that, when executed by a processor, implement a method of training a machine learning model in any of the foregoing embodiments.

The aforementioned integrated units, if implemented in the form of software functional units and sold or used as separate products, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic or optical disk, and other various media capable of storing program codes.

The present disclosure is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the disclosure. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

It should be noted that the terms "first," "second," and the like in the description and claims of this application and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It should be understood that the data so used may be interchanged under appropriate circumstances such that embodiments of the application described herein may be used. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.

It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in the process, method, article, or apparatus that comprises the element.

The above description is only exemplary of the present disclosure and is not intended to limit the present disclosure, so that any modification, equivalent replacement, or improvement made within the spirit and principle of the present disclosure should be included in the scope of the present disclosure.

Claims

1. A method of training a machine learning model, comprising:

inputting the training image into a machine learning model to obtain the prediction confidence of the training image;

processing the prediction confidence coefficient and the label real value of the training image to obtain the quantization values corresponding to different image marking conditions, wherein each image marking condition comprises different image marking combinations, and each image marking combination comprises the label real value and the prediction marking value of the training image;

processing the quantitative values corresponding to different image marking conditions to obtain evaluation indexes of the machine learning model, wherein the evaluation indexes comprise accuracy, precision, recall rate and F score;

constructing a derivable evaluation index loss function by using the evaluation index;

inputting the prediction confidence coefficient and the label real value of the training image into the evaluation index loss function to obtain a loss function value, wherein the loss function value of the evaluation index loss function is monotonically decreased along with the evaluation index;

adjusting various parameter variables in the machine learning model according to the loss function values, including: training the machine learning model by adopting a cross entropy loss function so as to make the machine learning model converge; and training the converged machine learning model by adopting the evaluation index loss function so as to adjust each parameter variable in the machine learning model according to the loss function value of the evaluation index loss function.

2. The training method of claim 1, wherein the different image labeling scenarios comprise:

in a binary image labeling scenario, the true label value of the training image is positive and the predictive label value is positive, the true label value of the training image is negative and the predictive label value is positive, the true label value of the training image is positive and the predictive label value is negative, the true label value of the training image is negative and the predictive label value is negative.

3. The training method according to claim 2, wherein the processing the prediction confidence and the label true value of the training image to obtain the quantization values corresponding to different image labeling situations comprises:

wherein the content of the first and second substances,

4. The training method according to claim 2, wherein the processing the prediction confidence and the label true value of the training image to obtain the quantization values corresponding to different image labeling situations comprises:

wherein the content of the first and second substances,

5. The training method according to claim 2, wherein the processing the prediction confidence and the label true value of the training image to obtain the quantization values corresponding to different image labeling situations comprises:

wherein the content of the first and second substances,

6. The training method according to claim 2, wherein the processing the prediction confidence and the label true value of the training image to obtain the quantization values corresponding to different image labeling situations comprises:

wherein the content of the first and second substances,

7. A training apparatus for a machine learning model, comprising:

the confidence coefficient acquisition module is configured to input the training image into the machine learning model and obtain the prediction confidence coefficient of the training image;

a function building module configured to: processing the prediction confidence coefficient and the label real value of the training image to obtain the quantization values corresponding to different image marking conditions, wherein each image marking condition comprises different image marking combinations, and each image marking combination comprises the label real value and the prediction marking value of the training image; processing the quantitative values corresponding to different image marking conditions to obtain evaluation indexes of the machine learning model, wherein the evaluation indexes comprise accuracy, precision, recall rate and F score; constructing a derivable evaluation index loss function by using the evaluation index;

a function value obtaining module configured to input the prediction confidence coefficient and the label real value of the training image into the evaluation index loss function to obtain a loss function value, wherein the loss function value of the evaluation index loss function is monotonically decreased with the evaluation index;

a variable adjustment module configured to adjust respective parameter variables in the machine learning model according to the loss function values, comprising: training the machine learning model by adopting a cross entropy loss function so as to make the machine learning model converge; and training the converged machine learning model by adopting the evaluation index loss function so as to adjust each parameter variable in the machine learning model according to the loss function value of the evaluation index loss function.

8. The training apparatus of claim 7, wherein the different image tagging conditions comprise:

9. The training apparatus of claim 8, wherein the function building module is configured to:

obtaining the corresponding quantized value of the training image with the true label value being positive and the predictive label value being positive

Wherein the content of the first and second substances,

10. The training apparatus of claim 8, wherein the function building module is configured to:

obtaining the corresponding quantized value of the training image with the true label value being positive and the predicted label value being negative

Wherein the content of the first and second substances,

11. The training apparatus of claim 8, wherein the function building module is configured to:

obtaining the corresponding quantized value of the training image with the true label value being negative and the predicted label value being positive

Wherein the content of the first and second substances,

To representAnd the true value of the label of the single training image is 0 or 1.

12. The training apparatus of claim 8, wherein the function building module is configured to:

obtaining the corresponding quantized value of the training image with the negative true label value and the negative predicted label value in the following way

Wherein the content of the first and second substances,

13. A training apparatus for a machine learning model, comprising:

a memory; and

a processor coupled to the memory, the processor configured to perform the method of training of a machine learning model of any of claims 1-6 based on instructions stored in the memory.

14. A non-transitory computer readable storage medium, wherein the non-transitory computer readable storage medium stores computer instructions that, when executed by a processor, implement a method of training a machine learning model according to any of claims 1 to 6.