CN110245723B

CN110245723B - Safe and reliable image classification semi-supervised machine learning method and device

Info

Publication number: CN110245723B
Application number: CN201910565453.1A
Authority: CN
Inventors: 李宇峰; 郭兰哲; 周志华
Original assignee: Nanjing University
Current assignee: Nanjing University
Priority date: 2019-06-27
Filing date: 2019-06-27
Publication date: 2023-06-09
Anticipated expiration: 2039-06-27
Also published as: CN110245723A

Abstract

The invention discloses a safe and reliable image classification semi-supervised machine learning method and device, wherein the method comprises the following steps: acquiring a target image dataset; constructing a small number of verification image datasets; assigning values to unlabeled data in the target data set and training the unlabeled data on the target data set according to a machine learning algorithm to obtain a machine learning model; calculating the predicted performance and safety of the model on the verification data set; updating the assignment strategy of the unlabeled data to ensure that the prediction performance and the safety of the model obtained by training on the verification data set are continuously optimized until convergence. The resulting model is finally trained to determine a machine learning model of the target image dataset. The method is suitable for the data analysis condition of more data and less marks which are common in the image classification task.

Description

Safe and reliable image classification semi-supervised machine learning method and device

Technical Field

The invention relates to a safe and reliable image classification semi-supervised machine learning method and device, and belongs to the technical field of machine learning of image classification.

Background

With the development of big data and high performance computing, artificial intelligence techniques have received a great deal of attention, with machine learning being the core technology of artificial intelligence. By means of calculation, a large amount of data is analyzed to improve the performance of the system. In computer systems, "experience" typically exists in the form of "data". The "model" can be derived from empirical data by a machine learning algorithm. The model gives corresponding predictions when faced with new data.

Image classification is one of the most common tasks of data analysis. Images provide vivid and easily understood information, and are an important source for people to forward and exchange information. The image classification is to divide images of different categories according to semantic information of the images, is an important basic problem in data analysis, and is also the basis of important tasks such as image detection, image segmentation, behavior analysis and the like. Image classification has wide application in many fields, including face recognition in security and protection fields, traffic scene recognition in traffic fields, content-based image retrieval in the internet field, image picture recognition in medical fields, and the like.

Currently, the existing image classification machine learning technology mainly comprises three types of supervised machine learning, unsupervised machine learning and semi-supervised machine learning. Wherein supervised machine learning requires model training with labeled data, which can be understood as data of known inputs and outputs. The non-supervision machine learning is used for finding hidden data relations in unlabeled sample data to train a model, and the semi-supervision machine learning is used for training the model according to a large amount of unlabeled data in the sample data and with the assistance of a small amount of labeled data. Because in many cases, the sample data has a difficult way of obtaining marked data, the number of marked data is far less than that of unmarked data. For example, a large number of medical images can be acquired, but it is difficult for a doctor to label lesions of a large number of images, so that the medical images are classified with a lot of data, but with few labels. In this case, training of a model for subsequent classification and recognition by semi-supervised machine learning based on the characteristics of the semi-supervised machine learning becomes a major approach in the field.

However, unlabeled data is not helpful in any case, as algorithms often require assumptions about the structure of the data labels. For example, a low density hypothesis considers that the true classification boundary should pass through a region of low density; manifold hypotheses consider adjacent data to have similar labels, etc. If in practical application the true distribution of the data markers and the assumptions of the model are not the same, unlabeled data can have a negative effect on model training, and semi-supervised machine learning can be unsafe and unreliable.

Disclosure of Invention

The invention aims to: aiming at the problems and the shortcomings in the prior art, the invention provides a semi-supervised machine learning method for realizing safe and reliable image classification, and the main purpose is to realize the function of safe and reliable semi-supervised image classification, thereby solving the problem that the existing semi-supervised learning can not be safe and reliable in solving the task of image classification.

The technical scheme is as follows: a safe and reliable image classification semi-supervised machine learning method specifically comprises the following steps:

1) Acquiring a target image dataset, wherein part of the image data in the target image dataset has marks;

2) Constructing a verification image data set, wherein the prediction performance and the safety on the verification image data set are used as indexes for evaluating the marking quality of the target image data set;

3) And performing label assignment on unlabeled image data in the target image data set according to a machine learning algorithm, performing model training on the target image data set, and performing prediction on the verification image data set to obtain a prediction result on the verification image data set.

4) Updating the unlabeled data assignment strategy according to the predicted performance and the safety on the verification image data set, and retraining the model until the performance reaches the optimal value.

5) The final model is acted upon a machine learning model on the target image dataset.

Optionally, the method for acquiring the verification image dataset includes, but is not limited to:

1) The domain expert gives a small amount of marked image data for verification

2) Constructing verification image data set by self-help sampling method in probability statistics

The self-service sampling method is a uniform sampling with a replacement, that is to say, each time a sample is selected, it is replaced in the sampling pool, possibly again.

The performance of the verification image dataset comprises 2 parts. Predicted performance: refers to the gap between the predicted mark value and the true mark value on the verification image data. Safety: refers to the gap between the predictive performance of the model and the predictive performance of the underlying model (i.e., supervised learning practices without unlabeled data) on the validated image data.

Optionally, the assignment policy of the unlabeled image data is a matrix Q, where Q _ij The probability that the training sample image i has a label j is represented, and the final label of the image is the label j corresponding to the maximum value of the probability.

Optionally, the adjusting the assignment policy of the unlabeled image record data according to the prediction performance and the security on the verification image data set is a double-layer optimization problem. The inner layer is optimized to train a machine learning model according to training image data and a label assignment strategy and a corresponding machine learning algorithm, the outer layer is optimized to minimize the prediction performance of the model on verification image data and maximize the safety of the model, and the unmarked data assignment strategy which enables the best prediction performance and the highest safety on the verification image data can be obtained by solving the optimization problem.

On the other hand, the invention provides a semi-supervised machine learning device for realizing safe and reliable image classification, which specifically comprises the following components:

1) An acquisition unit configured to acquire a target image dataset, wherein part of data in the target image dataset has a marker;

2) A first determining unit for constructing a verification image dataset;

3) And the assignment unit is used for carrying out label assignment on the unlabeled images in the target image data set.

4) A training unit for training the machine learning model on the target image dataset.

5) And the second determining unit is used for calculating the prediction performance and the security of the model on the verification image data and optimizing the mark assignment of the unlabeled data by adopting a corresponding method.

Optionally, the first determining unit includes:

1) An acquisition module for acquiring labeled image data given for verification by a domain expert as a verification image dataset;

2) The construction module is used for constructing a plurality of verification image data sets based on a self-help sampling method.

Optionally, the assigning unit includes constructing a matrix Q, where Q _ij The probability that the training sample image i has a label j is represented, and the label of the final image is the label j corresponding to the maximum value of the probability.

Optionally, the second determining unit includes:

1) The evaluation module is used for calculating the performance and the safety of the model obtained by training on the verification image data;

2) The determining module is used for determining updating of the assignment strategy matrix Q, which is a double-layer optimization problem, and when the inner layer optimization is a convex optimization problem, replacing the inner layer optimization by adopting a corresponding KKT condition, and converting the inner layer optimization into a single-layer optimization problem for solving; when the inner layer optimization is a non-convex optimization problem, a multi-step gradient descent method is adopted to solve the inner layer optimization, the multi-step gradient descent process is regarded as a plurality of constraint conditions, and a Lagrange multiplier method is adopted to solve.

By means of the technical scheme, the safe and reliable image classification semi-supervised machine learning method and device can acquire the target image data set, construct the verification data set which is distributed with the target data set, label and assign the unlabeled image, train the unlabeled image by adopting a machine learning algorithm to obtain a model, use the obtained model on the image data verification set to obtain a prediction result on the image data verification set, calculate the prediction performance and safety of the model on the verification set, and continuously update label and assign values of the unlabeled image to perform performance tuning until convergence.

Drawings

FIG. 1 is a flow chart of a method for implementing safe and reliable semi-supervised machine learning for image classification according to an embodiment of the present invention;

FIG. 2 is a process diagram of updating an unlabeled image label assignment strategy in accordance with an embodiment of the present invention;

FIG. 3 is a process diagram of another exemplary updating an unlabeled image label assignment strategy in accordance with an embodiment of the present invention;

fig. 4 is a block diagram of a semi-supervised machine learning apparatus for image classification with safety and reliability according to an embodiment of the present invention.

Detailed Description

The present invention is further illustrated below in conjunction with specific embodiments, it being understood that these embodiments are meant to be illustrative of the invention only and not limiting the scope of the invention, and that modifications of the invention, which are equivalent to those skilled in the art to which the invention pertains, will fall within the scope of the invention as defined in the claims appended hereto.

With the advent of massive data, artificial intelligence technology has evolved rapidly, and in order to mine bid values from massive data, a great deal of accurate data annotation information is required. However, in practical applications, the sample data has a difficulty in acquiring marked data, so that the number of marked data is far less than that of unmarked data. For example, hospitals can collect a large number of medical images, but it is impractical for doctors to label lesions of these images, so there is much data in medical image classification, but there is little labeling. Analysis of such data, semi-supervised learning techniques are one of the most effective methods.

However, existing semi-supervised machine learning models often require assumptions about the structure of the data markers. If in practical applications the actual distribution of the data markers and the assumptions of the model are not the same, unsafe and reliable situations can occur with unlabeled data, resulting in reduced or very limited performance improvement.

Therefore, the invention hopes to solve the problems by technical means, ensure the safety of unmarked data utilization and promote the performance of semi-supervised learning. In the process, a plurality of technical problems such as insufficient marked samples and missing training data are involved, and the problem of operation efficiency of massive data is also required to be solved.

The embodiment of the invention provides a safe and reliable image classification semi-supervised machine learning method. The specific steps of the method are shown in fig. 1, and mainly comprise:

101. a target image dataset and a verification image dataset are acquired.

Wherein a portion of the sample data in the target data image set has a marker. The target data set may be any image data set such as a natural scene picture data set or a medical image data set. The plurality of sample data included in the data set includes marked data and unmarked data, the marked data may be understood as data of a known classification result, and the unmarked data may be understood as data of an unknown classification result.

The process of acquiring the target data set can be performed according to any existing acquisition mode, for example, an interface dedicated to inputting the target data set can be provided, and the target data set can be acquired through the interface.

102. A verification image dataset is acquired.

Here, the verification image data set refers to data distributed identically to the target image data set. Although in practical application, the cost of labeling a large number of images one by one is relatively high, manual labeling by selecting a small number of verification images can be implemented, so that a small number of verification image data can be obtained. On the basis, in order to reduce the paranoid of the verification image, M verification data sets can be constructed by adopting a self-help sampling method in probability statistics

Further improving reliability.

103. And carrying out mark assignment on the unlabeled images in the target image data set.

Assigning a matrix Q by a label, wherein Q _ij The probability that the training sample i has a label j is represented, and the final label is the label j corresponding to the maximum value of the probability.

104. Training on the target image dataset by adopting a machine learning algorithm to obtain a model.

The unlabeled image can be trained to obtain a model according to a corresponding machine learning algorithm after being assigned, and the model is as follows:

wherein θ represents a model obtained by training, θ represents a value space of θ, β represents any value in θ, n represents the number of training data, k represents the total number of classes to which the image belongs, i represents an index of training samples, j represents a value of enumeration class, and Q _ij Representing the probability that training sample i has a label j, l represents a loss function characterizing performance, l (x _i J, β) represents the loss function value when the i-th image marker in the training set is j and the model parameter is β.

105. The computational model verifies the predictive performance and security of the data set.

Validating the data as

wherein x_j Represented as the j-th image, y in the verification data _j Representing image x _j Class labels, n _v Representing the total number of images in the verification data, the loss function value of the model on the verification data is as follows:

representing the error value between the predicted value of the model and the actual signature.

Wherein the model verifies the security of the data set as follows:

wherein ,θ₀ Representing a floor model (supervised learning model without unlabeled data), M represents the total number of validation sets,

representing the ith verification set. The model represents a semi-supervised learning model theta and a ground line model theta ₀ Gap in predicted performance. The larger the gap, the safer the model obtained by semi-supervised learning.

106. And continuously optimizing the marking assignment strategy of the unlabeled image in the target data set, and repeating the steps until convergence.

The adjustment of the label assignment strategy for unlabeled images in the target dataset based on performance on the validation set is a two-layer optimization problem, as follows:

/>

the inner layer is optimized to train to obtain a model theta according to training data and a mark assignment strategy, the meaning of corresponding parameters is the same as that of the formula (1), and the outer layer is optimized to minimize the loss of the model on verification data and maximize the safety of the model by adjusting a mark assignment matrix Q. For ease of description, we will represent the loss on the validation data as L (θ) and the loss on the training data as E (θ, Q).

When the inner layer optimization is a convex optimization problem, the convex optimization problem can be converted into a corresponding KKT condition, so that the double-layer optimization problem is converted into a single-layer optimization problem, wherein the KKT condition is a condition when the inner layer optimization obtains an optimal solution, and the conditions are as follows:

solving the formula to obtain a function theta (Q) of theta and Q, and carrying theta (Q) into the outer layer optimization to obtain the gradient of the final objective function and Q

Furthermore, the assignment strategy Q of the unlabeled data can be updated according to the gradient optimization algorithm, so that the prediction performance and the safety reliability of the model are improved. Fig. 2 shows the optimization algorithm flow for this problem: setting optimized number of rounds T and initial value Q of label assignment strategy ₀ In the k-th round of optimization, the derivative of the loss function L (θ) on the validation set, Q, is first calculated>

Then updating Q by adopting a gradient optimization algorithm, < ->

η is the step size of the gradient optimization algorithm.

When the inner layer optimization is a non-convex optimization problem, a gradient optimization method is adopted to solve the target-type inner layer optimization problem, and the target-type inner layer optimization problem is expressed as a T-step dynamic process, and the method is specifically as follows:

wherein θ_t Representing the model at step t, θ _t-1 Represents the model at step t-1, eta represents the step size of the gradient optimization algorithm, E (theta, Q) represents the loss on training data when the model assigns a strategy Q to the theta mark,

representing the derivative of E (θ, Q) with θ.

Representing the dynamic process as θ _t ＝Φ(θ _t-1,Q), wherein

The target can be written as:

min _Q L(θ _T )(7)

s.t.θ _t ＝Φ(θ _t-1 ,Q),t＝1,…T

the problem is an optimization problem with constraint, and the gradient of the objective function to Q can be obtained by adopting a Lagrange multiplier method, so that the solution is carried out by adopting a gradient optimization method.

The Lagrange multiplier method can obtain a Lagrange dual formula:

wherein λ_t As a lagrangian multiplier, it can be finally obtained by biasing it, and the derivative of the variable Q of equation (8) is:

wherein ,

so that a gradient-dependent optimization method can be used to solve Q. Fig. 3 shows the optimization algorithm flow for this problem: first initialize a tag assignment policy Q ₀ The number of inner optimized rounds T and the number of outer optimized rounds outT are determined. On the outer layer optimized kth round, θ is first calculated from t=1, …, T _t ＝Φ(θ _t-1 Q) and then calculate the derivative of the validation set loss with θ:

the derivative g of the validation set loss to the tag assignment policy Q is then initialized _Q =0, then g is calculated from t=t, …,1 _Q ＝g _Q +α _t+1 B _t+1 ,α _t ＝α _t+1 A _t+1 Calculate g _Q Thereafter, Q is updated using a gradient algorithm _k ＝Q _k-1 -ηg _Q η is the step size of the gradient optimization algorithm.

107. The final model is taken as a machine learning model on the target image dataset.

After the label assignment strategy is determined, training of the machine learning model can be performed according to label assignment of the unlabeled data, so that a final machine learning model is obtained.

As an implementation of the above-mentioned method for implementing safe and reliable image classification semi-supervised machine learning, the embodiment of the invention provides a device for implementing safe and reliable image classification semi-supervised machine learning, which is mainly used for implementing a function of implementing safe semi-supervised machine learning on a target data set, and solves the problem of unsafe and unreliable situations in the semi-supervised machine learning process. For convenience of reading, the details of the foregoing method embodiment are not repeated one by one, but it should be clear that the apparatus in this embodiment can correspondingly implement all the details of the foregoing method embodiment. The device is shown in fig. 4, and specifically comprises:

an acquisition unit 31 for acquiring a target image dataset, part of sample data in the target dataset having markers;

a first determining unit 32 for constructing image verification data;

and an assignment unit 33, configured to perform label assignment on the unlabeled image in the target data set.

The training unit 34 trains the target data set according to a machine learning algorithm to obtain a machine learning model.

The second determining unit 35 calculates the predicted performance and security of the model on the verification data, and adjusts the label assignment policy of the unlabeled image in the target dataset according to the result.

The first determination unit 32 includes:

an acquisition module 321 for acquiring verification image data to a domain expert.

A construction module 322 is configured to construct a plurality of verification data sets according to a self-sampling method.

The second determination unit 35 includes:

the evaluation module 351 computes the predicted performance and security of the model on the verification data.

And the updating module 352 optimizes the assignment strategy of the unlabeled images in the target data set by adopting an optimization method according to the calculation result.

In summary, the method and the device for realizing safe and reliable image classification semi-supervised machine learning provided by the embodiments of the present invention can acquire a target image dataset, construct verification image data, perform label assignment on unlabeled images in the target dataset, apply a machine learning algorithm to the target dataset to obtain a machine learning model, and then continuously optimize label assignment strategies of unlabeled data according to performance of the machine learning model on the verification data. The method can realize the function of semi-supervised machine learning of safe and reliable image classification. Compared with the condition that the performance is reduced or the performance improvement is limited in the existing method, the method provided by the invention explicitly optimizes the prediction performance and the safety of the model according to the verification set, and effectively avoids the problem that semi-supervised machine learning is unsafe and unreliable.

It will be clear to those skilled in the art that, for convenience and brevity of description, specific working procedures of the above-described systems, apparatuses and units may refer to corresponding procedures in the foregoing method embodiments, which are not repeated herein.

Claims

1. A safe and reliable image classification semi-supervised machine learning method is characterized by comprising the following steps:

1) Acquiring a target image dataset, wherein part of image data in the target image dataset is provided with marks;

3) Performing label assignment on unlabeled images of the target data set;

4) Model training is carried out on the target image data set and prediction is carried out on the verification image data set according to the machine learning algorithm and an unlabeled data assignment strategy corresponding to the machine learning algorithm, so that a prediction result on the verification image data set is obtained;

5) Updating the unlabeled data assignment strategy according to the predicted performance and the safety on the verification image data set, and retraining the model until the performance reaches the optimum;

and after the unlabeled image is assigned, training according to a corresponding machine learning algorithm to obtain a model, wherein the model is as follows:

wherein θ represents a model obtained by training, θ represents a value space of θ, β represents any value in θ, n represents the number of training data, k represents the total number of classes to which the image belongs, i represents an index of training samples, j represents a value of enumeration class, and Q _ij Representing the probability that training sample i has a label j, l represents a loss function characterizing performance, l (x _i J, beta) represents the loss function when the ith image mark in the training set is j and the model parameter is betaNumerical value;

validating the data as

representing an error value between the predicted value of the model and the actual signature;

wherein the model verifies the security of the data set as follows:

wherein ,θ₀ Representing a floor model, i.e., a supervised learning model that does not utilize unlabeled data, M represents the total number of validation sets,

representing an ith verification set; the semi-supervised learning model theta and the ground line model theta are represented by the formula (3) ₀ A gap in predicted performance; the larger the gap is, the safer the model obtained by semi-supervised learning is;

the method comprises the steps that an inner layer is optimized to train to obtain a model theta according to training data and a mark assignment strategy, the meaning of corresponding parameters is the same as that of a formula (1), an outer layer is optimized to minimize loss of the model on verification data and maximize safety of the model by adjusting a mark assignment matrix Q, the loss on the verification data is expressed as L (theta), and the loss on the training data is expressed as E (theta, Q);

when the inner layer optimization is a convex optimization problem, the convex optimization problem is converted into a corresponding KKT condition, so that the double-layer optimization problem is converted into a single-layer optimization problem, wherein the KKT condition is a condition when the inner layer optimization obtains an optimal solution, and the conditions are as follows:

/>

solving the equation to obtain a function theta (Q) of theta and Q, and carrying the theta (Q) into the outer layer to optimize to obtain the gradient of the final objective function on Q

Furthermore, the assignment strategy Q of the unlabeled data is updated according to the gradient optimization algorithm, so that the prediction performance and the safety reliability of the model are improved; optimizing the algorithm flow: setting optimized number of rounds T and initial value Q of label assignment strategy ₀ In the k-th round of optimization, the derivative of the loss function L (θ) on the validation set, Q, is first calculated>

Then updating Q by adopting a gradient optimization algorithm, < ->

η is the step length of the gradient optimization algorithm;

when the inner layer optimization is a non-convex optimization problem, a gradient optimization method is adopted to solve the inner layer optimization problem, and the inner layer optimization problem is expressed as a T-step dynamic process, and the method is specifically as follows:

representing the derivative of E (θ, Q) with respect to θ;

representing the dynamic process as θ _t ＝Φ(θ _t-1,Q), wherein

The target write is:

min _Q L(θ _T )(7)

obtaining the gradient of the objective function to Q by adopting a Lagrangian multiplier method, and solving by adopting a gradient optimization method;

wherein, the Lagrange dual equation is obtained by Lagrange multiplier method:

wherein λ_t The derivative of equation (8) with respect to variable Q, being the lagrange multiplier, is:

wherein ,

therefore, a gradient-related optimization method is adopted to solve Q; optimizing the algorithm flow: first initialize a tag assignment policy Q ₀ Determining the number of inner optimized rounds T and outer optimizedIs the number of rounds of outT; on the outer layer optimized kth round, θ is first calculated from t=1, …, T _t ＝Φ(θ _t-1 Q) and then calculate the derivative of the validation set loss with θ: />

The derivative g of the validation set loss to the tag assignment policy Q is then initialized _Q =0, then g is calculated from t=t, …,1 _Q ＝g _Q +α _t+1 B _t+1 ，α _t ＝α _t+1 A _t+1 Calculate g _Q Thereafter, Q is updated using a gradient algorithm _k ＝Q _k-1 -ηg _Q η is the step length of the gradient optimization algorithm;

6) The final model is acted upon a machine learning model on the target image dataset.

2. A secure and reliable image classification semi-supervised machine learning method as recited in claim 1, wherein constructing a validation image dataset includes:

1) Marked image data for verification given by a domain expert;

2) Constructing a verification image data set by a self-help sampling method in probability statistics;

the self-help sampling method refers to a uniform sampling method with replacement.

3. A secure and reliable image classification semi-supervised machine learning method as recited in claim 1, wherein the performance of the validation image dataset includes 2 parts; predicted performance: refers to the gap between the predicted mark value and the true mark value on the verification image data; safety: refers to the gap between the predicted performance of the model and the predicted performance of the underlying model in validating the image data.

4. The method of claim 1, wherein the adjusting of assignment strategies for unlabeled image registration data based on predictive performance and security on the validated image dataset is a two-layer optimization model; the inner layer is optimized to train a machine learning model according to training image data and a label assignment strategy and a corresponding machine learning algorithm, and the outer layer is optimized to minimize the predictive performance of the model on verification image data and maximize the safety of the model.

5. The safe and reliable image classification semi-supervised machine learning method of claim 4, wherein when the inner layer is optimized as a convex optimization problem, the corresponding KKT condition is adopted to replace the inner layer optimization problem, and the inner layer optimization problem is converted into a single-layer optimization problem for solving; when the inner layer optimization is a non-convex problem, a multi-step gradient descent method is adopted to solve the inner layer optimization problem, the multi-step gradient descent process is regarded as a plurality of constraint conditions, and a Lagrange multiplier method is adopted to solve the inner layer optimization problem.

6. A safe and reliable image classification semi-supervised machine learning apparatus, the apparatus comprising:

1) An acquisition unit configured to acquire a target image dataset, part of data in the target image dataset having a marker;

2) A first determining unit for constructing a verification image dataset;

3) The assignment unit is used for carrying out label assignment on the unlabeled images in the target image data set;

4) A training unit for training a machine learning model on the target image dataset;

5) The second determining unit is used for calculating the prediction performance and the safety of the model on the verification image data and optimizing the mark assignment of the unlabeled data by adopting a corresponding method;

wherein θ represents the training resultModel, wherein Θ is the value space of θ, β is any value in Θ, n is the number of training data, k is the total number of classes to which the image belongs, i is the index of the training sample, j is the value of enumeration class, Q _ij Representing the probability that training sample i has a label j, l represents a loss function characterizing performance, l (x _i J, beta) represents the loss function value when the i-th image mark in the training set is j and the model parameter is beta;

validating the data as

wherein the model verifies the security of the data set as follows:

/>

Then updating Q by adopting a gradient optimization algorithm, < ->

η is the step length of the gradient optimization algorithm;

representing the derivative of E (θ, Q) with respect to θ;

representing the dynamic process as θ _t ＝Φ(θ _t-1,Q), wherein

The target write is:

min _Q L(θ _T ) (7)

s.t.θ _t ＝Φ(θ _t-1 ，Q)，t＝1，…T

wherein, the Lagrange dual equation is obtained by Lagrange multiplier method:

wherein ,

therefore, a gradient-related optimization method is adopted to solve Q; optimizing the algorithm flow: first initialize a tag assignment policy Q ₀ Determining the number T of the inner optimized wheels and the number outT of the outer optimized wheels; on the outer layer optimized kth round, θ is first calculated from t=1, …, T _t ＝Φ(θ _t-1 Q) and then calculate the derivative of the validation set loss with θ: />

The derivative g of the validation set loss to the tag assignment policy Q is then initialized _Q =0, then g is calculated from t=t, …,1 _Q ＝g _Q +α _t+1 B _t+1 ，α _t ＝α _t+1 A _t+1 Calculate g _Q Thereafter, Q is updated using a gradient algorithm _k ＝Q _k-1 -ηg _Q η is the step size of the gradient optimization algorithm.

7. The safe and reliable image classification semi-supervised machine learning apparatus of claim 6, wherein the first determination unit includes:

1) An acquisition module for acquiring labeled image data given for verification by a domain expert as verification image data;

8. The safe and reliable image classification semi-supervised machine learning apparatus of claim 7, wherein the second determination unit includes: