CN114187452A

CN114187452A - Robust depth image classification model training method based on active labeling

Info

Publication number: CN114187452A
Application number: CN202210135383.8A
Authority: CN
Inventors: 黄圣君; 周慧
Original assignee: Nanjing University of Aeronautics and Astronautics
Current assignee: Nanjing University of Aeronautics and Astronautics
Priority date: 2022-02-15
Filing date: 2022-02-15
Publication date: 2022-03-15

Abstract

The invention discloses a robust depth image classification model training method based on active labeling. The method comprises the following steps: firstly, collecting a large number of non-labeled image sets and a small number of labeled training image data sets; adding noise disturbance to each image in the labeled image set to obtain a labeled image set containing noise; thirdly, taking the noisy labeled image set as a training set, and initializing an image classification model; and fourthly, carrying out multiple times of disturbance on each image in the unmarked image set, and calculating the value score S of each unmarked image. Fifthly, ranking the scores S to obtain corresponding user feedback; sixthly, updating the labeled image set L and the unlabeled image set, and updating the prediction model; and seventhly, returning to the step four or ending and outputting the prediction model f. According to the invention, the high-utility image annotation is automatically selected through an active learning technology, and the annotation cost of a user can be reduced to the maximum extent while the robustness of the model is improved.

Description

Robust depth image classification model training method based on active labeling

Technical Field

The invention belongs to the technical field of digital image automatic labeling, and particularly relates to a robust depth image classification model training method based on active labeling.

Background

At present, the depth model can obtain higher precision in the image classification field, however, in a real application scene, the model is often interfered by noise to cause serious performance reduction. For example, in an automatic driving task, the image video recognition model is usually disturbed by the weather of fog, frost, snow, sand storm, etc., and it is difficult to accurately recognize road signs. Therefore, improving the robustness of the model has become an important task in the field of machine learning. Recent research shows that the robustness of the depth model can be effectively improved by adding noise disturbance to the training image for training. However, this training process often requires a large number of labeled images. In many practical applications, it is often costly and extremely difficult to accurately label the label information of each image, especially in areas where expertise is highly required. Active learning is a main method for reducing the cost of sample annotation, and the cost of query marking can be reduced to the maximum extent while the model performance is improved by actively selecting the most valuable images for annotation. However, the traditional active labeling method only considers the potential utility of the image for improving the model performance, for example, it is difficult to directly improve the model robustness by measuring the uncertainty of the classification model to the unlabeled image as an estimation of the utility. Therefore, how to design an effective active annotation strategy to improve the robustness of the model is an urgent problem to be solved, and has important practical significance.

Disclosure of Invention

The purpose of the invention is as follows: in order to solve the problems that target domain data are difficult to obtain in a real task and the robustness of a model is difficult to improve, the invention provides a robust depth image classification model training method based on active labeling.

The technical scheme is as follows: in order to achieve the purpose, the invention adopts the technical scheme that:

a robust depth image classification model training method based on active labeling comprises the following steps:

step 1, collecting a large number of unmarked image sets

And a small number of labeled training image data sets

；

Step 2, carrying out annotation on the image set

Adding noise disturbance to each image to obtain a noise-containing labeled image set

；

Step 3, marking image sets with noises

As a training set, initializing an image classification model f;

step 4, carrying out annotation on the image set which is not marked

Carrying out multiple disturbance on each image, and calculating the value score of each unmarked image based on the prediction result of each unmarked image and multiple disturbed versions of each unmarked image by the model f

；

Step 5, scoring obtained in step 4

Sequencing, namely querying the marking information of the image for the user according to the sequence of the scores from large to small within the marking budget to obtain corresponding user feedback;

step 6, updating the labeled image set according to the user feedback result of the image category obtained in the step 5

And unlabeled image set

And obtaining a noise-containing label set according to the method in the step 2

To update the prediction model

；

And 7, returning to the step 4 or ending and outputting the prediction model f.

Further, the step 2 obtains a noisy labeled image set

The specific method comprises the following steps:

for the

Each image in

Adding from Gaussian distribution

Randomly labeled perturbation values

Obtaining corresponding noisy images

. The expression can be specifically as follows:

then the image set containing noise label

Wherein

The number of marked images.

Further, the specific method for initializing the image classification model f in step 3 is as follows:

using predictive models

For image sets with noise labels

The medium image category is predicted and,

are parameters of the predictive model. By using

Is shown as

Image frame

Output on model f, wherein

Representative image

Is predicted to be the first

The probability of an individual class of the object,

representing the total number of categories of the image. By using

Is shown as

Image frame

True mark, formAnd coding the one-hot code. Calculating the loss value of the model on each noisy image according to a formula, wherein the formula is as follows:

noise-containing labeled image set through minimization model

Optimizing the model by the upper loss value, wherein the specific formula is as follows:

wherein, in the step (A),

is a loss function.

Further, the step 4 calculates the value score of each unmarked image

The specific method comprises the following steps:

for each unmarked image

Adding

Secondary disturbance to obtain corresponding disturbance image set

Wherein

,

,

Is a Gaussian distribution

And (4) randomly marking a disturbance value, wherein the disturbance times m are hyper-parameters.

Calculating the model according to a formula

The prediction result of (2) and the clean image

And (3) the probability of inconsistency of the predicted result is obtained, and the formula is as follows:

wherein the content of the first and second substances,

for the indicator function, when the input is true, the output is 1, and when the input is false, the output is 0;

calculating image set without user feedback according to formula

Each image in

To the mark model

Value score of

The formula is as follows:

further, the step 6 is to update the labeled image set according to the user feedback result

And unlabeled image set

The specific method comprises the following steps:

the user provides category label information for the image being queried and the image is selected from the unlabeled data set

Moving to annotated image data sets

。

Has the advantages that: the invention provides a robust depth image classification model training method based on active labeling, which applies an active learning technology to the learning of a robust depth model, and effectively improves the robustness of the depth image classification model with the minimum labeling cost by actively selecting the most valuable image. Specifically, a batch of images which are most helpful for improving the robustness of the model are selected for query each time, so that the user can give image category information. In general, the prediction of a robust model tends to have stability, that is, the output of the model should remain consistent when small perturbations are added to the input image. However, under the same degree of disturbance, the prediction stability of the model on different images is different, for some images, when noise disturbance is encountered, the prediction result of the model is very unstable, and the images are added into the labeled set to train the model, so that the robustness of the model can be effectively improved. Therefore, when the images are selected, the active labeling method based on the inconsistency is provided, the potential utility of each unmarked image on improving the robustness of the model is measured by generating a series of disturbed images and adopting the prediction difference of the disturbed images, and the image with the maximum inconsistency value is selected for training the depth model. In the training process, the method adopts a mode of adding noise disturbance to the training image for training, and hopes that the robustness of the model to noise is gradually improved in the process of fitting the noisy image.

Drawings

FIG. 1 is a flow chart of the mechanism of the present invention;

FIG. 2 is a flow chart of calculating an example score;

FIG. 3 is a flow diagram of updating an annotation model.

Detailed Description

The present invention will be further described with reference to the accompanying drawings.

Examples

Fig. 1 shows a flow chart of the mechanism of the present invention. It is assumed that initially there is a data set consisting of a small number of annotated images

And a data set consisting of a large number of unlabelled images

. First pair, device labeled set

Adding noise disturbance to each image to construct a noise-containing labeled image set

And is based on

And training to obtain a basic prediction model. Subsequently, the model pairs the unlabeled image dataset

The image (2) is predicted to obtain the prediction result of each image which is not marked. And calculating to obtain the utility score of each image according to the model output. And sorting the images according to the utility scores, and inquiring the mark information from high to low to the user. Next, the user gives label information for the images, which are added to the training set

Likewise, for the updated labeled set

. Finally, marking the image set by using noise

And updating the model. The query process will loop until the marking overhead reaches the budget.

FIG. 2 is a flow diagram illustrating calculation of an example utility score. Firstly, each unlabelled image

Adding

Secondary disturbance to obtain corresponding disturbance image set

Wherein

,

,

Is a Gaussian distribution

Randomly labeled perturbation values. Then, the model is calculated according to the formula

The prediction result of (2) and the clean image

finally, calculating the image set without user feedback according to a formula

Each image in

Scoring value of annotation model f

The formula is as follows:

FIG. 3 is a flow chart illustrating updating an annotation model. In each training round, the user marked image is added into the training set

In (1). Then, for

Each image in

Adding from Gaussian distribution

Randomly labeled perturbation values

. The expression can be specifically as follows:

then the image set containing noise label

Wherein

The number of marked images. Subsequently, a predictive model is utilized

For image sets with noise labels

Predicting the medium image class by

Is shown as

Image frame

Output on model f, wherein

Representative image

Is predicted to be the first

The probability of an individual class of the object,

representing the total number of categories of the image. By using

Is shown as

Image frame

In the form of a one-hot code. Calculating the loss value of the model on each noisy image according to a formula, wherein the formula is as follows:

then, the noise-containing labeled image set is subjected to minimization model

Training a model by using the upper loss value, wherein the specific formula is as follows:

finally, the model parameters are updated by a gradient descent algorithm. The above training procedure will be executed in a loop until the model converges or the maximum number of iterations is reached.

Claims

1. A robust depth image classification model training method based on active labeling is characterized by comprising the following steps:

step 1, collecting a large number of unmarked image sets

And a small number of labeled training image data sets

；

Step 2, carrying out annotation on the image set

Adding noise disturbance to each image in the image to obtain a noise-containing imageNoise labeled image set

；

Step 3, marking image sets with noises

As a training set, initializing an image classification model f;

step 4, carrying out annotation on the image set which is not marked

；

Step 5, scoring obtained in step 4

And unlabeled image set

To update the prediction model f;

and 7, returning to the step 4 or ending and outputting the prediction model f.

2. The robust depth image classification model training method based on active labeling according to claim 1, wherein step 2 obtains a noisy labeled image set

The specific method comprises the following steps: for the

Each image in

Adding from Gaussian distribution

Randomly labeled perturbation values

The concrete expression is as follows:

then the image set containing noise label

Wherein

The number of marked images.

3. The robust depth image classification model training method based on active labeling according to claim 1, wherein the specific method for initializing the image classification model f in the step 3 is as follows:

step 3.1: using predictive models

For image sets with noise labels

The medium image category is predicted and,

for predicting parameters of the model, use

Is shown as

Image frame

Output on model f, wherein

Representative image

Is predicted to be the first

The probability of an individual class of the object,

representing the total class number of the image; by using

Is shown as

Image frame

The real mark of (1) is in the form of one-hot code; calculating the loss value of the model on each noisy image according to a formula, wherein the formula is as follows:

step 3.2: noise-containing labeled image set through minimization model

wherein, in the step (A),

is a loss function.

4. The method for training the robust depth image classification model based on the active labeling of claim 1, wherein the step 4 is to calculate the value score of each unlabeled image

The specific method comprises the following steps:

step 4.1 for each unlabelled image

Adding

Secondary disturbance to obtain corresponding disturbance image set

Wherein

,

,

Is a Gaussian distribution

Randomly marking a disturbance value, wherein the disturbance times m are hyper-parameters;

step 4.2: calculating the model according to a formula

The prediction result of (2) and the clean image

wherein the content of the first and second substances,

step 4.3: calculating image set without user feedback according to formula

Each image in

To classification model

Value score of

The formula is as follows:

。

5. the method for training the robust depth image classification model based on the active labeling of claim 1, wherein the step 6 is to update the labeled image set according to the user feedback result

And unlabeled image set

The specific method comprises the following steps: the user provides category label information for the image being queried and the image is selected from the unlabeled data set

Moving to annotated image data sets

。