CN114283287A

CN114283287A - Robust field adaptive image learning method based on self-training noise label correction

Info

Publication number: CN114283287A
Application number: CN202210221128.5A
Authority: CN
Inventors: 李绍园; 曹正涛
Original assignee: Nanjing University of Aeronautics and Astronautics
Current assignee: Nanjing University of Aeronautics and Astronautics
Priority date: 2022-03-09
Filing date: 2022-03-09
Publication date: 2022-04-05
Anticipated expiration: 2042-03-09
Also published as: CN114283287B

Abstract

The invention discloses a robust field self-adaptive image learning method based on self-training noise label correction, which comprises the following steps: acquiring a source domain, a target domain image set and a source domain low-quality label; initializing various parameters; building a model and a loss function; sequentially inputting the image sets of the source domain and the target domain into two mark classifiers; the two mark classifiers detect noise for the opposite side before each iterative training, then predict pseudo marks again for the noise source domain sample and the target domain sample, and perform rebalance sampling to participate in the next iterative training; inputting a target domain pseudo label set into target domain specific network training; and after the training is finished, performing a class prediction task on the target domain image by using the target domain specific classifier. Aiming at the problem that the category distribution of a source domain is inconsistent with that of a target domain, the method adopts a rebalanced sampling pseudo-mark sample mechanism to ensure that the sampling proportion of each category of the source domain and the target domain is consistent, and the accuracy of a deep learning model on the target domain is improved.

Description

Robust field adaptive image learning method based on self-training noise label correction

Technical Field

The invention relates to a robust field self-adaptive image learning method based on self-training noise label correction.

Background

The traditional supervised learning needs a large amount of images and accurate labeling information, however, in practical situations, the collection of a large amount of accurate labels requires extremely high cost, so that the labels contain a large amount of noise. The unsupervised field self-adaptation is applied to another data set with different but similar data distribution by utilizing a training model on the data set with the accurate label in a migration mode, wherein the data set with the accurate label is a source field, and the data set without the label is a target field. Although conventional domain adaptation solves the problem of lack of supervisory information on the target domain, it ignores the problem of significant cost for source domain marker acquisition, and therefore their performance is severely degraded when there is noise in the labeled information in the source domain.

Disclosure of Invention

The invention provides a robust field adaptive image learning method based on self-training noise label correction, and aims to further improve the accuracy of the field adaptive image learning method on a target field when the problems of labeled noise and inconsistent source field and target field category distribution (category distribution offset) are faced.

In order to achieve the purpose, the invention adopts the following technical scheme:

a robust field self-adaptive image learning method based on self-training noise label correction comprises the following steps:

step 1, obtaining a source domain original data set

And a target domain data set

；

Wherein the content of the first and second substances,D _srepresenting source domain raw images acquired by a network platformx _siAnd its corresponding source domain low quality mark

Composed source domain primitive numbersAccording to the data set, the data of the data set,N _srepresenting a source domain raw data setD _sTotal number of medium samples;

D _trepresenting the original image only by the target domainx _tiThe composed target domain data set is composed of,N _tto representD _tThe number of medium target domain samples;

step 2, initializing various parameters including iteration timest=0, the secondtRound false mark thresholdγ _tPre-training parametersN _warm；

And 3, building a deep learning model and a loss function, comprising the following steps: feature extractorGTwo tag classifierC ₁、C ₂Target domain specific classifierC _tCross entropy loss functionL _ceAnd a consistency loss functionL _sp；

Step 4, the original data set of the source domainD _sSource domain original image inx _siInput feature extractorGTo extract features in an imagef _si =G(x _si) Then extracting the source domain featuresf _siAnd low quality marks corresponding to the original image of the source domain

Is sent into two mark classifiersC ₁、C ₂In the middle ofWarm upTraining, trainingN _warmA wheel;

step 5. in the detection stage, two mark classifiers are usedC ₁、C ₂Sequentially and respectively detecting the marking noise in the source domain, and mutually taking the original data sets of the source domain as the opposite sideD _sPartitioning into clean source domain samples

And noise source domain samples

；

Step 6, two mark classifiers are usedC ₁、C ₂In turn, are noise source domain samples

And a target domain data setD _tPerforming class prediction on each sample, and taking the prediction class as

AndD _ta pseudo-label for each sample in the set;

according toγ _tFor the noise source field sample in step 6

And a target domain data setD _tIn each pseudo-labeled sample, the sample sampling ratio of each category isγ _t/ KTo obtain a pseudo label setD ^sp={

，

}；

Wherein the content of the first and second substances,Kthe number of categories is indicated and the number of categories,

a set of pseudo-annotations of the source domain is represented,

representing a target domain pseudo label set;

and 7, if the training is the first round of iterative training, then

=

(ii) a If not, then,

=

；

step 8, in the training stage, clean source domain samples are sampled

And pseudo label setD ^spImage input feature extractorGExtracting features, and inputting the extracted features and corresponding labels into two label classifiersC ₁、C ₂Carrying out supervision training in sequence;

for clean source domain samples

Optimizing cross entropy loss functionL _ceFor pseudo label setsD ^spOptimizing consistency loss functionsL _spTo update the feature extractorGAnd two tag classifiersC ₁、C ₂；

Step 9, pseudo labeling set of target domain

Image input feature extractorGExtracting features, and inputting the extracted features and corresponding pseudo labels into a target domain specific classification networkC _tOptimizing cross entropy loss functionL _ceTo updateG、C _t；

Step 10, judging the current iteration timestWhether or not the maximum number of iterations has been reachedT；

If the current number of iterationstNot reaching the maximum number of iterationsTThen, the self-training is continued by returning to the step 5,t=t+1, and updateγ _t=γ ₀+0.05*t(ii) a Wherein the content of the first and second substances,γ ₀a pseudo-mark threshold representing initialization; otherwise, go to step 11;

step 11, after the model training is finished, executing a classification prediction task, firstly using a feature extractorGExtracting features from the target domain image and inputting the extracted featuresC _tAnd (5) performing category prediction.

The invention has the following advantages:

as mentioned above, the invention relates to a robust domain adaptive image learning method based on self-training noise label correction, which is a robust domain adaptive method based on self-training noise detection and label rebalance definition, and performs alternate training by designing a label network and a target domain specific network, wherein the label network can effectively filter label noise in a source domain through the label network, and performs rebalance sampling on pseudo label samples to solve the problem of class distribution shift (label distribution shift), and the target domain pseudo label samples are utilized to train the target domain specific network, thereby obtaining the classification capability on a target domain to realize knowledge transfer from the source domain to the target domain, and meanwhile, the invention utilizes rebalance pseudo label sampling to solve the problem of inconsistent class distribution of the source domain and the target domain, further improving under the noise condition, robustness of domain adaptive methods.

Drawings

FIG. 1 is a flow chart of a robust domain adaptive image learning method based on self-training noise label correction in an embodiment of the present invention;

FIG. 2 is a schematic structural diagram of an integral model in an embodiment of the invention;

FIG. 3 is a flow chart illustrating the filtering of source domain marker noise according to an embodiment of the present invention;

FIG. 4 is a flow chart illustrating the generation and rebalancing of sampling pseudo labels according to an embodiment of the present invention.

Detailed Description

The noun explains:

weak enhancement means that a picture is simply turned over and translated; the strong enhancement means that two kinds of strong transformation (translation, clipping, rotation, turning, image compression and the like) and disturbance (overexposure, contrast enhancement, sharpening, black and white processing, tone separation, Gaussian blur and the like) with different degrees are randomly added to one picture so as to cause the picture to be seriously distorted.

The basic concept of the invention is as follows: in the noise field self-adaptation, the labels of a source domain are obtained through a low-cost labeling platform, some labels are wrong, and in order to solve the problem of labeling noise in the source domain, the method utilizes the characteristic that a deep neural network is used for fitting a noise sample after preferentially fitting a clean sample, and a sample with small loss is considered as a clean sample through analyzing each loss so as to achieve the purpose of dividing a data set into the clean source domain sample and the noise source domain sample. In order to solve the problem of inconsistent edge distribution (covariate shift) of samples in a source domain and a target domain, the invention uses the thought of self-training for reference, firstly, a model is pre-trained by using the source domain samples with accurate marks, then, pseudo marks are gradually added into the target domain samples through a marking network to participate in the training of the whole model, in addition, a target domain specific classifier which is only trained by using the target domain pseudo mark samples is additionally trained by the invention to capture the specific distinguishing characteristics of the target domain, and finally, the transfer of knowledge from the source domain to the target domain is gradually realized. In addition, the invention considers the problem of label distribution shift (label distribution shift), namely that samples of each category of a source domain and a target domain may have different quantities, and severe category imbalance may exist in the domain, so the invention adopts the rebalance sampling to the pseudo label samples to keep the training of the source domain and the target domain on each category consistent, concretely, the invention constructs two label classifiers to filter the label noise in the source domain, utilizes the characteristic of fitting the noise samples after the deep neural network is preferentially fitted to the clean samples to cluster the sample loss, the cross entropy loss generated by the clean samples is small, the cross entropy loss generated by the noise samples is large, the sample loss can be fitted to the Gaussian mixed distribution composed of relatively high loss distribution and relatively low loss distribution, and the probability that the loss of each sample belongs to the low loss distribution is the probability that the sample is the clean sample, and in order to reduce the continuous accumulation of the errors of a single model, the two mark classifiers divide the clean and noise samples for each other, the noise source domain sample and the target domain sample are utilized together in each iteration, and a reliable pseudo label is selected for the noise source domain sample and the target domain sample by judging whether the strength enhancement is consistent or not to construct a pseudo label data set. Aiming at the problem of inconsistent distribution of categories of different domains, the method carries out rebalance sampling on the pseudo labels. By the method, the problem of noise field self-adaptation and the phenomenon of category distribution offset (the category distribution offset is embodied in that each category in each domain is unbalanced and the unbalanced effect of a source domain sample and a target domain sample on each category is different) can be effectively solved, and the robustness of the field self-adaptation method under the noise condition is further improved.

The invention is described in further detail below with reference to the following figures and detailed description:

as shown in fig. 1, the robust domain adaptive image learning method based on self-training noise label correction includes the following steps:

step 1, obtaining a source domain original data set

And a target domain data set

。

The original data set of the source domain of the composition,N _srepresenting a source domain raw data setD _sTotal number of samples in (c).

D _tRepresenting the original image only by the target domainx _tiThe composed target domain data set is composed of,N _tto representD _tNumber of medium target domain samples.

The source domain and target domain images can easily acquire a batch of high-quality images through an internet platform.

The annotation of the source domain image set can be obtained through a network public annotation platform, such as a crowdsourcing annotation platform; the markers obtained by such low cost are not completely accurate and therefore contain a large number of noisy markers.

When the real mark of the image has the problem of unbalanced category, the collected mark is also unbalanced, and therefore, the source domain category distribution and the target domain category distribution may have a non-uniform phenomenon.

In such cases, it is extremely challenging to implement knowledge migration from the source domain to the target domain.

Step 2, initializing various parameters including iteration timest=0, the secondtRound false mark thresholdγ _tPre-training parametersN _warm. Wherein the content of the first and second substances,γ _tthe hyperparameter set by people represents that the upper limit of the number of the pseudo mark samples is set in each round of iterative training.

And 3, building a deep learning model and a loss function, comprising the following steps: feature extractorGTwo tag classifierC ₁、C ₂Target domain specific classifierC _tCross entropy loss functionL _ceAnd a consistency loss functionL _sp。

As shown in fig. 2, the entire frame comprises four parts: feature extractor (encoder) composed of deep convolutional networkGAnd a classifier composed of three fully-connected layers and a BN (Batch Normalization) layerC ₁、C ₂、C _t。

Wherein the content of the first and second substances,Gis a deep convolutional neural network used for extracting the characteristics of the sample and mapping the image to a high-dimensional characteristic space.

In order to be a clean source domain sample,

and

and respectively providing pseudo label sets of the source domain and the target domain which can be used after screening.

Two tag classifiers in this exampleC ₁、C ₂The effects of (A) are as follows:

1. in the detection stage, the original data set of the source domain is mainly subjected toD _sThe noise mark in the signal is detected and filtered, andD _spartitioning into clean source domain samples

And noise source domain samples

And for detected noise source field samples

And target domain samplesD _tPredicting pseudo-marks, respectively obtained by re-balancing the samples

、

And participating in the next round of self-training.

2. In the training phase, for twoC ₁、C ₂By using

And

，

the composed mixture data to train them.

C _tFor target domain specific classifiers, only target domain pseudo-labeled sets are used

Training to obtain the characteristic features of the target domain without being interfered by the source domain features, and finally obtaining the target domain with good classification performanceC _tAnd realizing the migration of the knowledge from the source domain to the target domain.

For a single picturex，GIt is first mapped into a high-dimensional depth feature space and thenC ₁、C ₂AndC _tpredicting probability of the mapped features, mapping toKProbability vector of dimensionP _modelAnd

，Kis the number of categories.

Step 4, the original data set of the source domainD _sSource domain original image inx _siInput feature extractorGTo extract features in an imagef _si =G(x _si) Then the original image of the source domain is processedx _siExtracted source domain featuresf _siAnd low quality marks corresponding to the original image of the source domain

Is sent into two mark classifiersC ₁、C ₂In the middle ofWarm upTraining, trainingN _warmAnd (4) wheels.

Warm upThe training means that before formal self-training, the original data set of the source domain is utilizedD _sBy optimizing the cross entropy loss functionL _ceSimple prediction to update a modelAnd training, namely fitting the characteristics of the noise sample after preferentially fitting the clean sample according to the deep neural network, so that the model can be used as pre-training of the following self-training (step 5-step 10) through initial training, fit the clean mark without fitting the noise mark, and play a role in initializing the network parameters of the whole model.

Step 5. in the detection stage, two mark classifiers are usedC ₁、C ₂Sequentially and respectively detecting the marking noise in the original data set of the source domain, and mutually taking the original data set of the source domain as the opposite sideD _sInto clean source domain samples and noise source domain samples.

Step 5.1, initialize noise filtering thresholdτ=0.6。

Wherein the content of the first and second substances,τthe hyper-parameter set artificially represents the boundary of whether the noise is judged to be a clean sample or not at each noise detection.

Step 5.2. willD _sOriginal image of medium source domainx _siInput feature extractorGExtracting features to obtainf _{si_t} =G(x _si)。

Wherein the content of the first and second substances,f _{si_t}is as followstIn the self-training process, the original image of the source domainx _siInput feature extractorGThe features obtained later, features extracted from each round of training in the training processf _{si_t}All are different.

Step 5.3, all source domain original images in the step 5.2 are processedx _siExtracted featuresf _{si_t}All input to two mark classifiers in turnC ₁、C ₂In using two tag classifiersC ₁、C ₂Respectively aiming at the original image of each source domain in sequencex _siExtracted featuresf _{si_t}Performing class prediction to obtain corresponding class prediction resultC ₁(f _{si_t})、C ₂(f _{si_t})。

Wherein the content of the first and second substances,C ₁(f _{si_t}) To useC ₁For each source domain original imagex _siExtracted featuresf _{si_t}Performing a result of the category prediction;C ₂(f _{si_t}) To useC ₂For each source domain original imagex _siExtracted featuresf _{si_t}And (5) performing a category prediction result.

Using cross entropy loss functionL _ceCalculating the class prediction results respectivelyC ₁(f _{si_t})、C ₂(f _{si_t}) And source domain original imagex _siCorresponding source domain low quality mark

Calculating cross entropy loss to obtain cross entropy lossl _i1,l _i2}_i ^Ns。

Wherein the content of the first and second substances,l _i1is composed ofC ₁(f _{si_t}) And source domain original imagex _siCorresponding source domain low quality mark

The cross-entropy loss of (a) is,l _i2is composed ofC ₂(f _{si_t}) And source domain original imagex _siCorresponding source domain low quality mark

Cross entropy loss of (2).

Step 5.4. Cross entropy loss for all source domain original images respectively by means of Gaussian Mixture Model (GMM)l _i1,l _i2}_i ^NsFitting a Gaussian mixture distribution to obtain the Gaussian mixture distributionp(g|l _i1)、p(g| l _i2)。

Each gaussian mixture distribution consists of two gaussian distributions representing a distribution with less loss and a distribution with greater loss, respectively.

Wherein the content of the first and second substances,p(g| l _i1)、p(g| l _i2) Respectively two mark classifiersC ₁、C ₂For each source domain original imagex _si∈D _sSource domain low-quality mark corresponding to category prediction

And calculating the probability of low loss after cross entropy loss.

Fitting principle of Gaussian mixture distribution:

according to the characteristics of fitting noise samples after the clean samples are fitted preferentially by the deep neural network, the sample loss is clustered, the cross entropy loss generated by the clean samples is small, the cross entropy loss generated by the noise samples is large, and the loss of all samples can be fitted into Gaussian mixture distribution consisting of relatively high loss distribution and relatively low loss distribution. Based on the property that the deep neural network preferentially fits the clean mark, we consider that the source domain raw image with less loss is more likely to belong to a clean sample.

Step 5.5, Gaussian mixture distributionp(g| l _i1)、p(g| l _i2) Greater than a noise filtering thresholdτThe source domain original image of (2) is used as a clean source domain sample, and the rest source domain original image is used as a noise source domain sample.

To reduce the constant accumulation of errors in the individual models themselves, two label classifiers are used before the next round of self-trainingC ₁、C ₂Partitioning clean source domain samples for each other

And noise source domain samples

I.e. mark classifierC ₁To the data partitioning result ofC ₂Use, mark classifierC ₂To the data partitioning result ofC ₁The preparation is used.

FIG. 3 shows two tag classifiersC ₁、C ₂How to filter the markup noise in the source domain.

And a target domain data setD _tPerforming class prediction on each sample, and taking the prediction class as the class

AndD _tpseudo-labeling of each sample.

To overcome the problem of inconsistent distribution of source domain and target domain categories, the method is based onγ _tFor in step 6

AndD _tis re-balanced sampled for each pseudo-labeled sample in a sample sampling ratio of each class ofγ _t/ KTo obtain a pseudo label setD ^sp={

，

}。

a set of pseudo-annotations of the source domain is represented,

representing a target domain pseudo label set.

FIG. 4 is a flow chart illustrating the generation and rebalancing of sampling pseudo labels in an embodiment of the present invention.

Step 6.1. noise Source Domain samples

The original mark is no longer trusted and is sampled with the target domainD _t(itself, a non-labeled sample) constituting a set of non-labeled samples x _b|x _b∈

∪D _t}. Wherein the content of the first and second substances,x _brepresenting samples in the unlabeled sample set.

Step 6.2. samplex _bStrongly enhanced version ofA(x _b) And weakly enhanced versionsα(x _b) Respectively input two tag classifiersC ₁、C ₂. Wherein the content of the first and second substances,A(x _b) Is to the samplex _bAdding transformation and disturbance of different degrees to severely distort the picture;α(x _b) Is to the samplex _bObtained through vertical turning and translation processing.

Step 6.3. two tag classifiersC ₁、C ₂For weak enhanced versions respectivelyα(x _b) And strongly enhanced versionsA(x _b) Carrying out classified prediction to obtain four prediction results which are respectivelyp ₁ ^α、p ₂ ^α、p ₁ ^A、p ₂ ^A. Wherein the content of the first and second substances,p ₁ ^αfor mark classifierC ₁For weak enhanced versionα(x _b) Performing a prediction result of classification prediction;p ₂ ^αfor mark classifierC ₂For weak enhanced versionα(x _b) And (5) carrying out a prediction result of classification prediction.p ₁ ^AFor mark classifierC ₁For weak enhanced versionA (x _b) Performing a prediction result of classification prediction;p ₂ ^Afor mark classifierC ₁For weak enhanced versionA (x _b) And (5) carrying out a prediction result of classification prediction. For the predicted resultp ₁ ^α、p ₂ ^αIntegration, i.e. addition of probability prediction vectors, is finally obtainedp ^α. For the predicted resultp ₁ ^A、p ₂ ^AIntegration, i.e. addition of probability prediction vectors, is finally obtainedp ^A. Wherein the content of the first and second substances,p ^α、p ^Arepresenting two label classifiers separatelyC ₁、C ₂For weak enhanced versionα(x _b) Enhanced versionA(x _b) I.e. the confidence on each class.

Step 6.4. calculate allKClass-class maximum probability prediction class:

，

. Wherein the content of the first and second substances,

means all ofKThe maximum probability in a class predicts the class.

Representing two tag classifiersC ₁、C ₂For weak enhanced versionα(x _b) Comprehensive pretreatmentA measured false mark;

representing two tag classifiersC ₁、C ₂For strong enhancement versionA(x _b) The predicted pseudo-tags are synthesized.

Will be provided with

=

The prediction pseudo mark is used as a preliminary reliable pseudo mark, and the prediction probability, namely the confidence degree ranking on the prediction category is ranked from high to low; according to the confidence ranking, from each category, the sampling equal proportion isγ _t/ KObtaining a source domain pseudo-label set by using the pseudo-label samples with the highest quantity and confidence coefficient

And target domain pseudo label set

。

For example: for all the pseudo-labeled samples labeled as class k, the method ranks from high to low according to the prediction confidence of the pseudo-labeled samples on the class k, and then the sampling proportion isγ _t/ KA pseudo-labeled sample on class k.

Wherein the content of the first and second substances,

、

is the prediction probability vector of the marking network to all images with the size ofN×K。

And selecting the category with the highest prediction probability as a pseudo label for the single label-free sample.

Step 7, if the first round is adoptedIterative training, then

=

(ii) a If not, then,

=

。

because the first round of iterative training model is not trained to be mature and cannot predict accurate pseudo labels for label-free samples, the invention utilizes a source domain pseudo label set

Training target domain specific classifiers as a first iterationC _tThe pseudo-tagged data set of (1).

Step 8, in the training stage, clean source domain samples are sampled

And pseudo label setD ^spImage input feature extractorGExtracting features, and inputting the extracted features and corresponding labels into two label classifiersC ₁、C ₂And (5) carrying out supervision training in sequence.

Wherein for clean source domain samples

Optimizing cross entropy loss functionL _ceFor pseudo label setsD ^spOptimizing consistency loss functionsL _spTo update the feature extractorGTwo tag classifiersC ₁、C ₂。

Updating two tag classifiersC ₁、C ₂For clean source domain samples: (

,

)∈

(ii) a Two tag classifierC ₁、C ₂To pair

Has a prediction probability of

(

) Cross entropy loss functionL _ceThe concrete form of (A) is as follows:

。

wherein the content of the first and second substances,Brepresenting the number of clean source field samples per small batch.

Representing the source domain original image in a clean source domain sample,

representing the label corresponding to the source domain image in the clean source domain sample.

Each small batch of clean source domain samples refers to clean source domain samples

All samples in the network are evenly divided into subsets with equal sizes, and each small batch of clean source domain samples are sequentially selected and sent into the network for training.

When the data size is too large, all data cannot be sent to the network for training at one time, but all samples are divided into subsets with the same size during each iteration training, and then the subsets are sequentially selected from the subsets and sent to the network for training, so that each iteration is not a loss function for calculating all data, but a small batch of loss functions.

Therefore, the training speed of the model and the convergence speed of the model can be accelerated.

Two tag classifierC ₁、C ₂Re-predicting pseudo-labels for noise source domain samples and target domain samples

And forming a pseudo-labeled sample set (x _b,

) Using a consistency loss functionL _spThe optimization model is specifically in the form of:

；

wherein the content of the first and second substances,A(x _b) For each samplex _bA strongly enhanced version of (a).P _model(A(x _b) ) a presentation marker classifierC ₁OrC ₂For the samplex _bStrongly enhanced version ofA(x _b) The class prediction of (1).H(

,P _model(A(x _b) ) is cross entropy, as follows:

H(

,P _model(A(x _b)) )=

。

step 9, pseudo labeling set of target domain

Is input to the feature extractorGExtracting features, and inputting the extracted features and corresponding pseudo labels into a target domain specific classification networkC _tOptimizing cross entropy loss functionL _ceTo updateG、C _t。

Specific classifiers in a target domainC _tIn the training phase, the loss function used in this embodiment is cross entropy lossL _ceOptimizingL _ceTo update the target domain specific classifierC _t。

Updating target domain specific classifiersC _tUsing only target domain pseudo-labeled samples: (

,

)∈

To train, target domain specific classifiersC _tTo pair

Has a prediction probability of

Then it crosses the entropy loss functionL _ceThe concrete form is as follows:

；

wherein the content of the first and second substances,

representing a set of pseudo labels of a target domain

The original image of the medium target domain is,

representing two tag classifiersC ₁、C ₂Is composed of

The added pseudo-mark is added to the mark,Brepresenting the number of pseudo-labeled samples per small batch of the target domain.

Each small batch of target domain pseudo-labeled samples refers to pseudo-labeling target domains

All samples in the target domain are uniformly divided into subsets with equal sizes, and each small batch of target domain pseudo-labeled samples are sequentially selected and sent into a network for training.

Step 10, judging the current iteration timestWhether or not the maximum number of iterations has been reachedT(ii) a If it istNot reaching the maximum number of iterationsTThen, the self-training is continued by returning to the step 5,t=t+1, and updateγ _t=γ ₀+0.05*t(ii) a Otherwise, go to step 11;

wherein the content of the first and second substances,γ ₀indicating an initialized pseudo-mark threshold.

Step 11, after the model training is completed, a feature extractor capable of extracting reliable features on the target domain is obtainedGAnd target domain specific classifier capable of performing reliable classification performance on target domain samplesC _t。

Performing a final classification prediction task, the method of the invention uses a feature extractorGExtracting features from the target domain image and inputting the extracted features into a target domain specific classification networkC _tAnd performing category prediction.

It should be understood, however, that the description herein of specific embodiments is not intended to limit the invention to the particular forms disclosed, but on the contrary, the intention is to cover all modifications, equivalents, and alternatives falling within the spirit and scope of the invention as defined by the appended claims.

Claims

1. The robust field self-adaptive image learning method based on self-training noise label correction is characterized by comprising the following steps of:

step 1, obtaining a source domain original data set

And a target domain data set

；

The original data set of the source domain of the composition,N _srepresenting a source domain raw data setD _sTotal number of medium samples;

D _trepresenting origin only by target domainStarting imagex _tiThe composed target domain data set is composed of,N _tto representD _tThe number of medium target domain samples;

Step 3, building a deep learning model and a loss function, including a feature extractorGTwo tag classifierC ₁、C ₂Target domain specific classifierC _tCross entropy loss functionL _ceAnd a consistency loss functionL _sp；

And noise source domain samples

；

AndD _ta pseudo-label for each sample in the set;

according toγ _tFor the noise source field sample in step 6

，

}；

a set of pseudo-annotations of the source domain is represented,

representing a target domain pseudo label set;

and 7, if the training is the first round of iterative training, then

=

(ii) a If not, then,

=

；

step 8, in the training stage, clean source domain samples are sampled

for clean source domain samples

Step 9, pseudo labeling set of target domain

2. The robust domain adaptive image learning method as recited in claim 1,

the step 5 specifically comprises the following steps:

step 5.1, initialize noise filtering thresholdτ=0.6；

Step 5.2. willD _sOriginal image of medium source domainx _siInput feature extractorGExtracting features to obtainf _{si_t} =G(x _si)；

Wherein the content of the first and second substances,f _{si_t}is as followstIn the self-training process, the original image of the source domainx _siInput feature extractorGThe characteristics obtained later;

step 5.3, all source domain original images in the step 5.2 are processedx _siExtracted featuresf _{si_t}All input to two mark classifiers in turnC ₁、C ₂In using two tag classifiersC ₁、C ₂Respectively aiming at the original image of each source domain in sequencex _siExtracted featuresf _{si_t}Performing class prediction to obtain corresponding class prediction resultC ₁(f _{si_t})、C ₂(f _{si_t})；

Wherein the content of the first and second substances,C ₁(f _{si_t}) Is composed ofC ₁For each source domain original imagex _siExtracted featuresf _{si_t}Performing a result of the category prediction;C ₂(f _{si_t}) Is composed ofC ₂For each source domain original imagex _siExtracted featuresf _{si_t}Performing a result of the category prediction;

Obtaining cross entropy lossl _i1,l _i2}_i ^Ns；

Wherein the content of the first and second substances,l _i1to representC ₁(f _{si_t}) And source domain original imagex _siCorresponding source domain low quality mark

The cross-entropy loss of (a) is,l _i2to representC ₂(f _{si_t}) And source domain original imagex _siCorresponding source domain low quality mark

Cross entropy loss of (d);

step 5.4, respectively carrying out comparison on all source domain original images by means of Gaussian mixture modelsx _siIs a cross entropy lossl _i1,l _i2}_i ^NsFitting a Gaussian mixture distribution to obtain the Gaussian mixture distributionp(g| l _i1)、p(g| l _i2)；

Each Gaussian mixture distribution consists of two Gaussian distributions which respectively represent a distribution with low loss and a distribution with high loss;

wherein，p(g| l _i1)、p(g| l _i2) Respectively two mark classifiersC ₁、C ₂For each source domain original imagex _si∈D _sSource domain low-quality mark corresponding to category prediction

Calculating the probability of low loss after cross entropy loss;

step 5.5, Gaussian mixture distributionp(g| l _i1)、p(g| l _i2) Middle greater than noise filtering thresholdτAs a clean source domain sample

The residual source domain original image is used as a noise source domain sample

；

Two tag classifierC ₁、C ₂Partitioning clean source domain samples for each other

And noise source domain samples

3. The robust domain adaptive image learning method as recited in claim 2,

in step 6, the specific steps of generating the pseudo mark are as follows:

step 6.1. noise Source Domain samplesBook (I)

And target domain samplesD _tForming a set of unmarked samples x _b|x _b∈

∪D _t }；

Wherein the content of the first and second substances,x _brepresenting samples in the unlabeled sample set;

step 6.2. samplex _bStrongly enhanced version ofA(x _b) And weakly enhanced versionsα(x _b) Are inputted separatelyC ₁、C ₂；

Wherein the content of the first and second substances,A(x _b) Is a samplex _bAdding transformation and disturbance of different degrees to severely distort the picture;

α(x _b) Is to the samplex _bObtained by vertical turning and translation processing;

step 6.3. two tag classifiersC ₁、C ₂For weak enhanced versions respectivelyα(x _b) And strongly enhanced versionsA(x _b) Carrying out classified prediction to obtain four prediction results which are respectivelyp ₁ ^α、p ₂ ^α、p ₁ ^A、p ₂ ^A；

Wherein the content of the first and second substances,p ₁ ^αfor mark classifierC ₁For weak enhanced versionα(x _b) Performing a prediction result of classification prediction;p ₂ ^αfor mark classifierC ₂For weak enhanced versionα(x _b) Performing a prediction result of classification prediction;

p ₁ ^Afor mark classifierC ₁For weak enhanced versionA (x _b) Performing a prediction result of classification prediction;p ₂ ^Afor mark classifierC ₁For weak enhanced versionA (x _b) Performing a prediction result of classification prediction;

for the predicted resultp ₁ ^α、p ₂ ^αIntegration, i.e. addition of probability prediction vectors, is finally obtainedp ^α；

For the predicted resultp ₁ ^A、p ₂ ^AIntegration, i.e. addition of probability prediction vectors, is finally obtainedp ^A；

Wherein the content of the first and second substances,p ^α、p ^Arepresenting two label classifiers separatelyC ₁、C ₂For weak enhanced versionα(x _b) Enhanced versionA(x _b) The overall predicted class outcome of (a), i.e., confidence in each class;

step 6.4. calculate allKClass-class maximum probability prediction class:

，

；

wherein the content of the first and second substances,

means all ofKPredicting the category of the maximum probability in the category;

representing two tag classifiersC ₁、C ₂For weak enhanced versionα(x _b) Pseudo-labeling for comprehensive prediction;

representing two tag classifiersC ₁、C ₂For strong enhancement versionA(x _b) Pseudo-labeling for comprehensive prediction;

will be provided with

=

And target domain pseudo label set

。

4. The robust domain adaptive image learning method as recited in claim 1,

the feature extractorGIs a feature extractor consisting of a deep convolutional network; two tag classifierC ₁、C ₂And a target domain specific classifierC _tAll are classifiers composed of three fully-connected layers and a normalization layer.

5. The robust domain adaptive image learning method as recited in claim 3,

in the training phase, optimizingL _ce、L _spTo update two tag classifiersC ₁、C ₂；

Updating two tag classifiersC ₁、C ₂For clean source domain samples: (

,

)∈

Two tag classifierC ₁、C ₂For images in clean source domain samples

Has a prediction probability of

(

) Cross entropy loss functionL _ceThe concrete form of (A) is as follows:

；

wherein the content of the first and second substances,Brepresenting the number of clean source domain samples per small batch,Krepresenting a category total;

In which all samples are evenly divided into subsets each having equal sizeThen, sequentially selecting each small batch of clean source domain samples, and sending the small batch of clean source domain samples into a network for training;

representing the source domain original image in a clean source domain sample,

a mark corresponding to the source domain image in the clean source domain sample is represented;

And forming a pseudo-labeled sample set (x _b,

；

wherein the content of the first and second substances,A(x _b) For each samplex _bA strongly enhanced version of (c);

P _model(A(x _b) ) a presentation marker classifierC ₁OrC ₂For the samplex _bStrongly enhanced version ofA(x _b) Predicting the category of (1);

H(

,P _model(A(x _b) ) is cross entropy, as follows:

H(

,P _model(A(x _b)) )=

。

6. the robust domain adaptive image learning method as recited in claim 3,

in the training phase, optimizingL _ceTo update the target domain specific classifier;

,

)∈

To train, target domain specific classifiersC _tFor images

Has a prediction probability of

Then it crosses the entropy loss functionL _ceThe concrete form is as follows:

；

wherein the content of the first and second substances,

representing a set of pseudo labels of a target domain

A middle target domain original image;

representing two tag classifiersC ₁、C ₂Is composed of

The added pseudo-mark is added to the label,Brepresenting the number of pseudo-labeled samples per small batch of the target domain,Krepresenting a category total;