CN113537307A

CN113537307A - Self-supervision domain adaptation method based on meta-learning

Info

Publication number: CN113537307A
Application number: CN202110727430.3A
Authority: CN
Inventors: 路统宇; 颜成钢; 孙垚棋; 张继勇; 李宗鹏
Original assignee: Hangzhou Dianzi University
Current assignee: Hangzhou Dianzi University
Priority date: 2021-06-29
Filing date: 2021-06-29
Publication date: 2021-10-22
Anticipated expiration: 2041-06-29
Also published as: CN113537307B

Abstract

The invention discloses a self-supervised domain adaptation method based on meta-learning. The target domain image reconstruction is regarded as a self-supervised task in the domain adaptation process, and the supervision information is the target domain image itself, and no additional target domain image annotation information is required. It saves a lot of manual annotation costs; in addition, the reconstruction process of the target domain image enables the network to learn richer high-level semantic information in the target domain image, so that the network can use the intrinsic features of the target domain data to assist the network to learn from the source domain data. The acquired knowledge is transferred to the target domain, thereby improving the performance of domain adaptation methods. By introducing the meta-learning strategy into the self-supervised domain adaptation, the update direction of the network parameters for specific tasks such as the target domain self-supervised task and the source domain classification tends to be consistent, so that the network can better extract domain-invariant features and reduce the number of domain The negative transfer problem caused by the inconsistent update direction of network parameters between adaptation tasks and specific tasks improves the domain adaptation performance.

Description

Self-supervision domain adaptation method based on meta-learning

Technical Field

The invention belongs to the field of computer vision and image processing, and particularly relates to an unsupervised domain adaptation method based on meta-learning and image reconstruction, which can enable a source domain image classification task and a target domain image reconstruction task to tend to be consistent in a parameter updating direction of a feature extraction network, namely, the parameter updating direction of source domain image category information to the feature extraction network tends to tend to the parameter updating direction of the feature extraction network by utilizing information such as the spatial relationship, illumination and the like of an object in a target domain image, so that the features obtained by the feature extraction network are domain-invariant features, and the domain adaptation of a source domain and a target domain is realized.

Background

In recent years, supervised learning based on a deep neural network has been applied in many aspects, and is widely applied in the fields of image classification, target detection, semantic segmentation, natural language processing and the like, and the combination of artificial intelligence technology and real life is greatly promoted. However, many supervised learning methods usually assume that probability distributions of a training set sample and a test set sample obey the same distribution, and in addition, in order to make a network have better generalization performance and avoid an overfitting problem, a large number of labeled training samples are usually required in a training stage in a supervised learning mode. However, with the advent of the big data era, the data scale is increasing, and the problems of statistical property difference among different data sets, high labor cost of data annotation and the like are gradually revealed. In order to solve the above problems, unsupervised domain adaptation methods are widely studied. Unsupervised domain adaptation is a method to resolve the distribution differences between source and target domains by learning some generalizable knowledge in tagged source domain data and applying it to the task on untagged target domain data to improve the performance of the network on the target domain.

At present, many unsupervised domain adaptation methods mostly focus on aligning the feature probability distribution of source domain data and target domain data by adopting a distance measurement-based or countermeasure learning-based mode, so that the inter-domain difference is reduced, and a network can learn a domain-invariant feature space. However, these methods only perform distribution alignment from the whole of the source domain data and the target domain data, and do not pay attention to the influence of the intrinsic characteristics of the target domain data on the knowledge migration in the domain adaptation process, so the self-supervision domain adaptation method based on image reconstruction and the like performs joint training on the network by setting self-supervision tasks such as image reconstruction, image rotation angle prediction, image restoration and the like on the non-labeled target domain data in cooperation with the labeled source domain data, that is, assists the network to migrate the knowledge learned from the source domain data to the target domain by mining the intrinsic characteristics of the target domain data, and in addition, because the method based on image reconstruction keeps the integrity of the data while extracting the migratable characteristics in the domain adaptation, it is ensured that the information for improving the performance of a specific task in the target domain data is not damaged, and the original distribution of the target domain data is kept as much as possible in the knowledge migration process of the source domain data, the learned knowledge in the source domain data can thus be better applied in the target domain tasks. However, in the domain adaptation process, many existing unsupervised domain adaptation methods simply update parameters of the feature extraction network through specific tasks such as an unsupervised task and image classification respectively, whether the update directions of the two types of tasks to the parameters of the feature extraction network are consistent or not is not considered, negative effects may be caused to feature learning of the specific tasks such as the image classification by the supervised task, the specific tasks such as the unsupervised task and the image classification can be respectively used as a trainer and a tester through meta-learning, and the parameters of the tester network are updated through a loss function in the trainer, so that the update directions of the two types of tasks to the parameters of the network tend to be consistent.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides a self-supervision domain adaptation method based on Meta learning (Meta learning).

A self-supervision domain adaptation method based on meta-learning comprises the following steps:

step 1, setting a trainer and a tester:

and taking the reconstruction process of the target domain sample as a trainer in meta-learning, and taking the classification process of the source domain sample as a tester in the meta-learning.

Step 2, performing an image reconstruction task by using the target domain sample and calculating reconstruction loss:

inputting the label-free target domain sample into a feature extraction network to obtain target domain sample features, inputting the target domain sample features into an image reconstruction network to reconstruct images and calculating reconstruction loss.

And 3, updating parameters of the feature extraction network:

the parameters of the characteristic extraction network in the trainer are updated together with the parameters of the characteristic extraction network in the trainer due to weight sharing, namely, the parameter updating direction of the network in the tester tends to the parameter updating direction of the network in the trainer.

Step 4, performing a classification task by using the source domain samples and calculating a classification loss:

and inputting the source domain data with the labels into the feature extraction network after the parameters are updated to obtain the source domain data features, and then inputting the source domain data features into a classification network to perform an image classification task and calculate the classification loss.

Step 5, calculating a total loss function and updating parameters of all networks:

and calculating a total loss function, and updating parameters of the feature extraction network, the reconstruction network and the classification network in the trainer and the tester.

The invention has the following beneficial effects:

(1) the target domain image reconstruction is used as an automatic supervision task in the domain adaptation process, the supervision information is the target domain image, and no additional target domain image marking information is needed, so that a large amount of manual marking cost is saved; in addition, the reconstruction process of the target domain image enables the network to learn richer high-level semantic information in the target domain image, so that the network can assist the network to transfer the knowledge learned in the source domain data to the target domain by utilizing the intrinsic characteristics of the target domain data, and the performance of the domain adaptation method is improved.

(2) By introducing the meta-learning strategy into the self-supervision domain adaptation, the updating directions of the target domain self-supervision task, the source domain classification and other specific tasks to the network parameters tend to be consistent, so that the network can better extract domain invariant features, the problem of negative migration caused by the inconsistency of the updating directions of the domain adaptation task and the specific tasks to the network parameters is solved, and the domain adaptation performance is improved.

Drawings

Fig. 1 is a flowchart of an adaptive method for an adaptive domain based on meta-learning according to the present invention.

Fig. 2 is a network diagram of an adaptive method for an autonomous domain based on meta-learning according to the present invention.

Fig. 3 is a schematic diagram of basic units of a ResNet network.

Fig. 4 is a schematic view of a fully connected layer structure.

Detailed Description

In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.

As shown in fig. 1, a method for adaptive self-supervision domain based on meta-learning includes the following steps:

step 1, setting a trainer and a tester: as shown in fig. 2, target domain samples x_tThe reconstruction process of (2) is used as a Meta-trainer (Meta-train) in Meta-learning, and the source domain sample x is used as a Meta-trainer_sAs a Meta-tester (Meta-test) in Meta-learning. The source domain with the label therein is denoted as S ═ { X_s,Y_s}，x_s∈X_sAnd y_s∈Y_sRespectively representing source domain samples and corresponding labels, and an unlabeled target domain is denoted as T ═ X_tIn which x_t∈X_tRepresenting a target domain sample.

target domain samples x to be unlabeled_tObtaining target domain sample characteristics f by input characteristic extraction network G_tThen the target domain sample characteristics f_tThe input image reconstruction network D carries out image reconstruction to obtain a target domain reconstruction sample

And meterCalculating reconstruction loss L_r. The feature extraction network G adopts a ResNet-50 structure, the basic unit of ResNet is shown in figure 3, the output of the previous layer is added with the output calculated by the current layer through jumping, and the summed result is input into the activation function as the output of the current layer. Obtaining target domain sample characteristics f through the characteristic extraction process of ResNet-50_t＝G(x_t). The image reconstruction network D adopts a decoder structure and carries out a series of upsampling on the target domain sample characteristics f_tReduced to the original size, i.e.

The reconstruction loss is:

wherein N is_tIs the number of samples in the target domain, and j is the jth sample in the target domain.

And 3, updating parameters of the feature extraction network:

using reconstruction loss L in the trainer_rUpdating parameters of the feature extraction network G, namely:

wherein

The parameters of the network are extracted for the current features,

extracting the parameters of the network for the updated features, alpha is the learning rate,

are the parameters of the decoder D and,

representation of parameters

And (5) solving the gradient, wherein a random gradient descent algorithm is adopted for gradient descent. Due to weight sharing, the parameter theta of the feature extraction network G in the tester and the parameter theta of the feature extraction network G in the trainer are updated together, namely, the parameter updating direction of a specific task such as image classification in the tester to the network is forced to trend to the parameter updating direction of an automatic supervision image reconstruction task in the trainer to the network.

Step 4, performing a classification task by using the source domain samples and calculating a classification loss: source domain data x to be tagged_sThe input parameter updated feature extraction network G obtains the source domain data feature f_sThen the source domain data is characterized by f_sAnd inputting the image into a classification network C to perform an image classification task and calculating a classification loss. The classification network C adopts a structure of multiple fully-connected layers and a softmax layer, a schematic diagram of the fully-connected layer structure is shown in FIG. 4, each node of the classification network C is connected with all nodes of the previous layer and used for integrating the extracted features and outputting image prediction tags through the softmax layer

Namely, it is

Is classified as

N_sIs the number of samples in the source domain, and k is the kth sample in the source domain.

calculating a total loss function L, and updating parameters of a feature extraction network G, a reconstruction network D and a classification network C in the trainer and the tester, namely:

where β is the learning rate, { θ_G,θ_D,θ_C}^tFor the parameters of the network at the current moment, the total loss function is as follows:

wherein, theta_CExpressing the parameters of the classification network, lambda is a hyper-parameter and is used for controlling the influence of the image reconstruction task and the image classification task on the network parameter updating, L_r(θ_G,θ_D) Expressed in a network parameter of theta_GAnd theta_DThe loss of the image reconstruction obtained by the calculation,

expressed in the network parameters of

And theta_CThe calculated image classification loss.

Claims

1. A self-supervised domain adaptation method based on meta-learning, characterized in that, comprising the following steps:

Step 1, set up trainer and tester:

The reconstruction process of the target domain samples is used as the trainer in the meta-learning, and the classification process of the source domain samples is used as the tester in the meta-learning;

Step 2, use the target domain samples to perform the image reconstruction task and calculate the reconstruction loss:

Input the unlabeled target domain samples into the feature extraction network to obtain the target domain sample features, and then input the target domain sample features into the image reconstruction network for image reconstruction and calculate the reconstruction loss;

Step 3, update the parameters of the feature extraction network:

Use the reconstruction loss in the trainer to update the parameters of the feature extraction network in the trainer. Due to the sharing of weights, the parameters of the feature extraction network in the tester are updated together with the parameters of the feature extraction network in the trainer, that is, even in the tester The parameter update direction of the network tends to the parameter update direction of the network in the trainer;

Step 4, use the source domain samples to perform the classification task and calculate the classification loss:

Input the labelled source domain data into the feature extraction network after updating the parameters to obtain the source domain data features, and then input the source domain data features into the classification network for image classification tasks and calculate the classification loss;

Step 5, calculate the total loss function and update the parameters of the entire network:

Calculate the total loss function and make parameter updates for the feature extraction network, reconstruction network, and classification network in the trainer and tester.

2. a kind of self-supervised domain adaptation method based on meta-learning according to claim 1, is characterized in that, the concrete method of step 1 is as follows:

The reconstruction process of the target domain sample x _t is used as the meta-trainer (Meta-train) in the meta-learning, and the classification process of the source domain sample x _s is used as the meta-tester (Meta-test) in the meta-learning; The source domain is denoted as S = {X _s , Y _s }, x _s ∈ X _s and y _s ∈ Y _s represent the source domain samples and corresponding labels, respectively, and the unlabeled target domain is denoted as T = {X _t }, where x _t ∈ X _t represents the target domain sample.

3. a kind of self-supervised domain adaptation method based on meta-learning according to claim 2, is characterized in that, the concrete method of step 2 is as follows:

Input the unlabeled target domain sample _x _t into the feature extraction network G to obtain the target domain sample feature ft _, and then input the target domain sample feature ft into the image reconstruction network D for image reconstruction to obtain the target domain reconstruction sample

And calculate the reconstruction loss L _r ; in which the feature extraction network G adopts the ResNet-50 structure, and the basic unit of ResNet adds the output of the previous layer and the output calculated by this layer through the jump connection, and the summation result is input into the activation function As the output of this layer; through the feature extraction process of ResNet-50, the target domain sample feature f _t =G(x _t ) is obtained; the image reconstruction network D adopts the decoder structure, and the target domain sample feature f _t is obtained through a series of upsampling Restore to the original image size, ie

(f _t ); the reconstruction loss is:

where N _t is the number of samples in the target domain, and j is the jth sample in the target domain.

4. a kind of self-supervised domain adaptation method based on meta-learning according to claim 3, is characterized in that, the concrete method of step 3 is as follows:

Use the reconstruction loss L _r in the trainer to update the parameters of the feature extraction network G, namely:

in

Extract the parameters of the network for the current feature,

is the parameter of the updated feature extraction network, α is the learning rate,

is the parameter of decoder D,

Indicates the pair of parameters

To find the gradient, the gradient descent adopts the stochastic gradient descent algorithm; due to the sharing of weights, the parameter θ of the feature extraction network G in the tester and the parameter θ of the feature extraction network G in the trainer are updated together, that is, forcing the image classification in the tester, etc. The direction of parameter update of the network by a specific task tends to the direction of parameter update of the network by the self-supervised image reconstruction task in the trainer.

5. a kind of self-supervised domain adaptation method based on meta-learning according to claim 4, is characterized in that, the concrete method of step 4 is as follows:

Input the labeled source domain data x _s into the updated feature extraction network G to obtain the source domain data feature f _s , and then input the source domain data feature f _s into the classification network C to perform the image classification task and calculate the classification loss; where the classification network C adopts multiple fully connected layers plus softmax layer structure, fully connected layer structure, each node is connected to all nodes in the previous layer, used to synthesize the extracted features, and output image prediction labels through the softmax layer

which is

The classification loss is

N _s is the number of samples in the source domain, and k is the kth sample in the source domain.

6. a kind of self-supervised domain adaptation method based on meta-learning according to claim 5, is characterized in that, the concrete method of step 5 is as follows:

Calculate the total loss function L, and update the parameters of the feature extraction network G, reconstruction network D and classification network C in the trainer and tester, namely:

where β is the learning rate, {θ _G , θ _D , θ _C } ^t is the parameter of the network at the current moment, and the total loss function is as follows:

Among them, θ _C represents the parameters of the classification network, λ is a hyperparameter, which is used to control the influence of image reconstruction tasks and image classification tasks on the update of network parameters, and L _r (θ _G , θ _D ) represents when the network parameters are θ _G and The image reconstruction loss calculated at θ _D ,

Indicates that the network parameters are

and the image classification loss computed when θ _C .