CN110378366A

CN110378366A - A kind of cross-domain image classification method based on coupling knowledge migration

Info

Publication number: CN110378366A
Application number: CN201910482559.5A
Authority: CN
Inventors: 孟敏; 兰孟城; 武继刚
Original assignee: Guangdong University of Technology
Current assignee: Guangdong University of Technology
Priority date: 2019-06-04
Filing date: 2019-06-04
Publication date: 2019-10-25
Anticipated expiration: 2039-06-04
Also published as: CN110378366B

Abstract

The invention discloses a kind of cross-domain image classification methods based on coupling knowledge migration to eliminate the difference of source domain and the distribution of aiming field data edges and the distribution of class condition by finding source domain and the common lower-dimensional subspace of aiming field based on Largest Mean difference criterion；According to the respective adjacent map of pseudo label information architecture of the label information of source domain data and target numeric field data, structural integrity of the data from luv space to lower-dimensional subspace is kept, while can dynamically adjust the structure of adjacent map, promotes the positive transfer of knowledge in domain；Using the source domain data training nearest neighbor classifier of tape label information in lower-dimensional subspace, continuous iteration optimization is carried out to the pseudo label information of target numeric field data and completes cross-domain image classification to obtain the final label information of target numeric field data；In addition the different confidence level of the pseudo label that the method for the present invention assigns target area image by design sample weight weighted strategy, is effectively reduced the negative transfer of knowledge in domain, improves the precision of cross-domain image classification.

Description

A kind of cross-domain image classification method based on coupling knowledge migration

Technical field

The present invention relates to computer visual image sorting technique field more particularly to it is a kind of based on coupling knowledge migration across Area image classification method.

Background technique

Traditional machine learning algorithm usually requires that the data sample of a large amount of tape label, and require training sample with Test sample obeys independent and identical distribution.Direct method is by the data set migration of existing tape label to newly unknown On data set, i.e., aiming field is moved to from source domain.But the difference of distribution is certainly existed between different data collection, " is obeyed independent The hypothesis of same distribution " is often invalid.Therefore, traditional image classification method is difficult to obtain preferable classification performance.

In order to overcome the difference of sample distribution between source domain and aiming field, the generalization ability of sorting algorithm is promoted, at present master It to adapt to be commonly divided into based on feature as the means for handling the knowledge migration problem in cross-cutting image classification by field The method of transformation, the method for Case-based Reasoning and the method based on neural network migration.However current cross-cutting study remains Many needs solve the problems, such as and perfect, for example, under source domain and the biggish situation of aiming field distributional difference, based on feature turn The classification performance of the field adaptive method changed is far from reaching the initial expectation of people；The method of Case-based Reasoning needs to calculate additional Sample weights, computation complexity is high, cannot carry out quickly cross-cutting study to large data sets；Method neural network based It is required that the source domain sample of a large amount of tape labels carries out training for a long time, limited in actual application larger.

Summary of the invention

The present invention is to solve existing cross-domain image classification method to calculate the problems such as time is long, classification accuracy is insufficient, is mentioned A kind of cross-domain image classification method based on coupling knowledge migration is supplied.

To realize the above goal of the invention, and the technological means used is:

A kind of cross-domain image classification method based on coupling knowledge migration, comprising the following steps:

S1. source domain image and target area image are obtained and carries out feature extraction, respectively obtains source domain eigenmatrix as source Numeric field data obtains target domain characterization matrix as target numeric field data；

S2. the adjacent map of source domain data and the adjoining of target numeric field data are constructed respectively according to the label information of source domain data Figure, the side right by calculating adjacent map respectively obtain the weight matrix of source domain and the weight matrix of aiming field again；

S3. data distribution is aligned between carrying out domain in the common lower-dimensional subspace of the source domain and aiming field, is utilized simultaneously The weight matrix of source domain and the weight matrix of aiming field carry out the knowledge migration of data in domain；

S4. it in the lower-dimensional subspace, using the source domain data training nearest neighbor classifier of tape label information, and is used for Predict the classification information of target numeric field data；

S5. the label information for the target numeric field data predicted is weighted again；

S6. judge whether to reach maximum number of iterations, if not return step S2, if then exporting predicted aiming field figure The label information of picture.

In above scheme, by the common lower-dimensional subspace of searching source domain and target source, the difference of sample distribution is reduced, together Shi Liyong adjacent map carries out knowledge positive transfer in domain, keeps leading to from luv space to the structural integrity of common lower-dimensional subspace The difference for eliminating source domain and the distribution of aiming field data edges and the distribution of class condition is crossed, so that data distribution is aligned, thus in low-dimensional The label information of target numeric field data is predicted in subspace using the source domain data training nearest neighbor classifier of tape label information, most It is weighted again by the label information to the target numeric field data predicted afterwards and inhibits knowledge negative transfer, to realize cross-domain image Accurate classification.

Preferably, the specific steps of the step S1 are as follows:

S1.1. n is inputted_sOpen the source domain image and n of tape label_tOpen the target area image without label；Wherein n_s、n_tIt is positive whole Number；

S1.2. feature extraction is carried out to the source domain image and target area image respectively, it will be to the source domain image zooming-out Obtained feature vector obtains source domain data characteristics matrix by column arrangementWherein m is characterized The dimension of vector, n_sFor the number of feature vector；

The feature vector obtained to the aiming field image zooming-out is obtained into aiming field data characteristics matrix by column arrangementWherein m is the dimension of feature vector, n_tFor the number of feature vector；

The feature vector of the source domain image indicates the source domain data of source domain image, the feature vector of the target area image Indicate the target numeric field data of target area image.

Preferably, feature extraction described in step S1 extracts the source specifically by the neural network of pre-training respectively The feature vector of area image and target area image.In this preferred embodiment, VGG16, GoogLeNet is can be used in neural network, ResNet50 etc..

Preferably, feature extraction described in step S1 is specially and carries out respectively to the source domain image and target area image After gray processing, the gray value of the source domain image is arranged in feature vector and arranges the gray value of the target area image At feature vector.

Preferably, the S2 specific steps are as follows:

S2.1. first time iteration is judged whether it is, if otherwise directly carrying out step S2.2, if constructing arest neighbors classification Device simultaneously trains the nearest neighbor classifier using the source domain data, for predicting the pseudo label information of target numeric field data

S2.2. according to the label information of the source domain data, the adjacent map of source domain data is constructed, i.e., in source domain data Adjacent side is established between homogeneous data, and calculates the weight of adjacent side, obtains weight matrix

Wherein W^sFor the weight matrix of source domain, it is calculate by the following formula:

WhereinFor the feature vector of i-th Zhang Yuan's area image,For the corresponding label information of i-th Zhang Yuan's area image, σ is The bandwidth of Gaussian kernel, | | | | indicate two norm of vector；

According to the pseudo label information of the target numeric field data, the adjacent map of target numeric field data is constructed, in target numeric field data Homogeneous data between establish adjacent side, and calculate the weight of adjacent side, obtain weight matrixWherein W^tFor target The weight matrix in domain is simultaneously calculate by the following formula:

WhereinFor the feature vector of i-th target area image,For the corresponding pseudo label of i-th target area image, β_i ForWeight, initial value is 1.

In this preferred embodiment, by respectively in source domain and aiming field construct adjacent map method, data can be kept Structural integrity from luv space to the invariant subspace constructs adjacent map using the semantic information of source domain data tag information, The structure of adjacent map can be dynamically adjusted, to excavate the more essential partial structurtes of data, promote in domain knowledge just Migration.

Preferably, the S3 specific steps are as follows:

S3.1. it in lower-dimensional subspace common in the source domain and aiming field, is carried out using Largest Mean difference criterion Data distribution is aligned between domain including edge distribution and conditional probability distribution, specifically by the following first object letter of minimum Number:

Wherein C indicates the class number of label information, x^s,cAnd x^t,cRespectively indicate the classification category of source domain data tag information Belong to the set of classification c in the set of classification c and the classification of aiming field data tag information,WithRespectively indicate source domain number Belong to the number of the data of the classification c of label information according to the number and target numeric field data of the data for the classification c for belonging to label information,For projection matrix, wherein d < < m, projects to d n-dimensional subspace n for tieing up data from m；

S3.2. weight matrix W is utilized^sAnd W^tThe knowledge migration for carrying out data in domain, specifically by below minimizing second Objective function:

S3.3. the first object function and the second objective function are combined and a regularization is addedObtain with Lower third objective function:

s.t.P^TXHX^TP=I

Wherein λ and α is balance factor, λ > 0 and α > 0, X=[X_s,X_t], change matrix centered on H:I is unit matrix,Complete 1 column vector for being 1 for value；

Projection matrix P is solved by generalized eigen decomposition, obtains source domain data and target numeric field data in low-dimensional Expression in space is respectively as follows:With

In this preferred embodiment, by found based on Largest Mean difference criterion source domain and aiming field it is common low-dimensional son The difference of source domain and the distribution of aiming field data edges and the distribution of class condition is eliminated in space.

Preferably, the specific steps of the step S4 are as follows:

According to expression of the source domain data in lower-dimensional subspace, the source of the tape label information in lower-dimensional subspace is obtained Numeric field dataFor training the nearest neighbor classifier, and predict the target numeric field data in lower dimensional space's Classification information

In this preferred embodiment, classified in lower-dimensional subspace using the source domain data training arest neighbors of tape label information Device carries out continuous iteration optimization to the pseudo label information of target numeric field data, to obtain the final label information of target numeric field data.

Preferably, the specific steps of the step S5 include:

S5.1. to the target numeric field data in lower dimensional spaceA new adjacent map is constructed, i.e., will Each aiming field data point in lower dimensional spaceThe k aiming field number nearest with its Euclidean distance respectively Adjacent side is established at strong point；

S5.2. based on adjacent map described in step S5.1, for i-th of aiming field data point on authorized adjacent mapIf withK connected aiming field data point all withA classification is belonged to, then aiming field data pointLabel Confidence level β_i=1, otherwise β_i=0, wherein k is positive integer.

In this preferred embodiment, based on sample weight weighted strategy, assign aiming field sample different confidence levels, thus effectively Ground inhibits the negative transfer of knowledge in domain.

Compared with prior art, the beneficial effect of technical solution of the present invention is:

The method of the present invention by carried out using Largest Mean difference criterion include edge distribution and conditional probability distribution domain Between data distribution be aligned, find source domain and the common lower-dimensional subspace of aiming field, eliminate source domain and aiming field data edges are distributed The difference being distributed with the distribution of class condition, so that data distribution is aligned；

According to the respective adjacent map of pseudo label information architecture of the label information of source domain data and target numeric field data, number is kept According to the structural integrity from luv space to lower-dimensional subspace, while the structure of adjacent map can be dynamically adjusted, promote to know in domain The positive transfer of knowledge；Using the source domain data training nearest neighbor classifier of tape label information in lower-dimensional subspace, to aiming field number According to pseudo label information carry out continuous iteration optimization and complete cross-domain figure to obtain the final label information of target numeric field data As classification；

In addition the different confidence of the pseudo label that the method for the present invention assigns target area image by design sample weight weighted strategy Degree, is effectively reduced the negative transfer of knowledge in domain, improves the precision of cross-domain image classification.

The present invention is based on the method for iteration, data distribution alignment can be effectively combined, promote in domain knowledge positive transfer and Inhibit knowledge negative transfer in domain, to achieve the purpose that couple knowledge migration, greatly improves the performance of cross-domain image classification. It solves existing cross-domain image classification method and calculates the problems such as time is long, classification accuracy is insufficient.

Detailed description of the invention

Fig. 1 is the flow chart of the method for the present invention.

Fig. 2 is the effect picture between carrying out domain in the present embodiment 2 after data distribution alignment.

Fig. 3 is the label information category distribution effect picture in the present embodiment 2 to aiming field in lower dimensional space.

Specific embodiment

The attached figures are only used for illustrative purposes and cannot be understood as limitating the patent；

In order to better illustrate this embodiment, the certain components of attached drawing have omission, zoom in or out, and do not represent actual product Size；

To those skilled in the art, it is to be understood that certain known features and its explanation, which may be omitted, in attached drawing 's.

The following further describes the technical solution of the present invention with reference to the accompanying drawings and examples.

Embodiment 1

A kind of cross-domain image classification method based on coupling knowledge migration, as shown in Figure 1, comprising the following steps:

S1. source domain image and target area image are obtained and carries out feature extraction, respectively obtains source domain eigenmatrix as source Numeric field data obtains target domain characterization matrix as target numeric field data:

S1.2. feature extraction is carried out to the source domain image and target area image respectively, by the VGG16 of pre-training, Any one neural network of GoogLeNe, ResNet50 extracts the feature vector of the source domain image and target area image respectively, will be right The feature vector that the source domain image zooming-out obtains obtains source domain data characteristics matrix by column arrangement Wherein m is the dimension of feature vector, n_sFor the number of feature vector；

S2. the adjacent map of source domain data and the adjoining of target numeric field data are constructed respectively according to the label information of source domain data Figure, the side right by calculating adjacent map respectively obtain the weight matrix of source domain and the weight matrix of aiming field again:

According to the pseudo label information of the target numeric field data, the adjacent map of target numeric field data is constructed, in target numeric field data Homogeneous data between establish adjacent side, and calculate the weight of adjacent side, obtain weight matrix

Wherein W^tFor aiming field weight matrix and be calculate by the following formula:

S3. data distribution is aligned between carrying out domain in the common lower-dimensional subspace of the source domain and aiming field, is utilized simultaneously The weight matrix of source domain and the weight matrix of aiming field carry out the knowledge migration of data in domain:

s.t.P^TXHX^TP=I

S4. it in the lower-dimensional subspace, using the source domain data training nearest neighbor classifier of tape label information, and is used for It predicts the classification information of target numeric field data: according to expression of the source domain data in lower-dimensional subspace, obtaining empty in low-dimensional Between middle tape label information source domain dataFor training the nearest neighbor classifier, and predict in lower dimensional space Target numeric field dataClassification information

S5. the label information for the target numeric field data predicted is weighted again:

S6. judge whether to reach maximum number of iterations, if not return step S2, if then exporting predicted aiming field figure The label information of picture

Embodiment 2

In the present embodiment 2, the facial image composition of randomly selected 10 people from CMU PIE data set is selected 980 face images, every image are cut into 32X32 resolution ratio.Further, using C05 subset totally 490 images as Source domain data set, using the C27 subset aiming field data set that 490 images are classified as needs in total.

S1. the target area image in the source domain image and aiming field data set in source domain data set is obtained, to the source domain After image and target area image carry out gray processing respectively, the gray value of the source domain image is arranged in feature vector and by institute The gray value for stating target area image is arranged in feature vector, in the present embodiment 2, the gray value of every image be arranged in feature to Amount, i.e. every image is expressed as the column vector of one 1024 dimension, corresponding, and it is special to respectively obtain source domain data by column arrangement Levy matrixWith target domain characterization matrix

S2.1. first time iteration is judged whether it is, if otherwise directly carrying out step S2.2, if constructing arest neighbors classification Device and the source domain data for utilizing tape label informationTraining nearest neighbor classifier, for predicting target numeric field dataPseudo- classification information

S2.2. according to the label information y of source domain data^sThe adjacent map for constructing source domain data, i.e., to same in source domain data Adjacent side is established between class data, and calculates the weight of adjacent side, obtains weight matrix

WhereinFor the feature vector of i-th Zhang Yuan's area image,For the corresponding label information of i-th Zhang Yuan's area image, σ is The bandwidth of Gaussian kernel is set as 1 in the present embodiment 1, | | | | indicate two norm of vector；

According to the pseudo label information of the target numeric field dataThe adjacent map for constructing target numeric field data, to target numeric field data In homogeneous data between establish adjacent side, and calculate the weight of adjacent side, obtain weight matrixWherein W^tFor mesh It marks the weight matrix in domain and is calculate by the following formula:

By minimizing following objective function:

s.t.P^TXHX^TP=I

Wherein λ=0.1 and α=0.01 are balance factor.X=[X_s,X_t], change matrix centered on H: I₉₈₀For unit matrix, 1₉₈₀For complete 1 column vector.Above formula, which is solved, by generalized eigen decomposition obtains projection matrix The dimension that lower-dimensional subspace is arranged in the present embodiment 2 is d=100.Source domain data and target numeric field data are finally obtained in low-dimensional Expression in space:With

S4. the expression according to the source domain data in lower-dimensional subspace, obtains the tape label information in lower-dimensional subspace Source domain dataFor training the nearest neighbor classifier, and predict the target numeric field data in lower dimensional spaceClassification information

S5.1. to the target numeric field data in lower dimensional spaceA new adjacent map is constructed, i.e., will Each aiming field data point in lower dimensional spaceThe k=5 target nearest with its Euclidean distance respectively Numeric field data point establishes adjacent side；

S5.2. based on adjacent map described in step S5.1, for i-th of aiming field data point on authorized adjacent mapIf with5 connected aiming field data points all withA classification is belonged to, then aiming field data pointLabel Confidence level β_i=1, otherwise β_i=0.

S6. judge whether to reach maximum number of iterations 10, if not return step S2, if then exporting predicted aiming field The label information of imageTo obtain classification results.

In the present embodiment 2, experiment porch is the MATLAB R2017a software in WIN10 system, the model of CPU Intel i7-6700K@4.00GHz.The experimental result schematic diagram of the present embodiment 2 is as shown in Figure 2,3, wherein in Fig. 2, it is round What is indicated is source domain data, and what five-pointed star indicated is target numeric field data, and the left subgraph of Fig. 2 is the original of source domain data and target numeric field data Begin to be distributed, it can be clearly seen that the distribution of source domain data and target numeric field data has very big otherness；Fig. 2 right subgraph is this reality It applies 2 methods and is carrying out the data point between domain after data distribution alignment in lower-dimensional subspace to source domain and data aiming field data Cloth, it can be clearly seen that the otherness of source domain data and aiming field data distribution greatly reduces.It is of different shapes in Fig. 3 Mark indicates different label information classifications, a total of ten label information classifications.Wherein the left subgraph of Fig. 3 is original aiming field The distribution of each label information of data, category distribution are more chaotic；Fig. 3 right subgraph is target numeric field data in 2 side of the present embodiment Distribution in the obtained lower-dimensional subspace of method, it can be clearly seen that the category distribution of target numeric field data is relatively clear, Ge Geyou The data of same label information are respectively at cluster.This illustrates the high efficiency of the method for the present invention.

The terms describing the positional relationship in the drawings are only for illustration, should not be understood as the limitation to this patent；

Obviously, the above embodiment of the present invention be only to clearly illustrate example of the present invention, and not be pair The restriction of embodiments of the present invention.For those of ordinary skill in the art, may be used also on the basis of the above description To make other variations or changes in different ways.There is no necessity and possibility to exhaust all the enbodiments.It is all this Made any modifications, equivalent replacements, and improvements etc., should be included in the claims in the present invention within the spirit and principle of invention Protection scope within.

Claims

1. a kind of cross-domain image classification method based on coupling knowledge migration, which comprises the following steps:

S1. source domain image and target area image are obtained and carries out feature extraction, respectively obtains source domain eigenmatrix as source domain number According to obtaining target domain characterization matrix as target numeric field data；

S2. the adjacent map of source domain data and the adjacent map of target numeric field data are constructed respectively according to the label information of source domain data, is led to The side right for crossing calculating adjacent map respectively obtains the weight matrix of source domain and the weight matrix of aiming field again；

S3. data distribution is aligned between carrying out domain in the common lower-dimensional subspace of the source domain and aiming field, while utilizing source domain Weight matrix and aiming field weight matrix carry out domain in data knowledge migration；

S4. in the lower-dimensional subspace, using the source domain data training nearest neighbor classifier of tape label information, and for predicting The label information of target numeric field data；

S6. judge whether to reach maximum number of iterations, if not return step S2, if then exporting predicted target area image Label information, to obtain classification results.

2. the cross-domain image classification method according to claim 1 based on coupling knowledge migration, which is characterized in that the step The specific steps of rapid S1 are as follows:

S1.1. n is inputted_sOpen the source domain image and n of tape label_tOpen the target area image without label；Wherein n_s、n_tFor positive integer；

S1.2. feature extraction is carried out to the source domain image and target area image respectively, the source domain image zooming-out will be obtained Feature vector by column arrangement obtain source domain data characteristics matrixWherein m is feature vector Dimension, n_sFor the number of feature vector；

The feature vector of the source domain image indicates that the source domain data of source domain image, the feature vector of the target area image indicate The target numeric field data of target area image.

3. the cross-domain image classification method according to claim 2 based on coupling knowledge migration, which is characterized in that step S1 Described in feature extraction extract the source domain image and target area image respectively specifically by the neural network of pre-training Feature vector.

4. the cross-domain image classification method according to claim 2 based on coupling knowledge migration, which is characterized in that step S1 Described in feature extraction be specially gray processing is carried out respectively to the source domain image and target area image after, by the source domain figure The gray value of picture is arranged in feature vector and the gray value of the target area image is arranged in feature vector.

5. the cross-domain image classification method according to claim 3 or 4 based on coupling knowledge migration, which is characterized in that institute The S2 specific steps stated are as follows:

S2.1. first time iteration is judged whether it is, if step S2.2 is otherwise directly carried out, if constructing nearest neighbor classifier simultaneously Using the source domain data training nearest neighbor classifier, for predicting the pseudo label information of target numeric field data

S2.2. according to the label information of the source domain data, the adjacent map of source domain data is constructed, i.e., to similar in source domain data Adjacent side is established between data, and calculates the weight of adjacent side, obtains weight matrix

WhereinFor the feature vector of i-th Zhang Yuan's area image,For the corresponding label information of i-th Zhang Yuan's area image, σ is Gaussian kernel Bandwidth, | | | | indicate two norm of vector；

According to the pseudo label information of the target numeric field data, the adjacent map of target numeric field data is constructed, to same in target numeric field data Adjacent side is established between class data, and calculates the weight of adjacent side, obtains weight matrix

WhereinFor the feature vector of i-th target area image,For the corresponding pseudo label of i-th target area image, β_iFor Weight, initial value is 1.

6. the cross-domain image classification method according to claim 5 based on coupling knowledge migration, which is characterized in that described S3 specific steps are as follows:

S3.1. in lower-dimensional subspace common in the source domain and aiming field, included using Largest Mean difference criterion Data distribution is aligned between edge distribution and the domain of conditional probability distribution, specifically by the following first object function of minimum:

Wherein C indicates the class number of label information, x^s,cAnd x^t,cThe classification for respectively indicating source domain data tag information belongs to class The set of other c and the classification of aiming field data tag information belong to the set of classification c,WithSource domain data are respectively indicated to belong to The number and target numeric field data of the data of the classification c of label information belong to the number of the data of the classification c of label information,For projection matrix, wherein d < < m, projects to d n-dimensional subspace n for tieing up data from m；

S3.2. weight matrix W is utilized^sAnd W^tThe knowledge migration for carrying out data in domain, specifically by the second target below minimizing Function:

S3.3. the first object function and the second objective function are combined and a regularization is addedObtain following Three objective functions:

s.t.P^TXHX^TP=I

Wherein λ and α is balance factor, λ > 0 and α > 0, X=[X_s,X_t], change matrix centered on H: I is unit matrix,Complete 1 column vector for being 1 for value；

Projection matrix P is solved by generalized eigen decomposition, obtains source domain data and target numeric field data in the lower-dimensional subspace In expression be respectively as follows:With

7. the cross-domain image classification method according to claim 6 based on coupling knowledge migration, which is characterized in that the step The specific steps of rapid S4 are as follows:

According to expression of the source domain data in lower-dimensional subspace, the source domain number of the tape label information in lower-dimensional subspace is obtained According toFor training the nearest neighbor classifier, and predict the target numeric field data in lower dimensional spaceClassification Information

8. the cross-domain image classification method according to claim 7 based on coupling knowledge migration, which is characterized in that the step Suddenly the specific steps of S5 include:

S5.1. to the target numeric field data in lower dimensional spaceConstruct a new adjacent map, i.e., it will be each Aiming field data point in lower dimensional spaceThe k aiming field data point nearest with its Euclidean distance respectively Establish adjacent side；

S5.2. based on adjacent map described in step S5.1, for i-th of aiming field data point on authorized adjacent mapIf WithK connected aiming field data point all withA classification is belonged to, then aiming field data pointLabelConfidence level β_i=1, otherwise β_i=0, wherein k is positive integer.