CN110533193A

CN110533193A - Feature and example combine transfer learning method under semi-supervised scene

Info

Publication number: CN110533193A
Application number: CN201910770868.2A
Authority: CN
Inventors: 黄浩然; 文江辉; 邓兵; 肖新平; 饶从军
Original assignee: Wuhan University of Technology WUT
Current assignee: Wuhan University of Technology WUT
Priority date: 2019-08-20
Filing date: 2019-08-20
Publication date: 2019-12-03

Abstract

The invention discloses features under a kind of semi-supervised scene and example to combine transfer learning method, when certain domain classification model trainings aiming at the problem that tape label data deficiencies, and introduce other field data and the field without label data supplemental training, while considering the difference of data distribution between field, for a small amount of tape label data existing in aiming field, there is situation largely without label data again, the present invention proposes mixed balanced distribution adaptive method and self study instance migration method；And construction feature and example combine transfer learning method FSJT based on this.

Description

Feature and example combine transfer learning method under semi-supervised scene

Technical field

The present invention relates to transfer learning technical fields in machine learning, in particular to feature and reality under a kind of semi-supervised scene Example joint transfer learning method.

Background technique

It is formal in the bulletin of US Department of Defense Advanced Research Projects Agency (DARPA) Information Processing Technology Office in 2005 Transfer learning one definition is provided, i.e., the knowledge and skills acquired in other tasks are applied to the ability of new task.With more Business study is compared, transfer learning more concerned be goal task, rather than learn all originating task and goal task simultaneously.Source Role of the task and goal task in transfer learning is no longer symmetrical.The task space for needing to study in transfer learning claims For aiming field (Target domain), and task space associated before is known as source domain (Source domain).Existing rank The development of section transfer learning mainly has following direction, and the transfer learning of Case-based Reasoning, the transfer learning based on feature are based on The migration etc. of parameter.

The migration of Case-based Reasoning is more intuitive, main thought are as follows: although source domain data cannot be utilized directly, data certain A little parts still can be used together with the existing training data in aiming field.How the source domain data that can be used are screened As the emphasis of instance migration study, such as classical instance migration method TrAdaBoost algorithm.In addition from the ratio of distribution Angle carries out correlative study, proposes transmitting transfer learning method (Transitive Transfer Learning, TTL) etc..

Migration based on feature, which refers to, migrates the data field by way of eigentransformation, to reduce source domain With the gap of aiming field.Component analyzing method (Transfer Component Analysis, TCA) is migrated as classical spy Shifting method of relocating residents from locations to be used for construction of new buildings or factories is proposed that this method utilizes Largest Mean difference (Maximum Mean Discrepancy, MMD) by Pan et al. As the measurement criterion of probability distribution between field, target is to minimize distributional difference.

Currently, there is scholar to make exploration in the moving method for combining example and feature.Long et al. proposition is minimizing While distribution distance, migration joint matching (Tranfer Joint Matching, TJM) method of example selection is added, it will Example and feature transfer learning method have carried out organic combination.But current transfer learning method mainly utilizes in aiming field A small amount of tape label data, or directly utilize no label data.And traditional semi-supervised learning utilizes in a field simultaneously Therefore two class data if considering semi-supervised scene in transfer learning, can learn to more target domains are existing to know Know.

Summary of the invention

The purpose of the present invention is to solve above-mentioned background technique there are the problem of, and propose a kind of semi-supervised scene under Feature and example joint transfer learning method (Feature and Sample Jointed Transfer, abbreviation FSJT), for Aiming field has a small amount of tape label data and largely without the situation of label data, and the situation that source domain and aiming field differ greatly, with Phase promotes the accuracy rate classified on aiming field.

To achieve the above object, feature and example combine transfer learning method under the semi-supervised scene designed by the present invention, It is characterized in that, the described method comprises the following steps:

Step 1: data in Definition Model: giving a field, it is known that each data generic uses " 0 " in the field Or " 1 " indicates, is denoted as source domain D_s, the n sample eigenmatrix and class label vector form for including be expressed as It is abbreviated as { x_s,y_s}；Another field is given, aiming field D is denoted as_t, wherein m sample composition tape label data set is denoted asIt is denoted as without label data collectionTwo class data sets are denoted as altogetherInclude m+ K training sample, and assumeMiddle sample size is not enough to one reliable classifier of training；

Assuming that feature space X_s=X_t, i.e., the feature type in two fields is identical as quantity, classification space Y_s=Y_t, but side Fate cloth P_s(x_s)≠P_t(x_t), condition is distributed P_s(y_s|x_s)≠P_t(y_t|x_t), target is to utilize source domain D_sMiddle data set { x_s,y_s} With aiming field D_tMiddle data setUnlabeled data in learning objective domainClassificationWherein r= 1 ..., l, n, m, k, l are the natural number greater than 1；

Step 2: balance parameters μ is utilized, objective function is constructed and carries out abbreviation and solve to obtain:

Wherein objective function minimum value is sought in min expression, and s.t. indicates bound for objective function.Matrix is sought in tr () expression Mark, the input matrix that X is made of source domain and aiming field data set, and meet X=[x_s,x_t], wherein Mapping function ψ:By kernel function K=ψ (X)^Tψ (X) is provided, and λ is regularization coefficient,Indicate the flat of F norm Side, balance factor μ, A indicate that transition matrix, H=I- (1/n) 1 are center matrix, I ∈ R^{(n+m+k+l)×(n+m+k+l)}For unit square Battle array, M₀, M_cFor MMD matrix, make is as follows:

Wherein { 0,1 } c ∈,WithIt is illustrated respectively in the number for belonging to the sample of c class in source domain and aiming field, n_c Indicate source domain data set { x_s,y_sIn belong to c class sample number；m_cIndicate aiming field tape label data setIn Belong to the number of the sample of c class；k_cIndicate aiming field without label data collectionIn belong to c class sample number；l_cTable Show aiming field test data set { x_testIn belong to c class sample number；

Step 3: being solved using method of Lagrange multipliers, set Lagrangian as Φ=(φ₁,φ₂,...,φ_d), d For the minimal eigenvalue of A, the local derviation that transition matrix A is sought after Lagrangian is derived:

Step 4: calculating transformed matrix A using Matlab^Tψ(X)；

Step 5: be directed to transformed source domain and target numeric field data, carry out instance migration study, with filter out in source domain with The sample that aiming field differs greatly, specifically:

5.1: data preparation, transformed source domain training dataset { z_s,y_s, aiming field tape label training datasetAiming field not tape label training datasetWith the test data set { x of aiming field_test}；

5.2: initialization weight vectorsWherein 1 to n weight is 1/n；The weight of n+1 to n+m is 1/m, settingN is the number of iterations；

5.3: iterative calculation updates sample weights vector w^v+1, wherein source domain D_sOn weight be updated to Wherein i=1 ..., n, aiming field D_tOn weight be updated toWherein i=n+1 ..., n+m；

Confidence level is calculated using such as following formulaScreen the δ that confidence level is greater than γ_uA sample, wherein γ indicates confidence Threshold value is spent,It is addedData set, wherein u=1 ..., U, U indicate the number of iterations, update m=m+ δ_u, and fromMiddle these samples of deletion；

5.5: step 5.2~5.4 are repeated, until u=U；

5.6: it is each primary using the data set repetition step 5.2 and 5.3 that update completion, calculate { z_testPrediction label h_f (z_test)。

Preferably, the step 4 includes following sub-step:

4.1: data preparation, source domain training dataset { x_s,y_s, aiming field training dataset dataWith Aiming field test data set { x_test}；

4.2: data set merges, by source domain eigenmatrix x_sWith target domain characterization matrixIt is combined into square Battle array X=[x_s,x_t]；

4.3: utilizing existing tag setCalculate initial m_c, with source domain data { x_s,y_sOne classifier of training, in advance It surveys in aiming field without label data collectionWith test data set { x_testInitial pseudo label collectionWithAnd it calculates Initial k_cAnd l_c, calculate initial M₀And M_c；

4.4: iteration T times, T are the natural number greater than 1, update M_cMatrix, until iteration terminates；

4.5: using the formula calculating matrix A of step 3, obtaining transformed matrix { A^Tψ(x_s),y_s, training is same on it The classifier of one type, prediction aiming field have label data collectionPseudo label collectionAccording to On accuracy rate repeat step 3, using PSO optimization algorithm adjusting parameter μ, d, and obtain optimal parameter combination；

4.6 repeat step 3 using parameter after updating, and calculate A^Tψ (X) matrix, and be decomposed intoShape Formula.

Preferably, the calculating step of each iteration includes: in the step 5.3

(1) it calculatesThe component of weight is set to add up to 1, v=1 ..., N；

(2) sample weights ω is utilized^vTraining base classifier h_v, predict in aiming field without label training datasetMark Label, are denoted asAnd calculate the error ε in aiming field tape label data_v:

(3) β is set_v=ε_v/(1-ε_v), and update sample weights vector w^v+1, source domain D_sOn weight be updated toWherein i=1 ..., n, aiming field D_tOn weight be updated toWherein i=n +1,...,n+m。

Preferably, in the step 4.4 each iteration the following steps are included:

(1) setup parameter λ, kernel function K are fixed value, and the initial value of setup parameter μ, d utilize function eigs solution matrix A；

(2) in { A^Tψ(x_s),y_sOn the same classifier of training, prediction{ A^T(x_test) tally setWithAnd update k_cAnd l_c, calculateUpper classification accuracy η；

(3) M is updated_cMatrix, until iteration terminates.

Preferably, it is calculated in the step 5.3The prediction label of middle dataFormula are as follows:

Preferably, confidence level is calculated in the step 5.3Formula are as follows:

The present invention introduces other field when certain domain classification model trainings aiming at the problem that tape label data deficiencies Data and the field without label data supplemental training, while considering the difference of data distribution between field, propose a kind of half Supervise feature and example joint transfer learning model (Feature and Sample Jointed Transfer, abbreviation under scene FSJT).In terms of novelty, for a small amount of tape label data existing in aiming field, and there is situation largely without label data, mention The balanced distribution adaptive method and self study instance migration method mixed out；And construction feature and example are combined and are moved based on this Move learning method FSJT.

Compared with the existing technology, of the invention to have the advantages that

1, the present invention is directed to the feature space of source domain and aiming field data set, carries out migration feature transformation, to reduce two necks The difference in characteristic of field space, the situation to differ greatly suitable for source domain and aiming field；

2, the present invention considers in semi-supervised scene the marginal probability distribution of data and conditional probability distribution and prison in aiming field The difference in scene is superintended and directed, in order to minimize the statistical distribution difference of data between field, is translated into and solves objective function most The problem of small value；Using balance parameters by distance and conditional distribution function between the edge feature distribution function of two FIELD Datas Between distance introduce objective function simultaneously；

3, present invention building method of the learning objective domain without label data information, will introduce instance migration without label data Algorithm is practised, to be suitable for semi-supervised learning scene

4, the present invention expands instance migration method using self study thought, and model is made to be suitable for semi-supervised learning field Scape is conducive to the example weight of target classification task by improving in training, reduces the example for being unfavorable for target classification task Weight removes the sample to differ greatly in source domain.

Detailed description of the invention

Fig. 1 is the flow chart that feature and example combine transfer learning method under the semi-supervised scene of the present invention.

Fig. 2 to realize the present invention under semi-supervised scene feature and example joint transfer learning method computer model structure Schematic diagram.

Specific embodiment

Present invention will now be described in further detail with reference to the embodiments and the accompanying drawings, but embodiments of the present invention are unlimited In this.

Basic procedure of the invention is as shown in Figure 1, feature and reality under a kind of semi-supervised scene provided in an embodiment of the present invention Example joint transfer learning method, comprising the following steps:

Step 1: data are as follows in Definition Model:

Give a field, it is known that each data generic is indicated with " 0 " or " 1 " in the field, is denoted as source domain D_s。 The n sample eigenmatrix and class label vector form for including are expressed asIt is abbreviated as { x_s,y_s}.It is given another One field, is denoted as aiming field D_t.Wherein m sample composition tape label data set is denoted asWithout label data collection It is denoted asWherein q=1 ..., k.Two class data sets are denoted as altogetherComprising m+k training sample, and assumeMiddle sample size is not enough to one reliable classifier of training.

Assuming that feature space X_s=X_t, i.e., the feature type in two fields is identical as quantity, classification space Y_s=Y_t, but side Fate cloth P_s(x_s)≠P_t(x_t), condition is distributed P_s(y_s|x_s)≠P_t(y_t|x_t).Institute is consideration is that utilize source domain D_sMiddle data Collect { x_s,y_sAnd aiming field D_tMiddle data setUnlabeled data in learning objective domainClassificationIts Middle r=1 ..., l.

Step 2: utilizing balance parameters μ, using balance parameters μ, construct objective function and carry out abbreviation solution.

2.1: utilizing balance parameters μ, while considering distance and condition between the edge feature distribution function of two FIELD Datas Distance between distribution function.It converts the difference for making aiming field and source domain minimum to and solves objective function minimum problems, structure The objective function built is as follows:

Wherein, as a=1 ..., m,WithIt respectively indicatesWithWork as a=m+1 ..., m When+k,It indicatesPseudo label,WithIt respectively indicatesWith It indicatesPseudo- mark Label.As a=m+k+1 ..., m+k+l,It indicatesPseudo label,WithIt respectively indicatesWith It indicatesPseudo label.Expression condition is distributed P (y_t|x_t) estimated value.

2.2: using MMD distance metric by objective function abbreviation in step 2 be numerical expression form:

Wherein, H indicates reproducing kernel Hilbert space (reproducing kernel Hilbert space abbreviation RKHS)。Indicate square of the norm (abbreviation H norm) on reproducing kernel Hilbert space.C ∈ { 0,1 },WithPoint The number of the sample of c class Biao Shi not be belonged in source domain and aiming field.n_cIndicate source domain data set { x_s,y_sIn belong to the sample of c class This number；m_cIndicate aiming field tape label data setIn belong to c class sample number；k_cIndicate aiming field not Tape label data setIn belong to c class sample number；l_cIndicate aiming field test data set { x_testIn belong to c class The number of sample.

2.3: utilize matrix skill and regularization by its abbreviation:

Wherein objective function minimum value is sought in min expression, and s.t. indicates bound for objective function.Matrix is sought in tr () expression Mark,.The input matrix that X is made of source domain and aiming field data set, and meet X=[x_s,x_t], wherein Mapping function ψ:By kernel function K=ψ (X)^Tψ (X) is provided.λ is regularization coefficient,Indicate the flat of F norm Side, balance factor μ.Constraint condition ensures the data A after conversion^TThe built-in attribute of ψ (X) reservation initial data.Wherein A is indicated Transition matrix, H=I- (1/n) 1 are center matrix, I ∈ R^{(n+m+k+l)×(n+m+k+l)}For unit matrix.M₀, M_cFor MMD matrix, construction Mode is as follows:

Step 3: being solved using method of Lagrange multipliers, set Lagrangian as Φ=(φ₁,φ₂,...,φ_d), The local derviation of A is asked to obtain following result after deriving Lagrangian:

The formula is generalized eigenvalue problem by solving the available matrix A of d minimal eigenvalue of A, and then acquires change Data A after changing^Tψ(X)。

Step 4: calculating transformed matrix A using Matlab^Tψ (X) calculates A using parameter after update^Tψ (X) matrix, and It is decomposed intoForm.

Specifically:

4.1: data preparation, source domain training dataset { x_s,y_s, aiming field training dataset dataWith Aiming field test data set { x_test}。

4.2: data set merges, by source domain eigenmatrix x_sWith target domain characterization matrixIt is combined into square Battle array X=[x_s,x_t]。

4.3: utilizing existing tag setCalculate initial m_c, with source domain data { x_s,y_sOne classifier of training, in advance It surveys in aiming field without label data collectionWith test data set { x_testInitial pseudo label collectionWithAnd it calculates Initial k_cAnd l_c, utilize M in step 4₀And M_cExpression formula calculates initial M₀And M_c。

4.4: iteration T times, each operation are as follows:

1) setup parameter λ, kernel function K are fixed value, and the initial value of setup parameter μ, d utilize function eigs solution matrix A.

2) is in { A^Tψ(x_s),y_sOn the same classifier of training, prediction{ A^T(x_test) tally set WithAnd update k_cAnd l_c, calculateUpper classification accuracy η.

3) updates M_cMatrix, until iteration terminates.

4.5: using the calculated matrix A of the formula of step 3, transformed matrix { A can be obtained^Tψ(x_s),y_s, on it The same type of classifier of training, prediction aiming field have label data collectionPseudo label collectionAccording toOn accuracy rate repeat step (4), using PSO optimization algorithm adjusting parameter μ, d, and obtain optimal parameter group It closes.

4.6: repeating step (4) using parameter after updating, calculate A^Tψ (X) matrix, and be decomposed into's Form.

5.1: data preparation, transformed source domain training dataset { z_s,y_s, aiming field tape label training datasetAiming field not tape label training datasetWith the test data set { x of aiming field_test}。

5.2: initialization weight vectorsWherein 1 to n weight is 1/n；The weight of n+1 to n+m is 1/m.SettingWherein N is the number of iterations.

5.3: iterative calculation updates sample weights vector w^v+1, wherein source domain D_sOn weight be updated to Wherein i=1 ..., n, aiming field D_tOn weight be updated toWherein i=n+1 ..., n+m.

Need to calculate following several steps (v=1 ..., N) for each iteration:

1) is calculatedThe component of weight is set to add up to 1.

2) utilizes sample weights ω^vTraining base classifier h_v, predict in aiming field without label training datasetMark Label, are denoted asAnd the error ε in aiming field tape label data is calculated using such as following formula_v。

3) β is arranged in_v=ε_v/(1-ε_v), and update sample weights vector w^v+1.Source domain D_sOn weight be updated toWherein i=1 ..., n.Aiming field D_tOn weight be updated toWherein i=n+ 1,...,n+m。

5.4: being calculated using such as following formulaThe prediction label of middle data

Confidence level is calculated using such as following formulaScreen the δ that confidence level is greater than γ_uA sample, wherein γ indicates confidence Spend threshold value.It is addedData set, wherein u=1 ..., U, U indicate the number of iterations.Update m=m+ δ_u, and fromMiddle these samples of deletion.

5.5: step 5.2~5.4 are repeated, until u=U；

By above-mentioned process, can to after eigentransformation source domain and aiming field training data carry out self study example and move It moves, and predicts the label of aiming field data to be tested.

Instance analysis:

1. data set prepares:

Integrated using transfer learning general data as OFFICE, it is made of the data in three fields, be followed successively by Amazon, Webcam and DSLR.There are 985,295,295 pictures in each field respectively.The discrete feature for turning to 800 dimensions of each picture is constituted Vector.These three fields are abbreviated as A, W, D respectively.

2. data prediction:

The selection of source domain and aiming field sample.Altogether there are four types of to experimental program, being followed successively by A is source domain sample, and D is aiming field Sample；A is source domain sample, and W is aiming field sample；D is source domain sample, and W is aiming field sample；W is source domain sample, and D is target Domain sample.In each experimental program, aiming field training sample is 90% data randomly selected, and 10% data, which are used as, to be tested Card.

3. the setting of parameter:

Basic classification device is set as k nearest neighbor classifier, and parameter k is set as 8.The number of iterations T is set as 10, and parameter lambda is excellent In change problemCoefficient, i.e. the regular parameter of F norm is set as λ=0.01, Selection of kernel function gaussian kernel function here.μ It is that tuning is needed in algorithm with d, μ balances M in solution procedure₀、M_cThe importance of matrix, further μ is used to balance side The shared weight of fate cloth and condition distribution, and obtained matrix A is made of d dimensional feature vector.Initial μ and d are set first Parameter adjusts the two parameters by the accuracy rate on aiming field tape label training data.After setting parameter, input is each The classification accuracy in test data can be obtained in the data of scheme, operation program.

4. the precision of different models compares:

An existing transfer learning method is selected, distribution adaptive method (BDA) algorithm of balance is compared, but due to BDA is applicable in unsupervised scene, when training BDA aiming field only choose that MBDA uses without label training data.

While in order to verify the performance of proposed method difference between process field, with the sorting algorithm in three kinds of machine learning It compares.Here support vector machines (SVM), three kinds of logistic regression (LR), decision tree (DT) algorithms are selected.Every kind of algorithm uses Data it is consistent with FSJT model, utilize the training of data to practice, the prediction of another data set.

Since there are three fields for OFFICE data set, several groups of different fields are provided respectively as trained and test data Model as a result, as shown in table 1 when collection.

The classifying quality comparing result of each model of table 1

The result from table, can it is found that migration effect of the FSJT and BDA method in field A to other two fields is all poor It larger can be caused by the gap in A data set and other two fields.But the transfer learning effect between the field W and D is preferable, FSJT Reach 88.41% and 89.27%, and is above the accuracy rate 85.37% and 86.02% of D → W and W → D in BDA method.By A small amount of tape label data on aiming field are utilized in FSJT, therefore may learn more target domain characterizations, there is precision It is improved.Accuracy rate highest of the SVM model in four kinds of data in the result of three kinds of machine learning methods is utilizing W data collection The precision of training, the test of D data set has reached 77.71%, followed by the 72.33% of LR model and DT model.And comprehensive five classes Method as a result, two class transfer learning methods are all higher than three kinds of machine learning algorithms in the result of four kinds of situations.It may be due to machine Device study needs to guarantee training data and the same distributional assumption of test data, and transfer learning method can handle the difference of two class data It is different, so that the generalization ability of model has obtained certain promotion.To sum up, performance of the FSJT method on normal data demonstrates the mould The validity of type.

It should be understood that it is above-mentioned for the more detailed of preferred embodiment, can not therefore it be considered to the present invention The limitation of scope of patent protection, those skilled in the art under the inspiration of the present invention, want not departing from right of the present invention It asks under protected ambit, replacement or deformation can also be made, fallen within the protection scope of the present invention, of the invention asks Ask that the scope of protection shall be subject to the appended claims.

Claims

1. feature and example combine transfer learning method under a kind of semi-supervised scene, it is characterised in that: the method includes following Step:

Step 1: data in Definition Model: giving a field, it is known that each data generic uses " 0 " or " 1 " in the field It indicates, is denoted as source domain D_s, the n sample eigenmatrix and class label vector form for including be expressed asIt is abbreviated as {x_s,y_s}；Another field is given, aiming field D is denoted as_t, wherein m sample composition tape label data set is denoted asNo Tape label data set is denoted asQ=1 ..., k, two class data sets are denoted as altogetherInclude m+k trained sample This, and assumeMiddle sample size is not enough to one reliable classifier of training；

Assuming that feature space X_s=X_t, i.e., the feature type in two fields is identical as quantity, classification space Y_s=Y_t, but edge distribution P_s(x_s)≠P_t(x_t), condition is distributed P_s(y_s|x_s)≠P_t(y_t|x_t), target is to utilize source domain D_sMiddle data set { x_s,y_sAnd target Domain D_tMiddle data setUnlabeled data in learning objective domainClassificationWherein r=1 ..., l, N, m, k, l are the natural number greater than 1；

Wherein objective function minimum value is sought in min expression, and tr () indicates to seek the mark of matrix, and X is by source domain and aiming field data set group At input matrix, and meet X=[x_s,x_t], whereinMapping function ψ:By kernel function K=ψ (X)^Tψ (X) is provided, and λ is regularization coefficient,Indicate that square of F norm, balance factor μ, A indicate transition matrix, H =I- (1/n) 1 is center matrix, I ∈ R^{(n+m+k+l)×(n+m+k+l)}For unit matrix, M₀, M_cFor MMD matrix, make is as follows:

Wherein { 0,1 } c ∈,WithIt is illustrated respectively in the number for belonging to the sample of c class in source domain and aiming field, n_cExpression source Numeric field data collection { x_s,y_sIn belong to c class sample number；m_cIndicate aiming field tape label data setIn belong to c The number of the sample of class；k_cIndicate aiming field without label data collectionIn belong to c class sample number；l_cIndicate target Domain test data set { x_testIn belong to c class sample number；

Step 3: being solved using method of Lagrange multipliers, set Lagrangian as Φ=(φ₁,φ₂,...,φ_d), d A Minimal eigenvalue, derive the local derviation that transition matrix A is sought after Lagrangian:

Step 4: calculating transformed matrix A using Matlab^Tψ(X)；

Step 5: be directed to transformed source domain and target numeric field data, carry out instance migration study, with filter out in source domain with target The sample that domain differs greatly, specifically:

5.1: data preparation, transformed source domain training dataset { z_s,y_s, aiming field tape label training dataset Aiming field not tape label training datasetWith the test data set { x of aiming field_test}；

5.3: iterative calculation updates sample weights vector w^v+1, wherein source domain D_sOn weight be updated toIts Middle i=1 ..., n, aiming field D_tOn weight be updated toWherein i=n+1 ..., n+m；

5.4: being calculated using following formulaThe prediction label of middle data

Confidence level is calculated using such as following formulaScreen the δ that confidence level is greater than γ_uA sample, wherein γ indicates confidence level threshold Value,It is addedData set, wherein u=1 ..., U, U indicate the number of iterations, update m=m+ δ_u, and FromMiddle these samples of deletion,

5.5: step 5.2~5.4 are repeated, until u=U；

2. feature and example combine transfer learning method under semi-supervised scene according to claim 1, it is characterised in that: institute Stating step 4 includes following sub-step:

4.1: data preparation, source domain training dataset { x_s,y_s, aiming field training dataset dataAnd aiming field Test data set { x_test}；

4.2: data set merges, by source domain eigenmatrix x_sWith target domain characterization matrixIt is combined into matrix X =[x_s,x_t]；

4.3: utilizing existing tag setCalculate initial m_c, with source domain data { x_s,y_sOne classifier of training, predict mesh It marks in domain without label data collectionWith test data set { x_testInitial pseudo label collectionWithAnd it calculates initial k_cAnd l_c, calculate initial M₀And M_c；

4.5: using the formula calculating matrix A of step 3, obtaining transformed matrix { A^Tψ(x_s),y_s, same class is trained on it The classifier of type, prediction aiming field have label data collectionPseudo label collectionAccording toOn Accuracy rate repeats step 3, using PSO optimization algorithm adjusting parameter μ, d, and obtains optimal parameter combination；

4.6 repeat step 3 using parameter after updating, and calculate A^Tψ (X) matrix, and be decomposed intoForm.

3. feature and example combine transfer learning method under semi-supervised scene according to claim 1, it is characterised in that: institute The calculating step for stating each iteration in step 5.3 includes:

(1) it calculatesThe component of weight is set to add up to 1, v=1 ..., N；

(2) sample weights ω is utilized^vTraining base classifier h_v, predict in aiming field without label training datasetLabel, note ForAnd calculate the error ε in aiming field tape label data_v:

(3) β is set_v=ε_v/(1-ε_v), and update sample weights vector w^v+1, source domain D_sOn weight be updated to Wherein i=1 ..., n, aiming field D_tOn weight be updated toWherein i=n+1 ..., n+m.

4. feature and example combine transfer learning method under semi-supervised scene according to claim 1, it is characterised in that: institute State each iteration in step 4.4 the following steps are included:

(3) M is updated_cMatrix, until iteration terminates.

5. feature and example combine transfer learning method under semi-supervised scene according to claim 1, it is characterised in that: institute It states in step 5.3 and calculatesThe prediction label of middle dataFormula are as follows:

6. feature and example combine transfer learning method under semi-supervised scene according to claim 1, it is characterised in that: institute It states and calculates confidence level in step 5.3Formula are as follows: