CN103177114B

CN103177114B - Based on the shift learning sorting technique across data field differentiating stream shape

Info

Publication number: CN103177114B
Application number: CN201310113911.0A
Authority: CN
Inventors: 方正; 张仲非
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2013-04-02
Filing date: 2013-04-02
Publication date: 2016-01-27
Anticipated expiration: 2033-04-02
Also published as: CN103177114A

Abstract

The embodiment of the invention discloses a kind of shift learning sorting technique across data field based on differentiating stream shape, comprising the following steps: input the data of each data field and the label data for training, data are set up to the adjacent map being used for spectrogram geometry and regulating; To the data of input, the adjacent map of label information and foundation, optimization aim is combined, sets up unified mathematical model; According to the mathematical model set up, the more new formula of derivation variable, upgrades the hiding factor of each dimension of each data field, the relational structure of inter-domain sharing, and regression coefficient in the mode of alternating iteration, until convergence; Utilize the parameter obtained, generic Tag Estimation is carried out to the data of aiming field, obtains the generic label to aiming field data prediction.The present invention is for learning the data manifold space obtaining a kind of discriminating, and the new expression factor has the height being conducive to classifying and differentiates structure, also maintains the original cluster manifold structure of data.

Description

Cross-data-domain transfer learning classification method based on identification manifold

Technical Field

The invention belongs to the technical field of data processing, and particularly relates to a cross-data-domain transfer learning classification method based on identification manifold.

Background

In the information age represented by massive big data, various data are exploded and increased in a geometric series, and the mining of the potential value of the data becomes a hot spot of attention and research of people. In the fields of internet, mobile communication and finance, daily life continuously generates a large amount of data, wherein a classification technology is a very effective method for mining potentially useful knowledge of the data. For example, internet users need to send and receive a large amount of e-mails every day, how to help users sort the e-mails, and how to automatically identify spam e-mails needs accurate and effective sorting technology to help users intelligently. For another example, on a network router node, how to effectively classify and detect data streams and discover abnormal phenomena and trojan virus data in time plays a great role in maintaining the security and stability of a network. The monitoring and classification of the user transaction behaviors in the financial field is beneficial to identifying malicious fraudulent transaction behaviors, so that the major economic loss caused by the malicious fraudulent transaction behaviors can be avoided.

On the other hand, in the actual data mining classification problem, reliable label data is often required as a training sample. To obtain such training data requires a lot of manpower, material resources and time. This often results in only a limited amount of manually classified label data in the subject domain of the study being available to train the model. However, if a certain amount of classified reliable data exists in the related similar data domain, the data of the target domain can be modeled and accurately classified under the condition of lacking training data by effectively utilizing the relation of different data domains to perform knowledge migration. Furthermore, in the internet as an example, although the research data at a certain time has sufficient label data, the data at a future time will evolve with the development of time, and the existing model trained by the previous data may not be suitable for the future data object, and needs to be readjusted or trained, which in turn will bring heavy manpower and time investment. How to use and utilize information and knowledge in training data at previous time to reduce investment requirements brought by retraining is of great significance for researching classification problems of data domains at different times. The most representative transfer learning technology in many existing advanced technologies is to solve the knowledge mining problem how to utilize the labels and useful information of other data domains to assist the clustering, classification and the like of the target object data domains.

In the existing transfer learning text mining algorithm, a plurality of researchers propose to mine potential data expression factors, and use a relation structure between hiding factors of data dimensions and hiding factors of feature dimensions as a physical quantity shared among a plurality of domains. The multi-data inter-domain relation established by the shared hidden factor relation structure achieves the effect of transferring knowledge among data domains to a certain extent, and can be trained and classified by using label data of the auxiliary domain under the condition that the target domain has only a few training samples. However, in most of the hidden factor mining algorithms of the transfer learning technology, the obtained hidden factors lack the identification characteristics beneficial to accurate classification. As most of the hidden factors are obtained through a framework model of matrix decomposition combined clustering, the mining of a data identification structure is ignored while the internal clustering structure of the data is maintained, and the capability of further improving the accurate prediction of the belonged category is lost. And although the potential relation of the hidden factors of each dimension of the target domain and the auxiliary domain is utilized and shared in the process of transfer learning, distribution gaps among different domains exist among the finally learned hidden factors. Especially, under the condition that the classification decision functions of the target data domain and the auxiliary data domain are the same, although the data of the auxiliary domain can be accurately classified, due to the inter-domain deviation of the data distribution, the classifier still can not achieve the ideal classification effect in the target domain.

In view of the defects and shortcomings of the existing transfer learning classification method based on hidden factor mining, the transfer learning classification technology provided by the invention can mine the identification structure which is beneficial to classification in data while keeping a good clustering structure of the data, and the finally obtained inter-domain deviation of the hidden factors can be greatly reduced by adjusting the Maximum Mean Difference (MMD) distance of different data domains. Thus, the problem of learning a classification across transitions between data domains is effectively solved. Compared with the existing transfer learning classification technology based on hidden factor mining, the provided classifier has greatly improved accuracy and stability.

Disclosure of Invention

In order to solve the above problems, an object of the present invention is to provide a cross-data-domain transfer learning classification method based on identification manifold, which is used to obtain an identified data manifold space through unified combination of joint matrix decomposition and regression identification model under certain constraint conditions while performing cross-data-domain transfer learning classification, and a new expression factor of data in the manifold space has a high identification structure beneficial to classification, and simultaneously maintains an original clustering manifold structure of the data. By minimizing the inter-domain data distribution distance MMD (maximum mean difference), the inter-domain difference of the hidden factors obtained by inter-domain learning of different data is greatly reduced, thereby further improving the accuracy and stability of the cross-data-domain transfer learning classifier.

In order to achieve the purpose, the technical scheme of the invention is as follows:

a cross-data-domain transfer learning classification method based on identification manifold comprises the following steps:

S1O, inputting data of each data field and label data for training, and establishing an adjacency graph for spectrogram geometric adjustment on the data;

s20, combining optimization targets such as a cross-data-domain joint matrix decomposition model, an identification regression model, cross-data-domain distance adjustment, manifold geometry adjustment and the like to the input data, label information and the established adjacency graph, and establishing a unified mathematical model;

s30, deriving an updating formula of the variables according to the established mathematical model, and updating hidden factors, inter-domain shared relational structures and regression coefficients of each dimensionality of each data domain in an alternating iteration mode until convergence;

and S40, performing the generic label prediction on the data of the target domain by using the obtained parameters to obtain the generic label predicted on the data of the target domain.

Preferably, S10 specifically includes the following steps:

s101, inputting an auxiliary data fieldAnd a target data fieldIncluding label data of the auxiliary data fieldAnd corresponding label information matrixAnd data of the target domainInputting label indication information P when the target domain has a small amount of label data_tA matrix for indicating which data of the target domain are labeled, and label information of the target domain data is inputted at the same timeBy collectionsSubscripts indicating different data fields when referring to a data field ofWhen it is time, the other data field corresponding to it is marked as

S102, respectively constructing an adjacency graph of data dimensions of auxiliary domains by using input dataAnd adjacency graphs of feature dimensionsThe edge weights between the points of the adjacency graph are as follows:

drawing (A)

Drawing (A)

Wherein N is_p(x) Representing the p fields of data x, taking p-5,

constructing a data dimension adjacency graph for a target domainAnd a feature dimension adjacency graph, wherein the edge weights between the points of the adjacency graph are respectively as follows:

drawing (A)

Drawing (A)

Wherein N is_p(x) P field representing data x is 5.

Preferably, S20 specifically includes the following steps:

s201, establishing a joint matrix decomposition model across data domains:

the matrix factorization model simultaneously resolves the data of the target data domain and the auxiliary data domain into a low-dimensional data expression, and retains a common knowledge structure between the two data domains, wherein,representing a pi data fieldLow-dimensional clustering of features of (1), k_mIs the number of clusters of the characteristic dimension;representing a pi data fieldIs also a low-dimensional hidden representation factor, k, of the data_nIs the number of clusters of data;representing a pi data fieldThe relationship structure between the characteristic class and the data class, and the target data field and the auxiliary data field share the stable relationship structure;

s202, fusing and identifying a regression model, and carrying out supervision constraint on a low-dimensional hidden representation factor of data:

whereinIs a regression coefficient acting on a data hiding factor, the label indicating information P_tThe matrix being a diagonalThe matrix is a matrix of a plurality of matrices,representing a pi data fieldThe ith element in (c) is used for supervised regression discrimination constraint, otherwise

P_{ii}^{π} = 0;

S203, reducing the difference between the target data domain and the auxiliary data domain, and introducing the adjustment of the MMD distance of the maximum mean difference;

the inter-domain difference distance in the data dimension is defined as follows:

the inter-domain difference distance in feature dimensions is defined as follows:

in order to reduce the difference between the target data domain and the auxiliary data domain, the expected data hidden representation factor and the characteristic low-dimensional clustering structure representation factor can make the inter-domain difference distance in each dimension as small as possible, so that the two distance functions are fused into the model obtained in the previous step S202 as the minimum target adjustment factor, and the following results are obtained:

s204, keeping the low-dimensional manifold structure of the data, and utilizing the auxiliary domain obtained in the step S102 according to the spectrogram geometric theoryEstablishing a measure for measuring the smoothness of the data mapping function along the geodesic lines in the low-dimensional manifold space:

wherein,

D_{s}^{v} = diag (\underset{i}{Σ} {(W_{s}^{v})}_{ij})

and establishing the measure of the smoothness of the measurement data feature mapping function along the geodesic lines in the low-dimensional manifold space by using the adjacency graph of the feature dimension of the auxiliary domain obtained in the step S102:

wherein,

D_{s}^{u} = diag (\underset{i}{Σ} {(W_{s}^{u})}_{ij})

similarly, the target domain obtained in step S102 is utilizedIn the target domainEstablishing a measure for measuring the smoothness of the data mapping function along the geodesic lines in the low-dimensional manifold space on the data dimension:

wherein,

D_{t}^{v} = diag (\underset{i}{Σ} {(W_{t}^{v})}_{ij})

and establishing the measure for measuring the smoothness of the data feature mapping function along the geodesic lines in the low-dimensional manifold space on the feature dimension by using the adjacency graph of the feature dimension of the target domain obtained in the step S102:

wherein,

D_{t}^{u} = diag (\underset{i}{Σ} {(W_{t}^{u})}_{ij})

s205: the cross-data-domain transfer learning classification model based on the identification manifold is established as follows:

s.t.V_s，V_t，U_s，U_t，H≥0

preferably, the alternating iteration in S30 specifically includes the following steps:

s301, updating the auxiliary domain data hiding factor V_s：

Wherein

B_{s} = A^{T} Y_{s} P_{s} P_{s}^{T},

B_{s}^{+} = (| B_{s} | + B_{s}) / 2,

B_{s}^{-} = (| B_{s} | - B_{s}) / 2,

E_{s} = A^{T} A V_{s} P_{s} P_{s}^{T},

R＝A^TA，R⁺＝(|R|+R)/2，R^-＝(|R|-R)/2，

S302, updating the target domain data hiding factor V_t：

Wherein

B_{t} = A^{T} Y_{t} P_{t} P_{t}^{T},

B_{t}^{+} = (| B_{t} | + B_{t}) / 2,

B_{t}^{-} = (| B_{t} | - B_{t}) / 2,

E_{t} = A^{T} A V_{t} P_{t} P_{t}^{T},

R＝A^TA，R⁺＝(|R|+R)/2，R^-＝(|R|-R)/2，

S303, updating the feature dimension low-dimensional factor U of the auxiliary domain_s：

S304, updating the feature dimension low-dimensional factor U of the target domain_t：

S305, updating the sharing factor between the auxiliary domain and the target domain: and updating a relation structure between the hiding factors of the data dimension and the hiding factors of the characteristic dimension according to the following formula:

wherein

S306, updating the regression coefficient A:

wherein

γ = \frac{α}{β} .

Preferably, S40 further includes the steps of:

s401, utilizing the obtained regression coefficient A and the target domain document hiding factor V_tPerforming generic label prediction on the target domain document to obtain a generic label for predicting the target domain news document

{\tilde{Y}}_{t} = A V_{t};

S402, according toThe subscript where the largest element of each column of document factors lies determines the data's category.

Compared with the prior art, the invention has the following beneficial effects:

(1) according to the classifier provided by the embodiment of the invention, the identification regression model is introduced into the mining algorithm of the hidden factors of transfer learning, so that the learned data hidden factors have identification structures beneficial to classification, and the identification and classification accuracy of the classifier are improved;

(2) according to the embodiment of the invention, potential useful structures of data are mined, and meanwhile, the inter-domain difference of hidden factors obtained by learning is minimized by utilizing the minimum inter-domain difference distance (MMD), so that the difference caused by data distribution drifting among different domains is reduced, and a problem of difficulty in a traditional transfer learning algorithm is further solved by sharing a relation matrix of a characteristic dimension and a clustering structure of a data dimension among the domains;

(3) according to the embodiment of the invention, the data of the auxiliary domain and the target domain are subjected to joint matrix decomposition, the inherent manifold structure of the data is reserved in the subspace of the mined hidden factor through spectrogram geometric adjustment, and the learned hidden factor has a classification identification structure and also reserves the clustering structure of the original data, so that the anti-noise capability and robustness of the classifier are improved;

(4) the embodiment of the invention provides a classifier (TLCDM) based on cross-data-domain transfer learning of identification manifold, and innovatively provides a set of effective parameter iterative updating method for training the classifier.

Drawings

FIG. 1 is a flowchart illustrating steps of a cross-data-domain transition learning classification method based on identification manifold according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

On the contrary, the invention is intended to cover alternatives, modifications, equivalents and alternatives which may be included within the spirit and scope of the invention as defined by the appended claims. Furthermore, in the following detailed description of the present invention, certain specific details are set forth in order to provide a better understanding of the present invention. It will be apparent to one skilled in the art that the present invention may be practiced without these specific details.

The embodiment of the present invention provides a classifier (TLCDM) for identifying manifold cross-data domain transfer learning, in which input data is news text data, and topic classification is performed on the news data for example, and the classification method of the embodiment of the present invention may also be applied to various types of data classification problems across domains, such as video data for a target domain and picture data for an auxiliary domain, to perform video data classification; alternatively, the target domain and the auxiliary domain are email data of different users, and spam classification is performed.

Referring to fig. 1, a flowchart illustrating steps of a cross-data domain transfer learning classification method based on identification manifold according to an embodiment of the present invention is shown, which includes the following steps:

and S10, inputting data of each data field and generic label data for training, and establishing an adjacent map for spectrogram geometric adjustment on the data. Specifically, the method comprises steps S101 to S102:

s101, inputting an auxiliary data fieldAnd a target data fieldIncluding label data of the auxiliary data fieldAnd corresponding label information matrixAnd data of the target domainInputting generic label indicating information P when the target domain has a small amount of generic label data_tA matrix for indicating which data of the target domain are labeled, and inputting generic label information of the target domain data

S102, for news data, the data dimension is each news document, the characteristic dimension is text words in news, and document adjacent maps of auxiliary domains are respectively constructedAnd text word adjacency graphThe edge weights between the points of the adjacency graph are as follows:

drawing (A)

Drawing (A)

Wherein N is_p(x) The field p of the object x is denoted by p 5.

Constructing document adjacency graphs for target domainsAnd text word adjacency graphThe edge weights between the points of the adjacency graph are as follows:

drawing (A)

Drawing (A)

Wherein N is_p(x) The field p of the object x is denoted by p 5.

S20, combining optimization objectives such as a joint matrix decomposition model across data domains, an identification regression model, distance adjustment across data domains, manifold geometry adjustment, and the like with respect to the input data, label information, and the created adjacency graph, to create a unified mathematical model, which specifically includes steps S201 to S204:

s201, establishing a joint matrix decomposition model across data domains:

wherein for ease of discussion and ease of expression of modeling, sets are usedSubscripts indicating different data fields when referring to a data field ofWhen it is time, the other data field corresponding to it is marked as

The matrix decomposition model decomposes the documents and text words of the target data field and the auxiliary data field into low-dimensional data expression at the same time, and reserves a common knowledge structure between the two data fields. Wherein,representing a pi data fieldLow-dimensional clustering structure of text words, k_mIs the number of clusters of text words;representing a pi data fieldThe low-dimensional cluster structure of the document is also a low-dimensional hidden representation factor, k, of the document_nIs the number of clusters of documents;representing a pi data fieldAnd (4) a relation structure between the text part of speech and the document class. Experience has shown that the target data field and the auxiliary data field share this stable relational structure.

S202, fusing and identifying a regression model, and carrying out supervision constraint on a low-dimensional hidden representation factor of the document:

whereinIs a regression coefficient acting on a data hiding factor, generic indicator information P_tThe matrix is a diagonal matrix and,representing a pi data fieldThe ith element in (c) is used for supervised regression discrimination constraint, otherwise

P_{ii}^{π} = 0 .

S203, reducing the difference between the target data domain and the auxiliary data domain, and introducing the adjustment of the Maximum Mean Difference (MMD) distance.

in order to reduce the difference between the target data field and the auxiliary data field, it is desirable that the inter-field difference distance defined on the document hiding factor be as small as possible, and the inter-field difference distance defined on the low-dimensional expression factor of the text word be as small as possible. Thus, the two distance functions are fused as the minimum target adjustment factor into the model obtained in the previous step S202, and the following results are obtained:

s204, maintaining the low-dimensional manifold structure of the data. According to the spectrogram geometric theory, the auxiliary domain obtained in step S102 is utilizedEstablishing a measure for measuring the smoothness of the function of the mapped document along the geodesic lines in the low-dimensional manifold space:

wherein,

D_{s}^{v} = diag (\underset{i}{Σ} {(W_{s}^{v})}_{ij}) .

establishing a measure for measuring the smoothness of the function of the mapping text words along the geodesic lines in the low-dimensional manifold space by using the adjacency graph of the text word dimension of the auxiliary domain obtained in the step S102:

wherein,

D_{s}^{u} = diag (\underset{i}{Σ} {(W_{s}^{u})}_{ij}) .

similarly, the target domain obtained in step S102 is utilizedIn the target domainEstablishing a measure for measuring the smoothness of the function of the mapped document along the geodesic lines in the low-dimensional manifold space on the dimension of the document:

wherein,

D_{t}^{v} = diag (\underset{i}{Σ} {(W_{t}^{v})}_{ij}) .

establishing a measure for measuring the smoothness of the function of the mapping text words along the geodesic lines in the low-dimensional manifold space on the dimension of the text words by using the adjacency graph of the dimension of the text words of the target domain obtained in the step S102:

wherein,

D_{t}^{u} = diag (\underset{i}{Σ} {(W_{t}^{u})}_{ij}) .

and S205, establishing a cross-data-domain transfer learning classification model based on the identification manifold.

In order to keep the inherent original structure of the data in each dimension manifold space (especially the spatial smoothness of the data) in the target domain and the auxiliary domain, the function smoothness measures of each dimension in the target domain and the auxiliary domain are used as the constraint adjustment of a matrix decomposition model and are fused into a unified mathematical model. And considering the non-negativity of the obtained low-dimensional representation factors of all dimensions and the non-negativity of the relational structure matrix, finally obtaining the following cross-data-domain transfer learning classification model based on the identification manifold:

s.t.V_s，V_t，U_s，U_t，H≥0

the hidden factors are mined by using the joint matrix decomposition model, the identifiability of the hidden factors is improved by using the identification regression model, the distribution difference of the hidden factors of different data fields is reduced by using the distance adjustment across the data fields, the local clustering structure of the original data is kept by using the manifold geometry adjustment, and the learned hidden factors have the classification identification structure and simultaneously keep the clustering structure of the original data, so that the anti-noise capability and the robustness of the classifier are improved.

And S30, deriving an updating formula of the variables according to the mathematical model established in S20, and updating the hidden factors, the inter-domain shared relation structure and the regression coefficients of the document and text word dimensions of each data domain in an alternating iteration mode until convergence. Each iteration specifically includes steps S301 to S306:

s301, updating the auxiliary domain document hiding factor V_s：

Wherein

B_{s} = A^{T} Y_{s} P_{s} P_{s}^{T},

B_{s}^{+} = (| B_{s} | + B_{s}) / 2,

B_{s}^{-} = (| B_{s} | - B_{s}) / 2,

E_{s} = A^{T} A V_{s} P_{s} P_{s}^{T},

R＝A^TA，R⁺＝(|R|+R)/2，R^-＝(|R|-R)/2，

S302, updating the target domain document hiding factor V_t：

Wherein

B_{t} = A^{T} Y_{t} P_{t} P_{t}^{T},,

B_{t}^{+} = (| B_{t} | + B_{t}) / 2,

B_{t}^{-} = (| B_{t} | - B_{t}) / 2,

E_{t} = A^{T} A V_{t} P_{t} P_{t}^{T},

R＝A^TA，R⁺＝(|R|+R)/2，R^-＝(|R|-R)/2，

S303, updating the auxiliary domain text word low-dimensional representation factor U_s：

S304, updating the low-dimensional representation factor U of the target domain text word_t：

S305, updating the shared structural factor between the auxiliary domain and the target domain: a relationship factor between the clustering structure of documents and the clustering structure of text words. The update formula is as follows:

wherein

S306, updating the regression coefficient A:

wherein

γ = \frac{α}{β}

Specifically, the method comprises the following steps of,

s401, using the regression coefficient A and the target domain document hiding factor V obtained in S30_tPerforming generic label prediction on the target domain document to obtain a generic label for predicting the target domain news document

{\tilde{Y}}_{t} = A V_{t} .

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims

1. A cross-data-domain transfer learning classification method based on identification manifold is characterized by comprising the following steps:

s10, inputting data of each data field and label data for training, and establishing an adjacent map for spectrogram geometric adjustment on the data;

s20, establishing a unified mathematical model for the input data, label information and the established adjacency graph by combining with an optimization target, wherein the optimization target comprises a cross-data-domain joint matrix decomposition model, an identification regression model, cross-data-domain distance adjustment and manifold geometry adjustment;

s40, performing generic label prediction on the data of the target domain by using the obtained parameters to obtain a generic label for predicting the data of the target domain;

wherein, S10 specifically comprises the following steps:

s101, inputting an auxiliary data field D_sAnd a target data field D_tIncluding label data of the auxiliary data fieldAnd corresponding label information matrixAnd data of the target domainInputting label indication information P when the target domain has a small amount of label data_tA matrix for indicating which data of the target domain are labeled, and label information of the target domain data is inputted at the same timeThe subscripts of different data fields are represented by the set I ═ { s, t }, and when the data field is referred to as pi ∈ I, the other data field corresponding to the data field is marked as

drawing (A)

Drawing (A)

Wherein N is_p(x) Representing the p fields of data x, taking p-5,

drawing (A)

Drawing (A)

Wherein N is_p(x) Representing the p field of the data x, and taking p as 5;

s20 specifically comprises the following steps:

s201, establishing a joint matrix decomposition model across data domains:

\min_{U_{π}, H, V_{π} &GreaterEqual; 0} \underset{π &Element; I}{Σ} | | X_{π} - U_{π} {HV}_{π} | |^{2}

the matrix factorization model simultaneously resolves the data of the target data domain and the auxiliary data domain into a low-dimensional data expression, and retains a common knowledge structure between the two data domains, wherein,represents the pi data field D_πLow-dimensional clustering of features of (1), k_mIs the number of clusters of the characteristic dimension;represents the pi data field D_πIs also a low-dimensional hidden representation factor, k, of the data_nIs the number of clusters of data;represents the pi data field D_πThe relationship structure between the characteristic class and the data class, and the target data field and the auxiliary data field share the stable relationship structure;

\min_{V_{π}, U_{π}, H, A} \underset{π &Element; 1}{Σ} (| | X_{π} - U_{π} {HV}_{π} | |^{2} + β | | Y_{π} P_{π} - {AV}_{π} P_{π} | |^{2}) + α | | A | |^{2}

whereinIs a regression coefficient acting on a data hiding factor, the label indicating information P_tThe matrix is a diagonal matrix and,represents the pi data field D_πThe ith element in (c) is used for supervised regression discrimination constraint, otherwise

P_{i i}^{π} = 0;

S203, reducing the difference between the target data domain and the auxiliary data domain, and introducing maximum mean difference MaximumMeanDiscrenancy and MMD distance adjustment;

{Dist}^{v} (D_{s}, D_{t}) = | | \frac{1}{n_{s}} Σ_{i = 1}^{n_{s}} v_{\cdot i}^{s} - \frac{1}{n_{t}} Σ_{j = 1}^{n_{t}} v_{\cdot j}^{t} | |^{2};

{Dist}^{u} (D_{s}, D_{t}) = | | \frac{1}{n_{s}} Σ_{i = 1}^{n_{s}} u_{i \cdot}^{s} - \frac{1}{n_{t}} Σ_{j = 1}^{n_{t}} u_{j \cdot}^{t} | |^{2};

\begin{matrix} \min_{V_{s}, V_{t}, U_{s}, U_{t}, H, A} \underset{π &Element; I}{Σ} (| | X_{π} - U_{π} {HV}_{π} | |^{2} + β | | Y_{π} P_{π} - {AV}_{π} P_{π} | |^{2}) + α | | A | |^{2} \\ + | | \frac{1}{m_{s}} 1_{m_{s}}^{T} U_{s} - \frac{1}{m_{t}} 1_{m_{t}}^{T} U_{t} | |^{2} + | | \frac{1}{n_{s}} V_{s} 1_{n_{s}} - \frac{1}{n_{t}} V_{t} 1_{n_{t}} | |^{2} \end{matrix}

\begin{matrix} R_{s}^{v} = \frac{1}{2} \underset{i j}{Σ} | | v_{\cdot i}^{s} - v_{\cdot j}^{s} | |^{2} {(W_{s}^{v})}_{i j} = \underset{i}{Σ} t r (v_{\cdot i}^{s} {(v_{\cdot i}^{s})}^{T}) {(D_{s}^{v})}_{i i} - \underset{i j}{Σ} t r (v_{\cdot i}^{s} {(v_{\cdot j}^{s})}^{T}) {(W_{s}^{v})}_{i j} \\ = t r (V_{s} (D_{s}^{v} - W_{s}^{v}) V_{s}^{T}) \end{matrix}

wherein,

D_{s}^{v} = d i a g (\underset{i}{Σ} {(W_{s}^{v})}_{i j})

\begin{matrix} R_{s}^{u} = \frac{1}{2} \underset{i j}{Σ} | | u_{i \cdot}^{s} - u_{j \cdot}^{s} | |^{2} {(W_{s}^{u})}_{i j} = \underset{i}{Σ} t r ({(u_{i \cdot}^{s})}^{T} (u_{i \cdot}^{s})) {(D_{s}^{u})}_{i i} - \underset{i j}{Σ} t r ({(u_{i \cdot}^{s})}^{T} (u_{j \cdot}^{s})) {(W_{s}^{u})}_{i j} \\ = t r (U_{s}^{T} (D_{s}^{u} - W_{s}^{u}) U_{s}) \end{matrix}

wherein,

D_{s}^{u} = d i a g (\underset{i}{Σ} {(W_{s}^{u})}_{i j})

similarly, the target domain D obtained in step S102 is utilized_tIn the target domain D_tEstablishing a measure for measuring the smoothness of the data mapping function along the geodesic lines in the low-dimensional manifold space on the data dimension:

\begin{matrix} R_{t}^{v} = \frac{1}{2} \underset{i j}{Σ} | | v_{\cdot i}^{t} - v_{\cdot j}^{t} | |^{2} {(W_{t}^{v})}_{i j} = \underset{i}{Σ} t r (v_{\cdot i}^{t} {(v_{\cdot i}^{t})}^{T}) {(D_{t}^{v})}_{i i} - \underset{i j}{Σ} t r (v_{\cdot i}^{t} {(v_{\cdot j}^{t})}^{T}) {(W_{t}^{v})}_{i j} \\ = t r (V_{t} (D_{t}^{v} - W_{t}^{v}) V_{t}^{T}) \end{matrix}

wherein,

D_{t}^{v} = d i a g (\underset{i}{Σ} {(W_{t}^{v})}_{i j})

\begin{matrix} R_{t}^{u} = \frac{1}{2} \underset{i j}{Σ} | | u_{i \cdot}^{t} - u_{j \cdot}^{t} | |^{2} {(W_{t}^{u})}_{i j} = \underset{i}{Σ} t r ({(u_{i \cdot}^{t})}^{T} (u_{i \cdot}^{t})) {(D_{t}^{u})}_{i i} - \underset{i j}{Σ} t r ({(u_{i \cdot}^{t})}^{T} (u_{j \cdot}^{t})) {(W_{t}^{u})}_{i j} \\ = t r (U_{t}^{T} (D_{t}^{T} - W_{t}^{u}) U_{t}) \end{matrix}

wherein,

D_{t}^{u} = d i a g (\underset{i}{Σ} {(W_{t}^{u})}_{i j})

s205, establishing a cross-data-domain transfer learning classification model based on the identification manifold as follows:

\begin{matrix} \min_{V_{s}, V_{t}, U_{s}, U_{t}, H, A} \underset{π &Element; I}{Σ} (| | X_{π} - U_{π} {HV}_{π} | |^{2} + β | | Y_{π} P_{π} - {AV}_{π} P_{π} | |^{2}) + α | | A | |^{2} \\ + \underset{π &Element; I}{Σ} λ (R_{π}^{u} + R_{π}^{v}) + | | \frac{1}{m_{s}} 1_{m_{s}}^{T} U_{s} - \frac{1}{m_{t}} 1_{m_{t}}^{T} U_{t} | |^{2} + | | \frac{1}{n_{s}} V_{s} 1_{n_{s}} - \frac{1}{n_{t}} V_{t} 1_{n_{t}} | |^{2} \end{matrix}

s.t.V_s,V_t,U_s,U_t,H≥0

the alternating iteration in S30 specifically includes the following steps:

s301, updating the auxiliary domain data hiding factor V_s：

Wherein

B_{s} = A^{T} Y_{s} P_{s} P_{s}^{T}, B_{s}^{+} = (| B_{s} | + B_{s}) / 2, B_{s}^{-} = (| B_{s} | - B_{s}) / 2, E_{s} = A^{T} {AV}_{s} P_{s} P_{s}^{T},

R＝A^TA，R⁺＝(|R|+R)/2，R^-＝(|R|-R)/2，

S302, updating the target domain data hiding factor V_t：

Wherein

B_{t} = A^{T} Y_{t} P_{t} P_{t}^{T}, B_{t}^{+} = (| B_{t} | + B_{t}) / 2, B_{t}^{-} = (| B_{t} | - B_{t}) / 2, E_{t} = A^{T} {AV}_{t} P_{t} P_{t}^{T},

R＝A^TA，R⁺＝(|R|+R)/2，R^-＝(|R|-R)/2，

where I is { s, t }

S306, updating the regression coefficient A:

A = (\underset{π &Element; I}{Σ} Y_{π} P_{π} {(V_{π} P_{π})}^{T}) {(\underset{π &Element; I}{Σ} V_{π} P_{π} {(V_{π} P_{π})}^{T} + γ I)}^{- 1},

where I ═ s, t },

γ = \frac{α}{β} .

2. the method for learning and classifying data domain transitions based on discriminative manifold as claimed in claim 1 wherein S40 further comprises the steps of:

{\tilde{Y}}_{t} = {AV}_{t};