CN107958287A

CN107958287A - Towards the confrontation transfer learning method and system of big data analysis transboundary

Info

Publication number: CN107958287A
Application number: CN201711183974.8A
Authority: CN
Inventors: 龙明盛; 王建民; 张育宸; 黄向东
Original assignee: Tsinghua University
Current assignee: Tsinghua University
Priority date: 2017-11-23
Filing date: 2017-11-23
Publication date: 2018-04-24

Abstract

The present invention provides a kind of confrontation transfer learning method and system towards big data analysis transboundary.This method includes：The source domain and the respective unlabeled data collection of target domain are corresponded to the corresponding random polyteny fusion of each tensor in tensor set to represent to substitute into the primary loss function of discriminator, obtain the current loss function of discriminator, and utilize backpropagation, adjust the parameter of the discriminator, to minimize the current loss function, the current optimal loss function as the discriminator；Tensor is the tensor product of the data vector of all data Layers in preset data layer set in the predetermined depth neutral net in the tensor set；The predetermined depth neural network parameter is updated based on the current optimal loss function and enters the renewal of the predetermined depth neural network parameter next time until parameter restrains.The Joint Distribution of multiple data Layers, which deviates, in the predetermined depth neutral net preset data layer set obtained by the present invention reduces, preferable applied to target domain effect.

Description

Towards the confrontation transfer learning method and system of big data analysis transboundary

Technical field

The present invention relates to data analysis technique field, more particularly, to a kind of confrontation towards big data analysis transboundary Transfer learning method and system.

Background technology

In numerous machine learning task processing, deep neural network method is current effect the best way.It is but deep Degree neutral net only obtain it is enough abundant have label data after, could obtain good by supervised learning training Business effect.Preferably it is used to complete target in the case that labeled data is less, remain to obtain effect in target domain The deep neural network of task, the cross-cutting study of generally use, the labeled data that source domain is enriched are used for target domain The acquisition of deep neural network.Under the deep neural network obtained based on the labeled data that source domain enriches, the number of source domain According to the data with target domain there are the problem of distributions shift, thus the deep neural network application target field is completed target and is appointed It is ineffective during business.

For this problem, generally use transfer learning method solves, that is, trains a discriminator to be used for percentage regulation god Parameter through network so that under the deep neural network after parameter adjustment, between the data of source domain and the data of target domain Distributions shift reduces, so that deep neural network application target field completes have preferable effect during goal task.Wherein, resist Transfer learning method is one of best transfer learning method of current effect, it passes through the list according to deep neural network intermediate layer The data vector of a data Layer, builds the loss function of discriminator, and the loss function for minimizing the discriminator obtains the discriminator Parameter, and fix the parameter of the loss function of the discriminator, minimize the loss function and the discriminator of deep neural network Loss function difference, obtain the continuous percentage regulation neutral net of this mode of the parameter of deep neural network parameter until receive Hold back.

By resisting transfer learning method, under some data Layers of the deep neural network top layer after parameter adjustment, source neck The data in domain and the data of target domain may possibly still be present data distribution offset, and then deep neural network application target field Completing effect during goal task may be bad.Especially, when the complexity of multi-mode is presented in the data distribution of source domain and target domain During structure, according to the data vector of the single data layer in deep neural network intermediate layer, the loss function for building discriminator is used for The parameter of percentage regulation neutral net, deep neural network after parameter adjustment to be likely difficult to catch numerous and diverse data distribution special Sign is alignd with that will be distributed fine granularity so that the offset of the data distribution of source domain and target domain is still larger, and deep neural network should It is ineffective during with target domain completion goal task.

The content of the invention

The present invention provides a kind of confrontation transfer learning method and system towards big data analysis transboundary, existing right to overcome Under some data Layers for the deep neural network top layer that anti-migration learning method obtains, the data of source domain and the number of target domain Deviated according to may possibly still be present data distribution, and when the complicated knot of multi-mode is presented in the data distribution of source domain and target domain During structure, obtain deep neural network and be likely difficult to catch numerous and diverse data distribution characteristics so that distribution fine granularity to be alignd so that source The offset of the data distribution of field and target domain is still larger, and the problem of ineffective during goal task is completed in application target field.

According to the first aspect of the invention, there is provided a kind of confrontation transfer learning method towards big data analysis transboundary, should Method includes：Step 1, source domain and the respective unlabeled data collection of target domain inputted to predetermined depth neutral net and just To propagation, the corresponding tensor set of the respective unlabeled data collection of the source domain and target domain is obtained；Opened in the tensor set When measuring as corresponding unlabeled data as input, all data Layers in preset data layer set in the predetermined depth neutral net Data vector tensor product；Step 2, by the source domain and the corresponding tensor set of the respective unlabeled data collection of target domain In the corresponding random polyteny fusion of each tensor represent, substitute into the primary loss function of discriminator, obtain the current of discriminator Loss function, and using the parameter of the backpropagation adjustment discriminator, to minimize the current loss function, as described The current optimal loss function of discriminator；Step 3, using backpropagation, the predetermined depth neutral net is led in the source The loss function in domain subtracts balance parameters with being minimized after the product of the current optimal loss function, obtains the predetermined depth The new parameter of neutral net, updates the parameter of the predetermined depth neutral net with the new parameter and carries out the forward direction again Propagate to update the parameter of the predetermined depth neutral net again, until parameter restrains；The balance parameters are described pre- If deep neural network is in the balance parameters of the loss function and the current optimal loss function of the source domain.

Wherein, the step 1 specifically includes：Step 11, by the source domain and the respective unlabeled data of target domain The each unlabeled data concentrated is sequentially input to predetermined depth neutral net and forward-propagating, and acquisition is described each not to mark number According to the data vector of each data Layer in preset data layer set in the lower predetermined depth neutral net；Step 12, institute is calculated The tensor product of data vector is stated, using the tensor product as the corresponding tensor of each unlabeled data；Step 13, according to institute The unlabeled data for stating source domain concentrates the corresponding tensor of each unlabeled data, obtains the unlabeled data collection of the source domain Corresponding tensor set, and the corresponding tensor of each unlabeled data is concentrated according to the unlabeled data of the target domain, obtain The corresponding tensor set of unlabeled data collection of the target domain.

Wherein, in step 1, the preset data layer set is by the predetermined depth neutral net top layer and intermediate layer Some data Layers form.

Wherein, in step 2, the discriminator is that an input is d dimensional vectors, output connecting entirely on section [0,1] Connect predetermined depth neutral net discriminator；The primary loss function sets of the discriminator are：

Wherein,WithThe unlabeled data collection of respectively described source domain and the target domain is corresponding Tensor set,WithThe tensor of i-th of unlabeled data of respectively described source domain and the target domain, n_sAnd n_tRespectively The number of unlabeled data is concentrated for the unlabeled data of the source domain and the target domain,WithRespectively Merge and represent for the random polyteny of the source domain and the tensor of i-th of unlabeled data of the target domain；To be described Preset data layer set,Z is tensor, and ⊙ is Hadamard product,It is that a dimension is Random matrix,It is data LayerDimension, d be discriminator input vector dimension,For data LayerData vector.

Wherein, in step 3, the predetermined depth neutral net is to be led in the source in the loss function of the source domain The labeled data in domain concentrates the average of the intersection entropy loss of all labeled data.

Wherein, the predetermined depth neutral net is set as in the loss function of the source domain：

Wherein, n_sThe number of labeled data is concentrated for the labeled data of the source domain,For the source domain The feature vector of i-th of labeled data,For the decision function of the predetermined depth neutral net,Led for the source The label of the labeled data of i-th of domain, J () is cross entropy loss function.

According to the second aspect of the invention, there is provided a kind of confrontation transfer learning system towards big data analysis transboundary, its It is characterized in that, including：The current optimal loss function acquisition module and update module of tensor set acquisition module and discriminator；It is described Tensor set acquisition module, for inputting the unlabeled data collection of the unlabeled data collection of source domain and target domain to default depth Spend neutral net and forward-propagating, obtain the corresponding tensor set of unlabeled data collection of the source domain and the target domain The corresponding tensor set of unlabeled data collection；It is described default when tensor is that corresponding unlabeled data is used as input in the tensor set In deep neural network in preset data layer set the data vector of all data Layers tensor product；The discriminator it is current most Good loss function acquisition module, for by the corresponding tensor set of the unlabeled data collection of the source domain and the target domain The corresponding fusion of polyteny at random of each tensor represents to substitute into the primary loss function of discriminator, obtains the current of discriminator and lose Function, and backpropagation is utilized, the parameter of the discriminator is adjusted, to minimize the current loss function, as the mirror The current optimal loss function of other device；The update module, for utilizing the backpropagation, minimizes the predetermined depth god Through network the source domain loss function and the current optimal loss function tradeoff difference, obtain the default depth Spend the new parameter of neutral net, with the new parameter update the parameter of the predetermined depth neutral net and carry out again it is described just To propagating to update the parameter of the predetermined depth neutral net again, until parameter restrains；The current optimal loss letter Several tradeoffs is loss function of the predetermined depth neutral net in the source domain and the current optimal loss function Balance parameters be multiplied by the current optimal loss function.

According to the third aspect of the invention we, there is provided a kind of computer program product, it is characterised in that the computer program Product includes the computer program being stored on non-transient computer readable storage medium storing program for executing, and the computer program refers to including program Order, when described program instruction is computer-executed, makes the method for the computer execution as described in relation to the first aspect.

According to the fourth aspect of the invention, there is provided a kind of non-transient computer readable storage medium storing program for executing, it is characterised in that described Non-transient computer readable storage medium storing program for executing stores computer instruction, and the computer instruction makes the computer perform such as first party Method described in face.

Confrontation transfer learning method and system proposed by the present invention towards big data analysis transboundary, by by predetermined depth In neutral net in preset data layer set the tensor product of the data vector of all data Layers as the corresponding tensor of input data, Each tensor in the corresponding tensor set of the unlabeled data collection of the source domain and the target domain is corresponding random multi-thread Property fusion represent substitute into discriminator primary loss function, obtain the current loss function of discriminator, and then be based on the discriminator Current loss function the parameter of predetermined depth neutral net is updated so that obtained predetermined depth neutral net is preset In data Layer set multiple data Layers Joint Distribution offset reduce, applied to target domain complete goal task when effect compared with It is good, meanwhile, tensor is represented using the fusion of random polyteny, follow-up data calculation amount is substantially reduced, accelerates to anti-migration Learning process.If in addition, the invention avoids the predetermined depth neutral net top layer obtained based on existing transfer learning method The data distribution offset that dry data Layer may still suffer from, meanwhile, it also avoid when the data distribution of source domain and target domain is in During the labyrinth of existing multi-mode, the predetermined depth neutral net that is obtained based on existing transfer learning method is difficult to catch numerous and diverse Data distribution characteristics cause data distribution offset larger, complete target applied to target domain and appoint so that distribution fine granularity to be alignd It is ineffective during business.

Brief description of the drawings

Fig. 1 is a kind of confrontation transfer learning method flow towards big data analysis transboundary according to the embodiment of the present invention Figure；

Fig. 2 is to be worked as according to discriminator under the respective unlabeled data collection of source domain and target domain of the embodiment of the present invention The acquisition flow chart of preceding loss function；

Fig. 3 is a kind of confrontation transfer learning system flow towards big data analysis transboundary according to the embodiment of the present invention Figure.

Embodiment

With reference to the accompanying drawings and examples, the embodiment of the present invention is described in further detail.Implement below Example is used to illustrate the present invention, but is not limited to the scope of the present invention.

As shown in Figure 1, the embodiment of the present invention provides a kind of confrontation transfer learning method towards big data analysis transboundary, should Method includes：

Step 1, source domain and the respective unlabeled data collection of target domain inputted to predetermined depth neutral net and just To propagation, the corresponding tensor set of the respective unlabeled data collection of the source domain and target domain is obtained；Opened in the tensor set When measuring as corresponding unlabeled data as input, all data Layers in preset data layer set in the predetermined depth neutral net Data vector tensor product.

In the present embodiment, predetermined depth neutral net can be applied to numerous machine learning tasks, such as:Image recognition and language Sound identification etc..By taking non-supervisory image recognition transfer learning task as an example, predetermined depth can be realized under Caffe deep learning frames Neutral net AlexNet or ResNet, can be before transfer learning be carried out first in order to improve the performance of predetermined depth neutral net Pre-training is carried out to predetermined depth neutral net using the ImageNet data sets marked；The unlabeled data collection of target domain For the set of eigenvectors of image to be classified.

In the present embodiment, specifically, do not marked what source domain and the respective unlabeled data of target domain were concentrated each Note data are sequentially input to predetermined depth neutral net and forward-propagating, obtain preset data in the predetermined depth neutral net The data vector of each data Layer in layer set.The tensor product of the data vector is calculated, the tensor product is as input The corresponding tensor of unlabeled data.The corresponding tensor of each unlabeled data is concentrated according to the unlabeled data of the source domain, The corresponding tensor set of unlabeled data collection of the source domain is obtained, and is concentrated often according to the unlabeled data of the target domain The corresponding tensor of a unlabeled data, obtains the corresponding tensor set of unlabeled data collection of the target domain.Wherein, it is described pre- If data Layer set is made of some data Layers in the predetermined depth neutral net top layer and intermediate layer.For example, when default When deep neural network is AlexNet, preset data layer set is setWhen predetermined depth nerve net When network is ResNet, preset data layer set is set

Inputted according to by source domain and the respective unlabeled data collection of target domain to predetermined depth neutral net and forward direction Propagate, obtain the respective unlabeled data collection of the source domain and target domain under each data Layer of preset data layer set The process of data vector is as shown in Figure 2.Wherein, X^sFor the unlabeled data collection of source domain, X^tFor the unlabeled data of target domain Collection, Z^siFor data vector of the unlabeled data collection under i-th of data Layer of preset data layer set of source domain, Z^tiFor mesh Data vector of the unlabeled data collection in mark field under i-th of data Layer of preset data layer set.

Step 2, by each tensor in the source domain and the corresponding tensor set of the respective unlabeled data collection of target domain Corresponding random polyteny fusion represents, substitutes into the primary loss function of discriminator, obtains the current loss function of discriminator, and The parameter of the discriminator is adjusted using backpropagation, to minimize the current loss function, as working as the discriminator Preceding optimal loss function.

In the present embodiment, the discriminator is that an input is the full connection of d dimensional vectors, output on section [0,1] Predetermined depth neutral net discriminator.The primary loss function sets of the discriminator are：

Wherein,WithThe unlabeled data collection of respectively described source domain and the target domain is corresponding Tensor set,WithThe tensor of i-th of unlabeled data of respectively described source domain and the target domain, n_sAnd n_tRespectively The number of unlabeled data is concentrated for the unlabeled data of the source domain and the target domain,WithRespectively Merge and represent for the random polyteny of the source domain and the tensor of i-th of unlabeled data of the target domain；For institute Preset data layer set is stated,Z is tensor, and ⊙ is Hadamard product,It is that a dimension isRandom matrix,It is data LayerDimension, d be discriminator input vector dimension,For data LayerData Vector.For random matrixEach matrix elementDistribution need to meet Available matrix element distribution includes Gaussian Profile, Bernoulli Jacob's distribution and Uniformly distributed.

The respective unlabeled data collection of source domain and target domain is under each data Layer of preset data layer set Tensor, substitutes into the primary loss function of discriminator, and the process for obtaining the current loss function of discriminator is as shown in Figure 2.Wherein, Rⁱ For the corresponding random matrix of i-th of data Layer of preset data layer set, ⊙ is Hadamard product, and D is the current loss of discriminator Function.Y^sFor the corresponding output collection of unlabeled data collection of source domain, Y^tFor the corresponding output of unlabeled data collection of target domain Collection.

In the present embodiment, after the current loss function for obtaining discriminator, the discriminator is adjusted using backpropagation Parameter, to minimize the current loss function, uses small during the current optimal loss function as the discriminator Momentum is simultaneously set to 0.9 by batch stochastic gradient descent.In each round iteration of stochastic gradient descent, learning rate isWherein, parameter p linearly rises to 1, η with iteration wheel number from 0₀=0.01, α=10, β=0.75.

Step 3, using backpropagation, the predetermined depth neutral net is subtracted in the loss function of the source domain flat Weighing apparatus parameter with minimized after the current most preferably product of loss function, obtain the new parameter of the predetermined depth neutral net, The parameter of the predetermined depth neutral net is updated with the new parameter and carries out the forward-propagating again to update again The parameter of the predetermined depth neutral net, until parameter restrains；The balance parameters exist for the predetermined depth neutral net The balance parameters of the loss function of the source domain and the current optimal loss function.

In the present embodiment, the predetermined depth neutral net is in the source domain in the loss function of the source domain Labeled data concentrate all labeled data intersection entropy loss average, the predetermined depth neutral net is in the source The loss function in field is set as：

In the present embodiment, the predetermined depth neutral net the loss function of the source domain subtract balance parameters with The product of the current optimal loss function is：

Wherein, λ is loss function and the current optimal loss of the predetermined depth neutral net in the source domain The balance parameters of function.

In the present embodiment, using the backpropagation, the predetermined depth neutral net is minimized in the source domain Loss function and the current optimal loss function tradeoff difference during, using small lot stochastic gradient descent simultaneously Momentum is set to 0.9.In each round iteration of stochastic gradient descent, learning rate isWherein, parameter p As iteration wheel number linearly rises to 1, η from 0₀=0.01, α=10, β=0.75.Balance parameters λ is again set at gradual change ginseng Number,Wherein δ=10.In the process, it specifically have adjusted predetermined depth neutral net F's (x) All convolved data layers, pond data Layer and classifier data layer, and the learning rate of classifier data layer is other data Layers 10 times.

Confrontation transfer learning method proposed by the present invention towards big data analysis transboundary, by by predetermined depth nerve net The tensor product of the data vector of all data Layers is as the corresponding tensor of input data in preset data layer set in network, by described in The corresponding random polyteny fusion of each tensor in the corresponding tensor set of unlabeled data collection of source domain and the target domain Represent the primary loss function of substitution discriminator, obtain the current loss function of discriminator, and then based on the current of the discriminator Loss function is updated the parameter of predetermined depth neutral net so that obtained predetermined depth neutral net preset data layer In set multiple data Layers Joint Distribution offset reduce, applied to target domain complete goal task when effect it is preferable, meanwhile, Tensor is represented using the fusion of random polyteny, follow-up data calculation amount is substantially reduced, accelerates confrontation transfer learning process. In addition, some data Layers the invention avoids the predetermined depth neutral net top layer obtained based on existing transfer learning method can The data distribution offset that can be still suffered from, meanwhile, it also avoid when multi-mode is presented in the data distribution of source domain and target domain During labyrinth, the predetermined depth neutral net obtained based on existing transfer learning method is difficult to catch numerous and diverse data distribution spy Sign with will distribution fine granularity alignment, cause data distribution offset it is larger, applied to target domain complete goal task when effect not It is good.

As shown in figure 3, the embodiment of the present invention provides a kind of confrontation transfer learning system towards big data analysis transboundary, bag Include：The current optimal loss function acquisition module and update module of tensor set acquisition module and discriminator；The tensor set obtains Module, for inputting the unlabeled data collection of the unlabeled data collection of source domain and target domain to predetermined depth neutral net And forward-propagating, obtain the corresponding tensor set of unlabeled data collection of the source domain and the unlabeled data of the target domain Collect corresponding tensor set；When tensor is that corresponding unlabeled data is used as input in the tensor set, the predetermined depth nerve net In network in preset data layer set the data vector of all data Layers tensor product；The current optimal loss function of the discriminator Acquisition module, for by each tensor pair in the corresponding tensor set of the unlabeled data collection of the source domain and the target domain The random polyteny fusion answered represents the primary loss function of substitution discriminator, obtains the current loss function of discriminator, and profit With backpropagation, the parameter of the discriminator is adjusted, to minimize the current loss function, as the current of the discriminator Optimal loss function；The update module, for utilizing the backpropagation, minimizes the predetermined depth neutral net in institute The difference of the loss function of source domain and the tradeoff of the current optimal loss function is stated, obtains the predetermined depth neutral net New parameter, update the parameter of the predetermined depth neutral net with the new parameter and carry out the forward-propagating again with again The parameter of the predetermined depth neutral net is once updated, until parameter restrains；The tradeoff of the current optimal loss function It is balance parameters of the predetermined depth neutral net in the loss function and the current optimal loss function of the source domain It is multiplied by the current optimal loss function.

Confrontation transfer learning system proposed by the present invention towards big data analysis transboundary, by by predetermined depth nerve net The tensor product of the data vector of all data Layers is as the corresponding tensor of input data in preset data layer set in network, by described in The corresponding random polyteny fusion of each tensor in the corresponding tensor set of unlabeled data collection of source domain and the target domain Represent the primary loss function of substitution discriminator, obtain the current loss function of discriminator, and then based on the current of the discriminator Loss function is updated the parameter of predetermined depth neutral net so that obtained predetermined depth neutral net preset data layer In set multiple data Layers Joint Distribution offset reduce, applied to target domain complete goal task when effect it is preferable, meanwhile, Tensor is represented using the fusion of random polyteny, follow-up data calculation amount is substantially reduced, accelerates confrontation transfer learning process. In addition, some data Layers the invention avoids the predetermined depth neutral net top layer obtained based on existing transfer learning method can The data distribution offset that can be still suffered from, meanwhile, it also avoid when multi-mode is presented in the data distribution of source domain and target domain During labyrinth, the predetermined depth neutral net obtained based on existing transfer learning method is difficult to catch numerous and diverse data distribution spy Sign with will distribution fine granularity alignment, cause data distribution offset it is larger, applied to target domain complete goal task when effect not It is good.

The embodiment of the present invention provides a kind of computer program product, and the computer program product includes being stored in non-transient Computer program on computer-readable recording medium, the computer program include programmed instruction, when described program instructs quilt When computer performs, computer is able to carry out the method that above method embodiment is provided, such as including：By not marking for source domain The unlabeled data collection of note data set and target domain is inputted to predetermined depth neutral net and forward-propagating, obtains the source neck The corresponding tensor set of unlabeled data collection in domain and the corresponding tensor set of unlabeled data collection of the target domain；The tensor It is when corresponding to unlabeled data as input to concentrate tensor, is owned in the predetermined depth neutral net in preset data layer set The tensor product of the data vector of data Layer；By the corresponding tensor set of the unlabeled data collection of the source domain and the target domain In the corresponding random polyteny fusion of each tensor represent to substitute into the primary loss function of discriminator, obtain the current damage of discriminator Function is lost, and utilizes backpropagation, the parameter of the discriminator is adjusted, to minimize the current loss function, as described The current optimal loss function of discriminator；Using the backpropagation, the predetermined depth neutral net is minimized in the source The difference of the tradeoff of the loss function in field and the current optimal loss function, obtains the new of the predetermined depth neutral net Parameter, updates the parameter of the predetermined depth neutral net with the new parameter and carries out the forward-propagating again with again The parameter of the predetermined depth neutral net is updated, until parameter restrains；The tradeoff of the current optimal loss function is institute The balance parameters that predetermined depth neutral net is stated in the loss function and the current optimal loss function of the source domain are multiplied by The current optimal loss function.

The embodiment of the present invention provides a kind of non-transient computer readable storage medium storing program for executing, the non-transient computer readable storage Medium storing computer instructs, and the computer instruction makes the computer perform the method that above method embodiment is provided, Such as including：The unlabeled data collection of the unlabeled data collection of source domain and target domain is inputted to predetermined depth neutral net And forward-propagating, obtain the corresponding tensor set of unlabeled data collection of the source domain and the unlabeled data of the target domain Collect corresponding tensor set；When tensor is that corresponding unlabeled data is used as input in the tensor set, the predetermined depth nerve net In network in preset data layer set the data vector of all data Layers tensor product；By the source domain and the target domain The corresponding random polyteny fusion of each tensor represents to substitute into the original damage of discriminator in the corresponding tensor set of unlabeled data collection Function is lost, obtains the current loss function of discriminator, and utilizes backpropagation, adjusts the parameter of the discriminator, to minimize The current loss function, the current optimal loss function as the discriminator；Using the backpropagation, described in minimum Predetermined depth neutral net the source domain loss function and the current optimal loss function tradeoff difference, obtain The new parameter of the predetermined depth neutral net, the parameter and again of the predetermined depth neutral net is updated with the new parameter The forward-propagating is carried out to update the parameter of the predetermined depth neutral net again, until parameter restrains；It is described current The tradeoff of optimal loss function for the predetermined depth neutral net the source domain loss function with it is described it is current most The balance parameters of good loss function are multiplied by the current optimal loss function.

One of ordinary skill in the art will appreciate that：Realizing all or part of step of above method embodiment can pass through The relevant hardware of programmed instruction is completed, and foregoing program can be stored in a computer read/write memory medium, the program Upon execution, the step of execution includes above method embodiment；And foregoing storage medium includes：ROM, RAM, magnetic disc or light Disk etc. is various can be with the medium of store program codes.

Through the above description of the embodiments, those skilled in the art can be understood that each embodiment can Realized by the mode of software plus required general hardware platform, naturally it is also possible to pass through hardware.Based on such understanding, on The part that technical solution substantially in other words contributes to the prior art is stated to embody in the form of software product, should Computer software product can store in a computer-readable storage medium, such as ROM/RAM, magnetic disc, CD, including some fingers Order is used so that a computer equipment (can be personal computer, server, or network equipment etc.) performs each implementation Method described in some parts of example or embodiment.

Finally it should be noted that：The above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations；Although The present invention is described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that：It still may be used To modify to the technical solution described in foregoing embodiments, or equivalent substitution is carried out to which part technical characteristic； And these modification or replace, do not make appropriate technical solution essence depart from various embodiments of the present invention technical solution spirit and Scope.

Claims

A kind of 1. confrontation transfer learning method towards big data analysis transboundary, it is characterised in that including：

Step 1, source domain and the respective unlabeled data collection of target domain are inputted to predetermined depth neutral net and positive biography Broadcast, obtain the corresponding tensor set of the respective unlabeled data collection of the source domain and target domain；Tensor is in the tensor set When corresponding unlabeled data is as input, in the predetermined depth neutral net in preset data layer set all data Layers number According to the tensor product of vector；

Step 2, each tensor in the source domain and the corresponding tensor set of the respective unlabeled data collection of target domain is corresponded to Random polyteny fusion represent, substitute into the primary loss function of discriminator, obtain the current loss function of discriminator, and utilize Backpropagation adjusts the parameter of the discriminator, to minimize the current loss function, as the discriminator it is current most Good loss function；

Step 3, using backpropagation, the predetermined depth neutral net is subtracted into balance ginseng in the loss function of the source domain Number obtains the new parameter of the predetermined depth neutral net, uses institute with being minimized after the product of the current optimal loss function State that new parameter updates the parameter of the predetermined depth neutral net and to carry out the forward-propagating again described to update again The parameter of predetermined depth neutral net, until parameter restrains；The balance parameters are the predetermined depth neutral net described The balance parameters of the loss function of source domain and the current optimal loss function.
2. according to the method described in claim 1, it is characterized in that, the step 1 specifically includes：

Step 11, each unlabeled data that the source domain and the respective unlabeled data of target domain are concentrated is sequentially input To predetermined depth neutral net and forward-propagating, obtain pre- in the predetermined depth neutral net under each unlabeled data If the data vector of each data Layer in data Layer set；

Step 12, the tensor product of the data vector is calculated, the tensor product is corresponding as each unlabeled data Tensor；

Step 13, the corresponding tensor of each unlabeled data is concentrated according to the unlabeled data of the source domain, obtains the source The corresponding tensor set of unlabeled data collection in field, and concentrated according to the unlabeled data of the target domain and each do not mark number According to corresponding tensor, the corresponding tensor set of unlabeled data collection of the target domain is obtained.
3. according to the method described in claim 1, it is characterized in that, in step 1, the preset data layer set is by described pre- If some data Layers in deep neural network top layer and intermediate layer are formed.
4. according to the method described in claim 1, it is characterized in that, in step 2, the discriminator is that an input is d dimensions The full connection predetermined depth neutral net discriminator of vector, output on section [0,1]；

The primary loss function sets of the discriminator are：

Wherein,WithThe corresponding tensor of unlabeled data collection of respectively described source domain and the target domain Collection,WithThe tensor of i-th of unlabeled data of respectively described source domain and the target domain, n_sAnd n_tRespectively institute The number of the unlabeled data concentration unlabeled data of source domain and the target domain is stated,WithRespectively institute The random polyteny fusion for stating the tensor of i-th of unlabeled data of source domain and the target domain represents；To be described pre- If data Layer set,Z is tensor, and ⊙ is that Hadamard accumulates, R^lIt is that a dimension is d × d_l's Random matrix, d_lIt is the dimension of data Layer l, d is the dimension of discriminator input vector, z^lFor the data vector of data Layer l.
5. according to the method described in claim 1, it is characterized in that, in step 3, the predetermined depth neutral net is described The loss function of source domain is in the equal of the intersection entropy loss of all labeled data of the labeled data of source domain concentration Value.
6. according to the method described in claim 5, it is characterized in that, the predetermined depth neutral net the source domain damage Losing function sets is：

<mrow> <mfrac> <mn>1</mn> <msub> <mi>n</mi> <mi>s</mi> </msub> </mfrac> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <msub> <mi>n</mi> <mi>s</mi> </msub> </msubsup> <mi>J</mi> <mrow> <mo>(</mo> <mi>F</mi> <mo>(</mo> <msubsup> <mi>x</mi> <mi>i</mi> <mi>s</mi> </msubsup> <mo>)</mo> <mo>,</mo> <msubsup> <mi>y</mi> <mi>i</mi> <mi>s</mi> </msubsup> <mo>)</mo> </mrow> <mo>,</mo> </mrow>

Wherein, n_sThe number of labeled data is concentrated for the labeled data of the source domain,For i-th of the source domain The feature vector of labeled data,For the decision function of the predetermined depth neutral net,For the source domain The label of i-th of labeled data, J () is cross entropy loss function.
A kind of 7. confrontation transfer learning system towards big data analysis transboundary, it is characterised in that including：Tensor set acquisition module With the current optimal loss function acquisition module and update module of discriminator；

The tensor set acquisition module, for the unlabeled data collection of the unlabeled data collection of source domain and target domain to be inputted To predetermined depth neutral net and forward-propagating, the corresponding tensor set of unlabeled data collection of the source domain and the mesh are obtained The corresponding tensor set of unlabeled data collection in mark field；When tensor is that corresponding unlabeled data is used as input in the tensor set, In the predetermined depth neutral net in preset data layer set the data vector of all data Layers tensor product；

The current optimal loss function acquisition module of the discriminator, for not marking the source domain and the target domain The corresponding fusion of polyteny at random of each tensor in the corresponding tensor set of data set is noted to represent to substitute into the primary loss letter of discriminator Number, obtains the current loss function of discriminator, and utilizes backpropagation, the parameter of the discriminator is adjusted, with described in minimum Current loss function, the current optimal loss function as the discriminator；

The update module, for utilizing the backpropagation, minimizes the predetermined depth neutral net in the source domain Loss function and the current optimal loss function tradeoff difference, obtain the new ginseng of the predetermined depth neutral net Number, updates the parameter of the predetermined depth neutral net with the new parameter and carries out the forward-propagating again with again more The parameter of the new predetermined depth neutral net, until parameter restrains；The tradeoff of the current optimal loss function is described Predetermined depth neutral net is multiplied by institute in the balance parameters of the loss function and the current optimal loss function of the source domain State current optimal loss function.
8. a kind of computer program product, it is characterised in that the computer program product includes being stored in non-transient computer Computer program on readable storage medium storing program for executing, the computer program include programmed instruction, when described program is instructed by computer During execution, the computer is set to perform such as claim 1 to 6 any one of them method.
9. a kind of non-transient computer readable storage medium storing program for executing, it is characterised in that the non-transient computer readable storage medium storing program for executing is deposited Computer instruction is stored up, the computer instruction makes the computer perform such as claim 1 to 6 any one of them method.