CN108229692A

CN108229692A - A kind of machine learning recognition methods based on double contrast's study

Info

Publication number: CN108229692A
Application number: CN201810128018.8A
Authority: CN
Inventors: 徐传运; 许洲; 张杨
Original assignee: Chongqing University of Technology
Current assignee: Chongqing Maoqiao Technology Co.,Ltd.
Priority date: 2018-02-08
Filing date: 2018-02-08
Publication date: 2018-06-29
Anticipated expiration: 2038-02-08
Also published as: CN108229692B

Abstract

The present invention provides a kind of machine learning recognition methods based on double contrast's study, and a certain amount of multi-medium data sample of known class can be utilized to put in order using different contrast sample's inputs to machine learning model f₁The learning training of multiple differentiated is carried out, to carry out multi-medium data classification identifying processing, machine learning model f₁It is designed as the built-up pattern framework of convolutional neural networks model or/and full Connection Neural Network model, significantly reduce the dependence to magnanimity training sample, and it can easily extend to carrying out classification identification without the multi-medium data classification of learning training, existing multimedia data classification machine learning recognition methods is solved well because the dependence to a large amount of training samples and because leading to practical application when directly can not carry out Classification and Identification to the classification without learning training, the problem of versatility is limited, it more can effectively be applied to extensively in more specific multimedia data classification use occasions.

Description

A kind of machine learning recognition methods based on double contrast's study

Technical field

The present invention relates to multimedia data processing and machine learning techniques field more particularly to one kind based on dual right Than the machine learning recognition methods of study.

Background technology

Multimedia (Multimedia) is the synthesis of media, in computer systems, multimedia refer to combination two kinds or A kind of man-machine interactive information interchange of two or more media and communications media, the media used include word, picture, photo, Interaction function that sound, animation and film and formula are provided etc..

With the arrival in big data epoch, the classification of mass multimedia data and digging technology are particularly important.In sea It measures in data mining, how to utilize the information classified and excavated from data with existing to instruct the classification of new data and excavation A new research hotspot is become.Particularly when the sample size of certain tasks is less, can have using multi-task learning The reduction mass data classification of effect and the time cost excavated simultaneously improve acquisition of information accuracy.For example, in face of being known based on face Other access control system of residential community development task, if the facial image of each owner is respectively divided into an independent image data class Not, it is necessary to which the Classification and Identification to facial image is realized in system processing, judges that collected facial image is to belong at current gate inhibition Which who in the face (judging image data classification belonged to) of owner, and then judge whether to release gate inhibition.

A kind of effective, robust information classification approach is proved to be based on deep learning method in practice.Depth nerve Network (such as depth convolutional neural networks) is most representative machine learning method.Deep learning model usually has tens of layers The data analysis layer that can learn, have it is hundreds thousand of, even millions of can be with learning parameter.Since quantity of parameters composition is extremely huge Studying space, optimal model parameter in order to obtain, it usually needs a large amount of training data.But in order to train depth Practise model, it is necessary to which structure possesses the training dataset of great amount of samples, and usual training samples number is more than tens thousand of.However, structure Such training set is extremely difficult in practical applications, and costs dearly.For example, in face of based on the small of recognition of face Area's access control system development task, if the facial image of each owner is respectively divided into an independent image data classification, When carrying out Classification and Identification training to machine learning model, if necessary to acquire ten hundreds of facial images for each owner Training sample is very unpractical.Deep learning method is caused many to the greediness of big data which results in depth model Field is all difficult to obtain concrete application, is difficult with reliable technology realizability in other words.

When deep learning method is used for classification task, traditional deep learning method requirement disaggregated model contrast sample's Class must be identical with producing the class of sample, i.e. model can only classify the class learnt, if there is the sample of new class needs to classify, Must re -training machine learning model or machine learning model is done some adaptability training study.For example, in face of base In the access control system of residential community development task of recognition of face, if the facial image of each owner is respectively divided into an independent figure As data category, using current deep learning method, it is required for carrying out study instruction to the facial image of each current owner Practice；When there is a new owner to occur, even if the facial image of the new owner is added directly into identification contrast sample's data In library, due to not carrying out learning training to the facial image of the new owner before machine learning model, at gate inhibition again It is secondary collect the new owner facial image when, machine learning model still can not be based on contrast sample's database in the new owner Face image data and Direct Classification identifies the new owner.This also results in the machine learning mould based on deep learning method The training of type needs to consume a large amount of training computing resource and longer training learning time, limits it in practical application field Ease of use and versatility in conjunction.

Invention content

For above-mentioned deficiency in the prior art, present invention solves the technical problem that being that how to provide one kind is based on The machine learning recognition methods of double contrast's study needs to solve existing multimedia data classification machine learning recognition methods The problem of relying on a large amount of training sample and leading to practical application, further solves existing multimedia data classification machine The problem of device study recognition methods directly can not carry out Classification and Identification to the classification without learning training and versatility is caused to be limited.

In order to solve the above technical problems, present invention employs following technological means：

Based on the machine learning recognition methods of double contrast's study, among the multi-medium data of multiple and different known class Target identification sample and contrast sample are chosen, as a machine learning model f₁Input, to machine learning model f₁It is learned Training is practised, and then utilizes the machine learning model f after learning training₁Classification identification is carried out to multi-medium data to be identified；It is described Machine learning model f₁Including the first sub- learning model f_DPWith the second sub- learning model f_DE, the first sub- learning model f_DPFor Convolutional neural networks model or full Connection Neural Network model, the second sub- learning model f_DEFor convolutional neural networks model or Full Connection Neural Network model；Selected contrast sample includes more than two different classes of multiple multi-medium datas, and Setting contrast sample is input to machine learning model f₁Input put in order, and according to contrast sample input put in order, will Target identification sample is combined with contrast sample with presetting rule of combination, is consequently formed and is remained with contrast sample input row Each multi-medium data sample group cooperation is learnt mould by multiple data samples combination of row Cahn-Ingold-Prelog sequence rule for the described second son respectively Type f_DEInput, and by each corresponding second sub- learning model f_DEOutput put in order rule according to contrast sample input It then sorts and forms a data vector, as the described first sub- learning model f_DPInput vector, and it is described first son study mould Type f_DPResult vector of the output vector as the machine learning model；Learn from there through training so that learning training institute The machine learning model f obtained₁Each result vector element in the result vector of output to characterize target identification sample with The correlation between contrast sample's generic on corresponding arrangement ordinal position, so as to utilize the more of known class Media data sample is put in order using different contrast sample's inputs to the machine learning model f₁Carry out multiple study Training.

In the above-mentioned machine learning recognition methods based on double contrast's study, preferably, as machine learning mould Type f₁The target identification sample of input is one or more, and belongs to same category；

If as machine learning model f₁The target identification sample of input is one, by target identification sample and comparative sample When this is combined with presetting rule of combination, the presetting rule of combination is one kind among following manner：

Rule of combination mode is 1.：The target identification sample is established into combinations of pairs between each contrast sample respectively Relationship carries out combinations of pairs respectively；

Rule of combination mode is 2.：Each contrast sample is first subjected to category division, then by the target identification sample Syntagmatic is established between the contrast sample of each classification respectively, is respectively combined；

If as machine learning model f₁The target identification sample of input is multiple, by target identification sample and comparative sample When this is combined with presetting rule of combination, the presetting rule of combination is one kind among following manner：

Rule of combination mode a：Each target identification sample is established into matched group between each contrast sample respectively Conjunction relationship carries out combinations of pairs respectively；

Rule of combination mode b：Each contrast sample is first subjected to category division, then by each target identification sample Syntagmatic is established between the contrast sample of each classification respectively, is respectively combined；

Rule of combination mode c：Target complete identification sample is established between each contrast sample respectively as a whole Combinations of pairs relationship carries out combinations of pairs respectively；

Rule of combination mode d：Each contrast sample is first subjected to category division, then makees target complete identification sample Syntagmatic is established between the contrast sample of each classification respectively to be whole, is respectively combined.

In the above-mentioned machine learning recognition methods based on double contrast's study, preferably, to machine learning mould Type f₁During carrying out learning training, the target identification sample and contrast sample are from preset multi-medium data sample database It is chosen, chooses the multi-medium data conduct of a part of known class included in the multi-medium data sample database every time Target identification sample and contrast sample are to machine learning model f₁Learning training is carried out, and several times from the multi-medium data sample Target identification sample and contrast sample are chosen in this library to machine learning model f₁Learning training is carried out, to ensure target identification sample The selection of this and contrast sample traverse each multi-medium data classification included in the multi-medium data sample database, and are directed to Each multi-medium data classification in multi-medium data sample database has been performed both by contrast sample's selection operation of at least H times, H be with Frequency threshold value is chosen in the training of setting.

It is preferably, described to utilize study instruction in the above-mentioned machine learning recognition methods based on double contrast's study Machine learning model f after white silk₁To multi-medium data to be identified carry out classification identification concrete mode be：

The multi-medium data as object to be identified is obtained as sample to be identified and from multiple and different known class The contrast sample chosen among multi-medium data, as the machine learning model f after learning training₁Input, selected pair More than two different classes of multiple multi-medium datas are included, and contrast sample is set to be input to machine learning model than sample f₁Input put in order, and put in order according to contrast sample's input, by sample to be identified and contrast sample with presetting Rule of combination is combined, and is consequently formed and is remained with multiple data samples combination that contrast sample's input puts in order regular, point It is the described second sub- learning model f not by each multi-medium data sample group cooperation_DEInput, and by it is each it is corresponding second son learn Practise model f_DEOutput form a data vector according to the contrast sample input rule compositor that puts in order, as described the One sub- learning model f_DPInput vector, and the first sub- learning model f_DPOutput vector as the machine learning mould The result vector of type；In classification identification process, the machine learning model f₁Each result in the result vector of output to Secondary element is related between sample to be identified and contrast sample's generic on corresponding arrangement ordinal position to characterize Property, so as to determine the generic of sample to be identified according to the correlation.

In the above-mentioned machine learning recognition methods based on double contrast's study, preferably, acquired is to be identified Sample is one or more, and belongs to same category；

If it is input to machine learning model f₁Sample to be identified for one, by sample to be identified with contrast sample with pre- When the rule of combination of setting is combined, the presetting rule of combination is one kind among following manner：

Rule of combination mode is 1.：The sample to be identified is established combinations of pairs respectively to close between each contrast sample System, carries out combinations of pairs respectively；

Rule of combination mode is 2.：Each contrast sample is first subjected to category division, then by the sample to be identified point Syntagmatic is not established between the contrast sample of each classification, is respectively combined；

If it is input to machine learning model f₁Sample to be identified to be multiple, by sample to be identified with contrast sample with pre- When the rule of combination of setting is combined, the presetting rule of combination is one kind among following manner：

Rule of combination mode a：Each sample to be identified is established into combinations of pairs between each contrast sample respectively Relationship carries out combinations of pairs respectively；

Rule of combination mode b：Each contrast sample is first subjected to category division, then by each sample to be identified point Syntagmatic is not established between the contrast sample of each classification, is respectively combined；

Rule of combination mode c：All samples to be identified between each contrast sample are established respectively as a whole and are matched To syntagmatic, combinations of pairs is carried out respectively；

Rule of combination mode d：Each contrast sample is first subjected to category division, it then will all sample conducts to be identified It is whole to establish syntagmatic between the contrast sample of each classification respectively, it is respectively combined.

If the sample to be identified obtained is multiple, it may be used and be input to the machine learning model f in batches₁It carries out Identifying processing is input to machine learning model f in batches₁Concrete mode be one kind among following manner：

Input mode is 1. in batches：Whole contrast samples are formed a sample with each sample to be identified respectively to input Set；Multiple sample input set are consequently formed, by several times as the machine learning model f₁Input；

Input mode is 2. in batches：Each contrast sample is first subjected to category division；Then it is selected from each classification A contrast sample is taken, then chooses a sample to be identified, forms a sample input set；Multiple sample inputs are consequently formed Set, by several times as the machine learning model f₁Input；

Input mode is 3. in batches：Each contrast sample is first subjected to category division；Then it is selected from each classification A contrast sample is taken, together with all samples to be identified, forms a sample input set；Multiple sample inputs are consequently formed Set, by several times as the machine learning model f₁Input；

Input mode is 4. in batches：Whole contrast samples are formed a sample input with all samples to be identified to gather, As the machine learning model f₁Input.

In the above-mentioned machine learning recognition methods based on double contrast's study, preferably, according to machine learning mould Type f₁The concrete mode that the result vector repeatedly exported carries out classification identifying processing is one kind among following manner：

Repeatedly export classification identification method 1.：Each result vector element in the result vector of each output of Statistical Comparison, The highest result vector element of degree of correlation that correlation is embodied is found out, by the arrangement corresponding to the result vector element Contrast sample's generic on ordinal position is determined as sample generic to be identified；

Repeatedly export classification identification method 2.：By machine learning model f₁The result vector of each output adds up, and obtains To accumulation result vector, so as to the correlation that each result vector element is characterized in accumulation result vector described in Statistical Comparison, look for Go out the highest result vector element of degree of correlation that correlation is embodied, the arrangement corresponding to the result vector element is suitable Contrast sample's generic that tagmeme is put is determined as sample generic to be identified.

It is above-mentioned based on double contrast study machine learning recognition methods in, preferably, the contrast sample from Chosen in preset multi-medium data sample database, choose every time to as machine learning model f₁The comparative sample of input This categorical measure L is less than the categorical measure S of the multi-medium data of known class included in the multi-medium data sample database, L and S is the integer more than 1, and needs repeatedly to choose contrast sample from the multi-medium data sample database respectively as machine Device learning model f₁Input, carry out multiple classification identifying processing to the sample identified, with ensure the selection of contrast sample traverse Each multi-medium data classification included in the multi-medium data sample database, and for every in multi-medium data sample database A multi-medium data classification has been performed both by contrast sample's selection operation of at least K times, and K is that number threshold is chosen in the identification with setting Value；Then, Statistical Comparison machine learning model f₁Each result vector in the result vector that each secondary classification identifying processing is exported Element finds out the highest result vector element of degree of correlation that correlation is embodied, will be corresponding to the result vector element Put in order on position contrast sample's generic be determined as sample generic to be identified.

Compared with the prior art, the invention has the advantages that：

1st, the present invention is based on the machine learning recognition methods of double contrast's study, can utilize a certain amount of of known class Multi-medium data sample is put in order using different contrast sample's inputs to machine learning model f₁Carry out multiple differentiated Learning training can carry out a large amount of learning training to reach expected using a small amount of training sample to machine learning model Classification recognition effect so as to significantly reduce the dependence to magnanimity training sample, solves existing multimedia data classification machine The problem of device study recognition methods needs to rely on a large amount of training sample and lead to practical application.

2nd, the present invention is based on the machine learning recognition methods of double contrast's study, even for some multi-medium data class Not without learning training, as long as the multi-medium data sample of the multi-medium data classification is added to identification contrast sample's data In library, when sample to be identified is the multi-medium data of the category, machine learning model f₁The result vector of output still can Embody the otherness between the sample to be identified and other different classes of contrast samples and the comparative sample with the same category Correlation between this, so as to can still determine the generic of sample to be identified according to the correlation, so as to convenient Extension to carrying out classification identification without the multi-medium data classification of learning training, can solve because can not be directly to the non-study of Confucian classics The problem of practising the classification progress Classification and Identification of training and versatility caused to be limited.

3rd, the present invention is based on the machine learning recognition methods of double contrast's study, in the process for carrying out classification identifying processing In, may be used the classification identifying processing mode that data are locally chosen allow choose every time to as machine learning model f₁It is defeated The contrast sample's categorical measure L entered is less than the multi-medium data of known class included in the multi-medium data sample database Categorical measure S, then by repeatedly choosing contrast sample respectively as machine learning model f₁Input to the sample identified into The mode of the multiple classification identifying processing of row, to reduce machine learning model f₁Each classification identification processing procedure performs data operation The data volume of processing avoids machine learning model f₁The problem for the treatment of effeciency is too low or can not perform effectively processing.

4th, the present invention is based on the machine learning recognition methods of double contrast's study, existing multi-medium data is solved well Sorting machine learns recognition methods because of the dependence to a large amount of training samples and because can not be directly to the classification without learning training The problem of carrying out Classification and Identification and causing practical application, versatility limited more can effectively be applied to more extensively In specific multimedia data classification use occasion, there is wide technology application and promotion prospect.

Description of the drawings

Fig. 1 is the flow diagram of machine learning training process in machine learning recognition methods of the present invention.

Fig. 2 is the flow diagram of another machine learning training process in machine learning recognition methods of the present invention.

Fig. 3 is the flow diagram of multi-medium data classification identification process in machine learning recognition methods of the present invention.

Specific embodiment

It needs to rely on a large amount of training sample for existing multimedia data classification machine learning recognition methods and cause The problem of practical application, needs the recognition principle from existing machine learning recognition methods to be analyzed, and discovery causes to ask The reason of topic occurs.Existing sorting machine learns recognition methods, typically by sample to be identified and the comparative sample of known class This progress individually compares, and calculates the similarity between sample to be identified and contrast sample or calculates sample to be identified and comparison Difference distance value between sample, to judge whether sample to be identified belongs to same category with contrast sample, so as to fulfill treating Identify the classification identification of sample.Such machine learning recognition methods, applies the application scenarios identified in multimedia data classification In, easily limited by technology application：

On the one hand, because in multi-medium data, all there may be larger data differences for same category of data sample；Example Such as, for the access control system of residential community development task based on recognition of face, if the facial image of each owner is respectively divided The image data classification independent for one needs system processing to realize the Classification and Identification to facial image, but even same position The facial image of owner, also easily because ambient light, shooting angle, owner itself dressing dressing situations such as difference and exist Image difference, and the training sample under each ambient light, shooting angle, dressing dressing cond, only for identical Sample Similarity to be identified or difference distance value under cond calculate and identification judges there is direct help, which results in Need owner's facial image largely under the conds such as varying environment light, different shooting angles, different dressing dressing As training sample and identification correction data, learning training is carried out to machine learning model, just can guarantee preferable recognition of face Effect so as to increase the operation difficulty for training model learning in practical application, causes to limit to technology application.

On the other hand, in existing sorting machine study recognition methods, it is different from sample generic to be identified it is each its His class training sample is difficult to symbolize otherness for the learning training influence of the specimen discerning result to be identified；For example, it faces For access control system of residential community development task based on recognition of face, if the facial image of each owner is respectively divided into one solely Vertical image data classification is identified or during learning training, contrast sample's database for the facial image of an owner In other arbitrary owners facial image, for the owner to be identified facial image identification or training result be similarity not Foot or difference distance are larger；Therefore a large amount of non-similar contrast sample is for the identification of sample to be identified or learning training result Significant differentiated can not be brought to influence, this also results in and is only capable of for the identification of sample to be identified or learning training indirectly Enough dependent on generic contrast sample, dependence of the machine learning recognition methods to magnanimity training sample is increased.

Correspondingly, based on above-mentioned both sides limitation reason, this also results in another as a result, being directed to multimedia number According in Classification and Identification application scenarios, for not passing through the data category of learning training, existing machine learning recognition methods pair The sample to be identified of the category carries out effective classification identification.

Above-mentioned analysis result is directed to, based on the technological thought to solve the above problems, the present invention proposes a kind of based on double The machine learning recognition methods of study is compared again, which employs the learning training mode different from the prior art to machine Device learning model is trained, as shown in Figure 1, choosing target identification sample among the multi-medium data of multiple and different known class This R and contrast sample a, as a machine learning model f₁Input, to machine learning model f₁Learning training is carried out, and then Utilize the machine learning model f after learning training₁Classification identification is carried out to multi-medium data to be identified；The machine learning model f₁Including the first sub- learning model f_DPWith the second sub- learning model f_DE, the first sub- learning model f_DPFor convolutional neural networks Model or full Connection Neural Network model, the second sub- learning model f_DEConnect for convolutional neural networks model or full nerve net Network model；Selected contrast sample includes more than two different classes of multiple multi-medium datas, and sets contrast sample It is input to machine learning model f₁Input put in order, such as in Fig. 1, by contrast sample input put in order list it is multiple Contrast sample is respectively labeled as a₁、a₂、…、a_n, n is denoted as machine learning model f₁Contrast sample's quantity of input, and according to Contrast sample's input puts in order, by target identification sample R and contrast sample a₁、a₂、…、a_nWith presetting rule of combination into Row combination, such as in Fig. 1 simple example be by target identification sample R by contrast sample input put in order respectively with comparative sample This₁、a₂、…、a_nIt is combined, is consequently formed and remains with multiple data sample groups that contrast sample inputs the rule that puts in order It closes, is respectively the described second sub- learning model f by each multi-medium data sample group cooperation_DEInput, and by each corresponding Two sub- learning model f_DEOutput DE₁、DE₂、…、DE_nThe rule compositor that puts in order is inputted according to the contrast sample and forms one Data vector, as the described first sub- learning model f_DPInput vector A, and the first sub- learning model f_DPOutput to Measure the result vector C as the machine learning model；Learn from there through training so that the machine learning obtained by learning training Model f₁Each result vector element c in the result vector C of output_i∈ C (i ∈ 1,2 ..., n }) know to characterize target Very originally and accordingly arrange a contrast sample a on ordinal position_iCorrelation between (i ∈ { 1,2 ..., n }) generic Property, so as to be put in order using the multi-medium data sample of known class using different contrast sample's inputs to the machine Device learning model f₁Carry out multiple learning training.For example, as shown in Fig. 2, using n contrast sample identical in Fig. 1, but it is logical It crosses setting change contrast sample's input to put in order so that a originally₁The contrast sample of ordinal position is adjusted to a₄Ordinal position, So as to machine learning model f₁Carry out different learning trainings.

The present invention is based on double contrast study machine learning recognition methods, compared with the prior art for, employ not With the technical implementation way of thinking, target identification sample and contrast sample are used as by the multi-medium data for choosing known class, It is input to machine learning model f together₁Learning training is carried out, selected contrast sample is needed comprising more than two inhomogeneities Other multiple multi-medium datas, the difference between different classes of multiple contrast samples on input puts in order can be embodied The opposite sex.Simultaneously as machine learning model f₁Learning training target be so that learning training obtained by machine learning model f₁It is defeated Each result vector element in the result vector gone out is arranged with corresponding on ordinal position to characterize target identification sample Correlation between one contrast sample's generic, also, due to machine learning model f₁It is designed as including the first son study Model f_DPWith the second sub- learning model f_DEBuilt-up pattern framework, the first sub- learning model f therein_DPWith the second son study mould Type f_DECan residual error can be selected with selected as convolutional neural networks model or full Connection Neural Network model, convolutional neural networks Neural network model (Residual Neural Network, be abbreviated as ResNet), intensive convolutional network model (DenseConvolutional Network, be abbreviated as DenseNet) etc., full Connection Neural Network can then select this field The more common neural network model with full articulamentum of technical staff；Using including the first sub- learning model f_DPWith the second son Learning model f_DEBuilt-up pattern framework form machine learning model f₁, during learning training, by target identification sample After (classification identifying processing phase should be sample to be identified) is combined with contrast sample, formation remains with contrast sample and inputs row Multiple data samples combination of row Cahn-Ingold-Prelog sequence rule is input to the second sub- learning model f_DE, obtained output is according to the comparative sample This input put in order rule compositor form a data vector, as the described first sub- learning model f_DPInput vector, from And by the first sub- learning model f_DPOutput is as machine learning model f₁Result vector, thereby guarantee that machine learning model f₁It is defeated Pair for remaining and being inputted with contrast sample between putting in order that put in order of each result vector element in the result vector gone out It should be related to, and due to the first sub- learning model f_DPWith the second sub- learning model f_DEIt can be with selected as convolutional neural networks model Or full Connection Neural Network model, therefore each result vector element is suitable by contrast sample's input arrangement in result vector The influence of sequence, so that the machine learning model f of training gained₁Each result vector element in the result vector of output The correlation characterized, all putting in order with contrast sample's input, there are relevance influences.Since in this way, in contrast sample with Target identification sample belongs to difference of the multi-medium data of the same category on contrast sample's input puts in order, can be to engineering Practise model f₁Learning training result generate different influence, therefore, belong to each of the same category with target identification sample Multi-medium data is in sample as a comparison in use, can be by adjusting its sequence in contrast sample's input puts in order Position, to machine learning model f₁Carry out the learning training of multiple differentiated.Meanwhile in contrast sample with target identification sample Difference of the multi-medium data to belong to a different category on contrast sample's input puts in order, also can be to machine learning model f₁'s Learning training result generates different influences, therefore, each multi-medium data to belong to a different category with target identification sample In sample as a comparison in use, can also by adjusting its contrast sample input put in order in ordinal position, repeatedly It participates in machine learning model f₁Differentiated learning training.A certain amount of multimedia of known class just can be utilized as a result, Data sample is put in order using different contrast sample's inputs to machine learning model f₁Carry out the study instruction of multiple differentiated Practice, a large amount of learning training can be carried out to machine learning model using a small amount of training sample and known to reach expected classification Other effect so as to significantly reduce the dependence to magnanimity training sample, solves existing multimedia data classification machine learning The problem of recognition methods needs to rely on a large amount of training sample and lead to practical application.

In specific application, in the machine learning recognition methods using the present invention to machine learning model f₁Carry out study instruction In experienced process, target identification sample and contrast sample are chosen from preset multi-medium data sample database, are chosen every time The multi-medium data of a part of known class included in the multi-medium data sample database is as target identification sample and right Than sample to machine learning model f₁Learning training is carried out, and chooses target from the multi-medium data sample database several times and knows Very this and contrast sample are to machine learning model f₁Learning training is carried out, to ensure the choosing of target identification sample and contrast sample Each multi-medium data classification included in the traversal multi-medium data sample database is taken, and for multi-medium data sample database In each multi-medium data classification be performed both by contrast sample's selection operation of at least H times, H is that the training with setting is chosen time Number threshold value.The multi-medium data for choosing a part of known class included in multi-medium data sample database every time is known as target Very this and contrast sample are to machine learning model f₁Learning training is carried out, is a kind of learning training processing side locally chosen Formula.It is performed because if the multi-medium data overall situation of classifications whole included in multi-medium data sample database is chosen to machine Learning model f₁Learning training processing, be easy to cause comparison operational data amount it is huge, operation efficiency is too low, also, if machine Device learning model f₁Neural network level it is excessive, be also easy to lead to machine learning model f₁There can not be so a large amount of data to hold The effective calculation process of row.Therefore, more matchmakers of a part of known class included in multi-medium data sample database are chosen every time Volume data is as target identification sample and contrast sample to machine learning model f₁Learning training is carried out, then by repeatedly choosing Target identification sample and contrast sample are taken to machine learning model f₁The mode of learning training is carried out, to reduce machine learning model f₁Each learning training processing procedure performs the data volume of data operation processing, avoids machine learning model f₁Treatment effeciency is too low Or the problem of processing can not be performed effectively；But this classification identifying processing mode locally chosen, it is possible to multimedia number occur According to the multi-medium data training included in sample database be underutilized during learning training the problem of, in view of this, And due to machine learning model f₁Each result vector element in the result vector of output is inputted by contrast sample The influence to put in order, thus the other contrast sample of same class on different contrast sample's inputs put in order for target The interdependence effects of identification sample can have differences, and may influence to machine learning model f₁Learning training as a result, being therefore Ensure as far as possible to machine learning model f₁Learning training effect, learning training process carry out ensure target identification sample and The selection of contrast sample traverses each multi-medium data classification included in the multi-medium data sample database, and for more matchmakers Each multi-medium data classification in volume data sample database has been performed both by contrast sample's selection operation of at least H times, and H is and setting Training choose frequency threshold value, the specific value of H can determine according to practical application experience.

Utilize the machine learning model f obtained by the machine learning training program learning training₁, just can be used in multimedia Data carry out classification identification.Specifically, utilize the machine learning model f after learning training₁To multi-medium data to be identified into Row classification identification concrete mode be：As shown in figure 3, the multi-medium data as object to be identified is obtained as sample to be identified R_xAnd the contrast sample a chosen among the multi-medium data of multiple and different known class, as the machine after learning training Learning model f₁Input, selected contrast sample includes more than two different classes of multiple multi-medium datas, and sets It puts contrast sample and is input to machine learning model f₁Input put in order, such as in Fig. 3, by contrast sample input put in order The multiple contrast samples listed are respectively labeled as a₁、a₂、…、a_n, n is denoted as machine learning model f₁The contrast sample of input Quantity, and put in order according to contrast sample's input, by sample R to be identified_xWith contrast sample a₁、a₂、…、a_nWith presetting Rule of combination is combined, such as simple example is by sample R to be identified in Fig. 3_xIt puts in order point by contrast sample's input Not with contrast sample a₁、a₂、…、a_nIt is combined, is consequently formed and remains with multiple numbers that contrast sample inputs the rule that puts in order It is combined according to sample, is respectively the described second sub- learning model f by each multi-medium data sample group cooperation_DEInput, and will be each Corresponding second sub- learning model f_DEOutput DE₁、DE₂、…、DE_nThe rule compositor that puts in order is inputted according to the contrast sample A data vector is formed, as the described first sub- learning model f_DPInput vector A, and the first sub- learning model f_DP Result vector C of the output vector as the machine learning model；In classification identification process, the machine learning model f₁It is defeated Each result vector element c in the result vector C gone out_i∈ C (i ∈ { 1,2 ..., n }) are characterizing sample R to be identified_xWith A contrast sample a on corresponding arrangement ordinal position_iCorrelation between (i ∈ { 1,2 ..., n }) generic, so as to To determine sample R to be identified according to the correlation_xGeneric；If for example, machine learning model f obtained by learning training₁Its is defeated Result vector element c in the result vector C gone out_iValue it is smaller show with accordingly arrange ordinal position on contrast sample belonging to Degree of correlation between classification is higher, then as shown in figure 3, determining sample R to be identified in identification_xGeneric y_xWhen, result vector Contrast sample a on the corresponding arrangement ordinal position of one result vector element of C intermediate values minimum_iAffiliated classification y_i, it is possible to It is determined as being sample R to be identified_xGeneric, i.e.,i∈{1,2,…,n}。

The above-mentioned machine in application, in the machine learning recognition methods that the present embodiment is learnt based on double contrast is embodied Learning training process and multi-medium data classification identification processing procedure can be loaded into machine by being configured after computer programming In the processor for learning identification device so that its processor is configured as performing the machine learning of above-mentioned machine learning training flow Training program or the multi-medium data classification recognizer for performing above-mentioned multi-medium data classification identifying processing flow.Based on this Machine learning classification identification device designed by the machine learning recognition methods of invention has common technical characterstic and skill naturally Art advantage.

In machine learning recognition methods of the present invention and its specific implementation of device, to cause the machine obtained by learning training Learning model f₁Each result vector element in the result vector of output arranges to characterize target identification sample with corresponding The correlation between contrast sample's generic on ordinal position when specific training operates, is easily accomplished；Example Such as, in the training process, whether be directed to target identification sample with inputting a contrast sample on putting in order is same class Not, in machine learning model f₁The result vector element to put in order accordingly in the result vector of output on position assigns Preset correlation desired value, such as the same category imparting positive correlation desired value (such as being assigned a value of " 0 "), and different classes of tax Give negative correlation desired value (such as being assigned a value of " 1 "), then trained by machine learning, machine learning model f₁It just being capable of acquistion For whether the correlation of the same category is distinguished between target identification sample and contrast sample, in the result vector by its output Each result vector element characterize belonging to target identification sample and a contrast sample on corresponding arrangement ordinal position Correlation between classification.Thus after training, when performing the classification identifying processing to multi-medium data, machine learning mould Type f₁Each result vector element in the result vector of output just can be good at differentiated symbolize sample to be identified with The correlation between contrast sample's generic on corresponding arrangement ordinal position, wherein closer to positive correlation desired value Result vector element, corresponding to the element sorting position in result vector contrast sample input puts in order on position Contrast sample's generic, it is possible to be determined to be the generic of sample to be identified.For example, shown in FIG. 1 It practises in training flow, target identification sample R and contrast sample input put in order the contrast sample a of the 1st₁Belong to mutually similar Not, therefore the result vector element c of the 1st of putting in order in result vector is assigned₁Value for " 0 " represent positive correlation it is expected Value, the corresponding result vector element of remaining different classes of contrast sample's ordinal position are assigned a value of " 1 " and represent negative correlation desired value； And in learning training flow shown in Fig. 2, target identification sample R and contrast sample input put in order the comparative sample of the 4th This₄Belong to the same category, therefore assign the result vector element c of the 1st of putting in order in result vector₄Value be " 0 " represent Positive correlation desired value, the corresponding result vector element of remaining different classes of contrast sample's ordinal position are assigned a value of " 1 " and represent negative Correlation desired value.

It should be noted that in the machine learning recognition methods of the present invention, the target identification sample that is obtained in learning training Originally it can be one, may be multiple, and need to belong to same category；Equally, it is targeted in classification identifying processing Sample to be identified may be one, may be multiple, but be equally also required to belong to same category.And it is directed to target knowledge Very this (or sample to be identified) is one or more different situations, and machine learning recognition methods of the invention is in concrete application In implementation, it is related to the factor of several respects, it is also desirable to which a point different situation is illustrated.

Wherein, the factor of first aspect, in learning training process or classification identification processing procedure, by target identification sample (mutually should be sample to be identified in classification identification processing procedure) is input to machine learning model f with contrast sample₁When, it needs with pre- The rule of combination of setting is combined, combined treatment as progress, is on the one hand to enable to the multiple data to be formed Sample combination remains with contrast sample and inputs the rule that puts in order, and is on the other hand to just establish target in data input level Identification sample (or sample to be identified) inputs being associated with and contact between putting in order contrast sample between and with contrast sample, This is also the important technology difference of machine learning recognition methods of the present invention compared with the prior art.In data input level just Establish target identification sample (or sample to be identified) between contrast sample and with contrast sample input put in order between Association contact after, pass through machine learning model f₁Processing output, output result vector in each result to Secondary element just no longer only has with target identification sample (or sample to be identified) and the similitude of a contrast sample therebetween It closes, pass of the target identification sample (or sample to be identified) among also being combined with each data sample of composition between contrast sample Connection property is related, and passes through machine learning model f₁Full connection calculation process effect so that each result vector element also with It is related that each data sample as input combines retained contrast sample's input rule that puts in order, so as to preferably ensure that Machine learning model f₁Each result vector element in the result vector of output to characterize target identification sample with it is corresponding Correlation between the contrast sample's generic to put in order on position.

Exactly by means of by target identification sample (mutually should be sample to be identified in classification identification processing procedure) with comparison Machine learning model f is input to after sample combination₁Carry out the specially treated mode of learning training or classification identifying processing so that this The machine learning recognition methods that invention is learnt based on double contrast can also have the data category for not passing through learning training Standby classification recognition capability.Because the machine learning model f obtained after being trained using the method for the present invention₁To multi-medium data to be identified When carrying out classification identification, machine learning model f₁A result vector element in the result vector of output not only with sample to be identified This is related with the similitude of a contrast sample therebetween, the target among also more being combined with each data sample of composition Identification sample (or sample to be identified) relevance between contrast sample and combine institute with each data sample as input Reservation contrast sample input put in order rule it is related, therefore, even for some multi-medium data classification without Training is practised, as long as the multi-medium data sample of the multi-medium data classification is added in identification contrast sample's database, when treating When identifying the multi-medium data that sample is the category, machine learning model f₁The result vector of output can still embody this and treat Identify the otherness between the sample contrast sample different classes of with other and the phase between the contrast sample of the same category Guan Xing, so as to can still determine the generic of sample to be identified according to the correlation.Therefore, the present invention is based on double contrasts The machine learning recognition methods of study can be easily extended to carrying out classification without the multi-medium data classification of learning training Identification can solve to cause what versatility was limited to ask because directly can not carry out Classification and Identification to the classification without learning training due to Topic.

Meanwhile just because of need to establish target identification sample (or sample to be identified) between contrast sample and with it is right Association contact between putting in order than sample input, thus the quantity of target identification sample (or sample to be identified) for one or For under multiple different situations, specific combination also can different from.

If as machine learning model f₁The target identification sample (or sample to be identified) of input is one, is known by target When very this (or sample to be identified) is combined with contrast sample with presetting rule of combination, the presetting combination rule It is then one kind among following manner：

Rule of combination mode is 1.：By the target identification sample (or sample to be identified) respectively with each contrast sample it Between establish combinations of pairs relationship, carry out combinations of pairs respectively；

Rule of combination mode is 2.：Each contrast sample is first subjected to category division, then by the target identification sample (or sample to be identified) establishes syntagmatic between the contrast sample of each classification respectively, is respectively combined.

If as machine learning model f₁The target identification sample (or sample to be identified) of input is multiple, is known by target When very this (or sample to be identified) is combined with contrast sample with presetting rule of combination, the presetting combination rule It is then one kind among following manner：

Rule of combination mode a：By each target identification sample (or sample to be identified) respectively with each contrast sample Between establish combinations of pairs relationship, carry out combinations of pairs respectively；

Rule of combination mode b：Each contrast sample is first subjected to category division, then by each target identification sample (or sample to be identified) establishes syntagmatic between the contrast sample of each classification respectively, is respectively combined；

Rule of combination mode c：Target complete identification sample (or sample to be identified) is right with each respectively as a whole Than establishing combinations of pairs relationship between sample, combinations of pairs is carried out respectively；

Rule of combination mode d：Each contrast sample is first subjected to category division, target complete is then identified into sample (or sample to be identified) establishes syntagmatic between the contrast sample of each classification respectively as a whole, carries out group respectively It closes.

Combinations of the above regular fashion is 1. and rule of combination mode a is by each target identification sample (or sample to be identified This) combinations of pairs relationship is established between each contrast sample respectively, combinations of pairs is carried out respectively, and such rule of combination can To form the data sample combination as much as possible for remaining with contrast sample's input and putting in order regular, for learning training processing For flow, quantity data sample combination as much as possible is conducive to put in order by changing different contrast sample's inputs And be used for execution differentiated different more times and think learning training, for hoisting machine learning model f₁Training effect have centainly It helps.And rule of combination mode 2. and processing rule b, c, d be then by target complete identification sample (or sample to be identified) work , for an entirety (belonging to a classification) or using each contrast sample of each classification as an entirety then carry out respectively again Combination, the multiple data samples combination being consequently formed can not only retain contrast sample and input the rule that puts in order, and will be complete Each comparison of the portion's target identification sample (or sample to be identified) as an entirety (belonging to a classification) or by each classification Sample closed as one data sample group of an overall structure in component part, a data sample group being consequently formed close into Enter machine learning model f₁During middle progress calculation process, calculation process process is equivalent to the data for incorporating respective classes sample Common feature, therefore have certain help for common feature Division identification rate of the raising between different classes of.

The factor of second aspect in classification identification processing procedure, is getting multiple same category of samples to be identified It needs to carry out classification identifying processing, while in the case that contrast sample is there is also multiple classifications and multiple quantity, may be used point Batch is input to machine learning model f₁Mode processing is identified；During concrete operations, it is input to machine learning mould in batches Type f₁Concrete mode one kind among following manner may be used：

Correspondingly, using machine learning model f is input in batches₁Carry out the processing mode of classification identification, each batch Input will all obtain a result vector, therefore according to machine learning model f₁The result vector repeatedly exported carries out classification knowledge Other places reason concrete mode can also in the following way among one kind：

Wherein, repeatedly output classification identification method be 1. directly according to each time export result vector in whole results to Secondary element carries out correlation statistics comparison, finds out highest one of degree of correlation to determine sample generic to be identified.It is and more 2. the secondary classification identification method that exports is then just to compare the result vector of each output with the correlation statistics that add up later again, find out Highest one of degree of correlation determines sample generic to be identified.In comparison, classification identification method 2. phase is repeatedly exported When then to being input to machine learning model f in batches₁Each secondary result vector output after classification identifying processing is carried out to carry out The comprehensive consideration of cumulative mean for classification identification method is repeatedly exported 1., is more conducive to avoid because of accidental error The wrong situation of sample class identification to be identified, helps to ensure that better recognition accuracy caused by and.

The factor of the third aspect, during classification identifying processing is carried out using machine learning recognition methods of the present invention, Contrast sample can be chosen, and in specific application from preset multi-medium data sample database, can be operated every time Choose to as machine learning model f₁Contrast sample's categorical measure L of input is less than in the multi-medium data sample database Comprising the categorical measure S, L and S of multi-medium data of known class be integer more than 1, and need from more matchmakers Contrast sample is repeatedly chosen in volume data sample database respectively as machine learning model f₁Input, to the sample identified carry out it is more Secondary classification identifying processing, to ensure that the selection of contrast sample traverses each more matchmakers included in the multi-medium data sample database Volume data classification, and it has been performed both by at least comparison of K times for each multi-medium data classification in multi-medium data sample database Sample selection operation, K are that frequency threshold value is chosen in the identification with setting.Allow choose every time to as machine learning model f₁It is defeated The contrast sample's categorical measure L entered is less than the multi-medium data of known class included in the multi-medium data sample database Categorical measure S is a kind of classification identifying processing mode locally chosen.Because if included in multi-medium data sample database The multi-medium data overall situations of whole classifications choose and perform classification identifying processing to the sample identified, be easy to cause comparison operation Data volume is huge, and operation efficiency is too low, also, if machine learning model f₁Neural network level it is excessive, be also easy to cause Machine learning model f₁There can not be so a large amount of data to perform effective calculation process.Therefore, allow choose every time making For machine learning model f₁Contrast sample's categorical measure L of input is less than known included in the multi-medium data sample database The categorical measure S of the multi-medium data of classification, then by repeatedly choosing contrast sample respectively as machine learning model f₁'s Input carries out the mode of multiple classification identifying processing to the sample identified, to reduce machine learning model f₁At each classification identification Reason process performs the data volume of data operation processing, avoids machine learning model f₁Treatment effeciency is too low or can not perform effectively place The problem of reason；But this classification identifying processing mode locally chosen, may cause to be not present in the contrast sample of single pick Classification belonging to sample to be identified and cause to cannot get effective classification recognition result, and due to machine learning model f₁Output Result vector in each result vector element put in order and influenced, therefore same class by contrast sample's input Other contrast sample influences to deposit on different contrast sample's inputs puts in order for the correlation identification of sample to be identified In difference, classification recognition result to the sample identified may be influenced, therefore in order to ensure classification to the sample identified as far as possible The accuracy of identification needs to ensure that the selection of contrast sample traverses each more matchmakers included in the multi-medium data sample database Volume data classification, and it has been performed both by at least comparison of K times for each multi-medium data classification in multi-medium data sample database Sample selection operation, K are the selection frequency threshold value with setting, and the specific value of K can be determined according to practical application experience.Then, Statistical Comparison machine learning model f₁Each result vector element in the result vector that each secondary classification identifying processing is exported, finds out The highest result vector element of degree of correlation that correlation is embodied, by putting in order corresponding to the result vector element Contrast sample's generic on position is determined as sample generic to be identified.

Recognition effect comparative example：

The machine that the present embodiment will use in the machine learning identification device provided by the present invention for multimedia data classification Device learns recognition methods, and the recognition methods of prior art machine learning model is used with some, is carried out using identical data set Recognition effect contrast experiment, verifying the machine learning identification side used in machine learning identification device provided by the present invention The feasibility and validity of method.

In the present embodiment, the method for the present invention is labeled as " LCNN ", and prior art machine learning mould as a comparison Type includes document " Lake, B.M., Salakhutdinov, R.＆Tenenbaum, J.B.Human-level concept learning through probabilistic program induction Supplementary Be mentioned in Material.Science 350,1332-1338 (2015) " BPL (BayesianProgram Learning, Bayes plans learning algorithm) model (being labeled as " BPL [Lake2015] "), document " Vinyals, O., Blundell, C., Lillicrap,T.,Wierstra,D.&others.Matching networks for one shot learning.in It is mentioned in Advances in Neural Information Processing Systems 3630-3638 (2016) " Matching Nets (matching network) model (being labeled as " Matching Nets [Vinyals2016] ") and document “Koch,G.,Richard Zemel&Ruslan Salakhutdinov.Siamese neural networks for one- The Convolutional being mentioned in shot image recognition.in (University of Toronto, 2015) " Siamese Net (convolution connection network) model (is labeled as " Convolutional Siamese Net [Kock2015] ").

The present embodiment is based on Omniglot data sets, choose 30 respectively in the training set provided in Omniglot data sets, 60th, for the sample of 136,156,964 classifications as training set, each classification has 20 samples, and the model for participating in comparison is distinguished It is trained；Then, using document " Koch, G., Richard Zemel＆Ruslan Salakhutdinov.Siamese neural networks for one-shot image recognition.in(University of Toronto, 2015) 400 test samples (each classification has 20 samples) of 20 classifications provided in " carry out the 20 single samples for selecting 1 The identification test of (20-way) classification, the classification identification test for each model carry out 100 times, count respective identification essence respectively Degree.

In the present embodiment, using the first sub- learning model f in the method for the present invention scheme_DPThe full connection god of selected as individual layer Through network, the second sub- learning model f_DEThe residual error neural network (ResNet) of 121 layers of selected as.The present embodiment is to as a comparison The accuracy of identification statistical data that prior art machine learning model carries out classification identification test is as shown in table 1, to the method for the present invention The accuracy of identification statistical data that each embodiment scheme carries out classification identification test is as shown in table 2.

Table 1

Table 2

It can see by above-mentioned Tables 1 and 2, machine learning recognition methods of the present invention is due to can be based on identical training Sample set is to machine learning model f₁Carry out more differentiated learning training, thus in identical training sample classification book and Under conditions of training samples number, the accuracy of identification of machine learning recognition methods of the present invention leads over the existing skill for participating in comparison The accuracy of identification of art machine learning model, it is sufficient to illustrate that the machine learning recognition methods of the present invention is directed to multi-medium data classification Identification has good feasibility and validity.

In conclusion the present invention is based on the machine learning recognition methods of double contrast's study, known class can be utilized A certain amount of multi-medium data sample is put in order using different contrast sample's inputs to machine learning model f₁It carries out multiple The learning training of differentiated can carry out a large amount of learning training to reach using a small amount of training sample to machine learning model To expected classification recognition effect, so as to significantly reduce the dependence to magnanimity training sample, solves existing multimedia number The problem of needing to rely on a large amount of training sample according to sorting machine study recognition methods and leading to practical application；Meanwhile that is, Just for some multi-medium data classification without learning training, as long as the multi-medium data sample by the multi-medium data classification Originally it is added in identification contrast sample's database, when sample to be identified is the multi-medium data of the category, machine learning model f₁The result vector of output can still embody the difference between the sample to be identified and other different classes of contrast samples Property and the correlation between the contrast sample of the same category, so as to can still determine sample to be identified according to the correlation Generic, so as to easily extend to without learning training multi-medium data classification carry out classification identification, energy It is enough to solve the problems, such as versatility to be caused to be limited because directly carry out Classification and Identification to the classification without learning training due to；In addition, During classification identifying processing is carried out, the classification identifying processing mode that data are locally chosen may be used and allow what is chosen every time To as machine learning model f₁Contrast sample's categorical measure L of input is less than included in the multi-medium data sample database Known class multi-medium data categorical measure S, then by repeatedly choosing contrast sample respectively as machine learning mould Type f₁Input carry out the mode of multiple classification identifying processing to the sample identified, to reduce machine learning model f₁Each classification Identification processing procedure performs the data volume of data operation processing, avoids machine learning model f₁Treatment effeciency is too low or can not be effective The problem of performing processing.It can be seen that the present invention is based on the machine learning recognition methods of double contrast's study, solve well Existing multimedia data classification machine learning recognition methods because of the dependence to a large amount of training samples and because can not directly to without The problem of classification of learning training carries out Classification and Identification and causes practical application, versatility limited, can be more effective extensively Be applied in more specific multimedia data classification use occasions, there is wide technology application and promotion prospect.

Finally illustrate, the above embodiments are merely illustrative of the technical solutions of the present invention and it is unrestricted, although with reference to reality Example is applied the present invention is described in detail, it will be understood by those of ordinary skill in the art that, it can be to the technical side of the present invention Case is modified or replaced equivalently, and without departing from the objective and range of technical solution of the present invention, should all be covered in the present invention Right in.

Claims

1. a kind of machine learning recognition methods based on double contrast's study, which is characterized in that in multiple and different known class Target identification sample and contrast sample are chosen among multi-medium data, as a machine learning model f₁Input, to machine Learning model f₁Learning training is carried out, and then utilizes the machine learning model f after learning training₁To multi-medium data to be identified into Row classification identifies；The machine learning model f₁Including the first sub- learning model f_DPWith the second sub- learning model f_DE, described first Sub- learning model f_DPFor convolutional neural networks model or full Connection Neural Network model, the second sub- learning model f_DEFor volume Product neural network model or full Connection Neural Network model；Selected contrast sample includes more than two different classes of more A multi-medium data, and contrast sample is set to be input to machine learning model f₁Input put in order, and according to contrast sample Input puts in order, and target identification sample with contrast sample with presetting rule of combination is combined, reservation is consequently formed There is multiple data samples combination that contrast sample's input puts in order regular, be by each multi-medium data sample group cooperation respectively The second sub- learning model f_DEInput, and by each corresponding second sub- learning model f_DEOutput according to the comparative sample This input put in order rule compositor form a data vector, as the described first sub- learning model f_DPInput vector, and The first sub- learning model f_DPResult vector of the output vector as the machine learning model；It is learned from there through training It practises so that the machine learning model f obtained by learning training₁Each result vector element in the result vector of output is to table The correlation between target identification sample and contrast sample's generic on corresponding arrangement ordinal position is levied, so as to It is put in order using the multi-medium data sample of known class using different contrast sample's inputs to the machine learning model f₁Carry out multiple learning training.

2. the machine learning recognition methods according to claim 1 based on double contrast's study, which is characterized in that as machine Learning model f₁The target identification sample of input is one or more, and belongs to same category；

If as machine learning model f₁The target identification sample of input is one, by target identification sample and contrast sample with When presetting rule of combination is combined, the presetting rule of combination is one kind among following manner：

Rule of combination mode is 1.：The target identification sample is established combinations of pairs respectively to close between each contrast sample System, carries out combinations of pairs respectively；

Rule of combination mode is 2.：Each contrast sample is first subjected to category division, then distinguishes the target identification sample Syntagmatic is established between the contrast sample of each classification, is respectively combined；

If as machine learning model f₁The target identification sample of input to be multiple, by target identification sample and contrast sample with When presetting rule of combination is combined, the presetting rule of combination is one kind among following manner：

Rule of combination mode a：Each target identification sample is established combinations of pairs respectively to close between each contrast sample System, carries out combinations of pairs respectively；

Rule of combination mode b：Each contrast sample is first subjected to category division, then distinguishes each target identification sample Syntagmatic is established between the contrast sample of each classification, is respectively combined；

Rule of combination mode c：Target complete identification sample is established into pairing between each contrast sample respectively as a whole Syntagmatic carries out combinations of pairs respectively；

Rule of combination mode d：Each contrast sample is first subjected to category division, then identifies sample as whole target complete Body establishes syntagmatic between the contrast sample of each classification respectively, is respectively combined.

3. the machine learning recognition methods according to claim 1 based on double contrast's study, which is characterized in that machine Learning model f₁During carrying out learning training, the target identification sample and contrast sample are from preset multi-medium data sample It is chosen in this library, chooses the multimedia number of a part of known class included in the multi-medium data sample database every time According to as target identification sample and contrast sample to machine learning model f₁Learning training is carried out, and several times from the multimedia Target identification sample and contrast sample are chosen in data sample library to machine learning model f₁Learning training is carried out, to ensure target Identify that the selection of sample and contrast sample traverse each multi-medium data classification included in the multi-medium data sample database, And the contrast sample that at least H times has been performed both by for each multi-medium data classification in multi-medium data sample database chooses behaviour Make, H is that frequency threshold value is chosen in the training with setting.

4. the machine learning recognition methods according to claim 1 based on double contrast's study, which is characterized in that the utilization Machine learning model f after learning training₁To multi-medium data to be identified carry out classification identification concrete mode be：

Obtain more matchmakers of the multi-medium data as sample to be identified and from multiple and different known class as object to be identified The contrast sample chosen among volume data, as the machine learning model f after learning training₁Input, selected comparative sample This includes more than two different classes of multiple multi-medium datas, and contrast sample is set to be input to machine learning model f₁'s Input puts in order, and is put in order according to contrast sample's input, by sample to be identified and contrast sample with presetting combination Rule is combined, and is consequently formed and is remained with multiple data samples combination that contrast sample's input puts in order regular, respectively will Each multi-medium data sample group cooperation is the described second sub- learning model f_DEInput, and by it is each it is corresponding second son study mould Type f_DEOutput according to the contrast sample input put in order rule compositor form a data vector, as described first son Learning model f_DPInput vector, and the first sub- learning model f_DPOutput vector as the machine learning model Result vector；In classification identification process, the machine learning model f₁Each result vector member in the result vector of output Element to characterize the correlation between contrast sample's generic on sample to be identified and corresponding arrangement ordinal position, from And the generic of sample to be identified is determined according to the correlation.

5. the machine learning recognition methods according to claim 4 based on double contrast's study, which is characterized in that acquired Sample to be identified is one or more, and belongs to same category；

If it is input to machine learning model f₁Sample to be identified for one, by sample to be identified with contrast sample with presetting Rule of combination when being combined, the presetting rule of combination is one kind among following manner：

Rule of combination mode is 1.：The sample to be identified is established into combinations of pairs relationship between each contrast sample respectively, Combinations of pairs is carried out respectively；

Rule of combination mode is 2.：First by each contrast sample carry out category division, then by the sample to be identified respectively with Syntagmatic is established between the contrast sample of each classification, is respectively combined；

If it is input to machine learning model f₁Sample to be identified to be multiple, by sample to be identified with contrast sample with presetting Rule of combination when being combined, the presetting rule of combination is one kind among following manner：

Rule of combination mode a：Each sample to be identified is established into combinations of pairs relationship between each contrast sample respectively, Combinations of pairs is carried out respectively；

Rule of combination mode b：First by each contrast sample carry out category division, then by each sample to be identified respectively with Syntagmatic is established between the contrast sample of each classification, is respectively combined；

Rule of combination mode c：All samples to be identified are established into matched group between each contrast sample respectively as a whole Conjunction relationship carries out combinations of pairs respectively；

Rule of combination mode d：Each contrast sample is first subjected to category division, then by all samples to be identified as a whole Syntagmatic is established between the contrast sample of each classification respectively, is respectively combined.

6. the machine learning recognition methods learnt according to right wants 4 based on double contrast, which is characterized in that acquired treats Sample is identified as one or more, and belongs to same category；

If the sample to be identified obtained is multiple, it may be used and be input to the machine learning model f in batches₁It is identified Processing, is input to machine learning model f in batches₁Concrete mode be one kind among following manner：

Input mode is 1. in batches：Whole contrast samples are formed into a sample input set with each sample to be identified respectively It closes；Multiple sample input set are consequently formed, by several times as the machine learning model f₁Input；

Input mode is 2. in batches：Each contrast sample is first subjected to category division；Then one is chosen from each classification A contrast sample, then a sample to be identified is chosen, form a sample input set；Multiple sample input sets are consequently formed It closes, by several times as the machine learning model f₁Input；

Input mode is 3. in batches：Each contrast sample is first subjected to category division；Then one is chosen from each classification A contrast sample together with all samples to be identified, forms a sample input set；Multiple sample input sets are consequently formed It closes, by several times as the machine learning model f₁Input；

7. the machine learning recognition methods according to claim 6 based on double contrast's study, which is characterized in that according to machine Learning model f₁The concrete mode that the result vector repeatedly exported carries out classification identifying processing is one kind among following manner：

Repeatedly export classification identification method 1.：Each result vector element in the result vector of each output of Statistical Comparison, finds out The highest result vector element of degree of correlation that correlation is embodied, by putting in order corresponding to the result vector element Contrast sample's generic on position is determined as sample generic to be identified；

Repeatedly export classification identification method 2.：By machine learning model f₁The result vector of each output adds up, and is added up Result vector so as to the correlation that each result vector element is characterized in accumulation result vector described in Statistical Comparison, finds out correlation The highest result vector element of degree of correlation that property is embodied, by the position that puts in order corresponding to the result vector element On contrast sample's generic be determined as sample generic to be identified.

8. the machine learning recognition methods according to claim 4 based on double contrast's study, which is characterized in that the comparison Sample is chosen from preset multi-medium data sample database, choose every time to as machine learning model f₁Input Contrast sample's categorical measure L is less than the classification of the multi-medium data of known class included in the multi-medium data sample database Quantity S, L and S are the integer more than 1, and need repeatedly to choose contrast sample's difference from the multi-medium data sample database As machine learning model f₁Input, multiple classification identifying processing is carried out to the sample identified, to ensure the choosing of contrast sample Each multi-medium data classification included in the traversal multi-medium data sample database is taken, and for multi-medium data sample database In each multi-medium data classification be performed both by contrast sample's selection operation of at least K times, K is that the identification with setting is chosen time Number threshold value；

Then, Statistical Comparison machine learning model f₁Each result vector in the result vector that each secondary classification identifying processing is exported Element finds out the highest result vector element of degree of correlation that correlation is embodied, will be corresponding to the result vector element Put in order on position contrast sample's generic be determined as sample generic to be identified.