CN110084108A

CN110084108A - Pedestrian re-identification system and method based on GAN neural network

Info

Publication number: CN110084108A
Application number: CN201910208515.3A
Authority: CN
Inventors: 高明柯; 杜欣军; 周燕琼; 房家骥; 王熠; 邵培南; 白利娟; 夏定江; 于楠
Original assignee: CETC 32 Research Institute
Current assignee: CETC 32 Research Institute
Priority date: 2019-03-19
Filing date: 2019-03-19
Publication date: 2019-08-02

Abstract

The invention relates to a pedestrian re-identification system and method based on a GAN neural network in the technical field of pedestrian re-identification, which are divided into two stages of image reconstruction and enhancement and pedestrian re-identification. In the image reconstruction and enhancement stage, firstly, a generation network in a GAN neural network is utilized to generate a high-resolution image from a low-resolution image; and then judging whether the generated image or the original image is acquired through a judgment network. If the image is generated, the image is generated again until the network cannot be distinguished, and the image reconstruction is completed; then, enhancing the image by adopting a Retinex algorithm; and in the pedestrian re-identification stage, firstly, extracting color features by using an HSV histogram on the basis of image reconstruction and enhancement, extracting texture features by using SILTP, extracting LOMO features of an image, finally, reducing the dimension of a space by using an XQDA method, and carrying out pedestrian re-identification by using distance measurement. The method has the advantages of high accuracy, good robustness, good generalization capability and the like, and is suitable for video pedestrian monitoring, video pedestrian information retrieval and other applications.

Description

Pedestrian's weight identifying system and method based on GAN neural network

Technical field

The present invention relates to pedestrian's weight identification technology fields, and in particular, to a kind of pedestrian based on GAN neural network knows again Other system and method.

Background technique

Pedestrian identifies that (Person re-identification) is also referred to as pedestrian and identifies again again, is to utilize computer vision skill Art judges the technology that whether there is specific pedestrian in image or video sequence.Pedestrian identifies that mainly there are two committed steps again: Firstly, feature extraction, the i.e. feature of extraction target pedestrian and candidate pedestrian.Then, metric learning, i.e. calculating target pedestrian and time The characteristic similarity for selecting pedestrian judges whether candidate pedestrian is the target to be looked for.Originally researcher introduces some image procossings, mould Formula identifies the maturation method in field, stresses to have studied the available feature of pedestrian, simple classification algorithm.Since two thousand and ten, pedestrian The training library of weight identification technology tends to large-scale, and deep learning frame is widely used and method is trained and reasoning.

Domestic patent CN108520226A proposes a kind of decompose based on body and the pedestrian side of identification again of conspicuousness detection Method.Pedestrian image is resolved into the semantic area with deep decomposition network first, pedestrian is separated from cluttered environment. Next, pedestrian image is divided into fritter, according to the result and salient region of background deduction, effective picture region is automatically selected Domain.LOMO feature is extracted from selected picture region and from whole image finally, extracting PHOG, HSV histogram and SIFT feature. Domestic patent CN105718882A proposes a kind of resolution ratio self-adaptive feature extraction and pedestrian's recognition methods again for merging.This method Biological characteristic and macroscopic features are merged, identifies pedestrian, the difference of Enhanced feature with the method that face characteristic and macroscopic features merge Property, while the performance according to feature on different images scale, different pedestrian's features will be compared on different scale；Together Shi Caiyong Filtering system is first screened with the fusion feature that color characteristic and contour feature obtain, then with face characteristic pair The selection result is supplemented, finally the texture feature extraction on the pedestrian of screening, very big stable color provincial characteristics and weighting face Color characteristic, and the global characteristics and local feature that are extracted using adaptive weighted method fusion obtain fusion feature.

Foreign patent US9396412B2 proposes a kind of recognition methods again of the pedestrian based on semantic color designation.This method is adopted Character image is described with semantic color designation, in combination with traditional color characteristic, calculates the probability point of 11 kinds of basic colors Cloth is as image descriptor.This method is given from rgb value to the mapping of the probability distribution of 11 kinds of colors, constructs semantic histogram conduct Image descriptor.In order to which preferably by semantic color designation and other widely used features, (including color histogram, texture are straight Side's figure and covariance matrix) it combines, this method will be defined as partial descriptor based on apparent affinity model, and pass through phase It measures to form linear combination like property.Different from other conventional methods, it is used as special using the long original vector with antipode Levy pond.Secondly, learning the weight of each similarity measurement by RankBoost algorithm.It is related in foreign patent US10108850B1 And a kind of recognition methods again of the pedestrian based on AlexNet neural network.This method extracts the various of pedestrian in different pictures first Physical trait, such as trunk, foot etc., then by the pre-training of convolutional neural networks and the training of AlexNet, finally by feature Than realizing that pedestrian identifies again.

Summary of the invention

In view of the drawbacks of the prior art, the object of the present invention is to provide a kind of pedestrians based on GAN neural network to identify again System and method.The present invention establishes the neural network structure based on GAN, by image reconstruction and enhancing, solves difference and takes the photograph Under camera and different perspectives, the problem of image definition, and the sensitivity of image under different illumination is reduced by Retinex algorithm Degree；Realize that pedestrian identifies again by LOMO feature combination XQDA method simultaneously, with accuracy is high, robustness is good, generalization ability The advantages that good, is suitable for the practical applications such as video pedestrian monitoring, video pedestrian's information retrieval.

Pedestrian's weight identifying system that the present invention relates to a kind of based on GAN neural network, including image reconstruction and enhancing module, Pedestrian's weight identification module；

Described image, which is rebuild, to be built up with enhancing module for generating network, differentiation network and loss function calculating and counterweight Image enhanced；

Pedestrian's weight identification module is used to rebuild described image the enhancing picture obtained with enhancing module and carries out pedestrian It identifies again.

Preferably, the generation network includes multiple residual blocks, includes two 3 × 3 convolutional layers, volume in each residual block Connection batch standardization layer after lamination chooses PReLU as activation primitive, reconnects two sub-pix convolutional layers and be used to increase feature Size.

Preferably, the differentiation network includes 8 convolutional layers, and as the network number of plies is deepened, Characteristic Number is continuously increased, special Sign size constantly reduces, and chooses LeakyReLU as activation primitive, activates letter eventually by two full articulamentums and sigmoid Number obtains the probability for being predicted as original image.

Preferably, what the differentiation network judgement obtained is to generate image or original image, if image is generated, then again It generates, until differentiating that network cannot be distinguished, then completes image reconstruction.

Preferably, the loss function calculating includes being weighted meter using different weights to content loss and confrontation loss It calculates, the content loss be the Euclidean distance generated between image and original image, and the confrontation loses to generate and allow arbiter Indistinguishable data distribution.

Preferably, the content loss includes the Minimum Mean Square Error Mseloss of pixel space and with 19 layers of Vgg net of pre-training The Euclidean distance Vggloss between image and original image feature is sought survival into based on the ReLU active coating of network.

Preferably, described image, which is rebuild, carries out image enhancement using Retinex algorithm algorithm with enhancing module.

Preferably, the Retinex algorithm algorithm is first according to R, G of pixel, B component by the cromogram of input As being decomposed into three width images, the intensity of the different reflected light of scene medium wavelength is represented；Calculate separately long wave, medium wave and short-wave band Opposite relationship between light and dark between interior pixel, and then determine the color of each pixel, finally by the color line in Retinex chrominance space Property be mapped to rgb space, obtain enhancing image.

Preferably, pedestrian's weight identification module extracts color with HSV histogram on the basis of image reconstruction and enhancing first Feature with SILTP texture feature extraction, then extracts image LOMO feature, carries out dimensionality reduction to space finally by XQDA method, and Pedestrian is carried out using distance metric to identify again.

A kind of recognition methods again of the pedestrian based on GAN neural network, which comprises the steps of:

Step 1, building generates network:

Generating network portion includes multiple residual blocks, includes two 3 × 3 convolutional layers in each residual block, after convolutional layer Connection batch standardization layer chooses PReLU as activation primitive, reconnects two sub-pix convolutional layers for increased feature sizes；

Step 2, building differentiates network:

Differentiate that network portion includes 8 convolutional layers, as the network number of plies is deepened, Characteristic Number is continuously increased, characteristic size Constantly reduce, chooses LeakyReLU as activation primitive, obtained eventually by two full articulamentums and sigmoid activation primitive It is predicted as the probability of original image；

Step 3, loss function calculates:

Loss includes two parts: the content loss of super-resolution image (SR)Weighted sum generator (Gen) confrontation Loss

Wherein X indicates one group of high-definition picture and low-resolution image.

Content loss indicates to generate the Euclidean distance between image and original image, including Mse loss and Vgg loss；

The Minimum Mean Square Error of Mse loss expression pixel space:

Wherein, x, y indicate image coordinate point；R indicates decimation factor；W indicates low resolution (LR) image I^LRWidth；H Indicate the height of low-resolution image；RW and rH respectively indicates r times of low-resolution image I^LRWidth and height；I^HRIt indicates High-definition picture, i.e. original image；G_θG(I^LR)_{X, y}The high resolution graphics of networking of making a living complexing；G_θGIndicate that generator, θ G indicate The weight and biasing of L layer depth network；

Vgg loss: based on the ReLU active coating of 19 layers of Vgg network of pre-training, image and original image feature are sought survival into Between Euclidean distance, extract the feature map of a certain layer on trained Vgg, image current layer will be generated Feature map corresponding to feature map and original image is compared:

Wherein, W_{I, j}And H_{I, j}The size of each characteristic pattern in VGG network is described；I, j indicate that i-th maximizes pooling Jth time convolution after layer；φ corresponds to the characteristic pattern that certain convolutional layer exports after activation primitive among VGG network, φ_{I, j} (I^HR)_{X, y}Indicate the characteristic pattern of high-definition picture, φ_{I, j}(GθG(I^LR))_{X, y}Indicate the characteristic pattern of generation network composograph, The loss function can reflect the error on higher perception level, and mean square error loss item can only reflect between the pixel of low level Error, therefore VGG loss item also known as perceive loss item；

Confrontation lossFor generating the data distribution for making arbiter indistinguishable:

Wherein, D_θDFor arbiter, θ D is the weight of arbiter, the probability depending on candidate samples from data distribution；D_θD (G_θG(I^LR)) what is indicated is that arbiter will generate the probability that image prediction is original image；N is sample size；

Step 4, image enhancement is carried out using Retinex algorithm:

The color image of input is decomposed by three width images according to R, G of pixel, B component first, represents scene medium wavelength The intensity of different reflected lights；It calculates separately long wave, the opposite relationship between light and dark in medium wave and short-wave band between pixel, and then determines The color of each pixel obtains enhancing image finally, the color in Retinex chrominance space is linearly mapped to rgb space；

Step 5, feature extraction:

On the basis of step 1-4 carries out image reconstruction and enhancing, color characteristic is extracted with HSV histogram, is mentioned with SILTP Take textural characteristics；

Step 6, LOMO extracts feature:

The down-sampled of 2 × 2average pooling twice is carried out to original image, LOMO is all made of to three images and is mentioned Feature is taken, then by the merging features of three images at a feature vector, is finally used for the especially big value in feature vector Log transformation is inhibited, renormalization later to unit-sized；

Step 7, dimensionality reduction and distance metric are carried out to space using XQDA method:

Enabling Δ=xi-xj indicates the feature difference between 2 samples, and P is Gaussian Profile, and P (Δ | Ω I) it is similar sample Between discrepancy delta meet the Gaussian Profile of difference Ω I in class, the discrepancy delta of P (Δ | Ω E) between foreign peoples's sample meets between class The Gaussian Profile of difference Ω E, and mean value is all 0:

Wherein, T is matrix transposition, and d (xi-xj) is distance function, ∑_EAnd ∑_IIt is similar sample respectively to collection and dissmilarity Covariance matrix of the sample to collection sample；F (Δ) indicates the distance between 2 samples, if more than 0, then it represents that very in maximum probability It is not similar；

XQDA method carries out feature principal component analysis and similarity-based learning simultaneously, by learning mapping matrix W ∈ R^d×r(r < d), by primitive character x_i, x_i∈R^dIt is mapped to multidimensional subspace, Feature Dimension Reduction is realized, improves the accuracy of characteristic matching, Middle matrix W is made of the corresponding feature vector of preceding r maximum eigenvalue of principal component analysis, and d indicates dimension, therefore, formula (6) Defined in distance function change conversion are as follows:

F (Δ)=(x-z)^TW(∑′_I ^-1-∑′_E ^-1)W^T(x-z) (7)

Wherein, T be matrix transposition, ∑ '_I ^-1=W^T∑_IW, indicate dissimilar sample to the covariance matrix of collection sample, ∑′_E ^-1=W^T∑_EW indicates similar sample to the covariance matrix of collection；X is the sample point at a visual angle, and z is another visual angle Sample point；

Step 8: output recognition result.

Compared with prior art, the present invention have it is following the utility model has the advantages that

1, the present invention establishes the neural network structure based on GAN, by image reconstruction and enhancing, solves different camera shootings Under machine and different perspectives, the problem of image definition, and the sensitivity of image under different illumination is reduced by Retinex algorithm Degree.

2, the present invention by LOMO feature combination XQDA method realize pedestrian identify again, have accuracy height, robustness it is good, The advantages that generalization ability is good is suitable for the practical applications such as video pedestrian monitoring, video pedestrian's information retrieval.

Detailed description of the invention

Upon reading the detailed description of non-limiting embodiments with reference to the following drawings, other feature of the invention, Objects and advantages will become more apparent upon:

Fig. 1 is the recognition methods flow diagram again of the pedestrian based on GAN neural network.

Specific embodiment

The present invention is described in detail combined with specific embodiments below.Following embodiment will be helpful to the technology of this field Personnel further understand the present invention, but the invention is not limited in any way.It should be pointed out that the ordinary skill of this field For personnel, without departing from the inventive concept of the premise, various modifications and improvements can be made.These belong to the present invention Protection scope.

Embodiment

In view of the above demand, the present invention relates to a kind of recognition methods again of the pedestrian based on GAN neural network, are divided into image weight It builds and identifies two stages again with enhancing and pedestrian.Image reconstruction and enhancing stage, first with the generation net in GAN neural network The image of low resolution is generated high-definition picture by network.It then, is to generate image by differentiate network judgement acquisition, still Original image.If generating image, then regenerate, until differentiating that network cannot be distinguished, then completes image reconstruction；It uses again Retinex algorithm enhances image.Pedestrian's weight cognitive phase is straight with HSV on the basis of image reconstruction and enhancing first Side's figure extracts color characteristic, with SILTP texture feature extraction, then image LOMO feature is extracted, finally by XQDA method to sky Between carry out dimensionality reduction, and carry out pedestrian using distance metric and identify again.

As shown in Figure 1, being divided into image reconstruction the present invention relates to a kind of recognition methods again of the pedestrian based on GAN neural network Two stages are identified again with enhancing stage and pedestrian.

The step of image reconstruction is with the enhancing stage is as follows:

Step 1: building generates network

Generating network portion includes multiple residual blocks, includes two 3 × 3 convolutional layers in each residual block, after convolutional layer Connection batch standardization layer chooses PReLU as activation primitive, reconnects two sub-pix convolutional layers for increased feature sizes.

Step 2: building differentiates network

Differentiate that network portion includes 8 convolutional layers, as the network number of plies is deepened, Characteristic Number is continuously increased, characteristic size Constantly reduce, chooses LeakyReLU as activation primitive, obtained eventually by two full articulamentums and sigmoid activation primitive It is predicted as the probability of original image.

Step 3: loss function calculates

Loss function calculates:

The Minimum Mean Square Error of Mse loss expression pixel space:

Wherein, W_{I, j}And H_{I, j}The size of each characteristic pattern in VGG network is described；I, j indicate that i-th maximizes pooling Jth time convolution after layer；φ corresponds to the characteristic pattern that certain convolutional layer exports after activation primitive among VGG network, φ_{I, j} (I^HR)_{X, y}Indicate the characteristic pattern of high-definition picture, φ_{I, j}(G_θG(I^LR))_{X, y}Indicate the characteristic pattern of generation network composograph, it should Loss function can reflect the error on higher perception level, and mean square error loss item can only reflect between the pixel of low level Error, therefore VGG loss item also known as perceives loss item；

Step 4: carrying out image enhancement using Retinex algorithm

The color image of input is decomposed by three width images according to R, G of pixel, B component first, represents scene medium wavelength The intensity of different reflected lights；It calculates separately long wave, the opposite relationship between light and dark in medium wave and short-wave band between pixel, and then determines The color of each pixel obtains enhancing image finally, the color in Retinex chrominance space is linearly mapped to rgb space.

Pedestrian's weight cognitive phase step:

Step 1: feature extraction

Color characteristic is extracted with HSV histogram on the basis of image reconstruction and enhancing, with SILTP texture feature extraction. In view of multi-scale information, the down-sampled of 2 × 2average pooling twice is carried out to original image, three images are adopted Feature is extracted with LOMO, then by the merging features of three images at a feature vector, finally, for the spy in feature vector Big value is inhibited using log transformation, renormalization later to unit-sized；

Step 2: dimensionality reduction and distance metric are carried out to space using XQDA method:

XQDA method carries out feature principal component analysis and similarity-based learning simultaneously, by learning mapping matrix W ∈ R^d×r(r < d), by primitive character x_i, x_j∈R^dIt is mapped to multidimensional subspace, Feature Dimension Reduction is realized, improves the accuracy of characteristic matching, Middle matrix W is made of the corresponding feature vector of preceding r maximum eigenvalue of principal component analysis, and d indicates dimension, therefore, formula (6) Defined in distance function change conversion are as follows:

F (Δ)=(x-z)^TW(∑′_I ^-1-∑′_E ^-1)W^T(x-z) (7)

Step 3: output recognition result.

In conclusion the present invention establishes the neural network structure based on GAN, by image reconstruction and enhancing, solve Under different cameras and different perspectives, the problem of image definition, and image under different illumination is reduced by Retinex algorithm Susceptibility；It realizes that pedestrian identifies again by LOMO feature combination XQDA method simultaneously, has accuracy height, robustness good, general The advantages that change ability is good is suitable for the practical applications such as video pedestrian monitoring, video pedestrian's information retrieval.

Specific embodiments of the present invention are described above.It is to be appreciated that the invention is not limited to above-mentioned Particular implementation, those skilled in the art can make a variety of changes or modify within the scope of the claims, this not shadow Ring substantive content of the invention.In the absence of conflict, the feature in embodiments herein and embodiment can any phase Mutually combination.

Claims

1. it is a kind of based on GAN neural network pedestrian weight identifying system, which is characterized in that including image reconstruction and enhancing module, Pedestrian's weight identification module；

Described image, which is rebuild, can generate network with enhancing module, differentiate network and the figure that loss function calculates and counterweight is built up As being enhanced；

Pedestrian's weight identification module can rebuild the enhancing picture progress pedestrian obtained with enhancing module to described image to be known again Not.

2. pedestrian's weight identifying system according to claim 1 based on GAN neural network, characterized in that the generation net Network includes multiple residual blocks, includes two 3 × 3 convolutional layers in each residual block, connection batch standardization layer, chooses after convolutional layer PReLU reconnects two sub-pix convolutional layers as activation primitive for increased feature sizes.

3. pedestrian's weight identifying system according to claim 1 based on GAN neural network, characterized in that the differentiation net Network includes 8 convolutional layers, and as the network number of plies is deepened, Characteristic Number is continuously increased, and characteristic size constantly reduces, and is chosen LeakyReLU obtains being predicted as original image as activation primitive eventually by two full articulamentums and sigmoid activation primitive Probability.

4. pedestrian's weight identifying system according to claim 3 based on GAN neural network, characterized in that the differentiation net What network judgement obtained is to generate image or original image, if image is generated, is then regenerated, until differentiating that network can not area Point, then complete image reconstruction.

5. pedestrian's weight identifying system according to claim 1 based on GAN neural network, characterized in that the loss letter It includes that content loss and confrontation loss are weighted using different weights that number, which calculates, and the content loss is to generate image Euclidean distance between original image, the confrontation loss are used to generate the data distribution for making arbiter indistinguishable.

6. pedestrian's weight identifying system according to claim 5 based on GAN neural network, characterized in that the content damage It loses the Minimum Mean Square Error Mseloss including pixel space and is sought survival based on the ReLU active coating of 19 layers of Vgg network of pre-training At the Euclidean distance Vggloss between image and original image feature.

7. pedestrian's weight identifying system according to claim 1 based on GAN neural network, characterized in that described image weight It builds and carries out image enhancement using Retinex algorithm algorithm with enhancing module.

8. pedestrian's weight identifying system according to claim 7 based on GAN neural network, characterized in that the Retinex The color image of input is decomposed into three width images according to R, G of pixel, B component first by algorithm algorithm, is represented in scene The intensity of the different reflected light of wavelength；Calculate separately long wave, the opposite relationship between light and dark in medium wave and short-wave band between pixel, in turn It determines the color of each pixel, the color in Retinex chrominance space is linearly finally mapped to rgb space, obtain enhancing figure Picture.

9. pedestrian's weight identifying system according to claim 1 based on GAN neural network again, characterized in that pedestrian identifies Module extracts color characteristic with HSV histogram on the basis of image reconstruction and enhancing first, with SILTP texture feature extraction, Image LOMO feature is extracted again, dimensionality reduction is carried out to space finally by XQDA method, and carry out pedestrian using distance metric and know again Not.

10. a kind of recognition methods again of the pedestrian based on GAN neural network, which comprises the steps of:

Step 1, building generates network:

Generating network portion includes multiple residual blocks, includes two 3 × 3 convolutional layers in each residual block, connects after convolutional layer Standardization layer is criticized, PReLU is chosen as activation primitive, reconnects two sub-pix convolutional layers for increased feature sizes；

Step 2, building differentiates network:

Differentiate that network portion includes 8 convolutional layers, as the network number of plies is deepened, Characteristic Number is continuously increased, and characteristic size is continuous Reduce, chooses LeakyReLU as activation primitive, predicted eventually by two full articulamentums and sigmoid activation primitive For the probability of original image；

Step 3, loss function calculates:

The Minimum Mean Square Error of Mse loss expression pixel space:

Wherein, x, y indicate image coordinate point；R indicates decimation factor；W indicates low resolution (LR) image I^LRWidth；H is indicated The height of low-resolution image；RW and rH respectively indicates r times of low-resolution image I^LRWidth and height；I^HRIndicate high score Resolution image, i.e. original image；G_θG(I^LR)_{X, y}The high resolution graphics of networking of making a living complexing；G_θGIndicate that generator, θ G indicate L layers The weight and biasing of depth network；

Vgg loss: it based on the ReLU active coating of 19 layers of Vgg network of pre-training, seeks survival between image and original image feature Euclidean distance, extract the feature map of a certain layer on trained Vgg, image current layer will be generated Feature map corresponding to feature map and original image is compared:

Wherein, W_{I, j}And H_{I, j}The size of each characteristic pattern in VGG network is described；After i, j indicate that i-th maximizes pooling layers Jth time convolution；φ corresponds to the characteristic pattern that certain convolutional layer exports after activation primitive among VGG network, φ_{I, j}(I^HR)_{X, y}Table Show the characteristic pattern of high-definition picture, φ_{I, j}(G_θG(I^LR))_{X, y}Indicate the characteristic pattern of generation network composograph, the loss function Can error on the higher perception level of reflection, and mean square error loss item can only reflect the error between the pixel of low level, because This VGG loses item and also known as perceives loss item；

Wherein, D_θDFor arbiter, θ D is the weight of arbiter, the probability depending on candidate samples from data distribution；D_θD(G_θG (I^LR)) what is indicated is that arbiter will generate the probability that image prediction is original image；N is sample size；

Step 4, image enhancement is carried out using Retinex algorithm:

The color image of input is decomposed by three width images according to R, G of pixel, B component first, it is different to represent scene medium wavelength Reflected light intensity；It calculates separately long wave, the opposite relationship between light and dark in medium wave and short-wave band between pixel, and then determines each The color of pixel obtains enhancing image finally, the color in Retinex chrominance space is linearly mapped to rgb space；

Step 5, feature extraction:

On the basis of step 1-4 carries out image reconstruction and enhancing, color characteristic is extracted with HSV histogram, extracts line with SILTP Manage feature；

Step 6, LOMO extracts feature:

The down-sampled of 2 × 2 average pooling twice is carried out to original image, to three images be all made of LOMO extract it is special Sign is finally become for the especially big value in feature vector using log then by the merging features of three images at a feature vector Swap-in row inhibits, renormalization later to unit-sized；

Enabling Δ=xi-xj indicates the feature difference between 2 samples, and P is Gaussian Profile, and P (Δ | Ω I) between similar sample Discrepancy delta meet the Gaussian Profile of difference Ω I in class, the discrepancy delta of P (Δ | Ω E) between foreign peoples's sample meets class inherited The Gaussian Profile of Ω E, and mean value is all 0:

Wherein, T is matrix transposition, and d (xi-xj) is distance function, ∑_EAnd ∑_IIt is similar sample respectively to collection and dissimilar sample To the covariance matrix of collection sample；F (Δ) indicates the distance between 2 samples, if more than 0, then it represents that be not in maximum probability very much It is similar；

XQDA method carries out feature principal component analysis and similarity-based learning simultaneously, by learning mapping matrix W ∈ R^d×r(r < D), by primitive character x_i, x_j∈R^dIt is mapped to multidimensional subspace, Feature Dimension Reduction is realized, improves the accuracy of characteristic matching, wherein Matrix W is made of the corresponding feature vector of preceding r maximum eigenvalue of principal component analysis, and d indicates dimension, therefore, in formula (6) The distance function of definition changes conversion are as follows:

F (Δ)=(x-z)^TW(∑′_I ^-1-∑′_E ^-1)W^T(x-z) (7)

Wherein, T be matrix transposition, ∑ '_I ^-1=W^T∑_IW, indicate dissimilar sample to the covariance matrix of collection sample, ∑ '_E ^-1= W^T∑_EW indicates similar sample to the covariance matrix of collection；X is the sample point at a visual angle, and z is the sample point at another visual angle；

Step 8: output recognition result.