CN107885764A

CN107885764A - Based on the quick Hash vehicle retrieval method of multitask deep learning

Info

Publication number: CN107885764A
Application number: CN201710857318.5A
Authority: CN
Inventors: 汤平; 汤一平; 温晓岳; 柳展; 张文广; 樊锦祥
Original assignee: Enjoyor Co Ltd
Current assignee: Yinjiang Technology Co.,Ltd.
Priority date: 2017-09-21
Filing date: 2017-09-21
Publication date: 2018-04-06
Anticipated expiration: 2037-09-21
Also published as: CN107885764B

Abstract

One kind be based on the quick Hash vehicle retrieval method of multitask deep learning, including for deep learning and the multitask depth convolutional neural networks of training identification, the Feature fusion of the compact Hash codes of the segmentation for improving retrieval precision and search method practicality and example aspects, retrieve for being lifted the local sensitivity Hash of performance sort algorithm and cross-module state search method for lifting search engine robustness and accuracy again.First, propose a kind of multitask depth convolutional network speced learning Hash codes method, image, semantic and graphical representation are combined, retrieval precision and refined image feature are improved using the contact between inter-related task, while the vehicle characteristics for arriving study using Image Coding is minimized have more robustness；Secondly, the example aspects of vehicle image are extracted again from feature pyramid network；Then, using local sensitivity Hash, sort method is retrieved to the feature extracted again；Finally, to the special circumstances of enquiring vehicle target image can not be obtained using cross-module state auxiliary vehicle retrieval method.

Description

Based on the quick Hash vehicle retrieval method of multitask deep learning

Technical field

Led the present invention relates to artificial intelligence, Digital Image Processing, convolutional neural networks and computer vision in public safety Application in terms of domain, belongs to intelligent transportation field.

Background technology

In today that smart city, intelligent transportation develop rapidly, large-scale image monitoring and video in public safety system The demand sharp increase of the vehicle identification, vehicle retrieval of database.

In the prior art, for the license board information of the search method of vehicle, mainly extraction target vehicle.Then root then, Motor vehicles to be retrieved are retrieved according to license board information.General specific practice is that the license plate number of vehicle is identified from monitoring image Code, then identify whether the motor vehicles with the number-plate number in other monitoring images again.It is this to rely solely on license plate number Although the method retrieved easily is realized, the motor vehicles for license plate information can not be got, such as false-trademark fake-licensed car Then can not effectively it be retrieved.

Vehicle retrieval technology based on external appearance characteristic not only can overcome the disadvantages that the limitation and deficiency of traditional vehicle board recognition methods, in intelligence Can vehicle retrieval, especially in inspection violating the regulations, accident chase, suspicion of crime Vehicle locking, fake license plate vehicle differentiates, and accelerates criminal investigation Efficiency of solving a case and speed all have very important realistic meaning and very wide application prospect.

Existing vehicle retrieval method is substantially to utilize the whole of sift, surf, dog scheduling algorithm extraction target vehicle image Width characteristics of image, using this feature as target signature, utilize the view picture of each vehicle image in identical algorithm extraction database Characteristics of image, using this feature as feature to be matched, the Euclidean distance between calculating target signature and each feature to be matched will Vehicle is as target vehicle corresponding to the nearest feature to be matched of Euclidean distance.

The vehicle of specific objective is found out in vehicle retrieval requirement in a series of similar vehicle of profiles, and this allows for work change Challenge must be had more；Moreover, it is contemplated that actual conditions, monitors environment, weather condition and illumination condition etc. The influence of factor.

Recent years, deep learning in the technology of computer vision field obtained rapid development, and deep learning can utilize Substantial amounts of training sample and hidden layer successively in depth learn the abstracted information of image, more comprehensively directly obtain characteristics of image. Digital picture is described with matrix, and convolutional neural networks describe the whole of image preferably from local message block Body structure, therefore solve problem using convolutional neural networks mostly in computer vision field, deep learning method.Around Improve accuracy of detection and detection time, depth convolutional neural networks technology is from R-CNN, FasterR-CNN to Fasterer R- CNN.Be embodied in further precision improvement, acceleration, end-to-end and more practical, almost cover from be categorized into detection, point Cut, position every field.It will be that the research for having very much actual application value is led that depth learning technology, which applies to vehicle retrieval, Domain.

It is the technology for the lifting retrieval performance commonly used in image retrieval technologies to sort again, for example, can be by between image pair Visual signature matching relationship initial retrieval result is sorted again.However, the effect that sorts again is greatly dependent on used Whether visual signature can sufficiently effective express image.

In similar vehicle search, due to many vehicles, often profile is much like, and the visual signature extracted can also compare It is similar, different automobile types are cannot be distinguished by, so as to cause this sort method again for directly using the matching relationship between image pair not Similar vehicle can preferably be retrieved.

Query expansion is to be used for improving the common method of recall rate and accuracy rate in retrieval technique.Query expansion technology be by Originally inquire about the method that sentence increases new keyword to inquire about again, such as the inquiry sentence elder generation that search engine can input user Primary retrieval is done, according to the file being retrieved, select suitable keyword, be added to inquiry sentence and retrieve again, look for whereby Go out more associated documents.It can be seen that query expansion can effectively improve the recall rate of information retrieval, but do not have also in the prior art There is this special object of the vehicle proposition being directed in image targetedly enquiry expanding method.

" a kind of vehicle inspection based on similarity study disclosed in the Chinese patent application of Application No. 201510744990.4 Suo Fangfa ", relative to vehicle retrieval method of the tradition based on the number-plate number, the method for proposition not only effectively prevent to be known to car plate The dependence of the other degree of accuracy, title can be retrieved to false-trademark car and fake-licensed car.But the technology still fall within before deep learning when The computer vision technique in generation.

A kind of vehicle retrieval side based on big data disclosed in the Chinese patent application of Application No. 201610671729.0 Method and device, this method include：Extract the brand identity of target vehicle in target vehicle image；Determine every in target vehicle image Individual pixel corresponds to the probability of each mark, wherein, mark includes the one or more in annual test mark, goods of furniture for display rather than for use, ornament；Root According to each pixel in target vehicle image correspond to each mark probability and each mark corresponding to probability threshold value, really Fixed position of each mark in target vehicle image；It is each according to position extraction of each mark in target vehicle image The characteristics of image of individual mark；It is special according to the brand of the characteristics of image of each mark and target vehicle in target vehicle image Sign searched targets vehicle in multiple vehicle images to be retrieved.Although the technology employs depth learning technology, but still belong to In the depth learning technology of single task；But vehicle retrieval belongs to a kind of depth learning technology of typical multitask.

The inquiry that the Chinese patent application of Application No. 201410381577.1 is disclosed in a kind of similar vehicle retrieval is expanded Method and device is opened up, wherein methods described includes：According to the image to be checked including vehicle, the car of the image to be checked is determined Type information；The multiple samples for meeting preparatory condition are chosen in vehicle ATL corresponding to vehicle information from the image to be checked Image；The sample image is formed into query expansion image collection, so that the sample image in the query expansion image collection Inquired about instead of the image to be checked in target database；Wherein, the vehicle ATL includes：Multiple different illumination Under the conditions of vehicle sample image, the vehicle sample image of the vehicle sample image of different shooting angles and different scenes, by upper The method of stating can improve the recall rate and accuracy rate of vehicle image retrieval.The figure in deep learning epoch before this method is still fallen within fact As retrieval technique.

Application No. 201410652730.X Chinese patent application discloses a kind of motor vehicles retrieval based on image Method and device.Methods described includes：Obtain the first image for including motor vehicle information to be retrieved；From described first image Determine the first appearance profile of the motor vehicles to be retrieved；Image in first appearance profile is divided into multiple areas Domain, the characteristics of image of regional is extracted using different step-lengths；Merge the characteristics of image of regional, obtain to be retrieved motor-driven The general image feature of vehicle；By the general image feature of the motor vehicles to be retrieved and the subject vehicle that in advance extracts General image feature be compared, obtain comparison result.The image inspection in deep learning epoch before this method is still fallen within fact Rope technology.

The content of the invention

For how efficiently to utilize caused massive video data in public safety field, the vehicle in big data epoch is lifted The problems such as recall precision, the present invention propose one kind and are based on the quick Hash search method of multitask deep learning, effectively utilize inspection Relevance, the diversity of bayonet vehicle essential information surveyed between identification mission realize the purpose of real-time retrieval；It is final to provide A kind of quick Hash vehicle retrieval method of multitask deep learning that retrieval precision is high, robustness is good.

The technical solution adopted for the present invention to solve the technical problems is：

One kind is based on the quick Hash vehicle retrieval method of multitask deep learning, and the vehicle retrieval method includes following step Suddenly：

The first step, build the multitask depth convolutional neural networks for deep learning and training identification；

Second step, using the Feature fusion for being segmented compact Hash codes and example aspects；

3rd step, using local sensitivity Hash sort algorithm again；

4th step, using cross-module state search method, realize vehicle retrieval.

Further, in the first step, using facilities networks of the Faster R-CNN as multitask convolutional neural networks；Net Network foremost is 3 × 3 convolutional networks, referred to as conv1, is named as conv2_x followed by 4 stacking convolution modules and arrives Conv5_x, each module contain { 2,3,3,3 } individual unit respectively, and conv1 to conv4_3 is as shared network；It is followed by RPN, i.e. region suggest network, and RPN networks are using any scalogram picture as input, the collection of output rectangular target Suggestion box Close, each frame includes 4 position coordinates variables and a score；For formation zone Suggestion box, in last shared volume Small network is slided in the convolution Feature Mapping of lamination output, this network is connected to n × n of input convolution Feature Mapping sky entirely Between on window；Each sliding window is mapped on a low-dimensional vector, and a sliding window of each Feature Mapping is corresponding one Numerical value；This vector exports the layer of the full connection at the same level to two；

Described RPN；RPN networks export the set of rectangular target Suggestion box using any scalogram picture as inputting, Each frame includes 4 position coordinates variables and a score；The target of described rectangular target Suggestion box refers to Vehicle Object；

It is the estimated probability of target/non-targeted to each Suggestion box, is the classification realized with the softmax layers of two classification Layer；K Suggestion box is parameterized by the corresponding k Suggestion box for being referred to as anchor；

Each anchor is centered on current sliding window mouth center, and a kind of corresponding yardstick and length-width ratio, uses 3 kinds of yardsticks With 3 kinds of length-width ratios, so just there is k=9 anchor in each sliding position；

In order to train RPN networks, a binary label is distributed to each anchor, is to mark the anchor with this It is not target；Then distribute positive label and give this two class anchor：(I) with some real target bounding box, i.e. Ground Truth, GT has the ratio between highest IoU, i.e. Interse-ction-over-Union, common factor union, overlapping anchor；(II) it is and any GT bounding boxs have the overlapping anchor of the IoU more than 0.7；Notice that a GT bounding box may give multiple anchor distribution positive mark Label；The IoU ratios that the negative label of distribution gives all GT bounding boxs are below 0.3 anchor；Anon-normal non-negative anchor is to instruction Practicing target does not have any effect, then abandons；

The multitask loss in Faster R-CNN is followed, minimizes object function；The loss function of one image is determined Justice is：

Here, i is anchor index, p_iIt is the prediction probability that anchor is the i-th target, if anchor is Just, GT labelsIt is exactly 1, if anchor is negative,It is exactly 0；t_iIt is a vector, represents 4 parameters of the bounding box of prediction Change coordinate,It is the coordinate vector of GT bounding boxs corresponding with positive anchor；λ is a balance weight, N_clsIt is the normalizing of cls items Change value be mini-batch size, N_regBe reg items normalized value be anchor positions quantity, Classification Loss function L_cls Two classifications, i.e. the logarithm loss of motor vehicles target vs. road backgrounds：

For returning loss function L_reg, defined to minor function：

In formula, L_regTo return loss function, R is the loss function of robust, and smooth L are calculated with formula (4)₁；

In formula, smooth_L1For smooth L₁Loss function, x are variable.

Further, in the multitask depth convolutional neural networks, it is heavy to closing to design a kind of multitask object function Want；Multitask object function is represented with formula (5)；

In formula,It is input feature valueWith weight parameter w^tMapping function, L () is loss function, Φ (w^t) be weight parameter regularization value, T is general assignment number, is designated as the training data of t-th of taskWherein t ∈ (1, T), i ∈ (1, N), N are total number of training,The respectively characteristic vector of the i-th sample and mark label；

For loss function, coordinate log-likelihood cost function to train the feature of last layer using softmax, realize more Task image is classified, and softmax loss functions are defined with formula (6),

In formula, x_iFor the i-th depth characteristic, W_jFor the jth row of weight in last full articulamentum, b is bias term, and m, n divide Sample size and classification number Wei not handled.

Further, in the second step, the Feature fusion process for being segmented compact Hash codes and example aspects is as follows：

In vehicle image feature extraction phases, first, by softmax activation primitives by threshold restriction between [0,1]； Then, promote the output of binary system Hash codes by fragmentation threshold function, Hash is reduced using speced learning and coding strategy The redundancy of intersymbol improves feature robustness；Finally, by way of Fusion Features by speced learning come Hash codes carry out Fusion finally gives vehicle characteristics and is segmented compact Hash codes；

For the example aspects of vehicle, implementation method is：It is each that stacking convolution is shared from conv2_x to conv5_x respectively The output of last unit combination RPN network of module, pyramid pond layer and vectorial flat layer are added to adapt to different chis Very little convolution characteristic pattern input, while one-dimensional characteristic vector, the reality of the vector referred to as vehicle are turned to by convolution three-dimensional feature is flat Example feature；

Finally, the compact feature and example aspects of the compact Hash codes of vehicle subsection are merged to obtain the spy for retrieval again Sign vector.

Described vehicle characteristics are segmented compact Hash codes and are realized by the following method；If share T task, Mei Geren C be present under business^tIndividual classification, uses m^tThe full connection output vector of each task is represented, makes to connect entirely using softmax activation primitives Layer output is calculated between [0,1] with formula (7)；

Wherein, θ represents random hyperplane, m^tRepresent the full connection output vector of each task, c^tRepresent to deposit under each task Classification, q^tRepresent full articulamentum output；

To encourage the output of the binary system of segment encoding module, threshold segmentation function binaryzation is reused：

Wherein, q^tRepresent full articulamentum output, H^tRepresent the binary system output of excitation segment encoding module；

Finally by H^tFusion turns into the compact Hash codes vector f of vehicle subsection_A：

f_A=[α¹H¹；α²H²；...；α^tH^t] (9)

Wherein, f_ARepresent the compact Hash code vector of vehicle subsection, α^tCoefficient is represented, is calculated with formula (10), H^tRepresent Encourage the binary system output of segment encoding module, t ∈ (1, T)：

Wherein, α^tCoefficient is represented, factor alpha is multiplied by before H vectors^tTo make up between different task caused by inequality of classifying Error.

In 3rd step, the characteristic vector for retrieval is by the compact feature and example of the compact Hash codes of vehicle subsection Feature is merged what is obtained, and process is as follows：

{ 4 are selected respectively for conv2_x to conv5_x bottommost layer²,8²,16²,16²Output chi as characteristic pattern It is very little；Be h × w for the size for giving input picture I, convolution convx_x is activated as three-dimensional vector T, size for h ' × w ' × D, contain a series of two dimensional character figure S=S { S_n, n ∈ (1, d), S_nSize correspond to n-th of channel characteristics for h ' × w ' The size of figure；T is sent into pyramid pond layer again and obtains three-dimensional vector T ', size is l × l × d, still comprising series of features Scheme S '=S ' { S '_n, n ∈ (1, d), S '_nSize be l × l, to each S '_nIt is that k × k sliding windows are traveled through using size Choose maximum, S '_nIt is changed into l/k × l/k, then the S ' to each passage_nMerged to obtain one-dimensional vector, successively to d passage Carry out same operation, the personal characteristics vector f finally obtained_BSize is (1, l/k × d)；Final searching characteristic vector f, meter Shown in calculation method such as formula (11)；

F=[f_A；f_B] (11)

In formula, f is used for the characteristic vector of vehicle retrieval, f_BExample aspects vector, i.e. personal characteristics vector, f_ARepresent vehicle It is segmented compact Hash code vector.

In 3rd step, similar sample is mapped in the same same bucket with high probability；Local sensitivity The hash function h () of Hash meets following condition：

s{h(f_Aq)=h (f_A)=sim (f_Aq,f_A) (12)

In formula, sim (f_Aq,f_A) represent f_AqWith f_ASimilarity, h (f_A) represent f_AHash function, h (f_Aq) represent f_Aq's Hash function, similarity measurement therein and a distance function σ direct correlation, are calculated with formula (13),

The typical classification of local sensitivity hash function is provided by accidental projection and threshold value, is calculated with formula (14),

h(f_A)=sign (Wf_A+b) (14)

In formula, W is a random hyperplane vector, and b is a random intercept.

In 3rd step, when query image recycles image by being segmented after compact Hash codes are mapped in similar bucket Example aspects combination formula (15) image returned in bucket is sorted again；The computational methods that sort again such as formula (15) institute Show：

In formula, k represents k-th of image in bucket,Represent penalty factor andCos represents COS distance formula, y tables Show the f before mapping_AqWithIt is whether equal；Y is 1 if equal, is otherwise 0,Represent that k-th of image vehicle subsection is compact Hash code vector, f_AqRepresent the compact Hash code vector of vehicle subsection after sorting againRetrieval result error image is allowed to be looked into input It is farther to ask the distance between image；Dis is smaller to show that similarity is higher.

In 4th step, cross-module state search method is by building one group of deep neural network by image and text data Common semantic space is mapped to by way of feature learning, to realize that the semanteme to different modalities data couples；Using depth Degree convolutional neural networks extract the semantic feature of image modalities directly from input picture, and text is represented by the way of term vector This, the semantic feature of middle extraction text modality is represented with one-dimensional convolutional neural networks from term vector；First, depth convolution god is passed through The compact Hash codes f of segmentation of vehicle is generated through network dynamic_A；Then, by text generation searching characteristic vector, so both are each The characteristic vector of self-generating using same searching system with regard to can be retrieved.

The semantic feature of described text modality is that characteristic vector is extracted from text, first as the extraction algorithm first step First need to split text；Entry of the characteristic vector of text from text, specific steps：

Input：Text O；Output：One group of rough similar image；

STEP1：Initialization：(1) text file analysis is vectorial into entry；(2) small word, repetitor are removed；(3) entry is checked Ensure the correctness of parsing；

STEP2：The minimum vectorial R=(r of entry of random combine are taken out from O₁,r₂,...,r_n)；

STEP3：To R and f_AOrder and the compact Hash codes of segmentation are integrated, and obtain text attribute featureNow f_ATxtDimension is less than R dimension；

STEP4：The hash algorithm that sorted again using local sensitivity is retrieved；

STEP5：Return to similar image group I；

Wherein, text attribute characteristic functionIt is expressed as with formula (16)：

In formula, A^TBe expressed as the transposed matrix of the compact Hash codes of vehicle subsection, R be expressed as the entry minimum of random combine to Amount,For text attribute characteristic function, it is sign function that sign, which is represented,；

In formula,Diagonal matrix is sought in diag expressions,Expression is that characteristic vector is extracted from text, car The compact Hash codes A of segmentation^TIt is initialized as complete 1 vector of (1 × c).

The present invention technical concept be：First, it is proposed that a kind of multitask depth convolutional network speced learning Hash codes side Method, image, semantic and graphical representation are combined, improve retrieval precision using the contact between inter-related task and refined image is special Sign, while the vehicle characteristics for arriving study using Image Coding is minimized have more robustness；Secondly, from feature pyramid network Network extracts the example aspects of vehicle image again；Then, using local sensitivity Hash, sort method is carried out to the feature extracted again Retrieval；Finally, to the special circumstances of enquiring vehicle target image can not be obtained using cross-module state auxiliary vehicle retrieval method.

Text generation searching characteristic vector is identical with the compact Hash code vector of the segmentation that convolutional network generates so that without warp Extra training is crossed, the characteristic vector that both generate can be retrieved using same searching system.

Depth convolutional neural networks model constructed by this patent, as shown in figure 1, be an end-to-end learning system, The model is jointly whole by Text Representation, characteristics of image study, text feature study, the retrieval of cross-module state and the task such as sort again Close among same learning framework.

Beneficial effects of the present invention are mainly manifested in：

1) a kind of contour of the vehicle identification framework of multitask deep learning is devised.It is parallel using the correlation between task Weights in processing procedure share the generalization ability of raising system, weaken influence of the over-fitting to neutral net, so solve because Sample deficiency and caused by grader generalization ability it is not strong the problem of, and attempted heterogeneous networks structure, finally will be interrelated Task merged, make network parameter is shared to reach maximization.

2) segmented method study Hash codes are used to reduce between binary system Hash codes with reference to multitask network structure Redundancy.Each task is responsible for learning a part of Hash codes and connectionless between each other, then by set forth herein Vector Fusion Method obtains accurately each vehicle image feature representation, and this feature is referred to as to the compact feature of segmentation of vehicle；Using shared heap Folded convolutional layer, pyramid pond layer and vectorial flat layer, i.e. Vector flat layer, multiple layer combination construction feature it is golden The example aspects of word tower network acquisition image, enter finally by by the graphical representation of get two kinds of different characteristic dimensional informations The searching characteristic vector that row vector merges to the end again.

3) proposing local sensitivity Hash, ordering searching method carries out Rapid matching to meet to the retrieval character got again The practical application request of intelligent transportation.This search method will inquire about image in storehouse first by the compact Hash codes of segmentation and be mapped to respectively In individual " bucket ", then the minor sort again of image in bucket is filtered out most by vehicle different characteristic dimension using example aspects vector TopK similar image, and using the mapping of coding vector so as to avoiding image from contrasting one by one to reach quick real-time retrieval Effect.

4) for can not obtain enquiring vehicle image information, night camera view is fuzzy or daylight is too strong, The special circumstances such as camera dead angle, the present invention propose cross-module state assisted retrieval mode to meet the actual requirement of varying environment；Root Vehicle characteristics are summed up according to artificial judgement, and then changes into text data feeding retrieval network and realizes assisted retrieval.

Brief description of the drawings

Fig. 1 is that the quick Hash of multitask depth convolutional neural networks retrieves overall network framework；

Fig. 2 is the schematic diagram that sorts again；

Fig. 3 is Text eigenvector generating process explanation figure；

Fig. 4 is RPN network structures；

Fig. 5 is multitask Faster R-CNN depth convolutional network structure charts.

Embodiment

The invention will be further described below in conjunction with the accompanying drawings.

1~Fig. 5 of reference picture, one kind are based on the quick Hash vehicle retrieval method of multitask deep learning, including：

3rd step, using local sensitivity Hash sort algorithm again；

4th step, using cross-module state search method, realize vehicle retrieval.

In the described first step, for deep learning and the multitask depth convolutional neural networks of training identification, such as Fig. 1 institutes Show；Using facilities networks of the Faster R-CNN as multitask convolutional neural networks；Network foremost is 3 × 3 convolutional networks, Referred to as conv1, stack convolution modules followed by 4 and be named as conv2_x to conv5_x, each module contain respectively 2, 3,3,3 } individual unit, conv1 to conv4_3 is as shared network；It is followed by RPN, i.e. network is suggested in region, and RPN networks are by one Individual any scalogram picture exports the set of rectangular target Suggestion box, each frame includes 4 position coordinates variables and one as input Individual score；For formation zone Suggestion box, small net is slided in the convolution Feature Mapping of last shared convolutional layer output Network, this network are connected in n × n of input convolution Feature Mapping spatial window entirely；Each sliding window is mapped to one On low-dimensional vector, the corresponding numerical value of a sliding window of each Feature Mapping；This vector exports at the same level to two complete The layer of connection；

Described RPN, as shown in Figure 4；RPN networks are built any scalogram picture as input, output rectangular target The set of frame is discussed, each frame includes 4 position coordinates variables and a score；What the target of described rectangular target Suggestion box referred to It is Vehicle Object；

In order to train RPN networks, a binary label is distributed to each anchor, is to mark the anchor with this It is not target；Then distribute positive label and give this two class anchor：(I) with some real target bounding box, i.e. GroundTruth, GT has the ratio between highest IoU, i.e. Interse-ction-over-Union, common factor union, overlapping anchor；(II) it is and any GT bounding boxs have the overlapping anchor of the IoU more than 0.7；Notice that a GT bounding box may give multiple anchor distribution positive mark Label；The IoU ratios that the negative label of distribution gives all GT bounding boxs are below 0.3 anchor；Anon-normal non-negative anchor is to instruction Practicing target does not have any effect, then abandons；

There are these to define, it then follows the multitask loss in Faster R-CNN, to minimize object function；To an image Loss function be defined as：

Here, i is anchor index, p_iIt is the prediction probability that anchor is the i-th target, if anchor is Just, GT labelsIt is exactly 1, if anchor is negative,It is exactly 0；t_iIt is a vector, represents 4 parameters of the bounding box of prediction Change coordinate,It is the coordinate vector of GT bounding boxs corresponding with positive anchor；λ is a balance weight, here λ=10, N_clsIt is The normalized value of cls items is mini-batch size, here N_cls=256, N_regThe normalized value for being reg items is anchor positions The quantity put, N_reg=2,400, Classification Loss function L_clsIt is the logarithm of two classifications, i.e. motor vehicles target vs. road backgrounds Loss：

For returning loss function L_reg, defined to minor function：

In formula, smooth_L1For smooth L₁Loss function, x are variable.

The multitask depth convolutional neural networks, as shown in Figure 5；Learnt in order to which multiple-task is merged And training, it is vital to design a kind of multitask object function；Multitask object function is represented with formula (5)；

The described compact Hash codes of segmentation and the Feature fusion of example aspects, as shown in Figure 1；On the one hand, in vehicle The image characteristics extraction stage, first, by softmax activation primitives by threshold restriction between [0,1]；Then, segmentation is passed through Threshold function table promotes the output of binary system Hash codes, uses speced learning and coding strategy reduces the redundancy of Hash intersymbol and come Improve feature robustness；Finally, by way of Fusion Features by speced learning come Hash codes carry out fusion and finally give car Compact Hash codes of Feature Segmentation；

On the other hand, the example aspects on vehicle；Inspired by image pyramid technology, the vehicle of convolutional layer extraction is real Example feature further merges with the compact feature extracted in multitask deep learning vehicle retrieval network so that retrieval result is more It is precisely reliable；Implementation method is：Last unit for stacking each module of convolution is shared from conv2_x to conv5_x respectively With reference to the output of RPN networks, add pyramid pond layer and vectorial flat layer and inputted with adapting to various sizes of convolution characteristic pattern, Simultaneously by convolution three-dimensional feature it is flat turn to one-dimensional characteristic vector, the vector referred to as vehicle example aspects；

f_A=[α¹H¹；α²H²；...；α^tH^t] (9)

The described characteristic vector for being used to retrieve is to enter the compact feature and example aspects of the compact Hash codes of vehicle subsection Row fusion obtains, and concrete methods of realizing is：

F=[f_A；f_B] (11)

The described local sensitivity Hash for being used to being lifted retrieval performance sort algorithm again, algorithm idea as shown in Fig. 2 be by Similar sample is mapped in the same same bucket with high probability；The hash function h () of local sensitivity Hash meet with Lower condition：

s{h(f_Aq)=h (f_A)=sim (f_Aq,f_A) (12)

h(f_A)=sign (Wf_A+b) (14)

In formula, W is a random hyperplane vector, and b is a random intercept.

The Feature fusion of the compact Hash codes of described segmentation and example aspects, in order that similar image closer to, The example aspects combination formula of image is recycled after query image is mapped in similar bucket by the compact Hash codes of segmentation (15) image returned in bucket is sorted again；Shown in the computational methods that sort again such as formula (15)：

In formula, k represents k-th of image in bucket,Represent penalty factor andCos represents COS distance formula, y tables Show the f before mapping_AqWithIt is whether equal；Y is 1 if equal, is otherwise 0,Represent that k-th of image vehicle subsection is compact Hash code vector, f_AqRepresent the compact Hash code vector of vehicle subsection after sorting again；

Add coefficientPurpose is in order to ensure the correctness of LSH mappings, i.e., in the case of identical is segmented compact Hash codes Ability calculated examples characteristic vector similarity, when being mapped into the compact Hash codes of different segmentations in same bucket, use penalty factorAllow Retrieval result error image and the distance between input inquiry image are farther；Dis is smaller to show that similarity is higher.

Described cross-module state search method is that image and text data are passed through into spy by building one group of deep neural network The mode of sign study maps to common semantic space, to realize that the semanteme to different modalities data couples；Using depth convolution Neutral net extracts the semantic feature of image modalities directly from input picture, text is represented by the way of term vector, with one Tie up the semantic feature that convolutional neural networks extract text modality from term vector expression；First, depth convolutional neural networks are passed through The dynamic generation compact Hash codes f of segmentation of vehicle_A；Then, by text generation searching characteristic vector, so both each self-generatings Characteristic vector can just be retrieved using same searching system, specific implementation process it is as shown in Figure 3.

Input：Text O；Output：One group of rough similar image；

STEP5：Return to similar image group I；

The foregoing is only the preferable implementation example of the present invention, be not intended to limit the invention, it is all in spirit of the invention and Within principle, any modification, equivalent substitution and improvements made etc., it should be included in the scope of the protection.

Claims

1. one kind is based on the quick Hash vehicle retrieval method of multitask deep learning, it is characterised in that：The vehicle retrieval method Comprise the following steps：

3rd step, using local sensitivity Hash sort algorithm again；

4th step, using cross-module state search method, realize vehicle retrieval.

2. it is based on the quick Hash vehicle retrieval method of multitask deep learning as claimed in claim 1, it is characterised in that：It is described In the first step, using facilities networks of the Faster R-CNN as multitask convolutional neural networks；Network foremost is volume 3 × 3 Product network, referred to as conv1, stack convolution module followed by 4 and be named as conv2_x to conv5_x, each module is wrapped respectively { 2,3,3,3 } individual unit is contained, conv1 to conv4_3 is as shared network；RPN is followed by, i.e. network, RPN nets are suggested in region Network exports the set of rectangular target Suggestion box, each frame includes 4 position coordinateses and become using any scalogram picture as input Amount and a score；It is sliding in the convolution Feature Mapping of last shared convolutional layer output for formation zone Suggestion box Small network is moved, this network is connected in n × n of input convolution Feature Mapping spatial window entirely；Each sliding window mapping Onto a low-dimensional vector, the corresponding numerical value of a sliding window of each Feature Mapping；This vector exports same to two The layer of the full connection of level；

Described RPN；RPN networks export the set of rectangular target Suggestion box, each using any scalogram picture as input Frame includes 4 position coordinates variables and a score；The target of described rectangular target Suggestion box refers to Vehicle Object；

It is the estimated probability of target/non-targeted to each Suggestion box, is the classification layer realized with the softmax layers of two classification；K Suggestion box is parameterized by the corresponding k Suggestion box for being referred to as anchor；

Each anchor is centered on current sliding window mouth center, and a kind of corresponding yardstick and length-width ratio, uses 3 kinds of yardsticks and 3 Kind length-width ratio, so just has k=9 anchor in each sliding position；

In order to train RPN networks, a binary label is distributed to each anchor, is to mark the anchor with this Target；Then distribute positive label and give this two class anchor：(I) have with some real target bounding box, i.e. Ground Truth, GT The ratio between highest IoU, i.e. Interse-ction-over-Union, common factor union, overlapping anchor；(II) with any GT bags Enclosing box has the overlapping anchor of the IoU more than 0.7；Notice that a GT bounding box may distribute positive label to multiple anchor； The IoU ratios that the negative label of distribution gives all GT bounding boxs are below 0.3 anchor；Anon-normal non-negative anchor is to training mesh No any effect is marked, then is abandoned；

The multitask loss in Faster R-CNN is followed, minimizes object function；The loss function of one image is defined as：

<mrow> <mi>L</mi> <mrow> <mo>(</mo> <mo>{</mo> <msub> <mi>p</mi> <mi>i</mi> </msub> <mo>}</mo> <mo>,</mo> <mo>{</mo> <msub> <mi>t</mi> <mi>i</mi> </msub> <mo>}</mo> <mo>)</mo> </mrow> <mo>=</mo> <mfrac> <mn>1</mn> <msub> <mi>N</mi> <mrow> <mi>c</mi> <mi>l</mi> <mi>s</mi> </mrow> </msub> </mfrac> <munder> <mo>&Sigma;</mo> <mi>i</mi> </munder> <msub> <mi>L</mi> <mrow> <mi>c</mi> <mi>l</mi> <mi>s</mi> </mrow> </msub> <mrow> <mo>(</mo> <msub> <mi>p</mi> <mi>i</mi> </msub> <mo>,</mo> <msubsup> <mi>p</mi> <mi>i</mi> <mo>*</mo> </msubsup> <mo>)</mo> </mrow> <mo>+</mo> <mi>&lambda;</mi> <mfrac> <mn>1</mn> <msub> <mi>N</mi> <mrow> <mi>r</mi> <mi>e</mi> <mi>g</mi> </mrow> </msub> </mfrac> <munder> <mo>&Sigma;</mo> <mi>i</mi> </munder> <msubsup> <mi>p</mi> <mi>i</mi> <mo>*</mo> </msubsup> <msub> <mi>L</mi> <mrow> <mi>r</mi> <mi>e</mi> <mi>g</mi> </mrow> </msub> <mrow> <mo>(</mo> <msub> <mi>t</mi> <mi>i</mi> </msub> <mo>,</mo> <msubsup> <mi>t</mi> <mi>i</mi> <mo>*</mo> </msubsup> <mo>)</mo> </mrow> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>1</mn> <mo>)</mo> </mrow> </mrow>

Here, i is anchor index, p_iIt is the prediction probability that anchor is the i-th target, if anchor is just, GT marks LabelIt is exactly 1, if anchor is negative,It is exactly 0；t_iIt is a vector, represents 4 parametrization coordinates of the bounding box of prediction,It is the coordinate vector of GT bounding boxs corresponding with positive anchor；λ is a balance weight, N_clsThe normalized value for being cls items is Mini-batch size, N_regBe reg items normalized value be anchor positions quantity, Classification Loss function L_clsIt is two The logarithm loss of classification, i.e. motor vehicles target vs. road backgrounds：

For returning loss function L_reg, defined to minor function：

<mrow> <msub> <mi>smooth</mi> <mrow> <mi>L</mi> <mn>1</mn> </mrow> </msub> <mrow> <mo>(</mo> <mi>x</mi> <mo>)</mo> </mrow> <mo>=</mo> <mfenced open = "{" close = ""> <mtable> <mtr> <mtd> <mrow> <mn>0.5</mn> <msup> <mi>x</mi> <mn>2</mn> </msup> </mrow> </mtd> <mtd> <mtable> <mtr> <mtd> <mrow> <mi>i</mi> <mi>f</mi> </mrow> </mtd> <mtd> <mrow> <mo>|</mo> <mi>x</mi> <mo>|</mo> <mo><</mo> <mn>1</mn> </mrow> </mtd> </mtr> </mtable> </mtd> </mtr> <mtr> <mtd> <mrow> <mo>|</mo> <mi>x</mi> <mo>|</mo> <mo>-</mo> <mn>0.5</mn> </mrow> </mtd> <mtd> <mrow> <mi>o</mi> <mi>t</mi> <mi>h</mi> <mi>e</mi> <mi>r</mi> <mi>w</mi> <mi>i</mi> <mi>s</mi> <mi>e</mi> <mo>,</mo> </mrow> </mtd> </mtr> </mtable> </mfenced> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>4</mn> <mo>)</mo> </mrow> </mrow>

In formula, smooth_L1For smooth L₁Loss function, x are variable.

3. it is based on the quick Hash vehicle retrieval method of multitask deep learning as claimed in claim 1 or 2, it is characterised in that： In the multitask depth convolutional neural networks, it is vital to design a kind of multitask object function；Multitask target letter Several formula (5) represent；

<mrow> <munder> <mrow> <mi>arg</mi> <mi> </mi> <mi>min</mi> </mrow> <msubsup> <mrow> <mo>{</mo> <msup> <mi>w</mi> <mi>t</mi> </msup> <mo>}</mo> </mrow> <mrow> <mi>t</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>T</mi> </msubsup> </munder> <munderover> <mo>&Sigma;</mo> <mrow> <mi>t</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>T</mi> </munderover> <munderover> <mo>&Sigma;</mo> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>N</mi> </munderover> <mi>L</mi> <mrow> <mo>(</mo> <msubsup> <mi>y</mi> <mi>i</mi> <mi>t</mi> </msubsup> <mo>,</mo> <mi>f</mi> <mo>(</mo> <mrow> <msubsup> <mi>x</mi> <mi>i</mi> <mi>t</mi> </msubsup> <mo>;</mo> <msup> <mi>w</mi> <mi>t</mi> </msup> </mrow> <mo>)</mo> <mo>)</mo> </mrow> <mo>+</mo> <mi>&Phi;</mi> <mrow> <mo>(</mo> <msup> <mi>w</mi> <mi>t</mi> </msup> <mo>)</mo> </mrow> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>5</mn> <mo>)</mo> </mrow> </mrow>

In formula,It is input feature valueWith weight parameter w^tMapping function, L () is loss function, Φ (w^t) be The regularization value of weight parameter, T are general assignment number, are designated as the training data of t-th of taskWherein t ∈ (1, T), I ∈ (1, N), N are total number of training,The respectively characteristic vector of the i-th sample and mark label；

For loss function, coordinate log-likelihood cost function to train the feature of last layer using softmax, realize multitask Image classification, softmax loss functions are defined with formula (6),

<mrow> <msub> <mi>L</mi> <mi>S</mi> </msub> <mo>=</mo> <mo>-</mo> <munderover> <mo>&Sigma;</mo> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>m</mi> </munderover> <mi>log</mi> <mfrac> <msup> <mi>e</mi> <mrow> <msubsup> <mi>W</mi> <mrow> <mi>j</mi> <mi>i</mi> </mrow> <mi>T</mi> </msubsup> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>+</mo> <msub> <mi>b</mi> <mrow> <mi>y</mi> <mi>i</mi> </mrow> </msub> </mrow> </msup> <mrow> <munderover> <mo>&Sigma;</mo> <mrow> <mi>j</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>n</mi> </munderover> <msup> <mi>e</mi> <mrow> <msubsup> <mi>W</mi> <mi>j</mi> <mi>T</mi> </msubsup> <msub> <mi>x</mi> <mi>j</mi> </msub> <mo>+</mo> <msub> <mi>b</mi> <mi>j</mi> </msub> </mrow> </msup> </mrow> </mfrac> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>6</mn> <mo>)</mo> </mrow> </mrow>

In formula, x_iFor the i-th depth characteristic, W_jFor the jth row of weight in last full articulamentum, b is bias term, and m, n are respectively Handle sample size and classification number.

4. it is based on the quick Hash vehicle retrieval method of multitask deep learning as claimed in claim 1 or 2, it is characterised in that： In the second step, the Feature fusion process for being segmented compact Hash codes and example aspects is as follows：

In vehicle image feature extraction phases, first, by softmax activation primitives by threshold restriction between [0,1]；So Afterwards, promote the output of binary system Hash codes by fragmentation threshold function, Hash codes are reduced using speced learning and coding strategy Between redundancy improve feature robustness；Finally, by way of Fusion Features by speced learning come Hash codes melted Conjunction finally gives vehicle characteristics and is segmented compact Hash codes；

For the example aspects of vehicle, implementation method is：Shared respectively from conv2_x to conv5_x and stack each module of convolution Last unit combination RPN network output, add pyramid pond layer and vectorial flat layer it is various sizes of to adapt to Convolution characteristic pattern input, while by convolution three-dimensional feature it is flat turn to one-dimensional characteristic vector, the vector be referred to as vehicle example spy Sign；

Finally, by the compact feature and example aspects of the compact Hash codes of vehicle subsection merge to obtain again feature for retrieval to Amount.

5. it is based on the quick Hash vehicle retrieval method of multitask deep learning as claimed in claim 4, it is characterised in that：It is described Vehicle characteristics be segmented compact Hash codes and be realized by the following method；If sharing T task, c be present under each task^tIt is individual Classification, use m^tRepresent the full connection output vector of each task, using softmax activation primitives make the output of full articulamentum [0, 1] between, calculated with formula (7)；

<mrow> <msup> <mi>q</mi> <mi>t</mi> </msup> <mo>=</mo> <mi>s</mi> <mi>o</mi> <mi>f</mi> <mi>t</mi> <mi>max</mi> <mrow> <mo>(</mo> <msup> <mi>m</mi> <mi>t</mi> </msup> <mo>)</mo> </mrow> <mo>=</mo> <mfrac> <msup> <mi>e</mi> <mrow> <msup> <mi>&theta;</mi> <mi>T</mi> </msup> <msup> <mi>m</mi> <mi>t</mi> </msup> </mrow> </msup> <mrow> <msubsup> <mo>&Sigma;</mo> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <msup> <mi>c</mi> <mi>t</mi> </msup> </msubsup> <msup> <mi>e</mi> <mrow> <msup> <mi>&theta;</mi> <mi>T</mi> </msup> <msubsup> <mi>m</mi> <mi>i</mi> <mi>t</mi> </msubsup> </mrow> </msup> </mrow> </mfrac> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>7</mn> <mo>)</mo> </mrow> </mrow>

Wherein, θ represents random hyperplane, m^tRepresent the full connection output vector of each task, c^tRepresent existing under each task Classification, q^tRepresent full articulamentum output；

f_A=[α¹H¹；α²H²；...；α^tH^t] (9)

Wherein, f_ARepresent the compact Hash code vector of vehicle subsection, α^tCoefficient is represented, is calculated with formula (10), H^tRepresent excitation The binary system output of segment encoding module, t ∈ (1, T)：

<mrow> <msup> <mi>&alpha;</mi> <mi>t</mi> </msup> <mo>=</mo> <mn>1</mn> <mo>-</mo> <mfrac> <mrow> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>j</mi> <mo>=</mo> <mn>1</mn> </mrow> <msup> <mi>c</mi> <mi>t</mi> </msup> </msubsup> <msubsup> <mi>c</mi> <mi>j</mi> <mi>t</mi> </msubsup> </mrow> <mrow> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>T</mi> </msubsup> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>j</mi> <mo>=</mo> <mn>1</mn> </mrow> <msup> <mi>c</mi> <mi>t</mi> </msup> </msubsup> <msubsup> <mi>c</mi> <mi>j</mi> <mi>i</mi> </msubsup> </mrow> </mfrac> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>10</mn> <mo>)</mo> </mrow> </mrow>

Wherein, α^tCoefficient is represented, factor alpha is multiplied by before H vectors^tFor make up between different task because classify it is uneven caused by by mistake Difference.

6. it is based on the quick Hash vehicle retrieval method of multitask deep learning as claimed in claim 5, it is characterised in that：It is described In 3rd step, the characteristic vector for retrieval is to be merged the compact feature and example aspects of the compact Hash codes of vehicle subsection Obtain, process is as follows：

{ 4 are selected respectively for conv2_x to conv5_x bottommost layer²,8²,16²,16²Output Size as characteristic pattern；It is right It is h × w in given input picture I size, convolution convx_x is activated as three-dimensional vector T, and size is h ' × w ' × d, comprising A series of two dimensional character figure S=S { S_n, n ∈ (1, d), S_nSize correspond to the big of n-th channel characteristics figure for h ' × w ' It is small；T to be sent into pyramid pond layer again and obtains three-dimensional vector T ', size is l × l × d, still comprising series of features figure S '= S′{S′_n, n ∈ (1, d), S '_nSize be l × l, to each S '_nIt is that k × k sliding windows carry out traversal selection most using size Big value, S '_nIt is changed into l/k × l/k, then the S ' to each passage_nMerged to obtain one-dimensional vector, phase is carried out to d passage successively Biconditional operation, the personal characteristics vector f finally obtained_BSize is (1, l/k × d)；Final searching characteristic vector f, computational methods As shown in formula (11)；

F=[f_A；f_B] (11)

In formula, f is used for the characteristic vector of vehicle retrieval, f_BExample aspects vector, i.e. personal characteristics vector, f_ARepresent vehicle subsection Compact Hash code vector.

7. it is based on the quick Hash vehicle retrieval method of multitask deep learning as claimed in claim 6, it is characterised in that：It is described In 3rd step, similar sample is mapped in the same same bucket with high probability；The Hash letter of local sensitivity Hash Number h () meets following condition：

s{h(f_Aq)=h (f_A)=sim (f_Aq,f_A) (12)

In formula, sim (f_Aq,f_A) represent f_AqWith f_ASimilarity, h (f_A) represent f_AHash function, h (f_Aq) represent f_AqHash Function, similarity measurement therein and a distance function σ direct correlation, are calculated with formula (13),

<mrow> <mi>s</mi> <mi>i</mi> <mi>m</mi> <mrow> <mo>(</mo> <msub> <mi>f</mi> <mrow> <mi>A</mi> <mi>q</mi> </mrow> </msub> <mo>,</mo> <msub> <mi>f</mi> <mi>A</mi> </msub> <mo>)</mo> </mrow> <mo>=</mo> <mi>exp</mi> <mo>{</mo> <mo>-</mo> <mfrac> <mrow> <mo>|</mo> <mo>|</mo> <msub> <mi>f</mi> <mrow> <mi>A</mi> <mi>q</mi> </mrow> </msub> <mo>-</mo> <msub> <mi>f</mi> <mi>A</mi> </msub> <mo>|</mo> <mo>|</mo> </mrow> <msup> <mi>&sigma;</mi> <mn>2</mn> </msup> </mfrac> <mo>}</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>13</mn> <mo>)</mo> </mrow> </mrow>

h(f_A)=sign (Wf_A+b) (14)

In formula, W is a random hyperplane vector, and b is a random intercept.

8. it is based on the quick Hash vehicle retrieval method of multitask deep learning as claimed in claim 7, it is characterised in that：It is described In 3rd step, the example aspects knot of image is recycled after query image is mapped in similar bucket by the compact Hash codes of segmentation Formula (15) is closed to sort to the image returned in bucket again；Shown in the computational methods that sort again such as formula (15)：

In formula, k represents k-th of image in bucket,Represent penalty factor andCos represents COS distance formula, and y represents to reflect F before penetrating_AqWithIt is whether equal；Y is 1 if equal, is otherwise 0,Represent k-th of compact Hash of image vehicle subsection Code vector, f_AqRepresent the compact Hash code vector of vehicle subsection after sorting againAllow retrieval result error image and input inquiry figure The distance between picture is farther；Dis is smaller to show that similarity is higher.

9. it is based on the quick Hash vehicle retrieval method of multitask deep learning as claimed in claim 1 or 2, it is characterised in that： In 4th step, cross-module state search method is that image and text data are passed through into feature by building one group of deep neural network The mode of study maps to common semantic space, to realize that the semanteme to different modalities data couples；Using depth convolution god The semantic feature of image modalities is extracted directly from input picture through network, represents text by the way of term vector, use is one-dimensional Convolutional neural networks extract the semantic feature of text modality from term vector expression；First, moved by depth convolutional neural networks State generates the compact Hash codes f of segmentation of vehicle_A；Then, by text generation searching characteristic vector, so both each self-generatings Characteristic vector using same searching system with regard to can be retrieved.

10. it is based on the quick Hash vehicle retrieval method of multitask deep learning as claimed in claim 9, it is characterised in that：Institute The semantic feature for the text modality stated is that characteristic vector is extracted from text, as the extraction algorithm first step firstly the need of fractionation Text；Entry of the characteristic vector of text from text, specific steps：

Input：Text O；Output：One group of rough similar image；

STEP1：Initialization：(1) text file analysis is vectorial into entry；(2) small word, repetitor are removed；(3) check that entry ensures The correctness of parsing；

STEP3：To R and f_AOrder and the compact Hash codes of segmentation are integrated, and obtain text attribute featureF now_ATxt Dimension is less than R dimension；

STEP5：Return to similar image group I；

In formula, A^TThe transposed matrix of the compact Hash codes of vehicle subsection is expressed as, R is expressed as the minimum vector of entry of random combine,For text attribute characteristic function, it is sign function that sign, which is represented,；

In formula,Diagonal matrix is sought in diag expressions,Expression is that characteristic vector is extracted from text, vehicle point The compact Hash codes A of section^TIt is initialized as complete 1 vector of (1 × c).