CN110134803A

CN110134803A - Image data method for quickly retrieving based on Hash study

Info

Publication number: CN110134803A
Application number: CN201910415146.5A
Authority: CN
Inventors: 王红滨; 纪斯佳; 张毅; 周连科; 王念滨; 童鹏鹏; 崔琎
Original assignee: Harbin Engineering University
Current assignee: Harbin Engineering University
Priority date: 2019-05-17
Filing date: 2019-05-17
Publication date: 2019-08-16
Anticipated expiration: 2039-05-17
Also published as: CN110134803B

Abstract

Based on the image data method for quickly retrieving of Hash study, it is related to image data method for quickly retrieving, belongs to data retrieval technology field.In order to solve the problems, such as that existing model can make model deviation occur in training stage negative feedback process in Hash codes generation phase using multiple relaxation.Depth Hash model of the invention includes five convolution-pond layer, two full articulamentums, characteristic layer, Hash layer and output layer；And be trained based on triple constraint, after obtaining trained depth Hash model, using depth Hash model foundation sample database, sample database is made of image pattern and corresponding Hash codes；For query image, the Hash codes of query image are generated using trained depth Hash model；It is retrieved using the Hash codes and image pattern library of query image.The present invention is suitable for image data retrieval.

Description

Image data method for quickly retrieving based on Hash study

Technical field

The present invention relates to image data method for quickly retrieving, belong to data retrieval technology field.

Background technique

The fast development of internet in recent years, high dimensional data show exponential form development, how using these data at For the focus of every profession and trade.Researchers propose many methods to large-scale data retrieval in time in the past, and hash method has Its efficient storage and computational efficiency and be widely used (LI WuJun, ZHOU ZhiHua. big data Hash study: status With trend [J] Science Bulletin, 2015,60 (Z1): 485-490.).Traditional hash method includes that local sensitivity Hash and spectrum are breathed out It is uncommon, certain achievement is obtained on image retrieval, but there are still a distances away from practical application.The quick hair of deep learning Exhibition promotes the progress of hash method, combines convolutional neural networks to propose convolution mind for the first time by Pan Yan and Yan Shuicheng within 2014 Through network Hash model (Convolutional Neural Network Hashing, CNNH) (R.Xia, Y.Pan, H.Lai, et al.Supervised hashing for image retrieval via image representation Learning [C] .AAAI Conference on Artificial Intelligence, 2014.), compare more traditional Hash Method achieves better effect.CNNH is divided into two stages and is trained to Hash codes, and the first step is by similar matrix S points Solution, whether the sample image of each of matrix S element representation element row and column is similar, each row of matrix H is The approximate Hash codes of training data.The characteristics of image that image in model shows in the training process cannot react on Hash The generation of code, can not Hamming distance between dynamic regulation Hash codes, the advantages of also can not just using convolutional neural networks Cause the hash function suboptimum learnt.On this basis, Li W J, Wang S, Kang W C et al. is in " Feature Learning based Deep Supervised Hashing with Pairwise Labels " in propose depth nerve Network Hash model (Deep Neural Network Hashing, DNNH), H.Liu, R.Wang, S.Shan et al. exist Depth is proposed in " Deep Supervised Hashing for Fast Image Retrieval " supervision Hash model (Deep Supervised Hashing, DSH).Two kinds of models overcome CNNH spy using model end to end in itself The problem of sign is extracted and Hash coding separates, allowable loss function is for generating Hash codes from different perspectives.But both moulds Type is carrying out Hash codes generation phase using repeatedly relaxation, and model can be made deviation occur in training stage negative feedback process, caused It is not accurate enough that image data retrieval is carried out using the trained model of generation.

Summary of the invention

The present invention can make model in the training stage to solve existing model in Hash codes generation phase using multiple relaxation There is the problem of deviation in negative feedback process, provides the image data method for quickly retrieving based on Hash study.

Image data method for quickly retrieving of the present invention based on Hash study, comprising the following steps:

Step 1 establishes depth Hash model:

Depth Hash model includes five convolution-pond layer, two full articulamentums, characteristic layer, Hash layer and output layer；

Step 2, training depth Hash model:

Training data is a series of data set { (p with label₁, w₁), (p₂, w₂), (p₃, w₃) ... (p_n, w_n), Middle p_iFor sample image, w_iIt is the label of correspondence image sample；

Input is triple label { p_i, p_j, p_k, wherein p_iAnd p_jFor same category, p_iAnd p_kIt is mutually similar to be different classes of Similarity distance between not is less than the similitude between different classes of；

After obtaining trained depth Hash model, using depth Hash model foundation sample database, sample database is by image sample This and corresponding Hash codes are constituted；

Step 3 is directed to query image, and the Hash codes of query image are generated using trained depth Hash model；

Step 4 is retrieved using the Hash codes and image pattern library of query image.

Further, the process that the Hash codes using query image are retrieved with image pattern library includes following Step:

If Hash codes corresponding to image are p in sample database_i={ h_{I, 1}, h_{I, 2}, h_{I, 3}..., h_{I, m}, query image is corresponding Hash codes are p_query={ h_{Query, 1}, h_{Query, 2}, h_{Query, 3}..., h_{Query, m}, then in Hamming space, the ε neighbour of query image It is expressed as NN (p_query, ε)=p | | | p_query-p_i||₂< ε }；

Pass through | p_query-p_i||₂< ε obtains ε neighbour's set p of query sample；

Count the bit S of " 0 " or " 1 " large percentage in each in all Hash codes in K-NN search sample set p ={ S₁,S₂,S₃,…,S_m},S_i∈{0,1}；

Count the probability P (S in K-NN search sample set p in all Hash codes in each bit for " 0 "₀) ={ P (S_1,0),P(S_2,0),P(S_3,0),…,P(S_m,0), wherein P (S_i,0)∈[0,1]；

P(S_i,0)=∑ (S_i=0, NN (p_query,ε))/count(NN(p_query, ε)), count is to meet certain in Hash codes It is the quantity of " 0 " in a bit；

Count the probability P (S in K-NN search sample set p in all Hash codes in each bit for " 1 "₁) ={ P (S_1,1),P(S_2,1),P(S_3,1),…,P(S_m,1), wherein P (S_i,1)∈[0,1]；

P(S_i,1)=∑ (S_i=1, NN (p_query,ε))/count(NN(p_query, ε)), count is to meet certain in Hash codes It is the quantity of " 1 " in a bit；

Pass through ω_i=1+max (P (S_i,0),P(S_i,1))/m determines everybody weight ω={ ω of Hash codes₁,ω₂, ω₃,…,ω_m}；

Then Hash codes p corresponding to image in everybody weight computing sample database of Hash codes is utilized_iWith query image pair The Hash codes p answered_queryWeighting Hamming distanceBy weighting Hamming distance From the image inquired in database and sequence for determining query image.

Further, ε=2 in the ε neighbour.

Further, the Hash code length that the characteristic layer length is 4 times.

Further, the Hash layer is used to constraint as constraint condition, and what which inputted is the feature of characteristic layer Vector, i.e. input are the { p to constraint_i,p_j,w_ij}；w_ijThe sample for indicating that two feature vectors represent when=1 is similar, w_ij The sample for indicating that two feature vectors represent when=0 is inhomogeneous；It is F by the feature vector that characteristic layer generates_i, F_j∈R^d, Being mapped to output on hash space is bi, bj ∈ { -1,1 }^m, then dist_H(b_i,b_j) Hamming space between bi and bj；Damage It is as follows to lose function:

The feature vector of sample image passes through tanh function before generating Hash codes after by Hash layer, last Hash codes are by being only final b after slack variable_iAnd b_j, the value before carrying out relaxation is u_iAnd u_j, u_i, u_j∈R^m； Variable u before relaxation is used in the calculating of loss function_iAnd u_jInstead of Hash codes b_iAnd b_j, loss function are as follows:

In formula, m is the length of Hash codes, and α is preference norm weight.

Further, on-line training, creation are carried out using small-scale data during the trained depth Hash model Small-scale triple is to follow following rule: (1) the selection sample number of different labels is determined from small lot, selection is minimum Exemplar number；(2) a certain label is shuffled at random, selects the anchor p of i and i+1 as triple in sample_iWith Positive example p_j；(3) other exemplars i is randomly choosed as triple p_iNegative example p_k；(4) whole labels and whole are recycled Sample generates containing anchor, positive example and bears exemplary random combine.

Present invention feature the most prominent and significant beneficial effect are:

The present invention verifies depth proposed by the present invention by the comparative experiments of completion depth Hash network structure and shuffle algorithm There is degree Hash network structure preferable superiority and shuffle algorithm to have more preferably for the image retrieval based on Hash codes Visual effect.Usually all it is to compare similitude by comparing Hamming distance mode in the research of previous hash function, works as number According to it is larger when Hamming distance discrimination can be insufficient, the present invention, as indexing, further discriminates between identical Hamming by Hash codes The similarity returned the result apart from size, obtains that similarity is higher to be returned the result.Our experiments show that the present invention is based on Hash The method of study compares other methods better performance on CIFAR-10 and NUS-WIDE.

Detailed description of the invention

Fig. 1 is depth Hash network structure；

Fig. 2 is the different classifications schematic diagram of the Sino-Kazakhstan uncommon code of neighbouring set；

Fig. 3 is CIFAR-10 comparative result figure；

Fig. 4 is NUS-WIDE comparative result figure；

Fig. 5 is visualized experiment result；

Fig. 6 is to reset Algorithm Demo experimental result；

Fig. 7 is 24bit accuracy rate.

Specific embodiment

Specific embodiment 1:

The image data method for quickly retrieving based on Hash study of present embodiment, specifically includes the following steps:

1, depth Hash model is established:

Depth Hash model includes five convolution-pond layer, two full articulamentums, characteristic layer, Hash layer and output layer；It is special Levy the feature vector of layer output certain length；Then feature vector is mapped to Hash codes by Hash layer.Model structure such as Fig. 1 It is shown, shown in design parameter table 1.

1 model parameter of table

In structure each full articulamentum by 500 × 1 neuron of single layer and activation primitive form.Wherein full articulamentum is made With being each feature for connecting intermediate features layer, the relationship between extraction feature corresponds to Hash codes difference by Hash layer Bit on, so that different sample images be made to generate the biggish Hash codes of Hamming distance.

2, training depth Hash model:

Training data is a series of data set { (p with label₁, w₁), (p₂, w₂), (p₃, w₃) ... (p_n, w_n), Middle p_iSample image, w_iIt is the label of correspondence image sample；

Triple label { p_i, p_j, p_kIndicate ternary constraint, relationship is closed between representative sample, under a certain kind measurement p_iAnd p_jDistance be less than p_iAnd p_kThe distance between；It is more preferable that ternary constrains in classifying quality in hands-on of the invention, for The adaptability of model is also more preferable.Input is triple label { p_i, p_j, p_k, wherein p_iAnd p_jFor same category, p_iAnd p_kFor not Generic, the similarity distance between the same category is less than the similitude between different classes of；

In general it can select to carry out all training datas whole combinations, but the training effectiveness of model is very inefficient, Furthermore error sample can mislead the generation of model in training sample.To ensure that model can quickly converge on ternary constraint condition, The present invention carries out high-volume on-line training using small-scale data, 40 small-scale sample images is such as chosen every time, to these samples This establishes triple, this kind of method advantage is that the Sample Refreshment model parameter of each batch can be used, and prevents model Over-fitting.Creating small-scale triple is to follow following rule: (1) the selection sample of different labels is determined from small lot Number, selects least exemplar number；(2) a certain label is shuffled at random, selects i in sample and i+1 as ternary The anchor p of group_iWith positive example p_j；(3) other exemplars i is randomly choosed as triple p_iNegative example p_k；(4) circulation is whole Label and whole samples, generate containing anchor, positive example and bear exemplary random combine.Under the aid of the rule, sample ensure that Being evenly distributed for notebook data, increases randomness.

Characteristic layer: the condition of convergence of depth Hash network is that training data in the feature vector that characteristic layer exports meets ternary Group constraint condition, what which can make model extracts more expressive feature.Triple constraint is applied to spy It is exactly to be less than the Euclidean distance between the feature vector of similar sample between foreign peoples's sample that sign, which is extracted, and formula is as follows:

In formulaIndicate anchor sample,Indicate positive example sample,Indicate that negative example sample, f are the mappings by learning to obtain Sample (is mapped to feature vector from sample image) by function, and threshold indicates specific threshold for controlling positive negative sample Distance, | | | | indicate the Euclidean distance between feature vector.In formula, error when meeting inter- object distance less than between class distance It is 0, indicates to be indicated using "+" in formula there are error when being unsatisfactory for.

It is got over hour in the value of training stage threshold, loss function Loss^tripletBe easier to be intended to 0, anchor with just The distance between example will not be too close, and distance again will not be too far between anchor and negative example, but the model at this time obtained is more difficult Convergence.Make model further the distance between anchor and positive example when threshold is larger, zooms out between anchor and negative example Distance, so that the loss function Loss of model^tripletBe maintained at a biggish value, thus reasonable threshold value for The training of model seems particularly critical.Depth Hash network has used ternary loss function to be constrained in characteristic layer, that is, passes through Minimize Loss^tripl_eT carries out negative sense feedback network, and parameter obtains the feature with more expression power in regulating networks.

Hash layer: using to constraint in Hash layer as constraint condition, and what which inputted is the feature vector of characteristic layer, I.e. input is the { p to constraint_i,p_j,w_ij}；w_ijThe sample for indicating that two feature vectors represent when=1 is similar, w_ijWhen=0 The sample for indicating that two feature vectors represent is inhomogeneous.The feature vector F generated by characteristic layer_i, F_j∈R^d, it is mapped to Kazakhstan Uncommon spatially output is bi, bj ∈ { -1,1 }^m, then dist_H(b_i,b_j) Hamming space between bi and bj；Loss function is such as Under:

Wherein, m is the length of Hash codes；

Loss function can be controlled between zero and one in loss function divided by m, and it is unrelated with Hash length.If not yet Have divided by m, will cause that Hash code length is longer, and loss will be bigger, result can be made more accurate in this way.

Work as w_ijWhen=1, to Loss^pairDerivation can minimize b when doing gradient decline_iAnd b_jBetween Hamming distance, with drop Low Loss^pairValue, work as w_ijWhen=0, the Hamming distance between bi and bj will increase.Use the loss function as constraint condition When, keep Hamming distance between generic sample Hash codes generated closer, the Chinese between different classes of the generated Hash codes of sample Prescribed distance is compared farther out, is optimal by the Hash codes that this kind of method obtains.

Dist in formula_H(b_i, b_j) function is discretization, since its gradient can not lead problem, conventional method can not be passed through Stochastic gradient descent is carried out, that is, can not carry out reversely adjusting model parameter.For solve loss function can not derivation ask The advantages of topic, the feature vector of sample image pass through tanh function after by Hash layer before generating Hash codes, tanh It is to be compressed in real number value between (- 1,1), it is larger for gradient value when being worth around 0, value can be made to be distributed in -1 and 1 as far as possible Around, be conducive to the generation of Hash codes.Known by the process, last Hash codes are final by being only after slack variable b_iAnd b_j, so the value before carrying out relaxation is u_iAnd u_j, u_i, u_j∈R^m.In order to make function can during being trained It leads, variable u before relaxation is used in the calculating of loss function_iAnd u_jInstead of Hash codes b_iAnd b_j, to prevent model from training Occur the generalization ability that over-fitting improves model in journey, increases regular terms after loss function.It is used during hands-on Loss function are as follows:

α is preference norm weight in formula, and as α → 0, model is easy to appear over-fitting, as α → ∞, model meeting There is poor fitting, so suitable α value is equally most important for the training of model.

3, it is directed to query image, the Hash codes of query image are generated using trained depth Hash model；

4, it is retrieved using the Hash codes of query image and image pattern library.

Specific embodiment 2:

Described in present embodiment using the process that the Hash codes of query image and image pattern library are retrieved include with Lower step:

The hash function that depth Hash model obtains can make each sample image in sample database have unique Hash Code { h₁,h₂,…,h_m},h_i∈{0,1}.Hamming when wanting the similar image of retrieval and inquisition sample q, with image in sample database Distance calculation formula are as follows:

In formula, dist_H(h_i, h_j) it is Hamming distance, m is the length of Hash codes.It is able to know that by formula, Hash codes In each effect it is all identical, and carry out Hash codes generating process in, each Hash codes is all single feature or more The combination of a feature shows, can be ignored when being retrieved using Hamming distance.Other than it can not carry out performance characteristic, In the retrieval for carrying out image, the search result of identical Hamming distance, which can not carry out further division, keeps search result not quasi- enough Really.Therefore, the present invention is that each Hash codes assigns its specific feature weight ω_i, used when carrying out Hamming distance calculating Weighting Hamming distance is calculated, and retrieval can be made to return the result with the similarity between refined queries sample and sample database data There is higher similarity with query sample.In the present embodiment, its specific power can be assigned for each of Hash codes Value, it is assumed that in certain a kind of Sino-Kazakhstan everybody weight of code of wishing for ω={ ω₁,ω₂,ω₃,…,ω_m, then it is fixed to weight Hamming distance Justice is as follows:

The discreteness of Hamming distance is compared, weighting Hamming distance has smaller similarity measurement granularity, can be further Divide the similitude between identical Hamming distance.The method of weighting for weighting Hamming distance is numerous, and the present invention proposes that one kind is completely new and adds Power method describes in the follow-up process for each weight of Hash codes in detail to the design of weight.

If Hash codes corresponding to image are p in sample database_i={ h_{I, 1}, h_{I, 2}, h_{I, 3}..., h_{I, m}, query image is corresponding Hash codes are p_query={ h_{Query, 1}, h_{Query, 2}, h_{Query, 3}..., h_{Query, m}, then in Hamming space, the ε neighbour of query image It is expressed as

NN(p_query, ε)=p | | | p_query-p_i||₂< ε }

Hamming distance is simple and highly efficient for image retrieval, so to retain when being designed Hamming distance weight It is simple and efficient.It is that Hamming is being weighted based on ε neighbour in Hamming space for feature weight proposed by the invention Distance first passes through Hamming distance and retrieves ε neighbour sample set p before calculating, in set p the Hash codes of all samples with look into The Hamming distance ask between the Hash codes of sample is both less than ε, but Hash codes are different in the set, as shown in Figure 2.Such as What determines the weight of different bit Hash codes, first counts samples all in set p, counts each bit The probability of upper " 0 " and " 1 ", then using the mode of probability in the calculating query sample and sample database on the sample set Weight Hamming distance.

In all Hash codes generated for the set sample data, if P (" 1 ")_iIt is " 1 " for Hash codes ith bit position Probability, if P (" 0 ")_iIt is the probability of " 0 " for Hash codes ith bit position, then has following relationship:

P("1")_i+P("0")_i=1

Know that feature is assembled more obvious in sample set p by the relationship, most of Hash codes in sample are a certain It can be partial to determination on position.Such as sample cat is with " ear " when mainly being differentiated, when cat class data have ear in sample database Can show in a certain bits of coded when feature has high consistency.When carrying out weight design, which compares other positions It is more important.When carrying out each weight computing of Hash codes, weight is updated according to its " significance level ".

The calculating process of weight ω is as follows.

(2) process effect is in order to embody bit feature expressive forces.Separation for a certain bit in all Hash codes Degree is bigger, illustrates that the bit feature expressive force is stronger.Such as now with 10 length be 12 Hash codes, for all Hash First of code, has 9 ' 1 ' and 1 ' 0 ', second has 5 ' 1 ' and 5 ' 0 ', then the weight of first Hash codes is just Second can be higher than.

By the calculating process of weight ω it is found that weight, which mainly spends differentiation, has identical Hamming distance sample, fundamentally Remain the relationship between Hamming distance and similitude.It is as follows for the relationship between Hamming distance and weighting Hamming distance:

In formula,To weight Hamming distance, weighting Hamming distance is not destroying Hamming distance high efficiency On the basis of further refined divide rule, overcome to a certain extent with same distance sequence the problem of.

Embodiment

Utilize CIFAR-10 (A.Krizhevsky, G.Hinton.Learning Multiple Layers of Features from Tiny Images [J] .2012.) and NUS-WIDE (Zhang P, Zhang W, Li W J, et Al.Supervised hashing with latent factor models [M] .2014.) data set tested, guarantees Effective reliability of Experimental comparison.Experiment extracts 600 image patterns as experiment from one kind every in CIFAR-10 data set Data, wherein 500 image patterns are as training data, other 100 image patterns are as test data.Due to NUS- WIDE data set is multi-tag data set, thinks that they are similar samples if two sample images have a same label Data.In an experiment, using with other same calculation method, take it is preceding 5000 return sample average mAP as last Correlation data.By result as can be seen that FastH, CNNH, NINH conventional method that compares in conjunction with deep neural network have Better accuracy.In CNNH, the Hash codes by deep neural network for fitting compare other Hash learning methods Obtained Hash codes are suboptimums.By comparative experiments it can be seen that the depth hash method that this project proposes has preferably Experiment effect, with the increase of Hash code length, the module mAP of data is higher and higher.As shown in table 2, the present invention is mentioned Depth Hash model out compares other methods, increases to a certain extent.The traditional hash method of comparison, such as LSH, It is obvious that SH, ITQ promote effect.Compare other Hash learning methods, such as FastH, CNNH and NINH are in CIFAR-10 data It is all promoted on collection and NUS-WIDE data set, demonstrates Optimality of the depth Hash model of the present invention on Hash coding Energy.

Table 2 data set retrieval accuracy (mAP) Comparative result

As can be seen from Table 2, the depth Hash network model experimental result comparison that this project is proposed, CIFAR-10 data It concentrates promotion more obvious, promotes respectively 3.8%, 3.5%, 5.0% and 5.1% promotion in different bit Hash codes.? Different bit Hash codes promotions are 5%, 6.8%, 5.4% and 6.8% respectively in NUS-WIDE data set.Pass through comparative experiments It is found that the Hash codes in the different length of different data collection suffer from a degree of raising.

Feature extraction extracts the feature of image using ternary loss function, and the length for extracting feature in this experiment is same Sample is the key factor for influencing Hash codes and generating, and characteristic length is shorter to be easy over-fitting, longer characteristic length meeting in Hash layer Interference characteristic is extracted, the generation of Hash codes is influenced.The value of optimal characteristic layer length in order to obtain, by comparing different spies Levy influence of the length for final mAP result.The train length chosen in an experiment by with the Hash code length that ultimately generates It is associated, chooses " L ", " 2*L ", " 3*L ", " 4*L " and " 5*L " length respectively in an experiment and be compared, it is " L " therein It is the length for ultimately generating Hash codes, the broken line in comparison diagram respectively represents the Hash codes result of different length.

The line chart (Fig. 3 and Fig. 4) of two datasets is analyzed, it, can when characteristic layer length is 1 times of Hash codes Hash codes are obtained directly to pass through processing in the layer, but result is general.With the increase of characteristic layer length, when characteristic layer length When for 4 times of Hash code length, effect is preferable, and when being increased, partial data mAP, which will appear, slightly declines.So logical Crossing test is optimal characteristic layer length in this experiment.

Image retrieval is mainly carried out by CIFAR-10 data set in visualized experiment, which is single label data collection And the information that each sample image includes is less, can be more accurate represent certain a kind of feature, retrieval is returned As a result there is more intuitive display.Experimental principle is to return and the smallest TOP-K sample of sample retrieval Hash codes Hamming distance This return, from every a line, first is sample retrieval, returns to most similar 10 sample graphs between Hamming distance and sample retrieval Picture.As can be seen that the feature of depth Hash model extraction is different classes of compared with that can show from the sample image that retrieval returns, Generate Hash progress image retrieval based on depth Hash network model has preferably accurately from the perspective of objective classification Degree, but it is general from the similarity between the result and sample retrieval that subjective angle analysis returns, and retrieval image is partial to manage By upper similar sample, as shown in Figure 5.

In this experiment mainly for contrast verification depth Hash returns the result rearrangement, pass through the depth based on triple first Hash network model is spent for its corresponding Hash codes of CIFAR-10 data set generation, is then reset using based on feature weight Algorithm returns to search result.In this experiment exist a key parameter ε, in Hamming space distance be less than ε's as a result, Here be arranged ε=2, indicate with Hamming distance less than 2 in the range of using based on feature weight shuffle algorithm to return the result into Rearrangement returns.As shown, the result after being reset with visual angle analysis has more obvious phase with sample retrieval Same feature, it is more reasonable as returning the result.The algorithm have the characteristics that it is obvious be to discriminate between with Hamming distance and Hash codes not With return the result, so from return the result it can be seen that reset after returning the result with also non-rearranged result have it is identical Return the result.By comparing the TOP-K result that directly returns of Hamming distance and being returned the result after resetting, at first 10 The quantity of similar sample increases in returning the result, while increasing that is to say, accuracy rate is illustrated.Comparing result it can be found that There is better similarity on subjective vision, as shown in Figure 6.

After carrying out subjective determination, the accuracy rate of different K values in TOP-K returned the result would also vary from, By the accuracy rate of Experimental comparison's different K values, rule can be summed up.The K value result of TOP-K is smaller, and accuracy rate is got over after rearrangement Height, with gradually increasing for K value, exact value and the preceding exact value gap of rearrangement are progressively smaller until identical after rearrangement.From another party Face demonstrate based on quantization Hash shuffle algorithm can distinguish with identical Hash codes return the result it is similar to sample retrieval Degree comparison, accuracy rate variation are as shown in Figure 7.

By completing the comparative experiments of depth Hash network structure and shuffle algorithm, depth Hash proposed by the present invention is verified There is network structure preferable superiority and shuffle algorithm to have better vision for the image retrieval based on Hash codes Effect.Usually all it is to compare similitude by comparing Hamming distance mode in the research of previous hash function, works as data scale Hamming distance discrimination can be insufficient when larger, and the present invention, as indexing, it is big to further discriminate between identical Hamming distance by Hash codes The small similarity returned the result, obtains that similarity is higher to be returned the result.

Claims

1. the image data method for quickly retrieving based on Hash study, which comprises the following steps:

Step 1 establishes depth Hash model:

Step 2, training depth Hash model:

Training data is a series of data set { (p with label₁, w₁), (p₂, w₂), (p₃, w₃) ... (p_n, w_n), wherein p_i For sample image, w_iIt is the label of correspondence image sample；

Input is triple label { p_i, p_j, p_k, wherein p_iAnd p_jFor same category, p_iAnd p_kTo be different classes of, the same category it Between similarity distance be less than it is different classes of between similitude；

After obtaining trained depth Hash model, using depth Hash model foundation sample database, sample database by image pattern and Corresponding Hash codes are constituted；

2. the image data method for quickly retrieving according to claim 1 based on Hash study, which is characterized in that the benefit The process retrieved with the Hash codes of query image and image pattern library the following steps are included:

If Hash codes corresponding to image are p in sample database_i={ h_{I, 1}, h_{I, 2}, h_{I, 3}..., h_{I, m}, the corresponding Hash of query image Code is p_query={ h_{Query, 1}, h_{Query, 2}, h_{Query, 3}..., h_{Query, m}, then in Hamming space, the ε neighbour of query image is indicated For NN (p_query, ε)=p | | | p_query-p_i||₂< ε }；

Count the bit S={ S of " 0 " or " 1 " large percentage in each in all Hash codes in K-NN search sample set p₁, S₂,S₃,…,S_m},S_i∈{0,1}；

Count the probability P (S in K-NN search sample set p in all Hash codes in each bit for " 0 "₀)={ P (S_1,0),P(S_2,0),P(S_3,0),…,P(S_m,0), wherein P (S_i,0)∈[0,1]；

P(S_i,0)=∑ (S_i=0, NN (p_query,ε))/count(NN(p_query, ε)), count is to meet some in Hash codes to compare It is the quantity of " 0 " in special position；

Count the probability P (S in K-NN search sample set p in all Hash codes in each bit for " 1 "₁)={ P (S_1,1),P(S_2,1),P(S_3,1),…,P(S_m,1), wherein P (S_i,1)∈[0,1]；

P(S_i,1)=∑ (S_i=1, NN (p_query,ε))/count(NN(p_query, ε)), count is to meet some in Hash codes to compare It is the quantity of " 1 " in special position；

Pass through ω_i=1+max (P (S_i,0),P(S_i,1))/m determines everybody weight ω={ ω of Hash codes₁,ω₂,ω₃,…, ω_m}；

Then Hash codes p corresponding to image in everybody weight computing sample database of Hash codes is utilized_iKazakhstan corresponding with query image Uncommon code p_queryWeighting Hamming distanceIt is determined by weighting Hamming distance The image inquired in database and sequence of query image.

3. the image data method for quickly retrieving according to claim 2 based on Hash study, which is characterized in that the ε is close ε=2 in neighbour.

4. the image data method for quickly retrieving according to claim 3 based on Hash study, which is characterized in that the feature The Hash code length that layer length is 4 times.

5. according to claim 1,2, the 3 or 4 image data method for quickly retrieving based on Hash study, which is characterized in that The Hash layer is used to constraint as constraint condition, and what which inputted is the feature vector of characteristic layer, i.e., input is to about { the p of beam_i,p_j,w_ij}；w_ijThe sample for indicating that two feature vectors represent when=1 is similar, w_ijTwo features are indicated when=0 The sample that vector represents is inhomogeneous；It is F by the feature vector that characteristic layer generates_i, F_j∈R^d, it is mapped to defeated on hash space It is out bi, bj ∈ { -1,1 }^m, then dist_H(b_i,b_j) Hamming space between bi and bj；Loss function is as follows:

The feature vector of sample image passes through tanh function, last Hash after by Hash layer before generating Hash codes Code is by being only final b after slack variable_iAnd b_j, the value before carrying out relaxation is u_iAnd u_j, u_i, u_j∈R^m；It is damaging It loses in the calculating of function using variable u before relaxation_iAnd u_jInstead of Hash codes b_iAnd b_j, loss function are as follows:

In formula, m is the length of Hash codes, and α is preference norm weight.

6. the image data method for quickly retrieving according to claim 5 based on Hash study, which is characterized in that the training During depth Hash model using small-scale data carry out on-line training, create small-scale triple be follow it is following Rule: (1) it from small lot determines the selection sample number of different labels, selects least exemplar number；(2) by a certain label into Row is shuffled at random, selects the anchor p of i and i+1 as triple in sample_iWith positive example p_j；(3) other label samples are randomly choosed This i is used as triple p_iNegative example p_k；(4) whole labels and whole samples are recycled, generates and contains anchor, positive example and negative example Random combine.