CN110059198A - A kind of discrete Hash search method across modal data kept based on similitude - Google Patents

A kind of discrete Hash search method across modal data kept based on similitude Download PDF

Info

Publication number
CN110059198A
CN110059198A CN201910277146.3A CN201910277146A CN110059198A CN 110059198 A CN110059198 A CN 110059198A CN 201910277146 A CN201910277146 A CN 201910277146A CN 110059198 A CN110059198 A CN 110059198A
Authority
CN
China
Prior art keywords
sample
matrix
mode
indicate
hash codes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910277146.3A
Other languages
Chinese (zh)
Other versions
CN110059198B (en
Inventor
孔祥维
李明阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dalian University of Technology
Zhejiang University ZJU
Original Assignee
Dalian University of Technology
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dalian University of Technology, Zhejiang University ZJU filed Critical Dalian University of Technology
Priority to CN201910277146.3A priority Critical patent/CN110059198B/en
Publication of CN110059198A publication Critical patent/CN110059198A/en
Application granted granted Critical
Publication of CN110059198B publication Critical patent/CN110059198B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of discrete Hash search methods across modal data kept based on similitude.The cross-module state retrieval data set being made of the sample comprising two mode is established, training set and test set are divided into;The objective function for establishing similitude in similitude and mode between keeping mode, solve to objective function and obtains Hash codes matrix by a kind of discrete optimizing method;Learn the hash function of each mode according to Hash codes matrix;The Hash codes of all samples in training set and test set are calculated using hash function;One mould measurement integrates as query set, another mode training set is retrieved set, calculates in query set that Hamming distance, sequence are used as search result between the Hash codes of sample and the Hash codes of sample in retrieved set.The present invention can effectively keep the similitude in similitude and mode between mode, and consider the discrete feature of Hash codes, be solved using a kind of method of discrete optimization to objective function, to improve the accuracy of cross-module state retrieval.

Description

A kind of discrete Hash search method across modal data kept based on similitude
Technical field
The present invention relates to a kind of a kind of cross-module state search methods in multimedia search technology field, more particularly, to one kind The discrete Hash search method across modal data kept based on similitude.
Background technique
With the fast development of Internet information technique, the multimedia messages of various mode are in explosive growth on network. Development trend is complied with, the retrieval of cross-module state becomes a most important problem, attracted the attention of many researchers.Across Mode retrieval typical scene is exactly the query sample of a given mode, retrieves other similar mode.But due to The presence of isomery wide gap can not directly measure the similitude between different modalities.Further, since the explosive increase of data, concern The carrying cost and efficiency retrieved on a large scale are necessary.Hash method is that popular method, target exist in recent years In data to be mapped as to compact binary code.By Hash, lower memory space can be used to save data, and pass through Hamming distance measures the similitude between different modalities, and Hamming distance can quickly be counted by the xor operation of bit It calculates.
In recent years, many cross-module state hash methods have been proposed in researcher.Most of cross-module state hash method it is main Thinking is to learn hash function using training data, by the Feature Mapping in luv space to a public Hamming space, And hash function should keep the semantic dependency in original feature space.Next it simply introduces some typical and relatively new Across Modal Method.CVH is expanded by the spectrum Hash of single mode, by the distance minimization of Weight.IMH is by keeping mode Between and mode in consistency learn linear hash function.CMFH uses Harmonious Matrix Factorization, to the difference of a sample Mode learns unified Hash codes.SMFH is to be based on confederate matrix Factorization, while keeping local Geometrical consistency and mark Consistency is signed to learn unified Hash codes.By keeping mode to learn hash function based on the mixing similitude of figure.
Similitude holding is one extremely important problem of cross-module state hash method.It is similar between most of method concern mode Property, that is to say, that if an image pattern and a samples of text are semantically being mutually related, they should have Similar Hash codes.In addition, similitude is also critically important in mode.Similitude is intended to keep the local geometric knot of each mode in mode Structure.Certain methods keep similitude in mode using figure Laplce's regular terms, however only focus in belonging within k neighbour Sample, the weight of the sample in relational matrix except k neighbour are arranged to 0, such as SMFH.In this way, in original feature space Similar sample will obtain similar Hash codes, but the Hash codes of dissimilar sample are not necessarily dissimilar because they not by To limitation.In addition, Hash codes are binary codes, study binary code is a discrete optimization problems of device, which is usually NP tired Difficult problem.The strategy that most of existing cross-module state hash methods use is to loosen original discrete constraint for continuous constraint, Then the successive value of acquisition is quantified as binary code by re-optimization objective function.However, this strategy that loosens will affect accessibility Energy.
Summary of the invention
In view of the above-mentioned deficiencies in the prior art, it is an object of the present invention to provide it is a kind of based on similitude keep across modal data Discrete Hash search method.
The technical solution adopted by the present invention the following steps are included:
1) the cross-module state retrieval data set being made of the sample comprising two mode, sample are established in the database of server This two mode are respectively image modalities and text modality, and data set is divided into training set and test set;
2) objective function of similitude in similitude and mode between keeping mode is established, and passes through a kind of discrete optimizing method Objective function is solved, Hash codes matrix is obtained;
3) the Hash codes matrix succeeded in school according to step 2), learns the hash function of each mode;
4) Hash codes of all samples in training set and test set are calculated using hash function;
5) using the test set of a mode as query set, using the training set of another mode as retrieved set, according to step Rapid 4) mode obtains Hash codes, calculates the Hamming distance in query set in the Hash codes of sample and retrieved set between the Hash codes of sample From being ranked up according to the sequence of Hamming distance from small to large to sample in retrieved set, the forward sample that sorts will be by as inspection Hitch fruit.
The step 1) specifically:
Image and text are collected from webpage, and the identical piece image of corresponding meaning and a text are constituted into an image Text pair, meaning identical finger describes same thing, such as the image and a text for describing people's surfing of width people's surfing With regard to constituting an image text pair;To retrieve data set, an image text pair to building cross-module state by each image text Characteristics of image and text feature constitute a sample;The training set of cross-module state retrieval data set has n sample, each sample packet Feature containing two mode of image and text, X(1)Indicate the image modalities matrix that the feature of n image modalities is constituted,Each arrange represents the feature of the image modalities of a sample,Indicate the figure of p-th of sample As the feature of mode, the as pth of image modalities matrix is arranged,Wherein d1Indicate the dimension of the feature of image modalities Degree, R indicate set of real numbers;X(2)Indicate the text modality matrix that the feature of n text modality is constituted,Each arrange represents the feature of the text modality of a sample,Indicate p-th of sample The feature of text modality, as the pth column of text modality matrix,Wherein d2Indicate the dimension of the feature of text modality Degree;By the corresponding feature of two modeWithConstitute sample characteristics;
Y={ y1,y2,…,ynIndicate label matrix, Y ∈ { 0,1 }c×n, wherein c indicates classification sum, ypIndicate pth The label vector of sample, i.e. the pth column of label matrix, yp={ y1p,y2p,…,yip,…,ynp, yipIndicate that pth sample exists The label of i-th class classification;If p-th of sample belongs to the i-th class, the element y of the i-th row pth column in label matrix Yip=1, Otherwise yip=0.
Data set is divided into training set and test set by the present invention, is extracted feature respectively to image and text, is wrapped in training set Containing the characteristics of image and text feature for being used as training, include characteristics of image and text feature as test in test set.
The step 2) specifically includes:
2.1) for two kinds of different modalities of same sample, identical Hash codes are arrived in study, are able to maintain similar between mode Property.Similarity matrix S is first constructed according to label matrix Y cosine similarity, the element of pth row q column is S in Spq=yp· yq/(||yp||2||yq||2), wherein p and q is the ordinal number of sample, yp·yqIndicate the label vector y of p-th of samplepWith q The label vector y of a sampleqBetween inner product, | | yp||2With | | yq||2Respectively indicate the label vector y of p-th of samplepWith q The label vector y of a sampleqTwo norms;
Then, the loss function of similitude between keeping mode below is established:WhereinIt is F Square of norm, B indicate the Hash codes matrix that the Hash codes of all samples are constituted, B ∈ { -1,1 }k×n, wherein k is Hash codes Length;
2.2) it is directed to a mode, it is desirable to the local geometry that sample can be kept, i.e., in original feature space In similar sample, it is desirable to after being mapped to Hamming space, their Hash codes are also similar.Mould is kept by using figure regular terms Similitude in state establishes following holding mould for m-th of mode (m=1 indicates image modalities, and m=2 indicates text modality) The loss function of similitude in state:
Wherein, bpAnd bqIt is the pth column and q column of Hash codes matrix B, W respectively(m)It is the weight matrix of m-th of mode,It is weight matrix W(m)Pth row q column element, LmIt is the Laplacian Matrix of m-th of mode, D(m)It is m-th of mould The diagonal matrix of state,Indicate diagonal matrix D(m)Pth row q column element,Lm=D(m)-W(m);tr The mark of () representing matrix,Indicate square of 2 norms;
This method not only considers the sample close apart from a certain sample, it is also considered that the sample far apart from a certain sample, on State weight matrix W(m)In elementSpecifically, more different Hash codes can be obtained in this way:
Wherein, e is natural constant,Indicate in m-th of mode with sample characteristicsApart from nearest k1A sample The set that feature is constituted,Indicate in m-th of mode with sample characteristicsApart from farthest k2A sample characteristics are constituted Set, μ be tradeoff parameter, the value of σ takes maximum
GatheringIn, with sample characteristicsDistance is closer, and weight is arranged bigger;GatheringIn, With sample characteristicsDistance is remoter, and the absolute value of weight is arranged bigger.Weight is set according to above-mentioned formula, can both make phase As Hash codes distance after sample mapping it is close, and the Hash codes distance after dissimilar sample mapping can be made remote.
2.3) loss function of similitude between keeping mode is combinedWith the loss function for keeping similitude in modeEstablish the overall goal function of following study Hash codes are as follows:
s.t.B∈{-1,1}k×n
Wherein, α indicate keep mode between similitude loss function tradeoff parameter, β1Indicate the mould of holding image modalities The tradeoff parameter of the loss function of similitude, β in state2Indicate the power of the loss function of similitude in the mode of holding text modality Weigh parameter, T representing matrix transposition;
2.4) due to the presence of Hash codes discrete constraint, solution procedure 2.1) objective function be a np problem, use A kind of discrete optimizing method carries out solving overall goal function, specifically:
2.4.1) random initializtion Hash codes matrix B(0)∈{-1,1}k×n, B(0)Indicate initial Hash codes matrix B;Breathe out The uncommon initial random generation of code matrix B, element therein are selected as -1 or 1.
2.4.2 solution) is iterated using following procedure:
First seek overall goal functionGradient:
Then iterative processing, the discrete Hash codes matrix B obtained using following formula according to iteration j(j)It handles To the Hash codes matrix B of (j+1) secondary iteration(j+1):
Wherein, λ is learning rate;B(j)Indicate the Hash codes matrix that iteration j obtains, B(j+1)Indicate that (j+1) is secondary repeatedly The Hash codes matrix B that generation obtains(j+1)
Hash codes matrix is updated according to above-mentioned iterative formula and completes optimization process, obtains optimal Hash codes matrix B.
In the step 3), hash function uses simple Linear Mapping h1(x(1))=sign (P1 Tx(1)),Learning hash function is to learn two mapping matrix P1And P2, wherein P1Indicate image modalities Mapping matrix, P2Indicate the mapping matrix of text modality, x(1)Indicate the feature of image modalities in sample, x(2)Indicate sample Chinese The feature of this mode;
It solves following formula and obtains mapping matrix P1And P2:
Wherein,Indicate that the loss function of study mapping matrix, γ are tradeoff parameters;
Pass through orderIt solves to calculate and obtains P1: P1=(X(1)X(1)T+γI)-1X(1)BT, I expression unit matrix;
Pass through orderIt solves to calculate and obtains P2: P2=(X(2)X(2)T+γI)-1X(2)BT
In the step 4), the Hash codes formula h of training set and test set image modalities1(x(1))=sign (P1 Tx(1)) It calculates, wherein x(1)Indicate the feature of the image modalities of sample, h1(x(1)) indicate by the sample image mode feature x(1)Meter The Hash codes of calculating;The Hash codes formula of training set and test set text modalityIt calculates, wherein x(2)Indicate the feature of sample text mode, h2(x(2)) indicate by the sample text mode feature x(2)Calculated Hash codes.
Present invention reserved mapping matrix P after step 2) and step 3) training1,P2And it abandons step 2) and succeeds in school Training set Hash codes.
The beneficial effects of the present invention are:
Similitude in similitude and mode between the mode that the present invention can effectively be kept simultaneously, and it is not concerned only with similar sample This Hash codes, have also paid close attention to the Hash codes of dissimilar sample, the method for the present invention makes sample dissimilar in original feature space This obtains dissimilar Hash codes after being mapped to Hamming space, and similar sample also obtains similar Hash codes, and Hash codes are more Having any different property solves the problems, such as to learn the data retrieval of discrete Hash codes.
The present invention considers the discrete feature of Hash codes, is asked using a kind of method of discrete optimization objective function Solution, to improve the accuracy of cross-module state data retrieval.
Detailed description of the invention
Fig. 1 is the implementation steps of the invention flow chart.
Fig. 2 is an example schematic of the image retrieval text on cross-module state data set Wiki.
Fig. 3 is an example schematic of the text retrieval image on cross-module state data set Wiki.
Specific embodiment
The present invention will be further described in detail below with reference to the accompanying drawings and specific embodiments.
As shown in Figure 1, specific embodiments of the present invention situation is as follows:
Specific implementation is to be described further with flow chart and cross-module state data set Wiki to technical solution of the present invention;Its In, cross-module state data set Wiki derives from wikipedia, includes 2866 images collected from wikipedia article Text pair.Image with 128 dimension SIFT feature indicate, text with 10 tie up LDA character representation.The image text of Wiki data set This is to being divided into 10 semantic classes, each image text is to belonging to one kind therein.It randomly chooses 2173 samples and constitutes instruction Practice collection, remaining 693 sample constitutes test set.
1) the cross-module state retrieval data set being made of the sample comprising two mode, sample are established in the database of server This two mode are respectively image modalities and text modality, and data set is divided into training set and test set.
Image and text are collected from webpage, and the identical piece image of corresponding meaning and a text are constituted into an image Text pair, meaning identical finger describes same thing, such as the image and a text for describing people's surfing of width people's surfing With regard to constituting an image text pair;To retrieve data set, an image text pair to building cross-module state by each image text Characteristics of image and text feature constitute a sample.
The training set of cross-module state retrieval data set has n sample, and each sample includes the spy of two mode of image and text Sign, X(1)Indicate the image modalities matrix that the feature of n image modalities is constituted, Indicate pth The feature of the image modalities of a sample,Wherein d1Indicate the dimension of the feature of image modalities, R indicates set of real numbers; X(2)Indicate the text modality matrix that the feature of n text modality is constituted, It indicates p-th The feature of the text modality of sample,Wherein d2Indicate the dimension of the feature of text modality;It is corresponding by two mode FeatureWithConstitute sample characteristics;Y={ y1,y2,…,ynIndicate label matrix, Y ∈ { 0,1 }c×n, wherein c table Show classification sum, ypIndicate the label vector of p-th of sample, yp={ y1p,y2p,…,yip,…,ynp, yipIndicate p-th of sample In the label of the i-th class classification;If p-th of sample belongs to the i-th class, the element y of the i-th row pth column in label matrix Yip= 1, otherwise yip=0.
2) objective function of similitude in similitude and mode between keeping mode is established, and passes through a kind of discrete optimizing method Objective function is solved, Hash codes matrix is obtained, learns Hash codes for training set.
2.1) similarity matrix S is first constructed according to label matrix Y cosine similarity, the element of pth row q column is in S Spq=yp·yq/(||yp||2||yq||2), wherein p and q is the ordinal number of sample, yp·yqIndicate the label of p-th of sample to Measure ypWith the label vector y of q-th of sampleqBetween inner product, | | yp||2With | | yq||2Respectively indicate the label of p-th of sample to Measure ypWith the label vector y of q-th of sampleqTwo norms;
Then, the loss function of similitude between keeping mode below is established:WhereinIt is F Square of norm, B indicate the Hash codes matrix that the Hash codes of all samples are constituted, B ∈ { -1,1 }k×n, wherein k is Hash codes Length;
2.2) similitude in mode is kept by using figure regular terms, (m=1 indicates image mould for m-th of mode State, m=2 indicate text modality), establish the following loss function for keeping similitude in mode:
Above-mentioned weight matrix W(m)In elementSpecifically:
2.3) loss function of similitude between keeping mode is combinedWith the loss function for keeping similitude in modeEstablish the overall goal function of following study Hash codes are as follows:
s.t.B∈{-1,1}k×n
2.4) it carries out solving overall goal function using a kind of discrete optimizing method, specifically:
2.4.1) random initializtion Hash codes matrix B(0)∈{-1,1}k×n, B(0)Indicate initial Hash codes matrix B;Breathe out The uncommon initial random generation of code matrix B, element therein are selected as -1 or 1.
2.4.2 solution) is iterated using following procedure:
First seek overall goal functionGradient:
Then iterative processing, the discrete Hash codes matrix B obtained using following formula according to iteration j(j)It handles To the Hash codes matrix B of (j+1) secondary iteration(j+1):
Wherein, λ is learning rate;B(j)Indicate the Hash codes matrix that iteration j obtains, B(j+1)Indicate that (j+1) is secondary repeatedly The Hash codes matrix B that generation obtains(j+1)
Hash codes matrix is updated according to above-mentioned iterative formula and completes optimization process, obtains optimal Hash codes matrix B.
3) the Hash codes matrix succeeded in school according to step 2), learns the hash function of each mode.
Hash function uses simple Linear Mapping h1(x(1))=sign (P1 Tx(1)),It learns Practising hash function is to learn two mapping matrix P1And P2, wherein P1Indicate the mapping matrix of image modalities, P2Indicate text mould The mapping matrix of state, x(1)Indicate the feature of image modalities in a certain sample, x(2)Indicate the feature of text modality in a certain sample;
It solves following formula and obtains mapping matrix P1And P2:
Pass through orderIt solves to calculate and obtains P1: P1=(X(1)X(1)T+γI)-1X(1)BT
Pass through orderIt solves to calculate and obtains P2: P2=(X(2)X(2)T+γI)-1X(2)BT
4) Hash codes of all samples in training set and test set are calculated using hash function.
The Hash codes formula h of training set and test set image modalities1(x(1))=sign (P1 Tx(1)) calculate, wherein x(1) Indicate the feature of the image modalities of sample, h1(x(1)) indicate by the sample image mode feature x(1)Calculated Hash codes; The Hash codes formula of training set and test set text modalityIt calculates, wherein x(2)Indicate sample text The feature of this mode, h2(x(2)) indicate by the sample text mode feature x(2)Calculated Hash codes.
5) using the test set of a mode as query set, using the training set of another mode as retrieved set, according to step Rapid 4) mode obtains Hash codes, in the Hamming in the Hash codes of sample in calculating query set and retrieved set between the Hash codes of sample Distance is ranked up sample in retrieved set according to the sequence of Hamming distance from small to large, and the forward sample that sorts will be by conduct Search result.
The present embodiment is used as evaluation criterion using mAP (mean Average Precision), and mAP value is bigger, the side of explanation The cross-module state retrieval performance of method is better.With CMFH (referring to document Tang J, Wang K, Shao on cross-module state data set Wiki L.Supervised matrix factorization hashing for cross-modal retrieval[J].IEEE Transactions on Image Processing,2016,25(7):3157-3166)、 SMFH(Ding G,Guo Y, Zhou J.Collective matrix factorization hashing for multimodal data[C] .Proceedings of the IEEE conference on computer vision and pattern recognition.2014:2075-2082.)、FSH(Liu H,Ji R,Wu Y,et al. Cross-modality binary code learning via fusion similarity hashing[C].Proceedings of CVPR.2017:6345- 6353.) three kinds of cross-module state hash methods are compared, and preceding 100 samples are returned when Hash code length is 16 bit MAP value is as shown in table 1.
MAP value on 1 Wiki data set of table
Method Image retrieval text Text retrieval image
CMFH 0.2295 0.3479
SMFH 0.2411 0.3658
FSH 0.2408 0.3871
The present invention 0.2455 0.4086
As it can be seen from table 1 the method for the present invention achieves highest mAP value, cross-module state compared with three kinds of control methods Retrieval performance is best.
Fig. 2 gives an example of an image retrieval text on cross-module state data set Wiki, and return is sequence In preceding 6 text, affiliated semantic classes is given above image and text.Query image belongs to geography class, solid line Frame indicates the text retrieved and query image belongs to same semantic classes, and dotted line frame indicates the text retrieved and query image It is not belonging to same semantic classes.As can be seen that method provided by the invention is better than to analogy from the search result of this example Method.
Fig. 3 gives an example of a text retrieval image on cross-module state data set Wiki, and return is sequence In preceding 6 image, affiliated semantic classes is given above image and text.Query text belongs to literature class, figure The rimless image for indicating to retrieve and query text belong to same semantic classes outside piece, have dotted line frame expression to retrieve outside picture Image and query text are not belonging to same semantic classes.It is from the search result of this example as can be seen that provided by the invention Method is better than control methods.
In conclusion the method for the present invention can effectively keep the similitude in similitude and mode between mode, and not The Hash codes of similar sample have been concerned only with, the Hash codes of dissimilar sample have also been paid close attention to, have been conducive to the more different Kazakhstan of study Uncommon code, and solve the problems, such as to learn discrete Hash codes using a kind of discrete optimizing method, to improve the retrieval of cross-module state Accuracy.

Claims (5)

1. a kind of discrete Hash search method across modal data kept based on similitude, it is characterised in that: method includes such as Lower step:
1) the cross-module state retrieval data set being made of the sample comprising two mode is established in the database of server, sample Two mode are respectively image modalities and text modality, and data set is divided into training set and test set;
2) objective function of similitude in similitude and mode between keeping mode is established, and by a kind of discrete optimizing method to mesh Scalar functions are solved, and Hash codes matrix is obtained;
3) the Hash codes matrix succeeded in school according to step 2), learns the hash function of each mode;
4) Hash codes of all samples in training set and test set are calculated using hash function;
5) using the test set of a mode as query set, using the training set of another mode as retrieved set, according to step 4) Mode obtains Hash codes, calculates the Hamming distance in query set in the Hash codes of sample and retrieved set between the Hash codes of sample, Sample in retrieved set is ranked up according to the sequence of Hamming distance from small to large, the forward sample that sorts will be tied as retrieval Fruit.
2. a kind of discrete Hash search method across modal data kept based on similitude according to claim 1, It is characterized in that: the step 1) specifically:
Image and text are collected from webpage, and the identical piece image of corresponding meaning and a text are constituted into an image text Right, to retrieve data set to building cross-module state by each image text, the characteristics of image and text of image text pair are special Sign constitutes a sample;The training set of cross-module state retrieval data set has n sample, and each sample includes two moulds of image and text The feature of state, X(1)Indicate the image modalities matrix that the feature of n image modalities is constituted, Table Show the feature of the image modalities of p-th of sample,Wherein d1Indicate the dimension of the feature of image modalities, R indicates real Manifold;X(2)Indicate the text modality matrix that the feature of n text modality is constituted, It indicates The feature of the text modality of p-th of sample,Wherein d2Indicate the dimension of the feature of text modality;By two moulds The corresponding feature of stateWithConstitute sample characteristics;Y={ y1,y2,…,ynIndicate label matrix, Y ∈ { 0,1 }c×n, Middle c indicates classification sum, ypIndicate the label vector of p-th of sample, yp={ y1p,y2p,…,yip,…,ynp, yipIndicate pth Label of a sample in the i-th class classification.
3. a kind of discrete Hash search method across modal data kept based on similitude according to claim 1, Be characterized in that: the step 2) specifically includes:
2.1) similarity matrix S is first constructed according to label matrix Y cosine similarity, the element of pth row q column is S in Spq= yp·yq/(||yp||2||yq||2), wherein p and q is the ordinal number of sample, yp·yqIndicate the label vector y of p-th of samplep With the label vector y of q-th of sampleqBetween inner product, | | yp||2With | | yq||2Respectively indicate the label vector y of p-th of samplep With the label vector y of q-th of sampleqTwo norms;
Then, the loss function of similitude between keeping mode below is established:WhereinIt is F norm Square, B indicates the Hash codes matrix that the Hash codes of all samples are constituted, B ∈ { -1,1 }k×n, wherein k is the length of Hash codes;
2.2) for m-th of mode (m=1 indicates image modalities, and m=2 indicates text modality), foundation is following to keep phase in mode Like the loss function of property:
Wherein, bpAnd bqIt is the pth column and q column of Hash codes matrix B, W respectively(m)It is the weight matrix of m-th of mode,It is Weight matrix W(m)Pth row q column element, LmIt is the Laplacian Matrix of m-th of mode, D(m)It is pair of m-th of mode Angle battle array,Indicate diagonal matrix D(m)Pth row q column element,Lm=D(m)-W(m);Tr () table Show the mark of matrix,Indicate square of 2 norms;
Above-mentioned weight matrix W(m)In elementSpecifically:
Wherein, e is natural constant,Indicate in m-th of mode with sample characteristicsApart from nearest k1A sample is special The set constituted is levied,Indicate in m-th of mode with sample characteristicsApart from farthest k2What a sample characteristics were constituted Set, μ are tradeoff parameters, and the value of σ takes maximum
2.3) loss function of similitude between keeping mode is combinedWith the loss function for keeping similitude in modeEstablish the overall goal function of following study Hash codes are as follows:
s.t.B∈{-1,1}k×n
Wherein, α indicate keep mode between similitude loss function tradeoff parameter, β1It indicates to keep in the mode of image modalities The tradeoff parameter of the loss function of similitude, β2Indicate to keep the tradeoff of the loss function of similitude in the mode of text modality to join Number, T representing matrix transposition;
2.4) it carries out solving overall goal function using a kind of discrete optimizing method, specifically:
2.4.1) random initializtion Hash codes matrix B(0)∈{-1,1}k×n, B(0)Indicate initial Hash codes matrix B;
2.4.2 solution) is iterated using following procedure:
First seek overall goal functionGradient:
Then iterative processing, the discrete Hash codes matrix B obtained using following formula according to iteration j(j)Processing obtains the (j+1) the Hash codes matrix B of secondary iteration(j+1):
Wherein, λ is learning rate;B(j)Indicate the Hash codes matrix that iteration j obtains, B(j+1)Indicate that (j+1) secondary iteration obtains Hash codes matrix B(j+1)
Hash codes matrix is updated according to above-mentioned iterative formula and completes optimization process, obtains optimal Hash codes matrix B.
4. a kind of discrete Hash search method across modal data kept based on similitude according to claim 1, Be characterized in that: in the step 3), hash function uses simple Linear Mapping h1(x(1))=sign (P1 Tx(1)), h2(x(2)) =sign (P2 Tx(2)), study hash function is to learn two mapping matrix P1And P2, wherein P1Indicate the mapping of image modalities Matrix, P2Indicate the mapping matrix of text modality, x(1)Indicate the feature of image modalities in sample, x(2)Indicate text mould in sample The feature of state;
It solves following formula and obtains mapping matrix P1And P2:
Wherein,Indicate that the loss function of study mapping matrix, γ are tradeoff parameters;
Pass through orderIt solves to calculate and obtains P1: P1=(X(1)X(1)T+γI)-1X(1)BT, I expression unit matrix;
Pass through orderIt solves to calculate and obtains P2: P2=(X(2)X(2)T+γI)-1X(2)BT
5. a kind of discrete Hash search method across modal data kept based on similitude according to claim 1, It is characterized in that: in the step 4), the Hash codes formula h of training set and test set image modalities1(x(1))=sign (P1 Tx(1)) calculate, wherein x(1)Indicate the feature of the image modalities of sample, h1(x(1)) indicate by the sample image mode feature x(1) Calculated Hash codes;The Hash codes formula of training set and test set text modalityIt calculates, Middle x(2)Indicate the feature of sample text mode, h2(x(2)) indicate by the sample text mode feature x(2)Calculated Hash Code.
CN201910277146.3A 2019-04-08 2019-04-08 Discrete hash retrieval method of cross-modal data based on similarity maintenance Active CN110059198B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910277146.3A CN110059198B (en) 2019-04-08 2019-04-08 Discrete hash retrieval method of cross-modal data based on similarity maintenance

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910277146.3A CN110059198B (en) 2019-04-08 2019-04-08 Discrete hash retrieval method of cross-modal data based on similarity maintenance

Publications (2)

Publication Number Publication Date
CN110059198A true CN110059198A (en) 2019-07-26
CN110059198B CN110059198B (en) 2021-04-13

Family

ID=67318463

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910277146.3A Active CN110059198B (en) 2019-04-08 2019-04-08 Discrete hash retrieval method of cross-modal data based on similarity maintenance

Country Status (1)

Country Link
CN (1) CN110059198B (en)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110059154A (en) * 2019-04-10 2019-07-26 山东师范大学 It is a kind of that Hash search method is migrated based on the cross-module state for inheriting mapping
CN110569387A (en) * 2019-08-20 2019-12-13 清华大学 radar-image cross-modal retrieval method based on depth hash algorithm
CN110609914A (en) * 2019-08-06 2019-12-24 厦门大学 Online Hash learning image retrieval method based on rapid category updating
CN110674323A (en) * 2019-09-02 2020-01-10 山东师范大学 Unsupervised cross-modal Hash retrieval method and system based on virtual label regression
CN110704659A (en) * 2019-09-30 2020-01-17 腾讯科技(深圳)有限公司 Image list sorting method and device, storage medium and electronic device
CN111078952A (en) * 2019-11-20 2020-04-28 重庆邮电大学 Cross-modal variable-length Hash retrieval method based on hierarchical structure
CN111125457A (en) * 2019-12-13 2020-05-08 山东浪潮人工智能研究院有限公司 Deep cross-modal Hash retrieval method and device
CN111209415A (en) * 2020-01-10 2020-05-29 重庆邮电大学 Image-text cross-modal Hash retrieval method based on mass training
CN111522903A (en) * 2020-04-01 2020-08-11 济南浪潮高新科技投资发展有限公司 Deep hash retrieval method, equipment and medium
CN111639197A (en) * 2020-05-28 2020-09-08 山东大学 Cross-modal multimedia data retrieval method and system with label embedded online hash
CN112199531A (en) * 2020-11-05 2021-01-08 广州杰赛科技股份有限公司 Cross-modal retrieval method and device based on Hash algorithm and neighborhood map
CN112559810A (en) * 2020-12-23 2021-03-26 上海大学 Method and device for generating hash code by utilizing multi-layer feature fusion
CN112732976A (en) * 2021-01-13 2021-04-30 天津大学 Short video multi-label rapid classification method based on deep hash coding
CN112925962A (en) * 2021-01-20 2021-06-08 同济大学 Hash coding-based cross-modal data retrieval method, system, device and medium
CN113434671A (en) * 2021-06-23 2021-09-24 平安国际智慧城市科技股份有限公司 Data processing method and device, computer equipment and storage medium
CN115080801A (en) * 2022-07-22 2022-09-20 山东大学 Cross-modal retrieval method and system based on federal learning and data binary representation
CN115374165A (en) * 2022-10-24 2022-11-22 山东建筑大学 Data retrieval method, system and equipment based on triple matrix decomposition
CN115687571A (en) * 2022-10-28 2023-02-03 重庆师范大学 Depth unsupervised cross-modal retrieval method based on modal fusion reconstruction hash
CN116244484A (en) * 2023-05-11 2023-06-09 山东大学 Federal cross-modal retrieval method and system for unbalanced data
WO2024045866A1 (en) * 2022-08-31 2024-03-07 Huawei Technologies Co., Ltd. System and method for cross-modal interaction based on pre-trained model
CN113434671B (en) * 2021-06-23 2024-06-07 平安国际智慧城市科技股份有限公司 Data processing method, device, computer equipment and storage medium

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104346440A (en) * 2014-10-10 2015-02-11 浙江大学 Neural-network-based cross-media Hash indexing method
CN106777318A (en) * 2017-01-05 2017-05-31 西安电子科技大学 Matrix decomposition cross-module state Hash search method based on coorinated training
CN107256271A (en) * 2017-06-27 2017-10-17 鲁东大学 Cross-module state Hash search method based on mapping dictionary learning
CN107273505A (en) * 2017-06-20 2017-10-20 西安电子科技大学 Supervision cross-module state Hash search method based on nonparametric Bayes model
CN107402993A (en) * 2017-07-17 2017-11-28 山东师范大学 The cross-module state search method for maximizing Hash is associated based on identification
CN108170755A (en) * 2017-12-22 2018-06-15 西安电子科技大学 Cross-module state Hash search method based on triple depth network
CN108334574A (en) * 2018-01-23 2018-07-27 南京邮电大学 A kind of cross-module state search method decomposed based on Harmonious Matrix
CN108510559A (en) * 2017-07-19 2018-09-07 哈尔滨工业大学深圳研究生院 It is a kind of based on have supervision various visual angles discretization multimedia binary-coding method
CN108595688A (en) * 2018-05-08 2018-09-28 鲁东大学 Across the media Hash search methods of potential applications based on on-line study
CN109299216A (en) * 2018-10-29 2019-02-01 山东师范大学 A kind of cross-module state Hash search method and system merging supervision message

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104346440A (en) * 2014-10-10 2015-02-11 浙江大学 Neural-network-based cross-media Hash indexing method
CN106777318A (en) * 2017-01-05 2017-05-31 西安电子科技大学 Matrix decomposition cross-module state Hash search method based on coorinated training
CN107273505A (en) * 2017-06-20 2017-10-20 西安电子科技大学 Supervision cross-module state Hash search method based on nonparametric Bayes model
CN107256271A (en) * 2017-06-27 2017-10-17 鲁东大学 Cross-module state Hash search method based on mapping dictionary learning
CN107402993A (en) * 2017-07-17 2017-11-28 山东师范大学 The cross-module state search method for maximizing Hash is associated based on identification
CN108510559A (en) * 2017-07-19 2018-09-07 哈尔滨工业大学深圳研究生院 It is a kind of based on have supervision various visual angles discretization multimedia binary-coding method
CN108170755A (en) * 2017-12-22 2018-06-15 西安电子科技大学 Cross-module state Hash search method based on triple depth network
CN108334574A (en) * 2018-01-23 2018-07-27 南京邮电大学 A kind of cross-module state search method decomposed based on Harmonious Matrix
CN108595688A (en) * 2018-05-08 2018-09-28 鲁东大学 Across the media Hash search methods of potential applications based on on-line study
CN109299216A (en) * 2018-10-29 2019-02-01 山东师范大学 A kind of cross-module state Hash search method and system merging supervision message

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
LI MINGYANG: ""Discrete similarity preserving hashing for cross-modal retrival"", 《ICAIS2019:ARTIFICAL INTELLIGENCE AND SECURITY》 *

Cited By (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110059154A (en) * 2019-04-10 2019-07-26 山东师范大学 It is a kind of that Hash search method is migrated based on the cross-module state for inheriting mapping
CN110059154B (en) * 2019-04-10 2022-04-15 山东师范大学 Cross-modal migration hash retrieval method based on inheritance mapping
CN110609914A (en) * 2019-08-06 2019-12-24 厦门大学 Online Hash learning image retrieval method based on rapid category updating
CN110609914B (en) * 2019-08-06 2021-08-17 厦门大学 Online Hash learning image retrieval method based on rapid category updating
CN110569387A (en) * 2019-08-20 2019-12-13 清华大学 radar-image cross-modal retrieval method based on depth hash algorithm
CN110569387B (en) * 2019-08-20 2020-12-11 清华大学 Radar-image cross-modal retrieval method based on depth hash algorithm
CN110674323B (en) * 2019-09-02 2020-06-30 山东师范大学 Unsupervised cross-modal Hash retrieval method and system based on virtual label regression
CN110674323A (en) * 2019-09-02 2020-01-10 山东师范大学 Unsupervised cross-modal Hash retrieval method and system based on virtual label regression
CN110704659A (en) * 2019-09-30 2020-01-17 腾讯科技(深圳)有限公司 Image list sorting method and device, storage medium and electronic device
CN110704659B (en) * 2019-09-30 2023-09-26 腾讯科技(深圳)有限公司 Image list ordering method and device, storage medium and electronic device
CN111078952B (en) * 2019-11-20 2023-07-21 重庆邮电大学 Cross-modal variable-length hash retrieval method based on hierarchical structure
CN111078952A (en) * 2019-11-20 2020-04-28 重庆邮电大学 Cross-modal variable-length Hash retrieval method based on hierarchical structure
CN111125457A (en) * 2019-12-13 2020-05-08 山东浪潮人工智能研究院有限公司 Deep cross-modal Hash retrieval method and device
CN111209415A (en) * 2020-01-10 2020-05-29 重庆邮电大学 Image-text cross-modal Hash retrieval method based on mass training
CN111209415B (en) * 2020-01-10 2022-09-23 重庆邮电大学 Image-text cross-modal Hash retrieval method based on mass training
CN111522903A (en) * 2020-04-01 2020-08-11 济南浪潮高新科技投资发展有限公司 Deep hash retrieval method, equipment and medium
CN111639197A (en) * 2020-05-28 2020-09-08 山东大学 Cross-modal multimedia data retrieval method and system with label embedded online hash
CN112199531A (en) * 2020-11-05 2021-01-08 广州杰赛科技股份有限公司 Cross-modal retrieval method and device based on Hash algorithm and neighborhood map
CN112199531B (en) * 2020-11-05 2024-05-17 广州杰赛科技股份有限公司 Cross-modal retrieval method and device based on hash algorithm and neighborhood graph
CN112559810A (en) * 2020-12-23 2021-03-26 上海大学 Method and device for generating hash code by utilizing multi-layer feature fusion
CN112732976A (en) * 2021-01-13 2021-04-30 天津大学 Short video multi-label rapid classification method based on deep hash coding
CN112925962A (en) * 2021-01-20 2021-06-08 同济大学 Hash coding-based cross-modal data retrieval method, system, device and medium
CN112925962B (en) * 2021-01-20 2022-09-27 同济大学 Hash coding-based cross-modal data retrieval method, system, device and medium
CN113434671A (en) * 2021-06-23 2021-09-24 平安国际智慧城市科技股份有限公司 Data processing method and device, computer equipment and storage medium
CN113434671B (en) * 2021-06-23 2024-06-07 平安国际智慧城市科技股份有限公司 Data processing method, device, computer equipment and storage medium
CN115080801A (en) * 2022-07-22 2022-09-20 山东大学 Cross-modal retrieval method and system based on federal learning and data binary representation
WO2024045866A1 (en) * 2022-08-31 2024-03-07 Huawei Technologies Co., Ltd. System and method for cross-modal interaction based on pre-trained model
CN115374165B (en) * 2022-10-24 2023-03-24 山东建筑大学 Data retrieval method, system and equipment based on triple matrix decomposition
CN115374165A (en) * 2022-10-24 2022-11-22 山东建筑大学 Data retrieval method, system and equipment based on triple matrix decomposition
CN115687571B (en) * 2022-10-28 2024-01-26 重庆师范大学 Depth unsupervised cross-modal retrieval method based on modal fusion reconstruction hash
CN115687571A (en) * 2022-10-28 2023-02-03 重庆师范大学 Depth unsupervised cross-modal retrieval method based on modal fusion reconstruction hash
CN116244484A (en) * 2023-05-11 2023-06-09 山东大学 Federal cross-modal retrieval method and system for unbalanced data
CN116244484B (en) * 2023-05-11 2023-08-08 山东大学 Federal cross-modal retrieval method and system for unbalanced data

Also Published As

Publication number Publication date
CN110059198B (en) 2021-04-13

Similar Documents

Publication Publication Date Title
CN110059198A (en) A kind of discrete Hash search method across modal data kept based on similitude
Li et al. Recent developments of content-based image retrieval (CBIR)
Wang et al. Richpedia: a large-scale, comprehensive multi-modal knowledge graph
CN108132968B (en) Weak supervision learning method for associated semantic elements in web texts and images
CN106095893B (en) A kind of cross-media retrieval method
Wang et al. Learning common and specific features for RGB-D semantic segmentation with deconvolutional networks
Li et al. Region-wise deep feature representation for remote sensing images
Huang et al. Identifying disaster related social media for rapid response: a visual-textual fused CNN architecture
CN109299341A (en) One kind confrontation cross-module state search method dictionary-based learning and system
CN110674407A (en) Hybrid recommendation method based on graph convolution neural network
Xie et al. Fast and accurate near-duplicate image search with affinity propagation on the ImageWeb
Tang et al. Large-scale remote sensing image retrieval based on semi-supervised adversarial hashing
Zhang et al. Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval.
Li et al. Online hashing for scalable remote sensing image retrieval
Xu et al. Instance-level coupled subspace learning for fine-grained sketch-based image retrieval
Abdul-Rashid et al. Shrec’18 track: 2d image-based 3d scene retrieval
CN109284414A (en) The cross-module state content search method and system kept based on semanteme
Cheng et al. Deep attentional fine-grained similarity network with adversarial learning for cross-modal retrieval
Guo Intelligent sports video classification based on deep neural network (DNN) algorithm and transfer learning
Zhu et al. Real-time data filling and automatic retrieval algorithm of road traffic based on deep-learning method
Deng Content-based image collection summarization and comparison using self-organizing maps
Benhabiles et al. Convolutional neural network for pottery retrieval
Ma et al. Dual modality collaborative learning for cross-source remote sensing retrieval
Moumtzidou et al. Verge in vbs 2017
Feng et al. Scene semantic recognition based on probability topic model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant