CN103559191A - Cross-media sorting method based on hidden space learning and two-way sorting learning - Google Patents

Cross-media sorting method based on hidden space learning and two-way sorting learning Download PDF

Info

Publication number
CN103559191A
CN103559191A CN201310410565.2A CN201310410565A CN103559191A CN 103559191 A CN103559191 A CN 103559191A CN 201310410565 A CN201310410565 A CN 201310410565A CN 103559191 A CN103559191 A CN 103559191A
Authority
CN
China
Prior art keywords
image
text
media
retrieval
query
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310410565.2A
Other languages
Chinese (zh)
Other versions
CN103559191B (en
Inventor
吴飞
汤斯亮
卢鑫炎
邵健
庄越挺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Original Assignee
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU filed Critical Zhejiang University ZJU
Priority to CN201310410565.2A priority Critical patent/CN103559191B/en
Publication of CN103559191A publication Critical patent/CN103559191A/en
Application granted granted Critical
Publication of CN103559191B publication Critical patent/CN103559191B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data

Abstract

The invention discloses a cross-media sorting method based on hidden space learning and two-way sorting learning. The method includes 1, centrally constructing sorting samples of text retrieval images and sorting samples of image retrieval texts into a training sample; 2, performing cross-media sorting learning based on the hidden space learning and the two-way sorting learning on the constructed training sample, and acquiring a multimedia semantic space and a cross-media sorting model; 3, using the learned cross-media sorting model to performing cross-media sorting. The method can be applied in the text retrieval images and the image retrieval texts, and modeling is performed on two retrieval directions simultaneously, the acquired semantic understanding capacity of a retrieval model is stronger, and retrieval accuracy is higher as compared with the method considering one-way sorting learning only.

Description

Based on the study of hidden space learning and Bidirectional sort across media sort method
Technical field
The present invention design is across media retrieval, relate in particular to a kind of based on hidden space learning and Bidirectional sort study across media sort method.
Background technology
Image is current very common file type, and it has certain semanteme.In general, image is comprised of pixel one by one, and computing machine can not directly be understood the semantic information that image contains.Along with the development of multimedia technology and network technology, increasing image emerges.Retrieval technique can help user's fast finding in the data of magnanimity to own interested content, to become field more and more important in Computer Applied Technology.No matter traditional retrieval technique, be retrieval or the content-based retrieval based on keyword, all can not meet well user and wish the demand with text retrieval image or image retrieval text.In searching system based on keyword, need in advance image to be marked.But because the amount of images existing is at present huge, therefore mark process engineering amount vast and numerous, and due to the impact of the marked content person's subjective factor that inevitably can be subject to mark, for same image, different mark persons may mark different keywords, so keyword often can not objectively respond whole semantemes that image contains.Content-based retrieval system does not need image to mark, user submits to a retrieval sample to retrieve image, but there are two weakness in traditional content-based retrieval technology: the one, and user can only retrieve and the media object of inquiring about the identical mode of example, can only pass through image retrieval image; The 2nd, the low-level image feature of image and high-level semantic exist semantic gap so retrieval performance to be restricted.In order to cross over the semantic gap between different modalities data, understand better semantic information of multimedia, simultaneously in order to meet user across the demand of Media Inquiries, seek a kind of quite meaningful across media sort method based on semanteme.
Summary of the invention
The object of the invention is to overcome the deficiencies in the prior art, provide a kind of based on the study of hidden space learning and Bidirectional sort across media sort method.
Based on the study of hidden space learning and Bidirectional sort across media sort method, comprise the steps:
1) the ordered samples unification of the ordered samples of text retrieval image and image retrieval text is configured to training sample;
2) to build the training sample obtain carry out based on hidden space learning and Bidirectional sort study across media sequence study, obtain semantic information of multimedia space and across media order models;
3) what use study obtained carries out sorting across media across media order models: after submit queries example, first find this inquiry example at the coordinate in semantic information of multimedia space, then according to the coordinate in semantic information of multimedia space across media object, calculate inquiry example and other all across media object the similarity in semantic information of multimedia space, and according to this similarity, to all, across media object, sort.
Described step 1) comprising:
1) to all text documents in training sample, utilize word bag model to carry out feature representation, and utilize TF-IDF method to be weighted each word, text is finally represented as t ∈ R m, wherein m is the dimension in text space;
2) all image documents in training sample are extracted to SIFT local feature point, and these local feature points are carried out to K-Means cluster, with cluster centre, build code book and vision word.Then to every pictures, each the local feature point that calculates this picture by Euclidean distance arest neighbors should belong to which the vision word in code book, last the same with the processing to text document, utilize word bag model and TF-IDF method to carry out feature representation, image is finally represented as p ∈ R n, the dimension that wherein n is image space;
3) for text retrieval image direction, to each query text, build the sorted lists of an image, to be wherein marked as query semantics relevant or semantic uncorrelated for the image in list, so the training sample of each text retrieval image is represented as tlv triple wherein N is training sample number, t ifor retrieval text, p ifor image collection,
Figure BDA0000380142000000022
be the sequence on image collection, y represents whole Sorting space;
4) for image retrieval text orientation, to each query image, build the sorted lists of a text document, to be wherein marked as query semantics relevant or semantic uncorrelated for the text document in list, and the training sample of each image retrieval text is represented as tlv triple
Figure BDA0000380142000000023
m is training sample number, p ifor retrieving images, t jtext document set,
Figure BDA0000380142000000024
it is the sequence in text document set;
5) Query List on both direction is combined and obtains unified training sample.
Described step 2) comprising:
1) use structural support vector machine to build an optimization problem, its objective function is to make mapping function between structure risk and empiric risk, obtain compromise:
min U , V , ξ 1 , ξ 2 λ 2 | | U | | F 2 + λ 2 | | V | | F 2 + 1 N Σ i = 1 N ξ 1 , i + 1 M Σ j = N + 1 N + M ξ 2 , j s . t . ∀ i ∈ { 1 , . . . , N } , ∀ y ∈ y : δF ( t i , p i , y ) ≥ Δ ( y i * , y ) - ξ 1 , i ∀ j ∈ { N + 1 , . . . , N + M } , ∀ y ∈ y : δF ( p j , t j , y ) ≥ Δ ( y j * , y ) - ξ 2 , j . - - - ( 1 )
Wherein,
Figure BDA0000380142000000032
the mapping matrix to hidden space by text mapping,
Figure BDA0000380142000000033
be the mapping matrix to hidden space by image mapped, k is the dimension in hidden space, ξ 1, iand ξ 2, jit is slack variable.The function F of definition is as follows:
F ( t , p , y ) = Σ i ∈ p + Σ j ∈ p - y ij ( Ut ) T V ( p i - p j ) | p + | · | p - | - - - ( 2 )
δF ( t i , p i , y ) = F ( t i , p i , y i * ) - F ( t i , p i , y ) - - - ( 3 )
F ( p , t , y ) = Σ i ∈ t + Σ j ∈ t - y ij ( Vp ) T U ( t i - t j ) | t + | · | t - | - - - ( 4 )
δF ( p j , t j , y ) = F ( p j , t j , y j * ) - F ( p j , t j , y ) - - - ( 5 )
Wherein, p +and p -represent respectively the image collection relevant to query text t and with the incoherent image collection of query text t, t +and t -represent respectively the text collection relevant to query image p, with the incoherent text collection of query image p.Y ijvalue according to sequence y, decide: if document i is more forward than the sequence of document j, y ij=1, otherwise y ij=-1.In addition, definition loss function is Δ (y *, y)=1-MAP (y *, y), MAP is Mean Average Precision, conventional performance measurement standard in a kind of information retrieval, and MAP value is larger, and sequence performance is better, and the value of loss function is just less;
2) input two-way ordered samples as the training sample of optimization problem, solve and obtain parameter U and V.
Described step 3) comprising:
1) to being input as in the situation of text query sample t, to all image p iaccording to following formula, calculate the similarity of itself and query sample: f (t, p i)=(Ut) tvp i, then by similarity, from big to small image is sorted;
2) to being input as in the situation of image querying sample p, to all text document t iaccording to following formula, calculate the similarity of itself and query sample: f (t i, p)=(Ut i) tvp, then sorts to text document from big to small by similarity.
The present invention compares with background technology, and the useful effect having is:
The present invention is directed to Bidirectional sort training sample and proposed a set of new search method based on semantic content.Because the method has merged hidden space learning and two kinds of mechanism of Bidirectional sort study, take full advantage of Bidirectional sort training sample, for sequence performance, directly optimize simultaneously, therefore there is better sequence performance.
Accompanying drawing explanation
Fig. 1 be based on the study of hidden space learning and Bidirectional sort across media sort method schematic diagram;
Fig. 2 is the example of Query Result of the present invention.
Embodiment
The present invention learns multimedia document to carry out semantic understanding by merging hidden space learning and Bidirectional sort, all multimedia documents (text document, image) are mapped in a hidden space of unified semantic information of multimedia, thereby realize across media sequence retrieval.
Based on the study of hidden space learning and Bidirectional sort across media sort method, comprise the steps:
1) the ordered samples unification of the ordered samples of text retrieval image and image retrieval text is configured to training sample;
2) to build the training sample obtain carry out based on hidden space learning and Bidirectional sort study across media sequence study, obtain semantic information of multimedia space and across media order models;
3) what use study obtained carries out sorting across media across media order models: after submit queries example, first find this inquiry example at the coordinate in semantic information of multimedia space, then according to the coordinate in semantic information of multimedia space across media object, calculate inquiry example and other all across media object the similarity in semantic information of multimedia space, and according to this similarity, to all, across media object, sort.
Described step 1) comprising:
1) to all text documents in training sample, utilize word bag model to carry out feature representation, and utilize TF-IDF method to be weighted each word, text is finally represented as t ∈ R m, wherein m is the dimension in text space;
2) all image documents in training sample are extracted to SIFT local feature point, and these local feature points are carried out to K-Means cluster, with cluster centre, build code book and vision word.Then to every pictures, each the local feature point that calculates this picture by Euclidean distance arest neighbors should belong to which the vision word in code book, last the same with the processing to text document, utilize word bag model and TF-IDF method to carry out feature representation, image is finally represented as p ∈ R n, the dimension that wherein n is image space;
3) for text retrieval image direction, to each query text, build the sorted lists of an image, to be wherein marked as query semantics relevant or semantic uncorrelated for the image in list, so the training sample of each text retrieval image is represented as tlv triple
Figure BDA0000380142000000051
wherein N is training sample number, t ifor retrieval text, p ifor image collection,
Figure BDA0000380142000000052
be the sequence on image collection, y represents whole Sorting space;
4) for image retrieval text orientation, to each query image, build the sorted lists of a text document, to be wherein marked as query semantics relevant or semantic uncorrelated for the text document in list, and the training sample of each image retrieval text is represented as tlv triple
Figure BDA0000380142000000053
m is training sample number, p jfor retrieving images, t jtext document set,
Figure BDA0000380142000000054
it is the sequence in text document set;
5) Query List on both direction is combined and obtains unified training sample.
Described step 2) comprising:
1) use structural support vector machine to build an optimization problem, its objective function is to make mapping function between structure risk and empiric risk, obtain compromise:
min U , V , ξ 1 , ξ 2 λ 2 | | U | | F 2 + λ 2 | | V | | F 2 + 1 N Σ i = 1 N ξ 1 , i + 1 M Σ j = N + 1 N + M ξ 2 , j s . t . ∀ i ∈ { 1 , . . . , N } , ∀ y ∈ y : δF ( t i , p i , y ) ≥ Δ ( y i * , y ) - ξ 1 , i ∀ j ∈ { N + 1 , . . . , N + M } , ∀ y ∈ y : δF ( p j , t j , y ) ≥ Δ ( y j * , y ) - ξ 2 , j . - - - ( 6 )
Wherein,
Figure BDA0000380142000000061
the mapping matrix to hidden space by text mapping,
Figure BDA0000380142000000062
be the mapping matrix to hidden space by image mapped, k is the dimension in hidden space, ξ 1, iand ξ 2, jit is slack variable.The function F of definition is as follows:
F ( t , p , y ) = Σ i ∈ p + Σ j ∈ p - y ij ( Ut ) T V ( p i - p j ) | p + | · | p - | - - - ( 7 )
δF ( t i , p i , y ) = F ( t i , p i , y i * ) - F ( t i , p i , y ) - - - ( 8 )
F ( p , t , y ) = Σ i ∈ t + Σ j ∈ t - y ij ( Vp ) T U ( t i - t j ) | t + | · | t - | - - - ( 9 )
δF ( p j , t j , y ) = F ( p j , t j , y j * ) - F ( p j , t j , y ) - - - ( 10 )
Wherein, p +and p -represent respectively the image collection relevant to query text t and with the incoherent image collection of query text t, t +and t -represent respectively the text collection relevant to query image p, with the incoherent text collection of query image p.Y ijvalue according to sequence y, decide: if document i is more forward than the sequence of document j, y ij=1, otherwise y ij=-1.In addition, definition loss function is Δ (y *, y)=1-MAP (y *, y), MAP is Mean Average Precision, conventional performance measurement standard in a kind of information retrieval, and MAP value is larger, and sequence performance is better, and the value of loss function is just less;
2) input two-way ordered samples as the training sample of optimization problem, solve and obtain parameter U and V.Concrete derivation algorithm is as follows:
Figure BDA0000380142000000067
Figure BDA0000380142000000071
To the optimum y of the searching in step 3 and step 5, can use SVMMAP method to solve.Finally solve linear mapping function that U that obtains and V are respectively Dao Yin space, text space and image space to the linear mapping function in hidden space.
Described step 3) comprising:
1) to being input as in the situation of text query sample t, to all image p iaccording to following formula, calculate the similarity of itself and query sample: f (t, p i)=(ut) tvp i, then by similarity, from big to small image is sorted;
2) to being input as in the situation of image querying sample p, to all text document t iaccording to following formula, calculate the similarity of itself and query sample: f (t i, p)=(Ut i) tvp, then sorts to text document from big to small by similarity.
Embodiment
In order to verify effect of the present invention, from the webpage of " figure wikipedia-every day one ", capture approximately 2900 webpages, be divided into 10 large classes, each webpage has comprised an image and several sections of relevant description texts, usings that this tests as data set.If image and text all belong to the class in 10 large classes, think image and text-dependent, otherwise uncorrelated.Data set is divided into training set and test set, and the present invention trains on training set, then on test set, carries out independent assessment.For feature extraction, according to the said step of the present invention, carry out, wherein remove common word and uncommon word after text space be set as 5000 dimensions, image space is set as 1000 dimensions.In order to evaluate objectively the performance of algorithm of the present invention, inventor uses Average Accuracy (Mean Average Precision, MAP) to evaluate the present invention.The result of MAP is as shown in table 1:
? MAP@50 MAP@all
Text query image 0.3981 0.2123
Image querying text 0.2599 0.2528
Table 1
Wherein MAP@50 is first 50 and returns results the MAP value calculating, and MAP@all is all MAP values that calculate that return results.
In order to represent better the present invention in the result across in media retrieval, in Fig. 2, presented the example of some Query Results.Fig. 2 is twice result for retrieval of the present invention, is respectively text retrieval image and image retrieval text.Wherein, when showing image retrieval text, the text returning has been used its corresponding image as displaying.No matter from the result presenting, can see, be with image querying text, or with text query image, and method of the present invention all has good effect, can return traditional single mode retrieval irrealizable semantically close result.

Claims (4)

  1. Based on the study of hidden space learning and Bidirectional sort across a media sort method, it is characterized in that comprising the steps:
    1) the ordered samples unification of the ordered samples of text retrieval image and image retrieval text is configured to training sample;
    2) to build the training sample obtain carry out based on hidden space learning and Bidirectional sort study across media sequence study, obtain semantic information of multimedia space and across media order models;
    3) what use study obtained carries out sorting across media across media order models: after submit queries example, first find this inquiry example at the coordinate in semantic information of multimedia space, then according to the coordinate in semantic information of multimedia space across media object, calculate inquiry example and other all across media object the similarity in semantic information of multimedia space, and according to this similarity, to all, across media object, sort.
  2. According to claim 1 a kind of based on the study of hidden space learning and Bidirectional sort across media sort method, it is characterized in that described step 1) comprising:
    1) to all text documents in training sample, utilize word bag model to carry out feature representation, and utilize TF-IDF method to be weighted each word, text is finally represented as t ∈ R m, wherein m is the dimension in text space;
    2) all image documents in training sample are extracted to SIFT local feature point, and these local feature points are carried out to K-Means cluster, with cluster centre, build code book and vision word.Then to every pictures, each the local feature point that calculates this picture by Euclidean distance arest neighbors should belong to which the vision word in code book, last the same with the processing to text document, utilize word bag model and TF-IDF method to carry out feature representation, image is finally represented as p ∈ R n, the dimension that wherein n is image space;
    3) for text retrieval image direction, to each query text, build the sorted lists of an image, to be wherein marked as query semantics relevant or semantic uncorrelated for the image in list, so the training sample of each text retrieval image is represented as tlv triple
    Figure FDA0000380141990000011
    wherein N is training sample number, t ifor retrieval text, p ifor image collection, be the sequence on image collection, y represents whole Sorting space;
    4) for image retrieval text orientation, to each query image, build the sorted lists of a text document, to be wherein marked as query semantics relevant or semantic uncorrelated for the text document in list, and the training sample of each image retrieval text is represented as tlv triple
    Figure FDA0000380141990000022
    m is training sample number, p jfor retrieving images, t jtext document set,
    Figure FDA0000380141990000023
    it is the sequence in text document set;
    5) Query List on both direction is combined and obtains unified training sample.
  3. According to claim 1 a kind of based on the study of hidden space learning and Bidirectional sort across media sort method, it is characterized in that described step 2) comprising:
    1) use structural support vector machine to build an optimization problem, its objective function is to make mapping function between structure risk and empiric risk, obtain compromise:
    min U , V , ξ 1 , ξ 2 λ 2 | | U | | F 2 + λ 2 | | V | | F 2 + 1 N Σ i = 1 N ξ 1 , i + 1 M Σ j = N + 1 N + M ξ 2 , j s . t . ∀ i ∈ { 1 , . . . , N } , ∀ y ∈ y : δF ( t i , p i , y ) ≥ Δ ( y i * , y ) - ξ 1 , i ∀ j ∈ { N + 1 , . . . , N + M } , ∀ y ∈ y : δF ( p j , t j , y ) ≥ Δ ( y j * , y ) - ξ 2 , j . - - - ( 1 )
    Wherein,
    Figure FDA0000380141990000029
    the mapping matrix to hidden space by text mapping,
    Figure FDA00003801419900000210
    be the mapping matrix to hidden space by image mapped, k is the dimension in hidden space, ξ 1, iand ξ 2, jit is slack variable.The function F of definition is as follows:
    F ( t , p , y ) = Σ i ∈ p + Σ j ∈ p - y ij ( Ut ) T V ( p i - p j ) | p + | · | p - | - - - ( 2 )
    δF ( t i , p i , y ) = F ( t i , p i , y i * ) - F ( t i , p i , y ) - - - ( 3 )
    F ( p , t , y ) = Σ i ∈ t + Σ j ∈ t - y ij ( Vp ) T U ( t i - t j ) | t + | · | t - | - - - ( 4 )
    δF ( p j , t j , y ) = F ( p j , t j , y j * ) - F ( p j , t j , y ) - - - ( 5 )
    Wherein, p +and p -represent respectively the image collection relevant to query text t and with the incoherent image collection of query text t, t +and t -represent respectively the text collection relevant to query image p, with the incoherent text collection of query image p.Y ijvalue according to sequence y, decide: if document i is more forward than the sequence of document j, y ij=1, otherwise y ij=-1.In addition, definition loss function is Δ (y *, y)=1-MAP (y *, y), MAP is Mean Average Precision, conventional performance measurement standard in a kind of information retrieval, and MAP value is larger, and sequence performance is better, and the value of loss function is just less;
    2) input two-way ordered samples as the training sample of optimization problem, solve and obtain parameter U and V.
  4. According to claim 1 a kind of based on the study of hidden space learning and Bidirectional sort across media sort method, it is characterized in that described step 3) comprising:
    1) to being input as in the situation of text query sample t, to all image p iaccording to following formula, calculate the similarity of itself and query sample: f (t, p i)=(Ut) tvp i, then by similarity, from big to small image is sorted;
    2) to being input as in the situation of image querying sample p, to all text document t iaccording to following formula, calculate the similarity of itself and query sample: f (t i, p)=(Ut i) tvp, then sorts to text document from big to small by similarity.
CN201310410565.2A 2013-09-10 2013-09-10 Based on latent space study and Bidirectional sort study across media sort method Expired - Fee Related CN103559191B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310410565.2A CN103559191B (en) 2013-09-10 2013-09-10 Based on latent space study and Bidirectional sort study across media sort method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310410565.2A CN103559191B (en) 2013-09-10 2013-09-10 Based on latent space study and Bidirectional sort study across media sort method

Publications (2)

Publication Number Publication Date
CN103559191A true CN103559191A (en) 2014-02-05
CN103559191B CN103559191B (en) 2016-09-14

Family

ID=50013438

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310410565.2A Expired - Fee Related CN103559191B (en) 2013-09-10 2013-09-10 Based on latent space study and Bidirectional sort study across media sort method

Country Status (1)

Country Link
CN (1) CN103559191B (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104317834A (en) * 2014-10-10 2015-01-28 浙江大学 Cross-media sorting method based on deep neural network
CN104346450A (en) * 2014-10-29 2015-02-11 浙江大学 Cross-media ordering method based on multi-modal implicit coupling expression
CN104346440A (en) * 2014-10-10 2015-02-11 浙江大学 Neural-network-based cross-media Hash indexing method
WO2015192655A1 (en) * 2014-06-20 2015-12-23 华为技术有限公司 Method and device for establishing and using user recommendation model in social network
CN105701227A (en) * 2016-01-15 2016-06-22 北京大学 Cross-media similarity measure method and search method based on local association graph
CN106095829A (en) * 2016-06-01 2016-11-09 华侨大学 Cross-media retrieval method based on degree of depth study with the study of concordance expression of space
CN106537379A (en) * 2014-06-20 2017-03-22 谷歌公司 Fine-grained image similarity
CN106529583A (en) * 2016-11-01 2017-03-22 哈尔滨工程大学 Bag-of-visual-word-model-based indoor scene cognitive method
CN107357884A (en) * 2017-07-10 2017-11-17 中国人民解放军国防科学技术大学 A kind of different distance measure across media based on two-way study sequence
CN107562812A (en) * 2017-08-11 2018-01-09 北京大学 A kind of cross-module state similarity-based learning method based on the modeling of modality-specific semantic space
CN107657008A (en) * 2017-09-25 2018-02-02 中国科学院计算技术研究所 Across media training and search method based on depth discrimination sequence study
CN108228757A (en) * 2017-12-21 2018-06-29 北京市商汤科技开发有限公司 Image search method and device, electronic equipment, storage medium, program
CN108829847A (en) * 2018-06-20 2018-11-16 山东大学 Commodity search method and system based on multi-modal shopping preferences

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11893818B2 (en) 2021-07-21 2024-02-06 Abbyy Development Inc. Optimization and use of codebooks for document analysis

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1920818A (en) * 2006-09-14 2007-02-28 浙江大学 Transmedia search method based on multi-mode information convergence analysis

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1920818A (en) * 2006-09-14 2007-02-28 浙江大学 Transmedia search method based on multi-mode information convergence analysis

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
XINYAN LU, FEI WU ET AL.: "A Low Rank Structural Large Margin Method for Cross-Modal Ranking", 《PROCEEDINGS OF THE 36TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL》, 1 August 2013 (2013-08-01), pages 433 - 442 *
YI YANG, YUE-TING ZHUANG ET AL.: "Harmonizing Hierarchical Manifolds for Multimedia Document Semantics Understanding and Cross-Media Retrieval", 《IEEE TRANSACTIONS ON MULTIMEDIA》, vol. 10, no. 3, 30 April 2008 (2008-04-30), pages 437 - 446, XP011346504, DOI: doi:10.1109/TMM.2008.917359 *
YUE-TING ZHUANG, YI YANG ET AL.: "Mining Semantic Correlation of Heterogeneous Multimedia Data for Cross-Media Retrieval", 《IEEE TRANSACTION ON MULTIMEDIA》, vol. 10, no. 2, 29 February 2008 (2008-02-29), pages 221 - 229, XP011200215, DOI: doi:10.1109/TMM.2007.911822 *
吴飞,庄越挺: "互联网跨媒体分析与检索:理论与算法", 《计算机辅助设计与图形学学报》, vol. 22, no. 1, 31 January 2010 (2010-01-31), pages 1 - 9 *

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015192655A1 (en) * 2014-06-20 2015-12-23 华为技术有限公司 Method and device for establishing and using user recommendation model in social network
CN105446973A (en) * 2014-06-20 2016-03-30 华为技术有限公司 User recommend model establishment and application method and device in social network
CN105446973B (en) * 2014-06-20 2019-02-26 华为技术有限公司 The foundation of user's recommended models and application method and device in social networks
CN106537379A (en) * 2014-06-20 2017-03-22 谷歌公司 Fine-grained image similarity
CN104346440A (en) * 2014-10-10 2015-02-11 浙江大学 Neural-network-based cross-media Hash indexing method
CN104317834A (en) * 2014-10-10 2015-01-28 浙江大学 Cross-media sorting method based on deep neural network
CN104317834B (en) * 2014-10-10 2017-09-29 浙江大学 A kind of across media sort methods based on deep neural network
CN104346440B (en) * 2014-10-10 2017-06-23 浙江大学 A kind of across media hash indexing methods based on neutral net
CN104346450B (en) * 2014-10-29 2017-06-23 浙江大学 A kind of across media sort methods based on multi-modal recessive coupling expression
CN104346450A (en) * 2014-10-29 2015-02-11 浙江大学 Cross-media ordering method based on multi-modal implicit coupling expression
CN105701227A (en) * 2016-01-15 2016-06-22 北京大学 Cross-media similarity measure method and search method based on local association graph
CN105701227B (en) * 2016-01-15 2019-02-01 北京大学 A kind of across media method for measuring similarity and search method based on local association figure
CN106095829A (en) * 2016-06-01 2016-11-09 华侨大学 Cross-media retrieval method based on degree of depth study with the study of concordance expression of space
CN106095829B (en) * 2016-06-01 2019-08-06 华侨大学 Cross-media retrieval method based on deep learning and the study of consistency expression of space
CN106529583A (en) * 2016-11-01 2017-03-22 哈尔滨工程大学 Bag-of-visual-word-model-based indoor scene cognitive method
CN107357884A (en) * 2017-07-10 2017-11-17 中国人民解放军国防科学技术大学 A kind of different distance measure across media based on two-way study sequence
CN107562812A (en) * 2017-08-11 2018-01-09 北京大学 A kind of cross-module state similarity-based learning method based on the modeling of modality-specific semantic space
CN107562812B (en) * 2017-08-11 2021-01-15 北京大学 Cross-modal similarity learning method based on specific modal semantic space modeling
CN107657008A (en) * 2017-09-25 2018-02-02 中国科学院计算技术研究所 Across media training and search method based on depth discrimination sequence study
CN107657008B (en) * 2017-09-25 2020-11-03 中国科学院计算技术研究所 Cross-media training and retrieval method based on deep discrimination ranking learning
CN108228757A (en) * 2017-12-21 2018-06-29 北京市商汤科技开发有限公司 Image search method and device, electronic equipment, storage medium, program
CN108829847A (en) * 2018-06-20 2018-11-16 山东大学 Commodity search method and system based on multi-modal shopping preferences
CN108829847B (en) * 2018-06-20 2020-11-17 山东大学 Multi-modal modeling method based on translation and application thereof in commodity retrieval

Also Published As

Publication number Publication date
CN103559191B (en) 2016-09-14

Similar Documents

Publication Publication Date Title
CN103559191B (en) Based on latent space study and Bidirectional sort study across media sort method
US9589208B2 (en) Retrieval of similar images to a query image
CN104317834B (en) A kind of across media sort methods based on deep neural network
CN111914558B (en) Course knowledge relation extraction method and system based on sentence bag attention remote supervision
Lv et al. Learning to model relatedness for news recommendation
US20160034512A1 (en) Context-based metadata generation and automatic annotation of electronic media in a computer network
CN108280114B (en) Deep learning-based user literature reading interest analysis method
US20170357878A1 (en) Multi-dimensional realization of visual content of an image collection
US8788503B1 (en) Content identification
CN105205096A (en) Text modal and image modal crossing type data retrieval method
US20120166439A1 (en) Method and system for classifying web sites using query-based web site models
CN105426529A (en) Image retrieval method and system based on user search intention positioning
Noel et al. Applicability of Latent Dirichlet Allocation to multi-disk search
CN103440262A (en) Image searching system and image searching method basing on relevance feedback and Bag-of-Features
US20120117090A1 (en) System and method for managing digital contents
Chen et al. Fine-grained product categorization in e-commerce
CN112131453A (en) Method, device and storage medium for detecting network bad short text based on BERT
Gialampoukidis et al. A hybrid graph-based and non-linear late fusion approach for multimedia retrieval
Wang et al. Constructing a comprehensive events database from the web
Li et al. User-curated image collections: Modeling and recommendation
CN111259223B (en) News recommendation and text classification method based on emotion analysis model
TW201243627A (en) Multi-label text categorization based on fuzzy similarity and k nearest neighbors
CN107423294A (en) A kind of community image search method and system
KR101544142B1 (en) Searching method and system based on topic
Singhal et al. Leveraging the web for automating tag expansion for low-content items

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160914

Termination date: 20180910

CF01 Termination of patent right due to non-payment of annual fee