CN103559191A - Cross-media sorting method based on hidden space learning and two-way sorting learning - Google Patents
Cross-media sorting method based on hidden space learning and two-way sorting learning Download PDFInfo
- Publication number
- CN103559191A CN103559191A CN201310410565.2A CN201310410565A CN103559191A CN 103559191 A CN103559191 A CN 103559191A CN 201310410565 A CN201310410565 A CN 201310410565A CN 103559191 A CN103559191 A CN 103559191A
- Authority
- CN
- China
- Prior art keywords
- image
- text
- media
- retrieval
- query
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/583—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
Abstract
The invention discloses a cross-media sorting method based on hidden space learning and two-way sorting learning. The method includes 1, centrally constructing sorting samples of text retrieval images and sorting samples of image retrieval texts into a training sample; 2, performing cross-media sorting learning based on the hidden space learning and the two-way sorting learning on the constructed training sample, and acquiring a multimedia semantic space and a cross-media sorting model; 3, using the learned cross-media sorting model to performing cross-media sorting. The method can be applied in the text retrieval images and the image retrieval texts, and modeling is performed on two retrieval directions simultaneously, the acquired semantic understanding capacity of a retrieval model is stronger, and retrieval accuracy is higher as compared with the method considering one-way sorting learning only.
Description
Technical field
The present invention design is across media retrieval, relate in particular to a kind of based on hidden space learning and Bidirectional sort study across media sort method.
Background technology
Image is current very common file type, and it has certain semanteme.In general, image is comprised of pixel one by one, and computing machine can not directly be understood the semantic information that image contains.Along with the development of multimedia technology and network technology, increasing image emerges.Retrieval technique can help user's fast finding in the data of magnanimity to own interested content, to become field more and more important in Computer Applied Technology.No matter traditional retrieval technique, be retrieval or the content-based retrieval based on keyword, all can not meet well user and wish the demand with text retrieval image or image retrieval text.In searching system based on keyword, need in advance image to be marked.But because the amount of images existing is at present huge, therefore mark process engineering amount vast and numerous, and due to the impact of the marked content person's subjective factor that inevitably can be subject to mark, for same image, different mark persons may mark different keywords, so keyword often can not objectively respond whole semantemes that image contains.Content-based retrieval system does not need image to mark, user submits to a retrieval sample to retrieve image, but there are two weakness in traditional content-based retrieval technology: the one, and user can only retrieve and the media object of inquiring about the identical mode of example, can only pass through image retrieval image; The 2nd, the low-level image feature of image and high-level semantic exist semantic gap so retrieval performance to be restricted.In order to cross over the semantic gap between different modalities data, understand better semantic information of multimedia, simultaneously in order to meet user across the demand of Media Inquiries, seek a kind of quite meaningful across media sort method based on semanteme.
Summary of the invention
The object of the invention is to overcome the deficiencies in the prior art, provide a kind of based on the study of hidden space learning and Bidirectional sort across media sort method.
Based on the study of hidden space learning and Bidirectional sort across media sort method, comprise the steps:
1) the ordered samples unification of the ordered samples of text retrieval image and image retrieval text is configured to training sample;
2) to build the training sample obtain carry out based on hidden space learning and Bidirectional sort study across media sequence study, obtain semantic information of multimedia space and across media order models;
3) what use study obtained carries out sorting across media across media order models: after submit queries example, first find this inquiry example at the coordinate in semantic information of multimedia space, then according to the coordinate in semantic information of multimedia space across media object, calculate inquiry example and other all across media object the similarity in semantic information of multimedia space, and according to this similarity, to all, across media object, sort.
Described step 1) comprising:
1) to all text documents in training sample, utilize word bag model to carry out feature representation, and utilize TF-IDF method to be weighted each word, text is finally represented as t ∈ R
m, wherein m is the dimension in text space;
2) all image documents in training sample are extracted to SIFT local feature point, and these local feature points are carried out to K-Means cluster, with cluster centre, build code book and vision word.Then to every pictures, each the local feature point that calculates this picture by Euclidean distance arest neighbors should belong to which the vision word in code book, last the same with the processing to text document, utilize word bag model and TF-IDF method to carry out feature representation, image is finally represented as p ∈ R
n, the dimension that wherein n is image space;
3) for text retrieval image direction, to each query text, build the sorted lists of an image, to be wherein marked as query semantics relevant or semantic uncorrelated for the image in list, so the training sample of each text retrieval image is represented as tlv triple
wherein N is training sample number, t
ifor retrieval text, p
ifor image collection,
be the sequence on image collection, y represents whole Sorting space;
4) for image retrieval text orientation, to each query image, build the sorted lists of a text document, to be wherein marked as query semantics relevant or semantic uncorrelated for the text document in list, and the training sample of each image retrieval text is represented as tlv triple
m is training sample number, p
ifor retrieving images, t
jtext document set,
it is the sequence in text document set;
5) Query List on both direction is combined and obtains unified training sample.
Described step 2) comprising:
1) use structural support vector machine to build an optimization problem, its objective function is to make mapping function between structure risk and empiric risk, obtain compromise:
Wherein,
the mapping matrix to hidden space by text mapping,
be the mapping matrix to hidden space by image mapped, k is the dimension in hidden space, ξ
1, iand ξ
2, jit is slack variable.The function F of definition is as follows:
Wherein, p
+and p
-represent respectively the image collection relevant to query text t and with the incoherent image collection of query text t, t
+and t
-represent respectively the text collection relevant to query image p, with the incoherent text collection of query image p.Y
ijvalue according to sequence y, decide: if document i is more forward than the sequence of document j, y
ij=1, otherwise y
ij=-1.In addition, definition loss function is Δ (y
*, y)=1-MAP (y
*, y), MAP is Mean Average Precision, conventional performance measurement standard in a kind of information retrieval, and MAP value is larger, and sequence performance is better, and the value of loss function is just less;
2) input two-way ordered samples as the training sample of optimization problem, solve and obtain parameter U and V.
Described step 3) comprising:
1) to being input as in the situation of text query sample t, to all image p
iaccording to following formula, calculate the similarity of itself and query sample: f (t, p
i)=(Ut)
tvp
i, then by similarity, from big to small image is sorted;
2) to being input as in the situation of image querying sample p, to all text document t
iaccording to following formula, calculate the similarity of itself and query sample: f (t
i, p)=(Ut
i)
tvp, then sorts to text document from big to small by similarity.
The present invention compares with background technology, and the useful effect having is:
The present invention is directed to Bidirectional sort training sample and proposed a set of new search method based on semantic content.Because the method has merged hidden space learning and two kinds of mechanism of Bidirectional sort study, take full advantage of Bidirectional sort training sample, for sequence performance, directly optimize simultaneously, therefore there is better sequence performance.
Accompanying drawing explanation
Fig. 1 be based on the study of hidden space learning and Bidirectional sort across media sort method schematic diagram;
Fig. 2 is the example of Query Result of the present invention.
Embodiment
The present invention learns multimedia document to carry out semantic understanding by merging hidden space learning and Bidirectional sort, all multimedia documents (text document, image) are mapped in a hidden space of unified semantic information of multimedia, thereby realize across media sequence retrieval.
Based on the study of hidden space learning and Bidirectional sort across media sort method, comprise the steps:
1) the ordered samples unification of the ordered samples of text retrieval image and image retrieval text is configured to training sample;
2) to build the training sample obtain carry out based on hidden space learning and Bidirectional sort study across media sequence study, obtain semantic information of multimedia space and across media order models;
3) what use study obtained carries out sorting across media across media order models: after submit queries example, first find this inquiry example at the coordinate in semantic information of multimedia space, then according to the coordinate in semantic information of multimedia space across media object, calculate inquiry example and other all across media object the similarity in semantic information of multimedia space, and according to this similarity, to all, across media object, sort.
Described step 1) comprising:
1) to all text documents in training sample, utilize word bag model to carry out feature representation, and utilize TF-IDF method to be weighted each word, text is finally represented as t ∈ R
m, wherein m is the dimension in text space;
2) all image documents in training sample are extracted to SIFT local feature point, and these local feature points are carried out to K-Means cluster, with cluster centre, build code book and vision word.Then to every pictures, each the local feature point that calculates this picture by Euclidean distance arest neighbors should belong to which the vision word in code book, last the same with the processing to text document, utilize word bag model and TF-IDF method to carry out feature representation, image is finally represented as p ∈ R
n, the dimension that wherein n is image space;
3) for text retrieval image direction, to each query text, build the sorted lists of an image, to be wherein marked as query semantics relevant or semantic uncorrelated for the image in list, so the training sample of each text retrieval image is represented as tlv triple
wherein N is training sample number, t
ifor retrieval text, p
ifor image collection,
be the sequence on image collection, y represents whole Sorting space;
4) for image retrieval text orientation, to each query image, build the sorted lists of a text document, to be wherein marked as query semantics relevant or semantic uncorrelated for the text document in list, and the training sample of each image retrieval text is represented as tlv triple
m is training sample number, p
jfor retrieving images, t
jtext document set,
it is the sequence in text document set;
5) Query List on both direction is combined and obtains unified training sample.
Described step 2) comprising:
1) use structural support vector machine to build an optimization problem, its objective function is to make mapping function between structure risk and empiric risk, obtain compromise:
Wherein,
the mapping matrix to hidden space by text mapping,
be the mapping matrix to hidden space by image mapped, k is the dimension in hidden space, ξ
1, iand ξ
2, jit is slack variable.The function F of definition is as follows:
Wherein, p
+and p
-represent respectively the image collection relevant to query text t and with the incoherent image collection of query text t, t
+and t
-represent respectively the text collection relevant to query image p, with the incoherent text collection of query image p.Y
ijvalue according to sequence y, decide: if document i is more forward than the sequence of document j, y
ij=1, otherwise y
ij=-1.In addition, definition loss function is Δ (y
*, y)=1-MAP (y
*, y), MAP is Mean Average Precision, conventional performance measurement standard in a kind of information retrieval, and MAP value is larger, and sequence performance is better, and the value of loss function is just less;
2) input two-way ordered samples as the training sample of optimization problem, solve and obtain parameter U and V.Concrete derivation algorithm is as follows:
To the optimum y of the searching in step 3 and step 5, can use SVMMAP method to solve.Finally solve linear mapping function that U that obtains and V are respectively Dao Yin space, text space and image space to the linear mapping function in hidden space.
Described step 3) comprising:
1) to being input as in the situation of text query sample t, to all image p
iaccording to following formula, calculate the similarity of itself and query sample: f (t, p
i)=(ut)
tvp
i, then by similarity, from big to small image is sorted;
2) to being input as in the situation of image querying sample p, to all text document t
iaccording to following formula, calculate the similarity of itself and query sample: f (t
i, p)=(Ut
i)
tvp, then sorts to text document from big to small by similarity.
Embodiment
In order to verify effect of the present invention, from the webpage of " figure wikipedia-every day one ", capture approximately 2900 webpages, be divided into 10 large classes, each webpage has comprised an image and several sections of relevant description texts, usings that this tests as data set.If image and text all belong to the class in 10 large classes, think image and text-dependent, otherwise uncorrelated.Data set is divided into training set and test set, and the present invention trains on training set, then on test set, carries out independent assessment.For feature extraction, according to the said step of the present invention, carry out, wherein remove common word and uncommon word after text space be set as 5000 dimensions, image space is set as 1000 dimensions.In order to evaluate objectively the performance of algorithm of the present invention, inventor uses Average Accuracy (Mean Average Precision, MAP) to evaluate the present invention.The result of MAP is as shown in table 1:
? | MAP@50 | MAP@all |
Text query image | 0.3981 | 0.2123 |
Image querying text | 0.2599 | 0.2528 |
Table 1
Wherein MAP@50 is first 50 and returns results the MAP value calculating, and MAP@all is all MAP values that calculate that return results.
In order to represent better the present invention in the result across in media retrieval, in Fig. 2, presented the example of some Query Results.Fig. 2 is twice result for retrieval of the present invention, is respectively text retrieval image and image retrieval text.Wherein, when showing image retrieval text, the text returning has been used its corresponding image as displaying.No matter from the result presenting, can see, be with image querying text, or with text query image, and method of the present invention all has good effect, can return traditional single mode retrieval irrealizable semantically close result.
Claims (4)
- Based on the study of hidden space learning and Bidirectional sort across a media sort method, it is characterized in that comprising the steps:1) the ordered samples unification of the ordered samples of text retrieval image and image retrieval text is configured to training sample;2) to build the training sample obtain carry out based on hidden space learning and Bidirectional sort study across media sequence study, obtain semantic information of multimedia space and across media order models;3) what use study obtained carries out sorting across media across media order models: after submit queries example, first find this inquiry example at the coordinate in semantic information of multimedia space, then according to the coordinate in semantic information of multimedia space across media object, calculate inquiry example and other all across media object the similarity in semantic information of multimedia space, and according to this similarity, to all, across media object, sort.
- According to claim 1 a kind of based on the study of hidden space learning and Bidirectional sort across media sort method, it is characterized in that described step 1) comprising:1) to all text documents in training sample, utilize word bag model to carry out feature representation, and utilize TF-IDF method to be weighted each word, text is finally represented as t ∈ R m, wherein m is the dimension in text space;2) all image documents in training sample are extracted to SIFT local feature point, and these local feature points are carried out to K-Means cluster, with cluster centre, build code book and vision word.Then to every pictures, each the local feature point that calculates this picture by Euclidean distance arest neighbors should belong to which the vision word in code book, last the same with the processing to text document, utilize word bag model and TF-IDF method to carry out feature representation, image is finally represented as p ∈ R n, the dimension that wherein n is image space;3) for text retrieval image direction, to each query text, build the sorted lists of an image, to be wherein marked as query semantics relevant or semantic uncorrelated for the image in list, so the training sample of each text retrieval image is represented as tlv triple wherein N is training sample number, t ifor retrieval text, p ifor image collection, be the sequence on image collection, y represents whole Sorting space;4) for image retrieval text orientation, to each query image, build the sorted lists of a text document, to be wherein marked as query semantics relevant or semantic uncorrelated for the text document in list, and the training sample of each image retrieval text is represented as tlv triple m is training sample number, p jfor retrieving images, t jtext document set, it is the sequence in text document set;5) Query List on both direction is combined and obtains unified training sample.
- According to claim 1 a kind of based on the study of hidden space learning and Bidirectional sort across media sort method, it is characterized in that described step 2) comprising:1) use structural support vector machine to build an optimization problem, its objective function is to make mapping function between structure risk and empiric risk, obtain compromise:Wherein, the mapping matrix to hidden space by text mapping, be the mapping matrix to hidden space by image mapped, k is the dimension in hidden space, ξ 1, iand ξ 2, jit is slack variable.The function F of definition is as follows:Wherein, p +and p -represent respectively the image collection relevant to query text t and with the incoherent image collection of query text t, t +and t -represent respectively the text collection relevant to query image p, with the incoherent text collection of query image p.Y ijvalue according to sequence y, decide: if document i is more forward than the sequence of document j, y ij=1, otherwise y ij=-1.In addition, definition loss function is Δ (y *, y)=1-MAP (y *, y), MAP is Mean Average Precision, conventional performance measurement standard in a kind of information retrieval, and MAP value is larger, and sequence performance is better, and the value of loss function is just less;2) input two-way ordered samples as the training sample of optimization problem, solve and obtain parameter U and V.
- According to claim 1 a kind of based on the study of hidden space learning and Bidirectional sort across media sort method, it is characterized in that described step 3) comprising:1) to being input as in the situation of text query sample t, to all image p iaccording to following formula, calculate the similarity of itself and query sample: f (t, p i)=(Ut) tvp i, then by similarity, from big to small image is sorted;2) to being input as in the situation of image querying sample p, to all text document t iaccording to following formula, calculate the similarity of itself and query sample: f (t i, p)=(Ut i) tvp, then sorts to text document from big to small by similarity.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310410565.2A CN103559191B (en) | 2013-09-10 | 2013-09-10 | Based on latent space study and Bidirectional sort study across media sort method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310410565.2A CN103559191B (en) | 2013-09-10 | 2013-09-10 | Based on latent space study and Bidirectional sort study across media sort method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103559191A true CN103559191A (en) | 2014-02-05 |
CN103559191B CN103559191B (en) | 2016-09-14 |
Family
ID=50013438
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310410565.2A Expired - Fee Related CN103559191B (en) | 2013-09-10 | 2013-09-10 | Based on latent space study and Bidirectional sort study across media sort method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103559191B (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104317834A (en) * | 2014-10-10 | 2015-01-28 | 浙江大学 | Cross-media sorting method based on deep neural network |
CN104346450A (en) * | 2014-10-29 | 2015-02-11 | 浙江大学 | Cross-media ordering method based on multi-modal implicit coupling expression |
CN104346440A (en) * | 2014-10-10 | 2015-02-11 | 浙江大学 | Neural-network-based cross-media Hash indexing method |
WO2015192655A1 (en) * | 2014-06-20 | 2015-12-23 | 华为技术有限公司 | Method and device for establishing and using user recommendation model in social network |
CN105701227A (en) * | 2016-01-15 | 2016-06-22 | 北京大学 | Cross-media similarity measure method and search method based on local association graph |
CN106095829A (en) * | 2016-06-01 | 2016-11-09 | 华侨大学 | Cross-media retrieval method based on degree of depth study with the study of concordance expression of space |
CN106537379A (en) * | 2014-06-20 | 2017-03-22 | 谷歌公司 | Fine-grained image similarity |
CN106529583A (en) * | 2016-11-01 | 2017-03-22 | 哈尔滨工程大学 | Bag-of-visual-word-model-based indoor scene cognitive method |
CN107357884A (en) * | 2017-07-10 | 2017-11-17 | 中国人民解放军国防科学技术大学 | A kind of different distance measure across media based on two-way study sequence |
CN107562812A (en) * | 2017-08-11 | 2018-01-09 | 北京大学 | A kind of cross-module state similarity-based learning method based on the modeling of modality-specific semantic space |
CN107657008A (en) * | 2017-09-25 | 2018-02-02 | 中国科学院计算技术研究所 | Across media training and search method based on depth discrimination sequence study |
CN108228757A (en) * | 2017-12-21 | 2018-06-29 | 北京市商汤科技开发有限公司 | Image search method and device, electronic equipment, storage medium, program |
CN108829847A (en) * | 2018-06-20 | 2018-11-16 | 山东大学 | Commodity search method and system based on multi-modal shopping preferences |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11893818B2 (en) | 2021-07-21 | 2024-02-06 | Abbyy Development Inc. | Optimization and use of codebooks for document analysis |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1920818A (en) * | 2006-09-14 | 2007-02-28 | 浙江大学 | Transmedia search method based on multi-mode information convergence analysis |
-
2013
- 2013-09-10 CN CN201310410565.2A patent/CN103559191B/en not_active Expired - Fee Related
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1920818A (en) * | 2006-09-14 | 2007-02-28 | 浙江大学 | Transmedia search method based on multi-mode information convergence analysis |
Non-Patent Citations (4)
Title |
---|
XINYAN LU, FEI WU ET AL.: "A Low Rank Structural Large Margin Method for Cross-Modal Ranking", 《PROCEEDINGS OF THE 36TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL》, 1 August 2013 (2013-08-01), pages 433 - 442 * |
YI YANG, YUE-TING ZHUANG ET AL.: "Harmonizing Hierarchical Manifolds for Multimedia Document Semantics Understanding and Cross-Media Retrieval", 《IEEE TRANSACTIONS ON MULTIMEDIA》, vol. 10, no. 3, 30 April 2008 (2008-04-30), pages 437 - 446, XP011346504, DOI: doi:10.1109/TMM.2008.917359 * |
YUE-TING ZHUANG, YI YANG ET AL.: "Mining Semantic Correlation of Heterogeneous Multimedia Data for Cross-Media Retrieval", 《IEEE TRANSACTION ON MULTIMEDIA》, vol. 10, no. 2, 29 February 2008 (2008-02-29), pages 221 - 229, XP011200215, DOI: doi:10.1109/TMM.2007.911822 * |
吴飞,庄越挺: "互联网跨媒体分析与检索:理论与算法", 《计算机辅助设计与图形学学报》, vol. 22, no. 1, 31 January 2010 (2010-01-31), pages 1 - 9 * |
Cited By (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015192655A1 (en) * | 2014-06-20 | 2015-12-23 | 华为技术有限公司 | Method and device for establishing and using user recommendation model in social network |
CN105446973A (en) * | 2014-06-20 | 2016-03-30 | 华为技术有限公司 | User recommend model establishment and application method and device in social network |
CN105446973B (en) * | 2014-06-20 | 2019-02-26 | 华为技术有限公司 | The foundation of user's recommended models and application method and device in social networks |
CN106537379A (en) * | 2014-06-20 | 2017-03-22 | 谷歌公司 | Fine-grained image similarity |
CN104346440A (en) * | 2014-10-10 | 2015-02-11 | 浙江大学 | Neural-network-based cross-media Hash indexing method |
CN104317834A (en) * | 2014-10-10 | 2015-01-28 | 浙江大学 | Cross-media sorting method based on deep neural network |
CN104317834B (en) * | 2014-10-10 | 2017-09-29 | 浙江大学 | A kind of across media sort methods based on deep neural network |
CN104346440B (en) * | 2014-10-10 | 2017-06-23 | 浙江大学 | A kind of across media hash indexing methods based on neutral net |
CN104346450B (en) * | 2014-10-29 | 2017-06-23 | 浙江大学 | A kind of across media sort methods based on multi-modal recessive coupling expression |
CN104346450A (en) * | 2014-10-29 | 2015-02-11 | 浙江大学 | Cross-media ordering method based on multi-modal implicit coupling expression |
CN105701227A (en) * | 2016-01-15 | 2016-06-22 | 北京大学 | Cross-media similarity measure method and search method based on local association graph |
CN105701227B (en) * | 2016-01-15 | 2019-02-01 | 北京大学 | A kind of across media method for measuring similarity and search method based on local association figure |
CN106095829A (en) * | 2016-06-01 | 2016-11-09 | 华侨大学 | Cross-media retrieval method based on degree of depth study with the study of concordance expression of space |
CN106095829B (en) * | 2016-06-01 | 2019-08-06 | 华侨大学 | Cross-media retrieval method based on deep learning and the study of consistency expression of space |
CN106529583A (en) * | 2016-11-01 | 2017-03-22 | 哈尔滨工程大学 | Bag-of-visual-word-model-based indoor scene cognitive method |
CN107357884A (en) * | 2017-07-10 | 2017-11-17 | 中国人民解放军国防科学技术大学 | A kind of different distance measure across media based on two-way study sequence |
CN107562812A (en) * | 2017-08-11 | 2018-01-09 | 北京大学 | A kind of cross-module state similarity-based learning method based on the modeling of modality-specific semantic space |
CN107562812B (en) * | 2017-08-11 | 2021-01-15 | 北京大学 | Cross-modal similarity learning method based on specific modal semantic space modeling |
CN107657008A (en) * | 2017-09-25 | 2018-02-02 | 中国科学院计算技术研究所 | Across media training and search method based on depth discrimination sequence study |
CN107657008B (en) * | 2017-09-25 | 2020-11-03 | 中国科学院计算技术研究所 | Cross-media training and retrieval method based on deep discrimination ranking learning |
CN108228757A (en) * | 2017-12-21 | 2018-06-29 | 北京市商汤科技开发有限公司 | Image search method and device, electronic equipment, storage medium, program |
CN108829847A (en) * | 2018-06-20 | 2018-11-16 | 山东大学 | Commodity search method and system based on multi-modal shopping preferences |
CN108829847B (en) * | 2018-06-20 | 2020-11-17 | 山东大学 | Multi-modal modeling method based on translation and application thereof in commodity retrieval |
Also Published As
Publication number | Publication date |
---|---|
CN103559191B (en) | 2016-09-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103559191B (en) | Based on latent space study and Bidirectional sort study across media sort method | |
US9589208B2 (en) | Retrieval of similar images to a query image | |
CN104317834B (en) | A kind of across media sort methods based on deep neural network | |
CN111914558B (en) | Course knowledge relation extraction method and system based on sentence bag attention remote supervision | |
Lv et al. | Learning to model relatedness for news recommendation | |
US20160034512A1 (en) | Context-based metadata generation and automatic annotation of electronic media in a computer network | |
CN108280114B (en) | Deep learning-based user literature reading interest analysis method | |
US20170357878A1 (en) | Multi-dimensional realization of visual content of an image collection | |
US8788503B1 (en) | Content identification | |
CN105205096A (en) | Text modal and image modal crossing type data retrieval method | |
US20120166439A1 (en) | Method and system for classifying web sites using query-based web site models | |
CN105426529A (en) | Image retrieval method and system based on user search intention positioning | |
Noel et al. | Applicability of Latent Dirichlet Allocation to multi-disk search | |
CN103440262A (en) | Image searching system and image searching method basing on relevance feedback and Bag-of-Features | |
US20120117090A1 (en) | System and method for managing digital contents | |
Chen et al. | Fine-grained product categorization in e-commerce | |
CN112131453A (en) | Method, device and storage medium for detecting network bad short text based on BERT | |
Gialampoukidis et al. | A hybrid graph-based and non-linear late fusion approach for multimedia retrieval | |
Wang et al. | Constructing a comprehensive events database from the web | |
Li et al. | User-curated image collections: Modeling and recommendation | |
CN111259223B (en) | News recommendation and text classification method based on emotion analysis model | |
TW201243627A (en) | Multi-label text categorization based on fuzzy similarity and k nearest neighbors | |
CN107423294A (en) | A kind of community image search method and system | |
KR101544142B1 (en) | Searching method and system based on topic | |
Singhal et al. | Leveraging the web for automating tag expansion for low-content items |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20160914 Termination date: 20180910 |
|
CF01 | Termination of patent right due to non-payment of annual fee |