CN104794210A - Image retrieval method combining visual saliency and phrases - Google Patents

Image retrieval method combining visual saliency and phrases Download PDF

Info

Publication number
CN104794210A
CN104794210A CN201510202152.4A CN201510202152A CN104794210A CN 104794210 A CN104794210 A CN 104794210A CN 201510202152 A CN201510202152 A CN 201510202152A CN 104794210 A CN104794210 A CN 104794210A
Authority
CN
China
Prior art keywords
image
saliency
super
estimated
conducted
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510202152.4A
Other languages
Chinese (zh)
Inventor
乔小燕
宫召华
孔凡秋
于永胜
刘重阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Technology and Business University
Original Assignee
Shandong Technology and Business University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Technology and Business University filed Critical Shandong Technology and Business University
Priority to CN201510202152.4A priority Critical patent/CN104794210A/en
Publication of CN104794210A publication Critical patent/CN104794210A/en
Pending legal-status Critical Current

Links

Abstract

The invention discloses an image retrieval method combining visual saliency and phrases. The image retrieval method comprises the following steps that a search image is input and decoded into a YUV color space, and K-means clustering is conducted on pixels of the YUV color space, so that the image is divided into multiple superpixel units; likelihood computation is conducted on each superpixel unit of the divided image, and four obtained likelihoods are fused, and then the saliency map of the superpixel precision is obtained; bilateral Gaussian filtering is conducted, an image saliency map of the pixel precision is obtained; self-adaptive threshold segmentation is conducted on the image saliency map, and a binary image with a prominent target portion is obtained; a dictionary is established, vision words of an image salient region are extracted, and image description is generated; the image similarity between the search image and each image in a image base is calculated. By the adoption of the image retrieval method, the salient region in the image can be more accurately reflected, the visual saliency and the phases are well combined, and the retrieval effect is good.

Description

A kind of image search method in conjunction with vision significance and phrase
Technical field
Image processing field of the present invention, is specifically related to a kind of image search method in conjunction with vision significance and phrase.
Background technology
Developing rapidly and applying along with computing machine, network and multimedia technology, the quantity of digital picture increases just with surprising rapidity, and the image how to find people to need from mass digital image collection quickly and efficiently becomes a problem demanding prompt solution.For this reason, image retrieval technologies is arisen at the historic moment and is achieved very large development, from the earliest based on the retrieval that image manually marks, develops into the retrieval of present image content-based, precision and the efficiency of image retrieval are also all significantly increased, but still cannot meet the demand of people.The key of its problem is also do not have a kind of method can make the understanding image, semantic of computing machine completely as people at present.If the real meaning of image can be excavated further, and accurately expresses in a computer, the effect of image retrieval will certainly be promoted.
Summary of the invention
For solving the problem, the invention provides a kind of image search method in conjunction with vision significance and phrase.
For achieving the above object, the technical scheme that the present invention takes is:
In conjunction with an image search method for vision significance and phrase, comprise the steps:
Image decoding is YUV color space by S1, input inquiry image, and by carrying out K mean cluster to the pixel of YUV color space thus be some super-pixel unit by Iamge Segmentation;
S2, Likelihood Computation is carried out to each super-pixel unit of image after segmentation, obtain the measure value of this query image different parameters, Likelihood Computation comprise color independence is estimated, Color-spatial distribution is estimated, sports independence is estimated and the calculating of the space measure of spread;
S3, each for step S2 gained parameter estimated within linear normalization to [0,1] scope; Four kinds of fusions estimated are carried out to each super-pixel unit, obtains the Saliency maps of super-pixel precision;
S4, bilateral gaussian filtering is carried out to the Saliency maps of step S3 gained super-pixel precision, obtain the saliency figure of pixel precision;
S5, according to maximum variance between clusters, adaptive threshold fuzziness is carried out to the saliency figure of step S4 gained, obtain the binary map with outstanding significant target part;
S6, utilize SIFT algorithm to extract SIFT feature point in different classes of image from inquiry picture library, all unique point vector sets are incorporated into one piece, utilize K-Means clustering algorithm to merge similar SIFT feature point, construct the dictionary that comprises several vocabulary;
The visual word in S7, extraction saliency region, the number of visual word in statistical picture Saliency maps, structure visual phrase, the iamge description of synthetic image;
In S8, calculating query image and picture library, the image similarity of every width image, sorts to all images in picture library according to Similarity value, and returns associated picture on request as Query Result.
Wherein, the span of described judgment threshold is 0.15 ~ 0.35.
The present invention has following beneficial effect:
The salient region in query image can be reflected more exactly, vision significance is well combined with phrase, there is good retrieval effectiveness.
Accompanying drawing explanation
Fig. 1 is the process flow diagram of a kind of image search method in conjunction with vision significance and phrase of the embodiment of the present invention.
Embodiment
In order to make objects and advantages of the present invention clearly understand, below in conjunction with embodiment, the present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to explain the present invention, be not intended to limit the present invention.
As shown in Figure 1, embodiments provide a kind of image search method in conjunction with vision significance and phrase, comprise the steps:
Image decoding is YUV color space by S1, input inquiry image, and by carrying out K mean cluster to the pixel of YUV color space thus query image being divided into some super-pixel unit;
S2, Likelihood Computation is carried out to each super-pixel unit of image after segmentation, obtain the measure value of this query image different parameters, Likelihood Computation comprise color independence is estimated, Color-spatial distribution is estimated, sports independence is estimated and the calculating of the space measure of spread;
S3, each for step S2 gained parameter estimated within linear normalization to [0,1] scope; Four kinds of fusions estimated are carried out to each super-pixel unit, obtains the Saliency maps of super-pixel precision;
S4, bilateral gaussian filtering is carried out to the Saliency maps of step S3 gained super-pixel precision, obtain the saliency figure of pixel precision;
S5, according to maximum variance between clusters, adaptive threshold fuzziness is carried out to the saliency figure of step S4 gained, obtain the binary map with outstanding significant target part;
S6, utilize SIFT algorithm to extract SIFT feature point in different classes of image from inquiry picture library, all unique point vector sets are incorporated into one piece, utilize K-Means clustering algorithm to merge similar SIFT feature point, construct the dictionary that comprises several vocabulary;
The visual word in S7, extraction saliency region, the number of visual word in statistical picture Saliency maps, structure visual phrase, the iamge description of synthetic image;
In S8, calculating query image and picture library, the image similarity of every width image, sorts to all images in picture library according to Similarity value, and returns associated picture on request as Query Result.
The span of described judgment threshold is 0.15 ~ 0.35.
The above is only the preferred embodiment of the present invention; it should be pointed out that for those skilled in the art, under the premise without departing from the principles of the invention; can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.

Claims (2)

1., in conjunction with an image search method for vision significance and phrase, it is characterized in that, comprise the steps:
Image decoding is YUV color space by S1, input inquiry image, and by carrying out K mean cluster to the pixel of YUV color space thus query image being divided into some super-pixel unit;
S2, Likelihood Computation is carried out to each super-pixel unit of image after segmentation, obtain the measure value of this query image different parameters, Likelihood Computation comprise color independence is estimated, Color-spatial distribution is estimated, sports independence is estimated and the calculating of the space measure of spread;
S3, each for step S2 gained parameter estimated within linear normalization to [0,1] scope; Four kinds of fusions estimated are carried out to each super-pixel unit, obtains the Saliency maps of super-pixel precision;
S4, bilateral gaussian filtering is carried out to the Saliency maps of step S3 gained super-pixel precision, obtain the saliency figure of pixel precision;
S5, according to maximum variance between clusters, adaptive threshold fuzziness is carried out to the saliency figure of step S4 gained, obtain the binary map with outstanding significant target part;
S6, utilize SIFT algorithm to extract SIFT feature point in different classes of image from inquiry picture library, all unique point vector sets are incorporated into one piece, utilize K-Means clustering algorithm to merge similar SIFT feature point, construct the dictionary that comprises several vocabulary;
The visual word in S7, extraction saliency region, the number of visual word in statistical picture Saliency maps, structure visual phrase, the iamge description of synthetic image;
In S8, calculating query image and picture library, the image similarity of every width image, sorts to all images in picture library according to Similarity value, and returns associated picture on request as Query Result.
2. a kind of image search method in conjunction with vision significance and phrase according to claim 1, it is characterized in that, the span of described judgment threshold is 0.15 ~ 0.35.
CN201510202152.4A 2015-04-23 2015-04-23 Image retrieval method combining visual saliency and phrases Pending CN104794210A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510202152.4A CN104794210A (en) 2015-04-23 2015-04-23 Image retrieval method combining visual saliency and phrases

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510202152.4A CN104794210A (en) 2015-04-23 2015-04-23 Image retrieval method combining visual saliency and phrases

Publications (1)

Publication Number Publication Date
CN104794210A true CN104794210A (en) 2015-07-22

Family

ID=53559002

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510202152.4A Pending CN104794210A (en) 2015-04-23 2015-04-23 Image retrieval method combining visual saliency and phrases

Country Status (1)

Country Link
CN (1) CN104794210A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105631037A (en) * 2015-12-31 2016-06-01 北京恒冠网络数据处理有限公司 Image retrieval method
CN105825238A (en) * 2016-03-30 2016-08-03 江苏大学 Visual saliency object detection method
CN108022263A (en) * 2017-12-05 2018-05-11 新疆工程学院 A kind of SIFT feature inspection optimization method based on the notable parameter index of regional area

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100074528A1 (en) * 2008-09-23 2010-03-25 Microsoft Corporation Coherent phrase model for efficient image near-duplicate retrieval
CN101894372A (en) * 2010-08-03 2010-11-24 新疆大学 New noise-containing remote sensing image segmentation method
CN103020992A (en) * 2012-11-12 2013-04-03 华中科技大学 Video image significance detection method based on dynamic color association
CN103747240A (en) * 2013-12-25 2014-04-23 浙江大学 Fusion color and motion information vision saliency filtering method
CN103838864A (en) * 2014-03-20 2014-06-04 北京工业大学 Visual saliency and visual phrase combined image retrieval method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100074528A1 (en) * 2008-09-23 2010-03-25 Microsoft Corporation Coherent phrase model for efficient image near-duplicate retrieval
CN101894372A (en) * 2010-08-03 2010-11-24 新疆大学 New noise-containing remote sensing image segmentation method
CN103020992A (en) * 2012-11-12 2013-04-03 华中科技大学 Video image significance detection method based on dynamic color association
CN103747240A (en) * 2013-12-25 2014-04-23 浙江大学 Fusion color and motion information vision saliency filtering method
CN103838864A (en) * 2014-03-20 2014-06-04 北京工业大学 Visual saliency and visual phrase combined image retrieval method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
全红艳,曹桂涛: "《数字图像处理原理与实现方法》", 31 January 2014, 机械工业出版社 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105631037A (en) * 2015-12-31 2016-06-01 北京恒冠网络数据处理有限公司 Image retrieval method
CN105631037B (en) * 2015-12-31 2019-02-22 北京恒冠网络数据处理有限公司 A kind of image search method
CN105825238A (en) * 2016-03-30 2016-08-03 江苏大学 Visual saliency object detection method
CN105825238B (en) * 2016-03-30 2019-04-30 江苏大学 A kind of vision significance mesh object detection method
CN108022263A (en) * 2017-12-05 2018-05-11 新疆工程学院 A kind of SIFT feature inspection optimization method based on the notable parameter index of regional area

Similar Documents

Publication Publication Date Title
Wang et al. RGB-D salient object detection via minimum barrier distance transform and saliency fusion
WO2021238062A1 (en) Vehicle tracking method and apparatus, and electronic device
Liu et al. Robustly extracting captions in videos based on stroke-like edges and spatio-temporal analysis
JP2015506045A (en) Image indexing based on similarity of image features
WO2021082168A1 (en) Method for matching specific target object in scene image
Wang et al. Video object co-segmentation via subspace clustering and quadratic pseudo-boolean optimization in an mrf framework
Iwamura et al. Recognition of multiple characters in a scene image using arrangement of local features
Zuo et al. Multi-strategy tracking based text detection in scene videos
Hu et al. Markov random fields for sketch based video retrieval
CN105335469A (en) Method and device for image matching and retrieving
CN103955952A (en) Extraction and description method for garment image color features
Peng et al. Superpixel optimization using higher order energy
CN103942778A (en) Fast video key frame extraction method of principal component characteristic curve analysis
CN109697240B (en) Image retrieval method and device based on features
CN104794210A (en) Image retrieval method combining visual saliency and phrases
CN103761736A (en) Image segmentation method based on Bayes harmonious degree
Wang et al. Semantic annotation for complex video street views based on 2D–3D multi-feature fusion and aggregated boosting decision forests
CN113780276A (en) Text detection and identification method and system combined with text classification
Gupta et al. A learning-based approach for automatic image and video colorization
Yan et al. Inferring occluded features for fast object detection
CN104199950A (en) Method of searching for academic papers on basis of fast matching of image similarities
Alajel et al. Face detection based on skin color modeling and modified Hausdorff distance
Li et al. Saliency segmentation and foreground extraction of underwater image based on localization
Hou et al. A multiple features video copy detection algorithm based on a SURF descriptor
CN107678655B (en) Image element extraction method and image element extraction system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150722

RJ01 Rejection of invention patent application after publication