CN104794210A - Image retrieval method combining visual saliency and phrases - Google Patents
Image retrieval method combining visual saliency and phrases Download PDFInfo
- Publication number
- CN104794210A CN104794210A CN201510202152.4A CN201510202152A CN104794210A CN 104794210 A CN104794210 A CN 104794210A CN 201510202152 A CN201510202152 A CN 201510202152A CN 104794210 A CN104794210 A CN 104794210A
- Authority
- CN
- China
- Prior art keywords
- image
- saliency
- super
- estimated
- conducted
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Abstract
The invention discloses an image retrieval method combining visual saliency and phrases. The image retrieval method comprises the following steps that a search image is input and decoded into a YUV color space, and K-means clustering is conducted on pixels of the YUV color space, so that the image is divided into multiple superpixel units; likelihood computation is conducted on each superpixel unit of the divided image, and four obtained likelihoods are fused, and then the saliency map of the superpixel precision is obtained; bilateral Gaussian filtering is conducted, an image saliency map of the pixel precision is obtained; self-adaptive threshold segmentation is conducted on the image saliency map, and a binary image with a prominent target portion is obtained; a dictionary is established, vision words of an image salient region are extracted, and image description is generated; the image similarity between the search image and each image in a image base is calculated. By the adoption of the image retrieval method, the salient region in the image can be more accurately reflected, the visual saliency and the phases are well combined, and the retrieval effect is good.
Description
Technical field
Image processing field of the present invention, is specifically related to a kind of image search method in conjunction with vision significance and phrase.
Background technology
Developing rapidly and applying along with computing machine, network and multimedia technology, the quantity of digital picture increases just with surprising rapidity, and the image how to find people to need from mass digital image collection quickly and efficiently becomes a problem demanding prompt solution.For this reason, image retrieval technologies is arisen at the historic moment and is achieved very large development, from the earliest based on the retrieval that image manually marks, develops into the retrieval of present image content-based, precision and the efficiency of image retrieval are also all significantly increased, but still cannot meet the demand of people.The key of its problem is also do not have a kind of method can make the understanding image, semantic of computing machine completely as people at present.If the real meaning of image can be excavated further, and accurately expresses in a computer, the effect of image retrieval will certainly be promoted.
Summary of the invention
For solving the problem, the invention provides a kind of image search method in conjunction with vision significance and phrase.
For achieving the above object, the technical scheme that the present invention takes is:
In conjunction with an image search method for vision significance and phrase, comprise the steps:
Image decoding is YUV color space by S1, input inquiry image, and by carrying out K mean cluster to the pixel of YUV color space thus be some super-pixel unit by Iamge Segmentation;
S2, Likelihood Computation is carried out to each super-pixel unit of image after segmentation, obtain the measure value of this query image different parameters, Likelihood Computation comprise color independence is estimated, Color-spatial distribution is estimated, sports independence is estimated and the calculating of the space measure of spread;
S3, each for step S2 gained parameter estimated within linear normalization to [0,1] scope; Four kinds of fusions estimated are carried out to each super-pixel unit, obtains the Saliency maps of super-pixel precision;
S4, bilateral gaussian filtering is carried out to the Saliency maps of step S3 gained super-pixel precision, obtain the saliency figure of pixel precision;
S5, according to maximum variance between clusters, adaptive threshold fuzziness is carried out to the saliency figure of step S4 gained, obtain the binary map with outstanding significant target part;
S6, utilize SIFT algorithm to extract SIFT feature point in different classes of image from inquiry picture library, all unique point vector sets are incorporated into one piece, utilize K-Means clustering algorithm to merge similar SIFT feature point, construct the dictionary that comprises several vocabulary;
The visual word in S7, extraction saliency region, the number of visual word in statistical picture Saliency maps, structure visual phrase, the iamge description of synthetic image;
In S8, calculating query image and picture library, the image similarity of every width image, sorts to all images in picture library according to Similarity value, and returns associated picture on request as Query Result.
Wherein, the span of described judgment threshold is 0.15 ~ 0.35.
The present invention has following beneficial effect:
The salient region in query image can be reflected more exactly, vision significance is well combined with phrase, there is good retrieval effectiveness.
Accompanying drawing explanation
Fig. 1 is the process flow diagram of a kind of image search method in conjunction with vision significance and phrase of the embodiment of the present invention.
Embodiment
In order to make objects and advantages of the present invention clearly understand, below in conjunction with embodiment, the present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to explain the present invention, be not intended to limit the present invention.
As shown in Figure 1, embodiments provide a kind of image search method in conjunction with vision significance and phrase, comprise the steps:
Image decoding is YUV color space by S1, input inquiry image, and by carrying out K mean cluster to the pixel of YUV color space thus query image being divided into some super-pixel unit;
S2, Likelihood Computation is carried out to each super-pixel unit of image after segmentation, obtain the measure value of this query image different parameters, Likelihood Computation comprise color independence is estimated, Color-spatial distribution is estimated, sports independence is estimated and the calculating of the space measure of spread;
S3, each for step S2 gained parameter estimated within linear normalization to [0,1] scope; Four kinds of fusions estimated are carried out to each super-pixel unit, obtains the Saliency maps of super-pixel precision;
S4, bilateral gaussian filtering is carried out to the Saliency maps of step S3 gained super-pixel precision, obtain the saliency figure of pixel precision;
S5, according to maximum variance between clusters, adaptive threshold fuzziness is carried out to the saliency figure of step S4 gained, obtain the binary map with outstanding significant target part;
S6, utilize SIFT algorithm to extract SIFT feature point in different classes of image from inquiry picture library, all unique point vector sets are incorporated into one piece, utilize K-Means clustering algorithm to merge similar SIFT feature point, construct the dictionary that comprises several vocabulary;
The visual word in S7, extraction saliency region, the number of visual word in statistical picture Saliency maps, structure visual phrase, the iamge description of synthetic image;
In S8, calculating query image and picture library, the image similarity of every width image, sorts to all images in picture library according to Similarity value, and returns associated picture on request as Query Result.
The span of described judgment threshold is 0.15 ~ 0.35.
The above is only the preferred embodiment of the present invention; it should be pointed out that for those skilled in the art, under the premise without departing from the principles of the invention; can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.
Claims (2)
1., in conjunction with an image search method for vision significance and phrase, it is characterized in that, comprise the steps:
Image decoding is YUV color space by S1, input inquiry image, and by carrying out K mean cluster to the pixel of YUV color space thus query image being divided into some super-pixel unit;
S2, Likelihood Computation is carried out to each super-pixel unit of image after segmentation, obtain the measure value of this query image different parameters, Likelihood Computation comprise color independence is estimated, Color-spatial distribution is estimated, sports independence is estimated and the calculating of the space measure of spread;
S3, each for step S2 gained parameter estimated within linear normalization to [0,1] scope; Four kinds of fusions estimated are carried out to each super-pixel unit, obtains the Saliency maps of super-pixel precision;
S4, bilateral gaussian filtering is carried out to the Saliency maps of step S3 gained super-pixel precision, obtain the saliency figure of pixel precision;
S5, according to maximum variance between clusters, adaptive threshold fuzziness is carried out to the saliency figure of step S4 gained, obtain the binary map with outstanding significant target part;
S6, utilize SIFT algorithm to extract SIFT feature point in different classes of image from inquiry picture library, all unique point vector sets are incorporated into one piece, utilize K-Means clustering algorithm to merge similar SIFT feature point, construct the dictionary that comprises several vocabulary;
The visual word in S7, extraction saliency region, the number of visual word in statistical picture Saliency maps, structure visual phrase, the iamge description of synthetic image;
In S8, calculating query image and picture library, the image similarity of every width image, sorts to all images in picture library according to Similarity value, and returns associated picture on request as Query Result.
2. a kind of image search method in conjunction with vision significance and phrase according to claim 1, it is characterized in that, the span of described judgment threshold is 0.15 ~ 0.35.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510202152.4A CN104794210A (en) | 2015-04-23 | 2015-04-23 | Image retrieval method combining visual saliency and phrases |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510202152.4A CN104794210A (en) | 2015-04-23 | 2015-04-23 | Image retrieval method combining visual saliency and phrases |
Publications (1)
Publication Number | Publication Date |
---|---|
CN104794210A true CN104794210A (en) | 2015-07-22 |
Family
ID=53559002
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510202152.4A Pending CN104794210A (en) | 2015-04-23 | 2015-04-23 | Image retrieval method combining visual saliency and phrases |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104794210A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105631037A (en) * | 2015-12-31 | 2016-06-01 | 北京恒冠网络数据处理有限公司 | Image retrieval method |
CN105825238A (en) * | 2016-03-30 | 2016-08-03 | 江苏大学 | Visual saliency object detection method |
CN108022263A (en) * | 2017-12-05 | 2018-05-11 | 新疆工程学院 | A kind of SIFT feature inspection optimization method based on the notable parameter index of regional area |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100074528A1 (en) * | 2008-09-23 | 2010-03-25 | Microsoft Corporation | Coherent phrase model for efficient image near-duplicate retrieval |
CN101894372A (en) * | 2010-08-03 | 2010-11-24 | 新疆大学 | New noise-containing remote sensing image segmentation method |
CN103020992A (en) * | 2012-11-12 | 2013-04-03 | 华中科技大学 | Video image significance detection method based on dynamic color association |
CN103747240A (en) * | 2013-12-25 | 2014-04-23 | 浙江大学 | Fusion color and motion information vision saliency filtering method |
CN103838864A (en) * | 2014-03-20 | 2014-06-04 | 北京工业大学 | Visual saliency and visual phrase combined image retrieval method |
-
2015
- 2015-04-23 CN CN201510202152.4A patent/CN104794210A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100074528A1 (en) * | 2008-09-23 | 2010-03-25 | Microsoft Corporation | Coherent phrase model for efficient image near-duplicate retrieval |
CN101894372A (en) * | 2010-08-03 | 2010-11-24 | 新疆大学 | New noise-containing remote sensing image segmentation method |
CN103020992A (en) * | 2012-11-12 | 2013-04-03 | 华中科技大学 | Video image significance detection method based on dynamic color association |
CN103747240A (en) * | 2013-12-25 | 2014-04-23 | 浙江大学 | Fusion color and motion information vision saliency filtering method |
CN103838864A (en) * | 2014-03-20 | 2014-06-04 | 北京工业大学 | Visual saliency and visual phrase combined image retrieval method |
Non-Patent Citations (1)
Title |
---|
全红艳,曹桂涛: "《数字图像处理原理与实现方法》", 31 January 2014, 机械工业出版社 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105631037A (en) * | 2015-12-31 | 2016-06-01 | 北京恒冠网络数据处理有限公司 | Image retrieval method |
CN105631037B (en) * | 2015-12-31 | 2019-02-22 | 北京恒冠网络数据处理有限公司 | A kind of image search method |
CN105825238A (en) * | 2016-03-30 | 2016-08-03 | 江苏大学 | Visual saliency object detection method |
CN105825238B (en) * | 2016-03-30 | 2019-04-30 | 江苏大学 | A kind of vision significance mesh object detection method |
CN108022263A (en) * | 2017-12-05 | 2018-05-11 | 新疆工程学院 | A kind of SIFT feature inspection optimization method based on the notable parameter index of regional area |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Wang et al. | RGB-D salient object detection via minimum barrier distance transform and saliency fusion | |
WO2021238062A1 (en) | Vehicle tracking method and apparatus, and electronic device | |
Liu et al. | Robustly extracting captions in videos based on stroke-like edges and spatio-temporal analysis | |
JP2015506045A (en) | Image indexing based on similarity of image features | |
WO2021082168A1 (en) | Method for matching specific target object in scene image | |
Wang et al. | Video object co-segmentation via subspace clustering and quadratic pseudo-boolean optimization in an mrf framework | |
Iwamura et al. | Recognition of multiple characters in a scene image using arrangement of local features | |
Zuo et al. | Multi-strategy tracking based text detection in scene videos | |
Hu et al. | Markov random fields for sketch based video retrieval | |
CN105335469A (en) | Method and device for image matching and retrieving | |
CN103955952A (en) | Extraction and description method for garment image color features | |
Peng et al. | Superpixel optimization using higher order energy | |
CN103942778A (en) | Fast video key frame extraction method of principal component characteristic curve analysis | |
CN109697240B (en) | Image retrieval method and device based on features | |
CN104794210A (en) | Image retrieval method combining visual saliency and phrases | |
CN103761736A (en) | Image segmentation method based on Bayes harmonious degree | |
Wang et al. | Semantic annotation for complex video street views based on 2D–3D multi-feature fusion and aggregated boosting decision forests | |
CN113780276A (en) | Text detection and identification method and system combined with text classification | |
Gupta et al. | A learning-based approach for automatic image and video colorization | |
Yan et al. | Inferring occluded features for fast object detection | |
CN104199950A (en) | Method of searching for academic papers on basis of fast matching of image similarities | |
Alajel et al. | Face detection based on skin color modeling and modified Hausdorff distance | |
Li et al. | Saliency segmentation and foreground extraction of underwater image based on localization | |
Hou et al. | A multiple features video copy detection algorithm based on a SURF descriptor | |
CN107678655B (en) | Image element extraction method and image element extraction system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
EXSB | Decision made by sipo to initiate substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20150722 |
|
RJ01 | Rejection of invention patent application after publication |