CN105512155A - Device and method for multi-layer semantic image retrieval - Google Patents

Device and method for multi-layer semantic image retrieval Download PDF

Info

Publication number
CN105512155A
CN105512155A CN201410558869.8A CN201410558869A CN105512155A CN 105512155 A CN105512155 A CN 105512155A CN 201410558869 A CN201410558869 A CN 201410558869A CN 105512155 A CN105512155 A CN 105512155A
Authority
CN
China
Prior art keywords
module
image
semantic
target
low
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410558869.8A
Other languages
Chinese (zh)
Inventor
张文萍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin Linglong Technology Co Ltd
Original Assignee
Tianjin Linglong Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin Linglong Technology Co Ltd filed Critical Tianjin Linglong Technology Co Ltd
Priority to CN201410558869.8A priority Critical patent/CN105512155A/en
Publication of CN105512155A publication Critical patent/CN105512155A/en
Pending legal-status Critical Current

Links

Abstract

The invention provides a device and method for multi-layer semantic image retrieval. The device comprises an information input module, an image retrieval module, a result output module and a display device, wherein the information input module, the image retrieval module and the result output module are connected in succession; the image retrieval module comprises a network communication module, a low-layer characteristic extraction module, a target semantic extraction module and a scene semantic extraction module, and the low-layer characteristic extraction module is connected to the network communication module; the target semantic extraction module is connected to the rear end of the low-layer characteristic extraction module; and the scene semantic extraction module is connected to the rear end of the target semantic extraction module, and retrieval results are displayed by the display device. The device and method provided by the invention have the beneficial effect that users' satisfaction with the image retrieval can be enhanced greatly.

Description

A kind of Multi-level semantic image indexing unit and method
Technical field
The invention belongs to searching system field, especially relate to a kind of Multi-level semantic image indexing unit and method.
Background technology
Along with computing machine, the developing rapidly of multimedia and internet, internet there is the Digital image information resource of magnanimity.Picture search be in recent years growth rate the fastest classified search application, the picture search number of times of global a few large search engine is all doubled and redoubled.Therefore image retrieval technologies has become the focus of research both at home and abroad, and how efficiently tissue, the large-scale image data base of management and retrieval, also become the gordian technique in the major projects such as Future Information highway, digital library.Image retrieval is also a chief component of multimedia information retrieval technology, is one of theoretical foundation of Video Information Retrieval Techniques:, occupies very important status in information retrieval field.
Earlier picture retrieval, mainly through carrying out artificial textual annotation to image, utilizes text retrieval to realize searching characteristics of image.The mode that this employing manually marks keyword descriptor wastes time and energy, and subjectivity inevitably with a guy and uncertainty.The search engines such as current Baidu, Google automatically gather based on info web and label technology carries out text marking and retrieval to image.The image identification that this automatic marking gathers is very coarse, and accuracy is not high, sometimes or even inaccurate.Early 1990s, researcher proposes CBIR thought, achieves the leap of the retrievals of image vision content characteristic such as using color, texture, shape and region and the search modes of " to scheme to look for figure ".But people judge that the similarity of image is not only based upon on the similarity basis of these low-level features.CBIR only relate to the surface characteristics of image, the semantic meaning of the image that is beyond expression.
Enter 21 century, image retrieval launches around this focus of image, semantic, and its target is the understanding level making the ability of computer search image reach people.Ideally, user mainly carries out image retrieval according to the implication of image, instead of retrieves according to the low-level feature of image.
Image, semantic is broadly divided into Feature Semantics, Object Semanteme, Scene Semantics, behavior semanteme and emotional semantic etc., in order to be described the picture material of different levels.There is research that picture material is divided into three levels further: ground floor is primitive character layer, comprise the visual signature of Description Image, as color, texture, shape etc., reflection be the content that some of image have objective statistical characteristic, corresponding to the Feature Semantics of image; The second layer, for deriving attribute layer, relates to and being derived by Low Level Vision feature and the attribute that obtains, in order to the object (as " sun ", " basketball " etc.) described in recognition image, corresponding to the Object Semanteme of image; Third layer is abstract attribute layer, comprise the abstract attribute (as " sunrise ", " Basketball Match " etc.) that object and scene are carried out more high-rise reasoning and obtained, semantic and the emotional semantic of the Scene Semantics of correspondence image, behavior etc. generally the gap between ground floor and the second layer is called " semantic gap ", whether image retrieval really employs semanteme and is mainly reflected in the picture material whether obtaining the second layer.
Research in recent years based on the Images Classification of image, semantic, mark and retrieval mostly needs to utilize the strategy of machine learning to obtain semanteme, but does not occur a kind of gratifying system general efficiently yet.
Summary of the invention
The object of this invention is to provide a kind of Multi-level semantic image indexing unit and method, greatly can improve the satisfaction that user carries out image retrieval.
For achieving the above object, the technical solution used in the present invention is: a kind of Multi-level semantic image indexing unit and method, comprise MIM message input module, image retrieval module, result output module, display device, it is characterized in that: described MIM message input module, described image retrieval module, described result output module connects successively, described image retrieval module comprises network communication module, low-level feature abstract module, the semantic extraction module of target, and Scene Semantics extraction module, wherein, described low-level feature abstract module is connected with network communication module, and the low-level feature information of network image is extracted via network communication module, the semantic extraction module of described target is connected to the rear end of low-level feature abstract module, and in low-level feature information, extracts target semantic information according to predetermined probabilistic information, described Scene Semantics extraction module is connected to the rear end of the semantic extraction module of target, and in target semantic information, extract Scene Semantics information according to predetermined Scene Semantics template, again image corresponding for this Scene Semantics information is sent to the result output module that this Scene Semantics extraction module rear end connects, described display device display result for retrieval.
Preferably, described image retrieval module, based on the distance function of light stream histogram matrix, matches the video lens being similar to given video lens, retrieves matching image from video database.
Preferably, described display device is LED display.
The advantage that the present invention has and good effect are: by extracting target semantic information and the Scene Semantics information of network image, thus achieving the hommization image retrieval being different from prior art, substantially increasing the satisfaction of image retrieval.
Accompanying drawing explanation
Fig. 1 is structural representation of the present invention.
Embodiment
Below in conjunction with drawings and Examples, the present invention will be further described.
As shown in Figure 1, the invention provides a kind of Multi-level semantic image indexing unit and method, comprise MIM message input module, image retrieval module, result output module, display device, MIM message input module, image retrieval module, result output module connect successively, image retrieval module comprises network communication module, low-level feature abstract module, the semantic extraction module of target and Scene Semantics extraction module, wherein, low-level feature abstract module is connected with network communication module, and extracts the low-level feature information of network image via network communication module; The semantic extraction module of target is connected to the rear end of low-level feature abstract module, and in low-level feature information, extracts target semantic information according to predetermined probabilistic information; Scene Semantics extraction module is connected to the rear end of the semantic extraction module of target, and in target semantic information, extract Scene Semantics information according to predetermined Scene Semantics template, again image corresponding for this Scene Semantics information is sent to the result output module that this Scene Semantics extraction module rear end connects, display device display result for retrieval.
Preferably, image retrieval module, based on the distance function of light stream histogram matrix, matches the video lens being similar to given video lens, retrieves matching image from video database.
Preferably, described display device is LED display.
In the present invention, multiple keyword can be adopted to carry out image retrieval.Such as, for the network image preserving each regional aim semantic vector and integral image target semantic vector, when adopting scene keyword to retrieve, can directly according to the characters matching result in Scene Semantics layer, return image containing same scene semanteme in network image as result for retrieval.Or, when adopting target keyword to retrieve, according to the overall goals of network image semanteme return to should the image of target keyword as result for retrieval.Or, when the illustration adopting user manually to select is retrieved, select to there is the image of the target semantic vector matched as result for retrieval in network image according to the semantic proper vector of the target of this illustration.
The advantage that the present invention has and good effect are: by extracting target semantic information and the Scene Semantics information of network image, thus achieving the hommization image retrieval being different from prior art, substantially increasing the satisfaction of image retrieval.
Above embodiments of the invention have been described in detail, but described content being only preferred embodiment of the present invention, can not being considered to for limiting practical range of the present invention.All equalizations done according to the present patent application scope change and improve, and all should still belong within patent covering scope of the present invention.

Claims (3)

1. a Multi-level semantic image indexing unit and method, comprise MIM message input module, image retrieval module, result output module, display device, it is characterized in that: described MIM message input module, described image retrieval module, described result output module connect successively, described image retrieval module comprises network communication module, low-level feature abstract module, the semantic extraction module of target and Scene Semantics extraction module, wherein, described low-level feature abstract module is connected with network communication module, and extracts the low-level feature information of network image via network communication module; The semantic extraction module of described target is connected to the rear end of low-level feature abstract module, and in low-level feature information, extracts target semantic information according to predetermined probabilistic information; Described Scene Semantics extraction module is connected to the rear end of the semantic extraction module of target, and in target semantic information, extract Scene Semantics information according to predetermined Scene Semantics template, again image corresponding for this Scene Semantics information is sent to the result output module that this Scene Semantics extraction module rear end connects, described display device display result for retrieval.
2. a kind of Multi-level semantic image indexing unit according to claim 1 and method, it is characterized in that: described image retrieval module is based on the distance function of light stream histogram matrix, from video database, match the video lens being similar to given video lens, retrieve matching image.
3. a kind of Multi-level semantic image indexing unit according to claim 1 and method, is characterized in that: described display device is LED display.
CN201410558869.8A 2014-10-18 2014-10-18 Device and method for multi-layer semantic image retrieval Pending CN105512155A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410558869.8A CN105512155A (en) 2014-10-18 2014-10-18 Device and method for multi-layer semantic image retrieval

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410558869.8A CN105512155A (en) 2014-10-18 2014-10-18 Device and method for multi-layer semantic image retrieval

Publications (1)

Publication Number Publication Date
CN105512155A true CN105512155A (en) 2016-04-20

Family

ID=55720139

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410558869.8A Pending CN105512155A (en) 2014-10-18 2014-10-18 Device and method for multi-layer semantic image retrieval

Country Status (1)

Country Link
CN (1) CN105512155A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107491514A (en) * 2017-08-07 2017-12-19 王庆军 Image search method based on information security
CN109783749A (en) * 2018-12-10 2019-05-21 深圳变设龙信息科技有限公司 A kind of Material for design intelligent recommendation method, apparatus and terminal device
CN112651413A (en) * 2019-10-10 2021-04-13 百度在线网络技术(北京)有限公司 Integrated learning classification method, device, equipment and storage medium for vulgar graphs

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107491514A (en) * 2017-08-07 2017-12-19 王庆军 Image search method based on information security
CN109783749A (en) * 2018-12-10 2019-05-21 深圳变设龙信息科技有限公司 A kind of Material for design intelligent recommendation method, apparatus and terminal device
CN112651413A (en) * 2019-10-10 2021-04-13 百度在线网络技术(北京)有限公司 Integrated learning classification method, device, equipment and storage medium for vulgar graphs
CN112651413B (en) * 2019-10-10 2023-10-17 百度在线网络技术(北京)有限公司 Integrated learning classification method, device, equipment and storage medium for hypo-custom graph

Similar Documents

Publication Publication Date Title
Snoek et al. The challenge problem for automated detection of 101 semantic concepts in multimedia
CN106560810B (en) Searching using specific attributes found in images
EP2402867B1 (en) A computer-implemented method, a computer program product and a computer system for image processing
US20100325138A1 (en) System and method for performing video search on web
CN101739428B (en) Method for establishing index for multimedia
CN103793697A (en) Identity labeling method of face images and face identity recognition method of face images
CN103384883B (en) Top-K process is utilized to make semantic abundant
CN102262670A (en) Cross-media information retrieval system and method based on mobile visual equipment
Dang-Nguyen et al. Multimodal retrieval with diversification and relevance feedback for tourist attraction images
CN102236714A (en) Extensible markup language (XML)-based interactive application multimedia information retrieval method
CN109446399A (en) A kind of video display entity search method
Kaneko et al. Visual event mining from geo-tweet photos
Fiallos et al. Detecting topics and locations on Instagram photos
CN113596601A (en) Video picture positioning method, related device, equipment and storage medium
Tsikrika et al. Image annotation using clickthrough data
CN105512155A (en) Device and method for multi-layer semantic image retrieval
CN103942274A (en) Labeling system and method for biological medical treatment image on basis of LDA
Min et al. Multimodal spatio-temporal theme modeling for landmark analysis
CN109241327A (en) Image search method and device
Tsai et al. Qualitative evaluation of automatic assignment of keywords to images
Boteanu et al. Hierarchical clustering pseudo-relevance feedback for social image search result diversification
Mylonas et al. Context modelling for multimedia analysis
Brenner et al. Multimodal detection, retrieval and classification of social events in web photo collections
CN201654783U (en) Multi-level semantic image retrieval system
Jones et al. Automated annotation of landmark images using community contributed datasets and web resources

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20160420

WD01 Invention patent application deemed withdrawn after publication