CN105512155A

CN105512155A - Device and method for multi-layer semantic image retrieval

Info

Publication number: CN105512155A
Application number: CN201410558869.8A
Authority: CN
Inventors: 张文萍
Original assignee: Tianjin Linglong Technology Co Ltd
Current assignee: Tianjin Linglong Technology Co Ltd
Priority date: 2014-10-18
Filing date: 2014-10-18
Publication date: 2016-04-20

Abstract

The invention provides a device and method for multi-layer semantic image retrieval. The device comprises an information input module, an image retrieval module, a result output module and a display device, wherein the information input module, the image retrieval module and the result output module are connected in succession; the image retrieval module comprises a network communication module, a low-layer characteristic extraction module, a target semantic extraction module and a scene semantic extraction module, and the low-layer characteristic extraction module is connected to the network communication module; the target semantic extraction module is connected to the rear end of the low-layer characteristic extraction module; and the scene semantic extraction module is connected to the rear end of the target semantic extraction module, and retrieval results are displayed by the display device. The device and method provided by the invention have the beneficial effect that users' satisfaction with the image retrieval can be enhanced greatly.

Description

A kind of Multi-level semantic image indexing unit and method

Technical field

The invention belongs to searching system field, especially relate to a kind of Multi-level semantic image indexing unit and method.

Background technology

Along with computing machine, the developing rapidly of multimedia and internet, internet there is the Digital image information resource of magnanimity.Picture search be in recent years growth rate the fastest classified search application, the picture search number of times of global a few large search engine is all doubled and redoubled.Therefore image retrieval technologies has become the focus of research both at home and abroad, and how efficiently tissue, the large-scale image data base of management and retrieval, also become the gordian technique in the major projects such as Future Information highway, digital library.Image retrieval is also a chief component of multimedia information retrieval technology, is one of theoretical foundation of Video Information Retrieval Techniques:, occupies very important status in information retrieval field.

Earlier picture retrieval, mainly through carrying out artificial textual annotation to image, utilizes text retrieval to realize searching characteristics of image.The mode that this employing manually marks keyword descriptor wastes time and energy, and subjectivity inevitably with a guy and uncertainty.The search engines such as current Baidu, Google automatically gather based on info web and label technology carries out text marking and retrieval to image.The image identification that this automatic marking gathers is very coarse, and accuracy is not high, sometimes or even inaccurate.Early 1990s, researcher proposes CBIR thought, achieves the leap of the retrievals of image vision content characteristic such as using color, texture, shape and region and the search modes of " to scheme to look for figure ".But people judge that the similarity of image is not only based upon on the similarity basis of these low-level features.CBIR only relate to the surface characteristics of image, the semantic meaning of the image that is beyond expression.

Enter 21 century, image retrieval launches around this focus of image, semantic, and its target is the understanding level making the ability of computer search image reach people.Ideally, user mainly carries out image retrieval according to the implication of image, instead of retrieves according to the low-level feature of image.

Image, semantic is broadly divided into Feature Semantics, Object Semanteme, Scene Semantics, behavior semanteme and emotional semantic etc., in order to be described the picture material of different levels.There is research that picture material is divided into three levels further: ground floor is primitive character layer, comprise the visual signature of Description Image, as color, texture, shape etc., reflection be the content that some of image have objective statistical characteristic, corresponding to the Feature Semantics of image; The second layer, for deriving attribute layer, relates to and being derived by Low Level Vision feature and the attribute that obtains, in order to the object (as " sun ", " basketball " etc.) described in recognition image, corresponding to the Object Semanteme of image; Third layer is abstract attribute layer, comprise the abstract attribute (as " sunrise ", " Basketball Match " etc.) that object and scene are carried out more high-rise reasoning and obtained, semantic and the emotional semantic of the Scene Semantics of correspondence image, behavior etc. generally the gap between ground floor and the second layer is called " semantic gap ", whether image retrieval really employs semanteme and is mainly reflected in the picture material whether obtaining the second layer.

Research in recent years based on the Images Classification of image, semantic, mark and retrieval mostly needs to utilize the strategy of machine learning to obtain semanteme, but does not occur a kind of gratifying system general efficiently yet.

Summary of the invention

The object of this invention is to provide a kind of Multi-level semantic image indexing unit and method, greatly can improve the satisfaction that user carries out image retrieval.

For achieving the above object, the technical solution used in the present invention is: a kind of Multi-level semantic image indexing unit and method, comprise MIM message input module, image retrieval module, result output module, display device, it is characterized in that: described MIM message input module, described image retrieval module, described result output module connects successively, described image retrieval module comprises network communication module, low-level feature abstract module, the semantic extraction module of target, and Scene Semantics extraction module, wherein, described low-level feature abstract module is connected with network communication module, and the low-level feature information of network image is extracted via network communication module, the semantic extraction module of described target is connected to the rear end of low-level feature abstract module, and in low-level feature information, extracts target semantic information according to predetermined probabilistic information, described Scene Semantics extraction module is connected to the rear end of the semantic extraction module of target, and in target semantic information, extract Scene Semantics information according to predetermined Scene Semantics template, again image corresponding for this Scene Semantics information is sent to the result output module that this Scene Semantics extraction module rear end connects, described display device display result for retrieval.

Preferably, described image retrieval module, based on the distance function of light stream histogram matrix, matches the video lens being similar to given video lens, retrieves matching image from video database.

Preferably, described display device is LED display.

The advantage that the present invention has and good effect are: by extracting target semantic information and the Scene Semantics information of network image, thus achieving the hommization image retrieval being different from prior art, substantially increasing the satisfaction of image retrieval.

Accompanying drawing explanation

Fig. 1 is structural representation of the present invention.

Embodiment

Below in conjunction with drawings and Examples, the present invention will be further described.

As shown in Figure 1, the invention provides a kind of Multi-level semantic image indexing unit and method, comprise MIM message input module, image retrieval module, result output module, display device, MIM message input module, image retrieval module, result output module connect successively, image retrieval module comprises network communication module, low-level feature abstract module, the semantic extraction module of target and Scene Semantics extraction module, wherein, low-level feature abstract module is connected with network communication module, and extracts the low-level feature information of network image via network communication module; The semantic extraction module of target is connected to the rear end of low-level feature abstract module, and in low-level feature information, extracts target semantic information according to predetermined probabilistic information; Scene Semantics extraction module is connected to the rear end of the semantic extraction module of target, and in target semantic information, extract Scene Semantics information according to predetermined Scene Semantics template, again image corresponding for this Scene Semantics information is sent to the result output module that this Scene Semantics extraction module rear end connects, display device display result for retrieval.

Preferably, image retrieval module, based on the distance function of light stream histogram matrix, matches the video lens being similar to given video lens, retrieves matching image from video database.

Preferably, described display device is LED display.

In the present invention, multiple keyword can be adopted to carry out image retrieval.Such as, for the network image preserving each regional aim semantic vector and integral image target semantic vector, when adopting scene keyword to retrieve, can directly according to the characters matching result in Scene Semantics layer, return image containing same scene semanteme in network image as result for retrieval.Or, when adopting target keyword to retrieve, according to the overall goals of network image semanteme return to should the image of target keyword as result for retrieval.Or, when the illustration adopting user manually to select is retrieved, select to there is the image of the target semantic vector matched as result for retrieval in network image according to the semantic proper vector of the target of this illustration.

Above embodiments of the invention have been described in detail, but described content being only preferred embodiment of the present invention, can not being considered to for limiting practical range of the present invention.All equalizations done according to the present patent application scope change and improve, and all should still belong within patent covering scope of the present invention.

Claims

1. a Multi-level semantic image indexing unit and method, comprise MIM message input module, image retrieval module, result output module, display device, it is characterized in that: described MIM message input module, described image retrieval module, described result output module connect successively, described image retrieval module comprises network communication module, low-level feature abstract module, the semantic extraction module of target and Scene Semantics extraction module, wherein, described low-level feature abstract module is connected with network communication module, and extracts the low-level feature information of network image via network communication module; The semantic extraction module of described target is connected to the rear end of low-level feature abstract module, and in low-level feature information, extracts target semantic information according to predetermined probabilistic information; Described Scene Semantics extraction module is connected to the rear end of the semantic extraction module of target, and in target semantic information, extract Scene Semantics information according to predetermined Scene Semantics template, again image corresponding for this Scene Semantics information is sent to the result output module that this Scene Semantics extraction module rear end connects, described display device display result for retrieval.

2. a kind of Multi-level semantic image indexing unit according to claim 1 and method, it is characterized in that: described image retrieval module is based on the distance function of light stream histogram matrix, from video database, match the video lens being similar to given video lens, retrieve matching image.

3. a kind of Multi-level semantic image indexing unit according to claim 1 and method, is characterized in that: described display device is LED display.