CN109857884B

CN109857884B - Automatic image semantic description method

Info

Publication number: CN109857884B
Application number: CN201811564965.8A
Authority: CN
Inventors: 李祖贺; 张涛; 钱晓亮; 曾黎; 金保华; 于泽琦; 田二林; 于源
Original assignee: Zhengzhou University of Light Industry
Current assignee: Zhengzhou University of Light Industry
Priority date: 2018-12-20
Filing date: 2018-12-20
Publication date: 2023-02-07
Anticipated expiration: 2038-12-20
Also published as: CN109857884A

Abstract

The invention discloses an automatic image semantic description method, which comprises the following steps of; clustering the image set according to the visual characteristics by using a clustering algorithm; dividing the clustered image set into a plurality of categories; using CNN to perform image pre-description processing; labeling the image with a plurality of categories, and calling the pre-description of the first layer as the category labeling of the image; constructing a classifier for the images of each category by using the SVM; determining whether to add such a description to the image using a classifier; an MBRM model marking algorithm is utilized; and obtaining the image semantics through the combination of the image regions obtained by the related training set. The invention provides an automatic image semantic description method, which can effectively fuse the bottom-layer characteristics of an image and image semantic description high-level semantic information, has the characteristics of high precision, high accuracy, definition, formalization, shareability, conceptualization and the like, can be widely applied to a plurality of fields including information retrieval, information extraction, semantic network and knowledge management, and has strong applicability.

Description

Automatic image semantic description method

Technical Field

The invention relates to the technical field of image semantic description, in particular to an automatic image semantic description method.

Background

Image content automatic description (image capturing), namely, the content of an image is automatically described by natural language, because the image content automatic description has wide application prospects, such as a man-machine interaction and blind guiding system, the image content automatic description is recently a new focus in the fields of computer vision and artificial intelligence, and is different from image classification or object detection, the image automatic description takes the comprehensive description of objects, scenes and relations thereof as a target, and relates to visual scene analysis, content semantic understanding and natural language processing, and is the integrated design of a tip technology in a mixed task;

in the prior art, image features extracted by a CNN are used as input of a Recurrent Neural Network (RNN), image semantic description information is used as output of the RNN, and an image semantic description problem is regarded as a translation process from an image to a semantic description, so that an automatic image semantic description model based on the CNN and the RNN is constructed.

Disclosure of Invention

The invention aims to provide an automatic image semantic description method to solve the problems in the background technology.

In order to achieve the purpose, the invention provides the following technical scheme: an automatic image semantic description method comprises the following steps;

step 1, clustering an image set according to visual characteristics by using a clustering algorithm;

step 2, dividing the clustered image set into a plurality of categories, wherein each category is divided into a plurality of images;

step 3, using CNN to perform image pre-description processing, and marking the pre-description purpose;

step 4, marking a plurality of categories on the image, and calling the pre-description of the first layer as the category marking of the image;

step 5, constructing a classifier for the image of each category by using an SVM;

step 6, judging whether to add description of the type to the image by using a classifier;

step 7, carrying out detailed annotation on semantic keywords of the test image annotation by using an MBRM model annotation algorithm;

8, labeling the label of the second layer of the image according to the type of the test image to obtain a corresponding image;

and 9, taking the images as training sets of the detailed labeling stage together, and obtaining image semantics through combination of image areas obtained by the related training sets.

Preferably, in the step 1, the algorithm of the clustering algorithm is a K-means image clustering algorithm.

Preferably, in step 1, the algorithm of the clustering algorithm is an image clustering algorithm of isodata.

Preferably, in step 3, the image pre-description processing process includes: vectorization, attribute establishment, projection transformation and data formatting conversion.

Preferably, in step 6, the classifier is used to guide the generation of the visual analysis and prediction stage category, so as to realize the parameter optimization of the semantic classifier.

Preferably, in step 4, the class labeling of the image is optimized by inverse propagation of the loss by mapping to a class space based on the features of each node and calculating a classification loss.

Compared with the prior art, the invention has the beneficial effects that: the invention provides an automatic image semantic description method, which comprises the following steps that 1, the bottom-layer characteristics of an image and image semantic description high-level semantic information can be effectively fused, the precision and the accuracy are high, the high semantic description precision can be achieved by using fewer parameters, and the requirements of practical application can be well met;

2. the method describes semantics through the relation between concepts, has the characteristics of definition, formalization, sharing, conceptualization and the like, can be widely applied to a plurality of fields including information retrieval, information extraction, semantic network and knowledge management, and has strong applicability.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Example 1:

the invention provides a technical scheme that: an automatic image semantic description method comprises the following steps;

step 1, clustering an image set according to visual characteristics by using a clustering algorithm, wherein the algorithm of the clustering algorithm is a K-means image clustering algorithm, so that the clustering accuracy can be improved;

step 3, using the CNN to perform image pre-description processing, marking the purpose of pre-description, wherein the image pre-description processing process comprises the following steps: vectorization, attribute establishment, projection transformation and data formatting conversion;

step 4, labeling the image with a plurality of categories, wherein the pre-description of the first layer is called the category labeling of the image, the category labeling of the image is mapped to a category space and calculates the classification loss based on the characteristics of each node, and the optimization is realized through loss reverse transmission;

step 6, judging whether to add description of the type to the image by using a classifier, and guiding visual analysis and generation of prediction stage categories by using the classifier so as to realize parameter optimization of a semantic classifier;

and 9, taking the images as training sets of the detailed labeling stage together, and obtaining image semantics through the combination of image areas obtained by the related training sets.

Example 2:

step 1, clustering an image set according to visual characteristics by using a clustering algorithm, wherein the algorithm of the clustering algorithm is an image clustering algorithm of isodata, the number of categories can be automatically increased or decreased in the clustering process, and the efficiency is accelerated;

step 5, constructing a classifier for the images of each category by using an SVM;

Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims

1. An automatic image semantic description method is characterized by comprising the following steps;

step 4, labeling the image into a plurality of categories, and calling the pre-description of the first layer as the category labeling of the image;

step 6, judging whether to add the description to the image or not by using a classifier;

step 7, marking the semantic keywords marked by the test image in detail by using an MBRM model marking algorithm;

2. The automatic image semantic description method according to claim 1, characterized in that: in the step 1, the algorithm of the clustering algorithm is a K-means image clustering algorithm.

3. The automatic image semantic description method according to claim 1, characterized in that: in step 1, the algorithm of the clustering algorithm is an image clustering algorithm of isodata.

4. The automatic image semantic description method according to claim 1, characterized in that: in step 3, the image pre-description processing process comprises: vectorization, attribute establishment, projection transformation and data formatting conversion.

5. The automatic image semantic description method according to claim 1, characterized in that: and step 6, guiding visual analysis and generation of prediction stage categories through a classifier, and further realizing parameter optimization of the semantic classifier.

6. The automatic image semantic description method according to claim 1, characterized in that: in step 4, the class labeling of the image is optimized by mapping to a class space based on the features of each node and calculating the classification loss through loss reverse transfer.