CN109657684A

CN109657684A - A kind of image, semantic analytic method based on Weakly supervised study

Info

Publication number: CN109657684A
Application number: CN201811577772.6A
Authority: CN
Inventors: 王凤琴; 李祖贺; 陈燕; 陈启强; 金保华; 江楠; 于源
Original assignee: Zhengzhou University of Light Industry
Current assignee: Zhengzhou University of Light Industry
Priority date: 2018-12-20
Filing date: 2018-12-20
Publication date: 2019-04-19

Abstract

The invention discloses a kind of image, semantic analytic methods based on Weakly supervised study, include the following steps；Step 1: dividing the image into the 45-55 blocks being of moderate size, the provincial characteristics with judgement index is extracted；Step 2: using linear classifier as the cluster mode differentiated；Step 3: the image area characteristics after segmentation are clustered, the image set after cluster is divided into the subregion of several classifications；It obtains a result Step 4: gathering same category of subregion in one kind；Step 5: the image-region of output semantic congruence merges, the final parsing result of image is obtained.The present invention provides a kind of image, semantic analytic methods based on Weakly supervised study, it is clustered by the image region that dividing method obtains, the Weakly supervised learning model minimized the error is established using image level mark and the relationship of image region label, for each image region allocated semantics label, it can achieve precision height, accuracy is high.

Description

A kind of image, semantic analytic method based on Weakly supervised study

Technical field

Automatically analyze and understand that technical field, specially one kind are based on Weakly supervised study the present invention relates to multimedia content Image, semantic analytic method.

Background technique

Image, semantic parsing be divide the image into the combined task of area marking, be a kind of higher level Image understanding technology, it not only can give image add semantic label, the correspondence area in image can also be added tags to Domain realizes that more fine-grained image, semantic understands；

As internet enters the Web2.0 epoch, more and more users mark network image using semantic label Note, and shared on picture sharing website Flickr, Picasa, explosive growth is presented in these image datas, to figure The index of picture and retrieval bring huge challenge, for this purpose, quickly and effectively automatic image annotation becomes the hot spot of current research Problem, both image segmentation and area marking are inseparable and mutually promote, and accurate image segmentation can be region Mark provides accurate visual signature and indicates, conversely, good area marking result can equally promote image segmentation, because having The pixel of same semantic label just belongs to the same object；

Image, semantic parsing is a kind of image labeling technology of thin scale, it will not only point out in image " what has ", It is also noted that " where ", i.e., semantic label is mapped in image corresponding region up, to realize more careful accurate Effect is marked, current existing image, semantic analytic method largely all relies on the training data accurately marked, i.e., artificial mark The training image of pixel scale is infused, but the network image content change multiterminal of big data era, semanteme disperse different, consuming The manual mask method of manpower increasingly cannot meet the needs.

Summary of the invention

The purpose of the present invention is to provide a kind of image, semantic analytic methods based on Weakly supervised study, to solve above-mentioned back The problem of being proposed in scape technology.

To achieve the above object, the invention provides the following technical scheme: a kind of image, semantic solution based on Weakly supervised study Analysis method, includes the following steps；

Step 1: dividing the image into the 45-55 blocks being of moderate size, the provincial characteristics with judgement index is extracted；

Step 2: using linear classifier as the cluster mode differentiated, and classifier is constrained using norm；

Step 3: the image area characteristics after segmentation are clustered, the image set after cluster is divided into several classes Other subregion；

It obtains a result Step 4: gathering same category of subregion in one kind；

S1, object module is established with the corresponding the constraint relationship of image region label by image level label；

S2, the cluster set allocated semantics label by image region；

S3, input have the image of semantic label, are clustered using SHDC method to same category of subregion；

Step 5: the image-region of output semantic congruence merges, the final parsing result of image is obtained.

Preferably, in step 1, the feature in region can be extracted from color, texture and shape as feature.

Preferably, in step 1, the feature in region can be extracted from the significant point in image as feature.

Preferably, in step 1, the vision similarity between subregion is calculated by the way of norm reconstruct, is used simultaneously The optimization method of CCCP and the iteration renewal process of non-negative multiplier method optimize norm item.

Preferably, in step 1, each image is split using the dividing method of SLIC.

Preferably, judged whether according to the size of color histogram map distance to image district when image-region is merged Domain A and image-region B are merged.

Compared with prior art, the beneficial effects of the present invention are: the present invention provides a kind of figures based on Weakly supervised study As semantic analytic method；

1, it is clustered by the image region that dividing method obtains, utilizes image level mark and image region mark The relationship of label establishes the Weakly supervised learning model minimized the error, is each image region allocated semantics label, can achieve Precision is high, and accuracy is high, just reaches higher semantic description precision using less parameter；

2, the context relation between region is taken full advantage of, information loss is reduced, using SHDC method to same class Other subregion is clustered, and the effect of cluster is more preferable, can be measured respectively to object type sample and background classes sample, to increase It is strong to the judgement index between object type and background classes, the needs of practical application, strong applicability can be met well.

Specific embodiment

Below in conjunction with the embodiment of the present invention, technical solution in the embodiment of the present invention is clearly and completely retouched It states, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.Based on the present invention In embodiment, every other implementation obtained by those of ordinary skill in the art without making creative efforts Example, shall fall within the protection scope of the present invention.

Embodiment 1:

The present invention provides a kind of technical solution: a kind of image, semantic analytic method based on Weakly supervised study, including following Step；

Step 1: divide the image into 48 blocks being of moderate size, using the dividing method of SLIC come to each image into Row segmentation, extracts the provincial characteristics with judgement index, and the vision similarity between subregion is calculated by the way of norm reconstruct, To the sensitivity of noise less, and reflection spatial information that can be implicit, it is more suitable for classification task, while using CCCP's Optimization method and the iteration renewal process of non-negative multiplier method optimize norm item；

The feature in region can be extracted from color, texture and shape as feature, and color characteristic is the image bottom, most Intuitive most apparent physical features, textural characteristics are the repetition distributions of certain approximate shapes, and shape feature is the outer boundary of object, Extracted from color, texture and shape as feature have the characteristics that calculate simply express it is intuitive；

It obtains a result Step 4: gathering same category of subregion in one kind；

S2, the cluster set allocated semantics label by image region；

S3, input have the image of semantic label, are clustered using SHDC method to same category of subregion, use SHDC method realizes more preferably Clustering Effect instead of traditional spectral clustering；

Step 5: the image-region of output semantic congruence merges, the final parsing of image is obtained as a result, in image district According to the size of color histogram map distance when domain merges, judge whether to merge image-region A and image-region B.

Embodiment 2:

Step 1: divide the image into 50 blocks being of moderate size, using the dividing method of SLIC come to each image into Row segmentation, extracts the provincial characteristics with judgement index, and the vision similarity between subregion is calculated by the way of norm reconstruct, To the sensitivity of noise less, and reflection spatial information that can be implicit, it is more suitable for classification task, while using CCCP's Optimization method and the iteration renewal process of non-negative multiplier method optimize norm item；

The feature in region can be extracted from the significant point in image as feature, and image can flexibly be depicted Local message and detail content

It obtains a result Step 4: gathering same category of subregion in one kind；

S2, the cluster set allocated semantics label by image region；

It although an embodiment of the present invention has been shown and described, for the ordinary skill in the art, can be with A variety of variations, modification, replacement can be carried out to these embodiments without departing from the principles and spirit of the present invention by understanding And modification, the scope of the present invention is defined by the appended.

Claims

1. a kind of image, semantic analytic method based on Weakly supervised study, which is characterized in that include the following steps；

Step 3: the image area characteristics after segmentation are clustered, the image set after cluster is divided into several classifications Subregion；

It obtains a result Step 4: gathering same category of subregion in one kind；

S2, the cluster set allocated semantics label by image region；

2. a kind of image, semantic analytic method based on Weakly supervised study according to claim 1, it is characterised in that: step In one, the feature in region can be extracted from color, texture and shape as feature.

3. a kind of image, semantic analytic method based on Weakly supervised study according to claim 1, it is characterised in that: step In one, the feature in region can be extracted from the significant point in image as feature.

4. a kind of image, semantic analytic method based on Weakly supervised study according to claim 1, it is characterised in that: step In one, calculate vision similarity between subregion by the way of norm reconstruct, at the same using the optimization method of CCCP with it is non- The iteration renewal process of negative multiplier method optimizes norm item.

5. a kind of image, semantic analytic method based on Weakly supervised study according to claim 1, it is characterised in that: step In one, each image is split using the dividing method of SLIC.

6. a kind of image, semantic analytic method based on Weakly supervised study according to claim 1, it is characterised in that: step In five, when image-region is merged according to the size of color histogram map distance, judge whether to image-region A and image district Domain B is merged.