CN110288045A

CN110288045A - A kind of semantic visual dictionary optimization method based on Pearson correlation coefficient

Info

Publication number: CN110288045A
Application number: CN201910586981.5A
Authority: CN
Inventors: 唐朝晖; 刘亦玲; 高小亮; 范影; 唐励雍; 李耀国
Original assignee: Central South University
Current assignee: Central South University
Priority date: 2019-07-02
Filing date: 2019-07-02
Publication date: 2019-09-27
Anticipated expiration: 2039-07-02
Also published as: CN110288045B

Abstract

The invention discloses a kind of semantic visual dictionary optimization method based on Pearson correlation coefficient extracts the behavioral characteristics of image by the low-level image feature that extracts the color of image, shape and texture and with SURF algorithm first；Using E²LSH clustering algorithm clusters the image behavioral characteristics of acquisition, extracts associated description visual phrase, constructs original visual dictionary；It introduces Pearson correlation coefficient and seeks the degree of correlation between behavioral characteristics and behavioral characteristics and between low-level image feature and behavioral characteristics to the optimization of original visual dictionary, obtain final semantic visual dictionary.The present invention problem complicated for visual phrase redundancy, calculating in semantic visual dictionary, carries out the classification performance that classification improves image to image, reduces the complexity of operation, shorten operation time, while improving the accuracy of classification.

Description

A kind of semantic visual dictionary optimization method based on Pearson correlation coefficient

Technical field

The present invention relates to visual dictionary optimisation technique fields, more particularly to a kind of semanteme based on Pearson correlation coefficient Visual dictionary optimization method.

Background technique

By the concern of lot of domestic and foreign scholar, market and social application are worth and also obtain Image Classfication Technology all the time The affirmative of people is arrived.Unlike text information, image has bigger information content with video information and is more difficult to understand Content, to make computer go to understand that piece image and one section of video also have biggish difficulty as the mankind.In reality In life, the information that people go acquisition image to be included by eyes, the information that then will acquire is located by brain Reason, human brain can get rid of those noises or useless information, retain corresponding image information.It is of the same race when running into next time When the image of classification, brain will make corresponding reaction, accurately be identified to image.In artificial intelligence field, to one Width image is identified and is classified, and will be trained accordingly by computer to it.In the training process, the first step is exactly The image information that will acquire is input in computer；Then computer can carry out image information as human brain corresponding Processing, when there is the image of classification of the same race to input again next time, computer will be identified it according to priori knowledge.So And the complexity of human brain is that people is unthinkable, allowing computer to remove simulation human brain, there are also quite long roads to walk.

In image classification, visual phrase is obtained, must carry out the extraction of feature to image first, then by mentioning The characteristics of image got carries out the processing, such as clustering algorithm, gauss hybrid models etc. of respective algorithms, finally can just obtain vision Phrase.However excessive visual dictionary can make the time complexity of image classification increase, and classify to piece image Perhaps identification inevitably has some picture noises and has an impact to classification or recognition efficiency.These picture noises can not only make The nicety of grading of image declines, and the scale of visual dictionary can be made to become larger, and excessive visual dictionary will increase computer The time cost of classification.

The present invention is mainly based upon machine vision and digital image processing techniques, by extract the color of image, shape and Textural characteristics, and using accelerating robust features algorithm (SURF) to extract image behavioral characteristics, it is breathed out using accurate European local sensitivity Uncommon clustering algorithm (E²LSH) behavioral characteristics of acquisition are clustered, Pearson correlation coefficient is introduced and optimizes visual dictionary.Structure after optimization The semantic visual dictionary built carries out the classification performance that classification improves image to image, reduces the complexity of operation, shortens fortune Evaluation time, while improving the accuracy of classification.

Summary of the invention

The semantic visual dictionary creation method based on Pearson correlation coefficient that the object of the present invention is to provide a kind of.The present invention It is then to be led to by extracting color, shape and the textural characteristics of image and extract with SURF algorithm the behavioral characteristics of image first Cross E²LSH clustering algorithm clusters the image behavioral characteristics of acquisition, associated description visual phrase is extracted, to construct original visual Dictionary, reference Pearson correlation coefficient optimize visual dictionary.

A kind of semantic visual dictionary creation method based on Pearson correlation coefficient, it is characterised in that the following steps are included:

Step A, image behavioral characteristics R={ r is extracted using acceleration robust features algorithm₁, r₂..., r_i..., r_N-1, r_N, The characteristics of the underlying image collection S={ s based on color of image, shape and texture is extracted simultaneously₁, s₂, s₃, s₁, s₂, s₃Respectively image Color, shape and textural characteristics；

Step B, it is clustered using image behavioral characteristics of the accurate European local sensitivity Hash clustering algorithm to acquisition, obtains original Beginning visual dictionary；

Step C, degree of correlation size between visual phrase is sought in original visual dictionary using the Pearson correlation coefficient degree of correlation | ρ_n|, given threshold ratio₁If | ρ_n| < ratio₁, this visual phrase is added to newly-built visual phrase set；

If step D, | ρ_n|≥ratio₁, then by the corresponding behavioral characteristics of this visual phrase and characteristics of the underlying image --- face Color, shape, textural characteristics seek the Pearson came degree of correlation again | ρ '_n|, given threshold ratio₂If | ρ '_n| > ratio₂, leave this Visual phrase set is added in visual phrase, optimizes the visual phrase in original visual dictionary, constructs new semantic visual dictionary.

The step B includes:

Step B1, image behavioral characteristics are extracted with acceleration robust features algorithm and constitutes behavioral characteristics collection R={ r₁, r₂..., r_i..., r_N-1, r_N, wherein r_iIt is a behavioral characteristics of image, N is characterized Characteristic Number in collection R；

Step B2, from E²A position sensing function g is randomly selected in collection of functions G in LSH clustering algorithm；

Step B3, characteristics of image r in image behavioral characteristics collection R is sought with position sensing function g_iCorresponding k dimensional vector g (r_i)；

Step B4, r is calculated_iMain cryptographic Hash h₁(g(r_i)) and time cryptographic Hash h₂(g(r_i)), all by primary and secondary cryptographic Hash in R Identical behavioral characteristics are put into the same bucket b_kIn, b_kIt is visual phrase；

Step B5, the bucket b of all image behavioral characteristics is sought_kConstitute original visual dictionary T_g={ b₁, b₂..., b_k..., b_Z}。

The step C includes:

Step C1, to original visual dictionary T_g={ b₁, b₂..., b_k..., b_ZIn any one visual phrase b_k, according to Formula Calculate the vision Related coefficient in phrase and original visual dictionary between other visual phrases | ρ_n|, obtain correlation matrix h_k=[| ρ₁ | ..., | ρ_n| ..., | ρ_z-1|]；

Step C2, to correlation matrix h_kDescending sort；

Step C3, given threshold ratio₁, to the correlation matrix h after descending_k, visual phrase is searched for, if | ρ_n| < ratio₁, then the visual phrase is added to newly-built visual phrase set B_g={ b₁, b₂..., b_k..., b_MIn.

Given threshold ratio described in step C₁Value be (0.6,0.7).

Given threshold ratio described in step D₂Value be (0.5,0.7).

Compared with prior art, the present invention considers the behavioral characteristics that image is considered using SURF algorithm, utilizes Pierre Inferior related coefficient has measured the degree of correlation size between behavioral characteristics, calculates the dimension that image indicates to reduce, more can be accurate Indicate the space distribution information of characteristics of image；The low-level image feature of image and behavioral characteristics are used into Pearson correlation coefficient degree simultaneously Amount, avoids being left out some important features, and dimension is small but describes more accurately semantic visual dictionary for building, solves tradition Semantic gap and the problem of redundancy in semantic dictionary, the visual dictionary after optimization can reduce the complexity of subsequent image sort operation Degree shortens operation time, while improving the accuracy of classification.

Detailed description of the invention

Fig. 1 is the semantic visual dictionary creation method flow diagram based on Pearson correlation coefficient.

Specific embodiment

The following further describes the present invention with reference to the drawings.

1, assume that training image collection is D=[d₁, d₂..., d_i..., d_N], wherein d_iIndicate the i-th width image.

2, image behavioral characteristics collection R={ r is extracted using SURF algorithm₁, r₂..., r_i..., r_N-1, r_N, wherein r_iIt is figure One behavioral characteristics of picture, N are Characteristic Number in behavioral characteristics collection R；

3, it is clustered using image behavioral characteristics of the accurate European local sensitivity Hash clustering algorithm to acquisition, generates Hash table T_g={ b₁, b₂..., b_k..., b_Z, wherein b_kK-th barrel is indicated in Hash table, and Z indicates the total number of bucket in Hash table, Hash Table T_gComplete a particular division to image behavioral characteristics, Hash table T_g={ b₁, b₂..., b_k..., b_ZIt is exactly original visual Dictionary.Specific step is as follows:

A, from E²A position sensing function g is randomly selected in LSH clustering algorithm in collection of functions G；

B, behavioral characteristics r in image behavioral characteristics collection R is sought with position sensing function g_iCorresponding k dimensional vector g (r_i)；

C, r is calculated_iMain cryptographic Hash h₁(g(r_i)) and time cryptographic Hash h₂(g(r_i)), primary and secondary cryptographic Hash in R is all identical Behavioral characteristics are put into the same bucket b_kIn, b_kIt is visual phrase；

D, the bucket b of all image behavioral characteristics is sought_kConstitute original visual dictionary T_g={ b₁, b₂..., b_k..., b_z}。

4, sentenced using the Pearson correlation coefficient degree of correlation as degree of correlation size between visual phrase in original visual dictionary Determine mode, takes original visual dictionary T_gIn any one visual phrase b_k, seek in the visual phrase and original visual dictionary Related coefficient between other visual phrases, in Pearson correlation coefficient formula (1),

Wherein, the meaning that ρ (X, Y) is represented is the linearly related power of two different image behavioral characteristics vector X and Y Degree, wherein indicating two feature vector nonlinear correlations if ρ (X, Y)=0 comprising -1≤ρ (X, Y)≤1, ρ's (X, Y) is exhausted Show that value, correlation is stronger more greatly, i.e., the degree of correlation is bigger.Set a threshold value ratio₁, as judgment basis, obtain correlation Other visual phrases smaller than threshold value are spent, visual phrase set is created.Specific step is as follows:

A, to original visual dictionary T_g={ b₁, b₂..., b_k..., b_ZIn any one visual phrase b_k, according to formula (1) related coefficient in this feature and visual phrase set between other local features is calculated | ρ_n|, obtain correlation matrix h_k =[| ρ₁| ..., | ρ_n| ..., | ρ_Z-₁|]；

B, to correlation matrix h_kDescending sort；

C, given threshold ratio₁=0.65, to the correlation matrix h after descending_k, visual phrase set is searched for, if | ρ_n | < ratio₁, then the visual phrase is added to newly-built visual phrase set B_g={ b₁, b₂..., b_k..., b_MIn；

5, to avoid leaving out some important features, if | ρ_n|≥ratio₁, then the corresponding dynamic of this visual phrase is special Levy and characteristics of the underlying image --- color, shape, textural characteristics seek the Pearson came degree of correlation again | ρ '_n|, given threshold ratio₂, If | ρ '_n| > ratio₂, leave this visual phrase and visual phrase set be added, optimize the visual phrase in original visual dictionary, structure Build new semantic visual dictionary.Specific step is as follows:

A, characteristics of the underlying image collection S={ s is found out₁, s₂, s₃, s₁, s₂, s₃Respectively color of image, shape and texture are special Sign calculates r_iWith the Pearson correlation coefficient between low-level image feature collection S | ρ '_n|=[| ρ '₁|, | ρ '₂|, | ρ '₃|]；

B, given threshold ratio₂=0.6, if | ρ '_n| > ratio₂, then visual phrase set B is added in the visual phrase_g In, to obtain the semantic visual dictionary after final optimization pass.

Claims

1. a kind of semantic visual dictionary creation method based on Pearson correlation coefficient, it is characterised in that the following steps are included:

Step A, image behavioral characteristics R={ r is extracted using acceleration robust features algorithm₁, r₂..., r_i..., r_N-1, r_N, it mentions simultaneously Take the characteristics of the underlying image collection S={ s based on color of image, shape and texture₁, s₂, s₃, s₁, s₂, s₃Respectively color of image, Shape and textural characteristics；

Step B, it is clustered using image behavioral characteristics of the accurate European local sensitivity Hash clustering algorithm to acquisition, obtains original view Feel dictionary；

If step D, | ρ_n|≥ratio₁, then by the corresponding behavioral characteristics of this visual phrase and characteristics of the underlying image --- color, shape Shape, textural characteristics seek the Pearson came degree of correlation again | ρ '_n|, given threshold ratio₂If | ρ '_n| > ratio₂, then this vision is short Visual phrase set is added in language, optimizes the visual phrase in original visual dictionary, constructs new semantic visual dictionary.

2. a kind of semantic visual dictionary optimization method based on Pearson correlation coefficient according to claim 1, feature It is, the step B includes:

Step B4, r is calculated_iMain cryptographic Hash h₁(g(r_i)) and time cryptographic Hash h₂(g(r_i)), primary and secondary cryptographic Hash in R is all identical Behavioral characteristics be put into the same bucket b_kIn, b_kIt is visual phrase；

3. a kind of semantic visual dictionary optimization method based on Pearson correlation coefficient according to claim 2, feature It is, the step C includes:

Step C1, to original visual dictionary T_g={ b₁, b₂..., b_k..., b_ZIn any one visual phrase b_k, according to formula Calculate the visual phrase With the related coefficient between visual phrases other in original visual dictionary | ρ_n|, obtain correlation matrix h_k=[| ρ₁| ..., | ρ_n| ..., | ρ_Z-1|]；

Step C2, to correlation matrix h_kDescending sort；

Step C3, given threshold ratio₁, to the correlation matrix h after descending_k, visual phrase is searched for, if | ρ_n| < ratio₁, The visual phrase is then added to newly-built visual phrase set B_g={ b₁, b₂..., b_k..., b_MIn.

4. a kind of semantic visual dictionary optimization method based on Pearson correlation coefficient according to claim 1, feature It is, given threshold ratio described in step C₁Value be (0.6,0.7).

5. a kind of semantic visual dictionary optimization method based on Pearson correlation coefficient according to claim 1, feature It is, given threshold ratio described in step D₂Value be (0.5,0.7).