CN110288045A - A kind of semantic visual dictionary optimization method based on Pearson correlation coefficient - Google Patents

A kind of semantic visual dictionary optimization method based on Pearson correlation coefficient Download PDF

Info

Publication number
CN110288045A
CN110288045A CN201910586981.5A CN201910586981A CN110288045A CN 110288045 A CN110288045 A CN 110288045A CN 201910586981 A CN201910586981 A CN 201910586981A CN 110288045 A CN110288045 A CN 110288045A
Authority
CN
China
Prior art keywords
visual
image
behavioral characteristics
phrase
dictionary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910586981.5A
Other languages
Chinese (zh)
Other versions
CN110288045B (en
Inventor
唐朝晖
刘亦玲
高小亮
范影
唐励雍
李耀国
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Central South University
Original Assignee
Central South University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Central South University filed Critical Central South University
Priority to CN201910586981.5A priority Critical patent/CN110288045B/en
Publication of CN110288045A publication Critical patent/CN110288045A/en
Application granted granted Critical
Publication of CN110288045B publication Critical patent/CN110288045B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a kind of semantic visual dictionary optimization method based on Pearson correlation coefficient extracts the behavioral characteristics of image by the low-level image feature that extracts the color of image, shape and texture and with SURF algorithm first;Using E2LSH clustering algorithm clusters the image behavioral characteristics of acquisition, extracts associated description visual phrase, constructs original visual dictionary;It introduces Pearson correlation coefficient and seeks the degree of correlation between behavioral characteristics and behavioral characteristics and between low-level image feature and behavioral characteristics to the optimization of original visual dictionary, obtain final semantic visual dictionary.The present invention problem complicated for visual phrase redundancy, calculating in semantic visual dictionary, carries out the classification performance that classification improves image to image, reduces the complexity of operation, shorten operation time, while improving the accuracy of classification.

Description

A kind of semantic visual dictionary optimization method based on Pearson correlation coefficient
Technical field
The present invention relates to visual dictionary optimisation technique fields, more particularly to a kind of semanteme based on Pearson correlation coefficient Visual dictionary optimization method.
Background technique
By the concern of lot of domestic and foreign scholar, market and social application are worth and also obtain Image Classfication Technology all the time The affirmative of people is arrived.Unlike text information, image has bigger information content with video information and is more difficult to understand Content, to make computer go to understand that piece image and one section of video also have biggish difficulty as the mankind.In reality In life, the information that people go acquisition image to be included by eyes, the information that then will acquire is located by brain Reason, human brain can get rid of those noises or useless information, retain corresponding image information.It is of the same race when running into next time When the image of classification, brain will make corresponding reaction, accurately be identified to image.In artificial intelligence field, to one Width image is identified and is classified, and will be trained accordingly by computer to it.In the training process, the first step is exactly The image information that will acquire is input in computer;Then computer can carry out image information as human brain corresponding Processing, when there is the image of classification of the same race to input again next time, computer will be identified it according to priori knowledge.So And the complexity of human brain is that people is unthinkable, allowing computer to remove simulation human brain, there are also quite long roads to walk.
In image classification, visual phrase is obtained, must carry out the extraction of feature to image first, then by mentioning The characteristics of image got carries out the processing, such as clustering algorithm, gauss hybrid models etc. of respective algorithms, finally can just obtain vision Phrase.However excessive visual dictionary can make the time complexity of image classification increase, and classify to piece image Perhaps identification inevitably has some picture noises and has an impact to classification or recognition efficiency.These picture noises can not only make The nicety of grading of image declines, and the scale of visual dictionary can be made to become larger, and excessive visual dictionary will increase computer The time cost of classification.
The present invention is mainly based upon machine vision and digital image processing techniques, by extract the color of image, shape and Textural characteristics, and using accelerating robust features algorithm (SURF) to extract image behavioral characteristics, it is breathed out using accurate European local sensitivity Uncommon clustering algorithm (E2LSH) behavioral characteristics of acquisition are clustered, Pearson correlation coefficient is introduced and optimizes visual dictionary.Structure after optimization The semantic visual dictionary built carries out the classification performance that classification improves image to image, reduces the complexity of operation, shortens fortune Evaluation time, while improving the accuracy of classification.
Summary of the invention
The semantic visual dictionary creation method based on Pearson correlation coefficient that the object of the present invention is to provide a kind of.The present invention It is then to be led to by extracting color, shape and the textural characteristics of image and extract with SURF algorithm the behavioral characteristics of image first Cross E2LSH clustering algorithm clusters the image behavioral characteristics of acquisition, associated description visual phrase is extracted, to construct original visual Dictionary, reference Pearson correlation coefficient optimize visual dictionary.
A kind of semantic visual dictionary creation method based on Pearson correlation coefficient, it is characterised in that the following steps are included:
Step A, image behavioral characteristics R={ r is extracted using acceleration robust features algorithm1, r2..., ri..., rN-1, rN, The characteristics of the underlying image collection S={ s based on color of image, shape and texture is extracted simultaneously1, s2, s3, s1, s2, s3Respectively image Color, shape and textural characteristics;
Step B, it is clustered using image behavioral characteristics of the accurate European local sensitivity Hash clustering algorithm to acquisition, obtains original Beginning visual dictionary;
Step C, degree of correlation size between visual phrase is sought in original visual dictionary using the Pearson correlation coefficient degree of correlation | ρn|, given threshold ratio1If | ρn| < ratio1, this visual phrase is added to newly-built visual phrase set;
If step D, | ρn|≥ratio1, then by the corresponding behavioral characteristics of this visual phrase and characteristics of the underlying image --- face Color, shape, textural characteristics seek the Pearson came degree of correlation again | ρ 'n|, given threshold ratio2If | ρ 'n| > ratio2, leave this Visual phrase set is added in visual phrase, optimizes the visual phrase in original visual dictionary, constructs new semantic visual dictionary.
The step B includes:
Step B1, image behavioral characteristics are extracted with acceleration robust features algorithm and constitutes behavioral characteristics collection R={ r1, r2..., ri..., rN-1, rN, wherein riIt is a behavioral characteristics of image, N is characterized Characteristic Number in collection R;
Step B2, from E2A position sensing function g is randomly selected in collection of functions G in LSH clustering algorithm;
Step B3, characteristics of image r in image behavioral characteristics collection R is sought with position sensing function giCorresponding k dimensional vector g (ri);
Step B4, r is calculatediMain cryptographic Hash h1(g(ri)) and time cryptographic Hash h2(g(ri)), all by primary and secondary cryptographic Hash in R Identical behavioral characteristics are put into the same bucket bkIn, bkIt is visual phrase;
Step B5, the bucket b of all image behavioral characteristics is soughtkConstitute original visual dictionary Tg={ b1, b2..., bk..., bZ}。
The step C includes:
Step C1, to original visual dictionary Tg={ b1, b2..., bk..., bZIn any one visual phrase bk, according to Formula Calculate the vision Related coefficient in phrase and original visual dictionary between other visual phrases | ρn|, obtain correlation matrix hk=[| ρ1 | ..., | ρn| ..., | ρz-1|];
Step C2, to correlation matrix hkDescending sort;
Step C3, given threshold ratio1, to the correlation matrix h after descendingk, visual phrase is searched for, if | ρn| < ratio1, then the visual phrase is added to newly-built visual phrase set Bg={ b1, b2..., bk..., bMIn.
Given threshold ratio described in step C1Value be (0.6,0.7).
Given threshold ratio described in step D2Value be (0.5,0.7).
Compared with prior art, the present invention considers the behavioral characteristics that image is considered using SURF algorithm, utilizes Pierre Inferior related coefficient has measured the degree of correlation size between behavioral characteristics, calculates the dimension that image indicates to reduce, more can be accurate Indicate the space distribution information of characteristics of image;The low-level image feature of image and behavioral characteristics are used into Pearson correlation coefficient degree simultaneously Amount, avoids being left out some important features, and dimension is small but describes more accurately semantic visual dictionary for building, solves tradition Semantic gap and the problem of redundancy in semantic dictionary, the visual dictionary after optimization can reduce the complexity of subsequent image sort operation Degree shortens operation time, while improving the accuracy of classification.
Detailed description of the invention
Fig. 1 is the semantic visual dictionary creation method flow diagram based on Pearson correlation coefficient.
Specific embodiment
The following further describes the present invention with reference to the drawings.
1, assume that training image collection is D=[d1, d2..., di..., dN], wherein diIndicate the i-th width image.
2, image behavioral characteristics collection R={ r is extracted using SURF algorithm1, r2..., ri..., rN-1, rN, wherein riIt is figure One behavioral characteristics of picture, N are Characteristic Number in behavioral characteristics collection R;
3, it is clustered using image behavioral characteristics of the accurate European local sensitivity Hash clustering algorithm to acquisition, generates Hash table Tg={ b1, b2..., bk..., bZ, wherein bkK-th barrel is indicated in Hash table, and Z indicates the total number of bucket in Hash table, Hash Table TgComplete a particular division to image behavioral characteristics, Hash table Tg={ b1, b2..., bk..., bZIt is exactly original visual Dictionary.Specific step is as follows:
A, from E2A position sensing function g is randomly selected in LSH clustering algorithm in collection of functions G;
B, behavioral characteristics r in image behavioral characteristics collection R is sought with position sensing function giCorresponding k dimensional vector g (ri);
C, r is calculatediMain cryptographic Hash h1(g(ri)) and time cryptographic Hash h2(g(ri)), primary and secondary cryptographic Hash in R is all identical Behavioral characteristics are put into the same bucket bkIn, bkIt is visual phrase;
D, the bucket b of all image behavioral characteristics is soughtkConstitute original visual dictionary Tg={ b1, b2..., bk..., bz}。
4, sentenced using the Pearson correlation coefficient degree of correlation as degree of correlation size between visual phrase in original visual dictionary Determine mode, takes original visual dictionary TgIn any one visual phrase bk, seek in the visual phrase and original visual dictionary Related coefficient between other visual phrases, in Pearson correlation coefficient formula (1),
Wherein, the meaning that ρ (X, Y) is represented is the linearly related power of two different image behavioral characteristics vector X and Y Degree, wherein indicating two feature vector nonlinear correlations if ρ (X, Y)=0 comprising -1≤ρ (X, Y)≤1, ρ's (X, Y) is exhausted Show that value, correlation is stronger more greatly, i.e., the degree of correlation is bigger.Set a threshold value ratio1, as judgment basis, obtain correlation Other visual phrases smaller than threshold value are spent, visual phrase set is created.Specific step is as follows:
A, to original visual dictionary Tg={ b1, b2..., bk..., bZIn any one visual phrase bk, according to formula (1) related coefficient in this feature and visual phrase set between other local features is calculated | ρn|, obtain correlation matrix hk =[| ρ1| ..., | ρn| ..., | ρZ-1|];
B, to correlation matrix hkDescending sort;
C, given threshold ratio1=0.65, to the correlation matrix h after descendingk, visual phrase set is searched for, if | ρn | < ratio1, then the visual phrase is added to newly-built visual phrase set Bg={ b1, b2..., bk..., bMIn;
5, to avoid leaving out some important features, if | ρn|≥ratio1, then the corresponding dynamic of this visual phrase is special Levy and characteristics of the underlying image --- color, shape, textural characteristics seek the Pearson came degree of correlation again | ρ 'n|, given threshold ratio2, If | ρ 'n| > ratio2, leave this visual phrase and visual phrase set be added, optimize the visual phrase in original visual dictionary, structure Build new semantic visual dictionary.Specific step is as follows:
A, characteristics of the underlying image collection S={ s is found out1, s2, s3, s1, s2, s3Respectively color of image, shape and texture are special Sign calculates riWith the Pearson correlation coefficient between low-level image feature collection S | ρ 'n|=[| ρ '1|, | ρ '2|, | ρ '3|];
B, given threshold ratio2=0.6, if | ρ 'n| > ratio2, then visual phrase set B is added in the visual phraseg In, to obtain the semantic visual dictionary after final optimization pass.

Claims (5)

1. a kind of semantic visual dictionary creation method based on Pearson correlation coefficient, it is characterised in that the following steps are included:
Step A, image behavioral characteristics R={ r is extracted using acceleration robust features algorithm1, r2..., ri..., rN-1, rN, it mentions simultaneously Take the characteristics of the underlying image collection S={ s based on color of image, shape and texture1, s2, s3, s1, s2, s3Respectively color of image, Shape and textural characteristics;
Step B, it is clustered using image behavioral characteristics of the accurate European local sensitivity Hash clustering algorithm to acquisition, obtains original view Feel dictionary;
Step C, degree of correlation size between visual phrase is sought in original visual dictionary using the Pearson correlation coefficient degree of correlation | ρn|, Given threshold ratio1If | ρn| < ratio1, this visual phrase is added to newly-built visual phrase set;
If step D, | ρn|≥ratio1, then by the corresponding behavioral characteristics of this visual phrase and characteristics of the underlying image --- color, shape Shape, textural characteristics seek the Pearson came degree of correlation again | ρ 'n|, given threshold ratio2If | ρ 'n| > ratio2, then this vision is short Visual phrase set is added in language, optimizes the visual phrase in original visual dictionary, constructs new semantic visual dictionary.
2. a kind of semantic visual dictionary optimization method based on Pearson correlation coefficient according to claim 1, feature It is, the step B includes:
Step B1, image behavioral characteristics are extracted with acceleration robust features algorithm and constitutes behavioral characteristics collection R={ r1, r2..., ri..., rN-1, rN, wherein riIt is a behavioral characteristics of image, N is characterized Characteristic Number in collection R;
Step B2, from E2A position sensing function g is randomly selected in collection of functions G in LSH clustering algorithm;
Step B3, characteristics of image r in image behavioral characteristics collection R is sought with position sensing function giCorresponding k dimensional vector g (ri);
Step B4, r is calculatediMain cryptographic Hash h1(g(ri)) and time cryptographic Hash h2(g(ri)), primary and secondary cryptographic Hash in R is all identical Behavioral characteristics be put into the same bucket bkIn, bkIt is visual phrase;
Step B5, the bucket b of all image behavioral characteristics is soughtkConstitute original visual dictionary Tg={ b1, b2..., bk..., bZ}。
3. a kind of semantic visual dictionary optimization method based on Pearson correlation coefficient according to claim 2, feature It is, the step C includes:
Step C1, to original visual dictionary Tg={ b1, b2..., bk..., bZIn any one visual phrase bk, according to formula Calculate the visual phrase With the related coefficient between visual phrases other in original visual dictionary | ρn|, obtain correlation matrix hk=[| ρ1| ..., | ρn| ..., | ρZ-1|];
Step C2, to correlation matrix hkDescending sort;
Step C3, given threshold ratio1, to the correlation matrix h after descendingk, visual phrase is searched for, if | ρn| < ratio1, The visual phrase is then added to newly-built visual phrase set Bg={ b1, b2..., bk..., bMIn.
4. a kind of semantic visual dictionary optimization method based on Pearson correlation coefficient according to claim 1, feature It is, given threshold ratio described in step C1Value be (0.6,0.7).
5. a kind of semantic visual dictionary optimization method based on Pearson correlation coefficient according to claim 1, feature It is, given threshold ratio described in step D2Value be (0.5,0.7).
CN201910586981.5A 2019-07-02 2019-07-02 Semantic visual dictionary optimization method based on Pearson correlation coefficient Active CN110288045B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910586981.5A CN110288045B (en) 2019-07-02 2019-07-02 Semantic visual dictionary optimization method based on Pearson correlation coefficient

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910586981.5A CN110288045B (en) 2019-07-02 2019-07-02 Semantic visual dictionary optimization method based on Pearson correlation coefficient

Publications (2)

Publication Number Publication Date
CN110288045A true CN110288045A (en) 2019-09-27
CN110288045B CN110288045B (en) 2023-03-24

Family

ID=68021541

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910586981.5A Active CN110288045B (en) 2019-07-02 2019-07-02 Semantic visual dictionary optimization method based on Pearson correlation coefficient

Country Status (1)

Country Link
CN (1) CN110288045B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102693311A (en) * 2012-05-28 2012-09-26 中国人民解放军信息工程大学 Target retrieval method based on group of randomized visual vocabularies and context semantic information
US20150269191A1 (en) * 2014-03-20 2015-09-24 Beijing University Of Technology Method for retrieving similar image based on visual saliencies and visual phrases
CN107480718A (en) * 2017-08-17 2017-12-15 南京信息工程大学 A kind of high-resolution remote sensing image sorting technique of view-based access control model bag of words

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102693311A (en) * 2012-05-28 2012-09-26 中国人民解放军信息工程大学 Target retrieval method based on group of randomized visual vocabularies and context semantic information
US20150269191A1 (en) * 2014-03-20 2015-09-24 Beijing University Of Technology Method for retrieving similar image based on visual saliencies and visual phrases
CN107480718A (en) * 2017-08-17 2017-12-15 南京信息工程大学 A kind of high-resolution remote sensing image sorting technique of view-based access control model bag of words

Also Published As

Publication number Publication date
CN110288045B (en) 2023-03-24

Similar Documents

Publication Publication Date Title
CN111523462B (en) Video sequence expression recognition system and method based on self-attention enhanced CNN
CN101315663B (en) Nature scene image classification method based on area dormant semantic characteristic
CN109101938B (en) Multi-label age estimation method based on convolutional neural network
CN110097060B (en) Open set identification method for trunk image
CN102385592B (en) Image concept detection method and device
CN110415071B (en) Automobile competitive product comparison method based on viewpoint mining analysis
CN110751027B (en) Pedestrian re-identification method based on deep multi-instance learning
CN109886161A (en) A kind of road traffic index identification method based on possibility cluster and convolutional neural networks
CN106997379B (en) Method for merging similar texts based on click volumes of image texts
CN111325237B (en) Image recognition method based on attention interaction mechanism
CN110472652A (en) A small amount of sample classification method based on semanteme guidance
CN110414513A (en) Vision significance detection method based on semantically enhancement convolutional neural networks
CN112784929A (en) Small sample image classification method and device based on double-element group expansion
CN115309860B (en) False news detection method based on pseudo twin network
CN112883931A (en) Real-time true and false motion judgment method based on long and short term memory network
CN110765285A (en) Multimedia information content control method and system based on visual characteristics
CN111860278A (en) Human behavior recognition algorithm based on deep learning
CN115130538A (en) Training method of text classification model, text processing method, equipment and medium
Cosovic et al. Classification methods in cultural heritage
Zhang et al. Cross-dataset learning for age estimation
CN113779283B (en) Fine-grained cross-media retrieval method with deep supervision and feature fusion
CN114625908A (en) Text expression package emotion analysis method and system based on multi-channel attention mechanism
CN114662586A (en) Method for detecting false information based on common attention multi-mode fusion mechanism
CN112200260B (en) Figure attribute identification method based on discarding loss function
CN113535960A (en) Text classification method, device and equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant