CN103064941A

CN103064941A - Image retrieval method and device

Info

Publication number: CN103064941A
Application number: CN2012105723718A
Authority: CN
Inventors: 陈世峰; 曹琛
Original assignee: Shenzhen Institute of Advanced Technology of CAS
Current assignee: Shenzhen Institute of Advanced Technology of CAS
Priority date: 2012-12-25
Filing date: 2012-12-25
Publication date: 2013-04-24
Anticipated expiration: 2032-12-25
Also published as: CN103064941B

Abstract

The invention provides an image retrieval method and device. The method includes acquiring a retrieval key word and performing screening in a database according to the retrieval key word to obtain an image set; establishing a first spectrogram model of the image set according to image characteristics to obtain a similarity relation of each two images in the image set; establishing a semi-supervised learning model according to the similarity relation; subjecting the image set to denoising according to the semi-supervised learning model to obtain a denoising image set; and returning the denoising image set to be used as a retrieval result corresponding to the retrieval key word. According to the image retrieval method and device, by means of the establishing of the spectrogram model of the image set, the semi-supervised learning model is established, the image set is subjected to denoising according to the semi-supervised learning model, the image set which is subjected to denoising is returned to be used as the retrieval result corresponding to the retrieval key word, overall denoising is performed on the retrieved image set, and the accuracy of image retrieval is improved.

Description

Image search method and device

Technical field

The present invention relates to image retrieval technologies, particularly relate to a kind of image search method and device.

Background technology

Image retrieval technologies based on keyword is the image retrieval technologies of current main-stream, yet because the existence of error label and the ambiguity of search key language obtain image by the image retrieval technologies retrieval based on keyword usually not accurate enough.

Summary of the invention

Based on this, be necessary for the not accurate enough problem of the result for retrieval of conventional images retrieval technique, a kind of image search method and device that can improve retrieval precision is provided.

A kind of image search method comprises the steps:

Obtain search key, and screen from database according to search key and to obtain image collection;

Set up the first spectrogram model of image collection according to characteristics of image, obtain the in twos similarity relation between the image in the image collection;

Set up the semi-supervised learning model according to similarity relation;

According to the semi-supervised learning model image collection is carried out denoising, obtain the denoising image collection;

Return the denoising image collection as the corresponding result for retrieval of search key.

A kind of image retrieving apparatus comprises:

Acquisition module is used for obtaining search key, and screens from database according to search key and to obtain image collection;

MBM, the first spectrogram model for set up image collection according to characteristics of image obtains the in twos similarity relation between the image in the image collection;

Study module is used for setting up the semi-supervised learning model according to similarity relation;

The denoising module is used for according to the semi-supervised learning model image collection being carried out denoising, obtains the denoising image collection;

Sending module is used for returning the denoising image collection as the corresponding result for retrieval of search key.Above-mentioned image search method and device, by obtaining search key, and screen from database according to search key and to obtain image collection, set up the first spectrogram model of image collection according to characteristics of image, obtain the in twos similarity relation between the image in the image collection, set up the semi-supervised learning model according to similarity relation, according to the semi-supervised learning model image collection is carried out denoising, obtain the denoising image collection, return the denoising image collection as the corresponding result for retrieval of search key, by the image collection that retrieves is carried out overall denoising, improved the degree of accuracy of image retrieval.

Description of drawings

Fig. 1 is image search method schematic flow sheet among the embodiment;

Fig. 2 is image retrieving apparatus structural representation among the embodiment;

Fig. 3 is the schematic flow sheet of image search method in another embodiment.

Embodiment

Be described in detail below in conjunction with specific embodiment and the accompanying drawing technical scheme to image search method and device, so that it is clearer.

As shown in Figure 1, in one embodiment, a kind of image search method comprises the steps:

S110 obtains search key, and screens from database according to search key and to obtain image collection.

In the present embodiment, obtain the words that is used for retrieving images that the user inputs as key word in search engine, according to this key word from image data base or other include in the database of figure and screen image, can be according to the text based retrieval technology such as content to the description of image, image name, image place webpage.

S130 sets up the first spectrogram model of image collection according to characteristics of image, obtain the in twos similarity relation between the image in the image collection.

In the present embodiment, obtain first the image feature value that obtains in the image collection, such as the characteristics of image such as rgb value (intensity levels of three colors of RGB), brightness value, tone, saturation degree or the figure number of plies of image.Set up the Characteristic of Image vector according to this image feature value, this proper vector is multi-C vector, and every one dimension is by a kind of characteristics of image value representation.According to proper vector image collection is gathered χ={ x by first node ₁..., x _nExpression, wherein x _nA multi-C vector, each x _nRepresent an image, x _nA dimension represent an eigenwert, image by a vector representation in the node set, is convenient to subsequent calculations.

According to first node set opening relationships matrix W, when i ≠ j, w _Ij=exp (|| x _i-x _j|| ²/ σ ²), when i=j, w _Ij=0, represent that by relational matrix W the similarity between each image concerns.Further, W does normalized to similar matrix, the similarity between the image is concerned get first normalization limit matrix S=D by the data representation that passes through between 0 and 1 ^-1/2WD ^-1/2, wherein D is that diagonal element is

Diagonal matrix, namely this diagonal element is w _IjThe all elements of column and.Normalization limit matrix is based on the in twos foundation of the mutual relationship between the node, can be used for excavating the inner structure of node set.

S150 sets up the semi-supervised learning model according to similarity relation.

In the present embodiment, obtain first in the first node set before p node, be positive sample with p node demarcation, for example, for the set χ of n node={ x ₁..., x _p, x _P+1..., x _n, its front p node is demarcated is positive sample, p can be preset value, also can be by spectral clustering to node in conjunction with calculating.Definition query vector y, y is a multi-C vector, for demarcating node, y=y _i=1 (i≤p), for not demarcating node, y=y _u=0 (p+1≤u≤n), y _iPerhaps y _uIt is the value of a dimension among the multi-C vector y.Definition prediction label vector f, wherein f _i(the expression of 1≤i≤n) node x _iThe prediction label, f is multi-C vector.

Further, y and the f according to definition sets up the energy function of predicting the label vector f

E (f) = Σ_{i, j = 1}^{n} w_{ij} {(\frac{f_{i}}{\sqrt{d_{ii}}} - \frac{f_{i}}{\sqrt{d_{jj}}})}^{2} + μ Σ_{i = 1}^{n} {(f_{i} - y_{i})}^{2},

Wherein μ is balance factor, can be preset value,

Level and smooth risk item, if x _iAnd x _jLarger w is arranged _Ij, then keep f _iAnd f _iMore approaching;

Be the empiric risk item, keep f to compare with original demarcation y changing little.

At last, the Global optimal solution f=(1-α) (I-α S) that according to this energy function E (f) f is differentiated and can get E (f) ^-1Y, the semi-supervised learning model that namely obtains, wherein α=1/ (1+ μ), I is unit matrix, S=D ^-1/2WD ^-1/2, wherein D is that diagonal element is

Diagonal matrix.

Come the image of front p position in the image collection that retrieval is obtained as positive sample, it is 1 that its label is set, the label of other images is set to 0 in the image collection, the image collection that sets label is formed binary vector y, calculate by the semi-supervised learning model, can be in the hope of rearrangement mark f.To whenever sorting from big to small with dimension value of f kind, can get sequence node, can resequence to image collection, the node in the node of standing out and the positive sample puts in order more approaching.

S170 carries out denoising according to the semi-supervised learning model to image collection, obtains the denoising image collection.

In one embodiment, above-mentioned steps S170 specifically may further comprise the steps:

Obtain the label of single node prediction according to the semi-supervised learning model, obtain label matrix F ^*=(I-α S) ^-1[y ¹. ..., y ⁱ..., y ⁿ]=(I-α S) ^-1Wherein,

Be based on the query vector y that single node is demarcated ^jAnd to x _iThe prediction label;

Carry out the spectral clustering analysis according to label matrix, obtain a plurality of class group;

Leading mark according to label matrix and the described node of class group definition is

According to inequality

Judge noise class group, wherein

Expression is averaged the data among the c of class group,

Expression is averaged k class group, and β is preset value;

Remove the noise class and roll into a ball corresponding noise image set, obtain the denoising image collection.

In the present embodiment, the node in the semi-supervised model can form main class group usually, and node corresponding to noise image can dilute the density of class group, the node that is positioned at same geometric configuration can be used as same class group, and noise then is the exceptional value that disperses.Specifically can be by study mapping g () with luv space

Twist into new space So that all exceptional values can form a new class group, and all class groups are separated from each other, and are convenient to noise remove like this.

Be based on the query vector y that single node is demarcated ^jAnd to x _iThe prediction label, if x _iAnd x _jBelong to same class group,

Value should be larger, and

With

At each dimension k=1 ..., the value of n is more close, and abnormal nodes should be less in the value of nearly all dimension.

Definition mapping g: χ → R ⁿ,

Then based on χ ^*=g (χ) sets up spectrogram, obtains normalization limit matrix S ^*With normalization figure Laplce L ^*=I-S ^*, order

Be L ^*Eigenwert and proper vector pair, and λ _i≤ ... ≤ λ _nL ^*Be block diagonal matrix, the element between the same class group has larger absolute value.L ^*Less partial feature value characteristic of correspondence vector is keeping same block structure, makes it form U _k=[v ₁, v ₂..., v _k], wherein k is χ ^*The quantity that middle class is rolled into a ball can be by L ^*The eigenwert of arranging from small to large in k the largest interval value occurs and determine with k+1.Then will with the K averaging method Gather into the k class, comprising the class that is formed by discrete noise node.If to F ^*The summation of every row, row corresponding to noise itself and less.

According to label matrix and the definition x of class group _iLeading mark be

Use simultaneously c ∈ 1 ..., the label of k} representation class group.Then can be according to inequality

Judge noise class group, wherein

Expression is averaged the data among the c of class group,

Expression is averaged k class group, and β is the threshold value factor, can be preset value, and removal noise class namely obtains the denoising image collection after rolling into a ball corresponding noise image set.

S190 returns the denoising image collection as the corresponding result for retrieval of search key.

In the present embodiment, the image collection after the denoising is returned to search engine, as the corresponding result for retrieval of search key, namely finish the retrieval of image.

Above-mentioned image search method, by obtaining search key, and screen from database according to search key and to obtain image collection, set up the first spectrogram model of image collection according to characteristics of image, obtain the in twos similarity relation between the image in the image collection, set up the semi-supervised learning model according to similarity relation, according to the semi-supervised learning model image collection is carried out denoising, obtain the denoising image collection, return the denoising image collection as the corresponding result for retrieval of search key, by the image collection that retrieves is carried out overall denoising, improved the degree of accuracy of image retrieval.

In one embodiment, above-mentioned steps S190 specifically may further comprise the steps:

The denoising image collection is set up the second spectrogram model, obtain the corresponding Section Point of denoising image collection set χ ' and based on the second normalization limit matrix S of χ ' ';

Set up the maximization function according to the spectrogram model

y_{p}^{*} = \arg \max (\frac{y_{p}^{T} M^{p \times p} y_{p}}{{(Σ_{i = 1}^{n} (y_{p})_{i})}^{2}} - γ \frac{1}{{(Σ_{i = 1}^{n} {(y_{p})}_{i})}^{2}}),

Wherein, M=(I-α S') ^-1, m _Ii=0, γ is preset value, M ^{P * p}The capable p row of front p of M;

Obtain positive sample by alternative manner solution maximization function;

By positive sample training semi-supervised learning model, so that the denoising image collection is resequenced, obtain the image collection that reorders;

Return the result for retrieval of image collection as search key that reorder.

In the present embodiment, as described in Figure 2, on the basis of denoising image collection, set up the spectrogram model, carry out the spectral clustering analysis, obtain leading class, to obtain positive sample, be used for training semi-supervised learning model and then the denoising image collection sorted, obtain final result for retrieval.With key word since with in terms of content with key word the relevant ratio of image in the image of standing out so set up spectrogram based on the image set of standing out, select to dominate class usually above the ratio in whole image set.Leading class, the i.e. many and high class group of density of node in the spectral clustering.

Concrete, make that χ ' is the denoising image collection, S' is the normalization limit matrix based on χ '.

Set up matrix M=(I-α S') ^-1And m _Ii=0.Owing to more may occupy larger proportion in the image that positive sample is stood out to form leading class group in image collection, only consider the front p dimension of M here, p can preset value.

The query vector y of definition p * 1 _pExpression comes the demarcation information of front p width of cloth image.In order to make y _pDemarcation information accurate, set up the maximization function

y_{p}^{*} = \arg \max (\frac{y_{p}^{T} M^{p \times p} y_{p}}{{(Σ_{i = 1}^{n} (y_{p})_{i})}^{2}} - γ \frac{1}{{(Σ_{i = 1}^{n} {(y_{p})}_{i})}^{2}}),

Wherein γ is balance factor, M ^{P * p}The capable p row of front p of M,

Be the density item, weigh by y _pThe density of the block structure that middle nominal data forms,

Be the yardstick item, guarantee that leading class group has larger size, It is the demarcation query vector after the purification of requirement.

For above-mentioned maximization function, adopt alternative manner to find the solution.At first, at all dimension initialization y _p=1, for each iteration, the value of certain one dimension is become 0 from 1, so that

Amplification is maximum.When

In the time of can't increasing by this mode, iteration stopping.Remaining 1 corresponding image corresponding to expression is positive sample.After demarcating good positive sample, the processing of resequencing of image that can the denoising image collection is arranged on the denoising image collection of positive sample by the semi-supervised learning model being applied in demarcate.The result that will resequence at last returns as the result for retrieval of search key and connects, and namely gets the result for retrieval that improves retrieval accuracy.

As shown in Figure 3, in one embodiment, a kind of image retrieving apparatus comprises acquisition module 110, MBM 130, study module 150, denoising module 170 and sending module 190.

Acquisition module 110 is used for obtaining search key, and screens from database according to search key and to obtain image collection.

In the present embodiment, acquisition module 110 obtains the words that is used for retrieving images that the user inputs as key word in search engine, according to this key word from image data base or other include in the database of figure and screen image, can be according to the text based retrieval technology such as content to the description of image, image name, image place webpage.

MBM 130, the first spectrogram model for set up image collection according to characteristics of image obtains the in twos similarity relation between the image in the image collection.

Study module 150 is used for setting up the semi-supervised learning model according to similarity relation.

E (f) = Σ_{i, j = 1}^{n} w_{ij} {(\frac{f_{i}}{\sqrt{d_{ii}}} - \frac{f_{i}}{\sqrt{d_{jj}}})}^{2} + μ Σ_{i = 1}^{n} {(f_{i} - y_{i})}^{2},

Wherein μ is balance factor, can be preset value,

Level and smooth risk item, if x _iAnd x _jLarger w is arranged _Ij, then keep f _iAnd f _jMore approaching;

At last, the Global optimal solution f=(1-α) (I-α S) that according to this energy function E (f) f is differentiated and can get E (f) ^-1Y, the semi-supervised learning model that namely obtains, wherein α=1/ (1+ μ), I is unit matrix, S=D ^-1/2WD ^-1/2, wherein D is that diagonal element is Diagonal matrix.

Denoising module 170 is used for according to the semi-supervised learning model image collection being carried out denoising, obtains the denoising image collection.

In one embodiment, above-mentioned denoising module 170 also is used for obtaining according to the semi-supervised learning model label of single node prediction, obtains label matrix F ^*=(I-α S) ^-1[y ¹. ..., y ⁱ..., y ⁿ]=(I-α S) ^-1Wherein,

Be based on the query vector y that single node is demarcated ^jAnd to x _iThe prediction label, carry out the spectral clustering analysis according to label matrix, obtain a plurality of class group, according to the leading mark of label matrix and class group defined node be

According to inequality Judge noise class group, wherein

Expression is averaged the data among the c of class group, Expression is averaged k class group, and β is preset value, removes described noise class and rolls into a ball corresponding noise image set, obtains the denoising image collection.

Twist into new space

So that all exceptional values can form a new class group, and all class groups are separated from each other, and are convenient to noise remove like this.

Value should be larger, and With

Definition mapping g: χ → R ⁿ,

Then based on x ^*=g (χ) sets up spectrogram, obtains normalization limit matrix S ^*With normalization figure Laplce L ^*=I-S ^*, order

Be L ^*Eigenwert and proper vector pair, and λ _i≤ ... ≤ λ _nL ^*Be block diagonal matrix, the element between the same class group has larger absolute value.L ^*Less partial feature value characteristic of correspondence vector is keeping same block structure, makes it form U ^k=[v ₁, v ₂..., v _k], wherein k is χ ^*The quantity that middle class is rolled into a ball can be by L ^*The eigenwert of arranging from small to large in k the largest interval value occurs and determine with k+1.Then will with the K averaging method

Gather into the k class, comprising the class that is formed by discrete noise node.If to F ^*The summation of every row, row corresponding to noise itself and less.

According to label matrix and the definition x of class group _iLeading mark be

Judge noise class group, wherein

Expression is averaged the data among the c of class group,

Sending module 190 is used for returning the denoising image collection as the corresponding result for retrieval of search key.

Above-mentioned image collator, by obtaining search key, and screen from database according to search key and to obtain image collection, set up the first spectrogram model of image collection according to characteristics of image, obtain the in twos similarity relation between the image in the image collection, set up the semi-supervised learning model according to similarity relation, according to the semi-supervised learning model image collection is carried out denoising, obtain the denoising image collection, return the denoising image collection as the corresponding result for retrieval of search key, by the image collection that retrieves is carried out overall denoising, improved the degree of accuracy of image retrieval.

In one embodiment, above-mentioned sending module 190 also is used for the denoising image collection is set up the second spectrogram model, obtain the corresponding Section Point of denoising image collection set χ ' and based on the second normalization limit matrix S of χ ' ', set up the maximization function according to the spectrogram model

y_{p}^{*} = \arg \max (\frac{y_{p}^{T} M^{p \times p} y_{p}}{{(Σ_{i = 1}^{n} (y_{p})_{i})}^{2}} - γ \frac{1}{{(Σ_{i = 1}^{n} {(y_{p})}_{i})}^{2}}),

Wherein, M=(I-α S') ^-1, m _Ii=0, γ is preset value, M ^{P * p}The capable p row of front p of M, obtain positive sample by alternative manner solution maximization function, by positive sample training semi-supervised learning model, so that the denoising image collection is resequenced, obtain the image collection that reorders, return the result for retrieval of image collection as search key that reorder.

In the present embodiment, on the basis of denoising image collection, set up the spectrogram model, carry out the spectral clustering analysis, obtain leading class, to obtain positive sample, be used for training semi-supervised learning model and then the denoising image collection sorted, obtain final result for retrieval.With key word since with in terms of content with key word the relevant ratio of image in the image of standing out so set up spectrogram based on the image set of standing out, select to dominate class usually above the ratio in whole image set.Leading class, the i.e. many and high class group of density of node in the spectral clustering.

y_{p}^{*} = \arg \max (\frac{y_{p}^{T} M^{p \times p} y_{p}}{{(Σ_{i = 1}^{n} (y_{p})_{i})}^{2}} - γ \frac{1}{{(Σ_{i = 1}^{n} {(y_{p})}_{i})}^{2}}),

Wherein γ is balance factor, M ^{P * p}The capable p row of front p of M,

For above-mentioned maximization function, adopt alternative manner to find the solution, at first, at all dimension initialization y _p=1, for each iteration, the value of certain one dimension is become 0 from 1, so that Amplification is maximum.When

One of ordinary skill in the art will appreciate that all or part of flow process that realizes in above-described embodiment method, to come the relevant hardware of instruction to finish by computer program, described program can be stored in the computer read/write memory medium, this program can comprise the flow process such as the embodiment of above-mentioned each side method when carrying out.Wherein, described storage medium can be magnetic disc, CD, read-only store-memory body (Read-Only Memory, ROM) or random store-memory body (Random Access Memory, RAM) etc.

The above embodiment has only expressed several embodiment of the present invention, and it describes comparatively concrete and detailed, but can not therefore be interpreted as the restriction to claim of the present invention.Should be pointed out that for the person of ordinary skill of the art, without departing from the inventive concept of the premise, can also make some distortion and improvement, these all belong to protection scope of the present invention.Therefore, the protection domain of patent of the present invention should be as the criterion with claims.

Claims

1. an image search method comprises the steps:

Obtain search key, and screen from database according to described search key and to obtain image collection;

Set up the first spectrogram model of described image collection according to characteristics of image, obtain the in twos similarity relation between the image in the described image collection;

Set up the semi-supervised learning model according to described similarity relation;

According to described semi-supervised learning model described image collection is carried out denoising, obtain the denoising image collection;

Return described denoising image collection as the corresponding result for retrieval of described search key.

2. image search method according to claim 1 is characterized in that, described the first spectrogram model of setting up described image collection according to characteristics of image, and the in twos step of the similarity relation between the image that obtains in the described image collection comprises:

Obtain image feature value, set up the Characteristic of Image vector;

According to described proper vector described image collection is gathered χ={ x by first node ₁..., x _nExpression, wherein x _nA multi-C vector, x _nA dimension represent an eigenwert;

According to described first node set opening relationships matrix W, wherein, when i ≠ j, w _Ij=exp (|| x _i-x _j|| ²/ σ ²); When i=j, w _Ij=0;

Described similar matrix W is done normalized get first normalization limit matrix S=D ^-1/2WD ^-1/2, wherein D is that diagonal element is

Diagonal matrix.

3. image search method according to claim 2 is characterized in that, the described step of setting up the semi-supervised learning model according to described similarity relation comprises:

P node before obtaining in the described first node set, it is positive sample that described p node demarcated;

Definition query vector y is wherein for demarcating node, y=y _i=1 (i≤p), for not demarcating node, y=y _u=0 (p+1≤u≤n);

Definition prediction label vector f, wherein f _i(1≤i≤n) expression node xi predicts label;

Set up the energy function of described prediction label vector f

E (f) = Σ_{i, j = 1}^{n} w_{ij} {(\frac{f_{i}}{\sqrt{d_{ii}}} - \frac{f_{i}}{\sqrt{d_{jj}}})}^{2} + μ Σ_{i = 1}^{n} {(f_{i} - y_{i})}^{2};

According to described energy function f is differentiated and to get semi-supervised learning model f=(1-α) (I-α S) ^-1Y, wherein α=1/ (1+ μ), I is unit matrix.

4. image search method according to claim 3 is characterized in that, describedly according to described semi-supervised learning model described image collection is carried out denoising, and the step that obtains the denoising image collection is:

Obtain the label of single node prediction according to described semi-supervised learning model, obtain label matrix F ^*=(I-α S) ^-1[y ¹. ..., y ⁱ..., y ⁿ]=(I-α S) ^-1Wherein,

Carry out the spectral clustering analysis according to described label matrix, obtain a plurality of class group;

Leading mark according to described label matrix and the described node of class group definition is

According to inequality

Judge noise class group, wherein

Expression is averaged the data among the c of class group, Expression is averaged k class group, and β is preset value;

Remove described noise class and roll into a ball corresponding noise image set, obtain the denoising image collection.

5. according to claim 4 described image search methods, it is characterized in that, describedly return described denoising image collection and as the step of the corresponding result for retrieval of described search key be:

Described denoising image collection is set up the second spectrogram model, obtain the corresponding Section Point of described denoising image collection set χ ' and based on the second normalization limit matrix S of χ ' ';

Set up the maximization function according to described the second spectrogram model

y_{p}^{*} = \arg \max (\frac{y_{p}^{T} M^{p \times p} y_{p}}{{(Σ_{i = 1}^{n} (y_{p})_{i})}^{2}} - γ \frac{1}{{(Σ_{i = 1}^{n} {(y_{p})}_{i})}^{2}}),

Obtain positive sample by the described maximization function of alternative manner solution;

By the described semi-supervised learning model of described positive sample training, so that described denoising image collection is resequenced, obtain the image collection that reorders;

Return the described result for retrieval of image collection as described search key that reorder.

6. image retrieving apparatus comprises:

Acquisition module is used for obtaining search key, and screens from database according to described search key and to obtain image collection;

MBM, the first spectrogram model for set up described image collection according to characteristics of image obtains the in twos similarity relation between the image in the described image collection;

Study module is used for setting up the semi-supervised learning model according to described similarity relation;

The denoising module is used for according to described semi-supervised learning model described image collection being carried out denoising, obtains the denoising image collection;

Sending module is used for returning described denoising image collection as the corresponding result for retrieval of described search key.

7. image retrieving apparatus according to claim 6 is characterized in that, described MBM also is used for obtaining image feature value, sets up the Characteristic of Image vector, according to described proper vector described image collection is gathered x={x by first node ₁..., x _nExpression, wherein x _nA multi-C vector, x _nA dimension represent an eigenwert, according to described first node set opening relationships matrix W, wherein, when i ≠ j, w _Ij=exp (|| x _i-x _j|| ²/ σ ²); When i=j, w _Ij=0, described similar matrix W is done normalized get first normalization limit matrix S=D ^-1/2WD ^-1/2, wherein D is that diagonal element is

Diagonal matrix.

8. image retrieving apparatus according to claim 7 is characterized in that, describedly also is used for obtaining p node before the described first node set according to described study module, it is positive sample that described p node demarcated, definition query vector y is wherein for demarcating node, y=y _i=1 (i≤p), for not demarcating node, y=y _u=0 (p+1≤u≤n), definition prediction label vector f, wherein f _i(the expression of 1≤i≤n) node x _iPredict label, set up the energy function of described prediction label vector f

E (f) = Σ_{i, j = 1}^{n} w_{ij} {(\frac{f_{i}}{\sqrt{d_{ii}}} - \frac{f_{i}}{\sqrt{d_{jj}}})}^{2} + μ Σ_{i = 1}^{n} {(f_{i} - y_{i})}^{2},

9. image retrieval rotary device according to claim 8 is characterized in that, the described label that also is used for obtaining according to described semi-supervised learning model the single node prediction according to described denoising module obtains label matrix F ^*=(I-α S) ^-1[y ¹. ..., y ⁱ..., y ⁿ]=(I-α S) ^-1Wherein,

Be based on the query vector y that single node is demarcated ^jAnd to x _iThe prediction label, carry out the spectral clustering analysis according to described label matrix, obtain a plurality of class group, according to the leading mark of described label matrix and the described node of class group definition be

According to inequality

Judge noise class group, wherein

Expression is averaged the data among the c of class group,

Expression is averaged k class group, and β is preset value, removes described noise class and rolls into a ball corresponding noise image set, obtains the denoising image collection.

10. according to claim 9 described image retrieving apparatus, it is characterized in that, described sending module also is used for described denoising image collection is set up the second spectrogram model, obtain the corresponding Section Point of described denoising image collection set χ ' and based on the second normalization limit matrix S of χ ' ', set up the maximization function according to described the second spectrogram model

y_{p}^{*} = \arg \max (\frac{y_{p}^{T} M^{p \times p} y_{p}}{{(Σ_{i = 1}^{n} (y_{p})_{i})}^{2}} - γ \frac{1}{{(Σ_{i = 1}^{n} {(y_{p})}_{i})}^{2}}),

Wherein, M=(I-α S') ^-1, m _Ii=0, γ is preset value, M ^{P * p}The capable p row of front p of M, obtain positive sample by the described maximization function of alternative manner solution, by the described semi-supervised learning model of described positive sample training, so that described denoising image collection is resequenced, obtain the image collection that reorders, return the described result for retrieval of image collection as described search key that reorder.