CN106650629A

CN106650629A - Kernel sparse representation-based fast remote sensing target detection and recognition method

Info

Publication number: CN106650629A
Application number: CN201611039128.4A
Authority: CN
Inventors: 高红民; 陈玲慧; 陆迎曙; 李臣明; 杨耀; 樊悦; 张振; 谢科伟; 黄昌运
Original assignee: Hohai University HHU
Current assignee: Hohai University HHU
Priority date: 2016-11-21
Filing date: 2016-11-21
Publication date: 2017-05-10

Abstract

The invention discloses a kernel sparse representation-based fast remote sensing target detection and recognition method. The method includes the following steps that: S1, four RGB characteristic channels are created; S2, the four-phase Fourier transformation of the four characteristic channels of a given image is calculated, a phase spectrum is extracted, the images of the four characteristic channels are reestablished through inverse Fourier transformation, and a saliency map can be generated; S3, binaryzation division is performed on the saliency map obtained in the step S2, and candidate regions of interest are extracted; S4, a search box is scanned through an effective sub-window search algorithm, so that image blocks to be detected are obtained, so that a remote sensing target image block training set is obtained; S5, SIFT features are extracted from the remote sensing target image block training set, and a sparse dictionary is generated; S6, a spatial pyramid is adopted to map the SIFT features; S7, kernel sparse representation is obtained; S8, the kernel sparse representation is solved; S9, the space pyramid vector representation of a target is performed; and S10, a linear support vector machine classification algorithm is used in combination to complete a recognition task.

Description

A kind of rapid remote sensing target detection recognition methods based on nuclear sparse expression

Technical field

The present invention relates to remote sensing images analysis field, more particularly to a kind of rapid remote sensing target based on nuclear sparse expression Detection recognition method.

Background technology

Due to the complexity of remote sensing image, detect that multiclass interesting target thing is one in the remote sensing image of view picture large format Individual more difficult subject study direction.In computer vision field, BOVW models have been extensively studied and have been applied to image point In class and pattern recognition task.In recent years, BOVW models are also introduced in remote sensing target detection identification field, and are achieved Good performance.But the method for being proposed at present is both without the remote sensing image before examinations recognizer, to large format Pre-processed the prediction i.e. to targets of interest region to process, also fail to fully capture institute's extraction feature in the feature coding stage Between similarity.This means that respectively a large amount of insignificant redundant computations lose the negative effect to detecting recognition performance with information. More importantly it is that these methods or model are both for what a certain specific object was proposed, and this is just to a great extent Limit their range of application.At the same time, existing document points out that the method for notable figure can be used to predict that potential candidate is emerging The regional location of interesting object, this is because targets of interest thing generally has significantly different on vision mechanism from background.Together When, input data, by kernel function, stealthyly is mapped to high dimensional feature empty by the method that existing document proposes nuclear sparse expression Between, while ensureing to become more sparse in the sparse of input data of feature space, this sparse property is sparse for correcting The defect of expression, the distinctive and lifting classification performance that improve between data have effect well.

However, above-mentioned method carries out pre- place both without before examinations recognizer to the remote sensing image of large format Reason is processed the prediction in targets of interest region, also fails to fully to be captured in the feature coding stage similar between institute's extraction feature Degree.This means that respectively a large amount of insignificant redundant computations lose the negative effect to detecting recognition performance with information.More increase Want, the method or model of the above are both for what a certain specific object was proposed, and this is just limited to a great extent Their range of application.

The content of the invention

Goal of the invention：It is an object of the invention to provide a kind of can solve the problem that the dilute based on core of defect present in prior art Dredge the rapid remote sensing target detection recognition methods for representing.

Technical scheme：Rapid remote sensing target detection recognition methods based on nuclear sparse expression of the present invention includes following Step：

S1：Create four RGB feature passages；

S2：Four phase Fourier transformations of four feature passages of given image are calculated, phase spectrum is extracted, by inverse Fourier The image of four feature passages of transform reconstruction, so as to generate notable figure；

S3：The notable figure obtained to step S2 carries out binaryzation division, extracts candidate's interest region；

S4：Testing image block is obtained by effective sub- window searching algorithm scanning search frame, remote sensing target image block instruction is formed Practice collection；

S5：SIFT feature extraction is carried out to remote sensing target image block training set, sparse dictionary is generated；

S6：SIFT feature is mapped using spatial pyramid；

S7：Nuclear sparse expression；

S8：Solve nuclear sparse expression；

S9：The spatial pyramid vector representation of target；

S10：Identification mission is completed with reference to linear support vector cassification algorithm.

Further, in step S7 shown in nuclear sparse expression such as formula (1)：

Wherein v=[v₁,v₂,v₃,…,v_N]∈R^N×KFor sparse coding coefficient, ξ is reconstructed residual,K (■, ■) be mercer kernel functions, x=[x₁,x₂,x₃,…,x_N]∈R^N×DFor The SIFT feature of training image collection,For the sparse dictionary after mapping,For the feature after mapping Collection, U=[u₁,u₂,u₃,…,u_K]∈R^K×DFor visual dictionary.

Further, in step S8, using based on the orthogonal matching pursuit algorithm of principal component analysis nuclear sparse expression is solved： Setting 0<ξ ＜＜ 1,0<ω ＜＜ K, 0<H≤K, K are degree of rarefication, and h is that principal component analyses the dimension after dimensionality reduction, calculates sparse dictionary Nuclear matrix Q ∈ R^K×KCorresponding regularization characteristic vector, for each regularization characteristic vector, calculates its core vector Y ∈ R¹ ^×K, then dimensionality reduction is carried out to Q and Y by matrix B, as a result respectively A=QB^T=(a₁,a₂,a₃,…,a_K)∈R^K×hAnd y=YB^T∈ R^1×h, then using the new matrix A for producing as dictionary, vectorial y generates sparse coefficient v ∈ as input feature vector with OMP algorithms R^1×KRepresent as image vector.

Further, in step S3, the method for extracting candidate's interest region is：Binaryzation notable figure, will be wherein positive Pixel number exceedes the region of the threshold value of setting as candidate's interest region.

Further, the threshold value is set according to formula (2)：

Threshold=Mean+n × Variance, (2)

Wherein Mean is the average of image intensity value, and Variance is the variance of image intensity value, and n is artificial setting value.

Further, in step S5, SIFT feature is calculated using K-means++ clustering algorithms, generates sparse dictionary.

Further, in step S9, the spatial pyramid vector representation of target is dilute to core using maximum assemblage method Thin coding carries out final convergence, shown in maximum assemblage method therein such as formula (3)：

Max:R=max (| v₁|,|v₁|,|v₂|,|v₃|,…,|v_M|) (3) wherein v_iIt is V=that step S8 is obtained (v₁,v₂,v₃,…,v_M)∈R^M×(21K)In element, 1≤i≤M, M for SIFT feature number, K is degree of rarefication, r ∈ R^1×(21K) For the final coding vector of the image block.

Beneficial effect：Compared with prior art, the present invention has following beneficial effect：

(1) present invention has been extracted with predictive candidate's interest region, is accelerated the speed of target detection identification, is carried High target identification precision；

(2) view-based access control model dictionary of the present invention reduces redundant computation, accelerates with nuclear sparse expression image feature representation model Iamge description speed；

(3) present invention accelerates nuclear sparse expression.

Description of the drawings

Fig. 1 is the method schematic diagram of the specific embodiment of the invention；

The schematic diagram that Fig. 2 maps for the spatial pyramid of the specific embodiment of the invention.

Specific embodiment

This specific embodiment discloses a kind of rapid remote sensing target detection recognition methods based on nuclear sparse expression, such as schemes Shown in 1, comprise the following steps：

S1：Four RGB features passage RG (t), BY (t), I (t) and M (t) are created, as shown in formula (1)-(4)；

RG (t)=R (t)-G (t) (1)

BY (t)=R (t)-G (t) (2)

M (t)=| I (t)-I (t- τ) | (4)

Wherein, the RGB triple channels of r (t), g (t), b (t) difference representative image, R (t), G (t) and B (t) such as formulas (5)- (7) shown in, I (t) represents the image of t；

S2：Four phase Fourier transformations of four feature passages of given image are calculated, phase spectrum is extracted, by inverse Fourier The image of four feature passages of transform reconstruction, so as to generate notable figure.

S3：The notable figure obtained to step S2 carries out binaryzation division, extracts candidate's interest region；Extract candidate's region of interest The method in domain is：Binaryzation notable figure, wherein positive pixel number is exceeded the region of the threshold value of setting as candidate's interest Region；Threshold value therein is set according to formula (8)：

Threshold=Mean+n × Variance, (8)

Wherein Mean is the average of image intensity value, and Variance is the variance of image intensity value, and n is artificial setting value. Therefore, when certain region of the inswept view picture remote sensing image of sliding window, if in the corresponding binaryzation region of the image just Pixel number is more than Threshold, then extract the image block in now scanning window as candidate's interest region, the otherwise area Domain is considered as background area.

S4：Testing image block is obtained by effective sub- window searching algorithm (ESS algorithms) scanning search frame, remote sensing target is formed Image block training set；

S5：SIFT feature extraction is carried out to remote sensing target image block training set, is calculated using K-means++ clustering algorithms SIFT feature, generates sparse dictionary.；

S6：SIFT feature is mapped using spatial pyramid：For the SIFT feature of image is given birth to by clustering algorithm Into length be the dictionary of K, each image is divided into into 3 tomographic image sub-block such as Fig. 2 according to 1 × 1,2 × 2,4 × 4, and for each layer Weights are assigned, weight is followed successively by 1/4,1/4,1/2.Dictionary table is carried out to the feature of each sub-block and is connected into 21K length up to after converging Dictionary represent.

S7：Nuclear sparse expression：By the way that simultaneously input feature vector and dictionary base are mapped to into higher-dimension or infinite dimensional feature space Middle F carries out similarity mode so as to obtain more distinctive sparse coefficient.Shown in nuclear sparse expression such as formula (9)：

S8：Solve nuclear sparse expression：Core is solved using based on the orthogonal matching pursuit algorithm (KPOMP algorithms) of principal component analysis Rarefaction representation：Setting 0<ξ《1,0<ω《K, 0<H≤K, K are degree of rarefication, and h is that principal component analyses the dimension after dimensionality reduction, calculates sparse word The nuclear matrix Q ∈ R of allusion quotation^K×KCorresponding regularization characteristic vector, for each regularization characteristic vector, calculates its core vector Y ∈ R^1×K, then dimensionality reduction is carried out to Q and Y by matrix B, as a result respectively A=QB^T=(a₁,a₂,a₃,…,a_K)∈R^K×hAnd y=YB^T ∈R^1×h, then using the new matrix A for producing as dictionary, vectorial y generates sparse coefficient v as input feature vector with OMP algorithms ∈R^1×KRepresent as image vector.

S9：The spatial pyramid vector representation of target：The spatial pyramid vector representation of target is using maximum convergence side Method carries out final convergence to core sparse coding, shown in maximum assemblage method therein such as formula (10)：

Max:R=max (| v₁|,|v₁|,|v₂|,|v₃|,…,|v_M|) (10)

Wherein v_iIt is V=(v that step S8 is obtained₁,v₂,v₃,…,v_M)∈R^M×(21K)In element, 1≤i≤M, M is SIFT The number of feature, K is degree of rarefication, r ∈ R^1×(21K)For the final coding vector of the image block.

Claims

1. a kind of rapid remote sensing target detection recognition methods based on nuclear sparse expression, it is characterised in that：Comprise the following steps：

S1：Create four RGB feature passages；

S2：Four phase Fourier transformations of four feature passages of given image are calculated, phase spectrum is extracted, by inverse Fourier transform The image of four feature passages is rebuild, so as to generate notable figure；

S4：Testing image block is obtained by effective sub- window searching algorithm scanning search frame, remote sensing target image block training set is formed；

S6：SIFT feature is mapped using spatial pyramid；

S7：Nuclear sparse expression；

S8：Solve nuclear sparse expression；

S9：The spatial pyramid vector representation of target；

2. the rapid remote sensing target detection recognition methods based on nuclear sparse expression according to claim 1, it is characterised in that： Shown in nuclear sparse expression in step S7 such as formula (1)：

\hat{v} = \arg \min_{v} | | v | |_{1}, | | Y - v Q | |_{2} \leq ξ - - - (1)

WhereinFor sparse coding coefficient, ξ is reconstructed residual, K (■, ■) be mercer kernel functions, x=[x₁,x₂,x₃,…,x_N]∈R^N×DFor the SIFT feature of training image collection,For the sparse dictionary after mapping,For the feature set after mapping, U=[u₁,u₂,u₃,…,u_K]∈R^K×DFor visual dictionary.

3. the rapid remote sensing target detection recognition methods based on nuclear sparse expression according to claim 2, it is characterised in that： In step S8, using based on the orthogonal matching pursuit algorithm of principal component analysis nuclear sparse expression is solved：Setting 0<ξ ＜＜ 1,0< ω ＜＜ K, 0<H≤K, K are degree of rarefication, and h is that principal component analyses the dimension after dimensionality reduction, calculates the nuclear matrix Q ∈ R of sparse dictionary^K×KIt is right The regularization characteristic vector answered, for each regularization characteristic vector, calculates its core vector Y ∈ R^1×K, then by matrix B to Q Dimensionality reduction is carried out with Y, as a result respectively A=QB^T=(a₁,a₂,a₃,…,a_K)∈R^K×hAnd y=YB^T∈R^1×h, then newly producing Matrix A as dictionary, vectorial y generates sparse coefficient v ∈ R as input feature vector with OMP algorithms^1×KAs image vector table Show.

4. the rapid remote sensing target detection recognition methods based on nuclear sparse expression according to claim 1, it is characterised in that： In step S3, the method for extracting candidate's interest region is：Binaryzation notable figure, wherein positive pixel number is exceeded and sets The region of fixed threshold value is used as candidate's interest region.

5. the rapid remote sensing target detection recognition methods based on nuclear sparse expression according to claim 4, it is characterised in that： The threshold value is set according to formula (2)：

Threshold=Mean+n × Variance, (2)

6. the rapid remote sensing target detection recognition methods based on nuclear sparse expression according to claim 1, it is characterised in that： In step S5, SIFT feature is calculated using K-means++ clustering algorithms, generate sparse dictionary.

7. the rapid remote sensing target detection recognition methods based on nuclear sparse expression according to claim 1, it is characterised in that： In step S9, the spatial pyramid vector representation of target is carried out final using maximum assemblage method to core sparse coding Converge, shown in maximum assemblage method therein such as formula (3)：

Max:R=max (| v₁|,|v₁|,|v₂|,|v₃|,…,|v_M|) (3)

Wherein v_iIt is V=(v that step S8 is obtained₁,v₂,v₃,…,v_M)∈R^M×(21K)In element, 1≤i≤M, M is SIFT feature Number, K is degree of rarefication, r ∈ R^1×(21K)For the final coding vector of the image block.