CN109063750B

CN109063750B - SAR target classification method based on CNN and SVM decision fusion

Info

Publication number: CN109063750B
Application number: CN201810781174.4A
Authority: CN
Inventors: 张磊; 李青伟; 刘宏伟
Original assignee: Xidian University; Xian Cetc Xidian University Radar Technology Collaborative Innovation Research Institute Co Ltd
Current assignee: Xidian University; Xian Cetc Xidian University Radar Technology Collaborative Innovation Research Institute Co Ltd
Priority date: 2018-07-17
Filing date: 2018-07-17
Publication date: 2022-05-13
Anticipated expiration: 2038-07-17
Also published as: CN109063750A

Abstract

The invention discloses an SAR target classification method based on decision fusion of CNN and SVM, which has the following thinking: determining an SAR image, wherein the SAR image comprises a lambda target, and dividing the lambda target to obtain a training set and a test set; then, the test set is standardized and sub-set divided to obtain a first test sub-set testx1, a second test sub-set testx2 and a third test sub-set testx 3; then obtaining a normalized feature matrix tranndata of the tranix, a normalized feature matrix testdata1 of the testx1, a normalized feature matrix testdata2 of the testx2 and a normalized feature matrix testdata3 of the testx3, and then respectively obtaining a parameter matrix W and an offset vector bb; determining a first column vector C1, a second column vector C2, and a third column vector C3; further obtaining a final prediction category column vector; v is obtained by calculation₂Class is identified as v₁Probability of class

And the SAR target classification result based on CNN and SVM decision fusion is recorded later.

Description

SAR target classification method based on CNN and SVM decision fusion

Technical Field

The invention belongs to the technical field of radars, and particularly relates to an SAR target classification method based on decision fusion of CNN and SVM, which is suitable for identifying and classifying radar targets.

Background

The Support Vector Machine (SVM) is a common machine learning classification model and has good effect on the classification and identification of small samples, non-linear and high-dimensional targets; the SVM is a supervised classifier which finds a hyperplane by learning training data, completely separates different types of targets, and simultaneously maximizes the sum of the distances from two points on two sides of the plane closest to the plane; if the target is linear inseparable, the target can be mapped from a low-dimensional space to a high-dimensional space by adding a proper kernel function, and then a hyperplane is found to correctly classify the target; by introducing a relaxation variable, a small number of sample classification errors are allowed while the interval is maximized, classification for practical tasks can be better performed, and the influence of overfitting is reduced.

The convolutional neural network CNN is a feedforward neural network, which generally comprises an input layer, a convolutional layer, an excitation layer, a pooling layer, a full-link layer and an output layer; the CNN has achieved great success in the field of image processing, avoids complex preprocessing of images, can directly input original images, and has a good final recognition effect. When a general network processes images, the images are regarded as one or more two-dimensional vectors, and all layers of feature maps are connected, so that the number of parameters to be determined in the training process of the network is very large, the time required for training the network is very long, and even the training fails, and the problem is solved by the CNN through a local connection and weight sharing strategy. The CNN extracts various characteristics of the image through the convolutional layer and samples input data through the pooling layer, so that parameters required by a training model are reduced, and the overfitting degree of the network is reduced. And extracting a part of the image from the convolutional layer through a sliding window, carrying out convolution on the part of the image and the convolution kernel, and adding the obtained corresponding pixel point results to obtain a final convolutional layer output result.

When the SVM is used for classification, the characteristic extraction of the data has great influence on the final classification result. If a proper feature extraction method can be found, the data is subjected to feature extraction, so that the classification accuracy can be greatly improved; conversely, if the feature extraction method is inappropriate, the classification accuracy may be deteriorated.

When the SAR target image is classified by the SVM, the two targets may be similar at certain angles, so that the recognition effect of the targets is not ideal near the angles.

Disclosure of Invention

The invention aims to provide a method for classifying SAR targets based on decision fusion of CNN and SVM, so as to improve the recognition rate of SAR images and enable different types of target images to be classified correctly.

The main ideas of the invention are as follows: taking out a part of the SAR image data set as a training set and taking a part of the SAR image data set as a test set, and standardizing the size of each picture of the training set and the test set; inputting standardized training data into an untrained CNN network for training, and carrying out fine adjustment through a BP algorithm; and inputting the standardized test set into the trained CNN network, inputting the output of the CNN into the SVM for decision fusion classification, and finally obtaining the prediction category of the target.

In order to achieve the technical purpose, the invention is realized by adopting the following technical scheme.

A SAR target classification method based on CNN and SVM decision fusion comprises the following steps:

step 1, determining an SAR image, wherein the SAR image comprises a lambda target, and dividing the lambda target to obtain a training set and a test set; then, the test set is standardized and sub-set divided to obtain a first test sub-set testx1, a second test sub-set testx2 and a third test sub-set testx 3; wherein lambda is more than or equal to 2;

step 2, setting a CNN network and training to obtain a trained CNN network;

step 3, obtaining a standardized feature matrix trandinata of the standardized training set, a standardized feature matrix testdata1 of the first testing subset testx1, a standardized feature matrix testdata2 of the second testing subset testx2 and a standardized feature matrix testdata3 of the third testing subset testx3 according to the first testing subset testx1, the second testing subset testx2, the third testing subset testx3 and the training set;

step 4, training a standardized feature matrix tranndata of a standardized training set tranix by using a traditional SVM classification algorithm to respectively obtain a parameter matrix W and an offset vector bb;

step 5, calculating prediction category column vectors of the first test subset testx1, the second test subset testx2 and the third test subset testx3 according to results obtained in the step 3 and the step 4, and respectively marking the prediction category column vectors as a first column vector C1, a second column vector C2 and a third column vector C3;

step 6, obtaining a final prediction category column vector according to the first column vector C1, the second column vector C2 and the third column vector C3;

step 7, obtaining v according to the final prediction category column vector₂Class is identified as v₁Probability of class

1≤v₁≤m，1≤v₂M is not less than m, m is not less than 2 and not more than lambda; v is₂Class is identified as v₁Probability of class

The method is an SAR target classification result based on CNN and SVM decision fusion.

The invention has the following advantages

Firstly, the invention uses the CNN network when extracting the characteristics of the data, can well extract the characteristics of the data and reduce the interference of noise and the like on classification.

Secondly, the method has good effect on the classification and identification of the small samples due to the use of the SVM classifier.

Thirdly, the final classification effect is further improved by performing decision fusion on the prediction results of the same target at different azimuth angles.

Drawings

The present invention will be described in further detail with reference to the accompanying drawings and specific embodiments.

FIG. 1 is a flow chart of an SAR target classification method based on decision fusion of CNN and SVM in the present invention;

FIG. 2 is a graph of the results of object classification using the method of the present invention.

Detailed Description

Referring to fig. 1, it is a flowchart of a decision fusion classification method based on SVM and CNN of the present invention; the decision fusion classification method based on the SVM and the CNN comprises the following steps:

step 1: and (6) initializing data.

1.1) the invention is directed to target identification of radar SAR images, for each SAR image its target class is known; determining an SAR image, wherein the SAR image comprises lambda targets, lambda is more than or equal to 2, each target corresponds to a label, and then obtaining a label y corresponding to the lambda targets_tr1、y_tr2、…、y_trλEach type of target at least comprises one target, and the azimuth angle and the pitch angle of all targets in each type of target are the same.

For the lambda class targets at pitch angles from sigma₁To sigma₂Interval σ₃Azimuth angle from xi₁To xi₂Interval xi₃SAR imaging is carried out to obtain

A SAR image with 0 degree or less sigma₁≤90°，0°≤σ₂≤90°，σ₁≤σ₂，σ₃>0，0°≤ξ₁≤360°，0°≤ξ₂≤360°，ξ₁≤ξ₂，ξ₃>0, each SAR image comprises a target, the size of each SAR image is c x t, c is larger than or equal to a, and t is larger than or equal to a.

In the above-mentioned

Selecting a pitch angle theta from the SAR image₁Azimuth angle from xi₁To xi₂Azimuthal interval ofIs xi₄Obtaining the SAR images of the m types of targets

A SAR image, wherein the label corresponding to the m-type target is y_tr1、y_tr2、…、y_trmWill be

Taking the SAR image and the label corresponding to the m-type target as a training set, wherein sigma₁≤θ₁≤σ₂，0<ξ₄≤ξ₃，2≤m≤λ。

In the above-mentioned

Selecting a pitch angle theta from the SAR image₂Azimuth angle from xi₁To xi₂Azimuth interval xi₄Obtaining the SAR image of the m' type target

A SAR image, wherein the label corresponding to the m' class target is y_tr1、y_tr2、…、y_trm'Will be

Taking the SAR image and a label corresponding to the m' type target as a test set, wherein sigma₁≤θ₂≤σ₂，θ₁≠θ₂And m' has the same value as m.

Importing a training set and a testing set into commercial software MATLAB, and standardizing the size of each SAR image in the training set and the testing set to a x a to obtain a standardized training set tranix and a standardized testing set testx, wherein the standardized training set tranix comprises

A label corresponding to the normalized SAR image and the m-type target, wherein the normalized test set testx comprises

And (4) obtaining a normalized SAR image and a label corresponding to the m' type target.

1.2) dividing the standardized test set testx into three test subsets, namely a first test subset testx1, a second test subset testx2 and a third test subset testx3, according to the difference of the azimuth angle of each standardized SAR image in the standardized test set testx, wherein the first test subset testx1 comprises m₁' Label corresponding to class target, and is marked as label column vector y corresponding to first test subset testx1_te1(ii) a The second test subset testx2 comprises m₂' Label corresponding to class target, and is marked as label column vector y corresponding to second test subset testx2_te2(ii) a The third test subset testx3 comprises m₃' Label corresponding to class target, and is marked as label column vector y corresponding to third test subset testx3_te3。

The first test subset testx1 corresponds to an azimuth of

The second test subset testx2 corresponds to an azimuth of

The third test subset testx3 corresponds to an azimuth of

Wherein

For the azimuthal interval of each normalized SAR image in the first test subset testx1,

j is the number of each normalized SAR image in the normalized test set testx,

label column vector y corresponding to the first test subset testx1_te1And the label column vector y corresponding to the second test subset testx2_te2The label column vector y corresponding to the third test subset testx3_te3Are respectively as

And (5) maintaining.

Step 2: the CNN network is initialized.

2.1) setting a CNN network to totally comprise n +1 layers, wherein n is a positive integer more than 3; let layer 0 type₀For input layer, i layer type_iWherein i is more than or equal to 1 and less than or equal to n-2, the ith layer type_iOptional convolutional and pooling layers, layer n-1 type_n-1Being a fully-connected hierarchy, the nth type_nThe total number of characteristic graphs of the ith layer is p as an output layer_i，p_iIs a positive integer greater than 0, p_iSimilarly to n, it can be arbitrarily set.

Wherein the total number of the characteristic graphs of the 0 th layer is p₀Total number p of characteristic diagrams of n-1 th layer_n-11, total number p of characteristic diagrams of the n-th layer_nIs 1; the total number of the characteristic diagrams of the pooling layer is the same as the value of the total number of the characteristic diagrams of the layer above the pooling layer; the size of each feature map in the ith layer is d_i×d_i，d_iA ≦ a, and each feature size in layer n-1 is (d)_n-2·d_n-2·p_n-2) X1, and each feature size in the nth layer is λ x 1.

If the i-th layer type_iThe layer i is connected with the layer i-1 through convolution kernels, all the convolution kernels in the CNN network have the same size, and the sizes of the convolution kernels are k x k, d_iThe distance of each sliding of the convolution kernel is q, and q is more than or equal to 1 and less than or equal to k; setting learning rate to alpha, alpha>0。

If the i-th layer type_iDetermining the sampling interval of the pooling layer as sc, and d is more than or equal to 0 and less than or equal to sc_i-1，d_i-1×d_i-1Representing the size of each feature map in the i-1 th layer, and the sampling intervals of all pooling layers are the same, d_i-1A is less than or equal to a; each time the magnitude b of the normalized SAR image is input, wherein b is

The divisor of (a) is greater than (b),

initialization: setting the h-th layer of the i-1 th layer after the l-th update₁Characteristic diagram and h-th layer in i-th layer₂The initial value of the convolution kernel of the characteristic diagram is

And is

Is a k x k matrix, where 0 ≦ h₁≤p_i-1，0≤h₂≤p_i，p_i-1Indicates the total number of profiles, p, of the i-1 th layer_iIndicates the total number of characteristic diagrams of the i-th layer

The values of k × k elements in the total are-H_iAnd H_iRandom number between, H_iThe formula is calculated as follows:

where k × k denotes the convolution kernel size.

2.2) according to the i-th layer type_iAnd the size d of each feature map in the i-1 th layer_i-1Calculating the size d of each feature map in the ith layer_i×d_i，1≤i≤n-2。

If the i-th layer type_iAs a convolutional layer, then:

d_i＝d_i-1-k+1

if the i-th layer type_iFor the pooling layer, then:

layer 0 type₀Is an input layer, and type of layer 1₁For convolutional layers, the input layer is connected to the convolutional layer by a convolutional kernel, then d₁＝a。

2.3) initialization: full connection matrix initial value ffw between layer n-1 and layer n₀And an initial value ffb of an offset vector between the n-1 th layer and the n-th layer₀。

Full connection matrix initial value ffw between layer n-1 and layer n₀Is one (d)_n×(d_n-1·d_n-1·p_n-1) ) of the n-1 th layer and the n-th layer, an initial value ffw of the full connection matrix between the n-1 th layer and the n-th layer₀Each element value is a random number between-HH and HH, and HH is calculated as follows:

initial value ffb of offset vector between layer n-1 and layer n₀Is a d_nX b matrix, initial value of offset vector between layer n-1 and layer n ffb₀Each element value is 0.

Initialization: l represents the first update, the initial value of l is 0, and the maximum value of l is

And step 3: CNN network forward input

B standardized SAR images are randomly selected in the first time in a standardized training set trainx to form a data set B after the first time of updating_lAnd updating the data set B for the first time_lDeleting the data from the standardized training set trainx, and updating the data set B for the first time_lInputting the data into the CNN network set in the step 2, carrying out forward propagation to obtain values of each layer of neural network, and finally obtaining the first time of updatingOutput o of new CNN network_l。

3.1) if the i-th layer type_iFor convolutional layer, the value of each feature map in the i-th layer is d_i×d_iLet p in the ith layer_mThe value of each feature map is d_i×d_iOf (2) matrix

P-th in the i-1 th layer_nThe value of each feature map is d_i-1×d_i-1Of (2) matrix

Wherein 1 is not more than p_m≤p_i，1≤p_n≤p_i-1。

Is provided with

Is a k × k matrix, represents

Middle (alpha)₁-1). q line to (α)₁Line (α) 1) q + k₂-1). q columns to (α)₂-1) q + k columns, wherein

d_i>k，q>0, then

Middle alpha₁Line, alpha₂Initial value of column

The calculation formula of (a) is as follows:

wherein, the size of each characteristic diagram in the convolution layer is the same,

representing a convolution operation.

3.2) if the i-th layer type_iFor the pooling layer, the value of each feature map in the i-th layer is d_i×d_iLet p in the ith layer_mThe value of each feature map is d_i×d_iOf (2) matrix

Let p in the i-1 st layer_mThe value of each feature map is d_i-1×d_i-1Of (2) matrix

1≤p_m≤p_i。

Is provided with

To represent

Middle alpha₃Line, alpha₄The value of the column element(s),

to represent

Middle 2. alpha₃Line 1, 2. alpha₄-1 value of column element, wherein 1 ≦ α₃≤d_i/2，1≤α₄≤d_i/2，

The calculation formula of (a) is as follows:

wherein, the size of each characteristic diagram in the pooling layer is the same.

3.3) setting the second layer in the n-2 th layer

The value of each feature map is d_n-2×d_n-2Of (2) matrix

The total number of the characteristic diagrams of the n-2 th layer is p_n-2The size of each feature map in the n-2 th layer is d_n-2×d_n-2I.e. by

P in the n-2 th layer_n-2The feature map is developed into a vector fv according to the following formula and the vector is stored in p of the n-1 th layer_n-1In the feature map, the (p) th in the vector fv_m-1)·(d_n-2)²+((r-1)·d_n-2) Element to (p)_m-1)·(d_n-2)²+(r·d_n-2) Each element is respectively connected with

Middle (r) th column 1 st element to d_n-2The elements are in one-to-one correspondence and equal in value, and the corresponding expression is as follows:

wherein,

represents the second in the vector fv

Element to (p)_m-1)·(d_n-2)²+(r·d_n-2) The number of the elements is one,

to represent

From the 1 st element in the r-th column to the d-th element in the r-th column_n-2R is more than or equal to 1 and less than or equal to d_n-2(ii) a Vector fv is p_n-1-1-dimensional column vectors.

3.4) initial value ffw based on the full connection matrix between n-1 and n-th layers₀And an initial value ffb of an offset vector between the n-1 th layer and the n-th layer₀To obtain the final output matrix initial value o₀The expression is as follows:

o₀＝ffw₀·fv+ffb₀

and 4, step 4: fine-tuning the CNN network parameters according to the parameters in the step 3 and the labels of the training data, wherein the fine-tuning comprises the following substeps:

4.1) determining the data set B after the first update_lThe b standard SAR images correspondingly comprise b targets, the b targets are set to belong to eta targets, and labels y corresponding to the eta targets_tr1、y_tr2、…、y_trηForm the column vector y of the tag set after the ith update_ηWherein eta is more than or equal to 1 and less than or equal to b.

Setting a zero matrix of lambda x b if the column vector y of the tag set after the ith update_ηThe value of the element in the ddth column is y_η(dd) ≦ dd ≦ b, then change the values of the σ -th row and η -th column elements in the λ × b zero matrix to 1, while the values of all other elements are still 0, and then obtain the tag matrix y of the η -class target after the first update_ηl。

Wherein 1< σ < λ.

4.2) final output matrix o after the first updating_lAnd the label matrix y of the eta class target after the first updating_ηlCalculating the error matrix e after the first update_l：

e_l＝o_l-y_ηl

4.3) according to the error matrix e after the first update_lRespectively calculating the cost function after the first updateL_lResidual error matrix od of nth layer after the first updating_lAnd residual matrix fvd of the n-1 th layer after the ith update_lThe calculation expressions are respectively:

od_l＝e_l·(o_l·(1-o_l))

fvd_l＝ffw_l ^T·od_l

wherein e is_l(mu, rho) represents the error matrix e after the first update_lThe values of the elements in the middle mu row and the rho column are more than or equal to 1 mu and less than or equal to lambda, and are more than or equal to 1 rho and less than or equal to b, ffw_l ^TRepresentation ffw_lTranspose of (2), ffw_lRepresenting the full connection matrix between the n-1 th layer and the n-th layer after the ith update.

4.4) calculating the p-th layer in the i-th layer according to the parameters_mSensitivity matrix of individual characteristic diagram

Set to the (n-1) th layer

The sensitivity matrix of the individual characteristic map is

Length d_n-1Residual matrix fvd of n-1 th layer after the ith update_lConversion to d_n-1×d_n-1Of the matrix, sensitivity matrix

Is as column (ss) of

Wherein,

p_n-1the total number of characteristic diagrams of the (n-1) th layer is shown,

to represent

Ss th column d_n-1Element, fvd_l((ss-1)·d_n-1:(ss·d_n-1) Residual matrix fvd representing the n-1 th layer after the l-th update_lMiddle (ss-1) d_n-1Line elements to ss d_n-1Line elements, ss is more than or equal to 1 and less than or equal to d_n-1。

If it is the ith₁Layer type

Is a pooling layer because of layer 0 type₀For input layer, n layer type_nIs an output layer, so₁N-1, n-2, …,2,1, then the ith₁In a layer of

Sensitivity matrix of individual characteristic diagram

Comprises the following steps:

wherein,

denotes the ith₁The total number of the characteristic diagrams of +1 layer,

denotes the ith₁The sensitivity matrix of the kth signature in layer +1,

indicating that the ith update is connected₁H in the layer₁A feature map and the ith₁H in +1 layer₂A convolution kernel of the feature map; fp denotes a first setting function which satisfies:

let D₁＝fp(D)，

Is a k × k matrix, then:

D₁(xx,yy)＝D(k-xx+1,k-yy+1)，

wherein D is₁(xx, yy) represents a matrix D₁The (k-xx +1, k-yy +1) represents the (k-xx +1) th row and the (k-yy +1) th column of the matrix D.

If it is the ith₁Layer type

Is a convolutional layer, then i₁In a layer of

Sensitivity matrix of individual characteristic diagram

Comprises the following steps:

wherein, let i₁In a layer of

The value of each feature map is one

Of (2) matrix

Denotes the ith₁+1 layer of the first

A sensitivity matrix of the individual signature; ex represents a second setting function, which satisfies:

wherein,

l₁and l₂Are respectively as

The total number of rows and the total number of columns,

denotes c₁×c₂The matrix of all 1 s of (a) is,

4.5) calculating the h in the i-1 th layer after the l time of updating according to the parameters₁Characteristic diagram and h-th layer in i-th layer₂Convolution kernel gradient matrix of individual feature map

Fully connected matrix gradient dffw after ith update_lAnd the full connection offset dffb after the first update_lThe calculation expressions are respectively:

wherein, when l and i take the same value and h₁And h₂When the values are different, the h-th layer in the i-1 th layer is connected after each update₁Characteristic diagram and h-th layer in i-th layer₂The convolution kernel gradient matrices of the individual feature maps are all identical; b represents the total number of normalized SAR images included in each updated data set, fv^TRepresenting transpose of fv, od_l(pp) denotes od_lAll elements, od, of the pp-th column of the matrix_lRepresenting the residual matrix of the nth layer after the ith update.

Step 5, adding 1 to the value of l, calculating the h in the i-1 layer after the l time of updating₁Characteristic diagram and h-th layer in i-th layer₂Convolution kernel of individual feature map

Convolution layer bias after the first update b_lFull connection matrix ffw between layer n-1 and layer n after the first update_lAnd an offset vector ffb between layer n-1 and layer n after the ith update_lThe calculation expressions are respectively:

b_l＝b_l-1-α·db_l-1

ffw_l＝ffw_l-1-α·dffw_l-1

ffb_l＝ffb_l-1-α·dffb_l-1

wherein,

indicates that the h in the i-1 th layer is connected after the l-1 th update₁A feature map andh in the i-th layer₂A convolution kernel gradient matrix of the feature map; let p in the i-1 st layer_nThe value of each feature map is d_i-1×d_i-1Of (2) matrix

1≤i≤n-2，

Denotes the p-th in the i-th layer_mSensitivity matrix of individual characteristic map, b_l-1Denotes the convolution layer offset after the first-1 update, b₀＝0，α>0；

Indicates that the h-th layer in the i-1 th layer is connected after the l-1 th update₁Characteristic diagram and h-th layer in i-th layer₂The convolution kernel of the individual feature maps,

indicates that the h-th layer in the i-1 th layer is connected after the l-th update₁Characteristic diagram and h-th layer in i-th layer₂Initial values of convolution kernels of the feature maps; db_l-1Representing the gradient of the offset vector after the l-1 th update,

b represents the total number of normalized SAR images included in each updated data set.

Then returning to the step 3; until it is calculated to be

Connecting h in the i-1 th layer after secondary updating₁Characteristic diagram and h-th layer in i-th layer₂Convolution kernel of individual feature map

First, the

Post-update convolutional layer biasing

First, the

Fully connected matrix between n-1 th layer and n-th layer after secondary updating

And a first

Offset vector between n-1 th layer and n-th layer after sub-update

For the first

Let i equal 1,2, …, n-2, and then get the second

After secondary update, connect h in layer 0₁Characteristic diagram and h in layer 1₂Convolution kernel of individual feature map

To the first

After the second update, connecting the h-th layer in the n-3 th layer₁A characteristic diagram and h in the n-2 th layer₂Convolution kernel of individual feature map

And the initial value b of the bias of the convolution layer obtained at this time₀To the first

Post-update convolutional layer biasing

Full connection matrix initial value ffw between layer n-1 and layer n₀To the first

And an initial value ffb of an offset vector between the n-1 th layer and the n-th layer₀To the first

Offset vector between n-1 th layer and n-th layer after sub-update

And the set CNN network is indicated to be trained and recorded as the trained CNN network.

Step 6: and extracting the feature matrix of the training data and the test data by using the CNN network.

6.1) the normalized training set trainx, and the first test subset testx1, the second test subset testx2 and the third test subset testx3 are trained using the trained CNN network to obtain the feature matrix trainx _ new of the normalized training set trainx, the feature matrix testx1_ new of the first test subset testx1, the feature matrix testx2_ new of the second test subset testx2 and the feature matrix testx3_ new of the third test subset testx3, respectively.

6.2) using a data normalization algorithm to normalize the eigen matrix train _ new of the normalized training set train x, the eigen matrix testx1_ new of the first test subset testx1, the eigen matrix testx2_ new of the second test subset testx2 and the eigen matrix testx3_ new of the third test subset testx3, respectively, resulting in a normalized eigen matrix trandinata of the normalized training set train x, the normalized eigen matrix testdata1 of the first test subset testx1, the normalized eigen matrix testdata2 of the second test subset testx2 and the normalized eigen matrix testdata3 of the third test subset testx3, respectively, which are calculated as:

wherein, the mean function represents the column mean value of the matrix, and the SID function represents the column standard deviation of the matrix.

Step 7, training the normalized feature matrix traindata of the normalized training set trainx by using a traditional SVM (support vector machine) classification algorithm to respectively obtain a parameter matrix W and an offset vector bb, wherein the parameter matrix W is p_n-1X m matrix, offset vector bb is p_n-1A row vector of x 1; p is a radical of_nIndicates the total number of characteristic diagrams of the n-th layer, p_n-1The total number of profiles of the (n-1) th layer is shown.

Step 8, calculating prediction category column vectors of the first test subset testx1, the second test subset testx2 and the third test subset testx3, respectively as a first column vector C1, a second column vector C2 and a third column vector C3 according to the feature matrix testdata1 of the first test subset testx1, the feature matrix testdata2 of the second test subset testx2 and the feature matrix testdata3 of the third test subset testx3, as well as the parameter matrix W and the offset vector bb, and obtaining the following steps: "

fen1＝testdata1·W^T+repmat(bb,[size(testdata1,1),1])

fen2＝testdata2·W^T+repmat(bb,[size(testdata2,1),1])

fen3＝testdata3·W^T+repmat(bb,[size(testdata3,1),1])

Where size (testdata1,1) represents the number of rows, repmat (bb, [ size (testdata1,1) of matrix testdata1]) Represents a size (testdata1,1) line p_n-1Matrix of columns, the size (testdata1,1) row p_n-1Each row of the matrix of columns is an offset vector bb; fen1 is a size (testdata1,1) xp_n-1Matrix of (2), size (testdata1,1) × p_n-1Each row of the matrix has only one element with a value of 1 and the remaining elements of each row have values of 0, respectively; if the number of columns where value 1 of the element in row # 1 of fen1 is Δ 1, C1(ψ 1) is Δ 1, where 1 ≦ ψ 1 ≦ size (testdata1), and 1 ≦ Δ 1 ≦ p_n-1C1(ψ 1) denotes the value of the ψ 1-th element of the first column vector C1; let ψ 1 be 1,2, …, size (testdata1), and then get a first column vector C1, which first column vector C1 includes size (testdata1) elements.

Where size (testdata2,1) represents the number of rows, repmat (bb, [ size (testdata2,1),1, of matrix testdata2]) Represents a size (testdata2,1) line p_n-1Matrix of columns, the size (testdata2,1) row p_n-1Each row of the matrix of columns is an offset vector bb; fen2 is a size (testdata2,1) xp_n-1Matrix of (2), size (testdata2,1) × p_n-1Each row of the matrix has only one element with a value of 1 and the remaining elements of each row have values of 0, respectively; if the number of columns where value 1 of element in row # 2 of fen2 is Δ 2, C2(ψ 2) is Δ 2, where 1 ≦ ψ 2 ≦ size (testdata2), and 1 ≦ Δ 2 ≦ p_n-1C2(ψ 2) denotes the value of the ψ 2-th element of the second column vector C2; let ψ 2 be 1,2, …, size (testdata2), resulting in a second column vector C2, said second column vector C2 comprising size (testdata2) elements.

Where size (testdata3,1) represents the number of rows, repmat (bb, [ size (testdata3,1),1, of matrix testdata3]) Represents a size (testdata3,1) line p_n-1Matrix of columns, the size (testdata3,1) row p_n-1Each row of the matrix of columns is an offset vector bb; fen3 is a size (testdata3,1) xp_n-1Matrix of (2), size (testdata3,1) × p_n-1Each row of the matrix has only one element with a value of 1 and the remaining elements of each row have values of 0, respectively; if the number of columns in which value 1 of element in row # 3 of fen3 is Δ 3, C3(ψ 3) is Δ 3, where 1 ≦ ψ 3 ≦ size (testdata3), and 1 ≦ Δ 3 ≦ p_n-1C3(ψ 3) denotes the value of the ψ 3 th element of the third column vector C3; let ψ 3 be 1,2, …, size (testdata3), which results in a third column vector C3, which third column vector C3 comprises size (testdata3) elements.

Step 9, decision fusion classification

Each row of the first column vector C1, the second column vector C2, and the third column vector C3 is decided, and the final prediction class column vector is obtained.

Taking the values of the delta-th row element of the first column vector C1, the second column vector C2 and the third column vector C3 respectively, and marking as the value C1 (delta) of the delta-th row element of the first column vector C1, the value C2 (delta) of the delta-th row element of the second column vector C2 and the value C3 (delta) of the delta-th row element of the third column vector C3 respectively, wherein 1 is larger than or equal to delta and smaller than or equal to S, and S is the total column number of the first column vector C1; setting the S-dimensional column vector as C, the procedure of obtaining the value C (δ) of the δ -th row element of the S-dimensional column vector C is:

if C1(δ) is C2(δ) or C1(δ) is C3(δ), then:

C(δ)＝C1(δ)。

if C2(δ) is C3(δ), then:

C(δ)＝C2(δ)。

if two of C1 (delta), C2 (delta) and C3 (delta) are different, any value of C1 (delta), C2 (delta) and C3 (delta) is randomly selected as the value C (delta) of the delta-th row element of the S-dimensional column vector C.

Let δ be 1,2, …, S, and further obtain the values C (1) of the 1 st row element of the S-dimensional column vector C to the values C (S) of the S-th row element of the S-dimensional column vector C, respectively, and record them as a final prediction type column vector, which is the S-dimensional column vector.

Step 10, calculating v according to the prediction result obtained in step 9₂Class is given as v₁Number of classes

Calculating v₂Class is identified as v₁Probability of class

If the value of the phi 4 th row element of the final prediction class column vector is v₁And the label column vector y to which the first test subset testx1 corresponds_te1Has a value v of the element of row # 4₂Counting the value of the element in S elements in the final prediction category column vector as v₁Total number of rows of elements, denoted as S₁，S₁<S, then counting S₁The rows correspond to the label column vector y_te1The value of the middle element is v₂Total number of (2), denoted as v₂Class is identified as v₁Number of classes

Wherein

V is then₂Class is identified as v₁Probability of class

Comprises the following steps:

wherein v is not less than 1₁≤m，1≤v₂≤m。

The invention is further illustrated below by means of a simulation example:

1) setting CNN network parameters and SVM classifier parameters:

the number n of layers of the CNN network is set to 7, the first layer is an input layer and includes a 32 × 32 feature map, the second layer is a convolutional layer and includes 6 28 × 28 feature maps, the third layer is a pooling layer and includes 6 14 × 14 feature maps, the sampling interval sc is 2, the fourth layer is a convolutional layer and includes 12 10 × 10 feature maps, the fifth layer is a pooling layer and includes 12 5 × 5 feature maps, the sampling interval sc is 2, the sixth layer is a vector expansion layer and includes 300 1 × 1 feature maps, the seventh layer is an output layer, the size of a convolutional kernel is 5 × 5, the distance of each sliding of the convolutional kernel is 1, the learning rate is 1, and the number of input pictures per time is 18.

SVM parameter is set, learning rate is 1.5, and regularization coefficient is 10^1.5The kernel function adopts a change function.

2) Training set and test set

Three types of tanks with training sets of T95, BT-7 and IS-4 have a pitch angle of 50 degrees, an azimuth angle of 0-360 degrees and intervals

The SAR image IS 10 degrees, and three types of tanks with test sets of T95, BT-7 and IS-4 have a pitch angle of 60 degrees, an azimuth angle of 0-360 degrees and intervals

SAR image at 10 °.

3) Emulation content

Decision fusion classification is carried out on the training set and the test set by using the method of the invention, and the classification result is shown in figure 2, and figure 2 is a target classification result graph obtained by using the method of the invention.

Some of the experimental data of the present invention are shown in Table 3, and the experimental result data of the present invention are shown in Table 4.

TABLE 3

TABLE 4

As can be seen from tables 3 and 4, the method can accurately identify and classify the SAR target, and the accuracy of classification is greatly improved compared with the method that only CNN is used for feature extraction and then SVM is used for classification.

The foregoing description is only an example of the present invention and is not intended to limit the invention, so that it will be apparent to those skilled in the art that various modifications and variations in form and detail can be made therein without departing from the spirit and scope of the invention.

Claims

1. A SAR target classification method based on CNN and SVM decision fusion is characterized by comprising the following steps:

in step 1, the training set and the test set are obtained by the following steps:

the SAR image comprises lambda targets, lambda is larger than or equal to 2, each target corresponds to a label, and then a label y corresponding to the lambda targets is obtained_tr1、y_tr2、…、y_trλEach type of target at least comprises one target, and the azimuth angle and the pitch angle of all targets in each type of target are the same;

for the lambda class targets at pitch angles from sigma₁To σ₂Interval σ₃Azimuth angle from xi₁To xi₂Interval xi₃SAR imaging is carried out to obtain

A SAR image with 0-sigma₁≤90°，0°≤σ₂≤90°，σ₁≤σ₂，σ₃>0，0°≤ξ₁≤360°，0°≤ξ₂≤360°，ξ₁≤ξ₂，ξ₃>0, each SAR image comprises a target, the size of each SAR image is c x t, c is more than or equal to a, and t is more than or equal to a;

in the above-mentioned

Selecting a pitch angle theta from the SAR image₁Azimuth angle from xi₁To xi₂Azimuthal angle interval of xi₄Obtaining the SAR images of the m types of targets

Taking the SAR image and the label corresponding to the m-type target as a training set, wherein sigma₁≤θ₁≤σ₂，0<ξ₄≤ξ₃，2≤m≤λ；

In the above-mentioned

Selecting a pitch angle theta from the SAR image₂Azimuth angle from xi₁To xi₂Azimuthal angle interval of xi₄Obtaining the SAR image of the m' type target

Taking the SAR image and a label corresponding to the m' type target as a test set, wherein sigma₁≤θ₂≤σ₂，θ₁≠θ₂M' has the same value as m;

the first test subset testx1, the second test subset testx2 and the third test subset testx3 are obtained by the following steps:

normalizing the size of each SAR image in the training set and the test set to a x a to obtain a normalized training set tranix and a normalized test set testx, wherein the normalized training set tranix comprises

The method comprises the steps of obtaining a standard SAR image and a label corresponding to an m' type target;

the standardized test set testx is divided into three test subsets, a first test subset testx1, a second test subset testx2 and a third test subset testx3, according to the difference of the azimuth angle of each standardized SAR image in the standardized test set testx, wherein the first test subset testx1 comprises m₁' Label corresponding to class target, and is marked as label column vector y corresponding to first test subset testx1_te1(ii) a The second test subset testx2 comprises m₂' Label corresponding to class target, and is marked as label column vector y corresponding to second test subset testx2_te2(ii) a The third test subset testx3 comprises m₃' Label corresponding to class target, and is marked as label column vector y corresponding to third test subset testx3_te3；

Step 2, setting a CNN network and training to obtain a trained CNN network;

step 3, obtaining a normalized feature matrix trandinata of the normalized training set, a normalized feature matrix testdata1 of the first testing subset testx1, a normalized feature matrix testdata2 of the second testing subset testx2 and a normalized feature matrix testdata3 of the third testing subset testx3 according to the first testing subset testx1, the second testing subset testx2, the third testing subset testx3 and the training set;

1≤v₂M is not less than m, m is not less than 2 and not more than lambda; v is₂Class is identified as v₁Probability of class

2. The SAR target classification method based on CNN and SVM decision fusion as claimed in claim 1, characterized in that in step 2, the trained CNN network is obtained by the following processes:

2.1) setting a CNN network to totally comprise n +1 layers, wherein n is a positive integer more than 3; let layer 0 type₀For input layer, i layer type_iWherein i is more than or equal to 1 and less than or equal to n-2, the ith layer type_iTaking convolutional and pooling layers, layer n-1 type_n-1Being a fully-connected hierarchy, the nth type_nThe total number of characteristic graphs of the ith layer is p as an output layer_i，p_iIs a positive integer greater than 0;

And is

Is a matrix of k x k, and,

wherein k × k represents the convolution kernel size, 0 ≦ h₁≤p_i-1，0≤h₂≤p_i，p_i-1Indicates the total number of profiles, p, of the i-1 th layer_iThe total number of the characteristic graphs of the ith layer is shown;

full connection matrix initial value ffw between layer n-1 and layer n₀Is one (d)_n×(d_n-1·d_n-1·p_n-1) A matrix of full connection matrix initial values ffw between the n-1 th layer and the n-th layer₀Each element value is a random number between-HH and HH, and HH is calculated as follows:

initial value ffb of offset vector between layer n-1 and layer n₀Is a d_nX b matrix, initial value of offset vector between layer n-1 and layer n ffb₀Each element value is 0; final output matrix initial value o₀＝ffw₀·fv+ffb₀Fv is p_n-1The (p) th of the 1-dimensional column vector, fv_m-1)·(d_n-2)²+((r-1)·d_n-2) Element to (p)_m-1)·(d_n-2)²+(r·d_n-2) Each element is respectively connected with

Middle (r) th column 1 st element to d_n-2The elements are in one-to-one correspondence and equal in value,

denotes the second in the n-2 th layer

The value of each feature map is d_n-2×d_n-2The matrix of (a) is,

p_n-2the total number of the characteristic diagrams of the n-2 th layer is shown; 1 is not more than p_m≤p_iLet d_i×d_iRepresenting the size of each characteristic diagram in the ith layer, wherein i is more than or equal to 1 and less than or equal to n-2; d_n-1·d_n-1Denotes the size of each feature map in the n-1 th layer, d_n·d_nRepresenting the size of each feature map in the nth layer;

2.2) determining the data set B after the first update_lThe b standard SAR images correspondingly comprise b targets, the b targets are set to belong to eta targets, and labels y corresponding to the eta targets_tr1、y_tr2、…、y_trηForm the column vector y of the tag set after the first update_ηWherein eta is more than or equal to 1 and less than or equal to b;

setting a zero matrix of lambda x b if the column vector y of the tag set after the ith update_ηThe value of the element in the ddth column is y_η(dd) ≦ dd ≦ b, then change the values of the σ -th row and η -th column elements in the λ × b zero matrix to 1, while the values of all other elements are still 0, and then obtain the tag matrix y of the η -class target after the first update_ηl；

Wherein 1< σ < λ;

according to the final output matrix o after the first updating_lAnd the label matrix y of the eta class target after the first updating_ηlCalculating the error matrix e after the first update_l：

e_l＝o_l-y_ηl；

2.3) according to the error matrix e after the first update_lRespectively calculating the cost function L after the first update_lResidual error matrix od of nth layer after the first updating_lAnd residual matrix fvd of the n-1 th layer after the ith update_lThe calculation expressions are respectively:

od_l＝e_l·(o_l·(1-o_l))

fvd_l＝ffw_l ^T·od_l

wherein e is_l(mu, rho) represents the error matrix e after the first update_lThe values of the elements in the middle mu row and the rho column are more than or equal to 1 mu and less than or equal to lambda, and are more than or equal to 1 rho and less than or equal to b, ffw_l ^TRepresentation ffw_lTranspose of (2), ffw_lRepresenting a full connection matrix between the (n-1) th layer and the nth layer after the ith update;

2.4) calculating the h-th in the i-1 th layer after the first update₁Characteristic diagram and h-th layer in i-th layer₂Convolution kernel gradient matrix of individual feature map

wherein, when l and i take the same value and h₁And h₂When the values are different, the h-th layer in the i-1 th layer is connected after each update₁Characteristic diagram and h-th layer in i-th layer₂The convolution kernel gradient matrices of the individual feature maps are all identical; b represents the total number of normalized SAR images included in each updated data set, fv^TRepresenting transpose of fv, od_l(pp) denotes od_lAll elements, od, of the pp-th column of the matrix_lRepresenting a residual error matrix of the nth layer after the ith update;

2.5) adding 1 to the value of l, calculating the h in the i-1 th layer after the l time of updating₁Characteristic diagram and h-th layer in i-th layer₂Convolution kernel of individual feature map

b_l＝b_l-1-α·db_l-1

ffw_l＝ffw_l-1-α·dffw_l-1

ffb_l＝ffb_l-1-α·dffb_l-1

wherein,

indicates that the h-th layer in the i-1 th layer is connected after the l-1 th update₁Characteristic diagram and h-th layer in i-th layer₂A convolution kernel gradient matrix of the feature map; let p in the i-1 st layer_nThe value of each feature map is d_i-1×d_i-1Of (2) matrix

indicates that the h-th layer in the i-1 th layer is connected after the l-th update₁Characteristic diagram and h in i layer₂Initial values of convolution kernels of the feature maps; db_l-1Representing the gradient of the offset vector after the l-1 th update,

b represents the total number of the standardized SAR images included in the data set after each update;

then returning to the step 2.2); until it is calculated to be

First, the

Post-update convolutional layer biasing

First, the

And a first

Offset vector between n-1 th layer and n-th layer after sub-update

For the first

Let i equal 1,2, …, n-2, and then get the second

To the first

Post-update convolutional layer biasing

Offset vector between n-1 th layer and n-th layer after sub-update

3. A CNN and SVM decision fusion based SAR target classification method according to claim 2, characterized in that in step 3, the normalized feature matrix tranndata of the normalized training set tranix, the normalized feature matrix testdata1 of the first test subset testx1, the normalized feature matrix testdata2 of the second test subset testx2 and the normalized feature matrix testdata3 of the third test subset testx3 are obtained by:

3.1) training the normalized training set trainx, the first test subset testx1, the second test subset testx2 and the third test subset testx3 respectively by using the trained CNN network to obtain a feature matrix trainx _ new of the normalized training set trainx, a feature matrix testx1_ new of the first test subset testx1, a feature matrix testx2_ new of the second test subset testx2 and a feature matrix testx3_ new of the third test subset testx3 respectively;

3.2) normalizing the feature matrix trainx _ new of the normalized training set trainx, the feature matrix testx1_ new of the first test subset testx1, the feature matrix testx2_ new of the second test subset testx2 and the feature matrix testx3_ new of the third test subset testx3, respectively, using a data normalization algorithm, resulting in a normalized feature matrix traindata of the normalized training set trainx, a normalized feature matrix testdata1 of the first test subset testx1, a normalized feature matrix testdata2 of the second test subset testx2 and a normalized feature matrix testdata3 of the third test subset testx3, respectively, which are calculated as:

4. The method for classifying SAR targets based on CNN and SVM decision fusion as claimed in claim 3, wherein in step 5, the first column vector C1, the second column vector C2 and the third column vector C3 are obtained by:

according to the feature matrix testdata1 of the first test subset testx1, the feature matrix testdata2 of the second test subset testx2 and the feature matrix testdata3 of the third test subset testx3, as well as the parameter matrix W and the offset vector bb, the column vectors of prediction categories for the first test subset testx1, the second test subset testx2 and the third test subset testx3, which are respectively denoted as a first column vector C1, a second column vector C2 and a third column vector C3, are calculated as follows: "

fen1＝testdata1·W^T+repmat(bb,[size(testdata1,1),1])

fen2＝testdata2·W^T+repmat(bb,[size(testdata2,1),1])

fen3＝testdata3·W^T+repmat(bb,[size(testdata3,1),1])

Where size (testdata1,1) represents the number of rows, repmat (bb, [ size (testdata1,1) of matrix testdata1]) Represents a size (testdata1,1) line p_n-1Matrix of columns, size (testdata1,1) line p_n-1Each row of the matrix of columns is an offset vector bb; fen1 is a size (testdata1,1) xp_n-1Matrix of (2), size (testdata1,1) × p_n-1Each row of the matrix has only one element with a value of 1 and the remaining elements of each row have values of 0, respectively; if the number of columns where value 1 of the element in row # 1 of fen1 is Δ 1, C1(ψ 1) is Δ 1, where 1 ≦ ψ 1 ≦ size (testdata1), and 1 ≦ Δ 1 ≦ p_n-1C1(ψ 1) denotes the value of the ψ 1-th element of the first column vector C1; let ψ 1 be 1,2, …, size (testdata1), and then get a first column vector C1, the first column vector C1 including size (testdata1) elements;

where size (testdata2,1) represents the number of rows, repmat (bb, [ size (testdata2,1),1, of matrix testdata2]) Represents a size (testdata2,1) line p_n-1Matrix of columns, the size (testdata2,1) row p_n-1Each row of the matrix of columns is an offset vector bb; fen2 is a size (testdata2,1) xp_n-1Matrix of (2), size (testdata2,1) × p_n-1Each row of the matrix has only one element with a value of 1 and the remaining elements of each row have values of 0, respectively; if the number of columns where value 1 of element in row # 2 of fen2 is Δ 2, C2(ψ 2) is Δ 2, where 1 ≦ ψ 2 ≦ size (testdata2), and 1 ≦ Δ 2 ≦ p_n-1C2(ψ 2) denotes the value of the ψ 2 th element of the second column vector C2; let ψ 2 be 1,2, …, size (testdata2), and then get a second column vector C2, said second column vector C2 comprising size (testdata2) elements;

5. The method for classifying SAR target based on CNN and SVM decision fusion as claimed in claim 4, wherein in step 6, the final prediction class column vector is obtained by:

making a decision on each row of the first column vector C1, the second column vector C2 and the third column vector C3 to obtain a final prediction type column vector;

if C1(δ) is C2(δ) or C1(δ) is C3(δ), then:

C(δ)＝C1(δ)；

if C2(δ) is C3(δ), then:

C(δ)＝C2(δ)；

if two of C1 (delta), C2 (delta) and C3 (delta) are different, randomly selecting any one value of C1 (delta), C2 (delta) and C3 (delta) as a value C (delta) of a delta-th row element of the S-dimensional column vector C;

6. The SAR target classification method based on CNN and SVM decision fusion as claimed in claim 5, characterized in that in step 7, the v₂Class is identified as v₁Probability of class k_v1v2The obtaining process is as follows:

if the value of the phi 4 th row element of the final prediction class column vector is v₁And the label column vector y to which the first test subset testx1 corresponds_te1Has a value v of the element of row # 4₂Counting the value of the element in S elements in the final prediction category column vector as v₁Total number of rows of elements, denoted as S₁，S₁<S, then counting S₁The rows correspond to a label column vector y_te1The value of the middle element is v₂Total number of (2), denoted as v₂Class is identified as v₁Number of classes

Wherein

V is then₂Class is identified as v₁Probability of class

Comprises the following steps:

wherein v is not less than 1₁≤m，1≤v₂≤m。