CN113095416A

CN113095416A - Small sample SAR target classification method based on mixed loss and graph attention

Info

Publication number: CN113095416A
Application number: CN202110408623.2A
Authority: CN
Inventors: 白雪茹; 杨敏佳; 孟昭晗; 周峰
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2021-04-16
Filing date: 2021-04-16
Publication date: 2021-07-09
Anticipated expiration: 2041-04-16
Also published as: CN113095416B

Abstract

The invention provides a small sample SAR target classification method based on mixed loss and graph attention, which comprises the following steps: acquiring a training sample set and a test sample set; constructing a network model H based on mixed loss and graph attention; performing iterative training on the H; and obtaining a target classification result of the small sample SAR image. The invention passes the classification loss value l of the training task set_CAnd the embedding loss value l of the training task set_EThe weighting and the mixed loss value l forming the training task set update the parameters of all the first convolution layers and all the second convolution layers in the embedded network module E and the parameters of all the first full-connection layers and all the second full-connection layers in the graph attention network module G, thereby enhancing the similarity between the characteristics of the same SAR target class and the difference between the characteristics of different SAR target classes, and the method can also improve the stability of the SAR target classesData enhancement effectively reduces the risk of overfitting in the model training process, and improves the classification precision of the small sample SAR target.

Description

Small sample SAR target classification method based on mixed loss and graph attention

Technical Field

The invention belongs to the technical field of radar image processing, relates to an SAR target classification method, and particularly relates to a small sample SAR target classification method based on mixed loss and drawing attention, which can be used for SAR target classification under the condition that the number of SAR images acquired by a target is small.

Background

The synthetic aperture radar SAR has the characteristics of all-time, all-weather, long range, high resolution and the like, and is widely used for target classification in military fields such as battlefield reconnaissance and the like because the two-dimensional high-resolution SAR image of the target contains rich information such as the shape, the size, the texture and the like of the target. The SAR target classification is an algorithm which is based on a computer system, extracts features after SAR image data of a target is obtained from a sensor, and gives target class attributes according to the extracted features. Although a large number of traditional SAR target classification methods based on manually selected features and designed classifiers have been generated, these traditional methods require setting a specific algorithm for a specific target according to a large amount of experience and strong professional knowledge, are time-consuming and difficult to popularize. In recent years, the SAR target classification method based on deep learning realizes data-driven SAR target classification, and the method can autonomously learn and extract the characteristics effective for classification from data and classify the target by using the characteristics, so that characteristics do not need to be manually selected, a classifier is not designed, professional knowledge is not needed to be stronger, and the method is easy to popularize into a new target class, so that excellent performance is obtained, and the method is widely researched and used by the industry.

However, some targets observed by the SAR are non-cooperative small-sample SAR targets, that is, the number of SAR images that can be acquired by these targets is small, and each target has only 1 image to ten or more images, whereas the SAR target classification method based on deep learning generally requires a large number of training samples to train a model to obtain a high classification accuracy on a test sample, and for small-sample SAR targets, the SAR target classification method based on deep learning has a problem of low classification accuracy due to the shortage of the training samples.

In order to solve the problem, in the prior art, a special model with low requirement on the number of samples is designed by improving a model structure to improve the classification accuracy of the small-sample SAR target. For example, a patent application with a publication number of CN111191718A entitled "small sample SAR target recognition method based on graph attention network" discloses a small sample SAR target recognition method based on graph attention network, which first obtains a small number of labeled SAR images and a large number of unlabeled SAR images of a target and performs noise reduction processing, then iteratively trains a self-encoder by using the denoised images to obtain feature vectors of all the SAR images, finally constructs an initial adjacency moment, and iteratively trains a graph attention network by using the initial adjacency moment and the feature vectors of all the SAR images, and the trained graph attention network can realize classification of the target in the unlabeled SAR images by using an attention mechanism. The graph attention network adopted by the method has less requirements on the SAR images with the labels in the class prediction process, and the application of the attention mechanism can improve the classification accuracy of the network, but the method has the defects that the obtained result is independently calculated and lost with the original SAR images after the self-encoder performs encoding and decoding operation on each SAR image, so that the extracted features of the same SAR target class have lower similarity, the difference between the features of different SAR target classes is weaker, and the classification accuracy of the model on the small-sample SAR target is still lower.

Disclosure of Invention

The invention aims to overcome the defects in the prior art, and provides a small sample SAR target classification method based on mixed loss and drawing attention, which is used for solving the technical problems of low classification accuracy caused by low similarity between the characteristics of the same SAR target category and weak difference between the characteristics of different SAR target categories in the prior art.

In order to achieve the purpose, the technical scheme adopted by the invention comprises the following steps:

(1) obtaining a training sample set

And test sample set

(1a) Acquiring a plurality of Synthetic Aperture Radar (SAR) images containing C different target categories, wherein each target category corresponds to M SAR images with the size of h multiplied by h, each SAR image contains 1 target, C is more than or equal to 10, M is more than or equal to 200, and h is 128;

(1b) marking the target category in each SAR image, and randomly selecting C_trainTotal of C for each object class_trainTaking x M SAR images and labels of each SAR image as training sample sets

Mixing the rest C_testTotal of C for each object class_testTaking x M SAR images and labels of each SAR image as test sample sets

wherein C_train+C_test＝C，3≤C_test≤5；

(2) Constructing a network model H based on mixed loss and graph attention:

constructing a mixed loss and graph attention-based network model H comprising a data enhancement module D, an embedded network module E, a node feature initialization module I and a graph attention network module G which are sequentially cascaded, wherein the embedded network module E comprises a plurality of first volume modules E which are sequentially cascaded_CAnd a second convolution module E_LEach first convolution module E_CComprises a first convolution layer, a first batch of normalization layers, a Mish activation layer, a maximum pooling layer and a second convolution module E which are sequentially stacked_LThe second convolution layer and the second batch of normalization layer are sequentially stacked; the graph attention network module G comprises a plurality of graph updating layers U which are sequentially cascaded, and each graph updating layer U comprises edge feature building modules U which are sequentially stacked_EAttention weight calculation module U_WAnd node feature update module U_NEach edge feature building block U_EComprises a plurality of first full-connection layers which are sequentially stacked, and each nodeFeature update module U_NA second full-link layer is included;

(3) performing iterative training on a network model H based on mixed loss and graph attention:

(3a) initializing the iteration number to be N, wherein the maximum iteration number is N, N is more than or equal to 1000, and N is set to be 0;

(3b) from a training sample set

Randomly selecting a group containing C_testTotal of C for each object class_testMultiplying the SAR images by x M, and performing one-hot coding on the label of each SAR image to obtain C of each SAR image_testDimension label vector, then from C_testRandomly selecting K SAR images contained in each target category and corresponding label vectors from the xM SAR images as training support sample sets

C is remained_test(M-K) SAR images and corresponding label vectors serving as training query sample sets

After one-hot coding, the C-dimensional element in the tag vector of each SAR image indicates that the target in the SAR image belongs to C_testThe probability of the c-th object class of the object classes,

representing the a-th training support sample consisting of the SAR image and its corresponding label vector,

representing a b-th training query sample consisting of the SAR image and the corresponding label vector, wherein K is more than or equal to 1 and less than or equal to 10;

(3c) will train and support the sample set

With each training query sample

Combined into a training task

Obtaining a training task set

And will be

Forward propagation as input to the mixed-loss and graph attention based network model H:

(3c1) data enhancement module D pairs training task set

Performing data enhancement on each SAR image: performing power transformation on each SAR image, adding noise to the SAR image subjected to power transformation, performing turnover transformation on the SAR image subjected to noise addition, and performing rotation transformation on the SAR image subjected to turnover transformation to obtain an enhanced training task set

wherein ,

representing training tasks

A corresponding one of the enhanced training tasks is,

representing training tasks

Training support sample in (1)

The corresponding reinforced training support sample is used for training,

representing training query samples

Corresponding enhanced training query samples;

(3c2) embedded network module E pair enhanced training task set

Each of the enhanced training tasks in (1)

Mapping each SAR image to obtain a training embedded vector set

And using an embedded loss function L_EBy training sets of embedded vectors

Computing training task set

Value of insertion loss l_E：

wherein ,

representing an enhanced training task

Corresponding training embedded vector set, satisfies a ≠ C_testOf K +1

Presentation of enhanced training support samples

The corresponding training is embedded into the vector(s),

representing enhanced training query samples

Corresponding training embedding vectors, log (-) representing the logarithm based on the natural constant e, exp (-) representing the natural constant_eA base exponent, Σ denotes a continuous sum,

representing a training task

Training support sample set in (1)

Each training embedded vector corresponding to each SAR image of the included c-th target class

The class center of the c-th object class obtained by averaging,

representing and training tasks

Training query sample in (1)

Targets in the contained SAR images belong to the class center of the same target class, d represents a measurement function, and d (p, q) | | p-q | | luminance²；

(3c3) Node characteristic initialization module I constructs a virtual label vector

And embedding a set of vectors for each training

In the condition that a is not equal to C_testEach training embedded vector of K +1

Splicing with the label vector of the corresponding SAR image, and simultaneously embedding each training vector group

Training embedded vector in

And virtual tag vector

Splicing to obtain a training node 1-layer feature group set

wherein ,

is represented by C_testA vector in which the element values of each dimension are all 1,

representing training embedded vector sets

The corresponding training node layer 1 feature set,

representing training embedded vectors

Corresponding training node 1 layer characteristics;

(3c4) graph attention network module G trains node layer 1 feature group set through input

To pair

Including each training node layer 1 feature set

Training node level 1 features in

Corresponding training query samples

Carrying out category prediction on targets in the SAR image to obtain a training prediction result vector set

wherein ,

to representTraining node layer 1 features

Corresponding dimension is C_testThe c-th element represents the 1-level feature of the training node

Corresponding training query samples

The prediction probability that the target in the included SAR image belongs to the c-th target class;

(3c5) using a classification loss function L_CPredicting the result vector set by training

And training the query sample set

All the label vectors in (1) calculate the training task set

Is a classification loss value l_C：

wherein ,

predicting outcome vectors for training

Value of the c-th element in (y)_b,cPredicting outcome vectors for training

The value of the c-dimension element in the label vector of the corresponding SAR image;

(3d) for the training task set

Is a classification loss value l_CAnd training task set

Value of insertion loss l_EObtaining a training task set by calculating a weighted sum

Mixing loss value l, l ═ λ l_C+(1-λ)l_EThen, updating parameters of all first convolution layers and all second convolution layers embedded in the network module E and parameters of all first full-connection layers and all second full-connection layers in the attention network module G through a mixed loss value l by using a random gradient descent algorithm, wherein lambda is weight, and lambda is more than or equal to 0.7 and less than 1;

(3e) judging whether N is greater than or equal to N, if so, obtaining a trained network model H' based on the mixing loss and the graph attention, otherwise, enabling N to be N +1, and executing the step (3 b);

(4) obtaining a target classification result of the small sample SAR image:

(4a) for test sample set

Carrying out one-hot coding on the label of each SAR image to obtain C of each SAR image_testDimension label vectors, then from the test sample set

C of (A)_testRandomly selecting K SAR images contained in each target category and corresponding label vectors from the xM SAR images as a test support sample set

C is remained_test(M-K) SAR images and corresponding label vectors serving as test query sample sets

wherein ,

representing the e-th test support sample consisting of the SAR image and its corresponding tag vector,

representing a g-th test query sample consisting of the SAR image and the corresponding label vector;

(4b) supporting the test with a sample set

With each test query sample

Combined into test tasks

Obtaining a set of test tasks

And will be

Forward propagation as input to the trained mixed-loss and attention-based network model H':

(4b1) trained embedded network module E' pair test task set

Each test task in (1)

Mapping each SAR image to obtain a test embedded vector group set

wherein ,

representing test tasks

Corresponding test embedded vector groups satisfy e ≠ C_testOf K +1

Presentation of test support samples

The corresponding test-embedded vector is inserted into the vector,

representing test query samples

A corresponding test embedding vector;

(4b2) node characteristic initialization module I constructs a virtual label vector

And embedding the vector set for each test

In the condition that e ≠ C_testEach test embedding vector of K +1

Splicing with the label vector of the corresponding SAR image, and simultaneously embedding each test into a vector group

Test embedded vector in

And virtual tag vector

Splicing to obtain a 1-layer feature group set of the test nodes

wherein ,

representing test embedding vector sets

The corresponding test point 1 layer feature set,

table test embedded vector

Corresponding test node 1 layer characteristics;

(4b3) the trained graph attention network module G' tests the node layer 1 feature group set through input

To pair

Including each test node a layer 1 feature set

Test node level 1 features in

Corresponding test query sample

Carrying out category prediction on targets in the SAR image to obtain a test prediction result vector set

Each test prediction vector

The dimension number corresponding to the medium maximum value is

Corresponding test query sample

Including a prediction category of the target in the SAR image, wherein,

representing test node level 1 features

Corresponding dimension is C_testThe element value of the c-th dimension represents the test node level 1 feature

Corresponding test query sample

The probability that the object in the included SAR image belongs to the c-th object class.

Compared with the prior art, the invention has the following advantages:

1. the invention passes the classification loss value l of the training task set_CAnd the embedding loss value l of the training task set_EThe mixed loss value l of the training task set formed by the weighted sum updates the parameters of all the first convolution layers and all the second convolution layers in the embedded network module E and the parameters of all the first full-connection layers and all the second full-connection layers in the graph attention network module G, thereby enhancing the similarity between the characteristics of the same SAR target class and the difference between the characteristics of different SAR target classes.

2. In the training process of the network model H based on the mixed loss and the attention of the graph, the data enhancement module D can effectively relieve the over-fitting risk of the model by enhancing the data of all SAR images; and the data enhancement module D, the embedded network module E and the node characteristic initialization module I can acquire the effective characteristics of each SAR image, the effective characteristics of each SAR image are combined with the corresponding label vector, data support can be provided for the class prediction of the attention network module G, and compared with the prior art, the classification precision of the small-sample SAR target is further improved.

Experimental results show that the method can obtain higher classification accuracy in small sample SAR target classification.

Drawings

FIG. 1 is a flow chart of an implementation of the present invention.

FIG. 2 is a flow chart of an implementation of the present invention for iterative training of a network model H based on mixed loss and graph attention.

Fig. 3 is a flow chart of an implementation of the present invention to obtain a target classification result of a small sample SAR image.

Detailed Description

The invention is described in further detail below with reference to the figures and the specific embodiments.

Referring to fig. 1, the present invention includes the steps of:

step 1) obtaining a training sample set

And test sample set

wherein C_train+C_test＝C，3≤C_test≤5；

The invention trains the model by adopting the target class containing sufficient SAR images, and the trained model can have higher classification precision in the small sample SAR target classification of other classes, so the training sample set constructed in the step

And test sample set

The medium SAR targets are different in category;

step 2), constructing a network model H based on mixed loss and graph attention:

constructing a mixed loss and graph attention-based network model H comprising a data enhancement module D, an embedded network module E, a node feature initialization module I and a graph attention network module G which are sequentially cascaded, wherein the embedded network module E comprises five first convolution modules E which are sequentially cascaded_CAnd a second convolution module E_LEach first convolution module E_CComprises a first convolution layer, a first batch of normalization layers, a Mish activation layer, a maximum pooling layer and a second convolution module E which are sequentially stacked_LThe second convolution layer and the second batch of normalization layer are sequentially stacked; the graph attention network module G comprises three graph updating layers U which are sequentially cascaded, and each graph updating layer U comprises edge feature building modules U which are sequentially stacked_EAttention weight calculation module U_WAnd node feature update module U_NEach edge feature building block U_EComprises five first full-connection layers which are sequentially stacked, and each node feature updating module U_NA second full-link layer is included;

the embedded network module E and the attention network module G have the following specific parameters:

first volume module E included in embedded network module E_CThe sizes of the convolution kernels of the five first convolution layers in the three first convolution layers are all 3 multiplied by 3, the step lengths are all 1, the filling is all 1, the number of the convolution kernels of the first three first convolution layers is 64, the number of the convolution kernels of the fourth and fifth first convolution layers is 128, the sizes of the pooling kernels of the five maximum pooling layers are all 2 multiplied by 2, and the sliding step lengths are all 2; second convolution module E_LThe size of the convolution kernel of the second convolution layer in (1) is 4 × 4, and the step size is 1;

the graph attention network module G comprises an edge feature building module U in each graph updating layer U_EThe number of the neurons of the first four first full connection layers is 96, and the number of the neurons of the fifth first full connection layer is 1; node feature update module U included in the first two graph update layers U_NThe number of the neurons of the second full-connection layer is 24, and the third graph updating layer U comprises a node feature updating module U_NThe number of neurons in the second fully-connected layer in (1) is C_test；

Step 3) iterative training is carried out on the network model H based on the mixed loss and the drawing attention, and the implementation steps are as shown in FIG. 2:

(3b) from a training sample set

in order to ensure that the trained model can have higher classification precision in the classification of small sample SAR targets of other classes, the invention simulates the test process in the training process and carries out the training on a sample set

Randomly selecting SAR images with the same SAR target category number in the testing process, and dividing all selected SAR images and corresponding label vectors into training support sample sets

And training the query sample set

Two subsets, a training support sample set

Has less SAR images and only C_testK images for simulating a small number of SAR images acquired by the small sample SAR target, training the query sample set

The SAR image in (1) is used for simulating the SAR image needing classification, and at the moment, a support sample set is trained

Training a query sample set for model prediction

The category of the medium SAR image provides data support;

(3c) will train and support the sample set

With each training query sample

Combined into a training task

At this time, each training task

Are all one smallSample SAR target problem, the model needs to support the sample set by training of known classes

To predict each training query sample

Class of medium SAR image, all training tasks

The combination can obtain a training task set

And will be

(3c1) data enhancement module D pairs training task set

wherein ,

representing training tasks

A corresponding one of the enhanced training tasks is,

representing training tasks

Training support sample in (1)

The corresponding reinforced training support sample is used for training,

representing training query samples

Corresponding enhanced training query samples;

the training task set may be changed during each iteration using the data enhancement module D

Can increase the training task set

The risk of model overfitting in the training process is reduced. The data enhancement module D comprises the following concrete implementation steps: performing a power transformation on each SAR image B, i.e. B₁＝(B/255)^γX 255 to obtain SAR image B₁For each SAR image B₁Making an addition of noise, i.e. B₂＝B₁+ noise, obtaining SAR image B₂For each SAR image B₂Performing a flip-flop transformation, i.e. B₃＝flip(B₂) Obtaining an SAR image B₃For each SAR image B₃Performing a rotational transformation, i.e. B' ═ rot (B)₃) Obtaining an enhanced SAR imageB', where γ represents a power, and γ is randomly taken to be [0.7,1.3 ]]The value within the range, noise, means compliance [ -alpha, alpha ]]Of uniformly distributed noise, alpha is randomly taken to be (0, 50)]The value in the range, film (DEG) indicates that the left and right or up and down overturn is carried out randomly, the probability of adopting each overturning mode is 1/2, rot (DEG) indicates that the clockwise rotation is carried out randomly by 90 degrees or 180 degrees or 270 degrees, and the probability of adopting each rotation angle is 1/3;

(3c2) embedded network module E pair enhanced training task set

Each of the enhanced training tasks in (1)

Mapping each SAR image to obtain a training embedded vector set

And using an embedded loss function L_EBy training sets of embedded vectors

Computing training task set

Value of insertion loss l_E：

wherein ,

representing an enhanced training task

Corresponding training embedded vector set, satisfies a ≠ C_testOf K +1

Presentation of enhanced training support samples

The corresponding training is embedded into the vector(s),

representing enhanced training query samples

The corresponding training embedding vector, log (-) represents the logarithm based on the natural constant e, exp (-) represents the exponent based on the natural constant e, Σ represents the running sum,

representing a training task

Training support sample set in (1)

The class center of the c-th object class obtained by averaging,

representing and training tasks

Training query sample in (1)

In this step, the process of the embedded network module E mapping the SAR image into an embedded vector is equivalent to the process of the embedded network module E mapping the SAR image into an embedded vectorFeatures of each SAR image extracted, embedding loss by for each training task

Calculating the class center of each SAR target class in the SAR target class, and connecting each class center with a training query sample

Distance measurement is carried out on corresponding embedded vectors, and the query sample is trained under the constraint of an embedded loss function

The corresponding embedded vector is closer to the class center belonging to the same class and is further away from the class center belonging to the different classes, and after multiple iterations, the embedded network module E can achieve the effects of higher similarity between the extracted features of the same SAR target class and stronger difference between the extracted features of the different SAR target classes, so that the attention network module G of the subsequent graph is improved for each training query sample

The category prediction precision of the medium SAR image;

And embedding a set of vectors for each training

Training embedded vector in

And virtual tag vector

Splicing to obtain a training node 1-layer feature group set

wherein ,

representing training embedded vector sets

The corresponding training node layer 1 feature set,

representing training embedded vectors

Corresponding training node 1 layer characteristics;

in this step, the sample is supported by each enhanced training

Corresponding training embedded vector

And

the label vectors in (1) are spliced, and a vector can be embedded for each training

Adding class information to obtain 1-layer characteristics of all training nodes

Each training query sample may be paired for the model

The class prediction of the medium SAR image provides data support; for each training query sample for which a prediction class is required

The corresponding training embedded vector of the SAR image is

And virtual tag vector

Stitching to ensure 1-level features per training node

Are equal in dimension;

To pair

Including each training node layer 1 feature set

Training node level 1 features in

Corresponding training query samples

wherein ,

representing training node layer 1 features

Corresponding training query samples

in this step, the graph attention network module G may be assembled according to the training node layer 1 feature group

1-layer features for each training node

Corresponding training query samples

Carrying out category prediction on the SAR image; the graph attention network module G firstly trains the task

Each SAR image in the SAR image is regarded as a node, two opposite directional edges are connected between each two nodes, namely a fully-connected directed graph is formed, each node and each edge have the characteristics, and the initial characteristics of each node/the input characteristics of the 1 st graph updating layer U are the 1-layer characteristics of the training nodes

The feature is updated for multiple times through each graph updating layer U, and the ith graph updating layer U is used for inputting training nodes and i-layer features

Updated to obtain the i +1 layer characteristics of the training nodes

Features of each edge in the ith graph update layer U

The attention weight is obtained by calculating the characteristics of the corresponding nodes in the ith graph updating layer U, the characteristics of each edge correspond to one attention weight, the attention weight can be used for guiding the corresponding nodes to aggregate the characteristics from other nodes, the node characteristics of the corresponding nodes are updated accordingly, and after the characteristics of the nodes are updated for three times, the attention network module G can be used for searching the training query sample

Training node 4-layer characteristics corresponding to SAR image in (synthetic aperture radar)

Converting to its predictive label; the specific implementation steps of the whole process are as follows:

(3c41) initializing the iteration number to be i, and noting that the graph force network module G includes three graph updating layers U, so that the maximum iteration number is 3, and making i equal to 1;

(3c42) the ith graph updates the edge of layer UFeature building module U_EUsing training node i-layer feature set sets

Each training node in (1) i-layer feature set

All included training node i-layer features

Constructing edge features to obtain a training edge i-layer feature group set

wherein ,

representing training node i-layer feature set

The corresponding training edge i-layer feature set,

representing training node i-layer feature set

The i-layer characteristics of the h-th training node,

representing training node i-layer feature set

The i-layer feature of the j-th training node,

representing i-layer features by training nodes

And training node i-layer features

The obtained training edge i-layer characteristic, abs (-) represents taking the absolute value of each element,

representing the edge feature building Module U included in the ith graph update layer U_EA plurality of first full connection layers sequentially laminated in the above order;

(3c43) attention weight calculation module U of ith map update layer U_WFor training edge i-layer feature group set

Each training edge i-layer feature set in (1)

All training edge i-layer features included

Calculating attention weight to obtain training i-layer attention weight group set

wherein ,

representing training edge i-layer feature set

Correspondingly training the i-layer attention weight group,

representing training edge i-layer features

The corresponding training i-layer attention weight,

from 1 to C representing removal of h_testA set of positive integers of (M-K);

in this step, the attention weight calculation module U_WCalculating attention weights among the nodes according to the characteristics of each edge, wherein the weights are normalized, namely the sum of the attention weights of one node to all other nodes is 1;

(3c44) node characteristic updating module U of ith graph updating layer U_NUsing training node i-layer feature set sets

And training the i-layer attention weight set

Updating the node characteristics to obtain a training node i +1 layer characteristic group set

wherein ,

representing training node i-layer feature set

The corresponding training node i +1 level feature set,

representing training node i-layer features

Corresponding training nodes are i +1 layer characteristics, a | b represents that after the vector b is spliced in the vector a,

indicating the node characteristic update module U included by inputting n into the ith graph update layer U_NThe second fully connected layer of (a);

in this step, the node characteristic update module U_NBy pouringIn the process, each node can aggregate the characteristics from all other nodes, the aggregated characteristics are spliced with the characteristics of the node, and the spliced result is input into a second full-connection layer to obtain the updated characteristics, namely the characteristics of the i layer of the training node

Updating to training node i +1 layer characteristics

Node feature aggregation guided by attention weights, per training query sample

The node corresponding to the SAR image in the system can acquire the category information from the nodes of the same category, and is converted into the prediction category after being updated for multiple times;

(3c45) judging whether i is true or not, if so, collecting the obtained training node 4-layer feature group set

Each training node in (4) layer feature set

Included training node 4-level features

Performing softmax transformation to obtain a vector set of training prediction results

Otherwise, let i be i +1, and perform step (3c42), wherein,

for training node 4-level features

A corresponding training predictor vector;

And training the query sample set

All the label vectors in (1) calculate the training task set

Is a classification loss value l_C：

wherein ,

predicting outcome vectors for training

Value of the c-th element in (y)_bC is a vector of training predictors

(3d) for the training task set

Is a classification loss value l_CAnd training task set

Mixing loss value l, l of＝λl_C+(1-λ)l_EThen, updating parameters of all first convolution layers and all second convolution layers embedded in the network module E and parameters of all first full-connection layers and all second full-connection layers in the attention network module G through a mixed loss value l by using a random gradient descent algorithm, wherein lambda is weight, and lambda is more than or equal to 0.7 and less than 1;

in this step, the classification loss value l is set_CAnd an insertion loss value l_EThe weighted and obtained mixed loss value l has the effects of enhancing the similarity between the characteristics of the same SAR target class and the difference between the characteristics of different SAR target classes while updating the parameters of the whole model, so that the classification precision is improved;

step 4) obtaining a target classification result of the small sample SAR image, wherein the implementation steps are as shown in FIG. 2:

(4a) for test sample set

wherein ,

(4b) supporting the test with a sample set

With each test query sample

Combined into test tasks

Obtaining a set of test tasks

And will be

(4b1) trained embedded network module E' pair test task set

Each test task in (1)

Mapping each SAR image to obtain a test embedded vector group set

wherein ,

representing test tasks

Corresponding test embedded vector groups satisfy e ≠ C_testOf K +1

Presentation of test support samples

The corresponding test-embedded vector is inserted into the vector,

representing test query samples

A corresponding test embedding vector;

And embedding the vector set for each test

In the condition that e ≠ C_testEach test embedding vector of K +1

Test embedded vector in

And virtual tag vector

Splicing to obtain a 1-layer feature group set of the test nodes

wherein ,

representing test embedding vector sets

The corresponding test point 1 layer feature set,

table test embedded vector

Corresponding test node 1 layer characteristics;

To pair

Including 1-level characteristics per test nodeGroup of

Test node level 1 features in

Corresponding test query sample

Each test prediction vector

The dimension number corresponding to the medium maximum value is

Corresponding test query sample

Including a prediction category of the target in the SAR image, wherein,

representing test node level 1 features

Corresponding test query sample

The probability that the target included in the SAR image belongs to the c-th target class;

in this step, test data is obtainedSet of measurement vectors

Similar to the step (3c4), only the input data of the trained graph attention network module G' is adjusted.

The technical effects of the present invention are further explained by combining simulation experiments as follows:

1. simulation experiment conditions and contents:

the hardware platform of the simulation experiment is as follows: the GPU is NVIDIA GeForce RTX 3090, and the software platform is as follows: the operating system is windows 10. The data set of the simulation experiment is a public MSTAR data set, wherein the SAR sensor is in a high-resolution beam-gathering type, the resolution is 0.3m multiplied by 0.3m, the SAR sensor works in an X wave band, the polarization mode is HH polarization, the pitch angles are respectively 17 degrees, the azimuth angles are continuously changed from 0 degree to 360 degrees, the interval is about 5 degrees, and the size after image cutting is 128 multiplied by 128 degrees. The consolidated MSTAR data set contains 10 classes of ground military vehicle targets, i.e., C-10, model numbers 2S1, BMP-2, BRDM-2, BTR-60, BTR-70, D-7, T-62, T-72, ZIL-131, ZSU-234, and each class of target has 210 SAR images, i.e., M-210.

In order to compare the small sample SAR target classification accuracy with the existing small sample SAR target identification method based on the graph attention network, 1050 total SAR images of 5 target categories and the label of each SAR image are selected from the MSTAR data set as a training sample set, namely C_trainSelecting 1050 total SAR images of the remaining 5 target categories and labels of each SAR image as a test sample set, namely C_test5. Meanwhile, the number K of training/testing support samples sampled by each target class in each training/testing task is 10, and the number M-K of training/testing query samples is 200. The classification of the target classes in the training sample set and the testing sample set and the number of the SAR images of each class of target are shown in Table 1:

TABLE 1

The classification confusion matrix and the classification average accuracy of the small sample SAR target recognition method based on the graph attention network are compared and simulated, and the result is shown in table 2:

TABLE 2

As can be seen from Table 2, the average accuracy of the small sample SAR target classification of the invention is improved by 5.1% compared with the prior art.

The foregoing description is only exemplary of the invention and is not intended to limit the invention, and it will be apparent to those skilled in the art that various changes and modifications in form and detail may be made without departing from the principles and arrangements of the invention, but these changes and modifications are within the scope of the invention as defined in the appended claims.

Claims

1. A small sample SAR target classification method based on mixed loss and graph attention is characterized by comprising the following steps:

(1) obtaining a training sample set

And test sample set

(1b) for the purpose in each SAR imageMarking the mark class and randomly selecting the mark class containing C_trainTotal of C for each object class_trainTaking x M SAR images and labels of each SAR image as training sample sets

wherein C_train+C_test＝C，3≤C_test≤5；

(2) Constructing a network model H based on mixed loss and graph attention:

constructing a mixed loss and graph attention-based network model H comprising a data enhancement module D, an embedded network module E, a node feature initialization module I and a graph attention network module G which are sequentially cascaded, wherein the embedded network module E comprises a plurality of first volume modules E which are sequentially cascaded_CAnd a second convolution module E_LEach first convolution module E_CComprises a first convolution layer, a first batch of normalization layers, a Mish activation layer, a maximum pooling layer and a second convolution module E which are sequentially stacked_LThe second convolution layer and the second batch of normalization layer are sequentially stacked; the graph attention network module G comprises a plurality of graph updating layers U which are sequentially cascaded, and each graph updating layer U comprises edge feature building modules U which are sequentially stacked_EAttention weight calculation module U_WAnd node feature update module U_NEach edge feature building block U_EComprises a plurality of first full-connection layers which are stacked in sequence, and each node feature updating module U_NA second full-link layer is included;

(3b) from a training sample set

(3c) will train and support the sample set

With each training query sample

Combined into a training task

Obtaining a training task set

And will be

(3c1) data enhancement module D pairs training task set

wherein ,

representing training tasks

A corresponding one of the enhanced training tasks is,

representing training tasks

Training support sample in (1)

The corresponding reinforced training support sample is used for training,

representing training query samples

Corresponding enhanced training query samples;

(3c2) embedded network module E pair enhanced training task set

Each of the enhanced training tasks in (1)

Mapping each SAR image to obtain a training embedded vector set

And using an embedded loss function L_EBy training sets of embedded vectors

Computing training task set

Value of insertion loss l_E：

wherein ,

representing an enhanced training task

Corresponding training embedded vector set, satisfies a ≠ C_testOf K +1

Presentation of enhanced training support samples

The corresponding training is embedded into the vector(s),

representing enhanced training query samples

representing a training task

Training support sample set in (1)

The class center of the c-th object class obtained by averaging,

representing and training tasks

Training query sample in (1)

And embedding a set of vectors for each training

Training embedded vector in

And virtual tag vector

Splicing to obtain a training node 1-layer feature group set

wherein ,

representing training embedded vector sets

The corresponding training node layer 1 feature set,

representing training embedded vectors

Corresponding training node 1 layer characteristics;

To pair

Including each training node layer 1 feature set

Training node level 1 features in

Corresponding training query samples

wherein ,

representing training node layer 1 features

Corresponding training query samples

And training the query sample set

All the label vectors in (1) calculate the training task set

Is a classification loss value l_C：

wherein ,

predicting outcome vectors for training

Value of the c-th element in (y)_b,cPredicting outcome vectors for training

(3d) for the training task set

Is a classification loss value l_CAnd training task set

(4) obtaining a target classification result of the small sample SAR image:

(4a) For test sample set

wherein ,

(4b) supporting the test with a sample set

With each test query sample

Combined into test tasks

Obtaining a set of test tasks

And will be

(4b1) trained embedded network module E' pair test task set

Each test task in (1)

Mapping each SAR image to obtain a test embedded vector group set

wherein ,

representing test tasks

Corresponding test embedded vector groups satisfy e ≠ C_testOf K +1

Presentation of test support samples

The corresponding test-embedded vector is inserted into the vector,

representing test query samples

A corresponding test embedding vector;

And embedding the vector set for each test

In the condition that e ≠ C_testEach test embedding vector of K +1

Test embedded vector in

And virtual tag vector

Splicing to obtain a 1-layer feature group set of the test nodes

wherein ,

representing test embedding vector sets

The corresponding test point 1 layer feature set,

table test embedded vector

Corresponding test node 1 layer characteristics;

To pair

Including each test node a layer 1 feature set

Test node level 1 features in

Corresponding test query sample

IncludedCarrying out category prediction on the target in the SAR image to obtain a test prediction result vector set

Each test prediction vector

The dimension number corresponding to the medium maximum value is

Corresponding test query sample

Including a prediction category of the target in the SAR image, wherein,

representing test node level 1 features

Corresponding test query sample

2. The mixed-loss and attention-based small-sample SAR target classification method according to claim 1, characterized in that the mixed-loss and attention-based network model H in the step (2) is embedded in a first convolution module E included in a network module E_CIs five, the figure notes the number of figure update layers U comprised by the force network module GFor three, each graph updating layer U comprises an edge feature building module U_EThe number of the first full connection layers in (1) is five, and the specific parameters of the embedded network module E and the graph attention network module G are as follows:

the graph attention network module G comprises an edge feature building module U in each graph updating layer U_EThe number of the neurons of the first four first full connection layers is 96, and the number of the neurons of the fifth first full connection layer is 1; node feature update module U included in the first two graph update layers U_NThe number of the neurons of the second full-connection layer is 24, and the third graph updating layer U comprises a node feature updating module U_NThe number of neurons in the second fully-connected layer in (1) is C_test。

3. The method for classifying small-sample SAR target based on mixed loss and attention of claim 1, wherein the data enhancement module D in step (3c1) is used for training task set

The method for enhancing the data of each SAR image comprises the following specific steps: performing a power transformation on each SAR image B, i.e. B₁＝(B/255)^γX 255 to obtain SAR image B₁For each SAR image B₁Making an addition of noise, i.e. B₂＝B₁+ noise, obtaining SAR image B₂For each SAR image B₂Performing a flip-flop transformation, i.e. B₃＝flip(B₂) Obtaining an SAR image B₃For each SAR image B₃To make a rotary changeAlternatively, i.e. B' ═ rot (B)₃) Obtaining an enhanced SAR image B', wherein gamma represents power and is randomly selected to be [0.7,1.3 ]]The value within the range, noise, means compliance [ -alpha, alpha ]]Of uniformly distributed noise, alpha is randomly taken to be (0, 50)]The value within the range, film (DEG) indicates that the left and right or up and down are randomly turned, the probability of each turning mode is 1/2, rot (DEG) indicates that the clockwise rotation is randomly carried out by 90 DEG or 180 DEG or 270 DEG, and the probability of each rotation angle is 1/3.

4. The method for classifying SAR targets based on mixed loss and graph attention of claim 1, wherein the graph attention network module G in step (3c4) is used for training node layer 1 feature set sets