Disclosure of Invention
Technical problem to be solved
In order to avoid the defects of the prior art, the invention provides a radar radiation source sorting method based on deep clustering, and aims to solve the problems that the existing radar radiation source sorting method needs manual feature extraction, the sorting accuracy is low, and the like.
Technical scheme
A radar radiation source sorting method based on depth clustering is characterized by comprising the following steps:
step 1: preprocessing a radar radiation source data set to obtain a carrier frequency, a pulse width and a signal time-frequency diagram;
step 2: constructing a convolutional self-encoder network;
the convolutional self-encoder network comprises 16 layers, and the structure of the convolutional self-encoder network sequentially comprises the following layers: first convolution layer → first pooling layer → second convolution layer → second pooling layer → third convolution layer → third pooling layer → Flatten layer → first fully-connected layer → second fully-connected layer → Reshape layer → first upsampling layer → fourth convolution layer → second upsampling layer → fifth convolution layer → third upsampling layer → sixth convolution layer, wherein the first 8 layers constitute the encoder and the second 8 layers constitute the decoder;
the number of convolution kernels in the first convolution layer to the sixth convolution layer is set to be 16, 8, 8, 8, 16 and 1 respectively, and the sizes of the convolution kernels are all set to be 3
Step length is set to 1, the activation function is a ReLU function, the filling mode selects and uses 'same' filling, the first to third pooling layers all adopt a maximum pooling mode, and the size of the pooling area core is set to 2
Step length is set to be 1, the number of neurons of the first full connection layer and the second full connection layer is 32 and 512 respectively, the activation function is a ReLU function, and the size of an up-sampling window of the first up-sampling layer to the third up-sampling layer is 2
2;
And step 3: constructing a fully-connected self-encoder network:
the fully-connected self-encoder network comprises 6 layers, and the structure of the fully-connected self-encoder network is as follows in sequence: the first hidden layer → the second hidden layer → the third hidden layer → the fourth hidden layer → the fifth hidden layer → the output layer, wherein, the first 3 layers compose the encoder, the last three layers compose the decoder;
setting the number of the neurons of the first hidden layer to 8, 8, 16, 8 and 8 respectively, wherein the activation function is a ReLU function, the number of the neurons of the output layer is 2, and the activation function is a sigmoid function;
and 4, step 4: network pre-training:
inputting the signal time-frequency diagram into a convolutional self-encoder network for pre-training to obtain a feature extraction radar radiation source intra-pulse feature extraction encoder network, and inputting the carrier frequency and the pulse width into a fully-connected self-encoder network for pre-training to obtain a radar radiation source inter-pulse feature extraction encoder network;
and 5: feature fusion:
splicing the outputs of the radar radiation source intra-pulse feature extraction encoder network and the inter-pulse feature extraction encoder network corresponding to each sample in the radar radiation source data set to obtain the joint features corresponding to the samples;
step 6: and (3) estimating the number of clusters:
estimating the clustering number of the radar radiation source data set by using a density peak clustering algorithm;
and 7: data clustering:
clustering samples in a radar radiation source data set to obtain a clustering label corresponding to each sample, and defining the clustering label as a label of the sample;
and 8: network fine adjustment:
splicing the outputs of the two encoder networks, inputting the spliced outputs into a full connection layer and an output layer, transforming the networks into a classification network, training the classification network by using a pseudo label supervision network, and realizing fine adjustment of network parameters, wherein the number of neurons of the full connection layer is set to be 32, an activation function is a ReLU function, the number of nodes of the output layer is equal to the number of clusters obtained by estimation in the step 6, and the activation function is a softmax function;
and step 9: and (3) outputting a sorting result:
and (5) repeatedly executing the step (5) to the step (8) for multiple times, and taking the last data clustering result as a final radar radiation source sorting result.
The further technical scheme of the invention is as follows: the data preprocessing in step 1 is specifically as follows:
(1a) intercepting radar radiation source signals within a period of time by using a reconnaissance receiver to obtain a radar radiation source data set;
(1b) measuring the carrier frequency and the pulse width of each radar radiation source signal sample in the radar radiation source data set, and normalizing the carrier frequency and the pulse width of each sample in the radar radiation source data set by using a min-max normalization method;
(1c) and calculating by using short-time Fourier transform to obtain a time-frequency graph of each sample in the radar radiation source data set, and performing min-max normalization and downsampling on the time-frequency graph to obtain a signal time-frequency graph.
The further technical scheme of the invention is as follows: the step 4 is as follows:
inputting the signal time-frequency diagram into a convolutional self-encoder network, iteratively updating the weight of the convolutional self-encoder network for N times by utilizing an Adam algorithm, discarding a decoder part of a trained convolutional self-encoder, and obtaining a pre-trained radar radiation source intra-pulse feature extraction encoder network;
inputting the carrier frequency and the pulse width into a fully-connected self-encoder network, iteratively updating the weight of the network for N times by using an Adam algorithm, discarding a decoder part of the trained fully-connected self-encoder, and obtaining a pre-trained radar radiation source inter-pulse characteristic extraction encoder network.
The further technical scheme of the invention is as follows: and 7, clustering the samples in the radar radiation source data set by adopting a k-means clustering algorithm.
Advantageous effects
In the prior art, the intra-pulse characteristics of radar radiation source signals need to be manually extracted, the prior knowledge such as the known clustering number is needed, and the characteristic mining of the signals is not sufficient. The method uses a self-encoder to extract the intra-pulse and inter-pulse characteristics of radar radiation source signals, fuses the inter-pulse and intra-pulse characteristics to form combined characteristics, and then adopts a strategy of clustering and network fine adjustment to perform network fine adjustment alternately, thereby improving the sorting accuracy; on the other hand, in order to reduce the dependence on the prior knowledge, the invention estimates the clustering number by using a density peak value clustering algorithm, provides input parameter information for a depth clustering algorithm and improves the intelligence of the method. Compared with the prior art, the invention has the following advantages:
firstly, because the neural network is utilized to automatically extract the intra-pulse characteristics of the radar radiation source signals, the artificial design and calculation of the intra-pulse characteristics are avoided, the manual intervention is reduced, and the intelligence and the self-adaptive capability of the invention are improved.
Secondly, the invention utilizes the density peak value clustering algorithm to estimate the clustering number, solves the problem that the k-means clustering needs to preset the clustering number, reduces the prior knowledge required by the invention, and can realize the sorting of unknown radar radiation sources.
Thirdly, because the intra-pulse and inter-pulse characteristics of the radar radiation source are fused to form a combined characteristic, the combined characteristic for sorting simultaneously contains the intra-pulse and inter-pulse information of the radar radiation source, and the sorting accuracy of the radar radiation source can be improved; on the other hand, the invention combines clustering and feature extraction, alternately carries out network fine adjustment and sample clustering, enables the network to learn the features which are easy to distinguish by a clustering algorithm, and improves the sorting accuracy.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in further detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are merely illustrative of the invention and do not limit the invention. In addition, the technical features involved in the embodiments of the present invention described below may be combined with each other as long as they do not conflict with each other.
The following describes the implementation steps of the present invention with reference to fig. 1.
Step 1, data preprocessing.
And intercepting radar radiation source signals within a period of time by using a reconnaissance receiver to obtain a radar radiation source data set.
And measuring the carrier frequency and the pulse width of each radar radiation source signal sample in the radar radiation source data set, and normalizing the carrier frequency and the pulse width of each sample in the radar radiation source data set by using a min-max normalization method.
Calculating by using short-time Fourier transform to obtain a time-frequency diagram of each sample in the radar radiation source data set, and performing min-max normalization and downsampling on the time-frequency diagram to obtain a time-frequency diagram with the size of 64
64 signal time-frequency diagram.
The min-max is normalized as follows:
wherein, the data of the radar radiation source after normalization processing is represented, the data of the radar radiation source before normalization processing is represented,
and
respectively representing maximum and minimum operations.
And 2, constructing a convolutional self-encoder network.
A16-layer convolutional self-encoder network is built, and the structure sequentially comprises the following steps: the first convolution layer → the first pooling layer → the second convolution layer → the second pooling layer → the third pooling layer → the Flatten layer → the first fully-connected layer → the second fully-connected layer → the Reshape layer → the first upsampled layer → the fourth convolution layer → the second upsampled layer → the fifth convolution layer → the third upsampled layer → the sixth convolution layer, wherein the first 8 layers constitute the encoder and the second 8 layers constitute the decoder.
The number of convolution kernels in the first convolution layer to the sixth convolution layer is set to be 16, 8, 8, 8, 16 and 1 respectively, and the sizes of the convolution kernels are all set to be 3
Step length is set to 1, the activation function is a ReLU function, the filling mode selects and uses 'same' filling, the first to third pooling layers all adopt a maximum pooling mode, and the size of the pooling area core is set to 2
Step length is set to be 1, the number of neurons of the first full connection layer and the second full connection layer is 32 and 512 respectively, the activation function is a ReLU function, and the size of an up-sampling window of the first up-sampling layer to the third up-sampling layer is 2
2。
The mathematical model of the ReLU function is represented as follows:
wherein,f(x)representing input values of a networkxResponse after the activation function ReLU.
And 3, constructing a fully-connected self-encoder network.
A6-layer full-connection self-encoder network is built, and the structure of the network is as follows in sequence: first hidden layer → second hidden layer → third hidden layer → fourth hidden layer → fifth hidden layer → output layer, wherein, the first 3 layers compose the encoder, and the last three layers compose the decoder.
The number of the neurons of the first hidden layer, the second hidden layer, the third hidden layer and the fifth hidden layer is respectively set to be 8, 8, 16, 8 and 8, the activation function is a ReLU function, the number of the neurons of the output layer is set to be 2, and the activation function is a sigmoid function.
The mathematical model of the ReLU function is represented as follows:
wherein,f(x)representing input values of a networkxResponse after the activation function ReLU.
The mathematical model of the sigmiod function is represented as follows:
wherein,g(z) Representing the response of the input value of the network after passing through the activation function sigmoid,e (。)expressed as natural constantseBottom exponential operation.
And 4, pre-training the network.
Inputting a signal time-frequency diagram of a sample in a radar radiation source data set into a convolutional self-encoder network, iteratively updating the weight of the network 100 times by using an Adam algorithm, discarding a decoder part of a trained convolutional self-encoder, and obtaining a pre-trained radar radiation source intra-pulse characteristic extraction encoder network.
Inputting the carrier frequency and the pulse width of a sample in a radar radiation source data set into a fully-connected self-encoder network, iteratively updating the weight of the network 100 times by using an Adam algorithm, discarding a decoder part of a trained fully-connected self-encoder, and obtaining a pre-trained radar radiation source inter-pulse characteristic extraction encoder network.
The Adam algorithm is as follows:
wherein,
grepresenting loss function at current iteration
The gradient of (a) of (b) is,
a value-assignment operation is represented and,
the gradient operator is represented by a gradient operator,
representing the loss function of the current iteration when the iterative training is carried out on the radar radiation source identification network,
representing the weight of the current iteration when the radar radiation source identification network is iteratively trained,
mrepresents a gradient with an initial value of 0
gIs estimated by the first moment of (a) of (b),
representing the exponential decay rate of the first moment estimate at a value of 0.9,
represents a gradient with an initial value of 0
gIs estimated by the second order moment of (a),
representing the exponential decay rate of the second moment estimate at a value of 0.999,
Tit is shown that the transpose operation,
representing a parameter with a preset value of 0.001,
expressed as a value of 10
-8Is constant.
And 5, fusing the characteristics.
And splicing the outputs of the radar radiation source intra-pulse feature extraction encoder network and the inter-pulse feature extraction encoder network corresponding to each sample in the radar radiation source data set to obtain the joint features corresponding to the samples.
And 6, estimating the number of clusters.
And estimating the clustering number of the radar radiation source data set by using a density peak value clustering algorithm based on the joint characteristics of the samples.
The specific steps of estimating the cluster number of the radar radiation source data sets by using the density peak clustering algorithm are as follows:
firstly, calculating the distance between every two samples in a radar radiation source data set according to the following formula:
wherein, dist: (
f i ,
f j ) Indicating first in radar radiation source data set
iA sample and
jthe distance between the individual samples of the sample,
f i denotes the first
iThe joint feature vector of each sample is then calculated,
f j is shown as
jThe joint feature vector of each sample is then calculated,
f iu is shown as
iThe first of the joint features of the samples
uThe value of the characteristic is compared with the value of the characteristic,
f ju is shown as
jThe first of the joint features of the samples
uThe value of the characteristic is used as the characteristic value,
which means that the square root operation is performed,
it is indicated that the summing operation is performed,
representing an absolute value taking operation;
secondly, arranging the sample distances obtained in the first step in ascending order, and taking the sample distance with the rank less than or equal to the top 2% and the maximum value as a truncation distanced c ;
Thirdly, calculating the local density of each sample according to the following formula:
wherein,
indicating first in radar source data set
iLocal density of samples, F for the radar radiation source data setA joint feature matrix;
fourthly, calculating the center offset distance of each sample according to the following formula:
wherein,
indicating first in radar radiation source data set
iA center offset of a sample;
the fifth step, respectively using the local density
And center offset distance
According to each sample, for the horizontal and vertical coordinates
And
and marking all sample points on the two-dimensional plane to obtain a decision graph of the data set, observing and counting the number of samples distributed at the upper right corner in the decision graph, wherein the number of the samples is an estimated value of the clustering data.
And 7, clustering the data.
Based on the joint characteristics of the samples, clustering the samples in the radar radiation source data set by using a k-means clustering algorithm to obtain a clustering label corresponding to each sample, and defining the clustering label as the label of the sample.
And 8, fine-tuning the network.
And (3) splicing the outputs of the two encoder networks, inputting the spliced outputs into a full connection layer and an output layer, transforming the networks into a classification network, training by using a pseudo label supervision network, and realizing fine adjustment of network parameters, wherein the number of neurons of the full connection layer is set to be 32, an activation function is a ReLU function, the number of nodes of the output layer is equal to the number of clusters obtained by estimation in the step 6, and the activation function is a softmax function.
And 9, outputting a sorting result.
And (5) repeatedly executing the step 5 to the step 8 for multiple times (15 times), and taking the data clustering result of the last time as a final radar radiation source sorting result.
The effect of the present invention is further explained by combining the simulation experiment as follows:
1. simulation conditions are as follows:
the hardware platform of the simulation experiment of the invention is as follows: the processor is Intel (R) core (TM) i9-9980HK, the main frequency is 2.40GHZ, and the memory is 64 GB.
The software platform of the simulation experiment of the invention is as follows: WINDOWS 10 operating system, MATLAB R2018a, Keras.
2. Simulation content and result analysis:
the radar radiation source samples to be sorted used in the simulation experiment are generated by MATLAB simulation, 10 different types of radar signals are simulated together, the specific parameters are shown in Table 1, and the jitter rate of all jitter parameters is 10%. The sampling frequency of the receiving system is 2.5GHz, 500 samples are simulated under the signal-to-noise ratios of 10 dB, 8 dB, 6 dB, 4 dB, 2dB, 0 dB and-2 dB of each type of radiation source, so that 35000 samples are generated in total, and 5000 samples are generated under each signal-to-noise ratio.
Simulation experiment 1: cluster number estimation
The method utilizes sample data generated by simulation to estimate the clustering number of 5000 samples under each signal-to-noise ratio by using density peak clustering. 20 replicates were performed at each signal-to-noise ratio, and the mode of the 20 results was taken as the result of the final cluster number estimation, as shown in table 2 below. As can be seen from Table 2, when the SNR is greater than or equal to 2dB, the cluster number can be correctly estimated, and when the SNR is 0 or-2 dB, the cluster number estimated by the density peak algorithm is 14, and the estimation is wrong. From this, it can be concluded that the number of clusters estimated when the signal-to-noise ratio is high for the density peak clusters is relatively reliable.
TABLE 2 estimation of cluster population at different SNR
Signal to noise ratio/dB
|
-2
|
0
|
2
|
4
|
6
|
8
|
10
|
Number of clusters
|
14
|
14
|
10
|
10
|
10
|
10
|
10 |
Simulation experiment 2: radar radiation source depth clustering sorting under different signal-to-noise ratios
In order to further verify the performance of the sorting method based on the deep clustering, experiments are carried out under the condition of different signal-to-noise ratios, the total number of correctly sorted samples of 10 different types of signals under each signal-to-noise ratio is counted, then the total number of correctly sorted samples of 10 different types of signals under each signal-to-noise ratio is divided by the total number of samples 5000 under each signal-to-noise ratio to obtain the correct rate of radar radiation source sorting under each signal-to-noise ratio, and all calculation results are drawn into a table 3.
TABLE 3 sorting accuracy List under different SNR
Signal to noise ratio/dB
|
10
|
8
|
6
|
4
|
2
|
0
|
-2
|
Accuracy rate
|
94.99%
|
90.51%
|
90.86%
|
87.8%
|
96.65%
|
90.01%
|
84.05% |
As can be seen from Table 1, the invention is less affected by noise, and the sorting accuracy is higher than 80% under any signal-to-noise ratio condition, and the sorting accuracy can reach 84.05% even under the condition of-2 dB. Therefore, the invention effectively inhibits the influence of noise, extracts proper intra-pulse and inter-pulse characteristics and improves the performance of radar radiation source sorting.
While the invention has been described with reference to specific embodiments, the invention is not limited thereto, and various equivalent modifications or substitutions can be easily made by those skilled in the art within the technical scope of the present disclosure.