CN113239880A

CN113239880A - Radar radiation source identification method based on improved random forest

Info

Publication number: CN113239880A
Application number: CN202110613814.2A
Authority: CN
Inventors: 武斌; 黄静; 李鹏; 张葵; 王钊; 武佳玥; 荆泽寰; 袁士博
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2021-06-02
Filing date: 2021-06-02
Publication date: 2021-08-10

Abstract

The invention discloses a radar radiation source identification method based on an improved random forest IRFC, which mainly solves the problems of low speed and repeated voting of the traditional random forest algorithm. The implementation scheme is as follows: simulating and generating a radar signal data set by using commercial software; performing feature extraction on the data set, and dividing a training sample set and a testing sample set; the method comprises the steps of improving a traditional random forest classifier, namely obtaining an improved random forest IRFC by eliminating a decision tree with low classification precision and a decision tree which is easy to vote and repeat; training the improved random forest IRFC by using a training set; and sending the test set signals to a trained IRFC network, and outputting radar signal prediction categories. The invention can effectively improve the random forest classification speed, fully extract the radar signal characteristics, improve the signal identification rate and can be used for radar signal identification in a complex electromagnetic environment.

Description

Radar radiation source identification method based on improved random forest

Technical Field

The invention belongs to the technical field of signal processing, and particularly relates to a radar signal identification method which can be used for electronic information reconnaissance, electronic support and threat warning systems.

Background

With the development of the electronic information field, electronic countermeasure plays an important role in electronic information reconnaissance, electronic support and threat alarm systems, and radar radiation source signal identification is an important link in electronic countermeasure.

However, the increasingly complex electromagnetic environment and the increasingly diverse new radar models currently present new challenges to the field of electronic countermeasures. How to effectively utilize the intercepted signals to identify and confirm an individual radar so as to position and track a radiation source; the problems not only put new requirements on the research of radar radiation source individual identification technology, but also play a very important role in subsequent accurate identification.

Wu Gaojie et al in 2016 published a random forest-based radar radiation source individual identification method, which obtains an identification result by sending a constructed pulse multidimensional fine feature vector into a random forest classifier after dimensionality reduction. Liu song et al in "random forest based radar signal intra-pulse modulation recognition" published in journal "telecommunication sciences" 2016 No. 5 fused the shape and texture features of a radar signal time-frequency graph, and sent to a random forest classifier for signal recognition. In addition, as the most typical and most commonly used combined classifier algorithm, the random forest classifier is also applied to other various fields, such as Liujian's model for predicting solar radiation based on random forest' and Liuqian's research for simulating and predicting fruit quality of greenhouse netted melon based on random forest' which all adopt the random forest classifier directly in the classification process. However, in the current big data environment, the direct adoption of the random forest algorithm causes the problems of slow recognition speed and excessively long training time.

Disclosure of Invention

The invention aims to provide a radar radiation source identification method based on an improved random forest aiming at the defects of a traditional random forest classification RFC algorithm in a big data environment so as to improve identification speed and accuracy.

The technical idea of the invention is as follows: by eliminating decision trees with low classification precision and decision trees which are easy to vote repeatedly in the original RFC, the improved random forest IRFC with higher diagnosis speed and higher classification precision is constructed, and the improved random forest IRFC is applied to radar radiation source signal identification, so that the identification accuracy is improved.

According to the above thought, the implementation scheme of the invention comprises the following steps:

1) using MATLAB software to simulate and generate a data set of radar signals, wherein the data set comprises LFM signals containing 9 types of different phase noises, pulse widths, bandwidths and carrier frequencies, and each type of signals respectively generates 1000 signals from 0-20dB every 4dB of signal-to-noise ratio to serve as the data set for experiment;

2) outputting the data set signals generated in the step 1) in a sequence form, and extracting the following characteristics of the data set signals:

performing Morlet wavelet transform on the signal sequence to obtain an envelope component of the signal sequence, and extracting three output characteristics of rising edge time, pulse width and top drop of the envelope;

extracting the bispectrum characteristic of the phase noise by using contour integral on the signal sequence, and calculating the waveform entropy E of the signal sequence according to the bispectrum characteristic_bEntropy of energy E_nAnd singular value entropy E_svdThese three output characteristics;

performing VDM decomposition on the signal sequence to obtain three output characteristics of signal sequence bandwidth, center frequency and Lagrange multiplier;

3) synthesizing the output characteristics into a nine-dimensional characteristic vector matrix, taking the characteristic vector matrix of each signal sequence and the signal category of each signal sequence as one piece of data, synthesizing a new data set, randomly extracting 800 samples from each signal-to-noise ratio of each signal in the data set as a training set, and taking the remaining 200 samples as a test set;

4) the RFC of the random forest classifier is improved:

4a) training an original RFC model by using a training set, inputting a test set into the trained RFC, setting a precision threshold q, evaluating the classification precision of each decision tree, eliminating the decision trees with the classification precision lower than the set precision threshold q, and obtaining a sub-forest comprising w decision trees;

4b) traversing w decision trees of the sub-forest to obtain all path information of the sub-forest;

4c) calculating similarity S between every two decision trees in the son forest through path information_abConstructing a similarity matrix M;

4d) setting a similarity threshold c, comparing the similarity S of each row in the similarity matrix M with the similarity threshold c, and classifying the decision trees in the similarity matrix M to obtain M categories of the decision trees;

4e) selecting a decision tree with the highest classification precision from each category and combining to obtain an improved random forest IRFC model;

5) sampling the training set by a bootstrap sampling method to obtain a training subset, and inputting the training subset into an improved random forest IRFC model for model training; after each round of training, carrying out classification accuracy evaluation on the obtained model by using data which are not sampled in the training set, and stopping training when the classification accuracy reaches an expected value to obtain a trained IRFC;

6) inputting the test set data into the trained IRFC, outputting the predicted LFM signal class C of each test data_iAnd obtaining the identification result of the radiation source signal.

The invention has the following advantages:

1) according to the invention, as the random forest classification method is improved, the decision tree which is easy to vote and repeat is eliminated, the problem of decision tree voting and repeating in the traditional random forest algorithm under a big data environment is solved, and the radar radiation source signal identification speed is effectively improved;

2) according to the method, the problem of low accuracy of traditional random forest recognition in a big data environment is solved by eliminating the decision tree with low classification accuracy, and the accuracy of radar radiation source signal recognition is effectively improved.

Drawings

FIG. 1 is a flow chart of the overall implementation of the present invention.

FIG. 2 is a graph of simulation results of the recognition accuracy of the present invention.

Detailed Description

Embodiments and effects of the present invention will be described in detail below with reference to the accompanying drawings.

Referring to fig. 1, the implementation of this embodiment includes the following steps:

step 1: a radar signal data set is generated.

The data set comprises LFM pulse signals containing 9 different phase noises, pulse widths, bandwidths and carrier frequencies, and the simulation steps are as follows:

1.1) setting the pulse width of an ideal emission signal of a radiation source to be 10us, the bandwidth to be 10MHz, the carrier frequency to be 10GHz and the sampling frequency to be 100 MHz;

1.2) adding three different phase noises S1, S2, S3 to the ideal transmission signal, the phase coefficients of which are set as shown in table 1, resulting in LFM signals carrying different phase noises.

TABLE 1 three phase noise phase modulation factor settings

Frequency offset/Hz	1K	10K	100K	500K	1000K
						S1 phase coefficient	1	0.1	0.01	0.001	0.0001
S2 phase coefficient	0.5	0.08	0.008	0.002	0.0001
						S3 phase coefficient	0.8	0.2	0.07	0.004	0.006

1.3) respectively passing a first LFM signal containing phase noise S1 through a first Butterworth filter F1, passing a second LFM signal containing phase noise S2 through a second Butterworth filter F2, and passing a third LFM signal containing phase noise S3 through a third Butterworth filter F3 to obtain three LFM signals containing phase noise passing through the Butterworth filters, wherein the three LFM signals are marked as E1, E2 and E3;

wherein the parameter settings of three different butterworth filters are shown in table 2:

TABLE 2 Butterworth filter parameter settings

	F1	F2	F3
				Sampling frequency/Hz	20000	30000	30000
Cut-off frequency/Hz	200	150	200

1.4) adding noises with signal-to-noise ratios of 0dB, 4dB, 8dB, 12dB, 16dB and 20dB into three LFM signals E1, E2 and E3 containing phase noise and passing through the Butterworth filters, so that each LFM signal containing phase noise and passing through the Butterworth filters respectively generates 1000 samples at each signal-to-noise ratio point, and the total number of the samples is 18000;

1.5) changing the pulse width to 7us, the bandwidth to 5MHz and the carrier frequency to 8GHz, and repeating the processes from 1.2) to 1.4) to obtain 18000 samples of three new different signals E4, E5 and E6 under different signal-to-noise ratios;

1.6) changing the pulse width to 15us, the bandwidth to 20MHz and the carrier frequency to 12GHz, repeating the processes from 1.2) to 1.4) and obtaining 18000 samples of three new signals E7, E8 and E9 under different signal-to-noise ratios;

1.7) the samples obtained in 1.4), 1.5), 1.6) were combined together as a data set, resulting in a total of 54000 analog signal samples for 9 different types of signals at 6 different signal-to-noise ratios.

Step 2: features of the data set signals are extracted.

2.1) outputting the generated data set signals in a sequence form, and performing Morlet wavelet transform on the signal sequence to obtain an envelope component of the signal sequence:

where a is a scale factor, b is a panning factor, t denotes time, s (t) denotes the input signal sequence, ψ^*Representing the conjugate function of a Morlet wavelet function, WT_ψs(a, b) is the result of Morlet wavelet transform of the signal sequence;

2.2) extracting three output characteristics of rising edge time, pulse width and top drop of the envelope component:

2.2.1) setting the time corresponding to the rising edge of the envelope component reaching the maximum amplitude of the envelope component by 10 percent as the starting time t of the rising edge₁(ii) a Setting the time corresponding to the rising edge of the envelope component reaching 90% of the maximum amplitude of the envelope component as the time t of the rising edge ending₂The rising edge time t of the envelope component is obtained_r＝t₂-t₁；

2.2.2) setting the time corresponding to the rising edge of the envelope component reaching 50% of the maximum amplitude of the envelope component as the measurement start time t of the pulse width₃(ii) a Setting the time corresponding to the falling edge of the envelope component falling to 50% of the maximum amplitude of the envelope component as the ending time t of the pulse width measurement₄The pulse width tau of the envelope component is obtained as t₄-t₃；

2.2.3) the time corresponding to the first arrival of the envelope component at 90% of the maximum amplitude of the envelope component is denoted t₅The envelope component reaches 90% of the maximum amplitude of the envelope component for the last timeThe corresponding time is denoted as t₆Calculating t₅And t₆The variance of the top amplitude of the envelope component between the two to obtain an envelope top drop TD;

2.3) extracting the bispectrum characteristic of the phase noise by using contour integral on the signal sequence, and calculating the waveform entropy E of the signal sequence according to the bispectrum characteristic_bEntropy of energy E_nAnd singular value entropy E_svdThese three output characteristics:

p_i＝|r_i|/||R||

p_ij＝|b(i,j)|/||B||

wherein the content of the first and second substances,

represents the sum of the bispectral estimates over all the integration paths, L represents the number of integration paths, | r_iL represents the sum of the bispectral estimates over each round of the integration path;

b (i, j) represents the sum of all elements in each row of the bispectral matrix of each bispectral value on the integration path, and I, J represents the row and column number of the bispectral matrix respectively;

β_i(i ═ 1, 2., N) is the singular value of the ith bispectrum estimation result, and N is the number of bispectrum singular value estimates;

2.4) carrying out VDM decomposition on the signal sequence to obtain V intrinsic mode function components;

2.5) obtaining Signal sequence Bandwidth

Center frequency omega_kAnd lagrange multiplier

Three output characteristics:

2.5.1) initializing time domain Bandwidth

Center frequency

Time domain Lagrange multiplier lambda¹(t), a secondary penalty factor alpha, a noise tolerance zeta and an iteration number N, and a precision epsilon is set;

2.5.2) versus frequency domain bandwidth

And center frequency omega_kUpdating, namely updating the formula as follows:

wherein

Is the time domain signal bandwidth at the nth +1 iteration of the kth eigenmode function component,

is the ith eigenmode function component after VDM decomposition,

is a sequence of signals that are input to the device,

is a time domain lagrangian multiplier;

respectively correspond to

Fourier transform of (1);

2.5.3) updating the frequency domain Lagrange multiplier

The update formula is as follows:

wherein the content of the first and second substances,

is the frequency domain lagrangian multiplier iterated the (n + 1) th time,

is the frequency domain lagrangian multiplier for the nth iteration;

2.5.4) calculating the frequency domain bandwidth of the n +1 th iteration of each eigenmode function component

Bandwidth of frequency domain with nth iteration

The modulus of the difference is then summed over V moduli, denoted as Y, i.e.

Comparing Y with precision ε:

if Y > ε and N < N, repeat 2.5.2) to 2.5.4);

if Y is less than or equal to epsilon, the process is completedIterating to obtain frequency domain bandwidth

Center frequency omega_kFrequency domain lagrange multiplier

Three features.

And step 3: a training set and a test set are obtained.

Synthesizing the features output in 2.2), 2.3) and 2.5) into a nine-dimensional feature vector matrix, and synthesizing a new data set by using the feature vector matrix of each signal sequence and the signal category to which the feature vector matrix belongs as a piece of data;

800 samples are randomly drawn from the data set at each signal-to-noise ratio for each type of signal as a training set, leaving 200 samples as a test set.

And 4, step 4: and improving the RFC of the random forest classifier.

4.1) carrying out bootstrap sampling for r times in a training set to obtain r training subsets, and forming a feature subset for each training subset by randomly selecting features;

4.2) recursively executing the following operations on each node from the root node according to the training subset and the feature subset to generate a decision tree to form a random forest:

4.2.1) corresponding each tangent point a of each feature A in the feature subset according to the training subset D of the current node O_iThe training subset D is arranged at each tangent point a_iAre all divided into₁And D₂Two subsets;

calculating all tangent points a of each feature A_iCoefficient of kini of

Wherein the content of the first and second substances,

a being of feature A_iPoint cutting; gini (D)₁)＝2p₁(1-p₁) Representing a first training subset D₁Coefficient of kini of (p)₁Is D₁In signal class C_iThe probability of (d); gini (D)₂)＝2p₂(1-p₂) Representing a second training subset D₂Coefficient of kini of (p)₂Is D₂In signal class C_iThe probability of (d);

4.2.2) from all the characteristics A and their possible values tangent point a_iIn the method, the tangent point with the smallest kini coefficient is taken as the optimal tangent point, and the characteristic of the tangent point is the optimal characteristic. Dividing the current node O into two sub-nodes according to the optimal characteristics;

4.2.3) repeating the steps 4.2.1) and 4.2.2) on the obtained two child nodes, and dividing the child nodes;

4.2.4) repeating 4.2.1) to 4.2.3) until all nodes are leaf nodes, and completing the construction of the decision tree;

4.2.5) performing operations from 4.2.1) to 4.2.4) on the r training subsets to obtain r decision trees, and combining the r decision trees to form a random forest RFC model;

4.3) training the acquired random forest RFC model:

4.3.1) sampling the training set by a bootstrap sampling method to obtain a new training subset b_i；

4.3.2) new training subset b_iInputting the random forest RFC model to carry out model training to obtain a currently trained random forest RFC model;

4.3.3) carrying out classification accuracy evaluation on the trained random forest RFC model by using data which is not sampled in the training set:

when the classification accuracy rate does not reach the expected value, returning to the step 4.3.1);

and when the classification accuracy reaches an expected value, stopping training to obtain a trained random forest RFC model.

4.4) inputting the test set into the trained RFC model, setting a precision threshold q, evaluating the classification precision of each decision tree, eliminating the decision trees with the classification precision lower than the set precision threshold q, and obtaining a sub-forest comprising w decision trees;

4.5) traversing w decision trees of the sub-forest to obtain all path information of the sub-forest;

4.6) calculating the similarity S between every two decision trees in the son forest according to the path information and whether the root nodes between every two decision trees are the same or not_ab：

If decision tree DT_aAnd decision tree DT_bIf the root nodes are different, the similarity of the two trees is 0;

if decision tree DT_aAnd decision tree DT_bIf the root nodes are the same, the similarity S is calculated by the following formula_ab：

Wherein S is_abIs decision tree DT_aAnd decision tree DT_bSimilarity between, L_iIs decision tree DT_aThe ith path and decision tree DT_bIs the cosine similarity between each path, l is the decision tree DT_aTotal number of paths of, MaxSim_iIs DT_aThe ith path pair DT_bMaximum similarity of paths;

4.7) constructing a similarity matrix M by similarity as follows:

4.8) setting a similarity threshold c, comparing the similarity of each row in the similarity matrix M with the similarity threshold c, and classifying the decision tree in the similarity matrix M:

firstly, classifying decision trees with the similarity of the first row exceeding a threshold value c in a similarity matrix M into one class, and then determining a decision tree DT of the ith row_iWhether the classification is a certain class, i is more than or equal to 2 and less than or equal to w: if so, skipping the row; whether or notThen, classifying the decision trees with the row similarity exceeding a threshold value c into one class;

after completing the similarity comparison of w rows in the similarity matrix M, dividing the decision tree into M categories;

4.9) selecting the decision tree with the highest classification precision from each category, and combining the decision tree as an improved random forest IRFC model.

And 5: and obtaining a new training subset to train the improved random forest IRFC model.

5.1) sampling the training set by a bootstrap sampling method to obtain a final training subset e_i；

5.2) final training subset e_iInputting the model into an improved random forest IRFC model for model training to obtain the currently trained improved random forest IRFC model;

5.3) carrying out classification accuracy evaluation on the trained improved random forest IRFC model by using data which is not sampled in the training set:

when the classification accuracy rate does not reach the expected value, returning to the step 5.1);

and when the classification accuracy reaches an expected value, stopping training to obtain the trained improved random forest IRFC.

Step 6: inputting the test set data into a trained modified random forest IRFC, and outputting the predicted LFM radiation source signal class C of each test data x_iAnd obtaining the identification result of the radiation source signal.

Then for each test data x, the output result is determined by each decision tree together, and the expression is as follows:

wherein, C_iIndicates the class value of the output, h (x) ═ C_iIndicates that the predicted result is C_iN represents the total number of categories, and m is the number of decision trees.

The effects of the present invention can be further illustrated by the following simulations.

1. Simulation conditions are as follows:

the hardware tools are as follows: the commercial computer and the chip are an Intel Core i5-6500 processor, the main frequency is 3.20GHz, the memory is 8GB, and the hard disk is 1 TB; operating the system: windows 7; developing a tool: matlab 2014a, spyder 3.3.6.

2. Simulation content:

and (3) respectively inputting the nine-dimensional feature vector matrix in the step (3) into the improved random forest IRFC classifier and the conventional KNN classifier, support vector machine SVM classifier, decision tree DT classifier and random forest RFC classifier to obtain the signal type identification accuracy of the classifiers on the radiation source signal types in the step (1) under different signal to noise ratios, as shown in the attached figure 2.

As can be seen from FIG. 2, the accuracy of the radiation source signal identification of the invention is obviously higher than that of other classifiers, and the accuracy of the radiation source signal identification of the invention is increased along with the increase of the signal-to-noise ratio, so that a good identification effect can be achieved.

The foregoing description is only an example of the present invention and is not intended to limit the present invention, and it will be apparent to those skilled in the art that modifications and variations in form and detail may be made without departing from the spirit and structure of the invention, but these modifications and variations are within the scope of the invention as defined in the appended claims.

Claims

1. The radar radiation source identification method based on the improved random forest is characterized by comprising the following steps:

4) the RFC of the random forest is improved:

4a) training an original random forest RFC model by using a training set, inputting a test set into the trained random forest RFC, setting a precision threshold q, evaluating the classification precision of each decision tree, eliminating the decision trees with the classification precision lower than the set precision threshold q, and obtaining a sub-forest comprising w decision trees;

5) sampling the training set by a bootstrap sampling method to obtain a final training subset, and inputting the training subset into an improved random forest IRFC model for model training; after each round of training, carrying out classification accuracy evaluation on the obtained model by using data which are not sampled in the training set, and stopping training when the classification accuracy reaches an expected value to obtain a trained IRFC;

2. The method according to claim 1, wherein the LFM signals of 9 different phase noise, pulse width, bandwidth and carrier frequency in 1) are respectively set as follows:

1a) setting the pulse width of an ideal emission signal of a radiation source to be 10us, the bandwidth to be 10MHz, the carrier frequency to be 10GHz and the sampling frequency to be 100 MHz;

1b) adding three different phase noises S1, S2 and S3 to an ideal transmitting signal to obtain an LFM signal carrying the different phase noises;

1c) the LFM signal with phase noise S1 is passed through a first butterworth filter F1, the LFM signal with phase noise S2 is passed through a second butterworth filter F2, and the LFM signal with phase noise S3 is passed through a third butterworth filter F3. Obtaining three LFM signals containing phase noise and passing through the Butterworth filter, and recording the LFM signals as E1, E2 and E3;

1d) noise with signal-to-noise ratios of 0dB, 4dB, 8dB, 12dB, 16dB and 20dB is added into three LFM signals E1, E2 and E3 which pass through the Butterworth filter, so that each LFM signal which passes through the Butterworth filter and contains phase noise respectively generates 1000 samples at each signal-to-noise ratio point, and 18000 samples are recorded;

1e) changing the pulse width to be 7us, the bandwidth to be 5MHz and the carrier frequency to be 8GHz, repeating the processes 1b) to 1d) to obtain 18000 samples of three new different signals E4, E5 and E6 under different signal-to-noise ratios;

1f) changing the pulse width to be 15us, the bandwidth to be 20MHz and the carrier frequency to be 12GHz, repeating the processes 1b) to 1d), and obtaining 18000 samples under different signal-to-noise ratios of three new signals E7, E8 and E9;

1g) the samples obtained in 1d), 1e), 1f) were combined together as a data set, resulting in 54000 analog signal samples of 9 different types of signals at 6 different signal-to-noise ratios.

3. The method as claimed in claim 2, wherein the phase noise phase modulation coefficients of three different phase noises S1, S2, S3 in 1b) are set as follows:

the phase modulation coefficients of the first phase noise S1 are 1, 0.1, 0.01, 0.001 and 0.0001 when the frequency offsets of the ideal transmitting signal are 1KHz, 10KHz, 100KHz, 500KHz and 1000KHz, respectively;

the phase modulation coefficients of the second phase noise S2 are 0.5, 0.08, 0.008, 0.002, and 0.0001 when the frequency offsets of the ideal transmission signal are 1KHz, 10KHz, 100KHz, 500KHz, and 1000KHz, respectively;

the phase modulation coefficients of the third phase noise S3 are 0.8, 0.2, 0.07, 0.004, and 0.006 when the frequency offsets of the ideal transmission signal are 1KHz, 10KHz, 100KHz, 500KHz, and 1000KHz, respectively.

4. Method according to claim 2, characterized in that the three different butterworth filters F1, F2, F3 parameters in 1c) are set as follows:

the sampling frequency of the first butterworth filter F1 is 20000Hz, and the cut-off frequency is 200 Hz;

the sampling frequency of the second butterworth filter F2 is 30000Hz, and the cut-off frequency is 150 Hz;

the sampling frequency of the third butterworth filter F3 is 30000Hz and the cut-off frequency is 200 Hz.

5. The method of claim 1, wherein the Morlet wavelet transform is performed on the signal sequence in 2) and has the following formula:

where a is a scale factor, b is a translation factor, t denotes time, s (t) denotes the input signal sequence, #^*Represents MorleConjugate function of t wavelet function, WT_ψsAnd (a, b) is the result of Morlet wavelet transform of the signal sequence.

6. The method according to claim 1, wherein the waveform entropy E of the signal sequence in 2) is calculated according to the bispectral features_bEntropy of energy E_nAnd singular value entropy E_svdThe three output characteristics are expressed as follows:

wherein the content of the first and second substances,

| R | | represents the sum of the bispectrum estimates over all the integration paths, L represents the number of integration paths, | R_iL represents the sum of the bispectral estimates over each round of the integration path;

b (i, j) represents a bispectrum matrix of each bispectrum value on the integration path, I, J is the row and column number of the bispectrum matrix respectively;

β_iand (i ═ 1, 2., N) is a singular value of the ith bispectrum estimation result, and N is the number of bispectrum singular value estimates.

7. The method of claim 1, wherein the 4c) calculating similarity between every two decision trees in the forest_abThe formula is as follows:

wherein S is_abIs decision tree DT_aAnd decision tree DT_bSimilarity between, L_iIs decision tree DT_aThe ith path and decision tree DT_bIs the cosine similarity between each path, l is the decision tree DT_aTotal number of paths of, MaxSim_iIs DT_aThe ith path pair DT_bThe maximum similarity of the paths of (1).

8. The method according to claim 1, wherein the decision tree in the similarity matrix M in 4d) is classified as follows:

classifying decision trees with the similarity of the first row in the similarity matrix M exceeding a threshold value c into one category;

determining the decision Tree DT of the first line_iWhether the classification is a certain class, i is more than or equal to 2 and less than or equal to w: if so, skipping the row; otherwise, classifying the decision trees with the row similarity exceeding a threshold value c into one class;

and after the similarity comparison of w rows in the similarity matrix M is completed, dividing the decision tree into M categories.

9. The method of claim 1, wherein 6) outputs the predicted LFM signal class for each test datum, which is determined by all decision trees in the IRFC model, as follows:

wherein, C_iDenotes the predicted LFM signal class, h (x) ═ C_iIndicates that the predicted result is C_iM is the number of decision trees in the IRFC model, and N represents the total number of signal categories.