CN113840297A

CN113840297A - Frequency spectrum prediction method based on radio frequency machine learning model drive

Info

Publication number: CN113840297A
Application number: CN202111169587.5A
Authority: CN
Inventors: 周福辉; 丁锐; 徐铭; 袁璐; 吴雨航; 吴启晖; 董超; 黄洋
Original assignee: Nanjing University of Aeronautics and Astronautics
Current assignee: Nanjing Xingpuzhi Information Technology Co ltd
Priority date: 2021-10-08
Filing date: 2021-10-08
Publication date: 2021-12-24
Anticipated expiration: 2041-10-08
Also published as: CN113840297B

Abstract

The invention discloses a frequency spectrum prediction method based on radio frequency machine learning model driving, which comprises the following steps: s1, collecting frequency spectrum data, and preprocessing the collected frequency spectrum data; s2, determining the step length M of the input data for the order of the autoregressive model according to the Chichi information criterion; s3, expanding the linear combination process of the autoregressive model into an M-layer network structure, and introducing new trainable parameters into the M-layer network structure to construct an M-layer frequency spectrum prediction network model driven by a radio frequency machine learning model; s4, training the spectrum prediction network model by using the training set data; s5, judging whether training is finished or not, if so, inputting the test set data into the trained frequency spectrum prediction network model, outputting a prediction result, and ending the process; if not, after adding one to the training iteration number, returning to the step S4 until the maximum iteration number is reached. The invention not only endows the network with interpretability, improves the prediction performance, but also accelerates the convergence speed of the network.

Description

Frequency spectrum prediction method based on radio frequency machine learning model drive

Technical Field

The invention relates to the technical field of communication, in particular to a frequency spectrum prediction method based on radio frequency machine learning model driving.

Background

The spectrum prediction is a vital technology in a cognitive radio network, and the use state of a future spectrum is deduced by learning the rule among historical spectrum data, so that a large amount of time and energy consumed in the spectrum sensing process are reduced, and the spectrum sensing efficiency is improved. Meanwhile, the spectrum prediction is helpful for improving the performance of spectrum management and control, and is an essential part in spectrum decision. Existing spectrum prediction methods can be divided into two categories, model-based prediction methods and data-based prediction methods. Model-based methods, such as autoregressive models, are based on the ideal assumption that data at the current point in time can be predicted from a linear combination of past data, which is often impractical in real scenarios, and therefore can lead to poor prediction performance, especially at low signal-to-noise ratios.

In recent years, data-driven methods have attracted attention because of their powerful feature learning capabilities. The differential moving average autoregressive model, the time-delay neural network and the long-short term memory neural network are compared in a published paper of "Experimental results on the prediction of memory in network for spectrum prediction in mobile radio bases" (IEEE trans. Cogn. Commun. Net., vol.6, No.2, pp.771-782,2020) of Ozyegen O, Mohammadja S, Kavurmacolog E and the like, and experiments prove that the data-driven method obtains higher precision than the model-driven prediction method. But network parameters are difficult to determine when addressing specific prediction problems, as the selection of these parameters is not based on a specific theory. Yu L, Chen J, Ding G et al, in its published paper, "Spectrum Prediction Based on Taguchi Method in Deep Learning With Long Short-Term Memory" (IEEE Access, vol.6, pp.45923-45933,2018), analyzed the predicted performance under different network configurations and analyzed the effect of network hyper-parameters on the predicted performance. Meanwhile, experiments prove that the long-term and short-term memory neural network can obtain better prediction performance than the traditional multilayer perceptron. A cross-band spectrum prediction method based on deep migration learning is disclosed in a patent application 'a cross-band spectrum prediction method based on deep migration learning' (application number CN201910995721.3 application publication number CN110730046A) proposed by China people liberation army engineering university. The method utilizes the similarity of each channel between frequency bands, and digs out the internal rules and the association among the frequency spectrum data through historical frequency spectrum data measured on the channels of other frequency bands, and migrates to the current frequency band so as to predict the frequency spectrum state of the current frequency band at the future moment, thereby solving the problem of insufficient training data to a certain extent, but the migration learning method has more limitations. Firstly, a pre-trained model is needed which already achieves good predictive performance on a large data set, and secondly, the prediction problem aimed at by the pre-trained model must have high similarity with the problem to be solved, which has certain limitations in practical application.

These data-only approaches all assume that there is sufficient tag data to complete network training, which is not possible in practical communication scenarios. The reason for this is that it is difficult to acquire a large amount of high-quality tag data in practice. In addition, the convergence rate of the algorithm is low, and real-time spectrum prediction is difficult to realize, so that the prediction performance cannot accurately reflect the use state of the real spectrum, and the real-time requirement of a future wireless communication network cannot be met. Therefore, there is a need to develop new spectrum prediction schemes.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides a frequency spectrum prediction method based on the driving of a radio frequency machine learning model, and not only is network interpretability endowed, but also the prediction performance is improved and the convergence speed of the network is accelerated by introducing the knowledge in the communication field and providing a lightweight design method.

In order to achieve the purpose, the invention adopts the following technical scheme:

a spectrum prediction method based on radio frequency machine learning model driving, the spectrum prediction method comprises the following steps:

s1, collecting frequency spectrum data, and preprocessing the collected frequency spectrum data;

s2, determining the step length M of the input data for the order of the autoregressive model according to the Chichi information criterion;

s3, expanding the linear combination process of the autoregressive model into an M-layer network structure, and introducing new trainable parameters into the M-layer network structure to construct an M-layer frequency spectrum prediction network model driven by a radio frequency machine learning model;

s4, training the spectrum prediction network model by using the training set data;

s5, judging whether training is finished or not, if so, inputting the test set data into the trained frequency spectrum prediction network model, outputting a prediction result, and ending the process; if not, the training iteration number is increased by one, and the process returns to the step S4.

In order to optimize the technical scheme, the specific measures adopted further comprise:

further, in step S1, the process of collecting the spectrum data includes:

when the master user is busy, the received signal is a fading signal subjected to channel fading and additive white Gaussian noise interference, x_t(n)＝h_t(n)s_t(n)+ε_t(n); wherein s is_t(n) denotes an nth sampling point, x, of a signal transmitted at the t-th time slot_t(n) represents the nth sample point of the received signal in the t-th time slot, h (n) represents the channel fading function, epsilon (n) represents the mean value of 0 and the variance of sigma²Additive white gaussian noise of (1);

when the primary user is idle, the received signal contains only noise, x_t(n)＝ε_t(n)。

Further, in step S1, the preprocessing of the collected spectrum data includes the following steps:

and S11, calculating the power value of the spectrum signal by adopting the following formula:

wherein N represents the number of signal sampling points in each time slot, | · | represents the modulo operation, x (t) is the spectral power data obtained for t time slot, x_t(n) represents an nth sampling point of a received signal at a t-th slot;

s12, normalizing the frequency spectrum signal by adopting the following formula:

where x (t) represents the spectral data at time t, x_minRepresents the minimum value, x, of all spectral data_maxRepresenting the maximum of all spectral data.

Further, in step S2, the process of determining the step size M of the input data according to the akachi pool information criterion for the auto-regressive model includes the following steps:

s21, setting the initial model order p to 1 for the p-order autoregressive model;

s22, calculating and recording the AIC value of the p-th order autoregressive model according to the following formula:

y_AIC＝-2ln(L)+2k

wherein L represents the maximum likelihood function of the model, k is the number of variables in the model, and y_AICRepresenting the corresponding AIC value of the model;

s23, let the model order p be p +1, return to step S22, recalculate and record the AIC value until p is p_maxSo far, proceed to step S24;

and S24, screening out the model order corresponding to the minimum AIC value as the step length M of the input data.

Further, p is_maxIs 10.

Further, in step S3, the process of developing the linear combination process of the autoregressive model into an M-layer network structure includes the following steps:

expanding the calculation process of the autoregressive structure into a multi-layer network in deep learning, taking linear operation of each order as one layer of the network, and training by a deep learning method to obtain optimized network parameters;

the m-th network computing process comprises the following steps:

in the formula, x_T+m-MRepresenting the incoming data for the m-th layer network,

weight parameter of input data of mth layer, M is total number of network layers, T is current time slot, y_mIndicating the output data of the m-th layer network.

Further, in step S3, the process of introducing new trainable parameters into the M-layer network structure to construct an M-layer spectrum prediction network model driven by the radio frequency machine learning model includes the following steps:

for the mth layer, the calculation formula is transformed into:

where T denotes the current time slot, x_T+m-MInput data representing a network of m-th layers, y_mOutput data representing an m-th layer network;

and alpha_mTwo learnable parameters representing the m-th tier network,

is weighting of current input dataParameter, α_mThe weighting parameter is output by the upper layer network and is used for selectively forgetting the past partial output; the nonlinear activation function σ (-) is used to avoid linear input data combinations.

Further, in step S4, the process of training the spectrum prediction network model by using the training set data includes the following steps:

s41, randomly initializing trainable parameters of the network according to a monotone decreasing rule;

s42, setting the maximum iteration times and the learning rate, and taking an Adam optimization algorithm as a network training optimizer;

s43, making the iteration time epoch equal to 1;

s44, inputting the training data into the spectrum prediction network model in batches for training, and reversely propagating the training error of each batch to optimize the network parameters; the error loss function adopts the most common mean square error loss function, and the calculation formula is as follows:

wherein, y_iRepresenting the true result of the ith set of samples,

the prediction result of the ith group of samples is shown, and Q represents the total number of samples.

And S45, when all the data of all the batches in the training data are completely propagated reversely, making epoch equal to epoch +1, and returning to the step S44 until the epoch reaches the maximum iteration number.

The invention discloses a frequency spectrum prediction method based on radio frequency machine learning model driving, which mainly solves the problems of low prediction accuracy, poor real-time performance, network lack of interpretability and the like of the existing frequency spectrum prediction method. The method comprises the following implementation steps: collecting frequency spectrum data; preprocessing frequency spectrum data; determining the order of the model according to the Chichi information criterion so as to determine the step length of input data; expanding the linear combination structure of the autoregressive model into a multilayer network; adding new trainable parameters in a network to construct a frequency spectrum prediction framework driven by a radio frequency machine learning model; training a network model by using the training data; judging whether the network training is finished; inputting the test set data into a network; and outputting a prediction result. The invention introduces the knowledge in the communication field, overcomes the black box effect of the traditional deep learning network and increases the interpretability of the network; the network structure is simple, trainable parameters are few, the network training speed is accelerated, and the method can be applied to actual communication scenes; compared with pure model driving and pure data driving methods, the method improves the spectrum prediction performance.

The invention has the beneficial effects that:

first, the invention introduces an autoregressive structure as expert knowledge of a new prediction network framework, overcomes the black box effect of the network and increases the interpretability of the network compared with the traditional deep learning framework.

Second, the invention designs a network framework in which the network elements of each layer contain only two trainable parameters. Compared with a pure data driving method, the method greatly reduces the network scale, improves the training speed of the network, and provides guarantee for the real-time requirement of the actual communication scene.

Thirdly, compared with a pure model-driven method, the method does not need ideal assumption in the model, improves the spectrum prediction performance by utilizing the method based on the radio frequency machine learning model drive, and can be applied to a future actual wireless communication system.

Drawings

Fig. 1 is a flowchart of a spectrum prediction method based on radio frequency machine learning model driving according to an embodiment of the present invention.

FIG. 2 is a diagram of a predictive network framework according to an embodiment of the invention.

Fig. 3 is a comparison graph of prediction accuracy of the spectrum prediction method according to the embodiment of the present invention and other existing spectrum prediction methods under different snr conditions.

Fig. 4 is a graph comparing the training convergence rates of the spectrum prediction method according to the embodiment of the present invention and other existing spectrum prediction methods.

Detailed Description

The present invention will now be described in further detail with reference to the accompanying drawings.

It should be noted that the terms "upper", "lower", "left", "right", "front", "back", etc. used in the present invention are for clarity of description only, and are not intended to limit the scope of the present invention, and the relative relationship between the terms and the terms is not limited by the technical contents of the essential changes.

Fig. 1 is a flowchart of a spectrum prediction method based on radio frequency machine learning model driving according to an embodiment of the present invention. Referring to fig. 1, the spectrum prediction method includes the steps of:

and S1, collecting the frequency spectrum data and preprocessing the collected frequency spectrum data.

And S2, determining the step length M of the input data for the autoregressive model according to the Chichi information criterion.

And S3, expanding the linear combination process of the autoregressive model into an M-layer network structure, and introducing new trainable parameters into the M-layer network structure to construct the M-layer frequency spectrum prediction network model driven by the radio frequency machine learning model.

And S4, training the spectrum prediction network model by using the training set data.

The foregoing spectrum prediction method is further described with reference to the drawings.

Step 1, collecting frequency spectrum data.

Spectral prediction requires decisions to be made in two cases based on samples of the received signal.

First, when the primary user is busy, the received signal is a fading signal subjected to channel fading and additive white gaussian noise interference, x_t(n)＝h_t(n)s_t(n)+ε_t(n) of (a). Wherein s is_t(n) denotes an nth sampling point, x, of a signal transmitted at a t-th time slot_t(n) denotes the nth of the received signal in the t-th slotSample points, h (n) denotes the channel fading function, ε (n) denotes the mean 0, and the variance σ²White additive gaussian noise.

Second, when the primary user is idle, the received signal contains only noise x_t(n)＝ε_t(n)。

In the embodiment of the present invention, a common BPSK signal is used as the transmission signal. The active mode of the primary user is modeled as a process of switching between switch states and the on and off durations are modeled as two random processes. Meanwhile, we assume that the two random processes of turning on and off are independent of each other.

And 2, preprocessing the frequency spectrum data.

According to the spectrum data obtained in step 1, we first calculate the signal power value:

wherein, N represents the number of signal sampling points in each time slot, | · | represents the modulo operation, and x (t) is the spectrum power data obtained for the t-th time slot. In the example of the present invention, we set N1000.

For a deep neural network structure, although the deeper network can further extract features, the deeper network inevitably brings about the problems of gradient disappearance or gradient explosion. Thus, after completing the signal power calculation, we perform an initial normalization operation on the input data:

Step 3, determining the step length M of input data according to the model order of the Chichi information criterion (AIC):

we propose a spectrum prediction framework based on the driving of a radio frequency machine learning model by using knowledge in the field of an autoregressive model, so that the autoregressive model needs to be firstly ordered to determine the input step length of the network.

AIC is a standard for measuring the fitting superiority of a statistical model, is built on the concept of entropy, and provides a standard for balancing the complexity of an estimated model and the superiority of fitting data. Usually, it is a weighted function of the fitting accuracy and the unknown number of parameters, y_AIC-2ln (l) +2 k. Wherein, L represents the maximum likelihood function under the model, k is the number of variables in the model, y_AICRepresenting the corresponding AIC value of the model. When there is a large difference between the two models, this difference is mainly reflected in the likelihood function term, and when the difference is not significant, the second term plays a major role, so that the model with a small number of parameters is considered to be a more appropriate model. In general, the aim of AIC is to find a model that can best interpret data but contains the fewest parameters.

For a p-order autoregressive model, setting initial p to be 1, setting maximum p to be 10, calculating AIC values corresponding to different model orders p, wherein p corresponding to the minimum AIC value is the most appropriate model order, namely the step length M of input data.

And 4, developing the linear combination process of the autoregressive model into an M-layer network structure.

The formula for the autoregressive structure is calculated as follows:

wherein x is_tRepresenting the value of the sequence at time t, p being the order of the autoregressive model,

representing a weight parameter, c being a constant term, ε_tIs an error term.

The autoregressive structure adopts the traditional least square method to estimate the weight parameters

But instead of the other end of the tubeThe precision of the parameters obtained by the method is low. The conventional prediction scheme based on deep learning has a large-scale network architecture, needs to train a large number of parameters, and has a low convergence rate. Furthermore, its black box architecture makes the network unexplainable. Aiming at the problems, the calculation process of the autoregressive structure is expanded into a multi-layer network in deep learning, linear operation of each order is used as one layer of the network, and optimized network parameters are obtained through training by a deep learning method. The calculation process of the m-th network is as follows:

wherein x is_T+m-MRepresenting the incoming data for the m-th layer network,

And 5, introducing new trainable parameters into the network to construct a spectrum prediction framework driven by a radio frequency machine learning model.

In order to better learn the distribution of the spectrum data, the invention introduces new trainable parameters on the basis of an autoregressive structure to selectively forget the output result of the previous layer, and simultaneously introduces a nonlinear activation function to avoid linear input data combination. The complete network structure consists of M cascaded layers. Each layer has the same architecture but different trainable parameters. For the m-th layer network, the calculation process is as follows:

where T denotes the current time slot, x_T+m-MInput data representing a network of m-th layers, y_mIndicating the output data of the m-th layer network.

And alpha_mTwo trainable parameters representing the m-th network,

is a weighting parameter, alpha, of the current input data_mIs a weighting parameter for the output of the network of the previous layer, and is used for selectively forgetting the past partial output. A non-linear activation function σ (-) is introduced to avoid linear input data combinations.

From the above formula, it can be seen that, unlike the conventional deep learning black box structure, the whole network structure of the present invention is modified from an autoregressive structure, which gives interpretability to the network. Compared with the traditional autoregressive model in which the least square method is used for parameter estimation, the network provided by the invention obtains the optimized parameters through deep learning method training, so that more accurate parameter estimation can be obtained. Finally, only two trainable parameters per network element can accelerate the network training process.

And 6, training the network model by using the training data.

Firstly, the trainable parameters of the network are initialized randomly, and since the sequence is always related to the data point closest to the current time to the highest degree, and the distance is longer, the relevance is lower, the initial trainable parameters of the network are initialized randomly according to a monotonous decreasing rule. The number of initial network training iterations epoch is equal to 1, the maximum number of iterations is 100, the learning rate is 0.001, and an Adam optimization algorithm is used as a network training optimizer.

And secondly, inputting the training data into the network for training in batches, wherein the batch size can be adjusted, and the training error of each batch is propagated reversely so as to optimize the network parameters. The error loss function adopts the most common mean square error loss function, and the calculation formula is as follows:

wherein, y_iRepresenting the true result of the ith set of samples,

When all the data of all the batches in the training data are completely propagated reversely, the data are 1 epoch.

And 7, judging whether the network training is finished.

And judging whether the current epoch reaches the maximum iteration number, if so, performing the step 7, otherwise, adding one to the epoch, and continuing to perform the second step of network training of the step 6.

And 8, inputting the test set data into a network.

And 9, outputting a prediction result.

The effect of the present invention will be further explained with the simulation experiment.

1. Simulation conditions and parameter setting:

the simulation experiment of the invention is carried out on a simulation platform of Python3.6, Pytroch 1.5.1. The computer CPU is of type Inter core i7, and is equipped with independent display card of type Inviad gel GTX 1660 SUPER.

The sampling point of the signal in each time slot is 1000 points. The switching duration of the primary user is modeled as two random processes that follow an exponential distribution. The input step size of the network is 7, ordered by the AIC criterion. The maximum iteration number of the network training is 100, the learning rate is 0.001, an Adam optimization algorithm is selected as a network training optimizer, and the data volume of each batch is 100.

2. Simulation content:

FIG. 3 is a comparison graph of prediction accuracy under different SNR conditions using the present invention and the prior art. The abscissa in fig. 3 represents different signal-to-noise ratios (dB) and the ordinate represents prediction accuracy. The polygonal line marked by a rectangle represents a prediction accuracy curve adopting the method, the polygonal line marked by a circle represents a prediction accuracy curve of a traditional frame based on the long-term and short-term memory neural network under different signal-to-noise ratios, and the polygonal line marked by a diamond represents a prediction accuracy curve of a traditional autoregressive model under different signal-to-noise ratios. The signal-to-noise ratio varies from-1 dB to 14 dB. By comparison, the prediction accuracy of the method is obviously higher than that of other existing methods. From the general trend, the prediction accuracy increases as the signal-to-noise ratio increases. When the signal-to-noise ratio is 0dB, the prediction accuracy of the method reaches about 74%, which exceeds the accuracy of the method based on the long-short term memory neural network by about 10% and exceeds the accuracy of the method based on the autoregressive model by about 21%; at a signal-to-noise ratio of 4dB, the prediction accuracy of the method reaches about 81%, which exceeds the accuracy of the method based on the long-short term memory neural network by about 8% and exceeds the accuracy of the method based on the autoregressive model by about 18%. In addition, under the condition of a lower signal-to-noise ratio, the prediction accuracy of the autoregressive model is poor, but the accuracy is obviously improved along with the increase of the signal-to-noise ratio, and when the signal-to-noise ratio reaches 8dB, the prediction accuracy based on the autoregressive model exceeds the method based on the long-short term memory neural network, but both are lower than the method provided by the invention.

Fig. 4 is a graph comparing training speeds of networks using the present invention and the prior art. In fig. 4, the abscissa represents the number of training sessions (times), and the ordinate represents the loss function value. The broken lines marked by rectangles represent the convergence curves of the method of the invention, and the broken lines marked by circles represent the training convergence curves of the framework based on the long-short term memory neural network. By comparing the training speed convergence curves obtained by the two methods, the training speed of the method is obviously higher than that of the existing method based on the long-term and short-term memory neural network. The inventive method achieves convergence only at about 10 training sessions, whereas the long-short term memory neural network-based method achieves convergence at about 30 x training sessions.

By integrating the simulation results and analysis, the frequency spectrum prediction network framework based on the radio frequency machine learning model drive can achieve higher prediction accuracy than the existing method, is low in network complexity and faster in convergence speed, and can be better applied to actual communication scenes.

The above is only a preferred embodiment of the present invention, and the protection scope of the present invention is not limited to the above-mentioned embodiments, and all technical solutions belonging to the idea of the present invention belong to the protection scope of the present invention. It should be noted that modifications and embellishments within the scope of the invention may be made by those skilled in the art without departing from the principle of the invention.

Claims

1. A frequency spectrum prediction method based on radio frequency machine learning model driving is characterized by comprising the following steps:

s5, judging whether training is finished or not, if so, inputting the test set data into the trained frequency spectrum prediction network model, outputting a prediction result, and ending the process; if not, after adding one to the training iteration number, returning to the step S4 until the maximum iteration number is reached.

2. The method for predicting frequency spectrum driven by radio frequency machine learning model according to claim 1, wherein the step S1, the collecting of frequency spectrum data includes:

3. The radio frequency machine learning model-driven spectrum prediction method according to claim 1, wherein in step S1, the preprocessing of the collected spectrum data includes the following steps:

4. The radio frequency machine learning model-driven spectrum prediction method according to claim 1, wherein in step S2, the autoregressive model is ranked according to the Chi information criterion, and the process of determining the step size M of the input data comprises the following steps:

y_AIC＝-2ln(L)+2k

wherein L represents the maximum under the modelLikelihood function, k is the number of variables in the model, y_AICRepresenting the corresponding AIC value of the model;

5. The radio frequency machine learning model-driven spectrum prediction method according to claim 4, wherein p is_maxIs 10.

6. The radio frequency machine learning model-driven spectrum prediction method according to claim 1, wherein the process of developing the linear combination process of the autoregressive model into the M-layer network structure in step S3 includes the following steps:

the m-th network computing process comprises the following steps:

7. The method according to claim 6, wherein in step S3, the process of building an M-layer radio frequency machine learning model-driven spectrum prediction network model by introducing new trainable parameters into the M-layer network structure includes the following steps:

for the mth layer, the calculation formula is transformed into:

and alpha_mTwo learnable parameters representing the m-th tier network,

is a weighting parameter, alpha, of the current input data_mThe weighting parameter is output by the upper layer network and is used for selectively forgetting the past partial output; the nonlinear activation function σ (-) is used to avoid linear input data combinations.

8. The method for spectrum prediction driven by a radio frequency machine learning model according to claim 1, wherein the step S4 of training the spectrum prediction network model by using the training set data includes the following steps:

s43, making the iteration time epoch equal to 1;

wherein, y_iRepresenting the true result of the ith set of samples,

representing the prediction result of the ith group of samples, and Q represents the total number of samples;