CN114650199A

CN114650199A - Deep neural network channel estimation method and system based on data driving

Info

Publication number: CN114650199A
Application number: CN202111637126.6A
Authority: CN
Inventors: 施毅; 孙浩; 熊云彩; 周唯; 沈连丰; 燕锋; 夏玮玮
Original assignee: Nanjing Rongzhi Information Innovation Research Institute Co ltd
Current assignee: Nanjing Rongzhi Information Innovation Research Institute Co ltd
Priority date: 2021-12-30
Filing date: 2021-12-30
Publication date: 2022-06-21

Abstract

The invention adopts a data-driven mode to iteratively train a network model, designs a data-driven channel estimation communication system suitable for a deep neural network, adopts various data for data driving, improves the training times of a perceptron, simulates a channel environment and continuously fits real channel distribution, continuously optimizes an extreme value through a forward and reverse algorithm, can acquire channel state information by utilizing data training, and improves the communication quality.

Description

Deep neural network channel estimation method and system based on data driving

Technical Field

The invention relates to the technical field of communication, in particular to a deep neural network channel estimation method and system based on data driving.

Background

With the development of communication and signal fields and the continuous improvement of communication systems, the wireless communication technology can realize low-delay mutual communication through the propagation of radio signals at any time, but the traditional communication technology still has serious intersymbol interference, low spectrum utilization rate, multipath effect and other influences, which all include the design of a plurality of system bottom layer reasons and the corresponding selection of frequency, bandwidth resource allocation and channel model technology. In the twenty-first century, with the introduction of discrete fourier transform (OFDM) and inverse discrete fourier transform (ifft) into Orthogonal Frequency Division Multiplexing (OFDM) and Multiple Input Multiple Output (MIMO) systems, the problem of improving the spectrum utilization and reducing inter-symbol interference can be truly achieved, and the transmission capacity of a communication system can be improved by transmission and reception of Multiple antennas. The introduction of the MIMO-OFDM system improves the system capacity and the spectrum efficiency to a certain extent, but the performance of the communication system is an important index for measuring the development progress of the communication system, and solving the signal distortion and recovering the check received signal in the wireless communication system is an important field needing to be researched at present.

In the development of communication technology, in order to solve the problems that signals at a receiving end cannot be recovered, signal distortion, received signal precision improvement and the like, the technical field of channel estimation is facing the era of rapid development. Whether the channel estimation accuracy is high or not can determine the quality of the whole MIMO-OFDM communication system to a great extent, the application fields can be millimeter wave large-scale MIMO system and 5G communication MIMO communication system, and the channel state information is realized by corresponding channel estimation algorithm. At present, the conventional channel estimation algorithm mainly includes a data-aided channel estimation algorithm, which inserts a certain number of pilot channels and regular pilot channels into a transmission channel, and then estimates channel characteristics through pilot signal information by using a corresponding algorithm, but this requires other data to help estimate the channel and is easy to generate pilot pollution, resulting in a large system overhead, and other problems; the blind channel estimation algorithm is used for counting channel characteristics through a large number of data sets, pilot signal overhead can not be generated, but the blind channel estimation algorithm has the problems of too much data quantity requirement, low efficiency and the like; a semi-blind channel estimation algorithm, which mainly includes a Least Square (LS) algorithm and a Minimum Mean Square Error (MMSE) by a channel estimation method combining blind channel estimation and data-aided channel estimation, but the performance of the algorithm still cannot achieve a good effect, noise interference exists in an actual channel, and a serious inter-symbol interference problem exists when channel state information is estimated, so a new channel estimation method and system are needed to solve the problem at present.

With the gradual maturity of deep learning technology and the specific practical application of advanced neural network technology in computer vision, speech recognition and other directions in recent years, the achievements of deep learning in channel estimation, signal detection, feedback of channel state information and other aspects verify that the technology has more obvious advantages compared with the field of traditional channel estimation algorithm, has strong learning capability to extract features, analyze a system model and have flexible network structure, and the features can effectively estimate channel state information and improve estimation precision, compared with the traditional channel estimation method, the channel estimation algorithm based on the deep neural network can test training data by using a data model to achieve the purpose of estimating the channel state information, wherein the deep neural network based on data driving can firstly monitor and fit the channel characteristics through the input of a data set, the learning processes signal distortion in a supervision mode, continuously iterates and updates data to prevent accuracy loss to the maximum extent, and then the obtained channel characteristics can be directly used in actual channel estimation, so that the actual characteristics can be better met, more accurate channel state information is output, and the performance of a communication system is improved.

Disclosure of Invention

The invention provides a deep neural network channel estimation method and system based on data driving, aiming at the problem that the performance, system accuracy and frequency spectrum utilization rate of a channel estimation algorithm are seriously insufficient. The antenna array technology is realized by utilizing a space-time coding technology, so as to solve the influence of multipath effect in a communication system and establish a multi-input multi-output orthogonal frequency division multiplexing system based on channel frequency domain response vectors, simulation data and input, the system is used as a transmission channel among signal streams, a deep neural network training iterative network model based on data driving is adopted, channel characteristics are learned, a propagation algorithm and a rejection strategy are utilized to optimize a minimum extreme value and check and recover received data, a product of a [0, 1] vector value and a neuron output value is randomly generated through a Bernoulli function to determine the activation state of a neuron, then the network model carries out coefficient matrix and bias vector calculation on an antenna receiving end channel parameter estimation module of a channel estimation system, an estimated optimal value in an error threshold range is updated after iterative training, and finally, according to an output channel matrix, compensating for channel characteristics. The invention adopts a data-driven mode to iteratively train a network model, designs a data-driven channel estimation communication system suitable for a deep neural network, adopts various data for data driving, improves the training times of a perceptron, simulates a channel environment and continuously fits real channel distribution, continuously optimizes an extreme value through a forward and reverse algorithm, can acquire channel state information by utilizing data training, and improves the communication quality.

In order to effectively illustrate the above object of the present invention, the present invention provides a deep neural network channel estimation method and system based on data driving, which is characterized by comprising the following steps:

(1) training test data for obtaining channel response data and simulation data set

(2) Creating data-driven based deep neural network communication system

(3) Propagation algorithm iterative optimization threshold range optimization extremum

(4) Refusing strategy to optimize neural network activation state and raising iteration efficiency

The deep neural network channel estimation method and system based on data driving are realized by the following technical scheme in the steps of (1) and (2):

in a MIMO OFDM system, multi-antenna transmission and multi-antenna reception are achieved by using antenna array technologyThe method comprises the steps that a driven deep neural network carries out channel estimation to obtain channel state information, the source of training data comprises compiling DeepMIMO simulation data set testing data, sampling collection of data is carried out in a limited testing training space through a deep neural learning framework, and a data set D_data＝[h₁，h₂，...，h_n-1，h_n]Where H ∈ H^T*L，H^T*LIn order to simulate a training data matrix, H '═ C (H x F + theta x), H' is H matrix, real part and imaginary part are extracted and then convolution operation is carried out, theta, x are data set input parameters, F is convolution vector, and the training collection process of the data set is divided into the following steps: firstly, randomly dividing original data into m data sets which are not equal to each other, inputting the m data sets into parameters, randomly extracting n data subsets from the m data sets, putting the n data subsets into a training frame, and then carrying out training iteration on the divided subsets for n times by different methods and then carrying out mean value calculation to obtain the data; subsequently saving the obtained data to D_data(ii) a Channel response data is obtained by using minimum mean square error algorithm, wherein MMSE objective function is

F_MMSE(θ)＝min(F(θ)，L{M})＝min L{(M-θH_x)(M-θH_x)^H}

Where θ is the matrix to be solved by the algorithm:

calculating theta', F by taking its derivative_MMSEThe value of (theta) can be found by setting its first derivative to 0_MMSE(θ)：

Wherein K_MNFor values at the inserted pilot, K_MN＝K_HHIn which K is_HHIs calculated for the autocorrelation matrix

The simulation data set and the channel response data are obtained through the technology, and the two parts of data are input into the deep neural network together to carry out the next iterative training and learn the channel characteristics, so that the channel state information is better estimated;

the deep neural network channel estimation method and system based on data driving, wherein the (3) comprises the following technical scheme:

after the data samples and the data set are used as a deep neural network supervision and prediction channel model, fitting nonlinear data and outputting the fitted data as the input of the neuron of the next layer through continuously learning channel characteristics and parameters of a hidden layer, and continuously and repeatedly learning to obtain a coefficient matrix tau_iAnd the offset vector omega, and then outputting a final result. Wherein the data sample is a channel response matrix obtained by a minimum mean square error algorithm

And the data set is D_dataThe two parts are input into a deep neural network as a whole data, and fit a nonlinear relation through weighted summation of neurons in a hidden layer and an activation function between the output of a lower-layer node and the input of an upper-layer node, wherein the step is a forward propagation algorithmProcess, for the (n, m) th neuron output of the hidden layer:

in the formula tau_iIs a coefficient matrix, ω is an offset vector, i

Outputting a data value for the (n, m) th neuron,

as an activation function:

mapping the characteristics of the neuron by an activation function, and inputting a data output end to an input end of another layer so as to strengthen the mapping capability of a nonlinear model of each layer of the neuron, a sparse network and solve the problem of gradient disappearance;

coefficient matrix tau obtained by front line propagation algorithm_iThe sum offset vector ω is passed through a loss function to calculate the corresponding difference value for τ_iThe sum omega is close to the actual output of the sample by the minimum difference value, the minimum extreme value is optimized by using the loss function, and then the result of optimizing the final extreme value by using the back propagation algorithm is realized by utilizing the multiple iterations of the gradient descent algorithm; wherein the corresponding loss function:

where r is the actual output of the neuron, a is the desired output, τ_iCoefficient matrix, ω bias vector, first order bias derivative is taken for coefficient matrix and bias vector:

will iterate τ_iCoefficient matrix, ω bias vector solution:

τ_i＝-η(δ(τ_ix+ω)-a)δ′(τ_ix+ω)·x-τ

ω＝-ηδ′(τ_ix+ω)(δ(τ_ix+ω)-a)+ω

will tau_iAfter the coefficient matrix and the omega bias vector are obtained, the output value theta of the ith neuron of the output layer is solved through the value_l：

In the formula r_iIs a target value, z_iThe hidden layer output, x is the input vector,

vector matrix, with w +1 layers

As a vector:

updating theta_l：

Where E is the sum of squares error and y_i，r_iRespectively, the output value and the target expected value, and finally obtaining the ith output of the w layer of the output layer:

the deep neural network channel estimation method and system based on data driving, wherein the implementation technical scheme of the (4) is as follows: after the data set is input into a front-line propagation algorithm of the deep neural network for training and testing, the obtained result is subjected to error calculation and then input into the neural network again for inverse propagation iteration to update a coefficient matrix and a bias vector, and in order to prevent over-fitting, a rejection strategy (Dropout) is used for solving the problem. Generation of random probability vectors p using Bernoulli functions_iValue p of_i∈[0，1]Introduction of loss rate K_i，Ki＝p_i(1-p_i) The Dropout strategy has a regularization effect, with the expectation that individual neurons are deleted being:

in the formula tau_iIs a matrix of coefficients, x_iFor inputting vectors, the number of activated neurons in deep neural network training is solved through loss rate parameters, and the problems of overlarge resource consumption and overfitting of a network system are solved.

The invention has the following beneficial effects:

the learning characteristic of the deep neural network is applied to a channel estimation algorithm, the strong learning capability of the neural network is used for carrying out characteristic extraction and analysis on a system model, the neural network is trained and iterated to output all layers of neurons through a flexible network structure, training test data of the neural network is derived from channel response data and a data set obtained by a minimum mean square error algorithm, two parts of data are input into the network to achieve supervision and fitting of the channel characteristic, then the neural network is supervised in a channel transmission process to process signal distortion, and continuously iterated and updated data are lost with the maximum precision, so that the real characteristic can be better met, the performance of a communication system is improved, and the frequency spectrum utilization rate and the signal transmission effectiveness are improved.

Drawings

FIG. 1 is an algorithmic flow chart of the present invention

FIG. 2 is a deep neural network communication system architecture of the present invention

FIG. 3 is a model of a neuron perceptron of the present invention

FIG. 4 is a deep neural network architecture of the present invention

Detailed Description

The deep neural network channel estimation method and system based on data driving provided by the invention fully utilize the characteristics of the deep neural network supervision training data, flexible network structure and learning channel characteristics, and take two parts of data as input, thereby achieving the purposes of training a learning neuron model and fitting channel distribution. On the premise of the technology of rapid development in the aspects of channel estimation, signal detection, feedback of channel state information and the like by utilizing deep learning, data streams in the MIMO-OFDM communication system are transmitted, so that channel parameters are estimated at a receiving end, and the communication performance is improved by algorithm optimization. The deep neural network based on data driving can firstly carry out supervision fitting on the characteristics of a channel through the input of a data set, learn to process signal distortion in a supervision mode, continuously and iteratively update data to lose the maximum precision, and then can directly use the obtained channel characteristics in actual channel estimation, so that the deep neural network can better accord with the actual characteristics, and more accurate channel state information is output.

The detailed description is made with reference to the implementation steps of the channel estimation algorithm shown in fig. 1, which mainly includes the following steps:

(1) obtaining training test data

(2) Iterative updating coefficient matrix and bias vector of propagation algorithm

(3) The training depth is corrected by the rejection strategy, and the training efficiency is improved

The array antenna technology utilizing space-time coding can solve the problems of low multipath effect and low frequency spectrum utilization rate in system model communication, a multi-input multi-output communication system is built by the technology to serve as a transmission channel between signal streams, and training data are collected by using a simulation data setSet of D_data＝[h₁，h₂，...，h_n-1，h_n]，h∈H^T*L，H^T*LFor simulating a training data matrix, H '═ C (H x F + theta x), H' is H matrix, real part and imaginary part are extracted, convolution operation is carried out, theta, x are data set input parameters, and F is convolution vector; the second part of data obtains approximate channel state information by a method of the lowest mean square error, F_MMSE(θ)＝min(F(θ)，L{M})＝min L{(M-θH_x)(M-θH_x)^H}，

Where θ is the matrix to be solved by the algorithm:

is calculated to obtain

As shown in fig. 2;

the method obtains the simulation data set and the channel response data, and the two parts of data are input into the deep neural network together to carry out the next iterative training and learn the channel characteristics, so that the channel state information is better estimated. The deep neural network is composed of an input layer, a hidden layer and an output layer, a data set and a channel matrix are used as data input, coefficient matrix and bias vector calculation are carried out through the hidden layer, wherein the hidden layer comprises a plurality of neurons, each neuron can be called a perceptron, the output and the output of each perceptron are formed by continuous iteration optimization of the previous layer of iteration and minimum extreme values obtained in a threshold range, as shown in figure 3, input data X belongs to { X ∈ { X } X₁，x₂，...，x_n}，

I-th perceptron of l-th layer, y_iIn order to be output, the output is,

continuously learning channel characteristics through hidden layersFitting nonlinear data and outputting the fitted data as the input of the next layer of neuron, and continuously and repeatedly learning to obtain coefficient matrix tau_iAnd a bias vector ω, which outputs for the (n, m) th neuron of the hidden layer:

outputting a data value for the (n, m) th neuron,

as an activation function:

the characteristics of the neurons are mapped out through an activation function, a complex communication system model comprises a plurality of data transmission types, a plurality of nonlinear data are input in the neural network training process, and in order to fit the data, the data which are more consistent with channel distribution are output, and the problem is solved by adopting the activation function. The method inputs a data output end to an input end of another layer so as to strengthen the mapping capability of a nonlinear model of each layer of the neuron, sparsely network and solve the problem of gradient disappearance, and a coefficient matrix tau_iThe sum offset vector ω is passed through a loss function to calculate the corresponding difference value for τ_iThe sum omega is close to the actual output of the sample by the minimum difference value, the minimum extreme value is optimized by using the loss function, and then the result of optimizing the final extreme value by using the back propagation algorithm is realized by utilizing the multiple iterations of the gradient descent algorithm; wherein corresponding loss function

δ(τ_ix + ω) ═ r, r actual output of the neuron, a desired output, τ_iCoefficient matrix, ω bias vector, first order bias derivative is taken for coefficient matrix and bias vector:

will iterate τ_iCoefficient matrix, ω bias vector solution: tau is_i＝-η(δ(τ_ix+ω)-a)δ′(τix+ω)·x-τ，ω＝-ηδ′(τ_ix+ω)(δ(τ_ix + ω) -a) + ω, and r_iAfter the coefficient matrix and the omega bias vector are obtained, the output value theta of the ith neuron of the output layer is solved through the value_l：

r_iIs a target value, Z_iThe hidden layer output, X is the input vector,

vector matrix, with w +1 layers

As a vector:

updating theta_l：

E is the sum of squares error, y_i，r_iRespectively, the output value and the target expected value, and finally obtaining the ith output of the W layer of the output layer:

calculating each layer output by using a forward propagation algorithm of a deep neural network, solving a difference value by using a reverse solution algorithm of gradient descent and optimizing an extreme value, and randomly generating [0, 1] by using a rejection strategy (Dropout strategy) and a Bernoulli function in order to prevent overfitting when data is fitted]The product of the vector value and the output value of the neuron determines the leaving of the neuron, and the Bernoulli function generates a random probability vector p_iValue p of_i∈[0，1]Introduction of lossRate K_i，K_i＝p_i(1-p_i) The Dropout strategy has a regularization effect, with the expectation that individual neurons are deleted being:

τ_iis a matrix of coefficients, x_iThe number of neurons activated in deep neural network training is solved by loss rate parameters for inputting vectors, so that the neural network can have a deeper and more efficient network model, as shown in fig. 4. The channel state information is estimated by using a deep neural network, various data are driven and trained by using data, the training times of a perceptron are increased, the channel environment is simulated, real channel distribution is continuously fitted, the channel state information is obtained through data training, and the communication quality is improved.

Claims

1. The deep neural network channel estimation method and system based on data driving are characterized by comprising the following steps:

(1) in the algorithm of channel estimation, the deep neural network is used for estimating and calculating channel estimation state information in consideration of the key effectiveness and reliability of test data on neural network training, in order to supplement the shortage of data quantity, the deep neural network based on data driving is adopted, and the data source of the deep neural network is based on channel response data obtained by a DeepMIMO simulation data set and Minimum Mean Square Error (MMSE) algorithm;

(2) acquiring a data set through simulation data and using the data set as deep neural network training data, inputting the deep neural network training data into neurons, performing iterative training, and solving channel data distortion and learning channel characteristics in a supervision mode after multiple iterative training;

(3) inputting the channel response data and the simulation data set into a deep neural network for training and learning, monitoring the channel characteristics, and outputting a neuron coefficient matrix and a bias vector after sequentially passing through a hidden layer perceptron;

(4) in order to optimize and reduce estimation errors, a forward propagation algorithm is used for continuously outputting an optimized coefficient matrix and an optimized bias vector, and a loss function and a gradient descent algorithm are used for calculating a difference value and then gradient descent iteration is carried out again to obtain an extremum value of optimization minimization;

(5) in the neural network training process, in order to avoid the situation that data seriously deviates from the real characteristics due to an excessive phenomenon occurring in the fitting process, a discarding strategy (Dropout) is adopted to prevent the excessive fitting;

(6) and outputting a channel matrix and performing channel equalization.

2. The data-driven deep neural network channel estimation method and system according to claim 1, wherein the method comprises the following steps (1):

on an MIMO-OFDM (Multiple-Input Multiple-Output-Orthogonal Frequency Division Multiplexing) system, an antenna array technology is utilized to achieve the purposes that a plurality of antennas transmit and receive signals, a deep neural network based on data driving is used for channel estimation, channel state information is obtained, a training data source of the deep MIMO simulation data set comprises compiling DeepMIMO simulation data set testing data, sampling collection of the data is carried out in a limited testing training space through a deep neural learning framework, and a data set is obtained

=

Wherein

，

In order to simulate the matrix of training data,

，

is composed of

The matrix extracts the real part and the imaginary part and then carries out convolution operation,

x is the input parameter of the data set, F is the convolution vector, and the training collection process of the data set is divided into: firstly, randomly dividing original data into m data sets which are not equal to each other, inputting the m data sets into parameters, randomly extracting n data subsets from the m data sets, putting the n data subsets into a training frame, and then carrying out training iteration on the divided subsets for n times by different methods and then carrying out mean value calculation to obtain the data; subsequently saving the obtained data to

(ii) a Channel response data is obtained by using minimum mean square error algorithm, wherein MMSE objective function is

:

Wherein

For the matrix to be solved by the algorithm:

calculated by taking its derivative

，

The value can be found by setting its first derivative to 0

：

L{

Wherein

In order to insert the value at the pilot,

wherein

Is calculated for the autocorrelation matrix

：

The simulation data set and the channel response data are obtained through the steps, and the two parts of data are input into the deep neural network together to carry out the next iterative training and learn the channel characteristics, so that the channel state information is better estimated.

3. The data-driven deep neural network channel estimation method and system according to claim 1, wherein the method comprises the steps of (2) (4):

after the data samples and the data sets are used as a deep neural network supervision and prediction channel model, fitting nonlinear data and outputting the fitted data as the input of the neuron of the next layer through continuously learning channel characteristics and parameters of a hidden layer, and continuously and repeatedly learning to obtain a coefficient matrix

And an offset vector

Then outputting a final result; wherein the data sample is a channel response matrix obtained by a minimum mean square error algorithm

And a data set of

The two parts are input into a deep neural network as a whole data, and fit a nonlinear relation through weighted summation of neurons in a hidden layer and an activation function between the output of a lower layer node and the input of an upper layer node, wherein the step is a forward propagation algorithm process, and the first part of the hidden layer is subjected to the second step

The individual neurons output:

in the formula

In the form of a matrix of coefficients,

in order to be a vector of the offset,

is a first

The individual neurons output a data value that is,

as an activation function:

the characteristic of the neuron is mapped out through an activation function, and a data output end is input to an input end of another layer, so that the mapping capability of a nonlinear model of each layer of the neuron is enhanced, a network is sparse, and the problem of gradient disappearance is solved.

4. The data-driven deep neural network channel estimation method and system according to claim 1, wherein the method comprises the steps of (3) (4):

coefficient matrix obtained by using front line propagation algorithm

And an offset vector

Calculating the corresponding difference value by a loss function in order to

And

the minimum difference value is close to the actual output of the sample, the minimum extreme value is optimized by using the loss function, and then the result of optimizing the final extreme value by using the back propagation algorithm is realized by utilizing the multiple iterations of the gradient descent algorithm; wherein the corresponding loss function:

，

where the actual output of the r neuron, a desired output,

the matrix of coefficients is a matrix of coefficients,

and (3) calculating a first-order partial derivative of the coefficient matrix and the bias vector:

will be iterated

The matrix of coefficients is a matrix of coefficients,

solving bias vectors:

5. the data-driven deep neural network channel estimation method and system according to claim 4, wherein the method comprises:

will be provided with

The matrix of coefficients is a matrix of coefficients,

after the bias vector is obtained, the value is used to solve the second layer of the output layer

Output value of each neuron

：

，

In the formula

In order to achieve the target value,

the output of the hidden layer is output,

in order to input the vector, the vector is input,

vector matrix of

Layer(s)

As a vector:

update

：

=

In the formula, E is the sum of squares error,

，

respectively as output value and target desired value, and finally obtaining the output layer

First of a layer

And (3) outputting:

6. the data-driven deep neural network channel estimation method and system based on claim 1, comprising the following steps (5):

after a data set is input into a front-line propagation algorithm of a deep neural network for training and testing, the obtained result is subjected to error calculation and then input into the neural network again for carrying out back propagation iteration to update a coefficient matrix and a bias vector, and in order to prevent over-fitting, a rejection strategy (Dropout) is used for solving the problem; generation of random probability vectors using bernoulli functions

Value of

Introduction of loss rate

，

The Dropout strategy has a regularization effect, with the expectation that individual neurons are deleted being:

in the formula

In the form of a matrix of coefficients,

for inputting the vector, the number of the neurons activated in deep neural network training is solved through loss rate parameters, and the problems of overlarge resource consumption and overfitting of a network system are solved.