CN111901024B - MIMO channel state information feedback method based on fitting depth learning resistance - Google Patents

MIMO channel state information feedback method based on fitting depth learning resistance Download PDF

Info

Publication number
CN111901024B
CN111901024B CN202010745080.9A CN202010745080A CN111901024B CN 111901024 B CN111901024 B CN 111901024B CN 202010745080 A CN202010745080 A CN 202010745080A CN 111901024 B CN111901024 B CN 111901024B
Authority
CN
China
Prior art keywords
layer
model
channel matrix
state information
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010745080.9A
Other languages
Chinese (zh)
Other versions
CN111901024A (en
Inventor
李鑫滨
赵海红
韩赵星
于海峰
骆曦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yanshan University
Original Assignee
Yanshan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yanshan University filed Critical Yanshan University
Priority to CN202010745080.9A priority Critical patent/CN111901024B/en
Publication of CN111901024A publication Critical patent/CN111901024A/en
Application granted granted Critical
Publication of CN111901024B publication Critical patent/CN111901024B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B7/00Radio transmission systems, i.e. using radiation field
    • H04B7/02Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas
    • H04B7/04Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas
    • H04B7/06Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the transmitting station
    • H04B7/0613Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the transmitting station using simultaneous transmission
    • H04B7/0615Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the transmitting station using simultaneous transmission of weighted versions of same signal
    • H04B7/0619Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the transmitting station using simultaneous transmission of weighted versions of same signal using feedback from receiving side
    • H04B7/0621Feedback content
    • H04B7/0626Channel coefficients, e.g. channel state information [CSI]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B7/00Radio transmission systems, i.e. using radiation field
    • H04B7/02Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas
    • H04B7/04Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas
    • H04B7/06Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the transmitting station
    • H04B7/0613Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the transmitting station using simultaneous transmission
    • H04B7/0615Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the transmitting station using simultaneous transmission of weighted versions of same signal
    • H04B7/0619Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the transmitting station using simultaneous transmission of weighted versions of same signal using feedback from receiving side
    • H04B7/0621Feedback content
    • H04B7/0632Channel quality parameters, e.g. channel quality indicator [CQI]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L25/00Baseband systems
    • H04L25/02Details ; arrangements for supplying electrical power along data transmission lines
    • H04L25/0202Channel estimation
    • H04L25/024Channel estimation channel estimation algorithms
    • H04L25/0254Channel estimation channel estimation algorithms using neural network algorithms

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Power Engineering (AREA)
  • Error Detection And Correction (AREA)

Abstract

本发明公开了一种基于抗拟合深度学习的MIMO道状态信息反馈方法,属于通信领域,包括以下步骤:首先,构建Anti‑overfitting CSI net模型,将信道矩阵分为实部和虚部分别输入进用户端的编码器,编码器包含卷积层、全连接层,数据经过编码经过反馈链路,到达接收端,在接收端的解码器包含抗拟合层,全连接层、RefineNet层、卷积层,最终输出预测的信道矩阵。模型构建完成后,将模型进行离线训练,首先初始化模型参数,误差收敛后保存模型,最后将训练好保存的模型在线进行预测信道状态信息。本发明可以进一步提高信息矩阵的恢复精度,保证系统发射端得到准确的信道状态信息,提高系统的通信质量。

Figure 202010745080

The invention discloses a MIMO channel state information feedback method based on anti-fitting deep learning, which belongs to the field of communication and includes the following steps: firstly, an Anti-overfitting CSI net model is constructed, and a channel matrix is divided into a real part and an imaginary part for input respectively The encoder that enters the user side includes a convolutional layer and a fully connected layer. The data is encoded and passed through the feedback link to reach the receiving end. The decoder at the receiving end includes an anti-fitting layer, a fully connected layer, a RefineNet layer, and a convolutional layer. , and finally output the predicted channel matrix. After the model is built, the model is trained offline. First, the model parameters are initialized, and the model is saved after the error converges. Finally, the trained and saved model is used to predict the channel state information online. The invention can further improve the recovery precision of the information matrix, ensure that the system transmitter obtains accurate channel state information, and improve the communication quality of the system.

Figure 202010745080

Description

基于抗拟合深度学习的MIMO信道状态信息反馈方法MIMO channel state information feedback method based on anti-fitting deep learning

技术领域technical field

本发明涉及通信领域,尤其是一种基于抗拟合深度学习的大规模MIMO道状态信息反馈方法。The invention relates to the field of communication, in particular to a massive MIMO channel state information feedback method based on anti-fitting deep learning.

背景技术Background technique

大规模多输入多输出(MIMO)技术作为第五代(5G)通信系统的关键技术,具有频谱效率高、系统容量大、系统鲁棒性强等优点,为保障信道估计得到的信道状态信息能够精确的反馈到发送端,MIMO系统相对于OFDM系统具有更高的数据传输速率,并且提高了系统的可靠性。因此,大规模的MIMO技术越来越受到工业界和学术界的关注。然而,大规模MIMO技术的显著优势在很大程度上取决于发射机可以获得下行链路的信道状态信息。在频分双工大规模MIMO系统中,基站需要通过接收端的反馈来获取下行CSI。然而,大规模天线阵列的使用导致了信道反馈开销的急剧增加。As the key technology of the fifth generation (5G) communication system, massive multiple-input multiple-output (MIMO) technology has the advantages of high spectral efficiency, large system capacity, and strong system robustness. With accurate feedback to the transmitting end, the MIMO system has a higher data transmission rate than the OFDM system, and improves the reliability of the system. Therefore, massive MIMO technology has attracted more and more attention from industry and academia. However, the significant advantages of massive MIMO technology depend to a large extent on the transmitter's availability of channel state information for the downlink. In a frequency division duplex massive MIMO system, the base station needs to obtain downlink CSI through feedback from the receiver. However, the use of large-scale antenna arrays leads to a dramatic increase in channel feedback overhead.

在MIMO无线通信系统中,传统的信道状态信息反馈方法存在严重的缺点。目前MIMO信道状态信息反馈中传统的研究方法,这些传统方法已广泛应用到通信中,但存在很多缺点。首先,它们严重依赖于假设信道是稀疏的。然而,通道在任何基础上都不是完全稀疏的,甚至可能没有可解释的结构。其次,压缩感知算法使用随机投影,没有充分利用信道结构。并且现有的信号重构算法多为迭代法,重构速度较慢。In a MIMO wireless communication system, the traditional channel state information feedback method has serious shortcomings. At present, the traditional research methods in MIMO channel state information feedback have been widely used in communication, but there are many shortcomings. First, they rely heavily on the assumption that the channel is sparse. However, channels are not completely sparse on any basis, and may even have no interpretable structure. Second, compressed sensing algorithms use random projections and do not fully exploit the channel structure. In addition, most of the existing signal reconstruction algorithms are iterative methods, and the reconstruction speed is relatively slow.

为了实现精度高和高效率的信道反馈方法,基于深度学习的反馈方案被提出,深度学习在信道反馈中的应用,具有良好的反馈效果,使得通信具有良好性能,保障了系统的稳定性。采用深度学习理论在通信系统的接收端对信道信息进行离线训练数据,在线恢复数据,极大减轻反馈链路的负担,提高了恢复信道状态信息的精度。但目前这些研究未考虑深度学习方法在信道信息反馈中存在过拟合的问题,过拟合会导致预测性能降低,模型训练结果下降等问题,导致最终预测的信道状态信息精度不足,通信无法保证质量。因此,如何找到一种可以提高恢复速度与精度的深度学习算法是信道信息反馈方案的关键。In order to realize the channel feedback method with high accuracy and high efficiency, a feedback scheme based on deep learning is proposed. The application of deep learning in channel feedback has a good feedback effect, which makes the communication have good performance and ensures the stability of the system. Using deep learning theory to conduct offline training data for channel information at the receiving end of the communication system, and restore data online, greatly reduce the burden of the feedback link and improve the accuracy of restoring channel state information. However, at present, these studies have not considered the problem of over-fitting in the channel information feedback of deep learning methods. Over-fitting will lead to problems such as reduced prediction performance and model training results, resulting in insufficient accuracy of the final predicted channel state information, and communication cannot be guaranteed. quality. Therefore, how to find a deep learning algorithm that can improve the recovery speed and accuracy is the key to the channel information feedback scheme.

发明内容SUMMARY OF THE INVENTION

本发明需要解决的技术问题是提供一种基于抗拟合深度学习的MIMO信道状态信息反馈方法,解决目前深度学习方法存在过拟合导致预测信道状态信息精度不足的问题。The technical problem to be solved by the present invention is to provide a MIMO channel state information feedback method based on anti-fitting deep learning, which solves the problem of insufficient accuracy of predicted channel state information due to overfitting in the current deep learning method.

为解决上述技术问题,本发明所采用的技术方案是:For solving the above-mentioned technical problems, the technical scheme adopted in the present invention is:

基于抗拟合深度学习的MIMO信道状态信息反馈方法,包括以下步骤:The MIMO channel state information feedback method based on anti-fitting deep learning includes the following steps:

(1)构建Anti-overfitting CSI net模型,利用卷积神经网络作为编码器和解码器,卷积神经网络可以通过加强相邻层神经元之间的局部连接模式来利用空间局部相关性,信道矩阵H的实部和虚部作为Anti-overfitting CSI net模型的输入;(1) Constructing an Anti-overfitting CSI net model, using convolutional neural networks as encoders and decoders, convolutional neural networks can utilize spatial local correlation by strengthening the local connection patterns between neurons in adjacent layers, and the channel matrix The real and imaginary parts of H are used as the input of the Anti-overfitting CSI net model;

(2)信道矩阵H数据进入编码器,编码器位于发送数据的用户端,将信道矩阵H编码为一个低维度的数据,编码器包含卷积层和全连接层;(2) The channel matrix H data enters the encoder, the encoder is located at the user end sending the data, and the channel matrix H is encoded into a low-dimensional data, and the encoder includes a convolution layer and a fully connected layer;

(3)数据经过编码后进入反馈链路,到达接收端;(3) After the data is encoded, it enters the feedback link and reaches the receiving end;

(4)在接收端的基站,解码器开始进行译码,将编码端低维的数据进行从新构建Anti-overfitting CSI net模型;接收端的解码器包含抗拟合层,全连接层、RefineNet层、卷积层,输出预测的信道矩阵;(4) At the base station at the receiving end, the decoder starts decoding, and rebuilds the Anti-overfitting CSI net model from the low-dimensional data at the encoding end; the decoder at the receiving end includes an anti-fitting layer, a fully connected layer, a RefineNet layer, and a volume Multilayer, output the predicted channel matrix;

(5)Anti-overfitting CSI net模型构建完成后,将模型进行离线训练,首先初始化模型参数,误差收敛后保存模型,最后将训练好保存的Anti-overfitting CSI net模型在线进行预测信道状态信息。(5) After the construction of the Anti-overfitting CSI net model is completed, the model is trained offline. First, the model parameters are initialized, and the model is saved after the error converges. Finally, the trained and saved Anti-overfitting CSI net model is used to predict the channel state information online.

本发明技术方案的进一步改进在于:步骤(2)所述包括:The further improvement of the technical solution of the present invention is: step (2) includes:

在编码器的卷积层,这一层使用尺寸为3×3的内核来生成两个特征图;在卷积层之后,我们将特征图重塑成一个向量,并使用一个全连接层来生成码字s,这是一个大小为M×1的向量;卷积层和全连接层模拟压缩感知的投影并充当编码器。In the convolutional layer of the encoder, this layer uses a kernel of size 3×3 to generate two feature maps; after the convolutional layer, we reshape the feature map into a vector and use a fully connected layer to generate codeword s, which is a vector of size M × 1; convolutional and fully connected layers simulate the projection of compressed sensing and act as encoders.

本发明技术方案的进一步改进在于:步骤(4)所述的解码器工作流程表示为:The further improvement of the technical solution of the present invention is: the decoder workflow described in step (4) is expressed as:

在接收端获得码字s,随后使用神经网络层(作为解码器)将其映射回通道矩阵H;解码器的第一层为抗拟合单元,我们加入随机失活(Dropout)算法,其方法为在每一个训练周期随机丢掉一定比例的节点信息,也就是将一定比例的上一层输出在这次的训练阶段变零,让下一层的节点根据剩下的信息决定数值;第二层是一个以经过随机失活处理的s为输入,输出两个大小为Nc×Nt的矩阵的全连通层,作为H的实部和虚部的初始估计;然后,初始估计数被输入到几个不断细化重建的细分网络单元中;RefineNet层包括多个RefineNet单元,每个RefineNet单元由四层组成,在RefineNet单元中,第一层是输入层,所有剩下的3层使用3×3个内核;第二层和第三层分别生成8和16个特征图,最后一层生成H的最终重构;通过适当的补零,将三个卷积层生成的特征图设置为与输入通道矩阵大小Nc×Nt相同的大小;选取ReLU(x)=max(x,0)作为激活函数,对每一层进行批量归一化处理;The codeword s is obtained at the receiving end, and then the neural network layer (as a decoder) is used to map it back to the channel matrix H; the first layer of the decoder is an anti-fitting unit, and we add a random dropout (Dropout) algorithm, its method In order to randomly lose a certain proportion of node information in each training cycle, that is, to change a certain proportion of the output of the previous layer to zero in this training phase, and let the nodes of the next layer determine the value according to the remaining information; the second layer is a fully connected layer that takes the randomly deactivated s as input and outputs two matrices of size N c ×N t as the initial estimates of the real and imaginary parts of H; then, the initial estimates are input to In several subdivision network units that are continuously refined and reconstructed; the RefineNet layer includes multiple RefineNet units, and each RefineNet unit consists of four layers. In the RefineNet unit, the first layer is the input layer, and all the remaining 3 layers use 3 ×3 kernels; the second and third layers generate 8 and 16 feature maps, respectively, and the last layer generates the final reconstruction of H; with appropriate zero-padding, the feature maps generated by the three convolutional layers are set to The input channel matrix size is the same as N c ×N t ; ReLU(x)=max(x,0) is selected as the activation function, and batch normalization is performed on each layer;

通过一系列RefineNet单元对信道矩阵进行细化后,将信道矩阵输入到最终的卷积层,使用sigmoid函数将值缩放到[0,1]范围。After the channel matrix is refined through a series of RefineNet units, the channel matrix is input to the final convolutional layer, and the values are scaled to the [0, 1] range using the sigmoid function.

本发明技术方案的进一步改进在于:步骤(5)包括以下内容:The further improvement of the technical scheme of the present invention is: step (5) comprises the following content:

为了训练Anti-overfitting CSI net,我们对编码器和解码器的所有内核和偏置值使用端到端学习;参数集记为Θ={Θende};Anti-overfitting CSI net的输入为Hi,重构的信道矩阵为

Figure GDA0003266171180000031
值得注意的是,Anti-overfitting CSI net的输入和输出都是归一化的通道矩阵,其元素在[0,1]范围内缩放;与自动编码器类似,Anti-overfitting CSI net是一种无监督学习算法;损失函数为均方误差(mean squared error,MSE),计算方法如下:To train the Anti-overfitting CSI net, we use end-to-end learning for all kernel and bias values of the encoder and decoder; the parameter set is denoted as Θ={Θ ende }; the input to the Anti-overfitting CSI net is H i , the reconstructed channel matrix is
Figure GDA0003266171180000031
It is worth noting that both the input and output of Anti-overfitting CSI net are normalized channel matrices whose elements are scaled in the range [0, 1]; similar to autoencoders, Anti-overfitting CSI net is a Supervised learning algorithm; the loss function is mean squared error (MSE), and the calculation method is as follows:

Figure GDA0003266171180000041
Figure GDA0003266171180000041

其中||·||2是欧几里得范数,T是在训练集的样本总数。where ||·|| 2 is the Euclidean norm and T is the total number of samples in the training set.

本发明技术方案的进一步改进在于:随机失活算法原理数据经过输入层、多层隐藏层、输出层;在正常数据传输过程中,每一层的神经元与下一层神经元都进行全部连接,这样当大量数据进行训练时,经常会造成过拟合的现象,因此,随机失活算法被提了出来,它将每一层与下一层进行相连的神经元进行随机失活,使神经元与下一层的神经元断开,这样做的目的减少训练数据,达到防止过拟合的目的;随机失活算法只是在训练时将神经元随机失活,得到训练好的模型后,在测试时所有神经元将又重新连接在一起,经过神经网络的计算,最终输出高精度的计算结果,使之获得完整的信道状态信息。The further improvement of the technical solution of the present invention is: the principle data of the random deactivation algorithm passes through the input layer, the multi-layer hidden layer and the output layer; in the normal data transmission process, the neurons of each layer are all connected to the neurons of the next layer. , so that when a large amount of data is used for training, it often causes overfitting. Therefore, a random deactivation algorithm is proposed, which randomly deactivates the neurons connected to each layer and the next layer, so that the neural The purpose of this is to reduce the training data and prevent overfitting; the random deactivation algorithm just randomly deactivates the neurons during training. During the test, all neurons will be reconnected together, and after the calculation of the neural network, the final output of high-precision calculation results, so that it can obtain complete channel state information.

由于采用了上述技术方案,本发明取得的技术进步是:Owing to having adopted the above-mentioned technical scheme, the technical progress that the present invention obtains is:

本发明提供一种基于抗拟合深度学习的大规模MIMO信道状态信息反馈方法,解决目前深度学习方法存在过拟合导致预测信道状态信息精度不足的问题,进一步提高信息矩阵的恢复精度,保证系统发射端得到准确的信道状态信息,提高系统的通信质量。The invention provides a massive MIMO channel state information feedback method based on anti-fitting deep learning, solves the problem of insufficient accuracy of predicted channel state information due to overfitting in the current deep learning method, further improves the restoration accuracy of the information matrix, and ensures the system The transmitting end obtains accurate channel state information to improve the communication quality of the system.

附图说明Description of drawings

图1为本发明的Anti-overfitting CSI net模型构建和训练流程图;Fig. 1 is the Anti-overfitting CSI net model construction and training flow chart of the present invention;

图2为本发明的Anti-overfitting CSI net模型网络架构图;Fig. 2 is the Anti-overfitting CSI net model network architecture diagram of the present invention;

图3为本发明的神经网络正常训练示意图;3 is a schematic diagram of the normal training of the neural network of the present invention;

图4为本发明的加入抗拟合单元训练示意图。FIG. 4 is a schematic diagram of adding an anti-fitting unit for training according to the present invention.

具体实施方式Detailed ways

下面结合实施例对本发明做进一步详细说明:Below in conjunction with embodiment, the present invention is described in further detail:

如图1所示,一种基于抗拟合深度学习的MIMO道状态信息反馈方法,包括以下步骤:As shown in Figure 1, a MIMO channel state information feedback method based on anti-fitting deep learning includes the following steps:

(1)构建AOCN(Anti-overfitting CSI net,AOCN)模型,利用卷积神经网络作为编码器和解码器,卷积神经网络可以通过加强相邻层神经元之间的局部连接模式来利用空间局部相关性,信道矩阵H的实部和虚部作为它的输入;(1) Construct AOCN (Anti-overfitting CSI net, AOCN) model, using convolutional neural network as encoder and decoder, convolutional neural network can utilize spatial localization by strengthening the local connection pattern between neurons in adjacent layers correlation, the real and imaginary parts of the channel matrix H as its input;

(2)信道矩阵H数据进入编码器,编码器位于发送数据的用户端,将信道矩阵H编码为一个低维度的数据,编码器包含卷积层和全连接层。包括以下内容:在编码器的卷积层,这一层使用尺寸为3×3的内核来生成两个特征图。在卷积层之后,我们将特征图重塑成一个向量,并使用一个全连接层来生成码字s,这是一个大小为M×1的向量。前两层模拟压缩感知的投影并充当编码器;(2) The channel matrix H data enters the encoder. The encoder is located at the user end that sends the data, and encodes the channel matrix H into a low-dimensional data. The encoder includes a convolution layer and a fully connected layer. It consists of the following: In the convolutional layer of the encoder, this layer uses a kernel of size 3×3 to generate two feature maps. After the convolutional layer, we reshape the feature map into a vector and use a fully connected layer to generate the codeword s, which is a vector of size M × 1. The first two layers simulate the projection of compressed sensing and act as encoders;

(3)数据经过编码后进入反馈链路,到达接收端;(3) After the data is encoded, it enters the feedback link and reaches the receiving end;

(4)在接收端的基站,解码器开始进行译码,将编码端低维的数据进行从新构建。接收端的解码器包含抗拟合层,全连接层、RefineNet层、卷积层,输出预测的信道矩阵。解码器工作流程表示为:在接收端获得码字s,随后使用神经网络层(作为解码器)将其映射回通道矩阵H。解码器的第一层我们加入随机失活(Dropout)算法。(4) At the base station at the receiving end, the decoder starts decoding, and reconstructs the low-dimensional data at the encoding end. The decoder at the receiving end includes an anti-fitting layer, a fully connected layer, a RefineNet layer, a convolutional layer, and outputs the predicted channel matrix. The decoder workflow is expressed as: the codeword s is obtained at the receiver and subsequently mapped back to the channel matrix H using a neural network layer (as the decoder). In the first layer of the decoder, we add a random dropout algorithm.

随机失活算法原理如图3和图4所示,随机失活算法原理数据经过输入层、多层隐藏层、输出层。在正常数据传输过程中,数据如图3进行传输,每一层的神经元与下一层神经元都进行全部连接,这样当大量数据进行训练时,经常会造成过拟合的现象,因此,随机失活算法被提了出来,它将每一层与下一层进行相连的神经元进行随机失活,使神经元与下一层的神经元断开,这样做的目的减少训练数据,达到防止过拟合的目的。随机失活算法只是在训练时将神经元随机失活,得到训练好的模型后,在测试时所有神经元将又重新连接在一起,经过神经网络的计算,最终输出高精度的计算结果,使之获得完整的信道状态信息。The principle of the random deactivation algorithm is shown in Figure 3 and Figure 4. The principle of the random deactivation algorithm The data passes through the input layer, the multi-layer hidden layer, and the output layer. In the normal data transmission process, the data is transmitted as shown in Figure 3, and the neurons of each layer are all connected to the neurons of the next layer, so when a large amount of data is used for training, it will often cause overfitting. Therefore, The random deactivation algorithm is proposed, which randomly deactivates the neurons connected to the next layer in each layer, so that the neurons are disconnected from the neurons in the next layer. The purpose of this is to reduce the training data and achieve The purpose of preventing overfitting. The random deactivation algorithm just randomly deactivates neurons during training. After the trained model is obtained, all neurons will be reconnected during testing. After the calculation of the neural network, the final output of high-precision calculation results, so that to obtain complete channel state information.

第二层是一个以经过随机失活处理的s为输入,输出两个大小为Nc×Nt的矩阵的全连通层,作为H的实部和虚部的初始估计。然后,初始估计数被输入到几个不断细化重建的细分网络单元中。每个RefineNet单元由四层组成,在RefineNet单元中,第一层是输入层,所有剩下的3层使用3×3个内核。第二层和第三层分别生成8和16个特征图,最后一层生成H的最终重构。通过适当的补零,将三个卷积层生成的特征图设置为与输入通道矩阵大小Nc×Nt相同的大小。选取ReLU(x)=max(x,0)作为激活函数,对每一层进行批量归一化处理。The second layer is a fully connected layer that takes the randomly deactivated s as input and outputs two matrices of size N c ×N t as initial estimates of the real and imaginary parts of H. The initial estimates are then fed into several subdivision network units that are continuously refined and reconstructed. Each RefineNet unit consists of four layers, in a RefineNet unit, the first layer is the input layer, and all the remaining 3 layers use 3×3 kernels. The second and third layers generate 8 and 16 feature maps, respectively, and the last layer generates the final reconstruction of H. The feature maps generated by the three convolutional layers are set to the same size as the input channel matrix size Nc × Nt with appropriate zero-padding. Select ReLU(x)=max(x,0) as the activation function, and batch normalize each layer.

通过一系列RefineNet单元对信道矩阵进行细化后,将信道矩阵输入到最终的卷积层,使用sigmoid函数将值缩放到[0,1]范围。After the channel matrix is refined through a series of RefineNet units, the channel matrix is input to the final convolutional layer, and the values are scaled to the [0, 1] range using the sigmoid function.

(5)Anti-overfitting CSI net模型构建完成,如图2所示,然后将模型进行离线训练,首先初始化模型参数,误差收敛后保存模型,最后将训练好保存的Anti-overfittingCSI net模型在线进行预测信道状态信息。为了训练Anti-overfitting CSI net,我们对编码器和解码器的所有内核和偏置值使用端到端学习。参数集记为Θ={Θen,Θde}。Anti-overfitting CSI net的输入为Hi,重构的信道矩阵为

Figure GDA0003266171180000061
值得注意的是,Anti-overfitting CSI net的输入和输出都是归一化的通道矩阵,其元素在[0,1]范围内缩放。与自动编码器类似,Anti-overfitting CSI net是一种无监督学习算法。损失函数为均方误差(mean squared error,MSE),计算方法如下:(5) The construction of the Anti-overfitting CSI net model is completed, as shown in Figure 2, and then the model is trained offline. First, the model parameters are initialized, and the model is saved after the error converges. Finally, the trained and saved Anti-overfitting CSI net model is predicted online. Channel state information. To train the Anti-overfitting CSI net, we use end-to-end learning for all kernel and bias values of the encoder and decoder. The parameter set is denoted as Θ = {Θ en , Θ de }. The input of Anti-overfitting CSI net is H i , and the reconstructed channel matrix is
Figure GDA0003266171180000061
It is worth noting that both the input and output of the Anti-overfitting CSI net are normalized channel matrices whose elements are scaled in the range [0, 1]. Similar to autoencoders, Anti-overfitting CSI net is an unsupervised learning algorithm. The loss function is mean squared error (MSE), and the calculation method is as follows:

Figure GDA0003266171180000062
Figure GDA0003266171180000062

其中||·||2是欧几里得范数,T是在训练集的样本总数。where ||·|| 2 is the Euclidean norm and T is the total number of samples in the training set.

以上所述的实施例仅仅是对本发明的优选实施方式进行描述,并非对本发明的范围进行限定,在不脱离本发明设计精神的前提下,本领域普通技术人员对本发明的技术方案做出的各种变形和改进,均应落入本发明权利要求书确定的保护范围内。The above-mentioned embodiments are only to describe the preferred embodiments of the present invention, and do not limit the scope of the present invention. On the premise of not departing from the design spirit of the present invention, those of ordinary skill in the art can Such deformations and improvements shall fall within the protection scope determined by the claims of the present invention.

Claims (3)

1.基于抗拟合深度学习的MIMO信道状态信息反馈方法,其特征在于包括以下步骤:1. The MIMO channel state information feedback method based on anti-fitting deep learning is characterized by comprising the following steps: (1)构建Anti-overfitting CSI net模型,利用卷积神经网络作为编码器和解码器,卷积神经网络通过加强相邻层神经元之间的局部连接模式来利用空间局部相关性,信道矩阵H的实部和虚部作为Anti-overfitting CSI net模型的输入;(1) Construct an Anti-overfitting CSI net model, using convolutional neural network as encoder and decoder, convolutional neural network utilizes spatial local correlation by strengthening the local connection pattern between neurons in adjacent layers, the channel matrix H The real and imaginary parts are used as the input of the Anti-overfitting CSI net model; (2)信道矩阵H数据进入编码器,编码器位于发送数据的用户端,将信道矩阵H编码为一个低维度的数据,编码器包含卷积层和全连接层;(2) The channel matrix H data enters the encoder, the encoder is located at the user end sending the data, and the channel matrix H is encoded into a low-dimensional data, and the encoder includes a convolution layer and a fully connected layer; (3)数据经过编码后进入反馈链路,到达接收端;(3) After the data is encoded, it enters the feedback link and reaches the receiving end; (4)在接收端的基站,解码器开始进行译码,将编码端低维的数据进行从新构建;接收端的解码器包含抗拟合层、全连接层、RefineNet层、卷积层,输出预测的信道矩阵;(4) At the base station at the receiving end, the decoder starts to decode and reconstructs the low-dimensional data at the encoding end; the decoder at the receiving end includes an anti-fitting layer, a fully connected layer, a RefineNet layer, and a convolution layer, and outputs the predicted channel matrix; (5)Anti-overfitting CSI net模型构建完成后,将模型进行离线训练,首先初始化模型参数,误差收敛后保存模型,最后将训练好保存的Anti-overfitting CSI net模型在线进行预测信道状态信息;(5) After the construction of the Anti-overfitting CSI net model is completed, the model is trained offline. First, the model parameters are initialized, and the model is saved after the error converges. Finally, the trained and saved Anti-overfitting CSI net model is used to predict the channel state information online; 步骤(4)所述的解码器工作流程表示为:The decoder workflow described in step (4) is expressed as: 在接收端获得码字s,随后利用卷积神经网络将其映射回通道矩阵H;解码器的第一层为抗拟合层,我们加入随机失活算法,其方法为在每一个训练周期随机丢掉一定比例的节点信息,也就是将一定比例的上一层输出在这次的训练阶段变零,让下一层的节点根据剩下的信息决定数值;第二层是一个以经过随机失活处理的s为输入,输出两个大小为Nc×Nt的矩阵的全连通层,作为H的实部和虚部的初始估计,其中:Nc为接收天线数,Nt为发射天线数;然后,初始估计数被输入到几个不断细化重建的细分网络单元中;RefineNet层包括多个RefineNet单元,每个RefineNet单元由四层组成,在RefineNet单元中,第一层是输入层,所有剩下的3层使用3×3个内核;第二层和第三层分别生成8和16个特征图,最后一层生成H的最终重构;通过适当的补零,将三个卷积层生成的特征图设置为与输入通道矩阵大小Nc×Nt相同的大小;选取ReLU(x)=max(x,0)作为激活函数,对每一层进行批量归一化处理;The codeword s is obtained at the receiving end, and then mapped back to the channel matrix H using a convolutional neural network; the first layer of the decoder is an anti-fitting layer, and we add a random deactivation algorithm, which is randomly selected in each training cycle. Losing a certain proportion of node information, that is, changing a certain proportion of the output of the previous layer to zero in this training phase, and letting the nodes of the next layer determine the value according to the remaining information; the second layer is a random deactivation method. The processed s is the input and outputs a fully connected layer of two matrices of size N c ×N t as the initial estimates of the real and imaginary parts of H, where: N c is the number of receive antennas and N t is the number of transmit antennas ; Then, the initial estimates are input into several subdivision network units that are continuously refined and reconstructed; the RefineNet layer includes multiple RefineNet units, each RefineNet unit consists of four layers, in the RefineNet unit, the first layer is the input layer , all remaining 3 layers use 3 × 3 kernels; the second and third layers generate 8 and 16 feature maps, respectively, and the last layer generates the final reconstruction of H; with appropriate zero-padding, the three volumes The feature map generated by the stacked layer is set to the same size as the input channel matrix size N c ×N t ; ReLU(x)=max(x, 0) is selected as the activation function, and batch normalization is performed on each layer; 通过一系列RefineNet单元对信道矩阵进行细化后,将信道矩阵输入到最终的卷积层,使用sigmoid函数将值缩放到[0,1]范围;After the channel matrix is refined through a series of RefineNet units, the channel matrix is input to the final convolutional layer, and the values are scaled to the [0,1] range using the sigmoid function; 步骤(5)包括以下内容:Step (5) includes the following: 为了训练Anti-overfitting CSI net,我们对编码器和解码器的所有内核和偏置值使用端到端学习;参数集记为Θ={Θende},其中Θen为编码参数集,Θde为解码参数集;Anti-overfitting CSI net的输入为Hi,重构的信道矩阵为
Figure FDA0003266171170000021
其中,Hi为真实的信道矩阵,fde为解码函数,fen为编码函数,si为第i个信道矩阵的压缩后的码字;值得注意的是,Anti-overfitting CSI net的输入和输出都是归一化的通道矩阵,其元素在[0,1]范围内缩放;损失函数为均方误差(mean squared error,MSE),计算方法如下:
To train the Anti-overfitting CSI net, we use end-to-end learning for all kernel and bias values of the encoder and decoder; the parameter set is denoted as Θ={Θ ende }, where Θ en is the encoding parameter set, Θ de is the decoding parameter set; the input of Anti-overfitting CSI net is H i , and the reconstructed channel matrix is
Figure FDA0003266171170000021
Among them, H i is the real channel matrix, f de is the decoding function, f en is the encoding function, and si is the compressed codeword of the i -th channel matrix; it is worth noting that the input and The outputs are all normalized channel matrices whose elements are scaled in the range [0,1]; the loss function is mean squared error (MSE), which is calculated as follows:
Figure FDA0003266171170000022
Figure FDA0003266171170000022
其中||·||2是欧几里得范数,T是在训练集的样本总数。where ||·|| 2 is the Euclidean norm and T is the total number of samples in the training set.
2.根据权利要求1所述的基于抗拟合深度学习的MIMO信道状态信息反馈方法,其特征在于:步骤(2)包括:2. The MIMO channel state information feedback method based on anti-fitting deep learning according to claim 1, wherein step (2) comprises: 在编码器的卷积层,这一层使用尺寸为的内核来生成两个特征图;在卷积层之后,我们将特征图重塑成一个向量,并使用一个全连接层来生成码字s,这是一个大小为的向量;卷积层和全连接层模拟压缩感知的投影并充当编码器。In the convolutional layer of the encoder, this layer uses a kernel of size to generate two feature maps; after the convolutional layer, we reshape the feature map into a vector and use a fully connected layer to generate the codeword s , which is a vector of size ; the convolutional and fully connected layers simulate the projection of compressed sensing and act as encoders. 3.根据权利要求1所述的基于抗拟合深度学习的MIMO信道状态信息反馈方法,其特征在于:随机失活算法原理数据经过输入层、多层隐藏层、输出层;在正常数据传输过程中,每一层的神经元与下一层神经元都进行全部连接,随机失活算法将每一层与下一层进行相连的神经元进行随机失活,使神经元与下一层的神经元断开;随机失活算法在训练时将神经元随机失活,得到训练好的模型后,在测试时所有神经元将又重新连接在一起。3. The MIMO channel state information feedback method based on anti-fitting deep learning according to claim 1, is characterized in that: the principle data of random deactivation algorithm passes through input layer, multi-layer hidden layer, output layer; in normal data transmission process The neurons of each layer are all connected to the neurons of the next layer, and the random deactivation algorithm randomly deactivates the neurons that are connected to the next layer, so that the neurons of the next layer are connected to the neurons of the next layer. Unit disconnection; the random deactivation algorithm randomly deactivates neurons during training, and after the trained model is obtained, all neurons will be reconnected during testing.
CN202010745080.9A 2020-07-29 2020-07-29 MIMO channel state information feedback method based on fitting depth learning resistance Active CN111901024B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010745080.9A CN111901024B (en) 2020-07-29 2020-07-29 MIMO channel state information feedback method based on fitting depth learning resistance

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010745080.9A CN111901024B (en) 2020-07-29 2020-07-29 MIMO channel state information feedback method based on fitting depth learning resistance

Publications (2)

Publication Number Publication Date
CN111901024A CN111901024A (en) 2020-11-06
CN111901024B true CN111901024B (en) 2021-11-05

Family

ID=73182650

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010745080.9A Active CN111901024B (en) 2020-07-29 2020-07-29 MIMO channel state information feedback method based on fitting depth learning resistance

Country Status (1)

Country Link
CN (1) CN111901024B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4346174A1 (en) * 2022-09-30 2024-04-03 Nokia Technologies Oy Devices, methods and apparatuses for channel prediction

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117413496A (en) * 2021-06-12 2024-01-16 华为技术有限公司 Link adaptation supporting artificial intelligence
CN115604731A (en) * 2021-07-09 2023-01-13 华为技术有限公司(Cn) Communication method and device
CN113660015B (en) * 2021-08-11 2022-08-16 东南大学 Online wireless channel acquisition optimization method under assistance of environment knowledge base
CN117751559A (en) * 2021-11-02 2024-03-22 Oppo广东移动通信有限公司 CSI feedback method, device, equipment and storage medium
CN118120156A (en) * 2021-12-31 2024-05-31 Oppo广东移动通信有限公司 Training method, device, equipment and storage medium of channel information feedback model
CN117200838A (en) * 2022-06-01 2023-12-08 华为技术有限公司 Communication method and device
CN115484000A (en) * 2022-08-05 2022-12-16 北京结慧科技有限公司 Channel state information feedback method, electronic device, and medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108390706A (en) * 2018-01-30 2018-08-10 东南大学 A kind of extensive mimo channel state information feedback method based on deep learning
CN109474316A (en) * 2018-11-22 2019-03-15 东南大学 A Channel Information Compression Feedback Method Based on Deep Recurrent Neural Networks
CN109525292A (en) * 2018-12-24 2019-03-26 东南大学 A kind of channel information compression feedback method using bit-level optimization network
CN110350958A (en) * 2019-06-13 2019-10-18 东南大学 A kind of more multiplying power compressed feedback methods of CSI of extensive MIMO neural network based
CN110912598A (en) * 2019-11-22 2020-03-24 中原工学院 CSI feedback method for massive MIMO system based on long and short-term attention mechanism

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11003988B2 (en) * 2016-11-23 2021-05-11 General Electric Company Hardware system design improvement using deep learning algorithms
CN110311718B (en) * 2019-07-05 2022-06-10 东南大学 A Quantization and Inverse Quantization Method in Massive MIMO Channel State Information Feedback
CN111464220B (en) * 2020-03-10 2021-06-29 西安交通大学 A Reconstruction Method of Channel State Information Based on Deep Learning

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108390706A (en) * 2018-01-30 2018-08-10 东南大学 A kind of extensive mimo channel state information feedback method based on deep learning
CN109474316A (en) * 2018-11-22 2019-03-15 东南大学 A Channel Information Compression Feedback Method Based on Deep Recurrent Neural Networks
CN109525292A (en) * 2018-12-24 2019-03-26 东南大学 A kind of channel information compression feedback method using bit-level optimization network
CN110350958A (en) * 2019-06-13 2019-10-18 东南大学 A kind of more multiplying power compressed feedback methods of CSI of extensive MIMO neural network based
CN110912598A (en) * 2019-11-22 2020-03-24 中原工学院 CSI feedback method for massive MIMO system based on long and short-term attention mechanism

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
A fast channel state information feedback scheme based on compressive sensing for underwater acoustic MIMO systems;Hui Ma等;《2016 35th Chinese Control Conference (CCC)》;20160729;全文 *
Controlled dropout: A different approach to using dropout on deep neural network;ByungSoo Ko等;《2017 IEEE International Conference on Big Data and Smart Computing (BigComp)》;20170216;摘要,第I-II节,图1-图2 *
Sequence-Dropout Block for Reducing Overfitting Problem in Image Classification;Ledan Qian等;《IEEE Access》;20200327;第8卷;全文 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4346174A1 (en) * 2022-09-30 2024-04-03 Nokia Technologies Oy Devices, methods and apparatuses for channel prediction

Also Published As

Publication number Publication date
CN111901024A (en) 2020-11-06

Similar Documents

Publication Publication Date Title
CN111901024B (en) MIMO channel state information feedback method based on fitting depth learning resistance
CN110474716B (en) Method for establishing SCMA codec model based on noise reduction self-encoder
CN108847876B (en) A Massive MIMO Time-varying Channel State Information Compression Feedback and Reconstruction Method
CN109672464B (en) FCFNN-based large-scale MIMO channel state information feedback method
CN110311718B (en) A Quantization and Inverse Quantization Method in Massive MIMO Channel State Information Feedback
CN108390706B (en) Large-scale MIMO channel state information feedback method based on deep learning
CN109617584A (en) A Design Method of Beamforming Matrix for MIMO System Based on Deep Learning
CN111713035B (en) MIMO multi-antenna signal transmission and detection technology based on artificial intelligence
CN113269077B (en) Underwater acoustic communication signal modulation mode identification method based on improved gating network and residual error network
CN108462517A (en) A kind of MIMO link self-adaption transmission methods based on machine learning
CN114268328B (en) Convolution code decoding method and convolution code encoding and decoding method based on bidirectional LSTM
CN110445581A (en) The method for reducing the channel decoding bit error rate based on convolutional neural networks
CN115021787A (en) Channel state information feedback method based on complex convolutional neural network
Xie et al. Communication-efficient framework for distributed image semantic wireless transmission
CN116484192A (en) Abnormal node detection method of unsupervised heteroleptic heterograph
CN114630207B (en) Multi-sensing-node sensing data collection method based on noise reduction self-encoder
CN108964672A (en) A kind of polarization code coding method based on deep neural network
CN115865145A (en) A Transformer-based Massive MIMO Channel State Information Feedback Method
CN118279642A (en) Swin transducer-based wireless image semantic communication system and design method thereof
CN114389730B (en) A beamforming design method for MISO system based on deep learning and dirty paper coding
CN116155333A (en) A Channel State Information Feedback Method Applicable to Massive MIMO Systems
CN113992313B (en) Balanced network assisted SCMA encoding and decoding method based on deep learning
CN113225110B (en) Joint Optimization Method of Dispersion Matrix and 3-D Constellation of STSK System Based on Improved Genetic Algorithm
CN115065392A (en) Beam forming design method for realizing MISO downlink sum rate maximization under dirty paper coding condition
CN115664893A (en) Deep learning-based symbol detection method in MIMO communication system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant