WO2023272739A1 - Channel decoding method, apparatus, training method for neural network model used for channel decoding, and apparatus - Google Patents

Channel decoding method, apparatus, training method for neural network model used for channel decoding, and apparatus Download PDF

Info

Publication number
WO2023272739A1
WO2023272739A1 PCT/CN2021/104367 CN2021104367W WO2023272739A1 WO 2023272739 A1 WO2023272739 A1 WO 2023272739A1 CN 2021104367 W CN2021104367 W CN 2021104367W WO 2023272739 A1 WO2023272739 A1 WO 2023272739A1
Authority
WO
WIPO (PCT)
Prior art keywords
decoding
samples
information sequence
codeword
discriminator
Prior art date
Application number
PCT/CN2021/104367
Other languages
French (fr)
Chinese (zh)
Inventor
郑凤
庞博文
池连刚
Original Assignee
北京小米移动软件有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京小米移动软件有限公司 filed Critical 北京小米移动软件有限公司
Priority to PCT/CN2021/104367 priority Critical patent/WO2023272739A1/en
Priority to CN202180002056.9A priority patent/CN115804067A/en
Publication of WO2023272739A1 publication Critical patent/WO2023272739A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M13/00Coding, decoding or code conversion, for error detection or error correction; Coding theory basic assumptions; Coding bounds; Error probability evaluation methods; Channel models; Simulation or testing of codes
    • H03M13/37Decoding methods or techniques, not specific to the particular type of coding provided for in groups H03M13/03 - H03M13/35
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L25/00Baseband systems
    • H04L25/02Details ; arrangements for supplying electrical power along data transmission lines

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Theoretical Computer Science (AREA)
  • Power Engineering (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Error Detection And Correction (AREA)

Abstract

The present disclosure relates to the field of communication, and provided within is a channel decoding method. A technical solution of the present disclosure, in essence, is: decoding, on the basis of a pre-trained neural network model, a received codeword to be decoded that is obtained after an information sequence undergoes channel transmission, so as to obtain a decoded codeword corresponding to the received codeword; wherein the pre-trained neural network model comprises a decoding generator, and the decoding generator decodes the received codeword and outputs the decoded codeword.

Description

信道译码方法及装置、用于信道译码的神经网络模型的训练方法及装置Channel decoding method and device, neural network model training method and device for channel decoding 技术领域technical field
本公开涉及移动通信技术领域,特别涉及一种信道译码方法及装置,以及一种用于信道译码的神经网络模型的训练方法及装置。The present disclosure relates to the technical field of mobile communication, and in particular to a channel decoding method and device, and a training method and device for a neural network model used for channel decoding.
背景技术Background technique
随着5G技术的商用化,对于无线通信系统的数据传输速率、数据传输量等提出了更高要求,为此需要提供一种具有更低误码率和更高数据传输速率的信道译码技术来支持无线通信系统的各种业务数据的传输需求。With the commercialization of 5G technology, higher requirements are put forward for the data transmission rate and data transmission volume of wireless communication systems. Therefore, it is necessary to provide a channel decoding technology with lower bit error rate and higher data transmission rate. To support the transmission requirements of various business data of the wireless communication system.
发明内容Contents of the invention
本公开提出了一种信道译码方法及装置,通过基于预训练的神经网络模型来实现对码字的译码,从而提供了一种具有低误码率、低译码时长以及低译码复杂度的信道译码方案。此外,本公开还提出了一种用于信道译码的神经网络模型的训练方法及装置,以通过迭代训练获得能够用于信道译码的神经网络模型。The present disclosure proposes a channel decoding method and device, which realizes decoding of codewords based on a pre-trained neural network model, thereby providing a channel decoding method with low bit error rate, low decoding time and low decoding complexity. Degree channel decoding scheme. In addition, the present disclosure also proposes a training method and device for a neural network model for channel decoding, so as to obtain a neural network model that can be used for channel decoding through iterative training.
本公开的第一方面实施例提供了一种用于信道译码的神经网络模型的训练方法,所述神经网络模型包括译码生成器和译码辨别器,所述方法包括:获取包括信息序列样本的信息序列训练集,并基于所述信息序列样本获得待译码的接收码字样本;以所述接收码字样本作为所述译码生成器的输入特征,以所述信息序列样本和所述译码生成器输出的译码码字样本作为所述译码辨别器的输入特征,并以能否区分所述信息序列样本和所述译码码字样本作为所述译码辨别器的输出特征,执行迭代训练以获得所述预训练的神经网络模型。The embodiment of the first aspect of the present disclosure provides a training method for a neural network model for channel decoding, the neural network model includes a decoding generator and a decoding discriminator, and the method includes: obtaining information sequences including The information sequence training set of samples, and obtain the received codeword samples to be decoded based on the information sequence samples; use the received codeword samples as the input features of the decoding generator, and use the information sequence samples and the The decoded codeword sample output by the decoding generator is used as the input feature of the decoded discriminator, and whether the information sequence sample and the decoded codeword sample can be distinguished is used as the output of the decoded discriminator feature, perform iterative training to obtain the pre-trained neural network model.
可选地,所述基于所述信息序列样本获得待译码的接收码字样本包括:基于所述信息序列样本,通过编码器获得编码码字样本;对所述编码码字样本进行调制以获得调制码字样本;以及将所述调制码字样本输入加噪信道以获得经信道传输后的接收码字样本。Optionally, the obtaining a received codeword sample to be decoded based on the information sequence sample includes: obtaining an encoded codeword sample by an encoder based on the information sequence sample; modulating the encoded codeword sample to obtain modulating codeword samples; and inputting the modulated codeword samples into a noise-added channel to obtain received codeword samples transmitted through the channel.
可选地,迭代训练中的每轮训练包括:基于从用于本轮训练的信息序列样本获得的接收码字样本,通过所述译码生成器获得与所述信息序列样本对应的译码码字样本;基于所述译码码字样本和所述信息序列样本,通过所述译码辨别器确定能否区分所述译码码字样本和所述信息序列样本;如果确定能够区分所述译码码字样本和所述信息序列样本,通过反向传播法对所述译码生成器以及所述译码辨别器进行更新,并重复上述步骤直至用于本轮训练的所有信息序列样本都经过所述译码生成器和所述译码辨别器处理之后开始下轮训练;以及如果确定无法区分所述译码码字样本和所述信息序列样本,结束所述迭代训练并获得所述预训练的神经网络模型。Optionally, each round of training in iterative training includes: based on the received codeword samples obtained from the information sequence samples used for this round of training, the decoding code corresponding to the information sequence samples is obtained by the decoding generator word sample; based on the decoded codeword sample and the information sequence sample, determine whether the decoded codeword sample and the information sequence sample can be distinguished through the decoding discriminator; if it is determined that the decoded codeword sample and the information sequence sample can be distinguished codeword samples and the information sequence samples, update the decoding generator and the decoding discriminator through the backpropagation method, and repeat the above steps until all the information sequence samples used for this round of training have passed The next round of training starts after the decoding generator and the decoding discriminator process; and if it is determined that the decoding codeword sample and the information sequence sample cannot be distinguished, the iterative training is ended and the pre-training neural network model.
可选地,所述译码辨别器和所述译码生成器的训练目标表示为:Optionally, the training target of the decoding discriminator and the decoding generator is expressed as:
Figure PCTCN2021104367-appb-000001
Figure PCTCN2021104367-appb-000001
其中,G表示所述译码生成器,D表示所述译码辨别器,V(D,G)表示所述信息序列样本与所述译码码字样本之间的差异,
Figure PCTCN2021104367-appb-000002
表示所述译码生成器和所述译码辨别器的训练目标,其中所述译码生成器的训练目标为能够最小化所述信息序列样本与所述译码码字样本之间的差异以及所述译码辨别器的训练目标为能够最大化地区分所述信息序列样本与所述译码码字样本之间的差异,x表示输入所述译码辨别器的信息序列样本, x~p data(x)表示x服从输入至所述译码生成器的数据分布,z表示输入噪声,z~p z(z)表示z服从输入至所述译码生成器的噪声变量分布,
Figure PCTCN2021104367-appb-000003
表示概率分布。
Wherein, G represents the decoding generator, D represents the decoding discriminator, V(D, G) represents the difference between the information sequence samples and the decoding codeword samples,
Figure PCTCN2021104367-appb-000002
Represents the training target of the decoding generator and the decoding discriminator, wherein the training target of the decoding generator is to be able to minimize the difference between the information sequence samples and the decoding codeword samples and The training goal of the decoding discriminator is to maximize the difference between the information sequence samples and the decoding codeword samples, x represents the information sequence samples input to the decoding discriminator, x~p data (x) means that x obeys the data distribution input to the decoding generator, z means input noise, and z~p z (z) means that z obeys the noise variable distribution input to the decoding generator,
Figure PCTCN2021104367-appb-000003
represents a probability distribution.
可选地,基于以下公式使用梯度上升法对所述译码辨别器进行更新:Optionally, the decoding discriminator is updated using a gradient ascent method based on the following formula:
Figure PCTCN2021104367-appb-000004
Figure PCTCN2021104367-appb-000004
其中,D表示所述译码辨别器,
Figure PCTCN2021104367-appb-000005
表示经更新的译码辨别器,m表示参与本轮训练的信息序列样本的数量,x i表示在第i次输入至所述译码辨别器中的信息序列样本,
Figure PCTCN2021104367-appb-000006
表示在上一次输入至所述译码辨别器中的信息序列样本。
Wherein, D represents the decoding discriminator,
Figure PCTCN2021104367-appb-000005
Represents the updated decoding discriminator, m represents the number of information sequence samples participating in the current round of training, x i represents the information sequence samples input to the decoding discriminator for the ith time,
Figure PCTCN2021104367-appb-000006
Indicates the information sequence samples last input to the decoder discriminator.
可选地,基于以下公式使用梯度下降法对所述译码生成器进行更新:Optionally, the decoding generator is updated using a gradient descent method based on the following formula:
Figure PCTCN2021104367-appb-000007
Figure PCTCN2021104367-appb-000007
其中,G表示所述译码生成器,D表示所述译码辨别器,
Figure PCTCN2021104367-appb-000008
表示经更新的译码生成器,m表示参与本轮训练的信息序列样本的数量,y i表示在第i次输入至所述译码生成器中的接收码字样本。
Wherein, G represents the decoding generator, D represents the decoding discriminator,
Figure PCTCN2021104367-appb-000008
represents the updated decoding generator, m represents the number of information sequence samples participating in the current round of training, and y i represents the received codeword samples input to the decoding generator for the ith time.
可选地,所述编码器为低密度奇偶校验LDPC码编码器。Optionally, the encoder is a low density parity check LDPC code encoder.
可选地,所述调制器为二进制相移键控BPSK调制。Optionally, the modulator is binary phase shift keying BPSK modulation.
可选地,所述加噪信道为以下之一:加性白噪声AWGN信道;以及瑞利信道。Optionally, the noise adding channel is one of the following: additive white noise AWGN channel; and Rayleigh channel.
可选地,所述神经网络模型为生成对抗神经网络GAN模型。Optionally, the neural network model is a GAN model for generating an adversarial neural network.
本公开第二方面实施例提供了一种信道译码方法,包括:基于预训练的神经网络模型,对信息序列经信道传输后得出的待译码的接收码字进行译码,以获得与所述接收码字对应的译码码字;其中,所述预训练的神经网络模型包括译码生成器,所述译码生成器对所述接收码字进行译码以输出所述译码码字。The embodiment of the second aspect of the present disclosure provides a channel decoding method, including: based on the pre-trained neural network model, decoding the received codeword to be decoded after the information sequence is transmitted through the channel, so as to obtain the A decoding codeword corresponding to the received codeword; wherein, the pre-trained neural network model includes a decoding generator, and the decoding generator decodes the receiving codeword to output the decoding code Character.
可选地,所述神经网络模型还包括译码辨别器,所述预训练的神经网络模型通过以下过程获得:获取包括信息序列样本的信息序列训练集,并基于所述信息序列样本获得待译码的接收码字样本;以所述接收码字样本作为所述译码生成器的输入特征,以所述信息序列样本和所述译码生成器输出的译码码字样本作为所述译码辨别器的输入特征,并以能否区分所述信息序列样本和所述译码码字样本作为所述译码辨别器的输出特征,执行迭代训练以获得所述预训练的神经网络模型。Optionally, the neural network model further includes a decoding discriminator, and the pre-trained neural network model is obtained through the following process: obtaining an information sequence training set including information sequence samples, and obtaining the information sequence to be translated based on the information sequence samples. The received codeword sample of the code; take the received codeword sample as the input feature of the decoding generator, and use the information sequence sample and the decoded codeword sample output by the decoding generator as the decoding The input feature of the discriminator, and whether the information sequence sample can be distinguished from the decoded codeword sample is used as the output feature of the decoding discriminator, and iterative training is performed to obtain the pre-trained neural network model.
可选地,所述基于所述信息序列样本获得待译码的接收码字样本包括:基于所述信息序列样本,通过编码器获得编码码字样本;对所述编码码字样本进行调制以获得调制码字样本;以及将所述调制码字样本输入加噪信道以获得经信道传输后的接收码字样本。Optionally, the obtaining a received codeword sample to be decoded based on the information sequence sample includes: obtaining an encoded codeword sample by an encoder based on the information sequence sample; modulating the encoded codeword sample to obtain modulating codeword samples; and inputting the modulated codeword samples into a noise-added channel to obtain received codeword samples transmitted through the channel.
可选地,迭代训练中的每轮训练包括:基于从用于本轮训练的信息序列样本获得的接收码字样本,通过所述译码生成器获得与所述信息序列样本对应的译码码字样本;基于所述译码码字样本和所述信息序列样本,通过所述译码辨别器确定能否区分所述译码码字样本和所述信息序列样本;如果确定能够区分所述译码码字样本和所述信息序列样本,通过反向传播法对所述译码生成器以及所述译码辨别器进行更新,并重复上述步骤直至用于本轮训练的所有信息序列样本都经过所述译码生成器和所述译码辨别器处理之后开始下轮训练;以及如果确定无法区分所述译码码字样本和所述信息序列样本,结束所述迭代训练并获得所述预训练的神经网络模型。Optionally, each round of training in iterative training includes: based on the received codeword samples obtained from the information sequence samples used for this round of training, the decoding code corresponding to the information sequence samples is obtained by the decoding generator word sample; based on the decoded codeword sample and the information sequence sample, determine whether the decoded codeword sample and the information sequence sample can be distinguished through the decoding discriminator; if it is determined that the decoded codeword sample and the information sequence sample can be distinguished codeword samples and the information sequence samples, update the decoding generator and the decoding discriminator through the backpropagation method, and repeat the above steps until all the information sequence samples used for this round of training have passed The next round of training starts after the decoding generator and the decoding discriminator process; and if it is determined that the decoding codeword sample and the information sequence sample cannot be distinguished, the iterative training is ended and the pre-training neural network model.
可选地,所述译码辨别器和所述译码生成器的训练目标表示为:Optionally, the training target of the decoding discriminator and the decoding generator is expressed as:
Figure PCTCN2021104367-appb-000009
Figure PCTCN2021104367-appb-000009
其中,G表示所述译码生成器,D表示所述译码辨别器,V(D,G)表示所述信息序列样本与所述译码码字样本之间的差异,
Figure PCTCN2021104367-appb-000010
表示所述译码生成器和所述译码辨别器的训练目标,其中所述译码生成器的训练目标为能够最小化所述信息序列样本与所述译码码字样本之间的差异以及所述译码辨别器的训练目标为能够最大化地区分所述信息序列样本与所述译码码字样本之间的差异,x表示输入所述译码辨别器的信息序列样本,x~p data(x)表示x服从输入至所述译码生成器的数据分布,z表示输入噪声,z~p z(z)表示z服从输入至所述译码生成器的噪声变量分布,
Figure PCTCN2021104367-appb-000011
表示概率分布。
Wherein, G represents the decoding generator, D represents the decoding discriminator, V(D, G) represents the difference between the information sequence samples and the decoding codeword samples,
Figure PCTCN2021104367-appb-000010
Represents the training target of the decoding generator and the decoding discriminator, wherein the training target of the decoding generator is to be able to minimize the difference between the information sequence samples and the decoding codeword samples and The training goal of the decoding discriminator is to maximize the difference between the information sequence samples and the decoding codeword samples, x represents the information sequence samples input to the decoding discriminator, x~p data (x) means that x obeys the data distribution input to the decoding generator, z means input noise, and z~p z (z) means that z obeys the noise variable distribution input to the decoding generator,
Figure PCTCN2021104367-appb-000011
represents a probability distribution.
可选地,基于以下公式使用梯度上升法对所述译码辨别器进行更新:Optionally, the decoding discriminator is updated using a gradient ascent method based on the following formula:
Figure PCTCN2021104367-appb-000012
Figure PCTCN2021104367-appb-000012
其中,D表示所述译码辨别器,
Figure PCTCN2021104367-appb-000013
表示经更新的译码辨别器,m表示参与本轮训练的信息序列样本的数量,x i表示在第i次输入至所述译码辨别器中的信息序列样本,
Figure PCTCN2021104367-appb-000014
表示在上一次输入至所述译码辨别器中的信息序列样本。
Wherein, D represents the decoding discriminator,
Figure PCTCN2021104367-appb-000013
Represents the updated decoding discriminator, m represents the number of information sequence samples participating in the current round of training, x i represents the information sequence samples input to the decoding discriminator for the ith time,
Figure PCTCN2021104367-appb-000014
Indicates the information sequence samples last input to the decoder discriminator.
可选地,基于以下公式使用梯度下降法对所述译码生成器进行更新:Optionally, the decoding generator is updated using a gradient descent method based on the following formula:
Figure PCTCN2021104367-appb-000015
Figure PCTCN2021104367-appb-000015
其中,G表示所述译码生成器,D表示所述译码辨别器,
Figure PCTCN2021104367-appb-000016
表示经更新的译码生成器,m表示参与本轮训练的信息序列样本的数量,y i表示在第i次输入至所述译码生成器中的接收码字样本。
Wherein, G represents the decoding generator, D represents the decoding discriminator,
Figure PCTCN2021104367-appb-000016
represents the updated decoding generator, m represents the number of information sequence samples participating in the current round of training, and y i represents the received codeword samples input to the decoding generator for the ith time.
可选地,所述编码器为低密度奇偶校验LDPC码编码器。Optionally, the encoder is a low density parity check LDPC code encoder.
可选地,所述调制器为二进制相移键控BPSK调制。Optionally, the modulator is binary phase shift keying BPSK modulation.
可选地,所述加噪信道为以下之一:加性白噪声AWGN信道;以及瑞利信道。Optionally, the noise adding channel is one of the following: additive white noise AWGN channel; and Rayleigh channel.
可选地,所述神经网络模型为生成对抗神经网络GAN模型。Optionally, the neural network model is a GAN model for generating an adversarial neural network.
本公开的第三方面实施例提供了一种用于信道译码的神经网络模型的训练装置,所述神经网络模型包括译码生成器和译码辨别器,所述装置包括:获取模块,用于获取包括信息序列样本的信息序列训练集,并基于所述信息序列样本获得待译码的接收码字样本;以及训练模块,用于以所述接收码字样本作为所述译码生成器的输入特征,以所述信息序列样本和所述译码生成器输出的译码码字样本作为所述译码辨别器的输入特征,并以能否区分所述信息序列样本和所述译码码字样本作为所述译码辨别器的输出特征,执行迭代训练以获得所述预训练的神经网络模型。The embodiment of the third aspect of the present disclosure provides a neural network model training device for channel decoding, the neural network model includes a decoding generator and a decoding discriminator, the device includes: an acquisition module, used Obtaining an information sequence training set including information sequence samples, and obtaining received codeword samples to be decoded based on the information sequence samples; and a training module, configured to use the received codeword samples as the decoding generator The input feature is to use the information sequence sample and the decoded code word sample output by the decoding generator as the input feature of the decoding discriminator, and to distinguish the information sequence sample from the decoded code Word samples are used as the output features of the decoding discriminator, and iterative training is performed to obtain the pre-trained neural network model.
本公开的第四方面实施例提供了一种信道译码装置,包括:译码模块,用于基于预训练的神经网络模型,对信息序列经信道传输后得出的待译码的接收码字进行译码,以获得与所述接收码字对应的译码码字;其中,所述预训练的神经网络模型包括译码生成器,所述译码生成器对所述接收码字进行译码以输出所述译码码字。The embodiment of the fourth aspect of the present disclosure provides a channel decoding device, including: a decoding module, used for receiving codewords to be decoded after the information sequence is transmitted through the channel based on the pre-trained neural network model Decoding is performed to obtain a decoding codeword corresponding to the received codeword; wherein, the pre-trained neural network model includes a decoding generator, and the decoding generator decodes the receiving codeword to output the decoded codeword.
本公开的第五方面实施例提供了一种电子设备,包括:存储器;处理器,与所述存储器连接,配置为通过执行所述存储器上的计算机可执行指令,以实现上述第一方面实施例的用于信道译码的神经网络模型的训练方法或第二方面实施例所述的信道译码方法。The embodiment of the fifth aspect of the present disclosure provides an electronic device, including: a memory; a processor connected to the memory and configured to implement the above-mentioned embodiment of the first aspect by executing computer-executable instructions on the memory A neural network model training method for channel decoding or the channel decoding method described in the embodiment of the second aspect.
本公开的第六方面实施例提供了一种电子设备,包括:模型训练器,用于基于包括信息序列样本以及与所述信息序列样本对应的待译码的接收码字样本获得预训练的神经网络模型,其中,所述神经网络模型包括译码生成器和译码辨别器,所述模型训练器用于以 所述接收码字样本作为所述译码生成器的输入特征,以所述信息序列样本和所述译码生成器输出的译码码字样本作为所述译码辨别器的输入特征,并以能否区分所述信息序列样本和所述译码码字样本作为所述译码辨别器的输出特征,执行迭代训练以获得所述预训练的神经网络模型;译码器,所述译码器用于基于所述预训练的神经网络模型,对信息序列经信道传输后得出的待译码的接收码字进行译码,以获得与所述接收码字对应的译码码字。The embodiment of the sixth aspect of the present disclosure provides an electronic device, including: a model trainer, which is used to obtain a pre-trained neural network based on information sequence samples and received codeword samples corresponding to the information sequence samples to be decoded. A network model, wherein the neural network model includes a decoding generator and a decoding discriminator, the model trainer is used to use the received codeword sample as the input feature of the decoding generator, and the information sequence The sample and the decoded codeword sample output by the decoding generator are used as the input features of the decoding discriminator, and whether the information sequence sample and the decoded codeword sample can be distinguished is used as the decoding discrimination The output features of the device, perform iterative training to obtain the pre-trained neural network model; the decoder, the decoder is used to obtain the information sequence to be obtained after the information sequence is transmitted through the channel based on the pre-trained neural network model The decoded received codeword is decoded to obtain a decoded codeword corresponding to the received codeword.
本公开第七方面实施例提出了一种计算机存储介质,其中,所述计算机存储介质存储有计算机可执行指令;所述计算机可执行指令被处理器执行后,能够实现上述第一方面实施例的用于信道译码的神经网络模型的训练方法或第二方面实施例所述的信道译码方法。The embodiment of the seventh aspect of the present disclosure provides a computer storage medium, wherein the computer storage medium stores computer-executable instructions; after the computer-executable instructions are executed by a processor, the above-mentioned embodiment of the first aspect can be implemented. A training method for a neural network model for channel decoding or the channel decoding method described in the embodiment of the second aspect.
本公开实施例还提供了一种用于信道译码的神经网络模型的训练方法及装置,通过获取包括信息序列样本的信息序列训练集并基于所述信息序列样本获得待译码的接收码字样本,以及以接收码字样本作为译码生成器的输入特征,以信息序列样本和译码生成器输出的译码码字样本作为译码辨别器的输入特征,并以能否区分信息序列样本和译码码字样本作为译码辨别器的输出特征,进行迭代训练以获得经训练的神经网络模型。由此得出的经训练的神经网络模型能够对信息序列经信道传输后的待译码的接收码字进行译码以得到原信息序列。An embodiment of the present disclosure also provides a training method and device for a neural network model for channel decoding, by obtaining an information sequence training set including information sequence samples and obtaining the received code to be decoded based on the information sequence samples This, and take the received codeword sample as the input feature of the decoding generator, take the information sequence sample and the decoded codeword sample output by the decoding generator as the input feature of the decoding discriminator, and use whether the information sequence sample can be distinguished And the decoded codeword samples are used as the output features of the decoded discriminator, and iterative training is performed to obtain the trained neural network model. The trained neural network model thus obtained can decode the received codeword to be decoded after the information sequence is transmitted through the channel to obtain the original information sequence.
本公开实施例提供了一种信道译码方法及装置,通过基于预训练的神经网络模型对信息序列经信道传输后得出的待译码的接收码字进行译码,以获得与接收码字对应的译码码字,其中预训练的神经网络模型包括用于对接收码字进行译码以输出译码码字的译码生成器。由此,能够通过预训练的神经网络模型来实现对待译码码字的译码,从而实现了具有低误码率、低译码时长以及低译码复杂度的信道译码方案。The embodiment of the present disclosure provides a channel decoding method and device, which decodes the received codewords to be decoded after the information sequence is transmitted through the channel based on the pre-trained neural network model, so as to obtain the received codewords A corresponding decoding codeword, wherein the pre-trained neural network model includes a decoding generator for decoding the received codeword to output the decoding codeword. Therefore, the decoding of the codeword to be decoded can be realized through the pre-trained neural network model, thereby realizing a channel decoding scheme with low bit error rate, low decoding duration and low decoding complexity.
本公开附加的方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本公开的实践了解到。Additional aspects and advantages of the disclosure will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the disclosure.
附图说明Description of drawings
本公开上述的和/或附加的方面和优点从下面结合附图对实施例的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present disclosure will become apparent and understandable from the following description of the embodiments in conjunction with the accompanying drawings, wherein:
图1为根据本公开实施例的一种用于信道译码的神经网络模型的训练方法的流程示意图;FIG. 1 is a schematic flowchart of a training method for a neural network model for channel decoding according to an embodiment of the present disclosure;
图2为根据本公开实施例的一种用于信道译码的神经网络模型的训练方法的流程示意图;FIG. 2 is a schematic flowchart of a training method for a neural network model for channel decoding according to an embodiment of the present disclosure;
图3为根据本公开实施例的一种用于信道译码的神经网络模型的训练方法的流程示意图;FIG. 3 is a schematic flowchart of a training method for a neural network model for channel decoding according to an embodiment of the present disclosure;
图4为根据本公开实施例的一种信道译码方法的流程示意图;FIG. 4 is a schematic flowchart of a channel decoding method according to an embodiment of the present disclosure;
图5为本公开实施例提供的一种用于信道译码的神经网络模型的训练方法装置的结构示意图;FIG. 5 is a schematic structural diagram of a training method and device for a neural network model for channel decoding provided by an embodiment of the present disclosure;
图6为本公开实施例提供的一种信道译码装置的结构示意图;FIG. 6 is a schematic structural diagram of a channel decoding device provided by an embodiment of the present disclosure;
图7为本公开实施例提供的一种电子设备的结构示意图;FIG. 7 is a schematic structural diagram of an electronic device provided by an embodiment of the present disclosure;
图8为本公开实施例提供的一种电子设备的结构示意图。FIG. 8 is a schematic structural diagram of an electronic device provided by an embodiment of the present disclosure.
具体实施方式detailed description
下面详细描述本公开的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,旨在用于解释本公开,而不能理解为对本公开的限制。Embodiments of the present disclosure are described in detail below, examples of which are illustrated in the drawings, in which the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present disclosure and should not be construed as limiting the present disclosure.
5G的商用化推动了人类生活方式、教育的进步,使得生活更加智能,并且加速了社会信息化的进程,这同时也导致了用户数据和系统容量的指数级膨胀,由此,对于新一代无线通信系统的数据的传输速率、系统容量提出了更高的要求。The commercialization of 5G has promoted the progress of human life style and education, made life more intelligent, and accelerated the process of social informatization, which also led to the exponential expansion of user data and system capacity. Therefore, for the new generation of wireless The data transmission rate and system capacity of the communication system put forward higher requirements.
现有的基于置信传播算法的信道译码算法,在接收到信息序列后,所有变量节点接收到对应的接收值,每个变量节点将传递一个可靠性消息给与它相邻的所有校验节点,每个校验节点接收到可靠性消息后做出处理,并将传递一个新的可靠性消息给与它相邻的所有变量节点,这个过程即可被视为一次迭代,一次迭代后将进行判决,若满足校验方程,则译码结束并输出判决结果,否则再次迭代,以此类推,直至满足校验方程或达到最大迭代次数。由于需要不断迭代来实现译码,现有的基于置信传播算法的信道译码算法复杂度高,译码时间长,且译码器件实现复杂。In the existing channel decoding algorithm based on the belief propagation algorithm, after receiving the information sequence, all variable nodes receive the corresponding received value, and each variable node will transmit a reliability message to all the check nodes adjacent to it , each check node will process after receiving the reliability message, and will pass a new reliability message to all the variable nodes adjacent to it, this process can be regarded as an iteration, after an iteration, the Judgment, if the verification equation is satisfied, the decoding ends and the judgment result is output, otherwise iterates again, and so on until the verification equation is satisfied or the maximum number of iterations is reached. Due to the need for continuous iterations to achieve decoding, the existing channel decoding algorithm based on the belief propagation algorithm has high complexity, long decoding time, and complex implementation of decoding devices.
为此,信道译码技术正在处于新一代技术的演进阶段。新一代信道译码技术的一个重要特点就是要维持海量数据传输并且在提升传输速率的同时保持较低的误码率。同时,不同业务类型根据性能需求对于信道译码技术有不同的要求,如eMBB业务数据中,长码使用LDPC码,而短码使用Polar码。因此,需要提供一种误码率更低的、译码速度更快的、译码模型复杂度更低的信道译码方案来支持各种业务数据的传输需求。For this reason, channel decoding technology is in the evolution stage of a new generation technology. An important feature of the new generation of channel decoding technology is to maintain massive data transmission and maintain a low bit error rate while increasing the transmission rate. At the same time, different service types have different requirements for channel decoding technology according to performance requirements. For example, in eMBB service data, LDPC codes are used for long codes, and Polar codes are used for short codes. Therefore, it is necessary to provide a channel decoding scheme with lower bit error rate, faster decoding speed and lower decoding model complexity to support the transmission requirements of various service data.
本公开提出了一种信道译码方法及装置,通过基于预训练的神经网络模型来实现对码字的译码,从而提供了一种具有低误码率、低译码时长以及低译码复杂度的信道译码方案。此外,本公开还提出了一种用于信道译码的神经网络模型的训练方法及装置,以通过训练过程获得能够用于信道译码的神经网络模型。The present disclosure proposes a channel decoding method and device, which realizes decoding of codewords based on a pre-trained neural network model, thereby providing a channel decoding method with low bit error rate, low decoding time and low decoding complexity. Degree channel decoding scheme. In addition, the present disclosure also proposes a training method and device for a neural network model for channel decoding, so as to obtain a neural network model that can be used for channel decoding through a training process.
下面结合附图对本申请所提供的信道译码方法及其装置进行详细地介绍。The channel decoding method and device thereof provided by the present application will be described in detail below with reference to the accompanying drawings.
图1示出了根据本公开实施例的一种用于信道译码的神经网络模型的训练方法的流程示意图,神经网络模型包括译码生成器和译码辨别器。如图1所示,该方法包括以下步骤。Fig. 1 shows a schematic flowchart of a training method for a neural network model for channel decoding according to an embodiment of the present disclosure. The neural network model includes a decoding generator and a decoding discriminator. As shown in Figure 1, the method includes the following steps.
步骤S101,获取包括信息序列样本的信息序列训练集,并基于信息序列样本获得待译码的接收码字样本。Step S101, obtaining an information sequence training set including information sequence samples, and obtaining received codeword samples to be decoded based on the information sequence samples.
其中,神经网络模型可以为生成对抗神经网络(GAN,Generative Adversarial Networks)模型。Wherein, the neural network model may be a Generative Adversarial Networks (GAN, Generative Adversarial Networks) model.
GAN为人工智能网络的一种特征学习方法,把原始数据通过一些简单的但是非线性的模型转换为更高层次的、更加抽象的表达,只要通过足够多的转换组合,非常复杂的特征也能被学习。目前,GAN已经在计算机视觉、图像处理、语音识别、自然语言处理等领域应用并取得了优越的性能。GAN is a feature learning method of artificial intelligence network. It converts the original data into a higher-level and more abstract expression through some simple but nonlinear models. As long as there are enough conversion combinations, very complex features can also be be learned. At present, GAN has been applied in computer vision, image processing, speech recognition, natural language processing and other fields and has achieved superior performance.
其中,信息序列训练集是用于对神经网络模型进行训练的数据集,其可以包括多个信息序列。信息序列可以为例如二进制信息序列。Wherein, the information sequence training set is a data set used for training the neural network model, which may include multiple information sequences. The information sequence may be, for example, a binary information sequence.
此外,基于信息序列样本获得接收码字样本,该接收码字样本为经编码后获得的,因此,该接收码字样本为需要译码的码字样本。In addition, the received codeword samples are obtained based on the information sequence samples, and the received codeword samples are obtained after encoding. Therefore, the received codeword samples are codeword samples that need to be decoded.
步骤S102,以接收码字样本作为译码生成器的输入特征,以信息序列样本和译码生成器输出的译码码字样本作为译码辨别器的输入特征,并以能否区分信息序列样本和译码码字样本作为译码辨别器的输出特征,执行迭代训练以获得预训练的神经网络模型。其中,译码生成器基于接收码字样本输出译码码字样本。Step S102, take the received codeword sample as the input feature of the decoding generator, use the information sequence sample and the decoded codeword sample output by the decoding generator as the input feature of the decoding discriminator, and determine whether the information sequence sample can be distinguished and the decoded codeword samples are used as the output features of the decoded discriminator, and iterative training is performed to obtain a pre-trained neural network model. Wherein, the decoding generator outputs decoded codeword samples based on the received codeword samples.
神经网络模型中的译码生成器用于对待译码的接收码字样本进行译码而输出对应的译码码字样本,而译码辨别器用于对信息序列样本和从译码生成器获得的译码码字样本进行区分。通过以能够区分信息序列样本和译码码字样本作为译码辨别器的输出特征来执行迭代训练过程,能够获得经训练的神经网络模型,其中该经训练的神经网络模型中的译码生成器能够将从信息序列获得的接收码字译码为译码辨别器无法将其与信息序列区分开的译码码字,即该译码码字为信息序列(至少从译码辨别器的角度),也就是说,译码生成器能够将待译码的接收码字经译码还原为信息序列。The decoding generator in the neural network model is used to decode the received codeword samples to be decoded and output the corresponding decoded codeword samples, while the decoding discriminator is used to compare the information sequence samples and the decoded codeword samples obtained from the decoding generator. codeword samples to distinguish. By performing an iterative training process with the output features capable of distinguishing information sequence samples and decoding codeword samples as the decoding discriminator, a trained neural network model can be obtained, wherein the decoding generator in the trained neural network model Be able to decode the received codeword obtained from the information sequence into a decoded codeword that cannot be distinguished from the information sequence by the decoded discriminator, i.e. the decoded codeword is the information sequence (at least from the perspective of the decoded discriminator) , that is to say, the decoding generator can restore the received codeword to be decoded into an information sequence after decoding.
根据本发明实施例,通过获取包括信息序列样本的信息序列训练集并基于所述信息序列样本获得待译码的接收码字样本,以及以接收码字样本作为译码生成器的输入特征,以信息序列样本和译码生成器输出的译码码字样本作为译码辨别器的输入特征,并以能否区分信息序列样本和译码码字样本作为译码辨别器的输出特征,进行迭代训练以获得经训练的神经网络模型。由此得出的经训练的神经网络模型能够对信息序列经信道传输后的待译码的接收码字进行译码以得到原信息序列。According to an embodiment of the present invention, by obtaining an information sequence training set including information sequence samples and obtaining received codeword samples to be decoded based on the information sequence samples, and using the received codeword samples as input features of the decoding generator, to The information sequence samples and the decoded codeword samples output by the decoding generator are used as the input features of the decoding discriminator, and the ability to distinguish the information sequence samples and the decoded codeword samples as the output features of the decoding discriminator is used for iterative training to obtain a trained neural network model. The trained neural network model thus obtained can decode the received codeword to be decoded after the information sequence is transmitted through the channel to obtain the original information sequence.
图2示出了根据本公开实施例的一种用于信道译码的神经网络模型的训练方法的流程示意图,基于图1所示的实施例,在本示例实施例中,描述了基于信息序列样本获得待译码的接收码字样本的具体实施方式。FIG. 2 shows a schematic flowchart of a training method for a neural network model for channel decoding according to an embodiment of the present disclosure. Based on the embodiment shown in FIG. 1 , in this example embodiment, an information sequence based The samples obtain a specific implementation of the received codeword samples to be decoded.
如图2所示,图1所示的步骤S101具体可以包括如下步骤。As shown in FIG. 2 , step S101 shown in FIG. 1 may specifically include the following steps.
步骤S201,基于信息序列样本,通过编码器获得编码码字样本。In step S201, based on the information sequence samples, codeword samples are obtained through an encoder.
可以利用编码器对信息序列样本进行编码以获得编码码字样本。其中,编码器可以为在移动通信系统中用于信道编码方案的编码器,例如低密度奇偶校验码(LDPC,Low Density Parity Check)编码器、极化(Polar)码编码器、Turbo码编码器。其中由于LDPC码是一种具有稀疏校验矩阵的线性分组码,其特征完全由奇偶校验矩阵决定,即其具有结构化构造,更适于深度学习网络学习,从而相对于非结构化码而言更易于获得适于LDPC码译码的神经网络模型。An encoder may be used to encode information sequence samples to obtain encoded codeword samples. Wherein, the encoder can be an encoder used in a channel coding scheme in a mobile communication system, such as a low density parity check code (LDPC, Low Density Parity Check) encoder, a polar (Polar) code encoder, a Turbo code encoder device. Among them, since LDPC code is a linear block code with a sparse check matrix, its characteristics are completely determined by the parity check matrix, that is, it has a structured structure, which is more suitable for deep learning network learning. In other words, it is easier to obtain a neural network model suitable for decoding LDPC codes.
步骤S202,对编码码字样本进行调制以获得调制码字样本。Step S202: Modulate the encoded codeword samples to obtain modulated codeword samples.
在获得编码码字样本之后,可以通过对编码码字样本进行调制获得调制码字样本。其中,例如,可以采用二进制相移键控(BPSK,Binary Phase Shift Keying)调制、频移键控(FSK,Frequency Shift Keying)调制等调制方案来对编码码字样本进行调制。After the encoded codeword samples are obtained, modulated codeword samples can be obtained by modulating the encoded codeword samples. Wherein, for example, binary phase shift keying (BPSK, Binary Phase Shift Keying) modulation, frequency shift keying (FSK, Frequency Shift Keying) modulation and other modulation schemes can be used to modulate the encoded codeword samples.
步骤S203,将调制码字样本输入加噪信道以获得经信道传输后的接收码字样本。Step S203, input the modulated codeword samples into the noise adding channel to obtain the received codeword samples transmitted through the channel.
经过调制获得的调制码字样本可经过加噪信道的传输得出接收码字样本,由此接收码字样本为具有噪声的码字。The modulated codeword samples obtained through modulation can be transmitted through the noise-added channel to obtain received codeword samples, so that the received codeword samples are codewords with noise.
在示例实施例中,加噪信道可以为加性白噪声(AWGN,Additive White Gaussian Noise)信道或瑞利信道。In an example embodiment, the noise-added channel may be an Additive White Gaussian Noise (AWGN) channel or a Rayleigh channel.
例如,在具体示例中,可以将长度为L的信息序列x(其中,该信息序列的总长度为L,其含有信息的序列长小于等于L)输入LDPC编码器以获得长度为M(该长度M可以大于L或小于等于L,这取决于编码器所采用的编码方式)的码字u,码字u经过BPSK调制得到码字s,然而将码字s输入AWGN信道以获得带有噪声的码字y,其中y=s+n,n表示噪声。For example, in a specific example, an information sequence x with a length of L (wherein, the total length of the information sequence is L, and the length of the sequence containing information is less than or equal to L) can be input into the LDPC encoder to obtain a length of M (the length M can be greater than L or less than or equal to L, which depends on the codeword u used by the encoder), the codeword u is modulated by BPSK to obtain the codeword s, but the codeword s is input into the AWGN channel to obtain the noisy Codeword y, where y=s+n, n represents noise.
根据本发明实施例,通过对信息序列样本进行编码、调制、加噪信道传输后能够获得待译码的接收码字样本。According to the embodiment of the present invention, the received codeword samples to be decoded can be obtained by encoding, modulating, and adding noise to the channel of the information sequence samples.
图3示出了根据本公开实施例的一种用于信道译码的神经网络模型的训练方法的流程示意图,基于图1所示的实施例,在本示例实施例中,描述了迭代训练中每轮训练的具体实施方式。FIG. 3 shows a schematic flowchart of a training method for a neural network model for channel decoding according to an embodiment of the present disclosure. Based on the embodiment shown in FIG. 1, in this example embodiment, it describes the iterative training The specific implementation of each round of training.
如图3所示,图1所示的步骤S102具体可以包括如下步骤。As shown in FIG. 3 , step S102 shown in FIG. 1 may specifically include the following steps.
步骤S301,基于从用于本轮训练的信息序列样本获得的接收码字样本,通过译码生成器获得与信息序列样本对应的译码码字样本。Step S301, based on the received codeword samples obtained from the information sequence samples used for the current round of training, the decoded codeword samples corresponding to the information sequence samples are obtained through the decoding generator.
对于每轮训练,可以采用一个或多个信息序列样本来进行。例如,可以将从一个或多个信息序列样本获得的接收码字样本输入译码生成器以获得分别与一个或多个信息序列样本对应的一个或多个译码码字样本。For each round of training, one or more information sequence samples can be used. For example, received codeword samples obtained from one or more information sequence samples may be input into the decoding generator to obtain one or more decoded codeword samples respectively corresponding to the one or more information sequence samples.
步骤S302,基于译码码字样本和信息序列样本,通过译码辨别器确定能否区分译码码字样本和信息序列样本。Step S302, based on the decoded codeword samples and the information sequence samples, determine whether the decoded codeword samples and the information sequence samples can be distinguished through the decoding discriminator.
从译码生成器输出的一个或多个译码码字样本以及相应的一个或多个信息序列样本被输入译码辨别器,译码辨别器确定是否能够区分译码码字样本和信息序列样本。One or more decoded codeword samples and corresponding one or more information sequence samples output from the decode generator are input into a decode discriminator, which determines whether the decoded codeword samples and the information sequence samples can be distinguished .
在示例实施例中,译码辨别器和译码生成器的训练目标可以表示为:In an example embodiment, the training objectives of the decode discriminator and the decode generator can be expressed as:
Figure PCTCN2021104367-appb-000017
Figure PCTCN2021104367-appb-000017
其中,G表示译码生成器,D表示译码辨别器,V(D,G)表示信息序列样本与译码码字样本之间的差异,
Figure PCTCN2021104367-appb-000018
表示译码生成器和译码辨别器的训练目标,其中译码生成器的训练目标为能够最小化信息序列样本与译码码字样本之间的差异以及译码辨别器的训练目标为能够最大化地区分信息序列样本与译码码字样本之间的差异,x表示译码辨别器的输入数据,x~p data(x)表示x服从输入至所述译码生成器的数据分布,z表示输入噪声,z~p z(z)表示z服从输入至所述译码生成器的噪声变量分布,
Figure PCTCN2021104367-appb-000019
表示概率分布。其中,p z(z)可以服从正态分布。
Among them, G represents the decoding generator, D represents the decoding discriminator, V(D, G) represents the difference between the information sequence samples and the decoded codeword samples,
Figure PCTCN2021104367-appb-000018
Represents the training objectives of the decoding generator and the decoding discriminator, where the training objective of the decoding generator is to minimize the difference between the information sequence samples and the decoded codeword samples, and the training objective of the decoding discriminator is to be able to maximize The difference between the information sequence sample and the decoded codeword sample is distinguished in the z region, x represents the input data of the decoding discriminator, x~p data (x) represents that x obeys the data distribution input to the decoding generator, z Indicates the input noise, z~p z (z) indicates that z obeys the noise variable distribution input to the decoding generator,
Figure PCTCN2021104367-appb-000019
represents a probability distribution. Wherein, p z (z) may obey a normal distribution.
译码辨别器的训练目标是在保持译码生成器不变的情况下对译码辨别器进行更新来实现,而译码生成器的训练目标是基于更新后的译码辨别器对信息序列样本与译码码字样本之间的差异的区分情况对译码生成器进行更新来实现。The training goal of the decoding discriminator is to update the decoding discriminator while keeping the decoding generator unchanged, and the training target of the decoding generator is to analyze the information sequence samples based on the updated decoding discriminator. This is accomplished by updating the decoding generator to differentiate between the differences from the decoded codeword samples.
步骤S303,如果确定能够区分译码码字样本和信息序列样本,通过反向传播法对译码生成器以及译码辨别器进行更新,并重复上述步骤直至用于本轮训练的所有信息序列样本都经过译码生成器和译码辨别器处理之后开始下轮训练。Step S303, if it is determined that the decoded codeword sample and the information sequence sample can be distinguished, update the decoding generator and the decoding discriminator through the back propagation method, and repeat the above steps until all the information sequence samples used for this round of training After being processed by the decoding generator and the decoding discriminator, the next round of training starts.
如果译码辨别器能够区分译码码字样本和信息序列样本,这表明译码生成器从接收码字样本所生成的译码码字样本与信息序列样本之前的差异可被译码辨别器区分,即译码生成器对接收码字样本的译码没有完全还原信息序列,因此,当前神经网络模型并不适用于对接收码字的译码,需要对译码生成器进行更新以使得其能够学习到更适合的译码算法,同时需要对译码辨别器进行更新以使得译码辨别器的辨别能力增强,通过训练过程来不断更新译码生成器和译码辨别器,从而最终达到纳什均衡状态。If the decoding discriminator can distinguish the decoded codeword samples from the information sequence samples, it means that the difference between the decoded codeword samples generated by the decoding generator from the received codeword samples and the information sequence samples can be distinguished by the decoding discriminator , that is, the decoding generator does not completely restore the information sequence to the decoding of the received codeword samples. Therefore, the current neural network model is not suitable for decoding the received codewords, and the decoding generator needs to be updated so that it can Learn a more suitable decoding algorithm, and at the same time need to update the decoding discriminator to enhance the discriminative ability of the decoding discriminator, and continuously update the decoding generator and the decoding discriminator through the training process, so as to finally reach the Nash equilibrium state.
例如,对于输入至译码辨别器中的一个译码码字样本和一个信息序列样本,如果译码辨别器能够区分该译码码字样本和该信息序列样本,则对译码生成器以及译码辨别器进行更新。具体的,通过使用梯度下降法对译码生成器进行更新以使得更新后的译码生成器能够最小化信息序列样本与译码码字样本之间的差异,以及通过使用使用梯度上升法对译码辨别器进行更新以使得更新后的译码辨别器能够最大化地区分信息序列样本与译码码字样本之间的差异。For example, for a decoded codeword sample and an information sequence sample input to the decoding discriminator, if the decoding discriminator can distinguish the decoded codeword sample from the information sequence sample, then the decoding generator and the decoding The code discriminator is updated. Specifically, the decoding generator is updated by using the gradient descent method so that the updated decoding generator can minimize the difference between the information sequence samples and the decoded codeword samples, and the decoding The code discriminator is updated so that the updated decoding discriminator can maximize the difference between the information sequence samples and the decoded code word samples.
在示例实施例中,基于以下公式使用梯度上升法对译码辨别器进行更新:In an example embodiment, the decoding discriminator is updated using gradient ascent based on the following formula:
Figure PCTCN2021104367-appb-000020
Figure PCTCN2021104367-appb-000020
其中,D表示译码辨别器,
Figure PCTCN2021104367-appb-000021
表示经更新的译码辨别器,m表示参与本轮训练的信息序列样本的数量,x i表示在第i次输入至译码辨别器中的信息序列样本,
Figure PCTCN2021104367-appb-000022
表示在上一次输入至译码辨别器中的信息序列样本。
Among them, D represents the decoding discriminator,
Figure PCTCN2021104367-appb-000021
Represents the updated decoding discriminator, m represents the number of information sequence samples participating in the current round of training, x i represents the information sequence samples input to the decoding discriminator for the i time,
Figure PCTCN2021104367-appb-000022
Indicates the information sequence samples input to the decoder discriminator last time.
在示例实施例中,基于以下公式使用梯度下降法对译码生成器进行更新:In an example embodiment, the decoding generator is updated using gradient descent based on the following formula:
Figure PCTCN2021104367-appb-000023
Figure PCTCN2021104367-appb-000023
其中,G表示译码生成器,D表示所述辨别器,
Figure PCTCN2021104367-appb-000024
表示经更新的译码生成器,m表示参与本轮训练的信息序列样本的数量,y i表示在第i次输入至译码生成器中的接收码字样本。
Among them, G represents the decoding generator, D represents the discriminator,
Figure PCTCN2021104367-appb-000024
Represents the updated decoding generator, m represents the number of information sequence samples participating in the current round of training, and y i represents the received codeword samples input to the decoding generator for the ith time.
在对译码生成器以及译码辨别器进行更新后,使用更新后的译码生成器以及更新后的译码辨别器重复以上步骤直至用于本轮训练的所有信息序列样本都经过译码生成器和译码辨别器处理之后开始。After updating the decoding generator and the decoding discriminator, use the updated decoding generator and the updated decoding discriminator to repeat the above steps until all the information sequence samples used for this round of training are decoded and generated After the discriminator and decode discriminator processing begins.
也就是说,在对译码生成器以及译码辨别器进行更新后,如果用于本轮训练的一个或多个信息序列样本中存在尚未经过译码生成器和译码辨别器处理的信息序列样本,则将从尚未处理的信息序列样本中任意一个获得的接收码字样本输入译码生成器以获得与该信息序列样本对应的译码码字样本,并将该信息序列样本和译码码字样本输入译码辨别器以确定能够区分该二者,如果能够区分,则再次对译码生成器和译码辨别器进行更新,以此类推,直至用于本轮训练的一个或多个信息序列样本中没有未经过译码生成器和译码辨别器处理的信息序列样本,即用于本轮训练的所有信息序列样本均已被使用,则可以开始下轮训练。That is to say, after the decoding generator and the decoding discriminator are updated, if there are information sequences that have not been processed by the decoding generator and the decoding discriminator in one or more information sequence samples used for this round of training sample, then input the received codeword sample obtained from any of the unprocessed information sequence samples into the decoding generator to obtain the decoding codeword sample corresponding to the information sequence sample, and combine the information sequence sample and the decoding code Word samples are input into the decoding discriminator to determine that the two can be distinguished. If they can be distinguished, the decoding generator and the decoding discriminator are updated again, and so on, until one or more information used for this round of training There are no information sequence samples in the sequence samples that have not been processed by the decoding generator and the decoding discriminator, that is, all the information sequence samples used for this round of training have been used, and the next round of training can start.
步骤S304,如果确定无法区分译码码字样本和信息序列样本,结束迭代训练并获得预训练的神经网络模型。Step S304, if it is determined that the decoded codeword sample and the information sequence sample cannot be distinguished, the iterative training is ended and a pre-trained neural network model is obtained.
如果译码辨别器无法区分出译码码字样本和信息序列样本,这表明译码生成器能够根据接收码字样本完全还原信息序列样本,因此,当前神经网络模型适用于对接收码字的译码,迭代训练结束,从而获得预训练的神经网络模型。If the decoding discriminator cannot distinguish the decoded codeword samples from the information sequence samples, it indicates that the decoding generator can completely restore the information sequence samples according to the received codeword samples. Therefore, the current neural network model is suitable for decoding the received codeword samples. Code, the iterative training ends, so as to obtain the pre-trained neural network model.
根据本发明实施例,在迭代训练中,通过梯度上升法对译码辨别器进行不断更新以及通过梯度下降法对译码生成器进行不断更新,从而能够获得适用于对接收码字进行译码的经训练的神经网络模型。According to the embodiment of the present invention, in the iterative training, the decoding discriminator is continuously updated by the gradient ascending method and the decoding generator is continuously updated by the gradient descent method, so that it is possible to obtain A trained neural network model.
图4示出了根据本公开实施例的一种信道译码方法的流程示意图,如图4所示,该方法包括以下步骤。Fig. 4 shows a schematic flowchart of a channel decoding method according to an embodiment of the present disclosure. As shown in Fig. 4 , the method includes the following steps.
步骤S401,基于预训练的神经网络模型,对信息序列经信道传输后得出的待译码的接收码字进行译码,以获得与接收码字对应的译码码字。Step S401, based on the pre-trained neural network model, decode the received codeword to be decoded obtained after the information sequence is transmitted through the channel, so as to obtain the decoded codeword corresponding to the received codeword.
其中,预训练的神经网络模型包括译码生成器,译码生成器对接收码字进行译码以输出译码码字。Wherein, the pre-trained neural network model includes a decoding generator, and the decoding generator decodes the received codeword to output the decoded codeword.
在本实施例中,可以将信息序列经信道传输后得出的接收码字输入预训练的神经网络模型,该预训练的神经网络模型通过其译码生成器对接收码字进行译码以输出与接收码字对应的译码码字。In this embodiment, the received codeword obtained after the information sequence is transmitted through the channel can be input into the pre-trained neural network model, and the pre-trained neural network model decodes the received codeword through its decoding generator to output The decoded codeword corresponding to the received codeword.
其中,神经网络模型可以为生成对抗神经网络(GAN,Generative Adversarial Networks)模型。利用预训练的神经网络模型,尤其是GAN模型来实现信道译码,可以带来如下优势:Wherein, the neural network model may be a Generative Adversarial Networks (GAN, Generative Adversarial Networks) model. Using the pre-trained neural network model, especially the GAN model to realize channel decoding can bring the following advantages:
1、低误码率:随着训练次数的增加,译码性能逐渐接近最大后验概率性能,并且误码率低于置信传播算法。1. Low bit error rate: With the increase of training times, the decoding performance gradually approaches the maximum a posteriori probability performance, and the bit error rate is lower than that of the belief propagation algorithm.
2、低译码时长:基于神经网络模型的信道译码在神经网络模型训练完成后译码时长将变得很短,降低了译码的时延。2. Low decoding time: The channel decoding based on the neural network model will shorten the decoding time after the training of the neural network model is completed, which reduces the decoding delay.
3、译码算法复杂度低:GAN基于反向传播,而不需要马尔科夫链,从而降低译码算法复杂度。3. Low complexity of decoding algorithm: GAN is based on backpropagation and does not require Markov chain, thus reducing the complexity of decoding algorithm.
根据本发明实施例,通过基于预训练的神经网络模型来实现对初始码字经信道传输后得出的接收码字的译码,从而实现了具有低误码率、低译码时长以及低译码复杂度的信道译码方案。According to the embodiment of the present invention, the decoding of the received codeword obtained after the initial codeword is transmitted through the channel is realized by using a pre-trained neural network model, thereby realizing a low bit error rate, low decoding time and low decoding time. Code complexity channel decoding scheme.
预训练的神经网络模型可以基于如上参考图1-3所描述的用于信道译码的神经网络模型的训练方法得出,具体步骤可参考以上描述,在此不再赘诉。The pre-trained neural network model can be obtained based on the training method of the neural network model for channel decoding described above with reference to FIGS.
与上述几种实施例提供的用于信道译码的神经网络模型的训练方法相对应,本公开还提供一种用于信道译码的神经网络模型的训练装置,由于本公开实施例提供的用于信道译码的神经网络模型的训练装置与上述几种实施例提供的用于信道译码的神经网络模型的训练方法相对应,因此在用于信道译码的神经网络模型的训练方法的实施方式也适用于本实施例提供的用于信道译码的神经网络模型的训练装置,在本实施例中不再详细描述。图5是根据本公开提出的用于信道译码的神经网络模型的训练装置的结构示意图。Corresponding to the training methods for the neural network model for channel decoding provided by the above embodiments, the present disclosure also provides a training device for the neural network model for channel decoding. The training device for the neural network model for channel decoding corresponds to the training method for the neural network model for channel decoding provided by the above-mentioned several embodiments, so in the implementation of the training method for the neural network model for channel decoding The method is also applicable to the training device for the neural network model for channel decoding provided in this embodiment, and will not be described in detail in this embodiment. Fig. 5 is a schematic structural diagram of a training device for a neural network model for channel decoding proposed according to the present disclosure.
图5为本公开实施例提供的一种用于信道译码的神经网络模型的训练装置500的结构示意图,神经网络模型包括译码生成器和译码辨别器。FIG. 5 is a schematic structural diagram of a neural network model training device 500 for channel decoding provided by an embodiment of the present disclosure. The neural network model includes a decoding generator and a decoding discriminator.
如图5所示,用于信道译码的神经网络模型的训练装置500包括:As shown in Figure 5, the training device 500 for the neural network model of channel decoding comprises:
获取模块501,用于获取包括信息序列样本的信息序列训练集,并基于所述信息序列样本获得待译码的接收码字样本;以及An acquisition module 501, configured to acquire an information sequence training set including information sequence samples, and obtain received codeword samples to be decoded based on the information sequence samples; and
训练模块502,用于以所述接收码字样本作为所述译码生成器的输入特征,以所述信息序列样本和所述译码生成器输出的译码码字样本作为所述译码辨别器的输入特征,并以能否区分所述信息序列样本和所述译码码字样本作为所述译码辨别器的输出特征,执行迭代训练以获得所述预训练的神经网络模型。The training module 502 is configured to use the received codeword samples as the input features of the decoding generator, and use the information sequence samples and the decoded codeword samples output by the decoding generator as the decoding discrimination The input feature of the discriminator, and whether the information sequence sample can be distinguished from the decoded codeword sample is used as the output feature of the decoding discriminator, and iterative training is performed to obtain the pre-trained neural network model.
根据本发明实施例,通过获取包括信息序列样本的信息序列训练集并基于所述信息序列样本获得待译码的接收码字样本,以及以接收码字样本作为译码生成器的输入特征,以信息序列样本和译码生成器输出的译码码字样本作为译码辨别器的输入特征,并以能否区分信息序列样本和译码码字样本作为译码辨别器的输出特征,进行迭代训练以获得经训练的神经网络模型。由此得出的经训练的神经网络模型能够对信息序列经信道传输后的待译码的接收码字进行译码以得到原信息序列。According to an embodiment of the present invention, by obtaining an information sequence training set including information sequence samples and obtaining received codeword samples to be decoded based on the information sequence samples, and using the received codeword samples as input features of the decoding generator, to The information sequence samples and the decoded codeword samples output by the decoding generator are used as the input features of the decoding discriminator, and the ability to distinguish the information sequence samples and the decoded codeword samples as the output features of the decoding discriminator is used for iterative training to obtain a trained neural network model. The trained neural network model thus obtained can decode the received codeword to be decoded after the information sequence is transmitted through the channel to obtain the original information sequence.
在一些实施例中,所述获取模块501用于:基于信息序列样本,通过编码器获得编码码字样本;对编码码字样本进行调制以获得调制码字样本;以及将调制码字样本输入加噪信道以获得经信道传输后的接收码字样本。In some embodiments, the obtaining module 501 is configured to: obtain coded codeword samples through an encoder based on information sequence samples; modulate coded codeword samples to obtain modulated codeword samples; and input modulated codeword samples into noise channel to obtain received codeword samples after channel transmission.
在一些实施例中,所述训练模块502用于:基于从用于本轮训练的信息序列样本获得的接收码字样本,通过译码生成器获得与信息序列样本对应的译码码字样本;基于译码码 字样本和信息序列样本,通过译码辨别器确定能否区分译码码字样本和信息序列样本;如果确定能够区分译码码字样本和信息序列样本,通过反向传播法对所述译码生成器以及所述译码辨别器进行更新,并重复上述步骤直至用于本轮训练的所有信息序列样本都经过所述译码生成器和所述译码辨别器处理之后开始下轮训练;如果确定无法区分译码码字样本和信息序列样本,结束迭代训练并获得预训练的神经网络模型。In some embodiments, the training module 502 is configured to: obtain the decoded codeword samples corresponding to the information sequence samples through the decoding generator based on the received codeword samples obtained from the information sequence samples used for the current round of training; Based on the decoded codeword samples and information sequence samples, determine whether the decoded codeword samples and information sequence samples can be distinguished through the decoding discriminator; if it is determined that the decoded codeword samples and information sequence samples can be distinguished, use the back propagation method The decoding generator and the decoding discriminator are updated, and the above steps are repeated until all information sequence samples used for this round of training have been processed by the decoding generator and the decoding discriminator, and then start downloading rounds of training; if it is determined that the decoded codeword samples and information sequence samples cannot be distinguished, the iterative training is ended and a pre-trained neural network model is obtained.
在一些实施例中,所述译码辨别器和所述译码生成器的训练目标表示为:In some embodiments, the training objectives of the decoding discriminator and the decoding generator are expressed as:
Figure PCTCN2021104367-appb-000025
Figure PCTCN2021104367-appb-000025
其中,G表示所述译码生成器,D表示所述译码辨别器,V(D,G)表示所述信息序列样本与所述译码码字样本之间的差异,
Figure PCTCN2021104367-appb-000026
表示所述译码生成器和所述译码辨别器的训练目标,其中所述译码生成器的训练目标为能够最小化所述信息序列样本与所述译码码字样本之间的差异以及所述译码辨别器的训练目标为能够最大化地区分所述信息序列样本与所述译码码字样本之间的差异,x表示输入所述译码辨别器的信息序列样本,x~p data(x)表示x服从输入至所述译码生成器的数据分布,z表示输入噪声,z~p z(z)表示z服从输入至所述译码生成器的噪声变量分布,
Figure PCTCN2021104367-appb-000027
表示概率分布。
Wherein, G represents the decoding generator, D represents the decoding discriminator, V(D, G) represents the difference between the information sequence samples and the decoding codeword samples,
Figure PCTCN2021104367-appb-000026
Represents the training target of the decoding generator and the decoding discriminator, wherein the training target of the decoding generator is to be able to minimize the difference between the information sequence samples and the decoding codeword samples and The training goal of the decoding discriminator is to maximize the difference between the information sequence samples and the decoding codeword samples, x represents the information sequence samples input to the decoding discriminator, x~p data (x) means that x obeys the data distribution input to the decoding generator, z means input noise, and z~p z (z) means that z obeys the noise variable distribution input to the decoding generator,
Figure PCTCN2021104367-appb-000027
represents a probability distribution.
在一些实施例中,基于以下公式使用梯度上升法对所述译码辨别器进行更新:In some embodiments, the decoding discriminator is updated using gradient ascent based on the following formula:
Figure PCTCN2021104367-appb-000028
Figure PCTCN2021104367-appb-000028
其中,D表示所述译码辨别器,
Figure PCTCN2021104367-appb-000029
表示经更新的译码辨别器,m表示参与本轮训练的信息序列样本的数量,x i表示在第i次输入至所述译码辨别器中的信息序列样本,
Figure PCTCN2021104367-appb-000030
表示在上一次输入至所述译码辨别器中的信息序列样本。
Wherein, D represents the decoding discriminator,
Figure PCTCN2021104367-appb-000029
Represents the updated decoding discriminator, m represents the number of information sequence samples participating in the current round of training, x i represents the information sequence samples input to the decoding discriminator for the ith time,
Figure PCTCN2021104367-appb-000030
Indicates the information sequence samples last input to the decoder discriminator.
在一些实施例中,基于以下公式使用梯度下降法对所述译码生成器进行更新:In some embodiments, the decoding generator is updated using a gradient descent method based on the following formula:
Figure PCTCN2021104367-appb-000031
Figure PCTCN2021104367-appb-000031
其中,G表示所述译码生成器,D表示所述译码辨别器,
Figure PCTCN2021104367-appb-000032
表示经更新的译码生成器,m表示参与本轮训练的信息序列样本的数量,y i表示在第i次输入至所述译码生成器中的接收码字样本。
Wherein, G represents the decoding generator, D represents the decoding discriminator,
Figure PCTCN2021104367-appb-000032
represents the updated decoding generator, m represents the number of information sequence samples participating in the current round of training, and y i represents the received codeword samples input to the decoding generator for the ith time.
在一些实施例中,所述编码器为低密度奇偶校验LDPC码编码器。In some embodiments, the encoder is a low density parity check LDPC code encoder.
在一些实施例中,所述调制器为二进制相移键控BPSK调制。In some embodiments, the modulator is binary phase shift keyed BPSK modulation.
在一些实施例中,所述加噪信道为以下之一:加性白噪声AWGN信道;以及瑞利信道。In some embodiments, the noise adding channel is one of: an additive white noise AWGN channel; and a Rayleigh channel.
在一些实施例中,所述神经网络模型为生成对抗神经网络GAN模型。In some embodiments, the neural network model is a GAN model.
图6为本公开实施例提供的一种信道译码装置600的结构示意图。FIG. 6 is a schematic structural diagram of a channel decoding apparatus 600 provided by an embodiment of the present disclosure.
如图6所示,所述装置600包括:译码模块601,用于基于预训练的神经网络模型,对信息序列经信道传输后得出的待译码的接收码字进行译码,以获得与所述接收码字对应的译码码字;其中,所述预训练的神经网络模型包括译码生成器,所述译码生成器对所述接收码字进行译码以输出所述译码码字。As shown in FIG. 6, the device 600 includes: a decoding module 601, configured to decode the received codeword to be decoded after the information sequence is transmitted through the channel based on the pre-trained neural network model, to obtain A decoded codeword corresponding to the received codeword; wherein, the pre-trained neural network model includes a decoded generator, and the decoded generator decodes the received codeword to output the decoded codeword numbers.
通过实施本实施例,基于预训练的神经网络模型对信息序列经信道传输后得出的待译码的接收码字进行译码,以获得与接收码字对应的译码码字,其中预训练的神经网络模型包括用于对接收码字进行译码以输出译码码字的译码生成器。由此,能够通过预训练的神 经网络模型来实现对待译码码字的译码,从而实现了具有低误码率、低译码时长以及低译码复杂度的信道译码方案。By implementing this embodiment, based on the pre-trained neural network model, the received codeword to be decoded after the information sequence is transmitted through the channel is decoded to obtain the decoded codeword corresponding to the received codeword, wherein the pre-trained The neural network model of includes a decoding generator for decoding received codewords to output decoded codewords. Therefore, the decoding of the codeword to be decoded can be realized through the pre-trained neural network model, thereby realizing a channel decoding scheme with low bit error rate, low decoding time and low decoding complexity.
在一些实施例中,所述神经网络模型还包括译码辨别器,所述预训练的神经网络模型通过以下过程获得:获取包括信息序列样本的信息序列训练集,并基于所述信息序列样本获得待译码的接收码字样本;以所述接收码字样本作为所述译码生成器的输入特征,以所述信息序列样本和所述译码生成器输出的译码码字样本作为所述译码辨别器的输入特征,并以能否区分所述信息序列样本和所述译码码字样本作为所述译码辨别器的输出特征,执行迭代训练以获得所述预训练的神经网络模型。In some embodiments, the neural network model further includes a decoding discriminator, and the pre-trained neural network model is obtained through the following process: obtaining an information sequence training set including information sequence samples, and obtaining The received codeword sample to be decoded; the received codeword sample is used as the input feature of the decoding generator, and the information sequence sample and the decoded codeword sample output by the decoding generator are used as the Decoding the input features of the discriminator, and taking whether the information sequence sample and the decoded codeword sample can be distinguished as the output feature of the decoding discriminator, performing iterative training to obtain the pre-trained neural network model .
在一些实施例中,所述基于所述信息序列样本获得待译码的接收码字样本包括:基于所述信息序列样本,通过编码器获得编码码字样本;对所述编码码字样本进行调制以获得调制码字样本;以及将所述调制码字样本输入加噪信道以获得经信道传输后的接收码字样本。In some embodiments, the obtaining the received codeword samples to be decoded based on the information sequence samples includes: obtaining encoded codeword samples by an encoder based on the information sequence samples; and modulating the encoded codeword samples obtaining modulation codeword samples; and inputting the modulation codeword samples into a noise-added channel to obtain received codeword samples transmitted through the channel.
在一些实施例中,迭代训练中的每轮训练包括:基于从用于本轮训练的信息序列样本获得的接收码字样本,通过所述译码生成器获得与所述信息序列样本对应的译码码字样本;基于所述译码码字样本和所述信息序列样本,通过所述译码辨别器确定能否区分所述译码码字样本和所述信息序列样本;如果确定能够区分所述译码码字样本和所述信息序列样本,通过反向传播法对所述译码生成器以及所述译码辨别器进行更新,并重复上述步骤直至用于本轮训练的所有信息序列样本都经过所述译码生成器和所述译码辨别器处理之后开始下轮训练;以及如果确定无法区分所述译码码字样本和所述信息序列样本,结束所述迭代训练并获得所述预训练的神经网络模型。In some embodiments, each round of training in the iterative training includes: based on the received codeword samples obtained from the information sequence samples used for the current round of training, the decoding generator corresponding to the information sequence samples is obtained by the decoding generator. Codeword samples; based on the decoded codeword samples and the information sequence samples, determine whether the decoded codeword samples and the information sequence samples can be distinguished through the decoding discriminator; if it is determined that the information sequence samples can be distinguished The decoding codeword samples and the information sequence samples are updated by the back propagation method to the decoding generator and the decoding discriminator, and repeat the above steps until all the information sequence samples are used for this round of training Start the next round of training after being processed by the decoding generator and the decoding discriminator; and if it is determined that the decoding codeword samples and the information sequence samples cannot be distinguished, end the iterative training and obtain the Pretrained neural network models.
在一些实施例中,所述译码辨别器和所述译码生成器的训练目标表示为:In some embodiments, the training objectives of the decoding discriminator and the decoding generator are expressed as:
Figure PCTCN2021104367-appb-000033
Figure PCTCN2021104367-appb-000033
其中,G表示所述译码生成器,D表示所述译码辨别器,V(D,G)表示所述信息序列样本与所述译码码字样本之间的差异,
Figure PCTCN2021104367-appb-000034
表示所述译码生成器和所述译码辨别器的训练目标,其中所述译码生成器的训练目标为能够最小化所述信息序列样本与所述译码码字样本之间的差异以及所述译码辨别器的训练目标为能够最大化地区分所述信息序列样本与所述译码码字样本之间的差异,x表示输入所述译码辨别器的信息序列样本,x~p data(x)表示x服从输入至所述译码生成器的数据分布,z表示输入噪声,z~p z(z)表示z服从输入至所述译码生成器的噪声变量分布,
Figure PCTCN2021104367-appb-000035
表示概率分布。
Wherein, G represents the decoding generator, D represents the decoding discriminator, V(D, G) represents the difference between the information sequence samples and the decoding codeword samples,
Figure PCTCN2021104367-appb-000034
Represents the training target of the decoding generator and the decoding discriminator, wherein the training target of the decoding generator is to be able to minimize the difference between the information sequence samples and the decoding codeword samples and The training goal of the decoding discriminator is to maximize the difference between the information sequence samples and the decoding codeword samples, x represents the information sequence samples input to the decoding discriminator, x~p data (x) means that x obeys the data distribution input to the decoding generator, z means input noise, and z~p z (z) means that z obeys the noise variable distribution input to the decoding generator,
Figure PCTCN2021104367-appb-000035
represents a probability distribution.
在一些实施例中,基于以下公式使用梯度上升法对所述译码辨别器进行更新:In some embodiments, the decoding discriminator is updated using gradient ascent based on the following formula:
Figure PCTCN2021104367-appb-000036
Figure PCTCN2021104367-appb-000036
其中,D表示所述译码辨别器,
Figure PCTCN2021104367-appb-000037
表示经更新的译码辨别器,m表示参与本轮训练的信息序列样本的数量,x i表示在第i次输入至所述译码辨别器中的信息序列样本,
Figure PCTCN2021104367-appb-000038
表示在上一次输入至所述译码辨别器中的信息序列样本。
Wherein, D represents the decoding discriminator,
Figure PCTCN2021104367-appb-000037
Represents the updated decoding discriminator, m represents the number of information sequence samples participating in the current round of training, x i represents the information sequence samples input to the decoding discriminator for the ith time,
Figure PCTCN2021104367-appb-000038
Indicates the information sequence samples last input to the decoder discriminator.
在一些实施例中,基于以下公式使用梯度下降法对所述译码生成器进行更新:In some embodiments, the decoding generator is updated using a gradient descent method based on the following formula:
Figure PCTCN2021104367-appb-000039
Figure PCTCN2021104367-appb-000039
其中,G表示所述译码生成器,D表示所述译码辨别器,
Figure PCTCN2021104367-appb-000040
表示经更新的译码生成器,m表示参与本轮训练的信息序列样本的数量,y i表示在第i次输入至所述译码生成器中的接收码字样本。
Wherein, G represents the decoding generator, D represents the decoding discriminator,
Figure PCTCN2021104367-appb-000040
represents the updated decoding generator, m represents the number of information sequence samples participating in the current round of training, and y i represents the received codeword samples input to the decoding generator for the ith time.
在一些实施例中,所述编码器为低密度奇偶校验LDPC码编码器。In some embodiments, the encoder is a low density parity check LDPC code encoder.
在一些实施例中,所述调制器为二进制相移键控BPSK调制。In some embodiments, the modulator is binary phase shift keyed BPSK modulation.
在一些实施例中,所述加噪信道为以下之一:加性白噪声AWGN信道;以及瑞利信道。In some embodiments, the noise adding channel is one of: an additive white noise AWGN channel; and a Rayleigh channel.
在一些实施例中,所述神经网络模型为生成对抗神经网络GAN模型。In some embodiments, the neural network model is a GAN model.
根据本公开的实施例,本公开还提供了一种电子设备和一种计算机可读存储介质。如图7所示,是根据本公开实施例的电子设备的框图。电子设备旨在表示各种形式的数字计算机,诸如,膝上型计算机、台式计算机、工作台、个人数字助理、服务器、刀片式服务器、大型计算机、和其它适合的计算机。电子设备还可以表示各种形式的移动装置,诸如,个人数字处理、蜂窝电话、智能电话、可穿戴设备和其它类似的计算装置。本文所示的部件、它们的连接和关系、以及它们的功能仅仅作为示例,并且不意在限制本文中描述的和/或者要求的本公开的实现。According to an embodiment of the present disclosure, the present disclosure also provides an electronic device and a computer-readable storage medium. As shown in FIG. 7 , it is a block diagram of an electronic device according to an embodiment of the present disclosure. Electronic device is intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other suitable computers. Electronic devices may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are by way of example only, and are not intended to limit implementations of the disclosure described and/or claimed herein.
如图7所示,该电子设备包括:一个或多个处理器710、存储器720,以及用于连接各部件的接口,包括高速接口和低速接口。各个部件利用不同的总线互相连接,并且可以被安装在公共主板上或者根据需要以其它方式安装。处理器可以对在电子设备内执行的指令进行处理,包括存储在存储器中或者存储器上以在外部输入/输出装置(诸如,耦合至接口的显示设备)上显示GUI的图形信息的指令。在其它实施方式中,若需要,可以将多个处理器和/或多条总线与多个存储器和多个存储器一起使用。同样,可以连接多个电子设备,各个设备提供部分必要的操作(例如,作为服务器阵列、一组刀片式服务器、或者多处理器系统)。图7中以一个处理器710为例。As shown in FIG. 7 , the electronic device includes: one or more processors 710 , a memory 720 , and interfaces for connecting various components, including high-speed interfaces and low-speed interfaces. The various components are interconnected using different buses and can be mounted on a common motherboard or otherwise as desired. The processor may process instructions executed within the electronic device, including instructions stored in or on the memory, to display graphical information of a GUI on an external input/output device such as a display device coupled to an interface. In other implementations, multiple processors and/or multiple buses may be used with multiple memories and multiple memories, if desired. Likewise, multiple electronic devices may be connected, with each device providing some of the necessary operations (eg, as a server array, a set of blade servers, or a multi-processor system). In FIG. 7, a processor 710 is taken as an example.
存储器720即为本公开所提供的非瞬时计算机可读存储介质。其中,所述存储器存储有可由至少一个处理器执行的指令,以使所述至少一个处理器执行本公开所提供的数据传输方法。本公开的非瞬时计算机可读存储介质存储计算机指令,该计算机指令用于使计算机执行本公开所提供的数据传输方法。The memory 720 is a non-transitory computer-readable storage medium provided in the present disclosure. Wherein, the memory stores instructions executable by at least one processor, so that the at least one processor executes the data transmission method provided in the present disclosure. The non-transitory computer-readable storage medium of the present disclosure stores computer instructions, and the computer instructions are used to cause a computer to execute the data transmission method provided by the present disclosure.
存储器720作为一种非瞬时计算机可读存储介质,可用于存储非瞬时软件程序、非瞬时计算机可执行程序以及模块,如本公开实施例中的数据传输方法对应的程序指令/模块。处理器710通过运行存储在存储器720中的非瞬时软件程序、指令以及模块,从而执行服务器的各种功能应用以及数据处理,即实现上述方法实施例中的数据传输方法。As a non-transitory computer-readable storage medium, the memory 720 can be used to store non-transitory software programs, non-transitory computer-executable programs and modules, such as program instructions/modules corresponding to the data transmission method in the embodiments of the present disclosure. The processor 710 executes various functional applications and data processing of the server by running the non-transitory software programs, instructions and modules stored in the memory 720, that is, implements the data transmission method in the above method embodiments.
存储器720可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储根据定位电子设备的使用所创建的数据等。此外,存储器720可以包括高速随机存取存储器,还可以包括非瞬时存储器,例如至少一个磁盘存储器件、闪存器件、或其他非瞬时固态存储器件。可选地,存储器720可选包括相对于处理器710远程设置的存储器,这些远程存储器可以通过网络连接至定位电子设备。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 720 may include a program storage area and a data storage area, wherein the program storage area may store an operating system and an application program required by at least one function; the data storage area may store data created according to the use of the positioning electronic device, and the like. In addition, the memory 720 may include a high-speed random access memory, and may also include a non-transitory memory, such as at least one magnetic disk storage device, a flash memory device, or other non-transitory solid-state storage devices. Optionally, the storage 720 may optionally include storages that are set remotely relative to the processor 710, and these remote storages may be connected to the positioning electronic device through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
电子设备还可以包括:输入装置730和输出装置740。处理器710、存储器720、输入装置730和输出装置740可以通过总线或者其他方式连接,图7中以通过总线连接为例。The electronic device may further include: an input device 730 and an output device 740 . The processor 710, the memory 720, the input device 730, and the output device 740 may be connected via a bus or in other ways, and connection via a bus is taken as an example in FIG. 7 .
输入装置730可接收输入的数字或字符信息,以及产生与定位电子设备的用户设置以及功能控制有关的键信号输入,例如触摸屏、小键盘、鼠标、轨迹板、触摸板、指示杆、一个或者多个鼠标按钮、轨迹球、操纵杆等输入装置。输出装置740可以包括显示设备、辅助照明装置(例如,LED)和触觉反馈装置(例如,振动电机)等。该显示设备可以包括但不限于,液晶显示器(LCD)、发光二极管(LED)显示器和等离子体显示器。在一些实施方式中,显示设备可以是触摸屏。The input device 730 can receive input numbers or character information, and generate key signal input related to user settings and function control of the positioning electronic equipment, such as a touch screen, a keypad, a mouse, a trackpad, a touchpad, a pointing stick, one or more Input devices such as mouse buttons, trackballs, joysticks, etc. The output device 740 may include a display device, an auxiliary lighting device (eg, LED), a tactile feedback device (eg, a vibration motor), and the like. The display device may include, but is not limited to, a liquid crystal display (LCD), a light emitting diode (LED) display, and a plasma display. In some implementations, the display device may be a touch screen.
此处描述的系统和技术的各种实施方式可以在数字电子电路系统、集成电路系统、专用ASIC(专用集成电路)、计算机硬件、固件、软件、和/或它们的组合中实现。这些各种实施方式可以包括:实施在一个或者多个计算机程序中,该一个或者多个计算机程序可在包括至少一个可编程处理器的可编程系统上执行和/或解释,该可编程处理器可以是专用或者通用可编程处理器,可以从存储系统、至少一个输入装置、和至少一个输出装置接收数据和指令,并且将数据和指令传输至该存储系统、该至少一个输入装置、和该至少一个输出装置。Various implementations of the systems and techniques described herein can be implemented in digital electronic circuitry, integrated circuit systems, application specific ASICs (application specific integrated circuits), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include being implemented in one or more computer programs executable and/or interpreted on a programmable system including at least one programmable processor, the programmable processor Can be special-purpose or general-purpose programmable processor, can receive data and instruction from storage system, at least one input device, and at least one output device, and transmit data and instruction to this storage system, this at least one input device, and this at least one output device an output device.
这些计算程序(也称作程序、软件、软件应用、或者代码)包括可编程处理器的机器指令,并且可以利用高级过程和/或面向对象的编程语言、和/或汇编/机器语言来实施这些计算程序。如本文使用的,术语“机器可读介质”和“计算机可读介质”指的是用于将机器指令和/或数据提供给可编程处理器的任何计算机程序产品、设备、和/或装置(例如,磁盘、光盘、存储器、可编程逻辑装置(PLD)),包括,接收作为机器可读信号的机器指令的机器可读介质。术语“机器可读信号”指的是用于将机器指令和/或数据提供给可编程处理器的任何信号。These computing programs (also referred to as programs, software, software applications, or codes) include machine instructions for a programmable processor and may be implemented using high-level procedural and/or object-oriented programming languages, and/or assembly/machine language calculation program. As used herein, the terms "machine-readable medium" and "computer-readable medium" refer to any computer program product, apparatus, and/or means for providing machine instructions and/or data to a programmable processor ( For example, magnetic disks, optical disks, memories, programmable logic devices (PLDs), including machine-readable media that receive machine instructions as machine-readable signals. The term "machine-readable signal" refers to any signal used to provide machine instructions and/or data to a programmable processor.
为了提供与用户的交互,可以在计算机上实施此处描述的系统和技术,该计算机具有:用于向用户显示信息的显示装置(例如,CRT(阴极射线管)或者LCD(液晶显示器)监视器);以及键盘和指向装置(例如,鼠标或者轨迹球),用户可以通过该键盘和该指向装置来将输入提供给计算机。其它种类的装置还可以用于提供与用户的交互;例如,提供给用户的反馈可以是任何形式的传感反馈(例如,视觉反馈、听觉反馈、或者触觉反馈);并且可以用任何形式(包括声输入、语音输入或者、触觉输入)来接收来自用户的输入。To provide for interaction with the user, the systems and techniques described herein can be implemented on a computer having a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user. ); and a keyboard and pointing device (eg, a mouse or a trackball) through which a user can provide input to the computer. Other kinds of devices can also be used to provide interaction with the user; for example, the feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and can be in any form (including Acoustic input, speech input or, tactile input) to receive input from the user.
可以将此处描述的系统和技术实施在包括后台部件的计算系统(例如,作为数据服务器)、或者包括中间件部件的计算系统(例如,应用服务器)、或者包括前端部件的计算系统(例如,具有图形用户界面或者网络浏览器的用户计算机,用户可以通过该图形用户界面或者该网络浏览器来与此处描述的系统和技术的实施方式交互)、或者包括这种后台部件、中间件部件、或者前端部件的任何组合的计算系统中。可以通过任何形式或者介质的数字数据通信(例如,通信网络)来将系统的部件相互连接。通信网络的示例包括:局域网(LAN)、广域网(WAN)和互联网。The systems and techniques described herein can be implemented in a computing system that includes back-end components (e.g., as a data server), or a computing system that includes middleware components (e.g., an application server), or a computing system that includes front-end components (e.g., as a a user computer having a graphical user interface or web browser through which a user can interact with embodiments of the systems and techniques described herein), or including such backend components, middleware components, Or any combination of front-end components in a computing system. The components of the system can be interconnected by any form or medium of digital data communication, eg, a communication network. Examples of communication networks include: Local Area Network (LAN), Wide Area Network (WAN) and the Internet.
计算机系统可以包括客户端和服务器。客户端和服务器一般远离彼此并且通常通过通信网络进行交互。通过在相应的计算机上运行并且彼此具有客户端-服务器关系的计算机程序来产生客户端和服务器的关系。A computer system may include clients and servers. Clients and servers are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by computer programs running on the respective computers and having a client-server relationship to each other.
本领域技术人员还可以了解到本申请实施例列出的各种说明性逻辑块(illustrative logical block)和步骤(step)可以通过电子硬件、电脑软件,或两者的结合进行实现。这样的功能是通过硬件还是软件来实现取决于特定的应用和整个系统的设计要求。本领域技术人员可以对于每种特定的应用,可以使用各种方法实现所述的功能,但这种实现不应被理解为超出本申请实施例保护的范围。Those skilled in the art can also understand that various illustrative logical blocks and steps listed in the embodiments of the present application can be implemented by electronic hardware, computer software, or a combination of both. Whether such functions are implemented by hardware or software depends on the specific application and overall system design requirements. Those skilled in the art may use various methods to implement the described functions for each specific application, but such implementation should not be understood as exceeding the protection scope of the embodiments of the present application.
图8示出了根据本公开实施例的另一电子设备的框图。FIG. 8 shows a block diagram of another electronic device according to an embodiment of the present disclosure.
如图8所示,电子设备800包括模型训练器810和译码器820。As shown in FIG. 8 , an electronic device 800 includes a model trainer 810 and a decoder 820 .
其中,模型训练器810用于基于包括信息序列样本以及与所述信息序列样本对应的待译码的接收码字样本获得预训练的神经网络模型,其中,所述神经网络模型包括译码生成器和译码辨别器。Wherein, the model trainer 810 is used to obtain a pre-trained neural network model based on information sequence samples and received codeword samples corresponding to the information sequence samples to be decoded, wherein the neural network model includes a decoding generator and decoding discriminator.
具体地,模型训练器用于以所述接收码字样本作为所述译码生成器的输入特征,以所述信息序列样本和所述译码生成器输出的译码码字样本作为所述译码辨别器的输入特征,并以能否区分所述信息序列样本和所述译码码字样本作为所述译码辨别器的输出特征,执行迭代训练以获得所述预训练的神经网络模型。Specifically, the model trainer is used to use the received codeword samples as the input features of the decoding generator, and use the information sequence samples and the decoded codeword samples output by the decoding generator as the decoding The input feature of the discriminator, and whether the information sequence sample can be distinguished from the decoded codeword sample is used as the output feature of the decoding discriminator, and iterative training is performed to obtain the pre-trained neural network model.
译码器820用于基于所述预训练的神经网络模型,对信息序列经信道传输后得出的待译码的接收码字进行译码,以获得与所述接收码字对应的译码码字。The decoder 820 is configured to decode the received codeword to be decoded after the information sequence is transmitted through the channel based on the pre-trained neural network model, so as to obtain a decoding code corresponding to the received codeword Character.
本申请还提供一种可读存储介质,其上存储有指令,该指令被计算机执行时实现上述任一方法实施例的功能。The present application also provides a readable storage medium on which instructions are stored, and when the instructions are executed by a computer, the functions of any one of the above method embodiments are realized.
本申请还提供一种计算机程序产品,该计算机程序产品被计算机执行时实现上述任一方法实施例的功能。The present application also provides a computer program product, which implements the functions of any one of the above method embodiments when executed by a computer.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机程序。在计算机上加载和执行所述计算机程序时,全部或部分地产生按照本申请实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机程序可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机程序可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(digital subscriber line,DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质(例如,软盘、硬盘、磁带)、光介质(例如,高密度数字视频光盘(digital video disc,DVD))、或者半导体介质(例如,固态硬盘(solid state disk,SSD))等。In the above embodiments, all or part of them may be implemented by software, hardware, firmware or any combination thereof. When implemented using software, it may be implemented in whole or in part in the form of a computer program product. The computer program product comprises one or more computer programs. When the computer program is loaded and executed on the computer, all or part of the processes or functions according to the embodiments of the present application will be generated. The computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable devices. The computer program can be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer program can be downloaded from a website, computer, server or data center Transmission to another website site, computer, server or data center by wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.). The computer-readable storage medium may be any available medium that can be accessed by a computer, or a data storage device such as a server or a data center integrated with one or more available media. The available medium may be a magnetic medium (for example, a floppy disk, a hard disk, a magnetic tape), an optical medium (for example, a high-density digital video disc (digital video disc, DVD)), or a semiconductor medium (for example, a solid state disk (solid state disk, SSD)) etc.
本领域普通技术人员可以理解:本申请中涉及的第一、第二等各种数字编号仅为描述方便进行的区分,并不用来限制本申请实施例的范围,也表示先后顺序。Those of ordinary skill in the art can understand that: the first, second and other numbers involved in this application are only for convenience of description, and are not used to limit the scope of the embodiments of this application, and also indicate the sequence.
本申请中的至少一个还可以描述为一个或多个,多个可以是两个、三个、四个或者更多个,本申请不做限制。在本申请实施例中,对于一种技术特征,通过“第一”、“第二”、“第三”、“A”、“B”、“C”和“D”等区分该种技术特征中的技术特征,该“第一”、“第二”、“第三”、“A”、“B”、“C”和“D”描述的技术特征间无先后顺序或者大小顺序。At least one in this application can also be described as one or more, and multiple can be two, three, four or more, and this application does not make a limitation. In this embodiment of the application, for a technical feature, the technical feature is distinguished by "first", "second", "third", "A", "B", "C" and "D", etc. The technical features described in the "first", "second", "third", "A", "B", "C" and "D" have no sequence or order of magnitude among the technical features described.
本申请中各表所示的对应关系可以被配置,也可以是预定义的。各表中的信息的取值仅仅是举例,可以配置为其他值,本申请并不限定。在配置信息与各参数的对应关系时,并不一定要求必须配置各表中示意出的所有对应关系。例如,本申请中的表格中,某些行示出的对应关系也可以不配置。又例如,可以基于上述表格做适当的变形调整,例如,拆分,合并等等。上述各表中标题示出参数的名称也可以采用通信装置可理解的其他名称,其参数的取值或表示方式也可以通信装置可理解的其他取值或表示方式。上述各表在实现时,也可以采用其他的数据结构,例如可以采用数组、队列、容器、栈、线性表、指针、链表、树、图、结构体、类、堆、散列表或哈希表等。The corresponding relationships shown in the tables in this application can be configured or predefined. The values of the information in each table are just examples, and may be configured as other values, which are not limited in this application. When configuring the corresponding relationship between the information and each parameter, it is not necessarily required to configure all the corresponding relationships shown in the tables. For example, in the table in this application, the corresponding relationship shown in some rows may not be configured. For another example, appropriate deformation adjustments can be made based on the above table, for example, splitting, merging, and so on. The names of the parameters shown in the titles of the above tables may also adopt other names understandable by the communication device, and the values or representations of the parameters may also be other values or representations understandable by the communication device. When the above tables are implemented, other data structures can also be used, for example, arrays, queues, containers, stacks, linear tables, pointers, linked lists, trees, graphs, structures, classes, heaps, hash tables or hash tables can be used Wait.
本申请中的预定义可以理解为定义、预先定义、存储、预存储、预协商、预配置、固化、或预烧制。Predefined in this application can be understood as defining, predefining, storing, prestoring, prenegotiating, preconfiguring, curing, or prefiring.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those skilled in the art can appreciate that the units and algorithm steps of the examples described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are executed by hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may use different methods to implement the described functions for each specific application, but such implementation should not be regarded as exceeding the scope of the present application.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the above-described system, device and unit can refer to the corresponding process in the foregoing method embodiment, which will not be repeated here.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。The above is only a specific implementation of the application, but the scope of protection of the application is not limited thereto. Anyone familiar with the technical field can easily think of changes or substitutions within the technical scope disclosed in the application. Should be covered within the protection scope of this application. Therefore, the protection scope of the present application should be determined by the protection scope of the claims.

Claims (25)

  1. 一种用于信道译码的神经网络模型的训练方法,其特征在于,所述神经网络模型包括译码生成器和译码辨别器,所述方法包括:A method for training a neural network model for channel decoding, wherein the neural network model includes a decoding generator and a decoding discriminator, and the method includes:
    获取包括信息序列样本的信息序列训练集,并基于所述信息序列样本获得待译码的接收码字样本;以及obtaining an information sequence training set including information sequence samples, and obtaining received codeword samples to be decoded based on the information sequence samples; and
    以所述接收码字样本作为所述译码生成器的输入特征,以所述信息序列样本和所述译码生成器输出的译码码字样本作为所述译码辨别器的输入特征,并以能否区分所述信息序列样本和所述译码码字样本作为所述译码辨别器的输出特征,执行迭代训练以获得所述预训练的神经网络模型。Using the received codeword samples as input features of the decoding generator, using the information sequence samples and the decoded codeword samples output by the decoding generator as input features of the decoding discriminator, and Taking whether the information sequence sample can be distinguished from the decoded codeword sample as an output feature of the decoding discriminator, iterative training is performed to obtain the pre-trained neural network model.
  2. 如权利要求1所述的方法,其特征在于,所述基于所述信息序列样本获得待译码的接收码字样本包括:The method according to claim 1, wherein said obtaining a received codeword sample to be decoded based on said information sequence sample comprises:
    基于所述信息序列样本,通过编码器获得编码码字样本;Obtaining encoded codeword samples through an encoder based on the information sequence samples;
    对所述编码码字样本进行调制以获得调制码字样本;以及modulating the encoded codeword samples to obtain modulated codeword samples; and
    将所述调制码字样本输入加噪信道以获得经信道传输后的接收码字样本。The modulated codeword samples are input into a noise adding channel to obtain received codeword samples transmitted through the channel.
  3. 如权利要求1所述的方法,其特征在于,所述迭代训练中的每轮训练包括:The method according to claim 1, wherein each round of training in the iterative training comprises:
    基于从用于本轮训练的信息序列样本获得的接收码字样本,通过所述译码生成器获得与所述信息序列样本对应的译码码字样本;Based on the received codeword samples obtained from the information sequence samples used for the current round of training, the decoded codeword samples corresponding to the information sequence samples are obtained by the decoding generator;
    基于所述译码码字样本和所述信息序列样本,通过所述译码辨别器确定能否区分所述译码码字样本和所述信息序列样本;determining whether the decoded codeword samples and the information sequence samples can be distinguished by the decoding discriminator based on the decoded codeword samples and the information sequence samples;
    如果确定能够区分所述译码码字样本和所述信息序列样本,通过反向传播法对所述译码生成器以及所述译码辨别器进行更新,并重复上述步骤直至用于本轮训练的所有信息序列样本都经过所述译码生成器和所述译码辨别器处理之后开始下轮训练;以及If it is determined that the decoded codeword sample and the information sequence sample can be distinguished, update the decoded generator and the decoded discriminator through the backpropagation method, and repeat the above steps until used for this round of training All the information sequence samples of are processed by the decoding generator and the decoding discriminator to start the next round of training; and
    如果确定无法区分所述译码码字样本和所述信息序列样本,结束所述迭代训练并获得所述经训练的神经网络模型。If it is determined that the decoded codeword samples and the information sequence samples cannot be distinguished, the iterative training is ended and the trained neural network model is obtained.
  4. 如权利要求3所述的方法,其特征在于,所述译码辨别器和所述译码生成器的训练目标表示为:The method according to claim 3, wherein the training target of the decoding discriminator and the decoding generator is expressed as:
    Figure PCTCN2021104367-appb-100001
    Figure PCTCN2021104367-appb-100001
    其中,G表示所述译码生成器,D表示所述译码辨别器,V(D,G)表示所述信息序列样本与所述译码码字样本之间的差异,
    Figure PCTCN2021104367-appb-100002
    表示所述译码生成器和所述译码辨别器的训练目标,其中所述译码生成器的训练目标为能够最小化所述信息序列样本与所述译码码字样本之间的差异以及所述译码辨别器的训练目标为能够最大化地区分所述信息序列样本与所述译码码字样本之间的差异,x表示输入所述译码辨别器的信息序列样本,x~p data(x)表示x服从输入至所述译码生成器的数据分布,z表示输入噪声,z~p z(z)表示z服从输入至所述译码生成器的噪声变量分布,
    Figure PCTCN2021104367-appb-100003
    表示概率分布。
    Wherein, G represents the decoding generator, D represents the decoding discriminator, V(D, G) represents the difference between the information sequence samples and the decoding codeword samples,
    Figure PCTCN2021104367-appb-100002
    Represents the training target of the decoding generator and the decoding discriminator, wherein the training target of the decoding generator is to be able to minimize the difference between the information sequence samples and the decoding codeword samples and The training goal of the decoding discriminator is to maximize the difference between the information sequence samples and the decoding codeword samples, x represents the information sequence samples input to the decoding discriminator, x~p data (x) means that x obeys the data distribution input to the decoding generator, z means input noise, and z~p z (z) means that z obeys the noise variable distribution input to the decoding generator,
    Figure PCTCN2021104367-appb-100003
    represents a probability distribution.
  5. 如权利要求3所述的方法,其特征在于,基于以下公式使用梯度上升法对所述译码辨别器进行更新:The method according to claim 3, wherein the decoding discriminator is updated using a gradient ascending method based on the following formula:
    Figure PCTCN2021104367-appb-100004
    Figure PCTCN2021104367-appb-100004
    其中,D表示所述译码辨别器,
    Figure PCTCN2021104367-appb-100005
    表示经更新的译码辨别器,m表示参与本轮训练的信息序列样本的数量,x i表示在第i次输入至所述译码辨别器中的信息序列样本,
    Figure PCTCN2021104367-appb-100006
    表示在上一次输入至所述译码辨别器中的信息序列样本。
    Wherein, D represents the decoding discriminator,
    Figure PCTCN2021104367-appb-100005
    Represents the updated decoding discriminator, m represents the number of information sequence samples participating in the current round of training, x i represents the information sequence samples input to the decoding discriminator for the ith time,
    Figure PCTCN2021104367-appb-100006
    Indicates the information sequence samples last input to the decoder discriminator.
  6. 如权利要求3所述的方法,其特征在于,基于以下公式使用梯度下降法对所述译码生成器进行更新:The method according to claim 3, wherein the decoding generator is updated using a gradient descent method based on the following formula:
    Figure PCTCN2021104367-appb-100007
    Figure PCTCN2021104367-appb-100007
    其中,G表示所述译码生成器,D表示所述译码辨别器,
    Figure PCTCN2021104367-appb-100008
    表示经更新的译码生成器,m表示参与本轮训练的信息序列样本的数量,y i表示在第i次输入至所述译码生成器中的接收码字样本。
    Wherein, G represents the decoding generator, D represents the decoding discriminator,
    Figure PCTCN2021104367-appb-100008
    represents the updated decoding generator, m represents the number of information sequence samples participating in the current round of training, and y i represents the received codeword samples input to the decoding generator for the ith time.
  7. 如权利要求2所述的方法,其特征在于,所述编码器为低密度奇偶校验LDPC码编码器。The method according to claim 2, wherein the encoder is a Low Density Parity Check (LDPC) code encoder.
  8. 如权利要求2所述的方法,其特征在于,所述调制器为二进制相移键控BPSK调制。The method according to claim 2, characterized in that the modulator is binary phase shift keying BPSK modulation.
  9. 如权利要求2所述的方法,其特征在于,所述加噪信道为以下之一:The method according to claim 2, wherein the noise adding channel is one of the following:
    加性白噪声AWGN信道;以及Additive white noise AWGN channel; and
    瑞利信道。Rayleigh channel.
  10. 如权利要求1-9中任一项所述的方法,其特征在于,所述神经网络模型为生成对抗神经网络GAN模型。The method according to any one of claims 1-9, wherein the neural network model is a GAN model for generating an adversarial neural network.
  11. 一种信道译码方法,其特征在于,包括:A channel decoding method, characterized in that, comprising:
    基于预训练的神经网络模型,对信息序列经信道传输后得出的待译码的接收码字进行译码,以获得与所述接收码字对应的译码码字;Based on the pre-trained neural network model, decoding the received codeword to be decoded obtained after the information sequence is transmitted through the channel, so as to obtain the decoded codeword corresponding to the received codeword;
    其中,所述预训练的神经网络模型包括译码生成器,所述译码生成器对所述接收码字进行译码以输出所述译码码字。Wherein, the pre-trained neural network model includes a decoding generator, and the decoding generator decodes the received codeword to output the decoded codeword.
  12. 如权利要求10所述的方法,所述神经网络模型还包括译码辨别器,所述预训练的神经网络模型通过以下过程获得:The method according to claim 10, the neural network model also includes a decoding discriminator, and the pre-trained neural network model is obtained through the following process:
    获取包括信息序列样本的信息序列训练集,并基于所述信息序列样本获得接收码字样本;以及obtaining an information sequence training set including information sequence samples, and obtaining received codeword samples based on the information sequence samples; and
    以所述接收码字样本作为所述译码生成器的输入特征,以所述信息序列样本和所述译码生成器输出的译码码字样本作为所述译码辨别器的输入特征,并以能否区分所述信息序列样本和所述译码码字样本作为所述译码辨别器的输出特征,执行迭代训练以获得所述预训练的神经网络模型。Using the received codeword samples as input features of the decoding generator, using the information sequence samples and the decoded codeword samples output by the decoding generator as input features of the decoding discriminator, and Taking whether the information sequence sample can be distinguished from the decoded codeword sample as an output feature of the decoding discriminator, iterative training is performed to obtain the pre-trained neural network model.
  13. 如权利要求12所述的方法,其特征在于,所述基于所述信息序列样本获得待译码的接收码字样本包括:The method according to claim 12, wherein said obtaining the received codeword samples to be decoded based on said information sequence samples comprises:
    基于所述信息序列样本,通过编码器获得编码码字样本;Obtaining encoded codeword samples through an encoder based on the information sequence samples;
    对所述编码码字样本进行调制以获得调制码字样本;以及modulating the encoded codeword samples to obtain modulated codeword samples; and
    将所述调制码字样本输入加噪信道以获得经信道传输后的接收码字样本。The modulated codeword samples are input into a noise adding channel to obtain received codeword samples transmitted through the channel.
  14. 如权利要求12所述的方法,其特征在于,所述迭代训练中的每轮训练包括:The method according to claim 12, wherein each round of training in the iterative training comprises:
    基于从用于本轮训练的信息序列样本获得的接收码字样本,通过所述译码生成器获得与所述信息序列样本对应的译码码字样本;Based on the received codeword samples obtained from the information sequence samples used for the current round of training, the decoded codeword samples corresponding to the information sequence samples are obtained by the decoding generator;
    基于所述译码码字样本和所述信息序列样本,通过所述译码辨别器确定能否区分所述译码码字样本和所述信息序列样本;determining whether the decoded codeword samples and the information sequence samples can be distinguished by the decoding discriminator based on the decoded codeword samples and the information sequence samples;
    如果确定能够区分所述译码码字样本和所述信息序列样本,通过反向传播法对所述译码生成器以及所述译码辨别器进行更新,并重复上述步骤直至用于本轮训练的所有信息序列样本都经过所述译码生成器和所述译码辨别器处理之后开始下轮训练;以及If it is determined that the decoded codeword sample and the information sequence sample can be distinguished, update the decoded generator and the decoded discriminator through the backpropagation method, and repeat the above steps until used for this round of training All the information sequence samples of are processed by the decoding generator and the decoding discriminator to start the next round of training; and
    如果确定无法区分所述译码码字样本和所述信息序列样本,结束所述迭代训练并获得所述经训练的神经网络模型。If it is determined that the decoded codeword samples and the information sequence samples cannot be distinguished, the iterative training is ended and the trained neural network model is obtained.
  15. 如权利要求14所述的方法,其特征在于,所述译码辨别器和所述译码生成器的训练目标表示为:The method according to claim 14, wherein the training objectives of the decoding discriminator and the decoding generator are expressed as:
    Figure PCTCN2021104367-appb-100009
    Figure PCTCN2021104367-appb-100009
    其中,G表示所述译码生成器,D表示所述译码辨别器,V(D,G)表示所述信息序列样本与所述译码码字样本之间的差异,
    Figure PCTCN2021104367-appb-100010
    表示所述译码生成器和所述译码辨别器的训练目标,其中所述译码生成器的训练目标为能够最小化所述信息序列样本与所述译码码字样本之间的差异以及所述译码辨别器的训练目标为能够最大化地区分所述信息序列样本与所述译码码字样本之间的差异,x表示输入所述译码辨别器的信息序列样本,x~p data(x)表示x服从输入至所述译码生成器的数据分布,z表示输入噪声,z~p z(z)表示z服从输入至所述译码生成器的噪声变量分布,
    Figure PCTCN2021104367-appb-100011
    表示概率分布。
    Wherein, G represents the decoding generator, D represents the decoding discriminator, V(D, G) represents the difference between the information sequence samples and the decoding codeword samples,
    Figure PCTCN2021104367-appb-100010
    Represents the training target of the decoding generator and the decoding discriminator, wherein the training target of the decoding generator is to be able to minimize the difference between the information sequence samples and the decoding codeword samples and The training goal of the decoding discriminator is to maximize the difference between the information sequence samples and the decoding codeword samples, x represents the information sequence samples input to the decoding discriminator, x~p data (x) means that x obeys the data distribution input to the decoding generator, z means input noise, and z~p z (z) means that z obeys the noise variable distribution input to the decoding generator,
    Figure PCTCN2021104367-appb-100011
    represents a probability distribution.
  16. 如权利要求14所述的方法,其特征在于,基于以下公式使用梯度上升法对所述译码辨别器进行更新:The method according to claim 14, wherein the decoding discriminator is updated using a gradient ascending method based on the following formula:
    Figure PCTCN2021104367-appb-100012
    Figure PCTCN2021104367-appb-100012
    其中,D表示所述译码辨别器,
    Figure PCTCN2021104367-appb-100013
    表示经更新的译码辨别器,m表示参与本轮训练的信息序列样本的数量,x i表示在第i次输入至所述译码辨别器中的信息序列样本,
    Figure PCTCN2021104367-appb-100014
    表示在上一次输入至所述译码辨别器中的信息序列样本。
    Wherein, D represents the decoding discriminator,
    Figure PCTCN2021104367-appb-100013
    Represents the updated decoding discriminator, m represents the number of information sequence samples participating in the current round of training, x i represents the information sequence samples input to the decoding discriminator for the ith time,
    Figure PCTCN2021104367-appb-100014
    Indicates the information sequence samples last input to the decoder discriminator.
  17. 如权利要求14所述的方法,其特征在于,基于以下公式使用梯度下降法对所述译码生成器进行更新:The method according to claim 14, wherein the decoding generator is updated using a gradient descent method based on the following formula:
    Figure PCTCN2021104367-appb-100015
    Figure PCTCN2021104367-appb-100015
    其中,G表示所述译码生成器,D表示所述译码辨别器,
    Figure PCTCN2021104367-appb-100016
    表示经更新的译码生成器,m表示参与本轮训练的信息序列样本的数量,y i表示在第i次输入至所述译码生成器中的接收码字样本。
    Wherein, G represents the decoding generator, D represents the decoding discriminator,
    Figure PCTCN2021104367-appb-100016
    represents the updated decoding generator, m represents the number of information sequence samples participating in the current round of training, and y i represents the received codeword samples input to the decoding generator for the ith time.
  18. 如权利要求11-17中任一项所述的方法,其特征在于,所述神经网络模型为生成对抗神经网络GAN模型。The method according to any one of claims 11-17, wherein the neural network model is a GAN model for generating an adversarial neural network.
  19. 一种用于信道译码的神经网络模型的训练装置,其特征在于,所述神经网络模型包括译码生成器和译码辨别器,所述装置包括:A training device for a neural network model for channel decoding, characterized in that the neural network model includes a decoding generator and a decoding discriminator, and the device includes:
    获取模块,用于获取包括信息序列样本的信息序列训练集,并基于所述信息序列样本获得待译码的接收码字样本;以及An acquisition module, configured to acquire an information sequence training set including information sequence samples, and obtain received codeword samples to be decoded based on the information sequence samples; and
    训练模块,用于以所述接收码字样本作为所述译码生成器的输入特征,以所述信息序列样本和所述译码生成器输出的译码码字样本作为所述译码辨别器的输入特征,并以能否区分所述信息序列样本和所述译码码字样本作为所述译码辨别器的输出特征,执行迭代训练以获得所述预训练的神经网络模型。A training module, configured to use the received codeword samples as the input features of the decoding generator, and use the information sequence samples and the decoded codeword samples output by the decoding generator as the decoding discriminator The input feature of the input feature, and whether the information sequence sample can be distinguished from the decoded codeword sample is used as the output feature of the decoding discriminator, and iterative training is performed to obtain the pre-trained neural network model.
  20. 一种信道译码装置,其特征在于,包括:A channel decoding device is characterized in that it comprises:
    译码模块,用于基于预训练的神经网络模型,对信息序列经信道传输后得出的待译码的接收码字进行译码,以获得与所述接收码字对应的译码码字;The decoding module is used to decode the received codewords to be decoded after the information sequence is transmitted through the channel based on the pre-trained neural network model, so as to obtain the decoded codewords corresponding to the received codewords;
    其中,所述预训练的神经网络模型包括译码生成器,所述译码生成器对所述接收码字进行译码以输出所述译码码字。Wherein, the pre-trained neural network model includes a decoding generator, and the decoding generator decodes the received codeword to output the decoded codeword.
  21. 一种电子设备,其中,包括:An electronic device, comprising:
    存储器;memory;
    处理器,与所述存储器连接,配置为通过执行所述存储器上的计算机可执行指令,以能够实现权利要求1-10任一项所述的方法。A processor, connected to the memory, configured to implement the method according to any one of claims 1-10 by executing computer-executable instructions on the memory.
  22. 一种电子设备,其中,包括:An electronic device, comprising:
    存储器;memory;
    处理器,与所述存储器连接,配置为通过执行所述存储器上的计算机可执行指令,以能够实现权利要求11-18任一项所述的方法。A processor, connected to the memory, configured to implement the method according to any one of claims 11-18 by executing computer-executable instructions on the memory.
  23. 一种电子设备,包括:An electronic device comprising:
    模型训练器,用于基于包括信息序列样本以及与所述信息序列样本对应的待译码的接收码字样本获得预训练的神经网络模型,其中,所述神经网络模型包括译码生成器和译码辨别器,所述模型训练器用于以所述接收码字样本作为所述译码生成器的输入特征,以所述信息序列样本和所述译码生成器输出的译码码字样本作为所述译码辨别器的输入特征,并以能否区分所述信息序列样本和所述译码码字样本作为所述译码辨别器的输出特征,执行迭代训练以获得所述预训练的神经网络模型;A model trainer, configured to obtain a pre-trained neural network model based on information sequence samples and received codeword samples to be decoded corresponding to the information sequence samples, wherein the neural network model includes a decoding generator and a decoding A code discriminator, the model trainer is used to use the received codeword samples as the input features of the decoding generator, and use the information sequence samples and the decoded codeword samples output by the decoding generator as the The input feature of the decoding discriminator, and whether the information sequence sample and the decoding codeword sample can be distinguished as the output feature of the decoding discriminator, perform iterative training to obtain the pre-trained neural network Model;
    译码器,所述译码器用于基于所述预训练的神经网络模型,对信息序列经信道传输后得出的待译码的接收码字进行译码,以获得与所述接收码字对应的译码码字。a decoder, the decoder is configured to decode the received codeword to be decoded after the information sequence is transmitted through the channel based on the pre-trained neural network model, so as to obtain the codeword corresponding to the received codeword The decoding codeword of .
  24. 一种计算机存储介质,其中,所述计算机存储介质存储有计算机可执行指令;所述计算机可执行指令被处理器执行后,能够实现权利要求1-10任一项所述的方法。A computer storage medium, wherein the computer storage medium stores computer-executable instructions; after the computer-executable instructions are executed by a processor, the method according to any one of claims 1-10 can be implemented.
  25. 一种计算机存储介质,其中,所述计算机存储介质存储有计算机可执行指令;所述计算机可执行指令被处理器执行后,能够实现权利要求11-18任一项所述的方法。A computer storage medium, wherein the computer storage medium stores computer-executable instructions; after the computer-executable instructions are executed by a processor, the method according to any one of claims 11-18 can be implemented.
PCT/CN2021/104367 2021-07-02 2021-07-02 Channel decoding method, apparatus, training method for neural network model used for channel decoding, and apparatus WO2023272739A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2021/104367 WO2023272739A1 (en) 2021-07-02 2021-07-02 Channel decoding method, apparatus, training method for neural network model used for channel decoding, and apparatus
CN202180002056.9A CN115804067A (en) 2021-07-02 2021-07-02 Channel decoding method and device, and training method and device of neural network model for channel decoding

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/104367 WO2023272739A1 (en) 2021-07-02 2021-07-02 Channel decoding method, apparatus, training method for neural network model used for channel decoding, and apparatus

Publications (1)

Publication Number Publication Date
WO2023272739A1 true WO2023272739A1 (en) 2023-01-05

Family

ID=84692207

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/104367 WO2023272739A1 (en) 2021-07-02 2021-07-02 Channel decoding method, apparatus, training method for neural network model used for channel decoding, and apparatus

Country Status (2)

Country Link
CN (1) CN115804067A (en)
WO (1) WO2023272739A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108650201A (en) * 2018-05-10 2018-10-12 东南大学 Channel equalization method, interpretation method based on neural network and corresponding equipment
CN109361404A (en) * 2018-09-28 2019-02-19 华南理工大学 A kind of LDPC decoding system and interpretation method based on semi-supervised deep learning network
CN110351212A (en) * 2019-08-10 2019-10-18 南京理工大学 Based on the channel estimation methods of convolutional neural networks under fast fading channel
CN111106839A (en) * 2019-12-19 2020-05-05 北京邮电大学 Polarization code decoding method and device based on neural network
CN111901000A (en) * 2020-08-03 2020-11-06 电子科技大学 Channel decoding method based on Incepton structure
US20210201003A1 (en) * 2019-12-30 2021-07-01 Affectiva, Inc. Synthetic data for neural network training using vectors

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108650201A (en) * 2018-05-10 2018-10-12 东南大学 Channel equalization method, interpretation method based on neural network and corresponding equipment
CN109361404A (en) * 2018-09-28 2019-02-19 华南理工大学 A kind of LDPC decoding system and interpretation method based on semi-supervised deep learning network
CN110351212A (en) * 2019-08-10 2019-10-18 南京理工大学 Based on the channel estimation methods of convolutional neural networks under fast fading channel
CN111106839A (en) * 2019-12-19 2020-05-05 北京邮电大学 Polarization code decoding method and device based on neural network
US20210201003A1 (en) * 2019-12-30 2021-07-01 Affectiva, Inc. Synthetic data for neural network training using vectors
CN111901000A (en) * 2020-08-03 2020-11-06 电子科技大学 Channel decoding method based on Incepton structure

Also Published As

Publication number Publication date
CN115804067A (en) 2023-03-14

Similar Documents

Publication Publication Date Title
CN110226289B (en) Receiver and method for decoding
US11334467B2 (en) Representing source code in vector space to detect errors
KR102617929B1 (en) Method and apparatus for pre-training graph neural network
US9941900B1 (en) Techniques for general-purpose lossless data compression using a recurrent neural network
US11722148B2 (en) Systems and methods of data compression
CN111563593B (en) Training method and device for neural network model
CN110569969A (en) Network model structure sampling method and device and electronic equipment
Chen et al. Practical polar code construction over parallel channels
Yang et al. Nonlinear programming approaches to decoding low-density parity-check codes
WO2021034387A1 (en) Flexible huffman tree approximation for low latency encoding
Cao et al. Minimal sets for capacity-approaching variable-length constrained sequence codes
KR102213345B1 (en) Method for constructing parity-check concatenated polar codes and apparatus therefor
US9595976B1 (en) Folded integer encoding
KR102607536B1 (en) Optimizer learning method and apparatus, electronic device and readable storage medium
CN112580723B (en) Multi-model fusion method, device, electronic equipment and storage medium
CN111539224B (en) Pruning method and device of semantic understanding model, electronic equipment and storage medium
WO2023272739A1 (en) Channel decoding method, apparatus, training method for neural network model used for channel decoding, and apparatus
CN112804029A (en) Transmission method, device and equipment of BATS code based on LDPC code and readable storage medium
Andi et al. Fast decoding of polar codes using tree structure
US20230069079A1 (en) Statistical K-means Clustering
CN111160552B (en) News information recommendation processing method, device, equipment and computer storage medium
CN111488972B (en) Data migration method, device, electronic equipment and storage medium
CN111553169B (en) Pruning method and device of semantic understanding model, electronic equipment and storage medium
CN111539225B (en) Searching method and device for semantic understanding framework structure
CN113872610A (en) LDPC code neural network training and decoding method and system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21947684

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE