CN110875746A - Hardware architecture of high-speed GII decoder - Google Patents

Hardware architecture of high-speed GII decoder Download PDF

Info

Publication number
CN110875746A
CN110875746A CN201811016480.5A CN201811016480A CN110875746A CN 110875746 A CN110875746 A CN 110875746A CN 201811016480 A CN201811016480 A CN 201811016480A CN 110875746 A CN110875746 A CN 110875746A
Authority
CN
China
Prior art keywords
unit
parallelism
module
gii
syndrome
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811016480.5A
Other languages
Chinese (zh)
Inventor
王中风
李文杰
林军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University
Original Assignee
Nanjing University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University filed Critical Nanjing University
Priority to CN201811016480.5A priority Critical patent/CN110875746A/en
Publication of CN110875746A publication Critical patent/CN110875746A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M13/00Coding, decoding or code conversion, for error detection or error correction; Coding theory basic assumptions; Coding bounds; Error probability evaluation methods; Channel models; Simulation or testing of codes
    • H03M13/03Error detection or forward error correction by redundancy in data representation, i.e. code words containing more digits than the source words
    • H03M13/05Error detection or forward error correction by redundancy in data representation, i.e. code words containing more digits than the source words using block codes, i.e. a predetermined number of check bits joined to a predetermined number of information bits
    • H03M13/11Error detection or forward error correction by redundancy in data representation, i.e. code words containing more digits than the source words using block codes, i.e. a predetermined number of check bits joined to a predetermined number of information bits using multiple parity bits
    • H03M13/1102Codes on graphs and decoding on graphs, e.g. low-density parity check [LDPC] codes
    • H03M13/1105Decoding

Landscapes

  • Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Error Detection And Correction (AREA)

Abstract

The invention discloses a hardware architecture of a high-speed Generalized Integrated Interleaved (GII) code decoder. The existing literature only stays in a software level for the research of the decoding algorithm of the GII, and the invention firstly proposes the hardware architecture of the GII decoder. The architecture fully utilizes the characteristics of the GII code and has extremely high speed. In order to meet the requirement of high throughput rate, full parallelism is adopted on an interlace level; and on the symbol level, the syndrome computing unit, the nested syndrome computing unit, the chien search unit and the numerical value computing unit all adopt partial parallelism. Aiming at two parts of calculation of nested syndromes and key equation coefficient updating in an algorithm, the invention designs a corresponding high-efficiency hardware architecture. In the inverse matrix multiplication module, complex matrix inversion is avoided, and a lookup table is used for realizing the inverse matrix multiplication. The framework can achieve extremely high throughput rate, and is very suitable for being applied to scenes requiring high-speed and low-power FEC schemes by considering extremely low decoding complexity of the GII code.

Description

Hardware architecture of high-speed GII decoder
Technical Field
The invention relates to the technical field of integrated circuits and communication, in particular to a hardware architecture of a high-speed GII decoder.
Background
The Generalized Integrated Interleaved (GII) code is a multilevel code that has been proposed to address error correction in distributed storage systems. The GII code is flexible in structure, and can be selected according to performance indexes regardless of code length, layer number or check bits. The most primitive GII code is ambiguous in coding algorithm and very complex in decoding algorithm, which has been ignored for a long time by academia. In recent years, a GII coding algorithm and a more efficient decoding algorithm have been proposed in succession, so that GII code re-emphasis has been brought to relevant researchers.
The decoding algorithm of the GII code is a layered decoding algorithm, and the characteristic enables the GII code to be decoded in the first layers when errors are few, so the corresponding decoding complexity is low, and the power consumption of the decoder is also at a lower level. On the other hand, the GII code adopting the BM correlation algorithm can multiplex the previous results in the decoding process, so that the iteration times of the BM are greatly reduced, and the GII decoder has extremely low decoding delay due to the characteristic.
At present, the GII code research only stays in the application of a storage system and in the software implementation stage, and various communication systems such as optical communication and the like all want to have an error correction code decoder with high throughput, low delay and low power consumption. Therefore, the GII code has great potential, becomes an error correction code scheme in communication standards such as Ethernet and the like, and has great significance for the research on the hardware architecture of the GII decoder.
Disclosure of Invention
The invention provides a hardware framework of a GII decoder for the first time, and the framework aims to meet the requirements of high-speed communication systems with throughput rates of more than or equal to 100Gbps such as Ethernet and the like on the decoder.
The corresponding GII decoding hardware architecture mainly comprises the following parts:
● syndrome calculation unit, for the whole GII code word, the syndrome parallel calculation of all interlaces; for a single interleaver, J symbols are computed in parallel. The syndrome calculation unit calculates only the low-order syndromes.
● key equation calculation units, using ribM algorithm, the number of ribM calculation units is equal to the number of interlaves, the number of basic units in each ribM unit is equal to three times of the maximum error correction capability.
● the searching and error value calculating module adopts all interlaves to be parallel, and the single interlave further adopts J symbols to be parallel. The error value calculation is only performed after the last iteration is completed and is used for error correction of the GII codeword.
● high-order nested syndrome calculation unit calculates error by calculatingDecoded higher order syndrome of interleave and correctly decoded r of interleave (α)j) And multiplying the conversion matrix to obtain a high-order nested syndrome.
● the inverse matrix multiplication module converts the higher order nested syndromes to higher order syndromes.
● the key equation coefficient update unit updates the coefficients stored in the registers of the key equation computation unit before each iteration.
Drawings
FIG. 1 is a top level framework diagram of the GII decoder architecture proposed by the present invention;
FIG. 2 is a block diagram of a key equation calculation unit;
FIG. 3 is an architecture diagram of a decodable interleave high order syndrome computation module;
FIG. 4 shows r (α)j) Calculating an architecture diagram of the unit;
FIG. 5 is a top level architecture diagram of a high order nested syndrome computation unit;
fig. 6 is an architecture diagram of a key equation coefficient update unit.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be further described with reference to the accompanying drawings. The embodiments described below by referring to the drawings are exemplary and intended to be illustrative of the present invention and are not to be construed as limiting the present invention.
The top level architecture of the GII decoder proposed by the present invention is shown in FIG. 1. The syndrome computing unit only computes low-order syndromes, and in order to achieve higher throughput, the syndrome computing unit adopts a partially parallel structure, and the parallelism J can be adjusted along with the parameter indexes. The received code word enters a syndrome calculation unit to perform syndrome calculation on one hand, and is stored in the RAM on the other hand.
The last clock cycle of the syndrome calculation initializes the registers of the key equation calculation unit of fig. 2 with the resulting syndromes. Then, the key equation calculation unit carries out the first calculation, and the required clock period is 2tL-1Wherein t isL-1Is the minimum value in the error correction capability. Key prescriptionThe polynomial expression (x) obtained by the program computing unit is sent to a chien searching unit for root searching, and the polynomial expression (x) and the trigonometric searching unit are simultaneously used for root searching
Figure BSA0000169960410000021
Sent to a high-order nested syndrome calculation unit for calculation.
The key equation in the framework adopts a ribM algorithm and four polynomials
Figure BSA0000169960410000022
The coefficients of Λ (x) and b (x) are updated iteratively. The critical path of the riBM architecture has only one galois field multiplication and one galois field addition.
The top-level architecture of a five-layer high-order nested syndrome computation unit of GII code with error correction capability of t ═ 28, 20, 16, 13, 12 is shown in fig. 5, e (α)j) The calculation unit is used for calculating the higher order syndrome of the correctly decoded interleave, and the specific structure is shown in FIG. 3. to avoid extra clock cycles, the unit is synchronized with the Chien search, the value of the higher order syndrome is stored in the shift register, e (α) according to the parallelism of the Chien search and the distribution of the GII code body error correction capabilityj) The computing units can be flexibly adjusted, e.g., serially e (α)j) When the number of clock cycles required by the computing unit is greater than that of the chien search, the serial connection can be changed into partial parallel connection to realize the synchronization of the module and the chien search module. Such synchronization is intended to meet the high throughput requirements without introducing additional clock cycles. The order of the syndromes in the shift register goes from high to low from left to right.
On the other hand, the higher order r of all interlaves (α)j) The decoding flag bit of the Qian search marks whether the corresponding interleaver is correctly decoded, and r is obtained (α)j)-e(αj) While an incorrectly decoded interleave only needs r (α)j) The hardware architecture of the module in which this calculation process is performed is shown in FIG. 4. to obtain the higher-order nested syndrome, it is necessary to further obtain the previous e (α)j) And r (α)j) The conversion matrix is multiplied.
r(αj) The computation unit is essentially similar to a partially parallel syndrome computation unit, which reads the interleave in RAM and computes r (α)j). The cell in fig. 5 contains all syndrome orders from 24 th to 55 th. In each decoding stage, only the module with the corresponding order is executed. The module of matrix multiplication outputs the high order nested syndrome that gets, its input has the multiplexer, in order to choose the module of the corresponding order in different decoding stages.
And the high-order nested syndrome obtains a high-order syndrome through an inverse matrix multiplication unit. The inverse matrix multiplication unit does not adopt a Galois field matrix inversion module, but uses a lookup table to realize the inversion of the matrix, thereby greatly reducing the complexity of hardware.
The higher order syndromes obtained by the inverse matrix multiplication module are not arranged in the order of the undecodable interlaces, so a shift net is added to rearrange the order of the higher order syndromes.
Before the next key equation calculation, the key equation coefficient updating unit calculates to obtain the data to be updated
Figure BSA0000169960410000031
And
Figure BSA0000169960410000032
the specific structure of the unit is shown in fig. 6, and the structure only includes a simple multiply-add unit.
In subsequent iterations, the key equation computation unit does not perform all iterations. Updating the output pairs of the cells with the key equation coefficients at the beginning of each decoding phase
Figure BSA0000169960410000033
And
Figure BSA0000169960410000034
the coefficients of (a) are reinitialized. Reinitialized
Figure BSA0000169960410000035
And
Figure BSA0000169960410000036
the coefficients of (2) contain information of higher-order syndromes, so that only additional iteration is needed after the original iteration, and 2t iterations are not needed again.
Each decoding stage thereafter is similar to that analyzed above. But the value calculation module is only synchronous with the chien search in the last decoding stage to calculate the error value. And adding the error numerical value obtained by calculation with the original code word read out from the RAM to obtain the corrected code word.

Claims (9)

1. A hardware architecture for a high-speed GII decoder comprising:
● syndrome calculation units with full parallelism at the Interleave level and partial parallelism at the symbol level, for receiving the code word from the channel and calculating all the low order syndromes.
● the key equation computation unit is still fully parallel at the interleave level. In each decoding stage, the partial registers in the key equation calculation units corresponding to the interlaves which cannot be decoded in the previous stage are assigned again.
● Qian search module with full parallelism at the level of interlave and partial parallelism at the level of sign, which is used to calculate the number of roots of the error location polynomial and determine whether each interlave can be translated.
● numerical computation module with complete parallelism of Interleave level and partial parallelism of symbol level, which is used to compute error numerical value.
● the high-order nested syndrome calculation unit comprises a high-order syndrome calculation module of interplay capable of decoding, r (α) of all interplayj) A calculation module and a matrix multiplication module.
● the inverse matrix multiplication module is used to get the inverse matrix of the sub-matrix of the transformation matrix and multiply with the high order nested syndrome to get the high order syndrome.
● the shift network passes the higher order syndromes to the corresponding interlaces in the correct order.
● the key equation coefficient updating unit adopts the full parallelism of interlave to update the coefficients of KES part of the irresolvable interlave.
2. The syndrome computation unit as recited in claim 1, wherein full parallelism at an interleave level and partial parallelism at a symbol level are employed to achieve high throughput. The calculation unit calculates only the low order syndromes and initializes the key equation calculation unit with these syndromes without storing them in the storage unit.
3. The key equation computation unit of claim 1, being still fully parallel at the interleave level. With the riBM architecture, the number of basic computation units corresponds to the value of the maximum error correction capability of the GII code.
4. A chien search module as claimed in claim 1, the module having a degree of parallelism in accordance with the syndrome computation unit. At the end of each decoding stage, the chien search uses the error location polynomial to find the error location and outputs the flag bit whether the interleave is solvable or not.
5. The numerical computation module of claim 1, wherein the parallelism is consistent with the chien search, and the module computes the error value only at the end of the whole decoding process.
6. The higher order nested syndrome computation unit of claim 1 is divided into three parts, inter higher order syndrome computation module for decodable, r (α) for all interj) The unit can be designed to be serial or partially parallel according to the difference between the required number of clock cycles and the error correction capability of adjacent layers, r (α) of all interlavesj) A calculation module is also performed concurrently with the chien search, which reads the codewords stored in the memory and calculates r (α) of the corresponding orderj). The moduleFull parallelism is adopted at an interlace level, and partial parallelism is adopted at a symbol level.
7. The inverse matrix multiplication module of claim 1 comprising a basic matrix multiplication unit and a look-up table. And the lookup table outputs the corresponding elements of the inverse matrix to the matrix multiplication calculating unit according to the serial number of the irresolvable interleaver.
8. The key equation coefficient update unit of claim 1, which performs the polynomial fitting by using a multiply-add calculation similar to a convolution calculation
Figure FSA0000169960400000021
And
Figure FSA0000169960400000022
and updating the key equation calculation unit using the updated terms.
9. A key equation coefficient update unit as claimed in claim 7, which unit uses the riBM algorithm
Figure FSA0000169960400000023
And
Figure FSA0000169960400000024
the coefficient updating rule of (2) only needs the multiplication and addition of the Galois field to complete the operation, and the key path is also the multiplication and addition of the Galois field.
CN201811016480.5A 2018-08-29 2018-08-29 Hardware architecture of high-speed GII decoder Pending CN110875746A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811016480.5A CN110875746A (en) 2018-08-29 2018-08-29 Hardware architecture of high-speed GII decoder

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811016480.5A CN110875746A (en) 2018-08-29 2018-08-29 Hardware architecture of high-speed GII decoder

Publications (1)

Publication Number Publication Date
CN110875746A true CN110875746A (en) 2020-03-10

Family

ID=69716403

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811016480.5A Pending CN110875746A (en) 2018-08-29 2018-08-29 Hardware architecture of high-speed GII decoder

Country Status (1)

Country Link
CN (1) CN110875746A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130086455A1 (en) * 2011-10-03 2013-04-04 Samsung Electronics Co., Ltd. Method and apparatus of qc-ldpc convolutional coding and low-power high throughput qc-ldpc convolutional encoder and decoder
US20180034481A1 (en) * 2016-07-28 2018-02-01 Indian Institute Of Science Reed-solomon decoders and decoding methods
CN107688506A (en) * 2017-08-31 2018-02-13 华中科技大学 A kind of BCH decoding systems of flowing structure

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130086455A1 (en) * 2011-10-03 2013-04-04 Samsung Electronics Co., Ltd. Method and apparatus of qc-ldpc convolutional coding and low-power high throughput qc-ldpc convolutional encoder and decoder
US20180034481A1 (en) * 2016-07-28 2018-02-01 Indian Institute Of Science Reed-solomon decoders and decoding methods
CN107688506A (en) * 2017-08-31 2018-02-13 华中科技大学 A kind of BCH decoding systems of flowing structure

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
赵景琰等: "并行化的BCH编解码器设计" *
陈宏铭等: "适合NAND 闪存控制器的 BCH 纠错编译码器VLS I 实现" *

Similar Documents

Publication Publication Date Title
US5642367A (en) Finite field polynomial processing module for error control coding
Lee A high-speed low-complexity Reed-Solomon decoder for optical communications
CN101277119B (en) Method for complexing hardware of Reed Solomon code decoder as well as low hardware complex degree decoding device
US20130198592A1 (en) Turbo code parallel interleaver and parallel interleaving method thereof
CN101483442B (en) BCH decoder for configuring error correcting capability according to Nand Flash extra space
CN112468161B (en) RS high-speed coding circuit
KR100260415B1 (en) High speed serial error position polynomual calculation circuit
Ahmed et al. VLSI architectures for soft-decision decoding of Reed-Solomon codes
JP3834122B2 (en) Error position detection polynomial calculator
CN101488762B (en) Area compact and fast BCH parallel decoding method
JPWO2009075143A1 (en) Decoding device, data storage device, data communication system, and decoding method
KR101094574B1 (en) APPARATUS FOR PERFORMING THE HIGH-SPEED LOW-COMPELEXITY PIPELINED BERLEKAMP-MASSEY ALGORITHM OF BCH decoder AND METHOD THEREOF
CN112653474A (en) Design method of compact LDPC-CC decoder for reducing average iteration number
CN100417031C (en) Method of realizing Reed Solomen convolution code in broadband radio insertion system
KR100756424B1 (en) An Area-Efficient Reed-Solomon Decoder using Pipelined Recursive Technique
CN107688506B (en) BCH decoding system with flow structure
CN110875746A (en) Hardware architecture of high-speed GII decoder
Zhang et al. Systematically re-encoded algebraic soft-decision Reed–Solomon decoder
Zhang et al. Ultra-compressed three-error-correcting BCH decoder
WO2006120691A1 (en) Galois field arithmetic unit for error detection and correction in processors
KR101267958B1 (en) Bch decoder, memory system having the same and decoding method
Park et al. High-speed low-complexity Reed-Solomon decoder using pipelined Berlekamp-Massey algorithm
Zhang et al. Generalized backward interpolation for algebraic soft-decision decoding of Reed-Solomon codes
Zhang et al. Low-power high-efficiency architecture for low-complexity chase soft-decision Reed–Solomon decoding
KR101226439B1 (en) Rs decoder, memory system having the same and decoding method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20200310

WD01 Invention patent application deemed withdrawn after publication