CN102255618A

CN102255618A - Low-overhead transient fault automatic correction circuit for high speed adder

Info

Publication number: CN102255618A
Application number: CN2010101769465A
Authority: CN
Inventors: 张民选; 孙岩; 陈吉华; 李少青; 赵振宇; 马卓; 张明; 何小威; 乐大珩; 张均安; 谭晓强; 段志奎
Original assignee: National University of Defense Technology
Current assignee: National University of Defense Technology
Priority date: 2010-05-20
Filing date: 2010-05-20
Publication date: 2011-11-23
Anticipated expiration: 2030-05-20
Also published as: CN102255618B

Abstract

Transient faults in combinational logic become an important challenge in VLSI circuit design. As a typical component of combinational logic, adders are widely used in arithmetic units. The invention discloses a low-overhead high-speed adder transient fault automatic correction circuit. This structure realizes the automatic correction of transient faults in high-speed adders with low overhead by exploiting the inherent hardware redundancy and time redundancy that exist in a large number of adder circuits, which significantly reduces the area and performance overhead of fault tolerance; The fault correction technology of the C unit, combined with the inherent hardware redundancy and time redundancy, further enhances the transient fault correction capability of the adder. The proposed adder has a better area-delay overhead tradeoff than other structures.

Description

A Low Overhead High Speed Adder Transient Fault Automatic Correction Circuit

技术领域 technical field

本发明属于集成电路设计领域，主要涉及到集成电路芯片的瞬态故障预防和恢复领域，特指利用电路的设计加固技术，实现低开销的高速加法器瞬态故障自动校正，从而提高电路可靠性的结构和技术。The invention belongs to the field of integrated circuit design, mainly relates to the field of transient fault prevention and recovery of integrated circuit chips, in particular refers to the use of circuit design reinforcement technology to realize automatic correction of transient faults of high-speed adders with low overhead, thereby improving circuit reliability structure and technology.

背景技术 Background technique

集成电路中的瞬态故障主要由各种高能粒子引起。高能粒子射入集成电路时，将发生瞬时充放电，充放的电荷被敏感区域吸收，将导致集成电路逻辑状态发生改变，引起运行错误或输出错误，严重影响了集成电路的可靠性。引起瞬态故障的高能粒子，主要来源于宇宙辐射环境、核辐射环境和封装材料等。过去曾经认为由于高能粒子在穿越大气层时能量迅速减弱，将不会引起地面上的集成电路发生功能错误。但是随着制造工艺的不断发展，超大规模集成电路的特征尺寸越来越小，其栅长度、节点尺寸、深度、氧化层厚度等都相应减小，P-N结临界电荷也大大下降。另一方面，集成电路工作频率越来越高，工作电压越来越低。这些变化都使集成电路对单粒子效应表现出了更强的敏感性。所以较低能量的粒子也有可能影响集成电路的正常运行。研究显示，瞬态故障引起的失效是集成电路最重要的失效原因。因此，不仅在空间环境中，甚至在地面上的集成电路也面临着高能粒子的威胁。应用于军事和航空航天领域的集成电路必须考虑高能粒子引起的瞬态故障，以提高系统的可靠性。Transient faults in integrated circuits are mainly caused by various high-energy particles. When high-energy particles are injected into the integrated circuit, instantaneous charge and discharge will occur, and the charged and discharged charge will be absorbed by the sensitive area, which will cause the logic state of the integrated circuit to change, cause operation errors or output errors, and seriously affect the reliability of the integrated circuit. The high-energy particles that cause transient faults mainly come from the cosmic radiation environment, nuclear radiation environment and packaging materials. It was thought in the past that high-energy particles would not cause malfunctions in integrated circuits on the ground because of the rapid loss of energy as they travel through the atmosphere. However, with the continuous development of the manufacturing process, the feature size of VLSI is getting smaller and smaller, and its gate length, node size, depth, oxide layer thickness, etc. are all reduced accordingly, and the critical charge of the P-N junction is also greatly reduced. On the other hand, the operating frequency of integrated circuits is getting higher and higher, and the operating voltage is getting lower and lower. These changes have made integrated circuits more sensitive to single event effects. So lower energy particles may also affect the normal operation of integrated circuits. Studies have shown that failures caused by transient faults are the most important cause of failure in integrated circuits. Therefore, not only in the space environment, but even integrated circuits on the ground are also facing the threat of high-energy particles. Integrated circuits used in military and aerospace fields must consider transient faults caused by high-energy particles to improve system reliability.

以前有研究指出，与时序电路相比，组合电路对高能粒子的敏感性较低，不易产生瞬态故障。但是由于存储器等时序元件结构规整，可以使用奇偶校验或纠错编码(Error CorrectionCodes，ECC)等方法进行保护。组合电路结构复杂，规则性较差，无法应用类似的方法进行保护。而且随着集成电路工艺的发展，组合电路中的瞬态故障将赶上甚至超过存储器，成为导致集成电路失效的主要原因，必须考虑瞬态故障的保护。Previous studies have pointed out that compared with sequential circuits, combinational circuits are less sensitive to high-energy particles and less prone to transient failures. However, due to the regular structure of sequential components such as memories, methods such as parity check or error correction codes (Error Correction Codes, ECC) can be used for protection. Combination circuits are complex in structure and poor in regularity, so similar methods cannot be applied for protection. Moreover, with the development of integrated circuit technology, the transient fault in the combined circuit will catch up with or even surpass the memory, and become the main reason for the failure of the integrated circuit, so the protection of the transient fault must be considered.

在航天航空等可靠性要求很高的应用中，组合电路一般通过三模冗余(Triple ModularRedundancy，TMR)进行加固。三模冗余将原始电路复制为三个副本，并对电路输出利用三选二表决器进行选择，从而确保结果的正确性。然而三模冗余会带来很大的面积、功耗和性能开销，对于普通的集成电路来说成本太高，无法大规模应用。而双模冗余将原始的电路复制为两个副本，并对输出结果进行比较以检测瞬态故障，降低了容错的开销。但是这种方法无法自动纠正电路中的故障，需要增加额外的错误处理机制，这又增加的设计的复杂性。In applications with high reliability requirements such as aerospace, combined circuits are generally reinforced by Triple Modular Redundancy (TMR). Triple-mode redundancy copies the original circuit into three copies, and selects the output of the circuit with two-out-of-three voters to ensure the correctness of the results. However, triple-mode redundancy will bring a large area, power consumption and performance overhead, and the cost is too high for ordinary integrated circuits to be used on a large scale. The dual-mode redundancy copies the original circuit into two copies, and compares the output results to detect transient faults, reducing the cost of fault tolerance. However, this method cannot automatically correct faults in the circuit, and an additional error handling mechanism needs to be added, which increases the complexity of the design.

为了解决以上问题，研究者提出两种技术：副本纠错和时移纠错技术。图1(a)表示了副本纠错的结构，它将C单元(其电路结构如图2所示)应用到双模冗余结构中，代替原来两部分副本输出的比较电路，可以自动纠正电路中的瞬态故障，但面积开销仍然较大。图1(b)表示了时移纠错的结构，它没有采用双模冗余结构，而是在电路的输出后加入一个延时单元，原始输出与延时后的输出同时进入C单元，也可以自动纠正电路中的瞬态故障，与副本纠错相比，面积开销大大降低，但电路的延时会有较大增加。In order to solve the above problems, the researchers proposed two technologies: replica error correction and time-shift error correction technology. Figure 1(a) shows the structure of the copy error correction, which applies the C unit (its circuit structure shown in Figure 2) to the dual-mode redundant structure, replacing the comparison circuit output by the original two parts of the copy, which can automatically correct the circuit transient faults in , but the area overhead is still large. Figure 1(b) shows the structure of time-shift error correction. It does not adopt the dual-mode redundant structure, but adds a delay unit after the output of the circuit. The original output and the delayed output enter the C unit at the same time. Transient faults in the circuit can be automatically corrected. Compared with replica error correction, the area overhead is greatly reduced, but the delay of the circuit will be greatly increased.

发明内容 Contents of the invention

副本纠错和时移纠错技术虽然降低了三模冗余的开销，并且解决了双模冗余只能检错无法纠错的问题，但是它们仍然存在较大的面积开销或延时开销，限制了两种技术的应用。组合电路内部存在大量的固有冗余资源，包括硬件冗余和时间冗余。然而，已有的技术只单独对某一种冗余进行研究，很少有技术充分开发和利用这两种资源。如果能够充分开发这些硬件冗余和时间冗余资源用于容错，将可以大大降低副本纠错和时移纠错技术的面积和延时开销，使得资源利用更加高效，以很小的代价设计高可靠的电路。Although copy error correction and time-shift error correction technologies reduce the overhead of triple-mode redundancy and solve the problem that dual-mode redundancy can only detect errors but not correct them, they still have large area overhead or delay overhead. The application of both techniques is limited. There are a large number of inherently redundant resources in combinational circuits, including hardware redundancy and time redundancy. However, existing technologies only study a certain kind of redundancy, and few technologies fully develop and utilize these two resources. If these hardware redundancy and time redundancy resources can be fully developed for fault tolerance, the area and delay overhead of replica error correction and time-shift error correction technology can be greatly reduced, making resource utilization more efficient, and designing high reliable circuit.

加法器是组合逻辑电路的一种最典型元件，广泛应用于集成电路的各种运算单元中。加法器的可靠性对整个芯片的可靠性具有重要的影响。本发明基于以上思想，提出了一种低开销的高速加法器瞬态故障自动校正电路，主要的技术点包括下列几个方面：An adder is one of the most typical components of a combinational logic circuit and is widely used in various arithmetic units of integrated circuits. The reliability of the adder has an important influence on the reliability of the whole chip. Based on the above ideas, the present invention proposes a low-overhead high-speed adder transient fault automatic correction circuit, and the main technical points include the following aspects:

1.开发加法器中固有的硬件冗余资源用于错误校正，降低了容错的面积开销；1. Develop the inherent hardware redundancy resources in the adder for error correction, reducing the area overhead of fault tolerance;

2.开发加法器中固有的时间冗余资源用于错误校正和面积降低，不但降低了容错的延时开销，还一定程度上降低了电路的面积；2. Develop the inherent time redundancy resources in the adder for error correction and area reduction, which not only reduces the delay overhead of fault tolerance, but also reduces the circuit area to a certain extent;

3.将基于C单元的错误校正技术与开发固有硬件和时间冗余相结合，增强加法器的容错能力，使之可以自行纠正其中的瞬态故障。3. Combining C-unit-based error correction techniques with developing inherent hardware and time redundancy to enhance the fault tolerance of the adder so that it can self-correct transient faults in it.

本发明公开的高速加法器瞬态故障校正电路重点关注降低容错的开销和提高纠错的能力。本发明的技术优势在于：The high-speed adder transient fault correction circuit disclosed by the invention focuses on reducing the overhead of fault tolerance and improving the ability of error correction. The technical advantage of the present invention is:

1.能够以较小的面积和延时开销实现瞬态故障的防护，只需很小的成本，并且能够保持高速加法器的性能优势；1. It can realize the protection of transient faults with a small area and delay overhead, requires only a small cost, and can maintain the performance advantages of high-speed adders;

2.充分开发了加法器中固有的硬件和时间冗余资源，提高资源利用效率，减小不必要的资源浪费；2. Fully exploit the inherent hardware and time redundant resources in the adder, improve resource utilization efficiency, and reduce unnecessary waste of resources;

3.能够对电路中的瞬态故障自动校正，实现了较强的容错能力，大大增强了加法器的可靠性。3. It can automatically correct the transient faults in the circuit, realizes strong fault tolerance, and greatly enhances the reliability of the adder.

本发明所提出的技术可以扩展到其它组合电路结构中。随着工艺的快速发展，组合电路的瞬态故障问题变得越来越严重，开发组合电路中固有的冗余资源来提高其可靠性，所需开销较小，具有重要的意义和实用价值。The technique proposed by the present invention can be extended to other combinational circuit structures. With the rapid development of technology, the problem of transient faults in combinational circuits has become more and more serious. It is of great significance and practical value to develop the inherent redundant resources in combinational circuits to improve their reliability and require less overhead.

附图说明 Description of drawings

图1副本纠错和时移纠错结构；Figure 1 Replica error correction and time shift error correction structure;

图2C单元电路结构；Figure 2C unit circuit structure;

图3本发明公开的高速加法器总体结构；The overall structure of the high-speed adder disclosed by the present invention of Fig. 3;

图4本发明公开的高速加法器瞬态故障校正电路组进位G/P产生部分；Fig. 4 The high-speed adder transient fault correction circuit group carry G/P generation part disclosed by the present invention;

图5本发明公开的高速加法器瞬态故障校正电路进位树部分；Fig. 5 The carry tree part of the high-speed adder transient fault correction circuit disclosed by the present invention;

图6传统基于稀疏树高速加法器的关键路径和非关键路径示意；Fig. 6 schematic diagram of critical path and non-critical path of traditional high-speed adder based on sparse tree;

图7本发明公开的高速加法器瞬态故障校正电路关键路径和非关键路径部分；Fig. 7 The critical path and non-critical path parts of the high-speed adder transient fault correction circuit disclosed by the present invention;

图8本发明与其它结构的面积、延时和面积延时积比较。Figure 8 compares the area, delay and area delay product of the present invention with other structures.

具体实施方式 Detailed ways

以下结合附图，详细说明本发明公开的低开销的高速加法器瞬态故障自动校正电路的结构和工作过程。The structure and working process of the low-overhead high-speed adder transient fault automatic correction circuit disclosed by the present invention will be described in detail below in conjunction with the accompanying drawings.

本发明公开的低开销的高速加法器瞬态故障自动校正电路由三个部分构成，如图3所示，分别为组进位G/P产生部分、进位树部分以及部分和产生与选择部分。为了清楚地阐述，以下以64位加法器为例说明。事实上，本发明的结构适用于任何位宽的加法器。The low-overhead high-speed adder transient fault automatic correction circuit disclosed by the present invention consists of three parts, as shown in FIG. 3 , which are the group carry G/P generation part, the carry tree part, and the partial sum generation and selection part. In order to illustrate clearly, a 64-bit adder is taken as an example below. In fact, the structure of the present invention is applicable to adders of any bit width.

组进位G/P产生部分产生4位一组的G/P组进位信号。输入为4位的A_i+3～A_i和B_i+3～B_i，经过三级逻辑，输出组进位产生信号G_i+3，i和组进位传播信号P_i+3，i，如图4所示。图中最上层的4个实心矩形自右至左分别表示如下操作：The group carry G/P generating part generates a 4-bit group G/P group carry signal. The input is 4-bit A _i+3 ～A _i and B _i+3 ～B _i , after three-level logic, the output group carry generation signal G _i+3,i and group carry propagation signal P _i+3,i , such as Figure 4 shows. The four solid rectangles at the top of the figure represent the following operations from right to left:

${G G}_{i i} = = {A A}_{i i} \cdot \cdot {B B}_{i i},, {P P}_{i i} = = {A A}_{i i} &CirclePlus; &CirclePlus; {B B}_{i i} - - - - - - [[Equ Equivalent . . 11]]$

${G G}_{i i + + 11} = = {A A}_{i i + + 11} \cdot \cdot {B B}_{i i + + 11},, {P P}_{i i + + 11} = = {A A}_{i i + + 11} &CirclePlus; &CirclePlus; {B B}_{i i + + 11} - - - - - - [[Equ Equivalent . . 22]]$

${G G}_{i i + + 22} = = {A A}_{i i + + 22} \cdot &Center Dot; {B B}_{i i + + 22},, {P P}_{i i + + 22} = = {A A}_{i i + + 22} &CirclePlus; &CirclePlus; {B B}_{i i + + 22} - - - - - - [[Equ Equivalent . . 33]]$

${G G}_{i i + + 33} = = {A A}_{i i + + 33} \cdot &Center Dot; {B B}_{i i + + 33},, {P P}_{i i + + 33} = = {A A}_{i i + + 33} &CirclePlus; &CirclePlus; {B B}_{i i + + 33} - - - - - - [[Equ Equivalent . . 44]]$

中间层的2个实心圆形自右至左分别表示如下操作：The two solid circles in the middle layer represent the following operations from right to left:

G_i+1，i＝G_i+1+P_i+1·G_i，P_i+1，i＝P_i+1·P_i [Equ.5]G _i+1,i =G _i+1 +P _i+1 G _i ,P _i+1,i =P _i+1 P _i [Equ.5]

G_i+3，i+2＝G_i+3+P_i+3·G_i+2，P_i+3，i+2＝P_i+3·P_i+2 [Equ.6]G _i+3,i+2 =G _i+3 +P _i+3 G _i+2 ,P _i+3,i+2 =P _i+3 P _i+2 [Equ.6]

最下层的1个实心圆形表示如下操作：A solid circle at the bottom indicates the following operations:

G_i+3，i＝G_i+3，i+2+P_i+3，i+2·G_i+1，i，P_i+3，i＝P_i+3，i+2·P_i+1，i [Equ.7]G _i+3,i =G _i+3,i+2 +P _i+3,i+2 G _i+1,i ,P _i+3,i =P _i+3,i+2 P _{i +1,i} [Equ.7]

在64位加法器中，共有16组图4中的组进位G/P产生模块。为了增强组进位G/P产生部分的可靠性，每个模块都复制一份，两个副本使用相同的输入，而输出连到稀疏进位树的不同信号。In the 64-bit adder, there are 16 groups of carry G/P generation modules in Figure 4. In order to enhance the reliability of the group carry G/P generation part, each module has a copy, the two copies use the same input, and the output is connected to a different signal of the sparse carry tree.

稀疏进位树部分由稀疏进位节点整列、增加的一级进位节点和最后的一组C单元组成，如图5所示。稀疏进位节点阵列在图中为蓝色框中的部分，它们与一个N/4位的Kogge-Stone(KS)加法器的进位节点阵列类似，对于64位加法器，共由16列组成，每列共4级，每个实心圆形节点实现以下功能：The part of the sparse carry tree is composed of the entire column of sparse carry nodes, an increased one-level carry node and the last group of C units, as shown in Figure 5. The sparse carry node array is the part in the blue box in the figure. They are similar to the carry node array of an N/4-bit Kogge-Stone (KS) adder. For a 64-bit adder, it consists of 16 columns. Each There are 4 levels in total, and each solid circular node realizes the following functions:

G_i+1，i＝G_i+1+P_i+1·G_i，P_i+1，i＝P_i+1·P_i [Equ.8]G _i+1,i =G _i+1 +P _i+1 G _i ,P _i+1,i =P _i+1 P _i [Equ.8]

稀疏进位节点阵列中的空心圆形实现以下功能：The hollow circles in the sparse carry node array implement the following functions:

G_i+1，i＝G_i+1+P_i+1·G_i [Equ.9]G _{i+1, i} ＝G _i+1 +P _i+1 ·G _i [Equ.9]

在稀疏进位节点阵列中，黑色的节点和连线用于计算4i+3(i＝0，2，4，…，14)位的进位，而红色的节点和连线用于计算4i+3(i＝1，3，5，…，15)位的进位，黑色和红色两部分之间没有信号连接，相互独立，互不影响。这样，黑色和红色两部分形成天然的硬件资源冗余，我们利用这种固有冗余特性用于容错，可大大提高资源利用率。In the sparse carry node array, the black nodes and lines are used to calculate the carry of 4i+3 (i=0, 2, 4, ..., 14), while the red nodes and lines are used to calculate 4i+3 ( i=1, 3, 5, . . . , 15) bit carry, there is no signal connection between the black and red parts, and they are independent of each other and do not affect each other. In this way, the black and red parts form natural hardware resource redundancy. We use this inherent redundancy feature for fault tolerance, which can greatly improve resource utilization.

为了开发冗余，在稀疏进位节点阵列后增加一级进位节点，从而使得每4i+3(i＝0，1，2，…，15)位的进位信号都由黑色和红色两部分计算。组进位G/P产生部分的两组输出分别与黑色和红色信号相连。这样加法器进位树形成两个独立的副本。In order to develop redundancy, a one-level carry node is added behind the sparse carry node array, so that every 4i+3 (i=0, 1, 2, . . . , 15) bit carry signal is calculated by two parts, black and red. The two groups of outputs of the group carry G/P generating part are respectively connected with the black and red signals. This way the adder carry tree forms two independent copies.

在增加的一级进位节点后，我们增加一级C单元。这样，加法器进位树与副本纠错的结构类似，可以自动改正其中出现的任何瞬态故障。与副本纠错不同的是，我们不需要增加大量的硬件开销，而是通过开发固有的硬件冗余来实现硬件的复制，大大降低了容错的面积开销。After adding one level of carry nodes, we add one level of C units. In this way, the carry tree of the adder is similar to the structure of the error correction of the replica, which can automatically correct any transient faults that occur in it. Unlike copy error correction, we do not need to add a lot of hardware overhead, but realize hardware replication by developing inherent hardware redundancy, which greatly reduces the area overhead of fault tolerance.

图6表示了传统的稀疏树加法器的关键路径与非关键路径，其中非关键路径即为部分和产生与选择部分的结构，它由两组4位的串行进位加法器(Ripple Carry Adder，RCA)和一组4位多路选择器(MUX)组成。两组4位的进位传播加法器分别产生进位为“0”的部分和psum0和进位为“1”的部分和psum1，psum0与psum1连接到多路选择器，由关键路径得到的进位信号进行选择，最终得到加法器的和。由于非关键路径上的延时远小于关键路径的延时，因此存在较多的松弛时间没有得到充分的利用。这部分松弛时间即为加法器中的固有时间冗余资源。本发明通过对非关键路径进行改造，充分开发固有的时间冗余资源进行容错，并利用时间冗余减小面积，大大降低了容错的时间开销，同时减小了芯片的面积。Figure 6 shows the critical path and non-critical path of the traditional sparse tree adder, wherein the non-critical path is the structure of the partial sum generation and selection part, which consists of two sets of 4-bit serial carry adders (Ripple Carry Adder, RCA) and a set of 4-bit multiplexers (MUX). Two sets of 4-bit carry-propagation adders generate the carry-in "0" part and psum0 and the carry-out part and psum1 respectively, psum0 and psum1 are connected to the multiplexer, and are selected by the carry signal obtained from the critical path , and finally get the sum of the adder. Since the delay on the non-critical path is much smaller than that on the critical path, there are more slack times that are not fully utilized. This part of the slack time is the inherent time redundancy resource in the adder. The present invention fully develops inherent time redundancy resources for fault tolerance by transforming non-critical paths, and utilizes time redundancy to reduce the area, thereby greatly reducing the time overhead of fault tolerance and simultaneously reducing the area of the chip.

本发明提出的部分和产生与选择部分如图7所示。我们在非关键路径上只使用一个进位为“0”的4位串行进位加法器，在4位加法器的输出连接一组延时单元τ。4位加法器的原始输出和经过τ单元延时的输出连接到一组C单元中，这形成一个类似时移纠错的结构。与时移纠错不同的是，由于是在非关键路径中，我们利用了加法器中的时间冗余来构造时移纠错，不会增加延时，即不会对加法器的性能造成任何损失。The part proposed by the present invention and the generation and selection part are shown in FIG. 7 . We only use a 4-bit serial carry adder whose carry is "0" on the non-critical path, and a group of delay units τ are connected to the output of the 4-bit adder. The original output of the 4-bit adder and the delayed output of the τ unit are connected to a group of C units, which forms a structure similar to time-shifted error correction. Different from time-shift error correction, because it is in the non-critical path, we use the time redundancy in the adder to construct time-shift error correction, which will not increase the delay, that is, it will not cause any impact on the performance of the adder. loss.

C单元的输出即为psum0，在C单元之后，再连接一个已经提出的RIC模块，从而产生psum1。psum0和psum1接入多路选择器进行选择输出最终结果。这里我们通过使用RIC结构来开发时间冗余，减少一个4位串行加法器，用以换取面积的降低。因此，非关键路径中的固有时间冗余资源得到了充分的利用，不仅增加了加法器自动纠错的能力，而且减少了器件的使用，从而降低面积。The output of the C unit is psum0, and after the C unit, a proposed RIC module is connected to generate psum1. psum0 and psum1 are connected to the multiplexer to select and output the final result. Here we exploit time redundancy by using the RIC structure, reducing a 4-bit serial adder in exchange for area reduction. Therefore, the inherent time redundancy resources in non-critical paths are fully utilized, which not only increases the automatic error correction capability of the adder, but also reduces the use of devices, thereby reducing the area.

我们对本发明提出的加法器结构与其它加法器在面积、延时和面积延时积(Area DelayProduct，ADP)三方面进行比较，如图8所示。比较以无纠错能力的标准KS加法器为参照，参与比较的可靠加法器有三模冗余加法器(TMR)、具有RIC结构的三模冗余加法器(TMR+RIC)、副本纠错加法器(ECD)和时移纠错加法器(ECTO)。从结果可以看到，三模冗余加法器的面积约为标准KS加法器的300％以上，因此面积延时积高达3.96；具有RIC结构的三模冗余加法器通过增加较小的延时使得面积在一定程度上减小，因此面积延时积比三模冗余加法器有所降低；副本纠错加法器由于只使用双模冗余，面积约为标准KS加法器的250％左右，因此面积延时积降低到2.39；时移纠错加法器具有最大的延时，但是其面积开销却降低到很小，因此面积延时积相应较小，约为1.96；而本发明提出的加法器由于充分开发了加法器中的硬件冗余和时间冗余，因此面积和延时都大大降低，分别约为标准KS加法器的112％和106％。面积和延时的大幅度降低使得本发明的结构具有最小的面积延时积，约为1.19。We compare the adder structure proposed by the present invention with other adders in terms of area, delay and area delay product (Area Delay Product, ADP), as shown in Figure 8. The comparison is based on the standard KS adder without error correction capability. The reliable adders involved in the comparison include triple-mode redundant adder (TMR), triple-mode redundant adder with RIC structure (TMR+RIC), and replica error-correcting adder device (ECD) and time shift error correction adder (ECTO). It can be seen from the results that the area of the three-mode redundant adder is about 300% of the standard KS adder, so the area delay product is as high as 3.96; The area is reduced to a certain extent, so the area delay product is lower than that of the three-mode redundant adder; because the copy error correction adder only uses dual-mode redundancy, the area is about 250% of the standard KS adder, Therefore the area delay product is reduced to 2.39; The time-shift error correction adder has the maximum delay, but its area overhead is reduced to very little, so the area delay product is correspondingly smaller, about 1.96; and the addition proposed by the present invention Due to the full development of the hardware redundancy and time redundancy in the adder, the area and delay are greatly reduced, which are about 112% and 106% of the standard KS adder, respectively. The substantial reduction in area and delay results in the structure of the present invention having the smallest area delay product of about 1.19.

综上所述，鉴于瞬态故障自动校正的可靠加法器面积开销和延时开销较大的问题，本发明公开了一种低开销的高速加法器瞬态故障自动校正电路，通过充分开发加法器中天然存在的硬件冗余和时间冗余资源，从而大幅度降低高速加法器瞬态故障自动校正的面积和延时开销。所提出的技术可以扩展到其它组合电路结构中。随着工艺的快速发展，组合电路的瞬态故障问题变得越来越严重，开发组合电路中固有的冗余资源来提高其可靠性，所需开销较小，具有一定的实用价值。To sum up, in view of the problem of large area overhead and delay overhead of reliable adders for automatic correction of transient faults, the present invention discloses a low-overhead high-speed automatic correction circuit for transient faults of adders. By fully developing the adder The hardware redundancy and time redundancy resources that exist naturally in the system can greatly reduce the area and delay overhead of automatic correction of transient faults of high-speed adders. The proposed technique can be extended to other combinational circuit structures. With the rapid development of technology, the problem of transient faults in combinational circuits is becoming more and more serious. It is of practical value to develop inherent redundant resources in combinational circuits to improve their reliability, which requires less overhead.

Claims

1. a low overhead high-speed adder transient fault automatic correction circuit, characterized in that:

By combining the inherent hardware redundancy in the carry tree of the critical path of the parallel adder and the C unit with transient fault correction capability, the automatic correction of transient faults in the critical path circuit can be realized with a low area overhead; by using the sparse tree The combination of structure and the development of inherent hardware redundancy greatly reduces the wiring complexity, further reduces the area, and improves performance at the same time; by combining the inherent time redundancy in the non-critical path of the parallel adder and the C Unit combination can realize automatic correction of transient faults in non-critical path circuits with low delay overhead; by combining Inverted Carry-In (RIC) technology with the development of inherent time redundancy, non-critical The slack time of the critical path is exchanged for hardware consumption, further reducing the area of the adder; the specific circuit form includes three components: group carry (G/P) generation, carry tree on the critical path, and partial sum generation and selection on the non-critical path part; the group carry (G/P) generation part includes two groups of independent group carry (G/P) generation modules to form a copy structure; the sparse tree part of the carry tree is combined with a (N/4) Kogge-Stone The carry tree is the same. After the sparse carry tree, one-level carry logic needs to be added to generate two sets of independent carry signals from the parity carry signal; the added one-level carry logic is followed by a group of C units to form a copy error correction structure; The sum generation and selection part uses a group of 4-bit serial carry adders, and the input (Cin) is "0"; the 4-bit serial carry adder is followed by a group of C units, and the two inputs of the C units are respectively 4-bit strings The output of the carry bit adder and the output of the 4-bit serial carry adder after the delay of the τ unit form a time-shift error correction structure; after the C unit, the RIC structure is used to generate two outputs (Psum0 and Psum1), and finally after a multiple The path selector (MUX) is selected to obtain the final output result of the adder.