US20050027777A1 - High speed low power 4-2 compressor - Google Patents

High speed low power 4-2 compressor Download PDF

Info

Publication number
US20050027777A1
US20050027777A1 US10/932,593 US93259304A US2005027777A1 US 20050027777 A1 US20050027777 A1 US 20050027777A1 US 93259304 A US93259304 A US 93259304A US 2005027777 A1 US2005027777 A1 US 2005027777A1
Authority
US
United States
Prior art keywords
xor
gate
multiplexer
xnor
nand
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US10/932,593
Inventor
Young-Chul Rhee
Sang-Suk Kim
Dong-Wook Lee
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Samsung Electronics Co Ltd
Original Assignee
Samsung Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1020030061766A external-priority patent/KR100553702B1/en
Application filed by Samsung Electronics Co Ltd filed Critical Samsung Electronics Co Ltd
Priority to US10/932,593 priority Critical patent/US20050027777A1/en
Assigned to SAMSUNG ELECTRONICS, CO. LTD. reassignment SAMSUNG ELECTRONICS, CO. LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: KIM, SANG-SUK, LEE, DONG-WOOK, RHEE, YOUNG-CHUL
Publication of US20050027777A1 publication Critical patent/US20050027777A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F5/00Methods or arrangements for data conversion without changing the order or content of the data handled
    • G06F5/06Methods or arrangements for data conversion without changing the order or content of the data handled for changing the speed of data flow, i.e. speed regularising or timing, e.g. delay lines, FIFO buffers; over- or underrun control therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/60Methods or arrangements for performing computations using a digital non-denominational number representation, i.e. number representation without radix; Computing devices using combinations of denominational and non-denominational quantity representations, e.g. using difunction pulse trains, STEELE computers, phase computers
    • G06F7/607Methods or arrangements for performing computations using a digital non-denominational number representation, i.e. number representation without radix; Computing devices using combinations of denominational and non-denominational quantity representations, e.g. using difunction pulse trains, STEELE computers, phase computers number-of-ones counters, i.e. devices for counting the number of input lines set to ONE among a plurality of input lines, also called bit counters or parallel counters

Definitions

  • the present invention generally relates to a logic circuit and, more particularly, to a 4-2 compressor for a multiplier.
  • Multiplication is one of the primary operations in general-purpose microprocessors and special purpose digital signal processors.
  • the speed of the multiplication operation often directly determines how fast the processors are capable of operating.
  • a multiplier generates plural partial summation results and then sums the partial results to compute a product.
  • the summation performance influences the performance of the multiplier.
  • the multiplier circuit grows in complexity, and, as a result, the multiplier occupies an increasingly larger area of a central processing unit (CPU) to be embedded in the data processing system.
  • CPU central processing unit
  • multipliers with the desired capability have array structures or Wallace tree structures.
  • a fast array multiplier performing fast multiplication is composed of a Booth encoder, a partial-product summation tree (PPST), and a final adder.
  • the PPST is responsible for a significant portion of the total multiplication operation delay, and is implemented commonly by a Wallace tree with 3-2 counters (i.e., full adders) and 4-2 compressors (i.e., half adders) as basic cells.
  • a Wallace tree with 3-2 counters i.e., full adders
  • 4-2 compressors i.e., half adders
  • FIG. 1 is a block diagram for illustrating a conventional 4-2 compressor 10 .
  • the 4-2 compressor 10 is described in “ Design of high - speed low - power 3-2 counter and 4-2 compressor for fast multipliers ”, by Shen-Fu Hsiao etc.; Feb. 19, 1998, ELECTRONICS LETTERS, Vol. 34, No. 4, pp. 341-343.
  • the 4-2 compressor 10 can be found in FIG. 3 a of the paper.
  • the 4-2 compressor 10 is composed of two cascaded 3-2 counters 12 and 14 .
  • Each 3-2 counter can be realized using two multiplexers.
  • the first 3-2 counter 12 When four input data I 1 , I 2 , I 3 and I 4 are externally input to the 4-2 compressor 10 , the first 3-2 counter 12 generates a carry-out bit Cout and a sum bit S in response to the three input data I 1 , I 2 and I 3 .
  • the carry-out bit Cout is output to a 4-2 compressor at the next stage (not shown) connected to the current 4-2 compressor 10 , and the sum bit S is output to the second 3-2 counter 14 , respectively.
  • the second 3-2 counter 14 generates a carry bit Carry and a sum bit Sum as final outputs, in response to the sum bit S, the input data I 4 , and a carry-input bit Cin from a previous 4-2 compressor at a previous stage (not shown).
  • the compressor 10 receives the four input data I 1 . I 2 , I 3 and I 4 , and generates two final outputs Carry and Sum. Therefore, the compressor 10 is referred to as a “4-2 compressor”.
  • the 4-2 compressor 10 four multiplexers are used. On the assumption that the delay required for one of the multiplexers is 1 MUX, then it follows that the 4-2 compressor 10 requires 4 MUXs of delay.
  • FIG. 2 is a circuit diagram for illustrating a conventional double-pass transistor-logic (DPL) 4-2 compressor 20 to reduce the delay of the 4-2 compressor 10
  • FIG. 3 is a detailed circuit diagram for illustrating a dual-railed multiplexer shown in FIG. 2 .
  • the DPL 4-2 compressor 20 is described in FIG. 3 b of the Hsiao paper cited above. Referring to FIGS. 2 and 3 , the 4-2 compressor 20 reduces the delay from 4 MUXs to 3 MUXs, through the use of pass-transistor multiplexers (PTMs) 21 - 25 and 27 composed of CMOS transmission gates (TGs).
  • PTMs pass-transistor multiplexers
  • TGs CMOS transmission gates
  • the 4-2 compressor 20 generates two carry bits Cout and Carry, and a sum bit Sum, in response to externally input four input data I 1 , I 2 , I 3 and I 4 , and a carry-input bit Cin from the previous 4-2 compressor (not shown).
  • the 4-2 compressor 20 is constructed with dual rails being suitable for high speed operation.
  • the 4-2 compressor 20 is referred to as a double-pass transistor-logic (DPL) 4-2 compressor.
  • the 4-2 compressor 20 has eight gates to be driven by the output of the pass-transistor multiplexers (PTMs) composed of CMOS transmission gates (TGs).
  • PTMs pass-transistor multiplexers
  • TGs CMOS transmission gates
  • the PMOS transistor is twice the size of the NMOS transistor.
  • the internal load capacitance is doubled, which is expressed as 1Cgp ⁇ 2Cgn. Therefore, the total internal load capacitance of the 4-2 compressor 20 can be expressed as 24Cgn by converting it on the basis of the NMOS transistor.
  • pass-transistor logic is used for transmitting a source instead of driving gates.
  • FIG. 4 is a detailed circuit diagram for illustrating an XOR/XNOR logic element shown in FIG. 4 .
  • the 4-2 compressor 30 comprises multiplexers 33 and 37 composed of transmission gates, and XOR/XNOR logic elements 31 , 32 , 34 and 36 .
  • the multiplexers 33 and 37 are coupled to output ports of carry bits Cout and Carry, and the XOR/XNOR logic elements 31 , 32 , 34 and 36 are used for generating a sum bit Sum.
  • Each of nodes N 3 and N 4 of the 4-3 compressor 30 has the internal load capacitance of 3Cgp+3Cgn.
  • a total internal load capacitance of the 4-2 compressor 30 becomes 6Cgp+6Cgn.
  • the total internal load capacitance can be expressed as 18Cgn by converting it on the basis of the NMOS transistor.
  • This 4-2 compressor 30 is referred to as a reduced-internal-capacitance (RIC) 4-2 compressor.
  • the nodes N 3 and N 4 are used for driving gates of the multiplexers 33 and 37 , and used as sources of the XOR/XNOR logic elements 31 , 32 , 34 and 36 , so that the driving gate capacity is relatively decreased. In other words, the capability of driving gates is decreased, although the amount of the internal load capacitance is reduced. Thus the efficiency of the operating time of the RIC 4-2 compressor 30 is inferior to the DPL 4-2 compressor 20 . In addition, the power consumption and the area of the RIC 4-2 compressor 30 are increased in proportion to increment of wiring based on the dual-railed structure.
  • a 4-2 compressor generating a sum bit and a carry bit as a function of four input data comprising: a first logic circuit for performing a NAND operation and a NOR operation of first and second input data, generating an XOR/XNOR operation result of the first and second input data by using the NAND and NOR results, and generating a carry-out bit to a next stage by selecting either the NAND operation result or the NOR operation result; a second logic circuit for generating a selection signal in response to a third input data, a fourth input data and the XOR/XNOR operation result from the first logic circuit; a third logic circuit for generating the sum bit by selecting either a carry-input bit or an inverted carry-input bit in response to the selection signal from the second logic circuit; and a fourth logic circuit for generating the carry bit by selecting either the inverted carry-input bit or an inverted fourth data in response to the selection signal from the second logic circuit.
  • the first logic circuit comprises: a first XOR/XNOR logic circuit for performing the NAND operation and the NOR operation, respectively, and for generating the XOR/XNOR operation result by using the NAND and NOR operation results; and a first multiplexer for generating the carry-out bit for the following stage by selecting either the NAND operation result or the NOR operation result in response to an inverted third data, wherein the first multiplexer is a single railed multiplexer.
  • the second logic circuit preferably comprises: a second XOR/XNOR logic circuit for performing a NAND operation and a NOR operation of the third input data and fourth input data, and for generating an XOR/XNOR operation result of the third and fourth input data by using the NAND and NOR operation results; and a second multiplexer for generating the selection signal by selecting either the XOR operation result or the XNOR operation result from the first XOR/XNOR logic circuit, in response to the XOR/XNOR operation result from the second XOR/XNOR logic circuit, wherein the second multiplexer is a dual railed multiplexer.
  • the third logic circuit preferably comprises a third multiplexer for generating the sum bit, by selecting one of the carry-input bit and the inverted carry-input bit, in response to the selection signal from the second multiplexer, wherein the third multiplexer is a single railed multiplexer.
  • the fourth logic circuit preferably comprises a fourth multiplexer for generating the carry bit, by selecting one of the inverted fourth input data and the inverted carry-input bit, in response to the selection signal from the second multiplexer, wherein the fourth multiplexer is a single railed multiplexer.
  • Each XOR/XNOR logic circuit may comprise: a NAND gate for performing the NAND operation of a first input data and the second input data; a NOR gate for performing the NOR operation of a second input data and the first input data; an XNOR means for generating an XNOR result of the first and the second input data in response to the outputs of the NOR gate and the NAND gate; and an XOR means for generating an XOR result of the first and the second input data in response to the outputs of the NOR gate and the NAND gate.
  • the XNOR means may comprise: a first inverter for inverting the output from the NAND gate while the output of the NOR gate is a first logic state, and outputting the output from the NAND gate without inverting while the output of the NOR gate is a second logic state; and a first full-swing means for making full-swing the output of the first inverter.
  • the XOR means may comprise: a second inverter for inverting the output from the NOR gate while the output of the NAND gate is the second logic state, and outputting the output from the NAND gate without inverting while the output of the NOR gate is a second logic state; and a second full-swing means for making full-swing the output of the second inverter.
  • the first inverter may comprise a P type transistor and a N type transistor having a current path between a power supply voltage source and an output port of the NOR gate in series, and a control electrode controlled by the first input data “a”.
  • the first full-swing means may comprise a P type transistor having a current path between an output port of the first inverter and the output port of the NOR gate, and control electrode controlled by the first input data “a”. The first full-swing means may be turned on when the output signal of the NOR gate is the second logic level, and increases the voltage level of the output signal of the first inverter as high as the power supply voltage level.
  • the second inverter may comprise a P type transistor and a N type transistor having a current path between a ground voltage source and an output port of the NAND gate in series, and a control electrode controlled by the output of the NOR gate.
  • the second full-swing means may comprise a N type transistor having a current path between an output port of the second inverter and the output port of the NAND gate, and control electrode controlled by the first input data “a”.
  • the second full-swing means is preferably turned on when the output signal of the NAND gate is the first logic level, and decreases the voltage level of the output signal of the second inverter as low as the ground voltage level.
  • FIG. 1 is a block diagram for illustrating a conventional 4-2 compressor using 3-2 counters
  • FIG. 2 is a circuit diagram for illustrating a conventional DPL 4-2 compressor
  • FIG. 3 is a detailed circuit diagram for illustrating a dual railed multiplexer shown in FIG. 2 ;
  • FIG. 4 is a circuit diagram for illustrating a conventional RIC 4-2 compressor
  • FIG. 5 is a detailed circuit diagram for illustrating an XOR/XNOR logic shown in FIG. 4 ;
  • FIG. 6 is a circuit diagram for illustrating a 4-2 compressor according to the preferred embodiment of the present invention.
  • FIG. 7A is a detailed circuit diagram for illustrating an XOR/XNOR logic element using NAND/NOR logic elements to generate an XOR/XNOR result, shown in FIG. 6 ;
  • FIG. 7B is a detailed circuit diagram for illustrating another embodiment of XOR/XNOR logic element using the NAND/NOR logic element shown in FIG. 6 ;
  • FIG. 8 is a detailed circuit diagram for illustrating a single railed multiplexer shown in FIG. 6 ;
  • FIG. 9 is a circuit diagram for illustrating a simulation environment of the 4-2 compressor.
  • FIGS. 10 through 12 are charts for illustrating simulation results related to delay and power consumption, and a multiplication result of the delay and power consumption of the 4-2 compressors shown in FIGS. 2, 4 and 6 , respectively, under the simulation environment shown in FIG. 9 .
  • FIG. 6 is a circuit diagram for illustrating a 4-2 compressor 100 according to a preferred embodiment of the present invention.
  • FIG. 7A is a detailed circuit diagram for illustrating an XOR/XNOR logic circuit shown in FIG. 6 .
  • FIG. 7B is a detailed circuit diagram for illustrating another embodiment of XOR/XNOR logic element using the NAND/NOR logic element shown in FIG. 6
  • FIG. 8 is a detailed circuit diagram for illustrating a single railed multiplexer shown in FIG. 6 .
  • the 4-2 compressor 100 comprises four logic circuits 110 , 120 , 130 and 140 .
  • the first logic circuit 110 generates a NAND/NOR result of externally input first and second data I 1 and I 2 , and an XOR/XNOR result by using the NAND/NOR result, and generates a carry-out bit Cout in response to the NAND/NOR result.
  • the second logic circuit 120 generates a selection signal, in response to the NAND/NOR results from the first logic circuit 110 and externally input third and fourth input data I 3 and I 4 .
  • the third logic circuit 130 generates a sum bit Sum by selecting either a carry-input bit Cin from a previous stage or an inverted carry-input bit ⁇ overscore (Cin) ⁇ , in response to the selection signal from the second logic circuit 120 .
  • the fourth logic circuit 140 generates a carry bit Carry by selecting either the inverted carry-input bit ⁇ overscore (Cin) ⁇ or an inverted fourth input data ⁇ overscore (I 4 ) ⁇ , in response to the selection signal from the second logic circuit 120 .
  • the structures of the logic circuits are described as follows.
  • the first logic circuit 110 includes a first XOR/XNOR logic element 52 and a single railed first multiplexer 62 .
  • the first XOR/XNOR logic element 52 generates a NAND/NOR result of the first and second data I 1 and I 2 , and an XOR/XNOR result by using the NAND/NOR result.
  • the first multiplexer 62 generates a carry-out bit Cout by selecting either the NAND result or the NOR result from the first XOR/XNOR logic circuit 52 , in response to the inverted third data ⁇ overscore (I 3 ) ⁇ .
  • the second logic circuit 120 includes a second XOR/XNOR logic element 54 and a dual railed second multiplexer 64 .
  • the second XOR/XNOR logic element 54 generates an XOR/XNOR result by using a NAND/NOR result of the third and fourth data I 3 and I 4 .
  • the second multiplexer 64 outputs the XOR/XNOR result from the first XOR/XNOR logic element 52 , in response to the XOR/XNOR result from the second XOR/XNOR logic element 54 .
  • the XOR/XNOR result from the second multiplexer 64 is used as a selection signal to output the sum bit Sum and the carry bit Carry of the 4-2 compressor 100 .
  • the third logic circuit 130 includes a single railed third multiplexer 66 , which receives the carry-input bit Cin from the previous stage and an inverted carry-input bit ⁇ overscore (Cin) ⁇ and generates the sum bit Sum in response to the output of the second multiplexer 64 .
  • the fourth logic circuit 140 includes a single railed fourth multiplexer 68 , which receives the inverted carry-input bit ⁇ overscore (Cin) ⁇ and the inverted fourth data ⁇ overscore (I 4 ) ⁇ and generates the carry bit Carry in response to the output of the second multiplexer 64 .
  • the externally input carry-input bit Cin and the fourth data I 4 are inverted through inverters 72 and 74 , respectively. This inversion is used for generating an inversion signal required by the single railed multiplexers 66 and 68 .
  • the sum bit Sum and carry bits Cout and Carry are output through buffers 84 , 82 and 86 coupled to the multiplexers 66 , 62 and 68 , respectively.
  • the XOR/XNOR logic elements 52 and 54 using the NAND/NOR logic elements to obtain XOR/XNOR results are used at an input stage of the 4-2 compressor 100 .
  • Each of the XOR/XNOR logic elements 52 and 54 adopts a single input manner and a dual input manner, so that the XOR/XNOR logic elements 52 and 54 do not require an inverter for generating an inversion signal. As a result, a delay originated from the inverter can be prevented.
  • the 4-2 compressor 100 uses the single railed multiplexers to limit the resulting circuit area and the power consumption of the 4-2 compressor.
  • the 4-2 compressor 100 uses the outputs of the XOR/XNOR logic circuits 52 and 54 as a source.
  • the 4-2 compressor 100 does not require two nodes N 1 and N 2 of FIG. 2 , or N 3 and N 4 of FIG. 4 , but requires only one node N 5 .
  • the gate drive at the node N 5 has only 2Cgp+2Cgn of internal load capacitance.
  • the internal load capacitance can be expressed as 6Cgn by converting it on the basis of the NMOS transistor. It is reduced in internal load capacitance by 18Cgn, as compared to the conventional DPL 4-2 compressor 20 shown in FIG. 2 , and is reduced by 12Cgn, as compared to the conventional RIC 4-2 compressor 30 shown in FIG. 4 .
  • each of the XOR/XNOR logic elements 52 and 54 is composed of a NAND gate, a NOR gate, inverters INV 1 and INV 2 for inverting the output signal of the NAND gate and the NOR gate, and full-swing transistors MP 3 and MN 3 for making full swing, the output of the inverters INV 1 and INV 2 .
  • the NAND logic includes PMOS transistors MP 11 and MP 12 and NMOS transistors MN 11 and MN 12 .
  • the PMOS transistor MP 12 and the NMOS transistors MN 11 and MN 12 have current paths which are sequentially formed in series between a power supply voltage VDD and a ground voltage VSS, and have gates controlled by second input data “b”, first input data “a”, and the second input data “b”, respectively.
  • the PMOS transistor MP 11 has a drain coupled to an output port of the NAND gate, a source coupled to the power supply voltage VDD, and a gate controlled by a first input signal.
  • the NOR logic includes PMOS transistors MP 13 and MP 14 and NMOS transistors MN 13 and MN 14 .
  • the PMOS transistors MP 13 MP 14 and the NMOS transistor MN 14 have current paths which are sequentially formed in series between a power supply voltage VDD and a ground voltage VSS, and have gates controlled by first input data “a”, second input data “b”, and the second input data “b”, respectively.
  • the NMOS transistor MN 13 has a drain coupled to an output port of the NOR gate, a source coupled to the ground voltage VSS, and a gate controlled by the first input data “a”.
  • the NAND gate includes a first input port for receiving a first input data “a”, and a second input port.
  • the NOR gate includes a third input port coupled to the first input port of the NAND gate to receive the first input data “a”, and a fourth input port to receive the second input data “b”.
  • the second input port of the NAND gate receive the second input data “b”, and is coupled to the fourth input port of the NOR gate.
  • Each of output nodes N 11 and N 12 of the NAND gate and the NOR gate is coupled to the first inverter INV 1 and the second inverter INV 2 .
  • the first inverter INV 1 is composed of a first PMOS transistor MP 1 and a first NMOS transistor MN 1 .
  • Each of the transistors MP 1 and MN 1 includes a current path shaped between a power supply voltage source V DD and the output node N 12 in series, and a gate controlled by the output of the NAND gate.
  • the second inverter INV 2 is composed of a second PMOS transistor MP 2 and a second NMOS transistor MN 2 .
  • Each of the transistors MP 2 and MN 2 includes a current path shaped between a ground voltage source V SS and the output node N 11 in series, and a gate controlled by the output of the NOR gate.
  • a third PMOS transistor MP 3 and a third NMOS transistor MN 3 are coupled, respectively, to make full swing, the output signal from the output nodes N 11 and N 12 .
  • the PMOS transistor MP 3 includes a drain coupled to the output node N 12 of the NOR gate, a source coupled to the output node N 21 of the first inverter INV 1 , and a gate controlled by the first input data “a”.
  • the NMOS transistor MN 3 includes a drain coupled to the output node N 11 of the NAND gate, a source coupled to the output node N 22 of the second inverter INV 2 , and a gate controlled by the first input data “a”. Operation of the XOR/XNOR logic circuit 52 and 54 using the NAND/NOR result to generate the XOR/XNOR result is described as follows.
  • the NAND gate and the NOR gate included in each of the XOR/XNOR logic circuits 52 and 54 each executes the NAND operation and the NOR operation, respectively.
  • the NAND operation result and the NOR operation result are input to the first inverter INV 1 and the second inverter INV 2 , respectively.
  • the first inverter INV 1 generates an XNOR result of the input data “a” and “b” (i,e., ⁇ overscore (a ⁇ b) ⁇ ) by inversing the output signal of the NAND gate while the output signal of the NOR gate is a logic low level, and outputting the output signal of the NAND gate without inversion while the output signal of the NOR gate is a logic high level.
  • the second inverter INV 2 generates an XOR result of the input data “a” and “b” (i,e., a ⁇ b) by inversing the output signal of the NOR gate while the output signal of the NAND gate is a logic high level, and outputting the output signal of the NOR gate without inversion while the output signal of the NAND gate is a logic low level.
  • the first inverter INV 1 receiving the output signal of the NAND gate generates an output signal with V DD -Vtn voltage level when both the output signals of the NAND gate and the NOR gate are logic high levels.
  • the output of the first inverter INV 1 is reduced by a threshold voltage Vtn of the MOS transistor MN 1 , so that the output signal is not fully swung.
  • the third PMOS transistor MP 3 coupled in parallel to the first NMOS transistor MN 1 of the first inverter INV 1 is turned on.
  • the transistor MP 3 charges on the output signal of the XOR/XNOR logic circuit 52 or 54 .
  • the XOR/XNOR logic element 52 or 54 can output the complete high voltage V DD .
  • the second inverter INV 2 receiving the output signal of the NOR gate generates an output signal having Vtp voltage level when both the output signals of the NAND gate and the NOR gate are logic low levels.
  • the output of the second inverter INV 2 is increased by a threshold voltage Vtp of the MOS transistor MP 2 , so that the output signal is not fully swung.
  • the third NMOS transistor MN 3 coupled in parallel to the second PMOS transistor MP 2 of the second inverter INV 2 is turned on.
  • the transistor MN 3 is discharged.
  • the XOR/XNOR logic element 52 or 54 can output the complete low voltage Vss.
  • the inverters INV 1 and INV 2 generate an XNOR result (i.e., ⁇ overscore (a ⁇ b) ⁇ ) and an XOR result (i.e., a ⁇ b) of the input data I 1 and I 2 , or I 3 and I 4 (i.e., “a” and “b”) of the NAND gate and the NOR gate, respectively, without fluctuation of the voltage originated- from the threshold voltage of the transistor included in the inverters INV 1 and INV 2 .
  • FIG. 7B Another embodiment of the XOR/XNOR logic element using the NAND/NOR logic element shown in FIG. 6 will now be described with reference to FIG. 7B .
  • the XOR/XNOR logic circuit shown in FIG. 7B includes the same components as the XOR/XNOR logic circuit shown in FIG. 7A . That is, the XOR/XNOR logic circuit includes a NAND gate, a NOR gate, first and second inverters INV 1 and INV 2 for inverting the NAND and NOR gates, PMOS and NMOS transistors MP 3 and MN 3 for fully swinging an output of the inverters INV 1 and INV 2 .
  • a drain of the PMOS transistor MP 3 is connected to an output port N 12 of the NOR gate in FIG. 7A
  • a drain of the PMOS transistor MP 14 in FIG. 7B is connected to a drain of the PMOS transistor MP 13 of the NOR gate, not to the output port N 12 of the NOR gate, i.e., a drain of the PMOS transistor MP 14 .
  • the PMOS transistor MP 3 is coupled to the output node N 21 of the inverter INV 1 , which makes the voltage of the output port of the first inverter INV 1 have the same value as a voltage of an output port N 12 of the NOR gate, i.e., the voltage of the PMOS transistor MP 14 .
  • the PMOS transistor MP 3 shown in FIG. 7B has the same function as the PMOS transistor MP 3 shown in FIG. 7A .
  • the voltage level of the output signal of the first inverter INV 1 is not the voltage level of the drain of the PMOS transistor MP 14 , which is the voltage level of the output port of the NOR gate, but rather, the voltage level of the drain of the PMOS transistor MP 13 . Therefore the power supply voltage VDD is transferred through one transistor MP 13 , and not two transistors MP 13 and MP 14 as in the embodiment of FIG. 7A . As a result, the operation speed of the logic circuit is enhanced.
  • the NMOS transistor MN 3 is used to make the voltage level of the output port of the second inverter INV 2 equal to the voltage level of the output port N 11 of the NAND gate, i.e., a voltage level of a drain of the NMOS transistor MN 11 .
  • the NMOS transistor MN 3 shown in FIG. 7B has the same function as the NMOS transistor MN 3 shown in FIG. 7A .
  • the voltage level of the output signal of the second inverter INV 2 is not the voltage level of the drain of the NMOS transistor MN 11 , which is the voltage level of the output port of the NAND gate, but rather, the voltage level of the drain of the NMOS transistor MN 12 . Therefore the ground voltage VSS is transferred through one NMOS transistor MN 12 , and not two transistors MN 11 and MN 12 as in the embodiment of FIG. 7A . As a result, the operation speed of the logic circuit is enhanced.
  • the XOR/XNOR results ⁇ overscore (a ⁇ b) ⁇ /a ⁇ b are input to the second multiplexer 64 , and are used as a source and a gate of the CMOS transmission gates.
  • the NAND result and the NOR result of the first XOR/XNOR logic circuit 52 are input to the first multiplexer 62 , and are output as a carry-out bit Cout.
  • the output signal of the second multiplexer 64 is input to the single railed multiplexers 66 and 68 , with a carry-input bit Cin from the previous stage, for generating the sum bit Sum and the carry bit Carry.
  • FIG. 9 the performance of the 4-2 compressor 100 as compared to the performance of the conventional 4-2 compressors 20 and 30 in the same simulation environment, will now be discussed.
  • FIG. 9 is a circuit diagram for illustrating a simulation environment of the 4-2 compressor.
  • the 4-2 compressor 20 , 30 or 100 is located between marks of “A” and “B”.
  • Input data ⁇ overscore (I 1 ) ⁇ , ⁇ overscore (I 2 ) ⁇ , ⁇ overscore (I 3 ) ⁇ and ⁇ overscore (I 4 ) ⁇ are input though four buffers, output data ⁇ overscore (Sum) ⁇ and ⁇ overscore (Carry) ⁇ are observed through two buffers, respectively.
  • load capacitors C L marked as C 1 and C 2 are coupled to output ports of the 4-2 compressor.
  • the delay, and the power consumption of the 4-2 compressors, and the multiplication result of the delay and the power consumption are measured through the load capacitors C 1 and C 2 .
  • the 4-2 compressors 20 , 30 and 100 adopt a 0.18 ⁇ m CMOS design rule, and use 1.8V as a power supply voltage V DD .
  • an input frequency is 50 MHz
  • an input slop is 0.2 ns
  • a temperature is 25° C., respectively.
  • FIGS. 10 through 12 are charts for illustrating simulation results related to delay, power consumption, and multiplication result of the delay and the power consumption of the 4-2 compressors 20 , 30 and 100 shown in FIGS. 2, 4 and 6 , respectively, under the simulation environment shown in FIG. 9 .
  • a simulation result of the DPL 4-2 compressor 20 shown in FIG. 2 is marked as ‘D’
  • a simulation result of the RIC 4-2 compressor 30 shown in FIG. 4 is marked as ‘R’
  • a simulation result of the 4-2 compressor 100 according to the present invention shown in FIG. 6 is marked as ‘P’, respectively.
  • FIGS. 10 through 12 it can be determined that the delay, the power consumption, and the multiplication result of the delay and the power consumption of the 4-2 compressor 100 according to the present invention shown in FIG. 6 are reduced remarkably, as compared to the conventional 4-2 compressors 20 and 30 .
  • Table 1 shows the simulation results of the DPL compressor 20 and the 4-2 compressor 100 .
  • Table 2 shows the simulation results of the RIC compressor 30 and the compressor 100 .
  • the numbers expressed in round brackets show normalized simulation results based on the simulation results of the conventional 4-2 compressors 20 and 30 .
  • the delay, power consumption, and multiplication result of the delay and the power consumption of the 4-2 compressor 100 are improved, as compared to the DPL 4-2 compressor 20 in FIG. 2 .
  • the delay is improved by about 2-3%
  • the power consumption is improved by about 23-36%
  • the multiplication result is improved by about 26-38%, as compared to the DPL 4-2 compressor 20 .
  • the delay, power consumption, and multiplication result of the delay and the power consumption of the 4-2 compressor 100 are improved, as compared to the RIC 4-2 compressor 30 in FIG. 4 .
  • the delay is improved by about 11-14%
  • the power consumption is improved by about 28-39%
  • the multiplication result is improved by about 38-45%, as compared to the RIC 4-2 compressor 30 .
  • the 4-2 compressor according to the present invention can operate at high speed with low power consumption and small circuit area.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Mathematics (AREA)
  • Computing Systems (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Mathematical Physics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Logic Circuits (AREA)

Abstract

A high speed low powered 4-2 compressor according to the present invention performs an XOR/XNOR operation of input data by using a single input type NAND/NOR logic circuit and a dual input type NAND/NOR logic circuit. Thus, delays to generate complementary signals are avoided. In addition, the 4-2 compressor uses a single railed multiplexer instead of a dual railed multiplexer, so that gate drive nodes and internal load capacitance are reduced. As a result, circuit area and power consumption of the 4-2 compressor are reduced.

Description

  • This application is a continuation-in-part application of U.S. application Ser. No. 10/023,686 filed on Dec. 18, 2001, the contents of which are herein incorporated by reference in their entirety.
  • FIELD OF THE INVENTION
  • The present invention generally relates to a logic circuit and, more particularly, to a 4-2 compressor for a multiplier.
  • BACKGROUND OF THE INVENTION
  • Multiplication is one of the primary operations in general-purpose microprocessors and special purpose digital signal processors. The speed of the multiplication operation often directly determines how fast the processors are capable of operating. Generally, a multiplier generates plural partial summation results and then sums the partial results to compute a product. The summation performance influences the performance of the multiplier. However, as technology progresses, a user demands increasingly fast data processing systems. To satisfy the demand, the multiplier circuit grows in complexity, and, as a result, the multiplier occupies an increasingly larger area of a central processing unit (CPU) to be embedded in the data processing system. Referring to “IEEE-754 Floating Point specification”, which can be downloaded from the web site http://standards.ieee.org/, contemporary multipliers must be capable of performing a multiplication operation for 54 bits of input data. Thus, a significant amount of circuit area is required to implement a multiplier for input data of large bit width.
  • To improve the performance of the multiplier, several techniques have been developed. Generally, multipliers with the desired capability have array structures or Wallace tree structures. A fast array multiplier performing fast multiplication is composed of a Booth encoder, a partial-product summation tree (PPST), and a final adder. The PPST is responsible for a significant portion of the total multiplication operation delay, and is implemented commonly by a Wallace tree with 3-2 counters (i.e., full adders) and 4-2 compressors (i.e., half adders) as basic cells. Hence, design of these counters and the compressors will be a significant factor for improving the performance of the multiplier. The structure of the 4-2 compressor is as follows.
  • FIG. 1 is a block diagram for illustrating a conventional 4-2 compressor 10. The 4-2 compressor 10 is described in “Design of high-speed low-power 3-2 counter and 4-2 compressor for fast multipliers”, by Shen-Fu Hsiao etc.; Feb. 19, 1998, ELECTRONICS LETTERS, Vol. 34, No. 4, pp. 341-343. The 4-2 compressor 10 can be found in FIG. 3 a of the paper.
  • Referring to FIG. 1, the 4-2 compressor 10 is composed of two cascaded 3-2 counters 12 and 14. Each 3-2 counter can be realized using two multiplexers. When four input data I1, I2, I3 and I4 are externally input to the 4-2 compressor 10, the first 3-2 counter 12 generates a carry-out bit Cout and a sum bit S in response to the three input data I1, I2 and I3. The carry-out bit Cout is output to a 4-2 compressor at the next stage (not shown) connected to the current 4-2 compressor 10, and the sum bit S is output to the second 3-2 counter 14, respectively. The second 3-2 counter 14 generates a carry bit Carry and a sum bit Sum as final outputs, in response to the sum bit S, the input data I4, and a carry-input bit Cin from a previous 4-2 compressor at a previous stage (not shown). As described above, the compressor 10 receives the four input data I1. I2, I3 and I4, and generates two final outputs Carry and Sum. Therefore, the compressor 10 is referred to as a “4-2 compressor”. In the 4-2 compressor 10, four multiplexers are used. On the assumption that the delay required for one of the multiplexers is 1 MUX, then it follows that the 4-2 compressor 10 requires 4 MUXs of delay.
  • FIG. 2 is a circuit diagram for illustrating a conventional double-pass transistor-logic (DPL) 4-2 compressor 20 to reduce the delay of the 4-2 compressor 10, and FIG. 3 is a detailed circuit diagram for illustrating a dual-railed multiplexer shown in FIG. 2. The DPL 4-2 compressor 20 is described in FIG. 3 b of the Hsiao paper cited above. Referring to FIGS. 2 and 3, the 4-2 compressor 20 reduces the delay from 4 MUXs to 3 MUXs, through the use of pass-transistor multiplexers (PTMs) 21-25 and 27 composed of CMOS transmission gates (TGs). The detailed structure of the 4-2 compressor 20 is as follows.
  • The 4-2 compressor 20 generates two carry bits Cout and Carry, and a sum bit Sum, in response to externally input four input data I1, I2, I3 and I4, and a carry-input bit Cin from the previous 4-2 compressor (not shown). The 4-2 compressor 20 is constructed with dual rails being suitable for high speed operation. The 4-2 compressor 20 is referred to as a double-pass transistor-logic (DPL) 4-2 compressor.
  • At two internal nodes N1 and N2, the 4-2 compressor 20 has eight gates to be driven by the output of the pass-transistor multiplexers (PTMs) composed of CMOS transmission gates (TGs). For example, on the assumption that an internal load capacitance of a P-channel MOS transistor is Cgp and an internal load capacitance of a N-channel MOS transistor is Cgn, the internal load capacitance at the node N2 is 4Cgp+4Cgn, and the internal load capacitance at the node N2 is 4Cgp+4Cgn, respectively. In this case, the total internal load capacitance of the 4-2 compressor 20 becomes 8Cgp+8Cgn. Generally, the PMOS transistor is twice the size of the NMOS transistor. Thus, the internal load capacitance is doubled, which is expressed as 1Cgp≈2Cgn. Therefore, the total internal load capacitance of the 4-2 compressor 20 can be expressed as 24Cgn by converting it on the basis of the NMOS transistor.
  • Generally, pass-transistor logic is used for transmitting a source instead of driving gates. Thus, it is difficult for the 4-2 compressor 20 to show low-power and high-speed performance of the pass-transistor logic, and moreover, the power consumption and the area of the 4-2 compressor 20 increase in proportion to the increment of wiring based on the dual railed structure.
  • To solve these problems, a reduced internal capacitance (RIC) 4-2 compressor 30 is used, as illustrated in FIG. 4, which is described with reference to FIG. 3 c of the Hsiao paper cited above. FIG. 5 is a detailed circuit diagram for illustrating an XOR/XNOR logic element shown in FIG. 4. Referring to FIGS. 4 and 5, the 4-2 compressor 30 comprises multiplexers 33 and 37 composed of transmission gates, and XOR/ XNOR logic elements 31, 32, 34 and 36. The multiplexers 33 and 37 are coupled to output ports of carry bits Cout and Carry, and the XOR/ XNOR logic elements 31, 32, 34 and 36 are used for generating a sum bit Sum. Each of nodes N3 and N4 of the 4-3 compressor 30 has the internal load capacitance of 3Cgp+3Cgn. Thus, a total internal load capacitance of the 4-2 compressor 30 becomes 6Cgp+6Cgn. The total internal load capacitance can be expressed as 18Cgn by converting it on the basis of the NMOS transistor. This 4-2 compressor 30 is referred to as a reduced-internal-capacitance (RIC) 4-2 compressor.
  • However, the nodes N3 and N4 are used for driving gates of the multiplexers 33 and 37, and used as sources of the XOR/ XNOR logic elements 31, 32, 34 and 36, so that the driving gate capacity is relatively decreased. In other words, the capability of driving gates is decreased, although the amount of the internal load capacitance is reduced. Thus the efficiency of the operating time of the RIC 4-2 compressor 30 is inferior to the DPL 4-2 compressor 20. In addition, the power consumption and the area of the RIC 4-2 compressor 30 are increased in proportion to increment of wiring based on the dual-railed structure.
  • SUMMARY OF THE INVENTION
  • It is therefore an object of the present invention to provide a high-speed low-power 4-2 compressor by reducing the total internal load capacitance and reducing the circuit area and power consumption of the 4-2 compressor.
  • In order to attain the above objects, according to an aspect of the present invention, there is provided a 4-2 compressor generating a sum bit and a carry bit as a function of four input data comprising: a first logic circuit for performing a NAND operation and a NOR operation of first and second input data, generating an XOR/XNOR operation result of the first and second input data by using the NAND and NOR results, and generating a carry-out bit to a next stage by selecting either the NAND operation result or the NOR operation result; a second logic circuit for generating a selection signal in response to a third input data, a fourth input data and the XOR/XNOR operation result from the first logic circuit; a third logic circuit for generating the sum bit by selecting either a carry-input bit or an inverted carry-input bit in response to the selection signal from the second logic circuit; and a fourth logic circuit for generating the carry bit by selecting either the inverted carry-input bit or an inverted fourth data in response to the selection signal from the second logic circuit.
  • In a preferred embodiment, the first logic circuit comprises: a first XOR/XNOR logic circuit for performing the NAND operation and the NOR operation, respectively, and for generating the XOR/XNOR operation result by using the NAND and NOR operation results; and a first multiplexer for generating the carry-out bit for the following stage by selecting either the NAND operation result or the NOR operation result in response to an inverted third data, wherein the first multiplexer is a single railed multiplexer.
  • The second logic circuit preferably comprises: a second XOR/XNOR logic circuit for performing a NAND operation and a NOR operation of the third input data and fourth input data, and for generating an XOR/XNOR operation result of the third and fourth input data by using the NAND and NOR operation results; and a second multiplexer for generating the selection signal by selecting either the XOR operation result or the XNOR operation result from the first XOR/XNOR logic circuit, in response to the XOR/XNOR operation result from the second XOR/XNOR logic circuit, wherein the second multiplexer is a dual railed multiplexer.
  • The third logic circuit preferably comprises a third multiplexer for generating the sum bit, by selecting one of the carry-input bit and the inverted carry-input bit, in response to the selection signal from the second multiplexer, wherein the third multiplexer is a single railed multiplexer.
  • The fourth logic circuit preferably comprises a fourth multiplexer for generating the carry bit, by selecting one of the inverted fourth input data and the inverted carry-input bit, in response to the selection signal from the second multiplexer, wherein the fourth multiplexer is a single railed multiplexer.
  • Each XOR/XNOR logic circuit may comprise: a NAND gate for performing the NAND operation of a first input data and the second input data; a NOR gate for performing the NOR operation of a second input data and the first input data; an XNOR means for generating an XNOR result of the first and the second input data in response to the outputs of the NOR gate and the NAND gate; and an XOR means for generating an XOR result of the first and the second input data in response to the outputs of the NOR gate and the NAND gate.
  • The XNOR means may comprise: a first inverter for inverting the output from the NAND gate while the output of the NOR gate is a first logic state, and outputting the output from the NAND gate without inverting while the output of the NOR gate is a second logic state; and a first full-swing means for making full-swing the output of the first inverter.
  • The XOR means may comprise: a second inverter for inverting the output from the NOR gate while the output of the NAND gate is the second logic state, and outputting the output from the NAND gate without inverting while the output of the NOR gate is a second logic state; and a second full-swing means for making full-swing the output of the second inverter.
  • The first inverter may comprise a P type transistor and a N type transistor having a current path between a power supply voltage source and an output port of the NOR gate in series, and a control electrode controlled by the first input data “a”. The first full-swing means may comprise a P type transistor having a current path between an output port of the first inverter and the output port of the NOR gate, and control electrode controlled by the first input data “a”. The first full-swing means may be turned on when the output signal of the NOR gate is the second logic level, and increases the voltage level of the output signal of the first inverter as high as the power supply voltage level.
  • The second inverter may comprise a P type transistor and a N type transistor having a current path between a ground voltage source and an output port of the NAND gate in series, and a control electrode controlled by the output of the NOR gate. The second full-swing means may comprise a N type transistor having a current path between an output port of the second inverter and the output port of the NAND gate, and control electrode controlled by the first input data “a”. The second full-swing means is preferably turned on when the output signal of the NAND gate is the first logic level, and decreases the voltage level of the output signal of the second inverter as low as the ground voltage level.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The foregoing and other objects, features and advantages of the invention will be apparent from the more particular description of preferred embodiments of the invention, as illustrated in the accompanying drawings in which like reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the invention.
  • FIG. 1 is a block diagram for illustrating a conventional 4-2 compressor using 3-2 counters;
  • FIG. 2 is a circuit diagram for illustrating a conventional DPL 4-2 compressor;
  • FIG. 3 is a detailed circuit diagram for illustrating a dual railed multiplexer shown in FIG. 2;
  • FIG. 4 is a circuit diagram for illustrating a conventional RIC 4-2 compressor;
  • FIG. 5 is a detailed circuit diagram for illustrating an XOR/XNOR logic shown in FIG. 4;
  • FIG. 6 is a circuit diagram for illustrating a 4-2 compressor according to the preferred embodiment of the present invention;
  • FIG. 7A is a detailed circuit diagram for illustrating an XOR/XNOR logic element using NAND/NOR logic elements to generate an XOR/XNOR result, shown in FIG. 6;
  • FIG. 7B is a detailed circuit diagram for illustrating another embodiment of XOR/XNOR logic element using the NAND/NOR logic element shown in FIG. 6;
  • FIG. 8 is a detailed circuit diagram for illustrating a single railed multiplexer shown in FIG. 6;
  • FIG. 9 is a circuit diagram for illustrating a simulation environment of the 4-2 compressor; and
  • FIGS. 10 through 12 are charts for illustrating simulation results related to delay and power consumption, and a multiplication result of the delay and power consumption of the 4-2 compressors shown in FIGS. 2, 4 and 6, respectively, under the simulation environment shown in FIG. 9.
  • DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS
  • FIG. 6 is a circuit diagram for illustrating a 4-2 compressor 100 according to a preferred embodiment of the present invention. FIG. 7A is a detailed circuit diagram for illustrating an XOR/XNOR logic circuit shown in FIG. 6. FIG. 7B is a detailed circuit diagram for illustrating another embodiment of XOR/XNOR logic element using the NAND/NOR logic element shown in FIG. 6, and FIG. 8 is a detailed circuit diagram for illustrating a single railed multiplexer shown in FIG. 6.
  • Referring first to FIG. 6, the 4-2 compressor 100 comprises four logic circuits 110, 120, 130 and 140. The first logic circuit 110 generates a NAND/NOR result of externally input first and second data I1 and I2, and an XOR/XNOR result by using the NAND/NOR result, and generates a carry-out bit Cout in response to the NAND/NOR result. The second logic circuit 120 generates a selection signal, in response to the NAND/NOR results from the first logic circuit 110 and externally input third and fourth input data I3 and I4. The third logic circuit 130 generates a sum bit Sum by selecting either a carry-input bit Cin from a previous stage or an inverted carry-input bit {overscore (Cin)}, in response to the selection signal from the second logic circuit 120. The fourth logic circuit 140 generates a carry bit Carry by selecting either the inverted carry-input bit {overscore (Cin)} or an inverted fourth input data {overscore (I4)}, in response to the selection signal from the second logic circuit 120. The structures of the logic circuits are described as follows.
  • The first logic circuit 110 includes a first XOR/XNOR logic element 52 and a single railed first multiplexer 62. The first XOR/XNOR logic element 52 generates a NAND/NOR result of the first and second data I1 and I2, and an XOR/XNOR result by using the NAND/NOR result. The first multiplexer 62 generates a carry-out bit Cout by selecting either the NAND result or the NOR result from the first XOR/XNOR logic circuit 52, in response to the inverted third data {overscore (I3)}.
  • The second logic circuit 120 includes a second XOR/XNOR logic element 54 and a dual railed second multiplexer 64. The second XOR/XNOR logic element 54 generates an XOR/XNOR result by using a NAND/NOR result of the third and fourth data I3 and I4. The second multiplexer 64 outputs the XOR/XNOR result from the first XOR/XNOR logic element 52, in response to the XOR/XNOR result from the second XOR/XNOR logic element 54. The XOR/XNOR result from the second multiplexer 64 is used as a selection signal to output the sum bit Sum and the carry bit Carry of the 4-2 compressor 100.
  • The third logic circuit 130 includes a single railed third multiplexer 66, which receives the carry-input bit Cin from the previous stage and an inverted carry-input bit {overscore (Cin)} and generates the sum bit Sum in response to the output of the second multiplexer 64.
  • The fourth logic circuit 140 includes a single railed fourth multiplexer 68, which receives the inverted carry-input bit {overscore (Cin)} and the inverted fourth data {overscore (I4)} and generates the carry bit Carry in response to the output of the second multiplexer 64.
  • The externally input carry-input bit Cin and the fourth data I4 are inverted through inverters 72 and 74, respectively. This inversion is used for generating an inversion signal required by the single railed multiplexers 66 and 68. The sum bit Sum and carry bits Cout and Carry are output through buffers 84, 82 and 86 coupled to the multiplexers 66, 62 and 68, respectively.
  • As shown in FIG. 6, the XOR/ XNOR logic elements 52 and 54 using the NAND/NOR logic elements to obtain XOR/XNOR results are used at an input stage of the 4-2 compressor 100. Each of the XOR/ XNOR logic elements 52 and 54 adopts a single input manner and a dual input manner, so that the XOR/ XNOR logic elements 52 and 54 do not require an inverter for generating an inversion signal. As a result, a delay originated from the inverter can be prevented.
  • In addition, the 4-2 compressor 100 uses the single railed multiplexers to limit the resulting circuit area and the power consumption of the 4-2 compressor. In this case, the 4-2 compressor 100 uses the outputs of the XOR/ XNOR logic circuits 52 and 54 as a source. Thus, the 4-2 compressor 100 does not require two nodes N1 and N2 of FIG. 2, or N3 and N4 of FIG. 4, but requires only one node N5. As a result, the gate drive at the node N5 has only 2Cgp+2Cgn of internal load capacitance. The internal load capacitance can be expressed as 6Cgn by converting it on the basis of the NMOS transistor. It is reduced in internal load capacitance by 18Cgn, as compared to the conventional DPL 4-2 compressor 20 shown in FIG. 2, and is reduced by 12Cgn, as compared to the conventional RIC 4-2 compressor 30 shown in FIG. 4.
  • Referring to FIG. 7A, each of the XOR/ XNOR logic elements 52 and 54 is composed of a NAND gate, a NOR gate, inverters INV1 and INV2 for inverting the output signal of the NAND gate and the NOR gate, and full-swing transistors MP3 and MN3 for making full swing, the output of the inverters INV1 and INV2.
  • Referring to FIG. 7B, the NAND logic includes PMOS transistors MP11 and MP12 and NMOS transistors MN11 and MN12. The PMOS transistor MP12 and the NMOS transistors MN11 and MN12 have current paths which are sequentially formed in series between a power supply voltage VDD and a ground voltage VSS, and have gates controlled by second input data “b”, first input data “a”, and the second input data “b”, respectively. The PMOS transistor MP11 has a drain coupled to an output port of the NAND gate, a source coupled to the power supply voltage VDD, and a gate controlled by a first input signal.
  • The NOR logic includes PMOS transistors MP13 and MP14 and NMOS transistors MN13 and MN14. The PMOS transistors MP13 MP14 and the NMOS transistor MN14 have current paths which are sequentially formed in series between a power supply voltage VDD and a ground voltage VSS, and have gates controlled by first input data “a”, second input data “b”, and the second input data “b”, respectively. The NMOS transistor MN13 has a drain coupled to an output port of the NOR gate, a source coupled to the ground voltage VSS, and a gate controlled by the first input data “a”.
  • The NAND gate includes a first input port for receiving a first input data “a”, and a second input port. The NOR gate includes a third input port coupled to the first input port of the NAND gate to receive the first input data “a”, and a fourth input port to receive the second input data “b”. The second input port of the NAND gate receive the second input data “b”, and is coupled to the fourth input port of the NOR gate.
  • Each of output nodes N11 and N12 of the NAND gate and the NOR gate is coupled to the first inverter INV1 and the second inverter INV2. The first inverter INV1 is composed of a first PMOS transistor MP1 and a first NMOS transistor MN1. Each of the transistors MP1 and MN1 includes a current path shaped between a power supply voltage source VDD and the output node N12 in series, and a gate controlled by the output of the NAND gate. The second inverter INV2 is composed of a second PMOS transistor MP2 and a second NMOS transistor MN2. Each of the transistors MP2 and MN2 includes a current path shaped between a ground voltage source VSS and the output node N11 in series, and a gate controlled by the output of the NOR gate.
  • To output nodes N21 and N22 of the inverters INV1 and INV2, a third PMOS transistor MP3 and a third NMOS transistor MN3 are coupled, respectively, to make full swing, the output signal from the output nodes N11 and N12. Briefly, the PMOS transistor MP3 includes a drain coupled to the output node N12 of the NOR gate, a source coupled to the output node N21 of the first inverter INV1, and a gate controlled by the first input data “a”. The NMOS transistor MN3 includes a drain coupled to the output node N11 of the NAND gate, a source coupled to the output node N22 of the second inverter INV2, and a gate controlled by the first input data “a”. Operation of the XOR/ XNOR logic circuit 52 and 54 using the NAND/NOR result to generate the XOR/XNOR result is described as follows.
  • First, when two data I1 and I2 or I3 and I4 (i.e., “a” and “b”) are input from an external source, the NAND gate and the NOR gate included in each of the XOR/ XNOR logic circuits 52 and 54 each executes the NAND operation and the NOR operation, respectively. The NAND operation result and the NOR operation result are input to the first inverter INV1 and the second inverter INV2, respectively.
  • The first inverter INV1 generates an XNOR result of the input data “a” and “b” (i,e., {overscore (a⊕b)}) by inversing the output signal of the NAND gate while the output signal of the NOR gate is a logic low level, and outputting the output signal of the NAND gate without inversion while the output signal of the NOR gate is a logic high level. The second inverter INV2 generates an XOR result of the input data “a” and “b” (i,e., a⊕b) by inversing the output signal of the NOR gate while the output signal of the NAND gate is a logic high level, and outputting the output signal of the NOR gate without inversion while the output signal of the NAND gate is a logic low level.
  • However, the first inverter INV1 receiving the output signal of the NAND gate generates an output signal with VDD-Vtn voltage level when both the output signals of the NAND gate and the NOR gate are logic high levels. The output of the first inverter INV1 is reduced by a threshold voltage Vtn of the MOS transistor MN1, so that the output signal is not fully swung. At this time, the third PMOS transistor MP3 coupled in parallel to the first NMOS transistor MN1 of the first inverter INV1, is turned on. Thus, the transistor MP3 charges on the output signal of the XOR/ XNOR logic circuit 52 or 54. According to the charging operation of the third PMOS transistor MP3, the XOR/ XNOR logic element 52 or 54 can output the complete high voltage VDD.
  • The second inverter INV2 receiving the output signal of the NOR gate generates an output signal having Vtp voltage level when both the output signals of the NAND gate and the NOR gate are logic low levels. The output of the second inverter INV2 is increased by a threshold voltage Vtp of the MOS transistor MP2, so that the output signal is not fully swung. At this time, the third NMOS transistor MN3 coupled in parallel to the second PMOS transistor MP2 of the second inverter INV2, is turned on. Thus, the transistor MN3 is discharged. According to the discharging operation of the third NMOS transistor MN3, the XOR/ XNOR logic element 52 or 54 can output the complete low voltage Vss.
  • As a result, the inverters INV1 and INV2 generate an XNOR result (i.e., {overscore (a⊕b)}) and an XOR result (i.e., a⊕b) of the input data I1 and I2, or I3 and I4 (i.e., “a” and “b”) of the NAND gate and the NOR gate, respectively, without fluctuation of the voltage originated- from the threshold voltage of the transistor included in the inverters INV1 and INV2.
  • Another embodiment of the XOR/XNOR logic element using the NAND/NOR logic element shown in FIG. 6 will now be described with reference to FIG. 7B.
  • The XOR/XNOR logic circuit shown in FIG. 7B includes the same components as the XOR/XNOR logic circuit shown in FIG. 7A. That is, the XOR/XNOR logic circuit includes a NAND gate, a NOR gate, first and second inverters INV1 and INV2 for inverting the NAND and NOR gates, PMOS and NMOS transistors MP3 and MN3 for fully swinging an output of the inverters INV1 and INV2. However, while a drain of the PMOS transistor MP3 is connected to an output port N12 of the NOR gate in FIG. 7A, a drain of the PMOS transistor MP14 in FIG. 7B is connected to a drain of the PMOS transistor MP13 of the NOR gate, not to the output port N12 of the NOR gate, i.e., a drain of the PMOS transistor MP14.
  • As previously stated, in the XOR/XNOR logic circuit of FIG. 7A, when output signals of the NAND and NOR gates are all high, an output signal of the first inverter INV1 does not fully swing and has a potential of VDD-Vtn. For this reason, the XOR/XNOR logic circuit using the NAND/NOR logic must output a fully high value VDD with the use of a charge operation of the PMOS transistor MP3. A voltage of an output port N21 of the first inverter INV1 is an instable voltage of VDD-Vtn. Therefore the PMOS transistor MP3 is coupled to the output node N21 of the inverter INV1, which makes the voltage of the output port of the first inverter INV1 have the same value as a voltage of an output port N12 of the NOR gate, i.e., the voltage of the PMOS transistor MP14.
  • The PMOS transistor MP3 shown in FIG. 7B has the same function as the PMOS transistor MP3 shown in FIG. 7A. In FIG. 7B, the voltage level of the output signal of the first inverter INV1 is not the voltage level of the drain of the PMOS transistor MP14, which is the voltage level of the output port of the NOR gate, but rather, the voltage level of the drain of the PMOS transistor MP13. Therefore the power supply voltage VDD is transferred through one transistor MP13, and not two transistors MP13 and MP14 as in the embodiment of FIG. 7A. As a result, the operation speed of the logic circuit is enhanced.
  • In the XOR/XNOR logic circuit of FIG. 7A, when the NAND gate 14 and the NOR gate 16 output a low level, the voltage of the output signal of the second inverter INV2 is the threshold voltage Vtp of the PMOS transistor MP2. Therefore, the NMOS transistor MN3 is used to make the voltage level of the output port of the second inverter INV2 equal to the voltage level of the output port N11 of the NAND gate, i.e., a voltage level of a drain of the NMOS transistor MN11.
  • The NMOS transistor MN3 shown in FIG. 7B has the same function as the NMOS transistor MN3 shown in FIG. 7A. In FIG. 7B, the voltage level of the output signal of the second inverter INV2 is not the voltage level of the drain of the NMOS transistor MN11, which is the voltage level of the output port of the NAND gate, but rather, the voltage level of the drain of the NMOS transistor MN12. Therefore the ground voltage VSS is transferred through one NMOS transistor MN12, and not two transistors MN11 and MN12 as in the embodiment of FIG. 7A. As a result, the operation speed of the logic circuit is enhanced.
  • Referring again to FIG. 6, the XOR/XNOR results {overscore (a⊕b)}/a⊕b are input to the second multiplexer 64, and are used as a source and a gate of the CMOS transmission gates. The NAND result and the NOR result of the first XOR/XNOR logic circuit 52 are input to the first multiplexer 62, and are output as a carry-out bit Cout. The output signal of the second multiplexer 64 is input to the single railed multiplexers 66 and 68, with a carry-input bit Cin from the previous stage, for generating the sum bit Sum and the carry bit Carry. With reference to FIG. 9, the performance of the 4-2 compressor 100 as compared to the performance of the conventional 4-2 compressors 20 and 30 in the same simulation environment, will now be discussed.
  • FIG. 9 is a circuit diagram for illustrating a simulation environment of the 4-2 compressor. Referring to FIG. 9, the 4-2 compressor 20, 30 or 100 is located between marks of “A” and “B”. Input data {overscore (I1)}, {overscore (I2)}, {overscore (I3)} and {overscore (I4)} are input though four buffers, output data {overscore (Sum)} and {overscore (Carry)} are observed through two buffers, respectively. Where, load capacitors CL marked as C1 and C2 are coupled to output ports of the 4-2 compressor. The delay, and the power consumption of the 4-2 compressors, and the multiplication result of the delay and the power consumption are measured through the load capacitors C1 and C2. The 4-2 compressors 20, 30 and 100 adopt a 0.18 μm CMOS design rule, and use 1.8V as a power supply voltage VDD. In the simulation, an input frequency is 50 MHz, an input slop is 0.2 ns, and a temperature is 25° C., respectively.
  • FIGS. 10 through 12 are charts for illustrating simulation results related to delay, power consumption, and multiplication result of the delay and the power consumption of the 4-2 compressors 20, 30 and 100 shown in FIGS. 2, 4 and 6, respectively, under the simulation environment shown in FIG. 9. In FIGS. 10 through 12, a simulation result of the DPL 4-2 compressor 20 shown in FIG. 2 is marked as ‘D’, a simulation result of the RIC 4-2 compressor 30 shown in FIG. 4 is marked as ‘R’, and a simulation result of the 4-2 compressor 100 according to the present invention shown in FIG. 6 is marked as ‘P’, respectively.
  • Referring to FIGS. 10 through 12, it can be determined that the delay, the power consumption, and the multiplication result of the delay and the power consumption of the 4-2 compressor 100 according to the present invention shown in FIG. 6 are reduced remarkably, as compared to the conventional 4-2 compressors 20 and 30.
  • Simulation results shown in FIGS. 10 through 12 are expressed below with reference to Table 1 and Table 2. Table 1 shows the simulation results of the DPL compressor 20 and the 4-2 compressor 100. Table 2 shows the simulation results of the RIC compressor 30 and the compressor 100. In Table 1 and Table 2, the numbers expressed in round brackets show normalized simulation results based on the simulation results of the conventional 4-2 compressors 20 and 30.
    TABLE 1
    Conventional
    DPL 4-2 compressor Proposed 4-2 compressor
    CL Power Delay × Power Delay ×
    [pF] Delay [ns] [μW] Power [fJ] Delay [ns] [μW] Power [fJ]
    0.02 0.33 3.67 1.21 0.32 2.83 0.90
    (1.00) (1.00) (1.00) (0.97) (0.77) (0.74)
    0.04 0.38 4.73 1.80 0.37 3.38 1.25
    (1.00) (1.00) (1.00) (0.97) (0.71) (0.69)
    0.06 0.42 5.81 2.44 0.41 3.96 1.62
    (1.00) (1.00) (1.00) (0.98) (0.68) (0.66)
    0.08 0.46 6.91 3.18 0.45 4.52 2.03
    (1.00) (1.00) (1.00) (0.98) (0.65) (0.64)
    0.10 0.50 7.99 4.00 0.49 5.08 2.49
    (1.00) (1.00) (1.00) (0.98) (0.64) (0.62)
  • TABLE 2
    Conventional
    RIC 4-2 compressor Proposed 4-2 compressor
    CL Power Delay × Power Delay ×
    [pF] Delay [ns] [μW] Power [fJ] Delay [ns] [μW] Power [fJ]
    0.02 0.37 3.91 1.45 0.32 2.83 0.90
    (1.00) (1.00) (1.00) (0.86) (0.72) (0.62)
    0.04 0.42 5.02 2.11 0.37 3.38 1.25
    (1.00) (1.00) (1.00) (0.88) (0.67) (0.59)
    0.06 0.46 6.14 2.82 0.41 3.96 1.62
    (1.00) (1.00) (1.00) (0.89) (0.64) (0.57)
    0.08 0.51 7.22 3.68 0.45 4.52 2.03
    (1.00) (1.00) (1.00) (0.88) (0.63) (0.55)
    0.10 0.55 8.28 4.55 0.49 5.08 2.49
    (1.00) (1.00) (1.00) (0.89) (0.61) (0.55)
  • As shown in Table 1, the delay, power consumption, and multiplication result of the delay and the power consumption of the 4-2 compressor 100 are improved, as compared to the DPL 4-2 compressor 20 in FIG. 2. In the 4-2 compressor 100, the delay is improved by about 2-3%, the power consumption is improved by about 23-36% and the multiplication result is improved by about 26-38%, as compared to the DPL 4-2 compressor 20.
  • As shown in Table 2, the delay, power consumption, and multiplication result of the delay and the power consumption of the 4-2 compressor 100 are improved, as compared to the RIC 4-2 compressor 30 in FIG. 4. In the 4-2 compressor 100, the delay is improved by about 11-14%, the power consumption is improved by about 28-39% and the multiplication result is improved by about 38-45%, as compared to the RIC 4-2 compressor 30. In this manner, the 4-2 compressor according to the present invention can operate at high speed with low power consumption and small circuit area.
  • While this invention has been particularly shown and described with references to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made herein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (20)

1. A 4-2 compressor for generating a sum bit and a carry bit as a result of four input data bits comprising:
a first logic circuit for performing a NAND operation and a NOR operation of a first and second input data, for generating an XOR/XNOR operation result of the first and second input data using the NAND and NOR operation results, and for generating a carry-out bit for a following stage by selecting either the NAND operation result or the NOR operation result;
a second logic circuit for generating a selection signal in response to a third input data, a fourth input data and the XOR/XNOR operation result from the first logic circuit;
a third logic circuit for generating the sum bit by selecting one of a carry-input bit and an inverted carry-input bit in response to the selection signal from the second logic circuit; and
a fourth logic circuit for generating the carry bit by selecting one of the inverted carry-input bit and an inverted fourth input data in response to the selection signal from the second logic circuit.
2. The 4-2 compressor according to claim 1, wherein the first logic circuit comprises:
a first XOR/XNOR logic circuit for performing the NAND operation and the NOR operation, respectively, and for generating the XOR/XNOR operation result by using the NAND and NOR operation results; and
a first multiplexer for generating the carry-out bit for the following stage by selecting either the NAND operation result or the NOR operation result in response to an inverted third data, wherein the first multiplexer is a single railed multiplexer.
3. The 4-2 compressor according to claim 1, wherein the second logic circuit comprises:
a second XOR/XNOR logic circuit for performing a NAND operation and a NOR operation of the third input data and fourth input data, and for generating an XOR/XNOR operation result of the third and fourth input data by using the NAND and NOR operation results; and
a second multiplexer for generating the selection signal by selecting either the XOR operation result or the XNOR operation result from the first logic circuit, in response to the XOR/XNOR operation result from the second XOR/XNOR logic circuit, wherein the second multiplexer is a dual railed multiplexer.
4. The 4-2 compressor according to claim 1, wherein the third logic circuit comprises a third multiplexer for generating the sum bit, by selecting one of the carry-input bit and the inverted carry-input bit, in response to the selection signal from the second logic circuit, wherein the third multiplexer is a single railed multiplexer.
5. The 4-2 compressor according to claim 1, wherein the fourth logic circuit comprises a fourth multiplexer for generating the carry bit, by selecting one of the inverted fourth input data and the inverted carry-input bit, in response to the selection signal from the second logic circuit, wherein the fourth multiplexer is a single railed multiplexer.
6. The 4-2 compressor according to claim 2, wherein each XOR/XNOR logic circuit comprises:
a NAND gate for performing the NAND operation of a first data and a second data;
a NOR gate for performing the NOR operation of the first data and the second data;
an XNOR means for generating an XNOR result of the first and the second data in response to the outputs of the NOR gate and the NAND gate; and
an XOR means for generating an XOR result of the first and the second data in response to the outputs of the NOR gate and the NAND gate.
7. The 4-2 compressor according to claim 6, wherein the XNOR means comprises:
a first inverter for inverting the output from the NAND gate while the output of the NOR gate is a first logic state, and outputting the output from the NAND gate without inverting while the output of the NOR gate is a second logic state; and
a first full-swing means for making full-swing the output of the first inverter.
8. The 4-2 compressor according to claim 6, wherein the XOR means comprises:
a second inverter for inverting the output from the NOR gate while the output of the NAND gate is the second logic state, and outputting the output from the NAND gate without inverting while the output of the NOR gate is the second logic state; and
a second full-swing means for making full-swing the output of the second inverter.
9. The 4-2 compressor according to claim 7, wherein the first inverter comprises a P type transistor and a N type transistor having a current path between a power supply voltage source and an output port of the NOR gate in series, and a control electrode controlled by the output of the NAND gate.
10. The 4-2 compressor according to claim 7, wherein the first full-swing means comprises a P type transistor having a current path between an output port of the first inverter and the output port of the NOR gate, and a control electrode controlled by the first data.
11. The 4-2 compressor according to claim 10, wherein the first full-swing means is turned on when the output signal of the NOR gate is the second logic level, and increases the voltage level of the output signal of the first inverter as high as the power supply voltage level.
12. The 4-2 compressor according to claim 7, wherein the NOR gate comprises:
a PMOS transistor MP13, a PMOS transistor MP14, and an NMOS transistor MN14 which have current paths sequentially formed in series between a power supply voltage VDD and a ground voltage VSS;
an NMOS transistor MN13 having a drain coupled to an output port of the NOR gate and a source coupled to the ground voltage VSS,
wherein the first full-swing means comprises a P-type transistor having a current path formed between the output port of the first inverter and the drain of the PMOS transistor MP13.
13. The 4-2 compressor according to claim 12, wherein the first full-swing means is turned on when the output signal of the NOR gate is the second logic level, and increases the voltage level of the output signal of the first inverter as high as the power supply voltage level.
14. The 4-2 compressor according to claim 8, wherein the second inverter comprises a P type transistor and a N type transistor having a current path between a ground voltage source and an output port of the NAND gate in series, and a control electrode controlled by the output of the NOR gate.
15. The 4-2 compressor according to claim 8, wherein the second full-swing means comprises an N type transistor having a current path between an output port of the second inverter and the output port of the NAND gate, and control electrode controlled by the first data.
16. The 4-2 compressor according to claim 13, wherein the second full-swing means is turned on when the output signal of the NAND gate is the first logic level, and decreases the voltage level of the output signal of the second inverter as low as the ground voltage level.
17. The 4-2 compressor according to claim 8, wherein the NAND gate comprises:
a PMOS transistor MP12, an NMOS transistor MN11, and an NMOS transistor MN12 which have current paths sequentially formed in series between a power supply voltage VDD and a ground voltage VSS;
a PMOS transistor MP11 having a drain coupled to an output port of the NAND gate and a source coupled to the power supply voltage VDD,
wherein the second full-swing means comprises an N-type transistor having a current path formed between the output port of the second inverter and the drain of the NMOS transistor MN12.
18. The 4-2 compressor according to claim 17, wherein the second full-swing means is turned on when the output signal of the NAND gate is the first logic level, and decreases the voltage level of the output signal of the second inverter as low as the ground voltage level.
19. The 4-2 compressor according to claim 15, wherein each XOR/XNOR logic circuit comprises:
a NAND gate for performing the NAND operation of a first data and a second data;
a NOR gate for performing the NOR operation of the first data and the second data;
an XNOR means for generating an XNOR result of the first and the second data in response to the outputs of the NOR gate and the NAND gate; and
an XOR means for generating an XOR result of the first and the second data in response to the outputs of the NOR gate and the NAND gate.
20. A 4-2 compressor for generating a sum bit and a carry bit as a result of four input data bits comprising:
a first XOR/XNOR logic circuit for performing a NAND operation and a NOR operation of a first and second input data, for generating an XOR/XNOR operation result of the first and second input data by using the NAND and NOR operation results;
a first multiplexer for generating the carry-out bit for following stage by selecting either the NAND operation result of the NOR operation result in response to an inverted third input data, wherein the first multiplexer is a single railed multiplexer;
a second XOR/XNOR logic circuit for performing a NAND operation and a NOR operation of a third input data and fourth input data, and for generating an XOR/XNOR operation result or the third and fourth input data by using the NAND and NOR operation results;
a second multiplexer for generating the selection signal by selecting either the XOR operation result or the XNOR operation result from the first XOR/XNOR logic circuit, in response to the XOR/XNOR operation result from the second XOR/XNOR logic circuit, wherein the second multiplexer is a dual railed multiplexer;
a third multiplexer for generating the sum bit, by selecting one of a carry-input bit and inverted carry-input bit, in response to the selection signal from the second multiplexer, wherein the third multiplexer is a single railed multiplexer; and
a fourth multiplexer for generating the carry bit, by selecting one of an inverted fourth input data and the inverted carry-input bit, in response to the selection signal from the second multiplexer, wherein the fourth multiplexer is a single railed multiplexer.
US10/932,593 2000-12-29 2004-09-02 High speed low power 4-2 compressor Abandoned US20050027777A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US10/932,593 US20050027777A1 (en) 2000-12-29 2004-09-02 High speed low power 4-2 compressor

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
KR00-85542 2000-12-29
KR10-2000-0085542A KR100423903B1 (en) 2000-12-29 2000-12-29 High speed low power 4-2 compressor
US10/023,686 US6904447B2 (en) 2000-12-29 2001-12-18 High speed low power 4-2 compressor
KR1020030061766A KR100553702B1 (en) 2003-09-04 2003-09-04 Full Adder
KR03-61766 2003-09-04
US10/932,593 US20050027777A1 (en) 2000-12-29 2004-09-02 High speed low power 4-2 compressor

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US10/023,686 Continuation-In-Part US6904447B2 (en) 2000-12-29 2001-12-18 High speed low power 4-2 compressor

Publications (1)

Publication Number Publication Date
US20050027777A1 true US20050027777A1 (en) 2005-02-03

Family

ID=19703930

Family Applications (2)

Application Number Title Priority Date Filing Date
US10/023,686 Expired - Lifetime US6904447B2 (en) 2000-12-29 2001-12-18 High speed low power 4-2 compressor
US10/932,593 Abandoned US20050027777A1 (en) 2000-12-29 2004-09-02 High speed low power 4-2 compressor

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US10/023,686 Expired - Lifetime US6904447B2 (en) 2000-12-29 2001-12-18 High speed low power 4-2 compressor

Country Status (4)

Country Link
US (2) US6904447B2 (en)
KR (1) KR100423903B1 (en)
GB (1) GB2372854B (en)
TW (1) TW483243B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070233760A1 (en) * 2006-03-29 2007-10-04 Sanu Mathew 3:2 Bit compressor circuit and method
US20090151837A1 (en) * 2007-12-18 2009-06-18 Gm Global Technology Operations, Inc. Pneumatic tire and configuration for reducing a tire noise
CN102882513A (en) * 2012-10-09 2013-01-16 北京大学 Full adder circuit and chip
US9966121B2 (en) 2016-06-14 2018-05-08 SK Hynix Inc. Comparison circuits and semiconductor devices employing the same
CN110190843A (en) * 2018-04-10 2019-08-30 北京中科寒武纪科技有限公司 Compressor circuit, Wallace tree circuit, multiplier circuit, chip and equipment
US20200136643A1 (en) * 2018-10-25 2020-04-30 Arm Limited Data Compressor Logic Circuit
CN111475136A (en) * 2020-04-10 2020-07-31 电子科技大学 Approximate 4-2 compressor with zero mean error

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6578063B1 (en) * 2000-06-01 2003-06-10 International Business Machines Corporation 5-to-2 binary adder
KR100505491B1 (en) * 2002-10-02 2005-08-03 전자부품연구원 4:2 compressor for fast arithmetic unit
US7302460B1 (en) * 2003-09-11 2007-11-27 Altera Corporation Arrangement of 3-input LUT's to implement 4:2 compressors for multiple operand arithmetic
US7290026B2 (en) * 2003-09-30 2007-10-30 International Business Machines Corporation Low-power high-speed 4-2 compressor with minimized transistor count
US7720902B2 (en) * 2006-02-28 2010-05-18 Sony Corporation Entertainment Inc. Methods and apparatus for providing a reduction array
US7508233B2 (en) * 2007-01-31 2009-03-24 National Yunlin University Of Science And Technology Full-adder of complementary carry logic voltage compensation
US20090063609A1 (en) * 2007-06-08 2009-03-05 Honkai Tam Static 4:2 Compressor with Fast Sum and Carryout
US20120265793A1 (en) * 2011-04-12 2012-10-18 Advanced Micro Devices, Inc. Merged compressor flop circuit
US10003342B2 (en) 2014-12-02 2018-06-19 Taiwan Semiconductor Manufacturing Company, Ltd. Compressor circuit and compressor circuit layout
CN108984149B (en) * 2018-08-07 2023-03-03 电子科技大学 Approximate 4-2 compressor with high speed and low power consumption
CN109542393B (en) * 2018-11-19 2022-11-04 电子科技大学 Approximate 4-2 compressor and approximate multiplier
US11106431B2 (en) * 2019-02-24 2021-08-31 DinoplusAI Holdings Limited Apparatus and method of fast floating-point adder tree for neural networks

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4766565A (en) * 1986-11-14 1988-08-23 International Business Machines Corporation Arithmetic logic circuit having a carry generator
US4831578A (en) * 1985-11-25 1989-05-16 Harris Semiconductor (Patents) Inc. Binary adder
US4839849A (en) * 1985-09-30 1989-06-13 Siemens Aktiengesellschaft Ripple-carry adder
US4931981A (en) * 1985-09-30 1990-06-05 Siemens Aktiengesellschaft Multi-place ripple-carry adder
US5040139A (en) * 1990-04-16 1991-08-13 Tran Dzung J Transmission gate multiplexer (TGM) logic circuits and multiplier architectures
US5151875A (en) * 1990-03-16 1992-09-29 C-Cube Microsystems, Inc. MOS array multiplier cell
US5694061A (en) * 1995-03-27 1997-12-02 Casio Computer Co., Ltd. Semiconductor device having same conductive type MIS transistors, a simple circuit design, and a high productivity
US5805491A (en) * 1997-07-11 1998-09-08 International Business Machines Corporation Fast 4-2 carry save adder using multiplexer logic
US5818747A (en) * 1995-01-27 1998-10-06 Sun Microsystems, Inc. Small, fast CMOS 4-2 carry-save adder cell
US6043675A (en) * 1997-01-30 2000-03-28 Fujitsu Limited Logic circuit utilizing capacitive coupling, an AD converter and a DA converter
US6505226B1 (en) * 1996-09-06 2003-01-07 Hyundai Electronics Industries Co., Ltd. High speed parallel adder
US6535902B2 (en) * 1996-08-29 2003-03-18 Fujitsu Limited Multiplier circuit for reducing the number of necessary elements without sacrificing high speed capability

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4901270A (en) * 1988-09-23 1990-02-13 Intel Corporation Four-to-two adder cell for parallel multiplication
JP2597736B2 (en) * 1990-07-17 1997-04-09 株式会社東芝 Fast multiplier
KR0167302B1 (en) * 1995-12-30 1999-01-15 문정환 4:2 compressing circuit of multiplier
TW421757B (en) * 1996-06-06 2001-02-11 Matsushita Electric Ind Co Ltd Arithmetic processor
KR100233856B1 (en) * 1996-12-09 1999-12-01 정선종 High speed 4-2 adder with cpl logic
JP2000056949A (en) * 1998-08-10 2000-02-25 Mitsubishi Electric Corp 4-2 compressor circuit and multiplier

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4839849A (en) * 1985-09-30 1989-06-13 Siemens Aktiengesellschaft Ripple-carry adder
US4931981A (en) * 1985-09-30 1990-06-05 Siemens Aktiengesellschaft Multi-place ripple-carry adder
US4831578A (en) * 1985-11-25 1989-05-16 Harris Semiconductor (Patents) Inc. Binary adder
US4766565A (en) * 1986-11-14 1988-08-23 International Business Machines Corporation Arithmetic logic circuit having a carry generator
US5151875A (en) * 1990-03-16 1992-09-29 C-Cube Microsystems, Inc. MOS array multiplier cell
US5040139A (en) * 1990-04-16 1991-08-13 Tran Dzung J Transmission gate multiplexer (TGM) logic circuits and multiplier architectures
US5818747A (en) * 1995-01-27 1998-10-06 Sun Microsystems, Inc. Small, fast CMOS 4-2 carry-save adder cell
US5694061A (en) * 1995-03-27 1997-12-02 Casio Computer Co., Ltd. Semiconductor device having same conductive type MIS transistors, a simple circuit design, and a high productivity
US6535902B2 (en) * 1996-08-29 2003-03-18 Fujitsu Limited Multiplier circuit for reducing the number of necessary elements without sacrificing high speed capability
US6505226B1 (en) * 1996-09-06 2003-01-07 Hyundai Electronics Industries Co., Ltd. High speed parallel adder
US6043675A (en) * 1997-01-30 2000-03-28 Fujitsu Limited Logic circuit utilizing capacitive coupling, an AD converter and a DA converter
US5805491A (en) * 1997-07-11 1998-09-08 International Business Machines Corporation Fast 4-2 carry save adder using multiplexer logic

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070233760A1 (en) * 2006-03-29 2007-10-04 Sanu Mathew 3:2 Bit compressor circuit and method
US20090151837A1 (en) * 2007-12-18 2009-06-18 Gm Global Technology Operations, Inc. Pneumatic tire and configuration for reducing a tire noise
CN102882513A (en) * 2012-10-09 2013-01-16 北京大学 Full adder circuit and chip
US9966121B2 (en) 2016-06-14 2018-05-08 SK Hynix Inc. Comparison circuits and semiconductor devices employing the same
CN110190843A (en) * 2018-04-10 2019-08-30 北京中科寒武纪科技有限公司 Compressor circuit, Wallace tree circuit, multiplier circuit, chip and equipment
WO2019196727A1 (en) * 2018-04-10 2019-10-17 北京中科寒武纪科技有限公司 Compressor circuit, wallace tree circuit, multiplier circuit, chip and device
US20200136643A1 (en) * 2018-10-25 2020-04-30 Arm Limited Data Compressor Logic Circuit
CN111106825A (en) * 2018-10-25 2020-05-05 Arm 有限公司 Data compressor logic circuit
US10756753B2 (en) * 2018-10-25 2020-08-25 Arm Limited Data compressor logic circuit
CN111475136A (en) * 2020-04-10 2020-07-31 电子科技大学 Approximate 4-2 compressor with zero mean error

Also Published As

Publication number Publication date
KR20020056222A (en) 2002-07-10
GB2372854A (en) 2002-09-04
KR100423903B1 (en) 2004-03-24
GB0129922D0 (en) 2002-02-06
US6904447B2 (en) 2005-06-07
GB2372854B (en) 2003-02-05
US20020129077A1 (en) 2002-09-12
TW483243B (en) 2002-04-11

Similar Documents

Publication Publication Date Title
US6904447B2 (en) High speed low power 4-2 compressor
Abu-Khater et al. Circuit techniques for CMOS low-power high-performance multipliers
JPH10124297A (en) Multiplier circuit, adder circuit constituting the multiplier circuit, partial product bit compressing method for the multiplier circuit, and large-scale semiconductor integrated circuit applying the multiplier circuit
KR100203302B1 (en) Static and dynamic adder using n-mos
US6690204B1 (en) Limited switch dynamic logic circuit
US7519648B2 (en) Encoder for a multiplier
Kishore et al. Low power and high speed optimized 4-bit array multiplier using MOD-GDI technique
Kandpal et al. Design of low power and high speed XOR/XNOR circuit using 90 nm CMOS technology
US6066978A (en) Partial product generating circuit
US5732008A (en) Low-power high performance adder
US6990508B1 (en) High performance carry chain with reduced macrocell logic and fast carry lookahead
Archana et al. Design of low power and high speed ripple carry adder
US20020186050A1 (en) Logic circuit for true and complement signal generator
US7003545B1 (en) High performance carry chain with reduced macrocell logic and fast carry lookahead
US11831341B2 (en) Data compressor logic circuit
US11152942B2 (en) Three-input exclusive NOR/OR gate using a CMOS circuit
US5812521A (en) Static adder using BICMOS emitter dot circuits
Jeong et al. Robust high-performance low-power carry select adder
US7085796B1 (en) Dynamic adder with reduced logic
US20190034165A1 (en) Circuit for addition of multiple binary numbers
US6571269B1 (en) Noise-tolerant digital adder circuit and method
Namin et al. Low power design of a word-level finite field multiplier using reordered normal basis
Chang et al. A high speed low power 4: 2 compressor cell design
US7042246B2 (en) Logic circuits for performing threshold functions
EP0250174B1 (en) Incrementer and decrementer

Legal Events

Date Code Title Description
AS Assignment

Owner name: SAMSUNG ELECTRONICS, CO. LTD., KOREA, REPUBLIC OF

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:RHEE, YOUNG-CHUL;KIM, SANG-SUK;LEE, DONG-WOOK;REEL/FRAME:015768/0979;SIGNING DATES FROM 20040818 TO 20040819

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO PAY ISSUE FEE