GB2345562A - Digital signal processor for performing fixed-point and/or integer arithmetic - Google Patents

Digital signal processor for performing fixed-point and/or integer arithmetic Download PDF

Info

Publication number
GB2345562A
GB2345562A GB0007909A GB0007909A GB2345562A GB 2345562 A GB2345562 A GB 2345562A GB 0007909 A GB0007909 A GB 0007909A GB 0007909 A GB0007909 A GB 0007909A GB 2345562 A GB2345562 A GB 2345562A
Authority
GB
United Kingdom
Prior art keywords
data
bit
bits
rounding
operating
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB0007909A
Other versions
GB2345562B (en
GB0007909D0 (en
Inventor
Il Teak Lim
Jun Ho Bahn
Kyu Seok Kim
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LG Electronics Inc
Original Assignee
LG Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1019970045342A external-priority patent/KR100315303B1/en
Priority claimed from KR1019970045343A external-priority patent/KR100251546B1/en
Priority claimed from KR1019970045341A external-priority patent/KR100297544B1/en
Priority claimed from KR1019970045345A external-priority patent/KR100246472B1/en
Priority claimed from KR1019980012318A external-priority patent/KR100271571B1/en
Application filed by LG Electronics Inc filed Critical LG Electronics Inc
Priority claimed from GB9818919A external-priority patent/GB2330226B/en
Publication of GB0007909D0 publication Critical patent/GB0007909D0/en
Publication of GB2345562A publication Critical patent/GB2345562A/en
Publication of GB2345562B publication Critical patent/GB2345562B/en
Application granted granted Critical
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • G06F7/48Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
    • G06F7/57Arithmetic logic units [ALU], i.e. arrangements or devices for performing two or more of the operations covered by groups G06F7/483 – G06F7/556 or for performing logical operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • G06F7/48Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
    • G06F7/499Denomination or exception handling, e.g. rounding or overflow
    • G06F7/49942Significance control
    • G06F7/49947Rounding

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Mathematics (AREA)
  • Computing Systems (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Complex Calculations (AREA)

Abstract

A digital signal processor has ALUs 58, 72 and accumulating registers 64, 66, 76, 78 small in bit number. r-bit rounding bits are added to an N-bit data to be processed (wherein r<N) by aligning the N-bit data to the left or the right of an (N+r)-bit bus;after the data has been operated, the high- or low-order N bits from the operated data are derived respectively (Fig. 5) , thus enabling fixed-point and integer arithmetic to be performed. g-bit guard bits are also added to the high order bits of the (N+r)-bit data, prior to operating. The g+N+r-bit data are selectively operated by means of parallel accumulators 58, 64 and 66 and 72, 76 and 78, and then selectively rounded and/or saturated by rounding/saturation processor 86. Addition/subtraction arithmetic operations which include scaling can be performed by employing barrel shifter 60 in parallel with the accumulators. In one embodiment (not claimed), no rounding bits are used (Fig. 7).

Description

DIGITAL SIGNAL PROCESSOR This invention relates to a digital signal processor (DSP) for processing various digital signals depending upon any programs.
Conventionally, the DSP can employ any one of a floatingpoint arithmetic and a fixed-point arithmetic. The floating-point arithmetic has a disadvantage in that it provides more accurate result than, but costs considerably more than the fixed-point arithmetic. Accordingly, most DSPs employ the fixed-point arithmetic. In order to prevent the error generation, the DSP of fixed-point arithmetic system having a typical N bit wordlength makes use of an arithmetic logic unit (ALU) having a'8+2N'bit wordlength and an accumulating register. Herein,'8'refers to the extra bits for overflow guarding commonly used.
Because all the ALU and accumulating register is lengthened at least twice more than the data wordlength N, an operation amount in the DSP is not only increased, but also a response speed of the DSP is reduced. Also, there are DSPs without the extra bits.
Actually, first and second data 10 and 12 having each an integer part and a fractional part and an N bit wordlength, as shown in Fig. 1, is read out of memory (not shown), and multiplied by means of a multiplier 14. A multiplied data having at most 2N bit length is generated at the multiplier 14. Such a multiplied data is temporarily stored in a product register 16. The multiplied data stored in the product register 16 is moved toward an accumulating register 20 through an ALU (18) in a state in which it is shifted by the bit number corresponding to the integer parts of the first and second data. The data removed to the accumulating register 20 has a new value in the case of data being operated by the ALU. An overflow can be generated. in the data which is stored in the accumulating register 20. To this end, a serious error is generated in the data of the accumulating register 20. In order to reduce the serious error caused by the overflow, DSPs in TMS320C5x series developed by Texas Instrument Co.
Ltd., have a saturation operating function instead of having the overflow prevention bits. On the other hand, DSPs of DSP5600x series developed by Motorola Corp. add 8 extra bits to the accumulating register 20 for the purpose of overflow guarding as shown in Fig. 1. In this case, the ALU and the accumulating register 20 included-in the DSP have a'8+2N'bit length. When the 8 bit of extra bits are applied to the DSP having a 24 bit wordlength, each of the ALU and the accumulating register 20 included in the DSP has a 56 bit length; while when the 8 bit of extra bits are applied to the DSP having a 20 bit wordlength, each of them has a 46 bit length. The ALU and accumulating register, having a wordlength of more than 48 bits causes the die size of the DSP chip as well as the manufacturing cost thereof to be increased. Also, the operating speed of the DSP becomes slow because a propagation delay amount, in the ALU having a large wordlength, is great.
The data stored in the accumulating register 20, hereinafter referred to as"accumulated data", is saturated prior to being transferred to the memory. By this saturation process, the accumulated data is changed into a third data 22 having a bit length ecual to the first and second data 10 and 12. Prior to performing such a saturation process, some DSP allows a rounding operation to be performed. For example, the DSP of Motorola corp. converts the'8+2N'bit data stored in the accumulator 20 into a'8+N'bit data under a command word"rnd". The saturation process to be performed after that time, changes the rounded data into an N bit of third data 22.
The DSPs using the method as described above, additionally wastes one command word or one clock period for the rounding operation. Due to this, clock periods additionally wasted become great when the rounding operation is involved in a code segment including a looping or a block repeating. As a result, an operation amount performed by the DSP increases.
Furthermore, the DSP in TMS320C5x model of Texas Instrument Co. Ltd. shift the data to be calculated with the ALU to the left by 0 to 16 bits using a pre-scaling shifter arranged at the previous stage of the ALU. In this case, the shift operation from the'0'numbered bit to the '15'numbered bit is usually used for scaling the data, but the shift operation to the'16'numbered bit is used when performing a fixed-point arithmetic rather than an integer arithmetic. This results from a fact that the 16 bit data read out of the memory must be arrayed to high order bits of the 32 bit accumulating register. The shift operation from the'0'numbered bit to the 15 numbered bit for scaling the data may or may not be used effectively depending upon a given algorithm, particularly upon a algorithm coding method. When performing an algorithm implemented with a code in which the pre-scaling is not used effectively, the pre-scaling shifter causes the die size of the DSP chip to be enlarged and the propagation delay amount to be increased, without any useful advantage.
For example, the pre-scaling shifter included in the DSP of Motorola Corp. shift the data, by one bit, to the left or the right. Instead, the DSP of Motorola Corp. provides different multiplying command words for the fixed-point arithmetic and the integer arithmetic, thereby absorbing an operation which shifts the data, by 16 bits, to the left in the TMS320C5x of Texas Instrument Co. Ltd.
Accordingly, it is an object of the present invention to provide a digital signal processor that includes an ALU and an accumulating register having a short wordlength.
Further object of the present invention is to provide a digital signal processor that does not waste an additional clock period at the time of rounding of data.
Another object of the present invention is to provide a digital signal processor that is capable of effectively performing an integer arithmetic and/or a fixed-point arithmetic.
Still another object of the present invention is to provide a digital signal processor that is capable of performing an operation including a data scaling in a high speed.
Still another object of the present invention is to provide a digital signal processor that can eliminate a rounding procedure of data.
In order to achieve these and other objects of the invention, a digital signal processor according to an aspect of the present invention includes data input means for receiving a N-bit data; rounding bit adding means for adding r-bit rounding bits to the N-bit data from the data input means, wherein r is smaller than N; guard bit adding means for adding g-bit guard bits to the high-order bits of the data from the rounding bit adding means; means for operating the data from the guard bit adding means; and rounding/saturation means for making a rounding process, a saturation process and a combined process including the rounding and saturation processes for the data from the operating means.
A digital'signal processor according to still another aspect of the present invention includes data input means for receiving an N-bit data; an alignment unit for aligning the N-bit data from the data input means to the left of (N+r)-bit bus; means for operating the data from the (N+r)-bit bus; and deriving means for deriving only the high-order N bits from the data from the operating means.
A digital signal processor according to still another aspect of the present invention includes input means for receiving an N-bit data; means for operating the data from the input means and a data from a feedback loop; a memory connected to the feedback loop for temporarily storing the data from the operating means; means for scaling the data from the input means; and selecting means for selectively transferring the data from the scaling means and the data from the operating means to the memory.
A digital signal processor according to still another aspect of the present invention includes data input means for receiving a N-bit data; guard bit adding means for adding g-bit guard bits to the high-order bits of the Nbit data from the data input means; means for operating the data from the guard bit adding means; and saturating means for making a saturation process of the data in accordance with logical values of the high-order (g+1) bits of the data from the operating means.
For a better understanding of the present invention, embodiments will now be described, with reference to the accompanying drawings, in which: Fig. 1 is a view for schematically explaining an operation procedure in the conventional DSP of fixed-point arithmetic system ; Fig. 2 is a schematic view showing a configuration of a DSP according to an embodiment of the present invention; Fig. 3 is a view for explaining a fixed-point arithmetic procedure in which a rounding bit number is reduced to r smaller than a data bit number N by means of the rounding/saturation processor 86 shown in Fig. 2; Fig. 4 is a view for showing the part of performing the data rounding in Fig. 2; Fig. 5 is a block diagram for explaining the fixed-point arithmetic procedure and the integer arithmetic procedure in Fig. 2; Fig. 6 is a view for showing the part of performing an operation including the scaling in the DSP of Fig. 2; Fig. 7 is a schematic view showing a configuration of a DSP according to another embodiment of the present invention; and Fig. 8 is a schematic view for explaining a signal processing procedure in the DSP of Fig. 7.
Referring to Fig. 2, there is shown a DSP according to an embodiment of the present invention that includes a first register 30 for receiving N bit data from a first external bus 31, and first and second bit alignment units 32 and 34 that are connected, in parallel, to the first register 30.
The first external bus 31 is composed of N data lines commonly connected to a working memory (not shown), and which is used as an N bit first read-only data bus. The first register 30 temporarily stores the N bit data from the memory by way of the first external bus 31. The first and second bit alignment units 32 and 34 align the N bit data from the first register 30 at the left or right of (N+r) bit bus to thereby extend the data from N bit into (N+r) bit. More specifically, the first bit alignment unit 32 has a wiring that connects N bit output lines of the first register 30 to high-order N input terminals of the first input port consisting of (N+r) terminals of a first multiplexor 36. The second bit alignment unit 34 has a wiring that connects the N bit output lines of the first register 30 to low-order N input terminals of the second input port consisting of (N+r) terminals of the first multiplexor 36. A this time, the high-order r bits are occupied with sign bits (positive (+) or negative (-) bits), or a value of'0'in accordance with a sign-extension mode of the DSP. The first multiplexor 36 supplies (N+r) bit data at the first input port or (N+r) bit data at the second input port to a first guard bit adder 38 in accordance with a type of operation, that is, the fixedpoint/integer arithmetic. The first guard bit adder 38 adds the (N+r) bit data from the first multiplexor 36 to guard bits of g bit. All the g bit of guard bits are set to'0'or extends a sign of data. More specifically, all the g bit of guard bits have bits of logical value equal to the sign bit of (N+r) bit data when the sign-extension mode is set, and have bits of logical value of'0'when the sign-extension mode is reset. The sign-extension mode is set or reset under a specific command word. In order to add the g-bit guard bits to the (N+r) bit data, the first guard bit adder 38 has a wiring that connects the output port of the first multiplexor 36 consisting of (N+r) terminals to low-order (N+r) lines in (g+N+r) lines included in a first internal bus 35.
The DSP further includes a second register 40 and a multiplier 46 that are connected, in series, to the first external bus 31, and a second multiplexor 42 connected to the first and second external buses 31 and 33. The second register 40 is responsible for temporarily storing the N bit data from the memory received via the first external bus 31 like the first register 30. The second external bus 33 is composed of N lines commonly connected to a programmable memory and the second working memory, not shown, and which is used as the second read-only data bus.
The second multiplexor 42 supplies to a third register 44 any one of the N bit data from the first external bus 31 supplied to its first input port and the N bit data from the second external bus 33 supplied to its second input port. The third register 44 temporarily stores the N bit data from the second multiplexor 42. The multiplier 46 multiplies two data stored in the second and third registers 40 and 44. The result multiplied by the multiplier 46 has 2N bit. Accordingly, a fourth register 48 for temporarily storing the data from the multiplier 46 must have a length of 2N bit. A bit alignment/guard bit adder 50 connected between the fourth register 48 and a second internal bus 37 converts the 2N bit data stored in the fourth register 48 into (N+r) bit data (wherein 2N > N+r) and adds g-bit guard bits to the converted (N+r) bit data.
The DSP further includes third to fifth multiplexors 52 to 56 commonly connected to the first internal bus 35. The third multiplexor 52 has first to third input ports for receiving (g+N+r) bit data from the first, second and fourth internal buses 35,37 and 43, respectively. The third multiplexor 52 supplies any one of three data from the first, second and fourth internal buses 35,37 and 43 to a first ALU 58. The fourth multiplexor 54 includes a first input port 39 for receiving a logical value of'0', and second and third input ports for receiving the (g+N+r) bit data from the second and third internal buses 35 and 41. The first ALU 53 operates two data from the third and fourth multiplexors 52 and 54 and supplies the operated result to a sixth multiplexor 62. The fifth multiplexor 56 also supplies the (g+N+r) bit data from the first internal bus 35 and the (g+N+r) bit data from the third internal bus 41 to a barrel shifter 60 selectively. The barrel shifter 60 scales a logical value of the data from the fifth multiplexor 56 and supplies the scaled data to the sixth multiplexor 62. For such a data scaling, the barrel shifter 60 shifts the data from the fifth multiplexor 56, by the bit number corresponding to a scaling amount, to the left or the light. Also, the barrel shifter 60 is connected, in parallel, to the first ALU 58 to minimize a propagation delay time of data. Accordingly, the-DSP can perform the four arithmetical operations, the scaling, the four arithmetical operations including the scaling at a high speed. The sixth multiplexor 62 selectively supplies the four-arithmetical operated data of the g+N+r) bit from the first ALU 58 and the scaled data from the barrel shifter 60 to a fifth or sixth register 64 or 66. The data stored in the fifth or sixth register 64 or 66 is supplied to the third internal bus 41. The fifth and sixth registers 64 and 66 are accumulating registers, which configures a first accumulator along with the first ALU 58.
The fifth and sixth registers 64 and 66 have a length of (g+N+r) bit so as to temporarily store the (g+N+r) bit data, and the third internal bus 41 also consists of (g+N+r) lines.
The DSP further includes a seventh multiplexor 68 connected to the first and fourth internal buses 35 and 43, and an eighth multiplexor 70 connected to the first and second internal buses 35 and 37. The seventh multiplexor 68 includes a first input port 45 for receiving a logical value of'0', and second and third input ports for receiving the (g+N+r) bit data from the first and fourth internal buses 35 and 43, respectively. The seventh multiplexor 68 supplies any one of three data at its first to third input ports to a second ALU 72. The eighth multiplexor 70 has first and second input ports for receiving the (g+N+r) bit data from the first and second internal buses 35 and 37, respectively. The eighth multiplexor 70 supplies any one of two data from the first and second internal buses 35 and 37 to the second ALU 72.
The second ALU 72 operates two data from the seventh and eighth multiplexors 68 and 70 and supplies the operated result to a ninth multiplexor 74. The ninth multiplexor 74 selectively supplies the operated data of (g+N+r) bit from the second ALU 72 and the scaled data from the barrel shifter 60 to the seventh or eighth register 76 or 78. The seventh or eighth register 76 or 78 supplies the data from the ninth multiplexor 74 to the fourth internal bus 43.
The seventh or eighth register 76 or 78 is an accumulating register, which constitutes a second accumulator along with the second ALU 72. The second accumulator is connected, in parallel, to the first accumulator to make a parallel operation of a plurality of complex arithmetic equations. mach of the seventh and eighth registers 76 and 78 has a length of (g+N+r) bit so as to temporarily store the (g+N+r) bit data. Also, the seventh and eighth registers 76 and 78 allow a number of complex arithmetic equations to be operated at a high speed along with the fifth and sixth registers 64 and 66.
Furthermore, the DSP includes a tenth multiplexor 80 for selecting two (g+N+r) bit data from the third and fourth internal buses 41 and 43, and third and fourth bit alignment units 82 and 84 commonly connected to the tenth multiplexor 80, and a rounding/saturation processor 82.
The tenth multiplexor 80 commonly supplies to the third and fourth bit alignment units 82 and 84 and the rounding/saturation processor 86 any one of the (g+N+r) bit data, via the third internal bus 41, from the fifth or sixth register 64 or 66 and the (g+N+r) bit data, via the fourth internal bus 23, from the seventh or eighth register 76 or 78. The third and fourth bit alignment units 82 and 84 derive only N bits from the (g+N+r) bit data from the tenth multiplexor 80 and supplies the derived N bit data to the first and second input ports of the eleventh multiplexor 88. More specifically, the third bit alignment unit 82 derives only high-order (g+l) th bit to low-order N bit data in the (g+N+r) bit data from the tenth multiplexor 80 and supplies the derived N bit data to the first input port of the eleventh multiplexor 88. To this end, the third bit alignment unit 82 includes a wiring that connects N terminals from the high-order (g+l) th terminal, that is, the remaining N terminals except for the high-order g terminals and the low-order r terminals, in the (g+N+r) output terminals of the tenth multiplexor 80 to the first input port of the eleventh multiplexor. 88 consisting of N terminals. The-third bit alignment unit 82 converts the (g+N+r) bit data into N bit data only by means of the wring. The fourth bit alignment unit 84 derives only low-order N bit data in the (g+N+r) bit data from che tent. multiplexor 80 and supplies the derived N bit data to the second input port of the eleventh multiplexor 88. To this end, the fourth bit alignment unit 84 includes a wiring that connects the loworder N terminals, that is, the remaining N terminals except for the high-order (g+r) terminals, in the (g+N+r) output terminals of the tenth multiplexor 80 to the second input port of the eleventh multiplexor 88 consisting of N terminals. The fourth bit alignment unit 84 does not require a separate circuit block because it is configured with only the wiring along with the first to third bit alignment units 32,34 and 82. Accordingly, the first to fourth bit alignment units 32,34,82 and 84 are capable of simplifying the circuit configuration of DSP as well as performing the fixed-point and integer arithmetical operations in a high speed.
The rounding/saturation processor 86 is driven in a rounding process mode, a saturation process mode or a combined mode under a command from a controller (not shown).
In the rounding mode, the rounding/saturation processor 86 checks a logical value of"r"th bit in the (g+N+r) bit data from the tenth multiplexor 80. If the logical value of low-order"r"th bit is 1, the rounding/saturation processor 86 eliminates low-order r bits from the (g+N+r) data to generate a (g+N) bit data. The (g+N) bit data is added to 1 by means of the rounding/saturation processor 86. The g bits are removed from the added (g+N) bit data to generate N bit data. The rounding/saturation processor 86 applies the N bit data to the eleventh multiplexor 88.
Subsequently, in the saturation mode, the rounding/saturation processor 86 processes the data in accordance with the logical value of the high-order (g+1) bits from the (g+N+r) bit data from the tenth multiplexor 80. More specifically, the rounding/saturation processor 86 determines the generation of overflow in accordance with whether or not all logical values of the high-order (g+1) bits (i. e., g guard bits and one sign bit) in the (g+N+r) bit data from the tenth multiplexor 80 are same.
If so, the rounding/saturation processor 86 supplies the remaining N bit data except for the high-order g bit and the low-order r bit in the (g+N+r) bit data from the tenth multiplexor 80 to the third input port of the eleventh multiplexor 88 as the operated result. Otherwise, if not, the rounding/saturation processor 86 determines whether a logical value of most significant bit of g guard bits is '0'or'1'. If a logical value of the most significant bit is'0', then the rounding/saturation processor 86 considers the data from the tenth multiplexor 80 to be a positive (+) data provided with an overflow and supplies a maximum value of N bit data (i. e.,"0111---11"), in which only the most significant bit has a logical value of'0', to the third input port of the eleventh multiplexor 88. Otherwise, if a logical value of the most significant bit is'1', the rounding/saturation processor 86 considers the data from the tenth multiplexor 80 to be a negative- (-) data provided with an overflow and supplies N bit data (i. e.,"1000 00"), in which only the most significant bit has a logical value of'1', to the third input port of the eleventh multiplexor 88. As described above, the saturation processor 86 accurately process a saturated logical value (i. e., a logical value generating an overflow) of the data operated by means of the ALU 58 or 72 based on logical values of guard bits and a sign bit. In the combination mode, the rounding/saturation processor 86 performs the rounding process for the (g+N+r) bit data from the tenth multiplexor 80 and then performs the saturation process for the rounded (g+N) bit data. The N bit data rounded and saturated using the rounding/saturation processor 86 in the above manner is applied to the eleventh multiplexor 88.
In this case, the rounding/saturation processor 86 first performs the rounding process for the low-order bits, but performs the rounding and saturation process within one clock period, so that it does not waste an additional rounding process time. Accordingly, the rounding/saturation processor 86 provides an advantage in that it can round the fixed-point operated data without wasting an additional clock. The eleventh multiplexor 88 transfers any one of three N bit data form the third and fourth bit alignment units 82 and 84 and the rounding/saturation processor 86 to the third external bus 47. The third external bus 47 consists of N lines including a write only data bus and a write only address bus.
Fig. 3 shows a fixed-point arithmetic procedure, in which the number r of rounding bit is smaller than the number N of data bit, using the rounding/saturation processor 86 in Fig. 2. In Fig. 3, first and second data 90 and 92 is sequentially read out of a memory (not shown) and each of which has a length of N bit. The most significant bit of each first and second data 90 and 92 includes a single sign bit indicating whether its logical values is positive number or negative number. The first and second data 90 and 92 are operated by the multiplier 46 shown in Fig. 2 to generate a third data 94 having a length of 2N bit.
This third data 94 is changed into a (g+N+r)-bit fifth data 98 by means of the bit alignment/guard bit adder 50 shown in Fig. 2, wherein g and r represent the number of guard'bits and the number of rounding bits, respectively.
The fifth data 98 is obtained by cutting away s-bit loworder bits from the fourth data 96 and which becomes to be a shape of input and output data of the ALUs 54 and 72.
Further, the fifth data 98 is temporarily stored in the accumulating register 64 or 76, or 66 or 78 having a length of (g+N+r) bit by way of the ALU 58 or 72. The fifth data 98 stored in the accumulating register 64 or 76, or 66 or 78 is converted into a sixth data 100 having a length of N bit by processing it using the roundinc/satur2tion processor 86 shown in Figs. : 3 and 4. In a rounding-off/saturation combined mode, the fifth data 98 is first converted into a (g-N) bit data by rounding off it and, thereafter, is saturated in a different manner in accordance with whether or not logical values of the high order (g+l) bits (i.e., g guard bits and one sign bit) are same and whether the most significant guard bit is'1'or '0', whereby it is finally converted into N bit'data. If logical values of the high-order (g+l) bits are same, then the remaining N bits except for the high-order g bits in the rounded-off (g+N) bit data are converted into a sixth data 100. If logical values of the high order (g+1) bits are not same and the most significant bit is'0', then the rounded-off (g+N) bit data is regarded as a positive (+) data generating an overflow and converted into a N-bit sixth data 100, i. e.,"0111---11"in which the most significant bit only has a logical value of'0'. Otherwise, if logical values of the high-order (g+1) bits are not same and the most significant bit is'1', then the roundedoff (g+N) bit data is regarded as a negative (-) data generating an overflow and converted into a N-bit sixth data 100, i. e.,"IOOO 00"in which the most significant bit only has a logical value of'1'. In such a procedure, the saturation process is omitted in the rounding-off mode while the rounding-off mode is omitted in the saturation mode. The sixth data 100 calculated finally in accordance with this operation procedure has a very approximate value including an error minimized as much as possible.
Fig. 4 shows a part of the DSP in Fig. 2 that performs the rounding of data. In Fig. 4, a (g+N+r)-bit first data D1 is operated by means of the ALU 58 or 72 and, thereafter, temporarily stored in any one of the accumulating registers 64,66,76 and 78. In the first data D1, g represents the number of guard bits, N is the number of data bits stored in the memory (not shown) and r is the number of rounding bits. The third bit alignment unit 82 converts the first data D1 into an N-bit second data D2 with the aid of the wiring as described with reference to Fig. 2 without an additional processing time. The rounding/saturation processor 86 rounds the first data D1 and then saturates the rounded data as described with reference to Fig. 2, thereby converting it the N-bit second data D2. The eleventh multiplexor 88 selects an output data of the rounding/saturation processor 86 and an output data of the third bit alignment unit 32 in accordance with whether any one of a passing mode, the rounding process mode, the saturation process mode and the combined mode is selected by a command from a controller (not shown). More specifically, the eleventh multiplexor 88 selects. the output data of the rounding/saturation processor 86 when one of the rounding process, saturation process and combined modes is selected. On the other hand, when the pass mode is selected, the eleventh multiplexor 88 selects an output data of the third bit alignment unit 82. By means of the rounding/saturation processor 86, the third bit alignment unit 82 and the eleventh multiplexor 88, the DSP does not waste an additional time (i. e., a clock period) and hence process the data at a high speed.
Fig. 5 is a block diagram for explaining the fixed-point arithmetic process and the integer arithmetic process in Fig. 2. In Fig. 5, the register 3C stores the N bit data from the memory (not shown) temporarily. The first bit alignment unit 32 aligns the N bit data from the register 30 to the left of the (N+r) bit bus with the aid of the wiring. Likewise, the second bit alignment unit 34 aligns the N bit data from the register 30 to the right of the (N+r) bit bus with the aid of the wiring. Then, the logical value of high-order r bits is determined according to the sign extension mode. That is, if the sign extension mode is set up, all the high-order r bits have a sign bit of the N bit data. On the contrary, all the high-order r bits have"0"when the sign extension mode is reset. The accumulator 102 receives the (N+r) bit data from the first bit alignment unit 32 to perform a fixed-point arithmetic, or receives the (N+r) bit data from the second bit alignment unit 34 to perform an integer arithmetic. The accumulator 102 is provided with the ALU 58 or 72 and the registers 64 and 66, or the registers 76 and 78. The third bit alignment unit 82 transfers only the high-order N bits in the (N+r) bit data from the accumulator 102 to the memory with the aid of the wiring structure. Accordingly, the first and third bit alignment units 32 and 82 align the data to the left by means of the wiring, thereby performing the fixed-point arithmetic at a high speed. The fourth bit alignment unit 84 transfers only the low-order N bits in the (N+r) bit data from the accumulator 102 to the memory with the aid of the wiring structure.
Accordingly, the second and fourth bit alignment units 34 and 84 align the data to the right by means of the wiring, thereby performing the integer arithmetic at a high speed.
By aligning the data with the aid of the wiring as described above, the DSP does not require an additional circuit block such as shifter and allows the fast fixedpoint and integer arithmetics.
Fig. 6 shows a circuit configuration for performing a adding/subtracting arithmetical operation including a scaling in the DSP of Fig. 2. In Fig. 6, the barrel shifter 60 is coupled to the first ALU 58 in parallel and commonly connected to two accumulating registers 64 and 76.
When the adding/subtracting arithmetical operation includes scaling, the accumulating register 76 temporally stores data scaled previously by the barrel shifter 60.
The data stored in the accumulating register 76 is applied to the first ALU 58 to be performed by the adding/subtracting arithmetical operation, during the next cycle. A result of the adding/subtracting arithmetical operation is stored in the accumulating regist-er 64. The circuitry structure as described above has a critical path shorter than DSPs in TMS320C5x series which are developed by Texas Instrument Co. Ltd. and have a serial circuit of the barrel shifter and ALU. Furthermore, the circuitry structure is effective rather than DSPs having only one register and a parallel circuit of the barrel shifter and ALU. That. is, the circuit structure consumes cycles smaller than other DSPs. More specifically, the data progresses from the first external bus 31, via the first register 30, the third multiplexor 52, the first ALU 58, the sixth multiplexor 62, the fifth register 64 and the eleventh multiplexor 88 in turn, to the third external bus 47; from the first external bus 31, via the first register 30, the fifth multiplexor 56, the barrel shifter 60, the sixth multiplexor 62, the fifth register 64 and the eleventh multiplexor 88 in turn, to the third external bus 47; or from the first external bus 31, via the first register 30, the fifth multiplexor 56, the barrel shifter 60, the seventh register 76 and the eleventh multiplexor 88 in turn, to the third external bus 47. The barrel shifter 60 is connected, in parallel, to the first ALU 58, so that the DSP is capable of minimizing the propagation delay time. Also, the number of clock required for the operation can be minimized because two accumulating registers are provided to one ALU. Accordingly, the DSP can perform the adding/subtracting arithmetical operation, the scaling, and the adding/subtracting arithmetical operation including the scaling at a high speed.
Referring to Fig. 7, there is shown a DSP according to another embodiment of the present invention. The DSP shown in Fig. 7, is most suitable for performing"AC-3"suggested as an audio compression and reconstruction algorithm by Dolby Co. Ltd. More specifically, the DSP of Fig. 7 performs effectively signal processing steps for recovering a pulse code modulated signal from a signal compressed by means of the AC-3. An accumulating register and ALU included in the DSP of Fig. 7 have a word length shorter than those in prior DSPs. Actually, if the word length of data is N, the DSP according to another embodiment of present invention requires a accumulating register and ALU each having (g+N) bit length. That is, the number of rounding bits is established as"0"so that the effect of the DSP having a (g+N+r) bit ALU and accumulating register is maximized. In the DSP including the (g+N) bit ALU and accumulating register, an FFT coefficient storing and butterfly arithmetical operation must be appropriately performed to minimize arithmetical errors generated at the arithmetical operation. This results from that the arithmetical errors are large or small in accordance with the selection of the storing and operating methods. To select the storing and operating methods appropriately, it is identified whether the errors are offset or not, through an analysis of the fixed-point arithmetical operating steps. The process of selecting the storing and operating methods will be described in the following.
Return to Fig. 7, the DSP according to another embodiment of the present invention includes a first register 130 for receiving N bit data from a first external bus 131, and a guard bit adder 132 connected to the first register 130.
The first external, bus 131 is composed of N data lines commonly connected to a working memory (not shown), and which is used as an N-bit first read-only data bus. The first register 130 temporarily stores the N bit data from the memory by way of the first external bus 131. The guard bit adder 132 adds g-bit guard bits to the N bit data from the first register 130. The g-bit guard bits are set to'0' or extends a sign of data. More specifically, all the gbit guard bits have a logical value identical to the sign bit of data when the sign-extension mode is set, and have a logical value of'0'when the sign-extension mode is reset. In order to add the g-bit guard bits to the N bit data, the guard bit adder 132 has a wiring that connects the output port of the first register 130 consisting of N terminals to the low-order N lines in (g+N) lines included in a first internal bus 135.
The DSP further includes a second register 136 and a multiplier 140 that are connected, in series, to the first external bus 131, and a first multiplexor 134 connected to the first and second external buses 131 and 133. The second register 136 is responsible for temporarily storing the N bit data from the memory received via the first external bus 131 like the first register 130. The second external bus 133 is composed of N lines commonly connected to a programmable memory and the second working memory, not shown, and which is used as the second read-only data bus. In the programmable memory, a cosine constant and a sine constant for the IFFT are stored as the same sign.
This prevents a generation of error in the reverse transform. The first multiplexor 134 supplies to a third register 138 any one of the N bit data from the first external bus 131 supplied to its first input port-and the N bit data from the second external bus 133 supplied to its second input port. The third register 138 temporarily stores the N bit data from the first multiplexor 134. The multiplier 140 multiplies two data stored in the second and third registers 136 and 138. The result multiplied by the multiplier 46 may have 2N bit. Accordingly, a fourth register 142 for temporarily storing the data from the multiplier 140 has a length of 2N bit. A bit controller 144 connected between the fourth register 142 and a second internal bus 137 converts the 2N bit data stored in the fourth register 142 into N bit data and adds g-bit guard bits to the converted (N+r) bit data. To this end, the bit controller 144 has a wiring that connects the output port of the fourth register 142 consisting of N terminals to the low-order N lines in the (g+N) lines included in the second internal bus 137.
The DSP further includes second to fourth multiplexors 146 to 150 commonly connected to the first internal bus 135.
The second multiplexor 146 has first to third input ports for receiving a (g+N) bit data from the first, second and fourth internal buses 135,137 and 143, respectively. The second multiplexor 146 supplies any one of three data from the first, second and fourth internal buses 135,137 and 143 to a first ALU 152. The third multiplexor 148 includes a first input port 139 for receiving a logical value ouf'0', and second and third input ports for receiving the (g+N) bit data from the first and third internal buses 135 and 141. The third multiplexor 148 supplies any one of three data at its first to third input ports to the first ALU 152. The first ALU 152 operates two data from the second and third multiplexors 146 and 148 and supplies the operated result to a fifth multiplexor 156. The-fourth multiplexor 150 also supplies the (g+N) bit data from the first internal bus 135 and the (g+N) bit data from the third internal bus 141 to a barrel shifter 154 selectively.
The barrel shifter 154 scales a logical value of the data from the fourth multiplexor 150 and supplies the scaled data to the fifth multiplexor 156. For such a data scaling, the barrel shifter 154 shifts the data from the fourth multiplexor 150, by the bit number corresponding to a scaling amount, to the left or the right. Also, the barrel shifter 154 is connected, in parallel, to the first ALU 152 to minimize a propagation delay time-of data.
Accordingly, the DSP can perform the four arithmetical operations, the scaling, and the four arithmetical operations including the scaling at a high speed. The fifth multiplexor 156 selectively supplies the fourarithmetical operated data of the (g+N) bit from the first ALU 152 and the scaled data from the barrel shifter 154 to a fifth or sixth register 158 or 160. The data stored in the fifth or sixth register 158 or 160 is supplied to the third internal bus 141. The fifth and sixth registers 158 and 160 are accumulating registers, which configures a first accumulator along with the first ALU 152. The fifth and sixth registers 158 and 160 have a length of (g+N) bit so as to temporarily store the (g+N) bit data, and the third internal bus 141 also consists of (g+N) lines.
The DSP further includes a sixth multiplexor 162 connected to the first and fourth internal buses 135 and 143, and a seventh multiplexor 164 connected to the first and second internal buses 135 and 137. The sixth multiplexor 162 includes a first input port 145 for receiving a logical value of'0', and second and third input ports for receiving the (g+N) bit data from the first and fourth internal buses 135 and 143, respectively. The sixth multiplexor 162 supplies any one of three data at its first to third input ports to a second ALU 166. The seventh multiplexor 164 has first and second input ports for receiving the (g+N) data form the first and second internal buses 135 and 137, respectively. The seventh multiplexor 164 supplies any one of two data from the first and second internal buses 135 and 137 to the second ALU 166. The second ALU 166 operates two data from the sixth and seventh multiplexors 162 and 164 and supplies the operated result to an eighth multiplexor 168. The eighth multiplexor 168 selectively supplies the operated data of (g+N) bit from the second ALU 166 and the scaled data from the barrel shifter 154 to the seventh or eighth register 170 or 172. The seventh or eighth register 170 or 172 supplies the data from the eighth multiplexor 168 to the fourth internal bus 143. The seventh or eighth register 170 or 172 is an accumulating register, which constitutes a second accumulator along with the second ALU 166. The second accumulator is connected, in parallel, to the first accumulator to make a parallel operation of at least two complex arithmetic equations. Each of the seventh and eighth registers 170 and 172 has a length of (g+N) bit so as to temporarily store the (g+N) bit data.
Also, the seventh and eighth registers 170 and 172 allow a number of complex arithmetic equations to be operated at a 'high speed along with the fifth and sixth registers 153 and 160.
Furthermore, the DSP includes a ninth multiplexor 174 for selecting two (g+N) bit data from the third and fourth internal buses 141 and 143, and a bit alignment units 176 and a saturation processor 178 commonly connected~ to the ninth multiplexor 174. The ninth multiplexor 174 commonly supplies to the bit alignment units 176 and the saturation processor 178 any one of the (g+N) bit data, via the third internal bus 41, from the fifth or sixth register 158 or 160 and the (g+N) bit data, via the fourth internal bus 123, from the seventh or eighth register 170 or 172. The bit alignment units 176 derive only N bits from the (g+N) bit data from the ninth multiplexor 174 and supplies the derived N bit data to the first input port of the eleventh multiplexor 180. More specifically, the bit alignment unit 176 derives. only the low-order N bit data in the (g+N) bit data from the ninth multiplexor 174 and supplies the derived N bit data to the first input port of the tenth multiplexor 180. To this end, the bit alignment unit 176 includes a wiring that connects the low-order N terminals, that is, the remaining N terminals except for the highorder g terminals in the (g+N) output terminals of the ninth multiplexor 174 to the first input port of the tenth multiplexor 180 consisting of N terminals. The bit alignment unit 176 does not require an additional circuit block because it is configured with the wiring only.
Accordingly, the bit alignment unit 176 is capable of simplifying the circuit configuration of the DSP and allows the fixed-point arithmetic to be performed at a high speed. The saturation processor 178 performs a saturation process under a command from a controller (not shown). The saturation processor 178 processes the data in accordance with logical values of the high-order (g+1) bits in the (g+N) bit data from the ninth multiplexor 174.
More specifically, the saturation processor 178 determines the generation of overflow in accordance with whether or not all logical values of the high-order (g+1) bits (i. e., g guard bits and one sign bit) in the (g+N) bit data from the ninth multiplexor 174 are same. If so, the saturation processor 178 supplies the low-order N bit data in the (g+N) bit data from the ninth multiplexor 174 to the second input port of the tenth multiplexor. 180 as the operated result. Otherwise, if not, the saturation processor 178 determines whether a logical value of the most significant bit of g guard bits is'0'or'1'. If a logical value of the most significant bit is'0', then the saturation processor 178 considers the data from the ninth multiplexor 174 to be a positive (+) data provided with an overflow and supplies a maximum value of N bit data (i. e., "Olllwll"), in which only the most significant bit has a logical value of'0', to the second input port of the tenth multiplexor 180. Otherwise, if a logical value of the most significant bit is'1', the saturation processor 178 considers the data from the ninth multiplexor 174 to be a negative (-) data provided with an overflow and supplies N bit datafi. e.,"1000 00"), in which only the most significant bit has a logical value of'1', to the second input port of the tenth multiplexor 180. As described above, the saturation processor 178 accurately process a saturated logical value (i. e., a logical value generating an overflow) of the data operated by means of the ALU 152 or 166 based on logical values of guard bits and a sign bit. As described above, in the data output from the ninth multiplexor 174, the saturation process only is performed, but the rounding process is not processed. This result in an improvement in the data operation speed. Th tenth multiplexor 180 transfers any one of two N bit data from the bit alignment unit 176 and the saturation processor 178 to the third external bus 147. The third external bus 147 consists of N lines including a write data bus and a write address bus.
Fig. 8 shows a fixed-point arithmetic procedure performed by means of the DSP in Fig. 7. In Fig. 8, first and second data 190 and 192 are sequentially read out of a memory (not shown) and each of which has a length of N bit. The most significant bit of each first and second data 190 and 192 includes a single sign bit indicating whether its logical values is positive number or negative number. The first and second data 190 and 192 are operated by the multiplier 140 shown in Fig. 5 to generate a third data 194 having a length of 2N bit. This third data 194 is shift by a predetermined bits when it is transferred from the multiplier 140 to the fourth register 142 and is converted into a fourth data 196. The fourth data 196 is converted into a fifth data 198 having a wordlength of (g+N) when it is transferred, via the bit controller 144 in Fig. 10, to the ALU 152 or 166. At this time, the bit controller 144 cuts away the low-order N bits of the fourth data 196 and adds g-bit guard bits to the remaining high-order bits of the N bits of the fourth data 196. The fifth data 198 is operated by means of the ALUs 152 and 166 and then temporarily stored in the accumulating register 158 or 170, or 160 or 172 having a length of (g+N) bit. The fifth data 198 stored in the accumulating register 158 or 170, or 160 or 172 is converted into a sixth data 200 having a length of N bit by processing it using the saturation processor 178 shown in Fig. 7. At this time, the fifth data 198 is processed in a different marner in accordance with whether or not logical values of the high-order (g+1) bits (i. e., g guard bits and one sign bit) are same and whether the most significant guard bit is'1'or'0', and it is. finally converted into N bit data. If logical values of the highorder (g+1) bits are same, then the fifth data 198 is converted into the sixth data 200 having the remaining N bits except for the high-order g bits in the (g+N) bits.
If logical values of the high order (g+1) bits are not same and the most significant bit is'0', then the fifth data 198 is regarded as a positive (+) data generating an overflow and converted into a N-bit sixth data 200, i. e., "0111 11"in which the most significant bit only has a logical value of'0'. Otherwise, if logical values of the high-order. (g+1) bits are not same and the most significant bit is'1', then the fifth data is regarded as a negative (-) data generating an overflow and converted into a N-bit sixth data 200, i. e.,"1000 00"in which the most significant bit only has a logical value of'1'. As described above, in the DSP according to the present embodiment the lengths of ALU and accumulating register are reduced to (g+N) bits to thereby reduce the circuitry size. As a result, the response speed of the DSP according to the present embodiment becomes fast. It will be proved from an analysis of the quantization error generated in the operation procedure that the extent of error in the DSP according to an embodiment of the present invention is minimized.
First, it is assumed that a data cut-away occurs in a process in which two data (a, b) are multiplied by means of the multiplier in Fig. 7 and transferred from the multiplier 140 to the ALU 152 or 166. The quantization error generated in this process can be formulated into the following equations: Tr (aXb) = a X b + et Tr' (ax (-b)) =-Tr (aX (-b)) Tr' (a#b) = a # b -e@ .... (1) When applying the equations (1) to the IFFT algorithm having a block length of 8, an error in the IFFT output has two types of distribution. In the first distribution, when maintaining original signs of the cosine and sine constants or inverting two types of constant sign, the average value of the quantization error in the IFFT output can be given as the following expression: { (0, 6m=), (0, 2mc), (0, 2me), (0,-2me), (0, 2 ! ne), (0,-2me), (0,-2me), (0,- 2m.), ),(0,-6me)} In the second distribution, when a sign in any one of two types of constant is inverted, the average value of the quantization error in the IFFT output can be given as the following expression: { (6me, 0), (2me, 0), (2me, 0), (-2me. 0), (2me, 0), (-2me, 0), (-2me, 0), (- 2me, 0), (-6me, 0)} (3) All the quantization error in the formula (2) is positioned at the imaginary number part ; while all the quantization error in the formula (3) is positioned at the real number part. The above-mentioned distribution of the quantization error becomes different by the post-IFFT process. Prior to explaining this, the post-IFFT process will be briefly described. In the post-IFFT process, complex number samples in which the complex number IFFT step is performed is re-distributed in accordance with the corresponding block length. In other words, when the block length is 128, the post-I. FFT can be formulated into the following equation: for (n=0 ; n < 128; n++) {y [n] = (Re (z [n]) * xcosl [n]-Im (z [n] * xsinl [n]) + j (Im (z [n]) * xcosl [n] + Re (z [n]) * xcosl [n]) (4) In the equation (4), xcosl [n] is-cos (2r (8n +1)/ (32128)) and xsinl [n] is sin (2r (8n +1)/ (32-128)). As seen from the equation (4), because the quantization error in the complex number IFFT output is distributed in the imaginary number part in the first case, the quantization error having a different sign is distributed in the real number part and the imaginary part at the post-IFFT output.
On the other hand, because the quantization error in the complex number IFFT output is distributed at the real number part in the second case, the quantization error having the same sign is distributed in the real number part and the imaginary part at the pcst-IFFT output. The cuantization error in the first case is canceled txi a window operation followed by the post-IFFT ; while the quantization error in the second case exists eve when the window operation followed by the pcst-IFFT is performed.
Therefore, since the cosine and sine constants for the IFFT are stored in the memory (i. e., the programmable memory or the working memory) as the same sign, quantization errors due to the data cut-away in the DSP according to an embodiment of the present invention are canceled each other.
As described above, the DSP according to the present embodiment controls the number of data bits (i. e., the wordlength) supplied to the ALU with the aid of the wiring and allows the number of rounding bits (i. e., r) to be smaller than N, so that it can shorten the length of the ALU and the accumulating register. If the number of the rounding bits is"0", it can further shorten the length of the ALU and the accumulating register. Also, in the DSP according to the present embodiment, the rounding of data is simultaneously performed at the time of the saturation process of data to thereby eliminate a clock required for the rounding process of data. Accordingly, the DSP according to the present embodiment can perform a signal process requiring the rounding, at a high speed.
Further, in the DSP according to the present embodiment, the bit alignment units implemented with the wiring are arrange at the pre-stage or the post-stage of the accumulator, thereby performing the fixed-point and integer arithmetics in a high speed as well as simplifying the circuit configuration.
Furthermore, the DSP according to the present embodiment. is capable of minimizing the propagation delay time as well as reducing the number of clocks required for the operation by connecting the barrel shifter to the ALU in parallel. Accordingly, the DSP according to the present embodiment. can perform the operation, the scaling and the operation including the scaling at a high speed.
Furthermore, the DSP according to the present embodiment makes a parallel operation of two complex arithmetical equations using a pair of ALUs connected in parallel, thereby operating a number of complex arithmetical equations at a high speed.
Although the present invention has been explained by the embodiments shown in the drawings described above, it should be understood to the ordinary skilled person in the art that the invention is not limited to the embodiments, but rather than that various changes or modifications thereof are possible without departing from the spirit of the invention. Accordingly, the scope of the invention shall be determined only by the appended claims and their equivalents.
The present application is a divisional application derived from patent application number 9818919. 4 (GB 2 330 226 A). The disclosure of GB 2 330 226 as published is incorporated herein by reference, including the text of the claims as originally filed.

Claims (11)

  1. Claims: 1. A digital signal processor, comprising: data input means for receiving an N-bit data; an alignment unit for aligning the N-bit data from the data input means to the left of a (N+r)-bit bus; means for operating the data from the alignment unit through the (N+r)-bit bus; and deriving means for deriving only the high-order N bits from the data from the operating means.
  2. 2. The digital signal processor according to claim 1, wherein: guard bit adding means is provided for adding g-bit guard bits to the high-order bits of the data from the alignment unit; said means for operating data operates the data from the guard bit adding means; said deriving means derives low-order N bits beginning at the high-order (g+1) numbered bit from the data from the operating means; rounding/saturation means is provided for performing a rounding process, saturation process and a combined process including the rounding and saturation processes for the data from the operating means; and selecting means for selecting the data from any one of the deriving means and the rounding/saturating means is provided.
  3. 3. A digital signal processor according to claim 2, wherein: said guard bit adding means adds g-bit guard bits to the high-order bits of the data from the alignment unit through the (N+r)-bit bus; said means for operating data operates the data from the guard bit adding means and a data from a feedback loop; a memory is connected to the feedback loop for temporarily storing the data from the operating means; means for scaling the data from the guard bit adding means is provided; further selecting means is provided for selectively transferring the data from the scaling means and the data from the operating means to the memory; said means for deriving the low-order N bits beginning at the high-order (g+1) numbered bit derives the bits from the data from the memory; and said rounding/saturation means processes data from the memory.
  4. 4. A digital signal processor, comprising: data input means for receiving an N-bit data; an alignment unit for aligning the N-bit data from the data input means to the right of a (N+r)-bit bus; means for operating the data from the alignment unit through the (N+r)-bit bus; and deriving means for deriving only the low-order N bits from the data from the operating means.
  5. 5. A digital signal processor according to claim 4, wherein: guard bit adding means is provided for adding g-bit guard bits to the high-order bits of the data from the alignment unit; said means for operating data operates the data from the guard bit adding means; rounding/saturation means is provided for making a rounding process, a saturation process and a combined process including the rounding and saturation processes for the data from the operating means; and selecting means for selecting the data from any one of the deriving means and the rounding/saturating means is provided.
  6. 6. A digital processor according to claim 5, wherein: said guard bit adding means adds g-bit guard bits to the high-order bits of the data from the (N+r)-bit bus; said means for operating data operates the data from the guard bit adding means and a data from a feedback loop; a memory is connected to the feedback loop for temporarily storing the data from the operating means; means for scaling the data from the guard bit adding means is provided; further selecting means is provided for selectively transferring the data from the scaling means and the data from the operating means to the memory; said means for deriving the low-order N bits derives the bits from the data from the memory; and said rounding/saturation means processes data from the memory.
  7. 7. A digital signal processor according to claim 1, wherein: a further alignment unit is provided for aligning the N-bit data from the data input means to the right of a further (N+r)-bit bus; first selecting means is provided for selecting the data from any one of said alignment unit or said further alignment unit; said means for operating data operates the data from the first selecting means; further deriving means is provided for deriving only the low-order N bits from the data from the operating means; and second selecting means is provided for selecting the data from any one of said deriving means and said further deriving means.
  8. 8. A digital signal processor according to claim 7, wherein: said first selecting means selects the data from any one of said (N+r)-bit bus or said further (N+r)-bit bus; guard bit adding means is provided for adding g-bit guard bits to the high-order bits of the data from the first selecting means; said means for operating data operates the data from the guard bit adding means; said deriving means derives the low-order N bits beginning at the high-order (g+1) numbered bit from the data from the operating means; rounding/saturation means is provided for making a rounding process, a saturation process and a combined process including the rounding and saturation processes for the data from the operating means; and said second selecting means selects the data from any one of said deriving means, said further deriving means and the rounding/saturating means.
  9. 9. A digital signal processor according to claim 8, wherein: said means for operating data operates the data from the guard bit adding means and a data from a feedback loop; a memory is connected to the feedback loop for temporarily storing the data from the operating means; means for scaling the data from the guard bit adding means is provided; third selecting means is provided for selectively transferring the data from the scaling means and the data from the operating means to the memory; said second deriving means derives the low-order N bits beginning at the high-order (g+1) numbered bit from the data from the memory; said further deriving means for deriving the loworder N bits from the data from the memory; and said rounding/saturation means processes data from the memory.
  10. 10. The digital signal processor as claimed in any one of claims 1 to 9, wherein a said alignment unit is implemented with a wiring.
  11. 11. The digital signal processor as claimed in any one of claims l to 10, wherein a said deriving means is implemented with a wiring.
GB0007909A 1997-08-30 1998-08-28 Digital signal processor Expired - Fee Related GB2345562B (en)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
KR1019970045342A KR100315303B1 (en) 1997-08-30 1997-08-30 Digital signal processor
KR1019970045345A KR100246472B1 (en) 1997-08-30 1997-08-30 Digital signal processor
KR1019970045341A KR100297544B1 (en) 1997-08-30 1997-08-30 Digital signal processor
KR1019970045343A KR100251546B1 (en) 1997-08-30 1997-08-30 Digital signal processor
KR1019980012318A KR100271571B1 (en) 1998-04-07 1998-04-07 Digital signal processor
GB9818919A GB2330226B (en) 1997-08-30 1998-08-28 Digital signal processor

Publications (3)

Publication Number Publication Date
GB0007909D0 GB0007909D0 (en) 2000-05-17
GB2345562A true GB2345562A (en) 2000-07-12
GB2345562B GB2345562B (en) 2000-12-27

Family

ID=27547317

Family Applications (2)

Application Number Title Priority Date Filing Date
GB0007912A Expired - Fee Related GB2345563B (en) 1997-08-30 1998-08-28 Digital signal processor
GB0007909A Expired - Fee Related GB2345562B (en) 1997-08-30 1998-08-28 Digital signal processor

Family Applications Before (1)

Application Number Title Priority Date Filing Date
GB0007912A Expired - Fee Related GB2345563B (en) 1997-08-30 1998-08-28 Digital signal processor

Country Status (1)

Country Link
GB (2) GB2345563B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2411975A (en) * 2003-12-09 2005-09-14 Advanced Risc Mach Ltd Arithmetic data processing apparatus for returning the high half of a result after data processing
GB2376773B (en) * 2001-03-12 2005-09-21 Touch Technologies Inc Apparatus and method for display and/or precision operations of numerical values in binary systems
US9557994B2 (en) 2004-07-13 2017-01-31 Arm Limited Data processing apparatus and method for performing N-way interleaving and de-interleaving operations where N is an odd plural number

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0534760A2 (en) * 1991-09-26 1993-03-31 International Business Machines Corporation High speed multiplier device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4760544A (en) * 1986-06-20 1988-07-26 Plessey Overseas Limited Arithmetic logic and shift device

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0534760A2 (en) * 1991-09-26 1993-03-31 International Business Machines Corporation High speed multiplier device

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2376773B (en) * 2001-03-12 2005-09-21 Touch Technologies Inc Apparatus and method for display and/or precision operations of numerical values in binary systems
GB2411975A (en) * 2003-12-09 2005-09-14 Advanced Risc Mach Ltd Arithmetic data processing apparatus for returning the high half of a result after data processing
GB2411975B (en) * 2003-12-09 2006-10-04 Advanced Risc Mach Ltd Data processing apparatus and method for performing arithmetic operations in SIMD data processing
US7761693B2 (en) 2003-12-09 2010-07-20 Arm Limited Data processing apparatus and method for performing arithmetic operations in SIMD data processing
US9557994B2 (en) 2004-07-13 2017-01-31 Arm Limited Data processing apparatus and method for performing N-way interleaving and de-interleaving operations where N is an odd plural number

Also Published As

Publication number Publication date
GB2345562B (en) 2000-12-27
GB2345563B (en) 2000-12-27
GB2345563A (en) 2000-07-12
GB0007912D0 (en) 2000-05-17
GB0007909D0 (en) 2000-05-17

Similar Documents

Publication Publication Date Title
US6209017B1 (en) High speed digital signal processor
US4754421A (en) Multiple precision multiplication device
US6009451A (en) Method for generating barrel shifter result flags directly from input data
US6233597B1 (en) Computing apparatus for double-precision multiplication
US4700324A (en) Digital circuit performing an arithmetic operation with an overflow
US6115729A (en) Floating point multiply-accumulate unit
US6463451B2 (en) High speed digital signal processor
US20140136588A1 (en) Method and apparatus for multiplying binary operands
US5402368A (en) Computing unit and digital signal processor using the same
GB2345562A (en) Digital signal processor for performing fixed-point and/or integer arithmetic
KR100481586B1 (en) Apparatus for modular multiplication
JPH10503311A (en) Galois field polynomial multiply / divide circuit and digital signal processor incorporating the same
US5065353A (en) Adder control method and adder control circuit
US7047271B2 (en) DSP execution unit for efficient alternate modes for processing multiple data sizes
KR100315303B1 (en) Digital signal processor
JPH02205923A (en) Arithmetic processor
KR0175974B1 (en) Processor for selectively performing multiplaying dividing operation s based on pipeline structure
KR100251547B1 (en) Digital signal processor
US5381380A (en) Divide circuit having high-speed operating capability
KR100246472B1 (en) Digital signal processor
KR100297544B1 (en) Digital signal processor
KR100251546B1 (en) Digital signal processor
JP2951685B2 (en) Fixed-point arithmetic unit
JP3137131B2 (en) Floating point multiplier and multiplication method
KR100451712B1 (en) Method and apparatus for multiplication

Legal Events

Date Code Title Description
PCNP Patent ceased through non-payment of renewal fee

Effective date: 20020828