JPH1091614A

JPH1091614A - Method for converting idct into integer

Info

Publication number: JPH1091614A
Application number: JP8242926A
Authority: JP
Inventors: Tomoko Ide; 知子井手
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1996-09-13
Filing date: 1996-09-13
Publication date: 1998-04-10

Abstract

PROBLEM TO BE SOLVED: To execute Arai's algorithm (Trans IEICE, E71, pp. 1095-1097, Nov '88) by 16-bit integer operation. SOLUTION: In the row algorithm (a) of 8×8 IDCT, a carry-down number is determined in accordance with the input values of all WAZANKIs (calculator for old Japanese mathematics), and when carry-down is required, WAZAN (old Japanese mathematics) is executed after the carry-down and the carry-down value of a result is determined in accordance with the input values of integrators 101 to 104. An integrator 105 executes normal integration. In column algorithm (b), d6, d13, d14, d24, and d31 are found out by WAZAN followed by input judgement, and in other cases, normal WAZAN is executed. Integrators 106 to 109 similarly execute integration followed by input judgement and an integrator 110 executes normal integration. Consequently the method can be applied to a processor for executing the parallel processing of 16-bit unit. When the number of operation bits is reduced, an area for forming an exclusive LSI can be reduced.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、ＩＤＣＴ演算用の
回路におけるＩＤＣＴの整数化法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method for converting an IDCT into an integer in an IDCT operation circuit.

【０００２】[0002]

【従来の技術】従来の８点ＩＤＣＴ演算方法（Ｙ．Ａｒ
ａｉ他，ＴｒａｎｓＩＥＩＣＥ，Ｅ７１，ｐｐ．１０
９５−１０９７，Ｎｏｖ‘８８）を図２に示す。2. Description of the Related Art A conventional 8-point IDCT calculation method (Y.Ar
Ai et al., Trans IEICE, E71, pp. 10
95-1097, Nov'88) is shown in FIG.

【０００３】図２（ａ）において、プリスケーリング部
２０１では、ＤＣＴ係数ｘ（ｉ）と定数ｃ（ｉ）の積算
を行いｐ（ｉ）を得、これを演算部２０２に入力する。
ここで、ｉを０〜７の整数とし、ｃ（ｉ）は（数１）で
定義された実数である。In FIG. 2A, a prescaling section 201 multiplies a DCT coefficient x (i) and a constant c (i) to obtain p (i), and inputs this to an arithmetic section 202.
Here, i is an integer from 0 to 7, and c (i) is a real number defined by (Equation 1).

【０００４】[0004]

【数１】 (Equation 1)

【０００５】（数１）において、ＳＱＲＴ（２）を２の
平方根とする。In equation (1), SQRT (2) is a square root of 2.

【０００６】演算部２０２の詳細を図２（ｂ）に示す。
ここで、黒丸は和算器、四角形は中に記述した定数との
積算を行う積算器を示す。ｂ１〜ｂ５は（数２）〜（数
６）で定義された定数である。FIG. 2B shows the details of the arithmetic unit 202.
Here, black circles indicate adders, and squares indicate integrators that perform integration with constants described therein. b1 to b5 are constants defined by (Equation 2) to (Equation 6).

【０００７】[0007]

【数２】 (Equation 2)

【０００８】[0008]

【数３】 (Equation 3)

【０００９】[0009]

【数４】 (Equation 4)

【００１０】[0010]

【数５】 (Equation 5)

【００１１】[0011]

【数６】 (Equation 6)

【００１２】該アルゴリズムを整数演算のみで実行する
場合、積算を整数化すればよい。即ち、各積算器におい
て、Ｍｂを整数とし、ｂ１〜ｂ５に２のＭｂ乗を掛けた
値を四捨五入整数化し、定数Ｂ１〜Ｂ５を予め用意す
る。積算を実行するとき、まず、該定数との積算をし、
その結果をＭｂビットの丸め機能付き右シフト演算する
ことにより、整数化できる。When the algorithm is executed only by an integer operation, the integral may be converted to an integer. That is, in each integrator, Mb is an integer, and a value obtained by multiplying b1 to b5 by 2 to the power of Mb is rounded to an integer to prepare constants B1 to B5 in advance. When performing integration, first, integration with the constant is performed,
The result can be converted to an integer by performing a right shift operation with a rounding function of Mb bits.

【００１３】次に、一般的な画像の復号化方法を図２
（ｃ）に示す。ハフマン復号部２０３において、入力ビ
ット列をハフマン復号してｈ（ｉ）を得、逆量子化部２
０４にてｈ（ｉ）とスケールファクタＱ（ｉ）の掛け算
によりＤＣＴ係数ｘ（ｉ）を得る。Ｑ（ｉ）が初期値Ｑ
０（ｉ）と変数αの積で表される場合、Ｑ０（ｉ）にｃ
（ｉ）を掛けた値を予め用意しておけば、図２（ａ）の
プリスケーリング部での積算をなくすことができる。よ
って、８点ＩＤＣＴの計算が積算５回、和算２９回のみ
で実行できるので、この方法は最も演算量の少なく、高
速化されたものとして知られている。Next, a general image decoding method will be described with reference to FIG.
It is shown in (c). In the Huffman decoding unit 203, the input bit sequence is Huffman decoded to obtain h (i), and the inverse quantization unit 2
At 04, a DCT coefficient x (i) is obtained by multiplying h (i) by the scale factor Q (i). Q (i) is the initial value Q
0 (i) and the variable α, when Q0 (i) is c
If the value multiplied by (i) is prepared in advance, the integration in the pre-scaling unit in FIG. 2A can be eliminated. Therefore, since the calculation of the eight-point IDCT can be performed only by 5 times of integration and 29 times of addition, this method is known to have the least amount of calculation and to be speeded up.

【００１４】これを８×８点に拡張したものを図２
（ｄ）に示す。FIG. 2 shows an extension of this to 8 × 8 points.
(D).

【００１５】プリスケーリング部２０５においてｘ
（ｉ，ｊ）とｃ（ｉ）×ｃ（ｊ）の積ｐ（ｉ，ｊ）を求
め、これをｉ行毎に分割する。ｋを０〜７の整数とし、
ｉ行の８個の要素ｐ（ｉ，ｋ）を演算部２０６に入力す
る。In the pre-scaling unit 205, x
The product p (i, j) of (i, j) and c (i) × c (j) is obtained, and this is divided for every i rows. k is an integer from 0 to 7,
The eight elements p (i, k) in the i-th row are input to the arithmetic unit 206.

【００１６】演算部２０６ではｐ（ｉ，ｋ）をｐ（ｋ）
に代入し、図２（ｂ）と同様の処理を実行する。出力ｙ
（ｋ）をｐｒ（ｉ，ｋ）代入し、８行全てにつきこれを
繰り返し、６４個のｐｒ（ｉ，ｊ）を得る。ｐｒ（ｉ，
ｊ）をｊ列毎に分割し、ｊ列の８個の要素ｐｒ（ｋ，
ｊ）を演算部２０７に入力する。The arithmetic unit 206 converts p (i, k) into p (k)
And performs the same processing as in FIG. Output y
(K) is substituted for pr (i, k), and this is repeated for all eight rows to obtain 64 pr (i, j). pr (i,
j) is divided into j columns, and eight elements pr (k,
j) is input to the arithmetic unit 207.

【００１７】演算部２０７ではｐｒ（ｋ，ｊ）をｐ
（ｋ）に代入し、図２（ｂ）と同様の処理を実行する。
出力ｙ（ｋ）をＹ（ｋ，ｊ）に代入し、８列全てにつき
これを繰り返し、６４個のＹ（ｉ、ｊ）を出力するもの
である。The arithmetic unit 207 calculates pr (k, j) as p
(K) and execute the same processing as in FIG. 2 (b).
The output y (k) is substituted for Y (k, j), and this is repeated for all eight columns to output 64 Y (i, j).

【００１８】一方、画像復号化処理をプロセッサにて実
行する動きが、近年、活発化し、画像処理用の特殊命令
を備えた製品が開発されている。その中に、６４ビット
演算用のプロセッサを用いて、１６ビット演算を４並列
同時に実行できるものも計画されている。On the other hand, the movement of executing a picture decoding process by a processor has recently become active, and products provided with special instructions for picture processing have been developed. Among them, a processor capable of simultaneously executing four 16-bit operations in parallel using a 64-bit operation processor is also planned.

【００１９】上記８×８ＩＤＣＴが１６ビット演算で実
行できれば、４並列処理が可能となり、さらに高速化す
ることができる。If the above-mentioned 8 × 8 IDCT can be executed by 16-bit operation, 4-parallel processing becomes possible and the speed can be further increased.

【００２０】また、ＭＰＥＧ２（ＭｏｖｉｎｇＰｉｃ
ｔｕｒｅＥｘｐｅｒｔｓＧｒｏｕｐｐａｒｔ２）
で定めるＩＤＣＴの規格が‘９５年１１月に修正された
（ＴｅｃｈｎｉｃａｌＣｏｒｒｉｇｅｎｄｕｍ２
ｆｏｒＩＳＯ／ＩＥＣ１３８１８ー２，ＩＳＯ／Ｉ
ＥＣＤＩＳ１３８１８ー４）。これは、ＩＤＣＴの
実行結果が、（ー２５６）以下（ー３８４）以上のとき
（ー２５６）を出力し、２５５以上３８４未満のとき
（２５５）を出力し、ー３８５以下、または、３８４以
上の場合、出力値を規定しないというものである。Also, MPEG2 (Moving Pic)
cure Experts Group part2)
Was revised in November 1995 (Technical Corrigendum 2
for ISO / IEC 13818-2, ISO / I
EC DIS 13818-4). This means that when the execution result of the IDCT is (−256) or less (−384) or more, (−256) is output, and when it is 255 or more and less than 384 (255), −385 or less or 384 or more is output. In the case of, the output value is not specified.

【００２１】この規格を上記８×８ＩＤＣＴに適用する
と、図２（ｄ）の演算部２０６への入力ｐ（ｉ、ｊ）の
ビット数がＭｐビットの場合、演算部２０６、及び演算
部２０７での和算、及び積算には、最大で（Ｍｐ＋２）
ビット必要となる。When this standard is applied to the 8 × 8 IDCT, if the number of bits of the input p (i, j) to the arithmetic unit 206 in FIG. 2D is Mp bits, the arithmetic unit 206 and the arithmetic unit 207 (Mp + 2)
A bit is needed.

【００２２】[0022]

【発明が解決しようとする課題】しかし、ＩＥＥＥのＩ
ＤＣＴ演算の精度基準（Ｓｔｄ１１８０−１９９０，Ｄ
ｅｃ６‘９０）を満足するためには、ｐ（ｉ，ｊ）は最
低でも１６ビット必要である。これは、演算部２０６、
２０７において、最大１８ビット必要とすることを意味
し、４並列処理に適さないという問題が生じる。However, the IEEE I
DCT calculation accuracy criterion (Std1180-1990, D
To satisfy (ec6'90), p (i, j) needs at least 16 bits. This is the operation unit 206,
In 207, it means that a maximum of 18 bits are required, which causes a problem that it is not suitable for 4-parallel processing.

【００２３】[0023]

【課題を解決するための手段】上記問題を解決するため
に、図２（ｂ）において演１７ビット以上の算結果を出
力する演算器については、該演算器への入力値の大きさ
により、桁下げするか否かの判定をする。その結果、桁
下げを要する場合は、予め入力を桁下げした後に演算を
実行する。これにより、演算結果を常に１６ビット以内
の値にすることが可能となる。In order to solve the above problem, in FIG. 2 (b), for an arithmetic unit which outputs a calculation result of 17 bits or more, the size of the input value to the arithmetic unit is determined by the following formula. It is determined whether to carry down. As a result, if a carry is required, the calculation is performed after the carry is previously carried out. This makes it possible to always set the operation result to a value within 16 bits.

【００２４】図１に本発明の構成を示す。FIG. 1 shows the configuration of the present invention.

【００２５】図１（ａ）は、８×８ＩＤＣＴの行アルゴ
リズム部を示す。入力ｐ（ｋ）、途中の演算の結果を示
すｄ１〜ｄ３１、ｓ０〜ｓ７、出力ｙ（ｋ）は１６ビッ
ト以内の整数である。ここで、ｋは０〜７の整数とす
る。FIG. 1A shows a row algorithm section of the 8 × 8 IDCT. The input p (k), d1 to d31 and s0 to s7 indicating the result of the intermediate operation, and the output y (k) are integers of 16 bits or less. Here, k is an integer of 0 to 7.

【００２６】行アルゴリズム部における全ての和算に
は、図３に示すクリップ演算を使用する。The clip operation shown in FIG. 3 is used for all the additions in the row algorithm section.

【００２７】図３において、ｚ１，ｚ２を１６ビット以
内の整数とし、ｓｆｔ（ｚ１）、ｓｆｔ（ｚ２）をｚ
１，ｚ２の現在にいたるまでの桁下げ数とする。In FIG. 3, z1 and z2 are integers within 16 bits, and sft (z1) and sft (z2) are z
1, z2, the number of carry down to the present.

【００２８】比較部３０１はｓｆｔ（ｚ１）とｓｆｔ
（ｚ２）の大きい方をｓｆｔＭに代入する。次にｓｆｔ
ｄ（ｚ１）、ｓｆｔｄ（ｚ２）を（数７）により求め
る。The comparison unit 301 calculates sft (z1) and sft (z1).
The larger of (z2) is substituted for sftM. Then sft
d (z1) and sftd (z2) are obtained by (Equation 7).

【００２９】[0029]

【数７】 (Equation 7)

【００３０】判定部３０２は、ｚ１をｓｆｔｄ（ｚ１）
ビット、ｚ２をｓｆｔｄ（ｚ２）ビット右シフトしたも
ののいずれかが１６ビットのとき、ｓｆｔｄ（ｚ１）、
ｓｆｔｄ（ｚ２）、ｓｆｔＭに１を加算する。The determination unit 302 converts z1 into sftd (z1)
When any of the bits z2 shifted right by sftd (z2) bits is 16 bits, sftd (z1),
1 is added to sftd (z2) and sftM.

【００３１】桁下げ部３０３は、ｚ１を（ｓｆｔｄ（ｚ
１）ビット）丸め機能付き右シフト演算したものをＺ１
に、ｚ２を（ｓｆｔｄ（ｚ２）ビット）丸め機能付き右
シフト演算したものをＺ２に代入する。演算部３０４
は、Ｚ１とＺ２の和算をした結果ｚ３を出力する。The carry unit 303 converts z1 into (sftd (z
1) Bit) The result of right shift operation with rounding function is Z1
Then, the value obtained by performing z2 (sftd (z2) bits) right shift operation with a rounding function is assigned to Z2. Arithmetic unit 304
Outputs the result z3 of the sum of Z1 and Z2.

【００３２】経歴加算部３０５はｓｆｔ（ｚ３）にｓｆ
ｔＭを代入する。The history adding unit 305 adds sf (z3) to sf (z3).
Substitute tM.

【００３３】図３において、ｚ３＝ｚ１＋ｚ２を得る場
合は、ｚ１、ｚ２の極性が同じであるときのみ判定部３
０２を動作させる。また、ｚ３＝ｚ１ーｚ２を得る場合
は、ｚ１、ｚ２の極性が異なるときのみ判定部３０２を
動作させる。この演算をクリップ演算と定義する。In FIG. 3, when z3 = z1 + z2 is obtained, the determination unit 3 only determines when the polarities of z1 and z2 are the same.
02 is operated. When z3 = z1−z2 is obtained, the determination unit 302 is operated only when the polarities of z1 and z2 are different. This operation is defined as a clip operation.

【００３４】また、常に判定部３０２を動作させずに実
行する和算を経歴付き和算と定義する。Further, the summation executed without operating the determination unit 302 is defined as summation with history.

【００３５】図１（ａ）において、クリップ演算により
ｄ１〜ｄ８を求めるとき、ｓｆｔ（ｐ（ｋ））に０を代
入しておく。In FIG. 1A, when d1 to d8 are obtained by clipping, 0 is substituted for sft (p (k)).

【００３６】次に行アルゴリズム部における積算部１０
１〜１０５について記述する。図１（ａ）の積算部１０
１の詳細を図４に示す。Next, the integrating section 10 in the row algorithm section
1 to 105 will be described. Integrator 10 of FIG.
4 is shown in FIG.

【００３７】図４の定数部４０１において、（数２）で
定義した定数ｂ１と２のＭｂ１乗を積算し、四捨五入し
たものを定数Ｂ１とする。Ｂ１とｄ６の積算結果をＤ１
６とし、桁下げ部４０２において、これをｓｆ（ｄ１
６）ビットの丸め機能付き右シフト演算しｄ１６を得
る。ここで、ｓｆ（ｄ１６）は判定部４０３において得
られる値である。判定部４０３において、ｓｆ（ｄ１
６）の初期値としてＭｂ１を代入しておく。また、ｄ１
６を得るまでに桁下げしたビット数をｓｆｔ（ｄ１６）
とし、初期値としてｓｆｔ（ｄ６）代入しておく。ｄ６
を表し得るビット数を求め、該ビット数が１６で、且
つ、ｓｆｔ（ｄ６）が０の場合、ｓｆ（ｄ１６）、ｓｆ
ｔ（ｄ１６）に１を加算する。In the constant section 401 of FIG. 4, the constant b1 defined by (Equation 2) is multiplied by 2 to the power of Mb1, and the result is rounded to obtain a constant B1. The integration result of B1 and d6 is D1
6, and the carry-down unit 402 sets this to sf (d1
6) Right shift operation with bit rounding function is performed to obtain d16. Here, sf (d16) is a value obtained in the determination unit 403. In the determination unit 403, sf (d1
Mb1 is substituted as the initial value of 6). Also, d1
Sft (d16) is the number of bits lowered to obtain 6.
And sft (d6) is substituted as an initial value. d6
Is obtained, and when the number of bits is 16 and sft (d6) is 0, sf (d16), sf
One is added to t (d16).

【００３８】図１（ａ）の積算部１０２において、図４
のｂ１を（数３）で定義したｂ２に、Ｍｂ１をＭｂ２
に、入力ｄ６をｄ４に置き換え、Ｂ２、Ｄ１５を求め
る。また、ｓｆ（ｄ１５）にＭｂ２を、ｓｆｔ（ｄ１
５）にｓｆｔ（ｄ４）を初期値として代入する。判定部
４０３は、ｄ４が１６ビット且つｓｆｔ（ｄ４）が０の
場合には、２をｓｆ（ｄ１５）、ｓｆｔ（ｄ１５）に加
算し、ｄ４が１６ビット且つｓｆｔ（ｄ４）が１の場
合、または、ｄ４が１５ビットの場合には、１をｓｆ
（ｄ１５）、ｓｆｔ（ｄ１５）に加算する。桁下げ部４
０２はＤ１５をｓｆ（ｄ１５）ビットの丸め機能付き右
シフト演算しｄ１５を得る。In the integrating section 102 shown in FIG.
Is changed to b2 defined by (Equation 3), and Mb1 is changed to Mb2.
Then, the input d6 is replaced with d4, and B2 and D15 are obtained. Also, Mb2 is set in sf (d15), and sft (d1
5) Substitute sft (d4) as an initial value. When d4 is 16 bits and sft (d4) is 0, the determination unit 403 adds 2 to sf (d15) and sft (d15), and when d4 is 16 bits and sft (d4) is 1, Alternatively, when d4 is 15 bits, 1 is set to sf
(D15), and added to sft (d15). Carring part 4
02 performs a right shift operation on D15 with a rounding function of sf (d15) bits to obtain d15.

【００３９】積算部１０３において、図４のｂ１を（数
４）で定義したｂ３に、Ｍｂ１をＭｂ３に、入力ｄ６を
ｄ１４に置き換え、Ｂ３、Ｄ２２を求める。また、ｓｆ
（ｄ２２）にＭｂ３を、ｓｆｔ（ｄ２２）にｓｆｔ（ｄ
１４）を初期値として代入する。判定部４０３は、ｄ１
４が１６ビットの場合には、１をｓｆ（ｄ２２）、ｓｆ
ｔ（ｄ２２）に加算する。桁下げ部４０２はＤ２２をｓ
ｆ（ｄ２２）ビットの丸め機能付き右シフト演算しｄ２
２を得る。In the integrating unit 103, B3 and D22 are obtained by replacing b1 in FIG. 4 with b3 defined by (Equation 4), Mb1 with Mb3, and input d6 with d14. Also, sf
(D22) is Mb3, and sft (d22) is sft (d
14) is assigned as an initial value. The determination unit 403 calculates d1
If 4 is 16 bits, 1 is sf (d22), sf
Add to t (d22). The carry unit 402 sets D22 to s
right shift operation with round function of f (d22) bits and d2
Get 2.

【００４０】積算部１０４において、図４のｂ１を（数
５）で定義したｂ４に、Ｍｂ１をＭｂ４に、入力ｄ６を
ｄ３に置き換え、Ｂ４、Ｄ１２を求める。また、ｓｆ
（ｄ１２）にＭｂ４を、ｓｆｔ（ｄ１２）にｓｆｔ（ｄ
３）を初期値として代入する。判定部４０３の判定動作
は積算部１０１と同じで、桁下げ部４０２はＤ１２をｓ
ｆ（ｄ１２）ビットの丸め機能付き右シフト演算しｄ１
２を得る。In the integrating unit 104, B4 and D12 are obtained by replacing b1 in FIG. 4 with b4 defined by (Equation 5), Mb1 with Mb4, and input d6 with d3. Also, sf
(D12) is Mb4, and sft (d12) is sft (d
3) is substituted as an initial value. The determining operation of determining section 403 is the same as that of integrating section 101, and carry-down section 402 sets D12 to s
right shift operation with round function of f (d12) bits and d1
Get 2.

【００４１】積算部１０５において、図４のｂ１を（数
６）で定義したｂ５に、Ｍｂ１をＭｂ５に、入力ｄ６を
ｄ１３に置き換え、Ｂ５、Ｄ２２を求める。また、ｓｆ
（ｄ２２）にＭｂ５を、ｓｆｔ（ｄ２２）にｓｆｔ（ｄ
１３）を初期値として代入する。このとき、判定部４０
３は判定動作を行わず、ｓｆ（ｄ２２）、ｓｆｔ（ｄ２
２）を更新しない。桁下げ部４０２はＤ２２をｓｆ（ｄ
２２）ビットの丸め機能付き右シフト演算しｄ２２を得
る。In the integrating section 105, B5 and D22 are obtained by replacing b1 in FIG. 4 with b5 defined by (Equation 6), Mb1 with Mb5, and input d6 with d13. Also, sf
(D22) is Mb5, and sft (d22) is sft (d
13) is assigned as an initial value. At this time, the determination unit 40
No. 3 performs no determination operation, and sf (d22) and sft (d2
Do not update 2). The carry unit 402 sets D22 to sf (d
22) Right shift operation with bit rounding function is performed to obtain d22.

【００４２】行アルゴリズムの出力ｙ（ｋ）は、各々ｓ
ｆｔ（ｙ（ｋ））ビットの桁下げされている。よって、
列アルゴリズム部に入力する前に、ｓｆｔ（ｙ（ｋ））
ビットの左シフト演算を行う。The output y (k) of the row algorithm is s
ft (y (k)) bits have been carried down. Therefore,
Before inputting to the column algorithm section, sft (y (k))
Performs a bit left shift operation.

【００４３】図１（ｂ）は８×８ＩＤＣＴの列アルゴリ
ズム部を示す。FIG. 1B shows an 8 × 8 IDCT column algorithm unit.

【００４４】ｐ（ｋ）、ｄ０〜ｄ３１，ｓ０〜ｓ７、ｙ
（ｋ）は１６ビット以内の整数である。ここで、ｋを０
〜７の整数とする。P (k), d0-d31, s0-s7, y
(K) is an integer within 16 bits. Where k is 0
It is assumed to be an integer of up to 7.

【００４５】和算について、ｄ１〜ｄ５、ｄ７，ｄ８、
ｓ１，ｓ０，ｓ２，ｓ４，ｓ６を求めるときは通常の和
算を使用し、ｓｆｔ（ｄ１）〜ｓｆｔ（ｄ５）、ｓｆｔ
（ｄ７）、ｓｆｔ（ｄ８）、ｓｆｔ（ｓ１）、ｓｆｔ
（ｓ０）、ｓｆｔ（ｓ２）、ｓｆｔ（ｓ４）、ｓｆｔ
（ｓ６）に０を代入する。ｄ６，ｄ１３，ｄ１４，ｄ
２４，ｄ３１には行アルゴリズム部で記述したクリップ
演算、ｄ２３，ｓ３，ｓ５，ｓ７には経歴付き和算を使
用する。For the addition, d1 to d5, d7, d8,
When obtaining s1, s0, s2, s4, and s6, normal addition is used, and sft (d1) to sft (d5), sft
(D7), sft (d8), sft (s1), sft
(S0), sft (s2), sft (s4), sft
Substitute 0 for (s6). d6, d13, d14, d
The clip operation described in the row algorithm section is used for 24 and d31, and the summation with history is used for d23, s3, s5, and s7.

【００４６】桁上げ部１１１において、ｄ２３は、ｓｆ
ｔ（ｄ２３）ビットの左シフト演算により更新される。In the carry unit 111, d23 is sf
It is updated by a left shift operation of t (d23) bits.

【００４７】桁上げ部１１２において、ｓ７は、ｓｆｔ
（ｓ７）ビットの左シフト演算により更新される。In the carry section 112, s7 is sft
(S7) The bit is updated by a left shift operation.

【００４８】桁上げ部１１３において、ｓ３は、ｓｆｔ
（ｓ３）ビットの左シフト演算により更新される。In the carry unit 113, s3 is sft
Updated by (s3) bit left shift operation.

【００４９】桁上げ部１１４において、ｓ５は、ｓｆｔ
（ｓ５）ビットの左シフト演算により更新される。In the carry section 114, s5 is sft
It is updated by a left shift operation of (s5) bits.

【００５０】ｙ（ｋ）を求めるときには通常の和算を使
用する。When obtaining y (k), ordinary addition is used.

【００５１】積算について、積算部１０６は積算部１０
１と同じ処理によりｄ１６を出力し、積算部１１０は積
算部１０５と同じ処理によりｄ２１を出力する。Regarding the multiplication, the multiplication unit 106
D16 is output by the same processing as in 1, and the integrating section 110 outputs d21 by the same processing as the integrating section 105.

【００５２】積算部１０７において、判定動作以外は積
算部１０２と同じ処理によりｄ１５を出力する。このと
きの判定動作は、ｄ４が１６ビットの場合には、２をｓ
ｆ（ｄ１５）、ｓｆｔ（ｄ１５）に加算し、ｄ４が１５
ビットの場合には、１をｓｆ（ｄ１５）、ｓｆｔ（ｄ１
５）に加算するものである。The integrating section 107 outputs d15 by the same processing as that of the integrating section 102 except for the judging operation. At this time, when d4 is 16 bits, 2 is determined as s.
f (d15) and sft (d15), and d4 becomes 15
In the case of a bit, 1 is set to sf (d15) and sft (d1
5) is added.

【００５３】積算部１０８において、判定動作以外は積
算部１０３と同じ処理によりｄ２２を出力する。このと
きの判定動作は、ｄ１４が１６ビットで、且つ、ｓｆｔ
（ｄ１４）が０の場合、ｓｆ（ｄ２２）、ｓｆｔ（ｄ２
２）に１を加算するものである。The accumulator 108 outputs d22 by the same processing as that of the accumulator 103 except for the determination operation. The determination operation at this time is that d14 is 16 bits and sft
When (d14) is 0, sf (d22) and sft (d2
1 is added to 2).

【００５４】積算部１０９において、判定動作以外は積
算部１０４と同じ処理によりｄ１２を出力する。このと
きの判定動作は、ｄ３が１６ビットの場合、ｓｆ（ｄ１
２）、ｓｆｔ（ｄ１２）に１を加算するものである。The accumulator 109 outputs d12 by the same processing as that of the accumulator 104 except for the determination operation. At this time, when d3 is 16 bits, sf (d1
2), 1 is added to sft (d12).

【００５５】[0055]

【発明の実施の形態】図５に本発明の第１の実施例を示
す。FIG. 5 shows a first embodiment of the present invention.

【００５６】図５（ａ）において、まず、８×８の行列
で示される１２ビットのＤＣＴ係数ｘ（ｉ、ｊ）をプリ
スケーリング部５０１に入力する。ここで、ｉ、ｊは０
〜７の整数とする。In FIG. 5A, first, a 12-bit DCT coefficient x (i, j) represented by an 8 × 8 matrix is input to the pre-scaling unit 501. Here, i and j are 0
It is assumed to be an integer of up to 7.

【００５７】プリスケーリング部５０１において、最大
値検出部５０２はｘ（ｉ，ｊ）の各ｉ行の最大値を検出
し、これを表し得るビット数が１２であるとき、桁上げ
数ＳＣＬ（ｉ）に０を代入し、１１であるとき１を、１
０ビット以内のとき２を代入する。In prescaling section 501, maximum value detecting section 502 detects the maximum value of each i row of x (i, j), and when the number of bits that can represent this is 12, carry number SCL (i ) Is substituted for 0, and when it is 11, 1 is set to 1
If it is within 0 bits, 2 is substituted.

【００５８】積算部５０３において、（ｃ（ｉ）×ｃ
（ｊ））に２の１７乗を掛けた値を四捨五入整数化した
ものを定数Ｃ（ｉ、ｊ）とし、ｘ（ｉ，ｊ）との積算結
果をＸ（ｉ，ｊ）とする。ここで、ｃ（ｎ）は（数１）
で定義される定数である。In the integrating section 503, (c (i) × c
A value obtained by multiplying (j)) by 2 to the 17th power is rounded to an integer, and the result is a constant C (i, j), and the result of integration with x (i, j) is X (i, j). Here, c (n) is (Equation 1)
Is a constant defined by

【００５９】桁下げ部５０４において、Ｘ（ｉ，ｊ）を
（１１ーＳＣＬ（ｉ）ーＣｒ（ｉ））ビットの丸め機能
付き右シフト演算により１６ビットにした結果をＰｒ
（ｉ，ｊ）とする。ここで、Ｃｒ（ｉ）を、Ｃｒ（７）
＝２、Ｃｒ（６）＝１、それ以外０である整数とする。
次に、Ｐｒ（ｉ，ｊ）の６４個の要素をｉ行毎に分割
し、列方向の８個の要素Ｐｒ（ｉ，ｋ）を行アルゴリズ
ム部５０５に入力する。The result of converting X (i, j) to 16 bits by a right shift operation with a rounding function of (11-SCL (i) -Cr (i)) bits in the carry unit 504 is Pr
(I, j). Here, Cr (i) is replaced by Cr (7)
= 2, Cr (6) = 1, otherwise 0.
Next, 64 elements of Pr (i, j) are divided for every i rows, and eight elements Pr (i, k) in the column direction are input to the row algorithm unit 505.

【００６０】行アルゴリズム部５０５の詳細を図５
（ｂ）に示し、各ｉ行毎の要素Ｐｒ（ｉ，ｋ）をｐ
（ｋ）に代入する。ここで、ｋを０〜７の整数とする。The details of the row algorithm unit 505 are shown in FIG.
(B), the element Pr (i, k) for each i-th row is represented by p
Substitute (k). Here, k is an integer of 0 to 7.

【００６１】和算については、ｄ１〜ｄ３１、ｓ０〜ｓ
７を求めるときには、前述したクリップ演算を使用す
る。ここで、ｄ５，ｄ６を求めるとき、桁下げの判定に
はｐ（２）のみを使用する。ｄ１，ｄ３を求めるときも
同様に、ｐ（１）のみで判定する。For the addition, d1 to d31, s0 to s
7 is obtained by using the above-described clip operation. Here, when determining d5 and d6, only p (2) is used for the determination of carry-down. Similarly, when obtaining d1 and d3, determination is made only with p (1).

【００６２】ｓ０〜ｓ７を求めた後に１ビットの丸め機
能付き右シフト演算をし、ｓｆｔ（ｓ０）〜ｓｆｔ（ｓ
７）に１を加算する。ｙ（０）〜ｙ（７）は経歴付き和
算により求める。After obtaining s0 to s7, a right shift operation with a 1-bit rounding function is performed, and sft (s0) to sft (sft)
Add 1 to 7). y (0) to y (7) are obtained by summing with a history.

【００６３】積算については、積算部５０９〜５１３
は、前述した図１（ａ）の積算部１０１〜１０５と同じ
動作をする。ここで、Ｍｂ１〜Ｍｂ５を１３とする。For the integration, the integration units 509 to 513
Performs the same operation as the integration units 101 to 105 in FIG. Here, Mb1 to Mb5 are set to 13.

【００６４】行アルゴリズム部５０５の出力ｙ（ｋ）、
ｓｆｔ（ｙ（ｋ））は、桁下げ部５０６に入力される。The output y (k) of the row algorithm unit 505,
sft (y (k)) is input to the carry unit 506.

【００６５】桁下げ部５０６において、ｉ行の８個の要
素ｙ（ｋ）に対し、（ＳＣＬ（ｉ）＋Ｃｒ（ｉ）ーｓｆ
ｔ（ｙ（ｋ）））ビットの丸め機能付き右シフト演算し
た結果をＰｒ２（ｉ，ｋ）とする。８行すべての結果で
ある６４個のＰｒ２（ｉ，ｊ）を求め、これをｊ列毎に
分割し、行方向の８個の要素Ｐｒ２（ｋ，ｊ）を列アル
ゴリズム部５０７に入力する。In the carry unit 506, (SCL (i) + Cr (i) -sf) is applied to the eight elements y (k) in the i-th row.
The result of the right shift operation with the rounding function of (t (y (k))) bits is defined as Pr2 (i, k). 64 Pr2 (i, j), which are the results of all eight rows, are obtained, divided into j columns, and eight elements Pr2 (k, j) in the row direction are input to the column algorithm unit 507.

【００６６】列アルゴリズム部５０７の詳細を図１
（ｂ）に示す。まず、各ｊ列毎の要素Ｐｒ２（ｋ，ｊ）
をｐ（ｋ）に代入する。ｄ６を前述したクリップ演算に
より求めるとき、桁下げの判定にはｐ（２）のみを使用
する。それ以外の演算は前述した列アルゴリズム部と同
じであり、Ｍｂ１〜Ｍｂ５を１３とする。ｙ（ｋ）を求
めた後、Ｐｃ（ｋ，ｊ）に代入し、桁下げ部５０８に入
力する。FIG. 1 shows details of the column algorithm unit 507.
(B). First, an element Pr2 (k, j) for each j column
To p (k). When d6 is obtained by the above-described clipping operation, only p (2) is used for the determination of carry-down. Other operations are the same as those of the column algorithm unit described above, and Mb1 to Mb5 are set to 13. After obtaining y (k), it substitutes for Pc (k, j) and inputs it to the carry unit 508.

【００６７】桁下げ部５０８において、全ての列のＰｃ
（ｋ，ｊ）に対し、６ビットの丸め機能付き右シフト演
算をする。該結果がー２５６以下の場合はー２５６に、
２５５以上の場合は２５５にクリッピングすることによ
り９ビットにし、８×８ＩＤＣＴの結果であるＹ（ｉ，
ｊ）を得る。In carry-down section 508, Pc of all columns
A right shift operation with a 6-bit rounding function is performed on (k, j). If the result is -256 or less, the result is -256,
If it is 255 or more, it is clipped to 255 to make it 9 bits, and Y (i,
j) is obtained.

【００６８】次に、本発明の第２の実施例を図６に示
す。Next, a second embodiment of the present invention is shown in FIG.

【００６９】図６（ａ）において、まず、８×８の行列
で示される１２ビットのＤＣＴ係数ｘ（ｉ、ｊ）をプリ
スケーリング部６０１に入力する。ここで、ｉ、ｊは０
〜７の整数である。In FIG. 6A, first, a 12-bit DCT coefficient x (i, j) represented by an 8 × 8 matrix is input to a pre-scaling unit 601. Here, i and j are 0
-7.

【００７０】プリスケーリング部６０１において、最大
値検出部６０２はｘ（ｉ，ｊ）の６４個の要素の最大値
を検出する。この値を表し得るビット数が８以下である
とき、８から該ビット数を引いた値をＳＣＬ＿Ｂとす
る。In the prescaling section 601, the maximum value detecting section 602 detects the maximum value of 64 elements of x (i, j). When the number of bits that can represent this value is 8 or less, a value obtained by subtracting the number of bits from 8 is set as SCL_B.

【００７１】最大値検出部６０３は、各ｉ行毎の８個の
要素ｘ（ｉ，ｋ）をＳＣＬ＿Ｂビット左シフトしたもの
の中の最大値を検出する。ここで、ｋは０〜７の整数で
ある。この値を表し得るビット数をｎとし、ｎ＞９であ
るときＳＣＬ（ｉ）＝１２ーｎとし、１０＞ｎ＞７であ
るときＳＣＬ（ｉ）＝１１ーｎとする。The maximum value detecting section 603 detects the maximum value among the eight elements x (i, k) for each i row shifted left by SCL_B bits. Here, k is an integer of 0-7. The number of bits that can represent this value is n, SCL (i) = 12−n when n> 9, and SCL (i) = 11−n when 10>n> 7.

【００７２】積算部６０４において、（ｃ（ｉ）×ｃ
（ｊ））に２の１７乗を掛けた値を四捨五入整数化した
ものを定数Ｃ（ｉ、ｊ）とし、ｘ（ｉ，ｊ）との積算結
果Ｘ（ｉ，ｊ）を出力する。ここで、ｃ（ｎ）は（数
１）で定義される定数である。In the integrating section 604, (c (i) × c
A value obtained by multiplying (j)) by 2 to the 17th power is converted to a rounded integer to obtain a constant C (i, j), and an integration result X (i, j) with x (i, j) is output. Here, c (n) is a constant defined by (Equation 1).

【００７３】桁下げ部６０５において、Ｘ（ｉ，ｊ）を
（１１ーＳＣＬ＿Ｂ−ＳＣＬ（ｉ）ーＣｒ（ｉ））ビッ
トの丸め機能付き右シフト演算により１６ビットにした
ものをＰｒ（ｉ，ｊ）とする。ここで、Ｃｒ（ｉ）を、
Ｃｒ（７）＝２、Ｃｒ（６）＝１、それ以外０である整
数とする。次に、Ｐｒ（ｉ，ｊ）の６４個の要素をｉ行
毎に分割し、列方向の８個の要素Ｐｒ（ｉ，ｋ）を行ア
ルゴリズム部６０６に入力する。The carry-down unit 605 converts X (i, j) to 16 bits by a right shift operation with a rounding function of (11-SCL_B-SCL (i) -Cr (i)) bits to Pr (i, j). j). Here, Cr (i) is
Cr (7) = 2, Cr (6) = 1, and other integers of 0. Next, the 64 elements of Pr (i, j) are divided every i rows, and the eight elements Pr (i, k) in the column direction are input to the row algorithm unit 606.

【００７４】行アルゴリズム部６０６の詳細を図１
（ａ）に示す。各ｉ行の要素Ｐｒ（ｉ，ｋ）をｐ（ｋ）
に代入する。ｄ５，ｄ６，ｄ１，ｄ３は第１の実施例の
行アルゴリズムと同じ方法で求める。それ以外は前述し
た行アルゴリズムと同じであり、Ｍｂ１〜Ｍｂ５を１３
とし、ｙ（ｋ）、ｓｆｔ（ｙ（ｋ））を出力する。The details of the row algorithm unit 606 are shown in FIG.
(A). The element Pr (i, k) of each i-th row is p (k)
Substitute for d5, d6, d1 and d3 are obtained by the same method as the row algorithm of the first embodiment. Otherwise, it is the same as the row algorithm described above, and Mb1 to Mb5 are 13
And y (k) and sft (y (k)) are output.

【００７５】桁下げ部６０７において、ｓｆｔ（ｙ
（ｋ））の中の最大値を求め、これをｓｆｔＭに代入す
る。ｓｆｔＭが（ＳＣＬ（ｉ）＋Ｃｒ（ｉ））以上の場
合は、ｙ（ｋ）を（ｓｆｔＭ−ｓｆｔ（ｙ（ｋ）））ビ
ットの丸め機能付き右シフト演算し、上記以外の場合は
丸め機能の無い右シフト演算したものをＰｒ２（ｉ，
ｋ）に代入する。また、ＳＣＬ（ｉ）からｓｆｔＭを減
算し、ＳＣＬ（ｉ）を更新する。これを８行全てにつき
実行し、６４個のＰｒ２（ｉ，ｊ）、８個の更新された
ＳＣＬ（ｉ）を求める。その後、Ｐｒ２（ｉ，ｊ）をｊ
列毎に分割し、行方向の８個の要素Ｐｒ２（ｋ，ｊ）と
８個のＳＣＬ（ｋ）を最大値検出部６０８に入力する。In the carry-down section 607, sft (y
The maximum value in (k)) is obtained, and this is substituted for sftM. If sftM is equal to or more than (SCL (i) + Cr (i)), y (k) is right-shifted with (sftM-sft (y (k))) bits rounding function, otherwise rounding function To the right shift operation without Pr2 (i,
k). Also, sftM is subtracted from SCL (i) to update SCL (i). This is performed for all eight rows to find 64 Pr2 (i, j) and eight updated SCL (i). Then, Pr2 (i, j) is changed to j
Each column is divided, and eight elements Pr2 (k, j) in the row direction and eight SCL (k) are input to the maximum value detection unit 608.

【００７６】最大値検出部６０８において、各ｊ列につ
いて、８個の要素Ｐｒ２（ｋ，ｊ）を（ＳＣＬ（ｋ）＋
Ｃｒ（ｋ）＋２）ビット右シフトした値のうちの最大値
を求める。この値を表し得るビット数を１４から差し引
いたものをＳＣＬ＿ｃ（ｊ）とし、これが２以上のとき
は２とする。これを８列すべてにつき実行し、８個のＳ
ＣＬ＿ｃ（ｊ）を求める。In the maximum value detecting section 608, for each j column, eight elements Pr2 (k, j) are converted into (SCL (k) +
The maximum value among the values shifted right by Cr (k) +2) bits is obtained. A value obtained by subtracting the number of bits that can represent this value from 14 is set as SCL_c (j), and is set to 2 when the number is 2 or more. This is performed for all eight columns, and eight S
Find CL_c (j).

【００７７】桁下げ部６０９において、各ｊ列につき、
Ｐｒ２（ｋ，ｊ）を（２＋ＳＣＬ（ｋ）＋Ｃｒ（ｋ）ー
ＳＣＬ＿ｃ（ｊ））ビット丸め機能付きの右シフト演算
したものをＰｃ（ｋ，ｊ）に代入する。In the lowering section 609, for each j column,
Pr2 (k, j) is right-shifted with (2 + SCL (k) + Cr (k) -SCL_c (j)) bit rounding function, and is substituted for Pc (k, j).

【００７８】列アルゴリズム部６１０において、Ｐｃ
（ｋ，ｊ）をｐ（ｋ）に代入する。列アルゴリズム部の
詳細を図６（ｂ）に示す。In the column algorithm section 610, Pc
Substitute (k, j) for p (k). FIG. 6B shows the details of the column algorithm unit.

【００７９】和算については、ｄ１〜ｄ２３，ｓ１，ｓ
０，ｓ２，ｓ４，ｓ６を求めるときは通常の和算をし、
ｓｆｔ（ｄ１）〜ｓｆｔ（ｄ２３）、ｓｆｔ（ｓ１），
ｓｆｔ（ｓ０）、ｓｆｔ（ｓ２），ｓｆｔ（ｓ４），ｓ
ｆｔ（ｓ６）に０を代入する。ｄ２４，ｄ３１には前述
したクリップ演算を使用する。ｓ３，ｓ５，ｓ７、には
前述した経歴付き和算を使用する。For the addition, d1 to d23, s1, s
When obtaining 0, s2, s4, and s6, perform normal addition,
sft (d1) to sft (d23), sft (s1),
sft (s0), sft (s2), sft (s4), s
Substitute 0 for ft (s6). The clip operation described above is used for d24 and d31. For s3, s5, and s7, the above-described summation with history is used.

【００８０】桁上げ部６２６において、ｓ７は、ｓｆｔ
（ｓ７）ビットの左シフト演算により更新される。In the carry section 626, s7 is sft
(S7) The bit is updated by a left shift operation.

【００８１】桁上げ部６２７において、ｓ３は、ｓｆｔ
（ｓ３）ビットの左シフト演算により更新される。In the carry section 627, s3 is sft
Updated by (s3) bit left shift operation.

【００８２】桁上げ部６２８において、ｓ５は、ｓｆｔ
（ｓ５）ビットの左シフト演算により更新される。In carry section 628, s5 is sft
It is updated by a left shift operation of (s5) bits.

【００８３】積算について、積算部６２１は、前述した
図１（ａ）の積算部１０１から判定動作を除いた処理に
よりｄ１６を出力する。積算部６２４も図１（ａ）の積
算部１０４から判定動作を除いた処理によりｄ１２を出
力する。Regarding the integration, the integrating section 621 outputs d16 by the processing except for the judging operation from the integrating section 101 of FIG. 1A described above. The accumulating section 624 also outputs d12 by the processing excluding the determining operation from the accumulating section 104 of FIG.

【００８４】積算部６２２は、判定動作以外は図１
（ａ）の積算部１０２と同じ処理によりｄ１５を出力す
る。このときの判定動作は、ｄ４が１５ビット以上の場
合には、ｓｆ（ｄ１５）、ｓｆｔ（ｄ１５）に１を加算
するものである。The accumulator 622 operates as shown in FIG.
D15 is output by the same processing as that of the integrating unit 102 in (a). The determination operation at this time is to add 1 to sf (d15) and sft (d15) when d4 is 15 bits or more.

【００８５】積算部６２３は、判定動作以外は図１
（ｂ）の積算部１０８と同じ処理によりｄ２２を出力す
る。このときの判定動作は、ｄ１４が１６ビットの場合
には、ｓｆ（ｄ２２）、ｓｆｔ（ｄ２２）に１を加算す
るものである。The accumulator 623 operates as shown in FIG.
D22 is output by the same process as the integration unit 108 in (b). The determination operation at this time is to add 1 to sf (d22) and sft (d22) when d14 is 16 bits.

【００８６】積算部６２５は、図１（ａ）の積算部１０
５と同じ処理によりｄ２１を出力する。ｙ（ｋ）を求め
た後、これをＰｃ２（ｋ，ｊ）に代入し、桁下げ部６１
１に入力する。The accumulating section 625 corresponds to the accumulating section 10 shown in FIG.
D21 is output by the same processing as in step 5. After obtaining y (k), it is substituted for Pc2 (k, j),
Enter 1

【００８７】桁下げ部６１１において、各ｊ列の８個の
要素Ｐｃ２（ｋ，ｊ）を（４＋ＳＣＬ＿Ｂ＋ＳＣＬ＿ｃ
（ｊ））ビットの丸め機能付き右シフト演算する。該結
果がー２５６以下の場合はー２５６に、２５５以上の場
合は２５５にクリッピングすることにより９ビットにし
たものをＹ（ｋ，ｊ）に代入する。８列全てにつきこれ
を実行し、８×８ＩＤＣＴの６４個の出力結果であるＹ
（ｉ，ｊ）を得る。In the carry unit 611, the eight elements Pc2 (k, j) in each j column are converted to (4 + SCL_B + SCL_c
(J)) Right shift operation with bit rounding function is performed. If the result is less than or equal to -256, the result is clipped to -256, and if it is more than 255, 255 is substituted into Y (k, j). This is performed for all eight columns, and 64 output results of 8 × 8 IDCT, Y
(I, j) is obtained.

【００８８】次に、本発明の第３の実施例を図７に示
す。本実施例では、演算器への入力値の大きさにより桁
下げ数を決定する機能、及び、演算アルゴリズムの途中
での丸め機能を除くことを前提とし、この場合に必要と
されるビット数の削減方法を示す。よって、演算には１
６ビット以上の整数を使用する。Next, a third embodiment of the present invention is shown in FIG. In this embodiment, it is assumed that the function of determining the number of carry-downs according to the magnitude of the input value to the arithmetic unit and the rounding function in the middle of the arithmetic algorithm are excluded, and the number of bits required in this case is reduced. The method of reduction is shown. Therefore, 1
Use an integer of 6 bits or more.

【００８９】図７（ａ）において、まず、８×８の行列
で示される１２ビットのＤＣＴ係数ｘ（ｉ、ｊ）をプリ
スケーリング部７０１に入力する。ここで、ｉ、ｊは０
〜７までの整数とする。積算部７０２において、（ｃ
（ｉ）×ｃ（ｊ））に２の１６乗を掛けた値を四捨五入
整数化したものを定数Ｃ（ｉ、ｊ）とし、ｘ（ｉ，ｊ）
との積算結果Ｘ（ｉ，ｊ）を出力する。ここで、ｃ
（ｎ）は（数１）で定義される定数である。桁下げ部７
０３において、Ｘ（ｉ，ｊ）を５ビットの右シフト演算
した値をＰｒ（ｉ，ｊ）とする。Ｐｒ（ｉ，ｊ）は２１
ビットの整数である。次に、Ｐｒ（ｉ，ｊ）の６４個の
要素をｉ行毎に分割し、列方向の８個の要素Ｐｒ
（ｉ，ｋ）を行アルゴリズム部７０４に入力する。ここ
で、ｋは０〜７までの整数とする。In FIG. 7A, first, a 12-bit DCT coefficient x (i, j) represented by an 8 × 8 matrix is input to the pre-scaling unit 701. Here, i and j are 0
It is assumed to be an integer of up to 7. In the integrating unit 702, (c
A value obtained by multiplying (i) × c (j)) by 2 to the 16th power is rounded to an integer C (i, j), and x (i, j)
And outputs the integration result X (i, j). Where c
(N) is a constant defined by (Equation 1). Carrying part 7
In 03, a value obtained by right-shifting X (i, j) by 5 bits is set to Pr (i, j). Pr (i, j) is 21
It is a bit integer. Next, 64 elements of Pr (i, j) are divided every i rows, and 8 elements Pr in the column direction
(I, k) is input to the row algorithm unit 704. Here, k is an integer from 0 to 7.

【００９０】行アルゴリズム部７０４の詳細を図７
（ｂ）に示す。まず、各ｉ行の要素Ｐｒ（ｉ，ｋ）をｐ
（ｋ）に代入する。ここで、黒丸は和算器を示す。積算
器７０６、積算器７０８は、３６２との積算結果を８ビ
ットの右シフト演算した値を出力する。積算器７０７
は、６６９との積算結果を８ビットの右シフト演算した
値を出力する。積算器７０９は、２７７との積算結果を
８ビットの右シフト演算した値を出力する。積算器７１
０は、１９６との積算結果を８ビットの右シフト演算し
た値を出力する。桁下げ部７１１は２ビットの右シフト
演算により１９ビットのｙ（ｋ）を得、これをＰｃ
（ｉ，ｋ）に代入する。行アルゴリズムにおける全ての
和算、積算には最低でも２３ビットを必要とする。The details of the row algorithm unit 704 are shown in FIG.
(B). First, the element Pr (i, k) of each i row is p
Substitute (k). Here, black circles indicate adders. The integrators 706 and 708 output a value obtained by performing an 8-bit right shift operation on the result of integration with 362. Integrator 707
Outputs a value obtained by performing an 8-bit right shift operation on the result of integration with 669. The integrator 709 outputs a value obtained by performing an 8-bit right shift operation on the result of integration with 277. Integrator 71
0 outputs a value obtained by performing an 8-bit right shift operation on the result of integration with 196. The carry unit 711 obtains 19-bit y (k) by a 2-bit right shift operation,
Substitute (i, k). All additions and accumulations in the row algorithm require at least 23 bits.

【００９１】８行全てにつき行アルゴリズムを実行した
後、Ｐｃ（ｉ，ｊ）をｊ列毎に分割し、行方向の８個の
要素Ｐｃ（ｋ，ｊ）を列アルゴリズム部７０５に入力す
る。After executing the row algorithm for all eight rows, Pc (i, j) is divided into j columns, and eight elements Pc (k, j) in the row direction are input to the column algorithm unit 705.

【００９２】列アルゴリズム部７０５の詳細を図７
（ｃ）に示す。まず、各ｊ列の要素Ｐｃ（ｋ，ｊ）をｐ
（ｋ）に代入する。ここで、黒丸は和算器を示す。桁下
げ部７１２は１ビットの右シフト演算を実行する。積算
器７１３〜積算器７１７は図７（ｂ）の積算器７０６〜
積算器７１０と同じ動作をする。桁下げ部７１８は８ビ
ットの丸め機能付き右シフト演算を行う。該結果がー２
５６以下の場合はー２５６に、２５５以上の場合は２５
５にクリッピングすることにより９ビットのｙ（ｋ）を
得る。さらに、これをＹ（ｋ，ｊ）に代入する。列アル
ゴリズム部におけるすべての和算、積算には最低でも２
０ビットを必要とする。The details of the column algorithm unit 705 are shown in FIG.
It is shown in (c). First, the element Pc (k, j) of each j column is p
Substitute (k). Here, black circles indicate adders. The carry unit 712 performs a 1-bit right shift operation. The integrators 713 to 717 correspond to the integrators 706 to 706 in FIG.
The same operation as that of the integrator 710 is performed. The carry unit 718 performs an 8-bit right shift operation with a rounding function. The result is -2
If it is 56 or less, -256; if it is 255 or more, it is 25.
By clipping to 5, 9-bit y (k) is obtained. Further, this is substituted for Y (k, j). At least 2 for all addition and multiplication in the column algorithm section
Requires 0 bits.

【００９３】８列全て対し列アルゴリズムを実行し、８
×８ＩＤＣＴの６４個の出力結果Ｙ（ｉ、ｊ）を得る。The column algorithm is executed for all eight columns, and
The 64 output results Y (i, j) of the × 8 IDCT are obtained.

【００９４】[0094]

【発明の効果】本発明によれば、８×８ＩＤＣＴを１６
ビットの整数演算で実行できるので、１６ビット演算を
４並列同時に実行できるような、特殊命令を備えた６４
ビットプロセッサで処理できる。この場合のＩＤＣＴの
演算量は約１／４に減少する。According to the present invention, the 8 × 8 IDCT is 16
Since it can be executed by an integer operation of bits, it is possible to execute four 16-bit operations in parallel at the same time.
Can be processed by a bit processor. In this case, the computation amount of the IDCT is reduced to about 1/4.

【００９５】また、演算に必要とするビット数を削減で
きる点から、本アルゴリズムを使用した８×８ＩＤＣＴ
を専用ＬＳＩで構成する場合、配線数、フリップフロッ
プ数を削減でき、ＬＳＩの面積を削減できる。Also, since the number of bits required for the operation can be reduced, the 8 × 8 IDCT
Is composed of a dedicated LSI, the number of wirings and the number of flip-flops can be reduced, and the area of the LSI can be reduced.

[Brief description of the drawings]

【図１】本発明による８×８ＩＤＣＴの行アルゴリズ
ム、及び列アルゴリズムの構成を示す図。FIG. 1 is a diagram showing a configuration of a row algorithm and a column algorithm of 8 × 8 IDCT according to the present invention.

【図２】従来のＩＤＣＴアルゴリズムを示す図。FIG. 2 is a diagram showing a conventional IDCT algorithm.

【図３】本発明における、結果を常に１６ビット以内の
値とする和算部の構成を示す図。FIG. 3 is a diagram illustrating a configuration of a summing unit according to the present invention that always sets a result to a value within 16 bits.

【図４】本発明における、結果を常に１６ビット以内の
値とする積算部の構成を示す図。FIG. 4 is a diagram illustrating a configuration of an integrating unit that always sets a result to a value within 16 bits according to the present invention.

【図５】本発明の第１の実施例を示す図。FIG. 5 is a diagram showing a first embodiment of the present invention.

【図６】本発明の第２の実施例を示す図。FIG. 6 is a diagram showing a second embodiment of the present invention.

【図７】本発明の第３の実施例を示す図。FIG. 7 is a diagram showing a third embodiment of the present invention.

[Explanation of symbols]

１０１〜１１０・・・積算部、１１１〜１１４・・・桁
上げ部、２０１・・・プリスケーリング部、２０２・・
・演算部、２０３・・・ハフマン復号部、２０４・・・
逆量子化部、２０５・・・プリスケーリング部、２０
６，２０７・・・演算部、３０１・・・比較部、３０２
・・・判定部、３０３・・・桁下げ部、３０４・・・演
算部、３０５・・・経歴加算部、４０１・・・定数部、
４０２・・・桁下げ部、４０３・・・判定部、５０１・
・・プリスケーリング部、５０２・・・最大値検出部、
５０３・・・積算部、５０４，５０６，５０８・・・桁
下げ部、５０５・・・行アルゴリズム部、５０７・・・
列アルゴリズム部、５０９〜５１３・・・積算部、６０
１・・・プリスケーリング部、６０２，６０３，６０８
・・・最大値検出部、６０４・・・積算部、６０５，６
０７，６０９，６１１・・・桁下げ部、６０６・・・行
アルゴリズム部、６１０・・・列アルゴリズム部、６２
１〜６２５・・・積算部、６２６〜６２８・・・桁上げ
部、７０１・・・プリスケーリング部、７０２・・・積
算部、７０３・・・桁下げ部、７０４・・・行アルゴリ
ズム部、７０５・・・列アルゴリズム部、７０６〜７１
０，７１３〜７１７・・・積算部、７１１，７１２，７
１８・・・桁下げ部。························ 101 to 110 ······················
-Operation unit, 203 ... Huffman decoding unit, 204 ...
Inverse quantization unit, 205... Prescaling unit, 20
6,207: arithmetic unit, 301: comparison unit, 302
··· Judgment unit, 303 ··· Carry down unit, 304 ··· Operation unit, 305 ··· History adding unit, 401 ··· Constant unit,
402: carry-down unit, 403: judgment unit, 501
..Pre-scaling section, 502... Maximum value detecting section,
503: accumulation unit, 504, 506, 508: carry unit, 505: row algorithm unit, 507 ...
Column algorithm part, 509 to 513...
1: Prescaling unit, 602, 603, 608
... Maximum value detector, 604 ... Integrator, 605,6
07, 609, 611: carry-down unit, 606: row algorithm unit, 610: column algorithm unit, 62
1 to 625: accumulation unit, 626 to 628: carry unit, 701: prescaling unit, 702: accumulation unit, 703: carry unit, 704: row algorithm unit, 705... Column algorithm section, 706 to 71
0, 713 to 717: integrating section, 711, 712, 7
18 ... Carry-down part.

Claims

[Claims]

1. A DCT coefficient x (i, j) represented by an 8 × 8 matrix is input to a pre-scaling unit, where i and j are integers from 0 to 7, and n is 1 in the pre-scaling unit. ~
An integer of 7, SQRT (2) is a square root of 2, c (n) is (1/2 × cos (n × π / 16)), c (0) is SQRT (2) / 4, and x (i , J) and (c (i) × c (j)) are output, and the 64 Pr (i, j) are divided into i rows, and On the other hand, k is an integer of 0 to 7 and eight column elements pr
(I, k) is substituted for p (k), and p (k) is input to a row algorithm unit. In the row algorithm unit, p (2) + p (6) is calculated for an operation when k is an even number. d5, p (2) -p (6) is d
6, p (0) + p (4) is d7, p (0) -p (4) is d8
When b1 is SQRT (2), b1 × d6 is d16, d5−d16 is d23, d7 + d5 is s0, d7−d5 is s6, d8−d23 is s4, d8 + d23 is s2, and k is an odd number. , P (1) + p (7) is d1, and p (1) -p (7) is d
3, p (5) + p (3) is d2, p (5) -p (3) is d4
D1−d2 is d14, d1 + d2 is s1, d4−d3 is d13, b2 is (1 / (cos (6π / 16)), b2 × d
4 as d15, b4 as (1 / (cos (2π / 16)), b4 × d
3 as d12, b3 as SQRT (2), b3 × d14 as d22, and b5 as (SQRT (2) / (2 × cos (2π / 1)
6)), b5 × d13 is d21, d21-d15 is d31, d12-d21 is d24, s1-d24 is s5, s5 + d22 is s3, s3 + d3
Let 1 be s7, and for the calculation results of the even and odd parts of k, s0 + s1 is y (0), s0-s1 is y (7), s4 + s5 is y (6), s4-s5 is y (1), s2 + s3 is y (2), s2-s3 is y (5), s6 + s7 is y (4), s6-s7 is y (3), and the output result y (k) of the row algorithm is obtained. y
(K) is substituted for pc (i, k), and this is repeated for all eight rows. After obtaining 64 pc (i, j), the pc (i, j) is divided into j columns. , J, the eight row-directional elements pc (k, j) of column j are substituted for p (k), and p (k) is input to the column algorithm unit. To get y (k),
ID for substituting y (k) for each j column into yc (k, j) and repeating this for all eight columns to obtain yc (i, j)
In the CT algorithm, in the pre-scaling unit, M is set to 16 or 17, and a value obtained by multiplying (c (i) × c (j)) by 2 to the power of M is rounded to C (i, j). , The carry number for each i row is SCL (i), and Cr (i) is
r (7) is an integer of 0 to 2; Cr (6) is 0 to 1
The following integers, and other integers that are 0, (x (i,
j) × C (i, j)) by (M-6-SCL (i) -Cr
(I) Result P of right shift operation with bit rounding function
r (i, j) is output, this is divided for every i rows, Pr (i, k) of the i rows is substituted for p (k), and this is input to the row algorithm section. In addition of integers z1 and z2 within 16 bits, z
Sft is the number of carry bits of 1, z2 up to the present
(Z1) and sft (z2), and the larger of the two is sf
tM, sftd (z1) is (sftM-sft (z1)), sftd (z2) is (sftM-sft (z2)), z1 is sftd (z1) bits, and z2 is sftd (z
2) The result of right shift operation with bit rounding function is Z
1, Z2, and the result of executing the addition of Z1 and Z2 is z3
When the operation of substituting sftM for sft (z3) is a sum with history, and using this, z3, which is the result of (z1 + z2) and (z1-z2), is always within 16 bits, z3 = Z1 + z2, the polarities of z1 and z2 are the same, z1 is sftd (z1) bits, and z2 is sftd (z
2) When any of the right-shifted bits is 16 bits, z1 is (sftd (z1) +1) bits, and z2 is the right-shifted value with rounding function of (sftd (z1) +1) bits to Z1 and Z2. Substituting, (Z1 + Z
2) is performed, and z3 which is always an integer within 16 bits
And perform an operation of substituting (sftM + 1) for sft (z3). The polarities of z1 and z2 are the same, and z1 is sftd
When both (z1) bits and z2 shifted right by sftd (z2) bits are 15 bits or less, or when the polarities of z1 and z2 are different, z1 is sftd (z1) bits, and z2 is sftd (z
2) The right-shifted value with the bit rounding function is Z1,
Substituting into Z2, executing (Z1 + Z2) to obtain z3, sf
When sftM is substituted for t (z3) to obtain z3 = z1−z2, the polarities of z1 and z2 are different, z1 is sftd (z1) bits, and z2 is sftd (z
2) When any of the right-shifted bits is 16 bits, z1 is (sftd (z1) +1) bits, z2 is (sftd (z2) +1) bits, and the right-shifted values with the rounding function are added to Z1 and Z2. Substituting, (Z1-Z
2) is performed, and z3 which is always an integer within 16 bits
And perform an operation of substituting (sftM + 1) for sft (z3). The polarities of z1 and z2 are different, and z1 is changed to sftd (z
1) bits and z2 shifted right by sftd (z2) bits are both 15 bits or less, or when the polarities of z1 and z2 are the same, z1 is sftd (z1) bits, and z2 is sftd ( z
2) The right-shifted value with the bit rounding function is Z1,
Substituting into Z2, executing (Z1-Z2) to obtain z3, sf
The operation method of substituting sftM for t (z3) is a clip operation, substituting 0 for sft (p (k)), performing all the additions in the row algorithm unit by the clip operation, and integrating in the row algorithm unit. , Where b1 is 2 Mb1
B1 is a value obtained by rounding the value obtained by multiplying the power to an integer,
(D6 × B1) is D16, the value obtained by multiplying b2 by 2 Mb2 and rounded to an integer is B2, and (d4 × B
2) is D15, a value obtained by multiplying b3 by 2 to the power of Mb1 and rounded to an integer is B3, and (d14 × B3) is D2.
The value obtained by multiplying 2 and b4 by 2 to the power of Mb4 is rounded to an integer, and the resulting value is B4, and (d3 × B4) is calculated by dividing D12 and b5 by 2
The value obtained by multiplying the value obtained by multiplying the power of
When (d13 × B5) is D21 and d12 is obtained, sf (d12) is Mb4, sft
Substituting sft (d3) for (d12), if d3 is 16 bits and sft (d3) is 0, sf (d1
2), 1 is added to sft (d12), and D1 is added to d12.
2 is substituted with the result of right shift operation with sf (d12) bit rounding function. When d15 is obtained, Mb2 and sft are used for sf (d15).
Substituting sft (d4) for (d15), if d4 is 16 bits and sft (d4) is 0, add 2 to sf (d15) and sft (d15), and d4 is 16 bits, When sft (d4) is 1 or when d4 is 15 bits, 1 is added to sf (d15) and sft (d15), and d1
5 is substituted with the value obtained by performing a right shift operation with sf (d15) bit rounding function on D15. When d16 is obtained, Mb1 and sft are used for sf (d16).
Substituting sft (d6) for (d16), if d6 is 16 bits and sft (d6) is 0, sf (d1
6), 1 is added to sft (d16), and D1 is added to d16.
6 is substituted with the result of right shift operation with sf (d16) bit rounding function, and when d21 is obtained, sf (d21) is Mb5, sft
Substituting sft (d13) for (d21), D21 for d21
21 is substituted with the result of right shift operation with sf (d21) bit rounding function, and when d22 is obtained, sf (d22) is Mb3, sft
Substituting sft (d14) for (d22), d14 becomes 16
In the case of bits, sf (d22) and sft (d22) are 1
Is added to d22, and the value obtained by performing a right shift operation with sf (d22) bit rounding function on D22 is substituted for d22. After obtaining y (k), y (k) is converted to (SCL (i) + Cr
(I) -sft (y (k))) A right-shift operation with a rounding function of bits is substituted for Pc (i, k), and this operation is repeated for all eight rows to obtain 64 Pc (i, j). )
Is obtained, the Pc (i, j) is divided into j columns, and the eight elements Pc (k, j) in the j columns are substituted for p (k), which is input to the column algorithm unit. , In the column algorithm section, regarding the addition, d1 to d5, d7, d8, s1, s
When obtaining 0, s2, s4, and s6, perform normal addition,
sft (d1) to sft (d5), sft (d7), s
ft (d8), sft (s1), sft (s0), sf
0 is substituted into t (s2), sft (s4), and sft (s6), and d6, d13, d14, d24, and d31 are clip operations described in the row algorithm section, d23, s
3, s5, and s7 are used as summations with a history, and d23 is substituted with sft (d23) bits left-shifted, and sft (s0) to s0 are substituted for s0 to s7.
ft (s7) -bit left-shifted operation is substituted,
When obtaining y (k), normal summation is used. For integration, Mb1 to Mb5, B1 to B5, D16,
The definitions of D15, D22, D12, and D21 are the same based on the description of the row algorithm, and d16 and d21 are obtained in the same manner as the row algorithm. When d12 is obtained, Mb4 is set to sf (d12), and sf
Substituting 0 into t (d12), and when d3 is 16 bits,
One is added to sf (d12) and sft (d12), and d1
2 is substituted with a value obtained by performing a right shift operation with sf (d12) bit rounding function on D12, and when d15 is obtained, Mb2 is set to sf (d15) and sf
Substituting 0 into t (d15), and when d4 is 16 bits,
2 is added to sf (d15) and sft (d15), and d4
Is 15 bits, sf (d15), sft (d1
Add 1 to 5), and add d15 to d15 as sf (d15)
Substituting the result of the right shift operation with the bit rounding function to obtain d22, Mb3 and sft are used for sf (d22).
Substituting sft (d14) for (d22), d14 becomes 16
Bit and sft (d14) is 0, sf
(D22), 1 is added to sft (d22), and d22
Is substituted for sf (d22) bits with a right shift operation with a rounding function, and after calculating y (k), y (k) is right shifted with a 6-bit rounding function. If it is less than 256, it is substituted by -256, and if it is more than 255, the value clipped to 255 is substituted into Y (i, k), and this is repeated for all eight columns to obtain Y (i, j). I
Integer method of DCT.

2. The method according to claim 1, wherein the summation in the row algorithm section uses a clipping operation when obtaining d1 to d31 and s0 to s7, and after obtaining s0 to s7, a 1-bit right with rounding function. After performing a shift operation, sft (s0) ~
An IDCT integerization method characterized in that 1 is added to sft (s7) and y (0) to y (7) are obtained by summing with a history.

3. The method according to claim 1, wherein y
After obtaining (k), the maximum value of sft (y (k)) is obtained, and this is substituted for sftM.
(I) + Cr (i)), y (k) is set to (sfM
−sft (y (k))) Right shift operation with a rounding function of bits, and in other cases, a right shift operation without a rounding function is performed, and the result is substituted into Pc (i, k), and the result is substituted from SCL (i). Subtract sftM and repeat for all 8 rows, 64 Pc (i, j), 8 updated SCL
After calculating (i), Pc (i, j) is divided into j columns, and the eight elements Pc (k, j) in each j column are shifted right by (SCL (k) + Cr (k) +2) bits. The maximum value of the shifted values is obtained, and the value obtained by subtracting the number of bits that can represent this from 14 is SCL_c.
(J), and when this is 2 or more, 2 and Pc (k,
j) by itself (2 + SCL (k) + Cr (k) -SCL
_C (j)) to which the right-shifted operation with the bit rounding function is substituted and input to the column algorithm unit.
When obtaining 23, s0, s1, s2, s4, and s6, normal addition is performed, and sft (d1) to sft (d23), s
ft (s0), sft (s1), sft (s2), sf
Substituting 0 into t (s4) and sft (s6) to obtain d31, d
24, a clipping operation is used, and s5, s3, and s7 use a history-added operation. Substitute s3 with the sft (s3) -bit left-shifted operation, and substitute s5 with sft (s
5) Substitute a left-shifted operation of bits, substitute itself for s7 with a left-shifted operation of sft (s7) bits, and use ordinary addition when obtaining y (k).
For integration, D16 is Mb1 bit, D12 is M
b4 bits and D21 are right-shifted with Mb5 bits rounding function as d16, d12 and d21, and sft
(D16), 0 for sft (d12), and sft (d21)
To obtain d15, Mb2 is substituted for sf (d15), and sf
Substituting 0 into t (d15), and when d4 is 15 bits or more, adding 1 to sf (d15) and sft (d15),
Substitute the value obtained by performing a right shift operation with sf (d15) bit rounding function on D15 for d15. When obtaining d22, Mb3 is set to sf (d22) and sf
Substituting 0 into t (d22), and when d14 is 16 bits, adding 1 to sf (d22) and sft (d22),
Substitute the right-shift operation of D22 with the sf (d22) bit rounding function for d22 into d22, obtain y (k), and then convert y (k) to (4 + SCL_c
(J)) A right shift operation with a bit rounding function is performed, and if the result is -256 or less, the result obtained by clipping to -256 and 255 to 255 is substituted into Y (k, j). This method is repeated to obtain Y (i, j), which is an IDCT integer conversion method.