JP3095348B2

JP3095348B2 - Discrete cosine transform and inverse discrete cosine transform device in data compression / decompression device

Info

Publication number: JP3095348B2
Application number: JP2969196A
Authority: JP
Inventors: 孝夫松本; 浩一堀内
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 1996-02-16
Filing date: 1996-02-16
Publication date: 2000-10-03
Anticipated expiration: 2016-02-16
Also published as: JPH09223124A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、データ圧縮伸長装
置における離散余弦変換及び逆離散余弦変換を高速に行
う装置に関し、特に、２次元回転演算を用いた改良技術
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention performs high-speed discrete cosine transform and inverse discrete cosine transform in a data compression / decompression device.
It relates USo location, in particular, to an improved technique using a two-dimensional rotation operation.

【０００２】[0002]

【従来の技術】近年のＷＳ、ＰＣ等のコンピュータ及び
ＶＬＳＩ等の半導体デバイスの高速化、低価格化と、Ｊ
ＰＥＧ、Ｈ２６１、ＭＰＥＧといったディジタル画像圧
縮伸長処理の標準化に伴い、画像データのディジタル圧
縮伸長処理が身近なものとなってきている。その結果、
ディジタル画像圧縮伸長処理に対する需要が増大し、さ
らなる高圧縮率、高品質を実現する画像圧縮伸長処理技
術が必要となってきた。2. Description of the Related Art In recent years, computers such as WS and PC, and semiconductor devices such as VLSI have been increased in speed and cost, and J.
With the standardization of digital image compression / expansion processing such as PEG, H261, and MPEG, digital compression / expansion processing of image data has become familiar. as a result,
The demand for digital image compression / expansion processing has increased, and image compression / expansion processing technology that achieves a higher compression ratio and higher quality has been required.

【０００３】近年標準化されたＪＰＥＧ、Ｈ２６１、Ｍ
ＰＥＧといった画像圧縮伸長処理においては、画像デー
タに対して離散余弦変換（Discrete Cosine Transfor
m、以下、「ＤＣＴ」ともいう。）を施し、その結果得
られたＤＣＴ係数を線形量子化し、さらに可変長符号化
することで画像圧縮し、さらにその逆のプロセスを実行
することで画像伸長するＤＣＴ符号化方式が採用されて
いる。[0003] Recently standardized JPEG, H261, M
In image compression / expansion processing such as PEG, Discrete Cosine Transform
m, hereinafter also referred to as “DCT”. ), The resulting DCT coefficients are linearly quantized, and image compression is performed by further performing variable-length coding, and the image is further decompressed by performing the reverse process. .

【０００４】ここで、ＤＣＴは直交変換の一つであり、
その出力であるＤＣＴ係数は、元の画像データに含まれ
る空間周波数の成分に相当する。式７は、１次元Ｎ次の
ＤＣＴ、即ち、Ｎ個の入力データｆ[x]（0≦x≦N-1）に
対するＤＣＴを表す変換式である。[0004] Here, DCT is one of orthogonal transforms.
The output DCT coefficient corresponds to a spatial frequency component included in the original image data. Equation 7 is a transform equation representing a one-dimensional N-order DCT, that is, a DCT for N pieces of input data f [x] (0 ≦ x ≦ N−1).

【０００５】[0005]

【数７】ただし、Ｋは定数であり、Ｃ[u]（0≦u≦N-1）は式８を
満たす。(Equation 7) Here, K is a constant, and C [u] (0 ≦ u ≦ N−1) satisfies Expression 8.

【０００６】[0006]

【数８】このようなＤＣＴ符号化方式により圧縮伸長を行なう画
像処理においては、ＤＣＴ及び逆ＤＣＴの高速化が、画
像の高圧縮率、高品質を実現するうえで、極めて重要な
ポイントの１つとなる。ＤＣＴ演算の高速化については
種々の研究がなされているが、高速化を妨げる要因の１
つとして乗算が多いことがあげられる（例えば、W. Che
n他, ''A FastComputational Algorithm for the Discr
ete Cosine Transform,'' IEEE Trans.Commun., COM-2
5, pp.1004-1009, 1977.）。(Equation 8) In image processing in which compression and decompression are performed by such a DCT coding method, speeding up of DCT and inverse DCT is one of the very important points in realizing a high compression rate and high quality of an image. Various studies have been made on the speeding up of the DCT operation, but one of the factors hindering the speeding up is as follows.
One is that there are many multiplications (for example, W. Che
n et al., '' A FastComputational Algorithm for the Discr
ete Cosine Transform, '' IEEE Trans.Commun., COM-2
5, pp. 1004-1009, 1977.).

【０００７】ところが、ＤＣＴ演算において必要とされ
る多くの乗算のすべて、あるいは、それら乗算のほとん
どが式１で示されるような２次元回転演算を用いて表現
できることが示されている（例えば、A. Ligtenberg他,
''A single chip solution for an 8 by 8 twodimensi
onal DCT,'' IEEE Intl. Symp. on Circuits and Syste
ms, ISCAS-87,pp.1128-1131, 1987.、 C. Loeffler他,
''Algorithm-architecture mapping forcustom DSP ch
ips,'' IEEE Intl. Symp. on Circuits and Systems, I
SCAS-88,pp.1953-1956, 1988.）。However, it has been shown that all or many of the multiplications required in the DCT operation can be expressed using a two-dimensional rotation operation as shown in Equation 1 (for example, A Ligtenberg et al.,
'' A single chip solution for an 8 by 8 twodimensi
onal DCT, '' IEEE Intl.Symp.on Circuits and Syste
ms, ISCAS-87, pp. 1128-1131, 1987., C. Loeffler et al.,
'' Algorithm-architecture mapping for custom DSP ch
ips, '' IEEE Intl.Symp.on Circuits and Systems, I
SCAS-88, pp.1953-1956, 1988.).

【０００８】具体的には、例えば、式７においてＫ＝√
２，Ｎ＝８の場合であれば、図６（ａ）に示す計算フロ
ーグラフを得ることができる（例えば、K. R. Rao 他、
「画像符号化技術−ＤＣＴとその国際標準−」、オーム
社、４０３〜４２６ページ）。この図より、１次元８
次のＤＣＴにおいて必要とされる多くの乗算は、３種類
の角度（−π／１６、−２π／１６、−５π／１６）に
ついての２次元回転演算に置き換えられていることが判
る。なお、図６（ｂ）は、図６（ａ）に用いられている
シンボルの意味を説明する図表である。Specifically, for example, in equation 7, K = √
If N = 8, a calculation flow graph shown in FIG. 6A can be obtained (for example, KR Rao et al.,
"Image coding technology-DCT and its international standard-", Ohmsha, pages 403-426). From this figure, one dimension 8
It can be seen that many of the multiplications required in the next DCT have been replaced by two-dimensional rotation operations on three different angles (-π / 16, -2π / 16, -5π / 16). FIG. 6B is a table for explaining the meaning of the symbols used in FIG. 6A.

【０００９】以上のことから、ＤＣＴ演算を高速に行う
には、式１で示される２次元回転演算を高速にすればよ
いことが判る。そこで、従来から２次元回転演算を高速
化する手法が提案されているが、その代表的な２つの手
法は以下の通りである。（１）第１の従来手法は、式１の２次元回転演算を行う
代わりに、それを変形して得られる式９の２次元回転演
算を行う方法である。From the above, it can be seen that the DCT operation can be performed at a high speed by increasing the speed of the two-dimensional rotation operation represented by Expression 1. Therefore, techniques for speeding up the two-dimensional rotation operation have been conventionally proposed. The two typical techniques are as follows. (1) The first conventional method is a method of performing a two-dimensional rotation operation of Expression 9 obtained by transforming the two-dimensional rotation operation of Expression 1 instead of performing the two-dimensional rotation operation of Expression 1.

【００１０】[0010]

【数９】これは、式１に示された２次元回転演算では１回の２次
元回転演算につき４回の乗算が必要とされるが、式９に
示された２次元回転演算によれば３回の乗算で済むこと
に着目したものである。実際には、この手法を用いるこ
とで、従来の方式では１次元ＤＣＴの実現に１６回の乗
算を必要としていたものが、１１回の乗算で実現できる
ことが報告されている（Loeffler他, ''Practical fast
1-D DCT algorithms with 11multiplications,'' IEEE
Intl. Conf. on Acoust., Speech, and Signal Proces
s.,ICASSP-89, pp.988-991, 1989.）。(Equation 9) This means that, in the two-dimensional rotation operation shown in Equation 1, four multiplications are required for one two-dimensional rotation operation, but according to the two-dimensional rotation operation shown in Equation 9, three multiplications are performed. It is focused on that it suffices. In fact, it has been reported that this technique can be realized by 11 multiplications, whereas the conventional scheme requires 16 multiplications to realize one-dimensional DCT (Loeffler et al., '' Practical fast
1-D DCT algorithms with 11multiplications, '' IEEE
Intl. Conf. On Acoust., Speech, and Signal Proces
s., ICASSP-89, pp.988-991, 1989.).

【００１１】この手法により、例えば、ＪＰＥＧ、Ｈ２
６１、ＭＰＥＧといった画像圧縮伸長で必要とされる８
×８画素データの２次元８次のＤＣＴにおいて従来２５
６回必要とされていた乗算が１７６回に削減される。（２）第２の従来手法は、ＣＯＲＤＩＣ（COordinate R
otation DIgital Computer）と呼ばれるアルゴリズムを
用いる方法である（例えば、J. E. Volder,''TheCORDIC
trigonometric computing technique,'' IRETrans. El
ectron. Comput., EC-8, pp.330-334, 1959.、J. S. Wa
lther,''A unifiedalgorithm for elementary functio
n,'' AFIPS Conf., 38, pp.379-385, 1971.）。According to this method, for example, JPEG, H2
61, 8 required for image compression / decompression such as MPEG
Conventional 25-dimensional DCT of 2D 8th order of × 8 pixel data
The required multiplication of six times is reduced to 176 times. (2) The second conventional method uses CORDIC (COordinate R).
This method uses an algorithm called “otation digital computer” (for example, JE Volder, '' TheCORDIC
trigonometric computing technique, '' IRETrans.El
ectron.Comput., EC-8, pp.330-334, 1959., JS Wa
lther, '' A unifiedalgorithm for elementary functio
n, '' AFIPS Conf., 38, pp.379-385, 1971.).

【００１２】この方法では、式１０、式１１及び式１２
に示される漸化式の演算を、式１２で示されるｚ[i]が
０に収束していくようにδ[i]を決定しながら、繰り返
し実行する。In this method, equations ( 10), ( 11) and ( 12)
Are repeatedly executed while δ [i] is determined so that z [i] shown in Expression 12 converges to 0.

【００１３】[0013]

【数１０】 (Equation 10)

【００１４】[0014]

【数１１】 [Equation 11]

【００１５】[0015]

【数１２】但し、ｉは整数、δ[i] = ±1であり、ｘ[0]、ｙ[0]及
びｚ[0]は式１３を満たし、θ[i]は式１４を満たすもの
とする。(Equation 12) Here, i is an integer, δ [i] = ± 1, x [0], y [0] and z [0] satisfy Expression 13, and θ [i] satisfy Expression 14.

【００１６】[0016]

【数１３】 (Equation 13)

【００１７】[0017]

【数１４】すると、式１０及び式１１によって得られる（ｘ[n]、
ｙ[n]）に式１５で示される補正値Ｃ[n]を乗じた値は、
求める理論値（ｘ'、ｙ'）に収束していくというもので
ある。[Equation 14] Then, (x [n], obtained by Expression 10 and Expression 11,
y [n]) multiplied by the correction value C [n] shown in Equation 15 is
It converges on the theoretical value (x ', y') to be obtained.

【００１８】[0018]

【数１５】即ち、大きなｎに対して、式１６が成り立つ。(Equation 15) That is, Equation 16 holds for a large n.

【００１９】[0019]

【数１６】この手法は、回転しようとする任意の角度（θ）を２の
べき乗と関連する所定の離散角度の和又は差に分解する
（例えば、θ＝θ1−θ2＋θ3・・・）ことにより、複
雑な２次元回転演算を単純な２進数の演算（桁シフトと
加減算）の繰り返しに置き換えるものである。(Equation 16) This technique breaks down any angle (θ) to be rotated into a sum or difference of predetermined discrete angles associated with powers of two (eg, θ = θ1−θ2 + θ3,. The dimensional rotation operation is replaced with the repetition of a simple binary operation (digit shift and addition / subtraction).

【００２０】図７は、ＣＯＲＤＩＣ手法による２次元回
転演算をハードウェアで実現した場合の構成を示すブロ
ック図である。この従来装置は、大きく分けて、離散角
度回転演算部７１０と制御部７２０からなり、図示され
ていないクロックに同期して動作する。即ち、セレクタ
７１１、７１２は、最初のクロックにおいては入力デー
タｘ[0]、ｙ[0]を、次のクロックからは加減算器７１
７、７１８の出力データを次のレジスタ７１３、７１２
に送る。FIG. 7 is a block diagram showing a configuration in a case where the two-dimensional rotation operation by the CORDIC technique is realized by hardware. This conventional device is roughly divided into a discrete angle rotation calculation unit 710 and a control unit 720, and operates in synchronization with a clock (not shown). That is, the selectors 711 and 712 output the input data x [0] and y [0] in the first clock, and adder / subtractor 71 from the next clock.
7 and 718 to the next registers 713 and 712
Send to

【００２１】レジスタ７１３、７１４から直接に、及び
シフタ７１５、７１６を経て送られてきた２組のデータ
は、それぞれ加減算器７１７、７１８で演算される。加
減算器７１７は式１０の演算に対応し、加減算器７１８
は式１１の演算に対応する。制御部７２０は、ＬＵＴ
（ルックアップテーブル）７２１、Ｚ算出部７２２及び
判定部７２３からなり、ＬＵＴ７２１は予め式１４を満
たす多くの角度θ[i]を記憶しているＲＯＭであり、Ｚ
算出部は式１２のｚ[i]を算出する演算器であり、判定
部７２３は式１２のδ[i]が１であるべきか−１である
べきかを判定する比較器である。Two sets of data sent directly from the registers 713 and 714 and via the shifters 715 and 716 are calculated by adders / subtracters 717 and 718, respectively. The adder / subtractor 717 corresponds to the operation of Expression 10, and the adder / subtractor 718
Corresponds to the operation of Expression 11. The control unit 720 includes an LUT
(Lookup table) 721, a Z calculation unit 722, and a determination unit 723, and the LUT 721 is a ROM that stores many angles θ [i] that satisfy Expression 14 in advance.
The calculation unit is an arithmetic unit that calculates z [i] in Expression 12, and the determination unit 723 is a comparator that determines whether δ [i] in Expression 12 should be 1 or −1.

【００２２】加減算器７１７、７１８の演算機能（加算
又は減算）は、制御部からの指示によってクロックごと
に動的に決定される。これによって、１クロックごと
に、式１０、式１１及び式１２の演算が１回ずつ実行さ
れることになる。なお、演算の繰り返し回数を入力デー
タの語長に等しくしておくことで適度な演算精度が確保
されることが、ＣＯＲＤＩＣ手法において明らかにされ
ている（例えば、G. L. Haviland他,''A CORDICArithme
tic Processor Chip,'' IEEE Trans. Comput., C-29, p
p.68-78, 1980.）。例えば、１６ビットの演算語長を有
する入力データに対して２次元回転演算を実行する場合
には、式１０、式１１及び式１２をそれぞれ１６回繰り
返せばよい。The arithmetic function (addition or subtraction) of the adders / subtracters 717 and 718 is dynamically determined for each clock according to an instruction from the control unit. Thus, for each clock, Equation 10, so that the operation of Equation 11 and Equation 12 is executed once. It has been clarified in the CORDIC method that appropriate arithmetic accuracy is ensured by making the number of repetitions of the arithmetic equal to the word length of the input data (for example, GL Haviland et al., “A CORDICArithme
tic Processor Chip, '' IEEE Trans. Comput., C-29, p
p.68-78, 1980.). For example, when performing a 2-dimensional rotation operation on the input data with the operation word length of 16 bits, Equation 10, Equation 11 and Equation 12 may be repeated each 16 times.

【００２３】以下の表１は、１６ビット固定小数点の入
力データ（ｘ，ｙ）=（５０，５０）を、ＣＯＲＤＩＣ
手法を用いて、θ=−π／１６として２次元回転演算し
た場合の演算過程を示す。Table 1 below shows that the 16-bit fixed-point input data (x, y) = (50, 50) is
A calculation process when a two-dimensional rotation calculation is performed with θ = −π / 16 using the technique will be described.

【００２４】[0024]

【表１】なお、表１に示された数値は、１６ビットデータを小数
点の位置が下位から８ビット目とする固定小数点データ
として表現したものである。この表から判るように、最
下段に示された（ｘ[16]，ｙ[16]）に定数Ｃ[16]（=0.6
072529…）を乗じた値（58.787303，39.286418）は、求
めるべき理論値（58.793780，39.284748）によく一致し
ていることが判る。[Table 1] Note that the numerical values shown in Table 1 represent 16-bit data as fixed-point data in which the position of the decimal point is the eighth bit from the lower order. As can be seen from this table, the constant (x [16], y [16]) shown at the bottom is a constant C [16] (= 0.6
072529 ...) (58.787303, 39.286418) is well in agreement with the theoretical value to be obtained (58.793780, 39.284748).

【００２５】[0025]

【発明が解決しようとする課題】しかしながら、上記の
従来手法による２次元回転演算を用いたＤＣＴ及び逆Ｄ
ＣＴは、一定量の演算回数を削減したものの、今日の大
容量化した画像データをリアルタイムで圧縮伸長する技
術としては十分に高速なものとは言えないという問題点
がある。また、これらの演算をハードウェアで実現した
場合には回路が大規模化するという問題点がある。However, DCT using the two-dimensional rotation operation and the inverse D
Although the CT has reduced the number of calculations by a certain amount, it has a problem that it cannot be said to be a sufficiently high-speed technique for compressing and expanding real-time large-volume image data in real time. Further, when these operations are realized by hardware, there is a problem that a circuit becomes large-scale.

【００２６】即ち、上記第１の従来手法では、２次元回
転演算で必要とされていた４回の乗算を３回の乗算に削
減にした過ぎず、依然として乗算という複雑な演算が必
要とされるために、この手法によって削減された時間で
は十分とは言えない。一方、上記第２の従来手法では、
演算精度を確保するには入力データの語長に等しい回数
のシフトや加減算を実行しなければならないために、扱
うデータの語長が長い場合には、非常に多くの回数のシ
フト及び加減算が必要とされ、１回の２次元回転演算に
要する時間が長くなってしまう。That is, in the first conventional method, the four multiplications required in the two-dimensional rotation operation are merely reduced to three multiplications, and a complicated operation called multiplication is still required. Therefore, the time saved by this method is not enough. On the other hand, in the second conventional method,
Shifts and additions / subtractions must be performed a number of times equal to the word length of the input data to ensure the operation accuracy. If the word length of the data to be handled is long, a very large number of shifts / additions / subtractions are required. Therefore, the time required for one two-dimensional rotation calculation becomes long.

【００２７】また、図７に示されるように、第２の従来
手法をハードウェアで実現した場合においては、ルック
アップテーブルや比較器等が必要とされるので、これら
の回路を実現するのに多くのゲート数が費やされ、ＤＣ
ＴのＬＳＩ化が困難となる。さらに、画像圧縮伸長にお
いては２次元画像データを８×８個の画素ブロックに分
割し各ブロックについてＤＣＴ／逆ＤＣＴを順次繰り返
すが、このようなハードウェア構成では、１６ビットの
データを扱うのであれば１６クロックによって１組の入
力データに対する２次元回転演算が終了するので、例え
ば、４８組の入力データに対して２次元回転演算を繰り
返す場合には７６８クロックという多くの時間が必要と
されてしまう。As shown in FIG. 7, when the second conventional method is implemented by hardware, a look-up table, a comparator, and the like are required. Many gates are spent and DC
It becomes difficult to make T an LSI. Furthermore, in image compression / expansion, two-dimensional image data is divided into 8 × 8 pixel blocks, and DCT / inverse DCT is sequentially repeated for each block. However, such a hardware configuration handles 16-bit data. For example, a two-dimensional rotation operation on one set of input data is completed by 16 clocks. For example, when repeating a two-dimensional rotation operation on 48 sets of input data, a long time of 768 clocks is required. .

【００２８】一方、この第２の従来手法をソフトウェア
で実現した場合においては、式１２の演算結果が出力さ
れてからでないと式１０及び式１１の演算の種類（加算
又は減算）を決定することができない、即ち、１回の演
算毎に条件判断が必要とされるため、このソフトウェア
を実行するＣＰＵ等が有するパイプラインの機能が十分
に活かされない。その結果、高速化が妨げられるという
問題点を有している。On the other hand, that the second in case of realizing a conventional technique in software, for determining the calculation result of Equation 12 is not from the output type of the operation of formula 10 and formula 11 (addition or subtraction) That is, since the condition judgment is required for each operation, the function of the pipeline of the CPU or the like executing this software cannot be fully utilized. As a result, there is a problem that speeding up is hindered.

【００２９】そこで、本発明はかかる問題点に鑑みてな
されたものであり、従来の２次元回転演算で必要とされ
る乗算をシフトと加減算のみで行うと共に、ＣＯＲＤＩ
Ｃ手法で必要とされる演算回数よりも少ない回数のシフ
トと加減算で２次元回転演算を行うことができる高速な
離散余弦変換及び逆離散余弦変換装置を提供することを
第１の目的とする。Accordingly, the present invention has been made in view of such a problem, and the multiplication required in the conventional two-dimensional rotation operation is performed only by shift and addition / subtraction, and the CORDI
To provide a C technique fast discrete cosine transform and inverse discrete cosine varying retrofit location capable of performing two-dimensional rotation operation with a small number of shift and subtraction than the number of calculations required in the first object .

【００３０】また、本発明の第２の目的は、従来のＣＯ
ＲＤＩＣ手法をハードウェアで実現する際に必要とされ
たルックアップテーブルや比較器等が必要とされない、
即ち小規模な回路で構成することができる離散余弦変換
及び逆離散余弦変換装置を提供することである。また、
本発明の第３の目的は、連続する入力データに対して定
角度の２次元回転演算を繰り返す場合においては、１組
の入力データに対する２次元回転演算につき必要とされ
るクロック数がＣＯＲＤＩＣ手法の場合よりも少なくて
済む高速な離散余弦変換及び逆離散余弦変換装置を提供
することである。Further, a second object of the present invention is to provide a conventional CO 2
Look-up tables, comparators, and the like required when implementing the RDIC method with hardware are not required.
That is, an object of the present invention is to provide a discrete cosine transform and inverse discrete cosine transform device which can be constituted by a small-scale circuit. Also,
A third object of the present invention is to provide a method for repeating a constant-angle two-dimensional rotation operation on continuous input data, in which the number of clocks required for the two-dimensional rotation operation on one set of input data is determined by the CORDIC method. An object of the present invention is to provide a high-speed discrete cosine transform and inverse discrete cosine transform device which requires less than the case.

【００３１】[0031]

【課題を解決するための手段】この課題を解決するため
に本発明は、ＤＣＴ／逆ＤＣＴを所定角度（θ）の２次
元回転演算を用いて行う方法であって、２次元回転演算
を行う対象となる入力データ（ｘ[0]，ｙ[0]）を算出す
るステップと、その入力データ（ｘ[0]，ｙ[0]）に対し
て、予め求められたδ[i]、ｐ[i]及びｑ[i]を用いて表
される式５に示される離散角度の回転演算をｎ回実行す
るステップと、そのステップにより最終的に得られた出
力データ（ｘ[n]，ｙ[n]）の補正を行うステップとから
なることを特徴とする。SUMMARY OF THE INVENTION In order to solve this problem, the present invention is a method for performing DCT / inverse DCT using a two-dimensional rotation operation at a predetermined angle (θ). A step of calculating target input data (x [0], y [0]); and, for the input data (x [0], y [0]), δ [i], p A step of executing a rotation operation at a discrete angle represented by Expression 5 expressed by using [i] and q [i] n times, and output data (x [n], y finally obtained by the step) [n]) performing the correction.

【００３２】この方法によれば、任意の角度ではなく、
予め定められた角度の回転演算を実行するのに必要な最
低限のステップが式２〜式４に基づいて予め求められて
おり、又それらのステップはシフトと加減算のみで構成
されるため、従来手法で必要とされた乗算の回数が減少
しかつ必要な処理ステップ数も減少する。また、この方
法による２次元回転演算を離散角度の回転演算を順次行
うパイプラインの構成とすることにより、従来のＣＯＲ
ＤＩＣ手法で必要とされたルックアップテーブルや比較
器等が不要になると共に少ない回路規模で高速かつコン
パクトなＤＣＴ／逆ＤＣＴ装置が実現される。According to this method, instead of an arbitrary angle,
Since the minimum steps required to execute the rotation operation at a predetermined angle are obtained in advance based on Equations ( 2) to ( 4), and since these steps are constituted only by shift and addition / subtraction, the related art The number of multiplications required by the technique is reduced and the number of processing steps required is also reduced. Further, by adopting a pipeline configuration in which the two-dimensional rotation operation according to this method sequentially performs the rotation operation at discrete angles, the conventional COR can be used.
A look-up table, a comparator, and the like required by the DIC method become unnecessary, and a high-speed and compact DCT / inverse DCT device with a small circuit scale is realized.

【００３３】[0033]

【発明の実施の形態】１次元Ｎ次のＤＣＴ装置の場合で
あれば、以下の順に従って、その構成（処理ステップ）
を決定すればよい。（１）上述のLoefflerらの理論により、１次元Ｎ次のＤ
ＣＴに必要な２次元回転演算の種類（角度）を特定し、
さらに、特定した各演算を実現する構成を以下の順に決
定する。（２）特定した角度ごとに、式２、式３及び式４を満た
すｎ個のδ（δ[0]からδ[n-1]）、ｐ（ｐ[0]からｐ[n-
1]）及びｑ（ｑ[0]からｑ[n-1]）を決定する。BEST MODE FOR CARRYING OUT THE INVENTION In the case of a one-dimensional N-order DCT device, its configuration (processing step) is performed in the following order.
Should be determined. (1) According to the above-mentioned theory of Loeffler et al.
Specify the type (angle) of the two-dimensional rotation calculation required for CT,
Further, a configuration for realizing each specified operation is determined in the following order. (2) in each specific angular equation 2, of n satisfying Equations 3 and 4 [delta] ([delta] [0] from δ [n-1]), p from p (p [0] [n-
1]) and q (q [0] to q [n-1]) are determined.

【００３４】この決定に際しては、より小さいｎの値に
おいてδ[i]、ｐ[i]及びｑ[i]が式２、式３、式４を満
たすこと、また、ｐ[i+1]−p[i]≠１、ｑ[i+1]−ｑ[i]
≠１を満たすことが好ましい。（３）決定されたｎ、δ[i]、ｐ[i]及びｑ[i]に基づ
き、式５の演算を繰り返すパイプラインの構成を決定す
る。[0034] In this determination, [delta] [i] at a value of less than n, p [i] and q [i] is Formula 2, Formula 3, to satisfy equation 4, also, p [i + 1] - p [i] ≠ 1, q [i + 1] −q [i]
It is preferable to satisfy # 1. (3) Based on the determined n, δ [i], p [i], and q [i], determine the configuration of the pipeline that repeats the operation of Expression 5.

【００３５】即ち、パイプラインは、ｉ＝１〜ｎ−１の
それぞれに対応する式５の演算を実行するｎ個のステー
ジからなる。各ステージは、式５の演算を１回実行する
ものであるので、ｘ[i]及びｙ[i]を格納する２個のレジ
スタとδ[i]の値によって決まる２個の加算器又は減算
器から構成することができる。ｐ[i]及びｑ[i]による２
のべき乗の演算（シフトダウン）は、レジスタと加算器
又は減算器との配線の工夫（対応する桁をずらして接続
する）により実現すればよい。That is, the pipeline is composed of n stages for executing the operation of Expression 5 corresponding to each of i = 1 to n-1. Since each stage executes the operation of Equation 5 once, two registers for storing x [i] and y [i] and two adders or subtractors determined by the value of δ [i] It can be composed of vessels. 2 by p [i] and q [i]
The exponentiation operation (shift down) may be realized by devising a wiring between the register and the adder or the subtractor (connecting by shifting the corresponding digit).

【００３６】以上のようにして決定された構成を備える
ＤＣＴ装置は、Ｎ個の入力画素データに対して高速にＤ
ＣＴを実行する。The DCT device having the configuration determined as described above can perform high-speed D.sub.D processing on N input pixel data.
Perform CT.

【００３７】[0037]

【実施例】以下、本発明に係るＤＣＴ装置について図面
を用いて詳細に説明する。（第１実施例）第１実施例は、１次元８次のＤＣＴを高速に行うＤＣＴ
装置に関するものである。（全体の構成及び機能）図１は、本装置の構成を示すブロック図である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, a DCT device according to the present invention will be described in detail with reference to the drawings. (First Embodiment) The first embodiment is a DCT that performs one-dimensional 8th-order DCT at high speed.
It concerns the device. (Overall Configuration and Function) FIG. 1 is a block diagram showing the configuration of the present apparatus.

【００３８】本装置は、前置処理部１０１、３個の回転
演算部１０２〜１０４及び６個の補正部１０５〜１１０
からなる。ｆ[0]〜ｆ[7]は本装置への入力画素データ、
Ｆ[0]〜Ｆ[7]は本装置の出力であるＤＣＴ係数を示す。
図２は、本装置が行う演算内容を示す計算フローグラフ
であり、従来手法における図６に示された計算フローグ
ラフに対応する。This apparatus comprises a preprocessing unit 101, three rotation calculation units 102 to 104, and six correction units 105 to 110.
Consists of f [0] to f [7] are input pixel data to the apparatus,
F [0] to F [7] indicate DCT coefficients which are outputs of the present apparatus.
FIG. 2 is a calculation flow graph showing the operation performed by the present apparatus, and corresponds to the calculation flow graph shown in FIG. 6 in the conventional method.

【００３９】前置処理部１０１は、加算器、減算器、セ
レクタ及びレジスタ等からなり、図２に示された加減算
２０１を行う。この前置処理部１０１での演算内容は、
図７及び図２から判るように従来手法の場合と同様であ
り、具体的には、８個の入力画素データｆ[0]〜ｆ[7]か
ら、単純な加減算等の演算によって、ＤＣＴ係数Ｆ[0]
及びF[4]を算出したり、各回転演算部１０２〜１０４へ
の入力データを算出したりする。The pre-processing unit 101 comprises an adder, a subtractor, a selector, a register and the like, and performs the addition / subtraction 201 shown in FIG. The content of the operation in the preprocessing unit 101 is as follows:
As can be seen from FIGS. 7 and 2, this is the same as the case of the conventional method. Specifically, the DCT coefficient is calculated from eight pieces of input pixel data f [0] to f [7] by a simple operation such as addition and subtraction. F [0]
And F [4], and input data to each of the rotation calculation units 102 to 104.

【００４０】第１回転演算部１０２、第２回転演算部１
０３及び第３回転演算部１０４は、それぞれ、−２π／
１６、−π／１６、−５π／１６という固定角度の２次
元回転演算を行うものであり、図２に示されたＲ'（−
２π／１６）、Ｒ'（−π／１６）、Ｒ'（−５π／１
６）に相当する。なお、これら３種類の回転演算部１０
２〜１０４の詳細な構成及び動作については後述する。The first rotation calculator 102 and the second rotation calculator 1
03 and the third rotation operation unit 104 are respectively -2π /
The two-dimensional rotation operation is performed at fixed angles of 16, −π / 16, and −5π / 16, and R ′ (−−) shown in FIG.
2π / 16), R ′ (− π / 16), R ′ (− 5π / 1
6). Note that these three types of rotation calculation units 10
Detailed configurations and operations of 2 to 104 will be described later.

【００４１】６個の補正部１０５〜１１０は、乗算器等
からなり、それぞれ、図２に示された乗算２０５〜２１
０を行う。これら６種類の補正部１０５〜１１０での演
算内容は、図７及び図２から判るように従来手法の場合
と乗数が異なるだけである。なお、図２に示されたｋ
2、ｋ1、ｋ5は、すべて定数であり、以下の値である。ｋ2＝0.892658、ｋ1＝0.968133、ｋ5＝0.684574 これらの意義については後述する。（回転演算部の構成・動作）以上のように構成された本装置において、従来手法によ
る装置と大きく異なる構成部、即ち、３種の回転演算部
１０２〜１０４の詳細な構成及び動作について説明す
る。なお、これら各回転演算部１０２〜１０４は、同種
類の回路で構成されるので、ここでは第２回転演算部１
０３、即ち、−π／１６の角度の回転演算を高速に行う
演算部について説明する。Each of the six correction units 105 to 110 is composed of a multiplier or the like, and each of the six correction units 105 to 110 shown in FIG.
Perform 0. As can be seen from FIGS. 7 and 2, the contents of the calculations performed by these six types of correction units 105 to 110 are different from the conventional method only in the multiplier. Note that k shown in FIG.
2, k1 and k5 are all constants and have the following values. k2 = 0.892658, k1 = 0.968133, k5 = 0.684574 The significance of these will be described later. (Structure and Operation of Rotation Calculation Unit) The detailed configuration and operation of the rotation calculation units 102 to 104 of the present apparatus configured as described above, which are significantly different from those of the apparatus according to the related art, will be described. . Since each of the rotation calculation units 102 to 104 is configured by the same type of circuit, the second rotation calculation unit 1 is used here.
03, that is, a calculation unit that performs a rotation calculation at an angle of -π / 16 at a high speed will be described.

【００４２】図３は、第２回転演算部１０３の構成を示
す回路図である。第２回転演算部１０３は、大きく分け
て、５段の離散角度回転演算部３４０〜３４４と出力バ
ッファ３４５からなり、上段の離散角度回転演算部の結
果が図示されていないクロックによって次段に伝搬され
るパイプラインの構成となっている。FIG. 3 is a circuit diagram showing the configuration of the second rotation calculation unit 103. The second rotation operation unit 103 is roughly divided into five stages of discrete angle rotation operation units 340 to 344 and an output buffer 345, and the result of the upper stage discrete angle rotation operation unit is propagated to the next stage by a clock (not shown). It has a pipeline configuration.

【００４３】各段の離散角度回転演算部は、２個の１６
ビットレジスタＲＥＧx[i]、ＲＥＧy[i]と、加算器ＡＤ
Ｄx[i]（又はＡＤＤy[i]）と、減算器ＳＵＢy[i]（又は
ＳＵＢx[i]）とからなり、出力バッファ３４５は、２個
の１６ビットレジスタＲＥＧx[5]、ＲＥＧy[5]からな
る。なお、図中の配線は、基本的には１６本のバスであ
り対応する桁どうしが接続されるが、配線の横に数値が
示されている場合は、その配線は、その数値の符号に対
応する方向に、その数値の桁だけずらして接続される。
例えば、レジスタＲＥＧx[0]と減算器ＳＵＢy[0]とを接
続する配線の横には"−２"が記されているが、これは、
レジスタＲＥＧx[0]の出力データが２ビットだけシフト
ダウンされて減算器ＳＵＢy[0]に入力されるように接続
されることを意味する。The discrete angle rotation operation section of each stage includes two 16
Bit registers REGx [i], REGy [i] and adder AD
Dx [i] (or ADDy [i]) and a subtractor SUBy [i] (or SUBx [i]), and the output buffer 345 has two 16-bit registers REGx [5] and REGy [5]. Consists of The wiring in the figure is basically 16 buses, and the corresponding digits are connected. When a numerical value is shown beside the wiring, the wiring is indicated by the sign of the numerical value. In the corresponding direction, the connection is shifted by the digit of the numerical value.
For example, “−2” is written beside the wiring connecting the register REGx [0] and the subtractor SUBy [0].
This means that the output data of the register REGx [0] is connected to be shifted down by 2 bits and input to the subtractor SUBy [0].

【００４４】次に、以上のように構成された第２回転演
算部１０３によって、１６ビットの演算精度を確保しつ
つ、−π／１６の角度の２次元回転演算が行われること
を説明する。先ず、第１番目のクロックにより、前置処
理部１０１から出力された２個の入力データｘ[0]、ｙ
[0]がレジスタＲＥＧx[0]、ＲＥＧy[0]に格納される。Next, a description will be given of how the second rotation calculation unit 103 configured as described above performs a two-dimensional rotation calculation of an angle of -π / 16 while ensuring 16-bit calculation accuracy. First, two input data x [0], y output from the pre-processing unit 101 by the first clock.
[0] is stored in the registers REGx [0] and REGy [0].

【００４５】次に、第２番目のクロックにより、加算器
ＡＤＤx[0]及び減算器ＳＵＢy[0]において以下の演算が
行われ、その結果がレジスタＲＥＧx[1]及びＲＥＧy[1]
に格納される。ＲＥＧx[1]＝ＲＥＧx[0]＋２ー2・ＲＥＧy[0] ＲＥＧy[1]＝ＲＥＧy[0]−２ー2・ＲＥＧx[0] 同様して、第３番目のクロックにより、減算器ＳＵＢx
[1]及び加算器ＡＤＤy[1]において以下の演算が行わ
れ、その結果がレジスタＲＥＧx[2]及びＲＥＧy[2]に格
納される。Next, the following operation is performed in the adder ADDx [0] and the subtractor SUBy [0] by the second clock, and the result is stored in the registers REGx [1] and REGy [1].
Is stored in REGx [1] = REGx [0] + 2-2−REGy [0] REGy [1] = REGy [0] −2−REGx [0] Similarly, the subtractor SUBx is generated by the third clock.
The following operation is performed in [1] and the adder ADDy [1], and the result is stored in the registers REGx [2] and REGy [2].

【００４６】ＲＥＧx[2]＝ＲＥＧx[1]−２ー4・ＲＥＧy
[1] ＲＥＧy[2]＝ＲＥＧy[1]＋２ー4・ＲＥＧx[1] 以下、同様にして、第６番目のクロックが経過した時点
において、この第２回転演算部１０３による最終結果Ｒ
ＥＧx[5]、ＲＥＧy[5]がレジスタＲＥＧx[5]、ＲＥＧy
[5]に格納される。REGx [2] = REGx [1] -2-4 · REGy
[1] REGy [2] = REGy [1] + 2−4 · REGx [1] Hereinafter, similarly, when the sixth clock has elapsed, the final result R of the second rotation operation unit 103 is obtained.
EGx [5] and REGy [5] are the registers REGx [5] and REGy
Stored in [5].

【００４７】以下の表２及び表３は、入力データ（ｘ
[0]、ｙ[0]）として（５０，５０）が与えられた場合に
おける、レジスタＲＥＧx[0]〜ＲＥＧx[5]及びＲＥＧy
[0]〜ＲＥＧx[5]に保持される値（１６進表示による）
のクロック毎の変化を示す。Tables 2 and 3 below show the input data (x
Registers REGx [0] to REGx [5] and REGy when (50, 50) is given as [0], y [0])
Value stored in [0] to REGx [5] (in hexadecimal notation)
Shows the change for each clock.

【００４８】[0048]

【表２】 [Table 2]

【００４９】[0049]

【表３】なお、表中の"−"は、この入力データ（５０，５０）と
は異なる他の入力データについての値が保持されている
ことを示している。また、表２の実数表示欄は、左欄に
表示された１６ビットデータを、小数点の位置を下位か
ら８ビット目とする１６ビット固定小数点データとして
表現した値（１０進表示による）を示している。[Table 3] Note that "-" in the table indicates that values for other input data different from the input data (50, 50) are held. Further, the real number display column in Table 2 shows a value (in decimal notation) that represents the 16-bit data displayed in the left column as 16-bit fixed-point data with the decimal point position being the eighth bit from the lower order. I have.

【００５０】この第２回転演算部１０３において最終的
に得られた結果（ＲＥＧx[5]，ＲＥＧy[5]）はそれぞれ
第３補正部１０７及び第４補正部１０８に送られて√２
・ｋ1倍されるが、（ＲＥＧx[5]，ＲＥＧy[5]）にｋ1を
乗じた値は、求めるべき理論値（ｘ'，ｙ'）に近似した
値となる。これらの値と従来のＣＯＲＤＩＣ手法による
値Ｃ・（ｘ[5]，ｙ[5]）とを比較すると以下の通りであ
る。The results (REGx [5], REGy [5]) finally obtained in the second rotation calculation section 103 are sent to the third correction section 107 and the fourth correction section 108, respectively, and
-Although multiplied by k1, the value obtained by multiplying (REGx [5], REGy [5]) by k1 is a value approximating the theoretical value (x ', y') to be obtained. A comparison between these values and the value C · (x [5], y [5]) according to the conventional CORDIC method is as follows.

【００５１】（ｘ'，ｙ'）＝（58.793780，39.284748）ｋ1・（ＲＥＧｘ[5]，ＲＥＧｙ[5]）＝（58.791401，39.288812）Ｃ（ｘ[16]，ｙ[16]）＝（58.787303，39.286418）この比較から判るように、本装置により得られたデータ
は、ＣＯＲＤＩＣ手法によるデータよりも理論値に近
い。即ち、本装置は、わずか５回の演算しか実行してい
ないにも拘らず、１６回の演算を行うＣＯＲＤＩＣ手法
による場合よりも精度の高い２次元回転演算を実行した
ことになる。(X ′, y ′) = (58.793780, 39.284748) k1 · (REGx [5], REGy [5]) = (58.791401, 39.288812) C (x [16], y [16]) = (58.787303) , 39.286418) As can be seen from this comparison, the data obtained by this apparatus is closer to the theoretical value than the data obtained by the CORDIC method. In other words, the present apparatus has executed a two-dimensional rotation operation with higher accuracy than the CORDIC method of performing the operation 16 times, although the apparatus has executed only the operation 5 times.

【００５２】また、この第２回転演算部１０３の構成か
ら明らかなように、図７に示されたＣＯＲＤＩＣ手法で
必要とされたＬＵＴ７２１、ｚ算出部７２２及び判定部
７２３に相当する構成要素が不要となっている。さら
に、上記例においては、６個のクロックにより、１組の
入力データに対する２次元回転演算が終了したが、多数
組の入力データが次々に連続して与えられる実際の画像
圧縮伸長処理の場合においては、パイプラインの効果に
より、１組の入力データの２次元回転演算に必要とされ
るクロック数は、１に近づく。この点、１組の入力デー
タの回転演算につき常に１６個のクロックが必要とされ
る従来の装置と大きく異なる。As is apparent from the configuration of the second rotation calculation unit 103, components corresponding to the LUT 721, the z calculation unit 722, and the determination unit 723 required by the CORDIC method shown in FIG. It has become. Further, in the above example, the two-dimensional rotation operation for one set of input data is completed by six clocks. However, in the case of an actual image compression / decompression process in which a large number of sets of input data are successively provided. The number of clocks required for the two-dimensional rotation operation of one set of input data approaches 1 due to the pipeline effect. In this regard, this is greatly different from a conventional device that always requires 16 clocks for the rotation operation of one set of input data.

【００５３】もっとも、ＣＯＲＤＩＣ手法をハードウェ
アで実現する他の方法として、図７における制御部７２
０をなくし、離散角度回転演算部７１０を１６段にして
直列にパイプラインとして接続する構成とすることも考
えられる。しかし、このような構成であっても、その演
算速度と回路のコンパクトさの点において、１個のシフ
タをも必要とせず、かつ、５段の加減算回路のみで高精
度の２次元回転演算を実行する本演算部１０３には到底
及ばないことは言うまでもない。（理論的考察）次に、本発明に係るＤＣＴ装置の第２回転演算部１０３
の構成を図３に示されたように決定したプロセス及び他
の回転演算部１０２、１０４の構成について説明する。As another method for realizing the CORDIC method by hardware, the control unit 72 shown in FIG.
It is also conceivable to adopt a configuration in which 0 is eliminated and the discrete angle rotation operation unit 710 is connected in series as a pipeline in 16 stages. However, even with such a configuration, in terms of the operation speed and the compactness of the circuit, a single shifter is not required, and a high-precision two-dimensional rotation operation can be performed using only a 5-stage addition / subtraction circuit. Needless to say, it is far from the execution unit 103 to be executed. (Theoretical Consideration) Next, the second rotation operation unit 103 of the DCT device according to the present invention
The process of determining the configuration of FIG. 3 as shown in FIG. 3 and the configuration of the other rotation calculation units 102 and 104 will be described.

【００５４】先ず、「式２、式３、式４を満たすように
定められたｎ個のδ（δ[0]からδ[n-1]）、ｐ（ｐ[0]
からｐ[n-1]）及びｑ（ｑ[0]からｑ[n-1]）を用いて、
式５の演算をｎ回繰り返すことで得られる（ｘ[n]、ｙ
[n]）は、本来求めるべき理論値（ｘ'，ｙ'）と式１７
に示される関係にある（但し、ｘ[0]、ｙ[0]は、式１８
に示される関係を有するとする。）」ことを証明する。[0054] First, "Formula 2, Formula 3, ([delta] from δ [0] [n-1 ]) of n defined to satisfy equation 4 δ, p (p [0 ]
To p [n-1]) and q (q [0] to q [n-1]),
(X [n], y obtained by repeating the operation of Equation 5 n times)
[n]) is the theoretical value (x ′, y ′) that should be originally obtained and Equation 17
(However, x [0] and y [0] are expressed by Expression 18)
Has the relationship shown in ) ".

【００５５】[0055]

【数１７】 [Equation 17]

【００５６】[0056]

【数１８】まず、式２の左辺における減算値をΔθとおく。即ち、(Equation 18) First, the subtraction value on the left side of Expression 2 is set to Δθ. That is,

【００５７】[0057]

【数１９】 Δθ≒０であるから、理論値（ｘ'，ｙ'）は、以下の通
り表せる。[Equation 19] Since Δθ ≒ 0, the theoretical value (x ′, y ′) can be expressed as follows.

【００５８】[0058]

【数２０】式２０は、式１を用いて変形すると、(Equation 20) Equation 20 can be transformed using Equation 1 to

【００５９】[0059]

【数２１】続いて、式１８、式１９より、(Equation 21) Then, Equation 18, from Equation 19,

【００６０】[0060]

【数２２】ｎ個の行列の積に分解すると、(Equation 22) Decomposing into the product of n matrices,

【００６１】[0061]

【数２３】 δ[i]＝±１より、(Equation 23) From δ [i] = ± 1,

【００６２】[0062]

【数２４】式３、式４より、(Equation 24) Equation 3, from the formula 4,

【００６３】[0063]

【数２５】ｋ[i]を整理し、(Equation 25) Organize k [i],

【００６４】[0064]

【数２６】式５、式６より、(Equation 26) Equation 5, from Equation 6,

【００６５】[0065]

【数２７】よって、式１７が成り立つことが証明された。次に、各
回転演算部１０２〜１０４の具体的な回路構成を決定す
るプロセスについて説明する。各回転演算部１０２〜１
０４の各段を構成する離散角度回転演算部は、式５に示
された演算を実行するものである。従って、θ＝−２π
／１６、−π／１６及び−５π／１６の３種類の角度に
ついての必要な全てのδ（δ[0]からδ[n-1]）、ｐ（ｐ
[0]からｐ[n-1]）及びｑ（ｑ[0]からｑ[n-1]）の値を決
定することにより、各回転演算部１０２〜１０４の具体
的な回路構成が一義的に導き出せる。[Equation 27] Therefore, it has been proved that Expression 17 holds. Next, a process for determining a specific circuit configuration of each of the rotation calculation units 102 to 104 will be described. Each rotation calculation unit 102-1
The discrete angle rotation calculation unit constituting each stage of No. 04 executes the calculation shown in Expression 5. Therefore, θ = −2π
Δ, all necessary δ (δ [0] to δ [n−1]) and p (p for three types of angles of −π / 16 and −5π / 16
By determining the values of [0] to p [n-1]) and q (q [0] to q [n-1]), the specific circuit configuration of each of the rotation calculation units 102 to 104 is univocal. Can be derived.

【００６６】以下、必要なｎ個のδ[i]、ｐ[i]及びｑ
[i]を具体的に決定するプロセスについて説明する。（S
tep１）まず、少なくともＣＯＲＤＩＣ手法による演算
精度を確保するために、許容誤差角度Ｅを、ＣＯＲＤＩ
Ｃ手法における許容誤差角度であるｚ[16]の絶対値（＝
0.000005）と同一の値に設定する。Hereinafter, the necessary n δ [i], p [i] and q
A process for specifically determining [i] will be described. (S
tep1) First, in order to secure at least calculation accuracy by the CORDIC method, the allowable error angle E is set to CORDI.
Absolute value of z [16] which is an allowable error angle in the C method (=
0.000005).

【００６７】即ち、Ｅ＝0.000005 とする。（Step２）
次に、ｋ[i] = cosθ[i] とする。このようにｋ[i]を設
定することで、式３、式４がそれぞれ式２８、式２９に
簡略化され、ＣＯＲＤＩＣ手法との対比を明瞭にするこ
とができるからである。That is, E = 0.000005. (Step 2)
Next, let k [i] = cosθ [i]. By setting this way k [i], equation 3, since Equation 4 are each equation 28 is simplified to Equation 29, it is possible to clarify the comparison with CORDIC technique.

【００６８】[0068]

【数２８】 [Equation 28]

【００６９】[0069]

【数２９】なお、式２９より、全てのｉについて、ｑ[i]=0とな
る。（Step３）続いて、 Δθ[0]（＝｜θ−δ[0]・θ[0]｜）を最小にするδ[0]及びθ[0]を決定する。(Equation 29) From Expression 29, q [i] = 0 for all i. (Step 3) Subsequently, δ [0] and θ [0] that minimize Δθ [0] (= | θ−δ [0] · θ [0] |) are determined.

【００７０】δ[i]＝±１及び式２８の条件を考慮する
と、例えば、第２回転演算部１０３の場合（θ＝−π／
１６）であれば、θ[0]＝tanー1２ー2 かつ δ[0]＝−１
とすることで、Δθ[0]を最小にできることが導き出
せる。以上より、第２回転演算部１０３に関して、ｐ
[0]（＝−２）、ｑ[0]（＝０）及びδ[0]（＝−１）が
決定された。（Step４）同様にして、 Δθ[1]（＝｜θ−δ[0]・θ[0]−δ[1]・θ[1]｜）を最小にするδ[1]及びθ[1]を決定する。Considering δ [i] = ± 1 and the condition of Expression 28, for example, in the case of the second rotation operation unit 103 (θ = −π /
16), θ [0] = tan-12−2 and δ [0] = − 1
Thus, it can be derived that Δθ [0] can be minimized. As described above, regarding the second rotation operation unit 103, p
[0] (= -2), q [0] (= 0) and δ [0] (= -1) were determined. (Step 4) Similarly, δ [1] and θ [1] that minimize Δθ [1] (= | θ−δ [0] · θ [0] −δ [1] · θ [1] |) To determine.

【００７１】第２回転演算部１０３の場合であれば、θ
[1]＝tanー1２ー4 かつ δ[0]＝１とすることで、Δθ
[1]を最小にできることが導き出せる。従って、ｐ[1]
（＝−４）、ｑ[1]（＝０）及びδ[1]（＝１）が決定さ
れた。（Step５）以上のようにして、ｉ＝０、１、２・
・・に対するｐ[i]、ｑ[i]及びδ[i]を順次決定してい
くと、第２回転演算部１０３の場合であれば、ｉ＝４の
ときに、初めて、 Δθ[i]＜Ｅを満たすこととなり、この決定プロセスは終了する。In the case of the second rotation calculation unit 103, θ
By setting [1] = tan-12-4 and δ [0] = 1, Δθ
It can be derived that [1] can be minimized. Therefore, p [1]
(= -4), q [1] (= 0) and δ [1] (= 1) were determined. (Step 5) As described above, i = 0, 1, 2,.
.. P [i], q [i], and δ [i] are sequentially determined. In the case of the second rotation operation unit 103, when i = 4, Δθ [i] is obtained for the first time. <E 2, and the determination process ends.

【００７２】このようにして決定されるδ[i]、ｐ[i]等
の値を以下の表４、表５及び表６に示す。The values of δ [i], p [i], etc. determined in this way are shown in Tables 4, 5 and 6 below.

【００７３】[0073]

【表４】 [Table 4]

【００７４】[0074]

【表５】 [Table 5]

【００７５】[0075]

【表６】表４は第１回転演算部１０２（θ＝−２π／１６）、表
５は第２回転演算部１０３（θ＝−π／１６）、表６は
第３回転演算部１０２（θ＝−５π／１６）の値に対応
する。なお、これらの表には、ｋ[i]及び式６で定義さ
れるｋの値も併せて示している。[Table 6] Table 4 shows the first rotation calculator 102 (θ = −2π / 16), Table 5 shows the second rotation calculator 103 (θ = −π / 16), and Table 6 shows the third rotation calculator 102 (θ = −5π). / 16). In these tables, k [i] and the value of k defined by Equation 6 are also shown.

【００７６】以上の説明より、第２回転演算部１０３が
図３に示された回路構成となることが明らかにされた。
また、図３に示された第２回転演算部１０３の構成と表
４〜表６に示されたδ[i]、ｐ[i]の値により（なお、ｑ
[i]は、全てゼロである。）、第１回転演算部１０２及
び第３回転演算部１０４の回路構成が容易に類推でき
る。即ち、第１回転演算部１０２と第２回転演算部１０
３はそれぞれ５段のパイプラインの構成となり、第３回
転演算部１０４は６段のパイプラインの構成となる。From the above description, it has been clarified that the second rotation operation section 103 has the circuit configuration shown in FIG.
Further, according to the configuration of the second rotation operation unit 103 shown in FIG. 3 and the values of δ [i] and p [i] shown in Tables 4 to 6, (q
[i] is all zero. ), The circuit configurations of the first rotation operation unit 102 and the third rotation operation unit 104 can be easily analogized. That is, the first rotation calculation unit 102 and the second rotation calculation unit 10
3 has a five-stage pipeline configuration, and the third rotation operation unit 104 has a six-stage pipeline configuration.

【００７７】また、上述したように、６個の補正部１０
５〜１１０は、図６に示される従来手法による値（＝√
２）に定数ｋ2、ｋ1、ｋ5を乗じた値を用いて補正を行
っている。これは、式１７から判るように、求めるべき
理論値（ｘ'，ｙ'）は、各回転演算部から出力されるデ
ータ（ｘ[n]，ｙ[n]）に補正値ｋ乗じた値に等しい（近
似する）からである。ｋ2、ｋ1、ｋ5の具体的な値は、
それぞれ表４、表５、表６に示されたｋの欄の最後に記
された値であり、ｎ、δ[i]、ｐ[i]、ｑ[i]が決定され
たときに同時に一義的に決定される値である。（第２実施例）第２実施例は、２次元８次、即ち、８×８画素の入力デ
ータに対するＤＣＴを高速に行うＤＣＴ装置に関するも
のである。（全体の構成及び機能）図４は、本装置全体の構成を示す計算フローグラフであ
る。図中の"１−ＤＤＣＴ"は、さらに、図５に示される
計算フローグラフとなる。図５の計算フローグラフは、
第１実施例における図２の計算フローグラフにおける最
終段での補正（√２・ｋを乗ずる）処理を除いたものに
等しい。As described above, the six correction units 10
5 to 110 are values according to the conventional method shown in FIG.
Correction is performed using a value obtained by multiplying 2) by constants k2, k1, and k5. This is because the theoretical value (x ′, y ′) to be obtained is a value obtained by multiplying the data (x [n], y [n]) output from each rotation operation unit by the correction value k, as can be seen from Expression 17. This is because (approximate). The specific values of k2, k1, and k5 are
These are the values described at the end of the column of k shown in Tables 4, 5, and 6, respectively, and are unambiguous when n, δ [i], p [i], q [i] are determined. It is a value that is determined. Second Embodiment A second embodiment relates to a DCT apparatus that performs DCT on input data of two-dimensional octal, that is, 8 × 8 pixels at high speed. (Overall Configuration and Function) FIG. 4 is a calculation flow graph showing the overall configuration of the present apparatus. "1-DDCT" in the figure further becomes a calculation flow graph shown in FIG. The calculation flow graph of FIG.
This is equivalent to the calculation flow graph of FIG. 2 in the first embodiment except for the correction (multiplying by √2 · k) at the last stage.

【００７８】図４において、ｆ[0][0]〜ｆ[7][7]は６４
個の入力画素データ、Ｆ[0][0]〜Ｆ[7][7]は本装置によ
って得られるＤＣＴ係数を示す。また、ｋ00〜ｋ77は定
数であり、式３０を満たす。In FIG. 4, f [0] [0] to f [7] [7] are 64
The pieces of input pixel data F [0] [0] to F [7] [7] indicate DCT coefficients obtained by the present apparatus. Further, k00 to k77 are constants and satisfy Expression 30.

【００７９】[0079]

【数３０】なお、第１実施例においては構成ブロック図（図１）を
示したが、本実施例においては構成ブロック図を省略す
る。各構成ブロックと計算フローグラフとの対応関係は
第１実施例の場合と同様なので、図４及び図５の計算フ
ローグラフから容易に本装置の構成を導き出せるからで
ある。[Equation 30] Although the configuration block diagram (FIG. 1) is shown in the first embodiment, the configuration block diagram is omitted in this embodiment. This is because the correspondence between each constituent block and the calculation flow graph is the same as that of the first embodiment, so that the configuration of the present apparatus can be easily derived from the calculation flow graphs of FIGS.

【００８０】即ち、本装置における図５に対応する部分
（以下、「１−ＤＤＣＴモジュール」という。）の構成
は、図１に示された構成から６個の補正部１０５〜１１
０を除いたものに等しい。また、図４から明らかなよう
に、本装置の全体は、図中の左列に並んだ８個の１−Ｄ
ＤＣＴモジュール（以下、「前段部」という。）と、図
中の右列に並んだ８個の１−ＤＤＣＴモジュール（以
下、「後段部」という。）と、６４個の補正部とから構
成される。That is, the configuration of the portion corresponding to FIG. 5 (hereinafter referred to as “1-DDCT module”) in the present apparatus is different from the configuration shown in FIG.
Equivalent to zero. Further, as is apparent from FIG. 4, the entirety of the present apparatus is composed of eight 1-Ds arranged in the left column in the figure.
The DCT module includes a DCT module (hereinafter, referred to as a “front section”), eight 1-DDCT modules (hereinafter, referred to as a “post section”) arranged in the right column in the figure, and 64 correction sections. You.

【００８１】以上のように構成された本装置によって２
次元８次のＤＣＴが実現される根拠は以下の通りであ
る。一般に、２次元Ｎ次のＤＣＴは、式３１に示され
る。With the present device configured as described above,
The grounds for implementing the DCT of dimension eight are as follows. In general, a two-dimensional N-order DCT is shown in Equation 31.

【００８２】[0082]

【数３１】そして、式３１の右辺に示された２次元Ｎ次のＤＣＴ
は、複数の１次元Ｎ次のＤＣＴに分解される。即ち、２
次元Ｎ次のＤＣＴは、Ｎ×Ｎ個の入力データをＮ個ずつ
に分割したＮ組の入力データのそれぞれに対して１次元
Ｎ次のＤＣＴを行う前段部と、それらの出力データをさ
らに一定の関係の下にＮ個ずつに分割したＮ組の出力デ
ータのそれぞれに対して１次元Ｎ次のＤＣＴを行う後段
部に分解される。(Equation 31) Then, the two-dimensional N-dimensional DCT shown on the right side of Equation 31
Is decomposed into a plurality of one-dimensional N-order DCTs. That is, 2
The N-dimensional DCT is a former stage that performs one-dimensional N-dimensional DCT on each of N sets of input data obtained by dividing N × N pieces of input data into N pieces, and further sets the output data thereof constant. Each of the N sets of output data divided into N pieces under the relationship is decomposed into a post-stage section that performs one-dimensional N-order DCT.

【００８３】本装置は、２次元８次のＤＣＴを行うもの
であるから、Ｋ＝√２、Ｎ＝８とすることにより、図４
及び図５の計算フローグラフが導き出される。よって、
これらの計算フローグラフで示される構成によって２次
元８次のＤＣＴが実現されることが判る。次に、本装置
が有する６４個の補正部の意義を説明する。Since this apparatus performs two-dimensional, eighth-order DCT, by setting K = √2 and N = 8, FIG.
And the calculation flow graph of FIG. 5 is derived. Therefore,
It can be seen that a two-dimensional eighth-order DCT is realized by the configuration shown in these calculation flow graphs. Next, the significance of the 64 correction units of the present apparatus will be described.

【００８４】図４及び図５の計算フローグラフから判る
ように、本装置は、基本的には、第１実施例の装置が１
６個集まったものに等しい。但し、本装置においては、
上述したように、個々の１−ＤＤＣＴモジュールは、そ
の内部に補正部を有さない点で第１実施例の装置と異な
る。これは、式７の１次元Ｎ次のＤＣＴの式より導かれ
る性質Ｆ[ｓ・ｕ]＝ｓ・Ｆ［ｕ］を利用して、前段部に属する１−ＤＤＣＴモジュールで
本来必要とされる補正（例えば、√２・ｋiを乗ずるこ
と）と後段部に属する１−ＤＤＣＴモジュールで本来必
要とされる補正（例えば、√２・ｋjを乗ずること）と
を結合させることにより、２回の補正を１回の補正（例
えば、２・ｋi・ｋjを乗ずること）で済ませるためであ
る。As can be seen from the calculation flow graphs of FIG. 4 and FIG. 5, this apparatus is basically different from the apparatus of the first embodiment in that
Equivalent to six. However, in this device,
As described above, each 1-DDCT module differs from the device of the first embodiment in that it does not have a correction unit inside. This is originally required in the 1-DDCT module belonging to the preceding stage by using the property F [s · u] = s · F [u] derived from the one-dimensional N-order DCT equation of Equation 7. The correction is performed twice by combining the correction (for example, multiplying by √2 · ki) and the correction originally required by the 1-DDCT module belonging to the subsequent stage (for example, by multiplying by √2 · kj). Is corrected once (for example, multiplying by 2 · ki · kj).

【００８５】よって、本装置において、後段部の１−Ｄ
ＤＣＴの出力データに対して、６４個の補正部により、
式３０に示された補正値が乗算されることにより、少な
い処理ステップで２次元８次のＤＣＴ係数が得られる。
なお、個々の１−ＤＤＣＴモジュールの詳細な動作は、
第１実施例の場合と同様であるので、その説明は省略す
る。（従来手法との比較）次に、２次元８次のＤＣＴに必要とされる演算の種類と
回数について、本装置による場合と従来手法による場合
とを比較して説明する。Therefore, in the present apparatus, the 1-D
For the DCT output data, 64 correction units
By multiplying by the correction value shown in Expression 30, a two-dimensional eighth-order DCT coefficient can be obtained with a small number of processing steps.
The detailed operation of each 1-DDCT module is as follows.
Since it is the same as the case of the first embodiment, the description is omitted. (Comparison with Conventional Method) Next, the types and the number of operations required for the two-dimensional eighth-order DCT will be described by comparing the case of the present apparatus with the case of the conventional method.

【００８６】まず、第１の従来手法と比較するために、
８×８画素からなる１ブロック分の入力データが与えら
れた場合に本装置において実行される乗算の回数を求め
る。本装置においては、図５に示された２箇所の"１／
√２"より１個の１−ＤＤＣＴモジュールにつき２回の
乗算が必要とされること、及び本装置が１６個の１−Ｄ
ＤＣＴモジュールと６４個の補正部から構成されること
より、合計９６回（＝２×１６＋６４）の乗算が行われ
る。First, for comparison with the first conventional method,
The number of times of multiplication executed in the present apparatus when input data of one block composed of 8 × 8 pixels is given. In this apparatus, two "1/1 /" shown in FIG.
{2} requires two multiplications per 1-DDCT module, and the device has 16 1-DCTs.
Since the DCT module and the 64 correction units are used, a total of 96 multiplications (= 2 × 16 + 64) are performed.

【００８７】これは、上述したように、２５６回の乗算
を１７６回に減少させた第１の従来手法に比べ、さらに
大幅にその回数を減少させるものである。なお、ＭＰＥ
Ｇ等の規格では式３１における定数Ｋの値は本実施例
（Ｋ＝√２）とは異なるが、本実施例におけるｋ00〜ｋ
77の値を予めスケーリングしておくことで対応できるこ
とであり、これによって必要な乗算の回数が変動するも
のではない。As described above, the number of multiplications is further reduced as compared with the first conventional method in which 256 multiplications are reduced to 176 times. In addition, MPE
In the standard such as G, the value of the constant K in Equation 31 is different from that of the present embodiment (K = √2), but k00 to k in the present embodiment.
This can be handled by scaling the value of 77 in advance, and this does not change the number of necessary multiplications.

【００８８】また、画像圧縮装置等においては、ＤＣＴ
装置から出力された各ＤＣＴ係数は、続く線形量子化の
処理のために、対応する予め定められたステップサイズ
による割り算が行われる。従って、本ＤＣＴ装置の６４
個の補正部での処理と、線形量子化での割り算とを結合
させることにより、画像圧縮装置全体における乗算の回
数の減少を図ることもできる。In an image compression apparatus or the like, DCT
Each DCT coefficient output from the device is divided by a corresponding predetermined step size for a subsequent linear quantization process. Accordingly, 64 of the present DCT apparatus is used.
By combining the processing by the correction units and the division by the linear quantization, the number of times of multiplication in the entire image compression apparatus can be reduced.

【００８９】次に、第２の従来手法であるＣＯＲＤＩＣ
手法と比較するために、１６ビットのデータ語長で３種
類の角度（−２π／１６、−π／１６、−５π／１６）
の回転演算を行うのに必要な演算の種類と回数を求め
る。なお、この従来手法と本装置で共通する処理、即
ち、補正値（Ｃやｋ）を乗ずる処理は、比較の対象に含
めない。Next, CORDIC, which is a second conventional method, is used.
Three types of angles (-2π / 16, -π / 16, -5π / 16) with a data word length of 16 bits for comparison with the method
The type and the number of calculations required to perform the rotation calculation are calculated. The processing common to the conventional method and the present apparatus, that is, the processing of multiplying the correction value (C or k) is not included in the comparison.

【００９０】図７に示されたＣＯＲＤＩＣ手法による装
置では、１回の離散回転角度の演算につき、２回のシフ
ト、３回の加減算（ｘ[i]，ｙ[i]，ｚ[i]）及び１回の
条件判断が必要とされるので、４８回の離散回転角度の
演算には、合計９６回のシフト、１４４回の加減算及び
４８回の条件判断が必要とされる。一方、本装置におい
ては、−２π／１６の回転演算に１０回の加減算、−π
／１６の回転演算に１０回の加減算、−５π／１６の回
転演算に１２回の加減算が必要とされることから、合計
３２回の加減算のみで済む。しかも、本装置では、ＣＯ
ＲＤＩＣ手法で必要とされた制御部、即ち、ＬＵＴ、ｚ
算出部、判定部が不要となる。In the apparatus according to the CORDIC method shown in FIG. 7, two shifts and three additions / subtractions (x [i], y [i], z [i]) are performed for one calculation of the discrete rotation angle. And one condition determination is required, so that the calculation of the 48 discrete rotation angles requires a total of 96 shifts, 144 additions / subtractions, and 48 condition determinations. On the other hand, in the present apparatus, the rotation calculation of -2π / 16 is added and subtracted ten times,
Since the rotation operation of / 16 requires 10 additions and subtractions, and the rotation of -5π / 16 requires 12 additions and subtractions, only 32 additions and subtractions are required. Moreover, in this device, CO
Controls required by the RDIC method, ie, LUT, z
The calculation unit and the determination unit become unnecessary.

【００９１】このように、本装置によって、従来手法で
必要とされた演算の種類や回数が飛躍的に減少し、回路
構成がコンパクトになる。このような効果は、本質的に
は、本装置が任意の角度ではなく予め定められた角度の
回転演算を行う処理又は回路から構成されていることに
基づくものと考えられる。以上、本発明に係るＤＣＴ装
置について、実施例に基づいて説明したが、本発明はこ
れら実施例に限られないことは勿論である。即ち、（１）第１及び第２実施例では、本発明を論理回路（ハ
ードウェア）で実現したが、汎用のＣＰＵの下で実行さ
れるプログラム（ソフトウェア）で実現してもよい。ソ
フトウェアによる方法であっても、ハードウェアによる
場合と同様の効果、即ち、処理ステップが短縮されると
いう効果が得られることは言うまでもない。（２）第１及び第２実施例では、ＤＣＴを行う装置につ
いて説明したが、逆ＤＣＴを行う装置であっても本装置
と同様のことが成り立つ。例えば、２次元逆ＤＣＴは、
一般に、式３２の式で表されるが、２次元逆ＤＣＴも２
次元ＤＣＴと同様に、やはり２次元回転演算を用いた計
算フローグラフとして表現されるからである。As described above, according to the present apparatus, the types and the number of operations required by the conventional method are remarkably reduced, and the circuit configuration becomes compact. It is considered that such an effect is based essentially on the fact that the present apparatus is configured by a process or a circuit that performs a rotation operation at a predetermined angle instead of an arbitrary angle. As described above, the DCT device according to the present invention has been described based on the embodiments, but it is needless to say that the present invention is not limited to these embodiments. (1) In the first and second embodiments, the present invention is realized by a logic circuit (hardware), but may be realized by a program (software) executed under a general-purpose CPU. It goes without saying that even with the software method, the same effect as that obtained by the hardware, that is, the effect of shortening the processing steps can be obtained. (2) In the first and second embodiments, the apparatus that performs DCT has been described. However, the same applies to an apparatus that performs inverse DCT. For example, the two-dimensional inverse DCT is
Generally, it is expressed by the equation 32, but the two-dimensional inverse DCT is also 2
This is because, similarly to the dimensional DCT, it is also expressed as a calculation flow graph using a two-dimensional rotation operation.

【００９２】[0092]

【数３２】（３）第１及び第２実施例のＤＣＴ装置は、８次のＤＣ
Ｔを行うものであったが、本発明は、この次数に限定さ
れるものではない。即ち、次数Ｎが予め決定されていれ
ば、その次数Ｎから必要な回転角度θが判明するので、
その角度θの回転演算を行う回転演算部Ｒ'（θ）の回
路構成が一義的に決定されるからである。（４）第１及び第２実施例に係るＤＣＴ装置は、それぞ
れ式７及び式３１で定義されるＤＣＴを行ったが、２次
元回転演算を用いて表現されるものであれば、他の定義
によるＤＣＴであってもよい。（５）第１実施例では、ｎ、δ[i]、ｐ[i]及びｑ[i]の
決定に際し、ｋ[i] = cosθ[i]とし、各ｉについてΔθ
[i]を最小にするδ[i]及びｐ[i]を決定することを繰り
返したが、このような方法に限定されるものではない。
例えば、ｋ[i] = sinθ[i]とすることも考えられる。(Equation 32) (3) The DCT devices of the first and second embodiments use an eighth-order DCT.
Although T is performed, the present invention is not limited to this order. That is, if the order N is determined in advance, the necessary rotation angle θ can be determined from the order N.
This is because the circuit configuration of the rotation calculation unit R ′ (θ) that performs the rotation calculation of the angle θ is uniquely determined. (4) DCT apparatus according to the first and second embodiments have been subjected to DCT defined by the respective formulas 7 and Formula 31, as long as it is expressed using a two-dimensional rotation operations, other definitions May be used. (5) In the first embodiment, when determining n, δ [i], p [i], and q [i], k [i] = cosθ [i], and Δθ for each i
The determination of δ [i] and p [i] that minimizes [i] was repeated, but is not limited to such a method.
For example, it is conceivable that k [i] = sin θ [i].

【００９３】但し、ｐ[i+1]−p[i]≠１及びｑ[i+1]−ｑ
[i]≠１を満たすようなｐ[i]及びｑ[i]を決定するのが
好ましい。より小さいｎに対して、式２、式３、式４を
満たすｎ、δ[i]、ｐ[i]及びｑ[i]を決定するためであ
る。（６）第１及び第２実施例に係るＤＣＴ装置は、画像デ
ータを対象としたが、これに限定されることはなく、例
えば、音声データ等であってもよい。Where p [i + 1] -p [i] ≠ 1 and q [i + 1] -q
It is preferable to determine p [i] and q [i] that satisfy [i] ≠ 1. For smaller n, Formula 2, Formula 3, n satisfying Equation 4, [delta] [i], in order to determine the p [i] and q [i]. (6) Although the DCT apparatuses according to the first and second embodiments are directed to image data, the present invention is not limited to this, and may be, for example, audio data.

【００９４】[0094]

【発明の効果】以上の説明から明らかなように、本発明
に係る離散余弦変換及び逆離散余弦変換装置は、ＤＣＴ
／逆ＤＣＴを所定角度（θ）の２次元回転演算を用いて
行うものであり、それら２次元回転演算は必要最低限の
個数の所定の離散角度の回転演算に分解されている。そ
して、それら各回転演算は、２のべき乗との乗算と加減
算で表される。As is apparent from the above description, the discrete cosine transform and the inverse discrete cosine transform device according to the present invention provide a DCT.
/ Inverse DCT is performed using a two-dimensional rotation operation at a predetermined angle (θ), and the two-dimensional rotation operation is decomposed into a minimum necessary number of rotation operations at a predetermined discrete angle. Each rotation operation is represented by multiplication by a power of 2 and addition / subtraction.

【００９５】これにより、従来手法で必要とされた多く
の乗算が単純な論理演算（シフト及び加減算）の繰り返
しに置き換えられると共に従来のＣＯＲＤＩＣ手法で必
要とされた演算回数よりも少ない回数の演算によってよ
り演算精度の高い２次元回転演算が行われ、高速な離散
余弦変換及び逆離散余弦変換装置が実現される。As a result, many multiplications required in the conventional method are replaced with simple repetition of logical operations (shift and addition / subtraction), and the number of operations is smaller than the number of operations required in the conventional CORDIC method. more 2-dimensional rotation operation high operation precision is performed, high-speed discrete cosine transform and inverse discrete cosine varying retrofit location is achieved.

【００９６】また、本発明に係る離散余弦変換及び逆離
散余弦変換方法によれば、前記２次元回転演算を明確な
指針の下に所定の離散角度の回転演算に分解しておくこ
とができる。これにより、２次元回転演算に必要とされ
る最適な論理演算の種類と個数を容易に決定しておくこ
とができる。Further, according to the discrete cosine transform and the inverse discrete cosine transform method according to the present invention, the two-dimensional rotation operation can be decomposed into a rotation operation at a predetermined discrete angle under clear guidelines. This makes it possible to easily determine the type and number of optimal logical operations required for the two-dimensional rotation operation.

【００９７】また、本発明に係る離散余弦変換及び逆離
散余弦変換装置によれば、前記２次元回転演算は単純な
離散角度の回転演算を順次行うパイプラインにより構成
される。これにより、従来のＣＯＲＤＩＣ手法で必要
とされたルックアップテーブルや比較器等が不要になる
と共に少ない回路規模で２次元回転演算部が構築される
ので、高速かつコンパクトなＤＣＴ／逆ＤＣＴ装置が実
現される。Further, according to the discrete cosine transform and inverse discrete cosine transform device according to the present invention, the two-dimensional rotation operation is constituted by a pipeline for sequentially performing a rotation operation at a simple discrete angle. This eliminates the need for a look-up table, a comparator, and the like, which are required by the conventional CORDIC method, and allows a two-dimensional rotation operation unit to be constructed with a small circuit scale, thereby realizing a high-speed and compact DCT / inverse DCT device. Is done.

[Brief description of the drawings]

【図１】本発明の第１実施例に係るＤＣＴ装置の構成を
示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a DCT device according to a first embodiment of the present invention.

【図２】同装置が行う１次元８次のＤＣＴの演算内容を
示す計算フローグラフである。FIG. 2 is a calculation flow graph showing the content of a one-dimensional eighth-order DCT operation performed by the apparatus.

【図３】同装置の第２回転演算部１０３の詳細な構成を
示す回路図である。FIG. 3 is a circuit diagram showing a detailed configuration of a second rotation calculation unit 103 of the device.

【図４】本発明の第２実施例に係るＤＣＴ装置が行う２
次元８次のＤＣＴの演算内容を示す計算フローグラフで
ある。FIG. 4 illustrates a second example of the operation performed by the DCT apparatus according to the second embodiment of the present invention.
9 is a calculation flow graph showing the contents of DCT operation of dimension 8;

【図５】図４の計算フローグラフにおける”１−ＤＤ
ＣＴ”の内容を詳細に示す計算フローグラフである。FIG. 5 shows “1-D D” in the calculation flow graph of FIG.
41 is a calculation flow graph showing the details of CT "in detail.

【図６】従来技術における１次元８次のＤＣＴの演算内
容を示す計算フローグラフである。FIG. 6 is a calculation flow graph showing the contents of a one-dimensional eighth-order DCT calculation in the related art.

【図７】従来のＣＯＲＤＩＣ手法による２次元回転演算
をハードウェアで実現した場合の構成を示すブロック図
である。FIG. 7 is a block diagram showing a configuration in a case where a two-dimensional rotation operation by a conventional CORDIC method is realized by hardware.

[Explanation of symbols]

１０１前置処理部１０２〜１０４回転演算部１０５〜１１０補正部３４０〜３４４離散角度回転演算部３４５出力バッファ３００〜３０５、３１０〜３１５レジスタ３２０、３２２、３２４、３３１、３３３加算器３２１、３２３、３３０、３３２、３３４減算器 101 Pre-processing unit 102-104 Rotation calculation unit 105-110 Correction unit 340-344 Discrete angle rotation calculation unit 345 Output buffer 300-305, 310-315 Register 320, 322, 324, 331, 333 Adders 321 and 323, 330, 332, 334 Subtractor

フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 17/14 H03M 7/30 H04N 1/41 H04N 7/30 ＪＩＣＳＴファイル（ＪＯＩＳ)Continuation of the front page (58) Field surveyed (Int.Cl. ⁷ , DB name) G06F 17/14 H03M 7/30 H04N 1/41 H04N 7/30 JICST file (JOIS)

Claims

(57) [Claims]

1. A discrete cosine transform in a data compression device.
An apparatus for performing a two-dimensional rotation operation at a predetermined angle (θ) represented by Equation 1, wherein input data (x [0]) to be subjected to the two-dimensional rotation operation is obtained from the data to be converted. , y [0]) and input data calculation circuit for calculating an input data calculated by the input data calculation circuit (x [0], with respect to y [0]), based on said predetermined angle (theta) previously obtained equation 2, of n satisfying the formula 3 and formula 4 δ (δ [0] from δ [n-1]), p (p [0] p [n-1] from) and q (q [0] to q [n-1]), a discrete angle rotation operation circuit that executes a rotation operation at a discrete angle shown in Expression 5 n times from i = 0 to i = n−1, The output data is calculated by calculating the product of the output data (x [n], y [n]) finally obtained by the discrete angle rotation operation circuit and a constant including a predetermined coefficient (k) shown in Expression 6. Correction And a correction circuit that performs the following. (Equation 1) Here, (x, y) is the data before rotation, and (x ', y') is the theoretical value after rotation. (Equation 2) (Equation 3) (Equation 4) Here, δ [i] = ± 1, p [i] and q [i] are integers, n is a positive integer, E is a predetermined tolerance angle, and k [i] is a real number. (Equation 5) (Equation 6)

2. The discrete angle rotation operation circuit is composed of n stages of rotation operation units (R (0) to R (n-1)) connected in series as a pipeline. R (i) is a register REGx [i] for storing the data x [i].
And a register REGy [i] for storing the data y [i].
And two two-input arithmetic units ALUx [i] and ALUy [i]. The register REGx [i] and the two-input arithmetic unit ALUx [i]
And ALUy [i] mean that the data x [i] output from the register REGx [i] is shifted by q [i] bits, input to one of the arithmetic units ALUx [i], and p [i]. ] And shifted so as to be input to one of the two-input arithmetic unit ALUy [i], the register REGy [i] and the two-input arithmetic unit ALUx.
[i] and ALUy [i] mean that the data y [i] output from the register REGy [i] is shifted by p [i] bits to the other one of the two-input arithmetic unit ALUx [i]. Input and shifted by q [i] bits, and the 2-input arithmetic unit ALUy
[i] is connected so as to be input to the other one of the two input arithmetic units ALUx [i] and ALUy [i] are δ [i]
The two-input arithmetic units ALUx [i] and ALUy [i] are based on the two-input arithmetic units ALUx [i] and ALUy [i] except when they belong to the last-stage rotation arithmetic unit R (n-1). hand,
The results of these calculations are respectively converted to the next-stage rotation operator R (i + 1)
Register REGx [i + 1] and REGy [i + 1] DCT device according to claim 1, characterized in that the output.

3. An apparatus for performing an inverse discrete cosine transformation in a data decompression apparatus by using a two-dimensional rotation operation of a predetermined angle (θ) shown in equation 1, wherein the two-dimensional rotation is performed based on data to be transformed. input data to be subjected to the rotation operation (x [0], y [ 0]) and input data calculation circuit for calculating an input data calculated by the input data calculation circuit (x [0], y [ 0] relative), said predetermined angle (theta formula 2 obtained in advance based on), [delta] from the n satisfying equation 3 and equation 4 δ (δ [0] [ n-1]), p (p [ 0] to p [n-1]) and q (q [0] to q [n-1]) are used to perform the rotation operation at the discrete angle shown in Expression 5 from i = 0 to i = n− a discrete angular rotation calculation circuit for performing n times until 1, wherein the discrete angular rotation calculating circuit by finally obtained output data (x [n], y [ n]) and a predetermined coefficient shown in equation 6 (k ) An inverse discrete cosine transform device, comprising: a correction circuit that corrects output data by calculating a product of the output data and a constant.

Wherein said discrete angular rotation calculation circuit is configured rotation calculator of n stages connected in series as a pipeline (from R (0) R (n- 1)) from the rotation calculator for each stage R (i) is a register REGx [i] for storing the data x [i].
And a register REGy [i] for storing the data y [i].
And two two-input arithmetic units ALUx [i] and ALUy [i]. The register REGx [i] and the two-input arithmetic unit ALUx [i]
And ALUy [i] mean that the data x [i] output from the register REGx [i] is shifted by q [i] bits, input to one of the arithmetic units ALUx [i], and p [i]. ] And shifted so as to be input to one of the two-input arithmetic unit ALUy [i], the register REGy [i] and the two-input arithmetic unit ALUx.
[i] and ALUy [i] mean that the data y [i] output from the register REGy [i] is shifted by p [i] bits and sent to the other of the two-input arithmetic unit ALUx [i]. Input and shifted by q [i] bits, and the 2-input arithmetic unit ALUy
[i] is connected so as to be input to the other one of the two input arithmetic units ALUx [i] and ALUy [i] are δ [i]
And the two-input arithmetic units ALUx [i] and ALUy [i] are the same except that they belong to the last-stage rotation arithmetic unit R (n-1). hand,
The results of these calculations are respectively converted to the next-stage rotation operator R (i + 1)
4. The inverse discrete cosine transform device according to claim 3, wherein the signals are output to registers REGx [i + 1] and REGy [i + 1].