JPH05300026A

JPH05300026A - Discrete cosine transformer device and inverse discrete cosine transformer device

Info

Publication number: JPH05300026A
Application number: JP19111392A
Authority: JP
Inventors: Mitsuharu Oki; 光晴大木
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1992-02-21
Filing date: 1992-07-17
Publication date: 1993-11-12

Abstract

PURPOSE:To simplify the configuration by decreasing the scale of an inner product arithmetic operation circuit and to attain high speed arithmetic operation by decreasing number of times of the arithmetic operation. CONSTITUTION:Data of 8-row 8-column are inputted from an input terminal IN in the order of columns and fed to a 1st 4-degree inner product arithmetic operation circuit 42 via a 1st rearrangement circuit 41. An output of the inner product arithmetic operation circuit 42 is fed to a 2nd octal inner product arithmetic operation circuit 44 via a 64-word 2nd rearrangement circuit 43. An output of the inner product arithmetic operation circuit 44 is fed to a 3rd 4-degree inner product arithmetic operation circuit 45, and an output of the inner product arithmetic operation circuit 45 is fed to an output terminal OUT via the 64-word 3rd rearrangemenent circuit 46.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、デジタル画像処理等に
好適な離散コサイン変換装置及び逆離散コサイン変換装
置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a discrete cosine transform device and an inverse discrete cosine transform device suitable for digital image processing and the like.

【０００２】[0002]

【従来の技術】従来、デジタル画像処理に適した各種の
離散的直交変換が知られており、そのうち、離散的コサ
イン変換（ＤｉｓｃｒｅｔｅＣｏｓｉｎｅＴｒａｍ
ｓｆｏｒｍ：ＤＣＴ）は帯域圧縮に適し、処理方式が
比較的簡単である。2. Description of the Related Art Conventionally, various discrete orthogonal transforms suitable for digital image processing have been known, and among them, discrete cosine transform (discrete cosine transform) is known.
sform: DCT) is suitable for band compression and has a relatively simple processing method.

【０００３】このＤＣＴはＮ次の場合、第１行のすべて
が１／√２、第２行以下はｃｏｓ｛（２ｘ＋１）ｋπ／２Ｎ｝（ｘ＝０、１・・・Ｎ−１；ｋ＝１・・・Ｎ−１）の要
素からなる行列〔Ｎ〕を用いて、変換及び逆変換（ＩＤ
ＣＴ）が定義され、２次元の場合、次のように表わされ
る。In the DCT of the Nth order, 1 / √2 is all in the first row, and cos {(2x + 1) kπ / 2N} (x = 0, 1 ... N-1; k in the second row and below. = 1 ... N-1) using a matrix [N] consisting of elements
CT) is defined and is two-dimensional, it is expressed as follows.

【０００４】[0004]

【数１】〔Ｙ〕＝〔Ｎ〕〔Ｘ〕^t〔Ｎ〕[Equation 1] [Y] = [N] [X] ^t [N]

【数２】〔Ｘ〕＝^t〔Ｎ〕〔Ｙ〕〔Ｎ〕[Equation 2] [X] = ^t [N] [Y] [N]

【０００５】なお、行列の規模が２^N行２^N列のとき、
数１式には１／２^N+1の係数が掛るが、Ｎ＋１ビットの
データシフトと等価であるから、この係数の記載は省略
する。When the matrix size is 2 ^N rows and 2 ^N columns,
A coefficient of 1/2 ^{N + 1 is applied to} the equation ( ¹ ), but since it is equivalent to a data shift of N + 1 bits, the description of this coefficient is omitted.

【０００６】ところで、数１、数２式のような行列デー
タの乗算には、図２４に示すような、内積演算回路と並
べ替え回路とからなる乗算装置が従来用いられていた。
図２４において、１０、２０は内積演算回路であって、
簡単のために、いずれも４行４列の規模の行列に対応す
る４次構成とされ、並べ替え回路３０を介して接続され
る。By the way, for the multiplication of the matrix data represented by the equations (1) and (2), a multiplication device including an inner product calculation circuit and a rearrangement circuit as shown in FIG. 24 has been conventionally used.
In FIG. 24, 10 and 20 are inner product arithmetic circuits,
For simplification, each has a quaternary configuration corresponding to a matrix of a size of 4 rows and 4 columns, and is connected via a rearrangement circuit 30.

【０００７】即ち、端子ＩＮから次の数３式のようなデ
ータ行列〔Ｘ〕が入力され、一方の内積演算回路１０に
おいて、数４式のような係数行列〔Ａ〕との内積演算が
行なわれる。That is, a data matrix [X] such as the following formula 3 is input from the terminal IN, and one inner product calculating circuit 10 performs an inner product calculation with the coefficient matrix [A] such as the formula 4. Be done.

【０００８】[0008]

【数３】 [Equation 3]

【数４】 [Equation 4]

【０００９】内積演算回路１０は、３個の単位遅延器１
１₁、１１₂、１１₃ が逆順に縦続接続されて、その出
力端、両接続中点及び入力端に４個のラッチ１２₁、１
２₂、１２₃ 及び１２₄ がそれぞれ接続され、各ラッチ
１２₁〜１２₄にそれぞれ縦続する乗算器１３₁〜１３
₄に係数ＲＯＭ１４₁〜１４₄がそれぞれ接続され、各
乗算器１３₁〜１３₄の出力が加算器１５に接続され
て、有限インパルス応答（ＦｉｎｉｔｅＩｍｐｕｌｓ
ｅＲｅｓｐｏｎｓｅ：ＦＩＲ）型のトランスバーサ
ルフィルタ構成とされる。The inner product calculation circuit 10 includes three unit delay units 1
1 ₁ , 11 ₂ and 11 ₃ are cascade-connected in reverse order, and four latches 12 ₁ and 1 are provided at the output end, both connection midpoints and the input end.
Multipliers 13 _{1 to} 13 to which 2 ₂ , 12 ₃ and 12 ₄ are respectively connected and cascaded to the respective latches 12 _{1 to} 12 ₄
₄ the coefficient ROM 14 ₁ to 14 ₄ are respectively connected, the outputs of the multipliers _{_131-134} are connected to an adder 15, a finite impulse response (Finite Impuls
e Response (FIR) type transversal filter configuration.

【００１０】同様に、内積演算回路２０もＦＩＲ型トラ
ンスバーサルフィルタ構成とされ、対応する各要素の符
号の十の位の数字を「２」に替えて重複説明を省略す
る。ただし、ＲＯＭ２４₁〜２４₄に格納される係数ｂ
_ijがＲＯＭ１４₁〜１４₄の係数ａ_ijと異なる。Similarly, the inner product calculation circuit 20 also has a FIR type transversal filter configuration, and the tens digit of the code of each corresponding element is changed to "2" to omit duplicated description. However, the coefficient b stored in the ROMs 24 _{1 to} 24 ₄
_ij is different from the coefficients a _ij of the ROMs 14 _{1 to} 14 ₄ .

【００１１】並べ替え回路３０は１対のＲＡＭ３１及び
３２と、入力側及び出力側の切換スイッチ３３及び３４
とで構成され、両スイッチ３３及び３４は、１対のＲＡ
Ｍ３１及び３２の一方にデータが書き込まれる期間に、
他方からデータが読み出されるように連動して切り換え
られる。ＲＡＭ３１及び３２の容量は、前述のような４
行４列の規模の行列に対応して、それぞれ１６ワードと
される。The rearrangement circuit 30 includes a pair of RAMs 31 and 32, and input side and output side changeover switches 33 and 34.
And both switches 33 and 34 have a pair of RAs.
During the period when data is written to one of M31 and M32,
The data is read from the other side so that the data is read in conjunction with each other. The RAMs 31 and 32 have a capacity of 4 as described above.
There are 16 words each corresponding to a matrix having a size of 4 rows.

【００１２】次に、図２５を参照しながら、図２４の従
来例の行列データ乗算について説明する。Next, the conventional matrix data multiplication of FIG. 24 will be described with reference to FIG.

【００１３】入力端子ＩＮから、図２５のＡに示すよう
な16ワード単位の入力行列〔Ｘ〕のデータａが第１列
（ｘ₁₁、ｘ₂₁、ｘ₃₁、ｘ₄₁）〜第４列（ｘ₁₄、ｘ₂₄、ｘ
₃₄、ｘ ₄₄）の順序で供給される。From the input terminal IN, as shown in FIG.
Data a of the input matrix [X] in units of 16 words is the first column
(X₁₁, X_{twenty one}, X₃₁, X₄₁) -4th column (x₁₄, X_{twenty four}, X
₃₄, X ₄₄) In order.

【００１４】単位データの入力開始時点ｔ₀から３サイ
クル分の時間３Ｔが経過したｔ₁時点では、単位遅延器
１１₁、１１₂及び１１₃の各出力端に第１列のデータ
ｘ₁₁、ｘ₂₁及びｘ₃₁が存在すると共に、４番目のデータ
ｘ₄₁が遅延器１１₃の入力端に存在する。[0014] In t ₁ when the time 3T for three cycles from the input start point t ₀ has elapsed the unit data, the unit delay 11 _1, 11 ₂ and 11 ₃ of the data x ₁₁ of the first column to the output terminals, x ₂₁ and x ₃₁ are present, and the fourth data x ₄₁ is present at the input of the delay device 11 ₃ .

【００１５】この状態で、各ラッチに共通のイネイブル
パルスが供給されて、第１列の４個のデータｘ₁₁、
ｘ₂₁、ｘ₃₁及びｘ₄₁が４個のラッチ１２₁、１２₂、１
２₃及び１２₄ にそれぞれ取り込まれ、図２５のＢ、
Ｄ、Ｆ及びＨに示すように、入力開始時点ｔ₀から４Ｔ
時間経過後のｔ₂時点から４Ｔ時間にわたって保持され
る。In this state, a common enable pulse is supplied to each latch, and the four data x ₁₁ in the first column,
x ₂₁ , x ₃₁ and x ₄₁ have four latches 12 ₁ , 12 ₂ , 1
2 ₃ and 12 ₄ , respectively, and in FIG. 25B,
As shown in D, F, and H, 4T from the input start time t ₀
It is held for 4T hours from time t ₂ after the passage of time.

【００１６】ＲＯＭ１４₁、１４₂、１４₃及び１４₄
には係数行列〔Ａ〕の各列の係数ａ _i1、ａ_i2、ａ_i3及び
ａ_i4（ｉ＝１、２、３、４）が格納されており、同図の
Ｃ、Ｅ、Ｇ及びＪに示すように、ｔ₂時点以後の１サイ
クルごとに、対応する乗算器１３₁、１３₂、１３₃及
び１３₄に順次供給され、それぞれ対応するラッチ１２
₁、１２₂、１２₃及び１２₄に保持された第１列のデ
ータｘ_i1（ｉ＝１、２、３、４）と乗算される。ROM14₁, 14₂, 14₃And 14_Four
Is the coefficient a of each column of the coefficient matrix [A]. _i1, A_i2, A_i3as well as
a_i4(I = 1, 2, 3, 4) is stored, and
As shown in C, E, G and J, t₂1 size after time
Corresponding multiplier 13 for each clou₁, 13₂, 13₃Over
And 13_FourAre sequentially supplied to the corresponding latches 12
₁, 12₂, 12₃And 12_FourThe first row of data held by
Data x_i1It is multiplied by (i = 1, 2, 3, 4).

【００１７】即ち、ｔ₂時点以降の１、２、３及び４番
目の各サイクルで、係数行列の１、２、３及び４行の係
数ａ_1j、ａ_2j、ａ_3j及びａ_4j（ｊ＝１、２、３、４）が
入力行列の第１列のデータｘ₁₁、ｘ₂₁、ｘ₃₁及びｘ₄₁と
乗算される。That is, in the 1st, _2nd , 3rd and 4th cycles after time t ₂ , the coefficients a _1j , a _2j , a _3j and a _4j (j = 1, 2, 3, 4) is multiplied with the first column of data x ₁₁ , x ₂₁ , x ₃₁ and x ₄₁ of the input matrix.

【００１８】加算器１５において、各乗算器１３₁〜１
３₄の出力が加算されて、同図のＫに示すように、ｔ₂
時点以降の４サイクルで次の数５式に示すような積の行
列〔Ｕ〕の第１列のデータｕ₁₁、ｕ₂₁、ｕ₃₁及びｕ₄₁が
得られる。In the adder 15, each of the multipliers 13 ₁ to 1
3 ₄ output is addition of, as shown in the K in the figure, t ₂
The data u ₁₁ , u ₂₁ , u ₃₁ and u ₄₁ in the first column of the product matrix [U] as shown in the following equation 5 are obtained in four cycles after the time point.

【００１９】[0019]

【数５】〔Ｕ〕＝〔Ａ〕〔Ｘ〕[Equation 5] [U] = [A] [X]

【００２０】一方、同図のＡに示すように、ｔ₂時点で
行列〔Ｘ〕の第２列のデータｘ₁₂、ｘ₂₂、ｘ₃₂及びｘ₄₂
の入力が開始されて、前述と同様に、ｔ₂時点から４Ｔ
時間後の時点ｔ₃では、第２列のデータｘ₁₂、ｘ₂₂、ｘ
₃₂及びｘ₄₂がそれぞれラッチ１２₁、１２₂、１２₃及
び１２₄ にラッチされている。また、ｔ₃時点以降の１
サイクルごとに、ＲＯＭ１４₁、１４₂、１４₃及び１
４₄から、前述と同様に、行列〔Ａ〕の各列の係数
ａ_i1、ａ_i2、ａ_i3及びａ_i4（ｉ＝１、２、３、４）が順
次出力される。On the other hand, as shown in A of the figure, the data x ₁₂ , x ₂₂ , x ₃₂ and x ₄₂ of the second column of the matrix [X] at time t _2.
Is started, and 4T from the time point t ₂ as described above.
At time t ₃ after time, the data x ₁₂ , x ₂₂ , x in the second column
₃₂ and x ₄₂ are latched in latches 12 ₁ , 12 ₂ , 12 ₃ and 12 ₄ , respectively. Also, 1 after t ₃
ROM 14 ₁ , 14 ₂ , 14 ₃ and 1 for each cycle
4 ₄ in the same manner as described above, the coefficient a _i1 each column of the matrix [A], a _i2, a _i3 and a _i4 (i = 1,2,3,4) are sequentially output.

【００２１】以下前述と同様にして、ｔ₃時点以降の４
サイクルで前出数５式に示すような積の行列〔Ｕ〕の第
２列のデータｕ₁₂、ｕ₂₂、ｕ₃₂及びｕ₄₂が得られる。Thereafter, in the same manner as described above, 4 after time t ₃
In the cycle, the data u ₁₂ , u ₂₂ , u _32, and u ₄₂ in the second column of the product matrix [U] as shown in the above equation 5 are obtained.

【００２２】以下同様にして、次のｔ₄時点以降の４サ
イクルで、積の行列〔Ｕ〕の第３列のデータｕ₁₃〜ｕ
₄₃が得られ、その次のｔ₅時点以降の４サイクルで、積
の行列〔Ｕ〕の第４列のデータｕ₁₄〜ｕ₄₄が得られる。In the same manner, the data u ₁₃ to u in the third column of the product matrix [U] are obtained in four cycles after the next time t _4.
₄₃ is obtained, and the data u _{14 to} u _{44 in} the fourth column of the product matrix [U] are obtained in the next 4 cycles after the time point t ₅ .

【００２３】上述のようにして得られた行列〔Ｕ〕の１
６ワードの列順のデータは並べ替え回路３０のＲＡＭ３
１及び３２に交互に書き込まれる。書き込み時のアドレ
スと読み出し時のアドレスとを変えることにより、ＲＡ
Ｍ３１及び３２から行順で交互に読出された行列〔Ｕ〕
のデータが第２の内積演算回路２０に供給され、上述と
全く同様にして、第２の係数行列〔Ｂ〕と乗算されて、
次の数６式で表わされる積の行列〔Ｙ〕のデータが端子
ＯＵＴに導出される。1 of the matrix [U] obtained as described above
The 6-word column-order data is the RAM 3 of the rearrangement circuit 30.
Alternately written to 1 and 32. RA is changed by changing the write address and the read address.
Matrix alternately read from M31 and M32 in row order [U]
Is supplied to the second inner product arithmetic circuit 20 and is multiplied by the second coefficient matrix [B] in the same manner as described above.
The data of the product matrix [Y] represented by the following equation 6 is derived at the terminal OUT.

【００２４】[0024]

【数６】〔Ｙ〕＝〔Ｕ〕〔Ｂ〕＝〔Ａ〕〔Ｘ〕〔Ｂ〕[Equation 6] [Y] = [U] [B] = [A] [X] [B]

【００２５】[0025]

【発明が解決しようとする課題】ところで、行列の規模
が８行８列の場合、数１式の定数行列〔Ｎ〕は、次の数
７式のように表される。By the way, when the scale of the matrix is 8 rows and 8 columns, the constant matrix [N] of the equation 1 is expressed by the following equation 7.

【００２６】[0026]

【数７】ここに、要素ａ〜ｎは、図２６に示すように、角度π／
16を単位とする所定角の余弦である。[Equation 7] Here, the elements a to n have an angle of π /, as shown in FIG.
It is a cosine with a specified angle in units of 16.

【００２７】また、ＤＣＴ及びＩＤＣＴを定義する数１
式から明らかなように、行列〔Ｙ〕の要素ｙ_ijは行列
〔Ｘ〕の要素ｘ_ijの１次式で表現される。Also, the number 1 that defines DCT and IDCT
As is clear from the equation, the element y _ij of the matrix [Y] is expressed by a linear equation of the element x _ij of the matrix [X].

【００２８】従って、図２７に示すように、８行８列の
要素ｘ₁₁〜ｘ₈₈が列順に入力されて６４次のベクトルと
なる〔Ｘc 〕と、８行８列の要素ｙ₁₁〜ｙ₈₈が列順に出
力されて６４次のベクトルとなる〔Ｙc 〕との間には、
次の数８式で表される関係が成立する。Therefore, as shown in FIG. 27, elements x _{11 to} x ₈₈ of 8 rows and 8 columns are input in column order to form a vector of 64th order [Xc], and elements y _{11 to} y of 8 rows and 8 columns. ₈₈ is output in column order and becomes a 64th order vector [Yc]
The relationship expressed by the following equation 8 is established.

【００２９】[0029]

【数８】〔Ｙc 〕＝〔Ｍ〕〔Ｘc 〕ここに〔Ｍ〕は６４行６４列の定数行列である。## EQU8 ## [Yc] = [M] [Xc] where [M] is a constant matrix of 64 rows and 64 columns.

【００３０】ところが、前述のような従来の行列データ
乗算装置では、この数８式の演算を行う場合、例えば６
４次の内積演算回路を用いて一挙に計算するため、回路
規模が膨大になり、構成が複雑になると共に、演算回数
が多くなって演算速度が制約されるという問題があっ
た。However, in the conventional matrix data multiplication device as described above, when the operation of the equation 8 is performed, for example, 6
Since the calculation is performed all at once using the fourth-order inner product arithmetic circuit, there are problems that the circuit scale becomes enormous, the configuration becomes complicated, the number of arithmetic operations increases, and the arithmetic speed is restricted.

【００３１】かかる点に鑑み、この発明の目的は、回路
規模が小さく、構成が簡単であると共に、演算回数が減
少して高速演算が可能な行列データ乗算装置を提供する
ところにある。In view of the above points, an object of the present invention is to provide a matrix data multiplication device which has a small circuit scale, a simple structure, and a reduced number of operations, which enables high-speed operations.

【００３２】[0032]

【課題を解決するための手段】この発明の第１の手段
は、行列の内積を演算する内積演算回路と、行列のデー
タ成分を所定の順序に並べ替える並べ替え回路とを備え
る離散コサイン変換装置において、係数が＋１及び−１
である４次の第１の内積演算回路４２と、係数が０、＋
１及び−１である８次の第２の内積演算回路４４と、定
数行列のデータ成分が格納されたメモリを含む第３の内
積演算回路４５とを設け、８行８列の入力データを第１
の並べ替え回路４１を介して上記第１の内積演算回路に
供給し、上記第１の内積演算回路の出力を第２の並べ替
え回路４３を介して上記第２の内積演算回路に供給し、
上記第２の内積演算回路の出力を直接に上記第３の内積
演算回路に供給すると共に、上記第３の内積演算回路の
出力を第３の並べ替え回路４６を介して導出するように
したことを特徴とする離散コサイン変換装置である。A first means of the present invention is a discrete cosine transform device including an inner product arithmetic circuit for calculating the inner product of a matrix and a rearrangement circuit for rearranging the data components of the matrix in a predetermined order. Where the coefficients are +1 and -1
And a coefficient of 0, +
An 8th-order second inner product arithmetic circuit 44 of 1 and -1 and a third inner product arithmetic circuit 45 including a memory in which the data components of the constant matrix are stored are provided, and the input data of 8 rows and 8 columns is 1
Is supplied to the first inner product arithmetic circuit via the rearrangement circuit 41, and the output of the first inner product arithmetic circuit is supplied to the second inner product arithmetic circuit via the second rearrangement circuit 43.
The output of the second inner product arithmetic circuit is directly supplied to the third inner product arithmetic circuit, and the output of the third inner product arithmetic circuit is derived via the third rearrangement circuit 46. Is a discrete cosine transform device.

【００３３】この発明の第２の手段は、行列の内積を演
算する内積演算回路と、行列のデータ成分を所定の順序
に並べ替える並べ替え回路とを備える逆離散コサイン変
換装置において、定数行列のデータ成分が格納されたメ
モリを含む第４の内積演算回路７２と、係数が０、＋１
及び−１である８次の第５の内積演算回路７３と、係数
が＋１及び−１である４次の第６の内積演算回路７５と
を設け、８行８列の入力データを第４の並べ替え回路７
１を介して上記第４の内積演算回路に供給し、上記第４
の内積演算回路の出力を直接に上記第５の内積演算回路
に供給し、上記第５の内積演算回路の出力を第５の並べ
替え回路７４を介して上記第６の内積演算回路に供給す
ると共に、上記第６の内積演算回路の出力を第６の並べ
替え回路７６を介して導出するようにしたことを特徴と
する逆離散コサイン変換装置である。The second means of the present invention is an inverse discrete cosine transform device comprising an inner product calculating circuit for calculating the inner product of a matrix and a rearrangement circuit for rearranging the data components of the matrix in a predetermined order. A fourth inner product calculation circuit 72 including a memory in which data components are stored, and coefficients 0, +1
And an −1st fifth inner product arithmetic circuit 73 of −1 and a fourth sixth inner product arithmetic circuit 75 of coefficients +1 and −1 are provided, and input data of 8 rows and 8 columns is input to the fourth Rearrangement circuit 7
1 to the fourth inner product calculation circuit,
Of the inner product arithmetic circuit is directly supplied to the fifth inner product arithmetic circuit, and the output of the fifth inner product arithmetic circuit is supplied to the sixth inner product arithmetic circuit via the fifth rearrangement circuit 74. In addition, the inverse discrete cosine transform device is characterized in that the output of the sixth inner product calculating circuit is derived via the sixth rearrangement circuit 76.

【００３４】この発明の第３の手段は、行列の内積を演
算する内積演算回路と、行列のデータ成分を所定の順序
に並べ替える並べ替え回路とを備える離散コサイン変換
装置において、シリアルに供給される行列データを所定
個毎に並列化する並列化回路８１と、係数が＋１及び−
１である４次の第１の内積演算回路と、係数が０、＋１
及び−１である８次の第２の内積演算回路と、定数行列
のデータ成分が格納されたメモリを含む第３の内積演算
回路とを設け、上記第１、第２、第３の内積演算回路を
それぞれ上記所定個並列に配し、８行８列の入力データ
を第１の並べ替え回路４１を介して上記並列化回路に供
給し、上記並列化回路から出力された並列データの各デ
ータを上記所定個のそれぞれの第１の内積演算回路（加
算回路４２′₁〜４２′₄）に供給し、上記各第１の内
積演算回路の出力を直接に上記所定個の内の対応する上
記第２の内積演算回路４４₁〜４４₄に供給し、上記各
第２の内積演算回路の出力を直接に上記所定個の内の対
応する上記第３の内積演算回路４５₁〜４５₄に供給す
ると共に、上記所定個の第３の内積演算回路の出力をシ
リアルデータに変換（回路８２）した後第３の並べ替え
回路４６を介して導出するようにしたことを特徴とする
離散コサイン変換装置である。A third means of the present invention is serially supplied to a discrete cosine transform device provided with an inner product calculating circuit for calculating the inner product of a matrix and a rearrangement circuit for rearranging the data components of the matrix in a predetermined order. Parallelizing circuit 81 for parallelizing a predetermined number of matrix data, and coefficients of +1 and −
Quaternary first inner product arithmetic circuit with 1 and coefficients 0, +1
And an −1st second inner product arithmetic circuit that is −1 and a third inner product arithmetic circuit that includes a memory in which the data components of the constant matrix are stored, and the first, second, and third inner product arithmetic operations are performed. Each of the predetermined number of circuits is arranged in parallel, and input data of 8 rows and 8 columns is supplied to the parallelization circuit through the first rearrangement circuit 41, and each data of the parallel data output from the parallelization circuit. Is supplied to each of the predetermined first inner product arithmetic circuits (adding circuits 42 ' _{1 to} 42' ₄ ), and the output of each of the first inner product arithmetic circuits directly corresponds to the above-mentioned corresponding one of the predetermined number. The second inner product arithmetic circuits 44 _{1 to} 44 ₄ are supplied, and the outputs of the respective second inner product arithmetic circuits are directly supplied to the corresponding third inner product arithmetic circuits 45 _{1 to} 45 ₄ of the predetermined number. At the same time, the outputs of the predetermined number of third inner product arithmetic circuits are converted into serial data. The discrete cosine transform device is characterized in that after (circuit 82), it is derived through a third rearrangement circuit 46.

【００３５】この発明の第４の手段は、行列の内積を演
算する内積演算回路と、行列のデータ成分を所定の順序
に並べ替える並べ替え回路とを備える逆離散コサイン変
換装置において、シリアルに供給される行列データを所
定個毎に並列化する並列化回路９１と、定数行列のデー
タ成分が格納されたメモリを含む第４の内積演算回路
と、係数が０、＋１及び−１である８次の第５の内積演
算回路と、係数が＋１及び−１である４次の第６の内積
演算回路とを設け、上記第４、第５、第６の内積演算回
路をそれぞれ上記所定個並列に配し、８行８列の入力デ
ータを第４の並べ替え回路７１を介して上記並列化回路
に供給し、上記並列化回路から出力された並列データの
各データを上記所定個の内の対応する上記第４の内積演
算回路７２ ₁〜７２₄に供給し、上記各第４の内積演算
回路の出力を直接に上記所定個の内の対応する上記第５
の内積演算回路７３₁〜７３₄に供給し、上記各第５の
内積演算回路の出力を直接に上記所定個の内の対応する
上記第６の内積演算回路に供給すると共に、上記所定個
の第６の内積演算回路（加算回路７５′₁〜７５′₄）
の出力をシリアルデータに変換（回路９２）した後第６
の並べ替え回路７６を介して導出するようにしたことを
特徴とする逆離散コサイン変換装置である。The fourth means of the present invention is to calculate the inner product of matrices.
The inner product calculation circuit that performs the calculation and the data components of the matrix in a predetermined order
Inverse discrete cosine transformation with a rearrangement circuit for rearranging
In the conversion device, the queue data serially supplied is stored.
A parallelization circuit 91 that parallelizes every fixed number and a constant matrix data
Fourth inner product arithmetic circuit including a memory storing data components
And an eighth-order fifth dot product with coefficients 0, +1 and -1
Arithmetic circuit and sixth inner product of order 4 with coefficients +1 and -1
An arithmetic circuit is provided, and the fourth, fifth, and sixth inner product arithmetic operations are performed.
Each of the above paths is arranged in parallel and the input data of 8 rows and 8 columns is arranged.
Data via the fourth rearrangement circuit 71.
Of the parallel data output from the parallel circuit
Each data is assigned to the corresponding fourth inner product of the predetermined number
Arithmetic circuit 72 ₁~ 72_FourAnd the above fourth inner product calculation
The output of the circuit directly corresponds to the fifth of the predetermined ones.
Inner product arithmetic circuit 73₁~ 73_FourTo each of the above 5th
The output of the dot product arithmetic circuit directly corresponds to the above predetermined number
In addition to supplying to the sixth inner product calculating circuit,
Sixth inner product calculating circuit (adding circuit 75 '₁~ 75 '_Four)
6th after converting the output of serial number into serial data (circuit 92)
The rearrangement circuit 76 of FIG.
It is a featured inverse discrete cosine transform device.

【００３６】[0036]

【作用】この発明によれば、内積演算回路の規模を小さ
くし、構成を簡単にすると共に、演算回数を減少させて
高速演算を可能にすることができる。According to the present invention, it is possible to reduce the scale of the inner product arithmetic circuit, simplify the configuration, and reduce the number of arithmetic operations to enable high-speed arithmetic.

【００３７】[0037]

【実施例】以下、図１〜図１９を参照しながら、この発
明による行列データ乗算装置の一実施例について説明す
る。すなわちこの発明の一実施例の構成を図１に示し、
その要部の構成を図２及び図３に示す。DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the matrix data multiplication device according to the present invention will be described below with reference to FIGS. That is, the structure of one embodiment of the present invention is shown in FIG.
The structure of the main part is shown in FIGS.

【００３８】図１において、入力端子ＩＮから８行８列
のデータが、前出図２７のベクトル〔Ｘc 〕に示すよう
に、列順で入力され、６４ワードの第１の並べ替え回路
４１を介して、４次の第１の内積演算回路４２に供給さ
れる。この内積演算回路４２の出力は、６４ワードの第
２の並べ替え回路４３を介して、８次の第２の内積演算
回路４４に供給される。内積演算回路４４の出力が４次
の第３の内積演算回路４５に供給され、内積演算回路４
５の出力は６４ワードの第３の並べ替え回路４６を介し
て、出力端子ＯＵＴに導出される。In FIG. 1, data of 8 rows and 8 columns is input from the input terminal IN in column order as shown in the vector [Xc] of FIG. 27, and the first rearrangement circuit 41 of 64 words is input. It is supplied to the first-order inner product calculation circuit 42 of the fourth order via the. The output of the inner product arithmetic circuit 42 is supplied to the 8th-order second inner product arithmetic circuit 44 via the second rearrangement circuit 43 of 64 words. The output of the inner product calculating circuit 44 is supplied to the third inner product calculating circuit 45 of the fourth order,
The output of 5 is led to the output terminal OUT via the third rearrangement circuit 46 of 64 words.

【００３９】後述のように、第１の内積演算回路４２の
係数は、＋１及び−１だけであり、第２の内積演算回路
４４の係数は、０、＋１及び−１だけである。また、第
３の内積演算回路４５の係数はＤＣＴに特有の値とな
る。As will be described later, the coefficients of the first inner product calculating circuit 42 are only +1 and -1, and the coefficients of the second inner product calculating circuit 44 are only 0, +1 and -1. Further, the coefficient of the third inner product calculating circuit 45 has a value peculiar to DCT.

【００４０】図２において、５０は４次の内積演算回路
であって、図１の内積演算回路４２に相当し、３個の単
位遅延器５１₁、５１₂、５１₃ が逆順に縦続接続され
て、その出力端、両接続中点及び入力端に４個のラッチ
５２₁、５２₂、５２₃、５２₄ がそれぞれ接続され
る。ラッチ５２₁ 〜５２₄ の出力が、それぞれスイッチ
５３₁〜５３₄の＋側接点に供給されると共に、２の補
数回路５４₁ 〜５４₄ を介して、スイッチ５３₁〜５３
₄の−側接点にそれぞれ供給される。スイッチ５３₁〜
５３₄の各出力が加算器５５に供給される。In FIG. 2, a fourth-order inner product calculating circuit 50 corresponds to the inner product calculating circuit 42 of FIG. 1, and three unit delay devices 51 ₁ , 51 ₂ , 51 ₃ are cascaded in reverse order. Thus, four latches 52 ₁ , 52 ₂ , 52 ₃ , 52 ₄ are connected to the output terminal, both connection midpoints and the input terminal, respectively. The output of latch 52 ₁ to 52 ₄ is supplied to a + side contact of the switch 53 ₁ to 53 ₄ respectively, via the two's complement circuit 54 ₁ to 54 _4, the switch 53 ₁ to 53
It is supplied to each of the ₄ negative contacts. Switch 53 ₁ ~
Each output of 53 ₄ is supplied to the adder 55.

【００４１】各スイッチ５３₁〜５３₄は、各補数回路
５４₁ 〜５４₄ と共に係数が＋１、−１だけの乗算器を
構成し、システム制御回路５６により互いに独立に切り
換えられる。Each of the switches 53 _{1 to} 53 ₄ constitutes a multiplier having coefficients of +1 and −1 together with each of the complement circuits 54 _{1 to} 54 ₄ , and they are independently switched by the system control circuit 56.

【００４２】また、２の補数回路５４₁ 〜５４₄ は、周
知のように、否定回路と加算回路とで構成される。As is well known, the 2's complement circuits 54 _{1 to} 54 ₄ are composed of a negation circuit and an addition circuit.

【００４３】図３において、６０は８次の内積演算回路
であって、図１の内積演算回路４４に相当し、入力ＩＮ
が８個の切換スイッチ６１₁〜６１₈の各第１の接点に
供給されると共に、８個の補数回路６２₁〜６２₈を介
して、スイッチ６１₁〜６１ ₈の各第２の接点に供給さ
れる。スイッチ６１₁〜６１₈の第３の接点には係数０
がそれぞれ供給され、スイッチ６１₁〜６１₈の各出力
がそれぞれ加算器６３ ₁〜６３₈に供給される。In FIG. 3, reference numeral 60 designates an eighth-order inner product arithmetic circuit.
Which corresponds to the inner product calculation circuit 44 of FIG.
There are eight changeover switches 61₁~ 61₈To each first contact of
8 complement circuits 62 as supplied₁~ 62₈Through
Then switch 61₁~ 61 ₈Supplied to each second contact of
Be done. Switch 61₁~ 61₈The third contact of is a coefficient 0
Are supplied respectively, and the switch 61₁~ 61₈Each output of
Are adders 63 ₁~ 63₈Is supplied to.

【００４４】これらの加算器６３₁〜６３₈の各出力が
それぞれスイッチ６４₁〜６４₈を通じて１６個のラッ
チ６５₀、６５₁〜６５₁₅に供給される。これらの各１
対のラッチ６５₀、６５₁；６５₂、６５₃・・・６５
₁₄、６５₁₄の出力が８個の切換スイッチ６６₁、６６₂
・・・６６₈の各１対の接点に供給される。スイッチ６
６₁〜６６₈の各出力がそれぞれ加算器６３₁〜６３₈
に供給される。The outputs of the adders 63 _{1 to} 63 ₈ are supplied to 16 latches 65 ₀ and 65 _{1 to} 65 ₁₅ through switches 64 ₁ to 64 _8, respectively. Each one of these
Pair of latches 65 ₀ , 65 ₁ ; 65 ₂ , 65 ₃ ... 65
The outputs of ₁₄ and 65 ₁₄ are eight changeover switches 66 ₁ and 66 ₂
It is supplied to the contacts of each pair of ... 66 _8. Switch 6
6 _1-66 sum the outputs of ₈ each 63 _1-63 ₈
Is supplied to.

【００４５】またラッチ６５₀〜６５₁₅の各出力がそれ
ぞれパラレル／シリアル変換器６７に供給される。この
変換器６７から出力端子ＯＵＴが導出される。The outputs of the latches 65 _{0 to} 65 ₁₅ are supplied to the parallel / serial converter 67, respectively. The output terminal OUT is derived from the converter 67.

【００４６】ここで切換スイッチ６１₁〜６１₈は、補
数回路６２₁〜６２₈と共に、係数が０、＋１、−１だ
けの乗算器をそれぞれ構成し、スイッチ６４₁〜６
４₈、６６₁〜６６₈と共に、システム制御回路６８に
より互いに独立に切り換えられる。The change-over switch 61 _1-61 ₈ where, together with the complement circuit 62 ₁ to 62 _8, the coefficient is 0, + 1, -1 only multipliers constitute respectively, switch 64 _1-6
4 ₈ and 66 _{1 to} 66 ₈ are independently switched by the system control circuit 68.

【００４７】次に、図４〜図１１をも参照しながら、図
１の実施例の動作について説明する。Next, the operation of the embodiment shown in FIG. 1 will be described with reference to FIGS.

【００４８】図１の実施例においては、ＤＣＴのための
６４行６４列の定数行列〔Ｍ〕を次の数９式に示すよう
な６個の行列に分解している。In the embodiment of FIG. 1, the constant matrix [M] of 64 rows and 64 columns for DCT is decomposed into 6 matrices as shown in the following equation (9).

【００４９】[0049]

【数９】〔Ｍ〕＝〔Ｗ〕〔Ｖ〕〔ＴＳ〕〔Ｒ〕〔Ｌ〕〔Ｑ〕／８[M] = [W] [V] [TS] [R] [L] [Q] / 8

【００５０】行列〔Ｑ〕、〔Ｒ〕及び〔Ｗ〕が第１、第
２及び第３の並べ替え回路４１、４３及び４６にそれぞ
れ対応すると共に、行列〔Ｌ〕、〔ＴＳ〕及び〔Ｖ〕が
第１、第２及び第３の内積演算回路４２、４４及び４５
にそれぞれ対応する。行列〔Ｑ〕〜〔Ｗ〕は何れも６４
行６４列であり、図４〜図１１に示されるように、それ
ぞれ多数の０要素を含む疎行列（ＳｐａｒｓｅＭａｔ
ｒｉｘ）である。The matrices [Q], [R] and [W] correspond to the first, second and third rearrangement circuits 41, 43 and 46, respectively, and the matrices [L], [TS] and [V]. ] Is the first, second and third inner product arithmetic circuits 42, 44 and 45
Respectively correspond to. The matrices [Q] to [W] are all 64
As shown in FIGS. 4 to 11, each row has 64 columns, and each sparse matrix (Sparse Mat) includes a large number of 0 elements.
rix).

【００５１】なお、この図４〜図１１において、＋及び
−はそれぞれ＋１及び−１を表しており、他の行列を示
す後出各図においても同様である。In FIGS. 4 to 11, + and-represent +1 and -1, respectively, and the same applies to the subsequent figures showing other matrices.

【００５２】並べ替え回路４１では、図４に示されるよ
うに、行列〔Ｑ〕の各行各列とも、１か所だけが＋１
で、残りの６３個の要素は全て０であるから、６４ワー
ドの入力データ〔Ｘｃ〕の並べ替えが行われる。In the rearrangement circuit 41, as shown in FIG. 4, only one place is +1 for each row and each column of the matrix [Q].
Since the remaining 63 elements are all 0, the 64 words of input data [Xc] are rearranged.

【００５３】内積演算回路４２において、この並べ替え
られたデータ〔Ｑ〕〔Ｘｃ〕が、図５の行列〔Ｌ〕で表
されるような演算処理を受ける。同図に明らかなよう
に、この行列〔Ｌ〕は、＋１及び−１の要素のみで、同
形の４行４列の小行列が対角線上に１６個並び、他の部
分が全て０要素の疎行列であるから、図２に示したよう
な４次の内積演算回路５０で演算処理することができ
る。In the inner product calculation circuit 42, the rearranged data [Q] [Xc] is subjected to calculation processing as represented by the matrix [L] in FIG. As is clear from the figure, this matrix [L] has only +1 and -1 elements, and 16 small 4-matrix 4-matrixes of the same shape are lined up diagonally and all other parts are sparse with 0 elements. Since it is a matrix, it can be processed by the quadratic inner product calculating circuit 50 as shown in FIG.

【００５４】図２において、入力端子ＩＮから、６４ワ
ード単位のデータ〔Ｑ〕〔Ｘｃ〕が供給され、それぞれ
４個のデータが４個のラッチ５２₁、５２₂、５２₃、
５２ ₄に取り込まれ、４Ｔ時間にわたって保持される。In FIG. 2, from the input terminal IN, 64 words
Data [Q] [Xc] in units of code are supplied,
4 pieces of data 4 pieces of latch 52₁, 52₂, 52₃,
52 _FourAnd held for 4T hours.

【００５５】４個のスイッチ５３₁、５３₂、５３₃、
５３₄は、行列〔Ｌ〕の４行４列の小行列の要素が＋１
であるか−１であるかにより、＋側または−側に切り換
えられて、各ラッチ５２〜５２₄に保持されたデータに
＋１または−１の係数が乗算され、加算器５５で加算さ
れて、端子ＯＵＴから出力される。Four switches 53 ₁ , 53 ₂ , 53 ₃ ,
53 _4, the elements of four rows and four columns of submatrix of the matrix [L] is +1
By either -1 or is, the + side or - is switched to the side, the coefficient of +1 or -1 to the data held in the respective latches 52 to 52 ₄ are multiplied and summed by the adder 55, It is output from the terminal OUT.

【００５６】内積演算回路４２から出力された６４ワー
ドのデータ〔Ｌ〕〔Ｑ〕〔Ｘｃ〕は、第２の並べ替え回
路４３において、図６及び図７〜図１０に示す行列
〔Ｒ〕で表されるように並べ替えられる。The 64-word data [L] [Q] [Xc] output from the inner product calculating circuit 42 is converted into the matrix [R] shown in FIGS. 6 and 7 to 10 in the second rearrangement circuit 43. Sorted as represented.

【００５７】この並べ替えられたデータ〔Ｒ〕〔Ｌ〕
〔Ｑ〕〔Ｘｃ〕が、第２の内積演算回路４４において、
図１１、図１２の行列〔ＴＳ〕で表されるような演算処
理を受ける。同図に明らかなように、この行列〔ＴＳ〕
は、それぞれ０、＋１及び−１の要素のみであり、そし
て、各列の各奇数行の要素、又はその下隣りの偶数行の
要素の少なくともどちらか一方は０であるような、１６
行１６列の小行列が対角線上に４個並び、他の部分が全
て０要素の疎行列であるから、図３に示したような８次
の内積演算回路６０で演算処理することができる。This rearranged data [R] [L]
[Q] [Xc] is the second inner product calculation circuit 44
The arithmetic processing as represented by the matrix [TS] in FIGS. 11 and 12 is performed. As is clear from the figure, this matrix [TS]
Are 0, +1 and −1 respectively, and at least one of the elements in each odd row of each column or the elements in the even rows below it is 0, 16
Since four small matrixes each having 16 rows and 16 columns are arranged diagonally and all other parts are sparse matrices, the operation can be performed by the 8th-order inner product operation circuit 60 as shown in FIG.

【００５８】行列［ＴＳ］は１６×１６の対角線上の
小行列以外の部分は全て０であるから、１６次の内積演
算回路で計算できる。しかも、その小行列の要素は全て
０、±１のみであるから、入力されてくるデータ、その
値に対して２の補数器を介した値、及び０の３個の値を
選択した値を加算すれば良い。さらに、行列［ＴＳ］の
各１６×１６小行列をよくみると、各列の第０行目と第
１行目のどちらかは必ず０である。そして、各列の第２
行目と第３行目のどちらかは必ず０である。同様に、各
列の第２ｋ行目と第２ｋ＋１行目のどちらかは必ず０で
ある（ｋ＝２〜７）。従って実際には８個の加算器を使
用した図３の回路で構成できる。Since the matrix [TS] is all 0 except for the small matrix on the 16 × 16 diagonal line, it can be calculated by a 16th-order inner product calculation circuit. Moreover, since all the elements of the sub-matrix are 0 and ± 1, only the input data, the value through the 2's complementer for that value, and the value selected from the three values of 0 are selected. All you have to do is add. Further, if each 16 × 16 submatrix of the matrix [TS] is examined carefully, it is always 0 in either the 0th row or the 1st row of each column. And the second of each row
Either line 3 or line 3 is always 0. Similarly, either the 2k-th row or the 2k + 1-th row of each column is always 0 (k = 2 to 7). Therefore, in practice, the circuit of FIG. 3 using eight adders can be used.

【００５９】通常、１６次の内積演算回路には１６個の
加算器を必要とするが、上述のとおり、各列の第２ｋ行
目と第２ｋ＋１行目のどちらかは必ず０である（ｋ＝０
〜７）から、図３に示すように８個の加算器で計算する
ことが出来る。即ち、図３の１６個のラッチは、演算開
始時刻に図示省略したクリア信号により、０がセットさ
れる。Normally, a 16th-order inner product arithmetic circuit requires 16 adders, but as described above, either the 2k-th row or the 2k + 1-th row of each column is always 0 (k = 0
.About.7), it can be calculated by eight adders as shown in FIG. That is, the 16 latches in FIG. 3 are set to 0 by a clear signal (not shown) at the calculation start time.

【００６０】ラッチ６５₀には、［ＴＳ_ii］（ｉ＝１〜
４）の第０行目と入力ベクトル（ｉ＝１のときは、
［Ｒ］［Ｌ］［Ｑ］［Ｘｃ］の０〜１５番目の要素、ｉ
＝２のときは、［Ｒ］［Ｌ］［Ｑ］［Ｘｃ］の１６〜３
１番目の要素、ｉ＝３のときは、［Ｒ］［Ｌ］［Ｑ］
［Ｘｃ］の３２〜４７番目の要素、ｉ＝４のときは、
［Ｒ］［Ｌ］［Ｑ］［Ｘｃ］の４８〜６３番目の要素）
との計算途中結果、及び計算結果が格納される。The latch 65 ₀ has [TS _ii ] (i = 1 to 1).
4) line 0 and the input vector (when i = 1,
The 0th to 15th elements of [R] [L] [Q] [Xc], i
= 2, 16 to 3 of [R] [L] [Q] [Xc]
When the first element, i = 3, [R] [L] [Q]
The 32nd to 47th elements of [Xc], when i = 4,
(48th to 63rd elements of [R] [L] [Q] [Xc])
The intermediate calculation result and the calculation result are stored.

【００６１】ラッチ６５₁には、［ＴＳ_ii］（ｉ＝１〜
４）の第１行目と入力ベクトルとの計算途中結果、及び
計算結果が格納される。セレクト１は、［ＴＳ_ii］（ｉ
＝１〜４）の第０行目または第１行目の０でないほうの
値が１のときは入力端子からの値を選択し、−１のとき
は入力端子からの値に対して２の補数器を介した値（−
１倍した値）を選択し、第０行目と第１行目がともに０
であるときには０を選択する。The latch 65 ₁ has [TS _ii ] (i = 1 to 1).
The intermediate calculation result of the first line of 4) and the input vector and the calculation result are stored. Select 1 is [TS _ii ] (i
= 1 to 4), the value from the input terminal is selected when the non-zero value in the 0th row or the 1st row is 1, and when it is -1, the value from the input terminal is 2 The value (-
(Value multiplied by 1) is selected, and both the 0th line and the 1st line are 0
If 0, 0 is selected.

【００６２】セレクト９は、第０行目が０でないときは
ラッチ６５₀側にし、第１行目が０でないときはラッチ
６５₁側にする。第０行目または第１行目のどちらか一
方は必ず０であるから、セレクト９は上記のように選択
することにより、ラッチ６５ ₀とラッチ６５₁の両方を
選択するという矛盾は起きない。また、第０行目と第１
行目が共に０であるときは、どちらを選択しても良い。
セレクト１、セレクト９の制御信号は、制御回路６８に
より制御される。Select 9 is selected when the 0th line is not 0.
Latch 65₀To the side and latch when the first row is not 0
65₁Turn it to the side. Either 0th line or 1st line
One is always 0, so select 9 is selected as above
The latch 65 ₀And latch 65₁Both of
There is no contradiction of choice. Also, the 0th line and the 1st line
If both lines are 0, either one may be selected.
Control signals for select 1 and select 9 are sent to the control circuit 68.
More controlled.

【００６３】例えば、ｊサイクル目（ｊ＝０〜１５）に
上述の入力ベクトルのｊ番目の要素が入力端子から入力
されてくる。この時刻において、図３においてどのよう
な計算が行われるかを以下で述べる。For example, at the j-th cycle (j = 0 to 15), the j-th element of the above-mentioned input vector is input from the input terminal. What kind of calculation is performed in FIG. 3 at this time will be described below.

【００６４】上述の［ＴＳ_ii］（ｉ＝１〜４）の第０行
第ｊ列目の要素が１である場合、ラッチ６５₀に格納さ
れている値（ｊ−１サイクル目までに計算された、第０
行目と入力されてくるデータとの計算途中結果）はスイ
ッチ６６₁を介して加算器６３₁に入力され、そして入
力ベクトルのｊ番目の要素もスイッチ６１₁を介して加
算器６３₁に入力される。そして加算器６３₁で、ｊ−
１サイクル目までに計算された、第０行目と入力されて
くるデータとの計算途中結果と、入力ベクトルのｊ番目
の要素の加算が行われ、この加算結果はスイッチ６４₁
を介してラッチ６５₀に格納される。When the element at the 0th row and the jth column of the above [TS _ii ] (i = 1 to 4) is 1, the value stored in the latch 65 ₀ (calculated by the j-1th cycle) The 0th
The intermediate calculation result of the row and the input data) is input to the adder 63 ₁ via the switch 66 ₁ , and the j-th element of the input vector is also input to the adder 63 ₁ via the switch 61 _1. To be done. Then, in the adder 63 ₁ , j−
The intermediate calculation result of the 0th row and the input data calculated up to the first cycle and the j-th element of the input vector are added, and the addition result is the switch 64 ₁
It is stored in the latch 65 ₀ via.

【００６５】上述の［ＴＳ_ii］（ｉ＝１〜４）の第０行
第ｊ列目の要素が−１である場合、ラッチ６５₀に格納
されている値（ｊ−１サイクル目までに計算された、第
０行目と入力されてくるデータとの計算途中結果）はス
イッチ６６₁を介して加算器６３₁に入力され、そして
入力ベクトルのｊ番目の要素に対して−１倍した値もス
イッチ６１₁を介して加算器６３₁に入力される。そし
て加算器６３₁で、ｊ−１サイクル目までに計算され
た、第０行目と入力されてくるデータとの計算途中結果
と、入力ベクトルのｊ番目の要素に対して−１倍した値
の加算が行われ、この加算結果はスイッチ６４₁を介し
てラッチ６５₀に格納される。When the element at the 0th row and the jth column of the above [TS _ii ] (i = 1 to 4) is -1, the value stored in the latch 65 ₀ (by the j-1th cycle) The calculated intermediate calculation result of the 0th row and the input data) is input to the adder 63 ₁ via the switch 66 ₁ and multiplied by -1 with respect to the j-th element of the input vector. The value is also input to the adder 63 ₁ via the switch 61 ₁ . Then, in the adder 63 ₁ , the intermediate calculation result of the 0th row and the input data calculated up to the j−1th cycle, and the value obtained by multiplying the jth element of the input vector by −1. Is performed and the result of this addition is stored in the latch 65 ₀ via the switch 64 ₁ .

【００６６】即ち、第０行第ｊ列目の要素が±１のとき
は、第０行目と入力されてくるデータとの計算途中結果
の更新（入力ベクトルのｊ番目の要素に対して±１倍し
た値の加算）が行われる。That is, when the element in the 0th row and the jth column is ± 1, the mid-calculation results of the 0th row and the input data are updated (± j for the jth element of the input vector). The addition of the value multiplied by 1) is performed.

【００６７】上述の［ＴＳ_ii］（ｉ＝１〜４）の第１行
第ｊ列目の要素が１である場合、ラッチ６５₁に格納さ
れている値（ｊ−１サイクル目までに計算された、第１
行目と入力されてくるデータとの計算途中結果）はスイ
ッチ６６₁を介して加算器６３₁に入力され、そして入
力ベクトルのｊ番目の要素もスイッチ６１₁を介して加
算器６３₁に入力される。そして加算器６３₁で、ｊ−
１サイクル目までに計算された、第１行目と入力されて
くるデータとの計算途中結果と、入力ベクトルのｊ番目
の要素の加算が行われ、この加算結果はスイッチ６４₁
を介してラッチ６５₁に格納される。When the element in the 1st row and the jth column of the above-mentioned [TS _ii ] (i = 1 to 4) is 1, the value stored in the latch 65 ₁ (calculated by the j-1th cycle) Was done, first
The intermediate calculation result of the row and the input data) is input to the adder 63 ₁ via the switch 66 ₁ , and the j-th element of the input vector is also input to the adder 63 ₁ via the switch 61 _1. To be done. Then, in the adder 63 ₁ , j−
The intermediate calculation result of the first row and the input data calculated up to the first cycle and the j-th element of the input vector are added, and the addition result is the switch 64 ₁
It is stored in the latch 65 ₁ via.

【００６８】上述の［ＴＳ_ii］（ｉ＝１〜４）の第１行
第ｊ列目の要素が−１である場合、ラッチ６５₁に格納
されている値（ｊ−１サイクル目までに計算された、第
１行目と入力されてくるデータとの計算途中結果）はス
イッチ６６₁を介して加算器６３₁に入力され、そして
入力ベクトルのｊ番目の要素に対して−１倍した値もス
イッチ６１₁を介して加算器６３₁に入力される。そし
て加算器６３₁で、ｊ−１サイクル目までに計算され
た、第１行目と入力されてくるデータとの計算途中結果
と、入力ベクトルのｊ番目の要素に対して−１倍した値
の加算が行われ、この加算結果はスイッチ６４₁を介し
てラッチ６５₁に格納される。When the element in the 1st row and the jth column of the above-mentioned [TS _ii ] (i = 1 to 4) is -1, the value stored in the latch 65 ₁ (by the j-1th cycle) The calculated intermediate result of the calculation of the first row and the input data) is input to the adder 63 ₁ via the switch 66 ₁ and multiplied by -1 with respect to the j-th element of the input vector. The value is also input to the adder 63 ₁ via the switch 61 ₁ . Then, in the adder 63 ₁ , intermediate calculation results of the first row and the input data calculated up to the j−1th cycle, and a value obtained by multiplying the jth element of the input vector by −1. Are added and the result of this addition is stored in the latch 65 ₁ via the switch 64 ₁ .

【００６９】即ち、第１行第ｊ列目の要素が±１のとき
は、第１行目と入力されてくるデータとの計算途中結果
の更新（入力ベクトルのｊ番目の要素に対して±１倍し
た値の加算）が行われる。That is, when the element in the 1st row and the jth column is ± 1, the mid-calculation results of the 1st row and the input data are updated (± for the jth element of the input vector). The addition of the value multiplied by 1) is performed.

【００７０】また、第０行第ｊ列目と第１行第ｊ列目の
要素が共に０であるときは、スイッチ６６₁でどちらか
のラッチが選択され、加算器６３₁を介してスイッチ６
４₁で選択されたラッチに再度格納されるが、このとき
スイッチ６１₁で０が選択され加算器６３₁に入力され
るので、加算器６３₁では実質、加算は行われない。つ
まり実質ラッチ６５₀、６５₁ともに、データの更新は
行われない。When the elements in the 0th row and the jth column and the 1st row and the jth column are both 0, one of the latches is selected by the switch 66 ₁ and is switched via the adder 63 _1. 6
4 ₁ but is again stored in the selected latched, so this time 0 the switch 61 ₁ is input to the selected adders 63 _1, the adder 63 _1, substantially, the addition is not performed. That is, the data is not updated in both the substantial latches 65 ₀ and 65 ₁ .

【００７１】このようにして、１６サイクル後には、ラ
ッチ６５₀には第０行目と入力されてくるデータとの計
算結果が格納され、ラッチ６５₁には第１行目と入力さ
れてくるデータとの計算結果が格納されていることにな
る。In this manner, after 16 cycles, the latch 65 ₀ stores the calculation result of the 0th row and the input data, and the latch 65 ₁ receives the 1st row. It means that the calculation result with the data is stored.

【００７２】同様に、ラッチ６５₂〜６５₁₅には、［Ｔ
Ｓ_ii］（ｉ＝１〜４）の第２〜１５行目と入力ベクトル
との計算途中結果が格納され、１６サイクル後には、計
算結果（第２〜１５行目と入力されてくるデータとの計
算結果）が格納されていることになる。Similarly, the latches 65 _{2 to} 65 ₁₅ have [T
S _ii ] (i = 1 to 4), the intermediate calculation results of the 2nd to 15th lines and the input vector are stored, and after 16 cycles, the calculation result (the 2nd to 15th lines and the input data) (Calculation result of) is stored.

【００７３】これら計算結果はパラレル／シリアル変換
器６７に入力され、第０行目と入力されてくるデータと
の計算結果、第１行目と入力されてくるデータとの計算
結果、．．．第１５行目と入力されてくるデータとの計
算結果の順にシリアル出力される。These calculation results are input to the parallel / serial converter 67, the calculation result of the 0th row and the input data, the calculation result of the 1st row and the input data ,. ．． It is serially output in the order of the calculation result of the 15th line and the input data.

【００７４】かくして、１６×１６行列［ＴＳ₁₁］との
計算、１６×１６行列［ＴＳ₂₂］との計算、１６×１６
行列［ＴＳ₃₃］との計算、１６×１６行列［ＴＳ₄₄］と
の計算が、それぞれ１６サイクルずつかけて行われる。
即ち、６４サイクルかけて行列［ＴＳ］との計算が行わ
れる。Thus, calculation with 16 × 16 matrix [TS ₁₁ ], calculation with 16 × 16 matrix [TS ₂₂ ], 16 × 16
The calculation with the matrix [TS ₃₃ ] and the calculation with the 16 × 16 matrix [TS ₄₄ ] are performed in 16 cycles each.
That is, the calculation with the matrix [TS] is performed over 64 cycles.

【００７５】内積演算回路４４から出力された６４ワー
ドのデータ〔ＴＳ〕〔Ｒ〕〔Ｌ〕〔Ｑ〕〔Ｘｃ〕は、更
に、第３の内積演算回路４５において、図１３、図１４
の行列〔Ｖ〕で表されるような演算処理を受ける。同図
に明らかなように、この行列〔Ｖ〕は、それぞれ４行４
列の小行列が対角線上に４個並び、他の部分が全て０要
素の疎行列であるから、前出図２４に示すような通常の
４次内積演算回路４５で演算処理することができる。The 64-word data [TS] [R] [L] [Q] [Xc] output from the inner product arithmetic circuit 44 is further processed by the third inner product arithmetic circuit 45 in FIGS.
[V] of the above matrix [V] is received. As is clear from the figure, this matrix [V] has 4 rows and 4 rows, respectively.
Since four sub-matrixes of columns are arranged diagonally and all other parts are sparse matrices with 0 elements, the arithmetic processing can be performed by the ordinary quadratic inner product arithmetic circuit 45 as shown in FIG.

【００７６】内積演算回路４５から出力された６４ワー
ドのデータ〔Ｖ〕〔ＴＳ〕〔Ｒ〕〔Ｌ〕〔Ｑ〕〔Ｘｃ〕
は、第３の並べ替え回路４６において、図１５及び図１
６〜図１９に示す行列〔Ｗ〕で表されるように並べ替え
られて、所望の出力データ〔Ｗ〕〔Ｖ〕〔ＴＳ〕〔Ｒ〕
〔Ｌ〕〔Ｑ〕〔Ｘｃ〕が得られる。64-word data [V] [TS] [R] [L] [Q] [Xc] output from the inner product calculating circuit 45.
In the third rearrangement circuit 46 shown in FIG.
6 to 19 are rearranged as represented by the matrix [W], and desired output data [W] [V] [TS] [R]
[L] [Q] [Xc] is obtained.

【００７７】なお、実際には、Actually,

【数１０】〔Ｙｃ〕＝〔Ｍ〕〔Ｘｃ〕＝〔Ｗ〕〔Ｖ〕〔ＴＳ〕〔Ｒ〕〔Ｌ〕〔Ｑ〕〔Ｘｃ〕／８であるから、この出力結果を８で割らなくてはいけない
が、これは値を３ビットシフトすればよく、回路的には
何も要らないので、図中では省略してある。[Equation 10] [Yc] = [M] [Xc] = [W] [V] [TS] [R] [L] [Q] [Xc] / 8 Since this output result is not divided by 8, However, this is not necessary in the figure, since it is sufficient to shift the value by 3 bits and no circuit is required.

【００７８】図１の実施例においては、各内積演算回路
４２、４４及び４５の演算処理を表す行列〔Ｌ〕、〔Ｔ
Ｓ〕及び〔Ｖ〕が何れも疎行列であるため、乗算回数を
少なくして、各内積演算回路を小規模にすることができ
る。In the embodiment shown in FIG. 1, matrices [L] and [T] representing the arithmetic processing of the inner product arithmetic circuits 42, 44 and 45, respectively.
Since S] and [V] are both sparse matrices, it is possible to reduce the number of multiplications and reduce the size of each inner product calculation circuit.

【００７９】また、内積演算回路４２については、行列
〔Ｌ〕の係数が０と＋１、−１だけであるため、また内
積演算回路４４については、行列〔ＴＳ〕の係数が０と
＋１、−１だけで各行に＋１又は−１の要素が２個並ぶ
ことがないため、図２及び図３に示すように、各乗算器
の構成を簡単にすることができると共に、内積演算時に
丸め誤差が発生することがない。Further, for the inner product arithmetic circuit 42, the coefficients of the matrix [L] are only 0, +1 and -1, and for the inner product arithmetic circuit 44, the coefficients of the matrix [TS] are 0, +1 and-. Since only one element does not form two +1 or -1 elements in each row, the configuration of each multiplier can be simplified as shown in FIGS. 2 and 3, and a rounding error occurs during inner product calculation. There is nothing to do.

【００８０】ここでは図２７に示すように、８行８列の
要素ｘ₁₁〜ｘ₈₈が列順に入力されて、８行８列の要素ｙ
₁₁〜ｙ₈₈が列順に出力される場合について述べたが、こ
れ以外の順番で入出力させる場合には、並べ替え回路４
１及び４６を別の適切な順番にデータを並べ替える並べ
替え回路で置き替える事により対応することができる。Here, as shown in FIG. 27, elements x _{11 to} x ₈₈ of 8 rows and 8 columns are input in the column order, and an element y of 8 rows and 8 columns is input.
_{The case where 11 to} y ₈₈ are output in the column order has been described, but when inputting and outputting in the other order, the rearrangement circuit 4
This can be dealt with by replacing 1 and 46 with a rearrangement circuit that rearranges data in another appropriate order.

【００８１】こうして上述の装置によれば、内積演算回
路の規模を小さくし、構成を簡単にすると共に、演算回
数を減少させて高速演算を可能にすることができるもの
である。As described above, according to the above-mentioned device, the scale of the inner product calculating circuit can be reduced, the structure can be simplified, and the number of calculations can be reduced to enable high-speed calculation.

【００８２】更に図２０は、本発明による逆離散コサイ
ン変換装置の一例の構成図である。この図において、入
力端子ＩＮから８行８列のデータが６４ワードの第４の
並べ替え回路７１を介して、４次の第４の内積演算回路
７２に供給される。この内積演算回路７２の出力が８次
の第５の内積演算回路７３に供給される。内積演算回路
７３の出力は、６４ワードの第５の並べ替え回路７４を
介して、４次の第６の内積演算回路７５に供給される。
内積演算回路７５の出力は６４ワードの第６の並べ替え
回路７６を介して、出力端子ＯＵＴに導出される。Further, FIG. 20 is a block diagram of an example of the inverse discrete cosine transform device according to the present invention. In this figure, data of 8 rows and 8 columns is supplied from an input terminal IN to a fourth inner product arithmetic circuit 72 of the fourth order via a fourth rearrangement circuit 71 of 64 words. The output of the inner product calculating circuit 72 is supplied to the eighth-order fifth inner product calculating circuit 73. The output of the inner product arithmetic circuit 73 is supplied to the fourth inner product arithmetic circuit 75 of the fourth order via the fifth rearrangement circuit 74 of 64 words.
The output of the inner product calculation circuit 75 is led to the output terminal OUT via the sixth rearrangement circuit 76 of 64 words.

【００８３】すなわち図２０の実施例においては、ＩＤ
ＣＴのための６４行６４列の定数行列〔ＩＭ〕を次の数
１０式に示すような６個の行列に分解している。That is, in the embodiment of FIG. 20, ID
A constant matrix [IM] of 64 rows and 64 columns for CT is decomposed into 6 matrices as shown in the following Expression 10.

【００８４】[0084]

【数１１】〔ＩＭ〕＝^t〔Ｑ〕〔Ｌ〕^t〔Ｒ〕^t〔ＴＳ〕^t〔Ｖ〕^t〔Ｗ〕／８[IM] = ^t [Q] [L] ^t [R] ^t [TS] ^t [V] ^t [W] / 8

【００８５】行列^t〔Ｗ〕、^t〔Ｒ〕及び^t〔Ｑ〕が第
４、第５及び第６の並べ替え回路７１、７４及び７６に
それぞれ対応すると共に、行列^t〔Ｖ〕、^t〔ＴＳ〕及
び〔Ｌ〕が第４、第５及び第６の内積演算回路７２、７
３及び７５にそれぞれ対応する。ここで上述の行列
〔Ｌ〕、〔ＴＳ〕及び〔Ｖ〕は、それらを形成する小行
列が何れも対角線上に配列されており、各転置行列
^t〔Ｌ〕、^t〔ＴＳ〕及び^t〔Ｖ〕も同様の形になるた
め、逆変換の場合にも、図１の実施例と同様の構成で対
応することができる。ちなみに〔Ｌ〕＝^t〔Ｌ〕であ
る。The matrices ^t [W], ^t [R] and ^t [Q] respectively correspond to the fourth, fifth and sixth rearrangement circuits 71, 74 and 76, and the matrices ^t [V], ^t [V], ^t [V]. TS] and [L] are the fourth, fifth and sixth inner product arithmetic circuits 72, 7
3 and 75, respectively. Here, in the above-mentioned matrices [L], [TS], and [V], the small matrices forming them are all arranged on a diagonal line, and each transposed matrix
^{Since t} [L], ^t [TS], and ^t [V] have the same shape, the same structure as that of the embodiment of FIG. By the way, [L] = ^t [L].

【００８６】ただし、^t〔ＴＳ〕の演算を行う内積演算
回路７３は、^t〔ＴＳ〕が各行の奇数列とその右隣りの
偶数列の要素のどちらか一方は少なくとも０であるか
ら、例えば図２１の構成で演算を行う。[0086] However, the inner product computation circuit 73 for performing an operation of ^t [TS], since ^t [TS] is at least 0 either the even column elements of adjacent its right an odd column of each row, for example, FIG. Calculation is performed with the configuration of 21.

【００８７】図２１において、６０′は８次の内積演算
回路であって、図２０の内積演算回路７３に相当し、１
５個の単位遅延器６１′₁、６１′₂ 〜６１′₁₅が逆順
に縦続接続されて、その出力端、各接続中点及び入力端
に１６個のラッチ６２′₁、６２′₂ 〜６２′₁₆がそれ
ぞれ接続され、各１対のラッチ６２′₁、６２′₂ ；６
２′₃、６２′₄・・・６２′₁₅、６２′₁₆の出力が８
個の切換スイッチ６３′₁、６３′₂・・・６３′₈の
各１対の接点に供給される。スイッチ６３′₁〜６３′
₈の各出力が、８個の切換スイッチ６４′₁〜６４′₈
の各＋側接点に供給されると共に、８個の補数回路６
５′₁〜６５′₈を介して、スイッチ６４′₁〜６４′
₈の各−側接点に供給される。スイッチ６４′₁〜６
４′₈の第３の接点には係数０がそれぞれ供給され、ス
イッチ６４′₁〜６４′₈の各出力が加算器６６′に供
給される。In FIG. 21, reference numeral 60 'denotes an eighth-order inner product arithmetic circuit, which corresponds to the inner product arithmetic circuit 73 of FIG.
Five unit delay devices 61 ′ ₁ , 61 ′ _{2 to} 61 ′ ₁₅ are cascade-connected in reverse order, and 16 latches 62 ′ ₁ , 62 ′ _{2 to} 62 are provided at the output end, each connection midpoint and the input end. ′ ₁₆ are respectively connected to each pair of latches 62 ′ ₁ , 62 ′ ₂ ; 6
2 _'3, 62' _4, ... 62 _'15, 62' ₁₆ output is 8
The changeover switches 63 ′ ₁ , 63 ′ _2, ... 63 ′ ₈ are supplied to each pair of contacts. Switch 63 ' _{1 to} 63'
Each output of the _8, 8 of the change-over switch 64 _'1-64' ₈
Is supplied to each + side contact of the
5 through _'1-65' _8, the switch 64 _'1-64'
It is supplied to each negative side contact of ₈ . Switch 64 ' _{1 to} 6
4 'in the third contacts ₈ are supplied coefficient 0, respectively, the switch 64' _1-64 'each output of ₈ adder 66' is supplied to the.

【００８８】切換スイッチ６４′₁〜６４′₈は、補数
回路６５′₁〜６５′₈と共に、係数が０、＋１、−１
だけの乗算器をそれぞれ構成し、スイッチ６３′₁〜６
３′ ₈と共に、システム制御回路６７′により互いに独
立に切り換えられる。Changeover switch 64 '₁~ 64 '₈Is the complement
Circuit 65 '₁~ 65 '₈With the coefficients 0, +1, -1
And a switch 63 '.₁~ 6
3 ' ₈Together with the system control circuit 67 ',
It can be switched to vertical.

【００８９】従ってこの図２１において、入力端子ＩＮ
から、６４ワード単位のデータ〔ＴＳ〕〔Ｒ〕〔Ｌ〕
〔Ｑ〕〔Ｘｃ〕が供給され、それぞれ１６個のデータが
１６個のラッチ６２′₁〜６２′₁₆に取り込まれ、１６
Ｔ時間にわたって保持される。Therefore, in FIG. 21, the input terminal IN
From 64 words in units of data [TS] [R] [L]
[Q] [Xc] is supplied, and 16 pieces of data are taken into ₁₆ latches 62 ′ _{1 to} 62 ′ ₁₆ ,
Hold for T hours.

【００９０】８個のスイッチ６３′₁〜６３′₈は、行
列〔ＴＳ〕の１６行１６列の小行列の各々隣り合う要素
のどちらかが０であるかにより、０でない側に切り換え
られて、各ラッチ６２′₁〜６２′₁₆に保持されたデー
タのうち８個のデータが取り出される。ただし各々隣り
合う要素が両方とも０である場合、対応する各スイッチ
６３′₁〜６３′₈は、どちらの側に切り換えられても
良い。The eight switches 63 ′ _{1 to} 63 ′ ₈ are switched to the non-zero side depending on which one of the adjacent elements of the 16 × 16 submatrix of the matrix [TS] is 0. , 8 out of the data held in each of the latches 62 ' _{1 to} 62' ₁₆ . However, when the elements adjacent to each other are both 0, the corresponding switches 63 ′ _{1 to} 63 ′ ₈ may be switched to either side.

【００９１】８個のスイッチ６４′₁〜６４′₈は、上
記取り出された８個のデータに対応する１６行１６列の
小行列の要素が０、＋１または−１のいずれかであるか
により、０側、＋側または−側に切り換えられて、上記
取り出された８個のデータに０、＋１または−１の係数
が乗算され、加算器６６′で加算されて、端子ＯＵＴか
ら出力される。The eight switches 64 ' ₁ to 64' ₈ depend on whether the element of the 16-row by 16-column sub-matrix corresponding to the extracted eight data is 0, +1 or -1. , 0 side, + side or − side, and the eight data thus taken out are multiplied by the coefficient of 0, +1 or −1, added by the adder 66 ′ and output from the terminal OUT. ..

【００９２】また図２２は、本発明による離散コサイン
変換装置において、上述の第１、第２及び第３の内積演
算回路４２、４４及び４５をそれぞれ並列化することに
よって動作速度の高速化を図ったものである。この図に
おいて、並べ替え回路４１で〔Ｑ〕の演算を行った後、
この出力〔Ｑ〕〔Ｘｃ〕に対して、シリアルに出力され
てくる４つのデータを１組として変換回路８１でパラレ
ル化して、４入力加算回路４２′₁〜４２′₄に入力す
る。Further, FIG. 22 shows a discrete cosine transform device according to the present invention in which the above-mentioned first, second and third inner product arithmetic circuits 42, 44 and 45 are respectively arranged in parallel to increase the operating speed. It is a thing. In this figure, after performing the operation of [Q] in the rearrangement circuit 41,
With respect to this output [Q] [Xc], four sets of serially output data are parallelized by the conversion circuit 81 and input to the four-input addition circuits 42 ′ _{1 to} 42 ′ ₄ .

【００９３】この４入力加算回路４２′₁では〔Ｌ〕の
第１行目、第５行目、第９行目・・・第６１行目を演算
し、４入力加算回路４２′₂では〔Ｌ〕の第２行目、第
６行目、第１０行目・・・第６２行目を演算し、４入力
加算回路４２′₃では〔Ｌ〕の第３行目、第７行目、第
１１行目・・・第６３行目を演算し、４入力加算回路４
２′₄では〔Ｌ〕の第４行目、第８行目、第１２行目・
・・第６４行目を演算する。[0093] 'first row of the ₁ [L], the fifth row, the ninth row ... second line 61 is calculated, 4 input adder circuit 42' This four-input adder circuit 42, ₂ [ The second line, the sixth line, the tenth line ... The 62nd line of L] are calculated, and in the 4-input addition circuit 42 ′ ₃ , the 3rd line, the 7th line of [L], 11th line: 4th-input addition circuit 4 by calculating the 63rd line
The fourth row of the 2 _'4 [L], line 8, line 12 -
.... Calculate the 64th line.

【００９４】演算〔Ｒ〕は、単なる並べ替えであるが、
それは〔Ｌ〕〔Ｑ〕〔Ｘｃ〕の第１行目、第５行目、第
９行目・・・第６１行目の１６個のデータと、第２行
目、第６行目、第１０行目・・・第６２行目の１６個の
データと、第３行目、第７行目、第１１行目・・・第６
３行目の１６個のデータと、第４行目、第８行目、第１
２行目・・・第６４行目の１６個のデータとの４つの組
に分けて、続く演算〔ＴＳ〕の４つの小行列である〔Ｔ
Ｓ₁₁〕〔ＴＳ₂₂〕〔ＴＳ₃₃〕〔ＴＳ₄₄〕の演算を可能と
させる為のものである。The operation [R] is a simple rearrangement,
It is 16 data of the 1st line, the 5th line, the 9th line ... 61st line of [L] [Q] [Xc] and the 2nd line, 6th line, 16th data in 10th line ... 62nd line, 3rd line, 7th line, 11th line ... 6th line
16 pieces of data on the 3rd line, 4th line, 8th line, 1st line
2nd line ... 4 sub-matrices of the operation [TS] divided into 4 groups with 16 data in the 64th line [T
This is for enabling the calculation of S ₁₁ ] [TS ₂₂ ] [TS ₃₃ ] [TS ₄₄ ].

【００９５】従って図２２の回路の場合、４入力加算回
路４２′₁から〔Ｌ〕〔Ｑ〕〔Ｘｃ〕の第１行目の値、
第５行目の値、第９行目の値・・・第６１行目の値が出
力され、４入力加算回路４２′₂から〔Ｌ〕〔Ｑ〕〔Ｘ
ｃ〕の第２行目の値、第６行目の値、第１０行目の値・
・・第６２行目の値が出力され、４入力加算回路４２′
₃から〔Ｌ〕〔Ｑ〕〔Ｘｃ〕の第３行目の値、第７行目
の値、第１１行目の値・・・第６３行目の値が出力さ
れ、４入力加算回路４２′₄から〔Ｌ〕〔Ｑ〕〔Ｘｃ〕
の第４行目の値、第８行目の値、第１２行目の値・・・
第６４行目の値が出力されるので、演算〔Ｒ〕を行う回
路は必要なく、そのまま４入力加算回路４２′₁〜４
２′₄の出力を、それぞれ内積演算回路４４₁〜４４₄
に入力させればよい。Therefore, in the case of the circuit of FIG. 22, the values of the first row of [L] [Q] [Xc] from the 4-input adding circuit 42 ' ₁
The value of the fifth line, the value of the ninth line ... The value of the 61st line is output, and the 4-input adder circuit 42 ′ ₂ outputs [L] [Q] [X.
c] second row value, sixth row value, tenth row value
..Values in the 62nd row are output and 4-input addition circuit 42 '
The values of ₃ to [L] [Q] [Xc] on the third line, the value on the seventh line, the value on the eleventh line ... The value on the 63rd line are output, and the 4-input addition circuit 42 is output. ' ₄ to [L] [Q] [Xc]
The value of the 4th line, the value of the 8th line, the value of the 12th line ...
Since the value on the 64th line is output, a circuit for performing the operation [R] is not necessary, and the 4-input addition circuits 42 ' ₁ to 4'
The output of the 2 _'4, respectively inner product computation circuits 44 ₁ to 44 ₄
You can input to.

【００９６】すなわち内積演算回路４４₁では〔Ｌ〕
〔Ｑ〕〔Ｘｃ〕の第１行目の値、第５行目の値、第９行
目の値・・・第６１行目の値を使用して、〔ＴＳ₁₁〕の
演算を行う。内積演算回路４４₂、４４₃、４４₄でも
同様にして、それぞれ〔ＴＳ₂₂〕〔ＴＳ₃₃〕〔ＴＳ₄₄〕
の演算を行う。That is, in the inner product calculation circuit 44 ₁ , [L]
The value of [Q] [Xc] on the first line, the value on the fifth line, the value on the ninth line ... The value on the 61st line is used to calculate [TS ₁₁ ]. Similarly, the inner product arithmetic circuits 44 ₂ , 44 ₃ , and 44 ₄ respectively perform [TS ₂₂ ] [TS ₃₃ ] [TS ₄₄ ].
Is calculated.

【００９７】さらに内積演算回路４４₁の出力を内積演
算回路４５₁に入力する事で、内積演算回路４５₁では
〔Ｖ₁₁〕の演算を行う。内積演算回路４５₂、４５₃、
４５ ₄でも同様にして、それぞれ〔Ｖ₂₂〕〔Ｖ₃₃〕〔Ｖ
₄₄〕の演算を行う。Further, the inner product calculating circuit 44₁Inner product output
Arithmetic circuit 45₁Input to the inner product arithmetic circuit 45₁Then
[V₁₁] Is performed. Inner product calculation circuit 45₂, 45₃,
45 _FourBut in the same way, [V_{twenty two}] [V₃₃] [V
₄₄] Is performed.

【００９８】かくして、内積演算回路４５₁〜４５₄の
出力からは〔Ｖ〕〔ＴＳ〕〔Ｒ〕〔Ｌ〕〔Ｑ〕〔Ｘｃ〕
が出力されるので、これを変換回路８２でシリアル化し
て、そして最後に〔Ｗ〕を並べ替え回路４６で行えば、
出力〔Ｙｃ〕が求まる。Thus, from the outputs of the inner product arithmetic circuits 45 _{1 to} 45 ₄ , [V] [TS] [R] [L] [Q] [Xc].
Is output, so if this is serialized by the conversion circuit 82 and finally [W] is performed by the rearrangement circuit 46,
The output [Yc] is obtained.

【００９９】さらに図２３は、本発明による逆離散コサ
イン変換装置において、上述の第４、第５及び第６の内
積演算回路７２、７３及び７５をそれぞれ並列化するこ
とによって動作速度の高速化を図ったものである。この
図において、並べ替え回路７１で^t〔Ｗ〕の演算を行っ
た後、この出力^t〔Ｗ〕〔Ｙｃ〕に対して、シリアルに
出力されてくる４つのデータを１組として変換回路９１
でパラレル化して、内積演算回路７２₁〜７２₄に入力
する。Further, FIG. 23 shows that in the inverse discrete cosine transform device according to the present invention, the fourth, fifth and sixth inner product arithmetic circuits 72, 73 and 75 are parallelized to increase the operating speed. It is intended. In this figure, after the calculation of ^t [W] is performed by the rearrangement circuit 71, four data that are serially output for this output ^t [W] [Yc] are set as one set and the conversion circuit 91 is set.
In and parallelized, and inputs to the inner product computation circuit 72 ₁ to 72 _4.

【０１００】この内積演算回路７２₁〜７２₄では
^t〔Ｖ₁₁〕〜^t〔Ｖ₄₄〕の演算を行う。この内積演算回
路７２₁〜７２₄の出力を、それぞれ内積演算回路７３
₁〜７３ ₄に入力する。この内積演算回路７３₁〜７３
₄では^t〔ＴＳ₁₁〕〜^t〔ＴＳ₄₄〕の演算を行う。This inner product calculation circuit 72₁~ 72_FourThen
^t[V₁₁] ~^t[V₄₄] Is performed. This inner product operation time
Road 72₁~ 72_FourOutput of the inner product calculation circuit 73
₁~ 73 _FourTo enter. This inner product calculation circuit 73₁~ 73
_FourThen^t[TS₁₁] ~^t[TS₄₄] Is performed.

【０１０１】この内積演算回路７３₁〜７３₄の出力
を、そのまま４入力加算回路７５′₁〜７５′₄に入力
させる。この４入力加算回路７５′₁〜７５′₄では
〔Ｌ₁₁〕〜〔Ｌ₄₄〕の演算を行う。かくして、４入力加
算回路７５′₁〜７５′₄の出力からは〔Ｌ〕^t〔Ｒ〕
^t〔ＴＳ〕^t〔Ｖ〕^t〔Ｗ〕〔Ｙｃ〕が出力されるの
で、これを変換回路９２でシリアル化して、そして最後
に^t〔Ｑ〕を並べ替え回路７６で行えば、出力〔Ｘｃ〕
が求まる。The outputs of the inner product arithmetic circuits 73 _{1 to} 73 ₄ are input to the 4-input adder circuits 75 ' ₁ to 75' ₄ as they are. In the 4-input adder circuit 75 _'1-75' ₄ performs calculation of [L _11] - [L _44]. Thus, from the output of 4-input adder circuit 75 _'1-75' ₄ [L] ^t [R]
^{Since t} [TS] ^t [V] ^t [W] [Yc] is output, if this is serialized by the conversion circuit 92, and finally ^t [Q] is performed by the rearrangement circuit 76, the output [Xc ]
Is required.

【０１０２】[0102]

【発明の効果】以上詳述のように、この発明によれば、
所要の定数行列を複数の疎行列に分解して、一方の疎行
列の要素を０、＋１及び−１とすると共に、他方の低次
の疎行列の要素を定数行列のデータ成分とするようにし
たので、内積演算回路の回路規模が小さく、構成が簡単
になると共に、演算回数が低減して演算速度が向上した
離散コサイン変換装置及び逆離散コサイン変換装置が得
られる。As described above in detail, according to the present invention,
The required constant matrix is decomposed into a plurality of sparse matrices, and the elements of one sparse matrix are 0, +1 and -1, and the elements of the other low-order sparse matrix are the data components of the constant matrix. Therefore, the circuit scale of the inner product arithmetic circuit is small, the configuration is simple, and the discrete cosine transform device and the inverse discrete cosine transform device in which the number of calculations is reduced and the calculation speed is improved can be obtained.

[Brief description of drawings]

【図１】本発明による離散コサイン変換装置の一例の構
成図である。FIG. 1 is a configuration diagram of an example of a discrete cosine transform device according to the present invention.

【図２】その要部の構成を示すブロック図である。FIG. 2 is a block diagram showing a configuration of a main part thereof.

【図３】その要部の構成を示すブロック図である。FIG. 3 is a block diagram showing a configuration of a main part thereof.

【図４】その要部の動作を説明するための行列を示す図
である。FIG. 4 is a diagram showing a matrix for explaining the operation of the main part thereof.

【図５】その要部の動作を説明するための行列を示す図
である。FIG. 5 is a diagram showing a matrix for explaining the operation of the main part thereof.

【図６】その要部の動作を説明するための行列を示す図
である。FIG. 6 is a diagram showing a matrix for explaining the operation of the main part.

【図７】その要部の動作を説明するための行列を示す図
である。FIG. 7 is a diagram showing a matrix for explaining the operation of the main part.

【図８】その要部の動作を説明するための行列を示す図
である。FIG. 8 is a diagram showing a matrix for explaining the operation of the main part thereof.

【図９】その要部の動作を説明するための行列を示す図
である。FIG. 9 is a diagram showing a matrix for explaining the operation of the main part.

【図１０】その要部の動作を説明するための行列を示す
図である。FIG. 10 is a diagram showing a matrix for explaining the operation of the main part.

【図１１】その要部の動作を説明するための行列を示す
図である。FIG. 11 is a diagram showing a matrix for explaining the operation of the main part.

【図１２】その要部の動作を説明するための行列を示す
図である。FIG. 12 is a diagram showing a matrix for explaining the operation of the main part.

【図１３】その要部の動作を説明するための行列を示す
図である。FIG. 13 is a diagram showing a matrix for explaining the operation of the main part.

【図１４】その要部の動作を説明するための行列を示す
図である。FIG. 14 is a diagram showing a matrix for explaining the operation of the main part.

【図１５】その要部の動作を説明するための行列を示す
図である。FIG. 15 is a diagram showing a matrix for explaining the operation of the main part.

【図１６】その要部の動作を説明するための行列を示す
図である。FIG. 16 is a diagram showing a matrix for explaining the operation of the main part.

【図１７】その要部の動作を説明するための行列を示す
図である。FIG. 17 is a diagram showing a matrix for explaining the operation of the main part.

【図１８】その要部の動作を説明するための行列を示す
図である。FIG. 18 is a diagram showing a matrix for explaining the operation of the main part.

【図１９】その要部の動作を説明するための行列を示す
図である。FIG. 19 is a diagram showing a matrix for explaining the operation of the main part.

【図２０】本発明による逆離散コサイン変換装置の一例
の構成図である。FIG. 20 is a configuration diagram of an example of an inverse discrete cosine transform device according to the present invention.

【図２１】その要部の構成を示すブロック図である。FIG. 21 is a block diagram showing a configuration of a main part thereof.

【図２２】本発明による離散コサイン変換装置の他の例
の構成図である。FIG. 22 is a configuration diagram of another example of the discrete cosine transform device according to the present invention.

【図２３】本発明による逆離散コサイン変換装置の他の
例の構成図である。FIG. 23 is a configuration diagram of another example of the inverse discrete cosine transform device according to the present invention.

【図２４】従来の装置の構成を示すブロック図である。FIG. 24 is a block diagram showing a configuration of a conventional device.

【図２５】従来例の装置の動作を説明するためのタイム
チャート図である。FIG. 25 is a time chart diagram for explaining the operation of the conventional device.

【図２６】この発明の説明のための図である。FIG. 26 is a diagram for explaining the present invention.

【図２７】この発明の説明のための図である。FIG. 27 is a diagram for explaining the present invention.

[Explanation of symbols]

４１第１の並べ替え回路４２係数が＋１及び−１である４次の第１の内積演算
回路４３第２の並べ替え回路４４係数が０、＋１及び−１である８次の第２の内積
演算回路４５定数行列のデータ成分が格納されたメモリを含む
第３の内積演算回路４６第３の並べ替え回路41 1st rearrangement circuit 42 4th order inner product arithmetic circuit whose coefficients are +1 and -1 43 2nd rearrangement circuit 44 8th order 2nd inner product whose coefficients are 0, +1 and -1 Arithmetic circuit 45 Third inner product arithmetic circuit including memory storing data components of constant matrix 46 Third rearrangement circuit

Claims

[Claims]

1. An inner product calculation circuit for calculating an inner product of a matrix,
In a discrete cosine transform device including a rearrangement circuit for rearranging data components of a matrix in a predetermined order, a fourth-order first inner product arithmetic circuit having coefficients of +1 and -1, and a coefficient of 0, +1 and -1 The second inner product arithmetic circuit of the 8th order and the third inner product arithmetic circuit including the memory in which the data components of the constant matrix are stored are provided, and the input data of 8 rows and 8 columns is converted into the first rearrangement circuit. To the first inner product arithmetic circuit, and to supply the output of the first inner product arithmetic circuit to the second inner product arithmetic circuit via the second rearrangement circuit. Is directly supplied to the third inner product arithmetic circuit and the output of the third inner product arithmetic circuit is derived via a third rearrangement circuit. ..

2. An inner product calculating circuit for calculating an inner product of a matrix,
In an inverse discrete cosine transform device including a rearrangement circuit for rearranging data components of a matrix in a predetermined order, a fourth inner product arithmetic circuit including a memory in which data components of a constant matrix are stored, and coefficients of 0, +1 and An eighth-order fifth inner product arithmetic circuit which is −1 and a fourth-order sixth inner product arithmetic circuit whose coefficients are +1 and −1 are provided, and the input data of 8 rows and 8 columns is fourth sorted circuit. To the fourth inner product arithmetic circuit, the output of the fourth inner product arithmetic circuit is directly supplied to the fifth inner product arithmetic circuit, and the output of the fifth inner product arithmetic circuit is supplied to the fifth inner product arithmetic circuit. The inverse discrete cosine is characterized in that the sixth inner product arithmetic circuit is supplied through the rearrangement circuit and the output of the sixth inner product arithmetic circuit is derived through the sixth rearrangement circuit. Converter.

3. An inner product calculating circuit for calculating an inner product of a matrix,
In a discrete cosine transform device including a rearrangement circuit that rearranges the data components of a matrix in a predetermined order, a parallelization circuit that parallelizes matrix data that is serially supplied for each predetermined number, and a coefficient of +1 and -1 A certain fourth-order inner product arithmetic circuit, an eighth-order second inner product arithmetic circuit whose coefficients are 0, +1 and -1, and a third inner-product arithmetic operation including a memory in which data components of a constant matrix are stored. A circuit, and the predetermined number of the first, second, and third inner product arithmetic circuits are arranged in parallel, and input data of 8 rows and 8 columns is input to the parallelization circuit through the first rearrangement circuit. Each data of the parallel data supplied and output from the parallelization circuit is supplied to each of the predetermined number of the first inner product arithmetic circuits, and the output of each of the first inner product arithmetic circuits is directly input to the predetermined number of the predetermined inner product arithmetic circuits. Corresponding to the second inner product arithmetic circuit in The output of each of the second inner product arithmetic circuits is directly supplied to the corresponding third inner product arithmetic circuit of the predetermined number, and the output of the predetermined third inner product arithmetic circuits is serialized. A discrete cosine transform device characterized in that it is converted into data and then derived through a third rearrangement circuit.

4. An inner product calculating circuit for calculating an inner product of a matrix,
In an inverse discrete cosine transform device including a rearrangement circuit that rearranges the matrix data components in a predetermined order, a parallelization circuit that parallelizes the matrix data that is serially supplied for each predetermined number, and a constant matrix data component A fourth inner product arithmetic circuit including a stored memory; an eighth-order fifth inner product arithmetic circuit whose coefficients are 0, +1 and -1; and a fourth-order sixth inner product arithmetic coefficient whose coefficients are +1 and -1 An arithmetic circuit, the fourth, fifth, and sixth inner product arithmetic circuits are arranged in parallel, and the input data of 8 rows and 8 columns is input to the parallelizing circuit via a fourth rearrangement circuit. And each data of the parallel data output from the parallelization circuit is supplied to the corresponding fourth inner product calculation circuit in the predetermined number, and the output of each fourth inner product calculation circuit is directly output. The fifth inner product corresponding to the predetermined number To the corresponding sixth inner product arithmetic circuit among the predetermined number, and at the same time to supply the output of each of the fifth inner product arithmetic circuits to the corresponding sixth inner product arithmetic circuit. An inverse discrete cosine transform device characterized in that the output is converted into serial data and then derived through a sixth rearrangement circuit.