JPS6151269A

JPS6151269A - Data processing device

Info

Publication number: JPS6151269A
Application number: JP17242384A
Authority: JP
Inventors: Katsuaki Owada; 大和田　克明
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1984-08-21
Filing date: 1984-08-21
Publication date: 1986-03-13

Abstract

PURPOSE:To set a bit reverse reading processing part to a vector and to process data with high speed which needs to replace data by installing a bit replacing means and a shift means of the vector data. CONSTITUTION:A register (REG)A is a register which stores data d0, d1...dm-1 of an (m) bit composition read from a vector register by a data line 300. An inverting circuit BRV is an inverting data output part which outputs one lower level bit o REGA, namely, data bm-1,...bm-l+1, bm-l by inverting a bit storing posi tion from dm-1 to dm-l. Composite data of a m bit from data b0 to bm-1 becomes input data of a shifter SFT and 0 or 1 of a constant C is forcibly inputted from b0 to bm-l+1. These data are shifted from right to left by the shifter SFT, by indication of a shift mode MOD, (m) bit data up to data k0-Km-1 are stored in REGB, and are sent through a data line 600 to the vector register.

Description

[Detailed description of the invention]

〔産業上の利用分野〕本発明はベクトルデータを処理するデータ処理装置に関
する。〔従来の技術〕コンピュータの応用分野が拡大するにつれて。コンピュータの高速処理・への要請は際限なく広がりつ
つある。スーパーコンピュータは、科学技術分野でのこ
のような要請に応えるものとして、近年特に開発が活発
化している。スーパーコンピュータは一般には膨大なベ
クトルデータをパイプライン処理するベクトル処理装置
な持つことにより高速化を実現しているものが多い。ところで、科学技術計算の１つに高速フーリエ変換（Ｆ
ＦＴ）がある。ＦＦＴは、偏微分方程式の境界値問題の
解析、音声信号や地震波、脳、１　　　　　　　波・経
済変動データなど０各種の信号の７゛クトル解析等、理
学、工学９社会科学などの分野に適用され、最近の高速
ベクトル処理装置の出現に伴い、それに滴１．たＦＦＴ
のアルゴリズムの研究も盛んになってきている。ＦＦＴ
アルゴリズムは、１’９６５年にＣ！ｏｏｌｅｙとＴｕ
ｋｅｙによりＦＦＴに関する論文”　Ａｎ　Ａｌｇｏｒ
ｉｔｈｍ　ｆｏｒｔｈｅ　Ｍａｃｈｉｌ、ｅ　Ｃａ１ｃ
ｕｌａｔｉｏｎ　ｏｆ　Ｉ：！ｏｍｐｌｅｘ　Ｆｏｕｒ
−ｉｅｒ　５ｅｒｉｅｓ　’、　Ｍａｔｈ、　Ｃ！ｏｍ
ｐｕｔ、、　１９＋　２９７−３０１（１９６５’）が
発表されて以来種々のアルゴリズムが提案されてきた。このようなＦＦＴのアルゴリズムの１つの特長は、変換
処理においてデータのビット連続み（ビットリバース）
が必要となるケースが出現することである。例えば、入
力データｆ　（ｉ）を高速フーリエ変換部９００により
フーリエ変換した結果をＦ　（ｉ）とすると、第２図（
ａ）に示す様（ミ８個のデータｆ（ｉ）　（ｉ　＝　０
〜７）に対し入力データｆ（ｉ）はそのままでよいが出
力時にはビットリバースが必要となるケースがある。更
には第２６　（ｂ）に示す如く、入力データｆ（１）を
ビットリバースすることにより、出力時にはＦ（１）は
そのままでよいケース等がある。以下倉口〔発明が解決しようとする問題点〕第３図は従来のＦＦＴに使用されるプログラムの一例を
示すもので、プログラム上ステップＡに相当する部分が
いわゆる入力データのビット連続みな行なっている部分
である。このようなビット連続みによるメモリデータの
並べ換えを行なう場合、一般に対象とするデータの要素
数及びデータ幅からメモリデータに対するメモリアドレ
ス生成時にビット連続み処理及びシフト処理が必要とな
る。従来、これらの処理部分はベクトル化が困難とされ
、従って、スカラ演算で行なっていた。このため、入力
データ量が多い場合はこれらの処理に要する時間が大き
く。ＦＦＴの実行性能を著しく低下させるという欠点があっ
た。本発明の目的は、ベクトルデータのビット入換え手段と
シフト手段を備えることにより上記欠点を除去してビッ
ト連続み処理部分のベクトル化を可能とし、データ入換
え処理を必要とするデータ処理の高速化を達成したデー
タ処理装置を提供することにある。〔問題点を解決するための手段〕本発明によれば、複数の要素から成るベクトルデータを
保持する複数個のベクトルレジスタを具備し、該ペルト
ルレジスタから順次取り出したベクトルデータを処理す
るデータ処理装置において、前記ベクトルレジスタの１
つから順次取り出したｍビット（ｍ≧２）構成のベクト
ル要素データに対し、該デー、夕のうち１ビット（１≦
ｍ）を抽出し、ビット格納位置を反転して出力する反転
手段と、残り（ｍ−１）ビットに対しては予め定めた定
数を出力する定数出力手段と、該定数出力手段の出力定
数データと前記反転手段により生成されたデータとの合
成データに対するシフト動作を規定するシフトモードを
保持し、かつシフト数を保持する保持手段と、該保持手
段により前記合成データをシフトするシフト動作手段と
を有することを特徴とするデータ処理装置が得られる。以下余白〔実施例〕次に本発明の実施例について図面を参照して説明する。第１図は１本発明の第１の実施例によるデータ処理装置
を示した図である。第１因において。１０及び２０はそれぞれ複数の要素から成るベクトルデ
ータを保持するベクトルレジスタ、３０は該ベクトルレ
ジスタ１０及び２０の一方からの続出データを選択する
選択回路、４０は該ベクトルレジスタ１０及び２０の一
方への書込みデータを選択する選択回路、５０は命令を
解読し、ベクトルレジスタへの読出し／書込み制御。選択回路５０．４０への選択指示及び機能回路６０への
動作開始指示等を行なう制御回路、６０は前記選択回路
３０により選択されたデータに対し、後述するビット入
換えを行なう本発明の特徴とする機能回路である。通常
９選択回路３０の下にはいくつかの機能回路が並列に接
続されている。第１図を用いて通常のベクトル処理の流れを簡単に説明
する。まず、演算に先立ちメモリ（因示せず）上からベクトル
データを読出し、楼素分だけベクトルレジスタへロード
する。この場合、メモリから読出されたデータはデータ
線７００を介して本データ処理装置へ送られてくる。制
御回路５０は命令語を解読し２選択線ＳＬ１により選択
回路４０においてデータ線７００を選択し２選択された
データ４０１又は４０２は２．指定されたベクトルレジ
スタ１０又は２０に対し、書込指示線ＷＥ１又はＷＢ２
及び書込みアドレスＷＡ１　又はＷＡ２により指定され
たベクトルレジスタへ書込まれる。次にロードされた前記ベクトルデータは、読出しアドレ
スＲ’Ａ　Ｉ又はＲＡ２によりデータ線１００又は２０
０上に読み出され２選択回路３０において選択線ＳＬ２
によりどち゛らか選ばれ、データ線６００を介して機能
回路６０に対して順次送出される。機能回路６０は、動
作指示線ＭＯＤに応動して、送られてきたデータに対し
演算を実行し、その結果を順次データ線６００を介して
ベクトルレジスタへ送出する。ベクトルレジスタはベク
トル制御部５０からの指示により書込みアドレスＷＡ１
又はＷｉ２により指定されたエリアへ演算結果を順次書
込んでいく。また演算結果をメモリ上へ格納する場合は、読出しアド
レスＲＡ１又はＲＡ２によりベクトルレジスタからデー
タを読み出し、データ線７０１を介してメモリ部へ送出
される。第４図は第１因の機能回路６０を示すものである。機能
回路６０は１ｍビット構成の入力ベクトルデータを格納
するレジスタＲＥＧ　Ａと、レジスタＲＥＧＡから１ビ
ットを抽出し格納位置を反転させてデータを出力する反
転回路ＢＲＶと。該反転されて出力されたデータと（ｍ−１）ピント分に
対する定数とを入力とするシフタ回路ＳＦＴと、該シフ
タＳＦＴに対するシフトモードを保持するフリップフロ
ップＭＲと、該シフタＳＦＴに対するシフト数を保持す
るレジスタＢＣＲと、シフタＳＦＴの出力データを格納
するレジスタＲＥＧＢとから構成される。第５図は、第４因の反転回路ＢＲＶとシフタ回路ＳＦＴ
とを含む部分の具体的な回路例を示す図である。ＲＥＧ
’Ａはデータ線３００によりベクトルレジスタから読み
出されたｍビット構成のデータｄＯ＋　ｄｌ　＋　ｄ２
　＋・・・・・ｄｍ　１＋　ｄｍ　１＋１　＋・・・・
・・。ａｍ−１を格納するレジスタである。ＢＲＶはＲＥＧ　
Ａの下位１ビット即ち、データ（ｉｍ−’１からａ、＋
二対しビット格納位置を反転して５．データｂｍｌ＋ｂ
ｍ−１＋１．・・・・・ｂｍ−１を出力する反転データ
出力部である。データｂＯから１）ｍ−１までのｍビッ
トの合成データはシフタ回路ＳＦＴの入力データとなり
、この時、　ｂｏから１）ｍ−１−１までは予め定めら
れた定数Ｃ（Ｃ＝Ｏ又は１）が強制的に入力される。こ
のようなビット合成部の詳細を表１に示す。表１即ち、ビット　ｂｏから１）ｍ−１−１までは定数Ｃを
入力し＋ｂｍ−１からｂｌ−１に対してはＲＥ（）Ａの
データｄｍ−１＋　ｄｍ−２＋　”””＋　ｄｍ−１−
Ｈ＋　ｄｍ−１を入力する。以上のデータは次にシック回路ＳＦＴにより。右／左にシフトされる。シック回路ＳＦＴに対しては、
シフトモード線Ｍ及びシフト数指示線ＳＦＣを介して各
フリップフロップＭＲ及びＳＣＲが接続されており、こ
れらの指示線によりシフトされたデータｋＯからｋ。−
１までのｍビットのデータはレジスタＲＥＧＢに格納さ
れデータ線６００を介してベクトルレジスタへ送出され
る。表２は一例として、ｍ＝８．１＝４の場合の第５図の回
路動作を表わした図であり、有効シフト数はＯ〜４ビッ
トまでとし１Ｍ−０の時は左シフ）、Ｍ＝１の時は右シ
フトを行なうものとする。また左シフトの場合、空いた
位置には′）　　　　　　　　”０・が挿入されるもの
とする。第５図をま表２において、Ｃ＝’０とした時の
ｋＯ〜に７発生回路を示した図であり＋　ｔＯ〜ｔ７は
シフト数８７０表２をデコーダＤによりデコードした信号で２例えば５ｙｃ
＝ｏならｔｏ＝１，５ＦＣ＝４ならｔ４＝１となる。ま
た、　Ｇ１は論理積ゲート、Ｇ２は論理和ゲートを表わ
す。次に、第１因のデータ処理装置を用いて。ＦＦＴなどにしばしば使用されるベクトルデータの入換
え動作の一具体例を第６図を参照して説明する。第６図（ａ）に示したメモリ７０に、基準アドレスＭＡ
から順次２ωバイト幅（ω≧０）のデータｆ（０）、　
　ｆ（１）、・・・、ｆ（ｉ）が格納されていて、　Ｆ
ＦＴのプロセスにおいて第２図の（ｂ）に示す様な入力
データの入換えを１ビットの反転回路ＢＲＶで行なう場
合を説明する。この場合、要素数１の有効ビット数をｐ
とすると、シフトモードＭ及びシフト数ＳＦＣは以下の
様に規定される。 −例として１ｉ＝７１　ω−：２＋　１−４＋　　ｐ−
３の場合を第６図に示す。まず、第６ｅ（ｂ）の如く、
ベクトルレジスタ１０に対しポインタ１（有効ビット数
：６）を順次ロードする。次にベクトルレジスタ１０の
ベクトルデータ即ち前記ポインタ１を選択回路３０を介
してシフタ付反転回路を有する機能回路６０へ送出する
。ｆ（１）のデータ幅は４バイトであるため、１−ｐ−１
、ω−２となりこのときのシフトモードＭＲには０”、
Ｆ２Ｏ，Ｈには００１”をセットする。シック付ビット反転回路（第４図のＢＲＶとＳＦＴとを
含む部分）はＲＥＧＡに格納された３２ビットのデータ
（ポインタ１を含む）の下位４ビットをビット反転させ
、かつ、上位２８ビットに対しては定数“０”を出力し
、これらの合成データをシフト回路ＳＦＴにより、左へ
１ピントシフトさせてレジスタＲＥＧＢ　（第４因）へ
格納する。即ち、ベクトルレジスタ１０の内容は以下の
様に変換される。[Industrial Application Field] The present invention relates to a data processing device that processes vector data. [Prior Art] As the application fields of computers expand. The demand for high-speed computer processing is expanding without limit. Supercomputers have been particularly actively developed in recent years to meet such demands in the science and technology fields. In general, many supercomputers achieve high speed by having a vector processing device that processes huge amounts of vector data in a pipeline. By the way, fast Fourier transform (F
FT). FFT is applied to fields such as science, engineering, social science, analysis of boundary value problems of partial differential equations, 7 vector analysis of various signals such as audio signals, seismic waves, brain, wave, and economic fluctuation data. , With the recent advent of high-speed vector processing devices, there are a number of new features: 1. FFT
Research on algorithms is also gaining momentum. FFT
The algorithm was developed by C! in 1'965. ooley and Tu
Paper on FFT by key” An Algor
Ithm for the Machil, e Ca1c
ulation of I:! complex four
-ier 5eries', Math, C! om
Various algorithms have been proposed since the publication of ``Put,'' 19+ 297-301 (1965'). One of the features of this type of FFT algorithm is that the data is bit-continuous (bit reversed) during the conversion process.
There will be cases where this is necessary. For example, if input data f (i) is Fourier transformed by the fast Fourier transform unit 900 and F (i) is the result, then FIG.
As shown in a) (8 data f(i) (i = 0
7), the input data f(i) may be left as is, but there are cases where bit reversal is required at the time of output. Furthermore, as shown in Section 26 (b), there are cases where input data f(1) is bit-reversed so that F(1) can be left as is at the time of output. Kuraguchi [Problems to be Solved by the Invention] Figure 3 shows an example of a program used in a conventional FFT, in which the part corresponding to step A in the program processes so-called consecutive bits of input data. This is the part where you are. When rearranging memory data based on bit continuity, bit continuity processing and shift processing are generally required when generating a memory address for the memory data based on the number of elements and data width of the target data. Conventionally, these processing parts have been considered difficult to vectorize, and therefore have been performed using scalar operations. Therefore, when the amount of input data is large, the time required for these processes is large. This has the disadvantage that the execution performance of FFT is significantly reduced. An object of the present invention is to eliminate the above-mentioned drawbacks by providing a bit swapping means and a shifting means for vector data, and to enable vectorization of bit contiguous processing parts, thereby enabling high-speed data processing that requires data swapping processing. The objective is to provide a data processing device that achieves [Means for Solving the Problems] According to the present invention, data processing is provided that includes a plurality of vector registers that hold vector data consisting of a plurality of elements, and processes vector data sequentially taken out from the Peltle registers. In the apparatus, one of the vector registers
For vector element data of m bits (m≧2) sequentially extracted from data, one bit (1≦
m), an inverting means for inverting the bit storage position and outputting it; a constant outputting means for outputting a predetermined constant for the remaining (m-1) bits; and output constant data of the constant outputting means. holding means for holding a shift mode that defines a shift operation for the composite data of and the data generated by the inverting means and for holding a shift number; and a shift operation means for shifting the composite data by the holding means. A data processing device is obtained which is characterized by having the following features. DESCRIPTION OF THE PREFERRED EMBODIMENTS [Embodiments] Next, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a diagram showing a data processing apparatus according to a first embodiment of the present invention. In the first cause. 10 and 20 are vector registers each holding vector data consisting of a plurality of elements; 30 is a selection circuit for selecting subsequent data from one of the vector registers 10 and 20; and 40 is a selection circuit for selecting successive data from one of the vector registers 10 and 20. A selection circuit 50 selects write data, and 50 decodes the command and controls read/write to the vector register. A control circuit that instructs the selection circuit 50, 40 to select, and instructs the functional circuit 60 to start operation, etc., and 60 is a feature of the present invention that performs bit swapping, which will be described later, on data selected by the selection circuit 30. It is a functional circuit that Normally, several functional circuits are connected in parallel under the 9 selection circuit 30. The flow of normal vector processing will be briefly explained using FIG. First, prior to calculation, vector data is read from a memory (not shown) and loaded into a vector register by the number of row elements. In this case, the data read from the memory is sent to the present data processing device via the data line 700. The control circuit 50 decodes the command word, selects the data line 700 in the selection circuit 40 using the 2 selection line SL1, and the selected data 401 or 402 is 2. Write instruction line WE1 or WB2 for specified vector register 10 or 20
and is written to the vector register specified by write address WA1 or WA2. Next, the loaded vector data is transferred to the data line 100 or 20 by the read address R'A I or RA2.
0 on the selection line SL2 in the 2 selection circuit 30.
one of them is selected and sent out sequentially to the functional circuit 60 via the data line 600. Functional circuit 60 executes arithmetic operations on the sent data in response to operation instruction line MOD, and sequentially sends the results to the vector register via data line 600. The vector register is set to the write address WA1 according to instructions from the vector control unit 50.
Alternatively, the calculation results are sequentially written into the area specified by Wi2. Further, when storing the calculation result in the memory, data is read from the vector register using the read address RA1 or RA2, and sent to the memory section via the data line 701. FIG. 4 shows the functional circuit 60 of the first cause. The functional circuit 60 includes a register REGA that stores input vector data having a 1 m bit configuration, and an inversion circuit BRV that extracts 1 bit from the register REGA, inverts the storage position, and outputs the data. A shifter circuit SFT that receives the inverted and output data and a constant for (m-1) focus portions, a flip-flop MR that holds a shift mode for the shifter SFT, and a shift number for the shifter SFT. It is composed of a register BCR that stores the output data of the shifter SFT, and a register REGB that stores the output data of the shifter SFT. Figure 5 shows the fourth factor, the inverting circuit BRV and the shifter circuit SFT.
FIG. 3 is a diagram illustrating a specific circuit example of a portion including. REG
'A is m-bit data dO+dl+d2 read from the vector register via the data line 300
+・・・・dm 1+ dm 1+1 +・・・・
.... This is a register that stores am-1. BRV is REG
The lower 1 bit of A, that is, the data (im-'1 to a, +
5. Invert the bit storage positions for the two pairs. data bml+b
m-1+1. ...This is an inverted data output section that outputs bm-1. The m-bit composite data from data bO to 1)m-1 becomes the input data of the shifter circuit SFT, and at this time, from bo to 1)m-1-1 is a predetermined constant C (C=O or 1). ) is forcibly entered. Details of such a bit combining section are shown in Table 1. Table 1 That is, input the constant C from bit bo to 1) m-1-1, and input the data of RE()A from +bm-1 to bl-1 dm-1+ dm-2+ """+ dm- 1-
Input H+ dm-1. The above data is then processed by thick circuit SFT. Shifted to the right/left. For thick circuit SFT,
Each flip-flop MR and SCR are connected via a shift mode line M and a shift number instruction line SFC, and data kO to k shifted by these instruction lines are connected. −
Data of m bits up to 1 is stored in register REGB and sent to the vector register via data line 600. As an example, Table 2 is a diagram showing the circuit operation of FIG. 5 when m=8.1=4, and the effective shift number is from O to 4 bits (left shift when 1M-0), M When =1, a right shift is performed. In addition, in the case of left shift, it is assumed that ``0'' is inserted in the vacant position. In Figure 5 and Table 2, the 7 generation circuit is shown for kO ~ when C = '0. In the figure, tO to t7 is a signal obtained by decoding the number of shifts 870 in Table 2 by decoder D. For example, 5yc
If =o, then to=1, and if 5FC=4, then t4=1. Further, G1 represents an AND gate, and G2 represents an OR gate. Next, using the data processing device of the first cause. A specific example of a vector data interchange operation often used in FFT and the like will be explained with reference to FIG. The reference address MA is stored in the memory 70 shown in FIG. 6(a).
2ω byte width (ω≧0) data f(0) sequentially from
f(1),..., f(i) are stored, and F
A case will be described in which input data is exchanged as shown in FIG. 2(b) in the FT process using a 1-bit inversion circuit BRV. In this case, the number of effective bits of element number 1 is p
Then, shift mode M and shift number SFC are defined as follows. - For example 1i=71 ω-: 2+ 1-4+ p-
Case 3 is shown in FIG. First, as in Section 6e(b),
Pointer 1 (effective number of bits: 6) is sequentially loaded into the vector register 10. Next, the vector data of the vector register 10, ie, the pointer 1, is sent via the selection circuit 30 to the functional circuit 60 having an inversion circuit with a shifter. Since the data width of f(1) is 4 bytes, 1-p-1
, ω-2, and the shift mode MR at this time is 0”,
Set "001" in F2O,H. The bit inversion circuit with thick (the part including BRV and SFT in Figure 4) converts the lower 4 bits of the 32-bit data (including pointer 1) stored in REGA. The bits are inverted and a constant "0" is output for the upper 28 bits, and the combined data is shifted by one pin point to the left by the shift circuit SFT and stored in the register REGB (fourth factor). , the contents of the vector register 10 are converted as follows.

〔０〕チ２４．００００００００　（０）→[0] Chi24.00000000 (0) →

〔０〕■２
４．００００００００　（０）[0]■2
4.00000000 (0)

〔０〕チ２４．００００
０００１　（１）→［０：］　、Ｘ−２４，０００１０
０００（１６）[0] Chi 24.0000
0001 (1) → [0:], X-24,00010
000 (16)

〔０〕繋２４．００００００１０　　（
２）→[0] Tsunai 24.00000010 (
2)→

〔０〕チ２４．００００１０００　（８）[0] Chi24.00001000 (8)

〔０〕
チ２４．００００００１１　（３）→[0]
Chi24.00000011 (3)→

〔０〕薫２４．０
００１１０００　（２４）[0] Kaoru 24.0
0011000 (24)

〔０〕餐２４．００００　［
１１００（４）→[0] Meal 24.0000 [
1100 (4) →

〔０〕チ２４．０００００１００　＜
　４　）[0] Chi24.00000100 <
4)

〔０〕餐２４．０００００１０１　（５）→〔
０〕子２４．０００１０１００　（２０）[0] Dinner 24.00000101 (5) → [
0] Child 24.00010100 (20)

〔０〕蒼２４
．０００００１．１０　（６）→[0] Ao24
．． 000001.10 (6)→

〔０〕苦２４，０００
１１０００　（１２）[0] Koku 24,000
11000 (12)

〔０〕藁２４．０００００１１１
　　（７）→[0] Straw 24.00000111
(7) →

〔０〕藁２４．［Ｉ［］０１１１１］Ｃｌ
　（２８）（注）［０’：ｌ＊ｎ・・・ｎビン１分のデ
ータがすべて”０°′であることを表わす。これらのデータは第６図（Ｃ）の如く、順次選択回路４
０を介してベクトルレジスタ１０に格納される。次にベ
クトルレジスタ２０に対し、メモリ７０上の基準アドレ
スＭＡを８要素順次格納する。然る後、ベクトルレジス
タ１０の内容及びベクトルレジスタ２０の内容を各要素
対応に読出して加算回路（特に図示していない）で演算
し、結果をベクトルレジスタ２０へ第６図（ｄ）の如＜
、格納する。この時点でベクトルレジスタ２０内の各要
素データは、メモリ上のベクトルデータｆ（１）のメモ
リアドレスを示していることになる。従って５次にベク
トルレジスタ２０の内容を順次取出し、該取出したデー
タをメモリ；１　　　　　　　アトｖ、ｌ：し１メ９す
上７゛ら順次データを読出してベクトルレジスタ１０へ
、第６図（ｅ）の如く格納する。メモリアドレスｊに格
納されているデータをＭ　（ｊ）とすると、第６因から
れかるように。Ｍ（ＭＡ　＋　［１）　　→　ｆ（０ン、　　　Ｍ（Ｍ
Ａ　＋１（Ｓ）　　→　ｆ（４ンＭ（ＭＡ＋　８）→ｆ
（２）、　　Ｍ（ＭＡ＋２４）→ｆ（６）Ｍ（ＭＡ＋４
）→ｆ（１）、　　Ｍ’（ＭＡ＋２０）→ｆ（５ンＭ（
ＭＡ＋１２）→ｆ（３）、　　Ｍ（ＭＡ＋２８）→ｆ（
７）となり、ベクトルレジスタ１０には順次、ｆ（０）
。ｆ（４）、　　ｆ（２）、　　ｆ（６）、　　ｆ（１）
、　　ｆ（５）、　　ｆ（３）、　　ｆ（７）。即ち、入換えられたデータが格納されたことになる。なお、第２図（ａ月二示した様なＦＦＴの変換データの
出力時にビットリバースさせる場合も前記同様なシック
付反転回路を使用してアドレス計算を行ない２例えばベ
クトルレジスタ１ｏに格納されているＦＦＴの結果Ｆ　
（０）　、　　Ｆ　（１）　、・・・川Ｆ　（６）　、
　？　（７）をベクトルレジスタ２０の内容即ちメモリ
アドレス上へ順次格納すればよい。即ち以下に示す様に
メモリへ格納される。Ｍ（ＭＡ＋　０）←Ｆ（［１）、　　Ｍ（ＭＡ＋　４）
←Ｆ（４）Ｍ（ＭＡ＋　８）←Ｆ（２）、　　Ｍ　（Ｍ
Ａ＋　１２）←Ｆ（６）Ｍ（ＭＡ＋１６）４−Ｆ（１）
、　　Ｍ（ＭＡ＋２０）４−Ｆ（５）ＭＣＭＡ＋２４）
←Ｆ（３）、　　Ｍ　（ＭＡ　＋　２８　）←Ｆ（７）
更に、」二記に説明した実施例と同じ効果を持つ本発明
の第２の実施例によるデータ処理装置の機能回路６０を
第７図に示す。第７図の回路は、前記反転回路ＢＲＶと
シフト回路ＳＦＴの接続が第４図の回路と逆の関係にな
っている。即ち、レジスタＲＫＧＡに格納されたｍビットのデータ
ｄｏ、ｄ１．・・・・ｄｍ−１に対しシフトモードＭ及
びシフト数ＳＦＣより指定されたシフト動作をシフト回
路１３ＦＴで行なう。次にシフトされたデータｋＯｒｋ
ｊ・・・・・・ｋｍ−１，・・・・・・＋　ｋｍ　Ｉの
下位１ビットを反転回路ＢＲＶに入力する。該反転回路
ＢＲＶの出力ｂｍ−１＋・・・・・、ｂｍ−１にはビッ
ト格納位瞳が反転されたデータｒ　ｋｍ　Ｓｒ　ｋｍ−
２＋　　・・・・・ｋｍ−１４１＋　ｋｍ　１が出力さ
れ、この時、上位（ｍ−１）ピント即ちｂＯ＋ｂ＋＋・
・・・・・＋ｂｍ（−＋に対しては定数Ｃ（Ｏ又は１）
が常時出力され８合成されたデータがレジスタＲＥＧ　
Ｂに格納される。〔発明の効果〕本発明は以上説明した様に、シック付反転回路によりピ
ット連続み処理もベクトル化可能となりメモリ上のデー
タの入換え処理を高速化できるという効果があり、特に
科学技術計算における高速データ処理の有効な手段とな
り得るものである。[0] Straw 24. [I[]01111]Cl
(28) (Note) [0': l*n...indicates that the data for one bin of n are all "0°'. These data are sequentially passed through the selection circuit as shown in Figure 6 (C). 4
0 is stored in the vector register 10. Next, eight elements of the reference address MA on the memory 70 are sequentially stored in the vector register 20. After that, the contents of the vector register 10 and the contents of the vector register 20 are read out for each element, and an adder circuit (not particularly shown) performs an operation, and the result is transferred to the vector register 20 as shown in FIG. 6(d).
,Store. At this point, each element data in the vector register 20 indicates the memory address of the vector data f(1) on the memory. Therefore, the contents of the vector register 20 are sequentially taken out in the fifth step, and the retrieved data are read out sequentially from the memory; ). Letting the data stored at memory address j be M (j), as can be seen from the sixth factor. M(MA + [1) → f(0n, M(M
A +1(S) → f(4nM(MA+8) →f
(2), M(MA+24)→f(6)M(MA+4
)→f(1), M'(MA+20)→f(5nM(
MA+12) → f(3), M(MA+28) → f(
7), and the vector register 10 sequentially stores f(0)
. f(4), f(2), f(6), f(1)
, f(5), f(3), f(7). In other words, the replaced data is stored. In addition, when bit-reversing the output of FFT converted data as shown in Figure 2 (A-2), an address calculation is performed using the same inverting circuit with thick as described above, and the address is stored in the vector register 1o. FFT result F
(0), F (1), ... River F (6),
? (7) may be sequentially stored in the contents of the vector register 20, that is, on the memory addresses. That is, it is stored in memory as shown below. M(MA+ 0)←F([1), M(MA+ 4)
←F(4)M(MA+8)←F(2), M(M
A+ 12)←F(6)M(MA+16)4-F(1)
, M(MA+20)4-F(5)MCMA+24)
←F(3), M (MA + 28)←F(7)
Furthermore, FIG. 7 shows a functional circuit 60 of a data processing apparatus according to a second embodiment of the present invention, which has the same effects as the embodiment described in Section 2. In the circuit shown in FIG. 7, the connection between the inverting circuit BRV and the shift circuit SFT is reversed to that in the circuit shown in FIG. 4. That is, m-bit data do, d1 . ...The shift circuit 13FT performs a shift operation specified by shift mode M and shift number SFC for dm-1. Next shifted data kOrk
j...km-1,...+km The lower 1 bit of I is input to the inversion circuit BRV. The output bm-1+, bm-1 of the inverting circuit BRV contains data r km Sr km- whose bit storage pupil is inverted.
2+...km-141+km 1 is output, and at this time, the upper (m-1) focus, that is, bO+b++・
...+bm (constant C (O or 1) for -+
is constantly output and the 8 combined data is stored in register REG.
It is stored in B. [Effects of the Invention] As explained above, the present invention has the effect that the inverting circuit with thick enables the processing of continuous pits to be vectorized and speeds up the processing of exchanging data on the memory. This can be an effective means of high-speed data processing.

[Brief explanation of drawings]

第１因は本発明の第１の実施例によるデータ処理装置を
示したブロック因１．第２（２）は高速フーリエ変換時
のデータ入換えを示したブロック図、第３図は従来の高
速フーリエ変換用プログラムの一例を示した図、第４因
は第１因の機能回路６０の一例を示したブロック図、第
５肉は第４図の反転回路ＢＲＶとシフタ回路ＳＦＴとを
含む部分の回路図、第６図は上記実施例によるデータ処
理装置によるデータ入換え処理の動作を説明するための
図、第７（８）は本発明の第２の実施例によるデータ処
理装置の機能回路６０を示したブロック図である。１０．２０・・・ベクトルレジスタ、３０．４０・・・
選折回路、５０・・・制御回路、６０・・・機能回路。ＲＥＧＡ、　ＲＫＧＢ、　ＳＣ！Ｒ・・・レジスタ、Ｍ
Ｒ・・・フリップフロップ、　Ｇ１・・・論理積グー）
、Ｇ２・・・論理和ゲート、Ｄ・・・デコーダ。代理人（７１２７）弁理士後藤洋介：）６０機能回路ＮＤ第６図（α）ＭＡ　　　墨坪アドレスｆ（０）〜ｆ（７）　　：　　公グトルデータ（ｅ）公りトルレヅスタ（１ωThe first factor is block factor 1 which shows the data processing device according to the first embodiment of the present invention. 2(2) is a block diagram showing data exchange during fast Fourier transform, FIG. 3 is a diagram showing an example of a conventional fast Fourier transform program, and the fourth factor is the functional circuit 60 of the first factor. A block diagram showing an example, the fifth part is a circuit diagram of a portion including the inversion circuit BRV and shifter circuit SFT in FIG. Figure 7 (8) is a block diagram showing a functional circuit 60 of a data processing device according to a second embodiment of the present invention. 10.20... Vector register, 30.40...
Sorting circuit, 50... control circuit, 60... functional circuit. REGA, RKGB, SC! R...Register, M
R...flip-flop, G1...logical product)
, G2...OR gate, D...decoder. Agent (7127) Patent attorney Yosuke Goto:) 60 functional circuit ND Fig. 6 (α) MA Sumitsubo address f(0) to f(7): Kogutle data (e) Kori Torrezusta (1ω

Claims

[Claims] 1. In a data processing device that includes a plurality of vector registers holding vector data consisting of a plurality of elements and processes vector data sequentially taken out from the vector registers, one of the vector registers Extract 1 bit (1≦m) of the m-bit (m≧2) vector element data sequentially extracted from the data,
Inverting means for inverting and outputting the bit storage position, and the remaining bit storage position (
m-1) For bits, a constant output means for outputting a predetermined constant, and a shift operation for converting the output constant data of the constant output means and the data generated by the inversion means into composite data are defined. A data processing device comprising: holding means for holding a shift mode and a shift number; and shift operation means for shifting the composite data using the holding means. 2. In a data processing device that includes a plurality of vector registers holding vector data consisting of a plurality of elements and processes vector data sequentially taken out from the vector registers, m bits sequentially taken out from one of the vector registers. (m≧2) retains a shift mode that defines a shift operation for vector element data of the configuration;
and a holding means for holding a shift number, a shift operation means for shifting the vector element data by the holding means, and one bit (1≦m) of the m-bit shift data generated by the shift operation means. A data processing device comprising: inverting means for inverting and outputting a bit storage position; and constant output means for outputting a predetermined constant for the remaining (m-1) bits.