JPH0214363A

JPH0214363A - Fast fourier transform method and arithmetic apparatus therefor

Info

Publication number: JPH0214363A
Application number: JP1059841A
Authority: JP
Inventors: John E Whelchel; ジョン　イー　ウェルチェル; James F Mcarthur; ジェームズ　エフ　マッカーサー
Original assignee: E Systems Inc
Current assignee: Raytheon Co
Priority date: 1988-03-14
Filing date: 1989-03-14
Publication date: 1990-01-18
Also published as: DE3908276A1; GB8905801D0; GB2216693A; GB2216693B; IL89604A; IL89604A0

Abstract

PURPOSE: To execute the FFT of digital data without using a switch by preventing an input memory and an output memory data word from being transmitted to the other side of a channel except performing the transmission within a Fourier transformation operator. CONSTITUTION: Digital data is received by the window composed of N complex number data words and the data is stored within R independent parallel channels. Next, these data is sequenced within a LogR N serially arrayed stages. Each stage has a Fourier transformation operator 10 having an arithmetic element to be a data address operator FFT or DFT, the RAM for each channel, a rotation factor operator deforming data to be transformed without performing cross channel communication and a new phase system operator. First and third substitution operators are transformed into phase shift operators. By making the complex number data word perform a phase rotation in each channel by these operators, the action which is equivalent to the switching of the data word is made.

Description

【発明の詳細な説明】り産業上の利用分野〕本発明は、−組のディジタルデータのフーリエ変換を実
行する方法および装置に関するものである。特に、連続
的に受信した電気信号からサンプリングした個別信号を
高速フーリエ変換（ＦＦＴ）するプロセッサに関するも
のである。DETAILED DESCRIPTION OF THE INVENTION FIELD OF INDUSTRIAL APPLICATION The present invention relates to a method and apparatus for performing a Fourier transform of a set of digital data. In particular, it relates to a processor that performs fast Fourier transform (FFT) on individual signals sampled from continuously received electrical signals.

[Conventional technology]

このようなＦＦＴ　（高速フーリエ変換）技術には、マ
ルチプルディスクリートフーリエ変換（ＤＦＴｓ）の高
速演算が包含されている。このＤＦＴは、これら個別（
ディスクリート）信号の時間領域と周波数領域との間の
関係について説明するためのツール（手段）として使用
される。この高速フーリエ変換を行なう装置および方法
によって変換すべきデータワードの数量と、ＤＦＴ（Ｎ
２）を演算するために必要な操作の回数との関係から、
これの効率が決定される。Such FFT (fast Fourier transform) techniques include the high speed operation of multiple discrete Fourier transforms (DFTs). This DFT is calculated using these individual (
It is used as a tool to explain the relationship between the time domain and frequency domain of discrete (discrete) signals. The number of data words to be transformed by this fast Fourier transform apparatus and method and the DFT (N
From the relationship with the number of operations required to calculate 2),
The efficiency of this is determined.

大規模なりＦＴをマルチプル小規模ＤＦＴｓ　（例えば
、２または４の基数Ｒで）代用できるならば、必要な操
作回数を相当程度減少させることができる。If multiple small-scale DFTs (eg, with a base R of 2 or 4) can be substituted for large-scale FTs, the number of required operations can be reduced considerably.

更に、このマルチプル小規模ＤＦＴｓの演算は、各ステ
ップが類似のステップの多段プロセスより成っている。Furthermore, the operation of this multiple small-scale DFTs consists of a multi-stage process where each step is similar.

このことによって、プロセッサは、より少ない特有のコ
ンポーネントを有するＦＦＴを演算することが可能とな
る。これらステージ（段）Ｂの数は、サンプルのサイズ
と基数とに基いてＢ＝ＬｏｇＦＬＮで関連付けられてい
る。This allows the processor to compute an FFT with fewer unique components. The numbers of these stages B are related based on the sample size and cardinality by B=LogFLN.

しかし乍ら、演算ステージ数が大きくなると、丸めの誤
差のために演算が複雑化すると共に、演算結果の精度が
低下してしまう欠点がある。従ってサンプルサイズ１１
　Ｎ　Ｉ＋が増大した場合の処理効率を向上させるため
に、ＦＦＴ演算方法は、ＤＦＴ基数基数上び演算段Ｂの
数量との間の妥協によって得られるもので、一般に基数
はデータバスにおけるスイッチングコストと複雑度によ
って４または８に限定されている。However, as the number of calculation stages increases, the calculations become more complex due to rounding errors, and the accuracy of the calculation results decreases. Therefore the sample size is 11
In order to improve the processing efficiency when N I+ increases, the FFT calculation method is obtained by a compromise between the DFT radix number and the number of calculation stages B, and the radix is generally a combination of the switching cost and the number of processing stages in the data bus. Limited to 4 or 8 depending on complexity.

従来、ＦＦＴを演算する数種類の独特な方法が提案され
ており、これら演算をハードウェアで実行した場合には
異なったアーキテイクチェアで実現されている。先ず、
第１の従来例は、Ｃｏｏｌｅｙおよび７ｕｋｅｙ氏らに
よろもので、”Ｍａｔｈ　Ｃｏｍｐｕｔｅｒ’″１９６
５年４月ｖｏｌ　１９．第２９７〜３０１頁の“Ａｎ　
Ａｌｇｏｒｉｔｈｍ　ｆｏｒｔｈｅ　Ｍａｃｈｉｎｅ　
Ｃａ１ｃｕｌａｔｉｏｎ　ｏｆ　Ｃｏｍｐｌｅｘ　Ｆｏ
ｕｒｉｅｒＳｅｒｉｅｓ”に開示されている。このタイ
プのものには″可変幾何学″が包含されており、これは
、データアドレス動作がステージからステージで変化す
ることを意味する。In the past, several unique methods for calculating FFT have been proposed, and when these calculations are performed in hardware, they are realized using different architectural chairs. First of all,
The first conventional example is by Cooley and 7ukey et al., "Math Computer'" 196
April 5th vol 19. “An” on pages 297-301
Algorithm for the Machine
Calculation of Complex Fo
This type includes a "variable geometry," meaning that the data addressing behavior varies from stage to stage.

また、第２のタイプのものは、Ｐｅａｓｅ氏によって紹
介された″一定幾何学”タイプのもので、′″Ｊｏｕｒ
ｎａｌ　ｏｆ　ｔｈｅ　Ａｓ５ｏｃｉａｔｉｏｎ　ｆｏ
ｒ　Ｃｏａ＋ｐｕｔｉｎｇ　Ｍａｃｈｉｎｅｒｙ”１９
６８年４月ｖｏｌ　１５．第２５２−２６４頁の“Ａｄ
ａｐｔａｔｉｏｎ　ｏｆ　ｔｈｅ　Ｆａｓｔ　Ｆｏｕｒ
ｉｅｒ　Ｔｒａｎｓｆｏｒｍ　ｆｏｒ　Ｐａｒａｌｌｅ
ｌ　ｆ”ｒｏｃｅｓｓｉｎｇ”に３己載されている。こ
こでデータのアドレス動作はステージからステ、−ジま
では同一のままである。このハードウェアの簡素化の実
現は、可変幾何学タイプのものに関する″回転要素”Ｒ
ＯＭの順序における変化である。これら両方のタイプの
ものにおいて、ＲＯＭ要素の順序はステージ毎に変化す
ると共に、一般に、これはアドレスカウンターで取扱わ
わる。The second type is the ``constant geometry'' type introduced by Mr. Pease, and the ``Jour''
nal of the As5ocation fo
rCoa+putting Machinery”19
April 1968 vol 15. “Ad” on pages 252-264
aptation of the Fast Four
ier Transform for Paralle
There are 3 articles listed in ``rocessing''. Here, the data address operation remains the same from stage to stage. The realization of this hardware simplification is due to the "rotating element" R for variable geometry types.
A change in the order of OM. In both of these types, the order of the ROM elements changes from stage to stage and is generally handled by an address counter.

最近の開発によれば、パイプラインプロセッサの導入で
ある。このアーキティクチェアによって。A recent development is the introduction of pipeline processors. By this architectural chair.

演算負荷を順次のパラレルステージに分割するので、Ｒ
チャンネルの同時の処理が可能となる。可変幾何学アー
キティクチェアにおけるパイプラインプロセッサの周知
の一例は、ＭｃＣｌｅｌｌａｒおよびＰｕｒｄｙ氏らに
よる”Ａｐｐｌｉｃａｔｉｏｎｓ　ｏｆ　Ｄｉｇｉｔａ
ｌ　Ｓｉｇｎａｌ　Ｐｒｏｃｅｓｓｉｎｇ”　１９７８
年Ｐｒｅｎｔｉｃｅ　）ｔａ１１社のＡｌａｎ１／、Ｏ
ｐｐｅｎｈｅｉｍｉ集者、第２６８−２７８頁に記載さ
れている。データの並列入力率を遅延させないために、
各ステージにおける演算素子自身は２段の４ポインｌ−
ＦＦＴ　（ＤＦＴＳの代りに）であり、各ポイントは、
−段当り４個の演算プロセッサより構成されている。Since we divide the computational load into sequential parallel stages, R
Simultaneous processing of channels is possible. A well-known example of a pipelined processor in a variable geometry architectural chair is the one published by McClellar and Purdy et al. in “Applications of Digital.
l Signal Processing” 1978
Prentice ) ta11's Alan1/, O
ppenheimi, pp. 268-278. In order not to delay the parallel input rate of data,
The arithmetic element itself in each stage consists of two stages of 4 points l-
FFT (instead of DFTS) and each point is
- Consists of four arithmetic processors per stage.

しかし乍ら、基数が８またはそれ以上に増加して更に並
列化すると、コミュテータまたはクロスバ−スイッチの
数が増加することによって、かなり高価なものとなって
しまう問題点がある。However, when the number of radixes increases to 8 or more and further parallelization is performed, there is a problem that the number of commutators or crossbar switches increases, resulting in a considerably high cost.

一定幾何学アーキティクチェアを利用したパイプライン
化ＦＦＴプロセッサがＣｏｒｉｎｔｈｉｏｓ氏によって
開発され、ＩＥＥＥ　Ｔｒａｎｓａｃｔｉｏｎｓ　ｏｎ
　Ｃｏｍｐｕｔｅｒｓ。A pipelined FFT processor using a constant geometry architectural chair was developed by Corinthios and published in IEEE Transactions on
Computers.

１９７１年６月　ｖｏｌ、ｃ−２０，“Ｔｈｅ　Ｄｅｓ
ｉｇｎ　ｏｆ　ａ　Ｃ１ａｓｓ　。June 1971 vol, c-20, “The Des
ign of a C1ass.

ｆ　Ｆａｓｔ　Ｆｏｕｒｉｅｒ　Ｔｒａｎｓ　ｆｏｒｍ
　Ｃｏｍｐｕｔｅｒｓ”、第６１７〜６２３頁に開示さ
れている。このアーキティクチェアでも、クロスチャン
ネル通信の目的のためにスイッチングおよびゲート作用
が要求されている。f Fast Fourier Trans form
Computers", pages 617-623. This architectural chair also requires switching and gating for purposes of cross-channel communication.

このようなプロセッサでは同様に、複雑で大規模なメモ
リが要求され、これによって基数が増大するにつれて、
更に、扱いにくくなる（即ち、メモリ長はＮ／Ｒ”の関
数であるので、所望のメモリユニットの数はＲ２となる
からである）、このプロセッサに関する技術が米国特許
（ｃｏｒｉｎｔｈｉｏｓ発明）第３，７５４，１２８号
、１９７３年８月２１日に開示されると共に、ＩＥＥ　
Ｔｒａｎｓａｃｔｉｏｎ　Ｏｎ　Ｃｏｍｐｕｔｅｒｓ、
　１９７５年Ｉ月ｖｏ１．ｃ−２４．第８０−９２頁の
“Ａ　Ｐａｒａｌｌｅｌ　Ｒａｄｉｘ４　Ｆａｓｔ　Ｆ
ｏｕｒｉｅｒ　Ｔｒａｎｓｆｏｒｍ　Ｃｏｍｐｕｔｅｒ
”にも記載されている。Such processors also have complex and large memory requirements, which, as the radix increases,
Furthermore, it becomes unwieldy (i.e., the desired number of memory units is R2 since the memory length is a function of "N/R"). No. 754,128, August 21, 1973, and IEE
Transaction On Computers,
January 1975 vol.1. c-24. “A Parallel Radix4 Fast F” on pages 80-92
Ourier Transform Computer
” is also stated.

他の開発は、上述した装置よおび方法の特定の特徴に関
するものである。Ｐｅｒｒｙの米国特許第４，１５９．
５２８号（１９７９年６月２６日）には、フーリエ変換
によって導入された位相シフトの補正が記載されている
。この技術によれば、バーレルスイッチおよび遅延素子
を利用して、小規模なりＦＴｓからの出力に対して適当
な位相補正をこれら出力を大規模なフーリエ変換に組込
む前に実行する。また、ＭｃＧｅｅ氏の米国特許第４，
５３４，００９号（１９８５年８月６日）によれば、前
述のＭｃＣｌａｌａｎおよびＰｕｒｄｙ氏のアーキティ
クチェアを実行し、スイッチとシフトレジスタを利用し
て演算ユニットの演算効率を向上させている。Other developments relate to certain features of the apparatus and methods described above. Perry U.S. Pat. No. 4,159.
No. 528 (June 26, 1979) describes the correction of phase shifts introduced by Fourier transforms. This technique utilizes barrel switches and delay elements to perform appropriate phase corrections on the outputs from smaller scale FTs before incorporating them into the larger scale Fourier transform. Also, Mr. McGee's U.S. Patent No. 4,
No. 534,009 (August 6, 1985) implements the aforementioned McClan and Purdy architectural chair and utilizes switches and shift registers to improve the computational efficiency of arithmetic units.

これら上述した単一チャンネルおよびマルチチャンネル
ＦＦＴプロセッサアーキティクチェアのすべてのものは
、スイッチを利用した相互チャンネル通信路のあるタイ
プのものが必要となる。これら通信路は可変幾何学のケ
ースにおいて時間およびステージと共に大幅に変化して
しまうものである。All of these single-channel and multi-channel FFT processor architectural chairs described above require some type of cross-channel communication path using a switch. These channels can vary significantly with time and stage in variable geometry cases.

[Purpose of the invention]

従って、本発明の目的は、クロスチャンネル通信路にお
いて、スイッチを用いずデジタルデータのＦＦＴ　（高
速フーリエ変換）を実行する装置を提供することである
。SUMMARY OF THE INVENTION Accordingly, it is an object of the present invention to provide an apparatus for performing FFT (Fast Fourier Transform) of digital data in a cross-channel communication path without using a switch.

また、本発明の他の目的は、ステージ間構造が実質的に
二重化されている新規な収縮性幾何学方法を実行するこ
とによって、デジタルデータのＦＦＴを演算する装置を
提供することである。Another object of the present invention is to provide an apparatus for computing an FFT of digital data by implementing a novel contractile geometry method in which the interstage structure is substantially doubled.

更に１本発明の他の目的は、基数サイズがスイッチング
構成の複雑度および／またはコストによって制限を受け
ないで、デジタルデータのＦＦＴ演算する装置を提供す
ることである。Yet another object of the present invention is to provide an apparatus for performing FFT operations on digital data where the radix size is not limited by the complexity and/or cost of the switching arrangement.

本発明の他の目的は、フーリエ変換の位相シフト特性を
データシャフリング（振るわし）の一部分に利用してデ
ジタルデータのＦＦＴ演算する装置を提供することであ
る。Another object of the present invention is to provide an apparatus that performs FFT calculations on digital data by utilizing the phase shift characteristics of Fourier transform as part of data shuffling.

また、更に、本発明の目的は、クロスチャンネル通信路
においてスイッチングデータのステップ無しで、デジタ
ルデータのＦＦＴ演算を実行できる装置を提供すること
である。It is a further object of the present invention to provide an apparatus capable of performing FFT operations on digital data without switching data steps in a cross-channel communication path.

〔Example〕

以下１図面を参照し乍ら本発明を詳述する。 The invention will now be described in detail with reference to one drawing.

先ず、本発明は、高速フーリエ変換（以下、ＦＦＴと称
す）を実行する装置および方法であり、これは前述のＰ
ｅａｓｅ氏による一定幾何学方法から得られる新規な収
縮性幾何学方法を演算するものである。デジタルデータ
がＮ個の複素数データワードから成るウィンドウで受信
されると共に、Ｒ個の独立した並列チャンネル内に記憶
される。First, the present invention is an apparatus and method for performing fast Fourier transform (hereinafter referred to as FFT), which is based on the above-mentioned P
A new contractile geometry method obtained from the constant geometry method by Ease is computed. Digital data is received in windows of N complex data words and stored in R independent parallel channels.

次に、これらデータはＬｏｇＲＮ個、直列に配列された
ステージ内に順序付けされる。各ステージは、データア
ドレスオペレータ（作用素）（Ｎ×Ｎ個のマトリックス
は、ＲＡＭシャフル（振わし）オペレータとして既知で
ある）と、ＦＦＴまたはＤＦＴ（核として既知である）
である演算素子を有するフーリエ変換オペレータと、各
チャンネルに対するＲＡＭと、クロスチャンネル通信を
行わず変換すべきデータを変形する回転ファクタオペレ
ータならびに新規な位相システムオペレータとを有して
いる。These data are then ordered into LogRN serially arranged stages. Each stage consists of a data address operator (the NxN matrix is known as the RAM shuffle operator) and an FFT or DFT (known as the kernel).
, a RAM for each channel, a rotation factor operator that transforms the data to be transformed without cross-channel communication, and a novel phase system operator.

概念的に考察して、このオペレーションは、この汎用の
ＦＦＴプロセッサ（前述のＰｅａｓｅおよびＣｏｒｉｎ
ｔｈｉｏｓを参照のこと）に使用されている、各ステー
ジ用の標準のグローバル（全方位）シャフルオペレータ
を３つの置換オペレータへ因数分解することによって達
成される。Conceptually, this operation is similar to this general-purpose FFT processor (the aforementioned Pease and Corin
This is achieved by factorizing the standard global shuffle operator for each stage, as used in thios (see thios), into three permutation operators.

これら置換オペレータの第２番目を、各チャンネル中の
１個のデータエレメント上に強制的に作用させるいこれ
によって２番目のオペレータを適当な核に与える。例え
ば、基数４の実施例において、これは以下のような４つ
のデータワードを意味する。即ち、４つのチャンネルの
各々からの４つのデータワードを各ステージで４ポイン
トＦＦＴ核に同時に表現することである。The second of these permutation operators is forced onto one data element in each channel, thereby giving the second operator to the appropriate kernel. For example, in a radix-4 embodiment, this means four data words: That is, four data words from each of the four channels are simultaneously represented at each stage into a four-point FFT kernel.

また、第１および第３番目の置換オペレータを位相シフ
トオペレータに変換する。これ、らオペレータによって
、各チャンネルそれぞれにおいて複素数データワードに
位相回転を行わせることによってデータワードのスイッ
チングと等価な作用となる（核に対する表現と一緒に適
当なデータワードを得ること）。このような方法におい
て、各チャンネル内のＲＡＭの比較的直接的な前方アド
レス作用を利用することによって、従来例に開示された
ようなスイッチを利用する代わりに、適当なデータワー
ドを得ることができる。Also, convert the first and third permutation operators into phase shift operators. This operator performs a phase rotation on the complex data word in each channel, which is equivalent to switching the data word (obtaining the appropriate data word together with the representation for the nucleus). In such a manner, by utilizing the relatively direct forward addressing of the RAM within each channel, the appropriate data words can be obtained instead of utilizing switches as disclosed in the prior art. .

本例においては、Ｎ＝６４でＲ＝４（即ち、３ステージ
生じる）の条件の下で実施され、特に説明したものは例
外とする。これらの値は１例えば例示目的のみであり、
決して限定するためのものではない。一般にこれらＮと
Ｒとは２の整数のべき数であるが、本発明の方法はあら
ゆる基数ＲおよびサンプルサイズＮに適用できるもので
ある。This example is carried out under the conditions of N=64 and R=4 (that is, three stages occur), with the exception of those specifically described. These values are for illustrative purposes only, e.g.
It is by no means intended to be limiting. Generally, N and R are integer powers of 2, but the method of the present invention is applicable to any base R and sample size N.

従来の一定幾何学ＦＦＴ演算方式は以下のマトリックス
で表わすことができる（マトリックスのオペレーション
の順序は右側から左側へ読む）；Ｆ、、Ｘ　＝　Ｆ４Ｓ
、／、Ｇ　Ｉ　Ｄ、、Ｆ、Ｓ、／、ＧＩ　Ｄ６４Ｆ４Ｓ
、／、６Ｘステージ３　　ｊステージ２１　ステージ１
ここで、Ｘ：Ｎ＝６４の複素数データワードの入力行データベク
トル。The conventional constant geometry FFT operation method can be represented by the following matrix (the order of operations in the matrix is read from right to left); F, , X = F4S
,/,G I D,,F,S,/,G I D64F4S
, /, 6X stage 3 j stage 21 stage 1
where: X: the input row data vector of N=64 complex data words.

Ｓ４／、＆：シャフルオペレータで、これにって４デー
タロツドを入力ベクトルから離間した１６デ一タワード対面させて選択させると共に、これらデータを一緒にグループ化してフーリエ変換オペレータ用の入力とする。S4/, &: A shuffle operator which selects 4 data rods facing 16 data words spaced from the input vector and groups these data together as input for the Fourier transform operator.

Ｆ４：フーリエ変換オペレータであり、Ｎ×Ｎ個の正方
ブロックマトリックスで、ブロック項に沿って４ポイントＤＦＴｓを包含する。F4: Fourier transform operator, which is an N×N square block matrix, containing 4-point D FTs along the block terms.

Ｄ６４１　Ｄｔｓ　：後述する回転要素（ｔｗｉｄｄｌ
ｅ　ｆａｃｔｏｒｓ）Ｆ６４：桁反転順序で列を有する
ＤＦＴマトリックス。D641 Dts: Rotating element (twiddl) described later
e factors) F64: DFT matrix with columns in digit-reversed order.

上述したＳ４／、、シャフルオペレータによって、行ベ
クトル中のデータの順序を、　Ｏ，Ｌ、２．・・・、Ｎ
−１ノシーケンスをＯ、Ｎ／Ｒ，２Ｎ／Ｒ，−、（Ｒ−
Ｌ）Ｎ／Ｒｊｌ。The order of data in the row vector is changed to O, L, 2, by the above-mentioned S4/, shuffle operator. ..., N
-1 sequence is O, N/R, 2N/R, -, (R-
L) N/Rjl.

（Ｎ／Ｒ）　＋１　、・［（Ｒ−１）Ｎ／Ｒｊ＋１．・
・　（Ｎ／Ｒ）−１、・・・Ｎ−１に変化する。同一の
効果が隣接チャンネルに関連するＲ個のパラレル入力チ
ャンネルＮ／［（データワードをシフトさせることによ
って達成される。(N/R) +1, ・[(R-1)N/Rj+1.・
・ Changes to (N/R)-1,...N-1. The same effect is achieved by shifting the R parallel input channels N/[(data words) associated with adjacent channels.

Ｎ＝６４とＲ＝４に対して、０，１，２．・・、６３の
入カシケンスが以下のように変えられる０、　１．２．・・・・・・・・・、１５１６．１７，
１８．・・・・・・・・・、３１３２．３３，３４．・
・・・・・・・・、４７４８．４９，５０．・・・・・
・・・・、５３オペレータＦ４は更に正確に記述すれば
（Ｉｘ＋、×Ｆ４）となる。ここで×は２つのマトリッ
クスのクロネツカー積（にｒｏｎｅｃｋｅｒ　ｐｒｏｄ
ｕｃｔ）である。オペレータとしてこの７１−リックス
は、４つのデータワドに対する４ポイン１−フーリエ変
換に相等するもので、これらワードはその入力に表現さ
れると共に、完全なパスがＮ個の複素数データワードの
ウィンドウを通って形成されるまで４個のデータワード
の連続グループに対する処理を繰返すようになる。Ｎ＝
６４に対して１６回繰返えすオペレーションが実行され
る。For N=64 and R=4, 0, 1, 2. ..., 63 input sequences are changed as follows: 0, 1.2.・・・・・・・・・、1516.17、
18. ......, 3132.33,34.・
......, 4748.49,50.・・・・・・
..., 53 Operator F4 can be described more accurately as (Ix+, xF4). Here, × is the Kronecker product of two matrices (ronecker prod
uct). As an operator, this 71-lix is equivalent to a 4-point 1-Fourier transform on the 4 data words that are represented at its input, and the complete path is through a window of N complex data words. The process is then repeated for successive groups of four data words until one is formed. N=
The operation is repeated 16 times for 64.

前述の゛′回転要素′″がＰｅａｓｅ特許に開示されて
おり以下のように規定される。The aforementioned "rotating element" is disclosed in the Pease patent and is defined as follows.

Ｄ、４　＝Ｄｉａｇ（Ｉｔｓ　＋　ｏｌｌｆ　ｔ　ＤＬ
ｓ　ｚ　Ｄｘｔ’）ここで、これら回転要素は、前述のｐｅａｓｅによる一定幾何学
ＦＦＴの導出結果から得られた係数である。これら値は
、上述の等式から演算可能であると共に。D, 4 = Diag(Its + olf t DL
s z Dxt') Here, these rotational elements are coefficients obtained from the derivation result of the constant geometry FFT using Pease described above. These values can be calculated from the equations above.

与えられたＮおよびＲに対して一定なものである。is constant for given N and R.

ベクトルＸを便宜上、後述の論議から削除するものとす
る。このようにして得られたち、のはＤＦＴマトリック
スのマトリックス因子Ｆ□である。Vector X will be deleted from the discussion below for convenience. What is thus obtained is the matrix factor F□ of the DFT matrix.

上述した従来例の考察によれば、同様なシャフルオペレ
ータおよびフーリエ変換オペレータが各ステージ毎に存
在する。回転要素のみ各ステージにおいて独特なもので
ある。更に、シャフルオペレータは全体的なもので、こ
れらによってすべてのチャンネルからのデータをアドレ
スし、６４データワードのウィンドウの時より大きな時
間間隔でアドレス動作する。このような特徴によって、
ハードウェア構造がシングルチャンネル動作に強制され
る傾向にある。According to the consideration of the conventional example described above, a similar shuffle operator and Fourier transform operator exist for each stage. Only the rotating elements are unique at each stage. Additionally, the shuffle operators are global, allowing them to address data from all channels and at larger time intervals than in a window of 64 data words. Due to these characteristics,
The trend is for hardware architectures to be forced into single channel operation.

データワードが第２ステージに入ると、これらデータワ
ードが同一データチャンネル内に存在するために、次の
Ｓ４／１Ｇオペレータ用の適当なシーケンスとは最早な
っていない、これらデータを操作して第２ステージに適
当なシーケンスで表われるようにする必要がある。第１
図に関連して、フーリエ変換オペレータ１０からのデー
タワードが図示の数で表わされた順序で４つのチャンネ
ルＡ〜Ｄ中に出力される。しかし乍ら、次のステージへ
の入力データが循環数で表示された４つのデータワード
のグループで必要となる。これらデータワードをそれ自
身の適切なチャンネルに整列させるために、クロスチャ
ンネル通信が要求される。When the data words enter the second stage, they are no longer in the proper sequence for the next S4/1G operator because they are in the same data channel, so they can be manipulated into the second stage. They need to appear on stage in an appropriate sequence. 1st
With reference to the figure, data words from Fourier transform operator 10 are output into four channels AD in the numerical order shown. However, the input data to the next stage is required in groups of four data words expressed in cycles. Cross-channel communication is required to align these data words to their own appropriate channels.

従来例ではこのことはスイッチを利用することによって
達成されていた。例えば、前述のＭｃＣｌｅｌｌａｎ、
　Ｐｕｒｄｙによるパイプラインアーキティクチェアに
よれば、コミュテータスイッチと遅延量の異なったＦＩ
ＦＯメモリの組合せを利用してクロスチャンネル操作を
実行している。In the prior art, this was accomplished by using switches. For example, the aforementioned McClellan,
According to the pipeline architecture chair by Purdy, a commutator switch and an FI with different amounts of delay are used.
A combination of FO memories is utilized to perform cross-channel operations.

本発明による収縮性幾何学方法を達成する第１ステツプ
として、ならびにスイッチなしでＲ個のパラレルチャン
ネルを処理するために、従来シャフルオペレータを以下
の３つのオペレータに因数分解する。As a first step in achieving the contractile geometry method according to the invention, and in order to process R parallel channels without switches, we factorize the conventional shuffle operator into the following three operators:

ＳＪ工、＝ＳｐｆＳ尺Ｓｐここで、ＳＰｆ　：高速循環シャフル（すべてのデータワード）ＳＰＬ：ランダムアクセスメモリ（ＲＡＭ）シャフル、
各チャンネルから１個のデータワードに対して演算するので、個々のＲＡＭを各チャンネルに利用できる。SJ Eng, = SpfS shuffle, where: SPf: Fast circular shuffle (all data words) SPL: Random access memory (RAM) shuffle;
Since we operate on one data word from each channel, separate RAM is available for each channel.

ＳＰ　＝低速循環シャフル（すべての第４番目のデータ
ワード）。SP = slow circular shuffle (every 4th data word).

このＳｐラシャルサイクルは、１６個のデータワードの
大きなグループ内の４つのデータワードの組である。Ｎ
＝６４で、１６個の４つのグループとなる。This Sp rashal cycle is a set of 4 data words within a larger group of 16 data words. N
= 64, resulting in 4 groups of 16.

１６個のデータワードの第１グループは全く変化しない
、この１６個の第２グループ内の４つのデータワードの
各組は以下に示したように１個のデータワードによって
循環される。The first group of 16 data words does not change at all, and each set of 4 data words within this second group of 16 is rotated by one data word as shown below.

１６個の第３グループにおける４データワードの各々は
、２データワードによって循環すると共に。Each of the 4 data words in the 16 third group is rotated by 2 data words and so on.

３データワードによって第４グループ中に循環する。Cycle through the fourth group by three data words.

前述した一定幾何学方法を利用して第２および第３ステ
ージグローバルシヤフルを以下に示すような３つの置換
オペレータで置換する（第１ステージシヤフルは変化し
ないままで残存する）Ｆ６４：Ｆ４ＳｐｆＳ　Ｓｐ１Ｄ
ｉＧＦ４ＳｐｆＳえ５ＰＩＤ６Ｊ＋５ｊｘｓこのｉｔ　
Ｄ　＋＋回転要素は、各々を（Ｓｐ−’Ｓｐ）で掛算す
ると共に、　ＳｐをＩＩ　Ｄ　Ｉ＋因数生成用ｓｐ−”
ｏｌ、’ｓρを介してＤＩＧの代りに通過させると共に
、５Ｐ−１ＤＧ４ＳｐをＤ６４の代りに通過させること
によって（最初の部分が再編成を表示）再編成される。Using the constant geometry method described above, replace the second and third stage global shuffles with three permutation operators as shown below (the first stage shuffle remains unchanged): F64:F4SpfS Sp1D
iGF4SpfSe5PID6J+5jxs this it
D ++ Rotation elements are each multiplied by (Sp-'Sp), and Sp is II D I + sp- for factor generation
It is reorganized by passing it through ol,'sρ instead of DIG and passing 5P-1DG4Sp instead of D64 (the first part indicates the reorganization).

次に、各５ｐ−１を隣接チャンネルからＳｐと一緒に溶
は込ませ、次にこれらをキャンセルして以トの式を形成
する。Next, each 5p-1 is infused together with Sp from an adjacent channel, and then these are canceled to form the following equation.

ＦＧ４＝ＦｉＳｐｆＳＦＬＩ　ＤＩ　Ｇ　’　５ＰＦ４
ｓＰｔ’ｓＲ＋　［）Ｇ４　’　５ＰＦ４Ｓ＋７１ＧＳ
ｐｆおよびｓｐはＦ４を通って通過し、このＦ４を変形
しないと共に、位相シフトオペレータとＤｐ　ｏｐ　’
を生成させ、更に、本発明の基本的な収縮性幾何学方法
を形成する。FG4=FiSpfSFLI DI G' 5PF4
sPt'sR+ [)G4' 5PF4S+71GS
pf and sp pass through F4, do not transform this F4, and use the phase shift operator and Dp op'
and further form the basic contractile geometry method of the present invention.

Ｆｓ４”ＤＰＨＦ４ＳＢＩＤｘｉ’ＤＰＩ４Ｆ＊ＤＰＨ
’Ｓ　ＩＤ５４’Ｆ＋ＤＰＨ５＊／１ｇ一定幾何学的ア
ーキティクチュアの大部分を。Fs4"DPHF4SBIDxi'DPI4F*DPH
'S ID54'F+DPH5*/1g Most of the constant geometry architecture.

回転要素および位相回転オペレータに現われる差と共に
保持する。また、第１ステージシヤフルオペレータも保
持する。Hold together with the differences appearing in the rotation element and phase rotation operator. It also holds a first stage shuffle operator.

位相シフトオペレータＤｐとＤｐ＋によってクロスチャ
ンネルスイッチングと等価なものが達成され、これには
複素数データワードの係数を変形させている。これら位
相シフトオペレータによって各チャンネル中の複素数デ
ータワードのＲの位相を、３６０／　Ｒ度の倍数だけシ
フトさせる（例えば、Ｒ＝４に対して、位相シフトは第
１チヤンネルに対してＯであり、第２チヤンネルに対し
てπ／２であり、第３チヤンネルに対してπであり、第
４チヤンネルに対して３π／２である。即ち、１゜Ｊｙ
　　１＋　　Ｊで掛算することに等価である）。The equivalent of cross-channel switching is achieved by phase shift operators Dp and Dp+, which involve transforming the coefficients of a complex data word. These phase shift operators shift the phase of R of the complex data word in each channel by a multiple of 360/R degrees (e.g., for R=4, the phase shift is O for the first channel; π/2 for the second channel, π for the third channel and 3π/2 for the fourth channel, i.e. 1°Jy
(equivalent to multiplying by 1+J).

これらのオペレータによってＤＦＴ核の“シフト”特性
が利用され、ここで、入力領域（ドメイン）における循
環環シフトは変換領域における複素指数による掛算に相
等するものである。これらシフトによってデータワード
を他のチャンネルからのデータワードを利用しないで適
当なシーケンスで配置でき、これによってクロスチャン
ネルスイッチングの必要性を除去できる。These operators take advantage of the "shift" property of the DFT kernel, where a cyclic ring shift in the input domain is equivalent to a multiplication by a complex index in the transform domain. These shifts allow data words to be placed in the proper sequence without utilizing data words from other channels, thereby eliminating the need for cross-channel switching.

第２図において、本発明によれば、フーリエ変換オペレ
ータ１０を導入するデータの位相を、前述したように位
相シフトユニット５内でシフトする。このＦＦＴｌ０を
出力するデータワードが図示した番号の順序で４つのチ
ャンネルＡ−Ｄ中に出力される。第１図に示したデータ
ワードの順序とは対比されて、上述の前段ＦＦＴシフト
によって次ステージへの入力に必要なデータワードが別
個のチャンネル中に現在現われるようになる（次のステ
ージは円で表示されている）。四角で表わされた次のス
テージへの次の入力セットを同様に配列する。これらチ
ャンネルからの４つの入力のこれら選択および次の組が
ＲＡＭアドレスユニット２０を介して実現される。In FIG. 2, according to the invention, the phase of the data introducing the Fourier transform operator 10 is shifted in the phase shift unit 5 as described above. The data words outputting this FFTl0 are output into four channels A-D in the numerical order shown. In contrast to the order of data words shown in FIG. displayed). The next set of inputs to the next stage, represented by squares, are similarly arranged. These selections and the next set of four inputs from these channels are realized via the RAM address unit 20.

もう一つの位相シフト処理が、以下の理由によって、デ
ータが各ステージのＦＦＴに存在した後で実行される。Another phase shift operation is performed after the data is present in the FFT of each stage for the following reasons.

即ち、第２図を更に参照してデータポイント（１３，１
７，３３および４９）の第２の組が並列であるが、これ
らの適当な宛先ＲＡＭから１つシフトするからである。That is, with further reference to FIG.
7, 33 and 49) are parallel, but shifted by one from their appropriate destination RAM.

再び、スイッチングデータ通路の代りに、シフト特性を
利用し、（今回は逆方向で）更に、以下のステージのＦ
４演算ユニット出力を適当な掛算子（１＋　Ｊ＋　　１
＋　　ＪがＲ＝４に対して準備）を項毎に掛算すること
によって、これと同じ効果を達成させる。これらは簡単
な位相シフトであり、これをＲＯＭ保持係数ＯｓＧ′と
ＤＧ４’に吸収することができ、これらは通常、各Ｆ、
の後の基数４変換で実現される。Again, instead of a switching data path, we use the shifting characteristic (this time in the opposite direction) to further
4 operation unit outputs by an appropriate multiplier (1+J+1
This same effect is achieved by multiplying term-by-term by +J (prepared for R=4). These are simple phase shifts that can be absorbed in the ROM retention coefficients OsG' and DG4', which are typically
It is realized by radix-4 conversion after .

このような収縮性幾何学方法における簡素化は、位相シ
フトオペレータを回転要素で溶込ませることによって実
現できる。このことによって演算負荷を減少でき、これ
は各チャンネルにおけるデータの単一の複素乗算を確立
すると共に、回転要素および位相シフトオペレータに必
要な２つの複素乗算を置換することによって減少できる
１合併（溶込んだ）オペレーションはＲＯＭユニットに
よって処理される。Simplifications in such contractile geometry methods can be achieved by incorporating the phase shift operator with rotating elements. This reduces the computational load, which can be reduced by establishing a single complex multiplication of the data in each channel and replacing the two complex multiplications required for the rotating element and phase shift operators. (input) operations are handled by the ROM unit.

このような合併（溶込み）動作は直接溶造みまたはＲＡ
ＭシャフルオペレータＳ、を介して位相シフトオペレー
ションＤＰＨ’を以下のように通過させることによって
実現できる。Such a merging (penetration) operation is called direct melting or RA.
This can be achieved by passing the phase shift operation DPH' through the M shuffle operator S, as follows.

１溶込み　溶込み１ＦＧ４”ＤＰｌ−ＩＦ４Ｓｌｔ　　Ｄｔｓ″ＤＰＭＦ４
ＤＰＨ″Ｓ艮ｌＤ＆４’ＦｎＤＰＨ３Ｊ工。1 Penetration Penetration 1 FG4”DPl-IF4Slt Dts”DPMF4
DPH″S艮ld＆4′FnDPH3J工.

ステージ１１　ステージ２　１　ステージ３最終結果は
減少した収縮性幾何学方法である。Stage 11 Stage 2 1 Stage 3 The final result is a reduced contractile geometry method.

Ｆｅ２”ＤＰ？１Ｆ４ＳＩＬ　　ＤＰＨ２Ｆ４Ｓえ　Ｄ
ＰＨｔＦ４Ｄ四Ｓｊ、ｓステージ１１ステ！−ジ２１　
ステージ３ここで。Fe2”DP?1F4SIL DPH2F4Se D
PHtF4D4Sj, s stage 11 stage! -ji21
Stage 3 here.

Ｄｐ）１□”Ｄｔｓ’ＤＰＳおよびｏｐＨ□＝ＤＰＨ″ＤＩ、４（ＤｐＨ″はＤＰＨ’を再編成させたものである。）次に、第３図を参照にし乍らＮ＝６４．Ｒ＝４゜３ステ
ージＦＦＴ装置によって本発明の収縮性方法を実施する
動作を説明する。データの流れは左から右へ移るものと
する。Dp) 1□"Dts' DPS and opH□=DPH"DI, 4 (DpH" is a rearrangement of DPH'.) Next, referring to FIG. 3, N=64.R= The operation of implementing the contractility method of the present invention using a 4° three-stage FFT device will be described.It is assumed that the data flow moves from left to right.

シャフルユニット１０はデータワード９０を受信すると
共に、これらを４つのチャンネルＡ−Ｄへ配列する。こ
のシャフルユニット１００によって前述したＳ４／０．
シャフルを実行するものと仮定すると、データがＳ、／
工、シャフルによって与えられた順序ですでに与えられ
ているこのシャフルユニットを取除くことができる６位
相回転器１１０は掛算ユニットであり、これによってデ
ータワードの位相をシフトさせるので、この結果、第２
図に示した順序が次のフーリエ変換オペレーションの後
に得られる。基数４に対しては、この位相回転器１１０
は各チャンネルをｊのべき数で簡単に掛算し。Shuffle unit 10 receives data words 90 and arranges them into four channels AD. With this shuffle unit 100, the above-mentioned S4/0.
Assuming we perform a shuffle, the data is S,/
The six-phase rotator 110, which can remove this shuffle unit already given in the order given by the shuffle, is a multiplication unit, which shifts the phase of the data word, so that the result is 2
The sequence shown is obtained after the next Fourier transform operation. For radix 4, this phase rotator 110
Simply multiply each channel by the power of j.

例えば、チャンネルＡは１（ｊ’）で、チャンネルＢは
ｊ　（ｊｌ）で、Ｃは−１（ｊ”）で、更にＤは−ｊ（
ｊ３）で掛算する。もっと高い基数オペレーションに対
しては、この掛算は更に複雑なものである（例えば、基
数８に対しては、８のチャンネルの各々の位相を４５°
だけシフトさせる必要がある）。フーリエ変換オペレー
タ１２０によってＤＦＴオペレーションを実行し、４デ
ータワードの組を各チャンネルから得てこれにフーリエ
変換する。回転エレメント１３０には、予じめ決められ
た係数を記憶するメモリと、これらデータワードをこれ
ら係数で掛算する掛算器とが設けられている。ＲＡＭユ
ニット１４０によってＲＡＭシャフル（ＳＲ）を実行す
ると共に、ステージ間でデータインターフェイスとして
作用する。For example, channel A is 1 (j'), channel B is j (jl), C is -1 (j"), and D is -j (
Multiply by j3). For higher radix operations, this multiplication is even more complex (e.g., for radix 8, the phase of each of the 8 channels is 45°
(need to be shifted). A DFT operation is performed by Fourier transform operator 120 to obtain and Fourier transform a set of four data words from each channel. The rotating element 130 is provided with a memory for storing predetermined coefficients and a multiplier for multiplying these data words by these coefficients. RAM unit 140 performs RAM shuffling (SR) and acts as a data interface between stages.

６４より大きなＮに対しては、ステージ２を繰返すが、
第２ステージおよび次のステージに、各前段ＦＦＴ位相
回転器１１０の前の後段ＦＦＴが設けられているのが例
外である。前述したように、後段ＦＦＴ位相回転器によ
って、　ＲＡＭから適当に受信されるべき第２のデータ
組が得られる。Ｎ＝６４に対しては、この関数は再編成
された回転要素Ｄ１％ｌＤ□に吸収されていた。For N greater than 64, repeat stage 2, but
The exception is that the second stage and the next stage are provided with a post-FFT before each pre-FFT phase rotator 110. As previously mentioned, a post-FFT phase rotator provides a second set of data to be appropriately received from the RAM. For N=64, this function was absorbed into the reorganized rotating element D1%lD□.

前述した減少型収縮性幾何学法が第４図に具現化されて
おり、ここで、識別番号は第３図のものに対応する。位
相回転器１５０が、位相シフト乗算子と合併された回転
係数を包含した合併オペレータである。The previously described reduced contractile geometry method is embodied in FIG. 4, where the identification numbers correspond to those in FIG. Phase rotator 150 is a merge operator that includes a rotation coefficient merged with a phase shift multiplier.

第４図のデバイスのステージ毎の対称性が、第３ステー
ジの位相回転器１１０を１個の回転オペレータで合併さ
せると共に、第１ステージの位相回転器１１０のオペレ
ータをＲＡＭエレメントでアドレス付けすることによっ
て更に強調される。これらの変化によって各ステージが
ＲＡＭエレメントと、フーリエ変換オペレータと、プロ
グラマブルＲＯＭ　（ＰＩｔＯＭ）の調整された回転係
数を有するデバイスを創作する。The stage-to-stage symmetry of the device of FIG. 4 allows the third stage phase rotator 110 to be merged with a single rotation operator while addressing the first stage phase rotator 110 operator with a RAM element. is further emphasized by These changes create a device in which each stage has a RAM element, a Fourier transform operator, and an adjusted rotation factor of a programmable ROM (PItOM).

第５図には、ＦＦＴプロセッサユニットが図示されてお
り、このユニットをマルチステージ（多段）ＦＦＴ装置
におけるステージとして使用すると共に、これら３つの
要素を包含している。Ｎ＝６４およびＲ＝４に対しては
３つのＦＦＴプロセッサユニットが必要となる。FIG. 5 illustrates an FFT processor unit that is used as a stage in a multi-stage FFT device and includes these three elements. Three FFT processor units are required for N=64 and R=4.

第５図に示したＦＦＴプロセッサユニット２００を７つ
のエレメントで構成でき、これらはＩ１０ＲＡＭエレメ
ント２２０．　ＲＡＭアドレスエレメント２４０１回転
メモリエレメント２６０２回転アドレスエレメント２８
０．　ＦＦＴ演算エレメント３００２組込テスト（ＢＩ
ＴＥ）エレメント３２０とコントロールニレメン１へで
ある。The FFT processor unit 200 shown in FIG. 5 can be configured with seven elements, including the I10RAM element 220. RAM address element 2401 rotation memory element 2602 rotation address element 28
0. FFT calculation element 3002 built-in test (BI
TE) element 320 and control element 1.

Ｉ１０ＲＡＭエレメント２２０は順次のステージ間のデ
ータインターフェイスである。このエレメント２２０は
曲設（即ち、ステージ１に対しては、図示しないシャフ
ルオペレータのようなエレメントを介してシステム入力
から）人力データ２２５を受信する。これらデータは２
つの１６ビツトコンポーネントを有しており、位相を有
する複素データワードを構成している。Ｉｌｏ）ＩＡＭ
エレメント２２０を４つの同一ボートに分離することが
でき、これらボートは演算エレメント３００の４つのボ
ートと直接インターフェイスしている。各ボートは、再
帰性データバッファおよびダブルバッファＲＡＭモジュ
ールより構成することができる。I10 RAM element 220 is the data interface between sequential stages. This element 220 receives human power data 225 (ie, for stage 1, from system input via an element such as a shuffle operator, not shown). These data are 2
It has two 16-bit components to form a complex data word with phase. Ilo)IAM
Element 220 can be separated into four identical boats that interface directly with the four boats of computational element 300. Each boat can be comprised of a recursive data buffer and a double buffer RAM module.

これら４つのＲＡＭモジュールの各々を、マルチプレッ
クス処理された入力および出力を有するマルチプルＲＡ
Ｍ　（例えば、Ｎ　：４０９６に対して２個の２ｋＸ　
１Ｓ　ＲＡＭ）より構成することができる。この結果新
たな入力データを、データがＲＡＭから読出された時と
同時にＲＡＭ内に書込むことができる。Each of these four RAM modules can be configured into multiple RAMs with multiplexed inputs and outputs.
M (for example, two 2kX for N :4096
1S RAM). As a result, new input data can be written into the RAM at the same time as the data is read from the RAM.

このような方法において、ＦＦＴプロセッサユニットを
１００％利用できる。　ＲＡＭエレメント２２０によっ
てバス２２７内の書込みアドレスコマンドを受信すると
共に、ＲＡＭアドレスエレメント２４０からバス２２９
内の読取リアドレスコマンドを受信する。バス２３５中
の出力データをＦＦＴ？′ｉＬ算エレメント３００に供
給する。In such a method, 100% of the FFT processor unit can be utilized. A write address command is received on bus 227 by RAM element 220 and a write address command is received on bus 229 from RAM address element 240.
Receive read read address command within. FFT the output data in bus 235? 'iL calculation element 300.

このＲＡＭアドレスエレメント２４０をマルチプルＰＲ
ＯＭ　（例えば、Ｎ＝４０９６に対して４つの２　Ｘ　
ｓ　ＰＲＯＭ）より構成できるＲＡＭアドレスを読取り
書込みアドレスならびに複素数／実数アドレスに分割す
る。This RAM address element 240 can be multiple PR
OM (e.g. 4 2X for N=4096
s PROM) into read/write addresses and complex/real addresses.

複素数アドレス部分によって読取りまたは書込みアドレ
スを利用し、複素数データ値を記憶する。The complex address portion uses read or write addresses to store complex data values.

実数アドレスによってＬＳＢを複素数アドレスに加算し
、このアドレスによって各複素数データ値の同相成分お
よび直交成分にアクセス制御する。例えば、ＯのＬＳＢ
値を同相、即ち、実数成分に対応させ、ｌのＬＳＢ値を
直交、即ち、虚数成分、に対応させる。これによって、
ＦＦＴデータを、複素数アドレスを利用して、ＬＳＢ値
によって実数または虚数成分を決定する固有の認識に基
いて複素データとして参照できる。The real address adds the LSB to the complex address, and the address controls access to the in-phase and quadrature components of each complex data value. For example, the LSB of O
The values are in-phase, ie, correspond to the real component, and the LSB values of l are made orthogonal, ie, correspond to the imaginary component. by this,
FFT data can be referred to as complex data based on the inherent recognition that the real or imaginary component is determined by the LSB value using a complex number address.

すべてのＲＡＭ書込みアドレスを、ＦＦＴステージ毎の
特定の複素数フーリエ変換演算に相当する計数値によっ
て順次ドライブする。順次の複素数ポイントを、最初、
実数コンポーネントならびに次に虚数コンポーネントで
順次書込む、従って、実数カウンタによってＩ１０ＲＡ
Ｍエレメント２２０をアドレスできる。このようなシー
ケンシャル計数が４つのボートの各々に対して同時に実
行されると共に、その後、単一バス２２７に残存するよ
うになる。All RAM write addresses are sequentially driven by counts corresponding to a particular complex Fourier transform operation for each FFT stage. Sequential complex points are first,
Write sequentially with the real component and then the imaginary component, so by the real counter I10RA
M element 220 can be addressed. Such sequential counting is performed simultaneously for each of the four boats and then remains on a single bus 227.

読取リアドレス作動は、ＦＦＴ演算エレメント３００へ
の入力データのＲＡＭシャフル（Ｓ、）のために、書込
みアドレス動作よりかなり多く包含されている。Read re-addressing operations involve considerably more than write-addressing operations due to the RAM shuffling (S,) of the input data to the FFT arithmetic element 300.

データがその適当なチャンネルのＲＡＭに順番に書込ま
九るので、読取り（読出し）アドレスによって、各フー
リエ変換オペレーションに対する必要なデータワードを
選択する必要がある。この読出しアドレス動作は、ＦＲ
ＯＭメモリ内のインストラクション（命令）によって実
行され、このメモリは、最新の複素数フーリエ変換演算
数をトラッキングするシーケンシャルカウンタによって
アドレス付けられる。工／○ＲＡＭエレメント２２０中
の４個のＲＡＭの各々には、バス２２９内において並列
に独立したアドレス動作が必要となる。Since data is written to the RAM in its appropriate channel in sequence, it is necessary to select the required data word for each Fourier transform operation by read address. This read address operation is performed by FR
It is executed by instructions in OM memory, which is addressed by a sequential counter that tracks the most recent complex Fourier transform operation. Each of the four RAMs in RAM element 220 requires independent addressing in parallel within bus 229.

回転メモリエレメント２６０をマルチプルＰＲＯＭで構
成することができ（例えば、Ｎ＝４０９６に対して、８
個の２　ｋＸ　８ＦＲＯＭ）　、これらＰＲＯＭには、
ＦＦＴ演算エレメント３００に供給される回転要素係数
が含まれている。これにデータを４個のパラレル１６ビ
ツトボートを介してバス２６５に送給する。Rotating memory element 260 can be configured with multiple PROMs (for example, N=4096, 8
(2 kX 8FROM), these PROMs include:
Contains rotational element coefficients supplied to the FFT calculation element 300. It sends data to bus 265 via four parallel 16-bit ports.

後述するように、実際に記憶された回転要素が、予じめ
決められたＮおよびＲに対する変換の１ステージに必要
となる。この記憶されたデータから、回転アドレスエレ
メント２８０内の回転アドレススキーム（予定）によっ
て正しい係数が、特定のステージおよび変換サイズ当り
のフーリエ変換オペレーションに対して選択されるよう
になる。As will be explained below, an actual stored rotational element is required for one stage of the transformation for predetermined N and R. From this stored data, a rotational addressing scheme within rotational address element 280 ensures that the correct coefficients are selected for a Fourier transform operation per particular stage and transform size.

回転アドレスエレメント２８０をマルチプルＰＲＯＭ（
例えば、Ｎ＝４０９６に対して３個の２　ｋ　Ｘ　８　
ＦＲＯＭ）より構成でき、これらＦＲＯＭを、ＲＡＭア
ドレスエレメント２４０内のような複素数フーリエ変換
オペレーション番号をトラッキングするカウンタによっ
て制御する。このエレメントによって、アドレスが適当
な値だけインクレメントされて異なった通過量や変換サ
イズを補償するようなマスキングスキームを利用する。The rotary address element 280 is a multiple PROM (
For example, 3 2 k X 8 for N=4096
FROM), which are controlled by counters that track complex Fourier transform operation numbers, such as in RAM address element 240. This element utilizes a masking scheme in which the address is incremented by an appropriate value to compensate for different throughputs and transform sizes.

インストラクションがライン２７５を介して回転メモリ
エレメント２６０に送給される。Instructions are sent to rotating memory element 260 via line 275.

コントロールエレメント３４０によって、他のエレメン
トに対してボード上のコントロール信号を発生させると
共に、クロック・イン信号３５５を供給してすべてのＩ
１０データワードと係数とを同期化する。Control element 340 generates on-board control signals for other elements and provides clock-in signal 355 to control all I/Os.
Synchronize the 10 data words and coefficients.

ＢＩＴＥエレメント３２０によって出力データ３０５を
ライン３１５を利用してテストデータ用のソースとして
引き上げる６例えば、出力データ３０５の１つのポート
を引き上げることによって、このＢＩＴＥエレメント３
２０によって１６−１にするデータマルチプレックサお
よび外部コントロール３２５を利用でき、これによって
、ライン３３５を介して外部のモニタ用の１６ビツトの
内の１つを選択する。By pulling up output data 305 by BITE element 320 as a source for test data using line 315, for example, by pulling up one port of output data 305.
20 to 16-1 data multiplexer and external control 325 is available to select one of the 16 bits for external monitoring via line 335.

また、ＦＦＴ演算エレメント（ＦＦＴＣＥ）　３００に
ついて第６図を参照し乍ら説明する。先ず、同図におい
て、識別番号は第５図のものと同一のものを使用した。Further, the FFT calculation element (FFTCE) 300 will be explained with reference to FIG. First, in this figure, the same identification numbers as those in FIG. 5 were used.

このＦＦＴＣＥ３００をシングルチップの独特なデザイ
ンとすることができ、この機能を、例えば。This FFTCE300 can be a unique single-chip design, and this functionality can be achieved by e.g.

ＩＢＭ　ＳＰＥチップのような現存のチップをカスケー
ド接続することによって実現できる。このＦＦＴＣＨの
機能は、ＦＦＴマトリックスオペレーションを演算する
と共に、アレイスケーリングおよびラウンディングが得
られる。バス２３５を介して入力データを受信し、更に
、バス２６５から回転係数を受信し、ライン３４５を介
してコントロールファンクションならびにライン３５５
のクロック・イン信号を受信する。出力データをバス３
０５を介して供給する（第５図の各ポートまたはバスに
隣接した数はポートまたはバス当りのライン数を表わす
）。This can be achieved by cascading existing chips such as the IBM SPE chip. This FFTCH functionality computes FFT matrix operations and provides array scaling and rounding. Receives input data via bus 235 and further receives rotation coefficients from bus 265 and control functions via line 345 as well as line 355.
Receives the clock in signal of Output data to bus 3
05 (the number adjacent to each port or bus in FIG. 5 represents the number of lines per port or bus).

このＦＦＴＣＥ３００によって基数４の４ポートＤＦＴ
が以下の式を解くことによって演算されるＴ（に）＝Ｃ
Ｈ（ｋ）＋Ｄ（ｋ＋１）＋Ｄ（ｋ＋２）＋Ｄ（ｋ＋３）
］　ｘ　ｃ（ｋ）Ｔ　（ｋ＋　１　）＝［Ｄ　（ｋ）−
ｊＤ　（ｋ＋１）−Ｄ　（ｋ＋２）＋ｊＤ　（ｋ＋３）
　］　Ｘ　Ｃ（ｋ＋　１）Ｔ（ｋ＋２）＝［Ｄ（ｋ）−
Ｄ（ｋ＋１）十〇（ｋ＋２）−〇（ｋ＋３）］　ｘｃ（
ｋ＋２）Ｔ　（ｋ＋３）＝ＣＨ（ｋ）＋ｊＤ　（ｋ＋　
１）−Ｄ　（ｋ＋　２）−ｊＤ　（ｋ＋３）　］　Ｘ　
Ｃ（ｋ＋３）ｋ＝ｏ、４．８・・・、Ｎ−４に対してこ
こで、Ｔ　（ｉ）　：出力ベクトルＤ（ｉ）：データ入力ベクトルＣ（ｉ）：回転係数または位相回転ベクトルｊニー１の
２乗根但し、これは基数４であるので、位相回転オペレーショ
ンが、Ｃ（ｉ）ベクトルを加えることによってこれら等
式中に包含されている。高次の基数オペレーションでは
、位相回転オペレーションは、個別の複素数乗算オペレ
ーション（図示せず）が必要となる。この代りに、位相
回転器を回転メモリエレメント係数と合併させて、複素
数乗算ステップを省略することもできる。Using this FFTCE300, radix 4 4-port DFT
is calculated by solving the following equation: T(to)=C
H(k)+D(k+1)+D(k+2)+D(k+3)
] x c(k)T (k+ 1)=[D(k)−
jD (k+1)-D (k+2)+jD (k+3)
] X C(k+ 1)T(k+2)=[D(k)−
D(k+1) 〇(k+2)−〇(k+3)] xc(
k+2)T (k+3)=CH(k)+jD (k+
1)-D (k+ 2)-jD (k+3)]
For C(k+3)k=o, 4.8..., N-4, here, T(i): Output vector D(i): Data input vector C(i): Rotation coefficient or phase rotation vector j square root of 1. However, since this is radix 4, a phase rotation operation is included in these equations by adding the C(i) vector. For higher order radix operations, the phase rotation operation requires a separate complex multiplication operation (not shown). Alternatively, the phase rotator can be merged with the rotating memory element coefficients and the complex multiplication step can be omitted.

データをバス２３５を介してデータワードとして受信す
ることができる。これらデータを同相成分と直交成分と
から成る複素数とすることもできる。Data may be received as data words via bus 235. These data can also be made into complex numbers consisting of in-phase components and orthogonal components.

各成分は１６ビツトで固定ポイントで、２の補数である
。４つのデータワードを同時に受信し、ベクトルＤ　（
ｉ）を上述した等式中に形成する。同時に、４つの回転
係数をバス２６５から４つのデータワードと同期して受
信し、ベクトルＣ（ｉ）を形成する。Each component is 16 bits, fixed point, and two's complement. Four data words are received simultaneously and the vector D (
i) is formed in the equation described above. At the same time, four rotation coefficients are received from bus 265 synchronously with four data words to form vector C(i).

本例においてＮ＝６４によって、ＦＦＴＣＥは１６個の
ベクトルＤ　（ｉ）とＣ（ｉ）とをステージを介して各
通路で受信すると共に、１６個のベクトルＴ　（ｉ）を
出力する。これらベクトルの各々は４つのデータワード
より構成される。出力をバス３０５を介して次のステー
ジに送給する。With N=64 in this example, the FFTCE receives 16 vectors D (i) and C(i) on each pass through the stage and outputs 16 vectors T (i). Each of these vectors consists of four data words. The output is routed to the next stage via bus 305.

また、第６図において、ＦＦＴＣＥ３００はコントロー
ルワード１′スケールフアクタ出力”　３４７を出力し
、これは、後続の処理中にオーバーフローすることを回
避するために必要な右側シフトの数に等しいものである
。この出力を次のステージで″スケールファクタ入力”
　３４９として受信する。Also, in FIG. 6, the FFTCE 300 outputs a control word 1' scale factor output" 347, which is equal to the number of right shifts required to avoid overflowing during subsequent processing. .This output is used as ``scale factor input'' in the next stage.
349.

このＦＦＴＣＥのアーキティクチュアの実施例として第
７図に示すものがある。ここでは、第５図の参照番号と
同じものを使用する。ベクトルＤ　（ｉ）のデータワー
ドがバス２３５中に入力されると共に、適当にスケール
処理される。演算論理ユニット（ＡＬＵ）３１２．遅延
３１４．　　′ｊ”マルチプライヤ（乗算器）３１６お
よびバス２６５からのＣ（ｉ）ベクトルを利用した複素
数マルチプライヤ３１８によって、バス３０５より出力
されるベクトルＴ　（ｉ）が得られる。An example of this FFTCE architecture is shown in FIG. The same reference numbers as in FIG. 5 are used here. The data words of vector D (i) are input onto bus 235 and scaled appropriately. Arithmetic logic unit (ALU) 312. Delay 314. 'j' multiplier 316 and a complex multiplier 318 utilizing the C(i) vector from bus 265 provides the vector T (i) output from bus 305.

Ｎ＝６４．Ｒ＝４の収縮性幾何学ＦＦＴプロセッサユニ
ットの回転メモリエレメント（第４図の２６０）中に記
憶されたベクトルＣ（ｉ）に対する値が第８図に示され
ている。フーリエ変換演算の計数が列Ａに図示されてい
る。各ステージのベクトルＣ（ｉ）の４つの値が列Ｂ−
Ｄに示されている。各値は項ｅｘｐ（ｊ２　ｔｃ　ｍ／
Ｎ）の”　ｍ　”を表わし、Ｎ＝６４である。N=64. The values for the vector C(i) stored in the rotating memory element (260 in FIG. 4) of the R=4 contractile geometry FFT processor unit are shown in FIG. The counts for the Fourier transform operation are illustrated in column A. The four values of vector C(i) at each stage are column B-
Shown in D. Each value is the term exp(j2 tc m/
N) represents "m", and N=64.

ステージ２の回転係数値がステージ１の回転係数値のサ
ブセットであるように見られる。より大きなＮに対して
この関係は後段に発展する。五位相回転器のべき数が回
転要素と合併した場合には、繰返性が破壊される。図示
のステージ３は、ｊマルチプライヤのべき数をステージ
３のユニット回転要素と合併した結果である。ｊマルチ
プライヤのべき数がステージ３の回転要素と合併しない
場合には、ステージ３のすべての回転要素値は零となる
。It can be seen that the stage 2 rotation coefficient values are a subset of the stage 1 rotation coefficient values. For larger N, this relationship is developed later. If the powers of the five-phase rotator are combined with the rotating elements, repeatability is destroyed. Stage 3 as shown is the result of merging the power of the j multiplier with the unit rotation element of stage 3. If the powers of the j multiplier do not merge with the rotating elements of stage 3, then all rotating element values of stage 3 will be zero.

ＲＯＭの要求度は、第８図に示したような回転係数を組
合せることによって更に減少できる。回転係数エレメン
ト（第４図の２８０）中のハードワイヤ処理したアドレ
ススキームによって、適当なＣ（ｉ）値を、フーリエ変
換演算数およびステージ数を利用して選択する。最後の
ステージにおいて。ROM requirements can be further reduced by combining rotation factors as shown in FIG. A hardwired addressing scheme in the rotation coefficient element (280 in FIG. 4) selects the appropriate C(i) value using the number of Fourier transform operations and the number of stages. At the last stage.

円が付けられた値のみアクセスされる。更に高いＮにつ
いては、パターンは類似しており、但し、円が付けられ
たグループの変動が繰返えされる。Only values marked with a circle are accessed. For higher N, the pattern is similar, except that the fluctuations of the circled groups are repeated.

例えば、Ｎ＝２５６で、各列には（０，６４，１２８，
１９２）の４つの連続的な繰返しが包含される。For example, if N=256, each column has (0, 64, 128,
192) are included.

上述した方法のソフトウェアシュミレーションによって
基数が２．４．８またはそれより高次の基数のパイプラ
インＦＦＴで利用できることが表わされる。基数が増大
するにつれて、スループットレートも上昇する。これは
、ＦＦＴ演算プロセッサ（各ステージにおける）を、高
いスルーグツ１−レトが起るようにそれ自身の内部アー
キテイクチュア内で十分な量のパラレリズム（数似性）
で構成できるならば相当程度のパラレリズムのために上
昇する。所望の処理スピードに依存して、高次基数アー
キティクチェアによってＦＦＴ演算エレメント（例えば
、８ポイントＦＦＴ等）内にパイプラインプロセッサを
必要とする。Software simulation of the method described above shows that it can be used with pipelined FFTs of radix 2.4.8 or higher. As the radix increases, the throughput rate also increases. This forces the FFT arithmetic processor (at each stage) to have a sufficient amount of parallelism within its own internal architecture such that high throughput occurs.
If it can be composed of , it will increase due to a considerable degree of parallelism. Depending on the desired processing speed, a high-order radix architectural chair may require a pipeline processor within the FFT calculation element (eg, 8-point FFT, etc.).

上述した方法は、周波数の１／１０を取る（ＤＩＦ）タ
イプのものであったが、時間の１／１０を取る（ＤＩＴ
）タイプのバージョンをＤＩＦ方法から発生させること
ができ、この場合、ＯＦＴマトリックスが対称形の時に
、ＤＦＴマトリックスのマトリックス配置Ｆ　Ｇ　４を
実行することによって発生させる。このＦ　６４は、シ
ャフルマトリックスＳ４／１ｇで掛算することによって
対称形にでき、このマトリックスによって、Ｆ＠４の行
を自然の順序（周波数を増大して）に逆変換する。この
ＤＩＴバージョンは以下に与えられる（ＤＩＦバージョ
ンをＳＪ、、によって掛算すると共にその結果を変換し
た後に）、Ｆ、　、　Ｔ＝Ｓ工ｓ／＊Ｄｐ　Ｆ４ＤＰ　
ｔｓ　ＴＦ４ＤρｚＳ　ＴＦ４Ｕｐ　Ｓｔｓ八ここで、Ｓ□６へ＝ＳＪ工、ＴＳζはＳビの配置行列であり、Ｆｓ４ＴはＤＩＴ　ＤＦＴマトリックスである。The method described above was of the type that takes 1/10 of the frequency (DIF), but it takes 1/10 of the time (DIT).
) type version can be generated from the DIF method, in this case by performing a matrix arrangement F G 4 of the DFT matrix when the OFT matrix is symmetric. This F 64 can be made symmetric by multiplication by a shuffle matrix S4/1g, which transforms the rows of F@4 back to their natural order (increasing frequency). This DIT version is given below (after multiplying the DIF version by SJ, , and converting the result), F, , T=S/*Dp F4DP
ts TF4DρzS TF4Up Sts8 Here, To S□6=SJ Engineering, T Sζ is the arrangement matrix of Sbi, and Fs4T is the DIT DFT matrix.

Ｆ４が対角線に沿って４ポイント核叶Ｔを包含するブロ
ック正方行列であると共に、各ＤＦＴが自然の順序であ
るものと仮定し、従って対称の４Ｘ４マトリツクスであ
るので、このマトリックスＦ。This matrix F4 is a block square matrix containing a 4-point kernel T along the diagonal and assumes that each DFT is in natural order, thus being a symmetric 4X4 matrix.

が対称形であると共に、配置行列によって変化しないも
のである。位相回転および回転要素を包含するＩＩ　ｐ
　Ｉ＋マトリックスのすべてがすでに正方形であり、こ
れによって対称形となる。シャフルマトリックスＳ４／
ＩＧおよびＳＦＬが配置行列オペレーションによって変
化する。但し、回転要素マトリックスは、現在Ｆ４マト
リックスに先行するもので、これは、時間の１／１０を
取るプロセラ、すの特性および方法である。is symmetrical and does not change depending on the placement matrix. II p including phase rotation and rotating elements
All of the I+ matrices are already square, which gives them symmetry. Shuffle matrix S4/
IG and SFL vary with placement matrix operations. However, the rotating element matrix is currently the predecessor to the F4 matrix, which is a feature and method of the processor that takes 1/10th of the time.

また、変形例によれば、位相シフトオペレータの機能を
循環性コミュテータスイッチで実施できる。クロスチャ
ンネル通信を導入するｍ、ＦＦｒ演算エレメントの外側
で行なう。このような実施例は低速の低次の基数オペレ
ーションにとって好都合である。According to a variant, the function of the phase shift operator can also be implemented with a circular commutator switch. Introducing cross-channel communication is done outside the FFr computational element. Such an embodiment is advantageous for slow, low-order radix operations.

前述した技術を再帰性ＦＦＴプロセッサにも応用できる
ものである。同じハードウェアが繰返して利用できる再
帰性プロセッサの応用例は、低処理速度が受入れられる
ならば、適切なものである。The technique described above can also be applied to a recursive FFT processor. Recursive processor applications, where the same hardware can be used repeatedly, are suitable if low processing speeds are acceptable.

本発明の種々の実施例を詳述したが、当業者によれば変
形や変更を加え得ることは容易である。Although various embodiments of the invention have been described in detail, modifications and changes will readily occur to those skilled in the art.

従って、上述した実施例は単に一例にすぎず、本発明の
技術的思想を限定するものではなく、添付したすべての
クレームおよびそれらの均等物によって規定されるもの
である。Accordingly, the embodiments described above are illustrative only and are not intended to limit the spirit of the invention, which is to be defined by the accompanying claims and their equivalents.

[Brief explanation of the drawing]

第１図は、従来のＦＦＴプロセッサでのデータワードに
ついての説明図；第２図は、本発明のＦＦＴプロセッサでのデータワード
の位相シフトされた順序についての説明図第３図は、第
２図の位相シフトスキームを包含する３ステージＦＦＴ
プロセツサの構成図；第４図は、第３図のデバイスの減
少型収縮性幾何学バージョンの構成図；第５図は、第４図のプロセッサの単一ステージの構成図
；第６図は、第５図のＦＦＴ演算エレメントのブロック図
；第７図は、第６図のエレメントの演算アーキテイクチュ
ア；第８図は、第７図のエレメントで使用される回転係数を
表わす図、および第９図は、第８図で示した回転係数の小さなＲＯ阿バー
ジョンである。５・・・位相シフトユニット１０・・・フーリエ変換オペレータ２０・・アドレスユニット１００・・・シャフルユニット１１０・・・位相回転器２００・・・ＦＦＴ　フロセッサユニット２８０・・・
回転アドレスエレメント３４７・・・スケールファクタ出力３４９・・・スケールファクタ入力FIG. 1 is an illustration of the data words in a conventional FFT processor; FIG. 2 is an illustration of the phase-shifted order of data words in the FFT processor of the present invention; FIG. A three-stage FFT encompassing a phase shift scheme of
A block diagram of a processor; FIG. 4 is a block diagram of a reduced contractile geometry version of the device of FIG. 3; FIG. 5 is a block diagram of a single stage of the processor of FIG. 4; A block diagram of the FFT calculation element shown in FIG. 5; FIG. 7 shows the calculation architecture of the element shown in FIG. 6; FIG. The figure shows the RO version with a small rotation coefficient shown in FIG. 5... Phase shift unit 10... Fourier transform operator 20... Address unit 100... Shuffle unit 110... Phase rotator 200... FFT processor unit 280...
Rotation address element 347...Scale factor output 349...Scale factor input

Claims

[Scope of Claims] 1. In performing a fast Fourier transform operation on complex data words, the data words are received, arranged in R channels, and R channels corresponding to the R channels are arranged. an input memory means having an output; a plurality of series connected stages for converting the data words, wherein a first of these stages converts the data words in R channels into the input memory means; output memory means connected at the end of the series connected stages for outputting said data words, each of said stages having said R channels for connecting said stages; R input means and output means corresponding to R of the data words; and R shuffling means for temporally aligning R of said data words into vectors, each of said vectors wherein said R of said data words from each of said channels; a Fourier transform operator for Fourier transforming each of these vectors; R vectors for shifting the phase of these data words before they are input to said Fourier transform operator; first multiplier means and R for shifting the phase of the data words after outputting them to the Fourier transform operator;
R second multiplier means, where said second multiplier means is not the first stage of said series arranged stages, and R third multiplier means for providing a predetermined coefficient to each of said data words. multiplier means for not transmitting said input memory and output memory data words of one of said channels to the other of said channels except within said Fourier transform operator. Fourier transform calculation device. 2 one of the R third multiplier means
2. An arithmetic device according to claim 1, wherein said first multiplier means is integrated into one of said first multiplier means. 3. The arithmetic device according to claim 1, wherein one of the third multiplier means is merged with one of the second multiplier means. 4. The Fourier transform operator is provided with a block square matrix, and R point Fourier transforms are provided along the diagonal of the block.
Arithmetic device as described in section. 5. The arithmetic device according to claim 1, wherein the first multiplier means includes multiplier means for shifting the phase of the data word by an integral multiple of 360/R degrees. 6. Each of said third multiplier means includes: one or more programmable ROM means containing said predetermined coefficients; and tracking means for counting the number of operations performed by said Fourier transform operator. , and an address element for selecting the coefficient from the programmable ROM means using the tracking means. 7 receiving means for receiving the input stream of N digital data words in R channels for Fourier transformation into an input stream of N digital data words, each having a phase; Log_RN serially connected stages for said data loads; processing means for processing within the vectors, each of these stages comprising: shuffling means having an N×N matrix for temporally aligning said data words R in vectors; one of said data words from each of said R channels subtracted at intervals of N/R from said stream of digital data words; shifting means for shifting the phase of the R data words in each of said channels by an integer multiple of 360/R degrees, further Fourier transforming each of said vectors and having a plurality of matrix operators of radix R; A Fourier transform device comprising: a Fourier transform means. 8. In performing a fast Fourier transform on a digital data word having a phase, a plurality of serially connected Fourier transform means having a plurality of channels and performing a Fourier transform on a group of these data words; a plurality of phase shifting means for transforming said data words by means of each of said phase shifting means, placing said data words in a suitable order for the next stage of said Fourier transformation means;
A processor characterized in that the data words are not transmitted between the channels. 9. The device comprises a plurality of data channels and a plurality of shuffle operators for rearranging digital data, and each of these shuffle operators rearranges data from only one of the data channels. A high-speed Fourier transform calculation device for digital data. 10 for fast Fourier transforming digital data words, each having a phase; a plurality of data channels; a plurality of serially connected Fourier transform means for computing the Fourier transform of groups of these data words; a plurality of phase shifting means provided with one of the data words from each of said channels and for modifying the phase of one data word in said channel prior to said Fourier means; further prior to said Fourier transforming means; 1 of the said channels
a plurality of shuffling means for arranging data words in one of the channels so that said data words in one of said channels are not transmitted to another channel. 11. The transform device according to claim 10, wherein the Fourier transform means is provided with a block square matrix, and the Fourier transform is provided along the diagonal of the block. 12 R said data channels are provided, and each said phase shift means is configured to adjust the phase of said data word by 360/
12. The conversion device according to claim 11, further comprising multiplier means for shifting by an integral multiple of the R degree. 13 comprising a plurality of data channels and means for computing a Fourier transform of a plurality of groups of digital data, each of said groups being provided with one of said data words from each of said channels, said data words being transferred to said channels; A fast Fourier transform device characterized in that no data is transmitted between the two. 14. A digital device comprising a processor having a plurality of stages and a plurality of channels for digital data, the path for the digital data in each of these channels being fixed and without a switch. Fourier transform device for data. 15. comprising a plurality of channels and a plurality of connected stages for computing the Fourier transform of complex data words, further comprising phase shifting means for transforming the data words;
Data words, characterized in that the arrangement of the data words in one of the stages results in a predetermined order of the words for the next stage, such that no data is transmitted between these channels. Transformation device. 16 providing four said channels, and providing said phase shifting means with multiplier means for multiplying the data word in each of said channels by a power of j, such that the powers of j for each of these channels are consecutive; 16. The transforming device according to claim 15, wherein the integer powers of start from zero. 17. Providing R channels, the shifting means further comprising multiplier means for shifting the phase of the data word present in each of the R channels by a multiple of 360/R degrees; 16. The apparatus of claim 15, wherein the multiples of 360/R are consecutive integers starting with zero. 18 Perform a fast Fourier transform operation on the complex data word,
a plurality of serially connected stages for converting these data words; each of these stages has R input and output means connecting the stages and corresponding to the R channels; R shuffling means for aligning R in time within the vectors, each of these vectors being provided with one of the data words from each of these channels, and Fourier means for performing a Fourier transform on one of these vectors. a transform operator; R first multiplier means for shifting the phase of said data words before they are input to said Fourier transform operator; applying a predetermined coefficient to each of said data words; R second multiplier means for providing one of said channels between said input means and output means;
A fast Fourier transform computing device characterized in that data words in one channel are not transmitted to other channels. 19. Apparatus according to claim 18, characterized in that one of said R second multiplier means is merged with one of said R first multiplier means. 20. The apparatus of claim 18, wherein the Fourier transform operator is provided with a block square matrix having R-point Fourier transforms along the diagonal of the block. 21. each of said second multiplier means includes one or more programmable ROM means containing predetermined coefficients; and tracking means for counting the number of operations performed by said Fourier transform operator; 19. The apparatus of claim 18, further comprising an address element for selecting said coefficients from said programmable ROM means using said tracking means. 22 has R channels and Log_RN serially connected stages, and in performing a Fourier transform operation on an input stream of N digital data words, each of these stages has R channels of these digital words in a vector. an N×N matrix aligned in time, where each of said vectors is provided with one data word from each of said R channels subtracted from said data word stream at intervals of N/R. A Fourier transform device comprising: phase shifting means for transforming the data word without transmitting the data word between the channels; and Fourier transform means for Fourier transforming each of the vectors. 23. The Fourier transform device according to claim 22, wherein the Fourier transform means is provided with a block square matrix, and R-point Fourier transform is provided on this block square matrix along the diagonal of the block. 24 Having a plurality of Fourier transform means arranged in series, in performing Fourier transform of a digital data word, a plurality of phase shift means for transforming the data word are provided before the Fourier transform means, and these Fast Fourier transform processor, characterized in that each of the phase shifting means places said data words in an appropriate sequence for the next stage of said Fourier transform means. 25. in computing a fast Fourier transform of a digital data word in a plurality of data channels, computing a Fourier transform of said group of digital data words;
a plurality of series-arranged Fourier transform means, each group being provided with one of said data words from each of said channels, further comprising: a plurality of serially arranged Fourier transform means; A fast Fourier transform device, characterized in that it comprises a plurality of phase shift means for transforming the digital data word, the path followed by the digital data word being fixed and without switches. 26 Fast Fourier transforming an input stream of N digital data words, each with phase: (a) partitioning these data words into R channels; (b) steps (c) to (f) (c) aligning R of these data words in time into N/R vectors, where the data words in each of these vectors are applied to the input at N/R intervals; (d) shifting the phase of the data word, wherein the phase of the data word in each channel is , for each channel by a multiple of 360/R degrees; (e) computing the Fourier transform of each of said vectors; and (f) multiplying by a predetermined coefficient. A fast Fourier transform method characterized by: 27 In performing a fast Fourier transform operation on an input stream of N digital data words in a plurality of serially arranged stages, each of these stages: (a) transforms R of said data words into N/R vectors; (b) shifting the phase of the data words, wherein the phase of the data words in these channels is adjusted to A fast Fourier transform method, comprising: shifting each of the channels by a multiple of 360/R degrees; and further comprising: (c) calculating a Fourier transform of each of the vectors.