JP2003067349A

JP2003067349A - Simd type microprocessor performing table conversion process corresponding to hysreresis of input value

Info

Publication number: JP2003067349A
Application number: JP2001261300A
Authority: JP
Inventors: Kosuke Takato; 浩資高藤
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2001-08-30
Filing date: 2001-08-30
Publication date: 2003-03-07

Abstract

PROBLEM TO BE SOLVED: To achieve a table conversion process corresponding to input value hysteresis of a nonlinear operation in an SIMD type microprocessor at few number of operation cycles. SOLUTION: This SIMD type microprocessor is provided with a plurality of processor elements. The predetermined operation in the processor elements that output data corresponding to input data are prepared previously by the predetermined table is defined. The operation is performed continuously in the predetermined order in a plurality of processor elements. A table converter is provided in which the input data are state data and data before conversion, the output data are data after conversion and the next state data, and the next state data in the output data becomes the state data of the input data in the operation concerning the next processor element performed continuously.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、マイクロプロセッ
サ、特に、ＳＩＭＤ（ＳｉｎｇｌｅＩｎｓｔｒｕｃｔ
ｉｎ−ｓｔｒｅａｍＭｕｌｔｉｐｌｅＤａｔａ−ｓ
ｔｒｅａｍ；単一命令多データ処理）型マイクロプロセ
ッサに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a microprocessor, and more particularly to a SIMD (Single Instruct).
in-stream Multiple Data-s
stream; single instruction multiple data processing type microprocessor.

【０００２】[0002]

【従来の技術】ＳＩＭＤ型マイクロプロセッサでは、複
数のデータに対して１つの命令で同時に同一の演算処理
が実行可能である。この機能により、演算は同一である
がデータ量が非常に多い処理（例えば、画像処理）に係
る用途において、頻用される。2. Description of the Related Art In a SIMD type microprocessor, the same arithmetic processing can be simultaneously executed on a plurality of data by one instruction. With this function, it is frequently used in applications related to processing (for example, image processing) in which the calculation is the same but the data amount is very large.

【０００３】ＳＩＭＤ型マイクロプロセッサにおける通
常の演算処理では、複数の演算ユニット（Ｐｒｏｃｅｓ
ｓｏｒＥｌｅｍｅｎｔ〔ＰＥ〕；プロセッサエレメン
ト）を並べ同一の演算を同時に複数のデータに対して実
行する。ところが、演算処理の内容を単一の演算式で表
すことができない「非線形処理」では、演算対象のデー
タに応じて演算式が変更されるため、同じ処理を同時に
実行することができない。そうすると、データ毎の逐次
処理をせざるを得ず、結局ＳＩＭＤであることの効果が
無くなってしまう。In the ordinary arithmetic processing in the SIMD type microprocessor, a plurality of arithmetic units (Proces) are used.
Sor Element [PE]; processor element) is arranged and the same operation is simultaneously performed on a plurality of data. However, in the “non-linear processing” in which the content of the arithmetic processing cannot be expressed by a single arithmetic expression, the arithmetic expression is changed according to the data to be calculated, and therefore the same processing cannot be executed at the same time. Then, it is unavoidable to perform sequential processing for each data, and eventually the effect of being SIMD is lost.

【０００４】通常のＳＩＳＤ（ＳｉｎｇｌｅＩｎｓｔ
ｒｕｃｔｉｎ−ｓｔｒｅａｍＳｉｎｇｌｅＤａｔａ
−ｓｔｒｅａｍ；単一命令単一データ処理）型のプロセ
ッサにおいても、演算対象データに応じて演算式が変更
される非線形処理では、演算処理を行うプログラムが非
常に複雑になりかねない。それを防ぐため、演算前デー
タに対して演算後の処理済みデータを予め全て準備して
それらをテーブル化し、演算前データを元にテーブル変
換を行ない演算後データを得る方式が一般的である。具
体的にはＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍ
ｏｒｙ）上に上記テーブルを格納し、演算前データにテ
ーブルの先頭の番地を加算した値をアドレスポインタと
してＲＡＭから得られたデータを、演算後データとして
いる。例えば、演算データ（演算前データ／演算後デー
タ）が８ビットの場合には、２５６バイトのサイズの変
換テーブルが必要となる。A normal SISD (Single Inst)
ructin-stream Single Data
Even in a processor of the type (-stream; single instruction / single data processing) type, a non-linear processing in which an arithmetic expression is changed according to data to be operated may make a program for performing arithmetic processing very complicated. In order to prevent this, it is general to prepare all the processed data after the calculation with respect to the pre-calculation data, make them into a table, and perform table conversion based on the pre-calculation data to obtain the post-calculation data. Specifically, a RAM (Random Access Mem)
ory), the above table is stored, and the data obtained from the RAM by using the value obtained by adding the address at the head of the table to the pre-computation data as the address pointer is the post-computation data. For example, when the operation data (data before operation / data after operation) is 8 bits, a conversion table having a size of 256 bytes is required.

【０００５】この変換テーブルのサイズは、演算データ
のビット幅の増加とともに２のベキ乗（の単位）で増加
するため、演算データのビット幅が相当に大きい場合
は、演算データを任意の区間に分け、その区間での近似
式をテーブルとして持つような方式も開示されている。Since the size of this conversion table increases by the power of 2 (unit) as the bit width of the operation data increases, if the bit width of the operation data is considerably large, the operation data can be divided into arbitrary sections. A method is also disclosed in which the table is divided and the approximate expression in that section is held.

【０００６】さて、このテーブル変換をＳＩＭＤ型マイ
クロプロセッサで採用する場合には、個々の演算単位に
おいてテーブルが必要となると考えられていた。例えば
２５６ＳＩＭＤ（即ち、演算単位〔演算ユニット、Ｐ
Ｅ〕が２５６個あるＳＩＭＤ）のプロセッサにおいて、
８ビットの演算データ（演算前データ／演算後データ）
のテーブル変換を行なう場合には、２５６バイトの（Ｒ
ＡＭ上の）テーブルが演算単位分即ち２５６個必要とな
る。よって、コスト面において大きな問題があった。そ
のため、ＳＩＭＤ型マイクロプロセッサの非線型演算の
テーブル変換に対して、種々の発明が案出され開示され
ている。When the table conversion is adopted in the SIMD type microprocessor, it has been considered that a table is required for each operation unit. For example, 256 SIMD (that is, calculation unit [calculation unit, P
[E] has 256 SIMD processors,
8-bit operation data (data before operation / data after operation)
When performing table conversion of 256 bytes (R
For the calculation unit, that is, 256 tables are required on the AM. Therefore, there was a big problem in terms of cost. Therefore, various inventions have been devised and disclosed for the table conversion of the non-linear operation of the SIMD type microprocessor.

【０００７】特開平５−６７２０３号においては、各Ｓ
ＩＭＤ単位のＰＥ（ＰｒｏｃｅｓｓｏｒＥｌｅｍｅｎ
ｔ；プロセッサエレメント）内蔵の出力レジスタから演
算前データを順次外部に出力し、外部で逐次にテーブル
変換を行ない、変換後のデータをＰＥ内蔵の入力レジス
タに順次入力する方式が提案されている。この方式では
変換テーブルが１個となるのでコストの上昇は抑えられ
るが、結局は逐次処理であるため、演算処理時間はＰＥ
の個数相応までかかってしまい演算速度面で不利であ
る。また、この変換処理をＰＥでの通常処理と並行に実
行すれば演算処理時間はトータルとして削減できるが、
入力・出力のレジスタがこの変換作業専用に使用され他
の用途に使用できない、変換処理後のデータが必要な場
合には演算処理時間だけ待つこととなり並行処理が不可
能である、といった問題が依然残る。In Japanese Patent Laid-Open No. 5-67203, each S
PE (Processor Elemen) in IMD units
(t: processor element) A method has been proposed in which pre-calculation data is sequentially output to the outside from an output register incorporated therein, table conversion is sequentially performed externally, and the converted data is sequentially input to an input register incorporated in the PE. In this method, the number of conversion tables is one, so the increase in cost can be suppressed, but after all, since the processing is sequential, the processing time is PE
However, it is disadvantageous in terms of calculation speed. Also, if this conversion processing is executed in parallel with the normal processing in PE, the total processing time can be reduced,
There are still problems that the input / output registers are used exclusively for this conversion work and cannot be used for other purposes, and if the data after conversion processing is required, the processing time is waited and parallel processing is impossible. Remain.

【０００８】非線形ユニットを１個搭載した特開平７−
２１９９１９号においても、状況は上記とほぼ同様であ
る。ＰＥ内にシフトレジスタとして用いる専用のレジス
タを設置し、非線形ユニットを挿入して環状の通信経路
とすることにより非線形演算を行なっているが、この場
合もＰＥ個数分のサイクル処理を行なうことが必要であ
る。Japanese Laid-Open Patent Publication No. 7-
In 199919, the situation is almost the same as above. Non-linear operation is performed by installing a dedicated register used as a shift register in PE and inserting a non-linear unit to form a circular communication path. In this case as well, it is necessary to perform cycle processing for the number of PEs. Is.

【０００９】また、特開平９−３０５５５０号におい
て、外部から順次変換前データと変換後のデータを入力
し、各ＰＥで入力された変換前データと演算前データを
比較し、一致したＰＥには変換後のデータを格納しこの
値を演算後データとする方式が提案されている。この場
合には、演算処理時間は演算前データの取りうる値の組
み合わせ数（即ち、変換テーブルのワード数）に依存す
るため、ＰＥの個数よりこのワード数が少ない場合には
高速化が実現される。ところが、演算前データが８ビッ
トデータの場合、ＰＥ数によらず２５６回程度のサイク
ル数となるため、この場合においても演算処理時間が長
くなるという問題がある。また、他の通常の処理と並行
にこの変換処理を行うという方式も想定され得るが、そ
の場合にも上記特開平５−６７２０３号と同じ問題が生
じる。Further, in Japanese Patent Laid-Open No. 9-305550, data before conversion and data after conversion are sequentially input from the outside, and the data before conversion and the data before calculation input in each PE are compared. A method has been proposed in which the converted data is stored and this value is used as the calculated data. In this case, the calculation processing time depends on the number of combinations of values that can be taken by the pre-calculation data (that is, the number of words in the conversion table). Therefore, when the number of words is smaller than the number of PEs, the speedup is realized. It However, when the pre-computation data is 8-bit data, the number of cycles is about 256 regardless of the number of PEs, and thus there is a problem that the computation processing time becomes long even in this case. Also, a method may be envisaged in which this conversion processing is performed in parallel with other normal processing, but even in that case, the same problem as in JP-A-5-67203 arises.

【００１０】さらに、特許第２８１２２９２号において
は、ＰＥの個数と同数の出力ポートを持つ変換テーブル
用のＲＡＭに対して、それぞれのＰＥから演算前データ
をアドレスポインタとして与えることで演算後データを
得る方法が提案されている。この方式においては変換速
度は１サイクル程度で終了するが、出力ポートの増加は
ＲＡＭのコストを押し上げることになり、特に数十個を
超えるポートを持つことは実現不可能である。したがっ
て、ＰＥ数が多いＳＩＭＤ型マイクロプロセッサに対応
できない、という問題がある。Further, in Japanese Patent No. 2812292, post-computation data is obtained by giving pre-computation data as an address pointer from each PE to a conversion table RAM having the same number of output ports as PEs. A method has been proposed. In this method, the conversion speed is completed in about one cycle, but the increase of the output ports increases the cost of RAM, and it is impossible to have more than several tens of ports. Therefore, there is a problem that SIMD microprocessors with a large number of PEs cannot be supported.

【００１１】以上の従来技術においては、テーブル変換
器を１個搭載するＳＩＭＤ型マイクロプロセッサにおけ
る各ＰＥ内データの非線形処理については、以下の２つ
のタイプのデータ逐次処理に分類され得る。In the above-mentioned conventional technique, the non-linear processing of the data in each PE in the SIMD type microprocessor equipped with one table converter can be classified into the following two types of data sequential processing.

【００１２】処理方式：ＰＥから変換前データを取り
出しそれを対応するテーブル値のアドレスポインタとし
て使用し、グローバルプロセッサ（ＧｌｏｂａｌＰｒ
ｏｃｅｓｓｏｒ；ＧＰ）若しくは外部メモリコントロー
ラがそのアドレスに対応するデータをＰＥにロードする
ことによりテーブル変換を実現する。これを全ＰＥにつ
いて順に繰り返す。Processing method: The pre-conversion data is taken out from the PE and used as an address pointer of the corresponding table value, and the global processor (Global Pr)
(GP) or an external memory controller loads the data corresponding to the address into the PE to realize the table conversion. This is repeated for all PEs in sequence.

【００１３】なお、後で説明するように、グローバルプ
ロセッサ（ＧＰ）は、ＳＩＭＤ型マイクロプロセッサを
構成する主要部位である。ＧＰそのものは、ＳＩＳＤ型
プロセッサであり、プログラムＲＡＭとデータＲＡＭを
内蔵し、プログラムコードを解読して、各種演算処理や
ＳＩＭＤ型マイクロプロセッサ全体に係る各種制御を行
なう。As will be described later, the global processor (GP) is a main part of the SIMD type microprocessor. The GP itself is a SISD type processor, has a built-in program RAM and data RAM, decodes the program code, and performs various arithmetic processes and various controls related to the SIMD type microprocessor as a whole.

【００１４】処理方式：変換前データが例えば８ビッ
トで表現されるデータである場合、ＧＰにて０から２５
５までインクリメントされるある変数（レジスタ）を設
定する。それらの０から２５５までの値の各々と、各Ｐ
Ｅにてそれぞれの演算前データと比較し一致すれば各Ｐ
Ｅの演算結果フラグを変化させる（“１”を立てる）。
フラグがセットされるＰＥにのみ対応する変換後データ
をロードする。上記変数が０から２５５までインクリメ
ントされると、テーブル変換が実現される。Processing method: When the pre-conversion data is data represented by, for example, 8 bits, 0 to 25 in GP
Set a variable (register) that is incremented by 5. Each of those values from 0 to 255 and each P
At E, each pre-computation data is compared, and if they match, each P
The operation result flag of E is changed (“1” is set).
The post-conversion data corresponding to only the PE whose flag is set is loaded. When the above variable is incremented from 0 to 255, table conversion is realized.

【００１５】上記処理方式の場合、同じ演算前データ
を持つＰＥが複数存在していても、ＧＰはそのことを把
握できる手段が無く、結局、ＰＥ数分だけ上記処理を繰
り返す必要がある。また、処理方式の場合、各ＰＥの
演算前データの内容をＧＰが前もって把握する手段が無
いため、結局、変換前データの総数、すなわち上記例の
場合２５６回の演算を繰り返す必要がある。In the case of the above processing method, even if there are a plurality of PEs having the same pre-computation data, the GP has no means for grasping the fact, and in the end, the above processing must be repeated for the number of PEs. Further, in the case of the processing method, since there is no means for the GP to grasp the contents of the pre-computation data of each PE in advance, it is necessary to repeat the total number of pre-conversion data, that is, 256 computations in the above example.

【００１６】ところで上記の処理方式に従う場合、Ｐ
Ｅの並ぶ順に沿って、個々のＰＥのレジスタファイルに
格納された値を、順次一つずつテーブル変換することに
なる。その際に、テーブル変換部へ入力される変換前デ
ータの履歴、つまりヒステリシスに対応して、変換に使
用するテーブルを動的に変更させるような処理や、入力
される変換前データの履歴そのものを監視する処理が、
付加的に要求されることが想定される。ところが、従来
技術においてはテーブル変換処理の高速化に関してのみ
腐心されている。その一方で、入力値のヒステリシスを
考慮するようなテーブル変換処理に係る技術は、現状殆
ど開示されていない。By the way, when the above processing method is followed, P
The values stored in the register files of the individual PEs are sequentially converted into tables one by one in the order in which Es are arranged. At that time, the history of the pre-conversion data input to the table conversion unit, that is, processing that dynamically changes the table used for conversion corresponding to the hysteresis, and the history of the input pre-conversion data itself The process to monitor is
It is expected that additional requirements will be made. However, in the prior art, the focus is only on speeding up the table conversion process. On the other hand, at present, most of the techniques related to the table conversion process that considers the hysteresis of the input value are not disclosed.

【００１７】上記のように入力値のヒステリシス監視に
対応したテーブル変換処理を実現するには、１つのデー
タに関するテーブル変換の結果値から、次のデータに適
用するテーブル若しくは入力状態を選択する処理を、各
ＰＥもしくはＧＰにて演算しなければならない。そうす
ると、テーブル変換に必要なサイクル数が全体として増
大してしまう。As described above, in order to realize the table conversion process corresponding to the hysteresis monitoring of the input value, the process of selecting the table or the input state to be applied to the next data from the result value of the table conversion for one data is performed. , Each PE or GP must calculate. Then, the number of cycles required for table conversion increases as a whole.

【００１８】[0018]

【発明が解決しようとする課題】本発明は、ＳＩＭＤ型
マイクロプロセッサにおける非線型演算の、特に上記の
ＰＥ別順次テーブル変換処理（処理方式）において、
入力値のヒステリシスに対応したテーブル変換処理をよ
り少ない演算サイクル数にて実現することを目的とす
る。SUMMARY OF THE INVENTION The present invention relates to a nonlinear operation in a SIMD type microprocessor, particularly in the above PE sequential table conversion processing (processing method),
The object is to realize a table conversion process corresponding to the hysteresis of an input value with a smaller number of operation cycles.

【００１９】[0019]

【課題を解決するための手段】本発明は、上記の目的を
達成するために為されたものである。本発明に係る請求
項１に記載のテーブル変換器は、複数のプロセッサエレ
メントを備えており、プロセッサエレメントにおける所
定の演算であって、入力データに対する出力データが所
定のテーブルにより予め用意されている、演算が定義さ
れ、複数のプロセッサエレメントにて所定の順序で連続
して上記演算が行なわれる、ＳＩＭＤ型マイクロプロセ
ッサにおいて、入力データが、状態データと、変換前デ
ータであり、出力データが、変換後データと、次状態デ
ータであり、出力データにおける次状態データは、連続
して行なわれる次のプロセッサエレメントに関する演算
における、入力データの状態データとなる、テーブル変
換器である。The present invention has been made to achieve the above object. A table converter according to a first aspect of the present invention includes a plurality of processor elements, which is a predetermined operation in the processor elements, and output data for input data is prepared in advance by a predetermined table. In a SIMD type microprocessor in which an operation is defined and a plurality of processor elements continuously perform the above operation in a predetermined order, input data is state data and pre-conversion data, and output data is after conversion. The data and the next state data, and the next state data in the output data is a table converter which becomes the state data of the input data in the operation regarding the next processor element that is continuously performed.

【００２０】本発明に係る請求項２に記載のテーブル変
換器は、状態データ、及び次状態データのビット長が、
変更されて設定され得る、請求項１に記載のテーブル変
換器である。In the table converter according to the second aspect of the present invention, the bit lengths of the state data and the next state data are
The table converter according to claim 1, which can be changed and set.

【００２１】本発明に係る請求項３に記載のテーブル変
換器は、入力データの状態データは、連続して行なわれ
るプロセッサエレメントに関する演算の、直前のものに
おける、出力データの次状態データか、若しくは、固定
の設定値であるか、のいずれかであり、いずれであるか
は、各プロセッサエレメントの備える条件フラグにより
決定される、請求項１乃至請求項２に記載のテーブル変
換器である。In the table converter according to a third aspect of the present invention, the state data of the input data is the next state data of the output data immediately before the operation relating to the processor element which is continuously performed, or , Or a fixed setting value, which is determined by a condition flag included in each processor element.

【００２２】本発明に係る請求項４に記載のＳＩＭＤ型
マイクロプロセッサは、請求項１乃至請求項３に記載の
テーブル変換器を搭載して、複数のプロセッサエレメン
トにて所定の順序で連続して上記演算が行う、ＳＩＭＤ
型マイクロプロセッサである。A SIMD type microprocessor according to a fourth aspect of the present invention is equipped with the table converter according to the first to third aspects, and a plurality of processor elements are continuously arranged in a predetermined order. SIMD by the above calculation
Type microprocessor.

【００２３】[0023]

【発明の実施の形態】以下、図面を参照して、本発明に
係る好適な実施の形態を説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Preferred embodiments of the present invention will be described below with reference to the drawings.

【００２４】図１は、本発明に係るＳＩＭＤ型マイクロ
プロセッサ２の概略の構成を示すブロック図である。該
ＳＩＭＤ型マイクロプロセッサ２は、概略、グローバル
プロセッサ４、レジスタファイル６、演算アレイ８、及
びテーブル変換部５０から構成される。テーブル変換部
５０は、メモリ及びレジスタファイル制御部５２及びテ
ーブルメモリ５４を含む。FIG. 1 is a block diagram showing a schematic configuration of a SIMD type microprocessor 2 according to the present invention. The SIMD microprocessor 2 is roughly composed of a global processor 4, a register file 6, an operation array 8 and a table conversion unit 50. The table conversion unit 50 includes a memory and register file control unit 52 and a table memory 54.

【００２５】（１）グローバルプロセッサこのグローバルプロセッサ４そのものは、いわゆるＳＩ
ＳＤ型のプロセッサであり、プログラムＲＡＭ１０とデ
ータＲＡＭ１２を内蔵し（図２参照）、プログラムを解
読し各種制御信号を生成する。この制御信号は内蔵する
各種ブロック以外に、レジスタファイル６、演算アレイ
８にも供給される。また、ＧＰ（グローバルプロセッ
サ）命令実行時は内蔵する汎用レジスタ、ＡＬＵ（算術
論理演算器）等を使用して各種演算処理、プログラム制
御処理をおこなう。(1) Global Processor The global processor 4 itself is a so-called SI.
This is an SD type processor, which has a program RAM 10 and a data RAM 12 built therein (see FIG. 2) and decodes the program to generate various control signals. This control signal is supplied to the register file 6 and the arithmetic array 8 in addition to the various built-in blocks. Further, when a GP (global processor) instruction is executed, various arithmetic processing and program control processing are performed using a built-in general-purpose register, an ALU (arithmetic logical operation unit), and the like.

【００２６】（２）レジスタファイルＰＥ（プロセッサエレメント）命令で処理されるデータ
を保持している。ＰＥ（プロセッサエレメント）３は、
公知のように、ＳＩＭＤ型マイクロプロセッサにおいて
個別の演算を実行する構成単位である。図２のレジスタ
ファイル６及び演算アレイ８が示すように、図２のＳＩ
ＭＤ型マイクロプロセッサ２では４個のＰＥ３を含んで
いる。通常ＰＥ３は、例えば、２５６個（等）に設定さ
れている。上記のＰＥ命令はＳＩＭＤ型の命令であり、
レジスタファイル６に保持されている複数のデータに対
し、同時に同じ処理を行なう。このレジスタファイル６
からのデータの読み出し／書き込みの制御はグローバル
プロセッサ４からの制御によって行なわれる。読み出さ
れたデータは演算アレイ８に送られ、演算アレイ８での
演算処理後にレジスタファイル６に書き込まれる。(2) Register file Holds data processed by a PE (processor element) instruction. PE (processor element) 3
As is well known, it is a structural unit that executes individual calculations in a SIMD microprocessor. As shown by the register file 6 and the arithmetic array 8 of FIG. 2, the SI of FIG.
The MD type microprocessor 2 includes four PEs 3. Usually, the PE 3 is set to 256 (or the like), for example. The PE instruction is a SIMD type instruction,
The same processing is simultaneously performed on a plurality of data held in the register file 6. This register file 6
Control of reading / writing of data from / to is performed by control from the global processor 4. The read data is sent to the arithmetic array 8 and written in the register file 6 after the arithmetic processing in the arithmetic array 8.

【００２７】また、レジスタファイル６はプロセッサ２
外部からのアクセスが可能であり、グローバルプロセッ
サ４の制御とは別に、外部から特定のレジスタを読み出
し／書き込みが行なわれる。The register file 6 is the processor 2
It can be accessed from the outside, and apart from the control of the global processor 4, a specific register is read / written from the outside.

【００２８】（３）演算アレイＰＥ命令の演算処理が行なわれる。処理の制御はすべて
グローバルプロセッサ４から行なわれる。(3) Arithmetic array PE instruction arithmetic processing is performed. All processing control is performed from the global processor 4.

【００２９】（４）テーブル変換部５０におけるメモリ
及びレジスタファイル制御回路５２プロセッサ２外部か
らのレジスタファイル６へのアクセスに関する制御を行
う。また、テーブル変換用のテーブルが格納されたテー
ブルメモリ（テーブルＲＡＭ）５４にも接続され、レジ
スタファイル６のデータに対してテーブル変換を行うこ
とができる。(4) Memory and register file control circuit 52 in the table conversion unit 50 Controls access to the register file 6 from outside the processor 2. Further, it is also connected to a table memory (table RAM) 54 in which a table for table conversion is stored, and table conversion can be performed on the data of the register file 6.

【００３０】図２は、本発明に係る好適な実施の形態で
あるＳＩＭＤ型マイクロプロセッサ２の、詳細な構成を
示すブロック図である。この図２では、ＰＥ数が４個の
場合の例を示す。上述のように、通常２５６個等のＰＥ
３が設定されている。FIG. 2 is a block diagram showing a detailed configuration of the SIMD type microprocessor 2 which is a preferred embodiment according to the present invention. FIG. 2 shows an example in which the number of PEs is four. As mentioned above, usually 256 PE etc.
3 is set.

【００３１】グロバールプロセッサ４には、本プロセッ
サのプログラム格納用のプログラムＲＡＭ１０と、演算
データ格納用のデータＲＡＭ１２が内蔵されている。さ
らに、プログラムのアドレスを保持するプログラムカウ
ンタ（ＰＣ）１４、演算処理のデータ格納のための汎用
レジスタであるＧ０、Ｇ１、Ｇ２及びＧ３レジスタ（１
６、１８、２０、２２）、レジスタ退避・復帰時に退避
先データＲＡＭのアドレスを保持しているスタックポイ
ンタ（ＳＰ）２４、サブルーチンコール時にコール元の
アドレスを保持するリンクレジスタ（ＬＳ）（図示せ
ず。）、同じくＩＲＱ（ＩｎｔｅｒｒｕｐｔＲｅＱｕ
ｅｓｔ；割込み要求）時とＮＭＩ（Ｎｏｎ−Ｍａｓｋａ
ｂｌｅＩｎｔｅｒｒｕｐｔＲｅＱｕｅｓｔ；禁止不
能割込み要求）時の分岐元アドレスを保持するＬＩレジ
スタ（図示せず。）及びＬＮレジスタ（図示せず。）、
プロセッサの状態を保持しているプロセッサステータス
レジスタ（Ｐ）（図示せず。）が内蔵されている。The global processor 4 has a built-in program RAM 10 for storing programs of this processor and a data RAM 12 for storing operation data. Further, a program counter (PC) 14 that holds the address of the program, G0, G1, G2, and G3 registers (1 that are general-purpose registers for storing data for arithmetic processing)
6, 18, 20, 22), a stack pointer (SP) 24 that holds the address of the save destination data RAM at the time of register save / restore, and a link register (LS) (LS) that holds the call source address at the time of subroutine call. IRQ (Interrupt ReQu)
est; interrupt request) and NMI (Non-Maska)
ble Interrupt ReQuest; an LI register (not shown) and an LN register (not shown) for holding a branch source address at the time of a prohibition impossible interrupt request),
A processor status register (P) (not shown) that holds the state of the processor is incorporated.

【００３２】これらのレジスタと、命令デコーダ（図示
せず。）、ＡＬＵ（図示せず。）、メモリ制御回路（図
示せず。）、割り込み制御回路（図示せず。）、外部Ｉ
／Ｏ制御回路（図示せず。）、及びＧＰ演算制御回路５
６を使用して、ＧＰ命令が実行される。These registers, instruction decoder (not shown), ALU (not shown), memory control circuit (not shown), interrupt control circuit (not shown), external I.
/ O control circuit (not shown) and GP arithmetic control circuit 5
6 is used to execute the GP instruction.

【００３３】また、ＰＥ命令実行時は、命令デコーダ、
レジスタファイル制御回路５８、ＰＥ演算制御回路（図
示せず。）を使用して、レジスタファイル６の制御と演
算アレイ８の制御が行なわれる。さらに、データＲＡＭ
１２から複数のＰＥレジスタファイル６にデータを転送
できるように、設定されている。When the PE instruction is executed, the instruction decoder,
The register file control circuit 58 and the PE operation control circuit (not shown) are used to control the register file 6 and the operation array 8. In addition, data RAM
It is set so that data can be transferred from 12 to a plurality of PE register files 6.

【００３４】レジスタファイル６においては、１つのＰ
Ｅ単位に８ビットのレジスタ３４が３２本内蔵されてお
り、（例えば）２５６個のＰＥ分の（３２本の）組が、
アレイ構成になっている。レジスタ３４はＰＥ毎に、Ｒ
０、Ｒ１、Ｒ２、．．．Ｒ３１と呼ばれる。それぞれの
レジスタ３４は、演算アレイ８に対して１つの読み出し
ポートと１つの書き込みポートを備えており、８ビット
のリード／ライト兼用のバスで演算アレイ８からアクセ
スされる。３２本のレジスタの内、２４本（Ｒ０〜Ｒ２
３）はプロセッサ外部からアクセス可能であり、外部か
らクロック（ＣＬＫ）とアドレス（Ａｄｄｒｅｓｓ）、
リード／ライト制御（ＲＷＢ）を入力することで、任意
のレジスタ３４に対し、読み書きできる。残りの８本
（Ｒ２４〜Ｒ３１）のレジスタ３４は、ＰＥ演算の一時
的な演算データ保存用として使用される。In the register file 6, one P
32 8-bit registers 34 are built in each E unit, and (for example) 256 PE (32) sets are
It has an array configuration. The register 34 is R for each PE.
0, R1, R2 ,. ．． Called R31. Each register 34 has one read port and one write port for the arithmetic array 8, and is accessed from the arithmetic array 8 by an 8-bit read / write dual-use bus. Of the 32 registers, 24 (R0-R2
3) is accessible from the outside of the processor, and clock (CLK) and address (Address)
By inputting the read / write control (RWB), reading / writing can be performed with respect to an arbitrary register 34. The remaining eight registers (R24 to R31) 34 are used for temporarily storing PE calculation data.

【００３５】上記の残りの８本（Ｒ２４〜Ｒ３１）のレ
ジスタ３４には、グローバルプロセッサ４のデータＲＡ
Ｍ１２からのデータを書き込まれることもある。グロー
バルプロセッサ４からのライト制御信号と、演算アレイ
８内の（後で説明する）条件レジスタ（Ｔレジスタ）５
３に格納されるデータ（フラグ）内容とを条件にして、
レジスタファイル６に内蔵される８本のレジスタ３４
（Ｒ２４〜Ｒ３１）対し、グローバルプロセッサ４のデ
ータＲＡＭ１２のデータが（条件を満たす）複数のＰＥ
３に同時に書き込まれることも可能である。また、デー
タＲＡＭ１２は６４ビットの出力ポートを備え、１つの
ＰＥ３に対しても８本のレジスタ３４に同時に６４ビッ
トの書き込みが可能である。Data RA of the global processor 4 is stored in the remaining eight registers (R24 to R31) 34.
The data from M12 may be written. A write control signal from the global processor 4 and a condition register (T register) 5 (described later) in the arithmetic array 8
3 and the contents of the data (flag) stored in
Eight registers 34 built in the register file 6
(R24 to R31), the data in the data RAM 12 of the global processor 4 has a plurality of PEs (which satisfy the condition).
It is also possible to write to 3 simultaneously. Further, the data RAM 12 is provided with a 64-bit output port, and even one PE 3 can simultaneously write 64-bit data to the eight registers 34.

【００３６】演算アレイ８は、１６ビットＡＬＵ３６と
１６ビットＡレジスタ３８、Ｆレジスタ４０を内蔵して
いる。ＰＥ命令による演算は、レジスタファイル６から
読み出されたデータ若しくはグローバルプロセッサ４か
ら与えられたデータをＡＬＵ３６の片側の入力とし、Ａ
レジスタ３８の内容をもう片側の入力として、行なわれ
る。その演算結果は、Ａレジスタ３８に格納される。し
たがって、Ｒ０〜Ｒ３１レジスタ３４若しくはグローバ
ルプロセッサ４から与えられたデータと、Ａレジスタ３
８に格納されるデータとの、演算が行なわれることにな
る。The arithmetic array 8 contains a 16-bit ALU 36, a 16-bit A register 38, and an F register 40. In the operation by the PE instruction, the data read from the register file 6 or the data given from the global processor 4 is used as an input on one side of the ALU 36, and A
This is performed by using the contents of the register 38 as the input on the other side. The calculation result is stored in the A register 38. Therefore, the data given from the R0 to R31 register 34 or the global processor 4 and the A register 3
An operation will be performed with the data stored in 8.

【００３７】レジスタファイル６の８ビットのデータ
は、シフト・拡張回路４４により任意のビット分だけ、
左シフトしてＡＬＵ３６に入力する。The 8-bit data of the register file 6 is shifted by the shift / expansion circuit 44 by an arbitrary bit,
Left shift and input to ALU36.

【００３８】≪第１の実施の形態≫本発明に係るＳＩＭ
Ｄ型マイクロプロセッサ２は、テーブル変換部５０を１
個搭載しそこに含まれるテーブルにより、各ＰＥ３内デ
ータの非線形演算を行なう。加えて、本発明に係るＳＩ
ＭＤ型マイクロプロセッサ２では、・「ＰＥから変換前データを取り出しそれを対応するテ
ーブル値のアドレスポインタとして使用し、ＧＰ、若し
くは外部メモリコントローラがそのアドレスに対応する
データをＰＥにロードすることにより、テーブル変換を
行なう。これを全ＰＥについて順に繰り返す。」という
データ逐次処理を行なう。<< First Embodiment >> SIM according to the present invention
The D-type microprocessor 2 sets the table conversion unit 50 to 1
The non-linear operation of the data in each PE 3 is carried out by the table mounted therein and contained therein. In addition, the SI according to the present invention
In the MD type microprocessor 2, “By extracting the pre-conversion data from the PE and using it as the address pointer of the corresponding table value, the GP or the external memory controller loads the data corresponding to the address into the PE, Table conversion is performed. This is repeated for all PEs in sequence. "

【００３９】更に、本発明に係るＳＩＭＤ型マイクロプ
ロセッサ２は、テーブル変換前の入力データのヒステリ
シスに対応して、複数のテーブルからテーブルを一つ選
択する処理を行なう。つまり、（１）一つの非線型演算に対して、変換テーブルが複数
用意されている。それぞれのテーブル（場合）を、１つ
の『状態』とみる。このことにより、ステートマシンが
構成される。（２）夫々は内容が異なる。（３）入力データのテーブル変換後における状態の遷移
を監視することにより、次の（後続の）データ変換にて
使用する変換テーブルを、動的に選択する。このような
処理を行なう。Further, the SIMD microprocessor 2 according to the present invention performs a process of selecting one table from a plurality of tables corresponding to the hysteresis of the input data before the table conversion. That is, (1) A plurality of conversion tables are prepared for one non-linear operation. Each table (case) is regarded as one "state". This constitutes a state machine. (2) The contents are different. (3) The conversion table used in the next (successive) data conversion is dynamically selected by monitoring the state transition of the input data after the table conversion. Such processing is performed.

【００４０】即ち、テーブル変換前の入力データと、そ
の時の『状態』とが確定していれば、テーブル変換後の
データと、次の『状態』とは一意に決定される。かよう
なテーブル変換処理において、出力データに変換後の遷
移先状態値を含ませることでステートマシンを構成する
ことが可能となり、よって、動的なテーブル選択、及び
入力値のヒステリシス対応が可能となる。That is, if the input data before table conversion and the "state" at that time are fixed, the data after table conversion and the next "state" are uniquely determined. In such a table conversion process, it becomes possible to configure a state machine by including the transition destination state value after conversion in the output data, thus enabling dynamic table selection and hysteresis of the input value. Become.

【００４１】上記のようなテーブル変換処理を実現する
ためのメモリ構成、特にテーブルメモリ構成を、図３に
示す。該図３のテーブルメモリは、扱うデータのビット
数がｎビット（ｎ≧０）であり、『状態』数がｓビット
（ｓ≧０）で表わされる場合のものである。また１ワー
ドのビット幅をｗビットとすると、当然ながらｓ＋ｎ≦
ｗである。FIG. 3 shows a memory configuration, particularly a table memory configuration, for realizing the above table conversion processing. In the table memory of FIG. 3, the number of bits of data to be handled is n bits (n ≧ 0) and the “state” number is represented by s bits (s ≧ 0). If the bit width of one word is w bits, s + n ≦
w.

【００４２】図３のメモリテーブルにおいて、ｎビット
の変換前データと、ｓビットで表わされる『状態』との
組合せが、アドレスポインタとなるアドレスにて、ｎビ
ットの変換後のデータと、ｓビットで表される次の遷移
先状態（に係るデータ）が格納されている。状態を表す
ｓビットは、ｎビットに対して上位に配置されても下位
に配置されても、かまわない。In the memory table of FIG. 3, the combination of the n-bit pre-conversion data and the "state" represented by s bits is an address that serves as an address pointer. The following transition destination state (data related to) is stored. The s-bit indicating the state may be arranged higher or lower than n bits.

【００４３】図４は、本発明に係るＳＩＭＤ型マイクロ
プロセッサ２における、第１の実施の形態のテーブル変
換部５０の詳細な構成を示す。以下に、該テーブル変換
部５０の動作を説明する。FIG. 4 shows a detailed configuration of the table conversion unit 50 of the first embodiment in the SIMD type microprocessor 2 according to the present invention. The operation of the table conversion unit 50 will be described below.

【００４４】最初に、図２におけるメモリ及びレジスタ
ファイル制御回路５２から読み出された変換前データ
（ｎビット）と、初期状態値（ｓビット）とにより、ア
ドレスポインタ（（ｎ＋ｓ）ビット）が生成される。こ
のアドレスポインタにより、図３のテーブルメモリ６２
から、変換後データ（ｎビット）と次の遷移先状態値
（ｓビット）が取得される。この「次の遷移先状態値」
（ｓビット）は、アドレス生成部６０に戻される。次
に、図２におけるメモリ及びレジスタファイル制御回路
５２から続いて読み出された変換前データ（ｎビット）
と、遷移先状態値（ｓビット）とにより、次のアドレス
ポインタが生成される。このアドレスポインタを元にし
て、新たに、図３のテーブルメモリ６２から、変換後デ
ータ（ｎビット）と（更に）次の遷移先状態値（ｓビッ
ト）が取得される。First, an address pointer ((n + s) bits) is generated from the pre-conversion data (n bits) read from the memory and register file control circuit 52 in FIG. 2 and the initial state value (s bits). To be done. With this address pointer, the table memory 62 of FIG.
From, the converted data (n bits) and the next transition destination state value (s bits) are acquired. This "next transition destination state value"
The (s bit) is returned to the address generator 60. Next, the pre-conversion data (n bits) read subsequently from the memory and register file control circuit 52 in FIG.
And the transition destination state value (s bit) generate the next address pointer. Based on this address pointer, the converted data (n bits) and the (next) next transition destination state value (s bits) are newly acquired from the table memory 62 of FIG.

【００４５】アドレスポインタ作成時には、第１のセレ
クタ６４、第２のセレクタ６６を介する。第１のセレク
タ６４は、変換前データとして必要なｎビットを選択
し、アドレスポインタ内の適切な位置（図では下位）に
設定する。第２のセレクタ６６は、初期状態値若しくは
遷移先状態値として必要なｓビットを選択し、アドレス
ポインタ内の適切な位置（図では上位）に設定する。変
換後データ出力及び次の遷移先状態値の取得時には、第
３のセレクタ６８、第４のセレクタ７０を介する。第３
のセレクタ６８は、変換後データとしてｎビットを、出
力データ内の適切な位置（図では下位）から選択する。
第４のセレクタ７０は、次の遷移先状態値としてｓビッ
トを、出力データ内の適切な位置（図では上位）から選
択する。At the time of creating the address pointer, it goes through the first selector 64 and the second selector 66. The first selector 64 selects the required n bits as the pre-conversion data and sets it at an appropriate position (lower in the figure) in the address pointer. The second selector 66 selects an s bit required as an initial state value or a transition destination state value, and sets it at an appropriate position (higher in the figure) in the address pointer. The third selector 68 and the fourth selector 70 are used to output the converted data and obtain the next transition destination state value. Third
Selector 68 selects n bits as post-conversion data from an appropriate position (lower in the figure) in the output data.
The fourth selector 70 selects s bits as the next transition destination state value from an appropriate position (higher in the figure) in the output data.

【００４６】テーブルメモリ６２へのアドレスポインタ
設定において、１回目は「初期状態値」を利用し、２回
目以降は直前に取得された「遷移先状態値」を利用す
る。何れを利用するかは、アドレス生成部６０内のマル
チプレクス７２において選択・制御される。In setting the address pointer to the table memory 62, the "initial state value" is used for the first time and the "transition destination state value" acquired immediately before is used for the second time and thereafter. Which one is used is selected and controlled by the multiplex 72 in the address generator 60.

【００４７】ＳＩＭＤ型マイクロプロセッサ２のＰＥ数
が「ｋ個」であるとすると、ｋ個のＰＥ３（のレジスタ
３４）に格納されているデータをテーブル変換するに
は、メモリ及びレジスタファイル制御回路５２を介して
１番目のＰＥ３からレジスタ３４を読み出してテーブル
変換を実施し、またレジスタ３４へ書き込むという動作
を、順次、ｋ回繰り返すことになる。しかも、この作業
をｋ回繰り返すだけで十分であり、他の動作を別途付加
的に実行する必要が無い。Assuming that the number of PEs of the SIMD type microprocessor 2 is “k”, the memory and register file control circuit 52 is used to convert the data stored in (register 34 of) k PEs 3 into a table. The operation of reading the register 34 from the first PE 3 via the, performing table conversion, and writing to the register 34 is sequentially repeated k times. Moreover, it is sufficient to repeat this work k times, and it is not necessary to additionally perform other operations.

【００４８】結局、２番目以降のＰＥのテーブル変換に
ついては、次の『状態』を求めるための演算を行う必要
が無く、従って、変換テーブルも自動的に動的変更され
ることになる。After all, for the table conversion of the second and subsequent PEs, it is not necessary to perform the calculation for obtaining the next "state", and therefore the conversion table is automatically changed dynamically.

【００４９】また、図４のアドレス生成部６０におい
て、第１のセレクタ６４の設定を変更すれば、「ｓ＋ｎ
≦ｗ」という条件の下で、取り出されるデータのビット
幅（即ち、ｎビットのｎ）を変更することができる。同
様に、第２のセレクタ６６の設定を変更すれば、「ｓ＋
ｎ≦ｗ」という条件の下で、取り出される『状態』数の
ビット幅（即ち、ｓビットのｓ）を変更することができ
る。そのとき、第３のセレクタ６８及び第４のセレクタ
７０の設定を調整すれば、変換後データ及び次の遷移先
状態値を適切に取得することができる。即ち、ｓ及びｎ
は、以下の条件式を満たす限りで、任意の値に設定され
ることが可能である。Further, in the address generator 60 of FIG. 4, if the setting of the first selector 64 is changed, "s + n"
It is possible to change the bit width of the data to be taken out (that is, n of n bits) under the condition of ≦ w ”. Similarly, if the setting of the second selector 66 is changed, "s +
It is possible to change the bit width (that is, s of s bits) of the number of “states” to be taken out under the condition of “n ≦ w”. At that time, if the settings of the third selector 68 and the fourth selector 70 are adjusted, the converted data and the next transition destination state value can be appropriately acquired. That is, s and n
Can be set to any value as long as the following conditional expression is satisfied.

【数１】ｎ≧０ｓ≧０ｓ＋ｎ≦ｗ[Equation 1] n ≧ 0 s ≧ 0 s + n ≦ w

【００５０】≪第２の実施の形態≫図５は、本発明に係
るＳＩＭＤ型マイクロプロセッサ２’における、第２の
実施の形態のテーブル変換部５０’の詳細な構成を示
す。図４に示されるテーブル変換部５０と略同様のもの
であるため、同一部位には同一符号を付して説明を省略
し、異なる部分を中心に説明する。<< Second Embodiment >> FIG. 5 shows a detailed configuration of a table conversion unit 50 'according to a second embodiment of a SIMD microprocessor 2'according to the present invention. Since it is substantially the same as the table conversion unit 50 shown in FIG. 4, the same parts are denoted by the same reference numerals, the description thereof will be omitted, and different parts will be mainly described.

【００５１】また、図６のＳＩＭＤ型マイクロプロセッ
サ２’は、図５に示される第２の実施の形態のテーブル
変換部５０’を構成要素とするものである。その図６の
ＳＩＭＤ型マイクロプロセッサ２’は、図２に示される
ＳＩＭＤ型マイクロプロセッサ２の構成に対して、メモ
リ及びレジスタファイル制御回路５２’を介して各ＰＥ
３の条件レジスタ５３中の任意の１ビットが読み出され
る構成が付加されている。Further, the SIMD type microprocessor 2'of FIG. 6 has the table conversion unit 50 'of the second embodiment shown in FIG. 5 as a constituent element. The SIMD type microprocessor 2 ′ of FIG. 6 is different from the SIMD type microprocessor 2 shown in FIG. 2 in that each PE is connected via a memory and register file control circuit 52 ′.
A configuration is added in which any one bit in the condition register 53 of No. 3 is read.

【００５２】図５に示される第２の実施の形態のテーブ
ル変換部５０’での、各ＰＥ３のレジスタ３４に格納さ
れたデータ（変換前データ）の、テーブル変換の動作を
説明する。アドレス生成部６０’では、アドレスポイン
タが設定されるのであるが、その際の遷移先状態値とし
て、その直前のテーブル変換から得られた「次の遷移先
状態値」がフィードバックされて設定されるか（又は
「初期値」が設定されるか）、若しくは予めの「設定
値」が設定される。この「次の遷移先状態値」か「設定
値」かの選択は、第１のマルチプレクス７２’におい
て、条件フラグ、即ち、各ＰＥ３の条件レジスタ５３中
の１ビットの、設定内容に応じて、為される。例えば、
該条件フラグに“１”が立っていれば、予めの「設定
値」が遷移先状態値として設定される。この「設定値」
が選択されるということは、利用される変換テーブルが
固定されるということである。つまり、一時的にヒステ
リシス対応をとらないことになる。The table conversion operation of the data (pre-conversion data) stored in the register 34 of each PE 3 in the table conversion unit 50 'of the second embodiment shown in FIG. 5 will be described. In the address generator 60 ′, the address pointer is set, and the “next transition destination state value” obtained from the table conversion immediately before that is fed back and set as the transition destination state value at that time. (Or whether an "initial value" is set) or a preset "set value" is set. This "next transition destination state value" or "setting value" is selected according to the condition flag in the first multiplex 72 ', that is, the setting content of 1 bit in the condition register 53 of each PE3. Is done. For example,
If the condition flag is set to "1", a preset "setting value" is set as the transition destination state value. This "set value"
The selection of means that the conversion table used is fixed. In other words, it means that hysteresis is not temporarily taken.

【００５３】[0053]

【発明の効果】本発明を利用することにより、以下のよ
うな効果を奏することができる。By utilizing the present invention, the following effects can be obtained.

【００５４】本発明の第１の実施の形態に係るテーブル
変換部及びそれを含むＳＩＭＤ型マイクロプロセッサを
利用すると、入力値のヒステリシスに対応して適用する
テーブルを変更するテーブル変換処理を実現するにあた
り、適用すべきテーブルの遷移を各ＰＥもしくはＧＰで
演算する必要が無く、通常のテーブル変換処理と同等の
サイクル数にて実現することが可能となる。When the table conversion unit and the SIMD type microprocessor including the table conversion unit according to the first embodiment of the present invention are used, the table conversion processing for changing the table to be applied according to the hysteresis of the input value is realized. , It is not necessary to calculate the transition of the table to be applied by each PE or GP, and it is possible to realize it in the same number of cycles as the normal table conversion processing.

【００５５】更に、テーブル変換後の出力データにおい
て遷移先状態値としてフィードバックさせる値のビット
数を、可変にすることができる。よって、より自由なス
テートマシンを構成することが可能となる。Further, the number of bits of the value fed back as the transition destination state value in the output data after table conversion can be made variable. Therefore, a more flexible state machine can be configured.

【００５６】本発明の第２の実施の形態に係るテーブル
変換部及びそれを含むＳＩＭＤ型マイクロプロセッサで
は、入力値のヒステリシスにより変換テーブルを動的に
選択するだけではなく、変換テーブルを固定的に選択す
る経路を確保している。よって、ヒステリシス対応のテ
ーブル変換処理と、動的にテーブルを変更する必要の無
い固定のテーブルに対する通常のテーブル変換処理とを
混在させることが可能となる。In the table conversion unit and the SIMD microprocessor including the table conversion unit according to the second embodiment of the present invention, not only the conversion table is dynamically selected by the hysteresis of the input value, but also the conversion table is fixed. The route to be selected is secured. Therefore, it is possible to mix the table conversion process corresponding to the hysteresis and the normal table conversion process for the fixed table that does not need to be dynamically changed.

[Brief description of drawings]

【図１】本発明に係るＳＩＭＤ型マイクロプロセッサ
の概略の構成を示すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of a SIMD type microprocessor according to the present invention.

【図２】本発明に係る好適な実施の形態であるＳＩＭ
Ｄ型マイクロプロセッサの、詳細な構成を示すブロック
図である。FIG. 2 is a SIM as a preferred embodiment according to the present invention.
It is a block diagram which shows a detailed structure of a D-type microprocessor.

【図３】本発明に係るテーブル変換処理を実現するた
めのテーブルメモリ構成を示す。FIG. 3 shows a table memory configuration for realizing a table conversion process according to the present invention.

【図４】本発明に係るＳＩＭＤ型マイクロプロセッサ
における、第１の実施の形態のテーブル変換部の詳細な
構成を示す。FIG. 4 shows a detailed configuration of a table conversion unit according to the first embodiment in a SIMD microprocessor according to the present invention.

【図５】本発明に係るＳＩＭＤ型マイクロプロセッサ
における、第２の実施の形態のテーブル変換部の詳細な
構成を示す。FIG. 5 shows a detailed configuration of a table conversion unit according to a second embodiment in a SIMD microprocessor according to the present invention.

【図６】本発明に係る好適な実施の形態であるＳＩＭ
Ｄ型マイクロプロセッサの、詳細な構成を示すブロック
図であって、図５に示される第２の実施の形態のテーブ
ル変換部を構成要素とするものである。FIG. 6 is a preferred embodiment SIM according to the present invention.
FIG. 6 is a block diagram showing a detailed configuration of a D-type microprocessor, which has the table conversion unit of the second embodiment shown in FIG. 5 as a component.

[Explanation of symbols]

２、２’・・・ＳＩＭＤ型マイクロプロセッサ、３・・
・プロセッサエレメント、４・・・グローバルプロセッ
サ、６・・・レジスタファイル、８・・・演算アレイ、
３４・・・レジスタ、５０、５０’・・・テーブル変換
部、５２、５２’・・・メモリ及びレジスタファイル制
御回路、５３・・・条件レジスタ、５４・・・テーブル
メモリ、６０、６０’・・・アドレス生成部、６２・・
・テーブルメモリ、６４・・・第１のセレクタ、６６・
・・第２のセレクタ、６８・・・第３のセレクタ、７０
・・・第４のセレクタ、７２、７２’・・・第１のマル
チプレックス2, 2 '... SIMD type microprocessor, 3 ...
・ Processor element, 4 ... Global processor, 6 ... Register file, 8 ... Arithmetic array,
34 ... Registers, 50, 50 '... Table conversion unit, 52, 52' ... Memory and register file control circuit, 53 ... Condition register, 54 ... Table memory, 60, 60 '. ..Address generation unit, 62 ...
Table memory, 64 ... first selector, 66
.... Second selector, 68 ... Third selector, 70
... Fourth selector, 72, 72 '... First multiplex

Claims

[Claims]

1. A processor comprising a plurality of processor elements, wherein a predetermined operation in the processor element, wherein output data for input data is prepared in advance by a predetermined table, is defined, and the plurality of processor elements are provided. In a SIMD microprocessor in which the above operations are continuously performed in a predetermined order, the input data is state data and pre-conversion data, and the output data is post-conversion data and next-state data. A table converter in which the next state data in the data becomes the state data of the input data in the operation on the next processor element performed successively.

2. The table converter according to claim 1, wherein the bit lengths of the state data and the next state data can be changed and set.

3. The state data of the input data is either the next state data of the output data, or a fixed set value, immediately before the operation on the processor element that is continuously performed. The table converter according to claim 1 or 2, wherein the presence or absence is determined by a condition flag included in each processor element.

4. A SIMD type microprocessor equipped with the table converter according to any one of claims 1 to 3, wherein a plurality of processor elements continuously perform the operation in a predetermined order.