JP4669781B2

JP4669781B2 - Processing equipment

Info

Publication number: JP4669781B2
Application number: JP2005370914A
Authority: JP
Inventors: 真小曽根; 誠岡田
Original assignee: Sanyo Electric Co Ltd
Current assignee: Sanyo Electric Co Ltd
Priority date: 2005-12-22
Filing date: 2005-12-22
Publication date: 2011-04-13
Anticipated expiration: 2025-12-22
Also published as: JP2007172400A

Description

本発明は、リコンフィギュラブル回路を備える処理装置に関する。 The present invention relates to a processing apparatus including a reconfigurable circuit.

最近、携帯電話、ＧＰＳ、ＶＩＣＳなど、無線通信が広く普及し、無線機の種類も増えている。これらの無線機またはその機能をすべてハードウエアで実装すると、コストや実装面積が嵩む。そこで、ハードウエアとしては汎用的な能力を有する回路を搭載しておき、それにロードするソフトウエアを切り替えることで多様な機能を実現する「ソフトウエア無線機」という考え方がある。 Recently, wireless communications such as mobile phones, GPS, and VICS have become widespread, and the types of wireless devices are increasing. If these radios or their functions are all implemented by hardware, the cost and mounting area increase. Therefore, there is a concept of “software radio” that implements various functions by installing a circuit having general-purpose capability as hardware and switching software loaded thereon.

ソフトウエア無線機を実現する回路として、ＦＰＧＡ（Field Programmable Gate Array）やＤＳＰ（Digital Signal Processor）がある。特許文献１には、ＦＰＧＡを動的に再構成することで、回路構成の再利用を図る方法が提案されている。動的変更が可能なタイプの回路を以下リコンフィギュラブル回路という。 Circuits that implement a software defined radio include an FPGA (Field Programmable Gate Array) and a DSP (Digital Signal Processor). Patent Document 1 proposes a method of reusing a circuit configuration by dynamically reconfiguring an FPGA. The type of circuit that can be dynamically changed is hereinafter referred to as a reconfigurable circuit.

ＦＰＧＡはＬＳＩ製造後に回路データを書き込んで比較的自由に回路構成を設計することが可能であり、専用ハードウェアの設計に利用されている。ＦＰＧＡは、論理回路の真理値表を格納するためのルックアップテーブル（ＬＵＴ）と出力用のフリップフロップからなる基本セルと、その基本セル間を結ぶプログラマブルな配線リソースとを含む。ＦＰＧＡでは、ＬＵＴに格納するデータと配線データを書き込むことで目的とする論理演算を実現できる。しかし、ＦＰＧＡでＬＳＩを設計した場合、ＡＳＩＣ（Application Specific IC）による設計と比べると、実装面積が非常に大きくなり、コスト高になる。そこで、ＦＰＧＡを動的に構成することによって、回路構成の再利用を計る方法が提案されている（例えば、特許文献１参照）。 The FPGA can design circuit configuration relatively freely by writing circuit data after the LSI is manufactured, and is used for designing dedicated hardware. The FPGA includes a lookup table (LUT) for storing a truth table of a logic circuit, a basic cell composed of an output flip-flop, and a programmable wiring resource that connects the basic cells. In the FPGA, a target logical operation can be realized by writing data stored in the LUT and wiring data. However, when an LSI is designed using an FPGA, the mounting area is very large and the cost is high compared to an ASIC (Application Specific IC) design. Thus, a method has been proposed in which the circuit configuration is reused by dynamically configuring the FPGA (see, for example, Patent Document 1).

論理回路の演算機能と論理回路どうしの接続を動的に構成するリコンフィギュラブル回路では、論理回路の演算処理ビット数とメモリの１アドレスのビット数が等しく構成されているのが一般である（例えば、特許文献２参照）。
特開平１０−２５６３８３号公報特開２００５−２７６８５４号公報 In a reconfigurable circuit that dynamically configures a logic circuit operation function and a connection between logic circuits, it is common that an operation processing bit number of the logic circuit is equal to a bit number of one address of the memory ( For example, see Patent Document 2).
Japanese Patent Laid-Open No. 10-256383 JP 2005-276854 A

例えば衛星放送では、季節などにより、放送モードを切り替えて画質の調整などを行うことがある。衛星放送受信機では、放送モードごとに複数の回路を予めハードウェア上に作り込んでおき、放送モードに合わせて選択器で回路を切り替えて受信している。従って、受信機の他の放送モード用の回路はその間遊んでいることになる。モード切替のように、複数の専用回路を切り替えて使用し、その切替間隔が比較的長い場合、複数の専用回路を作り込む代わりに、切替時にＬＳＩを瞬時に再構成することにすれば、回路構造を簡単にして汎用性を高め同時に実装コストを抑えることができる。このようなニーズに応えるべく、動的に再構成可能なＬＳＩに製造業界の関心が高まってきている。特に、携帯電話やＰＤＡ（Personal Digital Assistant）などのモバイル端末に搭載されるＬＳＩは小型化が必要であり、ＬＳＩを動的に再構成し、用途に合わせて適宜機能を切り替えることができれば、ＬＳＩの実装面積を小さく抑えることができる。 For example, in satellite broadcasting, image quality may be adjusted by switching broadcast modes depending on the season. In the satellite broadcast receiver, a plurality of circuits are built in hardware for each broadcast mode in advance, and the circuit is switched by a selector according to the broadcast mode for reception. Therefore, the other broadcast mode circuits of the receiver are idle during that time. When switching and using multiple dedicated circuits, such as mode switching, and the switching interval is relatively long, instead of creating multiple dedicated circuits, the LSI can be reconfigured instantaneously at the time of switching. The structure can be simplified and versatility can be improved, and at the same time the mounting cost can be reduced. In order to meet such needs, the manufacturing industry has become increasingly interested in dynamically reconfigurable LSIs. In particular, LSIs mounted on mobile terminals such as mobile phones and PDAs (Personal Digital Assistants) need to be miniaturized. If LSIs can be dynamically reconfigured and functions can be switched appropriately according to the application, The mounting area can be kept small.

ＦＰＧＡは回路構成の設計自由度が高く、汎用的である反面、全ての基本セル間の接続を可能とするため、多数のスイッチとスイッチのＯＮ／ＯＦＦを制御するための制御回路を含む必要があり、必然的に制御回路の実装面積が大きくなる。また、基本セル間の接続に複雑な配線パターンをとるため、配線が長くなる傾向がある。さらに１本の配線に多くのスイッチが接続される構造であるため、遅延が大きくなる。そのため、ＦＰＧＡによるＬＳＩは、試作や実験のために利用されるにとどまることが多く、実装効率、性能コストなどを考えると、量産には適していない。さらに、ＦＰＧＡによるＬＳＩでは、多数のＬＵＴ方式の基本セルに設定データを送る必要があるため、回路のコンフィギュレーションにはかなりの時間を要する。そのため、瞬時に回路構成の切替が必要な用途にはＦＰＧＡによるＬＳＩは適していない。 The FPGA has a high degree of design freedom in circuit configuration and is general-purpose. On the other hand, in order to enable connection between all the basic cells, it is necessary to include a large number of switches and a control circuit for controlling ON / OFF of the switches. This inevitably increases the mounting area of the control circuit. Moreover, since a complicated wiring pattern is used for connection between basic cells, the wiring tends to be long. Furthermore, the delay is increased because of the structure in which many switches are connected to one wiring. For this reason, FPGA based LSIs are often used only for trial manufacture and experiments, and are not suitable for mass production in view of mounting efficiency and performance cost. Furthermore, in the LSI based on the FPGA, it is necessary to send setting data to a large number of basic cells of the LUT method, so that it takes a considerable time to configure the circuit. For this reason, FPGA LSIs are not suitable for applications that require instantaneous switching of the circuit configuration.

それらの課題を解決するため、近年、ＡＬＵ（Arithmetic Logic Unit）と呼ばれる基本演算機能を複数有する多機能素子を用いたリコンフィギュラブルプロセッサの開発が進められている。リコンフィギュラブルプロセッサでは、コマンドデータが設定されることにより、ＡＬＵ回路の演算機能構成と接続部が制御され、全体として所期の演算処理回路を実現することが可能となる。コマンドデータは、一般にＣ言語等の高級プログラム言語で記述されたソースプログラムからＤＦＧ（Data Flow Graph）と呼ばれるデータフローを作成し、その情報をもとに作成される。 In order to solve these problems, in recent years, a reconfigurable processor using a multi-functional element called ALU (Arithmetic Logic Unit) having a plurality of basic arithmetic functions has been developed. In the reconfigurable processor, the command data is set to control the arithmetic function configuration and the connection part of the ALU circuit, so that the desired arithmetic processing circuit can be realized as a whole. Command data is generally created based on the data flow called DFG (Data Flow Graph) created from a source program written in a high-level programming language such as C language.

図１１に示すように、従来のメモリ構成を備えたプロセッサ（特許文献２参照）では、ＡＬＵの演算処理ビット数とメモリの１アドレスのビット数が等しく、ＡＬＵの入出力データをそのまま格納していた。そのため、ＡＬＵの入出力データの有効なビット数が小さい場合、本来不要な上位のビットもメモリに格納しており、無駄にメモリを使用していた。特に、ＡＬＵの処理ビット数が、例えば３２ビットやさらに６４ビットに拡大すると、無駄に使用されるメモリが増大する。 As shown in FIG. 11, in a processor having a conventional memory configuration (see Patent Document 2), the number of arithmetic processing bits of the ALU is equal to the number of bits of one address of the memory, and the input / output data of the ALU is stored as it is. It was. For this reason, when the number of effective bits of the input / output data of the ALU is small, the upper bits that are originally unnecessary are also stored in the memory, and the memory is used wastefully. In particular, if the number of ALU processing bits is increased to, for example, 32 bits or even 64 bits, the memory that is wasted is increased.

本発明はこうした状況に鑑みてなされたものであり、その目的は、回路規模の縮小化に貢献するリコンフィギュラブル回路を備えた処理装置を提供することである。 The present invention has been made in view of such circumstances, and an object thereof is to provide a processing apparatus including a reconfigurable circuit that contributes to a reduction in circuit scale.

上記目的を達成するため、本発明の第１の観点に係る処理装置は、それぞれが複数の算術論理演算機能を選択的に実行可能な複数の論理回路から構成される演算部と、前記複数の論理回路の間の接続関係を保持する接続部と、を備えるリコンフィギュラブル回路を含む処理装置において、前記論理回路で使用するデータを格納する記憶手段と、前記記憶手段に格納するデータの有効ビット数が前記記憶手段の１アドレスで指定されるメモリワードのビット数を超える場合、前記データを前記メモリワードのビット数を有する複数の部分に分けて、複数の前記メモリワードに格納する分割格納手段と、前記分割格納手段で複数のメモリワードに分割されて格納されたデータを、該複数のメモリワードから読み出すときに、元のビット数のデータに復元する連結読み出し手段と、を備えることを特徴とする。 In order to achieve the above object, a processing device according to a first aspect of the present invention includes an arithmetic unit including a plurality of logic circuits each capable of selectively executing a plurality of arithmetic logic operation functions, In a processing apparatus including a reconfigurable circuit comprising a connection unit that holds a connection relationship between logic circuits, a storage unit that stores data used in the logic circuit, and a valid bit of data stored in the storage unit Divided storage means for dividing the data into a plurality of portions having the number of bits of the memory word and storing the data in a plurality of memory words when the number exceeds the number of bits of the memory word specified by one address of the storage means When the data stored by being divided into a plurality of memory words by the divided storage means is read from the plurality of memory words, the data of the original number of bits Characterized in that it comprises a coupling reading means for restoring, the.

本発明の第２の観点に係る処理装置は、それぞれが複数の算術論理演算機能を選択的に実行可能な複数の論理回路から構成される演算部と、前記複数の論理回路の間の接続関係を保持する接続部と、を備えるリコンフィギュラブル回路を含む処理装置において、前記論理回路で使用するデータを格納する記憶手段と、前記記憶手段に格納するデータの有効ビット数が前記記憶手段の１アドレスで指定されるメモリワードのビット数を超えない場合、前記データを復元可能でかつ前記メモリワードのビット数以下のビット数に変更してメモリに格納する縮退格納手段と、前記縮退格納手段で前記メモリワードのビット数以下のビット数に変更して格納されたデータを読み出すときに、前記論理回路の入力ビット数に合わせて前記データを復元する拡張読み出し手段と、を備えることを特徴とする。 A processing device according to a second aspect of the present invention is a processing unit comprising a plurality of logic circuits each capable of selectively executing a plurality of arithmetic logic operation functions, and a connection relationship between the plurality of logic circuits. In a processing apparatus including a reconfigurable circuit including a connection unit that holds the data, a storage unit that stores data used in the logic circuit, and the number of effective bits of the data stored in the storage unit is 1 of the storage unit. If the number of bits of the memory word specified by the address does not exceed, the degenerate storage means that can restore the data and change the number of bits to be equal to or less than the number of bits of the memory word and store in the memory, and the degenerate storage means When reading the stored data after changing the number of bits to the number of bits of the memory word or less, the data is restored in accordance with the number of input bits of the logic circuit. Characterized in that it and a extended reading means.

本発明の第３の観点に係る処理装置は、前記記憶手段と、前記分割格納手段と、前記連結読み出し手段と、前記縮退格納手段と、前記拡張読み出し手段と、を備えることを特徴とする。 A processing apparatus according to a third aspect of the present invention includes the storage unit, the divided storage unit, the concatenated reading unit, the degenerate storage unit, and the extended reading unit.

さらに、前記論理回路で使用するデータを格納する前記記憶手段の１つのアドレスで指定されるメモリワードのビット数が、前記論理回路で処理するデータのビット数よりも小さいことを特徴とする。 Further, the number of bits of a memory word specified by one address of the storage means for storing data used in the logic circuit is smaller than the number of bits of data processed in the logic circuit.

好ましくは、前記分割格納手段は、前記記憶手段の前記メモリワードのビット数を有する複数の部分に分けられたデータのそれぞれを、元のデータにおけるビット位置に従って、格納する記憶領域が限定されていることを特徴とする。 Preferably, the divided storage means has a limited storage area for storing each of the data divided into a plurality of portions having the number of bits of the memory word of the storage means according to the bit position in the original data. It is characterized by that.

特に、前記分割格納手段又は前記縮退格納手段は、前記演算部から出力されるデータの数を前記データの有効ビット数に応じて制限することを特徴とする。 In particular, the divided storage means or the degenerate storage means limits the number of data output from the arithmetic unit according to the number of effective bits of the data.

また特に、前記連結読み出し手段又は前記拡張読み出し手段は、前記演算部に入力するデータの数を前記データの有効ビット数に応じて制限することを特徴とする。 In particular, the concatenated reading unit or the extended reading unit limits the number of data input to the arithmetic unit according to the number of effective bits of the data.

さらに、前記分割格納手段又は前記縮退格納手段は、前記演算部から出力される各データの有効ビット数に応じて、使用する前記メモリワード数の合計が同時に書き込み可能なメモリワード数を超えないように、同時に前記演算部から出力される前記データ数を制限することを特徴とする。 Further, the divided storage means or the degenerate storage means may prevent the total number of memory words to be used from exceeding the number of memory words that can be simultaneously written according to the number of effective bits of each data output from the arithmetic unit. In addition, the number of data output from the arithmetic unit at the same time is limited.

またさらに、前記連結読み出し手段又は前記拡張読み出し手段は、前記演算部に入力される各データの有効ビット数に応じて使用する前記メモリワード数の合計が同時に読み込み可能なメモリワード数を超えないように、同時に前記演算部に入力するデータ数を制限することを特徴とする。 Still further, the concatenated reading means or the extended reading means may prevent the total number of memory words used according to the number of valid bits of each data input to the arithmetic unit from exceeding the number of memory words that can be read simultaneously. In addition, the number of data input to the arithmetic unit at the same time is limited.

なお、好ましくは、前記分割格納手段又は前記縮退格納手段は、前記演算部から出力されるデータの数を前記データの有効ビット数に応じて制限するか、さらに、前記演算部から出力される各データの有効ビット数に応じて使用する前記メモリワード数の合計が同時に書き込み可能なメモリワード数を超えないように、同時に前記演算部から出力されるデータ数を制限し、そして、前記連結読み出し手段又は前記拡張読み出し手段は、前記演算部に入力するデータの数を前記データの有効ビット数に応じて制限するか、さらに、前記演算部に入力される各データの有効ビット数に応じて使用する前記メモリワード数の合計が同時に読み込み可能なメモリワード数を超えないように、同時に前記演算部に入力するデータ数を制限することを特徴とする。 Preferably, the divided storage unit or the degenerate storage unit limits the number of data output from the calculation unit according to the number of effective bits of the data, or further outputs each of the data output from the calculation unit. The number of data simultaneously output from the arithmetic unit is limited so that the total number of memory words used according to the number of valid bits of data does not exceed the number of simultaneously writeable memory words, and the concatenated reading means Alternatively, the extended reading means limits the number of data input to the calculation unit according to the number of effective bits of the data, or further uses the data according to the number of effective bits of each data input to the calculation unit. The number of data to be simultaneously input to the arithmetic unit is limited so that the total number of memory words does not exceed the number of memory words that can be read simultaneously. To.

本発明の第４の観点に係る処理装置は、前記いずれかの処理装置であって、前記複数の論理回路において選択的に実行する複数の算術論理演算機能と、前記複数の論理回路の間の接続関係を設定して、前記リコンフィギュラブル回路の動作を制御する設定データの生成において、前記複数の論理演算回路が実行する所期の演算処理を記述したプログラムに基づいて、前記論理回路の入出力データの有効ビット数を判定し、前記設定データを生成することを特徴とする。 A processing device according to a fourth aspect of the present invention is any one of the processing devices, comprising: a plurality of arithmetic logic operation functions selectively executed in the plurality of logic circuits; and a plurality of logic circuits between the plurality of logic circuits. In the generation of setting data for setting the connection relation and controlling the operation of the reconfigurable circuit, the input of the logic circuit is based on a program that describes the intended arithmetic processing executed by the plurality of logical arithmetic circuits. The setting data is generated by determining the number of valid bits of the output data.

本発明の処理装置によれば、従来のリコンフィギュラブル回路よりデータの有効ビット数に近いビット数でメモりに格納できるため、効率的なメモリの使用が可能となり、必要なメモリの容量が少なくてすむ。そして、メモリの容量が少なくなることにより、回路規模を縮小することができる。 According to the processing apparatus of the present invention, since it is possible to store the memory with the number of bits closer to the effective number of data than in the conventional reconfigurable circuit, it is possible to use the memory efficiently and to reduce the necessary memory capacity. Tesumu. And the circuit scale can be reduced by reducing the memory capacity.

本発明に係る処理装置の一実施の形態について、図を参照して説明する。図１は、本発明の一実施の形態に係るリコンフィギュラブル回路を含む処理装置の構成を示すブロック図である。図１に示すように、処理装置１０は、集積回路装置２６を備える。集積回路装置２６は、回路構成を再構成可能とする機能を有する。集積回路装置２６は１チップとして構成され、リコンフィギュラブル回路１、設定部１４、制御部１８、内部状態保持回路２０、出力回路２２および経路部２４を備える。リコンフィギュラブル回路１は、設定を変更することにより、機能の変更を可能とする。 An embodiment of a processing apparatus according to the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing a configuration of a processing apparatus including a reconfigurable circuit according to an embodiment of the present invention. As shown in FIG. 1, the processing apparatus 10 includes an integrated circuit device 26. The integrated circuit device 26 has a function that makes it possible to reconfigure the circuit configuration. The integrated circuit device 26 is configured as one chip, and includes a reconfigurable circuit 1, a setting unit 14, a control unit 18, an internal state holding circuit 20, an output circuit 22, and a path unit 24. The reconfigurable circuit 1 can change the function by changing the setting.

設定部１４は、第１設定部１４ａ、第２設定部１４ｂ、第３設定部１４ｃ、第４設定部１４ｄおよび選択器１６を有し、リコンフィギュラブル回路１に所期の回路を構成するための設定データ４０を供給する。経路部２４は、フィードバックパスとして機能し、リコンフィギュラブル回路１の出力を、リコンフィギュラブル回路１の入力に接続する。内部状態保持回路２０および出力回路２２は、例えばデータフリップフロップ（Ｄ−ＦＦ）などの順序回路あるいはメモリで構成され、リコンフィギュラブル回路１の出力を受ける。内部状態保持回路２０は経路部２４に接続されている。内部状態保持回路２０は、リコンフィギュラブル回路１で使用するデータを記憶する記憶部４を備える。リコンフィギュラブル回路１は組合せ回路、またはＤ−ＦＦのような状態保持を含む順序回路として構成される。 The setting unit 14 includes a first setting unit 14a, a second setting unit 14b, a third setting unit 14c, a fourth setting unit 14d, and a selector 16, and configures an intended circuit in the reconfigurable circuit 1. The setting data 40 is supplied. The path unit 24 functions as a feedback path, and connects the output of the reconfigurable circuit 1 to the input of the reconfigurable circuit 1. The internal state holding circuit 20 and the output circuit 22 are configured by a sequential circuit such as a data flip-flop (D-FF) or a memory, for example, and receive the output of the reconfigurable circuit 1. The internal state holding circuit 20 is connected to the path unit 24. The internal state holding circuit 20 includes a storage unit 4 that stores data used in the reconfigurable circuit 1. The reconfigurable circuit 1 is configured as a combinational circuit or a sequential circuit including state holding such as D-FF.

図２は、ＡＬＵアレイを用いたリコンフィギュラブル回路１の構成例を示す図である。図２に示すように、リコンフィギュラブル回路１は、機能の変更が可能な算術論理演算ユニットなどを含む論理回路（ＡＬＵ）２の集合体を複数備えた構造を有し、また、それぞれの集合体の間に設けられて、集合体間のＡＬＵ２の接続を選択的に確立可能な少なくとも１つの接続部３を有する。 FIG. 2 is a diagram illustrating a configuration example of the reconfigurable circuit 1 using an ALU array. As shown in FIG. 2, the reconfigurable circuit 1 has a structure including a plurality of sets of logic circuits (ALU) 2 including arithmetic logic units and the like whose functions can be changed. It is provided between the bodies and has at least one connection part 3 capable of selectively establishing the connection of the ALU 2 between the aggregates.

リコンフィギュラブル回路１において、演算機能を選択的に実行可能な複数のＡＬＵ２がマトリックス状に配列されて、図２の例では、Ｘ段Ｙ列のＡＬＵアレイを構成し、各段に配列された複数のＡＬＵ２が集合体を構成して、前段の集合体における処理結果が、接続部３において選択的に確立される接続にしたがって後段の集合体に引き渡される。上段から下段への論理回路間のデータ渡しは、論理回路間の接続切替を行う接続スイッチに接続データセットを設定することで、下段のどのＡＬＵ２にデータを渡すかが定められる。動作時には、構成情報に従って演算処理し、結果を出力する。 In the reconfigurable circuit 1, a plurality of ALUs 2 capable of selectively executing an arithmetic function are arranged in a matrix form. In the example of FIG. 2, an ALU array of X stages and Y columns is configured and arranged in each stage. A plurality of ALUs 2 constitute an aggregate, and the processing result in the upstream aggregate is delivered to the downstream aggregate according to the connection selectively established in the connection unit 3. Data transfer between the logic circuits from the upper stage to the lower stage is determined to which ALU 2 in the lower stage the data is transferred by setting a connection data set in a connection switch that switches connection between the logic circuits. During operation, arithmetic processing is performed according to the configuration information and the result is output.

各論理回路（ＡＬＵ）２の機能と、論理回路間の接続関係は、図１に示す設定部１４により供給される設定データ４０に基づいて設定される。設定データ４０は、以下の手順で生成される。 The function of each logic circuit (ALU) 2 and the connection relationship between the logic circuits are set based on setting data 40 supplied by the setting unit 14 shown in FIG. The setting data 40 is generated by the following procedure.

集積回路装置２６により実現されるべきプログラム３６が、設定記憶部３４に保持されている。プログラム３６は、信号処理回路または信号処理アルゴリズムなどをＣ言語などの高級言語で記述したものである。コンパイル部３０は、設定記憶部３４に格納されたプログラム３６をコンパイルし、データフローグラフ（ＤＦＧ）３８に変換して設定記憶部３４に格納する。データフローグラフ３８は、入力変数および定数による入力データから出力データに至る演算ないしはデータの流れをグラフ構造で表現したものである。ここで、コンパイル部３０は、リコンフィギュラブル回路１におけるＡＬＵ２の集合体の接続制限に応じて、データフローグラフ３８を生成する。 A program 36 to be realized by the integrated circuit device 26 is held in the setting storage unit 34. The program 36 describes a signal processing circuit or a signal processing algorithm in a high-level language such as C language. The compiling unit 30 compiles the program 36 stored in the setting storage unit 34, converts it into a data flow graph (DFG) 38, and stores it in the setting storage unit 34. The data flow graph 38 is a graph structure representing operations or data flow from input data to output data by input variables and constants. Here, the compiling unit 30 generates the data flow graph 38 according to the connection limitation of the aggregate of ALUs 2 in the reconfigurable circuit 1.

設定データ生成部３２は、データフローグラフ３８から設定データ４０を生成する。設定データ４０は、データフローグラフ３８をリコンフィギュラブル回路１にマッピングするためのデータであり、リコンフィギュラブル回路１におけるＡＬＵ２の機能や論理回路間の接続関係を定める。 The setting data generation unit 32 generates setting data 40 from the data flow graph 38. The setting data 40 is data for mapping the data flow graph 38 to the reconfigurable circuit 1 and defines the function of the ALU 2 in the reconfigurable circuit 1 and the connection relationship between the logic circuits.

（実施の形態１）
図３は、本発明に係る処理装置１０において、リコンフィギュラブル回路１が使用するデータを記憶する記憶部４の構成の一例を示す図である。本実施の形態では、リコンフィギュラブル回路１のＡＬＵアレイは、演算処理ビット数が３２ビットである。記憶部４はメモリ５として、１アドレスで指定されるメモリワードが１６ビットのＲＡＭ５ａ、５ｂを２系統備える。記憶部４は、演算処理ビット数が３２ビットのＡＬＵからの出力を、１アドレスのメモリワードが１６ビットのＲＡＭに格納する。図中の太線は３２ビットのデータ線を、細線は１６ビットのデータ線を表す。 (Embodiment 1)
FIG. 3 is a diagram showing an example of the configuration of the storage unit 4 that stores data used by the reconfigurable circuit 1 in the processing apparatus 10 according to the present invention. In the present embodiment, the ALU array of the reconfigurable circuit 1 has an arithmetic processing bit number of 32 bits. The storage unit 4 includes two systems of RAMs 5a and 5b each having a 16-bit memory word designated by one address as the memory 5. The storage unit 4 stores the output from the ALU having the number of arithmetic processing bits of 32 bits in a RAM having a memory word of 1 address as 16 bits. In the figure, a thick line represents a 32-bit data line, and a thin line represents a 16-bit data line.

記憶部４は、リコンフィギュラブル回路１の３２ビットのデータをそのまま受けるレジスタ７ａと、３２ビットのデータの下位１６ビットを受けるレジスタ７ｂを備える。３２ビットレジスタ７ａのデータは、上位１６ビットのデータと下位１６ビットのデータを選択する選択回路（ＭＵＸ）６ａに入力される。下位１６ビットを受けるレジスタ７ｂは、ＲＡＭ５ｂに接続している。 The storage unit 4 includes a register 7a that receives the 32-bit data of the reconfigurable circuit 1 as it is, and a register 7b that receives the lower 16 bits of the 32-bit data. The data in the 32-bit register 7a is input to a selection circuit (MUX) 6a that selects upper 16-bit data and lower 16-bit data. The register 7b that receives the lower 16 bits is connected to the RAM 5b.

ＲＡＭ５ａ、５ｂの読み出し側には、レジスタ７ｃ、７ｄと、ビット拡張回路８と、選択回路６ｂと、合成回路９ａ、９ｂが設けられている。ビット拡張回路８は、ＲＡＭ５ｂから読み出したデータのＭＳＢ（最上位ビット）を拡張して、上位１６ビットを生成する。選択回路６ｂは、ビット拡張回路８の出力とＲＡＭ５ａのデータを選択する。合成回路９ａは、ＲＡＭ５ａのデータを下位１６ビットとし、そのＭＳＢを拡張した１６ビットを上位１６ビットとして、元のデータを生成する。合成回路９ｂは、ＲＡＭ５ｂのデータを下位ビットとし、選択回路６ｂの出力を上位１６ビットとして元のデータを生成する。 On the read side of the RAMs 5a and 5b, registers 7c and 7d, a bit expansion circuit 8, a selection circuit 6b, and synthesis circuits 9a and 9b are provided. The bit expansion circuit 8 expands the MSB (most significant bit) of the data read from the RAM 5b to generate upper 16 bits. The selection circuit 6b selects the output of the bit extension circuit 8 and the data of the RAM 5a. The synthesizing circuit 9a generates the original data with the data in the RAM 5a as the lower 16 bits and the 16 bits obtained by extending the MSB as the upper 16 bits. The synthesizing circuit 9b generates original data with the data of the RAM 5b as the lower bits and the output of the selection circuit 6b as the upper 16 bits.

次に、記憶部４の動作を説明する。図３に示すように、記憶部４は２つのＲＡＭ５ａ、５ｂを一対とし、ＡＬＵ２からの出力データの有効ビット数が１７ビット以上であれば一対の２つのＲＡＭ５ａ、５ｂに上位と下位のそれぞれを格納する。すなわち、選択回路６ａはレジスタ７ａのデータの上位１６ビットを選択してＲＡＭ５ａに記憶する。下位１６ビットはＲＡＭ５ｂに記憶される。メモリ５を構成する一対となるＲＡＭ５ａ、５ｂのうち、ＲＡＭ５ａが上位用、ＲＡＭ５ｂが下位用である。 Next, the operation of the storage unit 4 will be described. As shown in FIG. 3, the storage unit 4 has a pair of two RAMs 5a and 5b, and if the number of effective bits of output data from the ALU 2 is 17 bits or more, the upper and lower parts are respectively stored in the pair of two RAMs 5a and 5b. Store. That is, the selection circuit 6a selects the upper 16 bits of the data in the register 7a and stores it in the RAM 5a. The lower 16 bits are stored in the RAM 5b. Of the pair of RAMs 5a and 5b constituting the memory 5, the RAM 5a is for the upper level and the RAM 5b is for the lower level.

分割して格納したデータは上位１６ビットと下位１６ビットをそれぞれＲＡＭ５ａ、５ｂから読み出しビット結合を行い、元の３２ビットのデータとしてＡＬＵ２に入力する。選択回路６ｂは、ＲＡＭ５ａのデータを選択し、合成回路９ｂの上位１６ビットとする。合成回路９ｂの下位１６ビットはＲＡＭ５ｂのデータ（レジスタ７ｄのデータ）とする。こうして、ＲＡＭ５ａ、５ｂに分割して記憶された３２ビットのデータは、合成回路９ｂで元の３２ビットデータに連結して読み出される。 The divided and stored data is read out from the RAMs 5a and 5b for the upper 16 bits and the lower 16 bits, respectively, and is coupled to the ALU 2 as the original 32-bit data. The selection circuit 6b selects the data in the RAM 5a and sets it as the upper 16 bits of the synthesis circuit 9b. The lower 16 bits of the synthesis circuit 9b are the RAM 5b data (register 7d data). In this way, the 32-bit data divided and stored in the RAMs 5a and 5b is read out by being combined with the original 32-bit data by the synthesis circuit 9b.

ＡＬＵからの出力の有効ビット数が１６ビット以下の場合は、３２ビットの下位１６ビットのみをＲＡＭ５ａ又はＲＡＭ５ｂに格納する。その際、いずれのＲＡＭ５ａ、５ｂに格納してもよい。例えば、連続する１６ビット以下のデータは、ＲＡＭ５ａとＲＡＭ５ｂの未使用領域の多い方のＲＡＭに格納する。ＲＡＭ５ａに下位１６ビットを格納する場合、選択回路６ａは下位１６ビットを選択して、ＲＡＭ５ａに格納する。そのとき、ＲＡＭ５ｂは異なる１６ビット以下のデータを格納してもよい。ＲＡＭ５ｂに下位１６ビットのみを格納する場合、レジスタ７ｂから下位１６ビットをＲＡＭ５ｂに格納する。そのとき、ＲＡＭ５ａは異なる１６ビット以下のデータを格納してもよい。 When the number of effective bits output from the ALU is 16 bits or less, only the lower 16 bits of 32 bits are stored in the RAM 5a or RAM 5b. At that time, the data may be stored in any of the RAMs 5a and 5b. For example, continuous data of 16 bits or less is stored in the RAM with the larger unused area of the RAM 5a and RAM 5b. When the lower 16 bits are stored in the RAM 5a, the selection circuit 6a selects the lower 16 bits and stores them in the RAM 5a. At that time, the RAM 5b may store different data of 16 bits or less. When only the lower 16 bits are stored in the RAM 5b, the lower 16 bits from the register 7b are stored in the RAM 5b. At that time, the RAM 5a may store different data of 16 bits or less.

下位１６ビットのみが格納されたデータは、ＲＡＭ５ａ又はＲＡＭ５ｂから読み出したときに、データのビット拡張を行い、元の３２ビットのデータとしてＡＬＵ２に入力する。すなわち、ＲＡＭ５ａに格納された下位１６ビットデータを読み出すときには、合成回路９ａでＲＡＭ５ａのデータ（レジスタ７ｃのデータ）を下位１６ビットとし、レジスタ７ｃのＭＳＢを拡張した１６ビットを上位１６ビットとして、元のデータを生成する。 When data having only the lower 16 bits stored therein is read from the RAM 5a or 5b, the data is bit-extended and input to the ALU 2 as the original 32-bit data. That is, when the lower 16-bit data stored in the RAM 5a is read, the data in the RAM 5a (data in the register 7c) is set as the lower 16 bits in the synthesis circuit 9a, and the 16 bits obtained by extending the MSB of the register 7c are set as the upper 16 bits. Generate data for

下位１６ビットのＭＳＢを拡張する方法は、データの型によって異なる。例えば、１６ビットのデータが文字コードのような符号の場合は、上位１６ビットに拡張するビットは常に０でよい。データが整数型で負の数が２の補数表示である場合は、上位１６ビットに拡張するビットは、下位１６ビットのＭＳＢと同じビットである必要がある。すなわち、正数の場合は０を上位１６ビットに詰め、負数の場合は１を上位１６ビットに詰める。ＡＬＵ２において、上位１６ビットが整数型以外では用いられることがなければ、ビット拡張回路８は常に整数型と同じ処理を行うものであってもよい。 The method of extending the lower 16 bits MSB differs depending on the data type. For example, when 16-bit data is a code such as a character code, the bit extended to the upper 16 bits may always be 0. When the data is an integer type and the negative number is 2's complement notation, the bit extended to the upper 16 bits needs to be the same bit as the MSB of the lower 16 bits. That is, in the case of a positive number, 0 is padded to upper 16 bits, and in the case of a negative number, 1 is padded to upper 16 bits. In ALU2, if the upper 16 bits are not used for anything other than the integer type, the bit extension circuit 8 may always perform the same processing as the integer type.

ＲＡＭ５ｂに格納された下位１６ビットのみのデータを読み出すときには、ビット拡張回路８は、ＲＡＭ５ｂから読み出したデータのＭＳＢ（最上位ビット）を拡張して、上位１６ビットを生成する。選択回路６ｂは、ビット拡張回路８の出力を選択して、合成回路９ｂの上位１６ビットとする。合成回路９ｂは、ＲＡＭ５ｂのデータ（レジスタ７ｄのデータ）を下位１６ビットとし、選択回路６ｂの出力（ビット拡張回路８で生成された上位１６ビット）を上位１６ビットとして元のデータを生成する。 When reading only the lower 16 bits of data stored in the RAM 5b, the bit expansion circuit 8 expands the MSB (most significant bit) of the data read from the RAM 5b to generate upper 16 bits. The selection circuit 6b selects the output of the bit expansion circuit 8 and sets the higher 16 bits of the synthesis circuit 9b. The synthesizing circuit 9b generates original data using the data in the RAM 5b (data in the register 7d) as the lower 16 bits and the output of the selection circuit 6b (the upper 16 bits generated by the bit expansion circuit 8) as the upper 16 bits.

こうして、ＲＡＭ５ａ又はＲＡＭ５ｂに下位１６ビットのみが格納されたデータは、合成回路９ａ又は合成回路９ｂで元の３２ビットデータに拡張されて読み出される。 In this way, the data in which only the lower 16 bits are stored in the RAM 5a or 5b is expanded and read by the combining circuit 9a or the combining circuit 9b to the original 32-bit data.

ＡＬＵ２からの出力データの有効ビット数が１６ビット以下の場合、図１１に示す従来の記憶部４ａの構成では不要な上位１６ビットのデータもメモリに格納していたが、本実施の形態１の記憶部４により不要なデータの格納が削減され、従来よりも効率的にメモリ５にデータを格納することが可能となる。その結果、メモリの容量が少なくなることにより、回路規模を縮小することができる。 When the number of effective bits of output data from the ALU 2 is 16 bits or less, the upper 16-bit data unnecessary in the configuration of the conventional storage unit 4a shown in FIG. 11 is also stored in the memory. Storage of unnecessary data is reduced by the storage unit 4, and data can be stored in the memory 5 more efficiently than in the past. As a result, the circuit scale can be reduced by reducing the memory capacity.

図３では、ＲＡＭ５ａは１６ビット又は３２ビットの上位のデータを格納し、ＲＡＭ５ｂは１６ビット又は３２ビットの下位のデータを格納する。そのため、メモリ５への入力において、左のＲＡＭ５ａには上位１６ビットか下位１６ビットかを選択する選択回路（ＭＵＸ）６ａが備わっているが、右のＲＡＭ５ｂには常に下位のデータが書き込まれるため、選択回路は必要ない。 In FIG. 3, the RAM 5a stores 16-bit or 32-bit upper data, and the RAM 5b stores 16-bit or 32-bit lower data. Therefore, in the input to the memory 5, the left RAM 5a is provided with a selection circuit (MUX) 6a for selecting whether the upper 16 bits or the lower 16 bits, but lower data is always written in the right RAM 5b. The selection circuit is not necessary.

記憶部４からの出力においては、元のデータが３２ビットのデータの場合の左のＲＡＭ５ａからの出力である上位１６ビットと、元のデータが１６ビットの場合の右のＲＡＭ５ｂからの出力をビット拡張した上位１６ビットとを選択する選択回路６ｂと、選択回路６ｂからの出力を上位１６ビットとし、右のＲＡＭ５ｂからの出力を下位１６ビットとしてビット統合を行い３２ビットのデータとする合成回路９ｂが備わっている。また、下位１６ビットのデータが左のＲＡＭ５ａに格納されている場合に、左のＲＡＭ５ａからの出力をビット拡張し３２ビットのデータとする合成回路９ａが備わっている。合成回路９ａの上位１６ビットは常に左のＲＡＭ５ａからの出力をビット拡張した１６ビットのデータであるため、選択回路６ｂのような上位１６ビットを選択する選択回路は必要ない。 In the output from the storage unit 4, the upper 16 bits that are output from the left RAM 5a when the original data is 32-bit data and the output from the right RAM 5b when the original data is 16 bits are bits. A selection circuit 6b that selects the expanded upper 16 bits, and an output from the selection circuit 6b as upper 16 bits, and a synthesis circuit 9b that performs bit integration with the output from the right RAM 5b as lower 16 bits to generate 32-bit data. Is equipped. In addition, when lower 16-bit data is stored in the left RAM 5a, there is provided a synthesis circuit 9a that expands the output from the left RAM 5a to 32-bit data. Since the upper 16 bits of the combining circuit 9a are always 16-bit data obtained by bit-extending the output from the left RAM 5a, a selection circuit for selecting the upper 16 bits such as the selection circuit 6b is not necessary.

図４は、図３の記憶部４の構成を用いた場合の、ＡＬＵ２からメモリ５に到る回路構成例を示す図である。３個のＡＬＵ２が、図３の１対のＲＡＭ５ａ、５ｂにアクセス可能である。左のＲＡＭ５ａには、上位１６ビットと下位１６ビットの両方を入力可能とするため、ＡＬＵ３個×上位下位２個の計６個から１個の１６ビットデータを選択する６→１の選択回路（ＭＵＸ）６ｃが備わっている。また、右のＲＡＭ５ｂには下位１６ビットのみが入力されるため、ＡＬＵ３個×下位１個の計３個から１個の１６ビットデータを選択する３→１の選択回路（ＭＵＸ）６ｄが備わっている。 FIG. 4 is a diagram showing a circuit configuration example from the ALU 2 to the memory 5 when the configuration of the storage unit 4 of FIG. 3 is used. Three ALUs 2 can access the pair of RAMs 5a and 5b in FIG. In order to allow both the upper 16 bits and the lower 16 bits to be input to the left RAM 5a, a 6 → 1 selection circuit (selecting one 16-bit data from a total of 6 ALUs 3 × upper and lower 2) ( MUX) 6c. Further, since only the lower 16 bits are input to the right RAM 5b, a 3 → 1 selection circuit (MUX) 6d for selecting one 16-bit data from a total of three ALUs 3 × lower ones is provided. Yes.

６→１ＭＵＸ６ｃが上位１６ビットを選択する場合、３→１ＭＵＸ６ｄは同じＡＬＵの出力の下位１６ビットを選択する。６→１ＭＵＸ６ｃが下位１６ビットを選択する場合、３→１ＭＵＸ６ｄは異なるＡＬＵ２の出力の下位１６ビットを選択してもよい。従って、この３個のＡＬＵ２からの出力は、有効ビット数が１６ビット以下のデータ２個か、１７ビット以上のデータ１個に制限される。また、ＡＬＵ２への入力においても同様で、入力データの有効ビット数が１６ビット以下のデータ２個（種類）か、１７ビット以上のデータ１個（種類）に制限される。このように、ＡＬＵ２の入出力制限は、データの有効ビット数に依存する。 When 6 → 1MUX 6c selects the upper 16 bits, 3 → 1 MUX 6d selects the lower 16 bits of the output of the same ALU. When 6 → 1MUX 6c selects the lower 16 bits, 3 → 1 MUX 6d may select the lower 16 bits of the output of a different ALU2. Therefore, the output from the three ALUs 2 is limited to two pieces of data having an effective bit number of 16 bits or less or one piece of data having 17 bits or more. The same applies to the input to the ALU 2. The number of valid bits of input data is limited to two data (types) of 16 bits or less or one data (type) of 17 bits or more. As described above, the input / output limit of the ALU 2 depends on the number of effective bits of data.

ＡＬＵ２の入出力データの有効ビット数は、リコンフィギュラブル回路１で実行する所期の演算処理を記述したプログラム３６に基づいて判定される。所期の演算処理がＣ言語で記述される場合、変数の型やコメント文で有効ビット数を判定できる。例えば、変数の型がｉｎｔ型（整数型）であればその変数の有効ビット数は３２ビットとして、ｓｈｏｒｔ型であれば有効ビット数が１６ビットとして判定できる。そして、設定データ生成部３２は、設定データ４０を作成するときに、ＡＬＵ２の入出力データの判定結果に基づいた有効ビット数から、上述した制限でＲＡＭ５ａ、５ｂにデータを格納するための設定データ４０を作成する。 The number of effective bits of the input / output data of the ALU 2 is determined based on a program 36 describing an intended arithmetic process executed by the reconfigurable circuit 1. When the intended arithmetic processing is described in C language, the number of effective bits can be determined by a variable type or a comment sentence. For example, if the variable type is int type (integer type), the effective bit number of the variable can be determined as 32 bits, and if the variable type is short type, the effective bit number can be determined as 16 bits. Then, when creating the setting data 40, the setting data generation unit 32 sets the setting data for storing the data in the RAMs 5a and 5b with the above-described restrictions from the number of effective bits based on the determination result of the input / output data of the ALU2. 40 is created.

図５は、複数対のＲＡＭ５ａ、５ｂ、５ｃ、５ｄを用いた記憶部４の構成を示す。図３及び図４の記憶部４では一対のＲＡＭ５ａ、５ｂを用いた構成であるが、図５のように複数対のＲＡＭ構成でもよい。図５は２対のＲＡＭ５ａ、５ｂと、ＲＡＭｃ、ｄを用いた構成であり、この場合入出力データの有効ビット数が１６ビット以下のデータ４個（種類）か、１７ビット以上のデータ２個（種類）か、１７ビット以上のデータ１個（種類）と１６ビット以下のデータ１個（種類）に制限される。 FIG. 5 shows a configuration of the storage unit 4 using a plurality of pairs of RAMs 5a, 5b, 5c, and 5d. The storage unit 4 in FIGS. 3 and 4 has a configuration using a pair of RAMs 5a and 5b, but may have a plurality of pairs of RAM configurations as shown in FIG. FIG. 5 shows a configuration using two pairs of RAMs 5a and 5b and RAMc and d. In this case, the number of effective bits of input / output data is four (types) of 16 bits or less, or two data of 17 bits or more. (Type) or one data (type) of 17 bits or more and one data (type) of 16 bits or less.

図５に示す記憶部のように２対のＲＡＭを用いる場合、リコンフィギュラブル回路のＡＬＵ２を２つに分けて、一方は左の１対のＲＡＭ５ａ、５ｂにアクセスし、他方は右の１対のＲＡＭ５ｃ、５ｄにアクセスするように構成してもよい。全てのＡＬＵ２がいずれの対のＲＡＭ５ａ、５ｂ、５ｃ、５ｄにアクセスできるようにしてもよい。 When two pairs of RAMs are used like the storage unit shown in FIG. 5, the ALU2 of the reconfigurable circuit is divided into two, one accesses the left pair of RAMs 5a and 5b, and the other accesses the right pair. The RAMs 5c and 5d may be accessed. All ALUs 2 may be allowed to access any pair of RAMs 5a, 5b, 5c, 5d.

本実施の形態では、３２ビットのデータを上位１６ビットと下位１６ビットに分けて記憶部４に格納する場合について説明したが、データのビット数と記憶部４に格納するビット数は３２ビットと１６ビットに限られない。メモリ５は、ＡＬＵ２の入出力ビット数を格納できる数のＲＡＭで構成されていればよい。例えば、データのビット数が１６ビットの場合に、記憶部４に格納するビット数が８ビットの２つの部分に分割して格納する構成とすることができる。また、データが６４ビットの場合、上位３２ビットと下位３２ビットに分割して記憶部に格納する構成であってもよい。データを上位と下位の２部分に分けて格納する構成は、図３及び図４の構成と同様である。 In the present embodiment, the case where 32-bit data is divided into upper 16 bits and lower 16 bits and stored in the storage unit 4 has been described. However, the number of data bits and the number of bits stored in the storage unit 4 are 32 bits. It is not limited to 16 bits. The memory 5 only needs to be composed of a number of RAMs that can store the number of input / output bits of the ALU 2. For example, when the number of data bits is 16, the storage unit 4 can be divided and stored in two parts of 8 bits. In addition, when the data is 64 bits, the data may be divided into upper 32 bits and lower 32 bits and stored in the storage unit. The configuration for storing data in two parts, upper and lower, is the same as the configuration of FIGS.

さらに、データを多くの部分に分割して記憶部に格納する構成も可能である。例えば、６４ビットのデータを１６ビットずつの４つの部分に分けて記憶部に格納する構成とすることができる。その場合、データの有効ビット数が１６ビット以下、３２ビット以下、４８ビット以下、４８ビットを越える場合の４通りに分けて、４つの１６ビットＲＡＭにデータを格納する。 Furthermore, a configuration in which data is divided into many parts and stored in the storage unit is also possible. For example, 64-bit data can be divided into four portions of 16 bits and stored in the storage unit. In that case, data is stored in four 16-bit RAMs divided into four cases where the number of effective bits of data is 16 bits or less, 32 bits or less, 48 bits or less, and more than 48 bits.

図６は６４ビットＡＬＵアレイから出力されるデータを４つの１６ビットＲＡＭに格納する場合の記憶部４の構成例を示す。図６において、白抜き線は６４ビットのデータ線を、太線は３２ビットのデータ線を、細線は１６ビットのデータ線を表す。図６の記憶部４は、図３の記憶部４における分割を２段階で行い、また拡張及び合成を２段階で行う構成になっている。 FIG. 6 shows an example of the configuration of the storage unit 4 when data output from the 64-bit ALU array is stored in four 16-bit RAMs. In FIG. 6, a white line represents a 64-bit data line, a thick line represents a 32-bit data line, and a thin line represents a 16-bit data line. The storage unit 4 in FIG. 6 is configured to perform the division in the storage unit 4 in FIG. 3 in two stages, and to perform expansion and synthesis in two stages.

６４ビットデータの有効ビットが３２ビットを超える場合は、データはＲＡＭ５ａ、５ｂ、５ｃ、５ｄの４つに１６ビットずつ分割して格納される。６４ビットデータの有効ビットが３２ビット以下の場合、６４ビットの下位３２ビットのデータのみがＲＡＭ５ａと５ｂ、又はＲＡＭ５ｃと５ｄに格納される。さらに、データの有効ビット数が１６ビット以下の場合は、６４ビットの最下位１６ビットのデータのみが、ＲＡＭ５ａ、５ｂ、５ｃ又は５ｄのいずれかに格納される。 When the effective bits of 64-bit data exceed 32 bits, the data is divided into 16 bits and stored in four RAMs 5a, 5b, 5c, and 5d. When the valid bits of 64-bit data are 32 bits or less, only 64-bit lower 32-bit data is stored in RAMs 5a and 5b or RAMs 5c and 5d. Furthermore, when the number of valid bits of data is 16 bits or less, only the 64-bit least significant 16-bit data is stored in any of the RAMs 5a, 5b, 5c, or 5d.

なお、対をなすＲＡＭの１アドレスで指定されるメモリワードのビット数は不均一であってもよい。例えば、ＲＡＭ５ａが８ビットであり、ＲＡＭ５ｂが２４ビットの構成であってもよい。その場合、データの有効ビット数が８ビット以下であれば、ＲＡＭ５ａ又はＲＡＭ５ｂに格納される。また、データの有効ビット数が９ビット以上２４ビット以下であれば、ＲＡＭ５ｂに格納される。さらに、データの有効ビット数が２５ビット以上であれば、ＲＡＭ５ａとＲＡＭ５ｂに対として格納される。 Note that the number of bits of the memory word specified by one address of the paired RAM may be non-uniform. For example, the RAM 5a may be 8 bits and the RAM 5b may be 24 bits. In that case, if the number of effective bits of data is 8 bits or less, it is stored in the RAM 5a or RAM 5b. If the effective bit number of the data is 9 bits or more and 24 bits or less, it is stored in the RAM 5b. Furthermore, if the number of valid bits of data is 25 bits or more, the data is stored as a pair in the RAM 5a and RAM 5b.

図７は、メモリ５をビット数の異なる３つのＲＡＭで構成した記憶部４の例を示す。図７において、太い実線は３２ビットのデータ線を、細い実線は１６ビットのデータ線を、点線は８ビットのデータ線を表す。リコンフィギュラブル回路１のＡＬＵアレイは、演算処理ビット数が３２ビットである。記憶部４はメモリ５として、１アドレスで指定されるメモリワードが１６ビットのＲＡＭ５ａと、１アドレスで指定されるメモリワードが８ビットのＲＡＭ５ｅ及び５ｆを備える。記憶部４は、演算処理ビット数が３２ビットのＡＬＵからの出力を、１アドレスのメモリワードが１６ビットのＲＡＭ５ａと、８ビットのＲＡＭ５ｅ及び５ｆに格納する。 FIG. 7 shows an example of the storage unit 4 in which the memory 5 is composed of three RAMs having different numbers of bits. In FIG. 7, a thick solid line represents a 32-bit data line, a thin solid line represents a 16-bit data line, and a dotted line represents an 8-bit data line. The ALU array of the reconfigurable circuit 1 has an arithmetic processing bit number of 32 bits. The storage unit 4 includes, as a memory 5, a RAM 5a having a 16-bit memory word specified by one address and RAMs 5e and 5f having an 8-bit memory word specified by one address. The storage unit 4 stores the output from the ALU having the number of arithmetic processing bits of 32 bits in the RAM 5a in which the memory word at one address is 16 bits and the 8-bit RAMs 5e and 5f.

記憶部４は、リコンフィギュラブル回路１の３２ビットのデータをそのまま受けるレジスタ７ａと、３２ビットのデータの下位１６ビットを受けるレジスタ７ｂと、３２ビットのデータの最下位８ビットを受けるレジスタ７ｃを備える。３２ビットレジスタ７ａのデータは、上位１６ビットのデータ（Ｄ１とＤ２）、中央の１６ビットのデータ（Ｄ２とＤ３）又は下位１６ビットのデータ（Ｄ３とＤ４）を選択する選択回路（ＭＵＸ）６ａに入力される。選択回路６ａの出力は、ＲＡＭ５ａに入力される。 The storage unit 4 includes a register 7a that receives the 32-bit data of the reconfigurable circuit 1 as it is, a register 7b that receives the lower 16 bits of the 32-bit data, and a register 7c that receives the least significant 8 bits of the 32-bit data. Prepare. The data in the 32-bit register 7a is selected from upper 16-bit data (D1 and D2), central 16-bit data (D2 and D3) or lower 16-bit data (D3 and D4). Is input. The output of the selection circuit 6a is input to the RAM 5a.

下位１６ビットを受けるレジスタ７ｂは、その１６ビットのうち、上位８ビットと下位８ビットを選択する選択回路（ＭＵＸ）６ｂに入力される。選択回路６ｂの出力は、ＲＡＭ５ｅに入力される。最下位８ビットを受けるレジスタ７ｃは、ＲＡＭ５ｆに接続している。 The register 7b that receives the lower 16 bits is input to a selection circuit (MUX) 6b that selects the upper 8 bits and the lower 8 bits of the 16 bits. The output of the selection circuit 6b is input to the RAM 5e. The register 7c that receives the least significant 8 bits is connected to the RAM 5f.

ＲＡＭ５ａ、５ｅ及び５ｆの読み出し側には、レジスタ７ｄ、７ｅ、７ｆ、７ｇ、７ｈと、ビット拡張回路８ａ、８ｂ、８ｃと、選択回路６ｃ、６ｄ、６ｅ、６ｆと、合成回路９ａ、９ｂ、９ｃが設けられている。ビット拡張回路８ａは、ＲＡＭ５ａから読み出した１６ビットデータ（レジスタ７ｄ）のＭＳＢ（最上位ビット）を拡張した８ビットを生成する。ビット拡張回路８ｂは、ＲＡＭ５ｅから読み出した８ビットデータ（レジスタ７ｅ）のＭＳＢを拡張した８ビットを生成する。ビット拡張回路８ｃは、ＲＡＭ５ｆから読み出した８ビットデータ（レジスタ７ｆ）のＭＳＢを拡張した８ビットを生成する。 On the reading side of the RAMs 5a, 5e and 5f, registers 7d, 7e, 7f, 7g and 7h, bit expansion circuits 8a, 8b and 8c, selection circuits 6c, 6d, 6e and 6f, and synthesis circuits 9a, 9b, 9c is provided. The bit extension circuit 8a generates 8 bits obtained by extending the MSB (most significant bit) of the 16-bit data (register 7d) read from the RAM 5a. The bit extension circuit 8b generates 8 bits by extending the MSB of the 8-bit data (register 7e) read from the RAM 5e. The bit extension circuit 8c generates 8 bits by extending the MSB of the 8-bit data (register 7f) read from the RAM 5f.

拡大回路７ｇは、ビット拡張回路８ｂの出力を２倍の１６ビットに拡大する。拡大回路７ｈは、ビット拡張回路８ｃの出力を２倍の１６ビットに拡大する。 The expansion circuit 7g expands the output of the bit expansion circuit 8b to double 16 bits. The expansion circuit 7h expands the output of the bit expansion circuit 8c to double 16 bits.

選択回路６ｃは、ＲＡＭ５ａのデータと拡大回路７ｇのデータを選択して、合成回路９ｂの中央の１６ビットデータ（Ｄ２とＤ３）とする。選択回路６ｄは、ＲＡＭ５ａのデータと拡大回路７ｇ又は７ｈのデータを選択して、合成回路９ｃの上位１６ビットデータ（Ｄ１とＤ２）とする。選択回路６ｅは、ビット拡張回路８ａの出力とビット拡張回路８ｂの出力を選択して、合成回路９ｂの上位８ビットデータ（Ｄ１）とする。選択回路６ｆは、ＲＡＭ５ｅのデータとビット拡張回路８ｃの出力を選択して、合成回路９ｃの下位１６ビットの上位側８ビットデータ（Ｄ３）とする。 The selection circuit 6c selects the data of the RAM 5a and the data of the enlargement circuit 7g to obtain the 16-bit data (D2 and D3) at the center of the synthesis circuit 9b. The selection circuit 6d selects the data of the RAM 5a and the data of the enlargement circuit 7g or 7h, and uses the data as the upper 16-bit data (D1 and D2) of the synthesis circuit 9c. The selection circuit 6e selects the output of the bit expansion circuit 8a and the output of the bit expansion circuit 8b, and sets it as the upper 8-bit data (D1) of the synthesis circuit 9b. The selection circuit 6f selects the data of the RAM 5e and the output of the bit expansion circuit 8c, and uses the lower 16 bits of the synthesis circuit 9c as the higher-order 8-bit data (D3).

次に、記憶部４の動作を説明する。図７に示すように、記憶部４は３つのＲＡＭ５ａ、５ｅ、５ｆを一組とし、ＡＬＵ２からの出力データの有効ビット数が２５ビット以上であれば一組の３つのＲＡＭ５ａ、５ｅ、５ｆに分割して格納する。すなわち、選択回路６ａはレジスタ７ａのデータの上位１６ビットを選択してＲＡＭ５ａに記憶する。下位１６ビットのうち上位側８ビットは、選択回路６ｂで選択されてＲＡＭ５ｅに格納される。最下位８ビットは、ＲＡＭ５ｆに格納される。 Next, the operation of the storage unit 4 will be described. As shown in FIG. 7, the storage unit 4 is a set of three RAMs 5a, 5e, and 5f. If the number of effective bits of output data from the ALU 2 is 25 bits or more, the set of three RAMs 5a, 5e, and 5f Divide and store. That is, the selection circuit 6a selects the upper 16 bits of the data in the register 7a and stores it in the RAM 5a. Of the lower 16 bits, the upper 8 bits are selected by the selection circuit 6b and stored in the RAM 5e. The least significant 8 bits are stored in the RAM 5f.

有効ビット数２４ビット以下で分割して格納したデータは上位１６ビットと下位１６ビットの上位側８ビットと下位側８ビットをそれぞれＲＡＭ５ａ、５ｅ、５ｆから読み出してビット結合を行い、元の３２ビットのデータとしてＡＬＵ２に入力する。選択回路６ｄは、ＲＡＭ５ａのデータを選択し、合成回路９ｃの上位１６ビットとする。選択回路６ｆはＲＡＭ５ｅのデータ（レジスタ７ｅのデータ）を選択し、合成回路９ｃの下位１６ビットの上位側８ビットとする。そして、合成回路９ｃの下位１６ビットの下位側８ビット（Ｄ４）は、ＲＡＭ５ｆのデータ（レジスタ７ｆ）のデータとする。こうして、ＲＡＭ５ａ、５ｅ、５ｆに分割して記憶された３２ビットのデータは、合成回路９ｃで元の３２ビットデータに連結して読み出される。 The data stored with the effective number of bits divided into 24 bits or less is read out from the upper 5 bits and lower 8 bits of the upper 16 bits and lower 16 bits from the RAMs 5a, 5e, and 5f, respectively, and bit-combined to obtain the original 32 bits. Is input to the ALU 2 as the data. The selection circuit 6d selects the data in the RAM 5a and sets it as the upper 16 bits of the synthesis circuit 9c. The selection circuit 6f selects the data in the RAM 5e (the data in the register 7e), and sets the upper 8 bits of the lower 16 bits of the synthesis circuit 9c. The lower 8 bits (D4) of the lower 16 bits of the synthesis circuit 9c are used as the data of the RAM 5f (register 7f). Thus, the 32-bit data divided and stored in the RAMs 5a, 5e, and 5f is read out by being combined with the original 32-bit data by the synthesis circuit 9c.

ＡＬＵからの出力の有効ビット数が１７ビット以上、２４ビット以下の場合は、３２ビットの中央の１６ビットのデータ（Ｄ２とＤ３）をＲＡＭ５ａに格納し、最下位８ビットのデータ（Ｄ４）をＲＡＭ５ｅに格納する。すなわち、選択回路６ａは、レジスタ７ａのデータの中央１６ビットを選択してＲＡＭ５ａに格納する。最下位８ビットのデータ（Ｄ４）は、選択回路６ｂでレジスタ７ｂから選択されてＲＡＭ５ｅに格納される。 When the number of valid bits output from the ALU is 17 bits or more and 24 bits or less, the 16 bits data (D2 and D3) at the center of 32 bits are stored in the RAM 5a, and the least significant 8 bits data (D4) is stored. Store in the RAM 5e. That is, the selection circuit 6a selects the central 16 bits of the data in the register 7a and stores it in the RAM 5a. The least significant 8-bit data (D4) is selected from the register 7b by the selection circuit 6b and stored in the RAM 5e.

有効ビット数が１７ビット以上、２４ビット以下で分割して格納されたデータは、合成回路９ｂで元の３２ビットデータに連結して読み出される。選択回路６ｃはＲＡＭ５ａのデータ（レジスタ７ｄのデータ）を選択して３２ビットの中央の１６ビットデータ（Ｄ２とＤ３）とする。合成回路９ｂの最下位８ビットはＲＡＭ５ｅのデータとする。選択回路６ｅは、ビット拡張回路８ａの出力を選択して合成回路９ｂの最上位８ビットのデータとする。 The data divided and stored with the effective bit number being 17 bits or more and 24 bits or less is read out by being combined with the original 32-bit data by the synthesis circuit 9b. The selection circuit 6c selects the data in the RAM 5a (data in the register 7d) and uses it as 16-bit data (D2 and D3) in the center of 32 bits. The least significant 8 bits of the synthesis circuit 9b are data in the RAM 5e. The selection circuit 6e selects the output of the bit extension circuit 8a and sets it as the most significant 8-bit data of the synthesis circuit 9b.

ＡＬＵからの出力の有効ビット数が１６ビット以下の場合は、３２ビットの下位１６ビットのみをＲＡＭ５ａ、又はＲＡＭ５ｅ及び５ｆに格納する。例えば、連続する１６ビット以下のデータは、ＲＡＭ５ａとＲＡＭ５ｅ及び５ｆの未使用領域の多い方のＲＡＭに格納する。ＲＡＭ５ａに下位１６ビットを格納する場合、選択回路６ａは下位１６ビット（Ｄ３とＤ４）を選択して、ＲＡＭ５ａに格納する。そのとき、ＲＡＭ５ｅ及び５ｆは異なる１６ビット以下のデータを格納してもよい。ＲＡＭ５ｅ及び５ｆに下位１６ビットのみを格納する場合、選択回路６ｂはレジスタ７ｂから上位側８ビット（Ｄ３）を選択してＲＡＭ５ｅに格納し、レジスタ７ｃから最下位８ビット（Ｄ４）をＲＡＭ５ｆに格納する。そのとき、ＲＡＭ５ａは異なる１６ビット以下のデータを格納してもよい。 When the number of effective bits output from the ALU is 16 bits or less, only the lower 16 bits of 32 bits are stored in the RAM 5a or the RAMs 5e and 5f. For example, continuous data of 16 bits or less is stored in the RAM 5a, RAM 5e, and 5f with the larger unused area. When storing the lower 16 bits in the RAM 5a, the selection circuit 6a selects the lower 16 bits (D3 and D4) and stores them in the RAM 5a. At that time, the RAMs 5e and 5f may store different data of 16 bits or less. When only the lower 16 bits are stored in the RAMs 5e and 5f, the selection circuit 6b selects the upper 8 bits (D3) from the register 7b and stores them in the RAM 5e, and stores the lowest 8 bits (D4) from the register 7c in the RAM 5f. To do. At that time, the RAM 5a may store different data of 16 bits or less.

下位１６ビットのみが格納されたデータは、ＲＡＭ５ａ又はＲＡＭ５ｅ及び５ｆから読み出したときに、データのビット拡張を行い、元の３２ビットのデータとしてＡＬＵ２に入力する。すなわち、ＲＡＭ５ａに格納された下位１６ビットデータを読み出すときには、合成回路９ａでＲＡＭ５ａのデータ（レジスタ７ｄのデータ）を下位１６ビットとし、レジスタ７ｄのＭＳＢを拡張した８ビットを２倍の上位１６ビットに拡大して、元のデータを生成する。 When data having only the lower 16 bits stored therein is read from the RAM 5a or the RAMs 5e and 5f, the data is bit-extended and input to the ALU 2 as the original 32-bit data. That is, when reading the lower 16-bit data stored in the RAM 5a, the synthesizing circuit 9a sets the data in the RAM 5a (data in the register 7d) to the lower 16 bits, and the 8 bits obtained by extending the MSB of the register 7d is twice the upper 16 bits. To the original data.

ＲＡＭ５ｅ及びＲＡＭ５ｆに格納された下位１６ビットを読み出すときには、合成回路９ｃで読み出しデータの連結を行う。最下位８ビットはＲＡＭ５ｆから読み出したデータとする。選択回路６ｆは、ＲＡＭ５ｅのデータを選択して下位１６ビットの上位側８ビットのデータ（Ｄ３）とする。ビット拡張回路８ｂは、レジスタ７ｅのＭＳＢを拡張し、拡大回路７ｇはさらに２倍の１６ビットに拡大する。選択回路６ｄは、拡大回路７ｇのデータを選択して、合成回路９ｃの上位１６ビットとする。 When reading the lower 16 bits stored in the RAM 5e and RAM 5f, the combining circuit 9c concatenates the read data. The least significant 8 bits are data read from the RAM 5f. The selection circuit 6f selects the data in the RAM 5e and sets it as the upper 16-bit data (D3) of the lower 16 bits. The bit expansion circuit 8b expands the MSB of the register 7e, and the expansion circuit 7g further expands to double 16 bits. The selection circuit 6d selects the data of the enlargement circuit 7g and sets it as the upper 16 bits of the synthesis circuit 9c.

ＡＬＵ２からの出力の有効ビット数が８ビット以下の場合は、３２ビットの最下位８ビットのみをＲＡＭ５ｅ又はＲＡＭ５ｆに格納する。選択回路６ｂで最下位８ビットを選択してＲＡＭ５ｅに格納するか、最下位８ビット（レジスタ７ｃで表される）をＲＡＭ５ｆに格納する。また、３２ビットの下位１６ビットをＲＡＭ５ａに格納してもよい。この場合は、有効ビット数が１６ビット以下の場合の格納、読み出しと同様になる。 When the number of effective bits output from the ALU 2 is 8 bits or less, only the least significant 8 bits of 32 bits are stored in the RAM 5e or RAM 5f. The selection circuit 6b selects the least significant 8 bits and stores them in the RAM 5e, or stores the least significant 8 bits (represented by the register 7c) in the RAM 5f. Further, the lower 16 bits of 32 bits may be stored in the RAM 5a. In this case, it is the same as storage and reading when the number of effective bits is 16 bits or less.

最下位８ビットのみを格納したデータは、合成回路９ｂ又は９ｃで元の３２ビットに拡張して読み出される。ＲＡＭ５ｅから最下位８ビットのみが格納されたデータを読み出す場合は、選択回路６ｃで拡大回路７ｇのデータを選択して中央１６ビットとする。選択回路６ｅは、ビット拡張回路８ｂの出力を選択して合成回路９ｂの最上位８ビット（Ｄ１）とする。合成回路９ｂの最下位８ビットは、ＲＡＭ５ｅから読み出したデータ（レジスタ７ｅで表される）とする。 Data storing only the least significant 8 bits is expanded and read by the combining circuit 9b or 9c to the original 32 bits. When reading data in which only the least significant 8 bits are stored from the RAM 5e, the data of the enlargement circuit 7g is selected by the selection circuit 6c to be the center 16 bits. The selection circuit 6e selects the output of the bit expansion circuit 8b and sets it as the most significant 8 bits (D1) of the synthesis circuit 9b. The least significant 8 bits of the synthesis circuit 9b are data (represented by the register 7e) read from the RAM 5e.

ＲＡＭ５ｆから最下位８ビットのみが格納されたデータを読み出す場合、選択回路６ｄは、拡大回路７ｈのデータ（ビット拡張回路８ｃの出力を２倍に拡大したデータである）を選択して合成回路９ｃの上位１６ビットとする。選択回路６ｆは、ビット拡張回路８ｃの出力を選択して、合成回路９ｃの下位１６ビットの上位側８ビット（Ｄ３）とする。合成回路９ｃの最下位８ビットは、ＲＡＭ５ｆから読み出したデータ（レジスタ７ｆで表される）とする。 When reading data in which only the least significant 8 bits are stored from the RAM 5f, the selection circuit 6d selects the data of the expansion circuit 7h (the data obtained by doubling the output of the bit expansion circuit 8c) and combines the circuit 9c. Are the upper 16 bits. The selection circuit 6f selects the output of the bit expansion circuit 8c and sets it as the upper 8 bits (D3) of the lower 16 bits of the synthesis circuit 9c. The least significant 8 bits of the synthesis circuit 9c are data read from the RAM 5f (represented by the register 7f).

図７の記憶部４は、ＡＬＵ２からの出力データの有効ビット数が８ビット以下であれば、ＲＡＭ５ｅまたはＲＡＭ５ｆに下位８ビットを格納することができる。また、有効ビット数が９ビット以上１６ビット以下であれば、ＲＡＭ５ａ又はＲＡＭ５ｅ＋ＲＡＭ５ｆに下位１６ビットを格納することができ、１７ビット以上２４ビット以下であればＲＡＭ５ａ＋ＲＡＭ５ｅに下位２４ビットを格納することができる。そして、２５ビット以上であれば、ＲＡＭ５ａ＋ＲＡＭ５ｅ＋ＲＡＭ５ｆに３２ビットを格納することができる。 The storage unit 4 in FIG. 7 can store the lower 8 bits in the RAM 5e or RAM 5f if the number of valid bits of the output data from the ALU 2 is 8 bits or less. If the number of valid bits is 9 bits or more and 16 bits or less, the lower 16 bits can be stored in the RAM 5a or RAM 5e + RAM 5f, and if it is 17 bits or more and 24 bits or less, the lower 24 bits can be stored in the RAM 5a + RAM 5e. . If it is 25 bits or more, 32 bits can be stored in the RAM 5a + RAM 5e + RAM 5f.

ＲＡＭのビット数が小さくなるほど、ＲＡＭの利用効率が向上しＲＡＭの容量は少なくなるが、ＭＵＸ等のＲＡＭ周辺回路と設定データ数が増加するため、最良の形態は回路全体の規模とシステムに依存するところである。 The smaller the number of bits of RAM, the more efficient the use of RAM and the smaller the capacity of RAM. However, since the RAM peripheral circuit such as MUX and the number of set data increase, the best mode depends on the scale of the entire circuit and the system. I'm about to do it.

図３、４、６及び７では、理解を容易にするためにメモリ５の入力側と出力側にレジスタ７ａ、７ｂ等を記載したが、レジスタはデータを保持する回路でなく、データ線であってもよい。 3, 4, 6, and 7, the registers 7 a and 7 b are illustrated on the input side and output side of the memory 5 for easy understanding, but the registers are not data holding circuits but data lines. May be.

（実施の形態２）
図８は、本発明に係る処理装置１０において、リコンフィギュラブル回路１が使用するデータを記憶する記憶部４の構成の異なる例を示す図である。図３の記憶部４と同様、演算処理ビット数が３２ビットのＡＬＵ２からの出力データを、１アドレスのメモリワードが１６ビットのＲＡＭに格納する。本実施の形態では、リコンフィギュラブル回路１のＡＬＵアレイは、演算処理ビット数が３２ビットである。 (Embodiment 2)
FIG. 8 is a diagram illustrating an example of a different configuration of the storage unit 4 that stores data used by the reconfigurable circuit 1 in the processing apparatus 10 according to the present invention. Similar to the storage unit 4 in FIG. 3, the output data from the ALU 2 having a 32-bit arithmetic processing bit number is stored in a RAM having a 16-bit memory word at one address. In the present embodiment, the ALU array of the reconfigurable circuit 1 has an arithmetic processing bit number of 32 bits.

記憶部４はメモリ５として、１アドレスで指定されるメモリワードが１６ビットのＲＡＭを２系統備える。記憶部４は、演算処理ビット数が３２ビットのＡＬＵ２から出力を１アドレスのメモリワードが１６ビットのＲＡＭに格納する。図中の太線は３２ビットのデータ線を、細線は１６ビットのデータ線を表す。 The storage unit 4 includes, as the memory 5, two RAMs each having a 16-bit memory word designated by one address. The storage unit 4 stores an output from the ALU 2 having a 32-bit arithmetic processing bit number in a RAM having a 16-bit memory word at one address. In the figure, a thick line represents a 32-bit data line, and a thin line represents a 16-bit data line.

記憶部４は、リコンフィギュラブル回路１の３２ビットのデータを受けるレジスタ７と、そのレジスタ７の上位１６ビットと下位１６ビットを選択する選択回路６ａを備える。選択回路６ａの出力は、ＲＡＭ５ａに接続している。また、レジスタの下位１６ビットは、ＲＡＭ５ｂに接続している。 The storage unit 4 includes a register 7 that receives 32-bit data of the reconfigurable circuit 1, and a selection circuit 6 a that selects upper 16 bits and lower 16 bits of the register 7. The output of the selection circuit 6a is connected to the RAM 5a. The lower 16 bits of the register are connected to the RAM 5b.

ＲＡＭ５ａ、５ｂの読み出し側には、選択回路６ｂ、６ｅと、レジスタ７ｅと、ビット拡張回路８と、合成回路９ｂが設けられている。選択回路６ｅは、ＲＡＭ５ａの１メモリワードのデータと、ＲＡＭ５ｂの１メモリワードのデータを選択してレジスタ７ｅに入力する。レジスタ７ｅは、合成回路９ｂの下位１６ビットに接続している。ビット拡張回路８は、レジスタ７ｅのデータのＭＳＢ（最上位ビット）を拡張して、上位１６ビットを生成する。選択回路６ｂは、ビット拡張回路８の出力とＲＡＭ５ａのデータを選択する。合成回路９ｂは、レジスタ７ｅのデータを下位１６ビットとし、選択回路６ｂの出力を上位１６ビットとして、元のデータを生成する。 On the read side of the RAMs 5a and 5b, selection circuits 6b and 6e, a register 7e, a bit expansion circuit 8, and a synthesis circuit 9b are provided. The selection circuit 6e selects one memory word data in the RAM 5a and one memory word data in the RAM 5b and inputs the selected data to the register 7e. The register 7e is connected to the lower 16 bits of the synthesis circuit 9b. The bit expansion circuit 8 expands the MSB (most significant bit) of the data in the register 7e to generate upper 16 bits. The selection circuit 6b selects the output of the bit extension circuit 8 and the data of the RAM 5a. The synthesizing circuit 9b generates the original data with the data in the register 7e as the lower 16 bits and the output of the selection circuit 6b as the upper 16 bits.

次に、記憶部４の動作を説明する。図８に示すように、２つのＲＡＭ５ａ、５ｂを１対とし、ＡＬＵ２からの出力データの有効ビット数が１７ビット以上であれば、一対の２つのＲＡＭ５ａ、５ｂに上位１６ビットと下位１６ビットのそれぞれを格納する。すなわち、選択回路６ａはレジスタ７の上位１６ビットを選択してＲＡＭ５ａに記憶する。レジスタ７の下位１６ビットはＲＡＭ５ｂに記憶される。この場合、メモリ５の一対となるＲＡＭ５ａ、５ｂのうち、ＲＡＭ５ａが上位用、ＲＡＭ５ｂが下位用となる。 Next, the operation of the storage unit 4 will be described. As shown in FIG. 8, if two RAMs 5a and 5b are paired and the number of effective bits of output data from ALU2 is 17 bits or more, upper 16 bits and lower 16 bits are stored in a pair of two RAMs 5a and 5b. Store each one. That is, the selection circuit 6a selects the upper 16 bits of the register 7 and stores it in the RAM 5a. The lower 16 bits of the register 7 are stored in the RAM 5b. In this case, of the pair of RAMs 5a and 5b of the memory 5, the RAM 5a is for the upper level and the RAM 5b is for the lower level.

分割して格納したデータは上位１６ビットと下位１６ビットをそれぞれＲＡＭ５ａ、５ｂから読み出してビット結合を行い、元の３２ビットのデータとしてＡＬＵ２に入力する。選択回路６ｂは、ＲＡＭ５ａのデータを選択し、合成回路９ｂの上位１６ビットとする。選択回路６ｅは、ＲＡＭ５ｂのデータを選択してレジスタ７ｅにセットする。レジスタ７ｅのデータは、合成回路９ｂの下位１６ビットとなる。この場合、ビット拡張回路８の出力は使用されない。合成回路９ｂの上位１６ビットは、選択回路６ｂの出力であるＲＡＭ５ａのデータであり、下位１６ビットはＲＡＭ５ｂのデータ（レジスタ７ｅのデータ）である。こうして、ＲＡＭ５ａ、５ｂに分割して記憶された３２ビットのデータは、合成回路９ｂで元の３２ビットデータに連結して読み出される。 The divided data stored in the upper 16 bits and lower 16 bits are read out from the RAMs 5a and 5b, bit-combined, and input to the ALU 2 as the original 32-bit data. The selection circuit 6b selects the data in the RAM 5a and sets it as the upper 16 bits of the synthesis circuit 9b. The selection circuit 6e selects the data in the RAM 5b and sets it in the register 7e. The data in the register 7e is the lower 16 bits of the synthesis circuit 9b. In this case, the output of the bit extension circuit 8 is not used. The upper 16 bits of the combining circuit 9b are data of the RAM 5a that is the output of the selection circuit 6b, and the lower 16 bits are data of the RAM 5b (data of the register 7e). In this way, the 32-bit data divided and stored in the RAMs 5a and 5b is read out by being combined with the original 32-bit data by the synthesis circuit 9b.

ＡＬＵ２からの出力の有効ビット数が１６ビット以下の場合は、３２ビットの下位１６ビットのみをＲＡＭ５ａ又はＲＡＭ５ｂに格納する。その際、いずれのＲＡＭ５ａ、５ｂに格納してもよい。例えば、連続する１６ビット以下のデータは、ＲＡＭ５ａとＲＡＭ５ｂの未使用領域の多い方のＲＡＭに格納する。ＲＡＭ５ａに下位１６ビットを格納する場合、選択回路６ａはレジスタ７の下位１６ビットを選択して、ＲＡＭ５ａに格納する。そのとき、ＲＡＭ５ｂは使用されない。ＲＡＭ５ｂに下位１６ビットのみを格納する場合、レジスタ７から下位１６ビットをＲＡＭ５ｂに格納する。そのとき、ＲＡＭ５ａは使用されない。 When the number of effective bits output from the ALU 2 is 16 bits or less, only the lower 16 bits of 32 bits are stored in the RAM 5a or RAM 5b. At that time, the data may be stored in any of the RAMs 5a and 5b. For example, continuous data of 16 bits or less is stored in the RAM with the larger unused area of the RAM 5a and RAM 5b. When the lower 16 bits are stored in the RAM 5a, the selection circuit 6a selects the lower 16 bits of the register 7 and stores it in the RAM 5a. At that time, the RAM 5b is not used. When only the lower 16 bits are stored in the RAM 5b, the lower 16 bits from the register 7 are stored in the RAM 5b. At that time, the RAM 5a is not used.

下位１６ビットのみが格納されたデータは、ＲＡＭ５ａ又はＲＡＭ５ｂから読み出したときに、データのビット拡張を行い、元の３２ビットのデータとしてＡＬＵ２に入力する。すなわち、ＲＡＭ５ａに格納された下位１６ビットデータを読み出すときには、選択回路６ｅでＲＡＭ５ａのデータを選択してレジスタ７ｅにセットする。ビット拡張回路８は、レジスタ７ｅのＭＳＢを拡張して、上位１６ビットを生成する。選択回路６ｂは、ビット拡張回路８の出力を選択して、合成回路９ｂの上位１６ビットとする。レジスタ７ｅのデータ（ＲＡＭ５ａのデータ）は、合成回路９ｂの下位１６ビットとなる。合成回路９ｂは、ＲＡＭ５ａのデータ（レジスタ７ｅのデータ）を下位１６ビットとし、レジスタ７ｅのＭＳＢを拡張した１６ビットを上位１６ビットとして、元のデータを生成する。 When data having only the lower 16 bits stored therein is read from the RAM 5a or 5b, the data is bit-extended and input to the ALU 2 as the original 32-bit data. That is, when reading the lower 16-bit data stored in the RAM 5a, the selection circuit 6e selects the data in the RAM 5a and sets it in the register 7e. The bit extension circuit 8 extends the MSB of the register 7e to generate upper 16 bits. The selection circuit 6b selects the output of the bit expansion circuit 8 and sets the higher 16 bits of the synthesis circuit 9b. The data in the register 7e (data in the RAM 5a) is the lower 16 bits of the synthesis circuit 9b. The synthesizing circuit 9b generates original data with the data in the RAM 5a (data in the register 7e) as the lower 16 bits and the 16 bits obtained by extending the MSB of the register 7e as the upper 16 bits.

ＲＡＭ５ｂに格納された下位１６ビットのみのデータを読み出すときには、選択回路６ｅはＲＡＭ５ｂのデータを選択してレジスタ７ｅにセットする。ビット拡張回路８は、レジスタ７ｅのＭＳＢを拡張して、上位１６ビットを生成する。選択回路６ｂは、ビット拡張回路８の出力を選択して、合成回路９ｂの上位１６ビットとする。レジスタ７ｅのデータ（ＲＡＭ５ｂのデータ）は、合成回路９ｂの下位１６ビットとなる。合成回路９ｂは、ＲＡＭ５ｂのデータ（レジスタ７ｅのデータ）を下位ビットとし、選択回路６ｂの出力（ビット拡張回路８で生成された上位１６ビット）を上位１６ビットとして元のデータを生成する。 When reading data of only the lower 16 bits stored in the RAM 5b, the selection circuit 6e selects the data in the RAM 5b and sets it in the register 7e. The bit extension circuit 8 extends the MSB of the register 7e to generate upper 16 bits. The selection circuit 6b selects the output of the bit expansion circuit 8 and sets the higher 16 bits of the synthesis circuit 9b. The data in the register 7e (data in the RAM 5b) is the lower 16 bits of the synthesis circuit 9b. The synthesizing circuit 9b generates original data using the data in the RAM 5b (data in the register 7e) as lower bits and the output of the selection circuit 6b (upper 16 bits generated by the bit expansion circuit 8) as upper 16 bits.

こうして、ＲＡＭ５ａ又はＲＡＭ５ｂに下位１６ビットのみが格納されたデータは、合成回路９ｂで元の３２ビットデータに拡張されて読み出される。 In this way, the data in which only the lower 16 bits are stored in the RAM 5a or 5b is expanded and read by the combining circuit 9b into the original 32-bit data.

ＡＬＵ２からの出力の有効ビット数が１６ビット以下の場合、図３の形態と同様、不要な上位１６ビットのデータの格納が削減され、従来よりも効率的に記憶部４にデータを格納することが可能となる。図８では、左のＲＡＭ５ａはＡＬＵ２からの３２ビットの出力データの上位１６ビット又は下位１６ビットのデータを格納し、右のＲＡＭ５ｂはＡＬＵ２からの３２ビットの出力データの下位１６ビットのデータを格納する。図３の記憶部４の構成では、有効ビット数が１６ビット以下のデータであれば、ＲＡＭ５ａ、５ｂに同時に読み書き可能であるが、図８の記憶部４では、１６ビット以下のデータであってもどちらか一方のＲＡＭ５ａ又は５ｂにしか読み書きできない。しかし、図３の構成では記憶部４の出力の３２ビットを選択する選択回路が省略されており、それを含めて回路規模を比較すると、図８の構成は図３の構成に比べて選択回路（ＭＵＸ）の数が少ないため回路規模が小さい。 When the number of effective bits output from the ALU 2 is 16 bits or less, storage of unnecessary upper 16 bits of data is reduced as in the case of FIG. 3, and data is stored in the storage unit 4 more efficiently than before. Is possible. In FIG. 8, the left RAM 5a stores upper 16 bits or lower 16 bits of 32-bit output data from ALU2, and the right RAM 5b stores lower 16 bits of 32-bit output data from ALU2. To do. In the configuration of the storage unit 4 in FIG. 3, if the number of effective bits is 16 bits or less, the RAM 5a and 5b can be simultaneously read / written. However, in the storage unit 4 in FIG. Can only read and write to either one of the RAMs 5a or 5b. However, in the configuration of FIG. 3, the selection circuit for selecting the 32 bits of the output of the storage unit 4 is omitted, and when the circuit scale is compared including that, the configuration of FIG. 8 is compared with the configuration of FIG. Since the number of (MUX) is small, the circuit scale is small.

図９は、図８の記憶部４の構成を用いた場合の、ＡＬＵ２からメモリ５に到る回路構成例を示す図である。３個のＡＬＵ２が、図８の１対のＲＡＭ５ａ、５ｂにアクセス可能である。３２ビットのＡＬＵ２からの出力データを選択する３→１ＭＵＸ６ｆが備わっている。従って、この３個のＡＬＵ２からの出力は、有効ビット数に関係なく１個に制限される。また、ＡＬＵ２への入力においても同様で、有効ビット数に関係なくデータ１個（種類）に制限される。このように、ＡＬＵ２の入出力制限がデータの有効ビット数に依存しないため、１対のＲＡＭ５ａ、５ｂ構成を一つのメモリ５としてみなすと、従来のＲＡＭ構成と同様の処理が適用可能となる。 FIG. 9 is a diagram illustrating a circuit configuration example from the ALU 2 to the memory 5 when the configuration of the storage unit 4 of FIG. 8 is used. Three ALUs 2 can access the pair of RAMs 5a and 5b in FIG. A 3 → 1 MUX 6f for selecting output data from the 32-bit ALU 2 is provided. Therefore, the output from the three ALUs 2 is limited to one regardless of the number of effective bits. The same applies to the input to the ALU 2 and is limited to one data (type) regardless of the number of effective bits. As described above, since the input / output limitation of the ALU 2 does not depend on the number of effective bits of data, if the pair of RAMs 5a and 5b is regarded as one memory 5, the same processing as the conventional RAM configuration can be applied.

ＡＬＵ２の入出力データの有効ビット数は、リコンフィギュラブル回路１で実行する所期の演算処理を記述したプログラム３６に基づいて判定される。所期の演算処理がＣ言語で記述される場合、変数の型やコメント文で有効ビット数を判定できる。そして、設定データ生成部３２は、設定データ４０を作成するときに、ＡＬＵ２の入出力データの判定結果に基づいた有効ビット数から、上述した制限でＲＡＭ５ａ、５ｂにデータを格納するための設定データ４０を作成する。 The number of effective bits of the input / output data of the ALU 2 is determined based on a program 36 describing an intended arithmetic process executed by the reconfigurable circuit 1. When the intended arithmetic processing is described in C language, the number of effective bits can be determined by a variable type or a comment sentence. Then, when creating the setting data 40, the setting data generation unit 32 sets the setting data for storing the data in the RAMs 5a and 5b with the above-described restrictions from the number of effective bits based on the determination result of the input / output data of the ALU2. 40 is created.

図１０は、複数対のＲＡＭ５ａ、５ｂ、５ｃ、５ｄを用いた記憶部４の構成を示す。図８、図９では１対のＲＡＭ構成であるが、図１０のように複数対のＲＡＭ構成でもよい。図１０は２対のＲＡＭ５ａ、５ｂとＲＡＭ５ｃ、５ｄを、それぞれ図８又は図９の記憶部４の構成で用いた例である。この場合入出力データは有効ビット数に関係なく２個（種類）に制限される。 FIG. 10 shows a configuration of the storage unit 4 using a plurality of pairs of RAMs 5a, 5b, 5c, and 5d. 8 and 9 show a pair of RAM configurations, but a plurality of pairs of RAM configurations may be used as shown in FIG. FIG. 10 shows an example in which two pairs of RAMs 5a and 5b and RAMs 5c and 5d are used in the configuration of the storage unit 4 shown in FIG. In this case, the input / output data is limited to two (types) regardless of the number of effective bits.

図３と図８の記憶部４の構成は、それぞれＡＬＵ２の入出力制限が異なる。いずれの構成を採用するかは、処理装置１０を適用するシステムの特性に合わせて決められる。なお、２つの実施の形態を組み合わせた形態も可能である。 The configuration of the storage unit 4 in FIGS. 3 and 8 is different in input / output restrictions of the ALU 2. Which configuration is adopted is determined according to the characteristics of the system to which the processing apparatus 10 is applied. A combination of the two embodiments is also possible.

本実施の形態においても、３２ビットのデータを上位１６ビットと下位１６ビットに分けて記憶部４に格納する場合について説明したが、データのビット数と記憶部に格納するビット数は３２ビットと１６ビットに限られない。例えば、データのビット数が１６ビットの場合に、記憶部に格納するビット数が８ビットの２つの部分に分割して格納する構成とすることができる。また、データが６４ビットの場合、上位３２ビットと下位３２ビットに分割して記憶部に格納する構成であってもよい。データを上位と下位の２部分に分けて格納する構成は、図８及び図９の構成と同様である。 Also in this embodiment, the case where 32-bit data is divided into upper 16 bits and lower 16 bits and stored in the storage unit 4 has been described. However, the number of bits of data and the number of bits stored in the storage unit are 32 bits. It is not limited to 16 bits. For example, when the number of bits of data is 16 bits, it can be configured to be divided and stored in two parts where the number of bits stored in the storage unit is 8 bits. In addition, when the data is 64 bits, the data may be divided into upper 32 bits and lower 32 bits and stored in the storage unit. The configuration for storing data divided into two parts, upper and lower, is the same as the configuration of FIGS.

さらに、データを多くの部分に分割して記憶部に格納する構成も可能である。また、対をなすＲＡＭの１アドレスで指定されるメモリワードのビット数は不均一であってもよい。 Furthermore, a configuration in which data is divided into many parts and stored in the storage unit is also possible. In addition, the number of bits of the memory word specified by one address of the paired RAM may be non-uniform.

図８及び９では、理解を容易にするためにメモリ５の入力側と出力側にレジスタ７、７ｅ等を記載したが、レジスタはデータを保持する回路でなく、データ線であってもよい。 8 and 9, the registers 7, 7e and the like are shown on the input side and output side of the memory 5 for easy understanding, but the registers may be data lines instead of circuits for holding data.

本発明により、効率的に記憶部４にデータを格納することができるため、全体のメモリ容量が従来よりも削減され、回路規模を縮小化できる。 According to the present invention, since data can be efficiently stored in the storage unit 4, the entire memory capacity is reduced as compared with the conventional case, and the circuit scale can be reduced.

なお、各実施の形態で説明した処理装置１０、リコンフィギュラブル回路１及び記憶部
４の回路構成は一例であり、任意に変更及び修正が可能である。記憶部４の構成は、実施の形態で示したものがすべてではなく、これらに限定されるものではない。 Note that the circuit configurations of the processing device 10, the reconfigurable circuit 1, and the storage unit 4 described in the embodiments are merely examples, and can be arbitrarily changed and modified. The configuration of the storage unit 4 is not limited to that shown in the embodiment, and is not limited to these.

本発明の実施の形態に係るリコンフィギュラブル回路を備える処理装置の構成を示すブロック図である。It is a block diagram which shows the structure of a processing apparatus provided with the reconfigurable circuit which concerns on embodiment of this invention. 本発明の実施の形態に係るリコンフィギュラブル回路の構成の例を示すブロック図である。It is a block diagram which shows the example of a structure of the reconfigurable circuit which concerns on embodiment of this invention. 本発明に係る処理装置において、リコンフィギュラブル回路が使用するデータを記憶する記憶部の構成の一例を示す図である。It is a figure which shows an example of a structure of the memory | storage part which memorize | stores the data which a reconfigurable circuit uses in the processing apparatus which concerns on this invention. 図３の記憶部の構成を用いた場合の、ＡＬＵからメモリに到る回路構成例を示す図である。FIG. 4 is a diagram illustrating a circuit configuration example from an ALU to a memory when the configuration of the storage unit in FIG. 3 is used. 複数対のＲＡＭを用いた記憶部の構成を示す図である。It is a figure which shows the structure of the memory | storage part using several pairs RAM. ６４ビットＡＬＵアレイから出力されるデータを４つの１６ビットＲＡＭに格納する場合の記憶部の構成例を示す図である。It is a figure which shows the structural example of the memory | storage part in the case of storing the data output from a 64-bit ALU array in four 16-bit RAM. メモリをビット数の異なる３つのＲＡＭで構成した記憶部の例を示す図である。It is a figure which shows the example of the memory | storage part which comprised memory by three RAM from which bit numbers differ. 本発明に係る処理装置において、リコンフィギュラブル回路が使用するデータを記憶する記憶部の構成の異なる例を示す図である。It is a figure which shows the example from which the structure of the memory | storage part which memorize | stores the data which a reconfigurable circuit uses is different in the processing apparatus which concerns on this invention. 図８の記憶部の構成を用いた場合の、ＡＬＵからメモリに到る回路構成例を示す図である。FIG. 9 is a diagram illustrating a circuit configuration example from an ALU to a memory when the configuration of the storage unit in FIG. 8 is used. 複数対のＲＡＭを用いた記憶部の構成を示す図である。It is a figure which shows the structure of the memory | storage part using several pairs RAM. 従来のリコンフィギュラブル回路における記憶部の例を示す図である。It is a figure which shows the example of the memory | storage part in the conventional reconfigurable circuit.

Explanation of symbols

１リコンフィギュラブル回路
２論理回路（ＡＬＵ）
３接続部
４記憶部
５メモリ
５ａ、５ｂ、５ｃ、５ｄ、５ｅ、５ｆＲＡＭ
６ａ、６ｂ、６ｃ、６ｄ、６ｅ、６ｆ選択回路
７、７ａ、７ｂ、７ｃ、７ｄ、７ｅ、７ｆレジスタ
７ｇ、７ｈ拡大回路
８、８ａ、８ｂビット拡張回路
９ａ、９ｂ、９ｃ合成回路
１０処理装置
３０コンパイル部
３２設定データ生成部
３４設定記憶部
３６プログラム
３８データフローグラフ
４０設定データ 1 Reconfigurable circuit
2 Logic circuit (ALU)
3 connections
4 storage
5 Memory 5a, 5b, 5c, 5d, 5e, 5f RAM
6a, 6b, 6c, 6d, 6e, 6f selection circuit 7, 7a, 7b, 7c, 7d, 7e, 7f register
7g, 7h Enlarged circuit
8, 8a, 8b bit expansion circuit
9a, 9b, 9c synthesis circuit
10 Processing device
30 Compilation section
32 Setting data generator
34 Setting memory
36 programs
38 Data flow graph
40 Setting data

Claims

An arithmetic unit composed of a plurality of logic circuits each capable of selectively executing a plurality of arithmetic logic operation functions;
A connection unit for maintaining a connection relationship between the plurality of logic circuits;
In a processing apparatus including a reconfigurable circuit comprising:
Storage means for storing data used in the logic circuit;
When the number of bits of data stored in the storage means exceeds the number of bits of a memory word specified by one address of the storage means, and the number of valid bits of the data does not exceed the number of bits of the memory word , Degenerate storage means for changing data to a number of bits less than or equal to the number of bits of the memory word and storing it in one memory word;
Extended read means for restoring the data in accordance with the number of input bits of the logic circuit when the data stored in the degenerate storage means is changed to a bit number equal to or less than the number of bits of the memory word ; and
A processing apparatus comprising:

If the number of bits of data stored in the storage means exceeds the number of bits of the memory word and the number of valid bits of the data exceeds the number of bits of the memory word, the data has the number of bits of the memory word. Divided storage means for dividing into a plurality of parts and storing in a plurality of memory words;
Concatenated read means for restoring the original number of bits when reading from the plurality of memory words, the data divided and stored in the plurality of memory words by the divided storage means,
The processing apparatus according to claim 1, further comprising:

The divided storage means has a plurality of portions having the number of bits of the memory word of the storage means
Each of the divided data is stored according to the bit position in the original data.
The processing apparatus according to claim 2, wherein the storage area is limited .

The divided storage unit or the degenerate storage unit is configured to calculate the number of data output from the arithmetic unit.
4. The method according to claim 1, wherein the number of bits is limited according to the number of effective bits of the data.
The processing apparatus according to item.

The concatenated reading means or the extended reading means is the number of data input to the arithmetic unit.
The processing apparatus according to any one of claims 1 to 4, characterized in that to limit in accordance with the number of effective bits of the data.

The divided storage unit or the degenerate storage unit is configured to validate each data output from the arithmetic unit.
Memory in which the total number of memory words to be used can be written simultaneously according to the number of bits
5. The processing apparatus according to claim 4 , wherein the number of the data simultaneously output from the arithmetic unit is limited so as not to exceed the number of words .

The concatenated reading means or the extended reading means is each data input to the arithmetic unit.
Depending on the number of valid bits, the total number of memory words used can be read simultaneously
6. The processing apparatus according to claim 5 , wherein the number of data simultaneously input to the arithmetic unit is limited so as not to exceed the number of memory words .

A plurality of arithmetic logic operation functions selectively executed in the plurality of logic circuits;
A setting for controlling the operation of the reconfigurable circuit by setting a connection relationship between logic circuits.
In the generation of constant data ,
Based on a program describing an intended arithmetic process executed by the plurality of logic circuits,
Determining the number of effective bits of the input and output data of the logic circuit, the processing apparatus according to any one of claims 1 to 7, characterized in that to generate the configuration data.