JP2006004226A

JP2006004226A - Data arithmetic unit

Info

Publication number: JP2006004226A
Application number: JP2004180639A
Authority: JP
Inventors: Shiro Kobayashi; 士朗小林; Isamu Kozuka; 勇小塚
Original assignee: Asahi Kasei Corp
Current assignee: Asahi Kasei Corp
Priority date: 2004-06-18
Filing date: 2004-06-18
Publication date: 2006-01-05

Abstract

<P>PROBLEM TO BE SOLVED: To provide a data arithmetic unit suitable for improving the processing efficiency of a processor and reducing a program cost and a manufacturing cost. <P>SOLUTION: A data arithmetic part 12 comprises a register file composed of a plurality of registers and a computing element 202 performing operation by use of data of the register file 200. The data access between the register file 200 and a data memory 10 is performed in a group register unit consisting of four registers. The data arithmetic unit further comprises an index register file 208 storing, in case that two group registers are grouped as a register block, element index data designating the position of data in the register block, and selectors 204 and 206 selectively outputting, based on the element index data of the index register file 208, data of any one register among the registers of the register block to the computing element 202. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、データを演算する装置および方法に係り、特に、プロセッサの処理効率を向上し、プログラムコストおよび製造コストを低減するのに好適なデータ演算装置に関する。 The present invention relates to an apparatus and method for calculating data, and more particularly to a data calculation apparatus suitable for improving the processing efficiency of a processor and reducing program cost and manufacturing cost.

デジタル信号処理をプロセッサで行う場合、メモリに対する１回のデータアクセスで、複数のデータをまとめて読み込んだり書き込んだりすることがある。これは、例えば、演算器で語長の長いデータが必要な場合や、処理効率を向上するために演算器で並列処理を行う場合に生じる。
３２ビットのＲＩＳＣプロセッサは、記憶容量が１バイト（８ビット）のレジスタを４つ単位でまとめてグループレジスタを構成し、レジスタファイルとメモリとの間のデータアクセスをグループレジスタ単位（４バイト単位）で行う。メモリに対しては、４の整数倍のアドレスを先頭アドレスとして４バイトのデータをグループレジスタにまとめて読み込み、４の整数倍のアドレスを先頭アドレスとしてグループレジスタのデータをまとめて書き込む。例えば、メモリの先頭から順にデータを読み込む場合、グループレジスタには、１回目のデータアクセスで0000h〜0003h番地のアドレスのデータが、２回目のデータアクセスで0004h〜0007h番地のアドレスのデータがそれぞれ読み込まれる。 When digital signal processing is performed by a processor, a plurality of data may be collectively read or written by one data access to the memory. This occurs, for example, when data having a long word length is required by the arithmetic unit, or when parallel processing is performed by the arithmetic unit in order to improve processing efficiency.
The 32-bit RISC processor configures a group register by grouping registers each having a storage capacity of 1 byte (8 bits) in units of four, and data access between the register file and the memory is performed in units of group registers (units of four bytes). To do. For the memory, 4 bytes of data are collectively read into the group register with an address that is an integer multiple of 4 as the start address, and the group register data is collectively written with an address that is an integer multiple of 4 as the start address. For example, when reading data sequentially from the beginning of the memory, the group register reads the data at addresses 0000h to 0003h at the first data access and the data at addresses 0004h to 0007h at the second data access. It is.

このようなデータアクセスを行う際、メモリ上のデータの先頭アドレスがプロセッサのアクセス単位の整数倍でない場合に問題が生じる。この状況は、メモリ上のデータのアドレス境界（以下、データ境界という。）が、プロセッサのアクセス可能なアドレス境界（以下、アクセス境界という。）と一致していないため、境界不整列データアクセスと呼ばれる。 When performing such data access, a problem arises when the top address of the data on the memory is not an integral multiple of the access unit of the processor. This situation is called boundary-misaligned data access because the address boundary of data on the memory (hereinafter referred to as data boundary) does not match the address boundary accessible by the processor (hereinafter referred to as access boundary). .

境界不整列データアクセスが引き起こす問題として、処理効率の低下を挙げることができる。３２ビットのＲＩＳＣプロセッサの場合、アクセス境界は、４の整数倍である。４バイト長のデータを処理する場合、４バイト長のデータをメモリから読み込むが、そのデータの先頭アドレスが４の倍数でないと、所望の４バイトを取得するために２回のデータアクセスが必要となり、さらにその並べ替えをソフトウェアで行うため、処理効率が低下する。このような問題を解決するため、ＲＩＳＣプロセッサ用のプログラムでは、コンパイル時に、データ境界とアクセス境界が一致するようにすべてのデータを配置し、実行時には境界不整列データアクセスが起こらないようにしている。
しかしながら、データの論理構造が重複している場合は、コンパイラでは対処することができない。 A problem caused by boundary misaligned data access can be a reduction in processing efficiency. For a 32-bit RISC processor, the access boundary is an integer multiple of four. When processing data of 4 bytes length, data of 4 bytes length is read from the memory, but if the head address of the data is not a multiple of 4, two data accesses are required to obtain the desired 4 bytes. Further, since the rearrangement is performed by software, the processing efficiency is lowered. In order to solve such a problem, in the RISC processor program, all data is arranged so that the data boundary and the access boundary coincide with each other at the time of compilation, so that the unaligned data access does not occur at the time of execution. .
However, if the logical structure of data overlaps, the compiler cannot handle it.

図１０は、メモリ上のデータの配置を示す図である。
図１０において、メモリには、0000h番地〜000Fh番地のアドレスに１６バイトのデータs[0]〜s[15]が格納され、0100h番地〜010Fh番地のアドレスに１６バイトのデータr[0]〜r[15]が格納されている。
データの論理構造が重複しているとは、s[0]〜s[3]が１つのグループデータを構成するが、s[1]〜s[4]、s[2]〜s[5]およびs[3]〜s[6]もそれぞれ１つのグループデータを構成する場合である。例えば、下式（１）〜（４）に示すような相関演算を行う場合には、境界不整列データアクセスを回避することができない。 FIG. 10 is a diagram showing the arrangement of data on the memory.
In FIG. 10, the memory stores 16 bytes of data s [0] to s [15] at addresses 0000h to 000Fh, and 16 bytes of data r [0] to addresses 0100h to 010Fh. r [15] is stored.
The logical structure of data is duplicated. S [0] to s [3] constitute one group data, but s [1] to s [4] and s [2] to s [5] And s [3] to s [6] also constitute one group data. For example, when performing a correlation operation as shown in the following equations (1) to (4), it is not possible to avoid boundary misaligned data access.

p[0] = r[0]*s[0] + r[1]*s[1] + r[2]*s[2] + r[3]*s[3] + … …（１）
p[1] = r[0]*s[1] + r[1]*s[2] + r[2]*s[3] + r[3]*s[4] + … …（２）
p[2] = r[0]*s[2] + r[1]*s[3] + r[2]*s[4] + r[3]*s[5] + … …（３）
p[3] = r[0]*s[3] + r[1]*s[4] + r[2]*s[5] + r[3]*s[6] + … …（４） p [0] = r [0] * s [0] + r [1] * s [1] + r [2] * s [2] + r [3] * s [3] +… (1)
p [1] = r [0] * s [1] + r [1] * s [2] + r [2] * s [3] + r [3] * s [4] +… (2)
p [2] = r [0] * s [2] + r [1] * s [3] + r [2] * s [4] + r [3] * s [5] +… (3)
p [3] = r [0] * s [3] + r [1] * s [4] + r [2] * s [5] + r [3] * s [6] +… (4)

さらに、プログラムも４通り作成しなければならない。
まず、p[0] を求めるには、次の（１）〜（５）のステップを含むプログラムになる。p[4]、p[8]、…、p[4n]についても同様である。
（１）グループレジスタにs[0]〜s[3]を読み込む。
（２）〜（５）グループレジスタのデータに基づいて、r[0]*s[0]、r[1]*s[1]、r[2]*s[2]、r[3]*s[3]を演算する。 In addition, four programs must be created.
First, to obtain p [0], the program includes the following steps (1) to (5). The same applies to p [4], p [8], ..., p [4n].
(1) Read s [0] to s [3] into the group register.
(2) to (5) Based on the group register data, r [0] * s [0], r [1] * s [1], r [2] * s [2], r [3] * Calculate s [3].

次に、p[1] を求めるには、次の（１）〜（６）のステップを含むプログラムになる。p[5]、p[9]、…、p[4n+1]についても同様である。
（１）グループレジスタにs[0]〜s[3]を読み込む。
（２）〜（４）グループレジスタのデータに基づいて、r[0]*s[1]、r[1]*s[2]、r[2]*s[3]を演算する。
（５）グループレジスタにs[4]〜s[7]を読み込む。
（６）グループレジスタのデータに基づいて、r[3]*s[4]を演算する。 Next, to obtain p [1], the program includes the following steps (1) to (6). The same applies to p [5], p [9],..., p [4n + 1].
(1) Read s [0] to s [3] into the group register.
(2) to (4) Calculate r [0] * s [1], r [1] * s [2], r [2] * s [3] based on the data in the group register.
(5) Read s [4] to s [7] into the group register.
(6) Calculate r [3] * s [4] based on the group register data.

次に、p[2] を求めるには、次の（１）〜（６）のステップを含むプログラムになる。p[6]、p[10]、…、p[4n+2]についても同様である。
（１）グループレジスタにs[0]〜s[3]を読み込む。
（２），（３）グループレジスタのデータに基づいて、r[0]*s[2]、r[1]*s[3]を演算する。
（４）グループレジスタにs[4]〜s[7]を読み込む。
（５），（６）グループレジスタのデータに基づいて、r[2]*s[4]、r[3]*s[5]を演算する。 Next, to obtain p [2], the program includes the following steps (1) to (6). The same applies to p [6], p [10],..., p [4n + 2].
(1) Read s [0] to s [3] into the group register.
(2), (3) Calculate r [0] * s [2] and r [1] * s [3] based on the group register data.
(4) Read s [4] to s [7] into the group register.
(5), (6) Calculate r [2] * s [4] and r [3] * s [5] based on the group register data.

次に、p[3] を求めるには、次の（１）〜（６）のステップを含むプログラムになる。p[7]、p[11]、…、p[4n+3]についても同様である。
（１）グループレジスタにs[0]〜s[3]を読み込む。
（２）グループレジスタのデータに基づいて、r[0]*s[3]を演算する。
（３）グループレジスタにs[4]〜s[7]を読み込む。
（４）〜（６）グループレジスタのデータに基づいて、r[1]*s[4]、r[2]*s[5]、r[3]*s[6]を演算する。 Next, to obtain p [3], the program includes the following steps (1) to (6). The same applies to p [7], p [11], ..., p [4n + 3].
(1) Read s [0] to s [3] into the group register.
(2) Calculate r [0] * s [3] based on the group register data.
(3) Read s [4] to s [7] into the group register.
(4) to (6) r [1] * s [4], r [2] * s [5], and r [3] * s [6] are calculated based on the group register data.

このように、データの論理構造が重複している場合は、境界不整列データアクセスが生じてしまうため、処理効率が低下し、プログラムコスト（プログラムの作成に要する手間、プログラムのデータ容量および複雑さなどをいう。）が増加するという問題がある。このような問題を解決するため、特許文献１および非特許文献１記載のデジタル信号プロセッサが提案されている。 As described above, when the logical structure of the data overlaps, boundary-aligned data access occurs, so that the processing efficiency is reduced, and the program cost (the time required for creating the program, the data capacity and complexity of the program) is reduced. There is a problem that increases. In order to solve such problems, digital signal processors described in Patent Document 1 and Non-Patent Document 1 have been proposed.

図１１は、従来のデジタル信号プロセッサの構成を示すブロック図である。
特許文献１および非特許文献１記載の発明は、図１１に示すように、レジスタファイル２００と、データを整列するデータ整列バッファ９００とを備え、データ整列バッファ９００を介してデータメモリ１０からレジスタファイル２００にデータを読み込む。
データ整列バッファ９００は、整列バッファ９１０およびマルチプレクサ／バレルシフタ９１２を含む。 FIG. 11 is a block diagram showing the configuration of a conventional digital signal processor.
As shown in FIG. 11, the invention described in Patent Document 1 and Non-Patent Document 1 includes a register file 200 and a data alignment buffer 900 that aligns data, and the register file is transferred from the data memory 10 via the data alignment buffer 900. 200 reads the data.
Data alignment buffer 900 includes an alignment buffer 910 and a multiplexer / barrel shifter 912.

マルチプレクサ／バレルシフタ９１２は、データメモリ１０からの３２ビットと、整列バッファ９１０の出力からの３２ビットとを含む６４ビットのデータを受け取り、与えられたオフセット信号に応答して、６４ビットのデータ中３２ビットをその入力として選択し、選択した３２ビットのデータをレジスタファイル２００に供給する。
データ整列バッファ９００は、境界整列データアクセス（データ境界がアクセス境界と一致している状況をいう。）および境界不整列データアクセスに用いることができる。境界整列データアクセスでは、マルチプレクサ／バレルシフタ９１２を介して、指定したオペランドを単一のメモリ行からレジスタファイル２００に供給する。境界不整列データアクセスでは、整列バッファ９１０およびマルチプレクサ／バレルシフタ９１２を介して、指定したオペランドを２つのメモリ行からレジスタファイル２００に供給する。 The multiplexer / barrel shifter 912 receives 64 bits of data including 32 bits from the data memory 10 and 32 bits from the output of the alignment buffer 910, and in response to a given offset signal, 32 of the 64 bits of data. A bit is selected as its input, and the selected 32-bit data is supplied to the register file 200.
The data alignment buffer 900 can be used for boundary aligned data access (referring to a situation where the data boundary matches the access boundary) and boundary unaligned data access. In aligned data access, a specified operand is supplied to the register file 200 from a single memory line via the multiplexer / barrel shifter 912. In boundary misaligned data access, the specified operand is supplied to the register file 200 from two memory lines via the alignment buffer 910 and the multiplexer / barrel shifter 912.

図１２は、従来のデジタル信号プロセッサの動作を説明するための図である。
図１２において、データメモリ１０には、0000h番地〜0007h番地のアドレスに８バイトのデータs[0]〜s[7]が格納されている。
s[1]〜s[4]を読み込む境界不整列データアクセスを行う場合は、まず、s[0]〜s[3]を整列バッファ９１０に読み込む。次いで、例えば、図１２（ａ）に示すように、「０」を示すオフセット信号をマルチプレクサ／バレルシフタ９１２に与えると、マルチプレクサ／バレルシフタ９１２によりデータの選択が行われ、レジスタファイル２００にs[1]〜s[4]が供給される。 FIG. 12 is a diagram for explaining the operation of a conventional digital signal processor.
In FIG. 12, the data memory 10 stores 8-byte data s [0] to s [7] at addresses 0000h to 0007h.
When performing boundary misalignment data access for reading s [1] to s [4], first, s [0] to s [3] are read into the alignment buffer 910. Next, for example, as shown in FIG. 12A, when an offset signal indicating “0” is given to the multiplexer / barrel shifter 912, data selection is performed by the multiplexer / barrel shifter 912, and s [1] is stored in the register file 200. ~ S [4] is supplied.

また、s[2]〜s[5]を読み込む境界不整列データアクセスを行う場合は、例えば、図１２（ｂ）に示すように、「１」を示すオフセット信号をマルチプレクサ／バレルシフタ９１２に与えると、マルチプレクサ／バレルシフタ９１２によりデータの選択が行われ、レジスタファイル２００にs[2]〜s[5]が供給される。
また、s[3]〜s[6]を読み込む境界不整列データアクセスを行う場合は、例えば、図１２（ｃ）に示すように、「２」を示すオフセット信号をマルチプレクサ／バレルシフタ９１２に与えると、マルチプレクサ／バレルシフタ９１２によりデータの選択が行われ、レジスタファイル２００にs[3]〜s[6]が供給される。
特表２００２−５０９３１２号公報「ADSP-TS201 TigerSHARCR Processor Programming Reference」Analog Devices, Inc.,、Revision 0.1, June 2003, Part Number 82-000810-01、Ｐ．７−２４ Further, when performing boundary misalignment data access for reading s [2] to s [5], for example, as shown in FIG. 12B, an offset signal indicating “1” is given to the multiplexer / barrel shifter 912. The multiplexer / barrel shifter 912 selects data, and s [2] to s [5] are supplied to the register file 200.
Further, when performing boundary misalignment data access for reading s [3] to s [6], for example, as shown in FIG. 12C, an offset signal indicating “2” is given to the multiplexer / barrel shifter 912. The multiplexer / barrel shifter 912 selects data, and s [3] to s [6] are supplied to the register file 200.
Japanese translation of PCT publication No. 2002-509912 “ADSP-TS201 TigerSHARCR Processor Programming Reference” Analog Devices, Inc., Revision 0.1, June 2003, Part Number 82-000810-01, p. 7-24

しかしながら、特許文献１および非特許文献１記載の発明にあっては、少なくともプロセッサのアクセス単位に相当する記憶容量（図１１の例では、３２ビット）の整列バッファ９１０を設けなければならないため、製造コストが増加するという問題があった。
また、図１２の例では、境界不整列データアクセスのパターンを３つ示したが、この３つのアクセスパターンに対応するだけならば、オフセット信号は、２ビットですむ。しかしながら、例えば、s[0]、s[1]、s[4]、s[5]を読み込む境界不整列データアクセスなど、他のアクセスパターンにも対応しようとすると、オフセット信号のビット数がさらに必要となり、製造コストが増加するという問題があった。ちなみに、図１２の例では、境界不整列データアクセスの全アクセスパターンは、₈Ｃ₄−２＝６８通りあるので、全アクセスパターンに対応しようとすると、オフセット信号は、７ビット必要となる。
そこで、本発明は、このような従来の技術の有する未解決の課題に着目してなされたものであって、プロセッサの処理効率を向上し、プログラムコストおよび製造コストを低減するのに好適なデータ演算装置を提供することを目的としている。 However, in the inventions described in Patent Document 1 and Non-Patent Document 1, it is necessary to provide an alignment buffer 910 having a storage capacity (32 bits in the example of FIG. 11) corresponding to at least a processor access unit. There was a problem that the cost increased.
In the example of FIG. 12, three boundary misaligned data access patterns are shown. However, if only these three access patterns are supported, the offset signal may be 2 bits. However, for example, when trying to support other access patterns such as boundary misaligned data access for reading s [0], s [1], s [4], and s [5], the number of bits of the offset signal is further increased. There is a problem that the manufacturing cost increases. Incidentally, in the example of FIG. 12, since there are ₈ C ₄ −2 = 68 total access patterns for boundary misaligned data access, 7 bits are required for the offset signal in order to support all access patterns.
Therefore, the present invention has been made paying attention to such an unsolved problem of the conventional technology, and is suitable for improving the processing efficiency of the processor and reducing the program cost and the manufacturing cost. The object is to provide an arithmetic device.

上記目的を達成するために、本発明に係る請求項１記載のデータ演算装置は、複数のレジスタからなるレジスタファイルと、前記レジスタのデータを用いて演算を行う演算器とを備え、前記レジスタファイルと外部との間のデータアクセスを、複数の前記レジスタからなるグループレジスタ単位で行うデータ演算装置であって、インデックスデータを記憶するインデックスレジスタと、前記インデックスレジスタのインデックスデータに基づいて、前記レジスタファイルのレジスタのうち少なくとも２つの前記グループレジスタのなかから、１または複数の前記レジスタのデータを選択して前記演算器に出力するセレクタとを備える。 In order to achieve the above object, a data operation device according to claim 1 according to the present invention comprises: a register file comprising a plurality of registers; and an arithmetic unit that performs an operation using data in the register, wherein the register file A data arithmetic unit that performs data access between a register and an external unit in a group register unit including a plurality of registers, the index file storing index data, and the register file based on the index data of the index register And a selector that selects data of one or a plurality of the registers from at least two of the group registers and outputs the selected data to the arithmetic unit.

このような構成であれば、まず、境界不整列データアクセスが生じる可能性のあるデータ群を少なくとも２つのグループレジスタにまとめて読み込む。そして、インデックスレジスタにインデックスデータを格納すると、セレクタにより、インデックスレジスタのインデックスデータに基づいて、それらグループレジスタのなかから１または複数のレジスタのデータが選択されて演算器に出力され、演算器により演算が行われる。したがって、インデックスデータの格納、セレクタによる選択、および演算器による演算を繰り返し行えば、それらグループレジスタにまとめて読み込まれたデータ群のなかで演算に必要なデータが逐次選択されて演算器に出力される。いずれのデータを演算に用いるかは、インデックスデータによって選択することができるので、境界整列データアクセスとなるグループデータも、境界不整列データアクセスとなるグループデータも区別なく取り扱うことができる。すなわち、境界整列データアクセスとなるグループデータについては、グループレジスタの先頭のレジスタから順にデータを選択していけばよいし、境界不整列データアクセスとなるグループデータについては、グループレジスタの所定箇所のレジスタから順にデータを選択していけばよい。 With such a configuration, first, a data group that may cause boundary-unaligned data access is collectively read into at least two group registers. When the index data is stored in the index register, the selector selects one or a plurality of register data from the group register based on the index data of the index register and outputs the selected data to the arithmetic unit. Is done. Therefore, if index data storage, selection by the selector, and calculation by the arithmetic unit are repeated, data necessary for the calculation is sequentially selected from the group of data read in the group register and output to the arithmetic unit. The Which data is used for the calculation can be selected by the index data, so that it is possible to handle the group data for boundary aligned data access and the group data for boundary unaligned data access without distinction. That is, for group data to be boundary aligned data access, data may be selected in order from the first register of the group register, and for group data to be boundary unaligned data access, a register at a predetermined position of the group register You can select the data in order.

ここで、セレクタは、１または複数のレジスタのデータを選択するようになっていればどのような構成であってもよく、複数のレジスタのデータを選択する場合は、例えば、グループレジスタ内で連続したレジスタのデータであってもよいし、グループレジスタ内で分散したレジスタのデータであってもよい。
また、インデックスレジスタは、インデックスデータをあらゆる手段でかつあらゆる時期に記憶するものであり、インデックスデータをあらかじめ記憶してあるものであってもよいし、インデックスデータをあらかじめ記憶することなく、本装置の動作時に外部からの入力等によってインデックスデータを記憶するようになっていてもよい。 Here, the selector may have any configuration as long as it can select data of one or a plurality of registers. For example, when selecting data of a plurality of registers, the selector is continuous in the group register. The data of the registered registers may be used, or the data of the registers distributed in the group register may be used.
The index register stores the index data by any means and at any time. The index register may store the index data in advance, or without storing the index data in advance. The index data may be stored by an external input or the like during operation.

さらに、本発明に係る請求項２記載のデータ演算装置は、複数のレジスタからなるレジスタファイルと、前記レジスタのデータを用いて演算を行う演算器とを備え、前記レジスタファイルと外部との間のデータアクセスを、複数の前記レジスタからなるグループレジスタ単位で行うデータ演算装置であって、前記レジスタファイルのレジスタのうち少なくとも２つの前記グループレジスタをレジスタブロックとしてグループ化したときに、前記レジスタブロック内のデータの位置を指定するエレメントインデックスデータを記憶するインデックスレジスタと、前記インデックスレジスタのエレメントインデックスデータに基づいて、前記レジスタブロックのレジスタのなかからいずれか１つの前記レジスタのデータを選択して前記演算器に出力するセレクタとを備える。 Furthermore, the data operation device according to claim 2 according to the present invention includes a register file composed of a plurality of registers, and an arithmetic unit that performs an operation using the data of the register, and the data operation device between the register file and the outside A data operation device that performs data access in units of a group register composed of a plurality of registers, wherein when at least two of the group registers of the register file are grouped as a register block, An index register for storing element index data for designating a position of data; and based on the element index data of the index register, the data of any one of the registers is selected from the registers of the register block, and the arithmetic unit Out And a selector that.

このような構成であれば、まず、境界不整列データアクセスが生じる可能性のあるデータ群をレジスタブロックのレジスタにまとめて読み込む。そして、インデックスレジスタにエレメントインデックスデータを格納すると、セレクタにより、インデックスレジスタのエレメントインデックスデータに基づいて、レジスタブロックのレジスタのなかからいずれか１つのレジスタのデータが選択されて演算器に出力され、演算器により演算が行われる。したがって、エレメントインデックスデータの格納、セレクタによる選択、および演算器による演算を繰り返し行えば、レジスタブロックのレジスタにまとめて読み込まれたデータ群のなかで演算に必要なデータが逐次選択されて演算器に出力される。いずれのデータを演算に用いるかは、インデックスデータによって選択することができるので、境界整列データアクセスとなるグループデータも、境界不整列データアクセスとなるグループデータも区別なく取り扱うことができる。すなわち、境界整列データアクセスとなるグループデータについては、グループレジスタの先頭のレジスタから順にデータを選択していけばよいし、境界不整列データアクセスとなるグループデータについては、グループレジスタの所定箇所のレジスタから順にデータを選択していけばよい。 With such a configuration, first, a data group that may cause a boundary misaligned data access is collectively read into the register of the register block. When the element index data is stored in the index register, the selector selects one register data from the register block registers based on the element index data in the index register, and outputs the selected register data to the arithmetic unit. The calculation is performed by the instrument. Therefore, if the element index data is stored, the selection by the selector, and the operation by the arithmetic unit are repeated, the data necessary for the operation is sequentially selected from the data group read together in the register of the register block, and the operation unit is selected. Is output. Which data is used for the calculation can be selected by the index data, so that it is possible to handle the group data for boundary aligned data access and the group data for boundary unaligned data access without distinction. That is, for group data to be boundary aligned data access, data may be selected in order from the first register of the group register, and for group data to be boundary unaligned data access, a register at a predetermined position of the group register You can select the data in order.

ここで、グループレジスタのグループ化は、物理的に行ってもよいし、論理的に行ってもよい。前者の場合、例えば、レジスタファイルのなかで、レジスタブロックに属するグループレジスタと、他のレジスタとをそれぞれ別々のものとして構成することができる。以下、請求項７記載のデータ演算方法において同じである。
また、インデックスレジスタは、エレメントインデックスデータをあらゆる手段でかつあらゆる時期に記憶するものであり、エレメントインデックスデータをあらかじめ記憶してあるものであってもよいし、エレメントインデックスデータをあらかじめ記憶することなく、本装置の動作時に外部からの入力等によってエレメントインデックスデータを記憶するようになっていてもよい。 Here, grouping of the group registers may be performed physically or logically. In the former case, for example, in the register file, the group register belonging to the register block and the other registers can be configured separately. Hereinafter, the same applies to the data calculation method according to the seventh aspect.
The index register stores element index data at any time and at any time, and may store element index data in advance, or without storing element index data in advance. The element index data may be stored by an external input or the like when the apparatus is operating.

さらに、本発明に係る請求項３記載のデータ演算装置は、請求項２記載のデータ演算装置において、さらに、複数の前記インデックスレジスタからなるインデックスレジスタファイルと、与えられたセレクトデータに基づいて、前記インデックスレジスタファイルのなかからいずれか１つの前記インデックスレジスタのエレメントインデックスデータを選択して前記セレクタに出力する第２セレクタとを備える。 Furthermore, the data operation device according to claim 3 according to the present invention is the data operation device according to claim 2, wherein the data operation device is further based on an index register file including a plurality of the index registers and given select data. A second selector that selects element index data of any one of the index registers from the index register file and outputs the selected element index data to the selector.

このような構成であれば、セレクトデータが与えられると、第２セレクタにより、インデックスレジスタファイルのなかからいずれか１つのインデックスレジスタのエレメントインデックスデータが選択されてセレクタに出力され、セレクタにより、入力されたエレメントインデックスデータに基づいて、レジスタブロックのレジスタのなかからいずれか１つのレジスタのデータが選択されて演算器に出力される。 In such a configuration, when the select data is given, the element index data of any one of the index registers is selected from the index register file by the second selector and output to the selector, and is input by the selector. Based on the element index data, the data in one of the registers in the register block is selected and output to the computing unit.

さらに、本発明に係る請求項４記載のデータ演算装置は、請求項３記載のデータ演算装置において、さらに、前記セレクタに出力したエレメントインデックスデータを更新して前記インデックスレジスタファイルに格納するインデックスデータ更新手段を備える。
このような構成であれば、インデックスデータ更新手段により、セレクタに出力したエレメントインデックスデータが更新されてインデックスレジスタファイルに格納される。 Furthermore, the data operation device according to claim 4 according to the present invention is the data operation device according to claim 3, wherein the index data update further updates the element index data output to the selector and stores it in the index register file Means.
With such a configuration, the index data update means updates the element index data output to the selector and stores it in the index register file.

さらに、本発明に係る請求項５記載のデータ演算装置は、請求項２ないし４のいずれか１項に記載のデータ演算装置において、さらに、前記レジスタブロックを指定するバンドルインデックスデータが与えられたときに、与えられたバンドルインデックスデータ、および前記インデックスレジスタのエレメントインデックスデータに基づいて、当該バンドルインデックスデータにより指定される前記レジスタブロックおよび当該レジスタブロック内のデータの位置を示す第２エレメントインデックスデータを生成するインデックスデータ生成手段を備え、前記セレクタは、前記インデックスデータ生成手段で生成した第２エレメントインデックスデータに基づいて、複数の前記レジスタブロックのレジスタのなかからいずれか１つの前記レジスタのデータを選択して前記演算器に出力するようになっている。 Furthermore, the data operation device according to claim 5 of the present invention is the data operation device according to any one of claims 2 to 4, further comprising bundle index data specifying the register block. In addition, based on the given bundle index data and the element index data of the index register, the register block specified by the bundle index data and the second element index data indicating the position of the data in the register block are generated. Index data generating means for performing the processing, and the selector selects one of the registers of the plurality of register blocks based on the second element index data generated by the index data generating means. And outputs to the operation unit to select other data.

このような構成であれば、バンドルインデックスデータが与えられると、インデックスデータ生成手段により、与えられたバンドルインデックスデータおよびエレメントインデックスデータに基づいて、そのバンドルインデックスデータにより指定されるレジスタブロックおよびそのレジスタブロック内のデータの位置を示す第２エレメントインデックスデータが生成される。そして、セレクタにより、生成された第２エレメントインデックスデータに基づいて、複数のレジスタブロックのレジスタのなかからいずれか１つのレジスタのデータが選択されて演算器に出力される。 In such a configuration, when bundle index data is given, a register block designated by the bundle index data and its register block based on the given bundle index data and element index data by the index data generating means Second element index data indicating the position of the data within is generated. Then, based on the generated second element index data, the selector selects one register data from the registers of the plurality of register blocks and outputs the selected register data to the arithmetic unit.

さらに、本発明に係る請求項６記載のデータ演算装置は、請求項５記載のデータ演算装置において、前記インデックスデータ生成手段は、２以上の第１所定数の前記グループレジスタを前記レジスタブロックとして論理的にグループ化する第１バンドルモード、および前記第１所定数よりも多い第２所定数の前記グループレジスタを前記レジスタブロックとして論理的にグループ化する第２バンドルモードのうちいずれかを指定するバンドルモードデータが与えられたときに、与えられたバンドルモードデータ、前記バンドルインデックスデータおよび前記エレメントインデックスデータに基づいて、前記第２エレメントインデックスデータを生成するようになっている。 Furthermore, the data operation device according to claim 6 of the present invention is the data operation device according to claim 5, wherein the index data generation means uses two or more first predetermined number of group registers as the register block. A bundle that designates one of a first bundle mode for grouping and a second bundle mode for logically grouping a second predetermined number of group registers larger than the first predetermined number as the register block When the mode data is given, the second element index data is generated based on the given bundle mode data, the bundle index data, and the element index data.

このような構成であれば、第１バンドルモードおよび第２バンドルモードのうちいずれかを指定するバンドルモードデータが与えられると、インデックスデータ生成手段により、与えられたバンドルモードデータ、バンドルインデックスデータおよびエレメントインデックスデータに基づいて、第２エレメントインデックスデータが生成される。
一方、上記目的を達成するために、本発明に係る請求項７記載のデータ演算方法は、複数のレジスタからなるレジスタファイルと、前記レジスタのデータを用いて演算を行う演算器とを備え、前記レジスタファイルと外部との間のデータアクセスを、複数の前記レジスタからなるグループレジスタ単位で行うデータ演算装置を利用して演算を行うデータ演算方法であって、前記レジスタファイルのレジスタのうち少なくとも２つの前記グループレジスタをレジスタブロックとしてグループ化したときに、前記レジスタブロックの各グループレジスタに前記外部からデータを読み込むデータ読込ステップと、前記レジスタブロック内のデータの位置を指定するエレメントインデックスデータをインデックスレジスタに格納するインデックスデータ格納ステップと、前記インデックスレジスタのエレメントインデックスデータに基づいて、前記レジスタブロックのレジスタのなかからいずれか１つの前記レジスタのデータを選択して前記演算器に出力するデータ選択ステップと、前記演算器により演算を行う演算ステップとを含む。
さらに、本発明に係る請求項８記載のデータ演算方法は、請求項７記載のデータ演算方法において、さらに、前記インデックスデータ格納ステップ、前記データ選択ステップおよび前記演算ステップを繰り返し行う反復演算ステップを含む。 With such a configuration, when bundle mode data designating either the first bundle mode or the second bundle mode is given, the given bundle mode data, bundle index data, and elements are given by the index data generating means. Second element index data is generated based on the index data.
On the other hand, in order to achieve the above object, a data operation method according to claim 7 according to the present invention includes a register file including a plurality of registers, and an arithmetic unit that performs an operation using data of the registers, A data operation method for performing an operation using a data operation device that performs data access between a register file and the outside in units of a group register composed of a plurality of registers, wherein at least two of the registers of the register file When the group register is grouped as a register block, a data reading step for reading data from the outside into each group register of the register block, and element index data for designating the position of the data in the register block in the index register Index data to store A storage step; a data selection step of selecting data of any one of the registers from the registers of the register block based on element index data of the index register; and outputting the data to the computing unit; and And a calculation step for performing a calculation.
Furthermore, the data operation method according to claim 8 of the present invention is the data operation method according to claim 7, further comprising an iterative operation step of repeatedly performing the index data storage step, the data selection step, and the operation step. .

以上説明したように、本発明に係る請求項１記載のデータ演算装置によれば、境界不整列データアクセスとなる場合や、データの論理構造が重複している場合でも、レジスタファイルへのデータの読み込み回数が極端に増加することがないので、処理効率を向上することができるという効果が得られる。また、データの論理構造が重複している場合でも、プログラムの多少の変更で対応することができるので、プログラムコストを低減することができるという効果も得られる。さらに、インデックスレジスタおよびセレクタを設けるだけでよく、また、多くの境界不整列データアクセスのパターンに対応する場合でも、インデックスデータのビット数をさほど増やさなくてすむので、特許文献１記載の発明に比して、製造コストを低減することができるという効果も得られる。 As described above, according to the data operation device of the first aspect of the present invention, even when boundary unaligned data access occurs or when the logical structure of data overlaps, the data of the register file is stored. Since the number of readings does not increase extremely, an effect that the processing efficiency can be improved is obtained. In addition, even when the logical structure of data is duplicated, it is possible to cope with a slight change in the program, so that an effect of reducing the program cost can be obtained. Furthermore, it is only necessary to provide an index register and a selector, and even when dealing with many boundary-unaligned data access patterns, it is not necessary to increase the number of bits of the index data. And the effect that manufacturing cost can be reduced is also acquired.

さらに、本発明に係る請求項２記載のデータ演算装置によれば、境界不整列データアクセスとなる場合や、データの論理構造が重複している場合でも、レジスタファイルへのデータの読み込み回数が極端に増加することがないので、処理効率を向上することができるという効果が得られる。また、データの論理構造が重複している場合でも、プログラムの多少の変更で対応することができるので、プログラムコストを低減することができるという効果も得られる。さらに、インデックスレジスタおよびセレクタを設けるだけでよく、また、多くの境界不整列データアクセスのパターンに対応する場合でも、エレメントインデックスデータのビット数をさほど増やさなくてすむので、特許文献１記載の発明に比して、製造コストを低減することができるという効果も得られる。 Furthermore, according to the data operation device of the second aspect of the present invention, the number of times data is read into the register file is extremely small even when boundary-aligned data access occurs or when the logical structure of data is duplicated. Therefore, there is an effect that the processing efficiency can be improved. In addition, even when the logical structure of data is duplicated, it is possible to cope with a slight change in the program, so that an effect of reducing the program cost can be obtained. Furthermore, it is only necessary to provide an index register and a selector, and even when dealing with many boundary-unaligned data access patterns, it is not necessary to increase the number of bits of the element index data. In comparison, the manufacturing cost can be reduced.

さらに、本発明に係る請求項３記載のデータ演算装置によれば、複数のエレメントインデックスデータをインデックスレジスタファイルに格納しておき、セレクトデータによってそれらエレメントインデックスデータを選択的に使用することができるので、インデックスレジスタの使い勝手が向上し、プログラムを作成しやすくなるという効果が得られる。
さらに、本発明に係る請求項４記載のデータ演算装置によれば、エレメントインデックスデータを使用するたびに自動的に更新されるので、エレメントインデックスデータの更新をソフトウェアで行わなくてすみ、演算速度を向上することができるという効果が得られる。 Furthermore, according to the data arithmetic device according to claim 3 of the present invention, a plurality of element index data can be stored in the index register file, and the element index data can be selectively used by the select data. As a result, it is possible to improve the usability of the index register and to facilitate the creation of a program.
Furthermore, according to the data operation device of claim 4 according to the present invention, since the element index data is automatically updated every time it is used, it is not necessary to update the element index data by software, and the operation speed is increased. The effect that it can improve is acquired.

さらに、本発明に係る請求項５記載のデータ演算装置によれば、レジスタファイルに複数のレジスタブロックが存在する場合、バンドルインデックスデータによってそれらレジスタブロックを選択的に使用することができるので、レジスタの使い勝手が向上し、プログラムを作成しやすくなるという効果が得られる。
さらに、本発明に係る請求項６記載のデータ演算装置によれば、バンドルモードデータによってレジスタブロックの論理構造を変更することができるので、レジスタの使い勝手がさらに向上し、プログラムをさらに作成しやすくなるという効果が得られる。 Furthermore, according to the data arithmetic device according to claim 5 of the present invention, when there are a plurality of register blocks in the register file, the register blocks can be selectively used by the bundle index data. Usability is improved, and it is easy to create a program.
Furthermore, according to the data arithmetic device according to claim 6 of the present invention, since the logical structure of the register block can be changed by the bundle mode data, the usability of the register is further improved, and the program can be more easily created. The effect is obtained.

一方、本発明に係る請求項７記載のデータ演算方法によれば、データの論理構造が重複している場合でも、プログラムの多少の変更で対応することができるので、プログラムコストを低減することができるという効果も得られる。さらに、インデックスレジスタを利用するだけでよく、また、多くの境界不整列データアクセスのパターンに対応する場合でも、エレメントインデックスデータのビット数をさほど増やさなくてすむので、特許文献１記載の発明に比して、製造コストを低減することができるという効果も得られる。 On the other hand, according to the data operation method of the seventh aspect of the present invention, even if the logical structure of data is duplicated, it is possible to cope with a slight change in the program, so that the program cost can be reduced. The effect that it is possible is also acquired. Furthermore, it is only necessary to use an index register, and even when dealing with many boundary-unaligned data access patterns, it is not necessary to increase the number of bits of element index data so much, compared with the invention described in Patent Document 1. And the effect that manufacturing cost can be reduced is also acquired.

さらに、本発明に係る請求項８記載のデータ演算方法によれば、境界不整列データアクセスとなる場合や、データの論理構造が重複している場合でも、レジスタファイルへのデータの読み込み回数が極端に増加することがないので、処理効率を向上することができるという効果が得られる。 Furthermore, according to the data operation method of the present invention, the number of times data is read into the register file is extremely small even when boundary-aligned data access occurs or when the logical structure of data overlaps. Therefore, there is an effect that the processing efficiency can be improved.

以下、本発明の第１の実施の形態を図面を参照しながら説明する。図１ないし図６は、本発明に係るデータ演算装置の第１の実施の形態を示す図である。
本実施の形態は、本発明に係るデータ演算装置を、図１に示すように、デジタル信号処理を行う３２ビットＲＩＳＣ型のデジタル信号処理装置１００においてデータ演算部１２に適用したものである。 Hereinafter, a first embodiment of the present invention will be described with reference to the drawings. 1 to 6 are diagrams showing a first embodiment of a data arithmetic device according to the present invention.
In this embodiment, as shown in FIG. 1, the data operation device according to the present invention is applied to the data operation unit 12 in a 32-bit RISC type digital signal processing device 100 that performs digital signal processing.

まず、デジタル信号処理装置１００の構成を図１を参照しながら説明する。
図１は、デジタル信号処理装置１００のハードウェア構成を示すブロック図である。
デジタル信号処理装置１００は、図１に示すように、演算用のデータを記憶するデータメモリ１０と、データメモリ１０からデータを読み込んで演算を行うデータ演算部１２と、データメモリ１０のデータアドレスを生成するデータアドレス生成部１４と、デジタル信号処理装置１００全体の制御プログラムを記憶したプログラムメモリ１６と、プログラムメモリ１６の制御プログラムに基づいてデジタル信号処理装置１００全体を制御するプログラムシーケンサ１８とで構成されている。データメモリ１０、データ演算部１２、データアドレス生成部１４およびプログラムシーケンサ１８は、データを転送するための信号線であるバス２０で相互にかつデータ授受可能に接続され、プログラムメモリ１６は、プログラムシーケンサ１８に直接接続されている。 First, the configuration of the digital signal processing apparatus 100 will be described with reference to FIG.
FIG. 1 is a block diagram illustrating a hardware configuration of the digital signal processing apparatus 100.
As shown in FIG. 1, the digital signal processing apparatus 100 includes a data memory 10 that stores data for calculation, a data calculation unit 12 that reads data from the data memory 10 and performs calculation, and a data address of the data memory 10. A data address generation unit 14 to be generated, a program memory 16 that stores a control program for the entire digital signal processing device 100, and a program sequencer 18 that controls the entire digital signal processing device 100 based on the control program in the program memory 16. Has been. The data memory 10, the data operation unit 12, the data address generation unit 14, and the program sequencer 18 are connected to each other via a bus 20 that is a signal line for transferring data, and the program memory 16 is a program sequencer. 18 is directly connected.

データメモリ１０は、データ演算部１２で演算対象となるデータ、およびデータ演算部１２の演算結果のデータを格納する。また、データアドレス生成部１４またはプログラムシーケンサ１８内のレジスタの値を格納することもある。
データ演算部１２は、データメモリ１０に対して、４の整数倍のアドレス（0000h、0004h、0008h、000Ch…）をアクセス境界として４バイト単位でデータアクセスを行う。以下、データ演算部１２のアクセス境界をグループアドレスという。 The data memory 10 stores data to be calculated by the data calculation unit 12 and calculation result data of the data calculation unit 12. Further, the value of a register in the data address generation unit 14 or the program sequencer 18 may be stored.
The data operation unit 12 performs data access to the data memory 10 in units of 4 bytes using an address (0000h, 0004h, 0008h, 000Ch...) That is an integer multiple of 4 as an access boundary. Hereinafter, the access boundary of the data operation unit 12 is referred to as a group address.

データアドレス生成部１４は、データメモリ１０と、データ演算部１２、データアドレス生成部１４またはプログラムシーケンサ１８との間のデータ転送を実行する際に必要となるデータアドレスを生成する。また、データアドレスの生成に合わせて、データメモリ１０を制御するための信号を生成する。これらの動作は、プログラムシーケンサ１８からの制御信号により規定される。 The data address generation unit 14 generates a data address necessary for executing data transfer between the data memory 10 and the data calculation unit 12, the data address generation unit 14, or the program sequencer 18. Further, a signal for controlling the data memory 10 is generated in accordance with the generation of the data address. These operations are defined by control signals from the program sequencer 18.

図２は、データアドレスのデータ構造を示す図である。
データアドレスは、図２に示すように、ｍビットのデータからなり、データメモリ１０の最小記憶単位を１バイトとして２^mバイトのメモリ空間を１バイト単位でアクセス可能なアドレスを構成している。データアドレスは、上位（ｍ−２）ビットｂ_m-1，…，ｂ₂がグループアドレスを構成し、下位２ビットｂ₁，ｂ₀がグループアドレス間の細分されたアドレスを構成している。 FIG. 2 shows the data structure of the data address.
Data address, as shown in FIG. 2 consists of m bits of data, constitutes a accessible address memory space of 2 ^m bytes minimum storage unit of the data memory 10 as a byte by one byte. In the data address, upper (m−2) bits b _m−1 ,..., B ₂ constitute a group address, and lower 2 bits b ₁ and b ₀ constitute a subdivided address between group addresses.

データアドレス生成部１４は、より具体的には、複数のポインタレジスタを有し、データアドレスのうちグループアドレスをデータメモリ１０に出力し、データアドレスのうち下位２ビットをデータ演算部１２にそれぞれ出力する。
図１に戻り、プログラムシーケンサ１８は、プログラムメモリ１６から制御プログラムを読み出し、読み出した制御プログラムに従って、データメモリ１０、データ演算部１２、データアドレス生成部１４、プログラムメモリ１６およびプログラムシーケンサ１８を制御する。データメモリ１０、データ演算部１２およびデータアドレス生成部１４に対しては、バス２０を介して制御信号を出力し、プログラムメモリ１６は制御信号を直接出力する。 More specifically, the data address generation unit 14 has a plurality of pointer registers, outputs a group address among the data addresses to the data memory 10, and outputs lower 2 bits of the data address to the data operation unit 12, respectively. To do.
Returning to FIG. 1, the program sequencer 18 reads the control program from the program memory 16, and controls the data memory 10, the data calculation unit 12, the data address generation unit 14, the program memory 16, and the program sequencer 18 according to the read control program. . A control signal is output via the bus 20 to the data memory 10, the data calculation unit 12, and the data address generation unit 14, and the program memory 16 directly outputs the control signal.

次に、データ演算部１２の構成を図３および図４を参照しながら詳細に説明する。
図３は、データ演算部１２のハードウェア構成を示すブロック図である。
データ演算部１２は、図３に示すように、複数のレジスタからなるレジスタファイル２００と、レジスタファイル２００のデータを用いて演算を行う演算器２０２と、レジスタファイル２００のなかからデータを選択するセレクタ２０４，２０６と、複数のインデックスレジスタからなるインデックスレジスタファイル２０８と、インデックスレジスタファイル２０８のなかからデータを選択するセレクタ２１０，２１２と、インデックスレジスタファイル２０８のデータを更新する計算器２１４と、計算器２１４とバス２０のデータを選択するセレクタ２１６とで構成されている。 Next, the configuration of the data calculation unit 12 will be described in detail with reference to FIGS. 3 and 4.
FIG. 3 is a block diagram illustrating a hardware configuration of the data calculation unit 12.
As shown in FIG. 3, the data operation unit 12 includes a register file 200 composed of a plurality of registers, a calculator 202 that performs operations using data in the register file 200, and a selector that selects data from the register file 200. 204, 206, an index register file 208 composed of a plurality of index registers, selectors 210, 212 for selecting data from the index register file 208, a calculator 214 for updating data in the index register file 208, and a calculator And a selector 216 for selecting data on the bus 20.

レジスタファイル２００は、ｎ個のグループレジスタＲ₀〜Ｒ_n-1からなる。グループレジスタＲ₀は、４個のレジスタＲ₀[0]〜Ｒ₀[3]からなり、各レジスタＲ₀[0]〜Ｒ₀[3]は、１バイトの記憶容量を有している。他のグループレジスタＲ₁〜Ｒ_n-1についても同様の構成となっている。
セレクタ２０４は、グループレジスタＲ₀，Ｒ₁をレジスタブロックＡとしてレジスタブロックＡの各レジスタと接続し、セレクタ２１０からのセレクトデータに基づいて、レジスタブロックＡのレジスタのなかからいずれか１つのレジスタのデータを選択し、選択したデータを演算器２０２の第１オペランドとして演算器２０２に出力する。 The register file 200 includes n group registers R _{0 to} R _n−1 . The group register R ₀ includes four registers R ₀ [0] to R ₀ [3], and each register R ₀ [0] to R ₀ [3] has a storage capacity of 1 byte. The other group registers R _{1 to} R _n-1 have the same configuration.
The selector 204 connects the group registers R ₀ and R ₁ as the register block A to each register of the register block A, and selects one of the registers of the register block A based on the select data from the selector 210. Data is selected, and the selected data is output to the computing unit 202 as the first operand of the computing unit 202.

セレクタ２１０からのセレクトデータは、３ビットからなる。セレクタ２０４は、セレクトデータの最上位ビットが「０」であるときは、グループレジスタＲ₀のうち下位２ビットの値により指定されるレジスタのデータを選択する。すなわち、セレクトデータが「０００」であるときはレジスタＲ₀[0]のデータを、「００１」であるときはレジスタＲ₀[1]のデータを、「０１０」であるときはレジスタＲ₀[2]のデータを、「０１１」であるときはレジスタＲ₀[3]のデータをそれぞれ選択する。また、セレクトデータの最上位ビットが「１」であるときは、グループレジスタＲ₁のうち下位２ビットの値により指定されるレジスタのデータを選択する。すなわち、セレクトデータが「１００」であるときはレジスタＲ₁[0]のデータを、「１０１」であるときはレジスタＲ₁[1]のデータを、「１１０」であるときはレジスタＲ₁[2]のデータを、「１１１」であるときはレジスタＲ₁[3]のデータをそれぞれ選択する。 The select data from the selector 210 consists of 3 bits. When the most significant bit of the select data is “0”, the selector 204 selects the data of the register designated by the value of the lower 2 bits of the group register R ₀ . That is, when the select data is “000”, the data in the register R ₀ [0] is stored. When the select data is “001”, the data in the register R ₀ [1] is stored. When the select data is “010”, the register R ₀ [ ₀ ] is displayed. When the data in [2] is “011”, the data in the register R ₀ [3] is selected. When the most significant bit of the select data is “1”, the register data designated by the value of the lower 2 bits of the group register R ₁ is selected. In other words, the select data is the data of "100" when it is the register R ₁ [0], the data of "101" at which time the register R ₁ [1], "110" when it is the register R ₁ [ When the data of 2] is “111”, the data of the register R ₁ [3] is selected.

セレクタ２０６は、グループレジスタＲ₂，Ｒ₃をレジスタブロックＢとしてレジスタブロックＢの各レジスタと接続し、セレクタ２１２からのセレクトデータに基づいて、レジスタブロックＢのレジスタのなかからいずれか１つのレジスタのデータを選択し、選択したデータを演算器２０２の第２オペランドとして演算器２０２に出力する。データの選択手順は、セレクタ２０４と同様である。
インデックスレジスタファイル２０８は、エレメントインデックスデータを各インデックスレジスタに記憶する。 The selector 206 connects the group registers R ₂ and R ₃ as the register block B to each register of the register block B, and selects one of the registers of the register block B based on the select data from the selector 212. Data is selected, and the selected data is output to the computing unit 202 as the second operand of the computing unit 202. The data selection procedure is the same as that of the selector 204.
The index register file 208 stores element index data in each index register.

図４は、エレメントインデックスデータのデータ構造を示す図である。
エレメントインデックスデータは、図４に示すように、３ビットのデータからなり、最上位ビットｂ₂がグループレジスタＲ₀，Ｒ₁（Ｒ₂，Ｒ₃）の別を示し、下位２ビットｂ₁，ｂ₀がグループレジスタの各レジスタの別を示す。エレメントインデックスデータの各ビットは、セレクタ２１０，２１２からのセレクトデータの各ビットに対応している。また、エレメントインデックスデータの下位２ビットｂ₁，ｂ₀は、データアドレスの下位２ビットｂ₁，ｂ₀に対応している。 FIG. 4 is a diagram showing the data structure of element index data.
As shown in FIG. 4, the element index data is composed of 3-bit data. The most significant bit b ₂ indicates the group register R ₀ , R ₁ (R ₂ , R ₃ ), and the lower 2 bits b ₁ , b ₀ indicates the group register. Each bit of the element index data corresponds to each bit of the select data from the selectors 210 and 212. The lower 2 bits b _1, b ₀ of the element index data corresponds to the lower 2 bits b _1, b ₀ of the data address.

図３に戻り、セレクタ２１０は、プログラムシーケンサ１８からのセレクトデータに基づいて、インデックスレジスタファイル２０８のなかからいずれか１つのインデックスレジスタのエレメントインデックスデータを選択してセレクタ２０４に出力する。なお、プログラムシーケンサ１８では、セレクタ２１０を制御する命令コードを実行すると、その命令コードに対応するオペランドをセレクトデータとしてセレクタ２１０に出力する。 Returning to FIG. 3, the selector 210 selects element index data of any one of the index registers from the index register file 208 based on the select data from the program sequencer 18 and outputs the selected element index data to the selector 204. When the program sequencer 18 executes an instruction code that controls the selector 210, the program sequencer 18 outputs an operand corresponding to the instruction code to the selector 210 as select data.

セレクタ２１２は、プログラムシーケンサ１８からのセレクトデータに基づいて、インデックスレジスタファイル２０８のなかからいずれか１つのインデックスレジスタのエレメントインデックスデータを選択してセレクタ２０６に出力する。なお、プログラムシーケンサ１８では、セレクタ２１２を制御する命令コードを実行すると、その命令コードに対応するオペランドをセレクトデータとしてセレクタ２１２に出力する。 The selector 212 selects element index data of any one of the index registers from the index register file 208 based on the select data from the program sequencer 18 and outputs it to the selector 206. When the program sequencer 18 executes an instruction code that controls the selector 212, the program sequencer 18 outputs an operand corresponding to the instruction code to the selector 212 as select data.

計算器２１４は、セレクタ２１０またはセレクタ２１２からセレクトデータとして出力されるエレメントインデックスデータを更新してセレクタ２１６に出力する。エレメントインデックスデータは、レジスタブロックＡ，Ｂのレジスタからデータを取得する順序があらかじめ決まっている場合に、その順序でデータが選択できるように更新する。例えば、レジスタブロックＡ，Ｂの先頭方向から末尾方向に連続してデータを選択する場合は、エレメントインデックスデータに「１」を加算することにより更新する。末尾方向から先頭方向に連続してデータを選択する場合は、エレメントインデックスデータから「１」を減算することにより更新する。 The calculator 214 updates the element index data output as select data from the selector 210 or the selector 212 and outputs it to the selector 216. The element index data is updated so that the data can be selected in that order when the order of obtaining the data from the registers of the register blocks A and B is determined in advance. For example, when data is continuously selected from the head direction to the tail direction of the register blocks A and B, the data is updated by adding “1” to the element index data. When data is continuously selected from the tail direction to the head direction, the data is updated by subtracting “1” from the element index data.

セレクタ２１６は、プログラムシーケンサ１８からのセレクトデータに基づいて、データアドレス生成部１４からのデータアドレスのうち下位２ビットのデータ、および計算器２１４からのエレメントインデックスデータのうちいずれか一方を選択してインデックスレジスタファイル２０８に格納する。なお、プログラムシーケンサ１８では、セレクタ２１６を制御する命令コードを実行すると、その命令コードに対応するオペランドをセレクトデータとしてセレクタ２１６に出力する。 Based on the select data from the program sequencer 18, the selector 216 selects either the lower 2 bits of the data address from the data address generator 14 or the element index data from the calculator 214. Store in the index register file 208. Note that when the instruction code for controlling the selector 216 is executed, the program sequencer 18 outputs an operand corresponding to the instruction code to the selector 216 as select data.

次に、本実施の形態の動作を図５および図６を参照しながら説明する。
初めに、デジタル信号処理装置１００で上式（１）に示す相関演算を行う場合を説明する。ただし、s[n]とr[n]は、図１０に示すように、データメモリ１０に格納されているとする。
図５は、プログラムシーケンサ１８で実行される処理を示すフローチャートである。
プログラムシーケンサ１８では、プログラムメモリ１６から制御プログラムが読み出され、読み出された制御プログラムに従って、図５のフローチャートに示す相関演算処理がステップＳ１００〜Ｓ１５０の順で実行される。 Next, the operation of the present embodiment will be described with reference to FIGS.
First, the case where the digital signal processing apparatus 100 performs the correlation calculation shown in the above equation (1) will be described. However, it is assumed that s [n] and r [n] are stored in the data memory 10 as shown in FIG.
FIG. 5 is a flowchart showing processing executed by the program sequencer 18.
In the program sequencer 18, the control program is read from the program memory 16, and the correlation calculation processing shown in the flowchart of FIG. 5 is executed in the order of steps S100 to S150 in accordance with the read control program.

ステップＳ１００では、ポインタレジスタＰ０にr[0]のデータアドレスを設定する命令コードが実行されると、データアドレス生成部１４では、r[0]のデータアドレスのうちグループアドレスがポインタレジスタＰ０に格納される。また、データ演算部１２では、r[0]のデータアドレスのうち下位２ビットのデータがインデックスレジスタファイル２０８に格納される。r[0]のデータアドレスのうち下位２ビットは「００」なので、最上位ビットを「０」として３ビットのデータ「０００」がエレメントインデックスデータｒとして格納される。 In step S100, when the instruction code for setting the data address of r [0] in the pointer register P0 is executed, the data address generator 14 stores the group address in the data address of r [0] in the pointer register P0. Is done. In the data operation unit 12, the lower 2 bits of the data address of r [0] are stored in the index register file 208. Since the lower 2 bits of the data address of r [0] are “00”, the most significant bit is “0” and 3-bit data “000” is stored as element index data r.

ステップＳ１０２では、ポインタレジスタＰ１にs[0]のデータアドレスを設定する命令コードが実行されると、データアドレス生成部１４では、s[0]のデータアドレスのうちグループアドレスがポインタレジスタＰ１に格納される。また、データ演算部１２では、s[0]のデータアドレスのうち下位２ビットのデータがインデックスレジスタファイル２０８に格納される。s[0]のデータアドレスのうち下位２ビットは「００」なので、最上位ビットを「０」として３ビットのデータ「０００」がエレメントインデックスデータｓとして格納される。 In step S102, when an instruction code for setting the data address of s [0] in the pointer register P1 is executed, the data address generator 14 stores the group address in the data address of s [0] in the pointer register P1. Is done. In the data operation unit 12, the lower 2 bits of the data address of s [0] are stored in the index register file 208. Since the lower 2 bits of the data address of s [0] are “00”, the most significant bit is “0” and 3-bit data “000” is stored as element index data s.

ステップＳ１０４では、レジスタＡ０（グループレジスタＲ₄〜Ｒ_n-1のいずれか）の内容を初期化する命令コードが実行されると、データ演算部１２では、レジスタＡ０に「０」が設定される。
ステップＳ１０８では、ポインタレジスタＰ１が指し示すグループアドレスのデータをグループレジスタＲ₀に読み込む命令コードが実行されると、データ演算部１２では、データメモリ１０からs[0]〜s[3]がグループレジスタＲ₀に読み込まれる。 In step S104, the instruction code to initialize the contents of the register A0 (either Group Register R ₄ ~R _n-1) is executed, the data calculating unit 12, "0" is set in the register A0 .
In step S108, when the instruction code for reading the data of the group address indicated by the pointer register P1 into the group register _R0 is executed, the data operation unit 12 stores s [0] to s [3] from the data memory 10 into the group register. Read into _R0 .

ステップＳ１１０では、ポインタレジスタＰ１を加算する命令コードが実行されると、データ演算部１２では、ポインタレジスタＰ１のグループアドレスが「１」加算される。データメモリ１０上では、４バイト先のs[4]を指し示すことになる。
ステップＳ１１２では、ポインタレジスタＰ１が指し示すグループアドレスのデータをグループレジスタＲ₁に読み込む命令コードが実行されると、データ演算部１２では、データメモリ１０からs[4]〜s[7]がグループレジスタＲ₁に読み込まれる。 In step S110, when the instruction code for adding the pointer register P1 is executed, the data operation unit 12 adds “1” to the group address of the pointer register P1. On the data memory 10, s [4] that is 4 bytes ahead is indicated.
In step S112, when the instruction code read data of the group address pointer register P1 pointed to the group registers R ₁ is executed, the data calculating unit 12, s [4] from the data memory 10 ~s [7] is Group Register It is loaded into R _1.

ステップＳ１１４では、ポインタレジスタＰ１を加算する命令コードが実行されると、データ演算部１２では、ポインタレジスタＰ１のグループアドレスが「１」加算される。データメモリ１０上では、４バイト先のs[8]を指し示すことになる。
ステップＳ１１６では、ポインタレジスタＰ０が指し示すグループアドレスのデータをグループレジスタＲ₂に読み込む命令コードが実行されると、データ演算部１２では、データメモリ１０からr[0]〜r[3]がグループレジスタＲ₂に読み込まれる。
ステップＳ１１８では、ポインタレジスタＰ０を加算する命令コードが実行されると、データ演算部１２では、ポインタレジスタＰ０のグループアドレスが「１」加算される。データメモリ１０上では、４バイト先のr[4]を指し示すことになる。 In step S114, when the instruction code for adding the pointer register P1 is executed, the data operation unit 12 adds “1” to the group address of the pointer register P1. On the data memory 10, s [8] that is 4 bytes ahead is indicated.
At step S116, the instruction code reads data group address pointer register P0 pointed to a group register R ₂ is executed, the data calculating unit 12, r [0] from the data memory 10 ~r [3] is Group Register It is loaded into R _2.
In step S118, when the instruction code for adding the pointer register P0 is executed, the data operation unit 12 adds “1” to the group address of the pointer register P0. On the data memory 10, it points to r [4] 4 bytes ahead.

ステップＳ１２０では、レジスタブロックＡ，Ｂのレジスタのデータを演算する命令コード（以下、ブロック演算命令コードという。）が実行されると、データ演算部１２では、プログラムシーケンサ１８からのセレクトデータに基づいて、セレクタ２１０により、エレメントインデックスデータｓ「０００」が選択されてセレクタ２０４に出力され、セレクタ２０４により、レジスタＲ₀[0]のデータs[0]が選択されて演算器２０２に出力される。また、プログラムシーケンサ１８からのセレクトデータに基づいて、セレクタ２１２により、エレメントインデックスデータｒ「０００」が選択されてセレクタ２０６に出力され、セレクタ２０６により、レジスタＲ₂[0]のデータr[0]が選択されて演算器２０２に出力される。そして、演算器２０２により、s[0]とr[0]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項の演算結果が得られる。また、計算器２１４により、エレメントインデックスデータｓ，ｒにそれぞれ「１」が加算され、インデックスレジスタファイル２０８に格納される。 In step S120, when an instruction code for calculating the register data of the register blocks A and B (hereinafter referred to as a block operation instruction code) is executed, the data operation unit 12 is based on the select data from the program sequencer 18. The element index data s “000” is selected by the selector 210 and outputted to the selector 204, and the data s [0] of the register R ₀ [0] is selected by the selector 204 and outputted to the computing unit 202. Further, based on the select data from the program sequencer 18, the element index data r “000” is selected by the selector 212 and output to the selector 206, and the selector 206 outputs the data r [0] of the register R ₂ [0]. Is selected and output to the computing unit 202. Then, s [0] and r [0] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the first term of the above equation (1) is obtained. Further, “1” is added to the element index data s and r by the calculator 214 and stored in the index register file 208.

ステップＳ１２２では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、プログラムシーケンサ１８からのセレクトデータに基づいて、セレクタ２１０により、エレメントインデックスデータｓ「００１」が選択されてセレクタ２０４に出力され、セレクタ２０４により、レジスタＲ₀[1]のデータs[1]が選択されて演算器２０２に出力される。また、プログラムシーケンサ１８からのセレクトデータに基づいて、セレクタ２１２により、エレメントインデックスデータｒ「００１」が選択されてセレクタ２０６に出力され、セレクタ２０６により、レジスタＲ₂[1]のデータr[1]が選択されて演算器２０２に出力される。そして、演算器２０２により、s[1]とr[1]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項および第２項の演算結果が得られる。また、計算器２１４により、エレメントインデックスデータｓ，ｒにそれぞれ「１」が加算され、インデックスレジスタファイル２０８に格納される。 In step S122, when the block operation instruction code is further executed, the data operation unit 12 selects the element index data s “001” by the selector 210 based on the select data from the program sequencer 18 and selects the selector 204. The selector 204 selects the data s [1] of the register R ₀ [1] and outputs it to the computing unit 202. Further, based on the select data from the program sequencer 18, the element index data r “001” is selected by the selector 212 and output to the selector 206, and the selector 206 outputs the data r [1] of the register R ₂ [1]. Is selected and output to the computing unit 202. The computing unit 202 multiplies s [1] and r [1], adds the multiplication result to the value of the register A0, and stores the result in the register A0. Thereby, the calculation results of the first term and the second term of the above formula (1) are obtained. Further, “1” is added to the element index data s and r by the calculator 214 and stored in the index register file 208.

ステップＳ１２４では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₀[2]，Ｒ₂[2]のデータs[2]，r[2]がそれぞれ選択される。そして、演算器２０２により、s[2]とr[2]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項〜第３項の演算結果が得られる。 At step S124, the further the block operation instruction code is executed, the data calculating unit 12, similarly, by the selector 204, the register R ₀ [2], data of _{R 2 [2] s [2} ], r [2] is selected. Then, s [2] and r [2] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the 1st term-the 3rd term of the above formula (1) is obtained.

ステップＳ１２６では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₀[3]，Ｒ₂[3]のデータs[3]，r[3]がそれぞれ選択される。そして、演算器２０２により、s[3]とr[3]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項〜第４項の演算結果が得られる。 In step S126, further, the block operation instruction code is executed, the data calculating unit 12, similarly, by the selector 204, the register R ₀ [3], the data s of _{R 2 [3] [3]} , Each r [3] is selected. Then, s [3] and r [3] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the 1st term-the 4th term of the above formula (1) is obtained.

ステップＳ１２８では、ポインタレジスタＰ１が指し示すグループアドレスのデータをグループレジスタＲ₀に読み込む命令コードが実行されると、データ演算部１２では、データメモリ１０からs[8]〜s[11]がグループレジスタＲ₀に読み込まれる。
ステップＳ１３０では、ポインタレジスタＰ１を加算する命令コードが実行されると、データ演算部１２では、ポインタレジスタＰ１のグループアドレスが「１」加算される。データメモリ１０上では、４バイト先のs[12]を指し示すことになる。 In step S128, when an instruction code for reading the data of the group address indicated by the pointer register P1 into the group register _R0 is executed, the data operation unit 12 stores s [8] to s [11] from the data memory 10 into the group register. Read into _R0 .
In step S130, when the instruction code for adding the pointer register P1 is executed, the data operation unit 12 adds “1” to the group address of the pointer register P1. On the data memory 10, s [12] that is 4 bytes ahead is indicated.

ステップＳ１３２では、ポインタレジスタＰ０が指し示すグループアドレスのデータをグループレジスタＲ₃に読み込む命令コードが実行されると、データ演算部１２では、データメモリ１０からr[4]〜r[7]がグループレジスタＲ₃に読み込まれる。
ステップＳ１３４では、ポインタレジスタＰ０を加算する命令コードが実行されると、データ演算部１２では、ポインタレジスタＰ０のグループアドレスが「１」加算される。データメモリ１０上では、４バイト先のr[7]を指し示すことになる。 In step S132, when the instruction code read data group address pointer register P0 pointed to a group register R ₃ is executed, the data calculating unit 12, r [4] from the data memory 10 ~r [7] is Group Register It is loaded into R _3.
In step S134, when the instruction code for adding the pointer register P0 is executed, the data operation unit 12 adds “1” to the group address of the pointer register P0. On the data memory 10, it points to r [7] 4 bytes ahead.

ステップＳ１３６では、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₁[0]，Ｒ₃[0]のデータs[4]，r[4]がそれぞれ選択される。そして、演算器２０２により、s[4]とr[4]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項〜第５項の演算結果が得られる。 In step S136, when the block operation instruction code is executed, the data operation unit 12 similarly uses the selectors 204 and 206 to store the data s [4] and r [[] in the registers R ₁ [0] and R ₃ [0]. 4] is selected respectively. Then, s [4] and r [4] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation results of the first to fifth terms of the above equation (1) are obtained.

ステップＳ１３８では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₁[1]，Ｒ₃[1]のデータs[5]，r[5]がそれぞれ選択される。そして、演算器２０２により、s[5]とr[5]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項〜第６項の演算結果が得られる。 In step S138, when the block operation instruction code is further executed, the data operation unit 12 similarly uses the selectors 204 and 206 to store the data s [5], Rs [1], R3 [1] in the registers R ₁ [1], R ₃ [1]. r [5] is selected. Then, s [5] and r [5] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the 1st term-the 6th term of the above formula (1) is obtained.

ステップＳ１４０では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₁[2]，Ｒ₃[2]のデータs[6]，r[6]がそれぞれ選択される。そして、演算器２０２により、s[6]とr[6]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項〜第７項の演算結果が得られる。 In step S140, further, the block operation instruction code is executed, the data calculating unit 12, similarly, by the selector 204, the register R ₁ [2], data s of _{R 3 [2] [6]} , Each r [6] is selected. Then, s [6] and r [6] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the 1st term-the 7th term of the above formula (1) is obtained.

ステップＳ１４２では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₁[3]，Ｒ₃[3]のデータs[7]，r[7]がそれぞれ選択される。そして、演算器２０２により、s[7]とr[7]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項〜第８項の演算結果が得られる。 In step S142, further, the block operation instruction code is executed, the data calculating unit 12, similarly, by the selector 204, the register R ₁ [3], the data s [7] of R ₃ [3], Each r [7] is selected. Then, s [7] and r [7] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the 1st term-the 8th term of the above formula (1) is obtained.

ステップＳ１４４では、ポインタレジスタＰ１が指し示すグループアドレスのデータをグループレジスタＲ₁に読み込む命令コードが実行されると、データ演算部１２では、データメモリ１０からs[12]〜s[15]がグループレジスタＲ₁に読み込まれる。
ステップＳ１４６では、ポインタレジスタＰ１を加算する命令コードが実行されると、データ演算部１２では、ポインタレジスタＰ１のグループアドレスが「１」加算される。データメモリ１０上では、４バイト先のs[16]を指し示すことになる。 In step S144, when the instruction code read data of the group address pointer register P1 pointed to the group registers R ₁ is executed, the data calculating unit 12, s [12] from the data memory 10 ~s [15] is Group Register It is loaded into R _1.
In step S146, when the instruction code for adding the pointer register P1 is executed, the data operation unit 12 adds “1” to the group address of the pointer register P1. On the data memory 10, it points to s [16] 4 bytes ahead.

ステップＳ１４８では、ポインタレジスタＰ０が指し示すグループアドレスのデータをグループレジスタＲ₂に読み込む命令コードが実行されると、データ演算部１２では、データメモリ１０からr[8]〜r[11]がグループレジスタＲ₂に読み込まれる。
ステップＳ１５０では、ポインタレジスタＰ０を加算する命令コードが実行されると、データ演算部１２では、ポインタレジスタＰ０のグループアドレスが「１」加算される。データメモリ１０上では、４バイト先のr[12]を指し示すことになる。
以下は、ステップＳ１２０〜Ｓ１５０の処理を必要な数だけ繰り返し行えばよい。 In step S148, when the instruction code read data group address pointer register P0 pointed to a group register R ₂ is executed, the data calculating unit 12, r [8] from the data memory 10 ~r [11] is Group Register It is loaded into R _2.
In step S150, when the instruction code for adding the pointer register P0 is executed, the data operation unit 12 adds “1” to the group address of the pointer register P0. On the data memory 10, it points to r [12] 4 bytes ahead.
In the following, the processes in steps S120 to S150 may be repeated as many times as necessary.

次に、デジタル信号処理装置１００で上式（２）に示す相関演算を行う場合を説明する。ただし、s[n]とr[n]は、図１０に示すように、データメモリ１０に格納されているとする。
図６は、プログラムシーケンサ１８で実行される処理を示すフローチャートである。
プログラムシーケンサ１８では、プログラムメモリ１６から制御プログラムが読み出され、読み出された制御プログラムに従って、図６のフローチャートに示す相関演算処理がステップＳ２００〜Ｓ２５０の順で実行される。 Next, the case where the digital signal processing apparatus 100 performs the correlation calculation shown in the above equation (2) will be described. However, it is assumed that s [n] and r [n] are stored in the data memory 10 as shown in FIG.
FIG. 6 is a flowchart showing processing executed by the program sequencer 18.
In the program sequencer 18, the control program is read from the program memory 16, and the correlation calculation processing shown in the flowchart of FIG. 6 is executed in the order of steps S200 to S250 in accordance with the read control program.

ステップＳ２００では、ステップＳ１００と同一の処理が実行される。
ステップＳ２０２では、ポインタレジスタＰ１にs[1]のデータアドレスを設定する命令コードが実行されると、データアドレス生成部１４では、s[1]のデータアドレスのうちグループアドレスがポインタレジスタＰ１に格納される。また、データ演算部１２では、s[1]のデータアドレスのうち下位２ビットのデータがインデックスレジスタファイル２０８に格納される。s[1]のデータアドレスのうち下位２ビットは「０１」なので、最上位ビットを「０」として３ビットのデータ「００１」がエレメントインデックスデータｓとして格納される。 In step S200, the same process as step S100 is executed.
In step S202, when the instruction code for setting the data address of s [1] in the pointer register P1 is executed, the data address generation unit 14 stores the group address in the data address of s [1] in the pointer register P1. Is done. Further, in the data operation unit 12, the lower 2 bits of the data address of s [1] are stored in the index register file 208. Since the lower 2 bits of the data address of s [1] are “01”, the most significant bit is “0” and 3-bit data “001” is stored as element index data s.

ステップＳ２０４〜Ｓ２１８では、ステップＳ１０４〜Ｓ１１８と同一の処理が実行される。
ステップＳ２２０では、ブロック演算命令コードが実行されると、データ演算部１２では、プログラムシーケンサ１８からのセレクトデータに基づいて、セレクタ２１０により、エレメントインデックスデータｓ「００１」が選択されてセレクタ２０４に出力され、セレクタ２０４により、レジスタＲ₀[1]のデータs[1]が選択されて演算器２０２に出力される。また、プログラムシーケンサ１８からのセレクトデータに基づいて、セレクタ２１２により、エレメントインデックスデータｒ「０００」が選択されてセレクタ２０６に出力され、セレクタ２０６により、レジスタＲ₂[0]のデータr[0]が選択されて演算器２０２に出力される。そして、演算器２０２により、s[1]とr[0]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（２）の第１項の演算結果が得られる。また、計算器２１４により、エレメントインデックスデータｓ，ｒにそれぞれ「１」が加算され、インデックスレジスタファイル２０８に格納される。 In steps S204 to S218, the same processing as in steps S104 to S118 is executed.
In step S220, when the block operation instruction code is executed, the data operation unit 12 selects the element index data s “001” by the selector 210 based on the select data from the program sequencer 18 and outputs it to the selector 204. Then, the selector 204 selects the data s [1] of the register R ₀ [1] and outputs it to the computing unit 202. Further, based on the select data from the program sequencer 18, the element index data r “000” is selected by the selector 212 and output to the selector 206, and the selector 206 outputs the data r [0] of the register R ₂ [0]. Is selected and output to the computing unit 202. Then, s [1] and r [0] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the first term of the above equation (2) is obtained. Further, “1” is added to the element index data s and r by the calculator 214 and stored in the index register file 208.

ステップＳ２２２では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、プログラムシーケンサ１８からのセレクトデータに基づいて、セレクタ２１０により、エレメントインデックスデータｓ「０１０」が選択されてセレクタ２０４に出力され、セレクタ２０４により、レジスタＲ₀[2]のデータs[2]が選択されて演算器２０２に出力される。また、プログラムシーケンサ１８からのセレクトデータに基づいて、セレクタ２１２により、エレメントインデックスデータｒ「００１」が選択されてセレクタ２０６に出力され、セレクタ２０６により、レジスタＲ₂[1]のデータr[1]が選択されて演算器２０２に出力される。そして、演算器２０２により、s[2]とr[1]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（２）の第１項および第２項の演算結果が得られる。また、計算器２１４により、エレメントインデックスデータｓ，ｒにそれぞれ「１」が加算され、インデックスレジスタファイル２０８に格納される。 In step S222, when the block operation instruction code is further executed, the data operation unit 12 selects the element index data s “010” by the selector 210 based on the select data from the program sequencer 18 and selects the selector 204. The selector 204 selects the data s [2] of the register R ₀ [2] and outputs it to the computing unit 202. Further, based on the select data from the program sequencer 18, the element index data r “001” is selected by the selector 212 and output to the selector 206, and the selector 206 outputs the data r [1] of the register R ₂ [1]. Is selected and output to the computing unit 202. Then, s [2] and r [1] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation results of the first term and the second term of the above equation (2) are obtained. Further, “1” is added to the element index data s and r by the calculator 214 and stored in the index register file 208.

ステップＳ２２４では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₀[3]，Ｒ₂[2]のデータs[3]，r[2]がそれぞれ選択される。そして、演算器２０２により、s[3]とr[2]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（２）の第１項〜第３項の演算結果が得られる。 In step S224, further, the block operation instruction code is executed, the data calculating unit 12, similarly, by the selector 204, the register R ₀ [3], the data s of _{R 2 [2] [3]} , r [2] is selected. Then, s [3] and r [2] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the 1st term-the 3rd term of the above formula (2) is obtained.

ステップＳ２２６では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₁[0]，Ｒ₂[3]のデータs[4]，r[3]がそれぞれ選択される。そして、演算器２０２により、s[4]とr[3]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（２）の第１項〜第４項の演算結果が得られる。 In step S226, further, the block operation instruction code is executed, the data calculating unit 12, similarly, by the selector 204, the register R ₁ [0], data s of _{R 2 [3] [4]} , Each r [3] is selected. The computing unit 202 multiplies s [4] and r [3], adds the multiplication result to the value of the register A0, and stores the result in the register A0. Thereby, the calculation result of the 1st term-the 4th term of the above formula (2) is obtained.

ステップＳ２２８〜Ｓ２３４では、ステップＳ１２８〜Ｓ１３４と同一の処理が実行される。
ステップＳ２３６では、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₁[1]，Ｒ₃[0]のデータs[5]，r[4]がそれぞれ選択される。そして、演算器２０２により、s[5]とr[4]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（２）の第１項〜第５項の演算結果が得られる。 In steps S228 to S234, the same processing as steps S128 to S134 is executed.
In step S236, when the block operation instruction code is executed, the data operation unit 12 similarly uses the selectors 204 and 206 to cause the data s [5] and r [[] in the registers R ₁ [1] and R ₃ [0]. 4] is selected respectively. Then, s [5] and r [4] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation results of the first to fifth terms of the above equation (2) are obtained.

ステップＳ２３８では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₁[2]，Ｒ₃[1]のデータs[6]，r[5]がそれぞれ選択される。そして、演算器２０２により、s[6]とr[5]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（２）の第１項〜第６項の演算結果が得られる。 In step S238, further, the block operation instruction code is executed, the data calculating unit 12, similarly, by the selector 204, the register R ₁ [2], data s of _{R 3 [1] [6]} , r [5] is selected. Then, s [6] and r [5] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the 1st term-the 6th term of the above formula (2) is obtained.

ステップＳ２４０では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₁[3]，Ｒ₃[2]のデータs[7]，r[6]がそれぞれ選択される。そして、演算器２０２により、s[7]とr[6]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（２）の第１項〜第７項の演算結果が得られる。 In step S240, further, the block operation instruction code is executed, the data calculating unit 12, similarly, by the selector 204, the register R ₁ [3], the data s [7] of R ₃ [2], Each r [6] is selected. Then, s [7] and r [6] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation results of the first to seventh terms of the above equation (2) are obtained.

ステップＳ２４２では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₀[0]，Ｒ₃[3]のデータs[8]，r[7]がそれぞれ選択される。そして、演算器２０２により、s[8]とr[7]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（２）の第１項〜第８項の演算結果が得られる。 In step S242, further, the block operation instruction code is executed, the data calculating unit 12, similarly, by the selector 204, the register R ₀ [0], the data s [8] of R ₃ [3], Each r [7] is selected. Then, s [8] and r [7] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the 1st term-the 8th term of the above formula (2) is obtained.

ステップＳ２４４〜Ｓ２５０では、ステップＳ１４４〜Ｓ１５０と同一の処理が実行される。
以下は、ステップＳ２２０〜Ｓ２５０の処理を必要な数だけ繰り返し行えばよい。
このように、上式（１）に示す相関演算を行う場合と異なるのは、ステップＳ２０２，Ｓ２２０〜Ｓ２２６，Ｓ２３６〜Ｓ２４２の処理であるが、プログラムシーケンサ１８の制御プログラムとしては、ステップＳ２０２の記述だけが異なる。このことは、上式（３），（４）に示す相関演算を行う場合についても同様である。 In steps S244 to S250, the same processing as steps S144 to S150 is executed.
In the following, the processes in steps S220 to S250 may be repeated as many times as necessary.
As described above, what is different from the case where the correlation calculation shown in the above equation (1) is performed is the processing of steps S202, S220 to S226, and S236 to S242, but the control program of the program sequencer 18 is described in step S202. Only the difference. The same applies to the case where the correlation calculation shown in the above equations (3) and (4) is performed.

このようにして、本実施の形態では、複数のグループレジスタＲ₀〜Ｒ_n-1からなるレジスタファイル２００と、レジスタファイル２００のデータを用いて演算を行う演算器２０２と、エレメントインデックスデータを記憶するインデックスレジスタと、インデックスレジスタのエレメントインデックスデータに基づいて、２つのグループレジスタＲ₀，Ｒ₁（Ｒ₂，Ｒ₃）をレジスタブロックとしてレジスタブロックのレジスタＲ₀[0]〜Ｒ₁[3]（Ｒ₂[0]〜Ｒ₃[3]）のなかからいずれか１つのレジスタのデータを選択して演算器２０２に出力するセレクタ２０４，２０６とを備える。 Thus, in this embodiment, a register file 200 composed of a plurality of group registers R _{0 to} R _n−1 , an arithmetic unit 202 that performs an operation using the data of the register file 200, and element index data are stored. And two group registers R ₀ , R ₁ (R ₂ , R ₃ ) as register blocks based on the index register to be registered and the element index data of the index register, the registers R ₀ [0] to R ₁ [3] of the register block Selectors 204 and 206 that select data from any one of the registers (R ₂ [0] to R ₃ [3]) and output the selected data to the computing unit 202 are provided.

これにより、境界不整列データアクセスとなる場合や、データの論理構造が重複している場合でも、レジスタファイル２００へのデータの読み込み回数が極端に増加することがないので、処理効率を向上することができる。また、データの論理構造が重複している場合でも、プログラムの多少の変更で対応することができるので、プログラムコストを低減することができる。さらに、インデックスレジスタおよびセレクタ２０４，２０６を設けるだけでよく、また、多くの境界不整列データアクセスのパターンに対応する場合でも、エレメントインデックスデータのビット数をさほど増やさなくてすむので、特許文献１の発明に比して、製造コストを低減することができる。 As a result, the number of times data is read into the register file 200 does not increase extremely even when boundary misaligned data access occurs or when the logical structure of data overlaps, thereby improving processing efficiency. Can do. In addition, even when the logical structure of data is duplicated, it can be handled by a slight change in the program, so that the program cost can be reduced. Furthermore, it is only necessary to provide index registers and selectors 204 and 206, and even when dealing with many patterns of boundary-unaligned data access, it is not necessary to increase the number of bits of element index data. Compared to the invention, the manufacturing cost can be reduced.

さらに、本実施の形態では、複数のインデックスレジスタからなるインデックスレジスタファイル２０８と、与えられたセレクトデータに基づいて、インデックスレジスタファイル２０８のなかからいずれか１つのインデックスレジスタのエレメントインデックスデータを選択してセレクタ２０４，２０６に出力するセレクタ２１０，２１２とを備える。
これにより、複数のエレメントインデックスデータをインデックスレジスタファイル２０８に格納しておき、セレクトデータによってそれらエレメントインデックスデータを選択的に使用することができるので、インデックスレジスタの使い勝手が向上し、プログラムを作成しやすくなる。 Further, in the present embodiment, element index data of any one of the index registers is selected from the index register file 208 based on the index register file 208 composed of a plurality of index registers and the given select data. Selectors 210 and 212 that output to the selectors 204 and 206 are provided.
As a result, a plurality of element index data can be stored in the index register file 208, and the element index data can be selectively used by the select data, thereby improving the usability of the index register and making it easy to create a program. Become.

さらに、本実施の形態では、セレクタ２０４，２０６に出力したエレメントインデックスデータを更新してインデックスレジスタファイル２０８に格納する計算器２１４を備える。
これにより、エレメントインデックスデータを使用するたびに自動的に更新されるので、エレメントインデックスデータの更新をソフトウェアで行わなくてすみ、演算速度を向上することができる。 Furthermore, the present embodiment includes a calculator 214 that updates the element index data output to the selectors 204 and 206 and stores it in the index register file 208.
As a result, since the element index data is automatically updated every time it is used, the element index data need not be updated by software, and the calculation speed can be improved.

上記第１の実施の形態において、エレメントインデックスデータは、請求項１ないし４または７記載のインデックスデータに対応し、セレクタ２０４，２０６は、請求項１ないし４記載のセレクタに対応し、セレクタ２１０，２１２は、請求項３記載の第２セレクタに対応し、計算器２１４は、請求項４記載のインデックスデータ更新手段に対応している。 In the first embodiment, the element index data corresponds to the index data according to claims 1 to 4 or 7, the selectors 204 and 206 correspond to the selector according to claims 1 to 4, and the selectors 210, 212 corresponds to the second selector according to claim 3, and the calculator 214 corresponds to the index data update means according to claim 4.

次に、本発明の第２の実施の形態を図面を参照しながら説明する。図７ないし図９は、本発明に係るデータ演算装置の第２の実施の形態を示す図である。
本実施の形態は、本発明に係るデータ演算装置を、デジタル信号処理を行う３２ビットＲＩＳＣ型のデジタル信号処理装置１００においてデータ演算部１２に適用したものであり、上記第１の実施の形態と異なるのは、レジスタブロックに含まれるグループレジスタの数を動的に変更する点にある。なお、以下、上記第１の実施の形態と異なる部分についてのみ説明し、上記第１の実施の形態と重複する部分については同一の符号を付して説明を省略する。 Next, a second embodiment of the present invention will be described with reference to the drawings. 7 to 9 are diagrams showing a second embodiment of the data arithmetic device according to the present invention.
In the present embodiment, the data arithmetic device according to the present invention is applied to the data arithmetic unit 12 in the 32-bit RISC type digital signal processing device 100 that performs digital signal processing. The difference is that the number of group registers included in the register block is dynamically changed. Hereinafter, only the parts different from the first embodiment will be described, and the same parts as those in the first embodiment will be denoted by the same reference numerals and the description thereof will be omitted.

次に、データ演算部１２の構成を図７ないし図９を参照しながら詳細に説明する。
図７は、データ演算部１２のハードウェア構成を示すブロック図である。
データ演算部１２は、図７に示すように、レジスタファイル２００と、演算器２０２と、レジスタファイル２００のなかからデータを選択するセレクタ２２４，２２６と、複数のインデックスレジスタからなるインデックスレジスタファイル２２８と、インデックスレジスタファイル２２８のなかからデータを選択するセレクタ２３０，２３２と、インデックスレジスタファイル２２８のデータを更新する計算器２３４と、計算器２３４とバス２０のデータを選択するセレクタ２３６とで構成されている。 Next, the configuration of the data calculation unit 12 will be described in detail with reference to FIGS.
FIG. 7 is a block diagram illustrating a hardware configuration of the data calculation unit 12.
As shown in FIG. 7, the data calculation unit 12 includes a register file 200, a calculator 202, selectors 224 and 226 that select data from the register file 200, and an index register file 228 including a plurality of index registers. , Selectors 230 and 232 for selecting data from the index register file 228, a calculator 234 for updating data in the index register file 228, and a selector 236 for selecting data on the calculator 234 and the bus 20. Yes.

図８は、レジスタブロックの構成を示す図である。
本実施の形態では、１つのグループレジスタを１つのレジスタブロックとしてグループ化するバンドルモード０と、２つのグループレジスタを１つのレジスタブロックとしてグループ化するバンドルモード１と、４つのグループレジスタを１つのレジスタブロックとしてグループ化するバンドルモード２と、８つのグループレジスタを１つのレジスタブロックとしてグループ化するバンドルモード３とを規定する。 FIG. 8 is a diagram illustrating a configuration of a register block.
In the present embodiment, bundle mode 0 for grouping one group register as one register block, bundle mode 1 for grouping two group registers as one register block, and one register for four group registers. A bundle mode 2 for grouping as a block and a bundle mode 3 for grouping eight group registers as one register block are defined.

バンドルモード０では、図８に示すように、グループレジスタＲ₀をレジスタブロック０（バンドル番号「０」により特定されるレジスタブロックをいう。以下、同様に略記する。）とし、同様に、グループレジスタＲ₂〜Ｒ₇をレジスタブロック１〜７としてそれぞれグループ化する。
バンドルモード１では、グループレジスタＲ₀，Ｒ₁をレジスタブロック０とし、グループレジスタＲ₂，Ｒ₃をレジスタブロック１とし、グループレジスタＲ₄，Ｒ₅をレジスタブロック２とし、グループレジスタＲ₆，Ｒ₇をレジスタブロック３としてそれぞれグループ化する。 In the bundle mode 0, as shown in FIG. 8, the group register R ₀ is set as a register block 0 (referred to as a register block specified by a bundle number “0”, hereinafter abbreviated in the same manner). R _{2 to} R ₇ are grouped as register blocks 1 to 7, respectively.
In the bundle mode 1, the group registers R ₀ and R ₁ are the register block 0, the group registers R ₂ and R ₃ are the register block 1, the group registers R ₄ and R ₅ are the register block 2, and the group registers R ₆ and R ₇ is grouped as a register block 3 respectively.

バンドルモード２では、グループレジスタＲ₀〜Ｒ₃をレジスタブロック０とし、グループレジスタＲ₄〜Ｒ₇をレジスタブロック１としてそれぞれグループ化する。
バンドルモード３では、グループレジスタＲ₀〜Ｒ₇をレジスタブロック０としてそれぞれグループ化する。
図７に戻り、セレクタ２２４は、グループレジスタＲ₀〜Ｒ₇の各レジスタと接続し、セレクタ２３０からのセレクトデータに基づいて、グループレジスタＲ₀〜Ｒ₇のなかからいずれか１つのレジスタのデータを選択し、選択したデータを演算器２０２の第１オペランドとして演算器２０２に出力する。 In the bundle mode 2, the group registers R _{0 to} R ₃ are grouped as the register block 0 and the group registers R _{4 to} R ₇ are grouped as the register block 1, respectively.
In the bundle mode 3, the group registers R _{0 to} R ₇ are grouped as a register block 0, respectively.
Returning to FIG. 7, the selector 224 is connected to each of the registers of the group registers R _{0 to} R ₇ , and based on the select data from the selector 230, the data in any one of the group registers R _{0 to} R _7. And the selected data is output to the computing unit 202 as the first operand of the computing unit 202.

セレクタ２３０からのセレクトデータは、５ビットからなる。セレクタ２２４は、セレクトデータの上位３ビットが「０００」であるときは、グループレジスタＲ₀のうち下位２ビットの値により指定されるレジスタのデータを選択し、セレクトデータの上位３ビットが「００１」であるときは、グループレジスタＲ₁のうち下位２ビットの値により指定されるレジスタのデータを選択する。同様に、セレクトデータの上位３ビットが「０１０」、「０１１」、「１００」、「１０１」、「１１０」および「１１１」であるときは、グループレジスタＲ₂〜Ｒ₇のうち下位２ビットの値により指定されるレジスタのデータを選択する。 The select data from the selector 230 consists of 5 bits. When the upper 3 bits of the select data is “000”, the selector 224 selects the register data specified by the lower 2 bits of the group register R ₀ , and the upper 3 bits of the select data is “001”. when "a selects the data of the register specified by the lower two bits of the value of the group registers R _1. Similarly, when the upper 3 bits of the select data are “010”, “011”, “100”, “101”, “110” and “111”, the lower 2 bits of the group registers R _{2 to} R ₇ Select the register data specified by the value of.

セレクタ２２６は、グループレジスタＲ₀〜Ｒ₇の各レジスタと接続し、セレクタ２３２からのセレクトデータに基づいて、グループレジスタＲ₀〜Ｒ₇のなかからいずれか１つのレジスタのデータを選択し、選択したデータを演算器２０２の第２オペランドとして演算器２０２に出力する。データの選択手順は、セレクタ２２４と同様である。
インデックスレジスタファイル２２８は、エレメントインデックスデータを各インデックスレジスタに記憶する。 The selector 226 is connected to the respective registers of the group registers R ₀ to R _7, based on the select data from the selector 232 selects the data of one of the registers from among Group Register R ₀ to R _7, selected The obtained data is output to the computing unit 202 as the second operand of the computing unit 202. The data selection procedure is the same as that of the selector 224.
The index register file 228 stores element index data in each index register.

図９は、エレメントインデックスデータのデータ構造を示す図である。
エレメントインデックスデータは、図９に示すように、５ビットのデータからなり、バンドルモード０のときは、５ビットのうち下位２ビットを使用し、下位２ビットｂ₁，ｂ₀がグループレジスタの各レジスタの別を示す。
バンドルモード１のときは、５ビットのうち下位３ビットを使用し、上位３番目のビットｂ₂がグループレジスタＲ₀，Ｒ₁（Ｒ₂，Ｒ₃等）の別を示し、下位２ビットｂ₁，ｂ₀がグループレジスタの各レジスタの別を示す。 FIG. 9 is a diagram illustrating a data structure of element index data.
As shown in FIG. 9, the element index data consists of 5-bit data. In the bundle mode 0, the lower 2 bits of the 5 bits are used, and the lower 2 bits b ₁ and b ₀ are stored in the group registers. Indicates the different register.
In the bundle mode 1, the lower 3 bits of the 5 bits are used, the upper third bit b ₂ indicates the group register R ₀ , R ₁ (R ₂ , R _3, etc.), and the lower 2 bits b ₁ and b ₀ indicate the group registers.

バンドルモード２のときは、５ビットのうち下位４ビットを使用し、上位２，３番目のビットｂ₃，ｂ₂がグループレジスタＲ₀〜Ｒ₃（Ｒ₄〜Ｒ₇）の別を示し、下位２ビットｂ₁，ｂ₀がグループレジスタの各レジスタの別を示す。
バンドルモード３のときは、上位１〜３番目のビットｂ₄〜ｂ₂がグループレジスタＲ₀〜Ｒ₇の別を示し、下位２ビットｂ₁，ｂ₀がグループレジスタの各レジスタの別を示す。エレメントインデックスデータの各ビットは、セレクタ２３０，２３２からのセレクトデータの各ビットに対応している。また、エレメントインデックスデータの下位２ビットｂ₁，ｂ₀は、データアドレスの下位２ビットｂ₁，ｂ₀に対応している。 In the bundle mode 2, the lower 4 bits of the 5 bits are used, and the upper and second bits b ₃ and b ₂ indicate the group registers R _{0 to} R ₃ (R _{4 to} R ₇ ). The lower 2 bits b ₁ and b ₀ indicate the group registers.
In the bundle mode 3, the first to third bits b _{4 to} b ₂ indicate the group registers R _{0 to} R ₇ , and the lower two bits b ₁ and b ₀ indicate the group registers. . Each bit of the element index data corresponds to each bit of select data from the selectors 230 and 232. The lower 2 bits b _1, b ₀ of the element index data corresponds to the lower 2 bits b _1, b ₀ of the data address.

図７に戻り、セレクタ２３０は、プログラムシーケンサ１８からのセレクトデータに基づいて、インデックスレジスタファイル２２８のなかからいずれか１つのインデックスレジスタのエレメントインデックスデータを選択し、選択したエレメントインデックスデータ、並びにプログラムシーケンサ１８からのバンドルモードデータおよびバンドルインデックスデータに基づいてレジスタインデックスデータを生成し、生成したレジスタインデックスデータをセレクタ２２４に出力する。 Returning to FIG. 7, the selector 230 selects element index data of any one of the index registers from the index register file 228 based on the select data from the program sequencer 18, and selects the selected element index data and the program sequencer. Register index data is generated based on the bundle mode data and bundle index data from 18, and the generated register index data is output to the selector 224.

バンドルインデックスデータは、バンドル番号を示す５ビットのデータである。バンドルモード０を示すバンドルモードデータが与えられたときは、バンドルインデックスデータの下位３ビットを上位ビットとし、選択したエレメントインデックスデータの下位２ビットを下位ビットとして結合し、５ビットのレジスタインデックスデータを生成して出力する。 The bundle index data is 5-bit data indicating the bundle number. When bundle mode data indicating the bundle mode 0 is given, the lower 3 bits of the bundle index data are combined as the upper bits, the lower 2 bits of the selected element index data are combined as the lower bits, and the 5-bit register index data is Generate and output.

バンドルモード１を示すバンドルモードデータが与えられたときは、バンドルインデックスデータの下位２ビットを上位ビットとし、選択したエレメントインデックスデータの下位３ビットを下位ビットとして結合し、５ビットのレジスタインデックスデータを生成して出力する。
バンドルモード２を示すバンドルモードデータが与えられたときは、バンドルインデックスデータの下位１ビットを上位ビットとし、選択したエレメントインデックスデータの下位４ビットを下位ビットとして結合し、５ビットのレジスタインデックスデータを生成して出力する。 When bundle mode data indicating bundle mode 1 is given, the lower 2 bits of the bundle index data are combined as the upper bits, the lower 3 bits of the selected element index data are combined as the lower bits, and the 5-bit register index data is Generate and output.
When bundle mode data indicating the bundle mode 2 is given, the lower 1 bit of the bundle index data is combined as the upper bit, the lower 4 bits of the selected element index data is combined as the lower bit, and the 5-bit register index data is Generate and output.

バンドルモード３を示すバンドルモードデータが与えられたときは、選択したエレメントインデックスデータをそのままレジスタインデックスデータとして出力する。
なお、プログラムシーケンサ１８では、セレクタ２３０を制御する命令コードを実行すると、その命令コードに対応するオペランドをセレクトデータとしてセレクタ２３０に出力する。また、バンドルモードまたはバンドル番号を設定する命令コードを実行すると、その命令コードに対応するオペランドをバンドルモードデータまたはバンドルインデックスデータとしてセレクタ２３０に出力する。 When bundle mode data indicating the bundle mode 3 is given, the selected element index data is output as register index data as it is.
When the program sequencer 18 executes an instruction code for controlling the selector 230, the program sequencer 18 outputs an operand corresponding to the instruction code to the selector 230 as select data. When an instruction code for setting a bundle mode or bundle number is executed, an operand corresponding to the instruction code is output to the selector 230 as bundle mode data or bundle index data.

セレクタ２３２は、プログラムシーケンサ１８からのセレクトデータに基づいて、インデックスレジスタファイル２２８のなかからいずれか１つのインデックスレジスタのエレメントインデックスデータを選択し、選択したエレメントインデックスデータ、並びにプログラムシーケンサ１８からのバンドルモードデータおよびバンドルインデックスデータに基づいてレジスタインデックスデータを生成し、生成したレジスタインデックスデータをセレクタ２２６に出力する。レジスタインデックスデータの生成手順は、セレクタ２３０と同様である。なお、プログラムシーケンサ１８では、セレクタ２３２を制御する命令コードを実行すると、その命令コードに対応するオペランドをセレクトデータとしてセレクタ２３２に出力する。また、バンドルモードまたはバンドル番号を設定する命令コードを実行すると、その命令コードに対応するオペランドをバンドルモードデータまたはバンドルインデックスデータとしてセレクタ２３２に出力する。 The selector 232 selects element index data of any one of the index registers from the index register file 228 based on the select data from the program sequencer 18, and selects the selected element index data and the bundle mode from the program sequencer 18. Register index data is generated based on the data and bundle index data, and the generated register index data is output to the selector 226. The register index data generation procedure is the same as that of the selector 230. When the instruction code for controlling the selector 232 is executed, the program sequencer 18 outputs an operand corresponding to the instruction code to the selector 232 as select data. When an instruction code for setting a bundle mode or a bundle number is executed, an operand corresponding to the instruction code is output to the selector 232 as bundle mode data or bundle index data.

計算器２３４は、セレクタ２３０またはセレクタ２３２からセレクトデータとして出力されるレジスタインデックスデータを更新してセレクタ２３６に出力する。レジスタインデックスデータは、レジスタブロックのレジスタからデータを取得する順序があらかじめ決まっている場合に、その順序でデータが選択できるように更新する。レジスタインデックスデータの更新手順は、上記第１の実施の形態において計算器２１４によりエレメントインデックスデータを更新する手順と同じ要領で行うことができる。 The calculator 234 updates the register index data output as select data from the selector 230 or the selector 232 and outputs it to the selector 236. The register index data is updated so that data can be selected in the order in which the data acquisition order is determined in advance from the register of the register block. The procedure for updating the register index data can be performed in the same manner as the procedure for updating the element index data by the calculator 214 in the first embodiment.

セレクタ２３６は、プログラムシーケンサ１８からのセレクトデータに基づいて、データアドレス生成部１４からのデータアドレスのうち下位２ビットのデータ、および計算器２３４からのレジスタインデックスデータのうちいずれか一方を選択してインデックスレジスタファイル２２８に格納する。なお、プログラムシーケンサ１８では、セレクタ２３６を制御する命令コードを実行すると、その命令コードに対応するオペランドをセレクトデータとしてセレクタ２３６に出力する。 Based on the select data from the program sequencer 18, the selector 236 selects either the lower 2 bits of the data address from the data address generator 14 or the register index data from the calculator 234. Store in the index register file 228. When the instruction code for controlling the selector 236 is executed, the program sequencer 18 outputs an operand corresponding to the instruction code to the selector 236 as select data.

次に、本実施の形態の動作を説明する。
初めに、バンドルモード１で演算を行う場合を説明する。
プログラムシーケンサ１８では、バンドルモード１に設定する命令コードが実行されると、バンドルモード１を示すバンドルモードデータがデータ演算部１２に出力される。そして、バンドル番号「０」を設定する命令コードが実行されると、バンドルインデックスデータとして「０００００」がデータ演算部１２に出力される。 Next, the operation of the present embodiment will be described.
First, a case where calculation is performed in the bundle mode 1 will be described.
In the program sequencer 18, when the instruction code set to the bundle mode 1 is executed, bundle mode data indicating the bundle mode 1 is output to the data calculation unit 12. When the instruction code for setting the bundle number “0” is executed, “00000” is output to the data operation unit 12 as bundle index data.

データ演算部１２では、バンドルモードデータおよびバンドルインデックスデータが与えられると、セレクタ２３０，２３２により、エレメントインデックスデータが選択され、選択されたエレメントインデックスデータ、並びに与えられたバンドルモードデータおよびバンドルインデックスデータに基づいてレジスタインデックスデータが生成されてセレクタ２２４，２２６に出力される。このとき、バンドルインデックスデータの下位２ビットが「００」なので、「００ｘｘｘ」（「ｘ」は、選択されたエレメントインデックスデータのビットを示す。以下、同様に表記する。）がレジスタインデックスデータとして出力される。そして、セレクタ２２４，２２６により、与えられたレジスタインデックスデータ「００ｘｘｘ」に基づいて、グループレジスタＲ₀，Ｒ₁のなかからいずれか１つのレジスタのデータが選択されて演算器２０２に出力される。 In the data operation unit 12, when bundle mode data and bundle index data are given, the element index data is selected by the selectors 230 and 232, and the selected element index data and the given bundle mode data and bundle index data are selected. Based on this, register index data is generated and output to the selectors 224 and 226. At this time, since the lower 2 bits of the bundle index data are “00”, “00xxx” (“x” indicates the bit of the selected element index data. Hereinafter, the same is expressed) is output as the register index data. Is done. Then, based on the given register index data “00xxx” by the selectors 224 and 226, the data of any _one of the group registers R ₀ and R ₁ is selected and output to the arithmetic unit 202.

なお、セレクタ２３０，２３２に与えられるセレクトデータが異なるので、セレクタ２２４，２２６，２３０，２３２による選択はそれぞれ独立に行われる。
プログラムシーケンサ１８では、バンドル番号「１」を設定する命令コードが実行されると、バンドルインデックスデータとして「００００１」がデータ演算部１２に出力される。 Since the selection data given to the selectors 230 and 232 is different, the selection by the selectors 224, 226, 230, and 232 is performed independently.
In the program sequencer 18, when the instruction code for setting the bundle number “1” is executed, “00001” is output to the data operation unit 12 as bundle index data.

データ演算部１２では、バンドルインデックスデータの下位２ビットが「０１」なので、セレクタ２３０，２３２により、「０１ｘｘｘ」がレジスタインデックスデータとして生成・出力される。そして、セレクタ２２４，２２６により、与えられたレジスタインデックスデータ「０１ｘｘｘ」に基づいて、グループレジスタＲ₂，Ｒ₃のなかからいずれか１つのレジスタのデータが選択されて演算器２０２に出力される。
バンドル番号「２」または「３」を設定する命令コードが実行された場合も同様であり、グループレジスタＲ₄，Ｒ₅またはＲ₆，Ｒ₇のなかからいずれか１つのレジスタのデータが選択されて演算器２０２に出力される。 In the data operation unit 12, since the lower 2 bits of the bundle index data are “01”, the selectors 230 and 232 generate and output “01xxx” as the register index data. Then, based on the given register index data “01xxx” by the selectors 224 and 226, the data of any one of the group registers R ₂ and R ₃ is selected and output to the computing unit 202.
The same applies to the case where the instruction code for setting the bundle number “2” or “3” is executed, and the data of any one register is selected from among the group registers R ₄ and R ₅ or R ₆ and R _7. To the calculator 202.

次に、バンドルモード２で演算を行う場合を説明する。
プログラムシーケンサ１８では、バンドルモード２に設定する命令コードが実行されると、バンドルモード２を示すバンドルモードデータがデータ演算部１２に出力される。そして、バンドル番号「０」を設定する命令コードが実行されると、バンドルインデックスデータとして「０００００」がデータ演算部１２に出力される。 Next, a case where calculation is performed in the bundle mode 2 will be described.
In the program sequencer 18, when an instruction code set to the bundle mode 2 is executed, bundle mode data indicating the bundle mode 2 is output to the data calculation unit 12. When the instruction code for setting the bundle number “0” is executed, “00000” is output to the data operation unit 12 as bundle index data.

データ演算部１２では、バンドルモードデータおよびバンドルインデックスデータが与えられると、バンドルインデックスデータの下位１ビットが「０」なので、セレクタ２３０，２３２により、「０ｘｘｘｘ」がレジスタインデックスデータとして生成・出力される。そして、セレクタ２２４，２２６により、与えられたレジスタインデックスデータ「０ｘｘｘｘ」に基づいて、グループレジスタＲ₀〜Ｒ₃のなかからいずれか１つのレジスタのデータが選択されて演算器２０２に出力される。 In the data operation unit 12, when bundle mode data and bundle index data are given, since the lower 1 bit of the bundle index data is “0”, the selectors 230 and 232 generate and output “0xxxx” as register index data. . Then, based on the given register index data “0xxxx” by the selectors 224 and 226, the data of any one of the group registers R _{0 to} R ₃ is selected and output to the arithmetic unit 202.

プログラムシーケンサ１８では、バンドル番号「１」を設定する命令コードが実行されると、バンドルインデックスデータとして「００００１」がデータ演算部１２に出力される。
データ演算部１２では、バンドルインデックスデータの下位１ビットが「１」なので、セレクタ２３０，２３２により、「１ｘｘｘｘ」がレジスタインデックスデータとして生成・出力される。そして、セレクタ２２４，２２６により、与えられたレジスタインデックスデータ「１ｘｘｘｘ」に基づいて、グループレジスタＲ₄〜Ｒ₇のなかからいずれか１つのレジスタのデータが選択されて演算器２０２に出力される。 In the program sequencer 18, when the instruction code for setting the bundle number “1” is executed, “00001” is output to the data operation unit 12 as bundle index data.
In the data operation unit 12, since the lower 1 bit of the bundle index data is “1”, “1xxxx” is generated and output as register index data by the selectors 230 and 232. Then, the selectors 224 and 226 select the data of any one of the group registers R _{4 to} R ₇ based on the given register index data “1xxx” and output the selected data to the arithmetic unit 202.

次に、バンドルモード３で演算を行う場合を説明する。
プログラムシーケンサ１８では、バンドルモード３に設定する命令コードが実行されると、バンドルモード３を示すバンドルモードデータがデータ演算部１２に出力される。
データ演算部１２では、バンドルモードデータが与えられると、セレクタ２３０，２３２により、選択されたエレメントインデックスデータがそのままレジスタインデックスデータとして出力される。そして、セレクタ２２４，２２６により、与えられたレジスタインデックスデータに基づいて、グループレジスタＲ₀〜Ｒ₇のなかからいずれか１つのレジスタのデータが選択されて演算器２０２に出力される。 Next, a case where calculation is performed in the bundle mode 3 will be described.
In the program sequencer 18, when the instruction code set to the bundle mode 3 is executed, bundle mode data indicating the bundle mode 3 is output to the data calculation unit 12.
In the data operation unit 12, when bundle mode data is given, the selector 230 and 232 output the selected element index data as it is as register index data. Then, the selectors 224 and 226 select data of any one of the group registers R _{0 to} R ₇ based on the given register index data, and output the selected data to the arithmetic unit 202.

次に、バンドルモード０で演算を行う場合を説明する。
プログラムシーケンサ１８では、バンドルモード０に設定する命令コードが実行されると、バンドルモード０を示すバンドルモードデータがデータ演算部１２に出力される。そして、バンドル番号「０」を設定する命令コードが実行されると、バンドルインデックスデータとして「０００００」がデータ演算部１２に出力される。 Next, a case where calculation is performed in the bundle mode 0 will be described.
In the program sequencer 18, when the instruction code to set the bundle mode 0 is executed, bundle mode data indicating the bundle mode 0 is output to the data calculation unit 12. When the instruction code for setting the bundle number “0” is executed, “00000” is output to the data operation unit 12 as bundle index data.

データ演算部１２では、バンドルモードデータおよびバンドルインデックスデータが与えられると、バンドルインデックスデータの下位３ビットが「０００」なので、セレクタ２３０，２３２により、「０００ｘｘ」がレジスタインデックスデータとして生成・出力される。そして、セレクタ２２４，２２６により、与えられたレジスタインデックスデータ「０００ｘｘ」に基づいて、グループレジスタＲ₀のなかからいずれか１つのレジスタのデータが選択されて演算器２０２に出力される。 In the data operation unit 12, when bundle mode data and bundle index data are given, since the lower 3 bits of the bundle index data are “000”, the selectors 230 and 232 generate and output “000xx” as register index data. . Then, based on the given register index data “000xx” by the selectors 224 and 226, the data of any one register is selected from the group register R ₀ and output to the computing unit 202.

バンドル番号「１」〜「７」を設定する命令コードが実行された場合も同様であり、グループレジスタＲ₁，Ｒ₂，Ｒ₃，Ｒ₄，Ｒ₅，Ｒ₆またはＲ₇のなかからいずれか１つのレジスタのデータが選択されて演算器２０２に出力される。
このようにして、本実施の形態では、レジスタブロックを指定するバンドルインデックスデータが与えられたときに、与えられたバンドルインデックスデータ、およびインデックスレジスタのエレメントインデックスデータに基づいて、そのバンドルインデックスデータにより指定されるレジスタブロックおよびそのレジスタブロック内のデータの位置を示すレジスタインデックスデータを生成するセレクタ２３０，２３２を備える。 The same applies to the case where the instruction code for setting the bundle numbers “1” to “7” is executed, and any of the group registers R ₁ , R ₂ , R ₃ , R ₄ , R ₅ , R ₆ or R ₇ is selected. Data in one register is selected and output to the computing unit 202.
In this way, in this embodiment, when bundle index data specifying a register block is given, it is designated by the bundle index data based on the given bundle index data and the element index data of the index register. Selectors 230 and 232 for generating register index data indicating the register block to be processed and the position of data in the register block.

これにより、バンドルインデックスデータによって複数のレジスタブロックを選択的に使用することができるので、レジスタの使い勝手が向上し、プログラムを作成しやすくなる。
さらに、本実施の形態では、セレクタ２３０，２３２は、バンドルモードを示すバンドルモードデータが与えられたときに、与えられたバンドルモードデータ、バンドルインデックスデータおよびエレメントインデックスデータに基づいて、レジスタインデックスデータを生成するようになっている。
これにより、バンドルモードデータによってレジスタブロックの論理構造を変更することができるので、レジスタの使い勝手がさらに向上し、プログラムをさらに作成しやすくなる。 As a result, a plurality of register blocks can be selectively used according to the bundle index data, thereby improving the usability of the registers and facilitating the creation of a program.
Furthermore, in this embodiment, when the bundle mode data indicating the bundle mode is given, the selectors 230 and 232 select the register index data based on the given bundle mode data, bundle index data, and element index data. It is designed to generate.
Thereby, the logical structure of the register block can be changed by the bundle mode data, so that the usability of the register is further improved and the program can be more easily created.

上記第２の実施の形態において、セレクタ２２４，２２６は、請求項５記載のセレクタに対応し、セレクタ２３０，２３２は、請求項５または６記載のインデックスデータ生成手段に対応し、レジスタインデックスデータは、請求項５または６記載の第２エレメントインデックスデータに対応している。
なお、上記第１の実施の形態においては、グループレジスタＲ₀，Ｒ₁をレジスタブロックＡとしてグループ化したが、これに限らず、特定の２つのグループレジスタをレジスタブロックＡとしてグループ化すればよいので、他の組み合わせでグループ化することもできる。レジスタブロックＢについても同様である。 In the second embodiment, the selectors 224 and 226 correspond to the selector described in claim 5, the selectors 230 and 232 correspond to the index data generation means described in claim 5 or 6, and the register index data is , Corresponding to the second element index data.
In the first embodiment, the group registers R ₀ and R ₁ are grouped as the register block A. However, the present invention is not limited to this, and two specific group registers may be grouped as the register block A. So it can be grouped in other combinations. The same applies to the register block B.

また、上記第１の実施の形態においては、インデックスレジスタファイル２０８を設けて構成したが、これに限らず、インデックスレジスタを１つだけ設けて構成することもできる。この場合、セレクタ２１０，２１２も不要となる。
また、上記第２の実施の形態においては、インデックスレジスタファイル２２８を設けて構成したが、これに限らず、インデックスレジスタを１つだけ設けて構成することもできる。この場合、セレクタ２３０，２３２は選択機能を設けず、レジスタインデックスデータを生成する機能だけ設ければよい。 In the first embodiment, the index register file 208 is provided. However, the present invention is not limited to this, and only one index register may be provided. In this case, the selectors 210 and 212 are also unnecessary.
In the second embodiment, the index register file 228 is provided. However, the present invention is not limited to this, and only one index register may be provided. In this case, the selectors 230 and 232 need not have a selection function, but only a function for generating register index data.

また、上記第２の実施の形態においては、８つのグループレジスタＲ₀〜Ｒ₇を用いてレジスタブロックを構成したが、これに限らず、８つ未満のグループレジスタを用いてレジスタブロックを構成することもできるし、９つ以上のグループレジスタを用いてレジスタブロックを構成することもできる。また、バンドルインデックスデータおよびエレメントインデックスデータのビット数は、例えば、２つのグループレジスタを用いてレジスタブロックを構成する場合はそれぞれ３ビット、４つのグループレジスタを用いてレジスタブロックを構成する場合はそれぞれ４ビット、１６個のグループレジスタを用いてレジスタブロックを構成する場合はそれぞれ５ビットあればよい。いずれの場合も、上記第２の実施の形態と同じ要領で行うことができる。 In the second embodiment, the register block is configured using the eight group registers R _{0 to} R _7. However, the present invention is not limited to this, and the register block is configured using less than eight group registers. It is also possible to configure a register block using nine or more group registers. The number of bits of bundle index data and element index data is, for example, 3 bits each when a register block is configured using two group registers, and 4 bits when each register block is configured using four group registers. When a register block is configured by using 16 bits and 16 group registers, it is sufficient to have 5 bits each. In either case, the same procedure as in the second embodiment can be performed.

また、上記第２の実施の形態においては、８つのグループレジスタＲ₀〜Ｒ₇を用いてレジスタブロックを構成し、４つのバンドルモードを設定したが、これに限らず、バンドルモードを４つ未満として設定してもよい。この場合、バンドルインデックスデータのビット数は、３つのバンドルモード０〜２を設定する場合は４ビット、２つのバンドルモード０，１を設定する場合は３ビット、１つのバンドルモード０だけを設定する場合は２ビットあればよい。このことは、レジスタブロックの構成に用いるグループレジスタの数が８つ未満の場合、および９つ以上の場合であっても同様である。 In the second embodiment, a register block is configured using eight group registers R _{0 to} R ₇ and four bundle modes are set. However, the present invention is not limited to this, and the number of bundle modes is less than four. May be set as In this case, the number of bits of the bundle index data is 4 bits when three bundle modes 0 to 2 are set, 3 bits when two bundle modes 0 and 1 are set, and only one bundle mode 0 is set. In this case, only 2 bits are required. This is the same even when the number of group registers used in the configuration of the register block is less than 8 and when it is 9 or more.

また、上記第１および第２の実施の形態においては、データ演算部１２に１つの演算器２０２を設けて構成したが、これに限らず、データ演算部１２に複数の演算器２０２を設けて構成することもできる。上記第１の実施の形態の場合は、各演算器２０２ごとにセレクタ２０４，２０６，２１０，２１２を設ければよい。上記第２の実施の形態の場合は、各演算器２０２ごとにセレクタ２２４，２２６，２３０，２３２を設ければよい。 In the first and second embodiments, the data calculation unit 12 is provided with one calculation unit 202. However, the present invention is not limited to this, and the data calculation unit 12 is provided with a plurality of calculation units 202. It can also be configured. In the case of the first embodiment, selectors 204, 206, 210, and 212 may be provided for each computing unit 202. In the case of the second embodiment, selectors 224, 226, 230, and 232 may be provided for each computing unit 202.

また、上記第１および第２の実施の形態においては、エレメントインデックスデータまたはレジスタインデックスデータをセレクタ２０４，２２４に出力するセレクタ２１０，２３０と、エレメントインデックスデータまたはレジスタインデックスデータをセレクタ２０６，２２６に出力するセレクタ２１２，２３２とを設けて構成したが、これに限らず、セレクタ２１０，２３０を設けず、セレクタ２０４，２２４は、プログラムシーケンサ１８からのセレクトデータに基づいて、レジスタファイル２００のなかからいずれか１つのレジスタのデータを選択するように構成することもできる。また、セレクタ２１２，２３２を設けず、セレクタ２０６，２２６は、プログラムシーケンサ１８からのセレクトデータに基づいて、レジスタファイル２００のなかからいずれか１つのレジスタのデータを選択するように構成することもできる。 In the first and second embodiments, the selectors 210 and 230 that output the element index data or the register index data to the selectors 204 and 224, and the element index data or the register index data are output to the selectors 206 and 226. However, the present invention is not limited to this, and the selectors 210 and 224 are not provided, and the selectors 204 and 224 can be selected from the register file 200 based on select data from the program sequencer 18. Alternatively, the data of one register can be selected. Also, the selectors 212 and 232 are not provided, and the selectors 206 and 226 can be configured to select data of any one register from the register file 200 based on the select data from the program sequencer 18. .

セレクタ２１２，２３２を設けない後者の構成とした場合の動作を、図５のフローチャートに示す相関演算処理を援用して説明する。
ステップＳ１００では、ポインタレジスタＰ０にr[0]のデータアドレスを設定する命令コードが実行されると、データアドレス生成部１４では、r[0]のデータアドレスのうちグループアドレスがポインタレジスタＰ０に格納される。 The operation of the latter configuration without the selectors 212 and 232 will be described with the aid of the correlation calculation process shown in the flowchart of FIG.
In step S100, when the instruction code for setting the data address of r [0] in the pointer register P0 is executed, the data address generator 14 stores the group address in the data address of r [0] in the pointer register P0. Is done.

ステップＳ１０４では、レジスタＡ０の内容を初期化する命令コードが実行されると、データ演算部１２では、レジスタＡ０に「０」が設定される。
ステップＳ１０８では、ポインタレジスタＰ１が指し示すグループアドレスのデータをグループレジスタＲ₀に読み込む命令コードが実行されると、データ演算部１２では、データメモリ１０からs[0]〜s[3]がグループレジスタＲ₀に読み込まれる。 In step S104, when an instruction code for initializing the contents of the register A0 is executed, the data operation unit 12 sets “0” in the register A0.
In step S108, when the instruction code for reading the data of the group address indicated by the pointer register P1 into the group register _R0 is executed, the data operation unit 12 stores s [0] to s [3] from the data memory 10 into the group register. Read into _R0 .

ステップＳ１２０では、ブロック演算命令コードが実行されると、データ演算部１２では、プログラムシーケンサ１８からのセレクトデータに基づいて、セレクタ２１０により、エレメントインデックスデータｓ「０００」が選択されてセレクタ２０４に出力され、セレクタ２０４により、レジスタＲ₀[0]のデータs[0]が選択されて演算器２０２に出力される。また、プログラムシーケンサ１８からのセレクトデータに基づいて、セレクタ２０６により、レジスタＲ₂[0]のデータr[0]が選択されて演算器２０２に出力される。そして、演算器２０２により、s[0]とr[0]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項の演算結果が得られる。また、計算器２１４により、エレメントインデックスデータｓに「１」が加算され、インデックスレジスタファイル２０８に格納される。 In step S 120, when the block operation instruction code is executed, the data operation unit 12 selects the element index data s “000” by the selector 210 based on the select data from the program sequencer 18 and outputs it to the selector 204. Then, the selector 204 selects the data s [0] of the register R ₀ [0] and outputs it to the computing unit 202. Further, based on the select data from the program sequencer 18, the selector 206 selects the data r [0] of the register R ₂ [0] and outputs it to the computing unit 202. Then, s [0] and r [0] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the first term of the above equation (1) is obtained. Further, “1” is added to the element index data s by the calculator 214 and stored in the index register file 208.

ステップＳ１２２では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、プログラムシーケンサ１８からのセレクトデータに基づいて、セレクタ２１０により、エレメントインデックスデータｓ「００１」が選択されてセレクタ２０４に出力され、セレクタ２０４により、レジスタＲ₀[1]のデータs[1]が選択されて演算器２０２に出力される。また、プログラムシーケンサ１８からのセレクトデータに基づいて、セレクタ２０６により、レジスタＲ₂[1]のデータr[1]が選択されて演算器２０２に出力される。そして、演算器２０２により、s[1]とr[1]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項および第２項の演算結果が得られる。また、計算器２１４により、エレメントインデックスデータｓに「１」が加算され、インデックスレジスタファイル２０８に格納される。 In step S122, when the block operation instruction code is further executed, the data operation unit 12 selects the element index data s “001” by the selector 210 based on the select data from the program sequencer 18 and selects the selector 204. The selector 204 selects the data s [1] of the register R ₀ [1] and outputs it to the computing unit 202. Further, based on the select data from the program sequencer 18, the selector 206 selects the data r [1] of the register R ₂ [1] and outputs it to the computing unit 202. The computing unit 202 multiplies s [1] and r [1], adds the multiplication result to the value of the register A0, and stores the result in the register A0. Thereby, the calculation results of the first term and the second term of the above formula (1) are obtained. Further, “1” is added to the element index data s by the calculator 214 and stored in the index register file 208.

ステップＳ１３２では、ポインタレジスタＰ０が指し示すグループアドレスのデータをグループレジスタＲ₂に読み込む命令コードが実行されると、データ演算部１２では、データメモリ１０からr[4]〜r[7]がグループレジスタＲ₂に読み込まれる。
ステップＳ１３４では、ポインタレジスタＰ０を加算する命令コードが実行されると、データ演算部１２では、ポインタレジスタＰ０のグループアドレスが「１」加算される。データメモリ１０上では、４バイト先のr[7]を指し示すことになる。 In step S132, when the instruction code read data group address pointer register P0 pointed to a group register R ₂ is executed, the data calculating unit 12, r [4] from the data memory 10 ~r [7] is Group Register It is loaded into R _2.
In step S134, when the instruction code for adding the pointer register P0 is executed, the data operation unit 12 adds “1” to the group address of the pointer register P0. On the data memory 10, it points to r [7] 4 bytes ahead.

ステップＳ１３６では、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₁[0]，Ｒ₂[0]のデータs[4]，r[4]がそれぞれ選択される。そして、演算器２０２により、s[4]とr[4]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項〜第５項の演算結果が得られる。 In step S136, when the block operation instruction code is executed, the data operation unit 12 similarly uses the selectors 204 and 206 to store the data s [4] and r [] in the registers R ₁ [0] and R ₂ [0]. 4] is selected respectively. Then, s [4] and r [4] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation results of the first to fifth terms of the above equation (1) are obtained.

ステップＳ１３８では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₁[1]，Ｒ₂[1]のデータs[5]，r[5]がそれぞれ選択される。そして、演算器２０２により、s[5]とr[5]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項〜第６項の演算結果が得られる。 In step S138, when the block operation instruction code is further executed, the data operation unit 12 similarly uses the selectors 204 and 206 to store the data s [5], Rs [1] and R2 [1] in the registers R ₁ [1] and R ₂ [1]. r [5] is selected. Then, s [5] and r [5] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the 1st term-the 6th term of the above formula (1) is obtained.

ステップＳ１４０では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₁[2]，Ｒ₂[2]のデータs[6]，r[6]がそれぞれ選択される。そして、演算器２０２により、s[6]とr[6]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項〜第７項の演算結果が得られる。 In step S140, further, the block operation instruction code is executed, the data calculating unit 12, similarly, by the selector 204, the register R ₁ [2], data of _{R 2 [2] s [6} ], Each r [6] is selected. Then, s [6] and r [6] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the 1st term-the 7th term of the above formula (1) is obtained.

ステップＳ１４２では、さらに、ブロック演算命令コードが実行されると、データ演算部１２では、同様に、セレクタ２０４，２０６により、レジスタＲ₁[3]，Ｒ₂[3]のデータs[7]，r[7]がそれぞれ選択される。そして、演算器２０２により、s[7]とr[7]が乗算され、その乗算結果がレジスタＡ０の値に加算されてレジスタＡ０に格納される。これにより、上式（１）の第１項〜第８項の演算結果が得られる。 In step S142, further, the block operation instruction code is executed, the data calculating unit 12, similarly, by the selector 204, the register R ₁ [3], the data s [7] of R ₂ [3], Each r [7] is selected. Then, s [7] and r [7] are multiplied by the arithmetic unit 202, and the multiplication result is added to the value of the register A0 and stored in the register A0. Thereby, the calculation result of the 1st term-the 8th term of the above formula (1) is obtained.

ステップＳ１４８では、ポインタレジスタＰ０が指し示すグループアドレスのデータをグループレジスタＲ₂に読み込む命令コードが実行されると、データ演算部１２では、データメモリ１０からr[8]〜r[11]がグループレジスタＲ₂に読み込まれる。
ステップＳ１５０では、ポインタレジスタＰ０を加算する命令コードが実行されると、データ演算部１２では、ポインタレジスタＰ０のグループアドレスが「１」加算される。データメモリ１０上では、４バイト先のr[12]を指し示すことになる。 In step S148, when the instruction code read data group address pointer register P0 pointed to a group register R ₂ is executed, the data calculating unit 12, r [8] from the data memory 10 ~r [11] is Group Register It is loaded into R _2.
In step S150, when the instruction code for adding the pointer register P0 is executed, the data operation unit 12 adds “1” to the group address of the pointer register P0. On the data memory 10, it points to r [12] 4 bytes ahead.

また、上記第１および第２の実施の形態においては、３２ビットＲＩＳＣ型のデジタル信号処理装置１００に適用したが、データメモリ１０に対するデータアクセスを、データメモリ１０の最小記憶単位よりも大きい単位で行うプロセッサであればどのようなプロセッサにも適用することができる。例えば、データメモリ１０の最小記憶単位が１バイトであれば、アクセス単位が１６ビット以上のプロセッサであり、データメモリ１０の最小記憶単位が２バイトであれば、アクセス単位が３２ビット以上のプロセッサである。 In the first and second embodiments, the present invention is applied to the 32-bit RISC type digital signal processing apparatus 100. However, data access to the data memory 10 is performed in a unit larger than the minimum storage unit of the data memory 10. The present invention can be applied to any processor as long as it performs. For example, if the minimum storage unit of the data memory 10 is 1 byte, the access unit is a processor of 16 bits or more. If the minimum storage unit of the data memory 10 is 2 bytes, the processor is an access unit of 32 bits or more. is there.

また、上記第１の実施の形態において、図５および図６のフローチャートに示す処理を実行するにあたってはいずれも、プログラムメモリ１６にあらかじめ格納されている制御プログラムを実行する場合について説明したが、これに限らず、これらの手順を示したプログラムが記憶された記憶媒体から、そのプログラムをプログラムメモリ１６に読み込んで実行するようにしてもよい。 In the first embodiment, the case where the control program stored in advance in the program memory 16 is executed in the processes shown in the flowcharts of FIGS. 5 and 6 has been described. However, the program may be read from the storage medium storing the program showing these procedures into the program memory 16 and executed.

ここで、記憶媒体とは、ＲＡＭ、ＲＯＭ等の半導体記憶媒体、ＦＤ、ＨＤ等の磁気記憶型記憶媒体、ＣＤ、ＣＤＶ、ＬＤ、ＤＶＤ等の光学的読取方式記憶媒体、ＭＯ等の磁気記憶型／光学的読取方式記憶媒体であって、電子的、磁気的、光学的等の読み取り方法のいかんにかかわらず、コンピュータで読み取り可能な記憶媒体であれば、あらゆる記憶媒体を含むものである。 Here, the storage medium is a semiconductor storage medium such as RAM or ROM, a magnetic storage type storage medium such as FD or HD, an optical reading type storage medium such as CD, CDV, LD, or DVD, or a magnetic storage type such as MO. / Optical reading type storage media, including any storage media that can be read by a computer regardless of electronic, magnetic, optical, or other reading methods.

また、上記第１および第２の実施の形態においては、本発明に係るデータ演算装置を、デジタル信号処理を行う３２ビットＲＩＳＣ型のデジタル信号処理装置１００においてデータ演算部１２に適用したが、これに限らず、本発明の主旨を逸脱しない範囲で他の場合にも適用可能である。 In the first and second embodiments, the data operation device according to the present invention is applied to the data operation unit 12 in the 32-bit RISC type digital signal processing device 100 that performs digital signal processing. However, the present invention can be applied to other cases without departing from the gist of the present invention.

デジタル信号処理装置１００のハードウェア構成を示すブロック図である。2 is a block diagram showing a hardware configuration of a digital signal processing apparatus 100. FIG. データアドレスのデータ構造を示す図である。It is a figure which shows the data structure of a data address. データ演算部１２のハードウェア構成を示すブロック図である。2 is a block diagram illustrating a hardware configuration of a data calculation unit 12. FIG. エレメントインデックスデータのデータ構造を示す図である。It is a figure which shows the data structure of element index data. プログラムシーケンサ１８で実行される処理を示すフローチャートである。3 is a flowchart showing processing executed by a program sequencer 18; プログラムシーケンサ１８で実行される処理を示すフローチャートである。3 is a flowchart showing processing executed by a program sequencer 18; データ演算部１２のハードウェア構成を示すブロック図である。2 is a block diagram illustrating a hardware configuration of a data calculation unit 12. FIG. レジスタブロックの構成を示す図である。It is a figure which shows the structure of a register block. エレメントインデックスデータのデータ構造を示す図である。It is a figure which shows the data structure of element index data. メモリ上のデータの配置を示す図である。It is a figure which shows arrangement | positioning of the data on memory. 従来のデジタル信号プロセッサの構成を示すブロック図である。It is a block diagram which shows the structure of the conventional digital signal processor. 従来のデジタル信号プロセッサの動作を説明するための図である。It is a figure for demonstrating operation | movement of the conventional digital signal processor.

Explanation of symbols

１００デジタル信号処理装置
１０データメモリ
１２プロセッサ
１４データアドレス生成部
１６プログラムメモリ
１８プログラムシーケンサ
２０バス
２００レジスタファイル
２０２演算器
２０４，２０６，２１０，２１２セレクタ
２０８，２２８インデックスレジスタファイル
２１４，２３４計算器
２１６セレクタ
２２４，２２６，２３０，２３２セレクタ
２３６セレクタ DESCRIPTION OF SYMBOLS 100 Digital signal processor 10 Data memory 12 Processor 14 Data address generation part 16 Program memory 18 Program sequencer 20 Bus 200 Register file 202 Calculator 204,206,210,212 Selector 208,228 Index register file 214,234 Calculator 216 Selector 224, 226, 230, 232 selector 236 selector

Claims

Data that includes a register file composed of a plurality of registers and a computing unit that performs computation using the data of the registers, and performs data access between the register file and the outside in a group register unit composed of the plurality of registers An arithmetic unit,
An index register for storing index data;
A selector that selects data of one or a plurality of the registers from at least two of the registers of the register file based on the index data of the index register and outputs the selected data to the arithmetic unit; A data operation device characterized by the above.

Data that includes a register file composed of a plurality of registers and a computing unit that performs computation using the data of the registers, and performs data access between the register file and the outside in a group register unit composed of the plurality of registers An arithmetic unit,
An index register for storing element index data for designating a position of data in the register block when grouping at least two of the group registers in the register file as a register block;
A data arithmetic device comprising: a selector that selects data of any one of the registers of the register block based on element index data of the index register and outputs the selected data to the arithmetic unit.

In claim 2,
Furthermore, an index register file composed of a plurality of the index registers,
A data selector comprising: a second selector that selects element index data of any one of the index registers from the index register file based on given select data and outputs the selected element index data to the selector .

In claim 3,
The data operation device further comprises index data updating means for updating the element index data output to the selector and storing it in the index register file.

In any one of Claims 2 thru | or 4,
Furthermore, when bundle index data designating the register block is given, the register block designated by the bundle index data based on the given bundle index data and the element index data of the index register, and Index data generating means for generating second element index data indicating the position of data in the register block;
The selector selects data of any one of the registers from a plurality of registers of the register block based on the second element index data generated by the index data generating means, and outputs the selected data to the arithmetic unit. A data operation device characterized by that.

In claim 5,
The index data generation means includes a first bundle mode for logically grouping two or more first predetermined number of the group registers as the register block, and a second predetermined number of the groups greater than the first predetermined number. When bundle mode data that designates one of the second bundle modes that logically group registers as the register block is given, the given bundle mode data, the bundle index data, and the element index data Based on this, the second element index data is generated.

Data that includes a register file composed of a plurality of registers and a computing unit that performs operations using the data of the registers, and performs data access between the register file and the outside in a group register unit composed of the plurality of registers A data calculation method for performing calculation using an arithmetic device,
A data reading step of reading data from the outside to each group register of the register block when grouping at least two of the group registers of the registers of the register file as a register block;
An index data storage step of storing in the index register element index data specifying the position of data in the register block;
A data selection step of selecting data of any one of the registers from the registers of the register block based on element index data of the index register and outputting the selected data to the computing unit;
A data operation method comprising: an operation step of performing an operation by the operation unit.

In claim 7,
The data calculation method further includes an iterative calculation step in which the index data storage step, the data selection step, and the calculation step are repeated.