JP2012150589A

JP2012150589A - Arithmetic unit

Info

Publication number: JP2012150589A
Application number: JP2011007719A
Authority: JP
Inventors: Masayuki Tokunaga; 将之徳永; Yoshifumi Yoshikawa; 宜史吉川
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2011-01-18
Filing date: 2011-01-18
Publication date: 2012-08-09

Abstract

PROBLEM TO BE SOLVED: To provide a processing unit improving use efficiency of computing resources and being able to prevent an increase in time required for processing of orders.SOLUTION: An arithmetic unit 100 of this embodiment comprises a program memory 21, an instruction fetch part 22 and a decode part 24. The program memory 21 memorizes an instruction A and an instruction B performing memory access processing to retrieve appointed data from a data memory 40 in stages mutually different of pipe line treatment. The instruction fetch part 22 fetches the instruction A and the instruction B at one time. The decode part 24 decodes the fetched instruction A and instruction B at one time.

Description

本発明の実施形態は、演算装置に関する。 Embodiments described herein relate generally to an arithmetic device.

従来、１つの命令の処理を複数の処理単位に分割し、各処理を別々のハードウェア回路で実行するパイプライン方式を採用した演算装置（例えばマイクロプロセッサ）が知られている。 2. Description of the Related Art Conventionally, there has been known an arithmetic device (for example, a microprocessor) adopting a pipeline system in which processing of one instruction is divided into a plurality of processing units and each processing is executed by a separate hardware circuit.

特開２００３−９９２４８号公報JP 2003-99248 A

パイプライン方式を採用した演算装置においては、例えばメモリなどの演算資源を効率的に利用することが求められている。本発明が解決しようとする課題は、演算資源の利用効率を向上させることが可能な演算装置を提供することである。 In an arithmetic device adopting a pipeline system, for example, it is required to efficiently use arithmetic resources such as a memory. The problem to be solved by the present invention is to provide a computing device capable of improving the utilization efficiency of computing resources.

実施形態の演算装置は、命令記憶部と、フェッチ部と、デコード部とを備える。命令記憶部は、演算資源を用いた演算処理を、パイプライン処理の相互に異なるステージで実行する第１命令および第２命令を記憶する。フェッチ部は、第１命令および第２命令を同時にフェッチする。デコード部は、フェッチされた第１命令および第２命令を同時にデコードする。 The arithmetic device according to the embodiment includes an instruction storage unit, a fetch unit, and a decoding unit. The instruction storage unit stores a first instruction and a second instruction that execute arithmetic processing using arithmetic resources at different stages of pipeline processing. The fetch unit fetches the first instruction and the second instruction at the same time. The decoding unit simultaneously decodes the fetched first instruction and second instruction.

本実施形態に係る演算装置のブロック図。The block diagram of the arithmetic unit which concerns on this embodiment. 実施形態に係るロードストアユニットのブロック図。The block diagram of the load store unit which concerns on embodiment. 命令Ａおよび命令Ｂのそれぞれの処理を説明するための図。The figure for demonstrating each process of the command A and the command B. FIG. 実施形態の演算装置の動作を示すタイミングチャート。The timing chart which shows operation | movement of the arithmetic unit of embodiment. 対比例の演算装置の動作を示すタイミングチャート。The timing chart which shows operation | movement of the arithmetic unit of a comparison. 対比例の演算装置の動作を示すタイミングチャート。The timing chart which shows operation | movement of the arithmetic unit of a comparison. 変形例の演算装置のブロック図。The block diagram of the arithmetic unit of a modification. 命令Ｃの処理を説明するための図。The figure for demonstrating the process of the command C. FIG. 変形例の演算装置の動作を示すタイミングチャート。The timing chart which shows operation | movement of the arithmetic unit of a modification.

図１は、本実施形態の演算装置１００の概略構成の一例を示すブロック図である。図１に示すように、演算装置１００は、制御ユニット１０と、ロードストアユニット２０と、データレジスタ３０と、データメモリ４０とを備える。演算資源の一例であるデータメモリ４０には各種のデータが格納される。 FIG. 1 is a block diagram illustrating an example of a schematic configuration of the arithmetic device 100 according to the present embodiment. As shown in FIG. 1, the arithmetic device 100 includes a control unit 10, a load / store unit 20, a data register 30, and a data memory 40. Various data are stored in the data memory 40, which is an example of a computing resource.

制御ユニット１０は、演算装置１００全体を制御する。ロードストアユニット２０は、制御ユニット１０からの指示を受けて、命令のフェッチと、フェッチした命令のデコードとを実行する。そして、ロードストアユニット２０は、デコードした命令に従って、データメモリ４０からのデータを読み出す処理（「メモリアクセス処理」と呼ぶ）を実行する。メモリアクセス処理の詳細な内容は後述する。また、データレジスタ３０は、メモリアクセス処理により読み出されたデータや後述のアドレス情報の算出に用いられる補正値（オフセット）を記憶する。 The control unit 10 controls the arithmetic device 100 as a whole. In response to an instruction from the control unit 10, the load / store unit 20 executes fetching of an instruction and decoding of the fetched instruction. Then, the load store unit 20 executes a process of reading data from the data memory 40 (referred to as “memory access process”) in accordance with the decoded instruction. Details of the memory access process will be described later. The data register 30 stores data read by the memory access process and a correction value (offset) used for calculating address information described later.

図２は、ロードストアユニット２０の詳細な構成の一例を示すブロック図である。図２に示すように、ロードストアユニット２０は、プログラムメモリ２１と、命令フェッチ部２２と、レジスタ２３ａ〜２３ｆと、デコード部２４と、アドレス情報算出部２５と、選択部２６と、読み出し部２７と、書き込み部２８とを備える。 FIG. 2 is a block diagram illustrating an example of a detailed configuration of the load store unit 20. As shown in FIG. 2, the load / store unit 20 includes a program memory 21, an instruction fetch unit 22, registers 23 a to 23 f, a decode unit 24, an address information calculation unit 25, a selection unit 26, and a reading unit 27. And a writing unit 28.

プログラムメモリ２１は、所定のプログラムコードで記述された２種類の命令（命令Ａ、命令Ｂ）を記憶する。本実施形態では、命令Ａおよび命令Ｂのそれぞれは、データメモリ４０からのデータの読み出しをロードストアユニット２０に行わせるための命令である。そして、命令Ａおよび命令Ｂのそれぞれの処理は、複数のステージ（工程）に分割され、相互に異なるステージで上述のメモリアクセス処理が行われる。さらに詳述すれば、図３に示すように、命令Ａの処理は、ステージＴ１とステージＴ２とステージＴ３とステージＴ４とに分割される。そして、ステージＴ１でフェッチ処理ＩＦが行われ、ステージＴ１の直後のステージＴ２でデコード処理ＩＤが行われ、ステージＴ２の直後のステージＴ３でメモリアクセス処理ＭＥＭが行われ、ステージＴ３の直後のステージＴ４でライトバック処理ＷＢが行われる。つまり、命令Ａの処理においては、第３段目のステージでメモリアクセス処理ＭＥＭが行われる。 The program memory 21 stores two types of instructions (instruction A and instruction B) described in a predetermined program code. In the present embodiment, each of the instruction A and the instruction B is an instruction for causing the load / store unit 20 to read data from the data memory 40. Each process of the instruction A and the instruction B is divided into a plurality of stages (processes), and the above-described memory access process is performed at different stages. More specifically, as shown in FIG. 3, the processing of the instruction A is divided into a stage T1, a stage T2, a stage T3, and a stage T4. Then, the fetch process IF is performed at stage T1, the decode process ID is performed at stage T2 immediately after stage T1, the memory access process MEM is performed at stage T3 immediately after stage T2, and stage T4 immediately after stage T3. Then, the write back processing WB is performed. That is, in the process of the instruction A, the memory access process MEM is performed in the third stage.

また、図３に示すように、命令Ｂの処理は、ステージＴ１とステージＴ２とステージＴ３とステージＴ４とステージＴ５とに分割される。そして、ステージＴ１でフェッチ処理ＩＦが行われ、ステージＴ２でデコード処理ＩＤが行われ、ステージＴ３でアドレス算出処理ＥＸが行われ、ステージＴ４でメモリアクセス処理ＭＥＭが行われ、ステージＴ５でライトバック処理ＷＢが行われる。つまり、命令Ｂの処理においては、第４段目のステージでメモリアクセス処理ＭＥＭが行われるので、命令Ａの処理に比べて、メモリアクセス処理ＭＥＭが行われるまでのステージ数が１だけ多い。各処理の詳細な内容については後述する。なお、本実施形態では、各ステージ（Ｔ１〜Ｔ５）の時間長は同じである。つまり、各ステージの処理に必要なクロックサイクル数は同じである。 Further, as shown in FIG. 3, the processing of instruction B is divided into stage T1, stage T2, stage T3, stage T4, and stage T5. Then, a fetch process IF is performed at stage T1, a decode process ID is performed at stage T2, an address calculation process EX is performed at stage T3, a memory access process MEM is performed at stage T4, and a write back process is performed at stage T5. WB is performed. That is, in the process of the instruction B, the memory access process MEM is performed in the fourth stage, so that the number of stages until the memory access process MEM is performed is one more than the process of the instruction A. Detailed contents of each process will be described later. In the present embodiment, the time length of each stage (T1 to T5) is the same. That is, the number of clock cycles required for the processing of each stage is the same.

再び図２に戻って説明を続ける。命令フェッチ部２２は、プログラムメモリ２１から、２つの命令を同時にフェッチできるように構成される。例えば命令フェッチ部２２は、２つのフェッチ回路から構成されてもよいし、ＶＬＩＷ（Very Long Instruction Word）方式が採用された構成であってもよい。命令フェッチ部２２は、制御ユニット１０からの指示に従って、プログラムメモリ２１に格納された命令Ａと命令Ｂを同時にフェッチする。より具体的には以下のとおりである。制御ユニット１０は、命令フェッチ部２２に対して、プログラムメモリ２１のうち、読み出すべき命令Ａおよび命令Ｂのそれぞれが格納された領域を示す命令アドレス情報を供給する。そして、命令フェッチ部２２は、制御ユニット１０からの命令アドレス情報を参照して、命令Ａおよび命令Ｂのそれぞれをプログラムメモリ２１から読み出し、読み出した命令Ａをレジスタ２３ａに書き込み、読み出した命令Ｂをレジスタ２３ｂに書き込む。 Returning to FIG. 2 again, the description will be continued. The instruction fetch unit 22 is configured to be able to fetch two instructions simultaneously from the program memory 21. For example, the instruction fetch unit 22 may be configured by two fetch circuits, or may be configured by employing a VLIW (Very Long Instruction Word) method. The instruction fetch unit 22 fetches the instruction A and the instruction B stored in the program memory 21 at the same time in accordance with an instruction from the control unit 10. More specifically, it is as follows. The control unit 10 supplies the instruction fetch unit 22 with instruction address information indicating areas in the program memory 21 in which the instruction A and the instruction B to be read are stored. Then, the instruction fetch unit 22 refers to the instruction address information from the control unit 10, reads each of the instruction A and the instruction B from the program memory 21, writes the read instruction A into the register 23a, and reads the read instruction B. Write to register 23b.

デコード部２４は、２つの命令を同時にデコードできるように構成されている。例えばデコード部２４は、２つのデコード回路から構成されてもよいし、ＶＬＩＷ方式が採用された構成であってもよい。デコード部２４は、レジスタ２３ａに書き込まれた命令Ａおよびレジスタ２３ｂに書き込まれた命令Ｂのそれぞれを同時にデコードする。本実施形態では、命令Ａを構成するプログラムコード内には、データメモリ４０のうち、読み出すべきデータが格納された領域を示すアドレス情報が記述されているので、デコード部２４は、デコードした命令Ａから、直ちにアドレス情報を特定することができる。そして、デコード部２４は、デコードした命令Ａから特定したアドレス情報をレジスタ２３ｃに書き込み、レジスタ２３ｃに書き込まれたアドレス情報は、後段の選択部２６へ供給される。 The decoding unit 24 is configured to be able to decode two instructions simultaneously. For example, the decoding unit 24 may be configured by two decoding circuits, or may be configured by adopting the VLIW method. The decode unit 24 simultaneously decodes the instruction A written in the register 23a and the instruction B written in the register 23b. In the present embodiment, address information indicating an area where data to be read out is stored in the data memory 40 is described in the program code constituting the instruction A. Therefore, the decoding unit 24 decodes the decoded instruction A. From this, address information can be identified immediately. Then, the decoding unit 24 writes the address information specified from the decoded instruction A to the register 23c, and the address information written to the register 23c is supplied to the selection unit 26 at the subsequent stage.

一方、命令Ｂを構成するプログラムコード内には、上述のアドレス情報は記述されておらず、アドレス情報を算出するための算出情報が記述されている。本実施形態では、算出情報は、アドレス情報の算出の際の基準値Ｉを示す基準情報と、データレジスタ３０のうち、基準値Ｉに加算すべき補正値Ｒが格納された領域を示すレジスタ情報とからなる。デコード部２４は、デコードした命令Ｂから特定した基準値Ｉをレジスタ２３ｄへ書き込み、レジスタ２３ｄに書き込まれた基準値Ｉは後段のアドレス情報算出部２５へ供給される。また、デコード部２４は、デコードした命令Ｂから特定したレジスタ情報をデータレジスタ３０へ供給する。データレジスタ３０は、デコード部２４からのレジスタ情報を受け付け、その受け付けたレジスタ情報が示す領域に格納された補正値Ｒを読み出す。そして、データレジスタ３０は、その読み出した補正値Ｒをレジスタ２３ｆへ供給する。 On the other hand, in the program code constituting the instruction B, the above address information is not described, but calculation information for calculating the address information is described. In the present embodiment, the calculation information includes reference information indicating the reference value I at the time of calculating the address information, and register information indicating a region in the data register 30 in which the correction value R to be added to the reference value I is stored. It consists of. The decoding unit 24 writes the reference value I specified from the decoded instruction B to the register 23d, and the reference value I written to the register 23d is supplied to the subsequent address information calculation unit 25. In addition, the decoding unit 24 supplies the register information specified from the decoded instruction B to the data register 30. The data register 30 receives the register information from the decoding unit 24 and reads the correction value R stored in the area indicated by the received register information. Then, the data register 30 supplies the read correction value R to the register 23f.

アドレス情報算出部２５は、レジスタ２３ｄから供給される基準値Ｉと、レジスタ２３ｆから供給される補正値Ｒとからアドレス情報を算出する。アドレス情報算出部２５は例えば加算器で構成され、レジスタ２３ｄから供給される基準値Ｉと、レジスタ２３ｆから供給される補正値Ｒとの加算結果を示す値がアドレス情報となる。アドレス情報算出部２５で算出されたアドレス情報はレジスタ２３ｅに書き込まれる。レジスタ２３ｅに書き込まれたアドレス情報は後段の選択部２６へ供給される。 The address information calculation unit 25 calculates address information from the reference value I supplied from the register 23d and the correction value R supplied from the register 23f. The address information calculation unit 25 is configured by an adder, for example, and a value indicating the addition result of the reference value I supplied from the register 23d and the correction value R supplied from the register 23f is address information. The address information calculated by the address information calculation unit 25 is written in the register 23e. The address information written in the register 23e is supplied to the selection unit 26 at the subsequent stage.

選択部２６は、レジスタ２３ｃに書き込まれたアドレス情報（つまりは、命令Ａによって指定されたアドレス情報）、および、レジスタ２３ｅに書き込まれたアドレス情報（つまりは、命令Ｂによって指定されたアドレス情報）のうちの何れか一方を選択して後段の読み出し部２７へ供給する。選択部２６は、例えばマルチプレクサなどの選択回路で構成され、一方の入力端には、レジスタ２３ｃに書き込まれたアドレス情報（命令Ａによって指定されたアドレス情報）が供給され、他方の入力端には、レジスタ２３ｅに書き込まれたアドレス情報（命令Ｂによって指定されたアドレス情報）が供給される。また、選択部２６には、デコード部２４からの選択制御信号（不図示）が供給される。選択部２６は、デコード部２４からの選択制御信号に応じて、一方の入力端および他方の入力端のうちの何れかに供給されるデータ（アドレス情報）を、後段の読み出し部２７へ供給する。 The selection unit 26 has address information written in the register 23c (that is, address information specified by the instruction A) and address information written in the register 23e (that is, address information specified by the instruction B). Any one of them is selected and supplied to the subsequent reading unit 27. The selection unit 26 includes a selection circuit such as a multiplexer, for example. Address information (address information specified by the instruction A) written in the register 23c is supplied to one input end, and the other input end receives The address information (address information specified by the instruction B) written in the register 23e is supplied. The selection unit 26 is supplied with a selection control signal (not shown) from the decoding unit 24. The selection unit 26 supplies data (address information) supplied to one of the one input terminal and the other input terminal to the subsequent reading unit 27 in accordance with the selection control signal from the decoding unit 24. .

読み出し部２７は、データメモリ４０のうち、選択部２６から供給されたアドレス情報が示す領域に格納されたデータを読み出す。読み出し部２７は、読み出したデータを書き込み部２８へ供給する。書き込み部２８は、読み出し部２７から供給されたデータをデータレジスタ３０の所定の領域に書き込む。 The reading unit 27 reads data stored in the area indicated by the address information supplied from the selection unit 26 in the data memory 40. The reading unit 27 supplies the read data to the writing unit 28. The writing unit 28 writes the data supplied from the reading unit 27 in a predetermined area of the data register 30.

図４は、演算装置１００の動作を説明するためのタイミングチャートである。以下、図４を参照しながら、演算装置１００の具体的な動作を説明する。本実施形態では、演算装置１００は、命令Ａと命令Ｂとをパイプライン方式で同時に実行する。以下、詳細な内容について説明する。ここでは、図４の各期間（Ｔ１１〜Ｔ５５）の時間長は、上述の各ステージ（Ｔ１〜Ｔ５）の時間長と同じである。 FIG. 4 is a timing chart for explaining the operation of the arithmetic device 100. Hereinafter, a specific operation of the arithmetic device 100 will be described with reference to FIG. In the present embodiment, the arithmetic unit 100 executes the instruction A and the instruction B simultaneously in a pipeline manner. Details will be described below. Here, the time length of each period (T11 to T55) in FIG. 4 is the same as the time length of each stage (T1 to T5) described above.

図４に示すように、最初の期間Ｔ１１は、命令Ａおよび命令Ｂのそれぞれの処理におけるステージＴ１（図３参照）となり、命令Ａのフェッチ処理ＩＦと命令Ｂのフェッチ処理ＩＦとが実行される。より具体的には、命令フェッチ部２２は、プログラムメモリ２１のうち、制御ユニット１０から供給された命令アドレス情報が示すそれぞれの領域から、命令Ａと命令Ｂを同時に読み出し、読み出した命令Ａをレジスタ２３ａに書き込み、読み出した命令Ｂをレジスタ２３ｂに書き込む。 As shown in FIG. 4, the first period T11 is stage T1 (see FIG. 3) in the processing of instruction A and instruction B, and the fetch processing IF of instruction A and the fetch processing IF of instruction B are executed. . More specifically, the instruction fetch unit 22 simultaneously reads the instruction A and the instruction B from the respective areas indicated by the instruction address information supplied from the control unit 10 in the program memory 21 and registers the read instruction A in the register. Write to the register 23b and write the read instruction B to the register 23b.

図４に示すように、期間Ｔ１１の直後の期間Ｔ２２は、命令Ａおよび命令Ｂのそれぞれの処理におけるステージＴ２（図３参照）となり、命令Ａのデコード処理ＩＤと命令Ｂのデコード処理ＩＤとが実行される。より具体的には、デコード部２４は、レジスタ２３ａに書き込まれた命令Ａとレジスタ２３ｂに書き込まれた命令Ｂとを同時にデコードする。そして、デコード部２４は、デコードした命令Ａから特定したアドレス情報をレジスタ２３ｃに書き込む。また、デコード部２４は、デコードした命令Ｂから特定した基準値Ｉをレジスタ２３ｄへ書き込むとともに、デコードした命令Ｂの内容から特定したレジスタ情報をデータレジスタ３０へ供給する。 As shown in FIG. 4, a period T22 immediately after the period T11 is a stage T2 (see FIG. 3) in each process of the instruction A and the instruction B, and the decoding process ID of the instruction A and the decoding process ID of the instruction B are Executed. More specifically, the decoding unit 24 simultaneously decodes the instruction A written in the register 23a and the instruction B written in the register 23b. Then, the decoding unit 24 writes the address information specified from the decoded instruction A to the register 23c. The decoding unit 24 writes the reference value I specified from the decoded instruction B to the register 23 d and supplies the register information specified from the content of the decoded instruction B to the data register 30.

図４に示すように、期間Ｔ２２の直後の期間Ｔ３３は、命令Ａおよび命令Ｂのそれぞれの処理におけるステージＴ３（図３参照）となり、命令Ａのメモリアクセス処理ＭＥＭが実行される一方、命令Ｂのアドレス算出処理ＥＸが実行される。より具体的には以下のとおりである。期間Ｔ３３では、図２の選択部２６には、一方の入力端に供給されるデータ（つまり、レジスタ２３ｃに書き込まれた、命令Ａによって指定されたアドレス情報）の選択を指示する選択制御信号がデコード部２４から供給される。これにより、読み出し部２７には、命令Ａによって指定されたアドレス情報が供給される。そして、読み出し部２７は、データメモリ４０のうち、選択部２６から供給されたアドレス情報が示す領域に格納されたデータを読み出す。以上が命令Ａのメモリアクセス処理ＭＥＭの内容である。 As shown in FIG. 4, a period T33 immediately after the period T22 is a stage T3 (see FIG. 3) in each processing of the instruction A and the instruction B, and the memory access process MEM of the instruction A is executed while the instruction B The address calculation process EX is executed. More specifically, it is as follows. In the period T33, the selection unit 26 in FIG. 2 receives a selection control signal for instructing selection of data supplied to one input terminal (that is, address information specified by the instruction A written in the register 23c). Supplied from the decoding unit 24. As a result, the address information specified by the instruction A is supplied to the reading unit 27. Then, the reading unit 27 reads the data stored in the area indicated by the address information supplied from the selection unit 26 in the data memory 40. The above is the contents of the memory access processing MEM of the instruction A.

また、期間Ｔ３３では、データレジスタ３０は、デコード部２４からのレジスタ情報が示す領域に格納された補正値Ｒを読み出し、その読み出した補正値Ｒをレジスタ２３ｆへ供給する。そして、アドレス情報算出部２５は、レジスタ２３ｄに書き込まれた基準値Ｉと、レジスタ２３ｆから供給された補正値Ｒとを加算して、命令Ｂによって指定されたアドレス情報を算出し、その算出したアドレス情報をレジスタ２３ｅに書き込む。以上が命令Ｂのアドレス算出処理ＥＸの内容である。 In the period T33, the data register 30 reads the correction value R stored in the area indicated by the register information from the decoding unit 24, and supplies the read correction value R to the register 23f. Then, the address information calculation unit 25 calculates the address information specified by the instruction B by adding the reference value I written in the register 23d and the correction value R supplied from the register 23f. Address information is written to the register 23e. The above is the content of the instruction B address calculation processing EX.

図４に示すように、期間Ｔ３３の直後の期間Ｔ４４は、命令Ａおよび命令Ｂのそれぞれの処理におけるステージＴ４（図３参照）となり、命令Ａのライトバック処理ＷＢが実行される一方、命令Ｂのメモリアクセス処理ＭＥＭが実行される。より具体的には以下のとおりである。期間Ｔ４４では、書き込み部２８は、ステージＴ３で読み出し部２７によって読み出されたデータ（命令Ａによって読み出しが指示されたデータ）を読み出し部２７から受け取り、その受け取ったデータをデータレジスタ３０の所定の領域に書き込む。以上が命令Ａのライトバック処理ＷＢの内容であり、期間Ｔ４４の終了とともに、命令Ａの処理は終了する。 As shown in FIG. 4, a period T44 immediately after the period T33 is a stage T4 (see FIG. 3) in each process of the instruction A and the instruction B, and the write-back process WB of the instruction A is executed, while the instruction B The memory access process MEM is executed. More specifically, it is as follows. In the period T44, the writing unit 28 receives the data read by the reading unit 27 in step T3 (data instructed to be read by the instruction A) from the reading unit 27, and receives the received data in a predetermined register of the data register 30. Write to the area. The above is the content of the write-back process WB of the instruction A, and the process of the instruction A ends with the end of the period T44.

また、期間Ｔ４４では、図２の選択部２６には、他方の入力端子に供給されるデータ（つまり、レジスタ２３ｅに書き込まれた、命令Ｂによって指定されたアドレス情報）の選択を指示する選択制御信号がデコード部２４から供給される。これにより、読み出し部２７には、命令Ｂによって指定されたアドレス情報が供給される。そして、読み出し部２７は、データメモリ４０のうち、選択部２６から供給されたアドレス情報が示す領域に格納されたデータを読み出す。以上が命令Ｂのメモリアクセス処理ＭＥＭの内容である。 Further, in the period T44, the selection control instructing the selection unit 26 in FIG. 2 to select the data supplied to the other input terminal (that is, the address information specified by the instruction B written in the register 23e). A signal is supplied from the decoding unit 24. As a result, the address information specified by the instruction B is supplied to the reading unit 27. Then, the reading unit 27 reads the data stored in the area indicated by the address information supplied from the selection unit 26 in the data memory 40. The above is the contents of the memory access processing MEM of the instruction B.

図４に示すように、期間Ｔ４４の直後の期間Ｔ５５は、命令Ｂの処理におけるステージＴ５（図３参照）となり、命令Ｂのライトバック処理ＷＢのみが実行される。より具体的には、期間Ｔ５５では、書き込み部２８は、期間Ｔ４４で読み出し部２７によって読み出されたデータ（命令Ｂによって読み出しが指示されたデータ）を読み出し部２７から受け取り、その受け取ったデータをデータレジスタ３０の所定の領域に書き込む。以上が命令Ｂのライトバック処理ＷＢの内容であり、期間Ｔ５５の終了とともに、命令Ｂの処理は終了する。 As shown in FIG. 4, a period T55 immediately after the period T44 is a stage T5 (see FIG. 3) in the process of the instruction B, and only the write back process WB of the instruction B is executed. More specifically, in the period T55, the writing unit 28 receives the data read by the reading unit 27 in the period T44 (data instructed to be read by the instruction B) from the reading unit 27, and receives the received data. Write to a predetermined area of the data register 30. The above is the content of the write-back process WB of the instruction B, and the process of the instruction B ends with the end of the period T55.

以上に説明したように、本実施形態によれば、メモリアクセス処理ＭＥＭが行われるステージが相互に異なる命令Ａおよび命令Ｂが同時に実行されるので、データメモリ４０の利用効率を向上させることができる。より具体的には、図４に示すように、本実施形態では、メモリアクセス処理ＭＥＭが行われるステージが「１」だけ相互に異なる命令Ａおよび命令Ｂが同時に実行されるので、データメモリ４０を、期間Ｔ３３および期間Ｔ４４にわたって連続して動作させることができる。これにより、データメモリ４０の利用効率を向上させることができる。また、本実施形態によれば、命令Ａのメモリアクセス処理ＭＥＭと命令Ｂのメモリアクセス処理ＭＥＭとが同じタイミングで行われることを防止できるので、命令の処理に要する時間が増大することを防止することもできる。 As described above, according to the present embodiment, since the instruction A and the instruction B that are different from each other in the stage in which the memory access process MEM is performed are executed at the same time, the utilization efficiency of the data memory 40 can be improved. . More specifically, as shown in FIG. 4, in the present embodiment, since the instruction A and the instruction B that are different from each other by “1” are executed simultaneously, the data memory 40 is stored in the memory access process MEM. The operation can be continued over the period T33 and the period T44. Thereby, the utilization efficiency of the data memory 40 can be improved. In addition, according to the present embodiment, it is possible to prevent the memory access process MEM of the instruction A and the memory access process MEM of the instruction B from being performed at the same timing, and thus it is possible to prevent an increase in time required for the instruction process. You can also.

ここで、パイプライン処理の相互に異なるステージでメモリアクセス処理ＭＥＭが行われる２つの命令を順番に実行する場合を想定する。２つの命令のうち、メモリアクセス処理ＭＥＭが行われるまでのステージ数が少ない方の命令を第１命令、メモリアクセス処理が行われるまでのステージ数が多い方の命令を第２命令として、第１命令→第２命令の順番でそれぞれの命令を実行する場合は、第１命令のメモリアクセス処理が行われてから、第２命令のメモリアクセス処理が行われるまでの期間が長くなるので、メモリが動作せずに待機状態となる期間も長くなる。これにより、メモリの利用効率が低くなるという問題が起こる。 Here, it is assumed that two instructions for performing the memory access process MEM are executed in order at different stages of the pipeline process. Of the two instructions, an instruction having a smaller number of stages until the memory access process MEM is performed is a first instruction, and an instruction having a larger number of stages until the memory access process is performed is a second instruction. When each instruction is executed in the order of the instruction → the second instruction, since the period from the memory access process of the first instruction until the memory access process of the second instruction is performed becomes longer, The period during which the apparatus is in a standby state without operating is also lengthened. This causes a problem that the use efficiency of the memory is lowered.

例えば図５−１に示すように、命令Ａ→命令Ｂの順番で実行する場合は、期間Ｔ３３において命令Ａのメモリアクセス処理ＭＥＭが行われ、期間Ｔ５５において命令Ｂのメモリアクセス処理ＭＥＭが行われる。すなわち、期間Ｔ４４においてデータメモリ４０は動作せずに待機状態となるので、データメモリ４０の利用効率が低くなる。 For example, as shown in FIG. 5A, when the instructions A are executed in the order of the instruction B, the memory access process MEM of the instruction A is performed in the period T33, and the memory access process MEM of the instruction B is performed in the period T55. . That is, since the data memory 40 is not operated in the period T44 and is in a standby state, the utilization efficiency of the data memory 40 is lowered.

一方、第２命令→第１命令の順番でそれぞれの命令を実行する場合は、第１命令のメモリアクセス処理ＭＥＭと、第２命令のメモリアクセス処理ＭＥＭとが同じタイミングで行われるおそれがある。この場合、第２命令のメモリアクセス処理ＭＥＭが終了するまで、第１命令のメモリアクセス処理ＭＥＭを実行できないので、結果として、命令の処理に要する時間が増大するという問題が起こる。 On the other hand, when the respective instructions are executed in the order of the second instruction → the first instruction, the memory access process MEM for the first instruction and the memory access process MEM for the second instruction may be performed at the same timing. In this case, since the memory access process MEM for the first instruction cannot be executed until the memory access process MEM for the second instruction is completed, there arises a problem that the time required for the instruction process increases as a result.

例えば図５−２に示すように、命令Ｂ→命令Ａの順番で実行する場合は、命令Ｂのメモリアクセス処理ＭＥＭが行われる期間と、命令Ａのメモリアクセス処理ＭＥＭが行われる期間とが共に期間Ｔ４４となってしまうので、命令Ｂのメモリアクセス処理ＭＥＭが終了するまで他の命令Ａのメモリアクセス処理ＭＥＭを行うことができず、結果として、命令の処理に要する時間が増大してしまう。 For example, as shown in FIG. 5B, when the instruction B is executed in the order of the instruction A, both the period during which the memory access process MEM for the instruction B is performed and the period during which the memory access process MEM for the instruction A is performed Since the period T44 is reached, the memory access process MEM of the other instruction A cannot be performed until the memory access process MEM of the instruction B is completed, and as a result, the time required for the instruction process increases.

以上より、パイプライン処理の相互に異なるステージでメモリアクセス処理ＭＥＭが行われる２つの命令を順番に実行する場合は、データメモリ４０の利用効率が低くなる、または、命令の処理に要する時間が増大するといった問題が起こる。 As described above, when two instructions for which the memory access process MEM is performed at different stages of the pipeline process are executed in order, the use efficiency of the data memory 40 is lowered or the time required for the instruction process is increased. Problem occurs.

これに対して、本実施形態によれば、データメモリ４０を、期間Ｔ３３および期間Ｔ４４にわたって連続して動作させることができるので、図５−１の場合に比べて、データメモリの利用効率を向上させることが可能になる。また、本実施形態によれば、命令Ａのメモリアクセス処理ＭＥＭが行われる期間と命令Ｂのメモリアクセス処理ＭＥＭが行われる期間とが同じ期間になることを防止できるので、図５−２の場合とは異なり、命令の処理に要する時間が増大することを防止できる。すなわち、本実施形態によれば、データメモリ４０の利用効率を向上させるとともに命令の処理に要する時間が増大することを防止できる。 On the other hand, according to the present embodiment, the data memory 40 can be operated continuously over the period T33 and the period T44, so that the utilization efficiency of the data memory is improved as compared with the case of FIG. It becomes possible to make it. Further, according to the present embodiment, the period during which the memory access process MEM for the instruction A is performed and the period during which the memory access process MEM for the instruction B are performed can be prevented from being the same as in the case of FIG. Unlike this, it is possible to prevent an increase in the time required for processing an instruction. That is, according to the present embodiment, it is possible to improve the utilization efficiency of the data memory 40 and to prevent an increase in time required for instruction processing.

（変形例）
以上、本発明の実施形態を説明したが、この実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 (Modification)
As mentioned above, although embodiment of this invention was described, this embodiment is shown as an example and is not intending limiting the range of invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.

例えば、上述の実施形態では、メモリアクセス処理が行われるステージが「１」だけ相互に異なる２つの命令（命令Ａ、命令Ｂ）が同時に実行されているが、これに限らず、例えばメモリアクセス処理が行われるステージが「２」だけ相互に異なる２つの命令が同時に実行されてもよい。要するに、メモリアクセス処理が行われるステージが相互に異なる２つの命令が同時に実行されるものであればよい。 For example, in the above-described embodiment, two instructions (instruction A and instruction B) that are different from each other by “1” are executed at the same time. However, the present invention is not limited to this. Two instructions that are different from each other by “2” may be executed simultaneously. In short, it is only necessary that two instructions having different stages in which memory access processing is performed are executed simultaneously.

上述の実施形態では、データレジスタ３０には、補正値Ｒと、メモリアクセス処理によって読み出されたデータとが格納されているが、例えば補正値Ｒが格納されるレジスタと、メモリアクセス処理によって読み出されたデータとが別々のレジスタに格納される構成であってもよい。 In the above embodiment, the data register 30 stores the correction value R and the data read by the memory access process. For example, the data register 30 stores the correction value R and the data read by the memory access process. The configuration may be such that the output data is stored in separate registers.

上述の実施形態では、多重化して設けることができない演算資源の一例としてデータメモリを挙げ、その演算資源を用いた演算処理の一例としてメモリアクセス処理を挙げて説明したが、これに限らず、上述の演算資源や演算資源を用いた演算処理の内容は任意である。 In the above-described embodiment, a data memory has been described as an example of an operation resource that cannot be multiplexed, and a memory access process has been described as an example of an operation process using the operation resource. The contents of the calculation processing using the calculation resources and the calculation resources are arbitrary.

上述の実施形態では、基準値Iと補正値Ｒとが加算されることでアドレス情報が算出されているが、これに限らず、アドレス情報の算出方法は任意である。例えばデータレジスタ３０には、アドレス情報の算出に用いられる各種のパラメータ値が記憶され、命令Ｂによって指定された複数（例えば２つ）のパラメータ値に基づいてアドレス情報が算出される構成であってもよい。 In the above-described embodiment, the address information is calculated by adding the reference value I and the correction value R. However, the present invention is not limited to this, and the address information calculation method is arbitrary. For example, the data register 30 stores various parameter values used for calculating the address information, and the address information is calculated based on a plurality (for example, two) of parameter values specified by the instruction B. Also good.

上述の実施形態では、同時にフェッチ、デコードする命令は２つであるが、これに限らず、例えば、メモリアクセス処理ＭＥＭが行われるステージが相互に異なる３つ以上の命令を同時にフェッチ、デコードすることも可能である。以下、一例を挙げて説明する。図６は、メモリアクセス処理ＭＥＭが行われるステージが「１」だけ相互に異なる命令Ａ、命令Ｂおよび命令Ｃを同時に実行する演算装置２００の構成の一例を示すブロック図である。図６に示すように、ロードストアユニット２０は、上述の実施形態で説明した内容に加えて、レジスタ２３ｇ、２３ｈ、２３ｉ、２３ｊおよび２３ｋと、アドレス情報算出部５０と、アドレス情報算出部５１とをさらに備える。また、命令フェッチ部２２は、プログラムメモリ２１から、３つの命令を同時にフェッチできるように構成される。さらに、デコード部２４は、３つの命令を同時にデコードできるように構成されている。 In the above-described embodiment, two instructions are fetched and decoded at the same time. However, the present invention is not limited to this. For example, fetching and decoding three or more instructions at different stages in which the memory access processing MEM is performed is simultaneously performed. Is also possible. Hereinafter, an example will be described. FIG. 6 is a block diagram illustrating an example of the configuration of the arithmetic device 200 that simultaneously executes the instruction A, the instruction B, and the instruction C that are different from each other by “1” in the stage where the memory access process MEM is performed. As shown in FIG. 6, in addition to the contents described in the above-described embodiment, the load / store unit 20 includes registers 23g, 23h, 23i, 23j, and 23k, an address information calculation unit 50, an address information calculation unit 51, Is further provided. The instruction fetch unit 22 is configured to be able to fetch three instructions simultaneously from the program memory 21. Further, the decoding unit 24 is configured to be able to decode three instructions simultaneously.

図６の構成例では、プログラムメモリ２１は、所定のプログラムコードで記述された３種類の命令（命令Ａ、命令Ｂ、命令Ｃ）を記憶する。図７に示すように、命令Ｃの処理は、ステージＴ１とステージＴ２とステージＴ３とステージＴ４とステージＴ５とステージＴ６とに分割される。 In the configuration example of FIG. 6, the program memory 21 stores three types of instructions (instruction A, instruction B, and instruction C) described in a predetermined program code. As shown in FIG. 7, the processing of instruction C is divided into stage T1, stage T2, stage T3, stage T4, stage T5, and stage T6.

図７に示すように、ステージＴ１では、フェッチ処理ＩＦが行われる。より具体的には、命令フェッチ部２２は、プログラムメモリ２１のうち、制御ユニット１０から供給された命令アドレス情報を参照して命令Ｃを読み出す。命令フェッチ部２２は、読み出した命令Ｃを図６に示すレジスタ２３ｇに書き込む。 As shown in FIG. 7, a fetch process IF is performed at stage T1. More specifically, the instruction fetch unit 22 reads the instruction C with reference to the instruction address information supplied from the control unit 10 in the program memory 21. The instruction fetch unit 22 writes the read instruction C into the register 23g shown in FIG.

図７に示すように、ステージＴ２では、デコード処理ＩＤが行われる。より具体的には以下のとおりである。デコード部２４は、レジスタ２３ｇに書き込まれた命令Ｃをデコードする。ここでは、命令Ｃを構成するプログラムコード内には、上述のアドレス情報は直接記述されておらず、アドレス情報を算出するための算出情報が記述されている。当該算出情報は、アドレス情報の算出の際の基準値Ｉを示す基準情報と、データレジスタ３０のうち読み出すべき補正値Ｒ２が格納された領域を示すレジスタ情報とからなる。そして、デコード部２４は、デコードした命令Ｃから特定した基準値Ｉを図６に示すレジスタ２３ｈへ書き込むとともに、デコードした命令Ｃの内容から特定したレジスタ情報をデータレジスタ３０へ供給する。 As shown in FIG. 7, a decoding process ID is performed at stage T2. More specifically, it is as follows. The decoding unit 24 decodes the instruction C written in the register 23g. Here, in the program code constituting the instruction C, the above address information is not described directly, but calculation information for calculating the address information is described. The calculation information includes reference information indicating the reference value I when calculating the address information, and register information indicating an area in the data register 30 in which the correction value R2 to be read is stored. Then, the decoding unit 24 writes the reference value I specified from the decoded instruction C to the register 23 h shown in FIG. 6 and supplies the register information specified from the content of the decoded instruction C to the data register 30.

図７に示すように、ステージＴ３では、アドレス算出処理ＥＸ１が行われる。より具体的には以下のとおりである。図６に示すデータレジスタ３０は、デコード部２４からのレジスタ情報が示す領域に格納された補正値Ｒ２を読み出し、その読み出した補正値Ｒ２を図６に示すレジスタ２３ｉへ供給する。そして、図６に示すアドレス情報算出部５０は、自身が有する固有値Ｍと、レジスタ２３ｉに書き込まれた補正値Ｒ２とを乗算してオフセット値Ｘを算出する。アドレス情報算出部５０は、算出したオフセット値Ｘと、レジスタ２３ｈから供給された基準値Ｉとを図６に示すレジスタ２３ｊに書き込む。以上が命令Ｃのアドレス算出処理ＥＸ１の内容である。 As shown in FIG. 7, in the stage T3, an address calculation process EX1 is performed. More specifically, it is as follows. The data register 30 shown in FIG. 6 reads the correction value R2 stored in the area indicated by the register information from the decoding unit 24, and supplies the read correction value R2 to the register 23i shown in FIG. Then, the address information calculation unit 50 shown in FIG. 6 calculates the offset value X by multiplying the eigenvalue M possessed by itself and the correction value R2 written in the register 23i. The address information calculation unit 50 writes the calculated offset value X and the reference value I supplied from the register 23h in the register 23j shown in FIG. The above is the content of the instruction C address calculation processing EX1.

図７に示すように、ステージＴ４では、アドレス算出処理ＥＸ２が行われる。より具体的には、図６に示すアドレス情報算出部５１は、レジスタ２３ｊから供給されたオフセット値Ｘと基準値Ｉとを加算して、命令Ｃによって指定されたアドレス情報を算出する。アドレス情報算出部５１は、その算出したアドレス情報をレジスタ２３ｋに書き込む。以上が命令Ｃのアドレス算出処理ＥＸ２の内容である。 As shown in FIG. 7, in the stage T4, an address calculation process EX2 is performed. More specifically, the address information calculation unit 51 shown in FIG. 6 adds the offset value X supplied from the register 23j and the reference value I to calculate the address information specified by the instruction C. The address information calculation unit 51 writes the calculated address information in the register 23k. The above is the content of the instruction C address calculation processing EX2.

図７に示すように、ステージＴ５では、メモリアクセス処理ＭＥＭが行われる。より具体的には以下のとおりである。ここで、図６に示す選択部２６は、レジスタ２３ｃに書き込まれたアドレス情報（命令Ａによって指定されたアドレス情報）が供給される第１入力端子と、レジスタ２３ｅに書き込まれたアドレス情報（命令Ｂによって指定されたアドレス情報）が供給される第２入力端子と、レジスタ２３ｋに書き込まれたアドレス情報（命令Ｃによって指定されたアドレス情報）が供給される第３入力端子とを有する。選択部２６は、デコード部２４から供給される選択制御信号（不図示）に応じて、何れかの入力端子に供給されるアドレス情報を後段の読み出し部２７へ供給する。上記ステージＴ５では、選択部２６には、命令Ｃによって指定されたアドレス情報の選択を指示する選択制御信号がデコード部２４から供給される。これにより、読み出し部２７には、命令Ｃによって指定されたアドレス情報が供給される。そして、読み出し部２７は、データメモリ４０のうち、選択部２６から供給されたアドレス情報が示す領域に格納されたデータを読み出す。以上が命令Ｃのメモリアクセス処理ＭＥＭの内容である。 As shown in FIG. 7, at stage T5, a memory access process MEM is performed. More specifically, it is as follows. Here, the selection unit 26 shown in FIG. 6 has a first input terminal to which the address information (address information specified by the instruction A) written in the register 23c is supplied, and the address information (command) written in the register 23e. The second input terminal to which the address information specified by B) is supplied, and the third input terminal to which the address information (address information specified by the instruction C) written in the register 23k is supplied. In response to a selection control signal (not shown) supplied from the decoding unit 24, the selection unit 26 supplies address information supplied to one of the input terminals to the subsequent reading unit 27. In stage T5, the selection unit 26 is supplied with a selection control signal for instructing selection of the address information designated by the instruction C from the decoding unit 24. As a result, the address information specified by the instruction C is supplied to the reading unit 27. Then, the reading unit 27 reads the data stored in the area indicated by the address information supplied from the selection unit 26 in the data memory 40. The above is the contents of the memory access processing MEM of the instruction C.

図７に示すように、ステージＴ６では、ライトバック処理ＷＢが行われる。より具体的には、書き込み部２８は、読み出し部２７によって読み出されたデータ（命令Ｃによって読み出しが指示されたデータ）を読み出し部２７から受け取り、その受け取ったデータをデータレジスタ３０の所定の領域に書き込む。以上が命令Ｃのライトバック処理ＷＢの内容である。 As shown in FIG. 7, at the stage T6, the write-back process WB is performed. More specifically, the writing unit 28 receives data read by the reading unit 27 (data instructed to be read by the instruction C) from the reading unit 27, and receives the received data in a predetermined area of the data register 30. Write to. The above is the content of the write-back process WB of the instruction C.

以上に説明したように、命令Ｃの処理においては、第５段目のステージＴ５でメモリアクセス処理ＭＥＭが行われるので、命令Ａの処理に比べて、メモリアクセス処理ＭＥＭが行われるまでのステージ数が２だけ多く、命令Ｂの処理に比べて、メモリアクセス処理ＭＥＭが行われるまでのステージ数が１だけ多い。そして、命令Ａと命令Ｂと命令Ｃとが同時に実行されることにより、データメモリ４０を連続して動作させることができるとともに、命令Ａ、命令Ｂおよび命令Ｃのそれぞれのメモリアクセス処理ＭＥＭが同じタイミングで行われることを防止できる。要するに、第Ｎ段目のステージでメモリアクセス処理ＭＥＭが行われる命令と、第Ｎ＋１段目のステージでメモリアクセス処理ＭＥＭが行われる命令と、第Ｎ＋２段目のステージでメモリアクセス処理ＭＥＭが行われる命令とが同時に実行されることにより、データメモリ４０を連続して動作させることができるとともに、各命令のメモリアクセス処理ＭＥＭが同じタイミングで行われることを防止できる。 As described above, in the processing of the instruction C, the memory access processing MEM is performed in the fifth stage T5. Therefore, the number of stages until the memory access processing MEM is performed as compared with the processing of the instruction A. And the number of stages until the memory access process MEM is performed is one more than the process of the instruction B. The instruction A, the instruction B, and the instruction C are simultaneously executed, so that the data memory 40 can be operated continuously, and the memory access processes MEM of the instruction A, the instruction B, and the instruction C are the same. This can be done at the timing. In short, an instruction for performing the memory access process MEM in the Nth stage, an instruction for performing the memory access process MEM in the (N + 1) th stage, and a memory access process MEM in the (N + 2) th stage. By executing the instructions at the same time, the data memory 40 can be operated continuously, and the memory access processing MEM of each instruction can be prevented from being performed at the same timing.

また、上述の実施形態の変形例として、演算装置１００は、メモリアクセス処理によって読み出されたデータを用いて所定の演算を実行する実行ユニットをさらに備えることもできる。また、図８に示すように、演算装置１００は、命令Ａおよび命令Ｂの処理を同時に開始してから、２ステージ分だけ待機した後に、別の命令Ａおよび命令Ｂの処理を同時に開始することもできる。さらに、各ステージ（Ｔ１〜Ｔ５）の時間長は任意である。例えばアドレス算出処理が行われるステージの時間長が、フェッチ処理やデコード処理が行われるステージの時間長より長くなり、より多くのクロックサイクル数が必要となってもよい。 As a modification of the above-described embodiment, the arithmetic device 100 may further include an execution unit that executes a predetermined operation using data read by the memory access process. Further, as shown in FIG. 8, the arithmetic unit 100 starts processing of the instruction A and the instruction B simultaneously, waits for two stages, and then starts processing of another instruction A and the instruction B simultaneously. You can also. Furthermore, the time length of each stage (T1-T5) is arbitrary. For example, the time length of the stage where the address calculation processing is performed may be longer than the time length of the stage where the fetch processing or decoding processing is performed, and a larger number of clock cycles may be required.

１０制御ユニット
２０ロードストアユニット
２１プログラムメモリ
２２命令フェッチ部
２４デコード部
２７読み出し部
４０データメモリ
１００演算装置

DESCRIPTION OF SYMBOLS 10 Control unit 20 Load store unit 21 Program memory 22 Instruction fetch part 24 Decoding part 27 Reading part 40 Data memory 100 Arithmetic unit

Claims

An instruction storage unit for storing a first instruction and a second instruction for executing arithmetic processing using arithmetic resources at mutually different stages of pipeline processing;
A fetch unit for simultaneously fetching the first instruction and the second instruction;
A decoding unit for simultaneously decoding the fetched first instruction and the second instruction;
An arithmetic device comprising:

An arithmetic processing unit that executes the arithmetic processing according to the decoded first instruction, and executes the arithmetic processing according to the decoded second instruction after the arithmetic processing according to the decoded first instruction. In addition,
The arithmetic unit according to claim 1.

The first instruction executes the arithmetic processing at an Nth stage of pipeline processing, while the second instruction executes the arithmetic processing at an N + 1th stage of pipeline processing.
The arithmetic unit according to claim 1 or claim 2, wherein

The computing resource is a data memory for storing data.
The arithmetic unit according to any one of claims 1 to 3, characterized in that:

The first instruction describes address information indicating an area of the data memory in which the data to be read is stored, while the second instruction includes calculation information for calculating the address information. Described,
The address information is calculated using the calculation information between the time when the decoding unit decodes the first instruction and the second instruction and before the start of the arithmetic processing according to the second instruction. An address information calculation unit;
The arithmetic processing executed by the arithmetic processing unit is a process of reading the data stored in an area indicated by the address information in the data memory.
The arithmetic unit according to claim 4, wherein: