JPH1165923A

JPH1165923A - Arithmetic processing unit and memory access method

Info

Publication number: JPH1165923A
Application number: JP21962397A
Authority: JP
Inventors: Yoshihiko Imamura; 義彦今村
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1997-08-14
Filing date: 1997-08-14
Publication date: 1999-03-09

Abstract

PROBLEM TO BE SOLVED: To improve a processing capability by comparing a leading address with an address from a data pointer(DP) register, discriminating a page to be accessed from plural banks, and transferring data to a data register. SOLUTION: Banks 45 and 46 of an inside memory 47 store pages with prescribed data amounts stored in continuous addresses on a memory address space, and DP registers 30 and 31 store addresses on the memory address space. Data registers r0 and r1 transfer data between the inside memory 47 and an ALU 13 for operating an arithmetic processing. The leading addresses of the pages stored in the banks 45 and 46 are compared with addresses stored in DP registers 30 and 31, and whether or not pages to be accesses are present in the banks 45 and 46 is discriminated. When the pages to be accesses are present, the pages to be accessed are data-transferred from the banks 45 and 46 to the data register by using the addresses stored in the DP registers 30 and 31.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は演算処理装置および
メモリアクセス方法に関する。The present invention relates to an arithmetic processing unit and a memory access method.

【０００２】[0002]

【従来の技術】マイクロプロセッサを、一般的な設計思
想によって分類すると、例えば、ＲＩＳＣ(Reduced Ins
truction Set Computer)型と、ＣＩＳＣ(Complex Instr
uctionSet Computer)型とに分けられる。ところで、マ
イクロプロセッサ単体の性能を決めるプログラム実行時
間は下記式（１）で示される2. Description of the Related Art Microprocessors can be classified according to a general design concept, for example, RISC (Reduced Insulated).
truction Set Computer) and CISC (Complex Instr
uctionSet Computer) type. By the way, the program execution time which determines the performance of the microprocessor alone is expressed by the following equation (1).

【０００３】[0003]

【数１】プログラム実行時間＝実行命令数（ＩＣ）×１命令当たりの平均所要クロックサイクル数（ＣＰＩ）×クロックサイクル時間（ＣＣＴ） …（１）## EQU1 ## Program execution time = number of executed instructions (IC) × average number of required clock cycles per instruction (CPI) × clock cycle time (CCT) (1)

【０００４】ＲＩＳＣ型のマイクロプロセッサは、命令
パイプライン処理を駆使して、上記式（１）のＣＰＩを
可能な限り１に近づけることを設計思想としている。そ
のため、ＲＩＳＣ型のマイクロプロセッサでは、命令パ
イプライン処理に適するように命令の機能を単純化する
という観点から、命令を固定単一長とし、命令形式とし
てレジスタ−レジスタ形式（ロード／ストア型アーキテ
クチャ：演算命令のソースオペランドはレジスタオペラ
ンドのみ）を採用している。また、ＲＩＳＣ型のマイク
ロプロセッサでは、命令パイプライン処理が滞らないよ
うにコンパイラによる静的コードスケジューリングを行
う。The design concept of the RISC microprocessor is to make the CPI of the above equation (1) as close to 1 as possible by making full use of instruction pipeline processing. Therefore, in the RISC type microprocessor, from the viewpoint of simplifying the function of the instruction so as to be suitable for the instruction pipeline processing, the instruction has a fixed single length, and the instruction format is a register-register format (load / store architecture: The source operand of the operation instruction is a register operand only). In a RISC microprocessor, static code scheduling is performed by a compiler so that instruction pipeline processing is not delayed.

【０００５】これに対して、ＣＩＳＣ型のマイクロプロ
セッサは、命令の機能レベルを上げて上記式（１）のＩ
Ｃを削減することを設計思想としている。そのため、Ｃ
ＩＳＣ型のマイクロプロセッサでは、命令を固定複数長
あるいは可変長とし、命令形式には、レジスタ−メモリ
形式と、メモリ−メモリ形式（演算命令のソースオペラ
ンドにメモリオペランドも可能）とが混在している。す
なわち、レジスタとメモリとの直接的な演算を可能とし
ている。On the other hand, in the CISC type microprocessor, the function level of the instruction is increased to increase the I level of the above equation (1).
The design philosophy is to reduce C. Therefore, C
In an ISC-type microprocessor, instructions are of fixed length or variable length, and instruction formats include a register-memory format and a memory-memory format (a memory operand can be used as a source operand of an operation instruction). . That is, direct operation between the register and the memory is enabled.

【０００６】ところで、メモリ上のデータをＡＬＵ(Ari
thmetic Logic Unit) 演算の対象にした場合、ＲＩＳＣ
型のマイクロプロセッサでは、メモリアクセスを行うた
めに、ロード命令とストア命令の２つが最低必要であ
る。これに対して、ＣＩＳＣ型のマイクロプロセッサで
は、メモリアクセスのためのみの命令は必要ではない。
ＣＩＳＣ型のマイクロプロセッサでは、メモリアドレス
を指定するための命令中のビットフィールドが多く必要
であり、前述したように可変長命令を採用することが多
い。しかしながら、可変長命令を用いると、デコード回
路が複雑かつ大規模になる傾向がある。そのため、ＣＩ
ＳＣ型のマイクロプロセッサでは、プログラム実行時間
を短縮するために、スーパースケーラ技術やアウトオブ
オーダ技術を用いてメモリ上のデータに対する演算を加
速させているのが現状である。By the way, the data on the memory is transferred to ALU (Ari
thmetic Logic Unit)
In a microprocessor of the type, at least two of a load instruction and a store instruction are required to perform a memory access. On the other hand, in a CISC type microprocessor, an instruction only for memory access is not required.
In a CISC type microprocessor, many bit fields are required in an instruction for specifying a memory address, and a variable length instruction is often employed as described above. However, when a variable length instruction is used, the decoding circuit tends to be complicated and large. Therefore, CI
At present, in the SC type microprocessor, in order to reduce the program execution time, the operation on the data on the memory is accelerated using a superscaler technique or an out-of-order technique.

【０００７】以下、従来のＲＩＳＣ型およびＣＩＳＣ型
のマイクロプロセッサにおけるメモリアクセス方法につ
いて説明する。図２６は、従来のＲＩＳＣ型およびＣＩ
ＳＣ型のマイクロプロセッサの汎用レジスタを説明する
ための図である。図２６に示すように、従来のマイクロ
プロセッサには、例えば、１６本の汎用レジスタセット
が備えられ、それら１６本の汎用レジスタに、ｒ０から
ｒ１５までの名前が付されていると仮定する。これらの
レジスタは、３オペランド演算命令セットのプロセッサ
アーキテクチャに実装する場合には、２個のリードポー
トと１個のライトポートとの合計３ポートが必要であ
る。A memory access method in a conventional RISC type and CISC type microprocessor will be described below. FIG. 26 shows a conventional RISC type and CI
FIG. 2 is a diagram for explaining general-purpose registers of an SC type microprocessor. As shown in FIG. 26, it is assumed that the conventional microprocessor is provided with, for example, 16 general-purpose register sets, and the 16 general-purpose registers are given names r0 to r15. When these registers are implemented in a processor architecture of a three-operand operation instruction set, a total of three ports of two read ports and one write port are required.

【０００８】３オペランド演算命令では、図２７に示す
ように、ＡＬＵ演算命令のレジスタ指定子を３個指定で
きる。なお、図２７において、「；」の右側に
は、「；」の左側に記述された命令についてのコメント
文が記述されている。図２７に示す命令は、「ｒ２←ｒ
３＋ｒ４」を実行する命令である。ｒ０からｒ１５まで
のレジスタは汎用として、値を一時的に保持する目的と
して使用される。また、ロード／ストア型アーキテクチ
ュアを採用しているプロセッサでは、メモリのロード／
ストアを実現するために汎用レジスタに対してロード／
ストア命令を実行する。直接、ＡＬＵ演算器に代入する
ような命令はない。このタイプは、ＲＩＳＣ型のプロセ
ッサに多く見られる。図２８に示すように、メモリ上の
データを加工する場合、一旦、ロード命令「ｌｗｒ
３，０（ｒ１０）」を実行する必要がある。これに対し
て、ＣＩＳＣ型のプロセッサにはメモリ上のデータをＡ
ＬＵ演算命令のオペランドとして指定することができる
ものがある。しかし、その場合、汎用レジスタを使用せ
ずにメモリバッファを直接使用することになる。In the three-operand operation instruction, as shown in FIG. 27, three register specifiers of the ALU operation instruction can be specified. In FIG. 27, on the right side of “;”, a comment sentence for the instruction described on the left side of “;” is described. The instruction shown in FIG. 27 is “r2 ← r
3 + r4 ”. The registers r0 to r15 are used for general purpose and for temporarily holding the values. In a processor employing a load / store type architecture, a memory load / store
Load / load to general purpose registers to implement store
Execute the store instruction. There is no instruction to directly substitute the ALU. This type is often found in RISC type processors. As shown in FIG. 28, when processing the data on the memory, the load instruction “lwr
3,0 (r10) ". On the other hand, the CISC type processor stores the data on the memory as A
Some can be specified as an operand of an LU operation instruction. However, in that case, the memory buffer is directly used without using the general-purpose register.

【０００９】以下、従来のＲＩＳＣ型プロセッサのパイ
プライン処理について説明する。ＲＩＳＣ型プロセッサ
では、５段あるいは８段パイプライン構造となっている
ものが多い。例えば、ＭＩＰＳ社のＲ３０００（商標
名）は、図２９に示すように、ＩＦ(Instruction Fetc
h) ステージ、ＤＥＣ(Instruction Decode)ステージ、
ＡＬＵステージ、ＭＥＭ(Memory)ステージおよびＷＢ(W
rite Back)ステージの５段パイプラインを採用してい
る。このプロセッサでは、１段目のＩＦステージで命令
をフェッチし（読み込み）、２段目のＤＥＣステージで
命令をデコードする。なお、汎用レジスタをソースレジ
スタとして指定した命令であれば、ＤＥＣステージで命
令をデコードした後に、汎用レジスタからデータを読み
出す。次に、３段目のＡＬＵステージで、ＡＬＵ演算命
令を実行する。なお、フェッチした命令がＡＬＵ命令で
ない場合は、ＡＬＵステージでは何も行わないので、デ
ータをＡＬＵの出力ポートにそのまま出力する。Hereinafter, the pipeline processing of the conventional RISC type processor will be described. Many RISC processors have a five-stage or eight-stage pipeline structure. For example, as shown in FIG. 29, R3000 (trade name) of MIPS Co., Ltd.
h) Stage, DEC (Instruction Decode) stage,
ALU stage, MEM (Memory) stage and WB (W
rite Back) A five-stage pipeline is used. In this processor, an instruction is fetched (read) in the first IF stage and decoded in the second DEC stage. If the instruction specifies a general-purpose register as a source register, the instruction is decoded in the DEC stage, and then data is read from the general-purpose register. Next, an ALU operation instruction is executed in the third ALU stage. If the fetched instruction is not an ALU instruction, nothing is performed in the ALU stage, so that the data is output to the output port of the ALU as it is.

【００１０】次に、４段目のＭＥＭステージでは、フェ
ッチした命令がメモリアクセス命令であるときに、メモ
リアクセスのためのメモリアドレスをメモリユニットに
出力してメモリアクセスを実行する。次に、５段目で、
汎用レジスタをディスティネーションレジスタとして指
定した命令について、ＡＬＵの演算結果を汎用レジスタ
に書き戻す。メモリリード命令（ロード命令）であれ
ば、メモリユニットから値を受け取り汎用レジスタに書
き込む。図２９に示すように、５段パイプラインを採用
したプロセッサでは、例えば、クロックサイクルＸにお
いて、コードＣ₁のＷＢステージと、コードＣ₂のＭＥ
Ｍステージと、コードＣ₃のＡＬＵステージと、コード
Ｃ₄のＤＥＣステージと、コードＣ₅のＩＦステージと
を多重化して行う。Next, in the fourth MEM stage, when the fetched instruction is a memory access instruction, a memory address for memory access is output to the memory unit to execute the memory access. Next, at the fifth stage,
For an instruction that designates a general-purpose register as a destination register, the operation result of the ALU is written back to the general-purpose register. If it is a memory read instruction (load instruction), it receives a value from the memory unit and writes it to a general-purpose register. As shown in FIG. 29, the processor employing the 5-stage pipeline, for example, in a clock cycle X, and WB stages of the code C _1, code C ₂ ME
And M stage, and ALU stage code C _3, and DEC stage code C _4, and the IF stage of the code C ₅ performed by multiplexing.

【００１１】ところで、前述したようにＲＩＳＣ型プロ
セッサでは、ロード／ストア型命令セットアーキテクチ
ュアを採用していることから、ＡＬＵ演算命令とロード
／ストア命令とが分離して各々独立して存在している。
そのため、これらの命令を含めて任意の命令を多重化す
るためには、例えば、図２９に示す５段パイプライン構
造にすることが都合がよい。すなわち、メモリアクセス
命令とその他の命令とを同時に実行することができる。
メモリアクセスのパスは１系統（１セット）のみ存在す
ることを前提としているので、メモリリードとメモリラ
イトとを同じＭＥＭステージで同時に実行することはな
い。また、メモリアクセス命令とそれ以外の命令とを各
々単独で扱うと、使用しないパイプラインステージが生
じてしまう。例えば、レジスタ間転送命令では、ＭＥＭ
ステージの機能は使用されない。また、メモリアクセス
命令では、ＡＬＵステージの機能を使用しない。なお、
メモリアクセスのためのアドレス生成演算は、ＡＬＵ以
外の別のユニットで行われる。Since the load / store type instruction set architecture is adopted in the RISC processor as described above, the ALU operation instruction and the load / store instruction are separated and exist independently of each other. .
Therefore, in order to multiplex arbitrary instructions including these instructions, for example, it is convenient to adopt a five-stage pipeline structure shown in FIG. That is, the memory access instruction and other instructions can be executed simultaneously.
Since it is assumed that there is only one system (one set) of memory access paths, a memory read and a memory write are not executed simultaneously in the same MEM stage. In addition, if a memory access instruction and other instructions are handled independently, an unused pipeline stage occurs. For example, in a transfer instruction between registers, MEM
Stage functionality is not used. In the memory access instruction, the function of the ALU stage is not used. In addition,
The address generation operation for memory access is performed in another unit other than the ALU.

【００１２】図２９に示す５段パイプライン処理では、
メモリ上のデータをＡＬＵ演算の対象にした場合に、例
えば図３０に示すようにプログラムを記述する。図３０
に示すプログラムでは、先ず、命令「ｌｗｒ２，０
（ｒ１０）」によって、レジスタｒ１０が示すメモリア
ドレス上のデータをレジスタｒ２にロードする。次に、
命令「ａｄｄｕｒ３，ｒ２，ｒ９」によって、レジス
タｒ２とレジスタｒ９の値を加算し、その結果をレジス
タｒ３に代入する。次に、命令「ｓｗｒ３，０（ｒ１
１）」によって、レジスタｒ３の値をレジスタｒ１１が
示すメモリアドレスにストアする（書き戻す）。これら
の操作を３命令で記述している。各命令とも実行には最
低１クロックサイクルを必要とするので、３命令を実行
するには、３クロックサイクルを必要とする。実際に
は、メモリリード（ロード）したデータは、その直後の
命令で参照できないので、もう１クロックサイクル必要
である。In the five-stage pipeline processing shown in FIG.
When the data on the memory is subjected to the ALU operation, a program is described, for example, as shown in FIG. FIG.
In the program shown in FIG. 1, first, the instruction “lw r2, 0
(R10) "loads the data on the memory address indicated by the register r10 into the register r2. next,
The instruction “addu r3, r2, r9” adds the values of the register r2 and the register r9, and assigns the result to the register r3. Next, the instruction “sw r3,0 (r1
1), the value of the register r3 is stored (rewritten) at the memory address indicated by the register r11. These operations are described by three instructions. Each instruction requires at least one clock cycle to execute, so executing three instructions requires three clock cycles. Actually, the memory read (loaded) data cannot be referred to by the instruction immediately after that, so another clock cycle is required.

【００１３】[0013]

【発明が解決しようとする課題】しかしながら、画像処
理や音声処理などのメディア処理では、連続したメモリ
アドレス空間上のデータに対して所定のＡＬＵ演算を繰
り返して行う必要がある。この場合には、図３１に示す
ように、図３０に示すプログラムに対して、メモリアド
レスを更新するための命令「ａｄｄｉｒ１０，４」お
よび命令「ａｄｄｉｒ１１，４」をさらに加える必要
がある。その結果、図３１に示すプログラムを実行する
のに、最低５クロックサイクル必要になり、処理時間が
長期化してしまうという問題がある。なお、図３１で
は、加算演算のソースデータのメモリ上の開始アドレス
をレジスタｒ１０を用いて設定し、ディスティネーショ
ンデータの開始アドレスをレジスタｒ１１を用いて設定
している。However, in media processing such as image processing and audio processing, it is necessary to repeatedly perform a predetermined ALU operation on data in a continuous memory address space. In this case, as shown in FIG. 31, it is necessary to further add an instruction “addi r10, 4” and an instruction “addi r11, 4” for updating the memory address to the program shown in FIG. As a result, at least five clock cycles are required to execute the program shown in FIG. 31, and there is a problem that the processing time becomes longer. In FIG. 31, the start address of the source data of the addition operation on the memory is set using the register r10, and the start address of the destination data is set using the register r11.

【００１４】また、上述した従来のマイクロプロセッサ
における５段パイプライン処理では、メモリアクセスは
ＭＥＭステージで実行され、メモリアクセスのパスが１
系統しか設けられていないため、メモリリードとメモリ
ライトとを同時に実行できない。したがって、メモリリ
ード命令とメモリライト命令とを独立して記述したプロ
グラムを作成する必要があり、そのことが処理時間の短
縮を図るときのボトルネックとなっていた。なお、マイ
クロプロセッサでは、限られたレジスタ資源を効率的に
使用することがシステム設計において非常に重要であ
る。In the above-described five-stage pipeline processing in the conventional microprocessor, memory access is executed in the MEM stage, and the memory access path is one.
Since only the system is provided, memory read and memory write cannot be executed simultaneously. Therefore, it is necessary to create a program in which a memory read instruction and a memory write instruction are described independently, which has been a bottleneck when shortening the processing time. It is very important for a microprocessor to efficiently use limited register resources in system design.

【００１５】本発明は上述した従来技術の問題点に鑑み
てなされ、処理能力の向上を図れる演算処理装置および
メモリアクセス方法を提供する。また、本発明は、レジ
スタ資源を効率的に利用でき、特に汎用レジスタの機能
を適切に発揮できる演算処理装置およびメモリアクセス
方法を提供する。The present invention has been made in view of the above-mentioned problems of the prior art, and provides an arithmetic processing device and a memory access method capable of improving processing performance. In addition, the present invention provides an arithmetic processing device and a memory access method that can efficiently use register resources and, in particular, can appropriately exhibit the functions of general-purpose registers.

【００１６】[0016]

【課題を解決するための手段】上述した目的を達成する
ために、本発明の演算処理装置は、メモリアドレス空間
上の連続したアドレスに記憶された所定のデータ量のペ
ージをそれぞれ記憶する複数のバンクを備えた内部メモ
リと、前記メモリアドレス空間上のアドレスを記憶する
データポインタレジスタと、演算処理を行う演算手段
と、前記内部メモリおよび前記演算手段との間でデータ
の転送を行うデータレジスタと、前記複数のバンクに記
憶されたページの前記メモリアドレス空間内の先頭アド
レスを記憶し、当該記憶された先頭アドレスと、前記デ
ータポインタレジスタに記憶されたアドレスとを比較
し、前記比較の結果に基づいて前記複数のバンクのいず
れかにアクセス対象となるページが存在するか否かを判
別するページ有無判別手段と、前記判別の結果、アクセ
ス対象となるページが前記複数のバンクのいずれかに存
在する場合に、アクセス対象となるページが記憶された
バンクと前記データレジスタとの間で、前記データポイ
ンタレジスタに記憶されたアドレスを用いてデータの転
送を行うように制御する制御手段とを有する。In order to achieve the above-mentioned object, an arithmetic processing unit according to the present invention comprises a plurality of pages each storing a predetermined amount of data stored at consecutive addresses in a memory address space. An internal memory having a bank, a data pointer register for storing an address in the memory address space, an arithmetic unit for performing arithmetic processing, and a data register for transferring data between the internal memory and the arithmetic unit. Storing a start address in the memory address space of the page stored in the plurality of banks, comparing the stored start address with an address stored in the data pointer register, and comparing the result of the comparison with the address stored in the data pointer register. Page presence / absence determination for determining whether there is a page to be accessed in any of the plurality of banks based on the plurality of banks And when the page to be accessed is present in any of the plurality of banks as a result of the determination, the data pointer register is provided between the bank in which the page to be accessed is stored and the data register. And control means for performing control to transfer data using the address stored in the memory.

【００１７】本発明の演算処理装置では、ページ有無判
別手段によって、内部メモリの複数のバンクのうち何れ
のバンクに、演算手段からのアクセス対象となるページ
が含まれているかが判別され、その判別結果に基づい
て、単体のデータレジスタが、複数のバンクのうち、ア
クセス対象のページを記憶するバンクを選択して接続さ
れる。In the arithmetic processing device according to the present invention, the page presence / absence determining means determines which of the plurality of banks of the internal memory contains a page to be accessed by the arithmetic means, and the determination is made. Based on the result, a single data register is selected and connected to a bank that stores a page to be accessed from a plurality of banks.

【００１８】また、本発明のメモリアクセス方法は、演
算手段からデータレジスタを介して、メモリアドレス空
間上の連続したアドレスに記憶された所定のデータ量の
ページをそれぞれ記憶する複数のバンクを備えた内部メ
モリにアクセスするメモリアクセス方法であって、前記
メモリアドレス空間上のアドレスをデータポインタレジ
スタに記憶し、前記複数のバンクに記憶されたページの
前記メモリアドレス空間内の先頭アドレスを記憶し、当
該記憶された先頭アドレスと、前記データポインタレジ
スタに記憶されたアドレスとを比較し、前記比較の結果
に基づいて、前記複数のバンクのいずれかにアクセス対
象となるページが存在するか否かを判別し、前記判別の
結果、アクセス対象となるページが前記複数のバンクの
いずれかに存在する場合に、アクセス対象となるページ
が記憶されたバンクとデータレジスタとの間で、前記デ
ータポインタレジスタに記憶されたアドレスを用いてデ
ータの転送を行う。Further, the memory access method of the present invention includes a plurality of banks for respectively storing pages of a predetermined data amount stored at consecutive addresses in the memory address space from the arithmetic means via the data register. A memory access method for accessing an internal memory, wherein an address in the memory address space is stored in a data pointer register, and a head address in the memory address space of a page stored in the plurality of banks is stored. The stored head address is compared with the address stored in the data pointer register, and based on a result of the comparison, it is determined whether a page to be accessed exists in any of the plurality of banks. As a result of the determination, the page to be accessed exists in any of the plurality of banks. Case, between the bank and the data register page to be accessed is stored, and transfers data using the address stored in the data pointer register.

【００１９】[0019]

【発明の実施の形態】以下、本発明の実施形態に係わる
マイクロプロセッサについて説明する。第１実施形態図１は、本実施形態のマイクロプロセッサ１の構成図で
ある。マイクロプロセッサ１は、図１に示す汎用レジス
タモジュール１１、マルチプレクサ（ＭＵＸ）１２，１
６、ＡＬＵ(Arithmetic Logic Unit) １３、インストラ
クションページメモリ３５、デコーダ３６と、図２に示
すＤＰレジスタ３０，３１および内部メモリ４７を有
し、これらを１チップ内に組み込んだ構成をしている。
なお、マイクロプロセッサ１では、デコーダ３６におけ
るデコード結果に応じた制御信号Ｓ３６に基づいて、以
下に示す処理が行われる。DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, a microprocessor according to an embodiment of the present invention will be described. First Embodiment FIG. 1 is a configuration diagram of a microprocessor 1 of the present embodiment. The microprocessor 1 includes a general-purpose register module 11, a multiplexer (MUX) 12, 1 shown in FIG.
6, an ALU (Arithmetic Logic Unit) 13, an instruction page memory 35, a decoder 36, the DP registers 30 and 31 and the internal memory 47 shown in FIG. 2, and these are incorporated in one chip.
In the microprocessor 1, the following processing is performed based on the control signal S36 according to the decoding result of the decoder 36.

【００２０】汎用レジスタモジュール１１は、汎用的に
使用される例えば３２ビットの１６個のレジスタｒ₀〜
ｒ₁₅で構成されるレジスタセットである。汎用レジスタ
モジュール１１を構成するレジスタの数は、命令セット
あるいはチップアーキテクチュアに大きく依存し、一般
的には、８個から３２個の範囲で設定される。汎用レジ
スタモジュール１１は、図１に示すように、７個のポー
トＷ，ＲＡ，ＲＢ，ＭＡ，ＭＢ，ＤＡ，ＤＢを備えてい
る。ここで、ポートＷは、ライトポートであり、ＡＬＵ
１３のポートＡＯＵＴから出力されたデータが、バス２
３およびポートＷを介して、汎用レジスタモジュール１
１に書き込まれる。また、ポートＭＡ，ＭＢは、図２に
示すように、それぞれ内部メモリ４７のメモリバンク４
５，４６との間のポートであり、それぞれバス２６，２
７を介して、メモリバンク４５，４６のポートとの間で
データ転送を行う。なお、ポートＭＡ，ＭＢを設けず
に、汎用レジスタモジュール１１の外部で、ポートＲ
Ａ，ＲＢ，Ｗを切り替えて、メモリバンク４５，４６と
の接続を実現するような構成にしてもよい。The general-purpose register module 11 has, for example, 16 32-bit registers r ₀ to r ₀ used for general purposes.
It is composed of a register set in r _15. The number of registers constituting the general-purpose register module 11 largely depends on the instruction set or the chip architecture, and is generally set in the range of 8 to 32. As shown in FIG. 1, the general-purpose register module 11 has seven ports W, RA, RB, MA, MB, DA, and DB. Here, port W is a write port and ALU
The data output from port AOUT of bus 13 is
3 and the general-purpose register module 1 via the port W
Written to 1. The ports MA and MB are connected to the memory banks 4 of the internal memory 47, respectively, as shown in FIG.
Ports between the buses 26 and 2 respectively.
7, data is transferred to and from the ports of the memory banks 45 and 46. It should be noted that the port R is provided outside the general-purpose register module 11 without providing the ports MA and MB.
A, RB, W may be switched to realize a connection with the memory banks 45, 46.

【００２１】また、ポートＲＡ，ＲＢは、リードポート
であり、ポートＲＡ，ＲＢおよびバス１７，１８を介し
て、汎用レジスタモジュール１１から読み出されたデー
タがマルチプレクサ１２に出力される。さらに、ポート
ＤＡ，ＤＢは、それぞれマルチプレクサ１６とデータレ
ジスタｒ₀，ｒ₁との間で、双方向通信を行うためのポ
ートである。The ports RA and RB are read ports. Data read from the general-purpose register module 11 is output to the multiplexer 12 via the ports RA and RB and the buses 17 and 18. Further, the ports DA and DB are ports for performing bidirectional communication between the multiplexer 16 and the data registers r ₀ and r ₁ , respectively.

【００２２】汎用レジスタモジュール１１では、ポート
Ｗ，ＲＡ，ＲＢ，ＭＡ，ＭＢ，ＤＡ，ＤＢを同時に使用
することができる。つまり、汎用レジスタモジュール１
１に対してのリード動作とライト動作とを独立して実行
できる。In the general-purpose register module 11, the ports W, RA, RB, MA, MB, DA, and DB can be used simultaneously. That is, general-purpose register module 1
1 can be executed independently of the read operation and the write operation.

【００２３】マイクロプロセッサ１では、データレジス
タｒ₀，ｒ₁を、それぞれバス２６，２７を介してメモ
リとの間で通信を行うためのインターフェイスとなるバ
ッファとして用いる。また、レジスタｒ₂〜ｒ₁₅は、一
般的な汎用レジスタとして使用される。マイクロプロセ
ッサ１では、例えば、ＡＬＵ１３において内部メモリ４
７に記憶されたデータを用いる場合には、そのデータを
バス２６を介してデータレジスタｒ₀に読み出した後
に、データレジスタｒ₀にアクセスを行う。このとき、
内部メモリ４７からバス２７を介して、データレジスタ
ｒ₁にデータを読み出した後に、データレジスタｒ₁に
アクセスしてもよい。また、マイクロプロセッサ１で
は、データを内部メモリ４７に書き込む場合に、データ
レジスタｒ₀あるいはｒ₁にデータを書き込んだ後に、
この書き込んだデータをバス２６あるいは２７を介して
内部メモリ４７に転送する。In the microprocessor 1, the data registers r ₀ and r ₁ are used as buffers serving as interfaces for communicating with the memories via the buses 26 and 27, respectively. The register r ₂ ~r ₁₅ is used as a general-purpose register. In the microprocessor 1, for example, the internal memory 4 in the ALU 13
In the case of using the data stored in 7, the data is read out to the data register r ₀ via the bus 26, and then the data register r ₀ is accessed. At this time,
From the internal memory 47 via the bus 27, after reading the data into the data register r _1, may access the data register r _1. Further, the microprocessor 1, when writing data in the internal memory 47, after writing data to the data register r ₀ or r _1,
The written data is transferred to the internal memory 47 via the bus 26 or 27.

【００２４】なお、データレジスタｒ₀，ｒ₁は、他の
レジスタｒ₂〜ｒ₁₅と実質的に同じ接続形態をしてお
り、レジスタｒ₂〜ｒ₁₅と同様に汎用レジスタとしても
使用できる。具体的には、例えば、マルチプレクサ１６
がバス２１および２２とポートＤＡ，ＤＢとの接続を選
択しているときに、ポートＤＡ，ＤＢがそれぞれＲＡ，
ＲＢと同様の機能を果たし、バス２１，２２がバス１
７、１８と同様の機能を果たす。また、マルチプレクサ
１６が、バス２３とポートＤＡあるいはＤＢとの接続を
選択しているときには、ポートＤＡあるいはＤＢがポー
トＷと同様の機能を果たす。そのため、データレジスタ
ｒ₀，ｒ₁は、プログラム内でレジスタｒ₂〜ｒ₁₅を含
む汎用レジスタ群の一部として統一的に記述され、デコ
ーダ３６においても同様に認識される。[0024] The data register r _0, r ₁ is the substantially the same topology and other registers r ₂ ~r _15, can also be used as well as general-purpose registers and register r ₂ ~r _15. Specifically, for example, the multiplexer 16
Selects the connection between the buses 21 and 22 and the ports DA and DB, the ports DA and DB
Performs the same function as RB, and buses 21 and 22
It performs the same function as 7 and 18. When the multiplexer 16 selects the connection between the bus 23 and the port DA or DB, the port DA or DB performs the same function as the port W. Therefore, the data registers r ₀ and r ₁ are uniformly described as a part of a general-purpose register group including the registers r _{2 to} r ₁₅ in the program, and are similarly recognized by the decoder 36.

【００２５】ＤＰ(Data Pointer)レジスタ３０，３１
は、それぞれ内部メモリ４７のメモリバンク４５，４６
のメモリアドレス空間内のアドレスを記憶する。なお、
ＤＰレジスタ３０，３１は、汎用レジスタモジュール１
１のレジスタｒ₀〜ｒ₁₅とは異なり、複数の制御レジス
タのうち一部を割り当てる。そのため、ＤＰレジスタ３
０，３１を、ＡＬＵ演算命令のレジスタ指定子を用いて
直接的には指定することはできず、データ転送命令によ
って指定される。なお、ＤＰレジスタ３０，３１に、内
部メモリ４７上のアドレスを書き込むには、１命令分の
ステップを必要とするが、データレジスタｒ₀，ｒ₁を
介した内部メモリ４７に対してのアクセス動作は、プロ
グラムには記述されておらず、プログラムに直接的に明
示された処理とは別に、バックグランドで行われる。す
なわち、内部メモリ４７に対してのアクセス動作は、パ
イプライン処理と並行して行われる。DP (Data Pointer) registers 30, 31
Are the memory banks 45 and 46 of the internal memory 47, respectively.
The address in the memory address space is stored. In addition,
The DP registers 30 and 31 correspond to the general-purpose register module 1
Unlike one register r _{0 to} r ₁₅ , a part of a plurality of control registers is assigned. Therefore, DP register 3
0 and 31 cannot be directly specified using the register specifier of the ALU operation instruction, but are specified by a data transfer instruction. Writing an address on the internal memory 47 to the DP registers 30 and 31 requires one instruction step. However, an access operation to the internal memory 47 via the data registers r ₀ and r ₁ is required. Is not described in the program, and is performed in the background separately from the process directly specified in the program. That is, the access operation to the internal memory 47 is performed in parallel with the pipeline processing.

【００２６】マイクロプロセッサ１では、例えば、図２
に示す内部メモリ４７のメモリバンク４５に記憶された
データを読み出す際に、当該読み出すデータのメモリバ
ンク４５内でのアドレスをＤＰレジスタ３０に書き込
む。これによって、ＤＰレジスタ３０に書き込まれたメ
モリバンク４５内でのアドレスに記憶されたデータが、
バス２６を介してメモリバンク４５からデータレジスタ
ｒ₀に、ハードウェアによって自動的に読み出される。
その後、データレジスタｒ₀から、マルチプレクサ１６
を介してＡＬＵ１３のポートＡＩＮあるいはＢＩＮにデ
ータが転送される。In the microprocessor 1, for example, FIG.
When the data stored in the memory bank 45 of the internal memory 47 is read, the address of the read data in the memory bank 45 is written to the DP register 30. As a result, the data stored in the address in the memory bank 45 written in the DP register 30 becomes
The data is automatically read from the memory bank 45 to the data register r ₀ via the bus 26 by hardware.
Thereafter, the multiplexer 16 is read from the data register r _0.
Is transferred to the port AIN or BIN of the ALU 13 via the.

【００２７】また、同様に、マイクロプロセッサ１で
は、例えば、図２に示す内部メモリ４７のメモリバンク
４６に記憶されたデータを読み出す際に、当該読み出す
データのメモリバンク４６内でのアドレスをＤＰレジス
タ３１に書き込む。これによって、ＤＰレジスタ３１に
書き込まれたメモリバンク４６内でのアドレスに記憶さ
れたデータが、バス２６を介してメモリバンク４６から
データレジスタｒ₁に、ハードウェアによって自動的に
読み出される。その後、データレジスタｒ₁から、マル
チプレクサ１６を介してＡＬＵ１３のポートＡＩＮある
いはＢＩＮにデータが転送される。Similarly, in the microprocessor 1, for example, when reading data stored in the memory bank 46 of the internal memory 47 shown in FIG. Write to 31. As a result, the data stored in the address in the memory bank 46 written in the DP register 31 is automatically read out from the memory bank 46 to the data register r ₁ via the bus 26 by hardware. Thereafter, data is transferred from the data register r ₁ to the port AIN or BIN of the ALU 13 via the multiplexer 16.

【００２８】このように、マイクロプロセッサ１では、
ＤＰレジスタ３０，３１に、メモリバンク４５，４６内
のアドレスを書き込むことで、それぞれデータレジスタ
ｒ₀，ｒ₁に、当該アドレスに記憶されたデータを自動
的に読み出す。As described above, in the microprocessor 1,
By writing the addresses in the memory banks 45 and 46 to the DP registers 30 and 31, the data stored at the addresses are automatically read out to the data registers r ₀ and r ₁ respectively.

【００２９】一方、内部メモリ４７のメモリバンク４５
および４６内の所定のアドレスにデータを書き込む場合
に、当該アドレスをそれぞれＤＰレジスタ３０，３１に
書き込んだ後に、それぞれデータレジスタｒ₀およびｒ
₁に当該データを書き込む。その後、ハードウェアによ
って自動的に、データレジスタｒ₀，ｒ₁に記憶された
データが、ＤＰレジスタ３０，３１に記憶されたメモリ
バンク４５，４６内のアドレスに書き込まれる。なお、
データレジスタｒ₀とメモリバンク４５とのデータ転送
と、データレジスタｒ₁とメモリバンク４６とのデータ
転送とは、マイクロプロセッサ１において、同一のパイ
プラインサイクルで行うことができる。On the other hand, the memory bank 45 of the internal memory 47
When data is written to a predetermined address in data registers 46 and 46, after writing the address to DP registers 30 and 31, respectively, data registers r ₀ and r
Write the relevant data to ₁ . Thereafter, the data stored in the data registers r ₀ and r ₁ are automatically written by hardware into the addresses in the memory banks 45 and 46 stored in the DP registers 30 and 31. In addition,
Data transfer between the data register r ₀ and the memory bank 45 and data transfer between the data register r ₁ and the memory bank 46 can be performed in the same pipeline cycle in the microprocessor 1.

【００３０】ところで、内部メモリ４７に対してのメモ
リライト動作で、データレジスタｒ₀またはｒ₁の一方
を使用すると、若干のサイクルをロスすることがある。
これは、データレジスタｒ₀またはｒ₁に対してデータ
の自動読み出し動作が不要に行われてしまうためであ
る。これを防ぐために、使用目的に合わせてデータレジ
スタｒ₀，ｒ₁の少なくとも一方をライト動作専用モー
ドに設定しておけば、無駄なメモリリード（ロード）動
作を実行することを回避できる。メモリリードあるいは
メモリライトの特性として、画像処理などの分野では、
連続したメモリアドレスが参照される可能性が高い。つ
まり、メモリリード命令の次には再びメモリリード命令
を実行することが多い。しかも、このとき、連続したメ
モリアドレスをアクセスする可能性が高い。そのため、
ＤＰレジスタ３０，３１には、例えば、内部メモリ４７
の記憶されたデータにアクセスする度に、記憶されてい
るアドレスを、例えば、＋１，−１，＋４，−４，＋８
あるいは−８だけ自動的にインクリメントする機能を備
え、画像処理を行うときの処理効率の向上を図ってい
る。When one of the data registers r _{0 and} r ₁ is used in a memory write operation to the internal memory 47, a slight cycle may be lost.
This is because the operation of automatically reading data from the data register r ₀ or r ₁ is unnecessary. To prevent this, if at least one of the data registers r ₀ and r ₁ is set to the write operation only mode in accordance with the purpose of use, useless memory read (load) operation can be avoided. As a characteristic of memory read or memory write, in fields such as image processing,
It is highly likely that consecutive memory addresses will be referenced. That is, the memory read instruction is often executed again after the memory read instruction. Moreover, at this time, there is a high possibility that consecutive memory addresses are accessed. for that reason,
The DP registers 30 and 31 have, for example, an internal memory 47.
Each time the stored data is accessed, the stored address is changed to, for example, +1, -1, +4, -4, +8
Alternatively, a function of automatically incrementing by -8 is provided to improve processing efficiency when performing image processing.

【００３１】図２に示す内部メモリ４７のメモリバンク
４５，４６は、それぞれアドレスが連続した外部メイン
メモリ５０のメモリアドレス空間上のデータのコピーを
記憶している。メモリバンク４５と４６との間では、外
部メインメモリ５０のメモリアドレス空間上のアドレス
が連続している必要はない。The memory banks 45 and 46 of the internal memory 47 shown in FIG. 2 store copies of data in the memory address space of the external main memory 50 whose addresses are continuous. The addresses in the memory address space of the external main memory 50 do not need to be continuous between the memory banks 45 and 46.

【００３２】ＡＬＵ１３は、入力ポートＡＩＮ，ＢＩＮ
およびポートＡＯＵＴを備え、デコーダ３６からの制御
信号Ｓ３６に基づいて、入力ポートＡＩＮ，ＢＩＮから
入力されたデータに対して算術演算あるいは論理演算を
実行し、演算結果のデータをポートＡＯＵＴからバス２
３に出力する。ＡＬＵ１３では、演算を１クロックサイ
クルで実行する。ここで、ＡＬＵ１３は、演算命令の種
類によっては、デコーダ３６からの制御信号Ｓ３６に基
づいて、入力ポートＡＩＮあるいはＢＩＮからＡＬＵ１
３に入力されたデータを演算を行わずに、そのまま出力
ポートＡＯＵＴから出力することもある。The ALU 13 has input ports AIN, BIN
And an arithmetic operation or a logical operation on data input from the input ports AIN and BIN based on a control signal S36 from the decoder 36, and transfers the operation result data from the port AOUT to the bus 2
Output to 3. The ALU 13 executes an operation in one clock cycle. Here, depending on the type of the operation instruction, the ALU 13 transmits the ALU1 from the input port AIN or BIN based on the control signal S36 from the decoder 36.
In some cases, the data input to 3 is output from the output port AOUT as it is without performing the operation.

【００３３】以下、マイクロプロセッサ１におけるパイ
プライン処理動作について説明する。マイクロプロセッ
サ１では、図１に示す構成によって、以下に示すよう
に、命令実行時のデータパスを４段パイプライン構造に
する。図４は、マイクロプロセッサ１のパイプライン処
理を説明するための図である。図４に示すように、マイ
クロプロセッサ１は、ＩＦステージ、ＤＥＣステージ、
ＡＬＵステージおよびＷＢ(Write Back)ステージの４段
パイプラインを採用している。すなわち、図４に示す４
段パイプラインには、図２９に示す５段パイプラインの
ＭＥＭステージが存在しない。Hereinafter, the pipeline processing operation in the microprocessor 1 will be described. In the microprocessor 1, with the configuration shown in FIG. 1, the data path at the time of instruction execution has a four-stage pipeline structure as shown below. FIG. 4 is a diagram for explaining the pipeline processing of the microprocessor 1. As shown in FIG. 4, the microprocessor 1 includes an IF stage, a DEC stage,
A 4-stage pipeline of an ALU stage and a WB (Write Back) stage is adopted. That is, 4 shown in FIG.
The stage pipeline does not have the MEM stage of the five-stage pipeline shown in FIG.

【００３４】図４に示す４段パイプラインにおいて、Ｉ
Ｆステージ、ＤＥＣステージおよびＡＬＵステージにお
ける処理は、前述した図２９に示す５段パイプラインの
場合と同じである。図４に示す４段パイプラインでは、
ＩＦステージ、ＤＥＣステージ、ＡＬＵステージおよび
ＷＢステージが多重化して行われる。具体的には、マイ
クロプロセッサ１では、１段目のＩＦステージで、図１
に示すプログラムカウンタ３４によって指し示されるイ
ンストラクションメモリ３５上のアドレスから、次に実
行する命令をフェッチする（読み込む）。次に、２段目
のＤＥＣステージで、ＩＦステージでフェッチした命令
を図１に示すデコーダ３６でデコードする。そして、こ
のデコード結果に応じた制御信号Ｓ３６に基づいて、マ
ルチプレクサ１２，１６およびＡＬＵ１３などが制御さ
れる。In the four-stage pipeline shown in FIG.
The processing in the F stage, DEC stage, and ALU stage is the same as in the case of the five-stage pipeline shown in FIG. 29 described above. In the four-stage pipeline shown in FIG.
The IF stage, the DEC stage, the ALU stage and the WB stage are multiplexed and performed. Specifically, in the microprocessor 1, the first IF stage
The instruction to be executed next is fetched (read) from the address on the instruction memory 35 indicated by the program counter 34 shown in FIG. Next, in the second DEC stage, the instruction fetched in the IF stage is decoded by the decoder 36 shown in FIG. The multiplexers 12, 16 and the ALU 13 are controlled based on the control signal S36 according to the decoding result.

【００３５】なお、汎用レジスタモジュール１１のレジ
スタｒ₀〜ｒ₁₅をソースレジスタとして指定した命令で
あれば、ＤＥＣステージで命令をデコードした後に、デ
ータレジスタｒ₀〜ｒ₁₅からデータを、マルチプレクサ
１２を介してＡＬＵ１３のポートＡＩＮ，ＢＩＮに読み
出す。すなわち、マイクロプロセッサ１では、プログラ
ムに応じて、データレジスタｒ₀，ｒ₁を他の汎用のレ
ジスタｒ₂〜ｒ₁₅と同様にアクセスできる。このとき、
内部メモリ４７に対してのメモリアクセスが必要な場合
には、図２に示すＤＰレジスタ３０，３１に、内部メモ
リ４７上の所定のアドレスを記憶させた後に、データレ
ジスタｒ₀，ｒ₁にアクセスを行うことで、内部メモリ
４７に対してのアクセスを実現する。すなわち、マイク
ロプロセッサ１では、データレジスタｒ₀，ｒ₁を介し
て、内部メモリ４７へのメモリアクセスを実現する。If the instruction specifies the registers r _{0 to} r ₁₅ of the general-purpose register module 11 as a source register, the instruction is decoded in the DEC stage, and then the data from the data registers r _{0 to} r _{15 is} transmitted to the multiplexer 12. The data is read out to the ports AIN and BIN of the ALU 13 via the CPU. That is, in the microprocessor 1, in accordance with the program, can access data registers r _0, r ₁ as well as other general purpose register r ₂ ~r _15. At this time,
When a memory access to the internal memory 47 is necessary, a predetermined address on the internal memory 47 is stored in the DP registers 30 and 31 shown in FIG. 2, and then the data registers r ₀ and r ₁ are accessed. , The access to the internal memory 47 is realized. That is, the microprocessor 1 implements memory access to the internal memory 47 via the data registers r ₀ and r ₁ .

【００３６】次に、３段目のＡＬＵステージで、ＡＬＵ
演算命令を実行する。なお、フェッチした命令がＡＬＵ
演算命令でない場合は、ＡＬＵステージでは何も行わ
ず、ポートＡＩＮ，ＢＩＮから入力したデータをＡＬＵ
１３のポートＡＯＵＴからそのまま出力する。次に、４
段目のＷＢステージでは、データレジスタｒ₀，ｒ₁お
よびレジスタｒ₂〜ｒ₁₅の何れかを、ディスティネーシ
ョンレジスタとして指定した命令について、ＡＬＵ１３
の演算結果を、当該指定したレジスタに書き戻す。この
とき、データレジスタｒ₀，ｒ₁にデータを書き込むこ
とで、バス２６，２７を介して、間接的に内部メモリ４
７にデータを書き戻す。Next, in the third ALU stage, the ALU
Execute the operation instruction. The fetched instruction is ALU
If it is not an operation instruction, nothing is performed in the ALU stage, and data input from ports AIN and BIN are
The signal is directly output from the port AOUT 13. Next, 4
In the second WB stage, the ALU 13 for an instruction in which _one of the data registers r ₀ and r ₁ and the registers r _{2 to} r ₁₅ is designated as a destination register
Is written back to the specified register. At this time, by writing data to the data registers r ₀ and r ₁ , the internal memory 4 is indirectly connected via the buses 26 and 27.
Write the data back to 7.

【００３７】図４に示す４段パイプライン処理では、コ
ードＣ₁が図２に示す内部メモリ４７に対してのメモリ
ライト操作を伴う命令であり、コードＣ₃が内部メモリ
４７に対してのメモリリード操作を伴う命令である場合
に、図５に示すクロックサイクルＹにおいて、コードＣ
₁のＷＢステージ４１と、コードＣ₃のＤＥＣステージ
４２とで、同時にメモリアクセスを行うことになる。こ
の場合に、マイクロプロセッサ１では、内部メモリ４７
に対してのアクセスのバッファとなるデータレジスタｒ
₀，ｒ₁を設け、内部メモリ４７に対しての２系統のア
クセス経路を設けているため、コードＣ₁とコードＣ₃
とでアクセスを行う内部メモリ４７のバンクが異なれ
ば、図５に示すように、コードＣ₁のＷＢステージ４１
とコードＣ₃のＤＥＣステージ４２とを多重化でき、パ
イプライ処理は乱れない（ストールしない）。[0037] In four-stage pipeline processing shown in FIG. 4, the code C ₁ is an instruction involving memory write operation to the internal memory 47 shown in FIG. 2, the memory code C ₃ is to the internal memory 47 In the case of an instruction accompanied by a read operation, in the clock cycle Y shown in FIG.
₁ of WB stage 41, in the DEC stage 42 of the code C _3, will perform a memory access at the same time. In this case, in the microprocessor 1, the internal memory 47
Data register r serving as a buffer for access to
₀ and r ₁ and two access paths to the internal memory 47 are provided, so that the code C ₁ and the code C ₃
If the bank of the internal memory 47 to be accessed is different between the WB stage 41 and the code C ₁ , as shown in FIG.
And it can be multiplexed and DEC stage 42 of the code C _3, pipeline processing is not disturbed (not stall).

【００３８】以下、マイクロプロセッサ１におけるプロ
グラムの記述に応じた内部メモリ４７に対してのアクセ
ス動作例について説明する。２オペランド演算命令を用いた場合以下、図６に示すように、オペコード（ＯＰ）と、〔ソ
ースレジスタ〕および〔ディスティネーションレジス
タ〕を指定する２個のオペランドとを備えた２オペラン
ド演算命令を用いて、プログラムを記述した場合におけ
る内部メモリ４７に対してのアクセス動作について説明
する。Hereinafter, an example of an access operation to the internal memory 47 according to the description of the program in the microprocessor 1 will be described. When a two-operand operation instruction is used, a two-operand operation instruction having an operation code (OP) and two operands specifying a [source register] and a [destination register] is used as shown in FIG. Next, an access operation to the internal memory 47 when a program is described will be described.

【００３９】例えば、図７に示すように、２オペランド
演算命令において、データレジスタｒ₀をソースレジス
タとして指定した場合には、内部メモリ４７上のＤＰレ
ジスタ３０に記憶されたアドレスからデータをデータレ
ジスタｒ₀に読み出した後に、データレジスタｒ₀に記
憶されたデータと、レジスタｒ₄に記憶されたデータと
を加算して、その加算結果のデータをレジスタｒ₄に書
き込む。For example, as shown in FIG. 7, when the data register r ₀ is designated as the source register in the two-operand operation instruction, the data is stored in the data register from the address stored in the DP register 30 on the internal memory 47. After reading to r ₀ , the data stored in the data register r ₀ and the data stored in the register r ₄ are added, and the resulting data is written to the register r ₄ .

【００４０】また、例えば、図８に示すように、２オペ
ランド演算命令において、データレジスタｒ₁をディス
ティネーションレジスタとして指定した場合には、内部
メモリ４７上のＤＰレジスタ３１に記憶されたアドレス
からデータをデータレジスタｒ₁に読み出した後に、デ
ータレジスタｒ₁に記憶されたデータと、レジスタｒ₅
に記憶されたデータとを加算して、その加算結果のデー
タをレジスタｒ₁に書き込む。そして、内部メモリ４７
上のＤＰレジスタ３１に記憶されたアドレスに、データ
レジスタｒ₁に記憶されたデータを書き込む。For example, as shown in FIG. 8, when the data register r ₁ is specified as a destination register in a two-operand operation instruction, the data stored in the DP register 31 on the internal memory 47 is read from the address. the after reading the data register r _1, and data stored in the data register r _1, register r ₅
By adding the data stored in the write data of the addition result in the register r _1. Then, the internal memory 47
The DP register 31 to the address stored in the above, and writes the data stored in the data register r _1.

【００４１】また、例えば、図９に示すように、２オペ
ランド演算命令において、データレジスタｒ₀をソース
アドレスとして指定し、データレジスタｒ₁をディステ
ィネーションレジスタとして指定した場合には、内部メ
モリ４７上のＤＰレジスタ３０，３１に記憶されたアド
レスからデータをそれぞれデータレジスタｒ₀，ｒ₁に
読み出した後に、データレジスタｒ₀に記憶されたデー
タと、データレジスタｒ₁に記憶されたデータとを加算
して、その加算結果のデータをレジスタｒ₁に書き込
む。そして、内部メモリ４７上のＤＰレジスタ３１に記
憶されたアドレスに、データレジスタｒ₁に記憶された
データを書き込む。For example, as shown in FIG. 9, when the data register r ₀ is specified as the source address and the data register r ₁ is specified as the destination register in the two-operand operation instruction, After reading the data from the addresses stored in the DP registers 30 and 31 into the data registers r ₀ and r ₁ respectively, the data stored in the data register r ₀ and the data stored in the data register r ₁ are added. to write the data of the addition result in the register r _1. Then, the address stored in the DP register 31 in the internal memory 47, and writes the data stored in the data register r _1.

【００４２】ところで、図８に示す演算命令を連続して
実行したときに、図４に示すマイクロプロセッサ１のパ
イプライン処理において、図１０に示すように、２クロ
ックサイクルのパイプライン・ハザード（ストール）が
発生する。これは、図１０に示すクロックサイクルＴ＋
３で、データレジスタｒ₁に対してのライト動作とリー
ド動作とが競合するためである。すなわち、図８に示す
演算命令であるコードＣ₁が、クロックサイクルＴ＋１
でＤＥＣステージに進んだときに、図１に示すデコーダ
３６は、コードＣ₁がロード／ストア命令であることを
認識してレジスタｒ₁を内部でロックする。そして、コ
ードＣ₁がクロックサイクルＴ＋３でＷＢステージに進
んだときに、このロックは解除される。すなわち、内部
メモリ４７に対してのリード・モディファイ・ライト
は、ライトバックされるまでロックされる。このロック
されたレジスタを命令がアクセスすると、ロックが解除
するまでアクセスが待たされる。そして、クロックサイ
クルＴ＋３でＡＬＵ１３の出力をデータレジスタｒ₁に
書き込み、ロックの解除を決定し、ＤＰレジスタ３１を
更新する。そして、クロックサイクルＴ＋４で、コード
Ｃ₂に応じて、内部メモリ４７からデータレジスタｒ₁
にデータを読み込み、その後、コードＣ₁の場合と同様
の処理を繰り返す。By the way, when the operation instructions shown in FIG. 8 are successively executed, in the pipeline processing of the microprocessor 1 shown in FIG. 4, as shown in FIG. 10, a pipeline hazard (stall) of two clock cycles is generated. ) Occurs. This corresponds to clock cycle T + shown in FIG.
In _No. 3, the write operation and the read operation on the data register r1 conflict with each other. That is, the code C ₁ is an operation command shown in FIG. 8, a clock cycle T + 1
When proceeding to DEC stage in the decoder 36 shown in FIG. 1, the locking internally registers r ₁ recognizes that the code C ₁ is a load / store instruction. The code C ₁ is in when it proceeds to the WB stage in clock cycle T + 3, the lock is released. That is, the read-modify-write to the internal memory 47 is locked until it is written back. When an instruction accesses the locked register, the access waits until the lock is released. Then, at clock cycle T + 3 writes the output of the ALU13 in the data register r _1, to determine the release of the lock, and updates the DP register 31. Then, at clock cycle T + 4, the data register r _{1 is} read from the internal memory 47 according to the code C _2.
Reads data, then repeats the same processing as the processing of the code C _1.

【００４３】３オペランド演算命令を用いた場合以下、図１１に示すように、オペコード（ＯＰ）と、２
個の〔ソースレジスタ〕および〔ディスティネーション
レジスタ〕を指定する３個のオペランドとを備えた３オ
ペランド演算命令を用いて、プログラムを記述した場合
における内部メモリ４７に対してのアクセス動作につい
て説明する。マイクロプロセッサ１では、例えば、従来
のマイクロプロセッサにおける図３０に示すプログラム
を実行するときに、プログラムを図１２に示すように記
述できる。図１２において、命令「ｍｏｖｒ₀，
ｒ₂」はデータレジスタｒ₀に記憶された内部メモリ４
７上のデータをレジスタｒ₂に転送することを示す。こ
のとき、命令「ｍｏｖｒ₀，ｒ₂」を実行するのに先
立って、図２に示すＤＰレジスタ３０にメモリバンク４
５内の読み込み対象のデータのアドレスが記憶され、ハ
ードウェアによって、当該アドレスに記憶されたデータ
が、バス２６を介してデータレジスタｒ₀に転送されて
いる。そして、データレジスタｒ₀に記憶された内部メ
モリ４７上のデータが、バス２４、マルチプレクサ１
６，１２およびバス２３を介して、レジスタｒ₂に転送
される。When a three-operand operation instruction is used , as shown in FIG.
An access operation to the internal memory 47 when a program is described using a three-operand operation instruction including three operands specifying [source registers] and [destination registers] will be described. In the microprocessor 1, for example, when the program shown in FIG. 30 in a conventional microprocessor is executed, the program can be described as shown in FIG. In FIG. 12, the instruction “mov r ₀ ,
“r ₂ ” is the internal memory 4 stored in the data register r _0.
Data on 7 indicates that transferred to the register r _2. At this time, prior to execution of the instruction “mov r ₀ , r ₂ ”, the memory bank 4 is stored in the DP register 30 shown in FIG.
Read address of the data of interest within 5 are stored by the hardware, the data stored in the address have been transferred to the data register r ₀ through the bus 26. The data on the internal memory 47 stored in the data register r ₀ is transferred to the bus 24 and the multiplexer 1.
The data is transferred to the register r ₂ via the buses 6 and 12 and the bus 23.

【００４４】また、「ａｄｄｒ₂，ｒ₉，ｒ₃」はレ
ジスタｒ₂に記憶されたデータとレジスタｒ₉に記憶さ
れたデータとを加算してレジスタｒ₃に記憶することを
示す。これによって、ＡＬＵ１３のポートＡＩＮに対し
てレジスタｒ₂に記憶されたデータが出力され、ポート
ＢＩＮに対してレジスタｒ₉に記憶されたデータが出力
され、ＡＬＵ１３において、これらの加算が行われ、加
算結果がポートＡＯＵＴからバス２３を介して、汎用レ
ジスタモジュール１１のレジスタｒ₃に書き込まれる。“Add r ₂ , r ₉ , r ₃ ” indicates that the data stored in the register r ₂ and the data stored in the register r ₉ are added and stored in the register r ₃ . As a result, the data stored in the register r ₂ is output to the port AIN of the ALU 13, the data stored in the register r ₉ is output to the port BIN, and these additions are performed in the ALU 13. result via the bus 23 from the port AOUT, written into the register r ₃ of the general register module 11.

【００４５】さらに、「ｍｏｖｒ₃，ｒ₁」は、レジ
スタｒ₃に記憶されたデータをデータレジスタｒ₁に転
送することを意味する。これによって、レジスタｒ₃に
記憶されたデータが、例えばバス１７、マルチプレクサ
１２、バス２３、マルチプレクサ１６およびバス２５を
介して、データレジスタｒ₁に転送される。その後、ハ
ードウェアによって、ＤＰレジスタ３１に記憶された内
部メモリ４７上のアドレスに対して、データレジスタｒ
₁に記憶されたデータがバス２７を介して自動的に書き
込まれる。Further, "mov r ₃ , r ₁ " means that the data stored in the register r ₃ is transferred to the data register r ₁ . Thus, the data stored in register r _3, for example a bus 17, a multiplexer 12, bus 23, via multiplexer 16 and bus 25, is transferred to the data register r _1. Thereafter, the hardware registers the data register r with the address on the internal memory 47 stored in the DP register 31.
_The data stored in ₁ is automatically written via the bus 27.

【００４６】このように、マイクロプロセッサ１では、
図３０に示すロード命令「ｌｗ」およびストア命令「ｓ
ｗ」の代わりに、図１２に示すようにレジスタ間転送命
令「ｍｏｖ」を記述したプログラムを実行する。As described above, in the microprocessor 1,
The load instruction “lw” and the store instruction “s” shown in FIG.
Instead of "w", a program describing an inter-register transfer instruction "mov" is executed as shown in FIG.

【００４７】なお、図１２に示すプログラムは、例え
ば、図１３に示すようにも記述できる。図１３に示すプ
ログラムでは、命令「ａｄｄ」のソースレジスタとして
データレジスタｒ₀およびレジスタｒ₉を指定してお
り、ディステイネイションアドレスとしてデータレジス
タｒ₁を指定している。但し、この場合に、ＤＰレジス
タ３０，３１によって、内部メモリ４７上の異なるバン
ク内のアドレスが指し示されている必要がある。図１３
に示すプログラムは、マイクロプロセッサ１によって１
クロックサイクルで実行される。The program shown in FIG. 12 can be described, for example, as shown in FIG. In the program shown in FIG. 13, the data register r ₀ and the register r ₉ are specified as the source register of the instruction “add”, and the data register r ₁ is specified as the destination address. However, in this case, it is necessary that the DP registers 30 and 31 point to addresses in different banks on the internal memory 47. FIG.
The program shown in FIG.
Executed in clock cycles.

【００４８】また、マイクロプロセッサ１では、内部メ
モリ４７上の連続したメモリアドレスに記憶されたデー
タを加工する場合には、例えば、図１４に示すようにプ
ログラムを記述する。図１４に示すプログラムを実行す
ると、図２に示すＤＰレジスタ３０，３１に記憶されて
いるアドレスに対して順次に、例えば「＋４」を加算し
て更新することで、データレジスタｒ₀，ｒ₁を介し
て、内部メモリ４７における「＋４」間隔のアドレスに
対して順次にアクセスが行われる。図１３において、
「ａｄｄｒ₀，ｒ₉，ｒ₁」は、データレジスタｒ₀
に記憶されたデータと、レジスタｒ₉に記憶されたデー
タとを加算して、その加算結果のデータをデータレジス
タｒ₁に記憶することを示している。図１４に示すプロ
グラムに応じた処理では、内部メモリ４７上の８個のデ
ータを加工する。なお、図１４に示すプログラムは、ル
ープを用いて記述してもよい。In the microprocessor 1, when processing data stored at consecutive memory addresses on the internal memory 47, for example, a program is described as shown in FIG. When the program shown in FIG. 14 is executed, the addresses stored in the DP registers 30 and 31 shown in FIG. 2 are sequentially updated by adding, for example, “+4”, thereby updating the data registers r ₀ and r _1. , Accesses are sequentially made to addresses at intervals of “+4” in the internal memory 47. In FIG.
“Add r ₀ , r ₉ , r ₁ ” is the data register r ₀
The stored data, by adding the data stored in the register r _9, show that for storing data of the addition result in the data register r ₁ is in. In the processing according to the program shown in FIG. 14, eight data on the internal memory 47 are processed. The program shown in FIG. 14 may be described using a loop.

【００４９】マイクロプロセッサ１では、例えば、図３
に示す汎用レジスタモジュール１１のレジスタｒ₁₄をデ
ータレジスタとして扱い、レジスタｒ₁₄に対応したＤＰ
レジスタを更に設けることで、メモリアクセスのパスを
３系統にすることができる。この場合に、図１５に示す
ように、プログラムを記述することで、加算命令「ａｄ
ｄ」の３個の全てのオペランドにおいて、データレジス
タｒ₀，ｒ₁₄，ｒ₁を指定することができる。In the microprocessor 1, for example, FIG.
DP of treatment, corresponding to the register r ₁₄ registers r ₁₄ of general register module 11 shown in the data register
By additionally providing a register, three paths for memory access can be provided. In this case, as shown in FIG. 15, by describing a program, the addition instruction “ad
Data registers r ₀ , r ₁₄ , and r ₁ can be specified in all three operands “d”.

【００５０】以上説明したように、マイクロプロセッサ
１によれば、汎用レジスタモジュール１１を構成する複
数の汎用レジスタの一部を、データレジスタｒ₀，ｒ₁
として用いることで、プログラムにおいて、内部メモリ
４７に対してのメモリアクセスのためのメモリリード命
令およびメモリライト命令を、別途記述する必要がな
い。すなわち、命令セットを、メモリアクセスが必要な
場合も含めて、汎用レジスタに記憶されたデータ相互間
で行う演算として統一して記述できる。すなわち、マイ
クロプロセッサ１によれば、内部メモリ４７に対しての
アクセスは、汎用レジスタに対してのアクセスの延長と
して扱われ、ソフトウェアからは、データレジスタ
ｒ₀，ｒ₁をメモリウィンドウとして使用できる。As described above, according to the microprocessor 1, a part of the plurality of general-purpose registers constituting the general-purpose register module 11 is replaced with the data registers r ₀ and r _1.
Therefore, it is not necessary to separately describe a memory read instruction and a memory write instruction for accessing the internal memory 47 in the program. In other words, the instruction set can be described as an operation performed between the data stored in the general-purpose registers, even when a memory access is required. That is, according to the microprocessor 1, the access to the internal memory 47 is treated as an extension of the access to the general-purpose register, and the data registers r ₀ and r ₁ can be used as a memory window from software.

【００５１】また、マイクロプロセッサ１では、アクセ
スする内部メモリ４７のアドレスおよびアクセスする順
番が予め決められている場合には、ＤＰレジスタ３０，
３１のアドレス更新機能を用いて、ＤＰレジスタ３０，
３１に記憶されているアドレスを自動的に更新すること
で、メモリアクセスのための手順を、プログラム内で毎
回明示して記述する必要がない。In the microprocessor 1, if the address of the internal memory 47 to be accessed and the access order are determined in advance, the DP register 30,
31, the DP register 30,
By automatically updating the address stored in 31, there is no need to explicitly describe the procedure for memory access every time in a program.

【００５２】そのため、内部メモリ４７のメモリアドレ
ス空間内の連続したアドレスに記憶された複数のデータ
を順にアクセスして加工する場合に、アクセスするデー
タのアドレスを毎回指定する必要がなく、プログラムに
はＡＬＵ演算命令のみを記述すればよい。このとき、デ
ータレジスタｒ₀，ｒ₁に対して、ＤＰレジスタ３０，
３１が自動的に更新された直後に、内部メモリ４７から
データを読み出すことで、ＡＬＵ演算命令を実行する度
に、ＡＬＵ１３がメモリアクセスの完了を待たされるこ
とを回避できる。すなわち、ＡＬＵ１３がプログラムに
応じてＡＬＵ演算を実行する時点で、内部メモリ４７か
らデータレジスタｒ₀，ｒ₁に既にデータが既に読み込
まれている状態にすることができる。その結果、マイク
ロプロセッサ１における演算処理において、命令の多重
化を効率的に実現し、例えば、パイプライン処理におい
て、ＡＬＵ演算命令を毎クロックサイクル実行できる。
また、ユーザによるプログラムの記述を簡単化できる。Therefore, when sequentially accessing and processing a plurality of data stored at consecutive addresses in the memory address space of the internal memory 47, it is not necessary to specify the address of the data to be accessed each time. Only the ALU operation instruction needs to be described. At this time, with respect to the data registers r _0, r _1, DP register 30,
By reading data from the internal memory 47 immediately after the 31 is automatically updated, it is possible to avoid waiting for the ALU 13 to complete the memory access every time the ALU operation instruction is executed. That is, at the time when the ALU 13 executes the ALU operation according to the program, the data can be already read from the internal memory 47 to the data registers r ₀ and r ₁ . As a result, in the arithmetic processing in the microprocessor 1, multiplexing of instructions can be efficiently realized, and for example, in pipeline processing, an ALU arithmetic instruction can be executed every clock cycle.
Further, the description of the program by the user can be simplified.

【００５３】また、マイクロプロセッサ１によれば、２
個のＤＰレジスタ３０，３１を設け、内部メモリ４７上
のデータに対して直接的にＡＬＵ演算命令を実行するこ
とで、例えば、内部メモリ４７上のデータを読み出して
加工した後に、内部メモリ４７に書き戻す処理を１命令
で記述できる。しかも、この命令において、内部メモリ
４７から読み出すデータのバンクと、内部メモリ４７に
書き込むデータのバンクとが異なる場合に、命令を１ク
ロックサイクル内に実行できる。なお、ＤＰレジスタ３
０，３１が共に内部メモリ４７の全てのアドレスを指定
できる場合には、内部メモリ４７から読み出すデータの
アドレスと、内部メモリ４７に書き込むデータのアドレ
スとが異なれば、命令を１クロックサイクル内に実行で
きる。例えば、プログラムにおいて、メモリアクセスを
伴う演算処理を、例えば図１３に示すように１命令で記
述し、１クロックサイクル内で実行できる。According to the microprocessor 1, 2
By providing the DP registers 30 and 31 and executing the ALU operation instruction directly on the data on the internal memory 47, for example, after reading and processing the data on the internal memory 47, The rewriting process can be described by one instruction. Moreover, in this instruction, if the bank of data read from the internal memory 47 is different from the bank of data written to the internal memory 47, the instruction can be executed within one clock cycle. Note that DP register 3
If both 0 and 31 can specify all the addresses of the internal memory 47, the instruction is executed within one clock cycle if the address of the data read from the internal memory 47 is different from the address of the data written to the internal memory 47. it can. For example, in a program, arithmetic processing involving memory access can be described by, for example, one instruction as shown in FIG. 13 and executed within one clock cycle.

【００５４】また、マイクロプロセッサ１では、内部メ
モリ４７にデータを書き戻す命令（ディスティネーショ
ンレジスタとしてデータレジスタｒ₀またはｒ₁を指定
するデータ転送命令）を連続して実行する場合に、ＤＰ
レジスタ３０，３１の自動読み出し機能を停止し、無駄
なメモリリードを無くすことができる。これにより、メ
モリライトを連続して実行することができ、命令の多重
化を実現できる。[0054] Further, the microprocessor 1, when performing write back data to the internal memory 47 instructions (data transfer instruction specifying the data register r ₀ or r ₁ as the destination register) successively, DP
The automatic read function of the registers 30 and 31 is stopped, and unnecessary memory read can be eliminated. As a result, memory writes can be executed continuously, and multiplexing of instructions can be realized.

【００５５】また、マイクロプロセッサ１では、データ
レジスタｒ₀，ｒ₁およびＤＰレジスタ３０，３１を用
いたメモリ構成にしたことで、図２３に示す従来の５段
パイプライン構造におけるＭＥＭステージが不要とな
り、図４に示すように、４段パイプライン構造にするこ
とができる。すなわち、データレジスタｒ₀，ｒ₁を用
いて内部メモリ４７に書き込み動作を行うことから、内
部メモリ４７に対しての書き込み動作を、ＷＢステージ
において行われるレジスタ書き込み動作によって行うこ
とができる。従って、パイプライン処理に、メモリアク
セス処理のステージを独立して設ける必要がない。その
結果、マイクロプロセッサ１では、プロセッサ全体の制
御回路が簡単になり、外部割り込みを含む例外処理に対
しても柔軟な対応を行うことができる。In the microprocessor 1, the memory configuration using the data registers r ₀ and r ₁ and the DP registers 30 and 31 eliminates the need for the MEM stage in the conventional five-stage pipeline structure shown in FIG. As shown in FIG. 4, a four-stage pipeline structure can be adopted. That is, since the write operation is performed on the internal memory 47 using the data registers r ₀ and r ₁ , the write operation on the internal memory 47 can be performed by the register write operation performed in the WB stage. Therefore, it is not necessary to provide a memory access processing stage independently in the pipeline processing. As a result, in the microprocessor 1, the control circuit of the entire processor is simplified, and it is possible to flexibly cope with exception processing including an external interrupt.

【００５６】第２実施形態本実施形態のマイクロプロセッサは、内部メモリのバン
ク数、データレジスタおよびＤＰレジスタの数、およ
び、図１に示すマルチプレクサ１２，１６の代わりにマ
ルチプレクサ８２を備えている点を除いて、基本的に、
前述した第１実施形態のマイクロプロセッサ１と同じ構
成をしている。図１６は、本実施形態のマイクロプロセ
ッサ８１の内部メモリ８７の周辺の構成図である。図１
６に示すように、マイクロプロセッサ８１は、内部メモ
リ８７、ＤＰレジスタ９１，９２，９３、データレジス
タｒ₀，ｒ₁−Ｒ，ｒ₁−Ｗ、マルチプレクサ８２およ
びバス１０１〜１０６を備えている。 Second Embodiment The microprocessor according to the second embodiment is characterized in that the number of banks of the internal memory, the number of data registers and DP registers, and that a multiplexer 82 is provided instead of the multiplexers 12 and 16 shown in FIG. Except, basically,
It has the same configuration as the microprocessor 1 of the first embodiment described above. FIG. 16 is a configuration diagram around the internal memory 87 of the microprocessor 81 of the present embodiment. FIG.
As shown in 6, the microprocessor 81 includes an internal memory 87, DP register 91, 92 and 93, the data register _{_{r 0, r 1 -R, r}} 1 -W, a multiplexer 82 and bus 101-106.

【００５７】また、マイクロプロセッサ８１は、図１に
示すマイクロプロセッサ１と同様に、ＡＬＵ１３、プロ
グラムカウンタ３４、インストラクションページメモリ
３５、デコーダ３６、バス１７，１８，１９，２０，２
３を備えている。また、マイクロプロセッサ８１では、
図１に示すレジスタｒ₁の代わりに、レジスタｒ₁−Ｒ
およびレジスタｒ₁−Ｗを備えている。すなわち、マイ
クロプロセッサ８１には、図１６に示すデータレジスタ
ｒ₀，ｒ₁−Ｒ，ｒ₁−Ｗの他に、レジスタｒ₂〜ｒ₁₅
が備えられている。なお、マイクロプロセッサ８１で
は、プログラムの記述において論理的に、物理的なデー
タレジスタｒ₁−Ｒとデータレジスタｒ₁−Ｗとを、単
体のデータレジスタｒ₁として扱う。The microprocessor 81 includes an ALU 13, a program counter 34, an instruction page memory 35, a decoder 36, and buses 17, 18, 19, 20, and 2, similarly to the microprocessor 1 shown in FIG.
3 is provided. In the microprocessor 81,
Instead of the register r ₁ shown in FIG. 1, the register r ₁ -R
And a register r ₁ -W. That is, the microprocessor 81 has registers r _{2 to} r _{15 in} addition to the data registers r ₀ , r ₁ -R, and r ₁ -W shown in FIG.
Is provided. The microprocessor 81 logically handles the physical data registers r ₁ -R and the data registers r ₁ -W as a single data register r _{1 in} the description of the program.

【００５８】マイクロプロセッサ８１では、ＤＰレジス
タ９１に記憶された内部メモリ８７のメモリバンク１１
０内のアドレスからデータを読み出して、データレジス
タｒ₀に記憶する。また、データレジスタｒ₁−Ｒは、
内部メモリ８７のメモリバンク１１１からのデータ読み
出し動作専用であり、プログラムに記述された２オペラ
ンド演算命令においてディスティネーションレジスタと
して論理的なデータレジスタｒ₁のアドレスが指定され
ている場合に、ＤＰレジスタ９２に記憶されたメモリバ
ンク１１１内のアドレスから読み出されたデータを記憶
する。また、データレジスタｒ₁−Ｗは、内部メモリ８
７のメモリバンク１１２に対してのデータ書き込み動作
専用であり、プログラムに記述された２オペランド演算
命令においてディスティネーションレジスタとして論理
的なデータレジスタｒ₁のアドレスが指定されている場
合に、そのデータレジスタｒ₁−Ｗの記憶データを、Ｄ
Ｐレジスタ９３に記憶されたメモリバンク１１２内のア
ドレスに書き込む。なお、論理的なデータレジスタｒ₁
のアドレスの指定は、論理アドレスを直接指定する他、
例えば、論理レジスタ名を介して論理アドレスを間接的
に指定してもよい。In the microprocessor 81, the memory bank 11 of the internal memory 87 stored in the DP register 91
Data is read from the address in ₀ and stored in the data register r0. The data register r ₁ -R is
A data read operation only from the memory bank 111 of internal memory 87, when the address of the logical data register r ₁ is designated in the second operand arithmetic instructions written in the program as the destination register, DP register 92 The data read from the address in the memory bank 111 stored in the memory cell is stored. Further, the data register r ₁ -W stores the internal memory 8
7 is a data write operation only with respect to the memory bank 112, if specified address of a logical data register r ₁ as the destination register in the two-operand arithmetic instructions written in the program, the data register r ₁ -W is stored in D
Write to the address in the memory bank 112 stored in the P register 93. Note that the logical data register r ₁
In addition to specifying the logical address directly,
For example, a logical address may be indirectly specified via a logical register name.

【００５９】以下、マイクロプロセッサ８１の動作につ
いて説明する。マイクロプロセッサ８１では、例えば、
図６に示すフォーマットを持つ２オペランド演算命令を
実行する。マイクロプロセッサ８１では、例えば、図１
７に示すように、データレジスタｒ₁がディスティネー
ションアドレスに指定されている命令を実行する場合
に、先ず、ＤＰレジスタ９２に加算対象データが記憶さ
れたメモリバンク１１１内のアドレスを書き込むこと
で、ハードウェアによって、バス１０２を介してデータ
レジスタｒ₁−Ｒに当該加算対象データを読み出す。次
に、図１に示すＡＬＵ１３において、レジスタｒ₅に記
憶されたデータと、データレジスタｒ₁−Ｒに記憶され
たデータとの加算演算を行い、加算結果のデータをバス
１０６を介してデータレジスタｒ₁−Ｗ₀に記憶する。
また、加算結果のデータを書き戻すメモリバンク１１２
内のアドレスをＤＰレジスタ９３に書き込む。これによ
って、データレジスタｒ₁−Ｗ₀に記憶された加算結果
のデータが、ハードウェアによって、バス１０３を介し
てメモリバンク１１２に書き戻される。Hereinafter, the operation of the microprocessor 81 will be described. In the microprocessor 81, for example,
A two-operand operation instruction having the format shown in FIG. 6 is executed. In the microprocessor 81, for example, FIG.
As shown in FIG. 7, when the data register r ₁ executes the instruction specified by the destination address, first, the address in the memory bank 111 in which the data to be added is stored is written to the DP register 92, The data to be added is read out to the data register r ₁ -R via the bus 102 by hardware. Next, the ALU13 shown in FIG. 1, the data stored in the register r _5, performs addition operation between data stored in the data register r ₁ -R, data register data of the addition result through the bus 106 Store it in r ₁ -W ₀ .
The memory bank 112 to which the data of the addition result is written back
Is written into the DP register 93. As a result, the data of the addition result stored in the data register r ₁ -W ₀ is written back to the memory bank 112 via the bus 103 by hardware.

【００６０】以下、マイクロプロセッサ８１において、
内部メモリ８７上の連続したメモリアドレスに記憶され
たデータを加工する場合について説明する。マイクロプ
ロセッサ８１において、例えば、図１８に示すプログラ
ムを図４に示す４段パイプライン処理で実行するとき、
図１９（Ａ）に示すようになる。図１８に示すコードＣ
₁〜Ｃ₆の各コードは、図１９（Ａ）に示すパイプライ
ン処理において、ＩＦステージで命令「ａｄｄｒ₀，
ｒ₁」のフェッチが行われ、ＤＥＣステージで、ＤＰレ
ジスタ９１に記憶されたメモリバンク１１０上のアドレ
スからデータレジスタｒ₀に読み出された一方の加算対
象データが、データレジスタｒ₀からＡＬＵ１３のポー
トＡＩＮに出力される。それと共に、ＤＰレジスタ９２
に記憶されたメモリバンク１１１上のアドレスからデー
タレジスタｒ₁−Ｒに読み出された他方の加算対象デー
タがデータレジスタｒ₁−ＲからＡＬＵ１３のポートＢ
ＩＮに出力される。次に、ＡＬＵステージで、ＡＬＵ１
３において双方の加算対象データを用いた加算が行わ
れ、ＷＢステージで、ＡＬＵ１３の加算結果がポートＡ
ＯＵＴからデータレジスタｒ₁−Ｗに書き込まれる。そ
の後、ＤＰレジスタ９３に記憶されたメモリバンク１１
２上のアドレスに、データレジスタｒ₁−Ｗの記憶デー
タが書き戻される。Hereinafter, in the microprocessor 81,
A case where data stored at consecutive memory addresses on the internal memory 87 is processed will be described. In the microprocessor 81, for example, when the program shown in FIG. 18 is executed by the four-stage pipeline processing shown in FIG.
The result is as shown in FIG. Code C shown in FIG.
Each code ₁ -C _6, in the pipeline processing shown in FIG. 19 (A), the instruction in the IF stage "the add r _0,
Fetching r ₁ "is performed, in DEC stage, one of the addition-target data from the stored address on the memory bank 110 has been read into the data register r ₀ in the DP register 91, the data register r ₀ of ALU13 Output to port AIN. At the same time, the DP register 92
The other addition target data read out from the address on the memory bank 111 stored in the memory register 111 to the data register r ₁ -R is transmitted from the data register r ₁ -R to the port B of the ALU 13.
Output to IN. Next, at the ALU stage, ALU1
3, the addition using both the addition target data is performed, and the addition result of the ALU 13 is
OUT is written to the data register r ₁ -W. Thereafter, the memory bank 11 stored in the DP register 93
The data stored in the data register r ₁ -W is written back to the address on 2.

【００６１】このとき、図１９（Ａ）に示すように、ク
ロックサイクルＴで、コードＣ₁のＩＦステージが行わ
れる。また、図１９（Ｂ）に示すように、クロックサイ
クルＴ＋１〜Ｔ＋６で、コードＣ₁〜Ｃ₆によるデータ
レジスタｒ₀に対してのメモリリード動作（デコードス
テージ）が順に行われる。また、図１９（Ｃ）に示すよ
うに、クロックサイクルＴ＋１〜Ｔ＋６で、コードＣ₁
〜Ｃ₆によるデータレジスタｒ₁−Ｒに対してのメモリ
リード動作（デコードステージ）が順に行われる。さら
に、図１９（Ｄ）に示すように、クロックサイクルＴ＋
３〜Ｔ＋８で、コードＣ₁〜Ｃ₆によるデータレジスタ
ｒ₁−Ｗに対してのメモリライト動作（ＷＢステージ）
が順に行われる。すなわち、マイクロプロセッサ８１で
は、前述したように、データレジスタｒ₁−Ｒ，ｒ₁−
Ｗｒ₀を設けたことで、データレジスタｒ₁に対しての
メモリリード動作とメモリライト動作とを同時に行うこ
とができる。その結果、図１８に示すコードＣ₁〜Ｃ₆
を、それぞれ１クロックサイクルで実行できる。At this time, as shown in FIG. 19A, the IF stage of the code C ₁ is performed in the clock cycle T. Further, as shown in FIG. 19 (B), in a clock cycle T + 1 to T + 6, a memory read operation for the data registers r ₀ by the code C ₁ -C ₆ (decode stage) is carried out in order. Further, as shown in FIG. 19C, the code C _{1 in} clock cycles T + _{1 to} T + 6.
The memory read operation (decode stage) for the data register r ₁ -R by C ₆ is sequentially performed. Further, as shown in FIG. 19D, the clock cycle T +
From 3 to T + 8, a memory write operation to the data register r ₁ -W by the codes C _{1 to} C ₆ (WB stage)
Are performed in order. That is, in the microprocessor 81, as described above, the data registers r ₁ -R, r _1-
By providing Wr ₀ , a memory read operation and a memory write operation for data register r ₁ can be performed simultaneously. As a result, the codes C _{1 to} C ₆ shown in FIG.
Can be executed in one clock cycle.

【００６２】以上説明したように、マイクロプロセッサ
８１によれば、論理的なデータレジスタｒ₁をディステ
ィネーションアドレスに指定してリードモディファイラ
イトを行う２オペランド演算命令を繰り返し連続して行
う場合であっても、各演算命令を１クロックサイクルで
実行できる。また、マイクロプロセッサ８１によれば、
２オペランド演算命令を用いて同一のクロックサイクル
で内部メモリ８７上の異なる３個のデータにアクセスを
行い、実質的に３オペランド演算と同等の演算を実現で
きる。[0062] As described above, according to the microprocessor 81, a case where the specified logical data register r ₁ to the destination address performed continuously repeated 2 operand operation instruction for performing the read-modify-write Also, each operation instruction can be executed in one clock cycle. According to the microprocessor 81,
By using a two-operand operation instruction, three different data on the internal memory 87 are accessed in the same clock cycle, and an operation substantially equivalent to a three-operand operation can be realized.

【００６３】第３実施形態上述した第１実施形態および第２実施形態のマイクロプ
ロセッサでは、複数のデータレジスタと内部メモリの複
数のバンクとを、それぞれ１対１に固定して対応させた
場合について説明した。従って、これらのマイクロプロ
セッサでは、内部メモリのバンク数に対応した数のデー
タレジスタおよびＤＰレジスタを設ける必要がある。従
って、内部メモリのバンク数が多い場合には多数のデー
タレジスタおよびＤＰレジスタを備える必要がある。本
実施形態のマイクロプロセッサは、前述したマイクロプ
ロセッサ１において、単数のデータレジスタを介して、
内部メモリの複数のバンクをアクセス可能な構成をして
いる。 Third Embodiment In the above-described microprocessors of the first and second embodiments, a case where a plurality of data registers and a plurality of banks of the internal memory are fixedly associated with each other in one-to-one correspondence. explained. Therefore, in these microprocessors, it is necessary to provide data registers and DP registers in a number corresponding to the number of banks of the internal memory. Therefore, when the number of banks of the internal memory is large, it is necessary to provide a large number of data registers and DP registers. The microprocessor according to the present embodiment differs from the microprocessor 1 described above in that the microprocessor 1 has a single data register.
The configuration is such that a plurality of banks of the internal memory can be accessed.

【００６４】図２０は本実施形態のマイクロプロセッサ
１２１のＡＬＵ１３の周辺の構成図、図２１はマイクロ
プロセッサ１２１の内部メモリ１４７の周辺の構成図で
ある。なお、図２０および図２１において、図１および
図２に示した構成要素と同じ構成要素には同じ符号を付
してある。マイクロプロセッサ１２１は、図２０に示す
汎用レジスタモジュール１２３、マルチプレクサ１２
５，１２７、ＡＬＵ１３、プログラムカウンタ３４、イ
ンストラクションページメモリ３５およびデコーダ３６
と、図２１に示す内部メモリ１４７、ＤＰレジスタ１４
９、メインメモリ１５０、バンク選択モジュール１５
１、ローカルバス１５３および制御回路１８０とを、例
えば１チップ内に組み込んだ構成をしている。マイクロ
プロセッサ１２１は、データレジスタｒ₀と内部メモリ
１４７のバンク１６１〜１６４との関係以外は、例えば
４段パイプライン処理を含めて、基本的に、前述した第
１実施形態のマイクロプロセッサ１と同じである。FIG. 20 is a configuration diagram around the ALU 13 of the microprocessor 121 of this embodiment, and FIG. 21 is a configuration diagram around the internal memory 147 of the microprocessor 121. In FIGS. 20 and 21, the same components as those shown in FIGS. 1 and 2 are denoted by the same reference numerals. The microprocessor 121 includes a general-purpose register module 123 and a multiplexer 12 shown in FIG.
5, 127, ALU 13, program counter 34, instruction page memory 35, and decoder 36
And the internal memory 147 and the DP register 14 shown in FIG.
9, main memory 150, bank selection module 15
1. A configuration in which the local bus 153 and the control circuit 180 are incorporated in, for example, one chip. The microprocessor 121 is basically the same as the microprocessor 1 of the above-described first embodiment, including, for example, four-stage pipeline processing, except for the relationship between the data register r ₀ and the banks 161 to 164 of the internal memory 147. It is.

【００６５】汎用レジスタモジュール１２３は、図１に
示す汎用レジスタモジュール１１からデータレジスタｒ
₁およびバス２５，２７を除いた構成をしている。マル
チプレクサ１２５は、図１に示すバス２５，２２に対し
ての選択機能を備えていない点を除いて、図１に示すマ
ルチプレクサ１６と同じである。また、マルチプレクサ
１２７は、図１に示すバス２２に対しての選択機能を備
えていない点を除いて、図１に示すマルチプレクサ１２
と同じである。The general-purpose register module 123 is a data register r from the general-purpose register module 11 shown in FIG.
₁ and buses 25 and 27 are excluded. The multiplexer 125 is the same as the multiplexer 16 shown in FIG. 1, except that the multiplexer 125 does not have a selection function for the buses 25 and 22 shown in FIG. The multiplexer 127 does not have the function of selecting the bus 22 shown in FIG. 1 except that the multiplexer 127 shown in FIG.
Is the same as

【００６６】ＤＰレジスタ１４９は、マイクロプロセッ
サ１２１がアクセスを行うメモリアドレス空間の全域の
アドレスを記憶可能な記憶容量を備え、メモリリード時
には、データレジスタｒ₀に読み出すデータのメモリア
ドレス空間内のアドレスを記憶し、メモリライト時に
は、データレジスタｒ₀に記憶されたデータの書き込み
先であるメモリアドレス空間内のアドレスを記憶する。[0066] DP register 149 has a storage capacity capable of storing the address of the whole area of the memory address space the microprocessor 121 makes an access, at the time of memory read, the address of the memory address space of the data to be read into the data register r ₀ stored, during memory write, stores the address of the memory address space is a write destination of data stored in the data register r _0.

【００６７】ＤＰレジスタ１４９は、図２２に示すよう
に、３２ビットのフィールドを有し、３２ビットのアド
レスを記憶する。その結果、メモリアドレス空間とし
て、４Ｇバイトまでのアドレスを持つものを用いること
ができる。ＤＰレジスタ１４９の３２ビットフィールド
は、ページセレクトフィールド２００、ページオフセッ
トフィールド２０１およびフィールド２０２からなる。
ページセレクトフィールド２００は、ビット「９」〜
「３１」によって構成され、メインメモリ１５０のメモ
リアドレス空間におけるページ（メモリブロック）の開
始アドレス（オフセット）を指定する。本実施形態で
は、１ページの容量を５１２バイトとしている。また、
ページオフセットフィールド２０１は、ビット「２」〜
「８」によって構成され、ページ内のオフセットを示し
ている。ここで、ページ内では、４バイト単位でメモリ
アクセスが行われる。また、フィールド２０２は、ビッ
ト「０」，「１」によって構成され、アドレスを指定す
るためには使用されない。As shown in FIG. 22, DP register 149 has a 32-bit field and stores a 32-bit address. As a result, a memory address space having an address of up to 4 Gbytes can be used. The 32-bit field of the DP register 149 includes a page select field 200, a page offset field 201, and a field 202.
The page select field 200 includes bits “9” to
The start address (offset) of a page (memory block) in the memory address space of the main memory 150 is specified by “31”. In this embodiment, the capacity of one page is 512 bytes. Also,
The page offset field 201 includes bits “2” to
It is composed of "8" and indicates an offset within the page. Here, in the page, memory access is performed in units of 4 bytes. The field 202 is constituted by bits “0” and “1”, and is not used to specify an address.

【００６８】内部メモリ１４７は、図２１に示すよう
に、４個のバンク１６１，１６２，１６３，１６４に分
割されている。バンク１６１，１６２，１６３，１６４
は、ローカルバス１５２を介して、ＤＰレジスタ１４９
に接続されている。ここで、ローカルバス１５２は、Ｄ
Ｐレジスタ１４９のうち、後述する図２２に示すページ
オフセットフィールド２０１のみに接続され、７ビット
のバス幅を有する。また、バンク１６１，１６２，１６
３，１６４は、ローカルバス１５３を介して、データレ
ジスタｒ₀に接続されている。The internal memory 147 is divided into four banks 161, 162, 163 and 164, as shown in FIG. Banks 161, 162, 163, 164
Is connected to the DP register 149 via the local bus 152.
It is connected to the. Here, the local bus 152
Of the P register 149, it is connected only to a page offset field 201 shown in FIG. 22, which will be described later, and has a bus width of 7 bits. Banks 161, 162, 16
3,164, via the local bus 153, and is connected to the data register r _0.

【００６９】バンク１６１〜１６４は、それぞれ１ペー
ジ分の容量を持ち、後述するように、選択回路１７９か
らの選択信号Ｓ１７９₁〜Ｓ１７９₄に応じてメインメ
モリ１５０から読み込んだページを記憶する。バンク１
６１〜１６４に読み込まれるメインメモリ１５０上のデ
ータ（ページ）は、互いに重なり合うことはない。これ
は、選択回路１７９が、ＤＰレジスタ１４９に記憶され
たアドレスに対応するページがバンク１６１〜１６４に
記憶されている場合には、そのページに対してメモリア
クセスを行い、その同じページをメインメモリ１５０か
ら読み出す作業を行わないためである。従って、バンク
１６１〜１６４のうち、２以上のバンクに同一のメモリ
アドレスのデータが記憶されることはない。[0069] Bank 161 to 164 each have a capacity of one page, as described below, and stores the pages read from the main memory 150 in response to the selection signal S179 ₁ ~S179 ₄ from the selection circuit 179. Bank 1
The data (pages) on the main memory 150 that are read into 61 to 164 do not overlap each other. In other words, when a page corresponding to the address stored in the DP register 149 is stored in the banks 161 to 164, the selection circuit 179 performs memory access to the page and stores the same page in the main memory. This is because the operation of reading from the 150 is not performed. Therefore, data of the same memory address is not stored in two or more of the banks 161 to 164.

【００７０】バンク１６１〜１６４は、データレジスタ
ｒ₀に対する読み出し動作を行う際に、それぞれイネー
ブル指示を示す選択信号Ｓ１７９₁〜Ｓ１７９₄を入力
すると、ＤＰレジスタ１４９に記憶されたアドレスのペ
ージオフセットフィールド２０１によって示されるペー
ジ内のアドレスから４バイトのデータを読み出し、この
読み出したデータをローカルバス１５３を介してデータ
レジスタｒ₀に転送する。データレジスタｒ₀は、この
転送されたデータを記憶する。When the banks 161 to 164 input the selection signals S179 _{1 to} S179 ₄ each indicating an enable instruction when performing a read operation on the data register r ₀ , the page offset field 201 of the address stored in the DP register 149 is input. 4 bytes of data are read from the address in the page indicated by, and the read data is transferred to the data register r ₀ via the local bus 153. The data register r ₀ stores the transferred data.

【００７１】バンク１６１〜１６４は、データレジスタ
ｒ₀からの書き込み動作において、それぞれイネーブル
指示を示す選択信号Ｓ１７９₁〜Ｓ１７９₄を入力する
と、データレジスタｒ₀に記憶された４バイトのデータ
をローカルバス１５３を介して入力し、ＤＰレジスタ１
４９に記憶されたアドレスのページオフセットフィール
ド２０１によって示されるページ内のアドレスに記憶す
る。In the write operation from the data register r ₀ , the banks 161 to 164 input the selection signals S 179 _{1 to} S 179 ₄ each indicating an enable instruction, and transfer the 4-byte data stored in the data register r ₀ to the local bus. 153 and the DP register 1
49 is stored at the address within the page indicated by the page offset field 201 of the address stored in 49.

【００７２】内部メモリ１４７は、ローカルバス１５３
を介してメインメモリ１５０に接続されている。メイン
メモリ１５０は、マイクロプロセッサ１２１のチップ内
に組み込んでも、あるいは、チップ外に設けてもよい。The internal memory 147 has a local bus 153
Through the main memory 150. The main memory 150 may be built in the chip of the microprocessor 121 or provided outside the chip.

【００７３】バンク選択モジュール１５１は、図２１に
示すように、ＤＰＣ(Data PointerCache)レジスタ１７
０，１７１，１７２，１７３、比較器１７４，１７５，
１７６，１７７および選択回路１７９を有する。ＤＰＣ
レジスタ１７０，１７１，１７２，１７３は、それぞれ
バンク１６１，１６２，１６３，１６４に記憶されてい
るページのメインメモリ１５０上の先頭アドレスを記憶
する。この先頭アドレスは、図２２に示すＤＰレジスタ
１４９に記憶されるアドレスのページセレクトフィール
ド２００に対応している。ここで、比較器１７４〜１７
７においては、ＤＰレジスタ１４９に記憶されたアドレ
スのうちページセレクトフィールド２００のみを比較す
れば良いため、ＤＰＣレジスタ１７０，１７１，１７
２，１７３は、ページセレクトフィールド２００に記憶
される３２ビットのアドレスのうち、ページセレクトフ
ィールド２００に対応するアドレスを記憶する２３ビッ
トを備えていればよい。As shown in FIG. 21, the bank selection module 151 includes a DPC (Data PointerCache) register 17.
0,171,172,173, comparators 174,175,
176, 177 and a selection circuit 179. DPC
The registers 170, 171, 172, and 173 store the head addresses of the pages stored in the banks 161, 162, 163, and 164 on the main memory 150, respectively. This start address corresponds to the page select field 200 of the address stored in the DP register 149 shown in FIG. Here, the comparators 174 to 17
7, since only the page select field 200 of the addresses stored in the DP register 149 needs to be compared, the DPC registers 170, 171, 17
2,173 only needs to include 23 bits for storing an address corresponding to the page select field 200 among the 32-bit addresses stored in the page select field 200.

【００７４】ＤＰＣレジスタ１７０，１７１，１７２，
１７３に記憶されたアドレスは、それぞれバンク１６
１，１６２，１６３，１６４に記憶されたページと、メ
インメモリ１５０に記憶されたページとが入れ替えられ
たときに、ＤＰレジスタ１４９に記憶されたアドレスの
ページセレクトフィールド２００によって更新される。DPC registers 170, 171, 172,
The addresses stored in 173 are stored in the respective banks 16
When the pages stored in 1, 162, 163, 164 and the pages stored in the main memory 150 are replaced, the page is updated by the page select field 200 of the address stored in the DP register 149.

【００７５】比較器１７４，１７５，１７６，１７７
は、それぞれＤＰＣレジスタ１７０，１７１，１７２，
１７３に記憶されたメインメモリ１５０上のページの開
始アドレスと、ＤＰレジスタ１４９のページセレクタフ
ィールドに記憶されたアドレスとを比較し、その比較結
果を、比較データＳ１７４，Ｓ１７５，Ｓ１７６，Ｓ１
７７として選択回路１７９のポートｉｎ０，ｉｎ１，ｉ
ｎ２，ｉｎ３にそれぞれ出力する。なお、比較器１７
４，１７５，１７６，１７７における比較処理は、ＤＰ
レジスタ１４９に記憶されたアドレスが更新される度に
行われる。The comparators 174, 175, 176, 177
Are the DPC registers 170, 171, 172,
173, and compares the start address of the page on the main memory 150 with the address stored in the page selector field of the DP register 149, and compares the comparison result with the comparison data S174, S175, S176, S1.
77 as the ports in0, in1, i of the selection circuit 179
Output to n2 and in3 respectively. Note that the comparator 17
The comparison process in 4,175,176,177 is DP
This is performed every time the address stored in the register 149 is updated.

【００７６】選択回路１７９は、比較データＳ１７４，
Ｓ１７５，Ｓ１７６，Ｓ１７７のそれぞれに基づいて、
比較結果が一致を示している場合にはイネーブルを指示
し、比較結果が一致していない場合にはディスイネーブ
ルを指示する選択信号Ｓ１７９₁，Ｓ１７９₂，Ｓ１７
９₃，Ｓ１７９₄をバンク１６１，１６２，１６３，１
６４にそれぞれ継続して出力する。なお、選択回路１７
９は、比較データＳ１７４〜Ｓ１７７が変化したとき
に、それぞれ選択信号Ｓ１７９₁〜Ｓ１７９₄を、イネ
ーブル指示とディスイネーブル指示との間で切り換え
る。The selection circuit 179 outputs the comparison data S 174,
Based on each of S175, S176, and S177,
Select signals S179 ₁ , S179 ₂ , and S17 instructing enable when the comparison result indicates a match, and disabling when the comparison result does not match.
9 ₃ , S179 ₄ into banks 161, 162, 163, 1
64 are continuously output. The selection circuit 17
No. 9 switches the selection signals S179 _{1 to} S179 ₄ between an enable instruction and a disable instruction when the comparison data S174 to S177 change.

【００７７】また、選択回路１７９は、比較データＳ１
７４，Ｓ１７５，Ｓ１７６，Ｓ１７７の全てが不一致で
あることを示す場合には、ページフォルトを示す指示信
号Ｓ１７９₅を制御回路１８０に出力する。制御回路１
８０は、ページフォルトを示す指示信号Ｓ１７９₅を入
力すると、ＤＰレジスタ１４９に記憶されているアドレ
スのページセレクタフィールドによって示されるメイン
メモリ１５０上の開始アドレスからページを読み出し、
このページをローカルバス１５３を介して内部メモリ１
４７に読み込む。この内部メモリ１４７に読み込まれた
新たなページは、内部メモリ１４７のバンク１６１〜１
６４に記憶された４枚のページのうち、最も先にアクセ
スされたページと入れ替えられる。すなわち、ＬＲＵ(L
east Recently Used) 方式を採用する。なお、ページ入
れ替えのアルゴリズムは、ＬＲＵに限定されず、種々の
方法を採ることができる。ここで、ページフォルトの検
出は、キャッシュシステムを用いたマイクロプロセッサ
におけるキャッシュヒットの判断と同様に、回路上のク
リティカルパスになることが多い。ここで、クリティカ
ルパスは、ＬＳＩの最高動作周波数を決定する要因であ
る。The selection circuit 179 outputs the comparison data S1
74, S175, S176, if all of S177 indicates that a mismatch, and outputs an instruction signal S179 ₅ showing a page fault to the control circuit 180. Control circuit 1
80 inputs the instruction signal S179 ₅ showing a page fault, reads a page from the start address of the main memory 150 indicated by the page selector field of the address stored in the DP register 149,
This page is stored in the internal memory 1 via the local bus 153.
Read into 47. The new pages read into the internal memory 147 are stored in the banks 161 to 1 of the internal memory 147.
Of the four pages stored in 64, the page that is accessed first is replaced. That is, LRU (L
east Recently Used) method. Note that the page replacement algorithm is not limited to LRU, and various methods can be adopted. Here, the detection of a page fault often becomes a critical path on a circuit, similarly to the determination of a cache hit in a microprocessor using a cache system. Here, the critical path is a factor that determines the highest operating frequency of the LSI.

【００７８】次に、上述したマイクロプロセッサ１２１
のメモリアクセス動作について説明する。図２３は、マ
イクロプロセッサ１２１のメモリアクセス動作を説明す
るためのフローチャートである。先ず、図２０に示すイ
ンストラクションページメモリ３５上のプログラムカウ
ンタ３４が指し示すアドレスから命令がフェッチされ、
デコーダ３６においてデコードされる。このとき、デコ
ードされた命令が、メインメモリ１５０のメモリアドレ
ス空間上のデータを用いた演算命令である場合には、当
該データのメインメモリ１５０のメモリアドレス空間内
のアドレスが、図２１に示すＤＰレジスタ１４９に記憶
される（ステップＳ１）。Next, the microprocessor 121
Will be described. FIG. 23 is a flowchart illustrating a memory access operation of microprocessor 121. First, an instruction is fetched from the address indicated by the program counter 34 on the instruction page memory 35 shown in FIG.
The data is decoded in the decoder 36. At this time, if the decoded instruction is an operation instruction using data in the memory address space of the main memory 150, the address of the data in the memory address space of the main memory 150 is changed to the DP shown in FIG. It is stored in the register 149 (step S1).

【００７９】次に、比較器１７４〜１７７において、そ
れぞれＤＰレジスタ１４９に記憶されたアドレスのペー
ジセレクトフィールド２００と、ＤＰＣレジスタ１７０
〜１７３に記憶されたアドレスとが比較され、それらの
比較データＳ１７４〜Ｓ１７７が選択回路１７９に出力
される（ステップＳ２）。Next, in the comparators 174 to 177, the page select field 200 of the address stored in the DP register 149 and the DPC register 170
173 are compared with each other, and the comparison data S174 to S177 are output to the selection circuit 179 (step S2).

【００８０】次に、選択回路１７９において、比較デー
タＳ１７４〜Ｓ１７７の何れかが、アドレスの一致を示
しているか否かが判断され（ステップＳ３）、一致して
いることを示す比較データがある場合には、その比較デ
ータＳ１７４〜Ｓ１７７に対応するバンク１６１〜１６
４に対して、イネーブルを示す指示信号Ｓ１７９₁〜１
７９₄を出力する。これによって、イネーブルを示す指
示信号Ｓ１７９₁〜１７９₄を入力したバンク１６１〜
１６４において、ＤＰレジスタ１４９に記憶されたアド
レスのページオフセットフィールド２０１に示されるペ
ージ内のアドレスからデータが読み出され、この読み出
されたデータが、ローカルバス１５３を介して、データ
レジスタｒ₀に転送され記憶される（ステップＳ４）。Next, in the selection circuit 179, it is determined whether or not any of the comparison data S174 to S177 indicates an address match (step S3). Include banks 161 to 16 corresponding to the comparison data S174 to S177.
4, instruction signals S179 ₁ to ₁ indicating enable.
79 ₄ to output. As a result, the banks 161 to 161 to which the instruction signals S179 _{1 to} 179 ₄ indicating enable are input are provided.
At 164, data is read from the address in the page indicated by the page offset field 201 of the address stored in the DP register 149, and the read data is transferred to the data register r ₀ via the local bus 153. The data is transferred and stored (step S4).

【００８１】一方、選択回路１７９において、比較デー
タＳ１７４〜Ｓ１７７の全てが、不一致であることを示
していると判断された場合には（ステップＳ３）、前述
したＬＲＵのアルゴリズムを用いて、メインメモリ１５
０から対応するページをローカルバス１５３を介して内
部メモリ１４７に読み出すことを決定する（ステップＳ
５）。On the other hand, if the selection circuit 179 determines that all of the comparison data S174 to S177 indicate mismatch (step S3), the main memory is written using the LRU algorithm described above. Fifteen
From 0, it is determined that the corresponding page is to be read out to the internal memory 147 via the local bus 153 (step S).
5).

【００８２】そして、ＬＲＵのアルゴリズムによって入
れ替え対象となった内部メモリ１４７のバンク１６１〜
１６４に記憶されたページ内にダーティビットが存在す
るか否かが判断され（ステップＳ６）、ダーディビット
が存在する場合には、当該ページをメインメモリ１５０
にライトバックする（ステップＳ８）。ここで、ダーテ
ィビットは、例えば、マイクロプロセッサ１２１によっ
て、当該ページに書き込みが行われた場合に付される。The banks 161 to 161 of the internal memory 147 which have been replaced by the LRU algorithm
It is determined whether or not the dirty bit exists in the page stored in the memory 164 (step S6).
(Step S8). Here, the dirty bit is added when, for example, the microprocessor 121 has written to the page.

【００８３】次に、ステップＳ５において、入れ替え対
象となった内部メモリ１４７のバンク１６１〜１６４
に、ＤＰレジスタ１４９に記憶されたアドレスによって
指し示されるページをメインメモリ１５０からローカル
バス１５３を介して読み込み、ページの入れ替えを行う
（ステップＳ７）。このとき、ＤＰレジスタ１４９に記
憶されたアドレスのページセレクトフィールド２００に
よって、入れ替えを行ったバンク１６１〜１６４に対応
するＤＰＣレジスタ１７０〜１７３に記憶されたアドレ
スを更新する。Next, in step S5, the banks 161 to 164 of the internal memory 147 to be replaced are set.
Next, the page pointed to by the address stored in the DP register 149 is read from the main memory 150 via the local bus 153, and the pages are replaced (step S7). At this time, the address stored in the DPC registers 170 to 173 corresponding to the replaced banks 161 to 164 is updated by the page select field 200 of the address stored in the DP register 149.

【００８４】一方、入れ替えの対象となったバンク１６
１〜１６４に記憶されたページ内にダーティビットが存
在しない場合にも、当該入れ替えの対象となったページ
を、メインメモリ１５０から読み込んだページと入れ替
えるが、メインメモリ１５０へのライトバックは行わな
い（ステップＳ７）。On the other hand, the bank 16
Even when the dirty bits do not exist in the pages stored in Nos. 1 to 164, the replacement target page is replaced with the page read from the main memory 150, but the write back to the main memory 150 is not performed. (Step S7).

【００８５】図２４は、図２０および図２１に示すマイ
クロプロセッサ１２１におけるメモリアクセスのタイミ
ング図である。図２４（Ａ）に示すように、最初の５サ
イクルではバンク１６１に対してのアクセスおよびその
手続き（オペレーション）を行い、次の５サイクルでは
バンク１６２に対してのアクセスおよびその手続きを行
い、次の３サイクルではバンク１６３に対してのアクセ
スおよびその手続きを行う。なお、図２４（Ａ）におい
て、「Ｘ」はＤＰレジスタ１４９に記憶されたアドレス
の更新を行うオペレーションを示し、「Ｙ」はデータレ
ジスタｒ₀に対しての読み出し動作あるいは書き込み動
作を行うオペレーションを示している。ここで、バンク
１６１〜１６４は、既に有効なデータを保持しているも
のとする。FIG. 24 is a timing chart of memory access in microprocessor 121 shown in FIGS. 20 and 21. As shown in FIG. 24A, access to bank 161 and its procedure (operation) are performed in the first five cycles, and access and procedure to bank 162 are performed in the next five cycles. In the three cycles, access to the bank 163 and its procedure are performed. Note that in FIG. 24 (A), "X" indicates an operation for updating the address stored in the DP register 149, "Y" operations to perform a read operation or a write operation to the data register r ₀ Is shown. Here, it is assumed that the banks 161 to 164 already hold valid data.

【００８６】図２４（Ｂ）は、バンク１６１に対しての
アクセス動作を行う際にＤＰレジスタ１４９に記憶され
るアドレスのタイミングを示し、アドレス「Ａ０」，
「Ａ１」，「Ａ２」，「Ａ３」，「Ａ４」，「Ａ５」
は、図２２に示すページセレクトフィールド２００に対
応するアドレスが全て同一である。図２４（Ｃ）は、バ
ンク１６１に対して読み出し動作あるいは書き込み動作
が行われるデータを示している。図２４（Ｃ）に示すデ
ータは、データレジスタｒ₀に記憶される。FIG. 24B shows the timing of the address stored in the DP register 149 when performing an access operation to the bank 161. The address "A0",
"A1", "A2", "A3", "A4", "A5"
Have the same address for the page select field 200 shown in FIG. FIG. 24C illustrates data on which a read operation or a write operation is performed on the bank 161. The data shown in FIG. _24C is stored in the data register r0.

【００８７】また、図２４（Ｄ）は、バンク１６２に対
してのアクセス動作を行う際にＤＰレジスタ１４９に記
憶されるアドレスのタイミングを示し、アドレス「Ｂ
０」，「Ｂ１」，「Ｂ２」，「Ｂ３」，「Ｂ４」，「Ｂ
５」は、図２２に示すページセレクトフィールド２００
に対応するアドレスが全て同一である。図２４（Ｅ）
は、バンク１６２に対して読み出し動作あるいは書き込
み動作が行われるデータを示している。図２４（Ｅ）に
示すデータは、データレジスタｒ₀に記憶される。FIG. 24D shows the timing of the address stored in the DP register 149 when the access operation to the bank 162 is performed.
0 "," B1 "," B2 "," B3 "," B4 "," B
5 "is the page select field 200 shown in FIG.
Are all the same. FIG. 24 (E)
Indicates data for which a read operation or a write operation is performed on the bank 162. The data shown in FIG. _24E is stored in the data register r0.

【００８８】また、図２４（Ｆ）は、バンク１６３に対
してのアクセス動作を行う際にＤＰレジスタ１４９に記
憶されるアドレスのタイミングを示し、アドレス「Ｃ
０」，「Ｃ１」，「Ｃ２」，「Ｃ３」は、図２２に示す
ページセレクトフィールド２００に対応するアドレスが
全て同一である。図２４（Ｇ）は、バンク１６３に対し
て読み出し動作あるいは書き込み動作が行われるデータ
を示している。図２４（Ｇ）に示すデータは、データレ
ジスタｒ₀に記憶される。FIG. 24F shows the timing of the address stored in the DP register 149 when the access operation to the bank 163 is performed.
"0", "C1", "C2", and "C3" all have the same address corresponding to the page select field 200 shown in FIG. FIG. 24G illustrates data on which a read operation or a write operation is performed on the bank 163. Data shown in FIG. 24 (G) is stored in the data register r _0.

【００８９】さらに、図２４（Ｈ）は、バンク１６４に
対してのアクセス動作を行う際にＤＰレジスタ１４９に
記憶されるアドレスのタイミングを示し、図２４（Ｉ）
は、バンク１６４に対して読み出し動作あるいは書き込
み動作が行われるデータを示している。FIG. 24H shows the timing of the address stored in the DP register 149 when the bank 164 is accessed, and FIG.
Indicates data for which a read operation or a write operation is performed on the bank 164.

【００９０】図２４（Ａ），（Ｂ），（Ｃ）に示すよう
に、１サイクル目では、バンク１６１を選択するために
ＤＰレジスタ１４９にアドレス「Ａ０」を書き込む。こ
れによって、データレジスタｒ₀に、バンク１６１内の
アドレス「Ａ０」のページオフセットフィールド２０１
によって示されるアドレスからデータが読み出され、図
２４（Ｃ）に示すように、バンク１６１のデータポート
からデータ「ｄ０」が出力され、このデータ「ｄ０」が
データレジスタｒ₀に記憶される。その後、マイクロプ
ロセッサ１２１において、前述した第１実施形態のマイ
クロプロセッサ１において述べたように、ＤＰレジスタ
１４９のアドレスが自動的に更新され、バンク内の連続
したアドレスが順次にアクセスされる。As shown in FIGS. 24A, 24B and 24C, in the first cycle, the address “A0” is written to the DP register 149 in order to select the bank 161. As a result, the page offset field 201 of the address “A0” in the bank 161 is stored in the data register r _0.
Data is read out from the address indicated by, as shown in FIG. 24 (C), is output data "d0" from the data port of the bank 161, the data "d0" is stored in the data register r _0. Thereafter, in the microprocessor 121, as described in the microprocessor 1 of the first embodiment, the address of the DP register 149 is automatically updated, and successive addresses in the bank are sequentially accessed.

【００９１】データレジスタｒ₀には、ＤＰレジスタ１
４９に記憶されたアドレスを更新した次のサイクルで、
特に指定のない限りデータの先読みを実行する。従っ
て、２サイクル目では、アドレス「Ａ０」が更新されて
「Ａ１」となる。これによって、バンク１６１のアドレ
ス「Ａ１」からデータ「ｄ１」が読み出され、バンク１
６１のデータポートから出力される。この２サイクル目
では、バンク１６１が選択されているため、データレジ
スタｒ₀に記憶されたデータ「ｄ０」が、図２０に示す
ＡＬＵ１３の入力ポートに出力された直後に、バンク１
６１のデータポートからデータ「ｄ１」が出力される。The data register r ₀ contains the DP register 1
In the next cycle after updating the address stored in 49,
Prefetch data unless otherwise specified. Therefore, in the second cycle, the address “A0” is updated to “A1”. As a result, the data “d1” is read from the address “A1” of the bank 161 and the data of the bank 1 is read.
It is output from 61 data ports. In the second cycle, since the bank 161 is selected, immediately after the data “d0” stored in the data register r ₀ is output to the input port of the ALU 13 shown in FIG.
Data “d1” is output from the 61 data ports.

【００９２】以下、同様な処理が行われ、図２４（Ｊ）
に示すように、データレジスタｒ₀には、データ「ｄ
０」〜「ｄ１２」が順次に記憶される。Thereafter, similar processing is performed, and FIG.
As shown in the figure, the data “d” is stored in the data register r _0.
"0" to "d12" are sequentially stored.

【００９３】以上説明したように、マイクロプロセッサ
１２１によれば、バンク選択モジュール１５１を設ける
ことで、４個のバンク１６１〜１６４に対して、単数の
ＤＰレジスタ１４９を用いてアクセスできる。なお、バ
ンク選択モジュール１５１のＤＰＣレジスタ１７０〜１
７３は、ユーザは通常の方法ではアクセスできず、ユー
ザが管理する必要がない。As described above, according to the microprocessor 121, by providing the bank selection module 151, the four banks 161 to 164 can be accessed using the single DP register 149. The DPC registers 170 to 1 of the bank selection module 151
At 73, the user cannot access in a normal manner, and does not need to be managed by the user.

【００９４】ところで、第１実施形態および第２実施形
態のように、内部メモリのバンク毎に固定したデータレ
ジスタを設けると、汎用レジスタの数に制限がある場合
には、汎用レジスタとして使用できるレジスタの数が減
ってしまい、処理能力の向上が抑えられる可能性があ
る。これに対して、マイクロプロセッサ１２１では、単
数のデータレジスタｒ₀が４個の全てのバンク１６１〜
１６４とデータ転送を行うことができるため、汎用レジ
スタモジュール１２３に設けられた１６個のレジスタの
うちの一つのみをデータレジスタｒ₀として設ければよ
い。その結果、汎用レジスタとして使用できるレジスタ
の数を増やすことができ、第１実施形態および第２実施
形態に比べて、処理能力をさらに向上できる。マイクロ
プロセッサ１２１は、特に、内部メモリのバンク数が多
い場合に有効である。By the way, if a fixed data register is provided for each bank of the internal memory as in the first and second embodiments, if the number of general-purpose registers is limited, a register that can be used as a general-purpose register is used. May be reduced, and an increase in processing capacity may be suppressed. In contrast, in the microprocessor 121, all banks data register r ₀ is the four single 161 to
Since data transfer with 164 can be performed, only one of the 16 registers provided in the general-purpose register module 123 needs to be provided as the data register r ₀ . As a result, the number of registers that can be used as general-purpose registers can be increased, and the processing capability can be further improved as compared with the first embodiment and the second embodiment. The microprocessor 121 is particularly effective when the number of banks of the internal memory is large.

【００９５】また、マイクロプロセッサ１２１によれ
ば、ＤＰＣレジスタ１７０〜１７３に、ＤＰレジスタ１
４９に記憶されたアドレスのページセレクトフィールド
２００に対応する部分のみを記憶するため、ＤＰレジス
タ１４９に記憶されたアドレスの全体を記憶する場合に
比べて、ＤＰＣレジスタ１７０〜１７３および比較器１
７４〜１７７の回路を縮小化できる。また、比較器１７
４〜１７７の比較処理を高速に行うことができる。According to the microprocessor 121, the DPC registers 170 to 173 store the DP register 1
Since only the portion of the address stored in the DP 49 corresponding to the page select field 200 is stored, the DPC registers 170 to 173 and the comparator 1 are compared with the case where the entire address stored in the DP register 149 is stored.
Circuits 74 to 177 can be reduced in size. Also, the comparator 17
4 to 177 can be performed at high speed.

【００９６】また、マイクロプロセッサ１２１によれ
ば、バンク数分の比較器１７４〜１７７を設けて比較処
理を並列化することで、ＤＰレジスタ１４９に記憶され
たアドレスの更新時と同一のサイクルで、バンク１６１
〜１６４から所望のデータを読み出すことができる。Further, according to the microprocessor 121, the comparators 174 to 177 for the number of banks are provided to parallelize the comparison processing, so that in the same cycle as when the address stored in the DP register 149 is updated, Bank 161
Desired data can be read from.

【００９７】また、マイクロプロセッサ１２１によれ
ば、ＤＰレジスタ１４９に記憶するアドレスを図２２に
示すような、ページセレクトフィールド２００とページ
オフセットフィールド２０１とで構成されるフォーマッ
トにしたことで、バンク選択モジュール１５１によって
同一のページがバンク１６１〜１６４のうち複数のバン
クに記憶されることを簡単な構成で回避でき、内部メモ
リ１４７を効率的に使用できる。また、ＤＰレジスタ１
４９のページオフセットフィールド２０１のみを内部メ
モリ１４７に接続すればよく、ローカルバス１５２のバ
ス幅を小さくできる。Further, according to the microprocessor 121, the address stored in the DP register 149 has a format including the page select field 200 and the page offset field 201 as shown in FIG. With 151, the same page can be prevented from being stored in a plurality of banks among the banks 161 to 164 with a simple configuration, and the internal memory 147 can be used efficiently. Also, DP register 1
Only the 49 page offset fields 201 need be connected to the internal memory 147, and the bus width of the local bus 152 can be reduced.

【００９８】さらに、マイクロプロセッサ１２１によれ
ば、ＤＰレジスタ１４９に記憶されているアドレスのう
ち、必要なフィールドのみをＤＰＣレジスタ１７０およ
びバンク１６１〜１６４に転送しているため、アドレス
線の本数を抑制できる。なお、一般的に、アドレス線は
速いアクセス速度を得るために、配線が短く、容量が少
ない方が良い。また、アドレスとして用いる配線の本数
も少ないほうがよい。アクセスに伴う電力と、電源配線
容量の効果も無視できないからである。なお、マイクロ
プロセッサ１２１は、前述した第１実施形態のマイクロ
プロセッサ１の効果も同様に得ることができる。Further, according to the microprocessor 121, only necessary fields among the addresses stored in the DP register 149 are transferred to the DPC register 170 and the banks 161 to 164, so that the number of address lines is suppressed. it can. In general, in order to obtain a high access speed, it is preferable that the address line be short and have a small capacity. Further, it is preferable that the number of wirings used as addresses is small. This is because the effects of the power associated with the access and the power supply wiring capacitance cannot be ignored. Note that the microprocessor 121 can also obtain the same effects as those of the microprocessor 1 of the first embodiment.

【００９９】本発明は上述した実施形態には限定されな
い。例えば、上述したマイクロプロセッサ１２１では、
ローカルバス１５３を介して、ＤＰレジスタ１４９およ
びメインメモリ１５０とバンク１６１〜１６４との間の
データ伝送を行う場合について例示したが、マイクロプ
ロセッサ１２１において、例えば、図２５に示すよう
に、ローカルバス１５３ａ，１５３ｂを用いて、このデ
ータ伝送を実現してもよい。The present invention is not limited to the above embodiment. For example, in the microprocessor 121 described above,
Although the case where data transmission between the DP register 149 and the main memory 150 and the banks 161 to 164 is performed via the local bus 153 has been described as an example, in the microprocessor 121, for example, as shown in FIG. , 153b may be used to realize this data transmission.

【０１００】この場合には、図２５に示すように、ロー
カルバス１５３ａにデータレジスタｒ₀を接続し、ロー
カルバス１５３ｂにメインメモリ１５０を接続する。ま
た、バンク１６１〜１６４を、それぞれマルチプレクサ
３０１〜３０４を介して、ローカルバス１５３ａおよび
１５３ｂに接続する。マルチプレクサ３０１〜３０４
は、それぞれバンク１６１〜１６４を、ローカルバス１
５３ａおよび１５３ｂの何れか一方と選択的に接続す
る。制御回路３００は、制御信号Ｓ３００ａをバンク１
６１〜１６４に出力し、データレジスタｒ₀との間のデ
ータの読み出しおよび書き込みを制御する。また、制御
回路３００は、制御信号Ｓ３００ｂをバンク１６１〜１
６４に出力し、メインメモリ１５０との間のページ入れ
替え処理を制御する。In this case, as shown in FIG. 25, the data register r ₀ is connected to the local bus 153a, and the main memory 150 is connected to the local bus 153b. The banks 161 to 164 are connected to local buses 153a and 153b via multiplexers 301 to 304, respectively. Multiplexers 301-304
Respectively connects the banks 161 to 164 with the local bus 1
Selectively connect to one of 53a and 153b. The control circuit 300 transmits the control signal S300a to the bank 1
Output to 61-164, controls the reading and writing of data between the data register r _0. Further, the control circuit 300 transmits the control signal S300b to the banks 161-1.
64 to control the page exchange processing with the main memory 150.

【０１０１】図２５に示すような構成を採用すること
で、バンク１６１〜１６４のうち一のバンクとデータレ
ジスタｒ₀との間のデータ転送と、その他のバンクとメ
インメモリ１５０との間のページ入れ替え処理とを並行
して行う（多重化する）ことができる。そのため、ペー
ジフォルトに伴うページ入れ替え処理に要する時間を隠
蔽することができる。[0102] By adopting the configuration as shown in FIG. 25, pages between the data transfer and the other bank and the main memory 150 of between one bank and data registers r ₀ of the bank 161 to 164 The replacement process can be performed (multiplexed) in parallel. Therefore, it is possible to hide the time required for the page replacement process due to the page fault.

【０１０２】また、マイクロプロセッサ１２１では、汎
用レジスタモジュール１１に単数のＤＰレジスタ１４９
を設ける場合について例示したが、本発明は、複数のＤ
Ｐレジスタを設け、そのうち少なくとも一つが複数のバ
ンクと接続可能な構成にしてもよい。In the microprocessor 121, a single DP register 149 is stored in the general-purpose register module 11.
Is provided, but the present invention provides a plurality of D
A configuration may be adopted in which a P register is provided, at least one of which can be connected to a plurality of banks.

【０１０３】また、マイクロプロセッサ１２１では、Ｄ
Ｐレジスタ１４９に記憶されたアドレスが更新される度
に、比較器１７４〜１７７で比較処理を行う場合につい
て例示したが、例えば、選択回路１７９あるいは制御回
路１８０によって、ＤＰレジスタ１４９に記憶されたペ
ージセレクトフィールド２００が更新されたか否かを監
視し、ページセレクトフィールド２００が更新された場
合にのみ、比較器１７４〜１７７における比較処理を行
うように制御してもよい。このように制御を行うこと
で、比較器１７４〜１７７における比較処理の実行頻度
を飛躍的に減少でき、低電力化を図ることができる。In the microprocessor 121, D
The case where the comparison process is performed by the comparators 174 to 177 every time the address stored in the P register 149 is updated has been described. For example, the page stored in the DP register 149 is controlled by the selection circuit 179 or the control circuit 180. Whether or not the select field 200 has been updated may be monitored, and only when the page select field 200 is updated, the comparators 174 to 177 may be controlled to perform the comparison process. By performing control in this manner, the frequency of execution of the comparison processing in the comparators 174 to 177 can be significantly reduced, and power consumption can be reduced.

【０１０４】また、上述したマイクロプロセッサ１２１
では、内部メモリ１４７に４個のバンクを設けた場合に
ついて例示したが、本発明では、内部メモリに設けるバ
ンクの数は２以上であれば任意である。また、上述した
マイクロプロセッサ１２１では、ＤＰＣレジスタ１７０
〜１７３にＤＰレジスタ１４９のページセレクトフィー
ルド２００のみを記憶する場合を例示したが、ＤＰＣレ
ジスタ１７０〜１７３に、ＤＰレジスタ１４９に記憶さ
れるアドレスの全体、あるいは、ページセレクトフィー
ルド２００およびページオフセットフィールド２０１の
みを記憶するようにしてもよい。Also, the microprocessor 121
In the above, the case where four banks are provided in the internal memory 147 has been exemplified, but in the present invention, the number of banks provided in the internal memory is arbitrary as long as it is two or more. Also, in the microprocessor 121 described above, the DPC register 170
173 stores only the page select field 200 of the DP register 149, but the DPC registers 170 to 173 store the entire address stored in the DP register 149 or the page select field 200 and the page offset field 201. Only the information may be stored.

【０１０５】また、マイクロプロセッサ１では、汎用レ
ジスタモジュール１１のうち、２個の汎用レジスタをデ
ータレジスタｒ₀，ｒ₁として用いた場合について例示
したが、データレジスタの数は、１以上であれば任意で
ある。また、データレジスタｒ₀，ｒ₁の使用方法は、
内部メモリ４７との間のインターフェイスとして用いる
他、例えば、複数のマイクロプロセッサを搭載した並列
処理プロセッサにおけるマイクロプロセッサ相互間で通
信を行う際のＦＩＦＯ（First In First Out）メモリと
して用いてもよい。また、データレジスタｒ₀，ｒ
₁は、ローカルメモリあるいはスタックメモリとして用
いてもよい。In the microprocessor 1, the case where two general-purpose registers among the general-purpose register modules 11 are used as the data registers r ₀ and r ₁ has been exemplified. However, if the number of data registers is one or more, Optional. The usage of the data registers r ₀ and r ₁ is as follows.
Instead of being used as an interface with the internal memory 47, for example, it may be used as a FIFO (First In First Out) memory when performing communication between microprocessors in a parallel processor equipped with a plurality of microprocessors. Further, data registers r ₀ , r
₁ may be used as a local memory or a stack memory.

【０１０６】また、マイクロプロセッサ１では、図１に
示すように、マルチプレクサ１２とマルチプレクサ１６
とを独立に設けたが、マルチプレクサ１６の機能をマル
チプレクサ１２に組み込んでもよい。Further, in the microprocessor 1, as shown in FIG.
The functions of the multiplexer 16 may be incorporated in the multiplexer 12.

【０１０７】また、マイクロプロセッサ１では、図２に
示すように、内部メモリ４７を２バンクに分けて構成し
たが、内部メモリ４７を３バンク以上にバンク分けした
構成、あるいは、バンク分けしていない構成にしてもよ
い。Further, in the microprocessor 1, as shown in FIG. 2, the internal memory 47 is divided into two banks, but the internal memory 47 is divided into three or more banks, or the banks are not divided. It may be configured.

【０１０８】また、マイクロプロセッサ１では、図１に
示すように、汎用レジスタモジュール１１に、複数の汎
用レジスタの一部として、データレジスタｒ₀，ｒ₁を
設けたが、汎用レジスタモジュール１１の外部に、デー
タレジスタｒ₀，ｒ₁を設け、汎用レジスタとは別に扱
うようにしてもよい。Further, in the microprocessor 1, as shown in FIG. 1, the general-purpose register module 11 is provided with data registers r ₀ and r ₁ as a part of a plurality of general-purpose registers. May be provided with data registers r ₀ and r ₁ so as to be handled separately from general-purpose registers.

【０１０９】また、図１６に示すメモリ構造を持つマイ
クロプロセッサ８１において、内部メモリ８７の代わり
に３本のＦＩＦＯメモリを備え、データレジスタｒ₁−
Ｒに入力用のＦＩＦＯメモリを接続し、データレジスタ
ｒ₁−Ｗに出力用のＦＩＦＯメモリを接続した構成にし
てもよい。In the microprocessor 81 having the memory structure shown in FIG. 16, three FIFO memories are provided instead of the internal memory 87, and the data register r _1-
A configuration may be adopted in which an input FIFO memory is connected to R and an output FIFO memory is connected to the data register r ₁ -W.

【０１１０】[0110]

【発明の効果】以上説明したように、本発明の演算処理
装置およびメモリアクセス方法によれば、単数のデータ
レジスタを介して、演算手段と、内部メモリの複数のバ
ンクとの間で、データ転送を行うことができ、レジスタ
資源を効果的に利用できる。その結果、データレジスタ
として汎用レジスタを用いている場合にも、汎用レジス
タの機能を適切に発揮させることができる。また、本発
明の演算処理装置およびメモリアクセス方法によれば、
内部メモリに対してのアクセスをデータレジスタを用い
て実現することで、内部メモリに対してのアクセスをレ
ジスタに対してのアクセスと同様に扱うことができる。
その結果、ユーザのプログラム作成時の負担を軽減でき
る。As described above, according to the arithmetic processing device and the memory access method of the present invention, data transfer between arithmetic means and a plurality of banks of the internal memory via a single data register. And the register resources can be used effectively. As a result, even when a general-purpose register is used as the data register, the function of the general-purpose register can be properly exhibited. According to the arithmetic processing device and the memory access method of the present invention,
By realizing access to the internal memory using the data register, access to the internal memory can be treated in the same way as access to the register.
As a result, the burden on the user when creating a program can be reduced.

[Brief description of the drawings]

【図１】図１は、本発明の第１実施形態に係わるマイク
ロプロセッサの構成図である。FIG. 1 is a configuration diagram of a microprocessor according to a first embodiment of the present invention.

【図２】図２は、図１に示すマイクロプロセッサのデー
タレジスタ、ＤＰレジスタおよび内部メモリを説明する
ための図である。FIG. 2 is a diagram for explaining a data register, a DP register, and an internal memory of the microprocessor shown in FIG. 1;

【図３】図３は、図１に示す汎用レジスタモジュールの
構成を説明するための図である。FIG. 3 is a diagram for explaining a configuration of a general-purpose register module shown in FIG. 1;

【図４】図４は、図１に示すマイクロプロセッサの４段
パイプラン処理を説明するための図である。FIG. 4 is a diagram for explaining a four-stage pipeline process of the microprocessor shown in FIG. 1;

【図５】図５は、図１に示すマイクロプロセッサのパイ
プラン処理を説明するための図である。FIG. 5 is a diagram for explaining the pipeline process of the microprocessor shown in FIG. 1;

【図６】図６は、図１に示すマイクロプロセッサにおい
て用いられる２オペランド演算命令のフォーマットを説
明するための図である。FIG. 6 is a view for explaining a format of a two-operand operation instruction used in the microprocessor shown in FIG. 1;

【図７】図７は、図６に示す演算命令において、ソース
レジスタにデータレジスタを指定した場合の演算命令を
説明するための図である。FIG. 7 is a diagram for explaining an operation instruction when a data register is specified as a source register in the operation instruction shown in FIG. 6;

【図８】図８は、図６に示す演算命令において、ディス
ティネーションレジスタにデータレジスタを指定した場
合の演算命令を説明するための図である。FIG. 8 is a diagram for explaining an operation instruction when a data register is designated as a destination register in the operation instruction shown in FIG. 6;

【図９】図９は、図６に示す演算命令において、ソース
レジスタおよびディスティネーションレジスタの双方に
データレジスタを指定した場合の演算命令を説明するた
めの図である。FIG. 9 is a diagram for explaining an operation instruction when a data register is designated as both a source register and a destination register in the operation instruction shown in FIG. 6;

【図１０】図１０は、図８に示す演算命令を連続して行
う場合に生じるパイプライン処理のストールを説明する
ための図である。FIG. 10 is a diagram for explaining a stall in pipeline processing that occurs when the operation instructions shown in FIG. 8 are continuously performed.

【図１１】図１１は、図１に示すマイクロプロセッサに
おいて用いられる３オペランド演算命令のフォーマット
を説明するための図である。FIG. 11 is a diagram for explaining a format of a three-operand operation instruction used in the microprocessor shown in FIG. 1;

【図１２】図１に示すマイクロプロセッサにおいて、従
来のマイクロプロセッサにおける図３０に示すプログラ
ムと同様の処理を記述したプログラムを説明するための
図である。12 is a diagram for describing a program in the microprocessor shown in FIG. 1 that describes processing similar to the program shown in FIG. 30 in a conventional microprocessor.

【図１３】図１３は、図１に示すマイクロプロセッサに
おいて、図１２に示すプログラムと同様の処理を、ソー
スレジスタおよびディステイネイションアドレスとして
データレジスタを指定して記述したプログラムを説明す
るための図である。13 is a view for explaining a program in which the same processing as the program shown in FIG. 12 is described in the microprocessor shown in FIG. 1 by designating a data register as a source register and a destination address; It is.

【図１４】図１４は、図１に示すマイクロプロセッサに
おいて、内部メモリ上の連続したメモリアドレスに記憶
されたデータを加工する処理を示すプログラムを説明す
るための図である。FIG. 14 is a diagram for explaining a program showing a process of processing data stored at consecutive memory addresses on an internal memory in the microprocessor shown in FIG. 1;

【図１５】図１５は、図１に示すマイクロプロセッサに
おいて、メモリアクセスのパスを３系統にして、加算命
令の３個の全てのオペランドにデータレジスタを指定し
た命令を含むプログラムを説明するための図である。FIG. 15 is a diagram for explaining a program including an instruction in which a data register is specified in all three operands of an addition instruction in the microprocessor shown in FIG. FIG.

【図１６】図１６は、本発明の第２実施形態に係わるマ
イクロプロセッサの内部メモリの周辺の構成図である。FIG. 16 is a configuration diagram around an internal memory of a microprocessor according to a second embodiment of the present invention.

【図１７】図１７は、２オペランド演算命令において、
ディスティネーションアドレスとしてデータレジスタを
指定した場合の例を説明するための図である。FIG. 17 shows two-operand operation instructions.
FIG. 4 is a diagram for describing an example in which a data register is specified as a destination address.

【図１８】図１８は、本発明の第２実施形態に係わるマ
イクロプロセッサにおいて、内部メモリ上の連続したメ
モリアドレスに記憶されたデータを加工するプログラム
を説明するための図である。FIG. 18 is a diagram for explaining a program for processing data stored in continuous memory addresses on an internal memory in the microprocessor according to the second embodiment of the present invention.

【図１９】図１９は、図１８に示すプログラムを図４に
示す４段パイプライン処理で実行した場合を説明するた
めの図である。FIG. 19 is a diagram for explaining a case where the program shown in FIG. 18 is executed by the four-stage pipeline processing shown in FIG. 4;

【図２０】図２０は、本発明の第３実施形態のマイクロ
プロセッサのＡＬＵの周辺の構成図である。FIG. 20 is a configuration diagram around an ALU of a microprocessor according to a third embodiment of the present invention;

【図２１】図２１は、図２０に示すマイクロプロセッサ
の内部メモリの周辺の構成図である。FIG. 21 is a configuration diagram around an internal memory of the microprocessor shown in FIG. 20;

【図２２】図２２は、図２１に示すＤＰレジスタの構成
を説明するための図である。FIG. 22 is a diagram for explaining a configuration of a DP register shown in FIG. 21;

【図２３】図２３は、図２０および図２１に示すマイク
ロプロセッサのメモリアクセス動作を説明するためのフ
ローチャートである。FIG. 23 is a flowchart illustrating a memory access operation of the microprocessor shown in FIGS. 20 and 21;

【図２４】図２４は、図２０および図２１に示すマイク
ロプロセッサにおけるメモリアクセスのタイミング図で
ある。FIG. 24 is a timing chart of memory access in the microprocessor shown in FIGS. 20 and 21;

【図２５】図２５は、図２１に示すマイクロプロセッサ
の内部メモリの周辺の変形例の構成図である。FIG. 25 is a configuration diagram of a modification around the internal memory of the microprocessor shown in FIG. 21;

【図２６】図２６は、従来のＲＩＳＣ型およびＣＩＳＣ
型のマイクロプロセッサの汎用レジスタを説明するため
の図である。FIG. 26 shows a conventional RISC type and CISC type.
FIG. 4 is a diagram for explaining general-purpose registers of a type microprocessor.

【図２７】図２７は、従来のマイクロプロセッサおける
３個のレジスタ指定子を持つＡＬＵ演算命令を説明する
ための図である。FIG. 27 is a diagram for explaining an ALU operation instruction having three register specifiers in a conventional microprocessor.

【図２８】図２８は、従来のマイクロプロセッサにおけ
るメモリ上のデータを加工する命令を説明するための図
である。FIG. 28 is a diagram for explaining an instruction for processing data on a memory in a conventional microprocessor.

【図２９】図２９は、従来のマイクロプロセッサの５段
パイプラン処理を説明するための図である。FIG. 29 is a diagram for explaining a five-stage pipeline process of a conventional microprocessor.

【図３０】図３０は、図２９に示す５段パイプライン処
理において、メモリ上のデータをＡＬＵ演算の対象にし
た処理のプログラムを説明するための図である。FIG. 30 is a diagram for explaining a program for processing in which data on a memory is subjected to an ALU operation in the five-stage pipeline processing shown in FIG. 29;

【図３１】図３１は、従来のマイクロプロセッサにおい
て、メモリアドレス空間上の一定の距離を隔てたアドレ
スに繰り返してアクセスする処理のプログラムを説明す
るための図である。FIG. 31 is a diagram for explaining a program of a process of repeatedly accessing addresses at a predetermined distance in a memory address space in a conventional microprocessor.

[Explanation of symbols]

１…マイクロプロセッサ、１１，１２３…汎用レジスタ
モジュール、１２，１６，１２５，１２７…マルチプレ
クサ、１３…ＡＬＵ、２１，２２，２３，２４，２５，
２６，２７…バス、３０，３１，１４９…ＤＰレジス
タ、３５…インストラクションページメモリ、３６…デ
コーダ、４５、４６…バンク、４７，１４７…内部メモ
リ、５０…外部メインメモリ、１５０…メインメモリ、
１５１…バンク選択モジュール、１５２，１５３…ロー
カルバス、１６１〜１６４…バンク、１７０〜１７３…
ＤＰＣレジスタ、１７４〜１７７…比較器、１７９…選
択回路、１８０…制御回路DESCRIPTION OF SYMBOLS 1 ... Microprocessor, 11, 123 ... General-purpose register module, 12, 16, 125, 127 ... Multiplexer, 13 ... ALU, 21, 22, 23, 24, 25,
26, 27 bus, 30, 31, 149 DP register, 35 instruction page memory, 36 decoder, 45, 46 bank, 47, 147 internal memory, 50 external main memory, 150 main memory,
151: Bank selection module, 152, 153: Local bus, 161-164: Bank, 170-173 ...
DPC register, 174 to 177 comparator, 179 selection circuit, 180 control circuit

Claims

[Claims]

1. An internal memory having a plurality of banks for storing pages of a predetermined data amount stored at consecutive addresses in a memory address space, and a data pointer register for storing addresses in the memory address space. Arithmetic means for performing arithmetic processing; a data register for transferring data between the internal memory and the arithmetic means; and storing a head address in a memory address space of a page stored in the plurality of banks. Comparing the stored start address with the address stored in the data pointer register, and determining whether there is a page to be accessed in any of the plurality of banks based on a result of the comparison. Page presence / absence determination means for determining whether the plurality of banks are to be accessed as a result of the determination. Control means for controlling transfer of data between a bank in which a page to be accessed is stored and the data register by using an address stored in the data pointer register, if any An arithmetic processing unit having:

2. The apparatus according to claim 2, further comprising a main memory having the memory address space, wherein the control unit determines that the page to be accessed does not exist in any of the plurality of banks as a result of the determination. 2. The arithmetic processing device according to claim 1, wherein a page to be accessed is read from the plurality of pages and replaced with a page stored in the plurality of banks.

A plurality of first address storage means for storing first addresses of the pages stored in the plurality of banks in the memory address space; and a plurality of the first address storage means. A plurality of comparing means for respectively comparing the stored head address and the address stored in the data pointer register; and accessing any one of the plurality of banks based on a comparison result of the plurality of comparing means. And determining means for determining whether or not a page to be present exists, wherein the control means, based on a determination result of the determining means,
Using the address stored in the data pointer register between the bank corresponding to the comparing means in which the start address and the address stored in the data pointer register match as a result of the comparison, using the address stored in the data pointer register. The arithmetic processing device according to claim 1, wherein control is performed so as to perform transfer.

4. The page presence / absence determination means determines whether or not a page to be accessed exists in any of the plurality of banks each time an address stored in the data pointer register is updated. Claim 1
An arithmetic processing unit according to item 1.

5. An address in the memory address space stored in the data pointer register includes an offset address of a page in the memory address space and an offset address of data in the page. The page presence / absence determining means stores an offset address of the page in the memory address space of the page stored in the plurality of banks, and an offset address of the stored page and an address stored in the data pointer register. 2. The arithmetic processing device according to claim 1, wherein the arithmetic processing device compares the offset address of the page included in the data.

6. The arithmetic processing device according to claim 5, wherein said page presence / absence determination means performs said comparison only when an offset address of said page included in an address stored in said data pointer register is updated. .

7. The arithmetic processing device according to claim 1, wherein data stored in said memory address space is stored in a plurality of banks of said internal memory so as not to be duplicated.

8. The data read from the address stored in the data pointer register in the plurality of banks is transferred to the data register and stored, and the data stored in the data register is transferred to the arithmetic means. The arithmetic processing device according to claim 1, which performs a memory read operation.

9. The arithmetic processing device according to claim 1, wherein data is written to said data register at an address in said plurality of banks stored in said data pointer register to perform a memory write operation.

10. The arithmetic processing device according to claim 1, wherein a part of a plurality of general-purpose registers is used as said data register.

11. An arithmetic unit via a data register,
A memory access method for accessing an internal memory having a plurality of banks for storing pages of a predetermined data amount stored at consecutive addresses on a memory address space, comprising: Storing the first address in the memory address space of the page stored in the plurality of banks in the memory address space; comparing the stored first address with the address stored in the data pointer register; Based on the result of the comparison, it is determined whether a page to be accessed exists in any of the plurality of banks. As a result of the determination, the page to be accessed exists in any of the banks. Between the bank where the page to be accessed is stored and the data register, Memory access method for transferring data using the address stored in the data pointer register.