JPH06175848A

JPH06175848A - High-speed branch processing system for central processing unit

Info

Publication number: JPH06175848A
Application number: JP4325878A
Authority: JP
Inventors: Kazuto Nagatsuka; 一人永塚
Original assignee: Fuji Electric Co Ltd; Fuji Facom Corp
Current assignee: Fuji Electric Co Ltd; Fuji Facom Corp
Priority date: 1992-12-07
Filing date: 1992-12-07
Publication date: 1994-06-24

Abstract

PURPOSE:To fetch a branch destination address or a branch destination address with the high probability of branching next to a branching instruction for accelerating the branching instruction at a central processing unit (CPU) such as a microcomputer. CONSTITUTION:This system is constituted of a CPU 1 provided with a pipeline processing part 2 from fetch 3 to execution 4 and a program counter 5, a program counter stack means 7, a high-probability branch destination registering means 9, a program counter queue storage means 8 for the step of queuing a counter value and an instruction judging means 6 for analyzing an instruction 10 fetched 3, reloading the counter 5 with the branching destination address in an unconitional case, writing the branch destination in the counter when the instruction is registered, and loading the address value in the lowest step of the program counter queue storage means to the program counter 5 when the conditions are not satisfied.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、シーケンサ、マイコン
等の制御演算装置において、チップ内のアドレス比較用
テーブルに登録したアドレスに分岐する命令か、無条件
分岐命令をフェッチした時、その命令を実行する前にジ
ャンプ先の命令をフェッチし、分岐処理を高速化する中
央演算処理装置の高速分岐処理方式に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a control operation unit such as a sequencer or a microcomputer, which fetches an instruction for branching to an address registered in an address comparison table in a chip or an unconditional branch instruction. The present invention relates to a high-speed branch processing method of a central processing unit that fetches a jump destination instruction before executing it and speeds up branch processing.

【０００２】[0002]

【従来の技術】中央演算処理装置の分岐処理について従
来の処理方式例を説明する。図１２は、一般的な中央演
算処理装置の構成例である。2. Description of the Related Art An example of a conventional processing method for branching processing of a central processing unit will be described. FIG. 12 is a configuration example of a general central processing unit.

【０００３】中央演算処理装置1200はプログラムのフェ
ッチを行なうフェッチ部1210、デコードを行なうデコー
ド部1220、ソース・オペランドのフェッチを行なうソー
スフェッチ部1230、命令の実行を行なう実行部1240で構
成されている。実行部1240は、レジスタファイル1241、
プログラム・カウンタＰＣ1242、ＡＬＵ1243からなる。The central processing unit 1200 comprises a fetch unit 1210 for fetching a program, a decode unit 1220 for decoding, a source fetch unit 1230 for fetching source operands, and an execution unit 1240 for executing instructions. . The execution unit 1240 uses the register file 1241,
It consists of a program counter PC1242 and an ALU1243.

【０００４】フェッチ部1210、デコード部1220、ソース
フェッチ部1230、実行部1240は、それぞれ独立して動作
するので、４段のパイプライン処理（フェッチ、デコー
ド、ソースオペランドフェッチ、実行）を行なうことが
できる。Since the fetch unit 1210, the decode unit 1220, the source fetch unit 1230, and the execution unit 1240 operate independently of each other, four-stage pipeline processing (fetch, decode, source operand fetch, execution) can be performed. it can.

【０００５】図１３は、無条件分岐命令を含むプログラ
ム例であり、このプログラムを図１２に示した一般的な
中央演算処理装置の構成で処理したときのタイムチャー
トを図１４に示す。FIG. 13 shows an example of a program including an unconditional branch instruction, and FIG. 14 shows a time chart when this program is processed by the configuration of the general central processing unit shown in FIG.

【０００６】図１３に示したプログラムを図１２の装置
で実行すると、図１４のタイムチャートに示すように、
１サイクルずつ処理が進んでいく。まず、１サイクルで
はアドレスバス（Ａバス）に１０００番地が設定され、
データバス（Ｄバス）に１０００番地の内容であるＩＮ
ＣＡが設定される。そしてプログラム・カウンタ（Ｐ
Ｃ）は１０００番地になる。When the program shown in FIG. 13 is executed by the apparatus shown in FIG. 12, as shown in the time chart of FIG.
Processing progresses one cycle at a time. First, 1000 addresses are set on the address bus (A bus) in one cycle.
IN which is the contents of address 1000 on the data bus (D bus)
C A is set. And the program counter (P
C) is number 1000.

【０００７】第２サイクルでは、ＩＮＣＡがフェッチ
され、次のＪＭＰＬＯＯＰ命令がデータバスに設定さ
れる。同様に、第３サイクルではＩＮＣＡがデコード
され、ＪＭＰＬＯＯＰがフェッチされ、ＭＯＶＡ，
Ｂがデータバスに設定される。次の第４サイクルでＩＮ
ＣＡはソースフェッチされ、第５サイクルで実行され
る。In the second cycle, INC A is fetched and the next JMP LOOP instruction is set on the data bus. Similarly, in the third cycle, INC A is decoded, JMP LOOP is fetched, MOV A,
B is set to the data bus. IN the next 4th cycle
CA is source fetched and executed in the fifth cycle.

【０００８】分岐命令（ＪＵＭＰＬＯＯＰ）に注目す
ると、第３サイクルで、プログラム・フェッチ部1210が
インストラクション・メモリ1100からフェッチし、第４
サイクルでプログラム・デコード部1220が命令を分岐命
令と分岐先アドレスにデコードする。そして、第５サイ
クルではソースフェッチ部1230がソースオペランドのフ
ェッチを行ない、最後に第６サイクルでＪＵＭＰＬＯ
ＯＰ命令を実行する。Focusing on the branch instruction (JUMP LOOP), the program fetch unit 1210 fetches from the instruction memory 1100 in the third cycle,
In a cycle, the program decode unit 1220 decodes an instruction into a branch instruction and a branch destination address. Then, in the fifth cycle, the source fetch unit 1230 fetches the source operand, and finally in the sixth cycle, JUMP LO.
Execute the OP instruction.

【０００９】実行部1240では、プログラムカウンタ（Ｐ
Ｃ）の値を分岐先アドレスに書き換え、各パイプライン
の内容を無効にする。そして、サイクル７からのサイク
ルでは２０００番地以降をフェッチ−デコード−ソース
フェッチ−実行する処理を行なう。In the execution unit 1240, the program counter (P
The value of C) is rewritten to the branch destination address to invalidate the contents of each pipeline. Then, in the cycles from the cycle 7 onward, the processing of fetching-decoding-source fetching-execution from the address 2000 onward is executed.

【００１０】[0010]

【発明が解決しようとする課題】しかしながら、従来の
方式では、分岐命令の実行に伴ってパイプラインの内容
を全てクリアしてから分岐命令後の命令をフェッチ処理
から開始しなければならず、数サイクルのオーバヘッド
を生じるという問題がある。However, in the conventional method, it is necessary to clear all the contents of the pipeline as the branch instruction is executed and then start the instruction after the branch instruction from the fetch processing. There is a problem of causing cycle overhead.

【００１１】すなわち、図１４のタイムチャートにおい
ては、第６サイクルでＪＵＭＰ命令が実行された時点で
プログラムカウンタが２０００番地となり、パイプライ
ンの内容をすべて無効にしたのち２０００番地以降の命
令をフェッチから処理しており、第７サイクル〜第１０
サイクルでは実行部1240は稼働していない。４サイクル
分のオーバヘッドが生じていることになる。That is, in the time chart of FIG. 14, the program counter reaches the address 2000 when the JUMP instruction is executed in the sixth cycle, and after invalidating all the contents of the pipeline, the instructions after the address 2000 are fetched. Processing, 7th cycle to 10th cycle
The execution unit 1240 is not operating in the cycle. This means that there is an overhead of 4 cycles.

【００１２】本発明は、分岐処理におけるこのようなオ
ーバヘッドを解消し、分岐命令処理を高速化することを
目的とする。An object of the present invention is to eliminate such overhead in branch processing and speed up branch instruction processing.

【００１３】[0013]

【課題を解決するための手段】図１は、本発明の機能ブ
ロック図である。本発明は、命令１０のフェッチ３〜実
行４の処理をパイプライン処理する中央演算処理装置１
を前提とする。また、該中央演算処理装置１はプログラ
ム・カウンタ５を有する。FIG. 1 is a functional block diagram of the present invention. The present invention relates to a central processing unit 1 for pipeline processing of fetch 3 to execution 4 of an instruction 10.
Is assumed. Further, the central processing unit 1 has a program counter 5.

【００１４】また、命令判断手段６は、フェッチした命
令１０を入力とし、該命令１０が分岐命令か否かを判断
する。分岐命令でない場合には中央演算処理装置１のパ
イプライン処理部２が通常のフェッチ３〜実行４を行な
う。分岐命令の場合には、無条件分岐命令あるいは条件
分岐命令のいずれかに応じた処理を実行する。The instruction judging means 6 receives the fetched instruction 10 as an input and judges whether the instruction 10 is a branch instruction. If it is not a branch instruction, the pipeline processing unit 2 of the central processing unit 1 performs normal fetch 3 to execution 4. In the case of a branch instruction, the processing according to either the unconditional branch instruction or the conditional branch instruction is executed.

【００１５】前記命令判断手段６が命令１０を無条件分
岐命令と判断した場合には、命令判断手段６は、プログ
ラム・カウンタ５の内容を無条件分岐命令の分岐先アド
レスに書き換える。When the instruction judging means 6 judges that the instruction 10 is an unconditional branch instruction, the instruction judging means 6 rewrites the contents of the program counter 5 to the branch destination address of the unconditional branch instruction.

【００１６】プログラム・カウンタ・スタック手段７
は、無条件分岐命令のなかでもサブルーチンコール命令
の場合に、プログラム・カウンタ５の内容をスタックに
プッシュしたうえで、プログラム・カウンタ５の内容を
無条件分岐命令の分岐アドレスに書き換える。また、リ
ターンサブルーチン命令の場合は、スタックをホップし
てサブルーチンの戻り番地をプログラム・カウンタ５に
ロードする。Program counter stack means 7
In the case of a subroutine call instruction among unconditional branch instructions, pushes the contents of the program counter 5 onto the stack and then rewrites the contents of the program counter 5 with the branch address of the unconditional branch instruction. In the case of a return subroutine instruction, the stack is hopped and the return address of the subroutine is loaded into the program counter 5.

【００１７】分岐アドレス登録手段９は、分岐先アドレ
スに飛ぶ確率が高い場合に分岐先アドレスを登録してお
く。プログラム・カウンタ・キュー格納手段８は、プロ
グラム・カウンタ５の書き変わるタイミングにプログラ
ム・カウンタ値をキューイングする。パイプライン段数
よりも一段少ない退避用キューで、プログラム・カウン
タ５と同じビット幅を持つ。The branch address registration means 9 registers the branch destination address when the probability of jumping to the branch destination address is high. The program counter queue storing means 8 queues the program counter value at the timing when the program counter 5 is rewritten. The save queue is one stage smaller than the number of pipeline stages and has the same bit width as the program counter 5.

【００１８】命令判断手段６は、フェッチした命令１０
が条件分岐命令の場合に、分岐先アドレスが前記分岐ア
ドレス登録手段９に登録されているか否かを判断し、登
録してある場合にはプログラム・カウンタ５の内容を分
岐先アドレスに書き換える。The instruction judging means 6 uses the fetched instruction 10
Is a conditional branch instruction, it is judged whether or not the branch destination address is registered in the branch address registration means 9, and if it is registered, the contents of the program counter 5 are rewritten to the branch destination address.

【００１９】該条件分岐命令をパイプライン処理部２で
実行４した時点で、条件が成立した場合にはそのまま処
理を続けると、分岐先アドレス内の命令が次に実行され
る。一方、条件が不成立の場合には、プログラム・カウ
ンタ・キュー格納手段８はキューの最下段のアドレスを
プログラム・カウンタ５にロードする。このとき、キュ
ーの最下段には該条件分岐命令の次アドレスが入ってい
る。これによって、このアドレスをパイプライン処理部
２でフェッチ３〜実行４すると正しく命令が実行され
る。When the conditional branch instruction is executed 4 in the pipeline processing unit 2, if the condition is satisfied, if the processing is continued as it is, the instruction in the branch destination address is executed next. On the other hand, when the condition is not satisfied, the program counter queue storing means 8 loads the address of the lowest stage of the queue into the program counter 5. At this time, the next address of the conditional branch instruction is entered at the bottom of the queue. As a result, when this address is fetched 3 to 4 by the pipeline processing unit 2, the instruction is correctly executed.

【００２０】[0020]

【作用】まず、条件分岐命令において分岐する確率が高
い分岐先アドレスが存在する場合には、パイプライン処
理部２の実行４を通して該分岐先アドレスを分岐アドレ
ス登録手段９に登録しておく。First, if there is a branch destination address with a high probability of branching in the conditional branch instruction, the branch destination address is registered in the branch address registration means 9 through the execution 4 of the pipeline processing unit 2.

【００２１】また、プログラム・カウンタ・キュー格納
手段８は、プログラム・カウンタ５が書き変わるタイミ
ングごとにプログラム・カウンタ値をキューイングす
る。例えば、フェッチ３−デコード−ソース・フェッチ
−実行４の４段のパイプライン処理部２であれば３段の
退避キューを用意しておく。これによって、常に、実行
４で処理された命令アドレスの次アドレスがキューの最
下段に入っていることになる。Further, the program counter queue storing means 8 queues the program counter value every time the program counter 5 is rewritten. For example, in the case of the 4-stage pipeline processing unit 2 of fetch 3-decode-source fetch-execute 4, a 3-stage save queue is prepared. As a result, the next address of the instruction address processed in execution 4 is always in the bottom row of the queue.

【００２２】命令１０がパイプライン処理部２でフェッ
チ３されると、命令判断手段６はフェッチされた命令１
０のオペコードを分析する。命令１０が分岐命令以外で
あればパイプライン処理部２が通常のフェッチ〜実行を
行なう。When the instruction 10 is fetched 3 by the pipeline processing section 2, the instruction judging means 6 fetches the fetched instruction 1
Analyze 0 opcodes. If the instruction 10 is other than a branch instruction, the pipeline processing unit 2 performs normal fetch to execution.

【００２３】一方、命令１０が分岐命令の場合には、分
岐命令の種類によって対応する処理を実行する。まず、
命令１０が無条件分岐命令の場合には、命令判断手段６
は、プログラム・カウンタ５の内容を分岐先アドレスに
書き換える。これによって、無条件分岐命令の次に、分
岐先アドレスがフェッチ３〜実行４されることになる。On the other hand, when the instruction 10 is a branch instruction, the corresponding processing is executed depending on the type of the branch instruction. First,
When the instruction 10 is an unconditional branch instruction, the instruction judging means 6
Rewrites the contents of the program counter 5 to the branch destination address. As a result, the branch destination address is fetched 3 to 4 after the unconditional branch instruction.

【００２４】無条件分岐命令のなかでもサブルーチン・
コール命令の場合は、命令判断手段６は、プログラム・
カウンタ・スタック手段７にまずプログラム・カウンタ
５の内容をプッシュし、その後、分岐先アドレスである
サブルーチンのアドレスをプログラム・カウンタ５に書
き込む。これによって、プログラム・カウンタ・スタッ
ク手段７には、サブルーチン実行後の戻りアドレスが格
納されることになる。Among unconditional branch instructions, a subroutine
In the case of a call instruction, the instruction judging means 6
The contents of the program counter 5 are first pushed to the counter stack means 7, and then the address of the subroutine which is the branch destination address is written in the program counter 5. As a result, the return address after the execution of the subroutine is stored in the program counter stack means 7.

【００２５】そして、命令１０がリターン・サブルーチ
ン命令の場合は、プログラム・カウンタ・スタック手段
７をホップし、サブルーチンの戻りアドレスをプログラ
ム・カウンタ５に書き込む。これによって、リターンサ
ブルーチン命令の次に、戻り番地の命令がフェッチ３〜
実行４されることになる。When the instruction 10 is a return subroutine instruction, the program counter stack means 7 is hopped and the return address of the subroutine is written in the program counter 5. As a result, after the return subroutine instruction, the instruction at the return address is fetched 3 to
It will be executed 4.

【００２６】命令１０が条件分岐命令の場合、命令判断
手段６は分岐アドレス登録手段９に該条件分岐命令の分
岐先アドレスがあるか否かを判断する。分岐先アドレス
が登録されていない場合には、そのまま、通常のパイプ
ライン処理を続ける。When the instruction 10 is a conditional branch instruction, the instruction judging means 6 judges whether or not the branch address registering means 9 has a branch destination address of the conditional branch instruction. If the branch destination address is not registered, normal pipeline processing is continued.

【００２７】一方、分岐先アドレスが登録されている場
合には、命令判断手段６は分岐先アドレスをプログラム
・カウンタ５に書き込む。そして、条件分岐命令の次
に、分岐先アドレス以降の命令をパイプライン処理部２
で処理する。On the other hand, when the branch destination address is registered, the instruction judging means 6 writes the branch destination address in the program counter 5. Then, after the conditional branch instruction, the instructions after the branch destination address are processed by the pipeline processing unit 2
To process.

【００２８】実際に、該条件分岐命令がパイプライン処
理部２で実行４されると、条件が成立あるいは不成立に
なる。条件が成立した場合には、次にパイプライン処理
されている分岐先の命令を実行４すればよい。一方、条
件が不成立の場合は、条件分岐命令の次のアドレスの命
令を処理しなければならない。このアドレスはプログラ
ム・カウンタ・キュー格納手段８の最下段に入っている
ので、キューの最下段のアドレスをプログラム・カウン
タ５にロードしたうえ、パイプライン処理部２でフェッ
チ３〜実行４する。これによって、正しく条件分岐命令
の次のアドレスの命令が処理されることになる。When the conditional branch instruction is actually executed 4 in the pipeline processing unit 2, the condition is satisfied or not satisfied. If the condition is satisfied, the branch destination instruction that is pipelined next may be executed 4. On the other hand, if the condition is not satisfied, the instruction at the address next to the conditional branch instruction must be processed. Since this address is in the lowest stage of the program counter queue storing means 8, the address of the lowest stage of the queue is loaded into the program counter 5, and the pipeline processing unit 2 fetches 3 to 4 for fetching. As a result, the instruction at the address next to the conditional branch instruction is correctly processed.

【００２９】[0029]

【実施例】以下に、本発明の実施例を説明する。図２
は、一実施例のシステム構成図である。EXAMPLES Examples of the present invention will be described below. Figure 2
FIG. 1 is a system configuration diagram of an embodiment.

【００３０】本実施例は、従来システムと同様に中央処
理演算装置210 、インストラクションメモリ200 、デー
タ・メモリ290 よりなるが、中央処理演算装置210 を強
化している。This embodiment comprises a central processing unit 210, an instruction memory 200 and a data memory 290 as in the conventional system, but the central processing unit 210 is strengthened.

【００３１】中央処理演算装置210 は、プログラムのフ
ェッチを行なう命令フェッチ部220、フェッチした命令
をデコードする命令デコード部230 、ソースオペランド
のフェッチを行なうソースフェッチ部240 、命令の実行
を行なう実行部250 に加えて、命令の内容を判断する命
令判断部260 で構成する。また、命令フェッチ部220を
強化している。The central processing unit 210 includes an instruction fetch unit 220 for fetching a program, an instruction decode unit 230 for decoding the fetched instruction, a source fetch unit 240 for fetching a source operand, and an execution unit 250 for executing an instruction. In addition to the above, the instruction determining unit 260 for determining the content of the instruction is included. Also, the instruction fetch unit 220 is strengthened.

【００３２】命令フェッチ部220 は、フェッチ・キュー
221 およびプログラム・カウンタＰＣ222 、プログラム
・カウントをスタックするＰＣスタック223 、ＰＣをキ
ューイングするＰＣキュー224 で構成する。ＰＣキュー
224 は、フェッチ以下のパイプライン段数から１を引い
た段数である３段にしてある。The instruction fetch unit 220 uses the fetch queue.
221, a program counter PC222, a PC stack 223 for stacking program counts, and a PC queue 224 for queuing PCs. PC queue
The number 224 is three, which is the number of pipeline stages after fetching minus one.

【００３３】一方、命令判断部260 は、命令の解析を行
なう命令判断ロジック261 と分岐先のアドレスを格納し
ておくキャッシュ構造のテーブルである分岐先テーブル
262などで構成する。On the other hand, the instruction judging section 260 is a branch destination table which is a table having a cache structure for storing an instruction judging logic 261 for analyzing an instruction and a branch destination address.
It is composed of 262 etc.

【００３４】インストラクション・メモリ200 の入出力
バスとしてはインストラクション・アドレスバス270 と
インストラクション・データバス275 があり、インスト
ラクション・アドレスバス270 は命令フェッチ部220 の
プログラムカウンタ222 と接続され、インストラクショ
ン・データバス275 は命令フェッチ部220 のフェッチキ
ュー221 に接続されている。そして、インストラクショ
ン・メモリ200 は、インストラクション・アドレスバス
270 に入力されるプログラム・カウンタ値の内容をイン
ストラクション・データバス275 を介してフェッチキュ
ー221 に出力する。An instruction address bus 270 and an instruction data bus 275 are provided as the input / output buses of the instruction memory 200. The instruction address bus 270 is connected to the program counter 222 of the instruction fetch section 220 and is connected to the instruction data bus 275. Is connected to the fetch queue 221 of the instruction fetch unit 220. And the instruction memory 200 is the instruction address bus.
The contents of the program counter value input to 270 are output to the fetch queue 221 via the instruction data bus 275.

【００３５】フェッチキュー221 の出力は、命令デコー
ド部230 および命令判断部260 の命令判断ロジック261
に接続されている。フェッチキュー221 に格納された命
令は命令判断ロジック261 で分析し、該命令が分岐命令
であるか否かを判断する。The output of the fetch queue 221 is the instruction judgment unit 261 of the instruction decoding unit 230 and the instruction judgment unit 260.
It is connected to the. The instruction stored in the fetch queue 221 is analyzed by the instruction judgment logic 261 to judge whether or not the instruction is a branch instruction.

【００３６】図３は、一実施例の命令判断部260 の構成
図である。命令判断部260 は、命令判断ロジック261 と
分岐先テーブル262 、オアゲート300 、アンドゲート35
0 からなる。FIG. 3 is a block diagram of the instruction determination unit 260 of one embodiment. The instruction determination unit 260 includes an instruction determination logic 261, a branch destination table 262, an OR gate 300, and an AND gate 35.
It consists of 0.

【００３７】命令判断ロジック261 は、命令フェッチ部
220 のフェッチ・キュー221 と接続され、フェッチ・キ
ュー221 内の命令のうちオペコードだけ（オペランドを
除く部分）をその入力とする。命令判断ロジック261
は、該オペコードが分岐命令であるか、更に分岐命令で
あれば、無条件分岐命令か条件分岐命令かを判断する。
この判断は、無条件分岐命令および条件分岐命令のオペ
コードを命令判断ロジック261 内に持っておき、入力さ
れるオペコードとの一致を取ればよい。該オペコードを
無条件分岐命令と判断した場合にはオアゲート300 の一
方の入力を“１”、無条件分岐命令でなければ該入力を
“０”とする。一方、該オペコードを条件分岐命令と判
断した場合にはアンドゲート350 の一方の入力を
“１”、条件分岐命令でなければ該入力を“０”とす
る。The instruction determination logic 261 is an instruction fetch unit.
It is connected to the fetch queue 221 of 220, and only the opcode (a part excluding the operand) of the instructions in the fetch queue 221 is used as its input. Instruction decision logic 261
Determines whether the operation code is a branch instruction, or if it is a branch instruction, it is an unconditional branch instruction or a conditional branch instruction.
For this determination, the operation codes of the unconditional branch instruction and the conditional branch instruction are held in the instruction judgment logic 261, and the operation code to be input may be matched. If the operation code is judged to be an unconditional branch instruction, one input of the OR gate 300 is set to "1", and if it is not an unconditional branch instruction, the input is set to "0". On the other hand, if the operation code is judged to be a conditional branch instruction, one input of the AND gate 350 is set to "1", and if it is not a conditional branch instruction, the input is set to "0".

【００３８】分岐先テーブル262 は、キャッシュ構造の
テーブルであり、分岐命令が命令実行部250 で実行され
るごとに該分岐命令の分岐先アドレスをこのテーブルに
登録しておく。そして、フェッチ・キュー221 に命令が
入ると、該命令のオペランドのみを分岐先テーブル262
の入力とする。該オペランドが入力されると、該オペラ
ンドと分岐先テーブル262 内に格納されている分岐先ア
ドレスが比較され、一致したものがあると（ＨＩＴ）、
前述のアンドゲート350 のもう一方の入力を“１”と
し、一致するアドレスがない場合には該入力を“０”と
する。該アンドゲート350 の出力は、オアゲート300 の
もう一方の入力となる。オアゲート300 の出力はプログ
ラム・カウンタ222 のロード・タイミング信号（ＬＤ）
となる。The branch destination table 262 is a cache structure table, and each time a branch instruction is executed by the instruction executing section 250, the branch destination address of the branch instruction is registered in this table. Then, when an instruction enters the fetch queue 221, only the operand of the instruction is branched to the branch destination table 262.
And input. When the operand is input, the operand is compared with the branch destination address stored in the branch destination table 262, and if there is a match (HIT),
The other input of the AND gate 350 is set to "1", and when there is no matching address, the input is set to "0". The output of the AND gate 350 becomes the other input of the OR gate 300. The output of the OR gate 300 is the load timing signal (LD) of the program counter 222.
Becomes

【００３９】以上の構成により、フェッチ・キュー221
に命令が入力されたとき、該命令が無条件分岐命令であ
ればオアゲート300 の一方の入力が“１”、オアゲート
300の出力が“１”となる。ＬＤに“１”が入力された
時点で、プログラム・カウンタ222 はフェッチ・キュー
221 からオペランドである分岐先アドレスをロードす
る。With the above configuration, the fetch queue 221
When an instruction is input to the OR gate 300, if the instruction is an unconditional branch instruction, one input of the OR gate 300 is “1”,
The output of 300 becomes "1". At the time when “1” is input to LD, the program counter 222 fetches the fetch queue.
Load the branch destination address that is the operand from 221.

【００４０】一方、該命令が条件分岐命令であれば、ア
ンドゲート350 の一方の入力が“１”となり、このと
き、分岐先テーブル262 にＨＩＴする分岐先アドレスが
あればアンドゲート350 のもう一方の入力も“１”とな
り、アンドゲート350 の出力が“１”となって、オアゲ
ート300 の出力（ＬＤ）が“１”となる。これによっ
て、プログラム・カウンタ222 はフェッチ・キュー221
から分岐先アドレスをロードする。該命令が条件分岐命
令であっても、分岐先テーブル262 にＨＩＴする分岐先
アドレスがない場合には、アンドゲート350 のもう一方
の入力は“０”となり、よってアンドゲート350 の出力
が“０”、オアゲート300 の出力が“０”となる。よっ
て、プログラム・カウンタ222 はフェッチ・キュー221
から分岐先アドレスをロードすることはない。On the other hand, if the instruction is a conditional branch instruction, one input of the AND gate 350 becomes "1". At this time, if the branch destination table 262 has a branch destination address to be hit, the other one of the AND gate 350. Also becomes "1", the output of the AND gate 350 becomes "1", and the output (LD) of the OR gate 300 becomes "1". This causes the program counter 222 to fetch queue 221.
Load the branch destination address from. Even if the instruction is a conditional branch instruction, if there is no branch destination address to be hit in the branch destination table 262, the other input of the AND gate 350 becomes "0", so that the output of the AND gate 350 becomes "0". ", The output of the OR gate 300 becomes" 0 ". Therefore, the program counter 222 becomes the fetch queue 221.
The branch destination address is not loaded from.

【００４１】さらに、該命令が分岐命令でない場合に
は、条件分岐を示すアンドゲート350の一方の入力が
“０”、無条件分岐を示すオアゲート300 の一方の入力
が“０”となり、オアゲート300 の出力は“０”となる
ので、プログラム・カウンタ222はフェッチ・キュー221
から分岐先アドレスをロードすることはない。If the instruction is not a branch instruction, one input of the AND gate 350 indicating a conditional branch becomes "0" and one input of the OR gate 300 indicating an unconditional branch becomes "0", and the OR gate 300 Is 0, the program counter 222 fetches the fetch queue 221.
The branch destination address is not loaded from.

【００４２】図４は、無条件分岐命令を含むプログラム
の例である。このプログラムを図２に示した一実施例の
システム構成で処理すると、図５に示すタイムチャート
のような動作を行なう。FIG. 4 is an example of a program including an unconditional branch instruction. When this program is processed by the system configuration of the embodiment shown in FIG. 2, the operation shown in the time chart of FIG. 5 is performed.

【００４３】図４のプログラムのなかの１００１番地の
ＪＭＰＬＡＢＥＬという無条件分岐命令は、第２サイ
クルでインストラクション・データバス275 に設定さ
れ、第３サイクルでフェッチキュー221 に入る。命令判
断部260 の命令判断ロジック261 はフェッチキュー221
に入った該命令を分析して無条件分岐命令であることを
判断し、次サイクル（第４サイクル）に分岐先アドレス
２０００番地をプログラムカウンタ222 に格納し、フェ
ッチキュー221 に入るはずの次の命令ＡＤＤＡ，Ｃを
無効にする。第５サイクルからは分岐先の命令のフェッ
チ−デコード−ソースフェッチ−実行処理が行われる。
すなわち、第５サイクルでは分岐先の２０００番地の命
令であるＡＮＤＡ，Ｄがフェッチされ、第６サイクル
でデコード部230 によるデコード、第７サイクルでソー
スフェッチ部240 によるオペランドのフェッチ、第８サ
イクルで実行が行われる。これによって、第９サイクル
には２００１番地のＥＮＤ命令が実行され、処理を終了
する。An unconditional branch instruction called JMP LABEL at address 1001 in the program shown in FIG. 4 is set in the instruction data bus 275 in the second cycle and enters the fetch queue 221 in the third cycle. The instruction judgment logic 261 of the instruction judgment unit 260 is the fetch queue 221.
The entered instruction is analyzed to determine that it is an unconditional branch instruction, the branch destination address 2000 is stored in the program counter 222 in the next cycle (fourth cycle), and the next address that should enter the fetch queue 221 is stored. Invalidates the instruction ADD A, C. From the fifth cycle, fetch-decode-source fetch-execution processing of the branch destination instruction is performed.
That is, in the fifth cycle, the instruction at the branch destination address 2000, AND A, D, is fetched, in the sixth cycle, the decoding section 230 decodes, in the seventh cycle, the source fetch section 240 fetches the operand, and in the eighth cycle. Execution takes place. As a result, the END instruction at the address 2001 is executed in the ninth cycle, and the processing ends.

【００４４】従来のシステム構成（図１２）で同様の図
４のプログラムを実行した場合には、無条件分岐命令を
実行することによりプログラム・カウンタを分岐先に変
え、パイプラインの内容を無効にしていた。その結果、
無条件分岐命令の実行後のサイクルである第７サイクル
でプログラムカウンタの内容が２０００番地になり、第
８サイクルから分岐先の命令のフェッチ〜実行処理が行
われる。よって、２００１番地のＥＮＤ命令を実行して
処理を終了するのは第１２サイクルとなる。When the similar program of FIG. 4 is executed in the conventional system configuration (FIG. 12), the unconditional branch instruction is executed to change the program counter to the branch destination and invalidate the contents of the pipeline. Was there. as a result,
In the seventh cycle, which is the cycle after the execution of the unconditional branch instruction, the content of the program counter becomes address 2000, and from the eighth cycle, the fetch-execution processing of the branch destination instruction is performed. Therefore, it is the twelfth cycle that the END instruction at the address 2001 is executed and the processing is ended.

【００４５】すなわち、本実施例によると、このプログ
ラムの場合、３サイクル分分岐処理を高速化できること
になる。図６は、サブルーチンコール命令を含むプログ
ラム例、図７は、図６のプログラムを本実施例のシステ
ム構成（図２）で処理した場合のタイムチャートであ
る。That is, according to this embodiment, in the case of this program, the branch processing can be sped up for 3 cycles. FIG. 6 is a program example including a subroutine call instruction, and FIG. 7 is a time chart when the program of FIG. 6 is processed by the system configuration (FIG. 2) of this embodiment.

【００４６】このプログラム例では、１００１番地の命
令がサブルーチン・コール命令になっており、この命令
により２０００番地〜２００２番地のプログラムを実行
してた後、１００２番地に戻る。In this program example, the instruction at address 1001 is a subroutine call instruction, and after executing the program at addresses 2000 to 2002 by this instruction, the program returns to address 1002.

【００４７】本実施例のシステム構成（図２）で処理し
た場合には、図７に示すように、第２サイクルでＣＡＬ
ＬＬＡＢＥＬ命令がインストラクション・データバス
275に設定され、第３サイクルでフェッチキュー221 に
格納される。この時点で命令判断部260 の命令判断ロジ
ック261 が該命令がサブルーチン・コール命令であるこ
とを判断し、第４サイクルでフェッチキュー221 に格納
されるはずの１００２番地の命令の内容を無効にし、そ
の時点のプログラム・カウンタ222 の内容（１００２番
地）をＰＣスタック223 にプッシュし、分岐先アドレス
（２０００番地）をプログラム・カウンタ222 に格納す
る。そして、第５サイクルからはサブルーチンの処理に
移る。When processing is performed with the system configuration (FIG. 2) of this embodiment, as shown in FIG. 7, CAL is performed in the second cycle.
L LABEL instruction is an instruction data bus
275 and stored in the fetch queue 221 in the third cycle. At this point, the instruction determination logic 261 of the instruction determination unit 260 determines that the instruction is a subroutine call instruction, and invalidates the content of the instruction at address 1002 that should be stored in the fetch queue 221 in the fourth cycle, The contents (address 1002) of the program counter 222 at that time are pushed to the PC stack 223, and the branch destination address (address 2000) is stored in the program counter 222. Then, from the fifth cycle, the processing moves to the subroutine.

【００４８】すなわち、第５サイクルでは２０００番地
（ＡＮＤＡ，Ｃ）のフェッチが行われ、第６サイクル
では２００１番地（ＭＵＬＡ，Ｄ）のフェッチと２０
００番地（ＡＮＤＡ，Ｃ）のデコードが行われ、第７
サイクルでは２００２番地（ＲＴＳ）のフェッチと２０
０１番地（ＭＵＬＡ，Ｄ）のデコード、２０００番地
（ＡＮＤＡ，Ｃ）のソース・フェッチが行われる。That is, the fetch of address 2000 (AND A, C) is performed in the fifth cycle, and the fetch of address 2001 (MUL A, D) and 20 are performed in the sixth cycle.
No. 00 (AND A, C) is decoded and the 7th
In the cycle, fetching at address 2002 (RTS) and 20
Decoding of address 01 (MUL A, D) and source fetch of address 2000 (AND A, C) are performed.

【００４９】リターンサブルーチン命令（ＲＴＳ）がフ
ェッチキュー221 に格納されると、命令判断部260 の命
令判断ロジック261 がリターン命令であることを判断
し、次サイクル（第８サイクル）でＰＣスタック223 か
らプログラム・カウンタ222 にポップを行ない、フェッ
チキュー221 の内容を無効にする。これによって、プロ
グラム・カウンタ222 は１００２番地になる。さらに、
第８サイクルでは、２００２番地（ＲＴＳ）のデコード
と、２００１番地（ＭＵＬＡ，Ｄ）のソース・フェッ
チ、２０００番地（ＡＮＤＡ，Ｃ）の実行が行われ
る。When the return subroutine instruction (RTS) is stored in the fetch queue 221, the instruction decision logic 261 of the instruction decision unit 260 decides that it is a return instruction, and the PC stack 223 reads it from the next cycle (eighth cycle). Pop the program counter 222 and invalidate the contents of the fetch queue 221. As a result, the program counter 222 becomes the address 1002. further,
In the eighth cycle, decoding of address 2002 (RTS), source fetch of address 2001 (MUL A, D), and execution of address 2000 (AND A, C) are performed.

【００５０】第９サイクルからは、１００２番地からプ
ログラムのフェッチ〜実行の処理が行われる。以上の処
理により、図６のプログラムは、第１４サイクルで完了
する（第１４サイクルに１００３番地のＥＮＤ命令が実
行される）。From the ninth cycle, the program fetch-execution processing is performed from address 1002. With the above processing, the program of FIG. 6 is completed in the 14th cycle (the END instruction at address 1003 is executed in the 14th cycle).

【００５１】一方、従来のシステム構成（図１２）で図
６のプログラムを実行した場合には、１００１番地のＣ
ＡＬＬＬＡＢＥＬ命令を実行した次のサイクル（第７
サイクル）でプログラム・カウンタ222 が２０００番地
にセットされ、次サイクル（第８サイクルから２０００
番地以降のフェッチ〜実行が行われる。よって２０００
番地のＡＮＤＡ，Ｃ命令が実行されるのは第１１サイ
クルとなり、２００２番地のＲＴＳ命令は第１３サイク
ルとなる。ＲＴＳ命令が実行されると、次の第１４サイ
クルでプログラム・カウンタ222 が１００２番地にセッ
トされ、第１５サイクルから１００２番地以降のフェッ
チ〜実行が行われる。よって、従来構成では、図６のプ
ログラムは第１９サイクルで完了する（第１９サイクル
に１００３番地のＥＮＤ命令が実行される）。On the other hand, when the program of FIG. 6 is executed with the conventional system configuration (FIG. 12), C at address 1001
Next cycle after executing ALL LABEL instruction (7th cycle)
In the cycle, the program counter 222 is set to the address 2000, and the next cycle (from the 8th cycle to 2000)
Fetching to execution after the address is performed. Therefore 2000
The AND A, C instruction at the address is executed in the 11th cycle, and the RTS instruction at the address 2002 is in the 13th cycle. When the RTS instruction is executed, the program counter 222 is set to the address 1002 in the next 14th cycle, and the fetch-execution from the 15th cycle to the address 1002 and thereafter is performed. Therefore, in the conventional configuration, the program of FIG. 6 is completed in the 19th cycle (the END instruction at address 1003 is executed in the 19th cycle).

【００５２】これにより、本実施例の構成（図２）で
は、従来構成（図１２）よりも５サイクル分高速化され
ることになる。図８は、条件付きの分岐命令を含むプロ
グラムの例である。そして、このプログラムを本実施例
のシステム構成（図２）で実行したときのタイムチャー
トを図９に示す。As a result, the configuration of this embodiment (FIG. 2) is faster than the conventional configuration (FIG. 12) by 5 cycles. FIG. 8 is an example of a program including a conditional branch instruction. FIG. 9 shows a time chart when this program is executed by the system configuration (FIG. 2) of this embodiment.

【００５３】図８のプログラムは、前もって実行される
イニシャル・ルーチンと、イニシャル・ルーチンが実行
された後に実行されるメイン・ルーチンからなる。イニ
シャル・ルーチンを実行すると、００００番地のＭＯＶ
ＢＡＴ，ＬＯＯＰという分岐命令実行時に分岐先テー
ブル262 に分岐先アドレスＬＯＯＰが格納される。そし
て、０００１番地のＪＭＰＭＡＩＮという無条件分岐
命令でメイン・ルーチンに飛ぶ。The program shown in FIG. 8 is composed of an initial routine that is executed in advance and a main routine that is executed after the initial routine is executed. MOV at address 0000 when the initial routine is executed
The branch destination address LOOP is stored in the branch destination table 262 when a branch instruction of BAT, LOOP is executed. Then, an unconditional branch instruction called JMP MAIN at address 0001 jumps to the main routine.

【００５４】メイン・ルーチンは、１０００番地の命令
でＣに２を代入し、１００１番地〜１００２番地をＣの
値が０でない場合にループするものである。ここで、イ
ニシャル・ルーチンは前もって実行されているので、分
岐先テーブル262 にはＬＯＯＰのアドレスが格納されて
いるものとし、この前提のもとで、メイン・ルーチンが
実行される。The main routine substitutes 2 for C by the instruction at address 1000, and loops addresses 1001 to 1002 when the value of C is not 0. Here, since the initial routine has been executed in advance, it is assumed that the branch destination table 262 stores the address of LOOP, and the main routine is executed under this premise.

【００５５】図９のタイムチャートで説明すると、条件
分岐命令（ＪＮＥ１００１）が第４サイクルでフェッ
チキュー221 に格納されると、命令判断ロジック261 は
分岐先テーブル262 を検索し、分岐先アドレスがそこに
格納されていることを確認する。格納されているという
ことは、分岐する確率が高いということであり、命令判
断ロジック261 は第５サイクルでプログラムカウンタ22
2 に分岐先の１００１番地を格納するとともに、次にフ
ェッチされるはずのフェッチ・キューの内容を無効にす
る。第６サイクル以降では、分岐先のフェッチ〜実行が
行なわれる。Explaining with the time chart of FIG. 9, when the conditional branch instruction (JNE 1001) is stored in the fetch queue 221 in the fourth cycle, the instruction judgment logic 261 searches the branch destination table 262 and the branch destination address is Make sure it is stored there. The fact that it is stored means that there is a high probability of branching, and the instruction judgment logic 261 has the program counter 22 in the fifth cycle.
The branch destination address 1001 is stored in 2, and the contents of the fetch queue that should be fetched next are invalidated. In the sixth and subsequent cycles, the fetch-execution of the branch destination is performed.

【００５６】一方、ＰＣキュー224 はプログラム・カウ
ンタ222 が書き変わるタイミングでキューイングを行な
う。例えば、プログラム・カウンタ値‘１０００’が
‘１００１’に書き変わる時点でキューイングされ、Ｐ
Ｃの最上段に‘１０００’がプッシュされる。以降、カ
ウンタ値が書き変わる時点でＰＣキュー224 がプッシュ
されるので、第４サイクルで‘１０００’がＰＣキュー
224 の最下段にキューイングされることになる。第５サ
イクルでは、第４サイクルにプログラム・カウンタ222
にあった‘１００３’がＰＣキュー224 の最上段に入
り、第７サイクルで最下段に入る。On the other hand, the PC queue 224 performs queuing at the timing when the program counter 222 is rewritten. For example, when the program counter value "1000" is rewritten to "1001", it is queued and P
'1000' is pushed to the top of C. After that, since the PC queue 224 is pushed when the counter value is rewritten, '1000' is the PC queue in the fourth cycle.
It will be queued at the bottom of 224. In the fifth cycle, the program counter 222 in the fourth cycle
The matching '1003' enters the top stage of the PC queue 224, and enters the bottom stage in the seventh cycle.

【００５７】第７サイクルでは、条件分岐命令（ＪＮＥ
ＬＯＯＰ）が実行される。Ｃの値は第５サイクルで
‘２’に設定され、第６サイクルのＤＥＣＣで１減っ
て第７サイクルには‘１’になっている。そこで、第７
サイクルの条件分岐命令は条件と一致し（Ｃ≠０）、１
００１番地にループすることになる。この１００１番地
は、第４サイクルの条件分岐命令フェッチ時に分岐アド
レス・テーブル262 を参照してプログラム・カウンタ22
2 にセットされ、第６サイクルでフェッチ、第７サイク
ルでデコード、第８サイクルでソース・フェッチされて
いる。よって、第９サイクルで１００１番地の実行を行
なうことが可能である。実行の結果、Ｃの値は‘０’に
なる。また、第１０サイクルでは再び条件分岐命令（Ｊ
ＮＥＬＯＯＰ）が実行される。In the seventh cycle, the conditional branch instruction (JNE
LOOP) is executed. The value of C is set to '2' in the fifth cycle, decremented by 1 in DEC C in the sixth cycle, and becomes '1' in the seventh cycle. Therefore, the seventh
The conditional branch instruction of the cycle matches the condition (C ≠ 0), 1
It will loop to address 001. This address 1001 refers to the branch address table 262 at the time of the conditional branch instruction fetch in the fourth cycle, and the program counter 22
It is set to 2, fetched in the 6th cycle, decoded in the 7th cycle, and source fetched in the 8th cycle. Therefore, it is possible to execute the address 1001 in the ninth cycle. As a result of execution, the value of C becomes "0". In the 10th cycle, the conditional branch instruction (J
NE LOOP) is executed.

【００５８】一方、第７サイクルではフェッチ・キュー
221 によってＪＮＥＬＯＯＰ命令がフェッチされてい
る。第４サイクルの場合と同様に、ＬＯＯＰ（１００１
番地）が分岐アドレステーブル262 に格納されているか
否かを判断し、格納されているので、第８サイクルでは
プログラム・カウンタ222 を１００１番地にセットし、
フェッチ・キュー221 の内容は無効にする。また、ＰＣ
キュー224 には第７サイクルでのプログラム・カウンタ
222 の内容である１００３番地をプッシュする。On the other hand, in the seventh cycle, the fetch queue
A JNE LOOP instruction has been fetched by 221. Similar to the case of the fourth cycle, LOOP (1001
Address) is stored in the branch address table 262. Since it is stored, the program counter 222 is set to the address 1001 in the eighth cycle.
Invalidates the contents of fetch queue 221. Also, PC
Queue 224 has a program counter in the 7th cycle
Push the address 1003 which is the content of 222.

【００５９】第１０サイクルで再び条件分岐命令（ＪＮ
ＥＬＯＯＰ）が実行されると、このとき、条件は不成
立である（Ｃ＝０）。この場合、条件一致（第７サイク
ル）の場合とは異なり、ループはしないで次の番地であ
る１００３番地を実行することになる。そのため、第１
１サイクルではＰＣキュー224 の最下段から戻り番地で
ある１００３番地をプログラム・カウンタ222 にロード
し、フェッチ・キュー221 の内容、命令デコード部230
の内容、ソースフェッチ部240 の内容を無効にする。そ
して、次の第１２サイクルから、１００３番地以降のフ
ェッチ〜実行処理を開始する。これによって、１００３
番地は第１５サイクルで命令実行部250により実行可能
になる。In the 10th cycle, the conditional branch instruction (JN
When E LOOP) is executed, the condition is not satisfied at this time (C = 0). In this case, unlike the case of the condition match (seventh cycle), the next address 1003 is executed without looping. Therefore, the first
In one cycle, the return address 1003 from the bottom of the PC queue 224 is loaded into the program counter 222, and the contents of the fetch queue 221 and the instruction decoding unit 230 are loaded.
And the contents of the source fetch unit 240 are invalidated. Then, from the next twelfth cycle, the fetch-execution processing from the address 1003 onward is started. By this, 1003
The address can be executed by the instruction execution unit 250 in the 15th cycle.

【００６０】以上のように、条件付き分岐命令の場合、
分岐アドレステーブル262 に前もって分岐する確率の高
い分岐先アドレスを格納しておくことにより、条件分付
き分岐命令のフェッチの後、次に分岐先アドレスをフェ
ッチできる。よって、条件が成立する場合には、条件付
き分岐命令実行サイクルの２サイクル後には分岐先アド
レスの命令を実行可能になる。一方、条件が不成立の場
合には、次サイクルでＰＣキュー224 の最下段をプログ
ラム・カウンタ222 にロードし、続いてフェッチ〜実行
処理する。よって、条件不成立の場合は、条件付き分岐
命令実行サイクルの５サイクル後には次番地の実行が可
能になる。As described above, in the case of a conditional branch instruction,
By storing the branch destination address having a high probability of branching in advance in the branch address table 262, the branch destination address can be fetched next after the conditional branch instruction is fetched. Therefore, when the condition is satisfied, the instruction at the branch destination address can be executed two cycles after the conditional branch instruction execution cycle. On the other hand, if the condition is not satisfied, the lowermost stage of the PC queue 224 is loaded into the program counter 222 in the next cycle, and then fetch-execute processing is performed. Therefore, when the condition is not satisfied, the next address can be executed 5 cycles after the conditional branch instruction execution cycle.

【００６１】従来のシステム構成（図１２）で本プログ
ラム（図８）を実行した場合には、条件分岐命令実行
後、条件不成立の場合は次サイクルで次アドレスの実行
が可能だが、条件が成立する場合には５サイクル後に分
岐先アドレスの実行が可能になる。分岐先アドレスに分
岐する確率が高い場合には、条件が成立する都度５サイ
クルが無駄になり、本実施例は１回につき３サイクル分
高速化されることになる。When this program (FIG. 8) is executed with the conventional system configuration (FIG. 12), after execution of the conditional branch instruction, if the condition is not satisfied, the next address can be executed in the next cycle, but the condition is satisfied. If so, the branch destination address can be executed after 5 cycles. When the probability of branching to the branch destination address is high, 5 cycles are wasted each time the condition is satisfied, and this embodiment speeds up by 3 cycles each time.

【００６２】図１０は、条件分岐命令が連続するプログ
ラムの例であり、図１１は、図１０のプログラムを本実
施例のシステム構成（図２）で実行した場合のタイムチ
ャートである。FIG. 10 is an example of a program in which conditional branch instructions are continuous, and FIG. 11 is a time chart when the program of FIG. 10 is executed by the system configuration of this embodiment (FIG. 2).

【００６３】この場合、１００１番地（ＪＮＥＬ１）
をフェッチすると（第３サイクル）、命令判断部260 の
処理により次サイクル（第４サイクル）でプログラム・
カウンタ222 を分岐先アドレスであるＬ１（２０００番
地）にセットしたうえ、フェッチ・キュー221 の内容を
無効にする。また、第４サイクルでは、ＰＣキュー224
の最上段に１００２番地が格納される。In this case, address 1001 (JNE L1)
Is fetched (third cycle), the instruction decision unit 260 processes the program in the next cycle (fourth cycle).
The counter 222 is set to the branch destination address L1 (address 2000), and the contents of the fetch queue 221 are invalidated. In the fourth cycle, the PC queue 224
Address 1002 is stored in the uppermost row of.

【００６４】第５サイクルでは２０００番地（ＪＺＥ
Ｌ２）がフェッチされる。すると、命令判断部260 の処
理により次サイクル（第６サイクル）でプログラム・カ
ウンタを分岐先アドレスであるＬ２（３０００番地）に
セットしたうえ、フェッチ・キュー221 の内容を無効に
する。In the fifth cycle, address 2000 (JZE
L2) is fetched. Then, in the next cycle (sixth cycle), the instruction counter 260 sets the program counter to the branch destination address L2 (address 3000) and invalidates the contents of the fetch queue 221.

【００６５】一方、第６サイクルでは１００１番地（Ｊ
ＮＥＬ１）が実行される。この場合、Ａ＝０となるの
で条件は不一致となる。よって、Ｌ１（２０００番地に
は飛ばずに、１００２番地を次に実行することになる。
条件が不一致の場合には、次サイクル（第７サイクル）
でＰＣキュー224 の最下段の内容をプログラム・カウン
タ222 にロードしたうえ、パイプラインの内容（フェッ
チ、デコード、ソースフェッチ）を無効にする。その結
果、プログラム・カウンタの内容は１００２番地にな
り、第８サイクルから１００２番地以降のフェッチ〜実
行が行われる。On the other hand, in the sixth cycle, address 1001 (J
NE L1) is executed. In this case, since A = 0, the conditions do not match. Therefore, L1 (address 1002 is executed next without jumping to address 2000).
If the conditions do not match, the next cycle (7th cycle)
Then, the contents of the bottom of the PC queue 224 are loaded into the program counter 222, and the contents of the pipeline (fetch, decode, source fetch) are invalidated. As a result, the content of the program counter becomes the address 1002, and the fetch to execution of the address 1002 and subsequent addresses are performed from the eighth cycle.

【００６６】ここで、ＰＣキュー224 が単に一時退避用
のレジスタであったとすると、二つ目の条件分岐命令を
フェッチ後のサイクル６でレジスタの内容が更新されて
しまい、最初の条件分岐命令が不一致であった場合に、
元の処理アドレス（１００２番地）に戻ることができな
くなってしまう。よって、ＰＣキュー224 をパイプライ
ン段数４段より１段少ない３段に設定しておくことが意
味をもつ。If the PC queue 224 is simply a temporary save register, the contents of the register are updated in cycle 6 after fetching the second conditional branch instruction, and the first conditional branch instruction is If there is a mismatch,
It becomes impossible to return to the original processing address (address 1002). Therefore, it is significant to set the PC queue 224 to three stages, which is one stage less than the pipeline stage number of four stages.

【００６７】[0067]

【発明の効果】本発明によって、命令がフェッチキュー
に格納された時点で、命令判断部により命令解析を行な
うことにより、無条件分岐命令に関しては、分岐処理が
従来方式に比べ高速化できる。また、条件分岐命令に関
しても、分岐する確率の高い分岐命令に関して分岐先ア
ドレスを分岐先テーブルに登録しておくことにより、プ
ログラム実行全体でると高速化を図ることが可能にな
る。また、パイプライン段数より１段少ない段数のＰＣ
キューを持たせることにより、条件分岐命令が不一致の
場合でも元の処理アドレスに速やかに戻ることが可能で
ある。As described above, according to the present invention, when an instruction is stored in the fetch queue, the instruction judging section analyzes the instruction, thereby making it possible to accelerate the branch processing of the unconditional branch instruction as compared with the conventional method. Further, regarding the conditional branch instruction as well, by registering the branch destination address in the branch destination table for the branch instruction having a high probability of branching, it is possible to speed up the entire program execution. In addition, the number of stages is one less than the number of pipeline stages
By providing a queue, it is possible to quickly return to the original processing address even if the conditional branch instructions do not match.

[Brief description of drawings]

【図１】本発明のブロック図である。FIG. 1 is a block diagram of the present invention.

【図２】一実施例のシステム構成図である。FIG. 2 is a system configuration diagram of an embodiment.

【図３】一実施例の命令判断部の構成図である。FIG. 3 is a configuration diagram of an instruction determination unit according to an embodiment.

【図４】一実施例の無条件分岐命令を含むプログラム例
である。FIG. 4 is a program example including an unconditional branch instruction according to an embodiment.

【図５】一実施例の無条件分岐命令処理のタイムチャー
トである。FIG. 5 is a time chart of unconditional branch instruction processing according to an embodiment.

【図６】一実施例の無条件サブルーチンコール命令を含
むプログラム例である。FIG. 6 is a program example including an unconditional subroutine call instruction according to an embodiment.

【図７】一実施例の無条件サブルーチンコール命令処理
のタイムチャートである。FIG. 7 is a time chart of unconditional subroutine call instruction processing according to an embodiment.

【図８】一実施例の条件分岐命令を含むプログラム例で
ある。FIG. 8 is a program example including a conditional branch instruction according to an embodiment.

【図９】一実施例の条件分岐命令処理のタイムチャート
である。FIG. 9 is a time chart of conditional branch instruction processing according to an embodiment.

【図１０】一実施例の条件分岐命令が連続するプログラ
ム例である。FIG. 10 is an example of a program in which conditional branch instructions of one embodiment are consecutive.

【図１１】一実施例の条件分岐命令が連続する場合のタ
イムチャートである。FIG. 11 is a time chart when conditional branch instructions of one embodiment are consecutive.

【図１２】従来の中央処理装置のブロック図である。FIG. 12 is a block diagram of a conventional central processing unit.

【図１３】従来システムの動作を説明するプログラム例
である。FIG. 13 is a program example for explaining the operation of the conventional system.

【図１４】従来システムの動作を説明するタイムチャー
トである。FIG. 14 is a time chart explaining the operation of the conventional system.

[Explanation of symbols]

１中央演算処理装置２パイプライン処理部３フェッチ４実行５プログラム・カウンタ６命令判断手段７プログラム・カウンタ・スタック手段８プログラム・カウンタ・キュー格納手段９分岐アドレス登録手段１０命令 1 Central Processing Unit 2 Pipeline Processing Unit 3 Fetch 4 Execution 5 Program Counter 6 Instruction Judging Means 7 Program Counter Stacking Means 8 Program Counter Queue Storing Means 9 Branch Address Registering Means 10 Instructions

Claims

[Claims]

1. A central processing unit (1) comprising a pipeline processing section (2) for pipeline processing fetch (3) to execution (4) of an instruction (10) and a program counter (5), comprising: The contents of the executed instruction (10) are analyzed, and if the instruction (10) is an unconditional branch instruction, the program
Instruction judging means (6) for rewriting the counter (5) to the branch destination address of the unconditional branch instruction, and if the unconditional branch instruction is judged to be a subroutine call instruction, the instruction Before the program counter (5) is rewritten by the judging means (6), the contents of the program counter (5) are pushed onto the stack to store the return address, and the instruction judging means (6) stores If the conditional branch instruction is determined to be a return / subroutine instruction, the stack is hopped,
A high-speed branch processing system of a central processing unit, comprising: a program counter stack means (7) for setting a return address of a subroutine in a program counter (5).

2. A high-speed branch high-speed processing method for a central processing unit according to claim 1, wherein a branch address registering means (9) for registering a branch address with a high probability of branching, and the number of stages of queues are provided. It has the same bit width as the program counter (5) at (the number of pipeline stages of the central processing unit-1), and the program counter (5) is queued at the rewriting timing of the program counter (5) to The instruction determining means (6) further comprises a program counter queue storing means (8) for saving the contents of the counter (5), and the fetched instruction (1
0) is a conditional branch instruction, and if the branch destination address is registered in the branch address registering means (9), the program counter (5) is rewritten to the branch destination address and the condition of the conditional branch instruction is rewritten. The branch destination address is executed immediately after the condition is satisfied, and if the condition of the conditional branch instruction is not satisfied, the address to be processed next, which is the lowermost content of the program counter queue storing means (8), is programmed. A high-speed branch processing method of the central processing unit, which is characterized in that it is loaded into the counter (5).