JPH0756755A

JPH0756755A - Method for controlling processor

Info

Publication number: JPH0756755A
Application number: JP5201469A
Authority: JP
Inventors: Masato Motomura; 真人本村
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1993-08-13
Filing date: 1993-08-13
Publication date: 1995-03-03
Anticipated expiration: 2011-08-07
Also published as: JP2522176B2

Abstract

PURPOSE:To obtain effective prefetch technique for a processor adopting a continuous buffer or the like by means of a simple hardware constitution by previously prefetching an instruction string pointed out by an instruction painter stored in the continuous buffer to a high speed buffer memory. CONSTITUTION:A prefetching device 150 prereads continuation to be executed in the future from the continuous buffer 120 based upon continuation prereading information 123 and prefetches a necessary instruction or data by means of the continuation. An instruction pointer 227-1 out of the continuation preread based upon the information 123 is transferred to a high speed buffer memory monitoring part 211 in an instruction prefetching part 210. The monitoring part 211 inspects whether a buffer memory line including an instruction specified by the painter 227-1 is included in the high speed buffer memory or not based upon high speed buffer memory inspecting information 152. When the memory line is not included, a prefetch request issuing part 212 issues a prefetch request for the buffer memory line.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は計算機システムの中核を
なすプロセッサに関し、特に一つのプログラムを並列に
処理可能な高性能なマイクロプロセッサに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a processor which is the core of a computer system, and more particularly to a high performance microprocessor capable of processing one program in parallel.

【０００２】[0002]

【従来の技術】プロセッサ、特にマイクロプロセッサに
おいては、キャッシュメモリに代表される高速バッファ
メモリを活用し、演算に必要な命令及びデータを共に高
速バッファメモリに格納した上で、これに高速アクセス
して必要な処理を行なうことが一般的である。2. Description of the Related Art In a processor, particularly a microprocessor, a high-speed buffer memory represented by a cache memory is utilized to store both instructions and data required for calculation in the high-speed buffer memory, and then to access this at high speed. It is common to perform the necessary processing.

【０００３】高速バッファメモリに所望の命令又はデー
タが存在しない場合（以下この場合のことをバッファメ
モリミスと呼ぶ）、メインメモリ等から所望の命令又は
データを転送しなければならず、この転送に長い時間が
かかることがプロセッサの性能を低下させる一つの要因
となっている。なお、高速バッファメモリは複数のバッ
ファメモリラインから構成され、バッファメモリミスに
伴う上記の転送は要求されたこのバッファメモリライン
単位で行なわれる。If the desired instruction or data does not exist in the high-speed buffer memory (hereinafter, this case is referred to as a buffer memory miss), the desired instruction or data must be transferred from the main memory or the like. A long time is one of the factors that reduce the performance of the processor. The high speed buffer memory is composed of a plurality of buffer memory lines, and the above transfer due to a buffer memory miss is performed in units of the requested buffer memory line.

【０００４】バッファメモリミスの場合の転送処理のオ
ーバヘッドによる性能低下の課題を解決するために、プ
リフェッチという従来技術がある。ここでプリフェッチ
とは、必要となる命令またはデータをあらかじめ高速バ
ッファメモリへロードしておくことである。In order to solve the problem of performance deterioration due to the overhead of transfer processing in the case of a buffer memory miss, there is a conventional technique called prefetch. Here, the prefetch is to load necessary instructions or data into the high speed buffer memory in advance.

【０００５】プリフェッチ関する従来技術の要旨を以下
に示す。The following is a summary of the prior art relating to prefetching.

【０００６】［１］プリフェッチ命令によりソフトウ
ェア的かつ明示的に、必要なデータをあらかじめ高速バ
ッファメモリへロードしておくという技術である。[1] It is a technique of preliminarily loading necessary data into a high-speed buffer memory in a software and explicit manner by a prefetch instruction.

【０００７】コンパイラは、この命令を用いて、必要と
なるデータをあらかじめ高速バッファメモリへ取り込ん
でおくようにプログラムコードを構成する。The compiler uses this instruction to configure the program code so that necessary data is loaded into the high speed buffer memory in advance.

【０００８】このような技術を実際に採用した例とし
て、ＤｉｇｉｔａｌＥｑｕｉｐｅｍｅｎｔＣｏｒｐ
ｏｒａｔｉｏｎ社が発表したａｌｐｈａアーキテクチャ
やＭｏｔｏｒｏｌａ社が発表したＭＣ８８１１０のアー
キテクチャなどがある。As an example of actually adopting such a technique, Digital Equipment Corp.
There is an alpha architecture announced by the corporation or the MC88110 architecture announced by Motorola.

【０００９】［２］ハードウェアによりプリフェッチ
装置を構成し、これによりプリフェッチを実現する技術
である。[2] This is a technique for implementing prefetch by configuring a prefetch device with hardware.

【００１０】ＮｏｒｍａｎＰ．Ｊｏｕｐｐｉ，“Ｉｍ
ｐｒｏｖｉｎｇＤｉｒｅｃｔ−ＭａｐｐｅｄＣａｃ
ｈｅＰｅｒｆｏｒｍａｎｃｅｂｙｔｈｅＡｄｄ
ｉｔｉｏｎｏｆａＳｍａｌｌＦｕｌｌｙ−Ａｓ
ｓｏｃｉａｔｉｖｅＣａｃｈｅａｎｄＰｒｅｆｅ
ｔｃｈＢｕｆｆｅｒｓ”，Ｔｈｅ１７ｔｈＩｎｔ
ｅｒｎａｔｉｏｎａｌＳｙｍｐｏｓｉｕｍｏｎＣ
ｏｍｐｕｔｅｒＡｒｃｈｉｔｅｃｔｕｒｅ，ＩＥＥＥ
ＣｏｍｐｕｔｅｒＳｏｃｉｅｔｙＰｒｅｓｓ，１
９９０，ｐｐ．３６４−３７３の中でストリームバッフ
ァと呼んでいるものがある。Norman P. Juppi, "Im
proving Direct-Mapped Cac
he Performance by the Add
edition of a Small Fully-As
social Cache and Prefe
tch Buffers ”, The 17th Int
international Symposium on C
computer Architecture, IEEE
Computer Society Press, 1
990, pp. Some of 364-373 are called stream buffers.

【００１１】これは以前から知られているプリフェッチ
バッファの一種であり、ある一つの命令又はデータがア
クセスされた時に、それに続く命令又はデータを続けて
直列的にアクセスし、ストリームバッファに記憶させよ
うというものである。プログラムの挙動が直列的であれ
ば、このような手法により高速バッファメモリに所望の
命令又はデータが存在する確率を大幅に向上することが
可能である。This is a kind of prefetch buffer that has been known for a long time. When a certain instruction or data is accessed, subsequent instructions or data are serially accessed and stored in a stream buffer. That is. If the program behavior is serial, such a method can significantly improve the probability that a desired instruction or data exists in the high-speed buffer memory.

【００１２】同じく、並列処理計算機を構成する要素プ
ロセッサの従来技術として、マルチスレッド型アーキテ
クチャがある。Similarly, there is a multi-thread type architecture as a conventional technique of the element processors constituting the parallel processing computer.

【００１３】マルチスレッド型アーキテクチャでは、プ
ログラムを並列実行可能な複数のスレッドにコンパイル
する。ここでスレッドとは逐次的に実行される命令列の
ことである。これらのスレッドは並列処理計算機を構成
する各要素プロセッサに分配され、それぞれのプロセッ
サで演算処理が実行される。この際、一つのプロセッサ
に複数のスレッドを割り当て、プロセッサ間通信などの
待ち時間が生じた時に、他のスレッドに実行を切替え
て、待ち時間を有効に活用する点がマルチスレッド型ア
ーキテクチャの基本的なアイデアである。In the multithreaded architecture, a program is compiled into a plurality of threads that can be executed in parallel. Here, a thread is a sequence of instructions that are sequentially executed. These threads are distributed to the respective element processors that make up the parallel processing computer, and the respective processors execute arithmetic processing. At this time, the basic point of the multithreaded architecture is that multiple threads are assigned to one processor, and when waiting time such as inter-processor communication occurs, execution is switched to another thread and the waiting time is effectively used. Is an idea.

【００１４】マルチスレッド型アーキテクチャの代表的
な例としては、Ｒ．Ｓ．Ｎｉｋｈｉｌ，Ｇ．Ｍ．Ｐａｐ
ａｄｏｐｏｕｌｏｓ，Ａｒｖｉｎｄ，“ *Ｔ：ＡＭｕ
ｌｔｉｔｈｒｅａｄｅｄＭａｓｓｉｖｅｌｙＰａｒ
ａｌｌｅｌＡｒｃｈｉｔｅｃｔｕｒｅ”，Ｔｈｅ１９
ｔｈＩｎｔｅｒｎａｔｉｏｎａｌＳｙｍｐｏｓｉｕ
ｍｏｎＣｏｍｐｕｔｅｒＡｒｃｈｉｔｅｃｔｕｒ
ｅ，ＩＥＥＥＣｏｍｐｕｔｅｒＳｏｃｉｅｔｙＰ
ｒｅｓｓ，１９９２，ｐｐ．１５６−１６７）がある。As a typical example of the multithreaded architecture, R. S. Nikhil, G .; M. Pap
adopoulos, Arvind, “* T: A Mu
lthreaded Massively Par
allell Architecture ", The19
th International Symposiu
m on Computer Architecture
e, IEEE Computer Society P
ress, 1992, pp. 156-167).

【００１５】マルチスレッド型アーキテクチャにおいて
は、プロセッサがどのスレッドが実行可能であるか認識
しておく必要が生じる。このために継続バッファと呼ぶ
バッファをプロセッサ内に持つのが普通である。ここで
継続（またはコンティニュエーションという）とはスレ
ッドの先頭命令アドレスを示す命令ポインタとスレッド
が使用するデータ領域であるフレームの先頭アドレスを
指定するフレームポインタの組である。In a multithreaded architecture, the processor needs to be aware of which threads can execute. For this reason, it is common to have a buffer in the processor called the continuation buffer. Here, the continuation (or continuation) is a set of an instruction pointer indicating the head instruction address of the thread and a frame pointer designating the head address of a frame which is a data area used by the thread.

【００１６】スレッドの実行の開始するときは命令ポイ
ンタの値にしたがってスレッドの先頭命令のフェッチを
行なう。スレッド内の以後の命令は、通常の通りプログ
ラムカウンタの指定にしたがってフェッチする。命令の
実行に必要なデータは、フレームポインタで指定される
フレームの中からレジスタへフェッチされる。When the execution of the thread is started, the leading instruction of the thread is fetched according to the value of the instruction pointer. Subsequent instructions in the thread are fetched as specified by the program counter as usual. The data required to execute the instruction is fetched into the register from the frame designated by the frame pointer.

【００１７】[0017]

【発明が解決しようとする課題】上記の従来のプリフェ
ッチ技術において本発明が解決しようとする課題の要旨
は以下の２項に存する。The gist of the problem to be solved by the present invention in the above-mentioned conventional prefetch technique lies in the following two items.

【００１８】［１］コンパイル時にプリフェッチ命令
を命令列中に有効に挿入するのが困難である点や、プリ
フェッチ命令の発行のために少なくとも１サイクル以上
を消費してしまう点、またデータのプリフェッチのみに
有効で、命令のプリフェッチには用いることができない
点、などが課題である。[1] It is difficult to effectively insert a prefetch instruction into an instruction sequence during compilation, at least one cycle is consumed for issuing a prefetch instruction, and only data prefetch is performed. The problem is that it is effective for prefetching and cannot be used for prefetching instructions.

【００１９】［２］プログラムの挙動が直列的である
ことを仮定しているため、多くの場合、無駄な命令又は
データをプリフェッチしてしまうという課題がある。こ
れは高速バッファメモリのミス率を上昇させてしまうば
かりでなく、メインメモリと高速バッファメモリ間のバ
ンド幅を無駄使いしてしまうという点でも課題である。[2] Since it is assumed that the behavior of the program is serial, in many cases, there is a problem of prefetching useless instructions or data. This not only increases the miss rate of the high speed buffer memory, but also wastes the bandwidth between the main memory and the high speed buffer memory.

【００２０】本発明の目的は、マルチスレッド型アーキ
テクチャのような、継続バッファを有し、継続バッファ
に記憶された複数の実行可能な継続の中から一つを選ん
で演算処理を実行するプロセッサに対して、その特徴を
生かしたプリフェッチ技術を実現するプロセッサの制御
方法を提供することにある。It is an object of the present invention to provide a processor having a continuation buffer, such as a multi-threaded architecture, which selects one of a plurality of executable continuations stored in the continuation buffer to execute arithmetic processing. On the other hand, another object of the present invention is to provide a control method of a processor that realizes a prefetch technique that takes advantage of the characteristics.

【００２１】[0021]

【課題を解決するための手段】上記の課題を解決するた
めの本発明に手段の要旨は以下の６項に存する。The gist of means of the present invention for solving the above-mentioned problems lies in the following 6 items.

【００２２】［１］実行可能な一連の命令である命令
列における先頭命令のアドレスを指定する命令ポインタ
及び前記命令列に対応するデータを記憶するデータ記憶
領域を指定するフレームポインタを記憶する継続バッフ
ァと、現在実行中の命令のアドレスを指定するプログラ
ムカウンタと、複数のバッファメモリラインを有し命令
及びデータを記憶する高速バッファメモリとを具備する
プロセッサにおいて、前記継続バッファ中に格納された
前記命令ポインタ及び前記命令ポインタに対応する前記
フレームポインタを取りだし、前記命令ポインタを前記
プログラムカウンタに格納し、前記プログラムカウンタ
に格納された前記命令ポインタを取りだし、取り出され
た前記命令ポインタで指示された前記命令列の前記先頭
命令の格納されているアドレスを決定し、前記フレーム
ポインタで指定される前記データ記憶領域に対して、前
記先頭命令から始まる前記命令列における演算処理を順
次に実行するときに、前記継続バッファ中に格納された
前記命令ポインタで指示された前記命令列をあらかじめ
前記高速バッファメモリへプリフェッチすることを特徴
とするプロセッサの制御方法。[1] Continuation buffer that stores an instruction pointer that specifies the address of the first instruction in an instruction string that is a series of executable instructions and a frame pointer that specifies a data storage area that stores data corresponding to the instruction string And a program counter for designating an address of an instruction currently being executed, and a high-speed buffer memory having a plurality of buffer memory lines for storing instructions and data, in the processor, the instruction stored in the continuation buffer. The pointer and the frame pointer corresponding to the instruction pointer are fetched, the instruction pointer is stored in the program counter, the instruction pointer stored in the program counter is fetched, and the instruction designated by the fetched instruction pointer Stores the first instruction in the row The instruction stored in the continuation buffer when the arithmetic operation in the instruction sequence starting from the first instruction is sequentially executed for the data storage area designated by the frame pointer. A method of controlling a processor, characterized in that the instruction sequence designated by a pointer is prefetched into the high-speed buffer memory in advance.

【００２３】［２］前記高速バッファメモリを構成す
る前記バッファメモリラインに格納できる長さに前記命
令列の長さを制限することを特徴とする請求項１に記載
のプロセッサの制御方法。[2] The method of controlling a processor according to claim 1, wherein the length of the instruction string is limited to a length that can be stored in the buffer memory line that constitutes the high-speed buffer memory.

【００２４】［３］前記命令列の長さを示す命令列長
情報を前記命令ポインタと一緒に前記継続バッファへ格
納し、前記継続バッファ中に格納された前記命令ポイン
タ及び前記命令ポインタに対応する前記命令列長情報を
用いて、必要となる前記命令列が含まれる全ての前記バ
ッファメモリラインをあらかじめ前記高速バッファメモ
リへプリフェッチすることを特徴とする請求項１に記載
のプロセッサの制御方法。[3] Instruction sequence length information indicating the length of the instruction sequence is stored in the continuation buffer together with the instruction pointer, and corresponds to the instruction pointer and the instruction pointer stored in the continuation buffer. The processor control method according to claim 1, wherein all the buffer memory lines including the required instruction sequence are prefetched into the high-speed buffer memory in advance by using the instruction sequence length information.

【００２５】［４］前記継続バッファ中に格納された
前記フレームポインタを用いて、必要となる前記データ
記憶領域をあらかじめ前記高速バッファメモリへプリフ
ェッチすることを特徴とする請求項１に記載のプロセッ
サの制御方法。[4] The processor according to claim 1, wherein the necessary data storage area is prefetched into the high-speed buffer memory in advance by using the frame pointer stored in the continuation buffer. Control method.

【００２６】［５］前記データ領域が前記高速バッフ
ァメモリを構成する前記バッファメモリライン内に格納
できるように、前記データ領域の大きさを制限すること
を特徴とする請求項４に記載のプロセッサの制御方法。[5] A processor according to claim 4, wherein the size of the data area is limited so that the data area can be stored in the buffer memory line forming the high speed buffer memory. Control method.

【００２７】［６］前記データ領域の大きさを示すフ
レームサイズ情報を前記フレームポインタと一緒に前記
継続バッファへ格納し、前記継続バッファ中に格納され
た前記フレームポインタ及び前記フレームポインタに対
応する前記フレームサイズ情報を用いて、必要となる前
記データ領域をあらかじめ前記高速バッファメモリへプ
リフェッチすることを特徴とする請求項１に記載のプロ
セッサの制御方法。[6] Frame size information indicating the size of the data area is stored in the continuation buffer together with the frame pointer, and the frame pointer stored in the continuation buffer and the frame pointer corresponding to the frame pointer are stored. The processor control method according to claim 1, wherein the required data area is prefetched into the high-speed buffer memory in advance using frame size information.

【００２８】[0028]

【実施例】図１は本発明による制御方法を適用したプロ
セッサの実施例を示したブロック構成図である。1 is a block diagram showing an embodiment of a processor to which a control method according to the present invention is applied.

【００２９】図１においてプロセッサ１１０は、継続バ
ッファ１２０、プログラムカウンタ１３０、高速バッフ
ァメモリ１４０、プリフェッチ装置１５０、演算実行装
置１６０およびレジスタ１７０から構成されている。In FIG. 1, the processor 110 comprises a continuation buffer 120, a program counter 130, a high speed buffer memory 140, a prefetch device 150, an arithmetic execution device 160 and a register 170.

【００３０】以下、図１を参照しながら、プロセッサ１
１０の制御方法の概略を説明する。継続バッファ１２０
は、実行可能なスレッドの先頭命令アドレスを示す命令
ポインタと、この命令ポインタに対応するデータの記憶
領域を示すフレームポインタの対から構成される継続を
複数個記憶する。この継続バッファ１２０へ格納されて
いる継続は、継続入力情報１１１を用いて他のプロセッ
サから与えられるか、又は継続格納情報１６１を用いて
演算実行装置１６０から与えられる。Hereinafter, referring to FIG. 1, the processor 1
An outline of the control method of 10 will be described. Continuation buffer 120
Stores a plurality of continuations each composed of a pair of an instruction pointer indicating the start instruction address of an executable thread and a frame pointer indicating a storage area of data corresponding to this instruction pointer. The continuation stored in the continuation buffer 120 is given from another processor using the continuation input information 111 or from the arithmetic execution unit 160 using the continuation storage information 161.

【００３１】演算実行装置１６０が新たなスレッドを実
行可能になった時点で、継続バッファ１２０に格納され
た継続の内から一つが選択され、選択された継続に含ま
れる命令ポインタが命令ポインタ読みだし情報１２１を
用いてプログラムカウンタ１３０へ読み出される。読み
出された命令ポインタの値に応じて、命令アドレス通知
情報１３１が高速バッファメモリ１４０へ実行すべき命
令のアドレスを指定し、指定された命令が命令転送情報
１４１を用いて演算実行装置１６０へ読み出され、実行
される。When the arithmetic execution unit 160 becomes able to execute a new thread, one is selected from the continuations stored in the continuation buffer 120, and the instruction pointer included in the selected continuation reads out the instruction pointer. It is read to the program counter 130 using the information 121. The instruction address notification information 131 specifies the address of the instruction to be executed in the high speed buffer memory 140 according to the value of the instruction pointer read, and the specified instruction is sent to the arithmetic execution unit 160 using the instruction transfer information 141. It is read and executed.

【００３２】継続バッファ１２０から選択された継続に
含まれるフレームポインタは、フレームポインタ読みだ
し情報１２２を用いて演算実行装置１６０へ読み出され
る。命令ポインタで指定される命令から始まるスレッド
の中で、メモリへアクセスする命令がある場合、フレー
ムポインタを用いて当該データのアドレスが計算され、
データアドレス通知情報１６２を用いて高速バッファメ
モリ１４０へ通知される。データ転送情報１４２を用い
て、当該データのレジスタ１７０と高速バッファメモリ
１４０間での受け渡しが行なわれる。The frame pointer included in the continuation selected from the continuation buffer 120 is read to the arithmetic execution unit 160 using the frame pointer read information 122. If there is an instruction to access the memory in the thread starting from the instruction specified by the instruction pointer, the address of the data is calculated using the frame pointer,
The high speed buffer memory 140 is notified using the data address notification information 162. The data transfer information 142 is used to transfer the data between the register 170 and the high speed buffer memory 140.

【００３３】高速バッファメモリ１４０に必要な命令又
はデータが存在しない場合、当該命令又はデータはデー
タ転送情報１８１を用いてメインメモリ１８０から高速
バッファメモリ１４０へ転送される。高速バッファメモ
リは、複数のバッファメモリラインから構成されてお
り、上記のメインメモリ１８０と高速バッファメモリ１
４０間の転送は、このバッファメモリライン単位で行な
われる。プリフェッチ装置１５０は、継続バッファに格
納された継続を継続先読み情報１２３を用いて読み出
し、読み出した継続の中の命令ポインタ又はフレームポ
インタをそれぞれ用いて、命令又はデータのプリフェッ
チをプリフェッチ要求情報１５１を用いてメインメモリ
１８０へ要求する。When the required instruction or data does not exist in the high speed buffer memory 140, the instruction or data is transferred from the main memory 180 to the high speed buffer memory 140 using the data transfer information 181. The high speed buffer memory is composed of a plurality of buffer memory lines, and includes the main memory 180 and the high speed buffer memory 1 described above.
The transfer between 40 is performed in units of this buffer memory line. The prefetch device 150 reads the continuation stored in the continuation buffer using the continuation readahead information 123, uses the instruction pointer or frame pointer in the read continuation, respectively, and prefetches the instruction or data using the prefetch request information 151. Request to the main memory 180.

【００３４】図２は、図１のプロセッサ１１０の実施例
に対する制御方法の第１の実施例を説明するための、プ
リフェッチ装置１５０と継続バッファ１２０の詳細ブロ
ック構成図である。図２において、本発明のプリフェッ
チ装置１５０は、命令プリフェッチ部２１０、データプ
リフェッチ部２２０およびアービトレータ２３０とから
構成される。FIG. 2 is a detailed block diagram of the prefetch device 150 and the continuation buffer 120 for explaining the first embodiment of the control method for the embodiment of the processor 110 of FIG. In FIG. 2, the prefetch device 150 of the present invention comprises an instruction prefetch unit 210, a data prefetch unit 220 and an arbitrator 230.

【００３５】命令プリフェッチ部２１０とデータプリフ
ェッチ部２２０は、それぞれ、高速バッファメモリモニ
タ部２１１，２２１とプリフェッチ要求発行部２１２，
２２２から構成される。また継続バッファ１２０は、継
続（１）２２６−１から継続（ｎ）２２６−ｎまでのｎ
個（ｎは正の整数）の継続を記憶する。それぞれの継続
は命令ポインタ２２７−１、フレームポインタ２２７−
２から構成される。The instruction prefetch unit 210 and the data prefetch unit 220 respectively include a high speed buffer memory monitor unit 211 and 221 and a prefetch request issuing unit 212, respectively.
It is composed of 222. Further, the continuation buffer 120 stores n from continuation (1) 226-1 to continuation (n) 226-n.
Store the number of continuations (n is a positive integer). Each continuation is indicated by an instruction pointer 227-1 and a frame pointer 227-1.
It consists of two.

【００３６】継続バッファ１２０の構成方法の例として
は、ＦＩＦＯ（ファーストインファーストアウト）型の
キューとして構成する方法がある。すなわち、継続入力
情報１１１又は継続格納情報１６１から入力された継続
はキューの後尾に入力され、キューの先頭に向けて順に
送られる。スレッドの実行のために継続を取りだすとき
はキューの先頭にある継続（１）２２６−１から命令ポ
インタ読みだし情報１２１と命令ポインタ読みだし情報
１２２を用いて読み出す。As an example of the method of configuring the continuation buffer 120, there is a method of configuring it as a FIFO (first in first out) type queue. That is, the continuation input from the continuation input information 111 or the continuation storage information 161 is input to the tail of the queue and sequentially sent toward the head of the queue. When the continuation is taken out for the execution of the thread, it is read from continuation (1) 226-1 at the head of the queue by using the instruction pointer read information 121 and the instruction pointer read information 122.

【００３７】以下、図２を参照しながら、本発明による
プロセッサの制御方法の第１の実施例を説明する。A first embodiment of the processor control method according to the present invention will be described below with reference to FIG.

【００３８】プリフェッチ装置１５０は、継続先読み情
報１２３を用いて、将来実行される継続を継続バッファ
１２０から先読みし、これを用いて必要な命令又はデー
タのプリフェッチを行なう。The prefetch device 150 uses the continuation prefetch information 123 to prefetch a continuation to be executed in the future from the continuation buffer 120, and uses this to prefetch necessary instructions or data.

【００３９】継続バッファからの先読みを行なう方法の
例としては、継続がキューの先頭に達した時点で行なう
方法や、先頭から順にｍ番目（ｍはｎ以下の正の整数）
までの継続に対して先読みを行なう方法などがある。継
続先読み情報１２３を用いて先読みされた継続のうちの
命令ポインタ２２７−１は命令プリフェッチ部２１０内
の高速バッファメモリモニタ部２１１へ転送される。As an example of a method of prefetching from the continuation buffer, a method of performing it when the continuation reaches the head of the queue, or the m-th from the head (m is a positive integer less than or equal to n)
There is a method such as prefetching for continuation until. The instruction pointer 227-1 of the continuation prefetched using the continuation prefetch information 123 is transferred to the high-speed buffer memory monitor unit 211 in the instruction prefetch unit 210.

【００４０】高速バッファメモリモニタ部２１１は、命
令ポインタ２２７−１で指定される命令を含むバッファ
メモリラインが高速バッファメモリに含まれるかどうか
を高速バッファメモリ検査情報１５２を用いて検査す
る。含まれていなければ、プリフェッチ要求発行部２１
２がそのバッファメモリラインのプリフェッチ要求を発
行する。また、先読みされた継続のうちのフレームポイ
ンタ２２７−２は、データプリフェッチ部内の高速バッ
ファメモリモニタ部２２１へ転送される。The high speed buffer memory monitor unit 211 uses the high speed buffer memory inspection information 152 to inspect whether or not the buffer memory line including the instruction designated by the instruction pointer 227-1 is included in the high speed buffer memory. If not included, the prefetch request issuing unit 21
2 issues a prefetch request for that buffer memory line. The frame pointer 227-2 of the prefetched continuation is transferred to the high speed buffer memory monitor unit 221 in the data prefetch unit.

【００４１】高速バッファメモリモニタ部２２１は、フ
レームポインタ２２７−２で指定されるフレームの先頭
アドレスを含むバッファメモリラインが高速バッファメ
モリに含まれるかどうかを高速バッファメモリ検査情報
１５２を用いて検査する。含まれていなければ、プリフ
ェッチ要求発行部２２２がそのブロックのプリフェッチ
要求を発行する。アービトレータ２３０は、命令プリフ
ェッチ部２１０とデータプリフェッチ部２２０からのプ
リフェッチ要求が同時に発生した場合に、それらの間の
調停を行ない、プリフェッチ要求情報１５１を用いてメ
インメモリ１８０へプリフェッチを要求する。アービト
レーション方法の例としては、命令プリフェッチ部２１
０からのプリフェッチ要求を常に優先する方法があげら
れる。The high speed buffer memory monitor 221 uses the high speed buffer memory inspection information 152 to inspect whether or not the buffer memory line including the head address of the frame designated by the frame pointer 227-2 is included in the high speed buffer memory. . If it is not included, the prefetch request issuing unit 222 issues the prefetch request for the block. When prefetch requests from the instruction prefetch unit 210 and the data prefetch unit 220 occur at the same time, the arbitrator 230 arbitrates between them and uses the prefetch request information 151 to request prefetch from the main memory 180. As an example of the arbitration method, the instruction prefetch unit 21
There is a method in which the prefetch request from 0 is always prioritized.

【００４２】上に説明した制御方法の第１の実施例で
は、プリフェッチされるバッファメモリラインが、命令
ポインタ２２７−１で指定される命令、又はフレームポ
インタ２２７−２で指定されるデータを含むことは保証
されても、上記命令ポインタ２２７−１で指定される命
令から始まるスレッド全て、又はフレームポインタ２２
７−２で指定されるフレーム全てを含むことは保証され
ない。従って、スレッドの実行中に、高速バッファメモ
リ１４０に必要な命令又はデータが存在しないことがあ
る。In the first embodiment of the control method described above, the prefetched buffer memory line contains the instruction specified by the instruction pointer 227-1 or the data specified by the frame pointer 227-2. Is guaranteed, all threads starting from the instruction specified by the instruction pointer 227-1 or the frame pointer 22
It is not guaranteed to include all the frames specified in 7-2. Therefore, during execution of the thread, the necessary instruction or data may not exist in the high speed buffer memory 140.

【００４３】本発明によるプロセッサの制御方法の第２
の実施例は、このような点を改善するために、プロセッ
サの制御方法の第１の実施例に加えて、スレッドまたは
フレームが一つのバッファメモリラインの中に含まれる
ように、スレッドまたはフレームの大きさを制限するも
のである。これにより、命令ポインタ２２７−１又はフ
レームポインタ２２７−２の指定に応じてバッファメモ
リラインをプリフェッチすることにより、スレッドの全
ての命令およびデータを、スレッドの実行前に高速バッ
ファメモリ１４０にプリフェッチすることが可能にな
る。A second method of controlling a processor according to the present invention
In order to improve such a point, in addition to the first embodiment of the control method of the processor, the embodiment of the above-mentioned method of the thread or the frame of the thread or frame is included in one buffer memory line. It limits the size. Thus, by prefetching the buffer memory line according to the designation of the instruction pointer 227-1 or the frame pointer 227-2, all the instructions and data of the thread are prefetched to the high speed buffer memory 140 before the execution of the thread. Will be possible.

【００４４】図３は、図１のプロセッサ１１０の実施例
に対する制御方法の第３の実施例を説明するための、継
続バッファ１２０の詳細ブロック構成図である。図３に
おいて、継続バッファ１２０は継続（１）３２６−１か
ら継続（ｎ）３２６−ｎまでのｎ個（ｎは正の整数）の
継続を記憶する。それぞれの継続は命令ポインタ３２７
−１、フレームポインタ３２７−２、スレッド長情報３
２７−３、フレームサイズ情報３２７−４から構成され
る。FIG. 3 is a detailed block diagram of the continuation buffer 120 for explaining the third embodiment of the control method for the embodiment of the processor 110 of FIG. In FIG. 3, the continuation buffer 120 stores n (n is a positive integer) continuations from continuation (1) 326-1 to continuation (n) 326-n. Each continuation is an instruction pointer 327
-1, frame pointer 327-2, thread length information 3
27-3 and frame size information 327-4.

【００４５】この第３の実施例においては、継続のなか
にスレッドの長さを示すスレッド長情報３２７−３とフ
レームの大きさを示すフレームサイズ情報３２７−４が
含まれることが特徴である。プリフェッチ装置１５０は
これらの情報を用いて、以下のようなプリフェッチ動作
を行なう。まず命令のプリフェッチを行なう際は、命令
ポインタ３２７−１とスレッド長情報３２７−３を用い
て、スレッドを含んでいる全てのバッファメモリライン
を計算する。計算の結果判明したバッファメモリライン
が高速バッファメモリ１４０に格納されているかどうか
高速バッファメモリ検査情報１５２を用いて順次検査
し、含まれていなければ、プリフェッチ要求を順次発行
する。The third embodiment is characterized in that the continuation includes thread length information 327-3 indicating the thread length and frame size information 327-4 indicating the frame size. The prefetch device 150 uses these pieces of information to perform the following prefetch operation. First, when prefetching an instruction, all buffer memory lines including the thread are calculated using the instruction pointer 327-1 and the thread length information 327-3. Whether or not the buffer memory line found as a result of the calculation is stored in the high speed buffer memory 140 is sequentially inspected using the high speed buffer memory inspection information 152, and if not included, the prefetch requests are sequentially issued.

【００４６】また、データのプリフェッチを行なう際
は、フレームポインタ３２７−２とフレームサイズ情報
３２７−４を用いて、フレームを含んでいる全てのバッ
ファメモリラインを計算する。計算の結果判明したバッ
ファメモリラインが高速バッファメモリ１４０に格納さ
れているかどうか高速バッファメモリ検査情報１５２を
用いて順次検査し、含まれていなければ、プリフェッチ
要求を順次発行する。これにより、スレッドの全ての命
令およびデータを、スレッドの実行前に高速バッファメ
モリ１４０にプリフェッチすることが可能になる。When prefetching data, all the buffer memory lines including the frame are calculated using the frame pointer 327-2 and the frame size information 327-4. Whether or not the buffer memory line found as a result of the calculation is stored in the high speed buffer memory 140 is sequentially inspected using the high speed buffer memory inspection information 152, and if not included, the prefetch requests are sequentially issued. This allows all instructions and data of a thread to be prefetched into the high speed buffer memory 140 before execution of the thread.

【００４７】[0047]

【発明の効果】本発明のプロセッサの制御方法によれ
ば、簡単なハードウェア構成により、マルチスレッド型
アーキテクチャのような継続バッファ又はそれに類似す
る機構を採用したプロセッサに対して、有効なプリフェ
ッチ技術を提供することができる。According to the processor control method of the present invention, an effective prefetch technique can be applied to a processor adopting a continuation buffer such as a multithreaded architecture or a mechanism similar to it by a simple hardware configuration. Can be provided.

【００４８】とくに、継続バッファがＦＩＦＯのキュー
として構成されている場合など、次に実行されるスレッ
ドが一意的にあらかじめ決定されている場合、本発明に
よるプロセッサの制御方法が提供するプリフェッチ技術
は、無駄な命令又はデータをプリフェッチすることな
く、次のスレッドの実行に必要となるデータだけでなく
命令も確実にプリフェッチすることができる。In particular, when the thread to be executed next is uniquely determined in advance, such as when the continuation buffer is configured as a FIFO queue, the prefetch technique provided by the processor control method according to the present invention is It is possible to surely prefetch not only the data necessary for the execution of the next thread but also the instruction without prefetching useless instructions or data.

[Brief description of drawings]

【図１】本発明によるプロセッサの制御方法を示すため
のプロセッサの実施例の構成図である。FIG. 1 is a configuration diagram of an embodiment of a processor for illustrating a method of controlling a processor according to the present invention.

【図２】図１における継続バッファならびにプリフェッ
チ装置の構成を示すブロック図である。FIG. 2 is a block diagram showing a configuration of a continuation buffer and a prefetch device in FIG.

【図３】図１における継続バッファの構成を示すブロッ
ク図である。3 is a block diagram showing a configuration of a continuation buffer in FIG. 1. FIG.

[Explanation of symbols]

１１０プロセッサ１１１継続入力情報１２０継続バッファ１２１命令ポインタ読みだし情報１２２フレームポインタ読みだし情報１２３継続先読み情報１３０プログラムカウンタ１３１命令アドレス通知情報１４０高速バッファメモリ１４２データ転送情報１５０プリフェッチ装置１５１プリフェッチ要求情報１５２高速バッファメモリ検査情報１６０演算実行装置１６１継続格納情報１６２データアドレス通知情報１７０レジスタ１８０メインメモリ１８１データ転送情報２１０命令プリフェッチ部２２０データプリフェッチ部２２６−１，…，２２６−ｎ継続（１），…，継続
（ｎ）２２７−１命令ポインタ２２７−２フレームポインタ２３０アービトレータ２１１，２２１高速バッファメモリモニタ部２１２，２２２プリフェッチ要求発行部２２６継続３２６継続３２６−１，…，３２６−ｎ継続（１），…，継続
（ｎ）３２７−１命令ポインタ３２７−２フレームポインタ３２７−３スレッド長情報３２７−４フレームサイズ情報110 Processor 111 Continuation Input Information 120 Continuation Buffer 121 Instruction Pointer Readout Information 122 Frame Pointer Readout Information 123 Continuation Lookahead Information 130 Program Counter 131 Instruction Address Notification Information 140 High Speed Buffer Memory 142 Data Transfer Information 150 Prefetch Device 151 Prefetch Request Information 152 High Speed Buffer memory check information 160 Operation execution device 161 Continuous storage information 162 Data address notification information 170 Register 180 Main memory 181 Data transfer information 210 Instruction prefetch unit 220 Data prefetch unit 226-1, ..., 226-n Continuation (1), ..., Continuation (n) 227-1 Instruction pointer 227-2 Frame pointer 230 Arbitrator 211,221 High speed buffer memory monitor Part 212,222 Prefetch request issuing part 226 Continuation 326 Continuation 326-1, ..., 326-n Continuation (1), ..., Continuation (n) 327-1 Instruction pointer 327-2 Frame pointer 327-3 Thread length information 327- 4 Frame size information

Claims

[Claims]

1. A continuation buffer for storing an instruction pointer for designating an address of a first instruction in an instruction sequence which is a series of executable instructions and a frame pointer for designating a data storage area for storing data corresponding to the instruction sequence. A processor having a program counter for designating an address of an instruction currently being executed and a high speed buffer memory having a plurality of buffer memory lines for storing instructions and data, wherein the instruction pointer stored in the continuation buffer And fetching the frame pointer corresponding to the instruction pointer, storing the instruction pointer in the program counter, fetching the instruction pointer stored in the program counter, and based on the instruction of the fetched instruction pointer, The first instruction of the instruction sequence is stored The instruction stored in the continuation buffer when sequentially executing the arithmetic processing in the instruction sequence starting from the first instruction to the data storage area specified by the frame pointer A method of controlling a processor, characterized in that the instruction sequence designated by a pointer is prefetched into the high-speed buffer memory in advance.

2. The method of controlling a processor according to claim 1, wherein the length of the instruction sequence is limited to a length that can be stored in the buffer memory line that constitutes the high speed buffer memory.

3. The instruction string length information indicating the length of the instruction string is stored in the continuation buffer together with the instruction pointer, and the instruction pointer stored in the continuation buffer and the instruction pointer corresponding to the instruction pointer. 2. The processor control method according to claim 1, wherein all the buffer memory lines including the required instruction sequence are prefetched into the high-speed buffer memory in advance using the instruction sequence length information.

4. The control of the processor according to claim 1, wherein the necessary data storage area is prefetched into the high-speed buffer memory in advance by using the frame pointer stored in the continuation buffer. Method.

5. The control of the processor according to claim 4, wherein the size of the data area is limited so that the data area can be stored in the buffer memory line forming the high speed buffer memory. Method.

6. Frame size information indicating a size of the data area is stored in the continuation buffer together with the frame pointer, and the frame pointer stored in the continuation buffer and the frame corresponding to the frame pointer. 2. The size information is used to prefetch the required data area into the high-speed buffer memory in advance.
The method of controlling the processor according to.