JPS601655B2

JPS601655B2 - Data prefetch method

Info

Publication number: JPS601655B2
Application number: JP52142631A
Authority: JP
Inventors: 広幸坂本; 潔森島
Original assignee: Toshiba Corp; Nippon Electric Co Ltd
Current assignee: Toshiba Corp; NEC Corp
Priority date: 1977-11-30
Filing date: 1977-11-30
Publication date: 1985-01-16
Also published as: JPS5475964A

Description

【発明の詳細な説明】本発明はキャシュメモリを有する情報処理装置における
データプリフェッチ方式に関する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a data prefetch method in an information processing apparatus having a cache memory.

従来、先行制御における命令のプリフェッチに関する技
術は多いが、オペランドのプリフヱッチに関する技術に
ついては極めて少ない。ところがキャシュメモリを有す
る情報処理装置では、データ処理の実行中に必要なオペ
ランドを、その都度主メモリとブロックで対応づけられ
ているキャシユメモリにアクセスするが、キャシュメモ
リにそのオペランドが存在しない場合には主メモリから
キャシュメモリーこブロック転送がなされ、この間処理
動作は特さたれてしまう。Conventionally, there are many techniques related to prefetching of instructions in advance control, but there are very few techniques related to prefetching of operands. However, in an information processing device having a cache memory, necessary operands are accessed each time during execution of data processing from the cache memory, which is associated with the main memory in blocks, but if the operand does not exist in the cache memory, A block is transferred from the main memory to the cache memory, and processing operations are interrupted during this time.

この為、キャシュメモリを有する情報処理装置において
はキャシュメモリ内に必要なデータができるだけ多く記
憶していることが望ましいわけであり、これを実現する
方法として近い将釆必要と思われるデータをキヤシュメ
モリにブロック転送するプリフェッチ手段が必要とされ
る。本発明は上記プリフェツチの制御方式に関するもの
で、特に長いオペランドあるいは複数のオペランドの処
理を実行する命令で有効な情報処理装置におけるプリフ
ェッチ方式を提供することを目的とする。For this reason, it is desirable for information processing devices with cache memory to store as much necessary data as possible in the cache memory, and one way to achieve this is to store data that will be necessary in the near future in the cache memory. A prefetch means is required for block transfer. The present invention relates to the above prefetch control method, and an object of the present invention is to provide a prefetch method for an information processing device that is particularly effective for instructions that process long operands or multiple operands.

そして、本発明では先行制御の状態に関係なく、現在実
行している命令のオペランドのプリフェッチを行なう特
徴を有する。The present invention is characterized in that the operands of the currently executed instruction are prefetched regardless of the state of advance control.

また、本発明ではプリフェツチによる障害を取り除く機
能を有する。即ち、プリフェッチの実行中はプロセッサ
をストップしないため、プリフェッチの動作が完了する
前に同一ブロックに対して、新なＲＥＡＤ要求又はＷＲ
ＩＴＥ要求が発生する可能性がある。従って、前記ＲＥ
ＡＤ要求がプリフェッチの動作中に発生されると、主メ
モ，ＩＪの同一ブロックに対し２重のアクセスが行なわ
れるため時間の無駄が生ずる。また、ＷＲＩＴＥ要求が
プリフェツチ中に行なわれると、主メモリとキャシュメ
モリの内容に矛盾をきたすことになる。そこで、本発明
では前記の障害の発生を防止するため、プリフェッチと
同一ブロックに対してデータ転送要求が発生した場合、
プリフェッチが終了するまで、前記データ転送要求の処
理を待機させる手段を具備するものである。Furthermore, the present invention has a function of eliminating the trouble caused by prefetching. That is, since the processor is not stopped while prefetching is being performed, a new READ request or WR request is made to the same block before the prefetching operation is completed.
ITE requests may occur. Therefore, the RE
If an AD request is generated during a prefetch operation, the same block in the main memory and IJ will be accessed twice, resulting in wasted time. Additionally, if a WRITE request is made during prefetching, the contents of main memory and cache memory will be inconsistent. Therefore, in the present invention, in order to prevent the above-mentioned failure from occurring, when a data transfer request occurs for the same block as the prefetch,
The apparatus includes means for making the processing of the data transfer request wait until the prefetch is completed.

以下、図面を参照し本発明のデータプリフェッチ方式を
詳述する。Hereinafter, the data prefetch method of the present invention will be described in detail with reference to the drawings.

第１図はキャシュ・メモリを有する情報処理装置の概略
構成を示したブロック図である。FIG. 1 is a block diagram showing a schematic configuration of an information processing device having a cache memory.

同時において１川ま主メモリ、１１は主メモリー０とブ
ロックで対応づけられたデータを保持するキャシュ・メ
モリ、なおキヤシユ・メモリ１１は主メモリー０より高
速アクセス出来るメモリ素子によって構成されているこ
とは言うまでもない。１２は演算を実行する演算制御部
、１３は命令の解読、命令の分岐制御、オペランドのア
ドレス計算、オペランドアクセスなどを実行する命令制
御部、１４は命令制御部１３または演算制御部１２とキ
ャシュ・メモリー１あるいは主メモリー０とのデータ転
送を制御するメモリアクセス制御部、１５はマイクロプ
ログラム制御方式に必要な制御記憶部である。なお、第
１図ではマイクロプログラム制御による情報処理装置の
実施例を示したものであるが、本発明は制御記憶部１５
を有しないハードウェア制御による情報処理装置であっ
ても十分実施し得るものである。第２図は、本発明のプ
リフェッチ方式が適用される命令形式の一実施例である
。At the same time, 1 main memory, 11 is cache memory that holds data associated with main memory 0 in blocks, and cache memory 11 is composed of memory elements that can be accessed faster than main memory 0. Needless to say. 12 is an arithmetic control unit that executes arithmetic operations; 13 is an instruction control unit that executes instruction decoding, instruction branch control, operand address calculation, operand access, etc.; and 14 is an instruction control unit 13 or an arithmetic control unit 12 and a cache. A memory access control section 15 that controls data transfer with the memory 1 or the main memory 0 is a control storage section necessary for the microprogram control method. Although FIG. 1 shows an embodiment of the information processing device controlled by a microprogram, the present invention is based on the control storage unit 15.
The present invention can be implemented satisfactorily even with an information processing device controlled by hardware that does not have a hardware control system. FIG. 2 is an example of an instruction format to which the prefetch method of the present invention is applied.

同図で示されている命令語は命令コードとオペランド１
乃至オペランド３とによって構成されており、前記オペ
ランドー乃至オペランド３中のＡＩ，Ａ２，Ａ３は各々
のアドレス部、Ｔ１，Ｔ２，Ｔ３は各々のデータタイプ
（１坊隼数の符号形式、１坊隼数または文字のビット長
を示す。）、Ｎ１，Ｎ２，Ｎ３は各々の１０進数または
文字のオペランド長を示す。そして、この命令はオペラ
ンドーとオペランド２とのデータを演算し、結果をオペ
ランド３にストアする命令と仮定する。第３図は第１図
に示した情報処理装置において本発明のプリフェッチ方
式が適用される先行制御の深さを３段にした場合の実施
例を示したものである。The instruction word shown in the figure is the instruction code and operand 1.
AI, A2, and A3 in the operands and operands 3 are the respective address parts, and T1, T2, and T3 are the respective data types (code format of 1-bod Hayabusa number, 1-bob Hayabusa code format, ), N1, N2, and N3 indicate the operand length of each decimal number or character. Assume that this instruction is an instruction that operates on the data of operand and operand 2, and stores the result in operand 3. FIG. 3 shows an embodiment in which the depth of advance control to which the prefetch method of the present invention is applied in the information processing apparatus shown in FIG. 1 is set to three stages.

同図において、命令制御部１３で行なわれる処理動作を
１サイクルと称し、この１サイクルは命令の解読、オペ
ランドアドレスの計算、メモリコマンド発生、ページア
ドレスの実アドレスへの変換等を実行する。また、メモ
リアクセス制御部１４、キャシュメモリ１１、主メモリ
１０で行なわれる処理動作をＣサイクルと称し、このＣ
サイクルでは前記１サイクルで発生されたデータ転送要
求を受取り、キャシュメモリ１１あるいは必要に応じて
主メモリー０のアクセスを行ない、必要ならばアクセス
したデータを演算制御部１４へ転送する動作を行なう。
更に、演算制御部１４で行なわれる処理動作をＥサイク
ルと称し、このＥサイクルはメモリアクセス制御部１４
から転送されたデータを用いて演算の実行を行なう。さ
て、上託した第３図で示される３段の先行制御方式をと
り、第２図の命令を実行したときの処理動作を第４図の
タイミングチャートを参照し説明する。In the figure, the processing operation performed by the instruction control unit 13 is called one cycle, and this one cycle executes decoding of an instruction, calculation of an operand address, generation of a memory command, conversion of a page address into a real address, etc. Furthermore, the processing operations performed in the memory access control unit 14, cache memory 11, and main memory 10 are referred to as a C cycle.
In the cycle, the data transfer request generated in the previous cycle is received, the cache memory 11 or the main memory 0 is accessed as necessary, and the accessed data is transferred to the arithmetic control section 14 if necessary.
Furthermore, the processing operation performed by the arithmetic control unit 14 is referred to as an E cycle, and this E cycle is performed by the memory access control unit 14.
The calculation is performed using the data transferred from. Now, with reference to the timing chart of FIG. 4, the processing operation when the three-stage advance control method shown in FIG. 3 is adopted and the instructions of FIG. 2 are executed will be explained.

第４図において、１，Ｃ，Ｅは第３図の１サイクル、Ｃ
サイクル、Ｅサイクルでの処理に対応する。そして、本
発明の先行制御を第２図の命令の実行に適用すると、先
ず１１にて命令の最初の語、命令語がキャシュメモリ１
１のインストラクションバッフア（図示せず）から命令
制御部亀３に送られ、その命令の解読が行なわれる。こ
の１１時点で命令制御部１３は複数命令語である事、ま
たプリフェツチを行なう命令である事が判断される。次
に１２では命令の次の語（オペランドーに関する諸）が
キャシュメモリ１１から命令制御部１３に送られ、オペ
ランドアドレス生成回路によってオペランド１のアドレ
スが計算される。In Figure 4, 1, C, and E are 1 cycle in Figure 3, and C
It corresponds to processing in cycle and E cycle. When the advance control of the present invention is applied to the execution of the instruction shown in FIG.
The instruction is sent from the instruction buffer 1 (not shown) to the instruction control unit 3, and the instruction is decoded. At time point 11, the instruction control unit 13 determines that there are multiple instruction words and that the instruction is for prefetching. Next, at step 12, the next word of the instruction (related to the operand) is sent from the cache memory 11 to the instruction control section 13, and the address of operand 1 is calculated by the operand address generation circuit.

このオペランド１のアドレスはページアドレス変換回路
によってアドレス変換され、そのアドレスがメモリアド
レス制御部１４に転送されると同時に、命令制御部１３
内のメモリコマンド発生回路からブリフェツチ要求のコ
マンドをメリアクセス制御部１４に転送する。次に１３
では命令の次の語（オペランド２に関する諸）がキャシ
ュメモリ１１から命令制御部１３に転送され、上記と同
様にオペランドアドレス生成回路によってオペランド２
のアドレスが計算される。The address of operand 1 is converted by the page address conversion circuit, and at the same time, the address is transferred to the memory address control unit 14, and at the same time, the address is transferred to the instruction control unit 13.
A briffet request command is transferred from the memory command generation circuit in the memory access control unit 14. Next 13
Then, the next word of the instruction (related to operand 2) is transferred from the cache memory 11 to the instruction control unit 13, and similarly to the above, the operand address generation circuit generates the operand 2.
address is calculated.

このオペランド２のアドレスもページアドレス変換回路
によってアドレスが変換され、そのアドレスとメモリコ
マンド発生回路からのプリフェッチ要求のコマンドをメ
モリアクセス制御部１４に転送する。この１３と同じタ
イミングにてＣサイクルのＣＩが並列的に処理される。
即ち、ＣＩでは１２で発生したプリフェッチ要求を受取
ったメモリアクセス制御部１４は要求されたアドレスを
有するブロックがキヤシユメモリ１１に存在するかどう
かを調べ、もしキヤシユメモリ１１に存在しない場合は
主メモリ１０に対してブロック転送要求を発生する。こ
のブロック転送によるデー外まキャシュメモリ１１の対
応するブロックに書込まれるが、演算制御部１２には転
送されない。一方、要求されたアドレスを有するブロッ
クがキャシュメモリ１１に存在するならばメモリアクセ
ス制御部１４は、前記プリフェツチ要求に対して何の処
理も行なわれない。The address of this operand 2 is also converted by the page address conversion circuit, and the address and the prefetch request command from the memory command generation circuit are transferred to the memory access control unit 14. CI of C cycle is processed in parallel at the same timing as this 13.
That is, in the CI, upon receiving the prefetch request generated in step 12, the memory access control unit 14 checks whether a block having the requested address exists in the cache memory 11, and if it does not exist in the cache memory 11, the block is sent to the main memory 10. A block transfer request is generated. The data resulting from this block transfer is written into the corresponding block of the cache memory 11, but is not transferred to the arithmetic control unit 12. On the other hand, if a block having the requested address exists in the cache memory 11, the memory access control unit 14 does not perform any processing in response to the prefetch request.

次に１４では、命令のオペランド３に関する語がキヤシ
ュメモリ１１から命令制御部１３に送られ、オペランド
３のアドレス計算が行なわれる。Next, in step 14, a word regarding operand 3 of the instruction is sent from cache memory 11 to instruction control section 13, and address calculation for operand 3 is performed.

オペランド３のアドレスはストアオペランドを示すもの
であるため、プリフェツチの必要はない。この１４と同
じタイミングであるＣ２では、上記１３で発生したオペ
ランド２のプリフェッチ要求の処理が行なわれる。この
プリフェツチ要求の処理は上記したＣＩで実行したオペ
ランド１のプリフェッチ要求と同一の処理が行なわれる
。そして、オペランド２に対するプリフェツチ要求した
アドレスを有するブロックがキャシュメモリ１１に存在
しない場合は主メモリー０からブ。ック転送が行なわれ
、キャシュメモリ１１に確保される。最後にＥサイクル
で処理されるＥＩ乃至ＥＮでは実際の命令の実行が行な
われる。Since the address of operand 3 indicates a store operand, there is no need for prefetching. At C2, which is the same timing as 14, the prefetch request for operand 2 generated in 13 above is processed. The processing of this prefetch request is the same as the prefetch request for operand 1 executed by the CI described above. If there is no block in the cache memory 11 that has the address requested for prefetching of operand 2, the block is read from main memory 0. A block transfer is performed and the data is secured in the cache memory 11. Finally, in EI to EN processed in the E cycle, actual instructions are executed.

即ち、上記１２乃至１４において計算されたアドレスで
オペランドァクセス、演算の実行、結果のストアを行な
つ。上記の実施例では、先行制御を初期の１サイクルの
段階でプリフェッチ要求を発生させるもので、第４図の
１２および１３にオペランド１およびオペランド２のプ
リフェッチ要求を示すコマンドを命令制御部１３内のメ
モリコマンド発生回路からメモリアクセス制御部１４に
発生していた。That is, operand access, operation execution, and result storage are performed using the addresses calculated in steps 12 to 14 above. In the above embodiment, a prefetch request is generated at the initial stage of one cycle of advance control, and commands indicating prefetch requests for operands 1 and 2 are sent to 12 and 13 in FIG. 4 in the instruction control unit 13. This was generated from the memory command generation circuit to the memory access control unit 14.

しかしながら、プリフェツチ要求は上記実施例に限定さ
れるものではなく、例えば第１図の制御記憶部１５にブ
リフェッチ要求を発生する機能を具備させてもさしつか
えない。なお、制御記憶部１５を有しない情報処理装置
では第１図の演算制御部１２にプリフェッチ要求を発生
する機能を具備させてもさしつかえない。さて、次に前
記した制御記憶部１５にプリフェッチ要求を発生する機
能を具備せしめた場合の処理動作を第５図のタイミング
チャートを参照し説明する。However, the prefetch request is not limited to the above embodiment; for example, the control storage unit 15 shown in FIG. 1 may be provided with a function of generating a prefetch request. Note that in an information processing apparatus that does not have the control storage section 15, the arithmetic control section 12 shown in FIG. 1 may be provided with a function of generating a prefetch request. Next, the processing operation when the control storage section 15 described above is provided with a function of generating a prefetch request will be explained with reference to the timing chart of FIG.

なお、第５図に示すタイミングチャートはオペレーティ
ングシステムなどで使用されている制御命令の実行を実
施例として示したものである。前記制御命令ではＡ，Ｂ
，Ｃ、の３つのオペランド‘こ対して前処理、オペラン
ドアクセス、演算処理が必要とされている命令であって
、この場合のプリフェッチを以下に説明する。先ず、１
サイクルでは命令の解読が行なわれる。Note that the timing chart shown in FIG. 5 shows an example of the execution of control instructions used in an operating system or the like. In the above control command, A, B
, C, which requires preprocessing, operand access, and arithmetic processing for the three operands. Prefetch in this case will be explained below. First, 1
In the cycle, instructions are decoded.

次のＣサイクルではオペランドアクセスが行なわれ、こ
の時プリフェツチは行なわれない。次のＥサイクルでは
この命令の主な実行を行なう。即ち、ＥＩの期間でオペ
ランドＡに関する前処理が行なわれる。次にＥ２の期間
でオペランドＢの前処理と同時にオペランドＡのプリフ
ェッチが行なわれる。次にＥ３の期間でオペランドＡの
アクセスと演算処理が行なわれる。このＥ３の段階では
前記Ｅ２にてオペランドＡのプリフェツチが行なわれて
いるため、オペランドＡのアクセスと演算処理が実行す
ることが出釆る。次にＥ４の期間でオペランドＣの前処
理と同時にオペランドＢのプリフエツチが行なわれる。
次にＥ５の期間ではオペランドＢのアクセスと演算処理
が行なわれる。このＥ５の段階では前記Ｅ４にてオペラ
ンドＢのプリフェツチが行なわれているため、オペラン
ドＢのアクセスと演算処理が実行することが出来る。最
後にＥ６の期間ではオペランドＣのアクセスと演算処理
が行なわれる。前記のようにオペランドアクセス以前に
必然的に行なうべき他の仕事があるとき、これを積極的
に前段にもってきて、且つ同時にプリフェツチを行なう
ことにより優れた効果を発揮することが出釆る。上記し
たプリフェッチ動作をより効果的に使用するためには「
プリフェッチによるブロック転送が完全に終了してから
実際のオペランドアクセスが行なわれた方がよい訳だが
、プリフェッチに対する主メモリ１０のデータ転送が遅
れた時、またプリフヱッチ後のオペランドアクセスが命
令制御部１３または演算制御部１２から早い時期に出さ
れた時などは、プリフェツチが終了しないうちに新たな
オペランドアクセスが出る可能性がある。Operand access is performed in the next C cycle, and no prefetch is performed at this time. In the next E cycle, this instruction is mainly executed. That is, preprocessing regarding operand A is performed during the EI period. Next, in period E2, operand A is prefetched at the same time as operand B is preprocessed. Next, during period E3, access and arithmetic processing of operand A are performed. At this stage of E3, since operand A has been prefetched at E2, access and arithmetic processing of operand A may be executed. Next, in period E4, prefetching of operand B is performed simultaneously with preprocessing of operand C.
Next, in period E5, access and arithmetic processing of operand B are performed. At this stage of E5, since operand B has been prefetched at E4, access and arithmetic processing of operand B can be executed. Finally, in period E6, access and arithmetic processing of operand C are performed. As mentioned above, when there is other work that must necessarily be done before operand access, excellent effects can be achieved by actively bringing it to the front stage and performing prefetching at the same time. In order to use the prefetch operation described above more effectively,
It is better to perform the actual operand access after the block transfer by prefetch is completely completed, but if the data transfer of the main memory 10 for prefetch is delayed, or if the operand access after prefetch is delayed by the instruction control unit 13 or If the arithmetic control unit 12 issues the operand at an early stage, there is a possibility that a new operand access will occur before the prefetch is completed.

つまり、プリフェッチが終了していないうちに同一フロ
ッ外こ対して新たなオペランドアクセスが行なわれた場
合、２重のデータ転送要求が主メモリの同一ブロックに
出力されることがあり得る。この様な状態になった時、
プリフェッチの効果がなくなるばかりでなく、後のオペ
ランドアクセスに遅れを生じさせることも起り得る。ま
た、何らかの原因でプリフェツチと同一ブロックに書き
込み要求が発生したとき、書き込み要求がプリフェッチ
終了以前に受けつけられると主メモリ１０とキャシュメ
モリ１１の内容に矛盾をきたすことになる。この場合は
プリフェツチによるキヤシュメモリ１１の書き込みが終
了してから前記書き込み要求によるキャシュメモリへの
書き込みを行なわなければならない。本発明はキャシュ
ディレクトリ一に工夫をこらし、プリフェツチ要求のコ
マンド‘こよるデータ転送が完了しないうちにキャシュ
メモリの同一ブロックに対して新たなデータ転送要求が
発生した時に、主メモリの同一ブロックに対して２重の
アクセスを防止するための機能を有する。That is, if a new operand access is made outside the same flop before prefetching is completed, double data transfer requests may be output to the same block of the main memory. When you are in a situation like this,
Not only will the effect of prefetching be lost, but it may also cause a delay in subsequent operand access. Further, when a write request is generated in the same block as the prefetch for some reason, if the write request is accepted before the prefetch is completed, the contents of the main memory 10 and the cache memory 11 will be inconsistent. In this case, it is necessary to write to the cache memory 11 according to the write request after the prefetch has finished writing to the cache memory 11. In the present invention, the cache directory is devised so that when a new data transfer request occurs to the same block in the cache memory before the data transfer caused by the prefetch request command is completed, the same block in the main memory is It has a function to prevent double access.

第６図は上記したキャシュディレクトリ一の実施例を示
すものである。FIG. 6 shows an embodiment of the above cache directory.

同図において、ＡＯ乃至ＡＮは仮想アドレス、ＶＯ乃至
ＶＮはディレクトリ−の内容が有効かどうかを示すバリ
ディティビット、ＰＯ乃至ＰＮはどのブロックを使用す
るかをそのアルゴリズムで決められるラワンド・ロビン
カウンタ以外にプリフェッチ要求によるデータ転送が完
了していないことを示すペンディングビット、ＲＲは次
に使用されるブロック。ケーションを示す。この第６図
で示したキャシュディレクトリ−を用いたときの第１図
の情報処理装置におけるデータ転送の制御を第７図のフ
ロチャートを参照し以下に説明する。本発明ではデータ
転送要求が発生したとき前記ペンディングビットを調べ
、もし前記ペンディングビットがセットしていれば、以
前に同一ブロックに対してプリフィッチ要求が出力して
、前記ブロックの転送が終了していないことを意味し、
データ転送要求の処理は前記ペンディングビットがリセ
ット、即ちプリフエツチ動作が終了するまで待機される
。即ち、第７図において、命令制御部１３または演算制
御部１２からメモリアクセス制御部１４にデータ転送要
求が転送されて来ると、メモリアクセス制御部１４は対
応するブロックのペンディングビットをＳＩで調べる。In the figure, AO to AN are virtual addresses, VO to VN are validity bits that indicate whether the contents of the directory are valid, and PO to PN are non-rawand robin counters that can determine which block to use using the algorithm. The pending bit indicates that the data transfer due to the prefetch request has not been completed, and the RR indicates the next block to be used. to show the application. The control of data transfer in the information processing apparatus of FIG. 1 when using the cache directory shown in FIG. 6 will be described below with reference to the flowchart of FIG. 7. In the present invention, when a data transfer request occurs, the pending bit is checked, and if the pending bit is set, a prefetch request was previously output for the same block and the transfer of the block has been completed. means no,
Processing of the data transfer request is awaited until the pending bit is reset, ie, the prefetch operation is completed. That is, in FIG. 7, when a data transfer request is transferred from the instruction control section 13 or the arithmetic control section 12 to the memory access control section 14, the memory access control section 14 checks the pending bit of the corresponding block using SI.

ペンディングビットが１で且つディレクトリ‐がビツト
した時は同一ブロックがプリフェッチにによるデータ転
送待ちであるため、ペンディングビットが０になるまで
前記データ転送要求は受けつけられない。ペンディング
ビットが０となりデータ転送要求が受けつけられると、
メモリコマンドのタイプがＳ２で調べられる。When the pending bit is 1 and the directory is bit, the same block is waiting for data transfer by prefetch, so the data transfer request cannot be accepted until the pending bit becomes 0. When the pending bit becomes 0 and the data transfer request is accepted,
The type of memory command is checked in S2.

例えば前記メモリコマンドがライトコマンドであるなら
ば対応するブロックがキャシュメモリ１１に存在するか
どうかをＳ３で調べられる。そして、対応するブロック
がキャシュメモリ１１に存在（ヒット）する時はキャシ
ュメモリ１１の対応するブロックにデータをＳ４で書き
込み、Ｓ５で主メモリー０に対してライトデータ転送要
求を出す。しかし対応するブロックがキャシュメモリ１
１に存在しない（ミスヒット）の時はキャシュメモリ１
１へのデータの書き込みは行なわず、Ｓ５で主メモリー
０１こ対してライトデータ転送要求を出す。そしてライ
ト動作は終了する。次に、メモリコマンドのタイプがＳ
２で調べた時、リードコマンドであるならばＳ６にて普
通のリード要求か、あるいはプリフェツチ要求かを調べ
る。For example, if the memory command is a write command, it is checked in S3 whether a corresponding block exists in the cache memory 11. When the corresponding block exists (hit) in the cache memory 11, data is written to the corresponding block in the cache memory 11 in S4, and a write data transfer request is issued to the main memory 0 in S5. However, the corresponding block is cache memory 1
If it does not exist in cache memory 1 (miss hit), cache memory 1
No data is written to the main memory 01, and a write data transfer request is issued to the main memory 01 in S5. The write operation then ends. Next, the type of memory command is S.
When checked in step 2, if it is a read command, it is checked in step S6 whether it is a normal read request or a prefetch request.

そして普通のリード要求のときは対応するフロックがキ
ャシュメモリ１１に存在するかどうかをＳ７で調べ、存
在する（ヒット）時はＳ８にてキャシュメモリ１１の対
応するブロックをアクセスし、その読み出したデータを
演算制御部１２または命令制御部１３に転送し終了する
。一方Ｓ７で調べた結果対応するブロックがキャシュメ
モリ１１に存在しない（ミスヒット）の場合は、Ｓ９に
て演算制御部１２命令制御部１３の実行を停止せしめ、
ＳＩ川こて王〆モリ１０１こ対してリード要求を発生す
る。ＳＩＩでの主メモリ１０からｌｊードデータの転送
が完了すると、Ｓ１２にてキャシュメモリ１１の対応す
るブロックに前記リードデータを書き込み、更にＳＩ３
にてディレクトリ−の対応するブロックのバリデイティ
ビツトをセットする。前記により対応するブロックがキ
ャシュメモリ１１に確保されると、ＳＩ４にて前記リー
ドデータを演算制御部１２または命令制御部１３へ転送
され、Ｓ１５にて演算制御部１２、命令制御部１３の実
行を再開し終了する。In the case of a normal read request, it is checked in S7 whether the corresponding block exists in the cache memory 11, and if it exists (hit), the corresponding block in the cache memory 11 is accessed in S8, and the read data is is transferred to the arithmetic control unit 12 or instruction control unit 13, and the process ends. On the other hand, if the result of checking in S7 is that the corresponding block does not exist in the cache memory 11 (mishit), the execution of the arithmetic control unit 12 and instruction control unit 13 is stopped in S9,
A read request is issued to SI Kawakoteoh Shimori 101. When the transfer of the lj code data from the main memory 10 in the SII is completed, the read data is written in the corresponding block of the cache memory 11 in S12, and then the read data is written in the corresponding block of the cache memory 11 in S12.
sets the validity bit of the corresponding block in the directory. When the corresponding block is secured in the cache memory 11 as described above, the read data is transferred to the arithmetic control unit 12 or the instruction control unit 13 in SI4, and the execution of the arithmetic control unit 12 or instruction control unit 13 is performed in S15. Restart and end.

ところで、前記Ｓ６においてプリフェッチのコマ、ンド
であることを判断した時の処理はＳ１６に進み、前記Ｓ
１６にて対応するブロックがキャシュメモリ１１に存在
するかどうかを調べる。そして、キャシュメモリ１１に
対応するブロックが存在する（ヒット）時はプリフェッ
チコマンド｝こ対し何の動作も行なわず終了する。しか
し、対応するブロックがキャシュメモリ１１に存在しな
い（ミスヒット）場合はＳ１７にてディレクトリ−の対
応するペンディングビットをセットし、更にＳ１８にて
主メモリ１川こ対してリード要求を発生する。このリー
ド要求はＳ９乃至ＳＩ５に示した普通のリード要求処理
と異なり、演算制御部１２、命令制御部１３の実行動作
は停止されることなく続行されている。そして、ＳＩ９
での主メモリ１０からのりードデータ転送が完了すると
、Ｓ２０もこて前記リードデータはキャシュメモリ１１
の対応するブロックに書き込まれる。この時点では演算
制御部１２または命令制御部１３への転送は行なわれな
い。次に、Ｓ２１にてディレクトリ−の対応するブロッ
クのペンディングビットがリセットし、且つバリディビ
ットをセットして動作を終了する。By the way, when it is determined in S6 that it is a prefetch command, the process proceeds to S16, and the S
At step 16, it is checked whether the corresponding block exists in the cache memory 11. Then, when the corresponding block exists in the cache memory 11 (hit), the process ends without performing any operation in response to the prefetch command. However, if the corresponding block does not exist in the cache memory 11 (mishit), the corresponding pending bit of the directory is set in S17, and a read request is issued to the main memory 1 in S18. This read request is different from the normal read request processing shown in S9 to SI5, in which the execution operations of the arithmetic control section 12 and instruction control section 13 are continued without being stopped. And SI9
When the read data transfer from the main memory 10 is completed in step S20, the read data is transferred to the cache memory 11.
is written to the corresponding block. At this point, the data is not transferred to the arithmetic control section 12 or instruction control section 13. Next, in S21, the pending bit of the corresponding block in the directory is reset, and the valid bit is set, and the operation ends.

以上、説明したように、本発明によれば以下に示す特徴
と効果を有するものである。第１に本発明はプリフェツ
チ要求発生のアルゴリズムは命令実行中の有効な時期に
現在実行中の命令が近い将来必要とされるオペランドも
こ対して発生される。As described above, the present invention has the following features and effects. First, in the present invention, the algorithm for generating a prefetch request is generated at a valid time during instruction execution, also for operands that will be required in the near future by the currently executing instruction.

このため、将来の先行制御方式をとる情報処理装置の中
で行なわれる本来のプリフェッチあるいは先行制御の空
き時間を利用して行なうプリフェツチと異なり独自のも
のである。第２に長いオペランド、複数オペランドの命
令に対するプリフェッチに優れた効果を発揮する。即ち
、十進数命令、文字を扱う命令、オベレーテイングシス
テムで使用される制御命令などではオペランドに長く複
数にまたがることが多い。本発明はこのような命令の実
行中にプリフェッチ要求を出すタイミングを容易に選べ
ることができる。第３にプリフヱッチするオペランドの
長さを任意に選択できる。即ち、本発明では個々の命令
の実行中にプリフェツチ要求が出されるためその命令に
合った長さのオペランドをプリフェツチすることが可能
である。第４にプリフェッチされたデータが使用される
確率が極めて高い。For this reason, this method is unique, unlike the original prefetch that will be performed in an information processing apparatus that employs a future advance control method or the prefetch that is performed using idle time of advance control. Second, it exhibits an excellent prefetch effect for instructions with long operands or multiple operands. That is, decimal instructions, instructions that handle characters, control instructions used in operating systems, etc. often have long operands spanning multiple operands. The present invention allows the timing of issuing a prefetch request to be easily selected during the execution of such an instruction. Third, the length of the operand to be prefetched can be arbitrarily selected. That is, in the present invention, since a prefetch request is issued during the execution of an individual instruction, it is possible to prefetch an operand of a length suitable for that instruction. Fourth, the probability that prefetched data will be used is extremely high.

従来の先行制御の中で行なわれるブリフェッチは分岐命
令による分岐が意に反した場合は、その以前に出された
プリフェツチが意味を持たなくなることがあり得る。し
かし本発明では、命令の実行中にその命令が近い将来必
要とされるべきデータに対してプリフェッチ要求が発生
されるため、プリフヱツチされたデータが使用される確
率は極めて高い。In the case of a prefetch performed in conventional advance control, if a branch by a branch instruction is unexpected, the previously issued prefetch may become meaningless. However, in the present invention, since a prefetch request is generated for data that will be needed in the near future during the execution of an instruction, the probability that the prefetched data will be used is extremely high.

第５にプリフェッチ要求のコマンドもこよるデータ転送
が完了しないうちに、キャシュメモリの同一ブロックに
対して新たなデータ転送要求が発生したとしても、主メ
モリの同一ブロックに対する２重のアクセスは防止する
ことが出来る。Fifth, even if a new data transfer request occurs to the same block in the cache memory before the data transfer is completed, which also involves a prefetch request command, double access to the same block in the main memory should be prevented. I can do it.

[Brief explanation of the drawing]

第１図は本発明のプリフェツチ動作を適用する情報処理
装置のブロック図、第２図は本発明のプリフヱッチ動作
を必要とする形式の一実施例を示す図、第３図は第１図
で示した情報処理装置を処理サイクル毎に分割したブロ
ック図、第４図は第２図に示した命令形式のプＩＪフェ
ッチ処理を説明するタイミング図、第５図は他のプリフ
ェッチ処理を説明するタイミング図、第６図は本発明の
プリフェッチ処理に適用するキャシュディレクトリ一の
実施例を示す図、第７図は第６図のキャシュディレクト
リ一を有する第１図の情報処理装置の動作手順を示した
フローチャートである。１０……主メモリ、１１……キヤシユメモリ、１２・・
・・・・演算制御部、１３・・・・・・命令制御部、１
４．．・．・・メモリアクセス制御部。オー図オＺ図グミ図矛４図次５図次６図次７図FIG. 1 is a block diagram of an information processing device to which the prefetch operation of the present invention is applied, FIG. 2 is a diagram showing an embodiment of a format requiring the prefetch operation of the present invention, and FIG. 4 is a timing diagram illustrating pre-IJ fetch processing in the instruction format shown in FIG. 2, and FIG. 5 is a timing diagram illustrating other prefetch processing. , FIG. 6 is a diagram showing an embodiment of the cache directory 1 applied to the prefetch processing of the present invention, and FIG. 7 is a flowchart showing the operating procedure of the information processing apparatus of FIG. 1 having the cache directory 1 of FIG. 6. It is. 10...Main memory, 11...Cache memory, 12...
...Arithmetic control section, 13...Instruction control section, 1
4. ．．・．． ...Memory access control unit. O figure O Z figure Gummy Zuko 4 figure next 5 figure next 6 figure next 7 figure

Claims

[Claims]

1. A main memory, a cache memory that stores data associated with this main memory in blocks, an instruction that processes long operands or operands that exist in multiple locations, and a If the data of the operand to be executed does not exist in the cache memory, a prefetch means stores the data of the operand from the main memory to the cache memory by block transfer; The data prefetch is characterized by comprising means for causing processing of the data transfer request to wait until block transfer to the cache memory by the prefetch means is completed when a new data transfer request is generated. method.