JPH02191051A

JPH02191051A - Data processor

Info

Publication number: JPH02191051A
Application number: JP1012653A
Authority: JP
Inventors: Hajime Fukuzawa; 福澤　一
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1989-01-20
Filing date: 1989-01-20
Publication date: 1990-07-26
Anticipated expiration: 2010-05-01
Also published as: JPH0740246B2

Abstract

PURPOSE:To detect at high speed whether desired data exist in a block or not by holding the request information of a request for the block transfer of the data and comparing the address out of the block of this request with the address out of the block of a following request. CONSTITUTION:A holding means 17 is provided to hold the request when a first request 1 is cache miss hit and a comparing means 19 is provided to compare the address in the request information of a second request 2 following to the first request 1 with the address in the request information held by the holding means 17. Then, a means 24 is provided to request the block transfer of the data from a main memory with responding to the second request 2 when the second request 2 is the cache miss hit and the compared result of the comparing means 19 shows dissidence. Thus, it can be detected at high speed that whether the desired data exist in the block which requested the block transfer of the data to the main memory or not exist due to a cache miss.

Description

【発明の詳細な説明】玖■±芳本発明はデータ処理装置に関し、特にキャッシュメモリ
を有するデータ処理装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a data processing device, and more particularly to a data processing device having a cache memory.

従来技術キャッシュミスヒツトしたリクエストに応答して、主メ
モリにデータのブロック転送要求が行われるが、この場
合後続のリクエストの処理は以下の３通りのケースがあ
って、各ケースに応じて処理が夫々異なっている。Conventional technology In response to a cache miss request, a data block transfer request is made to the main memory. In this case, there are three cases in which subsequent requests are processed, and the processing is performed according to each case. Each one is different.

■後続リクエストがキャツシュヒツトの場合；この場合
は、データの順序性を保証するために、後続リクエスト
の処理は、先行するりクエストのブロック転送処理が終
了するまで待たされる。(2) When the subsequent request is a cash request: In this case, in order to guarantee the order of data, the processing of the subsequent request is waited until the block transfer process of the preceding request is completed.

■後続のリクエストがキャッシュミスヒラ１−シかつブ
ロック転送要求を行なったブロックにヒツトする場合：この場合は、先行するりクエストのブロック中に所望の
データが存在するために、後続のりクエストの処理は先
行するリクエストのブロック転送処理が終了するまで待
たされる。■When the subsequent request hits the block for which the block transfer request was made and there is a cache miss: In this case, since the desired data exists in the block of the preceding request, the subsequent request cannot be processed. is made to wait until the block transfer processing of the preceding request is completed.

■後続のリクエストがキャッシュミスヒットしかつブロ
ック転送要求を行なったブロックにもミスヒツトする場
合；この場合は、先行するブロック転送要求を行なっなリク
エストに連続して後続のリクエストのブロック転送要求
を行なうことができる。■If the subsequent request misses the cache and also misses the block for which the block transfer request was made; In this case, do not make the preceding block transfer request, and make the block transfer request of the subsequent request immediately after the request. Can be done.

従来のこの種のデータ処理装置では、上記３通りの状態
を検出して後続のリクエストの処理を行っており、■の
場合には後続リクエストの処理を直ちに行うことができ
るか、この場合も■、■と同様に後続リクエストの処理
を先行するリクエストのブロック転送要求が終了するま
で待たすタイプと、先行するリクエストのアドレス情報
をキヤッシュディレクトリに登録した後で、後続のリク
エストによって再びキャッシュディレクトリの索引から
やり直し、キャッシュミス検出を行い、その後に後続の
リクエストに対して主メモリにデータブロック転送要求
を行うタイプのものとがある。Conventional data processing devices of this type detect the above three conditions and process subsequent requests. , Similar to ■, there is a type in which processing of the subsequent request waits until the block transfer request of the preceding request is completed, and after registering the address information of the preceding request in the cache directory, the subsequent request reindexes the cache directory. There is a type that starts over from scratch, detects a cache miss, and then requests a data block transfer to the main memory in response to a subsequent request.

上述した従来のデータ処理装置では、後続リクエストに
ついて直ちに処理できるにもかかわらず待たされたり、
また先行するりクエストがキャッシュミスヒツトした場
合には、そのアドレス情報をキャッシュディレクトリに
登録した後に、必す後続のリクエストによってキャッシ
ュディレクトリの索引を行なって後続のリクエストの状
態をチエツクしなければならないという欠点がある。With the conventional data processing device described above, subsequent requests may be made to wait even though they can be processed immediately, or
Also, if a preceding quest misses the cache, the address information must be registered in the cache directory, and then the cache directory must be indexed by the subsequent request to check the status of the subsequent request. There are drawbacks.

最近の大型計算機では、その処理の高速化のために、１
つの処理を複数段のステージに分割して、処理の並列化
を図るパイプライン処理方式が一般的であり、各ステー
ジの遅延時間によって決定されるマシンサイクルを減少
させるために、パイプラインステージは増加傾向にある
。In recent large-scale computers, in order to speed up the processing, 1
Pipeline processing is common, in which one process is divided into multiple stages to parallelize the process.In order to reduce the machine cycle, which is determined by the delay time of each stage, the number of pipeline stages increases. There is a tendency.

従って、キャッシュディレクトリを索引するステージと
キャッシュのヒツトミスヒツトを判定するステージとが
同一ステージとはならず、複数のステージに分割される
場合かあり、またこのような高速なマシンサイクルで動
作する装置では、キャッシュのミスヒツトを検出した信
号で直接、後続のリクエストの動作を制御することは困
難であり、そのためにキャッシュのミスヒツトを検出し
たステージの次のステージで後続のリクエストの制御を
行なわなければならない。Therefore, the stage for indexing the cache directory and the stage for determining cache hits and misses may not be the same stage, but may be divided into multiple stages, and in devices that operate at such high speed machine cycles, It is difficult to directly control the operation of subsequent requests using a signal that detects a cache mishit, so subsequent requests must be controlled at the stage following the stage that detected a cache mishit.

そのなめに、先行するりクエストでキャッシュのミスヒ
ツトが検出された場合には、キャッシュディレクトリを
索引するステージからキャッシュのヒツトミスヒツトを
判定するステージまでに複数の後続のリクエストが保持
される状態となる。Therefore, if a cache miss is detected in a preceding request, a plurality of subsequent requests will be held between the stage of indexing the cache directory and the stage of determining a cache hit or miss.

このような状態で、先行するりクエストのアドレス情報
をキャッシュディレクトリに登録した後に、後続のリク
エストでキャッシュディレクトリを索引してキャッシュ
のヒツトミスヒツト状態をチエツクするためには、上記
の各ステージに保持されている後続のリクエストの退避
及び後続のリクエストによるキャッシュディレクトリの
索引のためのパイプラインステージの管理や、退避され
た後続のリクエストの各パイプラインステージへの回復
などの非常に複雑な制御と、相当なハードウェア量の投
資が必要になるという欠点がある。In such a state, after registering the address information of the preceding request in the cache directory, in order to index the cache directory in the subsequent request and check the hit-miss status of the cache, the address information held in each of the above stages is required. Management of pipeline stages for evacuation of subsequent requests and indexing of cache directories by subsequent requests, and restoration of ejected subsequent requests to each pipeline stage requires very complex control and considerable The disadvantage is that it requires investment in hardware.

発明の目的本発明の目的は、キャッシュミスヒツトしたりクエスト
に続く後のリクエストの処理性能を向上させ得るように
したデータ処理装置を提供することである。OBJECTS OF THE INVENTION An object of the present invention is to provide a data processing device that can improve the processing performance of subsequent requests following a quest or a cache miss.

発明の構成本発明によれは、主メモリの内容の一部写しをブロック
単位で保持するキャッシュメモリを有するデータ処理装
置であって、第１リクエストがキャッシュミスヒツトの
場合に、そのリクエスト情報を保持する保持手段と、前
記第１リクエストに続く第２のリクエストのリクエスト
情報内のアドレスと、前記保持手段に保持されているリ
クエスト情報内のアドレスとを比較する比較手段と、前
記第２リクエストがキャッシュミスヒツトの場合でかつ
前記比較手段の比較結果が不一致を示す場合に、前記第
２リクエストに応答して前記主メモリからのデータブロ
ック転送要求をなす手段とを有することを特徴とするデ
ータ処理装置が得られる。Structure of the Invention According to the present invention, there is provided a data processing device having a cache memory that holds a partial copy of the contents of the main memory in units of blocks, wherein when the first request is a cache miss, the request information is held. a holding means for comparing an address in the request information of a second request following the first request with an address in the request information held in the holding means; and means for requesting data block transfer from the main memory in response to the second request in the case of a mishit and when the comparison result of the comparison means indicates a mismatch. is obtained.

更に、本発明によれば、主メモリの内容の一部写しをブ
ロック単位で保持するキャッシュメモリと、前記キャッ
シュメモリの保持内容のディレクトリを記録するキャッ
シュディレクトリ手段と、前記キャッシュディレクトリ
手段の索引結果によりキャツシュヒツト状態を判定する
キャツシュヒツト判定手段とを含み、リクエストに応答
して前記キャッシュディレクトリ手段を索引する第１ス
テージと、この索引結果を用いて前記キャツシュヒツト
判定手段によりキャツシュヒツト状態を判定する第２ス
テージとを有するパイプライン処理方式のデータ処理装
置であって、キャッシュミスヒツトが判定されたリクエ
ストのリクエスト情報を保持するキャッシュミスヒツト
リクエスト保持手段と、このリクエストに続く後続リク
エストのリクエスト情報を前記第１及び第２ステージで
夫々保持する手段と、前記第２ステージでリクエスト情
報が保持されるリクエストに対する前記キャッシュディ
レクトリの索引結果を保持する索引結果保持手段と、前
記第２ステージで保持されているリクエスト情報内のア
ドレスと、前記キャッシュミスヒツトリクエスト保持手
段に保持されているリクエスト情報内のアドレスとを比
較する比較手段と、前記索引結果保持手段の内容がキャ
ッシュミスヒットを示しておりかつ前記比較手段の比較
結果が不一致を示している場合、前記第２ステージで保
持されているリクエストに応答して前記主メモリからの
データブロック転送要求をなす手段とを有することを特
徴とするデータ処理装置が得られる。Further, according to the present invention, there is provided a cache memory for holding a partial copy of the contents of the main memory in units of blocks, a cache directory means for recording a directory of the contents held in the cache memory, and an index result of the cache directory means. a first stage for indexing the cache directory means in response to a request; and a second stage for determining the cache hit status by the cache hit determination means using the index result. A pipeline processing type data processing device comprising: a cache miss request holding unit that holds request information of a request for which a cache miss has been determined; and a cache miss request holding unit that holds request information of a subsequent request following this request; an index result holding means for holding an index result of the cache directory for a request whose request information is held in the second stage; a comparison means for comparing the address with an address in the request information held in the cache miss request holding means, and a comparison result of the comparison means when the contents of the index result holding means indicate a cache miss; There is obtained a data processing apparatus characterized in that the data processing apparatus includes means for making a data block transfer request from the main memory in response to the request held in the second stage when the second stage indicates a mismatch.

更にはまた、本発明によれば、主メモリの内容の一部写
しをブロック単位で保持するキャッシュメモリと、前記
キャッシュメモリの保持内容のディレクトリを記録する
キャッシュディレクトリ手段と、前記キャッシュディレ
クトリ手段の索引結果によりキャツシュヒツト状態を判
定するキャツシュヒツト判定手段とを含み、リクエスト
に応答して前記キャッシュディレクトリ手段を索引する
第１ステージと、この索引結果を用いて前記キャツシュ
ヒツト判定手段によりキャツシュヒツト状態を判定する
第２ステージとを有するパイプライン処理方式のデータ
処理装置であって、キャッシミスヒットが判定されたリ
クエストのリクエスト情報を保持するキャッシュミスヒ
ツトリクエスト保持手段と、このキャッシュミスヒット
が判定されたリクエストのキャッシュ登録コンパートメ
ント情報を保持する手段と、このリクエストに続く後続
リクエストのリクエスト情報を前記第１及び第２ステー
ジ夫々で保持する手段と、前記第２ステージでリクエス
ト情報が保持されるリクエストに対する前記キャッシュ
ディレクトリの索引結果を保持する索引結果保持手段と
、前記第２ステージで保持されているリクエスト情報内
のアドレスと、前記キャッシュミスヒツトリクエスト保
持手段に保持されているリクエスト情報内のアドレスと
を比較する比較手段と、前記索引結果保持手段の内容が
キャッシュミスヒットを示しておりかつ前記比較手段の
比較結果か不一致を示している場合、前記キャッシュミ
スヒツトが判定されたリクエストに応答して前記主メモ
リからブロックデータを読出して前記キャッシュメモリ
に登録し、しかる後に前記第２ステージに保持されてい
るリクエスト情報内のアドレスと前記キャッシュミスヒ
ツトが判定されたリクエストの前記キャッシュ登録コン
パートメント情報とにより、前記第２ステ・−ジに保持
されているリクエストの所望のデータを前記キャッシュ
メモリから読出してリクエスト要求元へ送出する手段と
を有するとを特徴とするデータ処理装置が得られる。Furthermore, according to the present invention, there is provided a cache memory for holding a partial copy of the contents of the main memory in units of blocks, a cache directory means for recording a directory of contents held in the cache memory, and an index for the cache directory means. a first stage that indexes the cache directory means in response to a request, and a second stage that uses the index result to determine the cache hit status by the cache hit determination means using the index result; a pipeline processing type data processing device comprising: a cache miss request holding means for holding request information of a request for which a cache miss has been determined; and a cache registration compartment for the request for which a cache miss has been determined. means for holding information, means for holding request information of subsequent requests following this request in each of the first and second stages, and an index result of the cache directory for the request whose request information is held in the second stage. an index result holding means for holding an index result holding means; a comparison means for comparing an address in the request information held in the second stage with an address in the request information held in the cache miss request holding means; When the content of the index result holding means indicates a cache miss and the comparison result of the comparison means indicates a mismatch, block data is retrieved from the main memory in response to the request for which the cache miss has been determined. The second stage is read out and registered in the cache memory, and then the address in the request information held in the second stage and the cache registration compartment information of the request for which the cache miss has been determined are used in the second stage. There is obtained a data processing apparatus characterized in that the data processing apparatus comprises means for reading desired data of a request held in the cache memory from the cache memory and sending it to the request source.

火隻囮次に、本発明について図面を参照して説明する。fire ship decoy Next, the present invention will be explained with reference to the drawings.

第１図は本発明の一実施例のブロック図である図におい
て、ＬＡＲｌは要求元からのリクエストアドレスを受け
る論理アドレスレジスタ、ＴＬＢ２は論理アドレスから
実アドレスへのアドレス変換を行なうアドレス変換バッ
ファ、ＡＡ３，４はキャッシュディレクトリであるアド
レスアレイである。FIG. 1 is a block diagram of an embodiment of the present invention. In the diagram, LARl is a logical address register that receives a request address from a request source, TLB2 is an address translation buffer that converts an address from a logical address to a real address, and AA3 is a block diagram of an embodiment of the present invention. , 4 is an address array which is a cache directory.

比較器５はＬＡＲｌとＴＬＢ２の出力を比較する比較器
、比較器６．７はＴＬＢ２の出力とＡＡ３．４の出力を
それぞれ比較する比較器である。Comparator 5 is a comparator that compares the outputs of LARl and TLB2, and comparator 6.7 is a comparator that compares the outputs of TLB2 and AA3.4, respectively.

反転回路８．９は比較器６，７の出力をそれぞれ反転す
る反転回路、アンド回路１０は反転回路８゜９の出力の
論理積をとるアンド回路、ＦＦＩＩはアンド回路１０の
出力を受けるフリップフロップ、ＦＦ１２，１３は比較
器６，７の出力を夫々受けるフリップフロップである。An inverting circuit 8.9 is an inverting circuit that inverts the outputs of the comparators 6 and 7, an AND circuit 10 is an AND circuit that takes the logical product of the outputs of the inverting circuit 8.9, and FFII is a flip-flop that receives the output of the AND circuit 10. , FF12 and FF13 are flip-flops that receive the outputs of comparators 6 and 7, respectively.

ＰＡＲ１４はアドレス変換後の実アドレスを受ける実ア
ドレスレジスタ、ＤＡｌ、５．１６はキャッシュのデー
タを保持するキャッシュデータアレイ、ＣＭＡＲ１７は
キャッシュをミスしたリクエストの実アドレスを保持す
るキャッシュミスアドレスレジスタ、■ビット１８はＣ
ＭＡＲ１４に有効な情報が保持されていることを表示す
るフラグレジスタである。PAR14 is a real address register that receives the real address after address conversion, DAl, 5.16 is a cache data array that holds cache data, CMAR17 is a cache miss address register that holds the real address of a request that misses the cache, ■ bit 18 is C
This is a flag register that indicates that valid information is held in the MAR 14.

比較器１９は、ＣＭＡＲ１７に保持されている実アドレ
スのブロック外アドレスと、ＰＡＲ１４に保持されてい
る実アドレスのブロック外アドレスとを比較する比較器
、ナンド回路２０は、■ビット１８の出力と比較器１つ
の出力との論理積をとってその値を反転して出力するナ
ンド回路、アンド回路２１はこのナンド回路２０とＦＦ
１１の出力との論理積をとるアンド回路である。The comparator 19 compares the real address outside the block held in the CMAR 17 with the real address outside the block held in the PAR 14, and the NAND circuit 20 compares the output of the bit 18. The AND circuit 21, which is a NAND circuit that performs an AND with the output of one device, inverts the value, and outputs it, is connected to the NAND circuit 20 and the FF.
This is an AND circuit that performs logical product with the output of No. 11.

セレクタ２２はＤＡ１５．１６の出力を選択する選択回
路、ＤＡＲ２３はセレクタ２２で選択されたキャッシュ
の読出しデータを受けるデータアレイ読出しレジスタ、
ＭＡＲ２４はＰＡＲ１４から出力される実アドレスを受
けて、主メモリにデータのブロック転送要求のアドレス
を送出するメモリアドレスレジスタ、反転回路２５はＦ
Ｆ１１の出力を反転させる反転回路である。The selector 22 is a selection circuit that selects the output of DA15.16, and the DAR23 is a data array read register that receives read data from the cache selected by the selector 22.
MAR24 is a memory address register that receives the real address output from PAR14 and sends an address for a data block transfer request to the main memory.
This is an inverting circuit that inverts the output of F11.

ＬＡＲｌに保持される論理アドレスはセグメント部Ｓ、
ページ部Ｐ、ページ内アドレスＬから構成されており、
Ｓ及びＰかＴＬＢ２によって実アドレスに変換され、Ｌ
と合わせて実アドレスが生成される。このＴＬＢ２はＳ
及びＰの一部分でアドレスされ、Ｓ及びＰの残りの部分
がキ一部としてＴＬＢ２へ登録され、また論理アドレス
に対応する実アドレスがデータ部としてＴＬＢ２の同一
アドレスに登録される。The logical address held in LARl is the segment part S,
It consists of a page part P, an address L within the page,
S and P are converted to real addresses by TLB2, and L
A real address is also generated. This TLB2 is S
The remaining parts of S and P are registered as part of key in TLB2, and the real address corresponding to the logical address is registered as a data part at the same address in TLB2.

ＡＡ３，４はブロックを単位としてデータを登録するキ
ャッシュデータアレイのブロックアドレスを管理するキ
ャッシュディレクトリである。本実施例では、キャッシ
ュのブロックは６４バイトであるとする。ＡＡ３，４は
ＬＡＲｌのＬのうちの下位６ビツト（ブロック内アドレ
ス）を無視した上位のブロック外アドレス部でアドレス
され、残りのＳ、Ｐ部がアドレス変換を受けた実アドレ
スがキ一部として登録される。AA3 and AA4 are cache directories that manage block addresses of cache data arrays in which data is registered in units of blocks. In this embodiment, a cache block is assumed to be 64 bytes. AA3 and AA4 are addressed by the upper outside block address part ignoring the lower 6 bits (intra-block address) of L of LARl, and the remaining S and P parts are addressed by the real address that has undergone address conversion as a key part. be registered.

ＤＡ１５．１６はキャッシュディレクトリと対応するア
ドレス位置にブロックを単位としたデータをデータ部と
して登録するキャッシュデータアレイである。キャッシ
ュデータアレイからは、リクエスト要求元が要求するデ
ータの単位でデータが読出される。本実施例ではこのデ
ータの単位は８バイトであるとする。DA15.16 is a cache data array in which data in units of blocks is registered as a data portion at an address position corresponding to a cache directory. Data is read from the cache data array in units of data requested by the request source. In this embodiment, the unit of this data is assumed to be 8 bytes.

ＤＡ１５．１６はＰＡＲ１４のＬに対応する実アドレス
のうちの下位３ビツト（８バイト内アドレス）を無視し
た上位の８バイト外アドレス部でアドレスされ、対応す
る８バイトデータがそれぞれ読出される。DA15 and DA16 are addressed by the upper 8-byte external address part of the real address corresponding to L of PAR14, ignoring the lower 3 bits (address within 8 bytes), and the corresponding 8-byte data are read out.

本実施例のキャッシュ構成はＡＡ３とＤＡ１５とが、Ａ
Ａ４とＤ　Ａ　１６とがそれぞれ対応する２レベルキヤ
ツシユ構成である。即ち、ＡＡ３，４を同時に索引し、
ＤＡ１５及びＤＡ１６のうちのいずれに所望のデータが
登録されているかを決定し、ＤＡ１５，１６から読出さ
れたデータを選択してリクエスト要求元に送出する。Ａ
Ａ３，４の索引によって、所望のデータがＤＡ１５．１
６のいずれにも存在しないことが検出される（以下ギヤ
ッシュミスと言う）と、対応する実アドレスを主メモリ
に送出して、主メモリにデータのブロック転送を要求す
る。主メモリから読出されたブロックはキャッシュに登
録されると共に、要求データをリクエスト要求元へ送出
する。In the cache configuration of this embodiment, AA3 and DA15 are
This is a two-level cache configuration in which A4 and DA16 correspond to each other. That is, index AA3 and 4 at the same time,
It is determined which of the DA15 and DA16 the desired data is registered in, and the data read from the DA15 and DA16 is selected and sent to the request source. A
The desired data is found in DA15.1 by the index of A3 and 4.
If it is detected that the address does not exist in any of 6 (hereinafter referred to as a gear miss), the corresponding real address is sent to the main memory, and a block transfer of data is requested from the main memory. The block read from the main memory is registered in the cache, and the requested data is sent to the request source.

次に第１図の動作について詳細に説明する。演算部から
要求されたメモリアクセス論理アドレスはＬＡＲｌに受
付けられ、ＬＡＲｌのアドレスの一部でＴＬＢ２及びＡ
Ａ３，４がアドレスされる。Next, the operation shown in FIG. 1 will be explained in detail. The memory access logical address requested from the arithmetic unit is accepted by LARl, and part of the address of LARl is stored in TLB2 and A.
A3 and 4 are addressed.

ＴＬＢ２のアドレスはＳ及びＰの一部であり、残りのＳ
及びＰか比較器５で’Ｔ’　Ｌ　Ｂ　２から読出された
キ一部と比較される。比較の結果一致した場合は、ＴＬ
Ｂ２のデータ部すなわち実アドレスがＰＡＲ１４にセッ
トされる。ＬＡＲｌのＬはアドレス変換を受けずそのま
まＰＡＲ１４にセットされ、ＬＡＲｌに受付けられた論
理アドレスの実アドレスへのアドレス変換が終了する。The address of TLB2 is part of S and P, and the remaining S
and P is compared with the part read from 'T' L B 2 in comparator 5. If the comparison results match, the TL
The data portion of B2, ie, the real address, is set in PAR14. The L of LARl is set in PAR14 as it is without undergoing address translation, and the address translation of the logical address accepted by LARl into a real address is completed.

比較の結果不一致の場合は、−膜内によく知られている
ようにメモリ上のアドレス変換テーブル等を索引して論
理アドレスから実アドレスへの変換を行ない、結果か７
ｒ’　Ｌ　Ｂ　２に登録されて、実アドレスがＰＡＲ１
４にセットされアドレス変換が終了する。If the comparison results in a mismatch, - as is well known in the film, an address conversion table on the memory is indexed and the logical address is converted to a real address, and the result is 7.
It is registered in r' L B 2 and the real address is PAR1.
It is set to 4 and address translation ends.

ＡＡ３．４のアドレスはＬＡＲＩのＬのうちのブロック
外アドレス部であり、ＡＡ３．４に登録されている実ア
ドレスが比較器６，７にそれぞれ供給される。同時に、
ＴＬＢ２のデータ部から読出された実アドレスが比較器
６．７の相方に供給される。比較器６．７は比較の結果
一致の場合は論理１を出力し、不一致の場合は論理０を
出力する。The address of AA3.4 is the out-of-block address part of L of LARI, and the real addresses registered in AA3.4 are supplied to comparators 6 and 7, respectively. at the same time,
The real address read from the data section of TLB2 is supplied to a partner of comparator 6.7. Comparator 6.7 outputs logic 1 if the comparison results in a match, and outputs logic 0 if there is a mismatch.

今、比較器６で一致が検出されたとする。比較器６，７
の比較結果はそれぞれ１．２．１．３にセットされると
共に、反転回路８．９に供給される。Suppose now that comparator 6 detects a match. Comparators 6, 7
The comparison results are set to 1, 2, 1, and 3, respectively, and are supplied to the inverting circuit 8.9.

ＦＦ１２，１３はそれぞれ論理】、論理０にセットされ
る。反転回路８．９は比較器６．７の出力を反転させて
、それぞれ論理０．論理１を出力し、アンド回路１０に
出力する。FFs 12 and 13 are set to logic] and logic 0, respectively. The inverting circuit 8.9 inverts the output of the comparator 6.7 to a logic 0.0, respectively. It outputs a logic 1 and outputs it to the AND circuit 10.

アンド回［１０は反転回路８．９の出力の論理積をとり
、その結果をＦＦＩＩにセットする。従って、Ｆ　Ｆ　
１．１は論理０がセットされる。このＦＦ１ｌはキャツ
シュヒツトミスを示し、ヒツトの場合は論理０が、ミス
の場合は論理１が夫々セットされる。同時に実アドレス
がＰＡＲｌ　４にセットされると、ＰＡＲｌ４のＬに対
応する実アドレスのうちの８バイト外アドレスでＤＡｌ
、５．１６かアドレスされ、ＤＡｌ５，１６から８バイ
トデータが同時に読出されてセレクタ２２に供給される
。AND circuit [10 performs the logical product of the outputs of the inverting circuit 8.9 and sets the result to FFII. Therefore, F F
1.1 is set to logic 0. This FF1l indicates a cash hit miss; logic 0 is set in the case of a hit, and logic 1 is set in the case of a miss. At the same time, when the real address is set to PARl4, DAl is set at an address outside 8 bytes of the real address corresponding to L of PARl4
, 5.16 are addressed, and 8-byte data is simultaneously read from DAl5 and DAl16 and supplied to the selector 22.

ＦＦ１２．１３の出力である論理１．論理０はセレクタ
２２に供給され、その結果セレクタ２２ではＤＡｌ５か
ら読出された８バイトデータか有効となり、これが選択
されてＤＡＲ２３に供給される。Logic 1. which is the output of FF12.13. The logic 0 is supplied to the selector 22, and as a result, the 8-byte data read from the DAl5 becomes valid in the selector 22, which is selected and supplied to the DAR23.

ＦＦ１１の出力である論理０は反転回路２５で反転され
て論理１が出力され、ＤＡＲ２３に供給されてＤＡＲ２
３にセットされたＤＡｌ５からの読出しデータが有効デ
ータとしてリクエスト要求元に送出される。The logic 0 that is the output of the FF11 is inverted by the inverting circuit 25 and a logic 1 is output, which is supplied to the DAR 23 and output from the DAR 2.
The read data from DAl5 set to 3 is sent to the request source as valid data.

ＰＡＲｌ４にセットされた実アドレスはＣＭＡＲ１７，
ＭＡＲ２４に供給されるが、ＦＦＩＩの論理Ｏかアンド
回路２１に供給され、このアンド回路は論理積の結果と
して論理Ｏを出力し、それぞれ■ピッ１〜１８．結果Ｖビット１８はＣＭＡＲ１７の内容が有効でないこ
とを表示する論理０にセットされ、またＭＡＲ２４の内
容は無効にされ、主メモリに対しては何の゛動作指示も
行なわれない。The real address set in PARl4 is CMAR17,
The logic O of the FFII is supplied to the MAR 24, but the logic O of the FFII is also supplied to the AND circuit 21, which outputs the logic O as a result of the logical product, and the pins 1 to 18, respectively. As a result, V bit 18 is set to a logic zero indicating that the contents of CMAR 17 are not valid, and the contents of MAR 24 are invalidated and no operations are directed to main memory.

比較器７で一致が検出された場合にも、同様にしてＤＡ
ｌ６から読出された８バイトデータが有効データとして
ＤＡＲ２３を通してリクエスト要求元に出力される。Similarly, when a match is detected by the comparator 7, the DA
The 8-byte data read from l6 is output as valid data to the request source through DAR23.

次に、本発明の特徴的動作である連続するリクエスト１
．２が共にキャッシュをミスする場合の動作について詳
細に説明する。リクセスト１の論理アドレスがＬＡＲｌ
に受付けられると、ＴＬＢ２でアドレス変換が行なわれ
実アドレスがＰＡＲｌ４にセットされる。ＡＡ３，４の
索引ではキャッシュミスが検出される。即ち、比較器６
，７は共に比較結果不一致を示す論理０を出力してＦＦ
１２、１３に共に論理０をセットする。反転回路８　９
は比較器６．７の出力を夫々反転させて共に論理１を出
力し、アンド回路１０は論理積の結果としてキャッシュ
ミスを示す論理１を出力してＦＦＩＩに論理１をセット
する。Next, continuous request 1, which is the characteristic operation of the present invention,
．． 2 both miss the cache will be described in detail. The logical address of request 1 is LARl
When the address is accepted, TLB2 performs address translation and the real address is set in PARl4. A cache miss is detected in the indexes of AA3 and AA4. That is, comparator 6
, 7 both output logic 0 indicating that the comparison result does not match, and the FF
Both 12 and 13 are set to logic 0. Inverting circuit 8 9
inverts the outputs of the comparators 6 and 7 and outputs a logic 1, and the AND circuit 10 outputs a logic 1 indicating a cache miss as a result of the logical product, and sets the logic 1 in FFII.

この時、同時にリクエスト２の理論アドレスがＬＡＲｌ
に受付けられる。ＰＡＲｌ４にリクエスト１の実アドレ
スがセットされると、ＤＡ１５１６がアドレスされる，
この時、ＦＦＩＩが論理１を出力していることにより、
反転回路２５では反転結果としてＤＡＲ２３の内容を無
効とする論理０を出力し、ＤＡｌ５．１６からの読出し
データはいずれも無効となり、ＤＡＲ２３からリクエス
ト要求元へのデータ転送は抑止される。At this time, the theoretical address of request 2 is LARl.
will be accepted. When the real address of request 1 is set in PARl4, DA1516 is addressed.
At this time, since FFII is outputting logic 1,
The inversion circuit 25 outputs a logic 0 that invalidates the contents of the DAR 23 as the inversion result, all read data from the DAl 5.16 becomes invalid, and data transfer from the DAR 23 to the request source is inhibited.

ＰＡＲｌ４とＣＭＡＲ１７とブロック外アドレスが比較
器１９で比較されるか、今、■ビット１８はＣＭＡＲ１
７の内容が無効であることを示す論理０を出力しており
、それがナンド回路２０に供給されているために、ナン
ド回路２０の出力は強制的に論理１となってアンド回路
２１に供給される。その結果アンド回路２１ではＦＦ１
１の出力がそのまま有効となり、キャッシュミスを示ず
論理１が出力されてＶビット１８及びＭＡＲ２４に供給
される。PARl4, CMAR17, and the out-of-block address are compared in comparator 19, and now ■ bit 18 is CMAR1
7 is invalid, and since this is supplied to the NAND circuit 20, the output of the NAND circuit 20 is forced to become a logic 1 and is supplied to the AND circuit 21. be done. As a result, in the AND circuit 21, FF1
The output of 1 remains valid, indicating a cache miss, and a logic 1 is output and supplied to the V bit 18 and MAR 24.

ＰＡＲｌ　４のリクエスト１の実アドレスはＣＭＡＲ１
７にセットされ、■ビット１８は、ＣＭＡＲ１７の内容
が有効であること表示する論理１にセットされる。同時
にＰＡＲｌ４のリクエスト１の実アドレスはＭＡＲ２４
にセットされ、アンド回路２１の論理１の出力により、
その内容が有効となり、主メモリにリクエスト１のデー
タのブロック転送要求が送出される。The real address of request 1 of PARl 4 is CMAR1
bit 18 is set to a logic 1 indicating that the contents of CMAR17 are valid. At the same time, the real address of request 1 of PARl4 is MAR24
is set to , and the logic 1 output of the AND circuit 21 causes
The contents become valid, and a block transfer request for the data of request 1 is sent to the main memory.

キャッシュミスしたリクエスト１の実アドレスがＣＭＡ
Ｒ，１７及びＭＡＲ２４にセットされるのと同時に、後
続のリクエスト２がＴＬＢ２で実アドレスに変換されて
ＰＡＲｌ４にセットされる。The real address of request 1 that caused a cache miss is CMA
At the same time as being set in R,17 and MAR24, the subsequent request 2 is translated into a real address by TLB2 and set in PAR14.

さらに、ＡＡ３．４を索引した結果がＦＦ１１１２１３
にそれぞれセットされる。リクエスト１の場合と同様に
、リクエスト２もＡＡ３，４をミスヒツトし、ＦＦ１ｌ
、１２．１３にはそれぞれ論理１．論理０．論理Ｏがセ
ットされる。従って、ＰＡＲｌ　４にセットされたリク
エスト２の実アドレスによって、ＤＡ１５．１６から読
出されたデータはいずれも無効となり、ＤＡＲ２３から
リクエスト要求元へのデータ転送は抑止される。Furthermore, the result of indexing AA3.4 is FF111213
are set respectively. Similar to request 1, request 2 also misses AA3 and 4, and FF1l
, 12.13 have logic 1. Logic 0. Logic O is set. Therefore, the real address of request 2 set in PARl 4 invalidates any data read from DA 15.16, and data transfer from DAR 23 to the request source is inhibited.

この時、同時にリクエスト１の実アドレスを保持するＣ
ＭＡＲ１７のブロック外アドレスとＰＡＲｌ４にセット
されたリクエスト２の実アドレスのブロック外アドレス
とが比較される。■ビット１８は論理１を出力している
ために、ナンド回路２０では比較器１９の出力結果を反
転させた結果がそのまま有効となって出力されアンド回
路２１に供給される。At this time, the C that holds the real address of request 1 at the same time
The out-of-block address of MAR17 and the out-of-block address of the real address of request 2 set in PARl4 are compared. (2) Since the bit 18 outputs a logic 1, the NAND circuit 20 inverts the output result of the comparator 19 and outputs it as valid and supplied to the AND circuit 21.

比較の結果、一致の場合は比較器１９は論理１を出力し
、ナンド回路２０で反転を受けて論理０がアンド回路２
１に供給される。この場合は、ブロック外アドレスが一
致している場合であり、リクエスト２の要求データは先
行して主メモリにデータのブロック転送を要求したリク
エスト１のブロック中に存在していること示しており、
リクエスト２に対する主メモリへのデータのブロック転
送要求は抑止される。As a result of the comparison, if there is a match, the comparator 19 outputs a logic 1, which is inverted by the NAND circuit 20 and a logic 0 is output from the AND circuit 2.
1. In this case, the out-of-block addresses match, indicating that the requested data of request 2 exists in the block of request 1, which previously requested a block transfer of data to the main memory.
The request for request 2 to transfer a block of data to the main memory is suppressed.

つまり、ナンド回路２０からアンド回路２１へ供給され
る論理０によって、キャッシュミスを示すＦＦ１１の出
力の論理１が、アンド回路２１による論理積の結果とし
て、キャッシュをヒツトした場合と同様の論理０かアン
ド回路２１から出力されることにより、リクエスト２の
実アドレスを受けるＭＡＲ２４の内容が無効化されて、
主メモリへのデータのブロック転送要求は抑止される。In other words, due to the logic 0 supplied from the NAND circuit 20 to the AND circuit 21, the logic 1 output from the FF 11 indicating a cache miss is changed to a logic 0, which is the same as when the cache is hit, as a result of the AND circuit 21. By outputting from the AND circuit 21, the contents of the MAR 24 that receives the real address of request 2 are invalidated,
Requests to transfer blocks of data to main memory are suppressed.

比較器１つでの比較の結果が不一致の場合は、比較器１
つは論理０を出力し、ナンド回路２０で反転を受けて論
理１がアンド回路２１に供給される。この場合は、ブロ
ック外アドレスが不一致の場合であり、リクエスト２の
要求データは先行して主メモリにデータのブロック転送
を要求したリクエスト１のブロック中には存在していな
いことを示しており、ＡＡ３，４をミスしている場合に
は、そのまま主メモリにデータのブロック転送要求を行
なうことが可能である。If the comparison result with one comparator is inconsistent, comparator 1
One outputs a logic 0, which is inverted by a NAND circuit 20 and a logic 1 is supplied to an AND circuit 21. In this case, the out-of-block addresses do not match, indicating that the requested data of request 2 does not exist in the block of request 1, which previously requested a block transfer of data to the main memory. If AA3 and AA4 are missed, a data block transfer request can be directly made to the main memory.

今、ナンド回路２０からアンド回路２１へ論理１が供給
されていることにより、キャッシュのミスを示ずＦＦＩ
Ｉの出力の論理１と論理積がとられ、その結果としてア
ンド回路２１から主メモリにデータのブロック転送要求
を行う許可信号が出力されてＭＡＲ２４に供給される。Now, since a logic 1 is being supplied from the NAND circuit 20 to the AND circuit 21, there is no cache miss and the FFI
The logical product of the output of I and the logic 1 is taken, and as a result, a permission signal is outputted from the AND circuit 21 to request a data block transfer to the main memory, and is supplied to the MAR 24.

ＭＡＲ２４にはＰＡＲｌ４にセットされているリクエス
ト２の実アドレスがセットされ、主メモリに対してリク
エスト２のデータのブロック転送要求が先行するりクエ
スト１のデータのブロック転送要求に連続して送出され
る。The real address of request 2 set in PARl4 is set in MAR24, and a block transfer request for request 2 data is sent to the main memory in advance or in succession to a block transfer request for quest 1 data. .

この様に、先行して主メモリにデータのブロック転送を
要求したリクエストのリクエスト情報を保持し、この保
持されたリクエストのブロック外アドレスと後続のリク
エストのブロック外アドレスとを比較することにより、
後続のリクエストかキャッシュをミスしており、かつ先
行して主メモリにデータのブロック転送を要求したブロ
ック中に所望のデータが存在しているか否かを高速に検
出することにより、システムの処理性能を向上できるの
である。In this way, by holding the request information of the request that previously requested a block transfer of data to the main memory and comparing the out-of-block address of this held request with the out-of-block address of the subsequent request,
The processing performance of the system is improved by quickly detecting whether the desired data exists in the block for which a subsequent request misses the cache and the block of data previously requested to be transferred to the main memory. can be improved.

第２図は本発明の他の実施例のブロック図である。図に
おいて、ＰＡＲｌ４はキャッシュディレクトリを索引す
るための実アドレスを受ける実アドレスレジスタ、３．
４はキャッシュディレクトリであるアドレスアレイ、比
較器６，７はＰＡＲ１４とＡＡ３，４の出力をそれぞれ
比較する比較器である。FIG. 2 is a block diagram of another embodiment of the invention. In the figure, PARl4 is a real address register that receives a real address for indexing the cache directory; 3.
4 is an address array that is a cache directory, and comparators 6 and 7 are comparators that compare the outputs of PAR 14 and AA 3 and 4, respectively.

ＦＦ１２，１３は比較器６．７の比較結果、即ちキャッ
シュディレクトリの索引結果を受けるフリップフロップ
、ＰＡＲ２７はキャッシュデータアレイを索引するため
の実アドレスをＰＡＲ１４から受ける実アドレスレジス
タ、ＤＡ］、５．１６は主メモリのデータの写しを保持
するキャッシュデータアレイである。FF12 and FF13 are flip-flops that receive the comparison result of the comparator 6.7, that is, the cache directory index result; PAR27 is a real address register that receives the real address for indexing the cache data array from PAR14, DA], 5.16 is a cache data array that holds a copy of the data in main memory.

反転回路８，９はＦＦ１２，１３の出力をそれぞれ反転
する反転回路、アンド回路１０は反転回路８，９の出力
の論理積をとるアンド回路である。The inverting circuits 8 and 9 are inverting circuits that invert the outputs of the FFs 12 and 13, respectively, and the AND circuit 10 is an AND circuit that performs a logical product of the outputs of the inverting circuits 8 and 9.

ＭＡＲ２４はキャッシュミスした場合の実アドレスをＰ
ＡＲ２７から受けて主メモリにデータのブロック転送要
求のアドレスを送出するメモリアドレスレジスタ、セレ
クタ２２はＤＡ１５．１６の出力データをＦＦ１２，１
３の出力結果に従って選択する選択回路、ＤＡＲ２３は
セレクタ２２によって選択されたキャッシュからの読出
しデータを受けるデータアレイ読出しレジスタである。MAR24 sets the real address in case of cache miss as P
A memory address register that receives the address of the data block transfer request from the AR27 and sends it to the main memory.The selector 22 transfers the output data of the DA15.16 to the FF12,1
DAR 23 is a data array read register that receives read data from the cache selected by selector 22.

ＦＦ１１はアンド回路１０の出力を受けるフリップフロ
ップ、ＣＭＡＲ１７はキャッシュミスヒツトしたリクエ
ストの実アドレスを保持するキャッシュミスヒツトアド
レスレジスタ、■ビット１８はＣＭＡＲ１７に有効な情
報が保持されていることを表示するフラグレジスタ、比
較器１９はＣＭＡＲ１７に保持されている実アドレスの
ブロック外アドレスと、ＰＡＲ２７に保持されている実
アドレスのブロック外アドレスを比較する比較器である
。FF11 is a flip-flop that receives the output of the AND circuit 10, CMAR17 is a cache miss address register that holds the actual address of a request that has a cache miss, and ■Bit 18 is a flag that indicates that valid information is held in CMAR17. The register and comparator 19 are comparators that compare the real address outside the block held in the CMAR 17 and the real address outside the block held in the PAR 27.

ナンド回路２０はＶビット１８の出力と比較器１９の出
力の論理積をとってその値を反転して出力するナンド回
路、アンド回路２１はナンド回路２０とアンド回路１０
の出力の論理積をとるアンド回路、パイプライン周期制
御回路２６は各パイプラインの同期制御を司る回路であ
る。The NAND circuit 20 is a NAND circuit that takes the logical product of the output of the V bit 18 and the output of the comparator 19, inverts the value, and outputs it, and the AND circuit 21 is a combination of the NAND circuit 20 and the AND circuit 10.
The pipeline cycle control circuit 26, which is an AND circuit that takes the logical product of the outputs of the pipeline period control circuit 26, is a circuit that controls the synchronization of each pipeline.

ステージ１はキャッシュディレクトリの索引を行なうス
テージであり、ステージ２はキャッシュデータアレイの
索引とキャツシュヒツト、ミスヒットを判定するステー
ジである。Stage 1 is a stage for indexing the cache directory, and stage 2 is a stage for determining cache data array indexes, cache hits, and miss hits.

キャッシュが索引される場合、ＡＡ３．４が同時に索引
され、ＤＡ１５，１．６のうちのいずれに所望のデータ
が登録されているかを決定して、ＤＡ１５，１．６から
読出されたデータを選択してリクエスト要求元に送出す
る。ＡＡ３．４の索引によって、所望のデータがＤＡ１
５，１６のいずれにも存在しないことが検出されると、
対応する実アドレスを主メモリに送出して、主メモリに
データのブロック転送を要求する。主メモリから読出さ
れたブロックはキャッシュに登録されると共に、要求デ
ータをリクエスト要求元へ送出する。When the cache is indexed, AA3.4 is indexed at the same time, it is determined in which of DA15 and 1.6 the desired data is registered, and the data read from DA15 and 1.6 is selected. and sends it to the request source. The desired data is located in DA1 by the AA3.4 index.
If it is detected that it does not exist in either 5 or 16,
Sends the corresponding real address to main memory to request a block transfer of data from main memory. The block read from the main memory is registered in the cache, and the requested data is sent to the request source.

以上のように本実施例では、キャッシュは２コンパート
メント構成を取っているが、実際には４コンパートメン
トから１６コンパートメント構成を取るキャッシュが一
般である。As described above, in this embodiment, the cache has a two-compartment configuration, but in reality, caches generally have a four-compartment to 16-compartment configuration.

次に、第２図の動作について詳細に説明する。Next, the operation shown in FIG. 2 will be explained in detail.

リクエスト要求元から送出されたメモリアクセスの論理
アドレスは、第１図に示した様な既知構成のアドレス変
換制御部によって実アドレスに変換されて、ＰＡＲ１４
にセラｌ−される。ＰＡＲ１４に実アドレスがセットさ
れると、実アドレスのブロック外アドレス部の下位のセ
ットアドレス部でＡＡ３，４が索引され、同時にブロッ
ク外アドレスの残りのキーアドレス部が比較器６．７に
供給されてＡＡ３，４から読出されたキーアドレス部と
の比較が行なわれる。The logical address for memory access sent from the request source is converted into a real address by an address conversion control unit with a known configuration as shown in FIG.
Sera l- is carried out. When the real address is set in PAR14, AA3 and AA4 are indexed in the lower set address part of the out-of-block address part of the real address, and at the same time, the remaining key address part of the out-of-block address is supplied to the comparator 6.7. Then, a comparison is made with the key address part read from AA3 and AA4.

比較器６．７は比較の結果か一致の場合は論理１を出力
し、不一致の場合は論理Ｏを出力してその値をＦＦ１２
，１３にそれぞれセットする。この時同時に、ＰＡＲ１
４の実アドレスはＰＡＲ２７にセットされる。Comparator 6.7 outputs logic 1 if the comparison result is a match, and outputs logic O if it does not match, and sends the value to FF12.
, 13, respectively. At this time, PAR1
The real address of 4 is set in PAR27.

今、比較器６側で一致が検出され、ＦＦ１２゜１３にそ
れぞれ論理１、論理Ｏがセットされたとする。ＤＡ１５
，１６ではＰＡＲ２７によってアドレスされ、その索引
結果として夫々８バイトデータが読出されてセレクタ２
２に供給される。そしてＦＦ１２，１３の値に従ってＤ
Ａ１５の８バイトデータが選択されてＤＡＲ２３にセッ
トされる。Assume now that a match is detected on the comparator 6 side and logic 1 and logic O are set in FFs 12 and 13, respectively. DA15
, 16 are addressed by PAR 27, and 8-byte data is read out as the index result and sent to selector 2.
2. Then, D according to the values of FF12 and FF13.
The 8-byte data of A15 is selected and set in DAR23.

一方間時に、ＦＦ１２，１３の出力は反転回路８９に出
力され、それぞれ反転を受けた結果がアンド回路１０に
供給される。今の場合は、それぞれ論理Ｏ１論理１が供
給され、よってアンド回路ｌＯからは論理０が出力され
る。アンド回路１０の出力はキャツシュヒツト、ミスヒ
ツトを示しており、ヒツトの場合は論理０を、ミスヒツ
トの場合は論理１を夫々出力する。この出力はＦＦ１１
にセットされ、リクエスト要求元にＤＡＲ２３のデータ
が有効か無効かのフラグとして供給される。今の場合は
、キャツシュヒツトの場合であるから、ＤＡＲ２３のデ
ータと共にＦＦ１１から論理Ｏが供給される。On the other hand, the outputs of the FFs 12 and 13 are outputted to the inverting circuit 89, and the respective inverted results are supplied to the AND circuit 10. In this case, logic O1 and logic 1 are respectively supplied, so that logic 0 is output from AND circuit IO. The output of the AND circuit 10 indicates a hit or a miss; a logic 0 is output in the case of a hit, and a logic 1 is output in the case of a miss. This output is FF11
, and is supplied to the request source as a flag indicating whether the data in the DAR 23 is valid or invalid. In this case, since it is a cash hit case, a logic O is supplied from the FF 11 along with the data of the DAR 23.

ＰＡＲ２７にセットされた実アドレスは、ＣＭＡＲｌ７
、ＭＡＲ，２４にも供給されるが、アンド回路１０の出
力の論理Ｏがアンド回路２１に供給されているため、ア
ンド回路２１は論理０を出力して、■ビット１８及びＭ
ＡＲ２４に供給する。The real address set in PAR27 is CMARl7.
, MAR, 24, but since the logic O of the output of the AND circuit 10 is supplied to the AND circuit 21, the AND circuit 21 outputs logic 0, and bits 18 and M
Supply to AR24.

その結果、■ビット１８はＣＭＡＲｌ７の内容が有効で
ないことを表示する論理０にセットされ、またＭＡＲ２
４の内容は無効化され、主メモリに対しては何の動作指
示も行なわれない。As a result, ■ bit 18 is set to logic 0 indicating that the contents of CMAR17 are not valid, and MAR2
The contents of 4 are invalidated, and no operation instruction is given to the main memory.

次に、本実施例の特徴的動作である連続するりクエスト
１，２が共にキャッシュをミスする場合の動作について
詳細に説明する。リクエスト１はキャッシュをミスする
ために、リクエスト１の実アドレスがＰＡＲ２７にセッ
トされたタイミングでＦＦ１２，１３には共に論理Ｏが
セットされる。Next, the characteristic operation of this embodiment, which is the operation when consecutive quests 1 and 2 both miss the cache, will be described in detail. Since request 1 misses the cache, logical O is set in both FFs 12 and 13 at the timing when the real address of request 1 is set in PAR 27.

この時、同時に後続のリクエスト２かＰＡＲ１４にセッ
トされる。At this time, the subsequent request 2 or PAR14 is set at the same time.

以下、ステージ２でのリクエスト１のミスヒツト動作に
ついて説明する。アンド回路１０ではキャッシュミスヒ
ツトを検出して論理１を出力し、アンド回路２１．ＦＦ
ＩＩに供給する。ＦＦＩＩからはＤＡＲ２３中のデータ
が無効であることがリクエスト要求元に通知される。The mishit operation of request 1 in stage 2 will be described below. AND circuit 10 detects a cache miss and outputs logic 1, and AND circuit 21 . FF
Supply to II. The FFII notifies the request source that the data in the DAR 23 is invalid.

今、■ビット１８は０にセットされているため、ナンド
回路２０はアンド回路２１へ論理１を出力する。その結
果、アンド回路２１はアンド回路１０の出力の論理１と
の論理積の結果として論理１を出力し、ＭＡＲ２４とｖ
ビット１８に供給する。Since the ■ bit 18 is now set to 0, the NAND circuit 20 outputs a logic 1 to the AND circuit 21. As a result, the AND circuit 21 outputs a logic 1 as a result of the logical product of the output of the AND circuit 10 and the logic 1, and the MAR 24 and v
Supply bit 18.

従って、キャッシュミスヒツトを起こしたＰＡＲ２７の
リクエスト１の実アドレスはＭＡＲ２４に有効アドレス
としてセットされ、主メモリにリクエスト１のデータの
ブロック転送要求が送出される。また、ＰＡＲ２７のリ
クエスト１の実アドレスはＣＭＡＲｌ　７にセラ１〜さ
れ、同時にＶビット１８も論理１にセットされてＣＭＡ
Ｒｌ７のアドレスが有効であることを表示する。Therefore, the real address of request 1 in PAR 27 that caused the cache miss is set as a valid address in MAR 24, and a block transfer request for the data of request 1 is sent to the main memory. Also, the real address of request 1 of PAR27 is set to CMAR17, and at the same time, V bit 18 is also set to logic 1 and CMAR1 is set to logic 1.
Indicates that the address of Rl7 is valid.

ところで、あるリクエストでキャッシュのミスヒツトが
起きた場合には、後続のリクエストは各パイプラインス
テージ上で同期的に停止させなければならない。そのた
めには、キャッシュのミスヒツトが検出されたアンド回
路１０の出力を直接使用できればよいのであるが、各パ
イプラインステージを同期させる制御信号は、図示して
はいないが、非常に複雑な論理を取った後に生成される
。By the way, if a cache miss occurs in a certain request, subsequent requests must be stopped synchronously at each pipeline stage. To do this, it would be sufficient to directly use the output of the AND circuit 10 that detected the cache miss, but the control signals for synchronizing each pipeline stage require very complex logic (not shown). generated after

そのために、本実施例のように非常に高速なマシンサイ
クルで動作する装置では、アンド回路１０の出力を直接
使用していたのでは、その制御信号の遅延時間か間にあ
わなくなる。Therefore, in a device operating at a very high speed machine cycle like the present embodiment, if the output of the AND circuit 10 were directly used, the delay time of the control signal would not be enough.

従って本実施例では、アンド回路１０の出力はパイプラ
イン同期制御回Ｆ＃１２６に供給され、同回路２６内で
パイプライン同期制御信号を生成し、同回路２６中の図
示されてはいないフリップフロップで同制御信号を受け
た後各パイプラインステージに供給される。Therefore, in this embodiment, the output of the AND circuit 10 is supplied to the pipeline synchronization control circuit F#126, which generates a pipeline synchronization control signal, and the flip-flop (not shown) in the circuit 26 generates a pipeline synchronization control signal. After receiving the same control signal, it is supplied to each pipeline stage.

従って、リクエスト１でキャッシュミスヒツトが起こる
場合に、同期制御回路２６からパイプライン同期制御信
号が発せられたタイミングでは、リクエスト１の実アド
レスはすでにＣＭＡＲｌ７及びＭＡＲ２４にセットされ
ており、後続のリクエスト２の実アドレスはＰＡＲ２７
に、さらに新たな後続のリクエスト３の実アドレスはＰ
ＡＲ１４に夫々セットされている。Therefore, when a cache miss occurs in request 1, the real address of request 1 has already been set in CMARl7 and MAR24 at the timing when the pipeline synchronization control signal is issued from the synchronization control circuit 26, and the real address of request 1 is already set in CMARl7 and MAR24, and subsequent request 2 The real address is PAR27
, the real address of the new subsequent request 3 is P
Each is set in AR14.

この状態で、リクエスト２かキャッシュをミスヒツトし
ておりかつ先行して出されたリクエスト１のブロックデ
ータ中にも所望のデータか存在しないことを検出するた
めに、従来技術を使用する場合を考える。まず、リクエ
スト１のアドレス情報を対応するＡＡ３または４に登録
した後でＰＡＲｌ４に保持されているリクエスト３の実
アドレスをどこかに退避させなければならない。そして
その後でＰＡＲ２７のリクエスト２の実アドレスをＰＡ
Ｒ，１４に移し、ＡＡ３．４の再索引を行なう。そして
キャッシュミスヒツトが検出されると、リクエスト２の
実アドレスをＰＡＲ２７，ＭＡＲ２４を経由して主メモ
リに供給し、その後でリクエスト３の実アドレスをＰＡ
Ｒｌ　４に回復させなければならない。In this state, let us consider a case where the conventional technique is used to detect that request 2 misses the cache and desired data does not exist in the block data of request 1 issued previously. First, after registering the address information of request 1 in the corresponding AA3 or AA4, the real address of request 3 held in PAR14 must be saved somewhere. Then, the real address of request 2 of PAR27 is PA
R, 14 and performs re-indexing of AA3.4. When a cache miss is detected, the real address of request 2 is supplied to the main memory via PAR27 and MAR24, and then the real address of request 3 is supplied to the PA
Must be restored to Rl 4.

また、リクエスト２かキャツシュヒツトした場合には、
主メモリにブロック転送要求を行なえないために、リク
エスト２は先行するリクエストのブロック転送処理が終
了した後で、その処理を再開させなければならす、この
場合には退避させたリクエスト３との間で新たなリクエ
スト再開回復処理が必要になる。Also, if request 2 or catshhit is made,
Since a block transfer request cannot be made to the main memory, request 2 must restart its processing after the block transfer processing of the preceding request is completed. New request restart recovery processing is required.

このように、本実施例の装置に従来技術を用いるのは、
非常な制御の複雑さと相当のハードウェア量を必要とし
好ましくない。In this way, the use of the conventional technology in the device of this embodiment is as follows:
This is undesirable because it requires great control complexity and a considerable amount of hardware.

本発明によれば、この場合には、後続のリクエスト２及
びリクエスト３はステージ２及びリクエスト３に保持し
たままで制御が可能となる。According to the present invention, in this case, subsequent requests 2 and 3 can be controlled while being held in stages 2 and 3.

次に本発明の制御動作について述べる。今、パイプライ
ン同期制御信号により後続のリクエスト２及びリクエス
ト３の実アドレスはＰＡＲ２７及びＰＡＲｌ４で保持さ
れており、リクエスト２による３、４の索引結果はＦＦ
１２，１３で保持されている。Next, the control operation of the present invention will be described. Now, the real addresses of subsequent requests 2 and 3 are held in PAR27 and PAR14 by the pipeline synchronization control signal, and the index results of 3 and 4 by request 2 are FF
12 and 13 are held.

まず、ＣＭＡＲ１７に保持されている、先行して主メモ
リにブロック転送要求を行なったリクエスト１の実アド
レスのブロック外アドレスとＰＡＲ２７に保持されてい
るリクエスト２の実アドレスのブロック外アドレスとか
比較器１９で比較される。リクエスト２の実アドレスは
先行して出されたリクエスト１のブロックアドレスに不
一致のケースであるから、比較器１９は不一致を示す論
理Ｏを出力し、ナンド回路２０に供給する。従って、ナ
ンド回路２０は論理１を出力してアンド回路２１に供給
する。First, the comparator 19 compares the out-of-block address of the real address of request 1 that previously made a block transfer request to the main memory held in the CMAR 17 and the out-of-block address of the real address of request 2 held in the PAR 27. are compared. Since the real address of request 2 does not match the block address of request 1 issued previously, comparator 19 outputs a logic O indicating a mismatch and supplies it to NAND circuit 20 . Therefore, the NAND circuit 20 outputs a logic 1 and supplies it to the AND circuit 21.

リクエスト２はキャッシュミスヒツトするケースである
ので、リクエスト２によるＡＡ３，４の索引結果を保持
するＦＦ１２，１３にはそれぞれ論理０か保持されてお
り、アンドゲート１０はキッシュをミスしたことを示す
論理１を出力してアントゲ−１・２１に供給する。よっ
てアンドゲート２１は論理１を出力してＭＡＲ２４に出
力する。Since request 2 is a case of a cache miss, the FFs 12 and 13 that hold the index results of AA3 and 4 by request 2 each hold a logic 0, and the AND gate 10 holds a logic 0 indicating that the quiche has been missed. It outputs 1 and supplies it to Antogame 1/21. Therefore, the AND gate 21 outputs a logic 1 and outputs it to the MAR 24.

このアンドゲート２１の論理１の信号は、リクエスト２
かキャッシュをミスしておりかつ先行して出されたリク
エスト１のブロックデータ中にも所望のデータが存在し
ていないことを示しており、リクエスト２の主メモリへ
のブロック転送要求を連続して発行可能であることを示
している。The logic 1 signal of this AND gate 21 is the request 2
This indicates that the request 1 has missed the cache and the desired data does not exist in the block data of request 1 issued previously, and the block transfer request to the main memory of request 2 is Indicates that it can be issued.

従って、ＰＡＲ２７中のリクエスト２の実アドレスはＭ
　Ａ　Ｒ，２４で受付りられて、主メモリへブロック転
送要求か発せられる。リクエスト１とリクエスト２のブ
ロック転送の処理中は、ＤＡ１５゜１６からのデータは
使用されず、リクエスト１゜２のブロックデータがＤＡ
１５または１６に登録される時に同時に、所望のデータ
がＤＡＲ２３を通してリクエスト要求元に送出される。Therefore, the real address of request 2 in PAR27 is M
It is accepted at AR, 24, and a block transfer request is issued to the main memory. During the processing of block transfers for request 1 and request 2, data from DA15゜16 is not used, and block data from request 1゜2 is transferred to DA.
15 or 16, the desired data is simultaneously sent to the request source through the DAR 23.

そしてリクエスト１．リクエスト２のブロック転送処理
中にリクエスト１とリクエスト２のアドレス情報が図示
されていない別パスで対応するＡＡ３または４に登録さ
れる。リクエスト１とリクエスト２のブロック転送処理
中の間は、後続のリクエスト３はＰＡＲｌ４中で待たさ
れる。And request 1. During the block transfer process of request 2, the address information of requests 1 and 2 is registered in the corresponding AA 3 or 4 through separate paths (not shown). While request 1 and request 2 are undergoing block transfer processing, subsequent request 3 is kept waiting in PAR14.

尚、ＣＭＡＲ１７とＰＡＲ２７のブロック外アドレスが
一致することが比較器１つで検出された場合、即ち先行
して出されたリクエスト１のブロックデータ中にリクエ
スト２の所望のデータか存在していることが検出された
場合には、比較器１９は論理１を出力するため、ナンド
回路２０は論理Ｏを出力してアンド回路２１の出力が論
理Ｏとなり、その結果、リクエスト２の主メモリに対す
るブロック転送要求は送出されたい。そして、リクエス
ト１のブロックデータが主メモリより転送されてきて、
リクエスト１の所望のデータがリクエスト要求元に送出
され、ブロックデータがＤＡ１５または１６に登録され
た後、このフロックデ−タが登録された側のＤＡか索引
されてリクエスト２の所望のデータが読出され、リクエ
スト要求元に送出される。そしてこの処理が終了するま
では一連のキャッシュミスヒツト処理中として扱われる
。Note that if one comparator detects that the out-of-block addresses of CMAR17 and PAR27 match, that is, the desired data of request 2 exists in the block data of request 1 issued previously. is detected, the comparator 19 outputs a logic 1, the NAND circuit 20 outputs a logic O, and the output of the AND circuit 21 becomes a logic O. As a result, the block transfer of request 2 to the main memory The request should be sent. Then, the block data of request 1 is transferred from the main memory,
After the desired data of request 1 is sent to the request source and the block data is registered in DA 15 or 16, the DA on which this block data is registered is indexed and the desired data of request 2 is read out. , is sent to the request source. Until this processing is completed, it is treated as a series of cache miss processing.

この様に、本実施例では、先行して主メモリにデータの
ブロック転送を要求したリクエストのリクエスト情報を
保持し、この保持されたリクエストのブロック外アドレ
スと後続のリクエストのブロック外アドレスとを比較す
ることにより、後続のリクエストがキャッシュをミスし
ておりかつ先行して主メモリにデータのブロック転送を
要求したブロック中に所望のデータが存在しているか否
かの検出を極めて簡ｍに行なうことができ、各パイプラ
イン上に保持されている後続のリクエストの、退避及び
回復動作のような非常に複雑な制御を行なうことなしに
、そのままの状態で処理可能となるのである。In this way, in this embodiment, the request information of the request that previously requested a block transfer of data to the main memory is held, and the out-of-block address of this held request is compared with the out-of-block address of the subsequent request. By doing this, it is possible to extremely easily detect whether a subsequent request misses the cache and whether or not desired data exists in a block that has previously requested a block transfer of data to main memory. This allows subsequent requests held on each pipeline to be processed as they are without performing very complex control such as saving and restoring operations.

第３図は本発明の別の実線例を示ず１１７７２図であり
、第２図と同等部分は同一符号により示している、第２
図の例と異なる部分について説明すると、０ＭＲ２８は
キャッシュミスヒツトしたリクエストのキャッシュ登録
コンパートメント情報を保持するキャッシュミスヒツト
コンパートメントレジスタであり、セレクト２９．３０
はＦＦＩ２．１３の出力とＣＭＲ，２８の夫々対応する
信号との選択を行う選択器である。これ等セレクタ２９
．３０の出力によりセレクタ２２が制御されるようにな
っている。FIG. 3 is a diagram 11772 which does not show another solid line example of the present invention, and parts equivalent to those in FIG. 2 are indicated by the same reference numerals.
To explain the differences from the example in the figure, 0MR28 is a cache miss compartment register that holds cache registration compartment information for requests that have cache misses, and select 29.30
is a selector that selects between the output of FFI 2.13 and the corresponding signals of CMR, 28. These selector 29
．． The selector 22 is controlled by the output of the selector 30.

次に、本実施例の制御動作について述べる。今、後続の
リクエスト２はキャッシュミスヒツ゛トし、かつ先行す
るリクエストか要求するブロックデータ中にも所望のデ
ータが存在しない場合を考える。Next, the control operation of this embodiment will be described. Now, let us consider a case where the subsequent request 2 misses the cache and the desired data does not exist in the block data requested by the preceding request.

そして現在、パイプライン同期制御信号によって後続の
リクエスト２及びリクエスト３の実アドレスはＰＡＲ２
７及び１４で夫々保持されており、リクエスト２による
ＡＡ３．４の索引結果はＦＦ１２．１３で保持されてい
るとする。And now, due to the pipeline synchronization control signal, the real addresses of subsequent requests 2 and 3 are set to PAR2.
7 and 14, respectively, and the index result of AA3.4 based on request 2 is held in FF12.13.

ます、ＣＭＡＲ１７に保持されている、先行して主メモ
リにブロック転送要求を行なったりクエス１〜１の実ア
ドレスのブロック外アドレスと、ＰＡＲ２７に保持され
ているリクエスト２の実アドレスのブロック外アドレス
とか比較器１９で比較される。リクエスト２の実アドレ
スは先行して出されたリクエスト１のブロックアドレス
に不一致のケースであるから、比較器１９は不一致を示
す論理Ｏを出力し、ナンド回路２０に供給する。従って
ナンド回路２０は論理１を出力してアンド回路２１に供
給する。First, a block transfer request is made to the main memory in advance, which is held in CMAR17. A comparator 19 compares them. Since the real address of request 2 does not match the block address of request 1 issued previously, comparator 19 outputs a logic O indicating a mismatch and supplies it to NAND circuit 20 . Therefore, the NAND circuit 20 outputs a logic 1 and supplies it to the AND circuit 21.

リクエスト２はキャッシュをミスヒツトするケースであ
るので、リフニス）・２によるＡＡ３，４の索引結果を
保持するＦＦ１２，１３には、それぞれ論理Ｏが保持さ
れており、アンドゲート１０はキャッシュをミスしたこ
とを示す論理１を出力してアンドゲート２１に供給する
。よってアンドゲート２１は論理１を出力してＭＡＲ２
４に出力する。Since request 2 is a case of a cache miss, FF12 and FF13 that hold the index results of AA3 and 4 by Rifnis).2 each hold logic O, and AND gate 10 indicates that the cache has been missed. A logic 1 indicating this is output and supplied to the AND gate 21. Therefore, AND gate 21 outputs logic 1 and MAR2
Output to 4.

このアンドゲート２１の論理１の信号はリクエスト２が
キャッシュをミスしておりかつ先行して出されたリクエ
スト１のブロックデータ中にも所望のデータが存在して
いないことを示しており、リクエスト２の主メモリへの
ブロック転送要求を連続して発行可能であることを示し
ている。The logic 1 signal of this AND gate 21 indicates that request 2 has missed the cache and that the desired data does not exist in the block data of request 1 that was issued previously. This indicates that it is possible to issue block transfer requests to main memory in succession.

従って、ＰＡＲ２７中のリクエスト２の実アドレスはＭ
ＡＲ２４で受付けられて、主メモリへブロック転送要求
が発せられる。リクエスト１とリクエスト２のブロック
データの転送の処理中は、ＦＦ１２，１３及びアンド回
路１０の出力信号等は無効な信号として扱われ、ＤＡｉ
５．１６からのデータは使用されず、リクエスト１．２
のブロックデータがＤＡ１５または１６に登録される時
に同時に所望のデータがＤＡＲ２３を通してリクエスト
要求元に送出される。Therefore, the real address of request 2 in PAR27 is M
It is accepted by the AR 24 and a block transfer request is issued to the main memory. During the process of transferring the block data of request 1 and request 2, the output signals of FFs 12 and 13 and the AND circuit 10 are treated as invalid signals, and the DAi
Data from 5.16 is not used, request 1.2
When the block data is registered in the DA 15 or 16, the desired data is simultaneously sent to the request source through the DAR 23.

そして、リクエスト１．リクエスト２のブロック転送処
理中にリクエスト１とリクエスト２のアドレス情報が対
応するＡＡ３または４に登録される。リクエスト１とリ
クエスト２のブロックデータの転送の処理中の間は、後
続のリクエスト３はＰＡＲ１４中で待たされる。And request 1. During the block transfer process of request 2, the address information of request 1 and request 2 is registered in the corresponding AA3 or AA4. While the transfer of block data of requests 1 and 2 is being processed, the subsequent request 3 is kept waiting in the PAR 14.

次に、後続のリクエスト２がキャッシュにはミスヒット
し、一方先行するリクエスト１が要求するブロックデー
タ中に所望のデータが存在する場合を考える。そして、
今ＣＭＡＲ１７とＰＡＲ２７のブロック外アドレスか一
致することが比較器１９で検出されたとする。即ちこの
場合には、比較器１９は論理１を出力し、かつＶビット
１８が論理１を出力しているために、ナンド回路２０は
論理０を出力してアンド回路２１に供給する。そしてア
ンド回路２１は論理０を出力し、その結果リクエスト２
の主メモリに対するブロック転送要求は送出されず、リ
クエスト１の要求したブロックデータの処理が終了する
までリクエスト２の処理はＰＡＲ２７上で待たされる。Next, consider a case where the subsequent request 2 misses the cache, but the desired data is present in the block data requested by the preceding request 1. and,
Suppose now that the comparator 19 detects that the out-of-block addresses of CMAR17 and PAR27 match. That is, in this case, since the comparator 19 outputs a logic 1 and the V bit 18 outputs a logic 1, the NAND circuit 20 outputs a logic 0 and supplies it to the AND circuit 21. Then, the AND circuit 21 outputs logic 0, and as a result, request 2
A block transfer request to the main memory is not sent, and the processing of request 2 is made to wait on the PAR 27 until the processing of the block data requested by request 1 is completed.

そして、主メモリからリクエスト１の要求したブロック
データが転送されてくるまでの間に、゛図示されていな
いキャッシュリプレースメントアルゴリズム回路によっ
て決定された、リクエスト１のブロックデータのキャッ
シュ登録コンパートメント情報が０ＭＲ２８にセットさ
れる。このキャッシュ登録コンパートメント情報は２ビ
ツトからなる情報であり、論理１で書込むべきキャッシ
ュのコンパートメントを表示し、それぞれのビットは各
々ＤＡ１５．１６に対応する。従って、そのビットの組
合せは「１０」及びｒｏＩＪＩ、か存在しない。Then, until the block data requested by request 1 is transferred from the main memory, the cache registration compartment information of the block data of request 1 determined by a cache replacement algorithm circuit (not shown) is set to 0MR28. be done. This cache registration compartment information is information consisting of 2 bits, and indicates the compartment of the cache to be written to with a logic 1, and each bit corresponds to DA15.16. Therefore, the bit combination "10" and roIJI does not exist.

リクエスト１が要求したブロックデータか主メモリより
転送されてきて、ＤＡ１５もしくは１６に登録され、リ
クエスト１の所望のデータがリクエスト要求元に送出さ
れるとＰＡＲ２７で待機中のリクエスト２の処理が開始
される。When the block data requested by request 1 is transferred from the main memory and registered in DA 15 or 16, and the desired data of request 1 is sent to the request source, processing of request 2 that is waiting in PAR 27 is started. Ru.

リクエスト２の所望のデータはリクエスト１か主メモリ
に要求したブロックデータ中に存在するのであるから、
即ちリクエスト１とリクエスト２は同一のブロックアド
レスを有している。従って、リクエスト２でＤＡ１５．
１６を索引して、０ＭＲ２８で指示されたコンパートメ
ントのＤＡ１５もしくは１６から読出されたデータは、
リクエスト２が所望するデータである。Since the desired data of request 2 exists in request 1 or the block data requested from main memory,
That is, request 1 and request 2 have the same block address. Therefore, in request 2, DA15.
16 and the data read from DA15 or 16 of the compartment indicated by 0MR28 is
Request 2 is the desired data.

今、アンドゲート１０はキャッシュミスヒツトを示す論
理１を出力しているため、セレクタ２９゜３０ではＣＭ
Ｒ，２８の出力が有効となり、リクエスト２の所望する
データがＤＡＲ２３に読出されてリクエスト要求元へ転
送される。このリクエスト２の処理か終了するまでは、
一連のキャッシュミスヒツト処理中として扱われる。Now, the AND gate 10 is outputting a logic 1 indicating a cache miss, so the selectors 29 and 30 are outputting CM
The output of R, 28 becomes valid, and the desired data of request 2 is read out to DAR 23 and transferred to the request source. Until this request 2 is processed or finished,
It is treated as a series of cache misses being processed.

尚、後続するリクエスト２がキャッシュにヒツトした場
合には、アンドゲート１０はキャッシュにヒツトしたこ
とを示す論理Ｏを出力するため、アンドゲート２１の出
力もＯとなり、リクエスト２の主メモリに対するブロッ
ク転送要求は送出されたい。Note that when the subsequent request 2 hits the cache, the AND gate 10 outputs a logic O indicating that it hits the cache, so the output of the AND gate 21 also becomes O, and the block transfer of request 2 to the main memory is performed. The request should be sent.

リクエスト２はリクエスト１のキャッシュミスヒツト処
理が終了するまでの間、ＰＡＲ２７で待たされる。この
間リクエスト２のＡＡ３，４の索引結果もＦＦ１２．１
３で保持され続ける。ただし、リクエスト１のキャッシ
ュミスヒツト処理が終了するまでの間は、ＦＦ１２，１
３及び１０の出力はキャッシュの制御に関しては無効化
される。Request 2 is kept waiting in the PAR 27 until the cache miss processing of request 1 is completed. During this time, the index results for AA3 and 4 of request 2 are also FF12.1
It continues to be held at 3. However, until the cache miss processing of request 1 is completed, FF12, 1
Outputs 3 and 10 are disabled for cache control.

リクエスト１が要求したブロックデータがＤＡ１５もし
くは１６に登録され、リクエスト１の所望のデータがリ
クエスト要求元へ送出されると、リクエスト１のキャッ
シュミスヒツト処理が終了してリクエスト２の処理が再
開される。この時、ＦＦ１２．１３及びアンドゲート１
０によるキャッシュ制御情報が有効となり、セレクタ２
９．３０はＦＦ１２．１３の出力を選択する。そしてリ
クエスト２の所望のデータかＤＡ１５Ｌしくは１６から
ＤＡＲ２３に読出されてリクエスト要求元へ送出される
。When the block data requested by request 1 is registered in the DA 15 or 16 and the desired data of request 1 is sent to the request source, the cache miss processing of request 1 is completed and the processing of request 2 is resumed. . At this time, FF12.13 and AND gate 1
The cache control information based on 0 becomes valid, and selector 2
9.30 selects the output of FF12.13. Then, the desired data of request 2 is read out from the DA 15L or 16 to the DAR 23 and sent to the request source.

以上説明したように、本実施例では、先行して主メモリ
にデータのブロック転送を要求したリクエストのリクエ
スト情報及びキャッシュの登録コンパートメント情報を
保持しているので、後続のリクエストの状態の検出を極
めて簡単に行なうことができ、各パイプライン上に保持
されている後続のリクエストの退避及び回復動作のよう
な非常に複雑な制御を行なうことなしに、そのままの状
態で処理できるのである。As explained above, in this embodiment, since the request information and cache registration compartment information of the request that previously requested a block transfer of data to the main memory are held, it is extremely easy to detect the status of the subsequent request. This is easy to do, and can be processed as is without very complex control such as saving and restoring subsequent requests held on each pipeline.

几呪凶羞１軟土の如く、本発明によれば、後続のリクストがＡヤッ
シュミスでかつ先行して主メモリにデータのブロック転
送要求をしたブロック中に所望とするデータが存在して
いるか否かの検出を容易に高速に行えるので、システム
の処理性能が向上するという効果がある。According to the present invention, the following request is an A-yash miss, and whether or not the desired data exists in the block that was previously requested to transfer data to the main memory. This has the effect of improving the processing performance of the system because it can be detected easily and quickly.

[Brief explanation of the drawing]

第１図〜第３図は本発明の各実施例のブロック図である
。主要部分の符号の説明１・・・・・・論理アドレスレジスタ３４・・・・・・アドレスアレイ５．６，７．１９・・・・・・比較器１４．２７・・・・・・実アドレスレジスタ１５．１６
・・・・・・データアレイ１７・・・・・・キャッシュミスアドレスレジスタ１８・・・・・・フラグレジスタ２３・・・・・・データアレイ読出レジスタ２４・・・・・・メモリアドレスレジスタ２６・・・・
・・パイプライン同期制御回路２８・・・・・・キャッシュミスヒツトコンパートメン
トレジスタ1 to 3 are block diagrams of each embodiment of the present invention. Explanation of symbols of main parts 1...Logical address register 34...Address array 5.6, 7.19...Comparator 14.27...Real Address register 15.16
... Data array 17 ... Cache miss address register 18 ... Flag register 23 ... Data array read register 24 ... Memory address register 26・・・・・・
... Pipeline synchronization control circuit 28 ... Cache miss compartment register

Claims

[Claims]

(1) A data processing device having a cache memory that holds a partial copy of the contents of the main memory in units of blocks, wherein when the first request is a cache miss, a holding unit that holds the request information; a comparison means for comparing an address in the request information of a second request following the first request with an address in the request information held in the holding means; and means for requesting data block transfer from the main memory in response to the second request when the comparison result of the comparing means indicates a mismatch.

(2) A cache memory that holds a partial copy of the contents of the main memory in block units, a cache directory means that records a directory of the contents held in the cache memory, and a cache hit state determined based on the index result of the cache directory means. a first stage that indexes the cache directory means in response to a request; and a second stage that uses the index result to determine a cache hit state by the cache hit determination means. The data processing device employs a pipeline processing method, and includes a cache miss request holding unit that holds request information of a request for which a cache miss hit has been determined, and a cache miss request holding unit that holds request information of a subsequent request following this request. means for holding each stage, index result holding means for holding an index result of the cache directory for a request whose request information is held at the second stage, and an address in the request information held at the second stage. and a comparison means for comparing the address in the request information held in the cache miss request holding means, and the contents of the index result holding means indicate a cache miss and the comparison result of the comparison means is and means for requesting data block transfer from the main memory in response to the request held in the second stage if a mismatch is indicated.

(3) A cache memory that holds a partial copy of the contents of the main memory in units of blocks, a cache directory means that records a directory of the contents held in the cache memory, and a cache hit state determined based on the index result of the cache directory means. a first stage that indexes the cache directory means in response to a request; and a second stage that uses the index result to determine a cache hit state by the cache hit determination means. A data processing device using a pipeline processing method, comprising a cache miss request holding means for holding request information of a request for which a cache miss has been determined, and cache registration compartment information for the request for which a cache miss has been determined. means for holding request information of a subsequent request following this request in each of the first and second stages; and holding an index result of the cache directory for the request whose request information is held in the second stage. index result holding means, comparison means for comparing an address in the request information held in the second stage and an address in the request information held in the cache miss request holding means;
If the contents of the index result holding means indicate a cache miss and the comparison result of the comparison means indicates a mismatch, block data is retrieved from the main memory in response to the request for which the cache miss has been determined. is read out and registered in the cache memory, and then retained in the second stage based on the address in the request information retained in the second stage and the cache registration compartment information of the request for which the cache miss has been determined. A data processing device comprising means for reading desired data of a request being made from the cache memory and sending it to a request source.