JP3450132B2

JP3450132B2 - Cache control circuit

Info

Publication number: JP3450132B2
Application number: JP23350196A
Authority: JP
Inventors: 晋司渡部
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1996-09-04
Filing date: 1996-09-04
Publication date: 2003-09-22
Anticipated expiration: 2016-09-04
Also published as: JPH1078914A

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明はキャッシュ制御回路
に関し、特にフォールトトレラントコンピュータのキャ
ッシュ制御方式に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a cache control circuit, and more particularly to a cache control system for a fault tolerant computer.

【０００２】[0002]

【従来の技術】従来、この種のキャッシュ制御方式にお
いては、２重化メモリを備えたフォールトトレラントコ
ンピュータでフォールトトレラント性を損なわずに効率
よくキャッシュ内容を２重化メモリに反映することを目
的として用いられている。2. Description of the Related Art Conventionally, in this type of cache control system, a fault-tolerant computer having a duplicated memory has an object to efficiently reflect the cache contents in the duplicated memory without impairing the fault tolerance. It is used.

【０００３】すなわち、フォールトトレラントコンピュ
ータにおいて、プロセッサエレメント内のキャッシュの
更新時に、その更新の都度、その更新内容をライトスル
ー方式で一方のメモリエレメントに反映させる逐次書出
し処理を行い、もう一方のメモリエレメントに対してチ
ェックポイント処理時に当該キャッシュのすべての更新
内容を一括して書出す一括書出し処理を行っている。That is, in a fault-tolerant computer, when the cache in the processor element is updated, each time the update is performed, a sequential write process for reflecting the updated content in one memory element is performed by a write-through method, and the other memory element is executed. On the other hand, during the checkpoint processing, the batch write-out processing is performed in which all the updated contents of the cache are collectively written.

【０００４】上記のフォールトトレラントコンピュータ
におけるキャッシュの更新処理については、特開平７−
２７１６２４号公報に詳述されている。Regarding the cache update processing in the above fault tolerant computer, Japanese Patent Laid-Open No. 7-
This is described in detail in Japanese Patent No. 271624.

【０００５】[0005]

【発明が解決しようとする課題】上述した従来のキャッ
シュ制御方式では、メモリエレメントのみが冗長化され
ており、プロセッサエレメントが冗長化された場合が考
慮されていない。プロセッサエレメントを冗長化した場
合にはプロセッサエレメント自身の故障も代替のプロセ
ッサエレメントでカバーできるため、上記のチェックポ
イント処理自体が不要となり、より効率的なシステムを
構築することが可能になる。また、１つのプロセッサエ
レメントが故障した場合でもメモリからのリカバリの必
要はない。In the above-mentioned conventional cache control method, only the memory element is made redundant, and the case where the processor element is made redundant is not taken into consideration. When the processor element is made redundant, the failure of the processor element itself can be covered by the alternative processor element, so that the above checkpoint processing itself is unnecessary and a more efficient system can be constructed. Further, even if one processor element fails, there is no need to recover from the memory.

【０００６】しかしながら、プロセッサエレメントを冗
長化した場合にはエラーが冗長化された１つのプロセッ
サエレメントのみに発生するため、そのエラーを訂正し
てキャッシュに訂正データの書込みを行う動作が、その
エラーを発生したプロセッサエレメントのみになる。エ
ラーの発生したプロセッサエレメントのキャッシュに訂
正データを書込む場合には訂正データの書込み動作を行
っている間、プロセッサからキャッシュをアクセスする
ことができなくなる。However, when the processor elements are made redundant, an error occurs only in one redundant processor element. Therefore, the operation of correcting the error and writing the corrected data in the cache causes the error. It is only the processor element that occurred. When the correction data is written in the cache of the processor element in which the error has occurred, the cache cannot be accessed from the processor while the correction data writing operation is performed.

【０００７】そのため、例えばエラーの発生したプロセ
ッサエレメントが訂正データの書込みを行っている間に
プロセッサからキャッシュへの読出しが発生した場合、
エラーの発生していないプロセッサエレメントではキャ
ッシュからプロセッサに読出しデータが即座に返される
が、エラーの発生しているプロセッサエレメントではキ
ャッシュへの訂正データの書込みが終了するまでプロセ
ッサに読出しデータを返すことができない。Therefore, for example, when a read from the processor to the cache occurs while the processor element in which the error has occurred is writing the correction data,
Read data is immediately returned from the cache to the processor in a processor element in which no error has occurred, but read data can be returned to the processor in the processor element in which an error has occurred until the writing of the correction data to the cache is completed. Can not.

【０００８】上記のように、エラーを訂正してキャッシ
ュに訂正データを書込もうとすると、同期して動作しな
ければならない冗長化されたプロセッサエレメントの同
期がずれてしまう。そのため、プロセッサエレメントも
冗長化した場合にはキャッシュに訂正可能なエラーが検
出されても、キャッシュにエラーを訂正したデータを書
込むことができないという問題が発生する。従来技術で
はプロセッサエレメントが冗長化されていないため、こ
の問題が発生することはない。As described above, if an error is corrected and correction data is written in the cache, the redundant processor elements that must operate in synchronization will be out of synchronization. Therefore, when the processor elements are also made redundant, even if a correctable error is detected in the cache, there arises a problem that the error-corrected data cannot be written in the cache. In the prior art, this problem does not occur because the processor elements are not redundant.

【０００９】そこで、本発明の目的は上記の問題点を解
消し、冗長化されたプロセッサエレメントにおいてキャ
ッシュにエラーが生じても冗長化されたプロセッサエレ
メントの同期をずらすことなく、訂正不可能なエラーに
発展することを防ぐことができるキャッシュ制御回路を
提供することにある。Therefore, an object of the present invention is to solve the above-mentioned problems, and even if an error occurs in a cache in a redundant processor element, an uncorrectable error does not occur without shifting the synchronization of the redundant processor element. It is to provide a cache control circuit that can prevent the development of the cache control circuit.

【００１０】[0010]

【課題を解決するための手段】本発明によるキャッシュ
制御回路は、記憶装置の内容の一部を格納するキャッシ
ュメモリと、データの書込みによって格納内容が前記記
憶装置の内容と不一致となった前記キャッシュメモリの
格納内容を前記記憶装置に書き戻す書き戻し手段とを備
えかつ互いに同期して動作する第１及び第２のプロセッ
サエレメントからなる情報処理システムのキャッシュ制
御回路であって、前記キャッシュメモリからの読出しデ
ータのエラーを検出しかつ検出されたエラーの訂正を行
うエラー検出訂正手段と、前記エラー検出訂正手段及び
他のプロセッサエレメントの前記エラー検出訂正手段か
ら出力されるエラー通知に応じて前記書き戻し手段に前
記キャッシュメモリの格納内容を前記記憶装置に書き戻
すよう指示する指示手段とを前記第１及び第２のプロセ
ッサエレメント各々に有し、前記書き戻し手段は、不一
致となった前記キャッシュメモリの格納内容を前記記憶
装置に書き戻す際に前記エラー検出訂正手段で検出され
た訂正可能なエラーを訂正した後に前記記憶装置に書き
戻すよう構成したことを特徴とする。 SUMMARY OF THE INVENTION A cache control circuit according to the present invention comprises a cache memory for storing a part of the contents of a storage device, and the cache in which the stored contents do not match the contents of the storage device due to writing of data. A cache control circuit of an information processing system, comprising: first and second processor elements, which are provided with write-back means for writing back stored contents of a memory to the storage device, and which operate in synchronization with each other. Error detection / correction means for detecting an error in read data and correcting the detected error, and the write-back in response to an error notification output from the error detection / correction means and the error detection / correction means of another processor element. A finger for instructing the means to write back the contents stored in the cache memory to the storage device. And means possess in the first and second processor elements each, said write back means, not one
The stored contents of the cache memory that has become a match are stored in the memory.
It is detected by the error detection and correction means when writing back to the device.
Write to the storage device after correcting the correctable error
It is characterized in that it is configured to return.

【００１１】本発明による他のキャッシュ制御回路は、
記憶装置の内容の一部を格納するキャッシュメモリを備
えかつ互いに同期して動作する第１及び第２のプロセッ
サエレメントからなる情報処理システムのキャッシュ制
御回路であって、前記キャッシュメモリからの読出しデ
ータのエラーを検出しかつ検出された訂正可能エラーの
訂正を行うエラー検出訂正手段と、前記キャッシュの読
出しアドレスを保持するアドレス保持手段と、前記エラ
ー検出訂正手段で前記エラーが検出された時に当該エラ
ーを訂正したデータを保持する訂正データ保持手段と、
自プロセッサエレメントの前記エラー検出訂正手段から
のエラー通知及び他プロセッサエレメントの前記エラー
検出訂正手段からのエラー通知の一方が入力された時に
前記アドレス保持手段に保持されたアドレスで前記訂正
データ保持手段に保持されたデータを前記キャッシュメ
モリに書込むよう指示する手段とを前記第１及び第２の
プロセッサエレメント各々に備え、前記第１及び第２の
プロセッサエレメントのうちの一方の前記エラー検出訂
正手段で訂正不可能なエラーが検出された時に前記第１
及び第２のプロセッサエレメントのうちの一方を切り離
して前記第１及び第２のプロセッサエレメントのうちの
他方で処理を続行するようにしている。Another cache control circuit according to the present invention is
A cache control circuit of an information processing system, comprising a first and a second processor element having a cache memory for storing a part of the contents of a storage device and operating in synchronization with each other, wherein the cache control circuit stores data read from the cache memory. and error detection and correction means for performing detecting and correcting the detected correctable errors errors, an address holding means for holding a read address of said cache, the error when the error is detected by the error detection and correction means Correction data holding means for holding the corrected data,
Said correction data holding means with an error notification and the address held in the address holding unit when an error notification one is inputted from said error detecting and correcting means of another processor element from said error detecting and correcting means of its own processor element and means for instructing writing the data held in the cache memory to the first and second processor elements each, wherein the first and second
The error detection and correction of one of the processor elements
When an uncorrectable error is detected by the corrective means, the first
And disconnecting one of the second processor element
Of the first and second processor elements
On the other hand, the process is continued .

【００１２】上記のように、本発明のキャッシュ制御方
式は冗長化されたプロセッサエレメントのキャッシュに
エラーが生じた場合に、冗長化されたプロセッサエレメ
ントの同期をずらすことなしに、キャッシュ上のエラー
を除去する。As described above, according to the cache control method of the present invention, when an error occurs in the cache of the redundant processor element, the error on the cache is eliminated without shifting the synchronization of the redundant processor element. Remove.

【００１３】より具体的には、キャッシュからのリード
データのエラーを検出しかつエラーのあるデータのエラ
ーを訂正する機能を持つエラー検出訂正回路と、冗長化
されたプロセッサエレメントの全てのエラー検出訂正回
路から受けるエラー通知に従って、ソフトウェアに対し
てキャッシュをフラッシュさせるためにエラーが検出さ
れたことをソフトウェアに通知するための割込みを生成
する割込み生成回路とを有している。More specifically, an error detection / correction circuit having a function of detecting an error in read data from the cache and correcting an error in data having an error, and all error detection / correction of redundant processor elements And an interrupt generation circuit that generates an interrupt for notifying the software that an error has been detected for the software to flush the cache in accordance with the error notification received from the circuit.

【００１４】また、他の方法では、キャッシュからのリ
ードデータのエラーを検出しかつエラーのあるデータの
エラーを訂正する機能を持つエラー検出訂正回路と、キ
ャッシュのリードアドレスを保持するアドレス保持手段
と、エラー検出訂正回路でエラーが検出された時にその
エラーが訂正されたキャッシュからのリードデータを保
持する訂正データ保持手段と、自プロセッサエレメント
のエラー検出訂正回路からのエラー通知またはどれか１
つ以上の他のプロセッサエレメントからのエラー通知に
よって、アドレス保持手段の保持するアドレスで訂正デ
ータ保持手段の保持するデータをキャッシュヘライトす
る指示を出す訂正データ書込み手段とを有している。According to another method, an error detection / correction circuit having a function of detecting an error in read data from the cache and correcting an error in data having an error, and an address holding means for holding a read address of the cache. , Correction data holding means for holding read data from the cache in which the error is corrected when the error is detected by the error detection / correction circuit, and error notification from the error detection / correction circuit of the own processor element, or any one of
It has a correction data writing means for issuing an instruction to write the data held by the correction data holding means to the cache at the address held by the address holding means in response to an error notification from one or more other processor elements.

【００１５】割込み生成回路によるソフトウェアでのキ
ャッシュのフラッシュ、または訂正データ書込み手段に
よるキャッシュへの訂正データの書込みによって、冗長
化されたプロセッサエレメントの同期をずらすことな
く、キャッシュ上のエラーを起こしたデータを除去する
ことが可能になる。Data causing an error on the cache without shifting the synchronization of the redundant processor elements by flushing the cache by software by the interrupt generation circuit or writing correction data to the cache by the correction data writing means. Can be removed.

【００１６】これは通常のシステムと異なり、一般的に
訂正可能なエラーがキャッシュに生じた場合でも、フォ
ールトトレラントシステムとして冗長化されたプロセッ
サエレメントの同期の問題でサービスから切り離されて
いたプロセッサエレメントの継続動作が可能になり、信
頼性能向上をもたらす効果がある。This is different from the usual system, in general, even when a correctable error occurs in the cache, a processor element which has been separated from the service due to the problem of synchronization of the processor element made redundant as a fault tolerant system. It has the effect of enabling continuous operation and improving reliability performance.

【００１７】現在はキャッシュも大容量化し、それにと
もなって放射線等によるハードウェアの故障ではないソ
フトエラーがキャッシュで発生する確率が高くなってい
る。ソフトエラーはそのエラーの訂正によって、全く元
通りに動作することが可能である。At present, the cache has a large capacity, and accordingly, there is a high probability that a soft error, which is not a hardware failure due to radiation or the like, will occur in the cache. A soft error can be restored to its original state by correcting the error.

【００１８】特に信頼性が重要視されるフォールトトレ
ラントコンピュータにおいて、このキャッシュのソフト
エラーをリカバリすることで冗長化された構成のまま動
作し続けることは、大幅に信頼性を向上することにな
る。In a fault-tolerant computer in which reliability is particularly important, it is possible to improve reliability significantly by recovering this cache soft error and continuing to operate in a redundant configuration.

【００１９】[0019]

【発明の実施の形態】次に、本発明の実施の形態につい
て図面を参照して説明する。図１は本発明の第１の実施
の形態の構成を示すブロック図である。図において、プ
ロセッサエレメント１，２は２重化されているため、全
く同じ構成要素を備えている。すなわち、プロセッサエ
レメント１，２はＭＰＵ１１，２１と、キャッシュ１
２，２２と、エラー検出訂正回路１３，２３と、割込み
生成回路１４，２４とから構成されている。DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing the configuration of the first embodiment of the present invention. In the figure, since the processor elements 1 and 2 are duplicated, they have exactly the same components. That is, the processor elements 1 and 2 have the MPUs 11 and 21 and the cache 1
2, 22, error detection / correction circuits 13 and 23, and interrupt generation circuits 14 and 24.

【００２０】ＭＰＵ１１，２１はプロセッサエレメント
１，２各々で演算処理を行うマイクロプロセッサであ
る。キャッシュ１２，２２はＭＰＵ１１，２１各々のキ
ャッシュメモリである。エラー検出訂正回路１３，２３
はＭＰＵ１１，２１各々の指示によってキャッシュ１
２，２２からリードされたデータのエラー検出及びエラ
ー訂正を行う回路である。The MPUs 11 and 21 are microprocessors that perform arithmetic processing in the processor elements 1 and 2, respectively. The caches 12 and 22 are cache memories of the MPUs 11 and 21, respectively. Error detection / correction circuit 13, 23
Is cache 1 according to the instruction of each MPU 11, 21.
It is a circuit that performs error detection and error correction of the data read from Nos. 2 and 22.

【００２１】エラー検出訂正回路１３はキャッシュ１２
からのリードデータにエラーが検出されると、自プロセ
ッサエレメント１内の割込み発生回路１４ともう一方の
プロセッサエレメント２内の割込み生成回路２４とにエ
ラー通知線１１０を通してエラーが検出されたことを同
時に通知する。同様に、エラー検出訂正回路２３はエラ
ー通知線１２０を通して割込み生成回路１４，２４にエ
ラーが検出されたことを同時に通知する。The error detection / correction circuit 13 includes a cache 12
When an error is detected in the read data from, the interrupt generation circuit 14 in the own processor element 1 and the interrupt generation circuit 24 in the other processor element 2 are notified at the same time that the error is detected through the error notification line 110. Notice. Similarly, the error detection / correction circuit 23 simultaneously notifies the interrupt generation circuits 14 and 24 via the error notification line 120 that an error has been detected.

【００２２】割込み生成回路１４，２４はエラー検出訂
正回路１３またはエラー検出訂正回路２３からエラーが
検出されたことが通知されると、ＭＰＵ１１，２１に対
してエラーが起こったことをソフトウェアに認識させる
ための割込みを発生する。When the interrupt generation circuits 14 and 24 are notified by the error detection / correction circuit 13 or the error detection / correction circuit 23 that an error has been detected, the interrupt generation circuits 14 and 24 cause the software to recognize that an error has occurred in the MPUs 11 and 21. To generate an interrupt.

【００２３】この図１を参照して本発明の第１の実施の
形態の動作について説明する。プロセッサエレメント
１，２は２重化されており、それらの構成要素は全く同
じ動作を行うので、基本的にプロセッサエレメント１の
構成要素の動作について説明する。尚、プロセッサエレ
メント２の各要素はプロセッサエレメント１の同じ名称
の構成要素に対応している。The operation of the first embodiment of the present invention will be described with reference to FIG. Since the processor elements 1 and 2 are duplicated and their constituent elements perform exactly the same operation, the operation of the constituent elements of the processor element 1 will be basically described. Each element of the processor element 2 corresponds to a component element of the processor element 1 having the same name.

【００２４】プロセッサエレメント１でＭＰＵ１１は演
算処埋を行う。この演算処理でのロード／ストアによ
り、図示せぬ主記憶からデータをリードしたり、主記憶
にデータをライトしたりする。このアクセスを高速に行
うためにキャッシュ１２が設けられている。キャッシュ
１２はＭＰＵ１１からリード要求があると、エラー検出
訂正回路１３にリードデータを送出する。In the processor element 1, the MPU 11 performs arithmetic processing. By the load / store in this arithmetic processing, data is read from a main memory (not shown) or written in the main memory. A cache 12 is provided to perform this access at high speed. Upon receiving a read request from the MPU 11, the cache 12 sends read data to the error detection / correction circuit 13.

【００２５】エラー検出訂正回路１３はキャッシュ１２
からのデータにエラーがあるかどうかをチェックする。
エラーが検出されると、エラー検出訂正回路１３はエラ
ー通知線１１０を通して自プロセッサエレメント１内の
割込み生成回路１４と他プロセッサエレメント２内の割
込み生成回路２４とに夫々同時にエラーが検出されたこ
とを通知する。The error detection / correction circuit 13 has a cache 12
Check if there is an error in the data from.
When an error is detected, the error detection / correction circuit 13 notifies the interrupt generation circuit 14 in the own processor element 1 and the interrupt generation circuit 24 in the other processor element 2 simultaneously through the error notification line 110. Notice.

【００２６】また、エラー検出訂正回路１３は検出した
エラーが訂正可能なエラーであれば、訂正したデータを
ＭＰＵ１１にリプライする。訂正不可能なエラーであれ
ば、プロセッサエレメント１が障害となってサービスか
ら切り離され、２重化されたもう一方のプロセッサエレ
メント２によって動作し続けることとなる。If the detected error is a correctable error, the error detection / correction circuit 13 replies the corrected data to the MPU 11. If it is an uncorrectable error, the processor element 1 becomes a failure and is disconnected from the service, and the other duplicated processor element 2 continues to operate.

【００２７】割込み生成回路１４ではエラー通知線１１
０を通してエラー検出訂正回路１３からキャッシュ２ａ
のリードデータにエラーがあったことが通知されるか、
あるいはエラー通知線１２０を通してエラー検出訂正回
路２３からキャッシュ２２のリードデータにエラーがあ
ったことが通知されると、エラーがあったことをソフト
ウェアに通知するための割込みをＭＰＵ１１に対して実
行する。In the interrupt generation circuit 14, the error notification line 11
0 to the error detection / correction circuit 13 to the cache 2a
Is notified that there was an error in the read data of
Alternatively, when the error detection / correction circuit 23 notifies that there is an error in the read data of the cache 22 through the error notification line 120, an interrupt for notifying the software that there is an error is executed to the MPU 11.

【００２８】したがって、プロセッサエレメント１，２
のどちらか一方にエラーが検出された場合でも、この割
込みが割込み生成回路１４によりＭＰＵ１１に対して、
また割込み生成回路２４によりＭＰＵ２１に対して同時
に実行される。Therefore, the processor elements 1 and 2 are
Even if an error is detected in one of the two, this interrupt is sent to the MPU 11 by the interrupt generation circuit 14.
The interrupt generation circuit 24 simultaneously executes the MPU 21.

【００２９】この時点で、エラーが検出されたプロセッ
サエレメントのキャッシュ内にはエラーのあるデータが
格納されたままである。そのため、訂正不可能なエラー
に進行する前に、割込みを受取ったソフトウェアは全キ
ャッシュのフラッシュを実行する。At this point, the erroneous data remains stored in the cache of the processor element in which the error was detected. Therefore, the software that receives the interrupt performs a full cache flush before proceeding to an uncorrectable error.

【００３０】ここで、キャッシュをフラッシュするとは
プロセッサからキャッシュに対してライトが実行される
ことで、主記憶のデータとキャッシュのデータとが不一
致となった場合、キャッシュのデータのうち少なくとも
不一致となったデータ全てを主記憶に書き戻して主記憶
のデータとキャッシュのデータとを一致させ、フラッシ
ュ時のキャッシュのデータ全てを無効状態にする一連の
動作のことである。但し、本発明ではキャッシュをフラ
ッシュする際に、キャッシュから主記憶に書き戻すデー
タを全てエラー検出訂正回路１３，２３を通して主記憶
に送出するよう構成しているので、例えエラー検出訂正
回路１３，２３で訂正可能なエラーが検出されてもその
エラーが訂正された状態で不一致となったキャッシュの
データ全てが主記憶に書き戻されることとなる。Here, flushing the cache means that a write is executed from the processor to the cache, and when the data in the main memory and the data in the cache do not match, at least the data in the cache do not match. This is a series of operations in which all the stored data is written back to the main memory to match the data in the main memory with the data in the cache, and all the data in the cache at the time of flushing is invalidated. However, in the present invention, when the cache is flushed, all the data to be written back from the cache to the main memory is sent to the main memory through the error detection / correction circuits 13 and 23. Even if a correctable error is detected in step 1, all the data in the cache that is inconsistent with the error corrected is written back to the main memory.

【００３１】図３は本発明の第２の実施の形態の構成を
示すブロック図である。図において、プロセッサエレメ
ント３，４は本発明の第１の実施の形態の構成と同様に
２重化されているため、全く同じ構成要素を備えてい
る。すなわち、プロセッサエレメント３，４はＭＰＵ３
１，４１と、キャッシュ３２，４２と、エラー検出訂正
回路３３，４３と、アドレス保持手段３４，４４と、訂
正データ保持手段３５，４５と、訂正データ書込み手段
３６，４６とから構成されている。FIG. 3 is a block diagram showing the configuration of the second embodiment of the present invention. In the figure, the processor elements 3 and 4 are duplicated similarly to the configuration of the first embodiment of the present invention, and therefore have exactly the same components. That is, the processor elements 3 and 4 are MPU3.
1, 41, caches 32 and 42, error detection / correction circuits 33 and 43, address holding means 34 and 44, correction data holding means 35 and 45, and correction data writing means 36 and 46. .

【００３２】ＭＰＵ３１，４１はプロセッサエレメント
３，４各々で演算処理を行うマイクロプロセッサであ
る。キャッシュ３２，４２はＭＰＵ３１，４１各々のキ
ャッシュメモリである。エラー検出訂正回路３３，４３
はＭＰＵ３１，４１各々の指示によってキャッシュ３
２，４２からリードされたデータのエラー検出及びエラ
ー訂正を行う回路である。The MPUs 31 and 41 are microprocessors that perform arithmetic processing in the processor elements 3 and 4, respectively. The caches 32 and 42 are cache memories of the MPUs 31 and 41, respectively. Error detection / correction circuits 33 and 43
Is cache 3 according to the instructions of MPUs 31 and 41.
It is a circuit that performs error detection and error correction of data read from Nos. 2 and 42.

【００３３】アドレス保持手段３４，４４，はＭＰＵ３
１，４１からのキャッシュ３２，４２に対するリードア
ドレスを保持する。訂正データ保持手段３５，４５はエ
ラー検出訂正回路３３，４３からＭＰＵ３１，４１に返
されるされるリードデータを保持する。The address holding means 34, 44 are MPU 3
The read addresses from the caches 1, 41 to the caches 32, 42 are held. The correction data holding means 35 and 45 hold the read data returned from the error detection and correction circuits 33 and 43 to the MPUs 31 and 41.

【００３４】訂正データ書込み手段３６はエラー検出訂
正回路３３からエラー通知線１３０を通してエラーが検
出されたことが通知されると、自プロセッサエレメント
３でエラーが検出されたことをもう一方のプロセッサエ
レメント４の訂正データ書込み手段４６に通知する。When the correction data writing means 36 is informed by the error detection / correction circuit 33 that an error has been detected through the error notification line 130, the fact that an error has been detected by the processor element 3 itself is indicated by the other processor element 4 The correction data writing means 46 is notified.

【００３５】訂正データ書込み手段３６はエラー検出訂
正回路３３からエラーが検出したことが通知されるか、
またはもう一方のプロセッサエレメント４の訂正データ
書込み手段４６からプロセッサエレメント４でエラーが
検出されたことが通知されると、アドレス保持手段３４
及び訂正データ保持手段３５に保持されているアドレス
及びデータにしたがってキャッシュ３２へのライトを実
行する。訂正データ書込み手段４６は訂正データ書込み
手段３６と同等の機能である。The correction data writing means 36 is notified by the error detection / correction circuit 33 that an error has been detected, or
Alternatively, when the correction data writing means 46 of the other processor element 4 notifies that an error has been detected in the processor element 4, the address holding means 34
And the write to the cache 32 is executed according to the address and data held in the correction data holding means 35. The correction data writing means 46 has the same function as the correction data writing means 36.

【００３６】この図３を参照して本発明の第２の実施の
形態の動作について説明する。プロセッサエレメント
３，４は２重化されており、それらの構成要素は全く同
じ動作を行うので、基本的にプロセッサエレメント３の
構成要素の動作について説明する。尚、プロセッサエレ
メント４の各要素はプロセッサエレメント３の同じ名称
の構成要素に対応している。The operation of the second embodiment of the present invention will be described with reference to FIG. Since the processor elements 3 and 4 are duplicated and their constituent elements perform exactly the same operation, the operation of the constituent elements of the processor element 3 will be basically described. Each element of the processor element 4 corresponds to the constituent element of the processor element 3 having the same name.

【００３７】プロセッサエレメント３でＭＰＵ３１は演
算処理を行う。その演算処理でのロード／ストアによ
り、図示せぬ主記憶からデータをリードしたり、主記憶
にデータをライトしたりする。このアクセスを高速に行
うためにキャッシュ３２が設けられる。キャッシュ３２
はＭＰＵ３１からリード要求があると、エラー検出訂正
回路３３ヘリードデータを送出する。The processor element 3 causes the MPU 31 to perform arithmetic processing. By loading / storing in the arithmetic processing, data is read from a main memory (not shown) or written in the main memory. A cache 32 is provided to perform this access at high speed. Cash 32
When the MPU 31 issues a read request, sends the read data to the error detection / correction circuit 33.

【００３８】エラー検出訂正回路３３はキャッシュ３２
からのデータにエラーがあるかどうかをチェックする。
エラーが検出されると、エラー検出訂正回路３３はエラ
ー通知線１３０を通して訂正データ書込み手段３６にエ
ラーが検出されたことを通知する。The error detection / correction circuit 33 uses the cache 32.
Check if there is an error in the data from.
When an error is detected, the error detection / correction circuit 33 notifies the correction data writing means 36 via the error notification line 130 that the error has been detected.

【００３９】また、エラー検出訂正回路３３は検出した
エラーが訂正可能なエラーであれば、訂正したデータを
ＭＰＵ３１にへリプライする。訂正不可能なエラーであ
れば、プロセッサエレメント３が障害となってサービス
から切り離され、２重化されたもう一方のプロセッサエ
レメント４によって動作し続けることとなる。If the detected error is a correctable error, the error detection / correction circuit 33 replies the corrected data to the MPU 31. In the case of an uncorrectable error, the processor element 3 becomes a failure and is disconnected from the service, and the other duplicated processor element 4 continues to operate.

【００４０】アドレス保持手段３４はＭＰＵ３１からキ
ャッシュ３２へのリードアドレスを保持する。このと
き、訂正データ保持手段３５にはアドレス保持手段３４
に保持されるアドレスのデータがキャッシュ３２からリ
ードされ、リードしたデータにエラーがある場合でもエ
ラー検出訂正回路３３によってそのエラーが訂正された
データが保持される。The address holding means 34 holds the read address from the MPU 31 to the cache 32. At this time, the address holding means 34 is stored in the correction data holding means 35.
The data of the address held at is read from the cache 32, and even if the read data has an error, the data whose error is corrected by the error detection / correction circuit 33 is held.

【００４１】訂正データ書込み手段３６はエラー検出訂
正回路３３からエラー通知線１３０を通してエラーが検
出されたことが通知されると、他プロセッサエレメント
４の訂正データ書込み手段４６に自プロセッサエレメン
ト３でエラーが検出されたことを報告する。同様の機能
を訂正データ書込み手段４６も有しているため、プロセ
ッサエレメント４でエラーが検出された場合でも、訂正
データ書込み手段３６は訂正データ書込み手段４６から
の通知によって認識することができる。訂正データ書込
み手段３６はエラー検出訂正回路３３または訂正データ
書込み手段４６からエラーの検出通知があると、アドレ
ス保持手段３４のアドレスと訂正データ保持手段３５の
データとに従ってキャッシュ３２へのライトを行う。When the correction data writing means 36 is notified by the error detection / correction circuit 33 that an error has been detected through the error notification line 130, the correction data writing means 46 of the other processor element 4 is notified of an error in its own processor element 3. Report that it was detected. Since the correction data writing unit 46 has the same function, the correction data writing unit 36 can recognize the error from the correction data writing unit 46 even when an error is detected in the processor element 4. When the correction data writing means 36 receives an error detection notification from the error detection / correction circuit 33 or the correction data writing means 46, the correction data writing means 36 writes to the cache 32 according to the address of the address holding means 34 and the data of the correction data holding means 35.

【００４２】このとき、プロセッサエレメント４で起こ
ったエラーであってもプロセッサエレメントは２重化さ
れているため、アドレス保持手段３４，４４に保持され
ているアドレスは必ず同じであり、訂正データ保持手段
３５，４５に保持されているデータもエラーが検出され
たデータもエラーの訂正後であるために必ず同じデータ
である。したがって、自プロセッサエレメント３内に保
持されているアドレスとデータとを使用しても問題はな
い。At this time, even if an error has occurred in the processor element 4, since the processor element is duplicated, the addresses held in the address holding means 34 and 44 are always the same, and the corrected data holding means is the same. The data held in 35 and 45 and the data in which an error is detected are always the same data since the error is corrected. Therefore, there is no problem in using the address and data held in the own processor element 3.

【００４３】次に、本発明の実施例について図面を参照
して説明する。図２は図１のエラー検出訂正回路１３の
構成を示すブロック図である。図において、エラー検出
訂正回路１３はエラー検出回路１３ａと、シンドローム
生成回路１３ｂと、シンドロームデコーダ１３ｃと、エ
ラー訂正回路１３ｄと、訂正不可エラー検出回路１３ｅ
とから構成されている。尚、エラー検出訂正回路２３は
上記のエラー検出訂正回路１３の構成と同じ構成となっ
ている。Next, embodiments of the present invention will be described with reference to the drawings. FIG. 2 is a block diagram showing the configuration of the error detection / correction circuit 13 of FIG. In the figure, the error detection / correction circuit 13 includes an error detection circuit 13a, a syndrome generation circuit 13b, a syndrome decoder 13c, an error correction circuit 13d, and an uncorrectable error detection circuit 13e.
It consists of and. The error detection / correction circuit 23 has the same configuration as the error detection / correction circuit 13 described above.

【００４４】これら図１及び図２を参照して本発明の第
１の実施の形態における実施例について説明する。ＭＰ
Ｕ１１，２１及びキャッシュ１２，２２は上述した本発
明の第１の実施の形態の動作の説明通りである。エラー
検出訂正回路１３，２３はエラー検出と訂正にＥＣＣ
（エラーチェック＆コレクション）コードを使用した実
施例を図２に示す。通常、ＥＣＣコードには１ビットエ
ラーの訂正と２ビットエラーの検出が可能なコードが使
用される。An example of the first embodiment of the present invention will be described with reference to FIGS. 1 and 2. MP
The U11 and 21 and the caches 12 and 22 are as described in the operation of the above-described first embodiment of the present invention. The error detection / correction circuits 13 and 23 perform ECC for error detection and correction.
An example using the (error check & collection) code is shown in FIG. Normally, a code that can correct a 1-bit error and detect a 2-bit error is used as the ECC code.

【００４５】エラー検出回路１３ａ，２３ａ（エラー検
出回路２３ａは図示せず）では全データのビットと全Ｅ
ＣＣコードの排他的論理和をとることで、訂正可能なエ
ラーがあるかどうかを検出する。エラー検出回路１３
ａ，２３ａの結果はエラー通知線１１０またはエラー通
知線１２０に出力され、割込み生成回路１４，２４に通
知される。In the error detection circuits 13a and 23a (the error detection circuit 23a is not shown), all data bits and all E bits are
Whether or not there is a correctable error is detected by taking the exclusive OR of the CC codes. Error detection circuit 13
The results of a and 23a are output to the error notification line 110 or the error notification line 120 and notified to the interrupt generation circuits 14 and 24.

【００４６】シンドローム生成回路１３ｂ，２３ｂ（シ
ンドローム生成回路２３ｂは図示せず）は予め規定され
たシンドロームの生成パターンにしたがって、シンドロ
ームを生成する。エラーがない場合のシンドロームは、
通常オール“０”である。The syndrome generation circuits 13b and 23b (the syndrome generation circuit 23b is not shown) generate a syndrome according to a predetermined syndrome generation pattern. The syndrome without error is
It is usually all “0”.

【００４７】生成されたシンドロームはシンドロームデ
コーダ１３ｃ，２３ｃ（シンドロームデコーダ２３ｃは
図示せず）でデコードされ、エラーの発生したデータに
対応するビットのみが“１”になっているリードデータ
と同じビット数のデコード結果が得られる。このデコー
ド結果もシンドロームと同様に、エラーがない場合はオ
ール“０”である。The generated syndrome is decoded by the syndrome decoders 13c and 23c (the syndrome decoder 23c is not shown), and the same number of bits as the read data in which only the bit corresponding to the error data is "1". The decoding result of is obtained. Similar to the syndrome, this decoding result is all "0" when there is no error.

【００４８】シンドロームデコーダ１３ｃ，２３ｃのデ
コード結果とキャッシュ１２，２２からのリードデータ
の各ビットの排他的論理和をとるエラー訂正回路１３
ｄ，２３ｄ（エラー訂正回路２３ｄは図示せず）によっ
て、訂正可能な１ビットエラーが訂正される。The error correction circuit 13 which takes the exclusive OR of the decoding results of the syndrome decoders 13c and 23c and each bit of the read data from the caches 12 and 22.
A correctable 1-bit error is corrected by d and 23d (error correction circuit 23d is not shown).

【００４９】訂正不可エラー検出回路１３ｅ，２３ｅ
（訂正不可エラー検出回路２３ｅは図示せず）は訂正不
可能な２ビット以上のエラーであるかどうかを検出す
る。エラー検出回路１３ａ，２３ａでエラーが検出され
ていないにも関わらず、シンドローム生成回路１３ｂ，
２３ｂの結果がオール“０”でない場合、及びシンドロ
ーム生成回路１３ｂ，２３ｂの結果が規定していないパ
ターンであった場合には訂正不可能なエラーとして判断
する。Uncorrectable error detection circuits 13e, 23e
(Uncorrectable error detection circuit 23e is not shown) detects whether the error is an uncorrectable error of 2 bits or more. Although no error is detected in the error detection circuits 13a and 23a, the syndrome generation circuit 13b,
If the result of 23b is not all "0", or if the results of the syndrome generation circuits 13b and 23b are undefined patterns, it is determined as an uncorrectable error.

【００５０】訂正不可能なエラーが検出された場合はリ
カバリが不可能なため、訂正不可能なエラーの検出され
たプロセッサエレメントがサービスより切り離され、フ
ォールトトレラントシステムは２重化されたもう一方の
プロセッサエレメントが処理を継続している。Since recovery is impossible when an uncorrectable error is detected, the processor element in which the uncorrectable error is detected is disconnected from the service, and the fault tolerant system is duplicated. The processor element continues processing.

【００５１】以上のエラー検出訂正回路１３，２３によ
るエラー訂正は、キャッシュ１２，２２からリードした
データにエラーがなければ必要はない。しかしながら、
システムによっては図２に示されるエラーの訂正に数ク
ロックサイクルを要する場合があり、その場合にはエラ
ーを訂正するかしないかでＭＰＵ１１，２１に対するリ
プライのタイミングが異なってくる。The error correction by the error detection / correction circuits 13 and 23 described above is not necessary unless the data read from the caches 12 and 22 has an error. However,
Depending on the system, it may take several clock cycles to correct the error shown in FIG. 2. In that case, the timing of reply to the MPU 11, 21 differs depending on whether the error is corrected or not.

【００５２】ＭＰＵ１１，２１のリプライのタイミング
が異なると、プロセッサエレメント１とプロセッサエレ
メント２との同期がずれるという問題が発生する。これ
を防ぐために、エラー検出訂正回路１３，２３は常にエ
ラーを訂正するように動作する。つまり、常にエラー訂
正回路１３ｄ，２３ｄの出力結果がＭＰＵ１１，２１に
返される。If the reply timings of the MPUs 11 and 21 are different, there is a problem that the processor element 1 and the processor element 2 are out of synchronization. In order to prevent this, the error detection / correction circuits 13 and 23 always operate to correct the error. That is, the output results of the error correction circuits 13d and 23d are always returned to the MPUs 11 and 21.

【００５３】エラー検出訂正回路１３，２３でキャッシ
ュ１２，２２からのリードデータにエラーが検出された
場合、一般的なシステムであればエラー訂正後のデータ
をキャッシュ１２，２２にライトし直すことによって、
キャッシュ１２，２２に存在するエラーを訂正してい
る。しかしながら、プロセッサエレメント１，２を２重
化しているような場合には、エラーの発生した一方のプ
ロセッサエレメントのみがエラー訂正データをキャッシ
ュヘライトし直す動作を行うと、２重化の同期がずれる
という問題が発生する。When an error is detected in the read data from the caches 12 and 22 by the error detection and correction circuits 13 and 23, in a general system, the error-corrected data is rewritten in the caches 12 and 22. ,
The errors existing in the caches 12 and 22 are corrected. However, in the case where the processor elements 1 and 2 are duplicated, if only one processor element in which an error occurs rewrites the error correction data to the cache, the duplication synchronization is lost. The problem occurs.

【００５４】ここで、割込み生成回路１４，２４はエラ
ー通知線１１０，１２０のエラー通知信号をＯＲして、
ＭＰＵ１１，２１に割込みを生成する。この割込みはオ
ペレーティングシステムによって受取られ、キャッシュ
からのリードデータにエラーを検出したことを認識する
と、キャッシュ１２，２２をフラッシュするように指示
を出す。キャッシュにエラー訂正後のデータをライトし
直す代わりに、キャッシュ１２，２２を同時にフラッシ
ュすることによって、プロセッサエレメント１，２の同
期をずらすことなく、エラーを含むデータをキャッシュ
上からなくしている。Here, the interrupt generation circuits 14 and 24 OR the error notification signals of the error notification lines 110 and 120,
An interrupt is generated in MPU11,21. This interrupt is received by the operating system and, upon recognizing that an error has been detected in the read data from the cache, instructs the caches 12, 22 to be flushed. Instead of rewriting the error-corrected data in the cache, the caches 12 and 22 are flushed at the same time, so that the data including the error is eliminated from the cache without shifting the synchronization of the processor elements 1 and 2.

【００５５】図４は図３の訂正データ書込み手段３６の
構成を示すブロック図である。図において、訂正データ
書込み手段３６はエラーＦ／Ｆ（フリップフロップ）３
６ａと、アドレスレジスタ３６ｂと、データレジスタ３
６ｃと、オア回路３６ｄとから構成されている。尚、訂
正データ書込み手段４６は上記の訂正データ書込み手段
３６の構成と同じ構成となっている。FIG. 4 is a block diagram showing the structure of the correction data writing means 36 shown in FIG. In the figure, the correction data writing means 36 indicates an error F / F (flip-flop) 3
6a, address register 36b, and data register 3
6c and an OR circuit 36d. The correction data writing means 46 has the same configuration as the above-mentioned correction data writing means 36.

【００５６】これら図３及び図４を参照して本発明の第
２の実施の形態における実施例について説明する。本発
明の第１の実施の形態がソフトウェアによってキャッシ
ュをフラッシュすることでキャッシュ上からエラーの起
こったデータを追い出すのに対して、本発明の第２の実
施の形態ではキャッシュのリード時にエラーの起こった
データとアドレスとを保持し、２重化されたプロセッサ
エレメントの両方において同時に訂正データをキャッシ
ュにライトすることによって、２重化されたプロセッサ
エレメントの同期をずらすことなくキャッシュ上のエラ
ーの起こったデータを訂正する機能を持つ。An example of the second embodiment of the present invention will be described with reference to FIGS. 3 and 4. In the first embodiment of the present invention, the data in which an error has occurred is flushed from the cache by flushing the cache by software, whereas in the second embodiment of the present invention, an error occurs when the cache is read. By storing the corrected data and address and simultaneously writing the correction data to the cache in both of the duplicated processor elements, an error on the cache occurs without shifting the synchronization of the duplicated processor elements. Has the function of correcting data.

【００５７】ＭＰＵ３１，４１及びキャッシュ３２，４
２は上述した本発明の第２の実施の形態の動作の説明通
りである。エラー検出訂正回路３３，４３は本発明の第
１の実施例のエラー検出訂正回路１３，２３と同様の構
成であり、本発明の第１の実施例で説明した通りであ
る。エラー検出訂正回路３３，４３のエラー検出回路で
訂正可能なエラーが検出されると、エラー通知線１３
０，１４０を通して訂正データ書込み手段３６，４６に
エラーが検出されたことが通知される。MPU 31, 41 and cache 32, 4
2 is as described in the operation of the second embodiment of the present invention described above. The error detection / correction circuits 33 and 43 have the same configuration as the error detection / correction circuits 13 and 23 of the first embodiment of the present invention, and are as described in the first embodiment of the present invention. When a correctable error is detected by the error detection circuits of the error detection / correction circuits 33 and 43, the error notification line 13
The corrected data writing means 36 and 46 are notified that an error has been detected through 0 and 140.

【００５８】エラー検出訂正回路３３，４３でキャッシ
ュ３２，４２からのリードデータにエラーが検出された
場合、一般的なシステムであればエラー訂正後のデータ
をキャッシュにライトし直すことによって、キャッシュ
に存在するエラーを訂正している。When an error is detected in the read data from the caches 32, 42 by the error detection / correction circuits 33, 43, in a general system, the error-corrected data is rewritten in the caches to rewrite them in the caches. Correcting any existing errors.

【００５９】しかしながら、プロセッサエレメント３，
４を２重化しているような場合、エラーの発生した一方
のプロセッサエレメントのみにおいてエラー訂正データ
をキャッシュヘライトし直す動作を行うことによって２
重化の同期がずれるという問題が発生する。そこで、一
方のプセロッサエレメントのみでなく、両方のプロセッ
サエレメントのキャッシュに同時に訂正データをライト
し直すことによって、２重化の同期をずらすことなしに
キャッシュに存在するデータを訂正する構成を図３に示
してある。However, the processor element 3,
In the case where 4 is duplicated, the error correction data is rewritten to the cache only in one of the processor elements in which the error has occurred.
There is a problem that the duplexing synchronization is out of sync. Therefore, not only one processor element but also the cache of both processor elements are rewritten at the same time to correct the data existing in the cache without shifting the duplication synchronization. 3 is shown.

【００６０】アドレス保持手段３４，４４はＭＰＵ３
１，４１からキャッシュ３２，４２をリードした時のア
ドレスを保持する。訂正データ保持手段３５，４５はエ
ラー検出訂正回路３３，４３で訂正可能なエラーが訂正
された後のデータを保持する。The address holding means 34 and 44 are the MPU 3
The addresses when the caches 32 and 42 are read from the caches 1 and 41 are held. The correction data holding means 35 and 45 hold the data after the errors that can be corrected by the error detection and correction circuits 33 and 43 are corrected.

【００６１】訂正データ書込み手段３６，４６の詳細な
構成は図４に示す通りである。訂正データ書込み手段３
６ではアドレス保持手段３４に保持されているアドレス
をアドレスレジスタ３６ｂに格納し、訂正データ保持手
段３５に保持されているデータをデータレジスタ３６ｃ
に保持する。エラー検出訂正回路３３からのエラー通知
線１３０を通してのエラー通知はエラーＦ／Ｆ３６ａに
格納される。The detailed structure of the correction data writing means 36 and 46 is as shown in FIG. Correction data writing means 3
In 6, the address held in the address holding means 34 is stored in the address register 36b, and the data held in the corrected data holding means 35 is stored in the data register 36c.
Hold on. The error notification from the error detection / correction circuit 33 through the error notification line 130 is stored in the error F / F 36a.

【００６２】エラーＦ／Ｆ３６ａに格納された自プロセ
ッサのキャッシュ３２のリードデータにエラーが検出さ
れたことをもう一方のプロセッサエレメント４の訂正デ
ータ書込み手段４６に通知する。The correction data writing means 46 of the other processor element 4 is notified that an error has been detected in the read data of the cache 32 of its own processor stored in the error F / F 36a.

【００６３】オア回路３６ｄではエラーＦ／Ｆ３６ａに
格納されている自プロセッサエレメントのエラー通知信
号と、他プロセッサエレメント４からのエラー通知信号
とをオアし、キャッシュ３２に対するライトを行うイネ
ーブル信号を生成する。このライトイネーブル信号に対
するキャッシュ３２へのアドレスはアドレスレジスタ３
６ｂに格納されているアドレスであり、データはデータ
レジスタ３６ｃに格納されているデータである。The OR circuit 36d ORs the error notification signal of its own processor element stored in the error F / F 36a and the error notification signal from the other processor element 4 to generate an enable signal for writing to the cache 32. . The address to the cache 32 for this write enable signal is the address register 3
6b is the address stored, and the data is the data stored in the data register 36c.

【００６４】オア回路３６ｄによって自プロセッサエレ
メント３と他プロセッサエレメント４のエラー通知をオ
アすることで、どちらか一方のプロセッサエレメントで
起こったエラーであっても、両方のプロセッサエレメン
ト３，４で同時にキャッシュ３２，４２へのライト指示
を行うことになる。By ORing the error notification of the own processor element 3 and the other processor element 4 by the OR circuit 36d, even if the error occurs in one of the processor elements, both processor elements 3 and 4 simultaneously cache it. A write instruction is given to 32 and 42.

【００６５】アドレス保持レジスタ３４，４４に保持さ
れているアドレスと、訂正データ保持手段３５，４５に
保持されているデータは必ず等しいため、他プロセッサ
ユニットにエラーが検出された場合でも、自プロセッサ
ユニットに保持されているアドレスとデータとを用いて
キャッシュへのライトを行うことが許される。Since the addresses held in the address holding registers 34 and 44 and the data held in the correction data holding means 35 and 45 are always the same, even if an error is detected in another processor unit, its own processor unit. It is permitted to write to the cache using the address and data held in the cache.

【００６６】以上のようにして、２重化されたプロセッ
サエレメントにおいて、キャッシュからのリードデータ
にエラーを検出し、キャッシュにエラーのあるデータが
あることが判明した時に、２重化されたプロセッサエレ
メントの同期をずらすことなく、キャッシュ中のエラー
のあるデータを除去することが可能になる。As described above, in the duplicated processor element, when an error is detected in the read data from the cache and it is found that the cached data has an error, the duplicated processor element It is possible to remove erroneous data in the cache without shifting the synchronization of the.

【００６７】[0067]

【発明の効果】以上説明したように本発明のキャッシュ
制御回路によれば、記憶装置の内容の一部を格納するキ
ャッシュメモリからの読出しデータのエラーを検出しか
つ検出されたエラーの訂正を行うとともに、データの書
込みによって格納内容が記憶装置の内容と不一致となっ
たキャッシュメモリの格納内容を記憶装置に書き戻す書
き戻し手段に他のプロセッサエレメントから出力される
エラー通知に応じてキャッシュメモリの格納内容を記憶
装置に書き戻すよう指示することによって、冗長化され
たプロセッサエレメントにおいてキャッシュにエラーが
生じても冗長化されたプロセッサエレメントの同期をず
らすことなく、訂正不可能なエラーに発展することを防
ぐことができるという効果がある。As described above, according to the cache control circuit of the present invention, the error of the read data from the cache memory storing a part of the contents of the storage device is detected and the detected error is corrected. At the same time, the cache memory is stored in response to an error notification output from another processor element to the write-back means for writing back the stored content of the cache memory whose stored content does not match the content of the storage device due to the writing of data to the storage device. By instructing the contents to be written back to the storage device, even if an error occurs in the cache in the redundant processor element, it is possible to develop an uncorrectable error without shifting the synchronization of the redundant processor element. The effect is that it can be prevented.

【００６８】また、本発明の他のキャッシュ制御回路に
よれば、記憶装置の内容の一部を格納するキャッシュメ
モリからの読出しデータのエラーを検出しかつ検出され
たエラーの訂正を行うとともに、自プロセッサエレメン
トからのエラー通知及び他プロセッサエレメントからの
エラー通知のうちの少なくとも一方が入力された時にア
ドレス保持手段に保持された読出しアドレスで訂正デー
タ保持手段に保持されたデータをキャッシュメモリに書
込むよう指示することによって、冗長化されたプロセッ
サエレメントにおいてキャッシュにエラーが生じても冗
長化されたプロセッサエレメントの同期をずらすことな
く、訂正不可能なエラーに発展することを防ぐことがで
きるという効果がある。Further, according to another cache control circuit of the present invention, an error in the read data from the cache memory storing a part of the contents of the storage device is detected, the detected error is corrected, and Write the data held in the correction data holding unit to the cache memory at the read address held in the address holding unit when at least one of the error notification from the processor element and the error notification from another processor element is input. By giving an instruction, even if an error occurs in the cache in the redundant processor element, it is possible to prevent the uncorrectable error from being developed without shifting the synchronization of the redundant processor element. .

[Brief description of drawings]

【図１】本発明の第１の実施の形態の構成を示すブロッ
ク図である。FIG. 1 is a block diagram showing a configuration of a first exemplary embodiment of the present invention.

【図２】図１のエラー検出訂正回路の構成を示すブロッ
ク図である。FIG. 2 is a block diagram showing a configuration of an error detection / correction circuit of FIG.

【図３】本発明の第２の実施の形態の構成を示すブロッ
ク図である。FIG. 3 is a block diagram showing a configuration of a second exemplary embodiment of the present invention.

【図４】図３の訂正データ書込み手段の構成を示すブロ
ック図である。FIG. 4 is a block diagram showing a configuration of correction data writing means in FIG.

[Explanation of symbols]

１〜４プロセッサエレメント１１，２１，３１，４１ＭＰＵ１２，２２，３２，４２キャッシュ１３，２３，３３，４３エラー検出訂正回路１３ａエラー検出回路１３ｂシンドローム生成回路１３ｃシンドロームデコーダ１３ｄエラー訂正回路、１３ｅ訂正不可エラー検出回路１４，２４割込み生成回路３４，４４アドレス保持手段３５，４５訂正データ保持手段３６，４６訂正データ書込み手段３６ａエラーＦ／Ｆ３６ｂアドレスレジスタ、３６ｃデータレジスタ３６ｄオア回路１１０，１２０，１３０，１４０エラー通知線 1-4 processor elements 11,21,31,41 MPU 12,22,32,42 cache 13, 23, 33, 43 Error detection and correction circuit 13a error detection circuit 13b Syndrome generation circuit 13c Syndrome Decoder 13d error correction circuit, 13e Uncorrectable error detection circuit 14, 24 Interrupt generation circuit 34,44 address holding means 35, 45 correction data holding means 36,46 Correction data writing means 36a Error F / F 36b address register, 36c data register 36d OR circuit 110, 120, 130, 140 Error notification line

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平４−211841（ＪＰ，Ａ) 特開昭63−278162（ＪＰ，Ａ) 特開平６−266574（ＪＰ，Ａ) 特開平５−265790（ＪＰ，Ａ) 特開平６−35736（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 12/08 G06F 11/16 G06F 12/16 ─────────────────────────────────────────────────── ─── Continuation of the front page (56) Reference JP-A-4-211841 (JP, A) JP-A-63-278162 (JP, A) JP-A-6-266574 (JP, A) JP-A-5- 265790 (JP, A) JP-A-6-35736 (JP, A) (58) Fields investigated (Int.Cl. ⁷ , DB name) G06F 12/08 G06F 11/16 G06F 12/16

Claims

(57) [Claims]

1. A cache memory that stores a part of the contents of a storage device, and a write-back process that writes back the stored contents of the cache memory to the storage device, the storage contents of which do not match the contents of the storage device due to the writing of data. A cache control circuit of an information processing system, comprising first and second processor elements, which include a returning means and operate in synchronization with each other, wherein an error of data read from the cache memory is detected and the detected error is detected. Error detecting and correcting means for correcting the error, and the storage content of the cache memory to the write back means in response to an error notification output from the error detecting and correcting means and the error detecting and correcting means of another processor element. An instruction means for instructing to write back to the first and second processor elements Yes people to
And, the write-back means, the cache eyes became a mismatch
When the contents stored in the memory are written back to the storage device, the error
-Correction of correctable errors detected by the detection and correction means
The cache control circuit is configured to be written back to the storage device after being stored .

2. The instructing means is configured to output to the rewriting means an interrupt signal for instructing to rewrite the contents stored in the cache memory to the storage device. Cache control circuit.

3. A cache control circuit of an information processing system, comprising a cache memory for storing a part of contents of a storage device, and comprising first and second processor elements operating in synchronization with each other, wherein the cache memory Error detection and correction means for detecting an error in read data from the memory and correcting a detected correctable error, address holding means for holding a read address of the cache, and the error detection and correction means for detecting the error. Error correction from the error detection and correction means of the own processor element and error notification from the error detection and correction means of another processor element are input. Sometimes, the address stored in the address storage means is used to correct the Each of the first and second processor elements has means for instructing to write the data held in the data holding means in the cache memory in synchronization with another processor element, Of the first and second processor elements by disconnecting one of the first and second processor elements when an uncorrectable error is detected by the error detection and correction means of one of the processor elements of A cache control circuit characterized in that the other one of them continues processing.