JPS6113261B2

JPS6113261B2 -

Info

Publication number: JPS6113261B2
Application number: JP56156193A
Authority: JP
Inventors: Tadaaki Bando; Yasushi Fukunaga; Yoshinari Hiraoka; Hidekazu Matsumoto; Toshuki Ide; Takeshi Kato; Tetsuya Kawakami
Original assignee: Hitachi Engineering Co Ltd; Hitachi Ltd
Current assignee: Hitachi Engineering Co Ltd; Hitachi Ltd
Priority date: 1981-10-02
Filing date: 1981-10-02
Publication date: 1986-04-12
Also published as: JPS5858666A

Description

[Detailed description of the invention]

本発明は複数のプロセツサが１つの主記憶装置
（以下主メモリと略称する）を共用するデータ処
理装置に関する。ここで、複数のプロセツサのうち、少なくとも
１つは、命令を実行するために仮想アドレスでメ
モリアクセスを行うプロセツサであり、ここで
は、このプロセツサをジヨブプロセツサと称して
いる。また、少なくとも１つは、補助記憶装置とも称
される外部記憶装置（以下外部メモリと略称す
る）との入出力を行うために、仮想アドレスでメ
モリアクセスを行うプロセツサであり、ここで
は、このプロセツサをフアイルプロセツサと称し
ている。また、ジヨブプロセツサには、仮想アドレスで
メモリアクセスされるキヤツシユメモリが設けら
れている。更に、本発明におけるデータ処理装置は、各プ
ロセツサから共通に使用され、仮想アドレスを物
理アドレスに変換するアドレス変換装置を有する
ものである。本発明は、このようなデータ処理装置におい
て、フアイルプロセツサが主メモリの内容を書き
換え（ページのロールイン、ロールアウト）時
に、アドレス変換テーブルを更新するために生ず
る問題、即ち、キヤツシユメモリが主メモリのコ
ピーであるという従来の考え方がくずれることか
ら生ずる問題を解決するようにしたデータ処理装
置に関する。まず、本発明の背景を詳細に説明する。１つの主メモリを複数のプロセツサが共用する
データ処理装置は、一般にマルチプロセツサシス
テムと称されている。従来のマルチプロセツサシステムでは、一般
に、シングルコンピユータのコスト／パーフオー
マンスを圧迫しない形で、マルチシステム構成を
可能としてきた。そのため、プロセツサ台数が、
２台程度では、最高な構成であつたが、プロセツ
サ台数が更に増加すると、相互の干渉が大きくな
り、ハードウエア構成が大きくなりすぎるという
問題があつた。マルチプロセツサシステムにおいて、解決しな
ければならない問題の１つに仮想記憶方式があ
る。仮想記憶方式は、良く知られているが、これ
は、主メモリと外部メモリとを見掛け上、一体の
ものとみなし、プロセツサから要求された情報が
主メモリ内になく、外部メモリにある場合には、
主メモリの比較的使われていない一部分の情報を
外部メモリに転送し、要求された情報を外部メモ
リから主メモリへ転送するのを自動的にシステム
が行うものである。主メモリの情報を外部メモリに転送することを
ロールアウト、逆に、外部メモリの情報を主メモ
リへ転送することをロールインと称している。このような、ロールイン、ロールアウトの制御
を実施する為、一般には、主メモリと外部メモリ
とは、それぞれページと称される１つの単位に分
割されて使用されている。各ページに対応して、そのページが現在主メモ
リ上にあるかどうか、またある場合は、それに対
応する主メモリの物理アドレス（実アドレスとも
称されている）がいくらであるかの情報が、変換
テーブル上に置かれている。プロセツサからのメモリアクセス時のアドレス
は、仮想アドレスで与えられ、その上位アドレス
部で変換テーブルが索引され、物理アドレスへの
変換が行なわれる。このような、変換テーブルは、仮想アドレスの
ページ数分だけ必要となり、必要なメモリ容量が
大きくなり、そのため、容量の削減を図ろうとし
て変換テーブルをセグメントテーブルと、ページ
テーブルのような２レベルのアドレス変換を実施
するものが多い。変換テーブルは、上記したように、メモリ容量
を多く必要とするので、一般には、主メモリ上に
置かれている。そのため、プロセツサからのメモ
リアクセスがある毎に、主メモリの変換テーブル
をチエツクしていたのでは、１つのメモリアクセ
ス要求に対して必ず３回以上のメモリアクセスが
発生し、オーバーヘツドが無視できない。そこで、仮想記憶方式をとるプロセツサには、
TLBと称される高速バツフアを設けているもの
が多く、この高速バツフアには最近使用された仮
想アドレスに対応した物理アドレスが記憶されて
いる。これによれば、プロセツサからのメモリアクセ
スが発生すると、まず、TLBの中に、対応する
アドレスがないかどうかがチエツクされ、存在す
る場合には、アドレス変換のための、変換テーブ
ルのメモリアクセスが必要でなく、アドレス変換
のオーバーヘツドは少なくてアクセス可能とな
る。このような、TLBを含んだアドレス変換装
置は、従来は、プロセツサ毎におかれ、プロセツ
サ内の制御装置の手助けを借りて仮想制御方式を
実現していた。一方、性能を上げるためには、キヤツシユメモ
リと呼ばれる高速メモリを各プロセツサに設ける
ことが一般的になつている。このキヤツシユメモ
リは、主メモリの一部の写しで、通常、主メモリ
よりも５〜10倍高速なものとして設計されてい
る。プロセツサからのメモリアクセスは、まずキヤ
ツシユに該当するものがあるか否かのチエツクを
行い、あれば、その内容を返送し、なければ主メ
モリをアクセスすることになる。ところで、仮想記憶方式で、キヤツシユメモリ
を有する場合には、従来、プロセツサから見て
TLB、キヤツシユメモリ、主メモリの順で接続
されていた。これは、キヤツシユメモリは、実記
憶である主メモリの写しであるという考え方に基
づくもので、キヤツシユメモリのアクセスは、物
理アドレスに変換してから行なわなければならな
かつた。このような、従来方式には、次のような問題点
がある。第１の問題点は、実効的なメモリアクセス時間
が長くなることである。この理由は、プロセツサからキヤツシユメモリ
をアクセスする場合に、必ずTLBを通らなけれ
ばならないからである。すなわち、プロセツサか
らの仮想アドレスを、TLBにより物理アドレス
に変換してからキヤツシユメモリをアクセスする
ことになるからである。第２の問題点は、TLBが全プロセツサに必要
なため、プロセツサの数が増えた場合、それにつ
れてハードウエア物量が大きくなるばかりでな
く、TLBのずれを修正しなければならず、これ
が複雑になつていることである。 TLBのずれの修正は、あるプロセツサが、ペ
ージをスワツプした時に、変換システム、TLB
のエントリーを更新するが、当該プロセツサのみ
でなく、他のプロセツサにもその旨を連絡して、
該当部分をクリヤしなければならない。これは、一般にTLB Purge（Translation
Lookaside Buffer Purge）と称され、マルチプ
ロセツサを構成する場合の１つの重要なポイント
となつている。第３の問題点は、一般のジヨブプロセツサはメ
モリをアクセスする場合、仮想アドレスを用いる
が、入出力プロセツサは、TLBを持たないた
め、物理アドレスでアクセスすることから生ずる
もので、両者で、アドレスの受け渡しをする際に
変換が必要となるためにオーバーヘツドが増加す
ることである。第４の問題点は、第２の問題点と類似している
が、マルチプロセツサでなくても、高度なパイプ
ライン制御をするプロセツサでは命令をアクセス
するユニツトとオペランドをアクセスするユニツ
トは別々で、各々にキヤツシユメモリを持ち、高
速化を図ることが行なわれるが、この際にも、従
来方式では、TLBを各ユニツトに持たなければ
ならなくなり、ハードウエア量が増大するという
ことである。このような問題点を解決するために、キヤツシ
ユメモリを仮想アドレスでアクセスし、ビツトし
ない場合のみアドレス変換を行うたり方が、特開
昭49−53339号公報の中に示されている。これに
よればキヤツシユメモリをアクセスする場合に毎
回アドレス変換を行う必要がなくなつて、高速化
を図ることが可能となる。しかしながら、この公報には、本方式は次の欠
点があるために、採用できないことを指摘してい
る。それは、 (1) ２つの異なつた仮想アドレスが同一の物理ア
ドレスを参照する場合、うまくゆかない。これ
はある仮想アドレスの内容を書き変えた時に同
一の物理アドレスを示す別の仮想アドレスで指
定されるキヤツシユメモリの内容が書き変つて
いなければならないためである。 (2) ページあるいはセグメントテーブルの内容を
書き変える場合に、キヤツシユの無効化をする
ためのスキヤンが必要である。 (3) ストレージキーは物理アドレスに対応してい
るために、プロテクシヨンチエツクが不可能と
なる。の３点である。特開昭56−38649号公報にも、同様に、キヤツ
シユメモリを仮想アドレスでアクセスする方法が
示されているが、上記の問題点の解決法は触れら
れていない。また、特開昭55−142476号公報には、アドレス
変換装置を複数のプロセツサで共有する方式が示
されている。これによれば、複数のプロセツサで
共有するために、マルチプロセツサ構成時に、経
済性を実現できるというものであるが、ここに
は、キヤツシユメモリは示されていない。本発明は、複数のプロセツサがあり、その中の
少なくともひとつのプロセツサが、仮想アドレス
でアクセスするキヤツシユメモリを持ち、アドレ
ス変換装置は全プロセツサで共有する構成に適用
されるものである。このような構成の目的は、アドレス変換装置に
要するハード量を削減した上で、実効メモリアク
セス時間の短縮がはかることにある。他の目的は、TBL間の一致化制御のような複
雑な処理が不要な方式を提供することである。他の目的は、入出力プロセツサなどからも仮想
アドレスでメモリをアクセスできる方式を提供
し、アドレスの一元管理を目指するものである。他の目的は、パイプライン制御を行う複数のユ
ニツトから成る計算機において、高速かつ経済的
な、メモリ制御方式を実現することである。しかしながらこの方式の最大の問題点は、特開
昭49−53339にも指摘されているように、主メモ
リのページあるいはセグメントテーブルを書換え
た場合に、キヤツシユメモリには、これが伝わら
ず、キヤツシユメモリの内容が主メモリの状態を
正しく反映しなくなることである。本発明の目的は、この問題点を解決したデータ
処理装置を提供することである。次に、前述した特開昭49−53339号公報に指摘
されている問題点について、本発明でどう解決し
ているかの考え方を説明する。まず第１の問題である２つの仮想アドレスが同
一の物理アドレスを示すことができないというこ
とであるが、このようなニーズは多重仮想記憶の
場合に必要であるが、多重仮想記憶が必要となる
第１の要因は、アドレスを指定するビツト長が24
ビツト程度と小さく仮想空間が2²⁴＝16Mega程度
でサイズが不十分である場合に必要となつたもの
で、2³²＝4Giga，2⁴⁸＝256Teraもの大きな仮想空
間をサポートできれば、多重仮想記憶のニーズは
少ない。また共面のサブルーチンや、データを異
なる仮想アドレスでアクセスすることは、エリヤ
管理も複雑となるために、良い方法とは言えな
い。従つて、全てのプログラム、データはユニー
クなアドレスを割付けられる単一仮想記憶の場合
は、これは問題点とはならない。次に第２の問題は、キヤツシユを仮想アドレス
でアクセスする場合の本質的なことであるが、こ
れを少し詳細に説明する。アドレス変換テーブル
を更新するのは、(i)あるプログラム実行中にミツ
シングページフオールトが発生し、必要なページ
をロールインする場合、あるいは、空エリヤを作
るためにあるページをロールアウトする場合と、
(ii)プログラムを生成して、ある仮想アドレスに割
付ける、あるいは削除する場合である。このよう
な場合に、キヤツシユメモリには、既にロールア
ウトして主メモリにはないデータが残つていた
り、新しいプログラムが生成されたのに、以前の
プログラムがキヤツシユメモリに残つていたりす
ることになる。これに対して、キヤツシユには、
既にロールアウトされた情報が残つているという
点は、キヤツシユメモリは、外部メモリも加えた
空間でのキヤツシと考えれば良く、既にロールア
ウトされた情報を読出せても不都合は生じない。逆に主メモリのサイズが、キヤツシユの容量分
だけ増える訳で、本方式の利点と言うこともでき
る。主メモリに書込もうとした時には、ストアス
ルー方式のキヤツシユメモリでは、書込毎に毎回
キヤツシユと主メモリの両方を更新するために、
毎回アドレス変換装置を通り、ここで該ページが
主メモリ上にあるか否かチエツクされる。従つて
ロールアウトされたページを使いながら実行して
いるプログラムは、読出しで、キヤツシユがヒツ
トしている限り続行し、キヤツシユミスか、ある
いは、主メモリ書込が起つた時に、ページフオー
ルトが発生することになる。ここでアドレス変換
装置に於けるチエツクとは、ページの状態には(a)
主メモリ上に存在する、(b)ページング中、(c)主メ
モリ上に存在しないの３つの状態があり、一般の
プロセツサは、(a)の時のみアクセスが可能で、外
部メモリとの転送を行うフアイルプロセツサは(b)
の時にもアクセスが可能であり、この規則に合つ
ているか否かのチエツクである。ページフオールトが発生した場合には、キヤツ
シユメモリ内の該当するブロツクは無効化される
ため、以後はキヤツシユでヒツトしない。尚、リ
ードアクセス時にページフオールトが発生した場
合には、該ブロツクをキヤツシユに書込まないよ
うに制御する。次に、今まで実行していたプログラムが完了し
て、同一の仮想アドレスに新たにプログラムが生
成される場合には、外部メモリから転送されるの
が普通であるが、この場合、外部メモリからの転
送中に、キヤツシユを無効化する手段を設けるこ
とによつて解決される。具体的には、外部メモリ
からの主メモリへの転送は、仮想アドレスで行
い、このアドレスをキヤツシユメモリは監視して
おいて、もしもキヤツシユメモリ内に該当するブ
ロツクがあれば無効化を行うものである。次に第３のプロテクトの問題であるが、プロテ
クトは、物理アドレスよりも、仮想アドレスの方
が、プログラム毎にユニークなストレージキイを
割当てることが可能であり、良い方法であると考
えられる。但し書込みは、必ずアドレス変換装置
を通るために書込みプロテクトは必ずチエツクさ
れる。書込みプロテクトエラーが検出された時に
は、キヤツシユメモリの対応するブロツクは無効
化され、キヤツシユメモリに書込んだデータが使
用されないようにする。実行プロテクトは、メモ
リの読出しのために、キヤツシユ内に該当するも
のがあれば、ここで返送しアドレス変換装置を経
由しないために工夫が必要である。本発明では、
キヤツシユメモリを命令キヤツシユとデータキヤ
ツシユに分離した例を示しているが、この例で
は、実行プロテクトエラーが発生した場合にはキ
ヤツシユメモリに入れないように制御している。以上で、本発明の、従来問題点とされていた項
目に対する解決法が理解できたと思われるが、本
発明はマルチプロセツサに於ける問題点をも同様
な方法で解決している。従来のマルチプロセツサ
構成で、アドレス変換テーブルを書き換えた場合
には、他のプロセツサに対して指令を送つて他の
プロセツサが所有するTLBの無効化（TLB
Purge）を行つていた。これによつて、以前のプ
ログラムがキヤツシユメモリに残つていても
TLBを無効化することによつて、該当するキヤ
ツシユメモリは使用しないようにできたわけであ
る。本発明が適用されるマルチプロセツサに於て
は、あるプロセツサがアドレス変換テーブルを書
換えた時に他のプロセツサのキヤツシユメモリ
が、主メモリと一致しなくなる点が問題である
が、前述のように、外部メモリからの転送を仮想
アドレスで行い、キヤツシユメモリでは、この仮
想アドレスを監視しておいて、該当するブロツク
がキヤツシユメモリ内にあれば、無効化を行うこ
とによつて解決している。以下、本発明の一実施例を図面を参照して詳細
に説明する。第１図は本発明が適用されるデータ処理装置の
全体構成の一例を示す図である。第１図において、１０はプログラムおよびデー
タを格納する主メモリで、メモリバス１１、メモ
リコントローラ（MCU）１２を介して共通バス
５０に接続されている。２０は、主メモリ１０に格納されるべきプログ
ラムおよびデータを格納する外部メモリで、外部
メモリバス２１、フアイルプロセツサ（FCP）
２２を介して共通バス５０に接続されている。３
０は入出力プロセツサ（IOP）であり、図示しな
い各種入出力装置とのデータ転送の制御を行う。４０はジヨブプロセツサ（JOBP）であり、こ
こでは１つだけを示しているが、プログラム（命
令）の実行を行う。ジヨブプロセツサ４０は、命令キヤツシユ４
１、データキヤツシユ４２、Ｉユニツト４３およ
びＥユニツトにより構成され、命令キヤツシユ４
１とＩユニツト４３はバス４５で接続され、デー
タキヤツシユ４２とＥユニツト４４はバス４６で
接続され、Ｉユニツト４３とＥユニツト４４はバ
ス４７で接続されている。このように、フアイルプロセツサ２２、入出力
プロセツサ３０およびジヨブプロセツサ４０は、
いずれも共通バス５０に接続され、メモリコント
ローラ１２を介して主メモリ１０をアクセス可能
になつている。ジヨブプロセツサ４は、Ｉユニツト４３とＥユ
ニツト４４でパイプライン処理をするもので、前
記の如くそれぞれのユニツトに対して命令キヤツ
シユ４１とデータキヤツシユ４４を有する。尚プログラム（命令）が扱うデータはオペラン
ドとも呼ばれ、このデータキヤツシユのことをオ
ペランドキヤツシユと呼ぶ場合がある。次に実行すべき命令語をＩユニツト４３がアク
セスする場合、まず、命令キヤツシユ４１上にそ
の命令語が存在するか否かチエツクされ、存在す
る場合には、そのデータが命令語としてバス４５
を介してＩユニツト４３へ送られる。存在しない
場合は、命令語の仮想アドレスを共通バス５０を
介してメモリコントローラ１２へ送出する。メモリコントローラ１２では、仮想アドレスを
物理アドレスに変換してメモリバス１１を介して
主メモリ１０をアクセスする。得られたデータ
（命令）は、共通バス５０を介して、命令キヤツ
シユ４１へ送られ、さらにバス４５を介してＩユ
ニツト４３へ送られ、Ｉユニツト４３で処理され
ると同時に、命令キヤツシユ４１へ貯わえられ
る。Ｉユニツト４３では、この得られた命令を解読
し、Ｅユニツト４４に対して「何を為すべきか」
を指示する。Ｅユニツト４４は、この指令に基づ
き、必要なデータを内部のレジスタやデータキヤ
ツシユ４２から（データキヤツシユ４２上にない
場合は、命令キヤツシユと同様に主メモリ１０か
ら）集め、演算処理し、その結果を内部のレジス
タか主メモリ１０に格納する。後者の主メモリ１
０に結果を格納する際には、該当する位置のデー
タが既にデータキヤツシユ４２内に取込まれてい
るならば、そのデータも更新する。次に共通バス５０の構成例について説明する。
共通バス５０は第２図に示す様に、実際に情報を
転送するのに使用される起動バス５５、データバ
ス５６、応答バス５７と、これらのバス５５〜５
７をそれぞれどのプロセツサあるいはメモリコン
トローラが使用するかを決めるのに必要な起動バ
ス占有要求線５１、データバス占有要求線５２、
応答バス占有要求線５３とインタロツク信号線５
４を含んでおり、時分割で使用される。各バス５５〜５７の情報の中味は次の通りであ
る。 (1) 起動バス５５ (a) アドレス (b) アクセスの種類（例えばリードアクセスで
あるか／ライトアクセスであるか、また何バ
イトアクセスするか、等） (c) アクセスキー（MCU１２で行うプロテク
シヨンチエツクに使用する。） (2) データバス５６ (a) ライトデータ (b) リードデータ (3) 応答バス５７ (a) 終了信号 (b) リターンコード（アクセス中に、発生した
エラー及びページフオールトの情報）などである。これらのバス５５〜５７が、どの様に使用され
るかを第３図で示す。この図で示される様に、 (i) ａのリード要求とｂのリード応答 (ii) ａのリード要求とｄのライト応答 (iii) ｃのライト要求とｄのライト応答の３つの組み合せの転送が、同一のタイムスロツ
トで同時に可能となる。次にバス５５〜５７の使用の様子を第４図で示
す。この図では、タイムスロツト０でJOBP４０
がMCU１２にメモリリード起動をかけ、それに
対するリードデータがタイムスロツトＮとＮ＋１
で返されて来ており、またタイムスロツト１で
IOP３０がMCU１２にメモリライト起動をか
け、それに対する応答がタイムスロツトＮ＋２で
返されている。この様に共通バス５０では、起動
と応答を分離した、いわゆるスプリツト転送を行
う。また、主メモリ１０は複数のメモリアクセス
を処理出来る構成となつている。以上、述べてきたバス５５〜５７の転送を行う
に当つて、その前に占有制御を行う必要がある。
これは転送を希望するプロセツサやメモリコント
ローラが、転送の１タイムスロツト前に、転送に
使用するバスに対する占有要求５１〜５３を出
し、これに対して優先順位を付けて転送を許可す
ることによつて行う。この優先順位の付け方は、
色々な方法が考えられるが、ここではその詳細に
ついては省略する。ただし、応答による占有要求
は、起動による占有要求より優先レベルを上げ
る。というのは、起動による占有要求によつて応
答が返せない事態になると、メモリコントローラ
上で起動の処理が詰まつてしまい、デツドロツク
状態となるからである。例えば、本実施例の場
合、第３図に示すｂのデータリード応答と、ｃの
データライト起動による占有要求が競合した場合
には前者が優先される。以上の占有制御の様子を簡略化して第５図に示
している。タイムスロツト０ではJOBP４０と
IOP３０がリード起動をしようとして、各々が起
動バス占有要求５１を出している。この内、
JOBP４０の方がIOP３０より優先レベルが高い
ものとすると、タイムスロツト１でJOBP４０は
起動バス５５を使用してリードの起動を行い、同
時に占有要求を止める。一方、IOP３０は占有が
許可されなかつたので、タイムスロツト１でも起
動バス占有要求５１を出したままとする。このス
ロツト１では、JOBP４０からの占有要求がなく
なるので、タイムスロツト２でIOP３０はリード
起動が可能となる。この様なシステムにおいて各プロセツサが、他
のプロセツサからのアクセスを排除して、すなわ
ちインタロツクして主メモリ１０をアクセスする
場合には、起動バス５５を他のプロセツサに使用
させない様にする。というのは、起動バス５５を
占有することで、他のプロセツサから今後発生す
る起動を排除し、また既に主メモリ１０内で処理
中のメモリ起動に対しては、データバス５６、応
答バス５７を使用して応答を返すことを可能にす
るためである。もし、これらの応答が返せない
と、メモリコントローラ１２上で起動の処理が詰
まつてしまい、デツドロツク状態になつてしまう
からである。次に、この起動バス５５の占有方法の一例を説
明する。メモリコントローラ１２をインタロツク
してアクセスしようとするプロセツサは、第６図
に示す様に起動バス占有要求５１が受付けられ、
起動バス５５に情報を転送するタイムスロツト
で、起動バス５５を占有していることを示すイン
タロツク信号５４を出す。そして、この信号によ
り他のプロセツサからの起動バス占有要求５１が
受付けられない様に制御する。これは例えば第７
図の回路によつて実現される。この図では、各占
有要求５１〜５３の優先判定回路６１は各プロセ
ツサごとに分散して持ち、インタロツク信号線５
４はオープン・コレクタの信号線としている。ま
ず、インタロツクの信号５４が出てない場合は、
各占有要求５１〜５３を優先判定回路６１でチエ
ツクし、自分の出した起動バス占有要求５１の優
先度が一番高い場合には、アンドゲート６２、オ
アゲート６３を通して起動バス５５の占有許可信
号６４が出る。従つて、このプロセツサは次のタ
イムスロツトで、起動バス５５に対して情報の転
送が可能である。また、この際プロセツサからイ
ンタロツク要求信号６５が出されていると、アン
ドゲート６８を介してＪ−Ｋフリツプフロツプ６
６がセツトされ、インタロツク信号５４が出力さ
れる。このインタロツク信号５４は、インタロツ
ク解除信号６７が出されるまで出力されており、
この間このプロセツサは起動バス５５を占有した
ままとなる。次に、他のプロセツサからインタロ
ツク信号５４が出されている場合には、アンドゲ
ート６２で優先判定回路６１の出力が禁止される
ので、起動バス占有許可信号６４が出ないため、
起動バス５５が使用できず、従つてメモリ起動も
出来ない。次にMCU１２について説明する。 MCU１２は、通常のメモリアクセスの処理の
他、仮想アドレスから物理アドレスへのアドレス
交換や、プロテクシヨンのチエツクを行う。また、各プロセツサ間で共通に使用され、高い
スループツトが要求されるため、リード処理とラ
イト処理は、第８図Ａ，Ｂに示すように、いくつ
かのステージ〜又は′〜′に分かれてお
り、複数個のアクセスを第８図Ｃに示すようにオ
ーバラツプさせて処理出来るようになつている。第９図は、MCU１２の構成の一例を示したも
のであるが、第８図Ａ，Ｂに示した各処理ステー
ジでは次のような動作を行う。 (A)：リード処理ステージの動作共通バス５０からリード起動受信起動バス５５上の仮想アドレス（VA）、アク
セスの種類（FUN）、アクセスキー
（AKEY）を共通バス受信用レジスタ７１に
取込む。アドレス変換とプロテクシヨンチエツクアドレス変換装置７５より、仮想アドレス
（VA）で示されるページが、主メモリ１０に
あるか否かの判定を行い、ある場合には物理
アドレス（PA）に変換する。ない場合は、
いわゆるページフオールトとなる。また、この時プロテクシヨンチエツク回路
７６で、そのアクセスが許可されているもの
か否かの判定を行う。このアドレス変換装置７５とプロテクシヨ
ンチエツク回路７６については、後で詳細に
述べる。これらのプロテクシヨンチエツクの結果
と、ページフオールト情報は、他のエラー情
報と共にリターンコード（RC）として、ア
クセスの種類（FUNC）や物理アドレス
（PA）と共にアクセスレジスタ７２にセツト
される。メモリリード起動アクセスレジスタ７２にあるアクセスにエラ
ーやページフオールトが発生していない場合
には、メモリコントローラ７７が、アクセス
レジスタ７２上の物理アドレス（PA）で、
主メモリ１０にメモリ起動１５１をかけ、主
メモリ１０がその起動を受取つたら、アクセ
スの種類（FUNC）とリターンコード
（RC）を一時記憶レジスタ７３へ移す。また、アクセスレジスタ７２にあるアクセ
スが、既にエラーやページフオールトの発生
を示している場合は、メモリ起動をせず、前
記の情報を一時記憶レジスタ７３へ移す。リードデータ受信とデータ、応答バス占有
要求主メモリ１０からメモリバス１１を介して
リードデータ１５４を受取ると共に、アクセ
スの種類（FUNC）とリターンコード
（RC）を共通バス送出用レジスタ７４へ移
す。一方、共通バス５０に対してはデータバス
占有要求５２と応答バス占有要求５３を出力
する。リードデータ、応答バス転送の占有要求５２，５３が受付けられたら、
リードデータ１５４をバス１５５を介してデ
ータバス５６に転送し、また、終了信号とリ
ターンコード（RC）をバス１５６を介して
応答バス５７に転送し、それぞれアクセス元
のプロセツサに返す。 (B)：ライト処理ステージの動作 ′ 共通バス５０からライト起動受信起動バス５５上の仮想アドレス（VA）、アク
セスの種類（FUNC）、アクセスキー
（AKEY）及びデータバス５６上のライトデ
ータ（WD）を共通バス受信用レジスタ７１
に取込む。 ′ アドレス変換とプロテクシヨンチエツクライトデータを（WD）をアクセスレジスタ
７２にセツトすることを除いて、リード処理
ステージＡのと同じ動作をする。 ′ メモリライト起動ライトデータ（WD）１５３を主メモリ１０
に転送することを除いて、リード処理ステー
ジＡのと同じである。応答バス占有要求アクセスの種類（FUNC）とリターンコード
（RC）を共通バス送出用レジスタ７４へ移
す。一方、共通バス５０に対しては、応答バ
ス占有要求５３を出力する。 ′ 応答バス転送 ′の占有要求５３が受付けられたら、終了
信号とリターンコード（RC）をバス１５６
を介して応答バス５７に転送し、アクセス元
のプロセツサに返す。以上の様に、リードとライトの処理は各ステー
ジに分けられており、異なるアクセスの処理の異
なる番号のステージは、第８図Ｃに示す様に並行
して処理可能である。この図では、共通バス５０
からイ4Byteリード起動、ロ4Byteライト起動、
ハ16Byteリード起動を、それぞれタイムスロツ
ト０，１，２で受取つて処理している。そしてタ
イムスロツト２の場合を見ると、イのメモリリー
ド起動３と、ロのアドレス変換とプロテクシヨン
のチエツク２′と、ハの共通バスからのリード起
動受信１を並行して行つている。ここで、ハの
16Byteリードはイの4Byteリードに比べて、〜
のステージを４回繰り返しているが、これは
4Byteを単位としたメモリインタリーブを行つて
いるためである。以下、これについて説明する。第１０図は主メモリ１０の構成の一例を示した
図であり、メモリボード（MB）１４（１４ａ〜
１４ｄ）は4Byteのデータ幅で構成され、各メモ
リボード１４ａ，１４ｂ，１４ｃ，１４ｄは
4Byte単位に付加されたアドレスの下位2bitが
00，01，10，11であるデータを持つている。そし
て16Byteのデータは、4Byteずつのメモリボード
１４ａ，１４ｂ，１４ｃ，１４ｄ上にあるため、
16Byteリードではメモリボード１４で競合をお
こすこと無く、第８図Ｃの様に連続してメモリボ
ードを起動し、リードデータを続み出して来るこ
とが可能となる。この様な16Byteリードは、主
にキヤツシユミス時にキヤツシユメモリへデータ
を送るブロツク転送に使用される。Ｉユニツト４３やＥユニツト４４が命令キヤツ
シユ４１やデータキヤツシユ４２をアクセスする
場合は、16Byteよりもつと小さな単位（この例
では4Byteとする）で行うので、この16Byteリー
ド時にはＩユニツト４３やＥユニツト４４が必要
とした4Byteのデータが残りのデータより早く渡
される様に制御し、アクセス時間を短縮する。そ
してこのためには、第１０図Ｂのごとくアドレス
に応じて、MCU１２から起動をかけるメモリボ
ード１４の順番を変更すれば良い。次に、アドレス変換とプロテクシヨンチエツク
について詳細に説明する。第１１図は、第９図のアドレス変換装置７５を
中心として更に詳細に示した構成図であり、第１
２図は、アドレス変換の動作フローを示したもの
である。仮想アドレスから物理アドレスへの変換テーブ
ル１３０は、そのメモリ容量が大きいので、主メ
モリ１０の一部に置かれている。しかし、メモリ
アクセスが発生するたびに、仮想アドレスを物理
アドレスに変換するために、主メモリ１０をアク
セスしていてはオーバーヘツドが大きくなるた
め、最近アクセスしたアドレス変換情報を格納し
ておくTLB１１０がMCU１２に設けられてい
る。 TLB１１０には、アドレス変換テーブル１３
０の内、最近使用されたページの内容が格納され
ており、高速にアドレス変換が行なえるようにな
つている。TLB１１０における各ページの内容
は、有効ビツト（Ｖ）１１１、コネクト（Ｃ）ビ
ツト１１２、仮想アドレスの一部（VAP）１１
３、物理アドレスの一部（PPA）１１４、実行
プロテクシヨンビツト（EP）１１５およびスト
レージキー（SKEY）１１６からなつている。Ｖ
ビツト１１１とＣビツト１１２は、該当ページの
現在の状態を示し、Ｖビツト１１１が「０」の場
合は、TLB１１０の該当ページの内容が有効な
データでない（無効）ことを示す。Ｖビツト１１１とＣビツト１１２が共に「１」
の場合は、該当ページが、現在主メモリ１０と外
部メモリ２０との間で転送されていること、すな
わち、ページング中であることを示し、Ｖビツト
１１１が「１」で、Ｃビツト１１２が「０」の場
合は、該当ページが主メモリ１０にあり、メモリ
アクセス可能なことを示している。このように、ページング中である状態を付加し
ているのは、ページングを行つているエリアを
ECP２２からのページングアクセス以外のアク
セスができないようにするためである。本システムでは、仮想アドレスから物理アドレ
スへのアドレス変換を、MCU１２で、各プロセ
ツサに共通に行なわせているので、ECP２２に
よりページングを行なつているアクセスであつて
も、同じアドレス変換装置７５を経由することに
なり、そのページング中のエリアを他のプロセツ
サがアクセスすることを許可すると、データの破
壊や喪失につながる。従つて、上記した如く、Ｖ
ビツト１１１とＣビツト１１２が共に「１」を示
している場合には、FCP２２からのページング
アクセスのみ許可することにより、上記の不都合
を解決しているのである。次に仮想アドレスの一部（VPA）１１３は、
TLB１１０でアドレス変換を行う際に、該当す
る仮想アドレス（VA）の変換対がTLB１１０に
登録されているか否かをチエツクするためのもの
であり、また、物理アドレスの一部（PPA）１
１４はTLB１１０の変換対があつた時に、物理
アドレス（PA）を作成するためのものである。仮想アドレス（VAは）、セグメントアドレス
（SA）１２１、ページアドレス（PA）１２２、
ページ内アドレス（DISP）１２３からなり、上
記の物理アドレスの一部（PPA）１１４は、ペ
ージ内アドレス（DISP）１２３と連なつて物理
アドレス（PA）を作る。実行プロテクシヨンビツト１１５（EP）は、
データに対し誤まつて命令読出し、実行すること
を防ぐためのものであり、プロテクシヨンチエツ
ク回路７６でこのビツトが「１」のエリアに対し
て命令読出しすると実行プロテクトエラーとな
る。従つて本構成例の様に、JOBP４０で命令キ
ヤツシユ４１とデータキヤツシユ４２が分れてい
る場合には、命令キヤツシユ４１からのこのエリ
アに対するアクセスは、全て実行プロテクトエラ
ーとなる。ストレージキー（SKEY）１１６は、ライトプ
ロテクシヨンを行うためのもので、要求元プロセ
ツサから転送されてきたアクセスキー（AKEY）
と共にプロテクシヨンチエツク回路７６により、
ライトアクセスが許可されるか、禁止されるかを
調べられ、後者の場合はライトプロテクトエラー
となる。アクセスキー（AKEY）は、この様にSKEY１
１６との比較によるライトプロテクトエラーのチ
エツクに使う他、ECP２２からのページングア
クセスか否かの情報や、命令読出しであるか否か
の情報を含んでおり、これらのプロテクトチエツ
クにも使用する。次に、変換過程を、第１２図のフローチヤート
を参照して順次説明する。メモリアクセスの種類は大きく次の２つに分け
られる。すなわち、 (1) 一般のプロセツサによるメモリアクセス (2) ECP２２によるページング時のメモリアク
セスの２つである。この(1)，(2)のアクセスの区別は、
アクセスキーAKEY上にあり、信号線１４０を経
由してアドレス変換コントローラ１２５に伝えら
れる。まず、一般的な(1)の場合のメモリアクセスのア
ドレス変換やアクセスの許可の判定について説明
する。あるプロセツサ（JOBP４０又はIOP３０）か
ら出力された仮想アドレスは、共通バス５０を経
由してMCU１２内の共通バス受信用レジスタ７
１内の仮想アドレスレジスタ１２０にセツトされ
る。この仮想アドレスレジスタ１２０にセツトさ
れた仮想アドレスは、セグメントアドレス
（SE）１２１及びページアドレス（PA）１２２
の一部分１２０−２をアドレスとしてまずTLB
１１０をアクセスする。これにより読み出された
TLB１１０のエントリのＶビツト１１１および
Ｃビツト１１２は、アドレス変換コントローラ１
２５に伝えられ、そのパターンにより、その後の
処理が次の〜のように３つに分かれる。これは、第１２図のフローのステツプ（FO
５）に相当している。Ｖビツト１１１＝０，Ｃビツト１１２＝０の
時。これは、第１２図で、「０，０」と表示して
ところであり、前述した如く、TLB１１０の
該当ページ（エントリ）は無効であり、主メモ
リ１０上の変換テーブル１３０を読み出す。
（Ｆ１０）この時、すなわち、TLBミス時の詳
細な動作は後述する。Ｖビツト１１１＝１，Ｃビツト１１２＝１の
時。第１２図で「１，１」の時であるが、この
時、仮想アドレスの一部分１２０−１とTLB
１１０の仮想アドレスの一部分VPA１１３を
コンパレータ１２４で比較した結果、一致し、
TLBヒツト信号１４１が出力されていれば
（Ｆ２０５）、該当ページは現在ページング中で
あることを示しているので、そのメモリアクセ
スを禁止し、アドレス変換コントローラ１２５
よりミツシングページフオールト信号１４２を
出力する。（Ｆ４５） TLBヒツト信号１４１が出力されていない
時は、TLBミスであるのでと同様に、主メ
モリ１０上の変換テーブル１３０を読み出す。
（Ｆ１０）Ｖビツト１１１＝１，Ｃビツト１１２＝０の
時。第１２図で、「１，０」の時であるが、ま
ず、TLBヒツト信号１４１がチエツクされ、
（Ｆ３０）出力されていない時は、プロテクシ
ヨンチエツク回路７６からのプロテクトエラー
信号１４３をチエツクし、エラーが発生してい
なければ、仮想アドレスレジスタ１２０のペー
ジ内アドレス部１２３とTLB１１０上の物理
アドレスの一部１１４を連結して物理アドレス
をセレクタ１２８を介しアクセスレジスタ７２
上に作成し、その物理アドレスをメモリアドレ
スバス１５２に送り、主メモリ１０をアクセス
するためメモリコントローラ７７よりメモリ起
動信号１５１を出力する。（Ｆ４０）次に(2)のFCP２２によるページング時のメモ
リアクセスについて説明する。 FCP２２より出力された仮想アドレスは、共
通バス５０を経由してMCU１２内の仮想アドレ
スレジスタ１２０にセツトされる。この場合も、まずTLB１１０をアクセスし、
アクセスしたTLB１１０のエントリのＶビツト
１１１及びＣビツト１１２のパターンにより、先
程と同様にその後の処理が３つに分かれる。Ｖビツト１１１＝０，Ｃビツト１１２＝０の
時。主メモリ１０の変換テーブル１３０の読み出
しを行う。（Ｆ１０）Ｖビツト１１１＝１，Ｃビツト１１２＝１の
時。この時、TLBヒツト信号１１４がチエツク
される。（Ｆ３０）TLBヒツトを示していれ
ば、アクセスレジスタ７２上で作成された物理
アドレスで主メモリ１０をアクセスする。（Ｆ
４０） TLBヒツト信号が出ていない場合は、主メ
モリ１０の変換テーブル１３０を読み出す。
（Ｆ１０）Ｖビツト＝１，Ｃビツト１１２＝０の時 TLBヒツト信号１４１がチエツクされる。
（Ｆ２１５） TLBヒツト信号１４１が出ている時は、禁
止区域をアクセスしていることになるので、
FCP２２にエラーを知らせる。（Ｆ２２０）次に、TLBミスの場合の、主メモリ１０上の
変換テーブル１３０を読み出す時の処理を説明す
る。変換テーブル１３０は、テーブルに必要なメモ
リ容量を減らすため、アドレス変換に必要な情報
を有するページテーブル１３２と、そのページテ
ーブル１３２の先頭アドレスを保持するセグメン
トテーブル１３１から成る。TLBミス時には、
まずセグメントテーブルの先頭アドレスを保持す
るレジスタ１２６（STOR）の内容と、仮想アド
レスレジスタ１２０のセグメントアドレス
（SA）１２１をアダー１２７で加算して物理アド
レスを作り、それでセグメントテーブル１３１の
該当する位置の内容をリードデータバス１５５上
に読み出して来る。このデータには、ページテー
ブル１３２の先頭アドレスが保持されており、こ
の値と、仮想アドレスレジスタ１２０のページア
ドレス（PA）１２２をアダー１２７で加算して
アドレスを作り、ページテーブル１３２から変換
に必要な情報を読み出す。（Ｆ１０）このページテーブル１３２には、Ｍビツトの
他、TLB１１０内の、Ｖビツト１１１と仮想ア
ドレスの一部１１３（VPA）を除く情報を含ん
でおり、このＭビツトとＣビツトがリードデータ
バス１５５の一部１５５−１を介してアドレス変
換コントローラ１２５に入力され、これらのビツ
トパターンにより次の様な処理を取る。Ｍビツト＝０，Ｃビツト０の時該当ベージは主メモリ１０上に無く、外部メ
モリ２０上にあることを示しており、このペー
ジのアクセス要求に対してはミツシングページ
フオールト信号１４２を出して、該当プロセツ
サにページフオールトを知らせる。（Ｆ４５）Ｍビツト＝０，Ｃビツト＝１の時該当ビツトは現在ページング中であることを
示しているので、FCP２２以外のメモリアク
セスに対してはそのメモリアクセスを禁止し、
ミツシングページフオールト信号１４２を出
す。（Ｆ４５）FCP２２からのメモリアクセス
の場合は、TLB１１０に登録してアクセスを
行う。（Ｆ２０）Ｍビツト＝１，Ｃビツト＝０の時読み出されたリードデータの一部１５５−２
と仮想アドレスの一部１２０−１、及びＶビツ
ト「１」がTLB１１０に登録され、（Ｆ２０）
Ｖ，Ｃビツトのチエツクルーチンに戻る。以上述べたように、アドレス変換装置７５は、
各プロセツサからの仮想アドレスによるメモリア
クセスに対し、仮想アドレスから物理アドレスへ
のアドレス変換を集中して実施することが可能で
アドレス変換の制御が単純となる。また、FCP２２からのアクセスと、他のプロ
セツサからのアクセスとの制御方式を変更するこ
とにより、ページング中のページに対する他のプ
ロセツサからのアクセスを禁止することが可能
で、データの保全が可能となる。次に、ミツシングページフオールト時の動作に
ついて説明する。ページフオールト信号を要求元プロセツサが受
取つた時には、その時に実行していたタスクを中
断し、要求したアドレスを含むページを主メモリ
１０にロードするために、FCP２２に起動をか
ける。FCP２２はこの起動を受けて、該当ペー
ジを読み出し、これが完了すると終了割込みを発
生する。この時には必要なページは主メモリ１０
上にロールインされているために、前記中断され
たタスクを再開する。このタスクが中断されてい
る間、当該プロセツサは他のタスクを実行する。次に命令キヤツシユ４１とデータキヤツシユ４
２について説明する。第１３図は命令キヤツシユ
４１の構成例を示した図である。主メモリ１０か
らコピーして来たデータがキヤツシユデータ部８
１−Ｉ上にあり、そのデータのアドレスがデイレ
クトリイ８２−Ｉと無効化デイレクトリイ８３−
Ｉにあり、またこれらが有効か否かを示す情報が
有効ビツトレジスタ８４−Ｉにある。デイレクト
リイ８２−Ｉと無効化デイレクトリイ８３−Ｉの
内容は同じであり、性能を高めるため分けてあ
る。前者はＩユニツト４３がアクセスしたデータ
がキヤツシユデータ部８１−Ｉにあるか否かのチ
エツクに使用し、後者は他のプロセツサが主メモ
リ１０に書込んだデータがキヤツシユデータ部８
１−Ｉに取込まれている場合に、既にそのデータ
は古くなつているので無効化しなければならない
（これを無効化処理と呼ぶ）が、そのためのチエ
ツクに使用する。次に、この命令キヤツシユ４１の動作について
説明する。なお、命令キヤツシユ４１はデータキ
ヤツシユ４２とは異なり、ライトアクセスは処理
しない。第１４図はリードアクセスのキヤツシユミス時
のフロー、第１５図は無効化処理のフローを示し
ている。 (1) リードアクセス（第１４図参照）Ｉユニツト４３から起動信号９１−Ｉが来
たら、仮想アドレス９２−Ｉの一部、ここで
はビツト１８−２７でデイレクトリイ８２−
Ｉと有効ビツトレジスタ８４−Ｉの内容を読
み出し、デイレクトリイ８２−Ｉの内容と仮
想アドレス９２−Ｉのビツト０−１７をコン
パレータ１６０−Ｉで一致チエツクを行い、
またその内容をパリテイチエツカー１６１−
Ｉでチエツクする。そしてコンパレータ１６
０−Ｉが一致を示し、パリテイエラーが発生
してなく、かつ有効ビツトレジスタ８４−Ｉ
が有効であることを示しているならば、ゲー
ト１６９−Ｉを介してキヤツシユヒツト信号
１７０−Ｉが命令キヤツシユコントローラ１
６２−Ｉに出され、命令キヤツシユコントロ
ーラ１６２−Ｉは、仮想アドレスのビツト１
８−２９でアクセスされたキヤツシユデータ
部８１−Ｉの内容を、リードデータバス９４
−Ｉに乗せると共に、Ｉユニツト４３に対し
て終了信号９３−Ｉを返す。キヤツシユミスの場合は、命令キヤツシユ
コントローラ１６２−Ｉは、起動バス占有要
求５１を出す。占有要求５１が許可されたら、ゲート８５
−Ｉを開き、起動バス５５に仮想アドレス
（VA）、アクセスの種類（FUNC）、アクセス
キー（AKEY）を転送する。なお、このアク
セスキー（AKEY）には命令読み出しである
ことを付加する。セツト信号１７２−Ｉにより、仮想アドレ
スのビツト０〜１７をデイレクトリイ８２−
Ｉ、無効化デイレクトリイ８３−Ｉへ書き込
み、有効ビツトレジスタ８４−Ｉをセツトす
る。本処理をこの時点で行う理由は後で述べ
る。 MCU１２からデータバス５６を介して、
リードデータ（RD）が、また応答バス５７
を介して終了信号とリターンコード（アクセ
ス中に発生したエラー及びページフオールト
の情報）（RC）が送られてきたらレジスタ８
６−Ｉにラツチする。MCU１２の説明でも
述べたように、最初に送られて来たデータ
は、Ｉユニツト４３がアクセスしたデータで
あるので、リターンコード（RC）が次の(1)
〜(3)の状態を示している時（第１４図Ｂに示
す条件イ成立時、）は、Ｉユニツト４３に終
了信号９３−Ｉとリードデータ９４−Ｉとリ
ターンコード９５−Ｉを返す。 (1) No Error（エラーが発生してない時） (2) Page Fault（ページフオールトが発生
した時、） (3) Soft Error（ソフトによるエラー、例え
ばプロテクシヨンエラーが発生した時）また、Hard Error（ハードが原因のエラ
ー）の場合は、再度主メモリ１０をアクセス
することによつて、救える場合が多いので、
リトライを行う。この為、上記の信号を返さ
ないが、リトライ回数が規定回数を越えた場
合、すなわち、リトライオーバーの場合に
は、エラー報告を行うために、上記の信号を
返す。そして、主メモリ１０から読み出して
来たリードデータをキヤツシユデータ部８１
−Ｉに書き込む。，， MCU１２から送られてくる残り
のリードデータをキヤツシユデータ部８１−
Ｉに書込む。のステージで既にＩユニツト
４３に対しては終了信号９３−Ｉを戻してあ
るので、この間、Ｉユニツト４３は別な動作
が可能である。の段階ではこれに加えて、
〜のステージでエラーやページフオール
トが発生したかをチエツクし、発生してない
場合には命令キヤツシユ４１の動作を止め
る。エラーやページフオールトが発生している
場合には、のステージでセツトした有効ビ
ツトレジスタ８４−Ｉに対し、命令キヤツシ
ユコントローラ１６２−Ｉより有効ビツトク
リア信号１７１−Ｉを出して、クリアしキヤ
ツシユデータ部８１−Ｉの該当データを使用
出来ない様にする。また、のステージで
Hard Errorを起こし且つリトライオーバし
てない場合には（第１４図Ｂに示す条件ア成
立）、リトライを行うためのステージに飛
ぶ。以上がリードアクセスの処理手順であるが、先
程述べた様にキヤツシユ（命令、データキヤツシ
ユ共）では、いわゆる無効化処理が必要となる。
以下、その手順を説明する。 (2) 無効化処理（第１５図参照）起動バス５５転送中の仮想アドレス
（VA）とアクセスの種類（FUNC）を毎回レ
ジスタ８７に取込む。上記仮想アドレスのビツト１８−２７で無
効化デイレクトリイ８３−Ｉの内容を読出
し、無効化する必要があるか否かを無効化判
定回路１６５−Ｉでチエツクすると共に、そ
のアドレスのビツト１８−２７をレジスタ８
８−Ｉにセツトする。そして無効化が必要な場合は、レジスタ８
８−Ｉのアドレスで該当の有効ビツト８４−
Ｉをクリアする。このため無効化判定回路１
６５−Ｉから有効ビツトクリア信号１７１−
Ｉを出す。次に無効化が必要な場合を詳細に説明する。
まず、無効化起動バス５５から取んだアクセス
の種類（FUNC）がライトアクセスを示し、そ
れが他方からのものである時行う。そして、次
に示す条件のいずれかを満した時に無効化を行
う。 (a) レジスタ８７−Ｉのアドレスのビツト１８
−２７で無効化デイレクトリイ８３−Ｉを読
出し、その内容とアドレスのビツト０−１７
をコンパレータ１６３−Ｉで比較し、一致し
た時。 (b) 無効化デイレクトリイ８３−Ｉを読出した
際に、パリテイチエツカ１６４−Ｉでパリテ
イエラーが検出された時。 (c) 無効化デイレクトリイ８３−ＩをJOBP４
内で使用している時。（にチエツクが出来
ないため）以上無効化処理について述べたが、次に(1)の
リードアクセスのステージでデイレクトリイ
８２−Ｉ、無効化デイレクトリイ８３−Ｉへア
ドレスを書込み、有効ビツトレジスタ８４−Ｉ
をセツトしなければならない理由を明らかにす
る。第１６図はリードアクセスがキヤツシユミス
になり、主メモリ１０をリードに行く場合と主
メモリ１０に対して他からライトアクセスが行
われた場合のキヤツシユの無効化処理が競合し
た時に、各部分がどの様に使用されるかをタイ
ムチヤートで示している。無効化処理について
は斜線で示してあり、それぞれタイムスロツト
１と３で起動バス５５、データバス５６を転送
中のライトアクセスに対して、タイムスロツト
２と４で無効化デイレクトリイ８３−Ｉのチエ
ツクを行い、タイムスロツト３と５の前半で有
効ビツトレジスタ８４−Ｉをクリアし無効化し
ている。一方、キヤツシユミスとなつたリード
アクセスは、タイムスロツト２で起動バス５５
にアドレスを転送しているので、主メモリ１０
へのアクセスの順番としては、タイムスロツト
１で起動バス５５を転送中のライトアクセスよ
り後で、タイムスロツト３で起動バス５５を転
送中のライトアクセスより前となる。従つてキ
ヤツシユと主メモリ１０のデータの一致を保つ
ためには、ライトアクセスが無効化デイレクト
リイ８３−Ｉをチエツクするタイムスロツト２
と４の間で、キヤツシユミスを起こしたリード
アクセスのアドレスを無効化デイレクトリイ８
３−Ｉに書込まなければならないし、またチエ
ツクの結果、有効ビツトレジスタ８４−Ｉを無
効化するタイムスロツト３と５の間で、有効ビ
ツトレジスタ８４−Ｉをセツトする必要があ
る。これらの制御が必要な理由は、主メモリ１
０で複数のメモリアクセスを同時に処理してい
るからである。なお本構成例では、アドレス情報を２ケ所、
すなわちデイレクトリイ８２−Ｉと無効化デイ
レクトリイ８３−Ｉに持つているため、無効化
デイレクトリイ８３−Ｉの方しか上記の制約を
受けないが、デイレクトリイを１ケ所に持つ場
合は、当然上記の制約を受ける。次にデータキヤツシユ４２について説明す
る。第１７図はデータキヤツシユ４２の構成例
を示した図であり、無効化処理の回路１８０−
Ｄは命令キヤツシユ４１と同じであるため省略
してある。尚第１３図と第１７図で、サフイツ
クスが違うだけのものは相当物である。第１３
図の命令キヤツシユではサフイツクスにＩ、第
１７図のデータキヤツシユではサフイツクスに
Ｄを使用している。命令キヤツシユ４１との大
きな違いは、ライトアクセスをサポートしなけ
ればならない点であり、このライトアクセス時
間を短縮するために共通バス送出用バツフア８
９−Ｄを設け、ライト時には仮想アドレス９２
−Ｄ、ライトデータ９５−Ｄ、制御情報９６−
Ｄをこのバツフア８９−Ｄにセツトしただけ
で、終了信号９３−ＤをＥユニツト４４に返
し、Ｅユニツト４４が次の処理を出来る様に制
御している。次に、このデータキヤツシユ４２の動作につ
いて説明する。但し、リードアクセスの処理は
命令キヤツシユ４１と同じであるので省略す
る。 (3) ライトアクセス（第１８図参照）。Ｅユニツト４４から起動信号９１−Ｄが来
たら、仮想アドレス９２−Ｄ、ライトデータ
９５−Ｄ、制御情報９６−Ｄ（アクセスの種
類、アクセスキー等）を共通バス送出用バツ
フア８９−Ｄにセツトし、Ｅユニツト４４に
対して終了信号９３−Ｄを返す。この際、デ
イレクトリイ８２−Ｄと有効ビツトレジスタ
８４−Ｄをチエツクしキヤツシユヒツト（信
号１７０−Ｄが出てる）ならば、キヤツシユ
データ部８１の仮想アドレスのビツト１８−
２７で示される位置にデータを書込む。データキヤツシユコントローラ１６２−Ｄ
より起動バス占有要求５１、データバス占有
要求５２を出す。両方の占有要求が許可されたら、ゲート８
５−Ｄを開き起動バス５５に仮想アドレス
（VA）、アクセスの種類（FUNC）、アクセス
キー（AKEY）を転送し、データバス５６に
はライトデータを転送する。 MCU１２から応答バス５７を通して終了
信号とリターンコードが送られてきたらレジ
スタ８６−Ｄにラツチする。そしてリターン
コードをチエツクし、エラーやページフオー
ルトを起こしてない時には共通バス送出用バ
ツフア８９−Ｄからそのアクセスを取り除
き、処理を終了する。一方、第１４図Ｂに示
したアの条件、すなわちHard Errorが発生
しかつリトライオーバしてない時には、リト
ライを行うためのステージに飛ぶ。上記以外の場合には、共通バス送出用バツ
フア８９−Ｄのアドレスで有効ビツトレジス
タ８４−Ｄをクリアすると共に、Ｅユニツト
４４に対してエラー、ページフオールトの発
生を報告する。有効ビツトレジスタ８４−Ｄ
をクリアする理由は、例えばプロテクシヨン
エラーの場合は、書込んではならないキヤツ
シユデータ部８１−Ｄのデータに対して、既
にのステージで書込みを行つているためで
ある。尚データキヤツシユ４２から主メモリ１０に
ライト起動したアドレスも起動バス５５からデ
ータキヤツシユのレジスタ８７−Ｄ（無効化処
理の回路１８０−Ｄに含まれている）にセツト
されるが、それに対しては、自分自身が出した
ものであるからデータキヤツシユコントローラ
１６２−Ｄより無効化処理の回路１８０−Ｄに
対して信号１７３−Ｄを送り無効化を行なわな
い様に制御する。命令キヤツシユ４１はライト
アクセスは行なわないので、この信号１７３−
Ｄに相当するものはない。以上詳細に説明したように、本発明によれば、
複数のプロセツサがあり、その中の少なくとも１
つのプロセツサは仮想アドレスでアクセスするキ
ヤツシユメモリを持ち、アドレス変換装置を全プ
ロセツサで共有する構成のデータ処理装置が実現
できる。 The present invention relates to a data processing device in which a plurality of processors share one main memory (hereinafter referred to as main memory). Here, at least one of the plurality of processors is a processor that accesses memory using a virtual address in order to execute instructions, and this processor is herein referred to as a job processor. In addition, at least one processor is a processor that accesses memory using a virtual address in order to perform input/output with an external storage device (hereinafter referred to as external memory), which is also called an auxiliary storage device. is called a file processor. The job processor is also provided with a cache memory that is accessed using virtual addresses. Further, the data processing device according to the present invention includes an address translation device that is commonly used by each processor and converts a virtual address into a physical address. The present invention solves the problem that occurs when the file processor updates the address translation table when rewriting the contents of the main memory (page roll-in, page roll-out) in such a data processing device. The present invention relates to a data processing device that solves problems arising from the collapse of the conventional idea that the main memory is a copy of the main memory. First, the background of the present invention will be explained in detail. A data processing device in which one main memory is shared by multiple processors is generally referred to as a multiprocessor system. Conventional multiprocessor systems generally allow for multi-system configuration without imposing the cost/performance of a single computer. Therefore, the number of processors is
With about two processors, it was the best configuration, but as the number of processors increased further, there was a problem that mutual interference increased and the hardware configuration became too large. One of the problems that must be solved in multiprocessor systems is the virtual memory system. The virtual memory method is well known, and it assumes that main memory and external memory are apparently one unit. teeth,
The system automatically transfers information from a relatively unused portion of main memory to external memory, and transfers requested information from external memory to main memory. Transferring information from main memory to external memory is called rollout, and conversely, transferring information from external memory to main memory is called rollin. In order to perform such roll-in and roll-out control, the main memory and external memory are generally divided into units called pages. For each page, information is provided as to whether the page is currently in main memory, and if so, what the corresponding physical address (also called real address) in main memory is. placed on the conversion table. The address at the time of memory access from the processor is given as a virtual address, the translation table is indexed using the upper address part, and translation into a physical address is performed. Such a translation table is required for the number of pages of virtual addresses, and the required memory capacity becomes large. Therefore, in an attempt to reduce the capacity, the translation table is divided into two levels, such as a segment table and a page table. Many of them perform address translation. As mentioned above, the conversion table requires a large memory capacity, so it is generally placed in the main memory. Therefore, if the conversion table in the main memory is checked every time there is a memory access from the processor, three or more memory accesses will always occur for one memory access request, and the overhead cannot be ignored. Therefore, for processors that use virtual memory,
Many devices are equipped with a high-speed buffer called a TLB, and this high-speed buffer stores the physical address corresponding to the recently used virtual address. According to this, when a memory access from the processor occurs, it is first checked to see if there is a corresponding address in the TLB, and if there is, the memory access of the translation table for address translation is performed. Access is possible with less address translation overhead. Conventionally, such an address translation device including a TLB was placed in each processor, and a virtual control method was realized with the help of a control device within the processor. On the other hand, in order to improve performance, it has become common to provide each processor with a high-speed memory called a cache memory. This cache memory is a copy of a portion of main memory and is typically designed to be 5 to 10 times faster than main memory. When accessing memory from the processor, it first checks to see if there is a corresponding item in the cache, and if so, the contents are returned, and if not, the main memory is accessed. By the way, when using a virtual memory system and having a cache memory, conventionally, from the perspective of the processor,
TLB, cache memory, and main memory were connected in that order. This is based on the idea that cache memory is a copy of main memory, which is real storage, and access to cache memory must be performed after converting it to a physical address. Such conventional methods have the following problems. The first problem is that the effective memory access time becomes longer. The reason for this is that when accessing cache memory from the processor, it must always go through the TLB. That is, the virtual address from the processor is converted into a physical address by the TLB before the cache memory is accessed. The second problem is that TLB is required for all processors, so when the number of processors increases, not only does the amount of hardware increase accordingly, but also the TLB deviation must be corrected, which increases complexity. It's a familiar thing. Correcting TLB drift is when a processor swaps a page, the translation system
update the entry for the processor, but notify not only the processor concerned but also other processors to that effect.
The relevant part must be cleared. This is generally done by TLB Purge (Translation
It is called "Lookaside Buffer Purge" and is one of the important points when configuring a multiprocessor. The third problem arises from the fact that general job processors use virtual addresses when accessing memory, but input/output processors do not have a TLB and therefore access using physical addresses. The problem is that overhead increases because conversion is required when transferring data. The fourth problem is similar to the second problem, but even if it is not a multiprocessor, in a processor that performs advanced pipeline control, the unit that accesses instructions and the unit that accesses operands are separate. , each unit has a cache memory to increase speed, but in this case too, in the conventional system, each unit must have a TLB, which increases the amount of hardware. In order to solve this problem, Japanese Patent Laid-Open No. 49-53339 discloses a method in which the cache memory is accessed using virtual addresses and address translation is performed only when no bits are present. According to this, it is no longer necessary to perform address conversion every time the cache memory is accessed, and it becomes possible to achieve higher speed. However, this publication points out that this method cannot be adopted because it has the following drawbacks. It does not work if (1) two different virtual addresses refer to the same physical address; This is because when the contents of a certain virtual address are rewritten, the contents of the cache memory designated by another virtual address indicating the same physical address must also be rewritten. (2) When rewriting the contents of a page or segment table, a scan is required to invalidate the cache. (3) Since storage keys correspond to physical addresses, protection checks are impossible. There are three points. Japanese Unexamined Patent Publication No. 56-38649 similarly discloses a method of accessing a cache memory using a virtual address, but does not mention a solution to the above problem. Furthermore, Japanese Patent Application Laid-Open No. 142476/1983 discloses a system in which an address translation device is shared by a plurality of processors. According to this, since it is shared by a plurality of processors, it is possible to achieve economical efficiency in a multiprocessor configuration, but the cache memory is not shown here. The present invention is applied to a configuration in which there are a plurality of processors, at least one of which has a cache memory that is accessed using a virtual address, and an address translation device is shared by all the processors. The purpose of such a configuration is to reduce the amount of hardware required for the address translation device and to shorten the effective memory access time. Another purpose is to provide a method that does not require complicated processing such as matching control between TBLs. Another purpose is to provide a method that allows input/output processors to access memory using virtual addresses, with the aim of unified address management. Another purpose is to realize a high-speed and economical memory control method in a computer consisting of a plurality of units that perform pipeline control. However, the biggest problem with this method is that, as pointed out in JP-A No. 49-53339, when a page or segment table in main memory is rewritten, this is not transmitted to the cache memory; This is when the contents of memory no longer correctly reflect the state of main memory. An object of the present invention is to provide a data processing device that solves this problem. Next, the concept of how the present invention solves the problems pointed out in the above-mentioned Japanese Unexamined Patent Publication No. 49-53339 will be explained. The first problem is that two virtual addresses cannot point to the same physical address.This need is necessary in the case of multiple virtual memory; The first factor is that the bit length for specifying the address is 24.
This was necessary when the virtual space was small, about ²²⁴ bits, and insufficient in size, at about 224 = 16Mega, and if it could support virtual spaces as large as ²³² = 4Giga and ²⁴⁸ = 256Tera, it would eliminate the need for multiple virtual memory. There are few. Furthermore, coplanar subroutines and accessing data using different virtual addresses are not good methods because area management becomes complicated. Therefore, in the case of a single virtual memory where all programs and data are assigned unique addresses, this is not a problem. Next, the second problem, which is essential when accessing a cache using a virtual address, will be explained in some detail. The address translation table is updated when (i) a missing page fault occurs during execution of a program and a necessary page is rolled in, or when a page is rolled out to create an empty area. and,
(ii) This is the case when a program is generated and assigned to a certain virtual address or deleted. In such cases, the cache memory may contain data that has already been rolled out and is not in the main memory, or the previous program may remain in the cache memory even though a new program has been generated. I will do it. On the other hand, in cash,
In that the information that has already been rolled out remains, the cache memory can be thought of as a cache in the space including the external memory, and there is no problem even if the information that has already been rolled out can be read. Conversely, the size of the main memory increases by the capacity of the cache, which can be said to be an advantage of this method. When attempting to write to main memory, with store-through type cache memory, both the cache and main memory are updated each time a write is made.
Each time it passes through an address translator, it is checked whether the page is in main memory or not. Therefore, a program running using rolled out pages will continue to read as long as the cache is hit, and a page fault will occur when a cache miss occurs or a main memory write occurs. It turns out. Here, the check in the address translation device means that the state of the page is (a)
There are three states: existing in main memory, (b) paging, and (c) not existing in main memory.A general processor can only access in (a), and transfers to and from external memory. A file processor that does (b)
It is possible to access the site even when , and it is a check to see if this rule is met. When a page fault occurs, the corresponding block in the cache memory is invalidated, so it will no longer be hit in the cache. Note that if a page fault occurs during read access, the block is controlled not to be written to the cache. Next, when the program that has been running so far is completed and a new program is generated at the same virtual address, it is normally transferred from external memory, but in this case, The problem is solved by providing a means to invalidate the cache during the transfer. Specifically, transfers from external memory to main memory are performed using virtual addresses, and this address is monitored by cache memory, and if there is a corresponding block in cache memory, it is invalidated. It is something. Next, regarding the third problem of protection, virtual addresses are considered to be a better method of protection than physical addresses since it is possible to allocate a unique storage key to each program. However, since writing always passes through the address translation device, write protection is always checked. When a write protect error is detected, the corresponding block in the cache memory is invalidated to prevent the data written to the cache memory from being used. Execution protection needs to be devised so that if there is a corresponding item in the cache, it is returned here so that it does not go through the address translation device in order to read the memory. In the present invention,
An example is shown in which the cache memory is separated into an instruction cache and a data cache. In this example, if an execution protection error occurs, the cache memory is controlled not to be stored in the cache memory. From the above, it is believed that the method of solving the problems of the present invention that has been considered as a problem in the past has been understood, but the present invention also solves the problems of multiprocessors in a similar manner. In a conventional multiprocessor configuration, when the address translation table is rewritten, a command is sent to other processors to invalidate the TLB owned by the other processor (TLB
Purge). This allows the program to be saved even if the previous program remains in cache memory.
By invalidating the TLB, it was possible to prevent the relevant cache memory from being used. A problem with multiprocessors to which the present invention is applied is that when one processor rewrites the address translation table, the cache memory of another processor no longer matches the main memory. , the transfer from external memory is performed using a virtual address, and the cache memory monitors this virtual address, and if the corresponding block is in the cache memory, it is resolved by invalidating it. There is. Hereinafter, one embodiment of the present invention will be described in detail with reference to the drawings. FIG. 1 is a diagram showing an example of the overall configuration of a data processing device to which the present invention is applied. In FIG. 1, a main memory 10 stores programs and data, and is connected to a common bus 50 via a memory bus 11 and a memory controller (MCU) 12. 20 is an external memory for storing programs and data to be stored in the main memory 10; an external memory bus 21; a file processor (FCP);
22 to a common bus 50. 3
0 is an input/output processor (IOP), which controls data transfer with various input/output devices (not shown). A job processor (JOBP) 40, only one of which is shown here, executes programs (instructions). The job processor 40 has an instruction cache 4
1, a data cache 42, an I unit 43 and an E unit;
1 and I unit 43 are connected by bus 45, data cache 42 and E unit 44 are connected by bus 46, and I unit 43 and E unit 44 are connected by bus 47. In this way, the file processor 22, input/output processor 30, and job processor 40 are
Both are connected to a common bus 50, and the main memory 10 can be accessed via the memory controller 12. The job processor 4 performs pipeline processing using an I unit 43 and an E unit 44, and has an instruction cache 41 and a data cache 44 for each unit as described above. Note that the data handled by a program (instruction) is also called an operand, and this data cache is sometimes called an operand cache. When the I unit 43 accesses an instruction word to be executed next, it is first checked whether the instruction word exists on the instruction cache 41, and if it exists, the data is transferred to the bus 45 as an instruction word.
It is sent to I unit 43 via. If it does not exist, the virtual address of the command is sent to the memory controller 12 via the common bus 50. The memory controller 12 converts the virtual address into a physical address and accesses the main memory 10 via the memory bus 11. The obtained data (commands) are sent to the instruction cache 41 via the common bus 50, and further sent to the I unit 43 via the bus 45, where they are processed and simultaneously transferred to the instruction cache 41. Can be stored. The I unit 43 decodes this obtained command and asks the E unit 44 "what to do."
instruct. Based on this command, the E unit 44 collects necessary data from internal registers and the data cache 42 (if it is not on the data cache 42, from the main memory 10 in the same way as the instruction cache), performs arithmetic processing, The result is stored in an internal register or main memory 10. The latter's main memory 1
When storing the result in 0, if the data at the corresponding location has already been taken into the data cache 42, that data is also updated. Next, a configuration example of the common bus 50 will be explained.
As shown in FIG. 2, the common bus 50 includes a startup bus 55, a data bus 56, a response bus 57, and these buses 55-5, which are used to actually transfer information.
A startup bus occupancy request line 51, a data bus occupancy request line 52, and a data bus occupancy request line 52 necessary for determining which processor or memory controller uses each of
Response bus occupancy request line 53 and interlock signal line 5
4 and is used on a time-sharing basis. The contents of the information on each bus 55 to 57 are as follows. (1) Startup bus 55 (a) Address (b) Type of access (for example, read access/write access, how many bytes to access, etc.) (c) Access key (protection performed by MCU 12) (2) Data bus 56 (a) Write data (b) Read data (3) Response bus 57 (a) End signal (b) Return code (errors and page faults that occurred during access) information) etc. FIG. 3 shows how these buses 55-57 are used. As shown in this figure, three combinations are transferred: (i) read request from a and read response from b, (ii) read request from a and write response from d, (iii) write request from c and write response from d. are possible simultaneously in the same time slot. Next, FIG. 4 shows how the buses 55 to 57 are used. In this figure, JOBP40 is set to time slot 0.
activates the memory read on the MCU 12, and the read data for it is sent to time slots N and N+1.
It has been returned in time slot 1, and in time slot 1
The IOP 30 issues a memory write start to the MCU 12, and a response is returned at time slot N+2. In this manner, the common bus 50 performs so-called split transfer in which activation and response are separated. Further, the main memory 10 is configured to be able to process multiple memory accesses. Before performing the transfer of the buses 55 to 57 described above, it is necessary to perform occupancy control.
This is done by the processor or memory controller that wishes to perform the transfer issuing occupancy requests 51 to 53 for the bus used for the transfer one time slot before the transfer, giving priority to these requests, and allowing the transfer. I'll do it. This prioritization method is
Various methods can be considered, but the details are omitted here. However, a response-based occupation request has a higher priority level than an activation-based occupation request. This is because if a response cannot be returned due to an occupancy request due to activation, the activation processing becomes stuck on the memory controller, resulting in a deadlock state. For example, in the case of this embodiment, if there is a conflict between the data read response b shown in FIG. 3 and the occupancy request caused by data write activation c shown in FIG. 3, the former is given priority. The above occupancy control is shown in a simplified manner in FIG. In time slot 0, JOBP40 and
Each of the IOPs 30 is issuing a startup bus occupancy request 51 in an attempt to start a read. Of these,
Assuming that JOBP 40 has a higher priority level than IOP 30, JOBP 40 uses the activation bus 55 to activate the read in time slot 1, and at the same time stops the occupancy request. On the other hand, since occupancy of the IOP 30 was not permitted, the startup bus occupancy request 51 continues to be issued in time slot 1 as well. Since there is no occupancy request from JOBP 40 in slot 1, IOP 30 can start reading in time slot 2. In such a system, when each processor accesses the main memory 10 by excluding access from other processors, that is, by interlocking, the startup bus 55 is prevented from being used by other processors. This is because by occupying the startup bus 55, future startups from other processors are excluded, and in response to memory startups that are already being processed in the main memory 10, the data bus 56 and response bus 57 are This is so that it can be used to return a response. This is because if these responses cannot be returned, the startup process will become stuck on the memory controller 12, resulting in a deadlock state. Next, an example of a method for occupying this startup bus 55 will be explained. A processor attempting to interlock and access the memory controller 12 receives a startup bus occupancy request 51 as shown in FIG.
At the time slot for transferring information to the startup bus 55, an interlock signal 54 indicating that the startup bus 55 is occupied is issued. This signal is used to control the activation bus occupancy request 51 from other processors so that it is not accepted. This is, for example, the seventh
This is realized by the circuit shown in the figure. In this figure, the priority determination circuit 61 for each occupancy request 51 to 53 is distributed for each processor, and the interlock signal line 5
4 is an open collector signal line. First, if the interlock signal 54 is not output,
A priority determination circuit 61 checks each of the occupancy requests 51 to 53, and if the activation bus occupancy request 51 issued by itself has the highest priority, an occupancy permission signal 64 for the activation bus 55 is passed through an AND gate 62 and an OR gate 63. coming out. This processor is therefore able to transfer information to the startup bus 55 in the next time slot. At this time, if the interlock request signal 65 is output from the processor, the JK flip-flop 6 is output via the AND gate 68.
6 is set and an interlock signal 54 is output. This interlock signal 54 is output until the interlock release signal 67 is issued.
During this time, this processor continues to occupy the startup bus 55. Next, if the interlock signal 54 is being output from another processor, the output of the priority determination circuit 61 is prohibited by the AND gate 62, so the startup bus occupancy permission signal 64 is not output.
The startup bus 55 cannot be used, so memory startup is also not possible. Next, the MCU 12 will be explained. In addition to normal memory access processing, the MCU 12 performs address exchange from virtual addresses to physical addresses and protection checks. In addition, since it is commonly used between each processor and requires high throughput, read processing and write processing are divided into several stages ~ or '~' as shown in Figure 8A and B. , multiple accesses can be processed in an overlapping manner as shown in FIG. 8C. FIG. 9 shows an example of the configuration of the MCU 12, and each processing stage shown in FIGS. 8A and 8B performs the following operations. (A): Operation of read processing stage The virtual address (VA), access type (FUN), and access key (AKEY) on the read activation reception activation bus 55 are taken into the common bus reception register 71 from the common bus 50. Address Conversion and Protection Check The address conversion device 75 determines whether a page indicated by a virtual address (VA) exists in the main memory 10 or not, and if so, converts it into a physical address (PA). If not,
This is a so-called page fault. At this time, the protection check circuit 76 determines whether or not the access is permitted. The address translation device 75 and protection check circuit 76 will be described in detail later. The results of these protection checks and page fault information are set in the access register 72 along with other error information as a return code (RC), access type (FUNC), and physical address (PA). If no error or page fault occurs in the access to the memory read start access register 72, the memory controller 77 uses the physical address (PA) on the access register 72 to
A memory activation 151 is applied to the main memory 10, and when the main memory 10 receives the activation, the access type (FUNC) and return code (RC) are transferred to the temporary storage register 73. Furthermore, if the access in the access register 72 already indicates the occurrence of an error or page fault, the memory is not activated and the information is transferred to the temporary storage register 73. Read data reception, data, response bus occupancy request Read data 154 is received from the main memory 10 via the memory bus 11, and the access type (FUNC) and return code (RC) are transferred to the common bus sending register 74. On the other hand, a data bus occupancy request 52 and a response bus occupancy request 53 are output to the common bus 50. When read data and response bus transfer occupancy requests 52 and 53 are accepted,
The read data 154 is transferred to the data bus 56 via the bus 155, and the end signal and return code (RC) are transferred to the response bus 57 via the bus 156 and returned to the accessing processor. (B): Operation of the write processing stage ' Write activation reception from the common bus 50 Virtual address (VA) on the activation bus 55, access type (FUNC), access key (AKEY), and write data (WD) on the data bus 56 ) as common bus reception register 71
Incorporate into. ' The same operation as in read processing stage A is performed except for address conversion and setting protection check write data (WD) in the access register 72. ' Memory write activation write data (WD) 153 is transferred to main memory 10.
This is the same as in read processing stage A, except that the read processing stage is transferred to. The response bus occupancy request access type (FUNC) and return code (RC) are transferred to the common bus sending register 74. On the other hand, a response bus occupancy request 53 is output to the common bus 50. When the occupancy request 53 for 'response bus transfer' is accepted, the end signal and return code (RC) are transferred to the bus 156.
The data is transferred via the response bus 57 and returned to the accessing processor. As described above, read and write processing is divided into stages, and stages with different numbers for different access processing can be processed in parallel as shown in FIG. 8C. In this figure, the common bus 50
From A 4Byte read start, B 4Byte write start,
C. The 16-byte read activation is received and processed at time slots 0, 1, and 2, respectively. Looking at the case of time slot 2, (a) memory read activation 3, (b) address conversion and protection check 2', and (c) read activation reception 1 from the common bus are performed in parallel. Here, the
The 16Byte read is compared to the 4Byte read of
This stage is repeated four times, but this
This is because memory interleaving is performed in units of 4 bytes. This will be explained below. FIG. 10 is a diagram showing an example of the configuration of the main memory 10, and shows a memory board (MB) 14 (14a to
14d) consists of a data width of 4Byte, and each memory board 14a, 14b, 14c, 14d
The lower 2 bits of the address added in 4-byte units are
It has data that is 00, 01, 10, 11. Since the 16 Byte data is on the memory boards 14a, 14b, 14c, and 14d each having 4 Bytes,
When reading 16 Bytes, it is possible to start up the memory boards in succession and continue reading data without causing any contention on the memory board 14, as shown in FIG. 8C. Such a 16-byte read is mainly used for block transfer to send data to the cache memory in the event of a cache miss. When the I unit 43 and the E unit 44 access the instruction cache 41 and the data cache 42, it is done in units smaller than 16 bytes (4 bytes in this example), so when reading this 16 bytes, the I unit 43 and the E unit access the instruction cache 41 and the data cache 42. The 4-byte data required by 44 is controlled to be delivered faster than the rest of the data, reducing access time. To do this, the order of the memory boards 14 to be activated from the MCU 12 may be changed according to the address as shown in FIG. 10B. Next, address translation and protection checking will be explained in detail. FIG. 11 is a block diagram showing the address translation device 75 in FIG. 9 in more detail, and the first
FIG. 2 shows the operational flow of address translation. The virtual address to physical address conversion table 130 is placed in a part of the main memory 10 because its memory capacity is large. However, accessing the main memory 10 to convert a virtual address to a physical address every time a memory access occurs would result in a large overhead, so the TLB 110, which stores recently accessed address conversion information, is It is provided in MCU12. The TLB 110 contains an address translation table 13.
0, the contents of the most recently used pages are stored, so that address translation can be performed at high speed. The contents of each page in the TLB 110 include a valid bit (V) 111, a connect (C) bit 112, and a portion of a virtual address (VAP) 11.
3. consists of part of physical address (PPA) 114, execution protection bit (EP) 115 and storage key (SKEY) 116. V
Bit 111 and C bit 112 indicate the current state of the corresponding page, and when V bit 111 is "0", it indicates that the contents of the corresponding page in TLB 110 are not valid data (invalid). V bit 111 and C bit 112 are both “1”
In this case, it indicates that the page in question is currently being transferred between the main memory 10 and the external memory 20, that is, paging is in progress, and the V bit 111 is "1" and the C bit 112 is "1". 0'' indicates that the corresponding page is in the main memory 10 and can be accessed. In this way, the paging status is added to indicate the area where paging is being performed.
This is to prevent access other than paging access from the ECP 22. In this system, the MCU 12 commonly performs address translation from a virtual address to a physical address for each processor, so even if an access is performed for paging by the ECP 22, it will go through the same address translation device 75. Allowing other processors to access the area being paged can lead to data corruption or loss. Therefore, as mentioned above, V
When the bit 111 and the C bit 112 both indicate "1", the above-mentioned inconvenience is solved by permitting paging access only from the FCP 22. Next, part of the virtual address (VPA) 113 is
When performing address translation in the TLB 110, it is used to check whether the translation pair of the corresponding virtual address (VA) is registered in the TLB 110.
14 is for creating a physical address (PA) when a translation pair of TLB 110 is received. Virtual address (VA), segment address (SA) 121, page address (PA) 122,
It consists of an intra-page address (DISP) 123, and the above-mentioned part of the physical address (PPA) 114 is concatenated with the intra-page address (DISP) 123 to form a physical address (PA). Execution protection bit 115 (EP) is
This is to prevent erroneously reading and executing an instruction with respect to data, and if the protection check circuit 76 reads an instruction to an area where this bit is "1", an execution protection error will occur. Therefore, if the instruction cache 41 and data cache 42 are separated in the JOBP 40 as in this configuration example, all accesses to this area from the instruction cache 41 will result in an execution protection error. The storage key (SKEY) 116 is for write protection and is the access key (AKEY) transferred from the requesting processor.
At the same time, the protection check circuit 76
It is checked whether write access is allowed or prohibited; in the latter case, a write protection error occurs. The access key (AKEY) is SKEY1 like this
In addition to being used to check for write protect errors by comparison with ECP 16, it also contains information on whether or not there is a paging access from the ECP 22, and information on whether or not it is an instruction read, and is also used for these protection checks. Next, the conversion process will be sequentially explained with reference to the flowchart of FIG. Types of memory access can be broadly divided into the following two types. That is, (1) memory access by a general processor, and (2) memory access during paging by the ECP 22. The access distinction in (1) and (2) is
It is on the access key AKEY and is transmitted to the address translation controller 125 via the signal line 140. First, address conversion for memory access and determination of access permission in the general case (1) will be explained. A virtual address output from a certain processor (JOBP 40 or IOP 30) is sent to the common bus reception register 7 in the MCU 12 via the common bus 50.
1 in the virtual address register 120. The virtual addresses set in this virtual address register 120 are segment address (SE) 121 and page address (PA) 122.
First, write TLB using part 120-2 as address.
110. read out by this
The V bit 111 and C bit 112 of the TLB 110 entry are the address translation controller 1
25, and depending on the pattern, subsequent processing is divided into the following three steps. This is the flow step (FO
5). When V bit 111=0 and C bit 112=0. This is displayed as "0,0" in FIG. 12, and as described above, the corresponding page (entry) of the TLB 110 is invalid, and the conversion table 130 on the main memory 10 is read.
(F10) The detailed operation at this time, that is, when a TLB miss occurs, will be described later. When V bit 111=1 and C bit 112=1. In Fig. 12, it is "1, 1", but at this time, part of the virtual address 120-1 and TLB
As a result of comparing the partial virtual address VPA113 of 110 with the comparator 124, it is found that they match,
If the TLB hit signal 141 is output (F205), this indicates that the corresponding page is currently being paged, so the memory access is prohibited and the address translation controller 125
A missing page fault signal 142 is output. (F45) When the TLB hit signal 141 is not output, it is a TLB miss, so the conversion table 130 on the main memory 10 is read out in the same way.
(F10) When V bit 111=1 and C bit 112=0. In FIG. 12, at the time of "1, 0", the TLB hit signal 141 is checked first,
(F30) If no output has occurred, check the protect error signal 143 from the protection check circuit 76, and if no error has occurred, check the in-page address field 123 of the virtual address register 120 and the physical address on the TLB 110. The part 114 is concatenated and the physical address is sent to the access register 72 via the selector 128.
The physical address is sent to the memory address bus 152, and the memory controller 77 outputs the memory activation signal 151 in order to access the main memory 10. (F40) Next, memory access during paging by the FCP 22 in (2) will be explained. The virtual address output from the FCP 22 is set in the virtual address register 120 within the MCU 12 via the common bus 50. In this case as well, first access TLB110,
Depending on the pattern of the V bit 111 and C bit 112 of the accessed entry in the TLB 110, the subsequent processing is divided into three parts as before. When V bit 111=0 and C bit 112=0. The conversion table 130 in the main memory 10 is read. (F10) When V bit 111=1 and C bit 112=1. At this time, the TLB hit signal 114 is checked. (F30) If the TLB hit is indicated, the main memory 10 is accessed using the physical address created on the access register 72. (F
40) If the TLB hit signal is not output, read the conversion table 130 in the main memory 10.
(F10) When the V bit = 1 and the C bit 112 = 0, the TLB hit signal 141 is checked.
(F215) When the TLB hit signal 141 is output, it means that a prohibited area is being accessed.
Notify FCP22 of the error. (F220) Next, the process of reading the conversion table 130 on the main memory 10 in the case of a TLB miss will be described. The conversion table 130 consists of a page table 132 having information necessary for address conversion and a segment table 131 holding the start address of the page table 132 in order to reduce the memory capacity required for the table. When TLB misses,
First, an adder 127 adds the contents of the register 126 (STOR) that holds the start address of the segment table and the segment address (SA) 121 of the virtual address register 120 to create a physical address. The contents are read onto the read data bus 155. This data holds the start address of the page table 132, and the adder 127 adds this value to the page address (PA) 122 of the virtual address register 120 to create an address, which is then used for conversion from the page table 132. Read out information. (F10) In addition to the M bit, this page table 132 includes information in the TLB 110 excluding the V bit 111 and a part of the virtual address 113 (VPA), and these M bits and C bits are used as the read data bus. The bit pattern is input to the address conversion controller 125 through a part 155-1 of the bit pattern 155, and the following processing is performed depending on these bit patterns. When the M bit = 0 and the C bit is 0, this indicates that the corresponding page is not on the main memory 10 but on the external memory 20, and a missing page fault signal 142 is issued in response to an access request for this page. to notify the corresponding processor of the page fault. (F45) When M bit = 0, C bit = 1 Since the corresponding bit indicates that paging is currently in progress, memory access is prohibited for those other than FCP22,
A missing page fault signal 142 is issued. (F45) In the case of memory access from the FCP 22, it is registered in the TLB 110 and accessed. (F20) When M bit = 1, C bit = 0 Part of read data 155-2
, part of the virtual address 120-1, and the V bit "1" are registered in the TLB 110, (F20)
Return to the V and C bit check routine. As described above, the address translation device 75
For memory accesses using virtual addresses from each processor, address translation from virtual addresses to physical addresses can be performed in a concentrated manner, which simplifies control of address translation. Additionally, by changing the control method for access from FCP22 and access from other processors, it is possible to prohibit access from other processors to the page being paged, making it possible to maintain data integrity. . Next, the operation at the time of a missing page fault will be explained. When the requesting processor receives a page fault signal, it interrupts the task it was executing at the time and activates the FCP 22 to load the page containing the requested address into main memory 10. In response to this activation, the FCP 22 reads the corresponding page, and when this is completed, generates an end interrupt. At this time, the required pages are 10 pages in main memory.
Resume the suspended task due to which it has been rolled up. While this task is suspended, the processor performs other tasks. Next, the instruction cache 41 and data cache 4
2 will be explained. FIG. 13 is a diagram showing an example of the structure of the instruction cache 41. The data copied from the main memory 10 is stored in the cache data section 8.
1-I, and the address of the data is on directory 82-I and invalidation directory 83-I.
I, and information indicating whether these are valid or not is in the valid bit register 84-I. The contents of the directory 82-I and the invalidation directory 83-I are the same, and are separated to improve performance. The former is used to check whether the data accessed by the I unit 43 is in the cache data section 81-I, and the latter is used to check whether the data written in the main memory 10 by another processor is in the cache data section 81-I.
1-I, the data is already old and must be invalidated (this is called invalidation processing), but it is used to check for this purpose. Next, the operation of this instruction cache 41 will be explained. Note that unlike the data cache 42, the instruction cache 41 does not process write accesses. FIG. 14 shows the flow when a read access cache miss occurs, and FIG. 15 shows the flow of invalidation processing. (1) Read access (see Figure 14) When the activation signal 91-I comes from the I unit 43, part of the virtual address 92-I, here bits 18-27, is used to access the directory 82-I.
I and the contents of the valid bit register 84-I are read, and a match is checked between the contents of the directory 82-I and bits 0 to 17 of the virtual address 92-I using the comparator 160-I.
Also, the contents are parity checker 161-
Check with I. and comparator 16
0-I indicates a match, no parity error has occurred, and valid bit register 84-I
indicates that the instruction
62-I, and the instruction cache controller 162-I registers bit 1 of the virtual address.
The contents of the cache data section 81-I accessed at 8-29 are transferred to the read data bus 94.
-I, and returns an end signal 93-I to the I unit 43. In the case of a cache miss, the instruction cache controller 162-I issues an activation bus occupancy request 51. If the occupancy request 51 is approved, the gate 85
-I and transfers the virtual address (VA), access type (FUNC), and access key (AKEY) to the startup bus 55. Note that this access key (AKEY) is appended with the fact that it is for command reading. The set signal 172-I sets bits 0 to 17 of the virtual address to the directory 82-I.
I, writes to the invalidation directory 83-I, and sets the valid bit register 84-I. The reason for performing this process at this point will be described later. From the MCU 12 via the data bus 56,
The read data (RD) is also sent to the response bus 57.
When the end signal and return code (information on errors and page faults that occurred during access) (RC) are sent through register 8.
6-Latch to I. As mentioned in the explanation of the MCU 12, the first data sent is the data accessed by the I unit 43, so the return code (RC) is the following (1).
When the state shown in ~(3) is shown (when condition A shown in FIG. 14B is satisfied), an end signal 93-I, read data 94-I, and return code 95-I are returned to the I unit 43. (1) No Error (when no error occurs) (2) Page Fault (when a page fault occurs) (3) Soft Error (when a software error such as a protection error occurs) Hard Errors (errors caused by hardware) can often be saved by accessing the main memory 10 again.
Perform a retry. Therefore, the above signal is not returned, but if the number of retries exceeds the specified number, that is, in the case of retry over, the above signal is returned in order to report an error. Then, the read data read from the main memory 10 is stored in the cache data section 81.
-Write to I. ,, The remaining read data sent from the MCU 12 is stored in the cache data section 81-.
Write to I. Since the end signal 93-I has already been returned to the I unit 43 at the stage , the I unit 43 can perform other operations during this time. In addition to this, at the stage of
It is checked whether an error or page fault has occurred in the stages of . . . , and if no error or page fault has occurred, the operation of the instruction cache 41 is stopped. If an error or page fault has occurred, the instruction cache controller 162-I outputs a valid bit clear signal 171-I to the valid bit register 84-I set at stage 162-I to clear the valid bit register 84-I and clear the valid bit register 84-I. The corresponding data in the data section 81-I is made unusable. Also, on the stage of
If a Hard Error occurs and the retry is not over (condition A shown in FIG. 14B is satisfied), the process jumps to the stage for retrying. The above is the processing procedure for read access, but as mentioned earlier, cache (both instruction and data cache) requires so-called invalidation processing.
The procedure will be explained below. (2) Invalidation processing (see FIG. 15) The virtual address (VA) and access type (FUNC) being transferred on the startup bus 55 are loaded into the register 87 each time. The contents of the invalidation directory 83-I are read using bits 18-27 of the virtual address, and the invalidation determination circuit 165-I checks whether or not invalidation is necessary. Register 8
8-I. And if you need to disable it, register 8
The corresponding valid bit 84- in the address of 8-I
Clear I. Therefore, invalidation determination circuit 1
Valid bit clear signal 171- from 65-I
Give I. Next, cases in which invalidation is necessary will be explained in detail.
First, the type of access (FUNC) taken from the invalidation activation bus 55 indicates a write access, and the access is performed from the other side. Then, invalidation is performed when any of the following conditions is met. (a) Bit 18 of address of register 87-I
-27 reads the invalidation directory 83-I, its contents and bits 0-17 of the address
are compared by the comparator 163-I, and when they match. (b) When the parity checker 164-I detects a parity error when reading the invalidation directory 83-I. (c) Invalidate directory 83-I to JOBP4
When used indoors. (Because it is not possible to check) The invalidation processing has been described above.Next, in the read access stage (1), addresses are written to the directory 82-I and the invalidation directory 83-I, and the valid bit register 84 is written. -I
Explain why it is necessary to set Figure 16 shows how each part is affected when a read access results in a cache miss and there is a conflict between cache invalidation processing when going to read the main memory 10 and when writing access to the main memory 10 is performed from another source. The time chart shows how it is used. Invalidation processing is indicated by diagonal lines, and for write accesses during transfer of the startup bus 55 and data bus 56 in time slots 1 and 3, respectively, the invalidation directory 83-I is checked in time slots 2 and 4. In the first half of time slots 3 and 5, the valid bit register 84-I is cleared and invalidated. On the other hand, the read access that resulted in a cache miss is accessed from the startup bus 55 in time slot 2.
Since the address is transferred to main memory 10
The order of access is after the write access during transfer on the activation bus 55 in time slot 1 and before the write access during transfer on the activation bus 55 in time slot 3. Therefore, in order to maintain consistency between the data in the cache and the main memory 10, write access must be performed at time slot 2 when checking the invalidation directory 83-I.
Disable the read access address that caused the cache error between directory 8 and 4.
The valid bit register 84-I must be set between time slots 3 and 5 to invalidate the valid bit register 84-I as a result of the check. The reason why these controls are necessary is that main memory 1
This is because multiple memory accesses are being processed simultaneously at 0. In this configuration example, address information is stored in two places,
In other words, since it is stored in the directory 82-I and the invalidation directory 83-I, only the invalidation directory 83-I is subject to the above restrictions, but if you have the directory in one place, of course the above restrictions apply. subject to restrictions. Next, the data cache 42 will be explained. FIG. 17 is a diagram showing an example of the configuration of the data cache 42, in which the invalidation processing circuit 180-
D is omitted because it is the same as the instruction cache 41. Note that the items in Figures 13 and 17 whose only difference is the suffix are equivalent. 13th
The instruction cache shown in the figure uses I for the suffix, and the data cache shown in FIG. 17 uses D for the suffix. The major difference from the instruction cache 41 is that it must support write access, and in order to shorten this write access time, a common bus sending buffer 8 is used.
9-D is provided, and when writing, virtual address 92 is provided.
-D, write data 95-D, control information 96-
By simply setting D to this buffer 89-D, a termination signal 93-D is returned to the E unit 44, and the E unit 44 is controlled so that it can perform the next process. Next, the operation of this data cache 42 will be explained. However, since the read access process is the same as that for the instruction cache 41, a description thereof will be omitted. (3) Write access (see Figure 18). When the activation signal 91-D comes from the E unit 44, the virtual address 92-D, write data 95-D, and control information 96-D (access type, access key, etc.) are set in the common bus sending buffer 89-D. Then, it returns a completion signal 93-D to the E unit 44. At this time, the directory 82-D and valid bit register 84-D are checked, and if the cache is being sent (signal 170-D is output), bit 18-D of the virtual address of the cache data section 81 is checked.
Data is written to the position indicated by 27. Data cache controller 162-D
A startup bus occupancy request 51 and a data bus occupancy request 52 are issued. If both occupancy requests are granted, gate 8
5-D is opened and the virtual address (VA), access type (FUNC), and access key (AKEY) are transferred to the startup bus 55, and write data is transferred to the data bus 56. When the end signal and return code are sent from the MCU 12 via the response bus 57, they are latched into the register 86-D. Then, the return code is checked, and if no error or page fault has occurred, the access is removed from the common bus sending buffer 89-D, and the process is terminated. On the other hand, if the condition A shown in FIG. 14B occurs, that is, a hard error occurs and retry is not over, the process jumps to the stage for retrying. In cases other than the above, the valid bit register 84-D is cleared with the address of the common bus sending buffer 89-D, and the occurrence of an error or page fault is reported to the E unit 44. Valid bit register 84-D
The reason for clearing is that, for example, in the case of a protection error, data in the cache data section 81-D that should not be written has already been written in the stage. It should be noted that the address activated by writing from the data cache 42 to the main memory 10 is also set from the activation bus 55 to the data cache register 87-D (included in the invalidation processing circuit 180-D); Since the data cache controller 162-D sends a signal 173-D to the invalidation processing circuit 180-D so as not to perform invalidation, the data cache controller 162-D sends a signal 173-D to the invalidation processing circuit 180-D. Since the instruction cache 41 does not perform write access, this signal 173-
There is no equivalent to D. As explained in detail above, according to the present invention,
There are multiple processors, at least one of which
It is possible to realize a data processing device in which each of the two processors has a cache memory that is accessed using virtual addresses, and the address translation device is shared by all the processors.

[Brief explanation of the drawing]

第１図は、本発明が適用されるデータ処理装置
の全体構成を示した図、第２図は第１図の共通バ
スの構成例を示した図、第３図はアクセスごとに
共通バスのどの部分を使用するかを示した図、第
４図は共通バスの使用例を示す図、第５図は共通
バスの占有制御の様子を示した図、第６図はイン
タロツク信号が出ている時の共通バスの占有制御
の様子を示した図、第７図は占有制御回路の構成
例を示した図、第８図Ａ〜ＣはMCUでの処理フ
ローの例及びMCUで複数のアクセスをオーバラ
ツプさせて処理していることを示した図、第９図
はMCUの構成例を示した図、第１０図Ａ，Ｂは
メモリボードの構成例及び16Byteリード時のデ
ータ返送の順番を示した図、第１１図はTLBに
よるアドレス変換装置を示した図、第１２図はア
ドレス変換のフローを示した図、第１３図は命令
キヤツシユの構成例を示した図、第１４図はキヤ
ツシユへのリードアクセス時の処理フローの説明
図、第１５図はキヤツシユ無効化の処理フローの
説明図、第１６図はキヤツシユ各部分の使用タイ
ミングの例を示した図、第１７図はデータキヤツ
シユの構成例を示した図、第１８図はライトアク
セスの処理フローの説明図である。１０……主記憶装置、１２……メモリアクセス
コントローラ、２０……外部記憶装置、２２……
フアイルプロセツサ、３０……入出力プロセツ
サ、４０……ジヨブプロセツサ、４１……命令キ
ヤツシユ、４２……データキヤツシユ、４３……
Ｉユニツト、４４……Ｅユニツト、５０……共通
バス、７５……アドレス変換装置。 FIG. 1 is a diagram showing the overall configuration of a data processing device to which the present invention is applied, FIG. 2 is a diagram showing an example of the configuration of the common bus in FIG. 1, and FIG. Figure 4 shows an example of how the common bus is used; Figure 5 shows how the common bus is controlled; Figure 6 shows the interlock signal. 7 is a diagram showing an example of the configuration of the occupancy control circuit, and FIGS. 8A to 8C are examples of the processing flow in the MCU and how the MCU handles multiple accesses. Figure 9 shows an example of the MCU configuration. Figures 10A and B show an example of the memory board configuration and the order of data return when reading 16 bytes. Figure 11 shows an address translation device using TLB, Figure 12 shows the flow of address translation, Figure 13 shows an example of the configuration of an instruction cache, and Figure 14 shows how to convert an address to the cache. Fig. 15 is an explanatory diagram of the processing flow during read access, Fig. 15 is an explanatory diagram of the processing flow of cache invalidation, Fig. 16 is a diagram showing an example of the usage timing of each part of the cache, and Fig. 17 is the configuration of the data cache. FIG. 18, a diagram showing an example, is an explanatory diagram of a write access processing flow. 10...Main storage device, 12...Memory access controller, 20...External storage device, 22...
File processor, 30... Input/output processor, 40... Job processor, 41... Instruction cache, 42... Data cache, 43...
I unit, 44...E unit, 50...common bus, 75...address conversion device.

Claims

[Claims] 1. A main storage device for storing programs and data, an external storage device for storing programs and data to be stored in the main storage device, and a virtual storage device for the main storage device to execute instructions. at least one job processor that accesses memory using an address; a file processor that accesses memory using a virtual address to the main storage to input and output programs and data between the main storage and the external storage; The job processor is connected to the main storage device and has a memory access controller that is commonly used by each processor and includes an address translation device that translates virtual addresses into physical addresses, and the job processor is accessed using virtual addresses. The cache memory receives a virtual address when the file processor writes to the main memory, and if the file processor holds a data block corresponding to this virtual address, A data processing device including invalidation processing means for making a data block unusable. 2. The cache memory consists of a data section that holds a copy of a part of the main storage, a directory section that holds the virtual address on the main storage stored in the data section, and whether or not the virtual address is valid. a latch register that latches the virtual address sent from the file processor; and a comparison that compares the virtual address latched in the latch register with the virtual address held in the directory section. 2. A data processing device according to claim 1, further comprising means for clearing said valid display section in accordance with a comparison result. 3. The address translation device has means for determining whether or not the physical address corresponding to the virtual address exists on the main storage device, and communicating the result of this determination to the processor that issued the access request. The data processing device described. 4. The data processing device according to claim 1, wherein the job processor, file processor, and memory access controller are connected to a common bus. 5 The cache memory is connected to a common bus,
5. A data processing device as claimed in claim 4, adapted to receive virtual addresses on a common bus.