JP4306811B2

JP4306811B2 - Cache coherence unit with integrated message passing and memory protection for distributed shared memory multiprocessor systems

Info

Publication number: JP4306811B2
Application number: JP53615499A
Authority: JP
Inventors: ベバー，ボルフ−ディートリヒ
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1998-01-07
Filing date: 1998-12-22
Publication date: 2009-08-05
Anticipated expiration: 2018-12-22
Also published as: WO1999035581A1; WO1999035580A1; JP2001515632A; JP2001515633A; JP3983820B2

Description

出願中の他の出願の参照
本発明は、発明者がWofl-Dietrich Weber及びJaspal Kohliで、共通の譲受人であり、「統合されたメッセージ・パッシング・サポートを有する分散共有メモリマルチプロセッサのためのメモリ保護機構」の名称で、１９９８年１月７日に出願された審査中の出願シリアル番号９／００３，７７１に関係する。
発明の背景
１．発明の分野
本発明は、一般的にはコンピュータ通信プロトコルに関し、特に共有メモリ・キャッシュ・コヒーレンス及び保護された共有で無いメッセージ・パッシング（protected shared nothing message passing）の両方をサポートする統合されたプロトコルに関する。
２．背景技術の議論
あるクラスのマルチプロセッサ・データ・コンピュータシステムは、高速相互接続ネットワーク上で通信する複数のプロセッサ・ノードで構成される。各プロセッサ・ノードは、典型的にはプロセッサとローカル・ランダム・アクセス・メモリ（ＲＡＭ）を有する。演算上の問題は、異なるプロセッサ・ノードで利用可能な特定の資源を使用するか、又は結果を生成するのに必要な実時間を低減してそれにより演算を促進するように、プロセッサ・ノードに分割される。それゆえ、１つのプロセッサ・ノードで進行するプロセスはコンピュータシステムの他のプロセッサ・ノードで実行される演算に依存している。各種のプロセスは、情報を交換してプロセスを同期させるように相互接続上で通信する。
プロセッサが相互にどのように通信するかによって異なる２つの主たるマルチプロセッサ・プログラミングの例がある。共有メモリの例は、すべてのプロセッサが装置全体ですべてのメモリにアクセスするのを許す。プロセッサは、１つのプロセッサが所定のメモリ位置に値を書込み、他のプロセッサが同一のメモリ位置からその値を読むことにより、互いに通信する。これに対して、共有で無い（すなわち、メッセージ・パッシング）例では、各プロセッサはその固有メモリにだけアクセスすることができ、明瞭にメッセージを作成し、それらを他のプロセッサに送ることにより他のプロセッサと通信する。両方のプログラミング例は、それぞれ利点があり、両方共に使用されている。共有メモリ例の利点は、より効率的な通信を提供することであり、共有でない例は、１つのプロセスの他のすべてのプロセッサからのより良好な保護を提供することである。
従来技術のシステムは、１つ又は他のプログラミング例が可能である。もし両方が可能であれば、それらは２つの異なる形式の相互接続上でサポートされ、通常は共有メモリ及び関係するキャッシュ・コヒーレンス・プロトコルについては高性能相互接続で、メッセージ・パッシングについてはより低い性能の相互接続でサポートされる。
発明の概要
本発明の目的は、共有メモリ・キャッシュ・コヒーレンス・プロトコルと共有でないメッセージ・パッシング・プロトコルの同一の高性能相互接続上への統合を含み、プロセッサの共有メモリへの完全なアクセスは許すが、プロセッサの共有でないメモリ間には保護障壁を設ける。
本発明は、メッセージ・パッシング・プロトコルをスケーラブル共有メモリ・マルチプロセッシング・コンピュータシステムのキャッシュ・コヒーレンス管理に拡張するシステム及び方法に属する。スケーラブル共有メモリ・マルチプロセッシング・コンピュータシステムは、相互接続に接続された複数のプロセッサを有し、プロセッサは相互接続の上で相互に通信する。相互接続プロセッサは、共有メモリ・コンピュータシステム、共有でないコンピュータシステム、及び一部のプロセッサはメモリを共有するが他は共有しないハイブリッド・コンピュータシステムに適用可能な通信プロトコルを使用して、相互接続上でメッセージの送信と受信を行う。ハイブリッド・コンピュータシステムについては、プロセッサの共有メモリへの完全なアクセスは許すが、プロセッサの共有でないメモリ間には保護障壁を設けることが重要である。プロセッサ・ノードは、やって来るメッセージが（完全には保護されない）同一のコヒーレンス・グループ内から来るか、又は（共有でない保護を適用する）コヒーレンス・グループの外側から来るかを告げることが可能である。これにより、プロセッサ・ノード共有メモリと共有でないプロセッサ・ノードを、同一の相互接続上に一緒に存在させることを可能にする。これは、ノード識別番号（ＮＩＤｓ）、コヒーレンス・ノード番号（ＣＮＮｓ）、及びそれらの間のマッピング（地図）を使用して実現される。システムにおける各プロセッシング・ノードは、ＮＩＤが与えられる。メモリを共有する組内のノードは、それぞれＣＮＮが割り当てられ、ＣＮＮｓからＮＩＤｓまでの一貫したマッピングに一致する。このマッピングで、プロセッサ・ノードは、やって来たメッセージが（完全には保護されない）同一のコヒーレンス・グループ内から来たか、又は（共有でない保護を適用する）そのコヒーレンス・グループの外側から来たかを告けることができる。これにより、プロセッサ・ノード共有メモリと共有でないプロセッサ・ノードを、同一の相互接続上に一緒に存在させることを可能にする。
本発明のこれら及び他の目的は、以下の本発明の詳細な説明及び付属の図面から、当業者には明らかになるであろう。
【図面の簡単な説明】
図１は、本発明の多重プロセッサ・ノードを有するコンピュータシステムの機能ブロック図である。
図２は、メモリ・サブシステム、入力／出力サブシステム、メッシュ・コヒーレンス・ユニット（ＭＣＵ）及びそれぞれのキャッシュを有するプロセッサを含む図１のプロセッサ・ノードの例の機能ブロック図である。
図３は、メッシュ・コヒーレンス・ユニットのブロック図である。
好適な実施例の詳細な説明
本発明は、メッセージ・パッシング・プロトコルと必要な保護障壁をスケーラブル共有メモリ・マルチプロセッシング・コンピュータシステムに統合するシステム及び方法に属する。スケーラブル共有メモリ・マルチプロセッシング・コンピュータシステムは、相互接続に接続された複数のプロセッサを有し、プロセッサは相互接続の上で相互に通信する。メッセージが送られる伝統的な機構は、入力／出力チャンネルと相互接続を介する。これに対して、本発明は、同一の通信チャンネル、相互接続、キャッシュ・コヒーレンスとメッセージ・パッシングの両方を使用し、これがメッセージを交換するレートを著しく増加させる。各プロセッサは、プロセッサ間ノード・キャッシュ・コヒーレンス・プロトコルと密接に統合されたメッセージ・プロトコルを使用してメッセージの送信と受信を行うことにより、他の相互接続プロセッサと通信する。
図１は、本発明によるコンピュータシステム１００の機能ブロック図であり、多重プロセッサ・ノード１０２ａ−ｔと、接続されたノード間の点間通信を提供するプロセッサ・ノード相互接続１０４とを有する。それぞれのプロセッサ・ノード１０２ａ−ｔは、スタンドアロン・コンピュータシステムとして又はメモリを共有するコンピュータシステムを形成するように他のプロセッサ・ノード組み合わされて構成されている。「サイト」という語は、メモリにおける物理的なアドレス空間を共有するプロセッサ・ノードのグループを示すのに使用される。選択されたプロセッサ・ノード１０２ａ−ｄ，１０２ｆ−ｉ，１０２ｌ−ｏ及び１０２ｑ−ｔは、それぞれサイト１０６ａ，１０６ｂ，１０６ｃ及び１０６ｄとして構成されている。他のプロセッサ・ノード１０２ｅ，１０２ｊ，１０２ｋ及び１０２ｐも、相互接続１０４を介して接続されているが、メモリを共有していない。異なるサイトのプロセッサ・ノードは、メッセージ・パッシングを介して通信する。例えば、サイト１０６ａにおけるプロセッサ・ノードは、他のサイトのプロセッサ・ノード、例えばサイト１０６ｃのプロセッサ・ノード１０２ｎと相互接続１０４を介してメッセージを送ることにより通信する。各サイトは、典型的には、対称マルチプロセッサ（ＳＭＰ）上で動作するのに類似したオペレーティングシステムの単一コピーを動作させる。
プロセッサ・ノード１０２ａ−ｔ間の統合されたメッセージ・パッシング及びメモリ保護を有するキャッシュ・コヒーレンスは、例示のシステム１００に適用されることで、本発明が実現される。あるサイトにおけるプロセッサ・ノード、例えばサイト１０６ａのプロセッサ・ノード１０２ａ−ｄは、物理的なアドレスメモリ空間を共有する。更に、それぞれのプロセッサ・ノードは、それぞれのキャッシュメモリ２０４ａ−ｄ（図２）を有する多重プロセッサを有する。このように、キャッシュ・コヒーレンスは、プロセッサのキャッシュ間で、ノード内だけでなく、異なるプロセッサ・ノード内でも維持されなければならない。例えば、ノード１０２ａにおけるキャッシュは、ノード１０２ｂ−ｄにおけるキャッシュとコヒーレントでなければならない。
本発明は、サイト内のプロセッサ・ノードに共有物理アドレス空間へのアクセスを許し、サイト外のプロセッサ・ノードにはアクセスを拒絶するメモリ保護機構を有する。例えば、プロセッサ・ノード１０２ｅは、プロセッサ・ノード１０２ａにメッセージを送ることができる。しかし、プロセッサ・ノード１０２ｅはサイト１０６ａ内にはないので、プロセッサ・ノード１０２ｅはサイト１０６ａの物理アドレス空間でのメモリアクセス動作を実行できない。
メモリ保護機構は、ノード識別子（ＮＩＤｓ）及びコヒーレンス・ノード番号（ＣＮＮｓ）及びそれらの間のマッピングに依存している。システムにおける各プロセッサ・ノードは、システムに渡って特有のＮＩＤが与えられる。更に、サイト内の各プロセッサ・ノードは、サイトに特有のＣＮＮを割り当てられる。サイト内のプロセッサ・ノードは、それぞれそのサイト内の他のすべてのプロセッサ・ノードのＣＮＮｓとＮＩＤｓの間のマッピングを保持するテーブルを保持している。システム１００においては、サイト１０６ｄは、ＮＩＤ＝１５１、ＣＮＮ＝１のノード１０２ｔ、ＮＩＤ＝１５２、ＣＮＮ＝２のノード１０２ｓ、ＮＩＤ＝１５３、ＣＮＮ＝３のノード１０２ｒ、及びＮＩＤ＝１５４、ＣＮＮ＝４のノード１０２ｑを有している。プロセッサ・ノード間で通信されるメッセージは、常にソース・プロセッサ・ノードを識別するＮＩＤを有する。受けたノードは、そのマッピング・テーブルを使用して、やって来たメッセージが同一サイト内のプロセッシング・ノードから来たかそうでないかを決定する。もしソース・ノード・サイトが到着先ノード・サイトと異なれば、メモリ・アクセス保護を適用する。
図２は、プロセッサ・ノード１０２の機能ブロック図である。プロセッサ・ノード１０２は、例えば、図１のプロセッサ・ノード１０２ａ−ｔであり、それぞれキャッシュ２０４ａ−ｄを有するプロセッサ２０２ａ−ｄと、メモリ・サブシステム２０６と、入力／出力サブシステム２０８と、メッシュ・コヒーレンス・ユニット（ＭＣＵ）２１０とを有する。それぞれの機能ユニット２０２ａ−ｄ，２０６，２０８及び２１０は、バス２１２に接続され、ユニット間で制御、アドレス及びデータ信号を送ることができる。メッシュ・コヒーレンス・ユニット２１０は、相互接続１０４に接続されている。メッシュ・コヒーレンス・ユニット２１０は、プロセッサ間ノード・キャッシュ・コヒーレンス、プロセッサ間ノード・メッセージ・パッシング、及びプロセッサ間ノード・メモリ保護を統合する。
プロセッサ２０２ａ−ｄ、メモリ・サブシステム２０６、入力／出力サブシステム２０８、メッシュ・コヒーレンス・ユニット２１０、及びバス２１２は、１つの可能なプロセッサ・ノードの構成例を示し、例えば、異なる個数のプロセッサを使用することもできる。
図３は、メッシュ・コヒーレンス・ユニット２１０のブロック図であり、それはプロセッサ間ノード・キャッシュ・コヒーレンスを維持し、プロセッサ間ノード１０２ａ−ｔの間のメッセージ・パッシングをサポートし、及び許可されていないメモリ・アクセスから保護する。
メッシュ・コヒーレンス・ユニット２１０は、出力制御要素３１０、入力制御要素３１２、キャッシュ・コヒーレンス制御要素３１４、及び相互接続インターフェース３１６を有する。出力制御要素３１０はＣＮＮマップ・レジスタ３３２を有し、及び入力制御要素３１２はメモリ・アクセス制御要素３３４を有する。出力制御要素３１０は、バス２１から、ライン３２２を介して制御信号を、ライン３２４を介しデータ信号を受け、出力メッセージを相互接続インターフェース３１６に送り、指定されたプロセッサ・ノード１０２ａ−ｔに送る（図１を参照）。同様に、入力制御要素３１２は、相互接続インターフェース３１６からメッセージを受け、例えば１０２ａに送られるように、制御信号はライン３２６を介して、データ信号はライン３２８を介してバス２１２に送る（図１を参照）。
キャッシュ・コヒーレンス制御要素３１４は、キャッシュされたメモリ位置の状態情報を保持する。再び図２を参照して、サイト１０２内の各プロセッサ・ノードは、そのノード内のプロセッサに固有のメモリ・サブシステム２０６を有する。キャッシュ・コヒーレンス制御要素３１４（図３）は、メッシュ・コヒーレンス・ユニット２１０に対して相対的に固有のメモリ位置のコヒーレンシィを統合する。
ＣＮＮマップ・レジスタ３３２及びメモリ・アクセス制御要素３３４は、ノード識別番号（ＮＩＤｓ）、コヒーレンス・ノード番号（ＣＮＮｓ）、及びそれらの間のマッピングを使用して、プロセッサ・ノード間のメモリ保護を提供する。システム１００内の各ノード１０２（図１参照）は、ＮＩＤが与えられる。一旦ノードの組がメモリを共有させると、それぞれにＣＮＮが割り当てられ、すべてＣＮＮｓとＮＩＤｓの一貫したマッピングを採用する。このマッピングで、プロセッサ・ノードは、やって来るメッセージが（完全には保護されない）同一のコヒーレンス・グループから来たか、（共有でない保護が適用される）コヒーレンス・グループの外から来たかを告げることができる。これにより、プロセッサ・ノード共有メモリと共有でないプロセッサ・ノードを、同一の相互接続上に一緒に存在させることを可能にする。
これまで説明した例示の実施例は、説明のためであり、限界を意図したものではない。従って、当業者には、以下の請求の範囲の範囲及び趣旨から離れない範囲で、他の実施例が実現できることが認識されるであろう。 Reference to other pending applications The present invention is a common assignee with the inventor, Wofl-Dietrich Weber and Jasper Kohli, “distributed shared memory multiplayer with integrated message passing support. It is related to the pending serial number 9 / 003,771, filed January 7, 1998, under the name “Memory Protection Mechanism for Processors”.
Background of the Invention FIELD OF THE INVENTION The present invention relates generally to computer communication protocols, and in particular to support both shared memory cache coherence and protected shared nothing message passing. Related protocol.
2. 2. Background Art Discussion A class of multiprocessor data computer systems consists of multiple processor nodes communicating over a high speed interconnect network. Each processor node typically has a processor and local random access memory (RAM). Arithmetic issues can cause processor nodes to use specific resources available on different processor nodes, or to reduce the real time required to produce results and thereby facilitate operations. Divided. Therefore, processes that run on one processor node depend on operations performed on other processor nodes in the computer system. The various processes communicate over the interconnect to exchange information and synchronize the processes.
There are two main examples of multiprocessor programming that depend on how the processors communicate with each other. The shared memory example allows all processors to access all memory throughout the device. Processors communicate with each other by one processor writing a value to a given memory location and another processor reading the value from the same memory location. In contrast, in non-shared (ie, message passing) examples, each processor can only access its own memory, and clearly creates messages and sends them to other processors Communicate with the processor. Both programming examples have their advantages and both are used. The advantage of the shared memory example is that it provides more efficient communication, and the non-shared example is that it provides better protection from all other processors of a process.
Prior art systems are capable of one or other programming examples. If both are possible, they are supported on two different types of interconnects, usually high performance interconnects for shared memory and related cache coherence protocols, and lower performance for message passing Supported in the interconnect.
SUMMARY OF THE INVENTION An object of the present invention includes the integration of a shared memory cache coherence protocol and a non-shared message passing protocol onto the same high performance interconnect, and the complete integration of the processor into shared memory. However, a protective barrier is provided between non-shared memory of processors.
The present invention belongs to a system and method for extending a message passing protocol to cache coherence management in a scalable shared memory multiprocessing computer system. A scalable shared memory multiprocessing computer system has a plurality of processors connected to an interconnect, and the processors communicate with each other over the interconnect. Interconnect processors can be used on interconnects using communication protocols applicable to shared memory computer systems, non-shared computer systems, and hybrid computer systems where some processors share memory but not others. Send and receive messages. For hybrid computer systems, full access to the processor's shared memory is allowed, but it is important to provide a protective barrier between the processors' non-shared memory. The processor node can tell whether incoming messages come from within the same coherence group (which is not fully protected) or from outside the coherence group (which applies non-shared protection). This allows processor nodes that are not shared with processor node shared memory to exist together on the same interconnect. This is achieved using node identification numbers (NIDs), coherence node numbers (CNNs), and mappings (maps) between them. Each processing node in the system is given an NID. Each node in the set sharing the memory is assigned a CNN and matches a consistent mapping from CNNs to NIDs. With this mapping, the processor node reports whether the incoming message came from within the same coherence group (which is not fully protected) or from outside that coherence group (which applies non-shared protection). Can This allows processor nodes that are not shared with processor node shared memory to exist together on the same interconnect.
These and other objects of the present invention will become apparent to those skilled in the art from the following detailed description of the invention and the accompanying drawings.
[Brief description of the drawings]
FIG. 1 is a functional block diagram of a computer system having a multiprocessor node of the present invention.
FIG. 2 is a functional block diagram of the example processor node of FIG. 1 including a memory subsystem, an input / output subsystem, a mesh coherence unit (MCU), and a processor having respective caches.
FIG. 3 is a block diagram of the mesh coherence unit.
Detailed Description of the Preferred Embodiments The present invention belongs to a system and method that integrates a message passing protocol and the necessary protection barriers into a scalable shared memory multiprocessing computer system. A scalable shared memory multiprocessing computer system has a plurality of processors connected to an interconnect, and the processors communicate with each other over the interconnect. The traditional mechanism by which messages are sent is through the input / output channels and interconnections. In contrast, the present invention uses the same communication channel, interconnection, both cache coherence and message passing, which significantly increases the rate at which messages are exchanged. Each processor communicates with other interconnected processors by sending and receiving messages using a message protocol that is tightly integrated with the interprocessor node cache coherence protocol.
FIG. 1 is a functional block diagram of a computer system 100 according to the present invention having multiple processor nodes 102a-t and a processor node interconnect 104 that provides point-to-point communication between the connected nodes. Each processor node 102a-t is configured as a stand-alone computer system or in combination with other processor nodes to form a computer system that shares memory. The term “site” is used to indicate a group of processor nodes that share a physical address space in memory. The selected processor nodes 102a-d, 102f-i, 102l-o and 102q-t are configured as sites 106a, 106b, 106c and 106d, respectively. Other processor nodes 102e, 102j, 102k and 102p are also connected via interconnect 104, but do not share memory. Processor nodes at different sites communicate via message passing. For example, a processor node at site 106a communicates by sending a message over interconnect 104 with a processor node at another site, eg, processor node 102n at site 106c. Each site typically runs a single copy of an operating system similar to running on a symmetric multiprocessor (SMP).
Cache coherence with integrated message passing and memory protection between processor nodes 102a-t is applied to the exemplary system 100 to implement the present invention. The processor nodes at a site, eg, processor nodes 102a-d at site 106a, share the physical address memory space. In addition, each processor node has multiple processors with respective cache memories 204a-d (FIG. 2). Thus, cache coherence must be maintained between processor caches, not only in nodes, but also in different processor nodes. For example, the cache at node 102a must be coherent with the cache at nodes 102b-d.
The present invention has a memory protection mechanism that allows processor nodes in the site to access the shared physical address space and denies access to processor nodes outside the site. For example, the processor node 102e can send a message to the processor node 102a. However, since the processor node 102e is not in the site 106a, the processor node 102e cannot perform a memory access operation in the physical address space of the site 106a.
Memory protection mechanisms rely on node identifiers (NIDs) and coherence node numbers (CNNs) and the mapping between them. Each processor node in the system is given a unique NID across the system. In addition, each processor node in the site is assigned a CNN specific to the site. Each processor node in a site maintains a table that maintains a mapping between CNNs and NIDs of all other processor nodes in the site. In the system 100, the site 106d has a node 102t with NID = 151 and CNN = 1, a node 102s with NID = 152 and CNN = 2, a node 102r with NID = 153 and CNN = 3, and a node 102r with NID = 154 and CNN = 4. Node 102q. Messages communicated between processor nodes always have an NID that identifies the source processor node. The receiving node uses its mapping table to determine whether the incoming message came from a processing node in the same site or not. If the source node site is different from the destination node site, apply memory access protection.
FIG. 2 is a functional block diagram of the processor node 102. The processor node 102 is, for example, the processor node 102a-t of FIG. 1, each of the processors 202a-d having caches 204a-d, a memory subsystem 206, an input / output subsystem 208, and a mesh node. A coherence unit (MCU) 210. Each functional unit 202a-d, 206, 208 and 210 is connected to the bus 212 and can send control, address and data signals between the units. The mesh coherence unit 210 is connected to the interconnect 104. The mesh coherence unit 210 integrates interprocessor node cache coherence, interprocessor node message passing, and interprocessor node memory protection.
Processors 202a-d, memory subsystem 206, input / output subsystem 208, mesh coherence unit 210, and bus 212 illustrate one possible processor node configuration, for example, a different number of processors. It can also be used.
FIG. 3 is a block diagram of mesh coherence unit 210, which maintains interprocessor node cache coherence, supports message passing between interprocessor nodes 102a-t, and is not allowed memory.・ Protect from access.
The mesh coherence unit 210 has an output control element 310, an input control element 312, a cache coherence control element 314, and an interconnection interface 316. The output control element 310 has a CNN map register 332 and the input control element 312 has a memory access control element 334. The output control element 310 receives control signals from the bus 21 via line 322, data signals via line 324, sends output messages to the interconnect interface 316, and sends them to designated processor nodes 102a-t ( (See FIG. 1). Similarly, the input control element 312 receives messages from the interconnect interface 316 and sends control signals via line 326 and data signals via line 328 to the bus 212, for example, to 102a (FIG. 1). See).
The cache coherence control element 314 maintains cached memory location state information. Referring again to FIG. 2, each processor node in site 102 has a memory subsystem 206 that is specific to the processors in that node. Cache coherence control element 314 (FIG. 3) integrates coherency of memory locations that are unique relative to mesh coherence unit 210.
CNN map register 332 and memory access control element 334 provide memory protection between processor nodes using node identification numbers (NIDs), coherence node numbers (CNNs), and mappings between them. . Each node 102 (see FIG. 1) in the system 100 is given an NID. Once a set of nodes share memory, each is assigned a CNN and all employ a consistent mapping of CNNs and NIDs. With this mapping, the processor node can tell whether the incoming message came from the same coherence group (which is not fully protected) or from outside the coherence group (which applies non-shared protection) . This allows processor nodes that are not shared with processor node shared memory to exist together on the same interconnect.
The exemplary embodiments described so far are illustrative and not intended to be limiting. Accordingly, those skilled in the art will recognize that other embodiments can be implemented without departing from the scope and spirit of the following claims.

Claims

A computer system,
Interconnects,
A plurality of processor nodes having at least one processor connected to the bus and connected to the interconnect;
One or more nodes form a site that is a group of processor nodes sharing a physical address space of memory;
Each processor node is given node identification information that is unique across the system,
The processor node is
An output controller that controls the output of messages and memory access requests from the bus to the interconnect;
An input controller that controls input of messages and memory access requests from the interconnect to the bus;
A table storing a mapping between a coherence node number unique to the site and node identification information given to another processor node in the site to which the processor node belongs;
A memory protection unit that permits sharing of memory within a site by processor nodes within the same site, while prohibiting memory sharing within the site by processor nodes that are connected to the interconnect and located outside the same site. And
When a set of processor nodes is set to share memory, a coherence node number is assigned to each set of processor nodes, and a mapping between the coherence node number and node identification information is stored in the table. The computer is characterized in that the processor node refers to the table to determine whether or not the input message to which the node identification information is attached is a memory access request from a processor node in the same site. system.

The plurality of processor nodes are divided into a multiprocessor node system;
Memory of a first processor node belonging to at least one multiprocessor node system is accessible to a second processor node belonging to said multiprocessor node system;
The memory of the second processor node is accessible to the first processor node;
The input control unit of the first processor node has said in response to the memory access request from the interconnect, the memory access control unit for selectively rejecting memory access request from the interconnect The computer system according to claim 1.

The plurality of processor nodes are divided into a multiprocessor node system;
At least one multiprocessor node system has distributed shared memory;
Memory of a first processor node belonging to a multiprocessor node system is accessible to a second processor node belonging to the multiprocessor node system;
The memory of the second processor node is accessible to the first processor node;
The input control unit of the first processor node in response to a memory access request from the interconnect, using a node identification numbers, and the coherence node number, a mapping between its those The computer system according to claim 1, further comprising a memory access control unit that selectively rejects a memory access request from the interconnection.

The plurality of processor nodes are divided into a multiprocessor node system;
At least one multiprocessor node system has distributed shared memory;
Memory of a first processor node belonging to a multiprocessor node system is accessible to a second processor node belonging to the multiprocessor node system;
The memory of the second processor node is accessible to the first processor node;
In response to a memory access request from the interconnection, the input control unit of the first processor node is configured to output a node identification number, a coherence node number, and between the node identification number and the coherence node number. 2. The computer system of claim 1, further comprising a memory access controller that selectively rejects memory access requests from the interconnect using a mapping of .

Having a plurality of processor nodes connected to the interconnect;
At least one of the plurality of processor nodes is:
Memory connected to the bus;
A plurality of processors connected to the bus including at least one processor having a cache;
A mesh coherence unit, and
The mesh coherence unit is
A unique coherence node number within the site, given to other processor nodes within the site that is a group of processor nodes that share the physical address space of the memory to which the processor node belongs, and the system A table that stores mappings with unique node identification information across,
An output controller that controls the output of messages and memory access requests from the bus to the interconnect;
An input controller that controls input of messages and memory access requests from the interconnect to the bus;
A cache coherence control unit connected to the output control unit and the input control unit and controlling cache coherence between processor nodes; and
Processors belonging to the same site share the memory in the site, while processors belonging to the same site do not share the memory in the site,
When a set of processor nodes is set to share memory in the site, a coherence node number is assigned to each of the processor nodes constituting the set, and the mapping between the coherence node number and the node identification information is performed as described above. The processor node refers to the table and determines whether the message with the node identification information to be input is a memory access request from a processor node belonging to the same site. Characteristic computer system.