JPH10228458A

JPH10228458A - Multiprocessor computer having configurable hardware system domain

Info

Publication number: JPH10228458A
Application number: JP9342920A
Authority: JP
Inventors: P Dorogichen Daniel; ダニエル・ピー・ドロギチェン; J Mcclauclin Andrew; アンドリュー・ジェイ・マククロクリン; E Aneshanzuree Nicholas; ニコラス・イー・アネシャンズレー
Original assignee: Sun Microsystems Inc
Current assignee: Sun Microsystems Inc
Priority date: 1996-12-12
Filing date: 1997-12-12
Publication date: 1998-08-25
Anticipated expiration: 2017-12-12
Also published as: EP0848327A2; JP4124507B2; US5931938A; EP0848327B1; USRE41293E1; DE69738864D1; EP0848327A3

Abstract

PROBLEM TO BE SOLVED: To separate an entire computer into many independent units from the viewpoint of both software and hardwares by allowing a global address router and a global data router to interconnect different system unit groups. SOLUTION: Each LAS 55L3 on a system unit is connected to one of four different GABs 55G1. An arbiter 55L4 physically consists of four same parts, and each of the parts responds to an access request from lines 811 and communicates with the GAA 55G2 and one of the different LAS 55L3. That is, entire functions of a local part where address router 450 is connected and a global part establishes schedule of the four GABs 55G1 among competitive requests from six ports of all of system unit group 410. The decision to all of the four GABs 55G1 is mutually and simultaneously processed in LAA 55L4 of each system unit.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、電子コンピュータ
に関し、更に特定すれば、多数のプロセッサを可変集合
即ちドメイン群に動的に分離可能とし、処理上の独立性
を得ると共に、ハードウエアのエラーにも拘わらず動作
し続けることを可能にするマルチプロセッサ・アーキテ
クチャに関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an electronic computer, and more particularly to an electronic computer, in which a large number of processors can be dynamically separated into a variable set, that is, a group of domains, to obtain processing independence and to reduce hardware errors. Nevertheless, it relates to a multi-processor architecture that allows it to continue to operate.

【０００２】[0002]

【従来の技術】多数の単純な端末を駆動する集中型メイ
ンフレーム・コンピュータの多くは、パーソナル・コン
ピュータのネットワークによって置き換えられている。
これらのネットワークの殆どは、１台以上のサーバ・コ
ンピュータを組み込んでおり、そのサーバ・コンピュー
タは個々のユーザのためにデータやプログラムを格納し
ている。実際に、サーバは、メインフレームの属性の多
くを引き継いだ、高性能スーパーサーバ(superserver)
に発展しつつある。しかしながら、スーパーサーバは、
ネットワーク型システムにおいては機能が異なってお
り、したがってスーパーサーバのアーキテクチャは、メ
インフレームまたはパーソナル・コンピュータのアーキ
テクチャとは異なるものでなければならない。2. Description of the Related Art Many centralized mainframe computers that drive many simple terminals have been replaced by networks of personal computers.
Most of these networks incorporate one or more server computers, which store data and programs for individual users. In fact, the server is a high-performance superserver that has inherited many of the attributes of the mainframe.
It is developing into. However, the super server
Functionality is different in networked systems, and therefore the architecture of the superserver must be different from the architecture of the mainframe or personal computer.

【０００３】[0003]

【発明が解決しようとする課題】スーパーサーバが他の
アーキテクチャとは領域を異にするのは、それらが１つ
以上のオペレーティング・システム−または１種類のオ
ペレーティング・システムの１つ以上のバージョンを、
異なるジョブのためまたは異なるユーザのために同時に
実行可能でなければならないからである。What sets superservers apart from other architectures is that they contain one or more operating systems-or one or more versions of one type of operating system,
Because it must be able to run simultaneously for different jobs or for different users.

【０００４】また、スーパーサーバは、可用性および信
頼性が非常に高くなければならない。これらは、ハード
ウエア・エラーおよびソフトウエア・エラーに対する高
い耐性を有していなければならず、コンピュータが実行
している間はサービスを提供可能であることが望まし
い。パーソナル・コンピュータの単一（または密接に結
合された多数の）プロセッサ・アーキテクチャとは異な
り、更にスーパーコンピュータの大量（超）並列設計と
も異なり、スーパーサーバは、予測不可能な資源要求を
有する、広範囲にわたる多数のしかも多種多様のタスク
を、実行する柔軟性を必要とする。[0004] Also, the super server must have very high availability and reliability. They must be highly resistant to hardware and software errors, and should be able to provide services while the computer is running. Unlike the single (or many closely coupled) processor architectures of personal computers, and unlike the massively (super) parallel design of supercomputers, superservers have a wide range of unpredictable resource requirements. Need the flexibility to perform a large number and wide variety of tasks.

【０００５】多くの方法では、スーパーサーバは、超大
型コンピュータとしておよび小型コンピュータとして実
行することが要求される。これは、それらのアーキテク
チャに多数の矛盾（競合）する要求を行うことになる。[0005] Many methods require that the superserver run as a very large computer and as a small computer. This places a number of conflicting (conflicting) requirements on those architectures.

【０００６】[0006]

【課題を解決するための手段】本発明は、前述の問題お
よびその他の関連する問題を、ソフトウエアで構成可能
な「ハードウエア・ドメイン群」によって、ソフトウエ
アおよびハードウエア双方の観点から、コンピュータ全
体を多数の独立ユニットに分離する、総合的なコンピュ
ータ・アーキテクチャを提供する。即ち、異なるドメイ
ンが異なるオペレーティング・システムおよびアプリケ
ーションを、互いに独立して実行するだけでなく、他の
ドメインで発生した致命的（重大）なハードウエア・エ
ラーとは無関係に動作するものである。「クラスタ(clu
ster)」によって、複数のドメインがメモリ・アドレス
の共通範囲を共有することを可能とし、データ転送の高
速化を図る。特権的構成−制御ソフトウエアが、物理的
にコンピュータを変更することなく、コンピュータ資源
を複数のドメインおよびドメイン・クラスタに分割する
こと、およびいずれの時点においても複数のドメインお
よびクラスタを再構成することを、オペレータまたはソ
フトウエア・プロセスに可能にさせる。このアーキテク
チャを用いるコンピュータは、パーソナル・コンピュー
タやワークステーション・コンピュータにおいて共通に
用いられているマイクロプロセッサのような、容易に入
手可能な市販のコンポーネント(commodity component)
で構築することができる。SUMMARY OF THE INVENTION The present invention addresses the foregoing and other related problems by means of software configurable "hardware domains," which are used in terms of both software and hardware. Provides a comprehensive computer architecture that separates the whole into a number of independent units. That is, different domains not only run different operating systems and applications independently of each other, but also operate independently of fatal (serious) hardware errors that have occurred in other domains. "Cluster (clu
ster) "allows a plurality of domains to share a common range of memory addresses, thereby speeding up data transfer. Privileged configuration-the control software partitions computer resources into multiple domains and domain clusters without physically modifying the computer, and reconfigures multiple domains and clusters at any one time Is enabled by an operator or software process. Computers using this architecture are readily available, commercially available component components, such as microprocessors commonly used in personal and workstation computers.
Can be built with

【０００７】本発明は、完全に分離された環境におい
て、新しいバージョンのソフトウエアの検査（テスト）
を行いつつ、コンピュータの残りの部分で通常のタスク
を継続することが可能である。コンピュータの一部分が
拡張された診断または予防的保守を実行し、その一方
で、残りの部分が同時に通常のユーザ・タスクを実行す
ることができる。同じコンピュータの異なる部分が、異
なるオペレーティング・システム・ソフトウエア（また
は、同じソフトウエアの異なるバージョン、異なるチュ
ーニング（ｔｕｎｉｎｇ）、またはパラメータ設定）の
下で実行可能であり、タイムシェアやデータベース質問
（ｑｕｅｒｙ）、またはオンライン・トランザクション
処理や意思決定支援システムのような、複数の異なる種
類の作業負荷の最適化を図ることができる。[0007] The present invention is directed to testing new versions of software in a completely isolated environment.
While continuing normal tasks on the rest of the computer. One part of the computer can perform extended diagnostics or preventive maintenance, while the other part can perform normal user tasks at the same time. Different parts of the same computer can run under different operating system software (or different versions of the same software, different tunings or parameter settings), time sharing and database queries Or a plurality of different types of workloads, such as online transaction processing and decision support systems.

【０００８】コンピュータの各部分は、他の部分のソフ
トウエア・エラーだけでなく、ハード・メモリ・エラー
やアドレス要求ラインの誤動作のようなハードウエアの
障害にも不感応である。本発明によるコンピュータは、
ハードウエアの障害によって、同一ハードウエア・ドメ
イン内にはないいずれのプロセッサやメモリにも、アド
レスまたはデータ信号が誤って転送されることを防止す
ると共に、多くのシステム全体に関係する制御信号が異
なるドメイン内のハードウエアに影響を与えるのを物理
的に防止する。Each part of the computer is insensitive to hardware failures, such as hard memory errors and malfunctioning address request lines, as well as software errors in other parts. The computer according to the present invention
Hardware faults prevent address or data signals from being accidentally transferred to any processor or memory not in the same hardware domain, and many system-wide control signals differ. Physically prevent the hardware in the domain from being affected.

【０００９】更なる利点も、当業者には明白であろう。
例えば、双方向（対話的）ジョブは、異なるドメインで
実行することにより、バッチ・ジョブから分離すること
ができる。生産（プロダクション）タスクは、１つのド
メインにおいて中断することなく実行し、開発または問
題の分離を他のドメイン内で同時に行うことも可能であ
る。新しくリリースされるソフトウエアの互換性を、古
い版（バージョン）を同時に実行する同じシステム上で
検査することができる。ときには、複数の組織(organiz
ation)が同じシステムを共有し、別個のドメインを使用
し、各々がそれら自体のタスクに対するある程度のレベ
ルの資源占有性を保証することができ、この占有性は、
容易にスケジュールすることが可能であると共に、構成
要素を物理的に置き換えたり、あるいは信号線を手作業
で切り替えることもなく、ソフトウエア制御の下でドメ
インおよびクラスタを再構成するだけで、短い通知をも
って変更することも可能である。[0009] Further advantages will be apparent to those skilled in the art.
For example, interactive (interactive) jobs can be separated from batch jobs by running in different domains. Production tasks can run uninterrupted in one domain and development or problem isolation can occur simultaneously in other domains. The compatibility of the newly released software can be checked on the same system running the old version at the same time. Sometimes, multiple organizations (organiz
ation) share the same system and use separate domains, each of which can guarantee some level of resource occupancy for their own tasks,
Can be easily scheduled and short notices by simply reconfiguring domains and clusters under software control without physically replacing components or manually switching signal lines It is also possible to change with.

【００１０】端的に言えば、本発明によるコンピュータ
は、多数の個別システム・ユニットを有し、その各々が
プロセッサ、メモリ・セグメント、および／または入出
力アダプタを有する。中央相互接続部が、システム・ユ
ニット間でアドレスおよびデータを移送する。ドメイン
・コントローラが、ドメイン・フィルタを動的に構成
し、互いに独立して機能し、他のドメイン内の重要なハ
ードウエア・エラーにさえも無関係な複数のドメインを
形成する。１つのドメインのプロセッサ、メモリおよび
Ｉ／Ｏは、単一の一元化したコンピュータ・システムと
して機能し、それらの物理的位置は、同じシステム・ユ
ニットにあろうが、異なるシステム・ユニットにあろう
が無関係である。加えて、複数のドメインが動的にクラ
スタに相互接続し、それらのメモリ空間のいくつかまた
は全てを共有することができる。複数のドメインおよび
クラスタは、ソフトウエア制御の下で設定されるレジス
タの内容によって定義される。[0010] Briefly, a computer according to the present invention has a number of individual system units, each having a processor, a memory segment, and / or an input / output adapter. A central interconnect transports addresses and data between system units. A domain controller dynamically configures the domain filters and functions independently of one another to form domains that are independent of even significant hardware errors in other domains. One domain's processors, memory and I / O function as a single, centralized computer system, the physical locations of which may be on the same system unit or on different system units It is. In addition, multiple domains can dynamically interconnect with the cluster and share some or all of their memory space. The domains and clusters are defined by register contents set under software control.

【００１１】種々のシステム・ユニット間の通信は全
て、相互接続部上の「トランザクション」として発生す
る。トランザクションはメモリ・アドレスを含むことが
できるが、含まないものもある。通常のメモリ・トラン
ザクションは、非特権的アプリケーション・プログラム
が行うような、潜在的にキャッシュ可能な主メモリに対
して行われる。他のトランザクションは、（キャッシュ
不可能な）システム制御レジスタに対するもの、Ｉ／Ｏ
アダプタが使用するアドレス空間の部分に対するものが
あり、後者のものは、システム・ブート、ＯＳカーネ
ル、およびＩ／Ｏドライバのような、特権モード・コー
ドによってのみアクセス可能である。更に他のトランザ
クションには、割り込みをあげることができる。[0011] All communications between the various system units occur as "transactions" on the interconnect. Transactions can include memory addresses, but some do not. Normal memory transactions are performed on potentially cacheable main memory, such as by non-privileged application programs. Other transactions are to system control registers (non-cacheable), I / O
Some are for the portion of the address space used by the adapter, the latter being only accessible by privileged mode code, such as system boot, OS kernel, and I / O drivers. Still other transactions can be interrupted.

【００１２】複数のドメインは、ソフトウエアおよびハ
ードウエア双方において、互いに分離されている。個々
のサブシステムは、複数のシステム・カード、ボード、
または、メモリおよび／またはＩ／Ｏ機能を処理するた
めのハードウエアを潜在的に内蔵可能な他の複数のユニ
ットを備えることができる。個々のシステム・ユニット
は全てが完全なプロセッサの機能全てを内蔵しなければ
ならない訳ではないが、ドメインを形成する１組のユニ
ットは、とりわけ、完全なデータ処理システムの機能全
てを含んでいなければならない。単一のシステム・ユニ
ットが、１つのドメインを形成することもできる。いず
れのシステム・ユニットも、属することができるのは１
つのドメインのみである。ドメインは単一のデータ処理
システムとして機能し、その個々のシステム・ユニット
は互いに秘密を有さない。The domains are separated from each other in both software and hardware. Each subsystem consists of multiple system cards, boards,
Alternatively, other units can be provided that can potentially incorporate hardware for processing memory and / or I / O functions. The individual system units do not all have to contain all the functions of a complete processor, but the set of units forming the domain must, among other things, contain all the functions of a complete data processing system. Must. A single system unit may form one domain. Any system unit can belong to 1
Only one domain. A domain functions as a single data processing system, whose individual system units are not secret from each other.

【００１３】「ソフトウエアの分離」とは、あるドメイ
ンにおいて実行中のソフトウエアは、サブシステムにお
けるハードウエア障害がない限り、他のドメインで実行
中のソフトウエアに影響を与え得ないことを意味する。
このためには、各ドメインが、他のドメインとは共有し
ない、それ自体の１つの（または複数の）物理的プロセ
ッサ、複数のメモリ・ユニット、およびＩ／Ｏアダプタ
を有することが必要である。各システム・ユニットと共
通アドレス相互接続ハードウエアとの間のドメイン・フ
ィルタ・ハードウエアは、完全なシステム内に潜在的に
含まれる各ユニットに対する別個のビットを含むマスク
・レジスタを有する。これらのビットの状態が、他のシ
ステムのどれが同一ドメインのメンバであるのかを示
す。ユニットのインターフェースは、相互接続部からの
トランザクションが同じドメイン内のシステム・ユニッ
トにおいて発したものである場合にのみ、このトランザ
クションに応答する。このようにサブシステム間で分配
されるハードウエアは、別個のサービス・プロセッサの
ような、サブシステム外のエージェンシによってのみハ
ードウエアが制御可能である限り、ソフトウエアの分離
を保証するには十分である。[0013] "Software isolation" means that software running in one domain cannot affect software running in another domain unless there is a hardware failure in the subsystem. I do.
This requires that each domain has its own physical processor (s), multiple memory units, and I / O adapters that are not shared with other domains. The domain filter hardware between each system unit and the common address interconnect hardware has a mask register that contains a separate bit for each unit potentially included in the complete system. The state of these bits indicates which of the other systems are members of the same domain. The unit's interface responds to the transaction only if the transaction from the interconnect originated in a system unit in the same domain. The hardware distributed between subsystems in this manner is not sufficient to guarantee software separation, as long as the hardware can only be controlled by an agency outside the subsystem, such as a separate service processor. is there.

【００１４】加えて、「ハードウエアの分離」は、ある
ドメイン内で発生したハードウエア・エラーがコンピュ
ータ内の異なるドメインの動作に影響を与えないことを
示す。ハードウエアの分離は、個々のサブシステム間に
共通バス・アーキテクチャを用いている場合は、実施で
きない。何故なら、障害を発生したサブシステムがバス
全てを停止させる可能性があるからである。したがっ
て、クロスバー(crossbar)および／またはルータのよう
な、切り替え相互接続部をサブシステム間に用いる。１
つのサブシステム内のハードウエア障害は、それが異な
るサブシステムに属するものとして偽装させたり、ある
いは制御信号のパリティ・エラーのような、システム全
体に及ぶ致命的インターフェース信号を発生させる可能
性があるので、サブシステムのハードウエア分離は、サ
ブシステム自体以外に何らかの中央制御ロジックを必要
とし、制御信号の少なくともいくつかを、この中央ロジ
ックと各サブシステム間で１対１（ポイントツーポイン
ト）で結合することが必要となる。相互接続ハードウエ
アもドメイン・マスク・レジスタを有する場合、発生元
のドメイン内の各システムに「有効トランザクション」
信号を生成することができる。これによって、いずれの
ユニットも他のユニットのように偽装することを防止す
る。ソース・ドメイン以外のユニットは全てトランザク
ションを無視するので、これらは他のドメインから発せ
されたハードウエア・エラー信号にエラー状態を発生す
ることができない。相互接続ハードウエア自体における
障害も全てのドメインに影響を与える可能性があるが、
実際には、相互接続部は、サブシステム内のハードウエ
アに比較すると、小型で堅牢である。[0014] In addition, "hardware isolation" indicates that hardware errors that occur in one domain do not affect the operation of different domains in the computer. Hardware separation cannot be implemented if a common bus architecture is used between the individual subsystems. This is because the failed subsystem can shut down all buses. Thus, switching interconnects, such as crossbars and / or routers, are used between the subsystems. 1
Hardware failures within one subsystem can be disguised as belonging to different subsystems, or can cause fatal interface signals throughout the system, such as control signal parity errors. , Subsystem hardware isolation requires some central control logic other than the subsystem itself, and couples at least some of the control signals one-to-one (point-to-point) between this central logic and each subsystem. It is necessary. If the interconnect hardware also has a domain mask register, each system in the originating domain has a "valid transaction"
A signal can be generated. This prevents any unit from impersonating like other units. Since all units other than the source domain ignore the transaction, they cannot generate an error condition on hardware error signals originating from other domains. Failures in the interconnect hardware itself can also affect all domains,
In practice, the interconnects are small and robust when compared to the hardware in the subsystem.

【００１５】用途によっては、所定のドメインが、それ
らの個別にアドレス可能なメモリ空間の１つ以上のセグ
メントを共有することにより、互いとの通信に高帯域通
信を必要とする場合がある。本発明は、個別ドメインの
それに類似した特性を有するドメインのクラスタを提供
することができる。１つの個別システム・ユニットはそ
れ自体のクラスタとなることができ、あらゆる単一ユニ
ットは１つのクラスタのみのメンバとなることができ、
クラスタ関係は変化する。また、ドメインは正確に１つ
のクラスタ内にあり、クラスタは１つ以上のドメインを
含む。クラスタ関係が変化し得るものであることの必要
性は、ドメイン間でメモリを共有する際にそれを使用す
ることから生じたものである。ＡがドメインＢ，Ｃにメ
モリをエクスポートする場合、Ｂ，Ｃは相互接続部上で
互いのトランザクションに応答しなければならず、した
がって同じクラスタ内にいなければならない。この要件
は、上述のシステムにおいて、Ａ内の共有メモリからの
データの現在値が実際にＢまたはＣ内のキャッシュに位
置する場合があり、Ｂ内のプロセッサが新しい値をこの
アドレスに書き込む場合、Ｃのキャッシュ内のコピーを
無効化しなければならず、これを達成するためには、Ｃ
はＢからのトランザクションを全て見なければならない
という可能性から生じたものである。In some applications, certain domains may require high bandwidth communication to communicate with each other by sharing one or more segments of their individually addressable memory space. The present invention can provide a cluster of domains having properties similar to those of individual domains. One individual system unit can be its own cluster, any single unit can be a member of only one cluster,
Cluster relationships change. Also, the domains are in exactly one cluster, and a cluster includes one or more domains. The need for the cluster relationship to be variable stems from its use in sharing memory between domains. If A exports memory to domains B and C, B and C must respond to each other's transactions on the interconnect and must therefore be in the same cluster. This requirement is that in the system described above, if the current value of the data from the shared memory in A may actually be located in the cache in B or C, and the processor in B writes a new value to this address, The copy in C's cache must be invalidated, and to achieve this, C
Arises from the possibility that all transactions from B must be seen.

【００１６】クラスタ内のシステム・ユニットは、同じ
クラスタ内の他の各ユニットとメモリを共有する必要は
ないが、同じクラスタ内のユニットとのみメモリの共有
が可能である。システム・ユニットＡがある範囲の共有
アドレスをユニットＢにエクスポートする場合、Ｂは必
然的に同じクラスタにいるが、Ａ，Ｂと同じクラスタに
いるユニットＣは、このアドレス範囲を共有する必要は
ない。同じクラスタ内において、いずれのシステム・ユ
ニットも、トランザクションの発生元ユニットとして、
当該トランザクションを受けるが、このトランザクショ
ンに応答する必要性は必ずしもある訳ではない。即ち、
受信側ユニットはそれをフィルタ(filter)することも可
能である。実際には、複数のドメインは、メモリを共有
するためにのみクラスタ内で結合されており、当該クラ
スタの全てのメンバ・ドメイン内のシステム・ユニット
は全て、当該クラスタ内の全てのユニットからの、この
共有メモリに対応する特定の範囲のメモリ・アドレスに
対する通常のメモリ・トランザクションに応答するよう
に構成されている。共有メモリ自体は、クラスタ内の１
つのシステム・ユニット上に位置し、この範囲を、それ
自体以外のクラスタ内のドメインに「エクスポート」す
ると言う。（ユニットは、常にそれ自体のドメイン内の
ソース・ユニットからのトランザクション全てに応答す
る。これは、ここで用いられる意味では「エクスポー
ト」ではない。この用語は、ドメインの外側から発した
いくつかのメモリ・トランザクションへの応答の追加の
みを意味する。）したがって、システム・ユニットは、
それが異なるメモリ・アドレス範囲にエクスポートする
ユニットからはアクセス不可能な、キャッシュ可能メモ
リを含むことができる。ユニット内のシステム制御レジ
スタ群およびＩ／Ｏ装置群は、異なるドメイン内のユニ
ット群には全くアクセスすることはできない。たとえこ
れらのユニットが同じクラスタに属していてる場合で
も、アクセスすることはできない。A system unit in a cluster need not share memory with each other unit in the same cluster, but can share memory only with units in the same cluster. If system unit A exports a range of shared addresses to unit B, B is necessarily in the same cluster, but units C in the same cluster as A and B do not need to share this address range. . Within the same cluster, any system unit can be a transaction originating unit
While receiving the transaction, it is not always necessary to respond to the transaction. That is,
The receiving unit can also filter it. In practice, the domains are combined in a cluster only to share memory, and all system units in all member domains of the cluster are It is configured to respond to normal memory transactions for a specific range of memory addresses corresponding to the shared memory. The shared memory itself is the one in the cluster.
Located on one system unit, this range is said to be "exported" to domains in the cluster other than itself. (A unit always responds to all transactions from source units in its own domain. This is not an "export" in the sense used here. The term It only means adding a response to the memory transaction.) Therefore, the system unit
It may include a cacheable memory that is inaccessible from units exporting to different memory address ranges. The system control registers and I / O devices in a unit have no access to units in different domains. Even if these units belong to the same cluster, they cannot be accessed.

【００１７】クラスタ化によって、各システム・ユニッ
ト上のドメイン・レジスタに、共有メモリ・レジスタが
追加され、どのアドレスが共有されるのか、即ち、当該
クラスタにおける他のドメイン内の少なくとも１つのシ
ステム・ユニットにエクスポートされるのかを示す、範
囲レジスタ群も含むことができる。共有メモリ・レジス
タは、他のユニットのどれが、そのユニットとのアドレ
スの転送を行うことができるのかを示す。したがって、
システム・ユニットが、他のユニットからのトランザク
ションにおいて、アドレスに応答するのは、（ａ）ソー
ス・ユニットが同じドメインのメンバであること、また
は（ｂ）それが同じクラスタのメンバであることが共有
メモリレジスタ内に指定されており、そのアドレスが共
有するように指定されている範囲（あるとすれば）内に
ある場合にのみであり、しかも先に定義したような通常
のメモリ・トランザクションに対してのみ行う。相互接
続部内のドメイン・レジスタは、トランザクションを発
生したユニットと同じクラスタ内にある他のシステム・
ユニット群に有効性確認信号を送出することができる、
ドメイン・クラスタ・レジスタとなる。The clustering adds a shared memory register to the domain registers on each system unit, which addresses are shared, ie, at least one system unit in another domain in the cluster. Range registers that indicate if they are to be exported. The shared memory register indicates which of the other units can transfer addresses with that unit. Therefore,
The system unit responds to addresses in transactions from other units because (a) the source unit is a member of the same domain, or (b) it is a member of the same cluster. Specified in a memory register and only if the address is within the range (if any) specified to be shared, and for normal memory transactions as defined above. Only do it. The domain registers in the interconnect are used by other systems in the same cluster as the unit that originated the transaction.
A validity confirmation signal can be sent to the unit group,
It becomes a domain cluster register.

【００１８】[0018]

【発明の実施の形態】図１は、サーバまたは中型コンピ
ュータに典型的なアーキテクチャを有する従来技術のコ
ンピュータ１００を示す。コンピュータ１００は、マザ
ー・ボードまたはプラグイン・ボード上のプロセッサ１
１０、メモリ用の別個のボード１２０、およびＩ／Ｏア
ダプタ用の別個のボード１３０を有する。データ・バス
１４０およびアドレス・バス１５０が、異なる機能ブロ
ックを共に結合する。制御分配バス１６０は、エラー信
号を含む制御信号を種々のボードに送出する。システム
が大きい場合、起動、診断、および同様の機能に、専用
の制御およびサービス・ユニット１７０を有してもよ
い。FIG. 1 shows a prior art computer 100 having an architecture typical of a server or medium computer. The computer 100 includes a processor 1 on a mother board or a plug-in board.
10, a separate board 120 for memory and a separate board 130 for I / O adapters. A data bus 140 and an address bus 150 couple different functional blocks together. Control distribution bus 160 sends control signals, including error signals, to various boards. If the system is large, it may have dedicated control and service units 170 for startup, diagnostics, and similar functions.

【００１９】バー１０１は、コンピュータ１００のアド
レス空間全体を概略的に表している。プロセッサまたは
プロセッサ群（複数）は、メモリを含むボード即ちユニ
ット１２０に対するアドレスを全て、バス１５０上に送
出する。各ボードは、ある範囲のアドレスに応答するメ
モリを有する。各ボードは、通常メモリ・ボード上の機
械的スイッチまたはレジスタによって設定される、異な
るアドレス範囲を含む。また、プロセッサ１１０は、全
てのボード１３０上のＩ／Ｏアダプタ全てとも通信を行
う。A bar 101 schematically represents the entire address space of the computer 100. The processor or processors (s) send out all addresses on the bus 150 for the board or unit 120 containing the memory. Each board has memory responsive to a range of addresses. Each board contains a different address range, usually set by mechanical switches or registers on the memory board. The processor 110 also communicates with all of the I / O adapters on all the boards 130.

【００２０】図２は、多数のシステム・ユニット２１０
の各々が、プロセッサ自体の中に、メモリ、および互い
に結合されているＩ／Ｏアダプタ群を含み、当該ユニッ
トが潜在的にそれ自体で完全なコンピュータとして機能
することができるようにした、異なるアーキテクチャ２
００を示す。（しかしながら、システム・ユニットによ
っては、実際に、プロセッサのみ、メモリのみ、Ｉ／Ｏ
のみ、またはそれらの潜在的な全機能の下位の組み合わ
せのいくつかを含むものもある。）個々のシステム・ユ
ニット２１０は、同一ユニット内に、アドレスされたデ
ータを送信するか、あるいはシステム・ユニット群が差
し込まれているセンタプレーン相互接続構造(centerpla
ne interconnect structure)として構築されている、高
速ルータ２４０，２５０上を、同じ複合体内の他のシス
テム・ユニット群に、アドレスされたデータを送信す
る。制御分配バス２６０は、制御信号およびエラー信号
を全てのシステム・ユニット２１０に送出する。このよ
うなコンピュータは、典型的なパーソナル・コンピュー
タまたは中型コンピュータのバス組織に限定されるもの
ではない。例えば、データおよびアドレス・ルータ２４
０，２５０は、通常のポイントツーポイント配線、クロ
ス・ポイント・スイッチ、または多数の調停されたバス
として実施してもよい。システム２００全体は、メモリ
共有対称マルチプロセッサ・システムとして特徴付ける
ことができる。好ましくは、コヒーレントなキャッシュ
機能も用いる。この機構は、多数の従来からの方法で実
現すればよい。Sun Microsystems社から入手可能な市販
のCS6400は、このタイプの機械（マシン）の一例であ
る。FIG. 2 illustrates a number of system units 210.
Each include memory within the processor itself, and I / O adapters coupled to each other, allowing the unit to potentially function as a complete computer by itself. 2
00 is shown. (However, depending on the system unit, actually only the processor, only the memory, the I / O
Some may include only some or some of the sub-combinations of all of their potential functions. 2.) Individual system units 210 may transmit addressed data within the same unit, or the centerplane interconnect structure (centerpla) into which the system units are plugged.
The addressed data is transmitted over high speed routers 240, 250, which are constructed as ne interconnect structures, to other system units in the same complex. Control distribution bus 260 sends control and error signals to all system units 210. Such computers are not limited to the typical personal or medium computer bus organization. For example, data and address router 24
0,250 may be implemented as regular point-to-point wiring, cross-point switches, or multiple arbitrated buses. The entire system 200 can be characterized as a shared memory symmetric multiprocessor system. Preferably, a coherent cache function is also used. This mechanism may be implemented in a number of conventional ways. The commercially available CS6400 available from Sun Microsystems is an example of this type of machine.

【００２１】バー２０１は、コンピュータ２００のアド
レス空間を示す。各システム・ユニットは、潜在的にそ
れ自体で完全なコンピュータであるが、アドレス・ルー
タ２５０によって与えられる相互接続部は、共通のアド
レス空間全体に、全てのシステム・ユニットを配置す
る。即ち、各システム・ユニット２１０上のあらゆるメ
モリ位置の完全に承認されたアドレスは、他の全てのユ
ニット上のあらゆるメモリ位置のそれとは異なっていな
ければならない。A bar 201 indicates the address space of the computer 200. Although each system unit is potentially a complete computer by itself, the interconnect provided by the address router 250 places all system units throughout a common address space. That is, the fully authorized address of every memory location on each system unit 210 must be different from that of every memory location on every other unit.

【００２２】分配バス２６０上のエラー信号およびステ
ータス信号は、システム内のあらゆるユニット２１０に
影響を与える。例えば、エラー訂正コードは、ルータ２
５０上のビット・エラーを訂正し、コードが検出可能で
あるが訂正不可能なその他のエラーに対して致命的エラ
ー信号を生成する。このような致命的エラー信号は、通
常、当該エラーの原因となった障害を単一のシステム・
ユニットまたはルータの場所に閉じこめた場合でも、シ
ステム全体に影響を及ぼす。障害を発生したシステム・
ユニットは、連続的にエラー信号をアサートすることが
でき、システム全体を停止させる。CANCEL（時としてAB
ORTとも呼ぶ）信号が異なる状況を提示する。高性能シ
ステムには、思惑的にマルチサイクル処理を開始し、そ
れらの仮定が正しくなかった場合に処理を取り消すもの
がある。このような単一ドメイン・システムにおいてCA
NCELをアサートすると、システム全体のあるあらゆるユ
ニットを停止させることになる。The error and status signals on distribution bus 260 affect every unit 210 in the system. For example, the error correction code is
It corrects the bit errors on 50 and generates a fatal error signal for other errors where the code is detectable but not correctable. Such a catastrophic error signal usually indicates the fault that caused the error in a single system.
Even if trapped in a unit or router location, it will affect the entire system. The failed system
The unit can continuously assert an error signal, shutting down the entire system. CANCEL (sometimes AB
ORT) signal presents a different situation. Some high-performance systems speculatively start multi-cycle processing and cancel processing if those assumptions are incorrect. CA in such a single domain system
Asserting NCEL will stop any unit in the entire system.

【００２３】図３は１つの仮説的なコンピュータ３００
を示し、この中では、図２のユニット２１０に対応する
種々のシステム・ボード３１０が物理的に多数のドメイ
ンに分割され、各々がそれ自体の物理的に別個のデータ
・ルータまたはバス３４０、それ自体のアドレス・ルー
タまたはバス３５０、およびそれ自体の制御分配手段ま
たはバス３６０を有し、その上それ自体のシステム・コ
ントローラ３７０をも有することも可能である。実際に
は、コンピュータ３００は、多数の別々のコンピュータ
即ちドメインＳ１，Ｓ２，Ｓ３となる。加えて、多数の
ドメインは、それらのメモリ・アドレスの一部または全
てを共有して、アドレス・ルータ３５０の領域３５１に
よって示されるような、クラスタを形成することができ
る。図３では、ドメインＳ１は、それ自体が（変性）ク
ラスタＣＡであり、一方ドメインＳ２，Ｓ３は一体とな
ってクラスタＣＢを形成する。これら別々のクラスタの
アドレス空間は互いに重複していてもよく、各々が他と
は独立してそれ自体のオペレーティング・システムを実
行することができ、１つのドメイン・クラスタにおける
メモリ障害またはその他のハードウエア・エラーは、他
のドメイン・クラスタの動作に影響を与えることはな
い。FIG. 3 shows one hypothetical computer 300.
Wherein the various system boards 310 corresponding to the unit 210 of FIG. 2 are physically divided into a number of domains, each with its own physically separate data router or bus 340, It may have its own address router or bus 350 and its own control distribution means or bus 360, as well as its own system controller 370. In practice, computer 300 will be a number of separate computers or domains S1, S2, S3. In addition, multiple domains can share some or all of their memory addresses to form clusters, as indicated by area 351 of address router 350. In FIG. 3, domain S1 is itself a (denatured) cluster CA, while domains S2 and S3 together form a cluster CB. The address spaces of these separate clusters may overlap with each other, each may run its own operating system independently of the others, and may have memory failures or other hardware in one domain cluster Errors do not affect the operation of other domain clusters.

【００２４】バー３０１，３０２，３０３は、コンピュ
ータ３００のメモリ・アドレス空間が、３カ所の別個の
空間として処理可能であり、そのアドレスのいくらかま
たは全てが互いに重複していてもよいことを示す。加え
て、メモリ・アドレスの中には、３０４，３０５に示す
ように、複数のドメインとの間で、物理的に共有可能な
ものもある。下位の２つのアドレス・ルータ３５０をブ
リッジする領域３５１は、別々のドメイン間で共有され
るメモリアドレスを象徴化したものである。Bars 301, 302, 303 indicate that the memory address space of computer 300 can be treated as three separate spaces, some or all of whose addresses may overlap with each other. In addition, some memory addresses can be physically shared between multiple domains, as shown at 304 and 305. An area 351 bridging the lower two address routers 350 symbolizes a memory address shared between different domains.

【００２５】コンピュータ３００では、制御信号の多く
をそれらを適用することができないドメインから分離す
ることができる。したがって、ドメインＳ１内の致命的
エラー（アドレスまたは制御バスにおける訂正不可能な
エラーのような）は、当該ドメイン・クラスタの範囲内
でのみバス３６０上でARBSTOP信号を生成し、ドメイン
Ｓ２，Ｓ３の動作を継続させる。しかしながら、システ
ム３００は、永続的または少なくとも半永続的に手動で
構成されなければならない。即ち、再構成は、ボードの
再配置またはスイッチのリセットのために、システムの
完全な停止および配線のし直しまたは手作業による調節
を必要とする。このシステムは、可変量の資源を有する
異なるドメイン群やクラスタ群に動的にまたは容易に再
構成することはできない。In computer 300, many of the control signals can be separated from domains to which they cannot be applied. Thus, a fatal error in domain S1 (such as an uncorrectable error in the address or control bus) will generate an ARBSTOP signal on bus 360 only within the domain cluster in question and the domain S2, S3 Continue operation. However, the system 300 must be manually configured permanently or at least semi-permanently. That is, reconfiguration requires complete shutdown of the system and rewiring or manual adjustment for board relocation or switch reset. The system cannot be dynamically or easily reconfigured into different domains or clusters with variable amounts of resources.

【００２６】図４は、図２および図３のコンピュータ・
システム２００，３００を背景として構築し、ある例示
環境における本発明の好適な形態の概要を提示する。以
下に述べる詳細の多くは、それ自体本発明の概念に直接
関係するものではないが、この環境において本発明がど
のように機能するのかを理解するのに役立つ。FIG. 4 is a block diagram of the computer of FIG. 2 and FIG.
Building on the background of the systems 200, 300, an overview of the preferred form of the invention in an example environment is presented. Many of the details set forth below are not themselves directly related to the concepts of the present invention, but are helpful in understanding how the present invention functions in this environment.

【００２７】コンピュータ４００は、図３のユニット群
３１０に対応するシステム・ユニット群４１０を有す
る。データ・ルータ４４０は、全てのシステム・ユニッ
ト群４１０を物理的に相互接続する。アドレス・ルータ
４５０および制御バス４６０は、丁度図２におけるよう
に、全てのシステム・ユニット群を物理的に結合する。
コンピュータ４００においては、しかしながら、追加の
ドメイン・フィルタ４８０が、コンピュータ４００を、
互いに独立して動作可能なドメイン群およびクラスタ群
に電子的に分割する。ルータ４５０およびユニット群４
１０間のドメイン・フィルタの配置は、それがアドレス
信号および制御信号に作用しドメインへの分離を達成す
ることを象徴化している。好適な実施態様では、フィル
タ４８０は、アドレス・ルータ４５０の部分を形成する
チップの中に物理的に配置され、ルータ４５０自体は、
部分的にシステム・ユニット４１０内、および部分的に
共通のセンタプレーン構造内に物理的に配置される。ま
た、フィルタ４８０は、データ・ルータ４４０内に位置
する構成要素を含むこともできる。ドメイン・コンフィ
ギュレータ(domain configurator)４２０がフィルタ４
８０と通信し、任意にかつ動的にドメイン群およびクラ
スタ群を設定する。図４における例は、図３の対応する
マップ３０１〜３０３と同じメモリ・アドレス・マップ
４０１〜４０３を有する。The computer 400 has a system unit group 410 corresponding to the unit group 310 in FIG. Data router 440 physically interconnects all system units 410. Address router 450 and control bus 460 physically couple all system units, just as in FIG.
In computer 400, however, an additional domain filter 480 allows computer 400 to
It is electronically divided into domains and clusters that can operate independently of each other. Router 450 and unit group 4
The placement of the ten domain filters symbolizes that it acts on address and control signals to achieve domain separation. In a preferred embodiment, the filter 480 is physically located in a chip forming part of the address router 450, and the router 450 itself
Physically located partially within system unit 410 and partially within a common centerplane structure. Filter 480 may also include components located within data router 440. Domain configurator 420 is filter 4
80 and arbitrarily and dynamically set domains and clusters. The example in FIG. 4 has the same memory address maps 401-403 as the corresponding maps 301-303 in FIG.

【００２８】後の詳細説明について前もって述べておく
が、マップの隣にある番号は、種々の点におけるアドレ
スを示す。番号は１６進数であり、例示の実施態様で
は、'00 0000 0000'から'0F FFFF FFFF'までに及ぶ。
（図４は、可能な１６のシステム・ユニットの内８個の
みを示すので、この空間の前半である、アドレス'08 00
000000'までのみを含む。）また、システムは、システ
ム・レジスタやＩ／Ｏ装置にアクセスするために、別の
空間として、'10 0000 0000'から'1F FFFF FFFF'のアド
レスも採用している。本発明には関連のない詳細を少し
加えるが、例示のシステム・アーキテクチャは各システ
ム・ユニット４１０に４ギガバイト（ＧＢ）のアドレス
範囲を割り当てている。各範囲は割り当てられた単位番
号に４ＧＢをかけた数で開始するが、ユニット上に実際
に装着されるいずれのメモリも、その割り当てられた範
囲内であればどこから始まってどこで終わってもよい。
これは殆ど常に、実装されたメモリのアドレス範囲にお
いて穴あき状態を生ずる結果となるが、システム４００
はこの状況に容易に対処する。他のシステムは、しかし
ながら、全く異なるメモリ・アーキテクチャを有する本
発明を容易に実施することができる。As will be described in detail later, the numbers next to the map indicate addresses at various points. The numbers are hexadecimal and, in the exemplary embodiment, range from '00 0000 0000 'to' 0F FFFF FFFF '.
(FIG. 4 shows only eight of the 16 possible system units, so the first half of this space, address '08 00
Includes only up to 000000 '. Also, the system employs addresses from '10 0000 0000 'to' 1F FFFF FFFF 'as another space for accessing system registers and I / O devices. With some details not relevant to the present invention, the exemplary system architecture assigns each system unit 410 an address range of 4 gigabytes (GB). Each range starts with the assigned unit number multiplied by 4 GB, but any memory actually mounted on the unit may begin and end anywhere within the assigned range.
This almost always results in a puncture in the address range of the implemented memory, but the system 400
Addresses this situation easily. Other systems, however, can readily implement the present invention with completely different memory architectures.

【００２９】図５は、図４のシステム・ユニット４１０
を、データ・ルータ４４０、アドレス・ルータ４５０、
およびドメイン・フィルタ４８０の部分と共に示す。こ
れらはシステム・ユニットの一部をなし、これに結合さ
れているものである。図５は、本発明のドメイン・フィ
ルタによって管理される分配器４６０の個々の制御線を
殆ど示していない。図８ないし図１０において、代表的
な制御信号をより詳細に示しかつ検討することにする。
本例では、コンピュータ４００は、実装可能な１６のシ
ステム・ユニット４１０の内８個を有する。FIG. 5 shows the system unit 410 of FIG.
To the data router 440, the address router 450,
And a portion of the domain filter 480. These are part of the system unit and are connected to it. FIG. 5 shows little of the individual control lines of the distributor 460 managed by the domain filter of the present invention. 8-10, representative control signals will be shown and discussed in more detail.
In this example, computer 400 has eight of sixteen system units 410 that can be implemented.

【００３０】システム・ユニット４１０は、コンピュー
タ１００の主要構成要素１１０〜１３０の全てに対し
て、回路基板のような、１つの物理構造上に空間および
配線を含むが、これらの全てが特定のユニットに完全に
または部分的にでさえも設ける必要がある訳ではない。
プロセッサ・サブシステム５１０は、マイクロプロセッ
サ５１１を４個まで有することができ、各々それ自体の
キャッシュ５１２を有する。入出力サブシステム５３０
は２個のシステムＩ／Ｏバス５３１を含み、各々が種々
の従来からのＩ／Ｏアダプタ５３２を制御する。一方、
Ｉ／Ｏアダプタ５３２は、ディスク・ドライブ、端末コ
ントローラ、および通信ポートのような外部Ｉ／Ｏ装置
に、線５３３を結合する。メモリ・サブシステム５２０
は４個までのメモリ５２１のバンクを含み、各対が従来
からのパック／アンパック・モジュール(pack/unpack m
odule)５２２に結合する。完全に汎用なシステム・ユニ
ットの代わりとして、特殊化されたボードも使用可能で
ある。例えば、第１タイプのシステム・ユニットが、プ
ロセッサおよびメモリ・サブシステムに対してのみ配線
および場所を有してもよく、第２タイプが１個以上のＩ
／Ｏサブシステムのみを含むこともあろう。System unit 410 includes space and wiring on one physical structure, such as a circuit board, for all of the major components 110-130 of computer 100, all of which are specific units. It is not necessary to provide them completely or even partially.
The processor subsystem 510 can have up to four microprocessors 511, each with its own cache 512. I / O subsystem 530
Includes two system I / O buses 531, each controlling a variety of conventional I / O adapters 532. on the other hand,
I / O adapter 532 couples line 533 to external I / O devices such as disk drives, terminal controllers, and communication ports. Memory subsystem 520
Contains up to four banks of memory 521, each pair comprising a conventional pack / unpack module (pack / unpack m).
odule) 522. Specialized boards can be used as an alternative to completely general-purpose system units. For example, a first type of system unit may have wiring and locations only for the processor and memory subsystem, and a second type of one or more I / Os.
It may include only the / O subsystem.

【００３１】データ・ルータ４４０は、サブシステム５
１０〜５３０間でトランザクション・データの受け渡し
を行う。本実施例では、データ・ルータは、各システム
・ユニット４１０上のローカル部分５４Ｌ０およびセン
タプレーン上に配置されたグローバル部分５４Ｇ０の間
で物理的に分割される。図５において、ラベル５４Ｌ０
は、データ・ルータ４４０の内、構成要素５４Ｌ１〜５
４Ｌ３を有するローカル部分全体を示し、ラベル５４Ｇ
０は、構成要素５４Ｇ１〜５４Ｇ２を有するグローバル
部分全体を示す。The data router 440 is connected to the subsystem 5
Transaction data is exchanged between 10 and 530. In this embodiment, the data router is physically divided between a local portion 54L0 on each system unit 410 and a global portion 54G0 located on the center plane. In FIG. 5, the label 54L0
Are the components 54L1-5 of the data router 440.
Label L shows the entire local part with 4L3
0 indicates the entire global part having the components 54G1 to 54G2.

【００３２】ローカル・ルータ５４Ｌ０の各バッファ５
４Ｌ１は、例えば、２５６トランザクションを保持可能
な、少量の高速スタティック・ＲＡＭを有する。その通
常の目的は、アクティビティの可能なバーストに対抗し
て、データの流れの分離および平滑化を行うための保持
行列(holding queue)を提供することである。ローカル
・データ・スイッチ５４Ｌ２は全二重双方向１ｘ４クロ
スバーである。ローカル・データ・アービタ５４Ｌ３
は、グローバル・アービタ５４Ｇ２から許可を受け、せ
いぜい１つのバッファに、対応するトランザクション・
パケットを格納するように命令する。同時に、ローカル
・アービタ５４Ｌ３は、従来からの公正アルゴリズム(f
airness algorithm)を用いて、バッファの１つから待機
中のパケットを選択し、このパケットの代わりに、グロ
ーバル・アービタ５４Ｇ２への送信要求を発生する。グ
ローバル・データ・ルータ５４Ｇ０は、１つのシステム
・ユニットのＬＤＲ５４Ｌ０からのデータを、同一ユニ
ットまたは異なるユニットのＬＤＲ５４Ｌ０に転送す
る。この際、１６ｘ１６のクロスバー・アレイ５４Ｇ１
を用い、これがアービタ５４Ｇ２からの１６組の４ビッ
ト方向制御ロジック(steering logic)を受け取る。下位
では、これを、各システム・ユニット毎に１つとして、
１６個の１６入力マルチプレクサとして実現する。Each buffer 5 of the local router 54L0
4L1 has, for example, a small amount of high-speed static RAM capable of holding 256 transactions. Its usual purpose is to provide a holding queue for separating and smoothing the flow of data against possible bursts of activity. Local data switch 54L2 is a full-duplex bidirectional 1x4 crossbar. Local data arbiter 54L3
Receives permission from global arbiter 54G2 and stores at most one buffer with the corresponding transaction
Instructs to store the packet. At the same time, the local arbiter 54L3 uses the conventional fair algorithm (f
Using the airness algorithm), a waiting packet is selected from one of the buffers, and a transmission request to the global arbiter 54G2 is generated instead of this packet. The global data router 54G0 transfers data from the LDR 54L0 of one system unit to the LDR 54L0 of the same unit or a different unit. At this time, the 16 × 16 crossbar array 54G1
Which receives 16 sets of 4-bit steering logic from arbiter 54G2. At the lower level, this is one for each system unit,
Implemented as 16 16-input multiplexers.

【００３３】アドレス・ルータ４５０は、各システム・
ユニット４１０上のサブシステム５１０〜５３０間でア
ドレスの受け渡しを行うと共に、あるシステム・ユニッ
トから他のシステム・ユニットへのアドレスの受け渡し
も行う。データ・ルータと同様、５５Ｌ０で示すローカ
ル部分と、グローバル部分５５Ｇ０とを有する。この実
施態様では、アドレス・ルーティングは、ローカル（イ
ントラシステム）およびグローバル（インターシステ
ム）トランザクション双方に対して、同じ経路を進む。
ポート・コントローラ群５５Ｌ１およびメモリ・コント
ローラ５５Ｌ２は、サブシステム５１０〜５３０および
個々のルーティング・スイッチ５５Ｌ３間に、従来から
のインターフェースを設ける。この場合、個々のプロセ
ッサ５１１、Ｉ／Ｏバス５３１、メモリ・ユニット５２
１は、事実上ローカル・アドレス・スイッチ（ＬＡＳ：
local address switch）５５Ｌ３に直接接続されている
と見なしてもよい。ＬＡＳ群５５Ｌ３は、キャッシュ・
コヒーレンシ（ｃａｃｈｅｃｏｈｅｒｅｎｃｙ）のよう
な、多数の通常からの機能を実行する。本発明の目的の
ために、それらの機能は、システム・ユニット内のプロ
セッサ群５１１、Ｉ／Ｏバス群５３１、およびメモリ５
２１からのアドレスの、グローバル・アドレス・ルータ
５５Ｇ０との受け渡しを行うことである。The address router 450 is connected to each system
Addresses are transferred between the subsystems 510 to 530 on the unit 410, and addresses are transferred from one system unit to another system unit. Similar to the data router, it has a local part indicated by 55L0 and a global part 55G0. In this embodiment, address routing follows the same path for both local (intrasystem) and global (intersystem) transactions.
Port controller group 55L1 and memory controller 55L2 provide a conventional interface between subsystems 510-530 and individual routing switches 55L3. In this case, the individual processors 511, the I / O bus 531 and the memory unit 52
1 is effectively a local address switch (LAS:
(local address switch) may be regarded as being directly connected to 55L3. The LAS group 55L3 is
Performs a number of conventional functions, such as cache coherency. For the purposes of the present invention, their functions are: a processor group 511, an I / O bus group 531 and a memory 5 in the system unit.
21 is to transfer the address from the global address router 21 to the global address router 55G0.

【００３４】本実施例のアドレス・ルータ４５０のグロ
ーバル部分５５Ｇ０は、１６個のシステム・ユニット間
で共有される４系統のアドレス・バス５５Ｇ１を有す
る。別個のグローバル・アドレス・アービタ５５Ｇ２
が、各ユニット内のローカル・アドレス・アービタ５５
Ｌ４からのトランザクション要求に応答して、各アドレ
ス・バスを種々のシステム・ユニット４１０に割り当て
る。The global portion 55G0 of the address router 450 of this embodiment has four address buses 55G1 shared among 16 system units. Separate global address arbiter 55G2
Is the local address arbiter 55 in each unit.
Each address bus is assigned to a different system unit 410 in response to a transaction request from L4.

【００３５】本実施例では、システム・ユニット上の各
ＬＡＳ５５Ｌ３は、線９１５，９２２上の白抜き円によ
って象徴化されているように、４個のＧＡＢ５５Ｇ１の
異なる１つに接続する。アービタ５５Ｌ４は、物理的
に、４個の同一部分から成り、その各々が、線８１１か
らのアクセス要求に応答して、ＧＡＡ５５Ｇ２およびＬ
ＡＳ５５Ｌ３の異なる１つと通信を行う。即ち、アドレ
ス・ルータ４５０の結合されたローカル部分およびグロ
ーバル部分の全体的な機能は、システム・ユニット群４
１０全ての６個のポート（コントローラ５５Ｌ１の各々
から２つ）からの競合する要求間で、４個のＧＡＢ５５
Ｇ１のスケジュールを立てることである。４個のＧＡＢ
５５Ｇ１全てに対する決定は、各システム・ユニットの
ＬＡＡ５５Ｌ４において、互いに対して同時に処理され
る。In the present embodiment, each LAS 55L3 on the system unit connects to a different one of the four GABs 55G1, as symbolized by the open circles on lines 915 and 922. Arbiter 55L4 physically consists of four identical parts, each of which responds to an access request from line 811 with GAA 55G2 and L
It communicates with a different one of the AS55L3s. That is, the overall function of the combined local and global portions of the address router 450 is the system unit group 4
10 conflicting requests from all six ports (two from each of the controllers 55L1)
G1 schedule. 4 GABs
The decisions for all 55G1 are processed simultaneously for each other in the LAA 55L4 of each system unit.

【００３６】図６は、図５の通常のポート・コントロー
ラ５５Ｌ１内における、関連するアドレス・ルーティン
グを示す。各コントローラ・チップは、アドレス線およ
び制御線を含み、２個のプロセッサまたは２系統のＩ／
Ｏバスを、４系統のアドレス・バスのいずれかとインタ
ーフェースする。双方向ドライバ／レシーバ６１０が、
アウトバウンド（出）トランザクションを線６１１から
先入れ／先出し（ＦＩＦＯ）バッファ６２０に送出す
る。スイッチ群６２１は、ＦＩＦＯ出力を双方向ドライ
バ／レシーバ６３０に出力し、ＦＩＦＯ出力は線９１１
を通じて図５のローカル・アドレス・ルータ５５Ｌ３に
進む。線９１１からのインバウンド（入）トランザクシ
ョンは、ドライバ／レシーバ６３０からＦＩＦＯ６４０
に進む。マルチプレクサ群６４１は、格納されているト
ランザクションから選択を行い、それらをドライバ／レ
シーバ６１０に送出し、線６１１を通じて送信する。線
８１１はスイッチ群６２１、マルチプレクサ群６４１、
およびポート・コントローラ５５Ｌ１内の他の構成要素
（図示せず）の制御を行う。また、コントローラの構成
要素は、ステータス方法や他の情報を、図５のローカル
・アドレス・アービタ５５Ｌ４に送信する。FIG. 6 illustrates the relevant address routing within the conventional port controller 55L1 of FIG. Each controller chip includes address lines and control lines, and includes two processors or two I / O lines.
The O bus is interfaced with any of the four address buses. The bidirectional driver / receiver 610
An outbound transaction is sent out on line 611 to a first in / first out (FIFO) buffer 620. The switch group 621 outputs the FIFO output to the bidirectional driver / receiver 630, and outputs the FIFO output to the line 911.
Through the local address router 55L3 of FIG. An inbound transaction from line 911 is sent from driver / receiver 630 to FIFO 640.
Proceed to. Multiplexers 641 make a selection from the stored transactions and send them to driver / receiver 610 for transmission over line 611. Line 811 is a switch group 621, a multiplexer group 641,
And controls other components (not shown) in the port controller 55L1. Also, the components of the controller send the status method and other information to the local address arbiter 55L4 in FIG.

【００３７】図７は、図５の通常のメモリ・コントロー
ラ５５Ｌ２の関連部分を示す。このチップは、ポート・
コントローラ５５Ｌ１がＤＲＡＭメモリ・チップ５２１
の４個のバンクに対して行う機能と同様の機能を実行す
る。図５のローカル・アドレス・ルータ５５Ｌ３からの
線９１１は、トランザクションをＦＩＦＯ記憶部７１０
に供給する。クロスバー・スイッチ７２０は、これらト
ランザクションからのアドレスを、線７２１を介して４
個のメモリ・バンクに送出する。即ち、ユニット内に位
置するメモリ・セグメントのメモリ・アドレスの異なる
下位範囲(subrange)内にデータが位置する限り、同じシ
ステム・ユニット４１０上の多数のバンクが同時にデー
タの読み出しおよび書き込みを行うことができる。通常
の調停（アービトレーション）ロジック７２２が、個々
のＦＩＦＯ７１０を種々の出力７２１に割り当てる。図
９からの線９４８は、このシステム・ユニットには見る
ことができるようにされていないトランザクションから
のメモリ・アクセスを取り消す（キャンセルする）。FIG. 7 shows relevant parts of the conventional memory controller 55L2 of FIG. This chip
The controller 55L1 is a DRAM memory chip 521
Performs the same functions as those performed for the four banks. The line 911 from the local address router 55L3 in FIG.
To supply. Crossbar switch 720 passes the addresses from these transactions to line 4 via line 721.
To the number of memory banks. That is, as long as the data is located in different subranges of the memory addresses of the memory segments located within the unit, multiple banks on the same system unit 410 can simultaneously read and write data. it can. Normal arbitration logic 722 assigns individual FIFOs 710 to various outputs 721. Line 948 from FIG. 9 cancels (cancels) memory accesses from transactions that are not made visible to this system unit.

【００３８】図８は、図５のローカル・アドレス・アー
ビタ５５Ｌ４の詳細を示す。各システム・ユニット４１
０は、１つのＬＡＡチップ５５Ｌ４を含む。各ポート・
コントローラ５５Ｌ１は、ＦＩＦＯバッファ８１０内の
キューへのＧＡＢ要求信号を立ち上げる(raise)ことに
より、使用可能なグローバル・アドレス・バス５５Ｇ１
の１つにおいて、ショット(shot)を要求することができ
る。これらの線は、図５に示した通常のポート制御線８
１１の一部を形成する。調停ロジック８２０は、多数の
通常の公正アルゴリズムのいずれかを用いて、これらの
要求間で選択を行い、次いでＧＡＢ要求および方向制御
線８２１を立ち上げる。要求線は、ロジック８２０が特
定のグローバル・アドレス・バスへのアクセスを望んで
いるか否かについての指示を与える。グローバル・アド
レス・アービタ５５Ｇ２が、線８２２上の特定のアドレ
ス・バスに対する要求を許可した場合、アービタ・チッ
プ５５Ｌ４は、線８２３を通じて適切なＬＡＳチップ５
５Ｌ３に通知する。FIG. 8 shows details of the local address arbiter 55L4 of FIG. Each system unit 41
0 includes one LAA chip 55L4. Each port
Controller 55L1 raises the available global address bus 55G1 by raising a GAB request signal to a queue in FIFO buffer 810.
In one, a shot may be requested. These lines are the normal port control lines 8 shown in FIG.
11 is formed. The arbitration logic 820 makes a selection between these requests using any of a number of conventional fair algorithms, and then raises the GAB request and direction control line 821. The request line provides an indication as to whether the logic 820 wants to access a particular global address bus. If the global address arbiter 55G2 grants a request for a particular address bus on line 822, the arbiter chip 55L4 sends the appropriate LAS chip 5 via line 823.
Notify 5L3.

【００３９】ＬＡＡ５５Ｌ４は、多数の方法で、コンピ
ュータ４００の動作を中断することができる。システム
・ユニット上でのパリティ・エラーのような致命的エラ
ーは、線８２４上でARBSTOP制御信号を発生することが
できる。即ち、ＬＡＡはARBSTOP制御信号の発生器とし
て作用する。通常のコンピュータでは、この信号は、制
御分配器４６０を通じて、他の各システム・ユニットの
ＬＡＡ内のARBSTOP検出線８２７に直接同報通信する。
したがって、１つのユニットにおける致命的エラーが、
通常では、コンピュータ４００全体の各システム・ユニ
ットを停止させることによって、ユーザ・データの汚染
(corruption)を回避すると共に、障害分析のための早急
のダンプ(dump)を許す。図１０に関連付けて説明する
が、しかしながら、本コンピュータはこの信号を濾過(f
ilter)し、同じドメイン・クラスタ内のシステム・ユニ
ットだけが、当該ドメイン・クラスタ内のユニットの１
つからの出るARBSTOP信号を受けるようにする。The LAA 55L4 can interrupt the operation of the computer 400 in a number of ways. A fatal error, such as a parity error on the system unit, can generate an ARBSTOP control signal on line 824. That is, the LAA acts as a generator of the ARBSTOP control signal. In a typical computer, this signal would be broadcast directly through the control distributor 460 to the ARBSTOP detect line 827 in the LAA of each of the other system units.
Therefore, a fatal error in one unit is
Typically, shutting down each system unit of the entire computer 400 results in contamination of user data.
(corruption) as well as allowing an immediate dump for failure analysis. As will be described with reference to FIG. 10, however, the computer filters this signal (f
ilter), and only the system units in the same domain cluster are one of the units in the domain cluster.
To receive the ARBSTOP signal coming out of one.

【００４０】また、システム・ユニットは、他の全ユニ
ットに対するHOLD制御信号８２５を、それらの対応する
検出線上でアサートすることも可能である。通常では、
いずれのシステム・ユニットからのアウトバウンドHOLD
信号も、他の各ユニットの対応するインバウンドHOLD線
８２６に直接移動するので、当該システム・ユニットの
入力キューが保留中の処理で飽和したときはいつでも、
コンピュータ全体がそれ以上のトランザクションを要求
するのを禁止した。加えて、障害のあるシステム・ユニ
ット４１０は、連続してHOLDをアサートすることによ
り、コンピュータ全体を停止させる可能性がある。しか
しながら、図１０では、この信号も濾過するので、線８
２５上のアウトバウンドHOLDは、同じドメイン・クラス
タ内のシステム・ユニット群上の入来するHOLD８２６に
影響を与えるに過ぎない。The system units can also assert HOLD control signals 825 for all other units on their corresponding detection lines. Usually,
Outbound HOLD from any system unit
The signal also goes directly to the corresponding inbound HOLD line 826 of each other unit, so whenever the system unit's input queue is saturated with pending processing,
Prohibits the entire computer from requesting further transactions. In addition, the faulty system unit 410 may shut down the entire computer by asserting HOLD continuously. However, in FIG. 10, this signal is also filtered, so line 8
The outbound HOLD on 25 only affects the incoming HOLD 826 on the system units in the same domain cluster.

【００４１】このように、ローカル・アドレス・アービ
タ５５Ｌ４は、他のシステム・ユニットの動作に影響を
与える可能性がある、GAB REQ, ARBSTOP-out, HOLD-o
utのような制御信号の発生器として作用する。また、こ
れは、他のシステム・ユニットからのこれら制御信号GA
B GRANT, ARBSTOP-in, HOLD-inの受け取り側としても
作用する。従来のシステムであれば、全てのシステム・
ユニットからの出立制御信号を単に互いに結びつけ、そ
れらの他の全ユニットの受け取り側に送出するだけであ
る。しかしながら、本システムは、これらをドメイン・
フィルタに通すので、１つのＬＡＡにおいて発生された
信号は、同じドメインまたはドメイン・クラスタ内のユ
ニットのみのＬＡＡに影響を与えるに過ぎない。以下で
明らかとなろうが、他の動作装置も、異なるドメイン定
義にしたがって、ドメイン・フィルタが通過させたり、
阻止することができる制御信号の発生器および受け取り
側として作用する。As described above, the local address arbiter 55L4 has a possibility of affecting the operation of another system unit, such as GAB REQ, ARBSTOP-out, HOLD-o.
Acts as a generator of control signals such as ut. This is also because these control signals GA from other system units
Also acts as a recipient of B GRANT, ARBSTOP-in and HOLD-in. For conventional systems, all systems
The exit control signals from the units are simply tied together and sent to the receivers of all other units. However, the system does not
Because of the filtering, the signal generated at one LAA only affects the LAA of units within the same domain or domain cluster only. As will become apparent below, other operating devices may also be passed by a domain filter according to different domain definitions,
Acts as a generator and receiver of control signals that can be blocked.

【００４２】図９は、図４のドメイン・フィルタ４８０
の一部としての機能に集中して、図５のローカル・アド
レス・ルータ・チップ５５Ｌ３を詳細に示す。その動作
全体は複雑であり、トランザクション毎に多数のサイク
ルを必要とするが、この場合の目的に必要なのは、図５
の各アドレス・バス５５Ｇ１が所定のトランザクション
に対してアドレス・ビットおよびいくつかの制御信号を
搬送することのみである。FIG. 9 illustrates the domain filter 480 of FIG.
5, the local address router chip 55L3 of FIG. 5 is shown in detail. The overall operation is complex and requires a large number of cycles per transaction, but the purpose in this case is that of FIG.
Each address bus 55G1 carries address bits and some control signals for a given transaction.

【００４３】アウトバウンド・アドレス制御部９１０
は、線群９１１上の各ポート・コントローラ５５Ｌ１
（図５）からのトランザクション・アドレスを受け取
り、それらを、エラー訂正コード発生器９１２を通じ
て、ＦＩＦＯバッファ９１３に送出する。マルチプレク
サ９１４が、ローカル・アドレス・アービタ５５Ｌ４に
よって確立され線８２３を通じて通信されるそれらの特
性に応じて、待ちアドレス(waiting address)をドライ
バ９１６を通して出グローバル・アドレス線９１５上に
選択的に結合する。Outbound address control section 910
Represents each port controller 55L1 on the line group 911.
It receives the transaction addresses from (FIG. 5) and sends them through the error correction code generator 912 to the FIFO buffer 913. Multiplexer 914 selectively couples a waiting address through driver 916 onto outgoing global address line 915 depending on those characteristics established by local address arbiter 55L4 and communicated over line 823.

【００４４】線１０２３上のVALID信号が、特定のシス
テム・ユニット４１０に対してトランザクションが有効
であることを知らせるときはいつでも、従来の入来アド
レス・スイッチ９２０が入来アドレス線９２２を通じ
て、受信部９２１においてグローバル・アドレス・バス
からのトランザクション・アドレスを受け取る。この線
がインアクティブのままである場合、ＬＡＳ５５Ｌ３が
対応するバス・サイクルをアイドル・サイクルとして扱
い、何の動作も行わない。有効なトランザクションから
のアドレスは、線９２３を通って直接メモリ・コントロ
ーラ５５Ｌ２に進む。他のシステム・ユニットからのア
ドレスは、ＥＣＣデコーダ９２４およびキャッシュ・コ
ヒーレンシ・ユニット９３０を通じて、入来アドレス・
スイッチ９２５に進む。アドレスの中には、読み出し／
書き込み制御部９２６または応答制御部９２７を通じて
スイッチ９２５に進むものもある。最後に、スイッチユ
ニット９２５は、適正な線９１１への入来アドレスを、
ポート・コントローラ群５５Ｌ１の１つに通過させる。Whenever the VALID signal on line 1023 indicates to a particular system unit 410 that a transaction is valid, a conventional incoming address switch 920 may be connected to the receiving unit via incoming address line 922. At 921, a transaction address from the global address bus is received. If this line remains inactive, LAS 55L3 will treat the corresponding bus cycle as an idle cycle and take no action. The address from a valid transaction goes directly to memory controller 55L2 via line 923. Addresses from other system units are passed through the ECC decoder 924 and the cache coherency unit 930 to receive incoming address
Proceed to switch 925. Some of the addresses are read /
Some go to the switch 925 through the write control unit 926 or the response control unit 927. Finally, the switch unit 925 sets the incoming address on the appropriate line 911,
Pass through one of the port controller groups 55L1.

【００４５】ブロック９３０は、通常の方法で、図５の
キャッシュ間のコヒーレンシを維持する。線９３１は、
キャッシュ制御部９３０が動作を中断すべきことを判定
した場合、それ自体のシステム・ユニットからCANCEL制
御信号を生成する。高性能システムは、多数のクロック
・サイクルにわたって、処理を思惑的に実行しつつ、平
行して当該処理は本当に実行すべきか否かについて判定
を行うことができる。この処理を実行する条件が満たさ
れない場合、出線９３１は、制御分配器４６０を通じ
て、他のシステム・ユニット群全ての入来CANCEL線９３
２にCANCEL信号を同報通信する。これによって、キャッ
シュ制御部９３０はメモリ・コントローラ５５Ｌ２への
MEM CANCEL線９４８をアサートし、データを変更できる
ようになる前に、いずれのメモリ動作の完了も禁止す
る。例えば、メモリがＲＡＭから読み出しを行い、その
間に、システムはむしろ現在値がプロセッサ群の１つの
キャッシュ内に位置するか否かについて判定する。この
場合も、ドメイン・フィルタ４８０が、１つのシステム
・ユニットからのCANCEL-out信号９３１が、同じドメイ
ン・クラスタ内にはないユニットのCANCEL-in線９３２
に影響を与えるのを防止するので、各クラスタは、この
制御信号およびその他の制御信号に関しては、他のクラ
スタとは独立して動作することができる。線９３３も、
線９４８を経由するあらゆるボード上メモリの動作を取
り消す。これについては、以下で説明する。Block 930 maintains coherency between the caches of FIG. 5 in the usual manner. Line 931 is
When the cache control unit 930 determines that the operation should be interrupted, it generates a CANCEL control signal from its own system unit. High performance systems can perform a process speculatively over a number of clock cycles, while concurrently making a determination as to whether or not the process should really be performed. If the condition for executing this process is not satisfied, the outgoing line 931 is sent through the control distributor 460 to the incoming CANCEL line 93 of all the other system units.
2 to broadcast a CANCEL signal. As a result, the cache control unit 930 sends a request to the memory controller 55L2.
Assert the MEM CANCEL line 948 to inhibit the completion of any memory operations before data can be changed. For example, the memory reads from RAM while the system rather determines whether the current value is located in a cache of one of the processors. In this case as well, the domain filter 480 determines that the CANCEL-out signal 931 from one system unit is not connected to the CANCEL-in line 932 of a unit not in the same domain cluster.
, Each cluster can operate independently of the other clusters with respect to this and other control signals. Line 933 also
Cancel any on-board memory activity over line 948. This will be described below.

【００４６】システム４００は、異なるシステム・ユニ
ット群４１０において発生および終了するトランザクシ
ョン、および同じユニットにおいて発生および終了する
トランザクションの区別を行わない。全てのトランザク
ションは、本システムにおけるグローバル・アドレス・
バスを通過する。何故なら、ドメインまたはクラスタ内
の各キャッシュ・コントローラは、同じ群の他のキャッ
シュ全てのキャッシュ線におけるトランザクションにつ
いて知っていなければならないからである。The system 400 does not distinguish between transactions occurring and ending in different system units 410 and transactions originating and ending in the same unit. All transactions are handled by the global address
Pass the bus. This is because each cache controller in a domain or cluster must know about transactions on cache lines of all other caches in the same group.

【００４７】各アービタ・チップ５５Ｌ３における図４
のドメイン・フィルタ４８０のローカル部分９４０は、
同じシステム・ユニット４１０内の他のチップ５５Ｌ３
の全てに配置されている部分９４０と同一であり、常に
それと同じデータを搬送する。しかしながら、ブロック
９４０の各コピーは、バス群５５Ｇ１の異なる１つから
の入来アドレス線９２１を、線９２１を介して受け取
る。FIG. 4 for each arbiter chip 55L3
The local portion 940 of the domain filter 480 of
Other chips 55L3 in the same system unit 410
, And carry the same data at all times. However, each copy of block 940 receives an incoming address line 921 from a different one of bus groups 55G1 via line 921.

【００４８】比較器９４１は、線９２２からのアドレス
と、４個のレジスタ９４２および９４４〜９４６の各々
との間の一致を検出する。Comparator 941 detects a match between the address from line 922 and each of the four registers 942 and 944-946.

【００４９】ドメイン・マスク・レジスタ（ＤＭＲ:dom
ain mask register）９４２は、コンピュータ４００内
の可能なシステム・ユニット毎に１ビットとして、１６
ビットを有する。所与のドメイン内の各システム・ユニ
ットにおける各ドメイン・レジスタ（の各コピー）に対
するビット位置は、同じドメイン内のシステム・ユニッ
トのその他の全レジスタ内に「１」ビットを含む。図４
の例を用いると、第１の４個のシステム・ユニット（４
１０−０から４１０−３）を第１ドメインとして定義
し、次の２個（４１０−４および４１０−５）が第２ド
メインを形成し、次の２個（４１０−６および−７）は
第３ドメインを構成し、可能な１６個のシステム・ユニ
ットの内８個のみがあると仮定する。すると、８個の実
装されたシステム・ユニット４１０のドメイン・マスク
・レジスタ９４２は、以下の値を含む。The domain mask register (DMR: dom
ain mask register) 942 contains 16 bits, one bit for each possible system unit in computer 400.
With bits. The bit position for each domain register (each copy) in each system unit in a given domain includes a "1" bit in all other registers of the system unit in the same domain. FIG.
Using the example of, the first four system units (4
10-0 to 410-3) are defined as the first domain, the next two (410-4 and 410-5) form the second domain, and the next two (410-6 and -7) Assume that the third domain is configured and that there are only 8 out of 16 possible system units. Then, the domain mask registers 942 of the eight mounted system units 410 contain the following values:

【００５０】[0050]

【表１】 [Table 1]

【００５１】ここでも、同じシステム・ユニット４１０
内のレジスタ９４２の４個のコピーは全て、同じ値を含
む。Again, the same system unit 410
All four copies of register 942 contain the same value.

【００５２】線９２２は、現トランザクションを発行し
た特定のシステム・ユニット４１０の番号を表す信号を
含む。受信ユニットのＤＭＲ９４２の対応するビットが
オンになっていない場合、比較器９４１は、NON−DOMAI
N線９４３上に禁止信号を生成し、入来スイッチ９２５
がトランザクションを線９１１を通じて図５のポート群
５５Ｌ１に渡すのを禁止する。また、比較器は、線９４
９およびＯＲゲート９０１を通じて、線９４８上にMEMO
RY CANCEL禁止信号を生成する。この信号は、現トラン
ザクションがドメイン外で発した場合に、メモリ・コン
トローラ５５Ｌ２に、線９２３上のアドレスを無視する
ように命令する。これは効果的にドメインを分離し、他
のドメインで発生したトランザクションには不感応にす
る。Line 922 includes a signal representing the number of the particular system unit 410 that issued the current transaction. If the corresponding bit of the DMR 942 of the receiving unit is not turned on, the comparator 941 sets the NON-DOMAI
An inhibit signal is generated on the N line 943 and the incoming switch 925
Prohibits the transaction from passing to the port group 55L1 of FIG. 5 through the line 911. The comparator also has a line 94
9 and OR gate 901, MEMO on line 948
Generate RY CANCEL inhibit signal. This signal instructs the memory controller 55L2 to ignore the address on line 923 if the current transaction originated outside the domain. This effectively separates the domains and makes them insensitive to transactions originating in other domains.

【００５３】これまで述べてきたように、異なるドメイ
ンにおけるシステム・ユニット群は、図５の５３３のよ
うな専用配線によって相互接続される直列通信線のよう
な外部Ｉ／Ｏ装置を通じてのみ、互いにデータを交換し
合うことができる。大幅に高速化した方法により異なる
ドメインを協同させれば、コンピュータ４００は多くの
用途に拡張されよう。このために、図４のドメイン・フ
ィルタ４８０は、多数のドメインを共に集合化し、１つ
のクラスタにすることも可能である。１つのクラスタ内
のドメイン群は、それらのメモリの一部または全部を互
いに共有し合うことができる。１つのドメイン内のプロ
セッサが、予め規定されている範囲のアドレス空間にデ
ータを書き込む場合、同じクラスタの他のドメイン内に
あるプロセッサはこのデータを読むことができる。即
ち、１つのクラスタ内の異なるドメインはある範囲の共
有メモリを有する。このメモリは、クラスタ内のいずれ
のシステム・ユニットにも物理的に位置することがで
き、図５のグローバル・アドレス・ルータ５５Ｇ０を通
じてアクセスし、当該クラスタ内のあらゆるドメイン内
の他のいずれのシステム・ユニットに対するデータの転
送も、グローバル・データ・ルータ５４Ｇ０を通じて行
うことができる。As described above, the system units in the different domains communicate with each other only through external I / O devices, such as serial communication lines interconnected by dedicated wiring, such as 533 in FIG. Can be exchanged. Cooperating different domains in a greatly accelerated manner would extend computer 400 to many uses. To this end, the domain filter 480 of FIG. 4 can also aggregate multiple domains together into a single cluster. Domains within a cluster can share some or all of their memory with each other. If a processor in one domain writes data to a pre-defined range of address space, a processor in another domain of the same cluster can read this data. That is, different domains within a cluster have a range of shared memory. This memory can be physically located in any system unit in the cluster, accessed through the global address router 55G0 of FIG. 5 and any other system unit in any domain in the cluster. The transfer of data to the unit can also be performed through the global data router 54G0.

【００５４】ローカル・ドメイン・フィルタ９４０の各
コピー内に位置する共有メモリ・マスク・レジスタ９４
４は、どのシステム・ユニットが、図１０におけるクラ
スタ・レジスタ１０２０によって定義されるクラスタ内
の共有メモリとして、他のユニットにエクスポートすべ
き物理的ＲＡＭ５２１を含むのかについて定義する。同
じシステム・ユニット内の各ＳＭＭＲ９４４の内容は同
一である。Shared memory mask register 94 located in each copy of local domain filter 940
4 defines which system units include the physical RAM 521 to be exported to other units as shared memory in the cluster defined by the cluster register 1020 in FIG. The contents of each SMMR 944 within the same system unit is identical.

【００５５】各ＳＭＭＲ９４４は、コンピュータ４００
内の可能なシステム・ユニット毎に１ビットとして、１
６ビットを有し、各システム・ユニット４１０は、コン
ピュータ４００内のグローバル・アドレス・バス５５Ｇ
１毎に１つのコピーとして、それ自体のＳＭＭＲのコピ
ーを４個有する。所与のクラスタ内のシステム・ユニッ
ト４１０−ｉにおけるＳＭＭＲに対するビット位置ｊ
は、ユニット４０１−ｉがシステム・ユニット４１０−
ｊからのいずれかのメモリ・トランザクションに応答す
べき場合、「１」値を含む。図４に示した例に戻り、第
２ドメインの２個のユニット４１０−４，４１０−５が
第３ドメインの２個のユニット４１０−６，４１０−７
と共にクラスタを形成し、ユニット４１０−４は共有メ
モリを、ユニット４１０−６，４１０−７から成るドメ
インにエクスポートすると仮定する。即ち、ユニット４
１０−４上に物理的に実装されたアドレス番号のメモリ
の内少なくともいくつかは、そのメモリが後者のユニッ
ト上に実装されているかのように、ユニット４１０−
６，４１０−７上のプロセッサによって、同じアドレス
番号の下で、読み出しおよび書き込みが可能である。
（この場合も、可能な１６個のシステム・ユニットの内
あるのは８個のみなので、ビット位置８〜Ｆに対する値
は重要ではない。）したがって、システム・ユニット４
１０のＳＭＭＲは、次の値を含む。Each SMMR 944 has a computer 400
1 bit for each possible system unit within
6 bits, each system unit 410 has a global address bus 55G
It has four copies of its own SMMR, one copy for each. Bit position j for SMMR in system unit 410-i in a given cluster
Means that the unit 401-i is the system unit 410-
Include a "1" value if any memory transaction from j should be responded to. Returning to the example shown in FIG. 4, two units 410-4 and 410-5 of the second domain are replaced with two units 410-6 and 410-7 of the third domain.
And form a cluster with unit 410-4, assuming that unit 410-4 exports the shared memory to a domain consisting of units 410-6 and 410-7. That is, unit 4
At least some of the memory of the address numbers physically implemented on the unit 10-4, as if the memory were implemented on the latter unit, the unit 410-
Reading and writing are possible under the same address number by the processor on 6,410-7.
(Again, the values for bit positions 8-F are not significant, as only 8 of the 16 possible system units are available.) Therefore, system unit 4
The ten SMMRs include the following values:

【００５６】[0056]

【表２】 [Table 2]

【００５７】全てのレジスタにおいてビット位置８〜Ｆ
は「０」である。何故なら、それに対応するシステム・
ユニットが存在しないからである。ユニット４１０−０
ないし４１０−３には「１」の値がない。何故なら、こ
れらは同じドメインにあり、当該ドメインの中のユニッ
ト群には、メモリを他のドメインにエクスポートするも
のはないからである。ユニット４１０−４ないし４１０
−７に対するビット７までの「１」値は、これらのユニ
ットが、ユニット４１０−４ないし４１０−７全てから
の通常のメモリ・トランザクションに応答し、共有メモ
リを実施しなければならないことを示す。メモリはこれ
らのユニットの１つ（例えば、４１０−４）の上に位置
するが、特定の位置は、ＳＭＭＲ９４４から推論するこ
とはできない。共有メモリ上ではキャッシュ・コヒーレ
ンシが必要であるので、この共有メモリを用いるユニッ
トは全て、共有メモリを用いる他の全ユニットからの、
このアドレス範囲内のトランザクション全てを見ていな
ければならない。Bit positions 8 to F in all registers
Is “0”. Because the corresponding system
This is because the unit does not exist. Unit 410-0
To 410-3 have no value of "1". This is because they are in the same domain, and none of the units in that domain export memory to other domains. Units 410-4 through 410
A "1" value up to bit 7 for -7 indicates that these units must respond to normal memory transactions from all of units 410-4 through 410-7 and implement shared memory. The memory is located on one of these units (e.g., 410-4), but the specific location cannot be inferred from SMMR 944. Since cache coherency is required on shared memory, all units that use this shared memory will receive
You must see all transactions within this address range.

【００５８】システム・ユニットのメモリの全てが共有
されているか、あるいは全く共有されていないかを示す
には、レジスタ９４４のみで十分であろう。しかしなが
ら、殆ど全ての場合、クラスタのドメイン群間では、メ
モリの指定された部分のみを共有することが望ましい。
レジスタ９４５，９４６は、特定のクラスタにおいて共
有すべきアドレス範囲の境界を指定する。クラスタ内の
共有メモリに対するアクセスを有する、各システム・ユ
ニット上の各共有メモリ基底レジスタ（ＳＭＢＲ：shar
ed-memory base register)９４５は、共有すべきコンピ
ュータ４００の全アドレス空間内の最下位アドレスを含
む。図４の例では、ユニット４１０−４は、アドレス'0
4 0000 0000'ないし'04 FFFF FFFF'のメモリを物理的に
収容するが、エクスポートするのは上位１ＧＢ、即ち、
アドレス'04 C0000 0000'から'04FFFF FFFF'までのメモ
リだけである。４１ビットのアドレスの内、実際にレジ
スタ９４５に格納されるのは上位２５ビットのみである
ので、共有メモリの細分性(granularity)は６４Ｋバイ
トとなる。したがって、ユニット４１０−４ないし４１
０−７のＳＭＢＲは値'004 C000'を含む。基底アドレス
値を全く保持しないＳＭＢＲを指定するには、種々の方
法が存在する。この例では、かかるレジスタは値'000 0
000'を保持する。（これらのアドレス上の上位側に追加
した'0'はアドレス空間ビットであり、'0'はメモリ・ア
ドレス、'1'はレジスタ９４０のアドレス自体のような
システム・アドレスを示す）。Register 944 alone will be sufficient to indicate whether all of the memory of the system unit is shared or not at all. However, in almost all cases, it is desirable to share only a specified portion of the memory between the domain groups of the cluster.
Registers 945 and 946 specify the boundaries of the address range to be shared in a particular cluster. Each shared memory base register (SMBR: shar) on each system unit that has access to the shared memory in the cluster
The ed-memory base register 945 includes the lowest address in the entire address space of the computer 400 to be shared. In the example of FIG. 4, the unit 410-4 has the address' 0
4 0000 0000 'to '04 FFFF FFFF' memory is physically accommodated, but exported is the upper 1 GB, that is,
Only the memory from address '04 C0000 0000 'to' 04FFFF FFFF '. Since only the upper 25 bits of the 41-bit address are actually stored in the register 945, the granularity of the shared memory is 64 Kbytes. Therefore, the units 410-4 through 41
An SMBR of 0-7 contains the value '004 C000'. There are various ways to specify an SMBR that does not hold any base address value. In this example, such a register would have the value '000 0
Hold 000 '. (The '0' added to the upper side of these addresses is an address space bit, '0' indicates a memory address, and '1' indicates a system address such as the address of the register 940 itself).

【００５９】同様に、同じクラスタ内の各共有メモリ制
限レジスタ（ＳＭＬＲ：shared-memory limit registe
r）９４６は、共有アドレス範囲の最上位アドレスの上
位２５ビットを含む。この例では、システム・ユニット
４１０−４ないし４１０−７のＳＭＬＲは値'004 FFFF'
を保持し、最上位共有アドレスは、当該ユニット上の物
理メモリの最上位アドレス、即ち、'004 FFFF FFFF'と
同じであることを指定する。他の全ユニットのＳＭＬＲ
は指定無効値'000 0000'を保持する。Similarly, each shared memory limit register (SMLR: shared-memory limit register) in the same cluster
r) 946 includes the upper 25 bits of the highest address of the shared address range. In this example, the SMLR of system units 410-4 through 410-7 has the value '004 FFFF'
And that the highest shared address is the same as the highest address of the physical memory on the unit, that is, '004 FFFF FFFF'. SMLR of all other units
Holds the specified invalid value '000 0000'.

【００６０】[0060]

【表３】 [Table 3]

【００６１】レジスタ制御部９４７は、制御線１１４３
に、異なる値をレジスタ９４２，９４４，９４５，９４
６にロードさせる。これは、ドメイン群およびクラスタ
群におけるシステム・ユニット群４１０の動的再構成、
および各クラスタの共有メモリの位置の動的再構成を可
能にする。図１１および図１２は、この機能をどのよう
に行うかを説明するためのものである。各レジスタ組９
４０内に基底および境界レジスタ９４５，９４６の追加
コピーを配置すると、必要であれば、単一のドメイン・
クラスタ内において多数の範囲の共有アドレスが可能に
なる。あるいは、レジスタに基底アドレスおよび共有セ
グメント・サイズ、またはその他のパラメータを格納し
てもよい。The register control section 947 controls the control line 1143
And different values in registers 942, 944, 945, 94
Load 6 This includes dynamic reconfiguration of system units 410 in domains and clusters,
And allows dynamic reconfiguration of the shared memory location of each cluster. FIG. 11 and FIG. 12 are for explaining how to perform this function. Each register set 9
Placing additional copies of the base and boundary registers 945, 946 within 40 allows the single domain
Multiple ranges of shared addresses are possible within a cluster. Alternatively, the base address and shared segment size, or other parameters, may be stored in registers.

【００６２】NON-DOMAIN線９４３を用いて、ドメイン以
外からの他のトランザクションを禁止するのと丁度同じ
ように、非共有メモリからのトランザクションを禁止即
ち阻止することも可能である。この構成は非メモリ・ト
ランザクションの迅速な制御を可能にするが、メモリの
濾過(filtering)が比較器９４１において必要な時間が
長くなる。メモリ・サブシステム５２０におけるレイテ
ンシは、図５の他のサブシステムにおけるレイテンシよ
りも重大であるので、比較器９４１は、現トランザクシ
ョンのタイプを示す、線９２２からの従来の信号も受け
取ることが好ましい。線９２３が非メモリ・トランザク
ションを指定する場合、線９４３は、前述のように線９
１１を禁止するが、通常のメモリ・トランザクション
は、この時点では濾過せず、メモリ・サブシステム５２
０に進み、ここでその実行のための準備が開始される。
しかしながら、比較器９４１は、このユニットのドメイ
ン（ＤＭＲ９４２によって定義される）外のシステム・
ユニットから発するいずれの通常のメモリ・トランザク
ションに対しても、MEMORY CANCEL線９４８を活性化す
る。このメモリ・トランザクションに対して、レジスタ
９４５，９４６は、他のドメインと共有するメモリの範
囲外にあることを示す。即ち、このメモリ・トランザク
ションはＳＭＭＲ９４４に示されないシステム・ユニッ
トから発したメモリ・トランザクションである。次に、
この線９４８は、図７のスイッチ７２０においてトラン
ザクションを直接阻止し、その処理が部分的に既に開始
されていても、トランザクションが図５のバンク群５２
１のいずれに格納されているデータにも実際に影響を及
ぼすのを防止する。It is also possible to use the NON-DOMAIN line 943 to prohibit or prevent transactions from non-shared memory, just as prohibit other transactions from outside the domain. This configuration allows for quick control of non-memory transactions, but increases the time required for memory filtering in comparator 941. Since the latency in the memory subsystem 520 is more critical than in the other subsystems of FIG. 5, the comparator 941 also preferably receives a conventional signal from line 922 indicating the type of the current transaction. If line 923 specifies a non-memory transaction, line 943 will be connected to line 9 as described above.
11, but normal memory transactions are not filtered at this point and the memory subsystem 52
0, where preparation for its execution is started.
However, the comparator 941 does not allow the system outside the domain of this unit (defined by DMR 942).
The MEMORY CANCEL line 948 is activated for any normal memory transaction originating from the unit. For this memory transaction, registers 945 and 946 indicate that it is out of range of memory shared with other domains. That is, this memory transaction is a memory transaction originating from a system unit not shown in SMMR 944. next,
This line 948 directly blocks the transaction at switch 720 of FIG.
1 is prevented from actually affecting data stored in any one of them.

【００６３】こうして、コンピュータ４００は、ドメイ
ン群およびクラスタ群間の「ソフトウエア分離」を達成
した。異なるドメインは、例えば、全く異なるオペレー
ティング・システムを実行させてもよく、互いに干渉し
合うことはない。更に、エラーが他のドメイン・クラス
タ内のシステム・ユニットの動作のみに影響をあたえる
場合に、図４の制御バス４６０からのハードウエア・エ
ラー信号がシステム全体を破壊し得ないように、コンピ
ュータ内に「ハードウエア分離」を設けることが残って
いる。例えば、システム・ユニット４１０−０内のＥＣ
Ｃブロック９２４によって検出されたエラーは、４１０
−５のようなシステム・ユニットに影響を与えない。何
故なら、これらのハードウエア・ユニットはむしろ互い
に独立して動作しており、一方のユニットにおけるハー
ドウエア障害が他方で実行中の処理には全く影響を与え
ないからである。Thus, the computer 400 achieves “software separation” between the domain group and the cluster group. Different domains may, for example, run entirely different operating systems and do not interfere with each other. Further, if the error only affects the operation of system units in other domain clusters, the hardware error signal from control bus 460 of FIG. To provide "hardware separation". For example, the EC in the system unit 410-0
The error detected by C block 924 is 410
Does not affect system units like -5. This is because these hardware units operate rather independently of one another, and hardware failures in one unit have no effect on the processing being performed on the other.

【００６４】図１０は、図４のドメイン・フィルタ４８
０の４個の同一グローバル部分の１つを含む、図５の４
個のグローバル・アドレス・アービタ５５Ｇ２の１つを
詳細に示す。図１０におけるアービタ５５Ｇ２は、４系
統のグローバル・アドレス・バス（ＧＡＢ）５５Ｇ１の
内第１バス５５Ｇ１−０を制御すると仮定する。このア
ービタは、コンピュータ４００内の各システム・ユニッ
ト４１０上に配置された、図８のローカル・アドレス・
アービタ（ＬＡＡ）８２０からの４本のＧＡＢ−要求線
８２１の１本を受け取る。ＬＡＡ５５Ｌ４が、そのシス
テム・ユニット上のどのポートが、４系統のグローバル
・バスの各々に対する次のアクセスを得るに値するかを
決定したときはいつでも、その線８２１は、アービタ・
ロジック１０１０によって制御されるＧＡＢを通じて、
トランザクションを同報通信する要求をアサートする。
コンピュータ４００は４個のＧＡＢ５５Ｇ１を有するの
で、各ローカル・アービタ５５Ｌ４から４個のグローバ
ル・アービタ５５Ｇ２までに、４本の別個の線８２１が
通っている。FIG. 10 shows the domain filter 48 of FIG.
5, one of the four identical global parts of 0
One global address arbiter 55G2 is shown in detail. It is assumed that the arbiter 55G2 in FIG. 10 controls the first bus 55G1-0 of the four global address buses (GAB) 55G1. This arbiter is located on each system unit 410 in the computer 400 and is located on the local address
It receives one of the four GAB-request lines 821 from the arbiter (LAA) 820. Whenever the LAA55L4 determines which port on the system unit deserves the next access to each of the four global buses, its line 821 is asserted by the arbiter
Through GAB controlled by logic 1010,
Assert a request to broadcast a transaction.
Since computer 400 has four GABs 55G1, there are four separate lines 821 running from each local arbiter 55L4 to four global arbiters 55G2.

【００６５】調停ロジック１０１０は、多数の従来から
のアルゴリズムのいずれかを用いて、１６本の許可線１
０１３の内１本を立ち上げることによって、そのＧＡＢ
５５Ｇ１（図５）の転送サイクルを、１６個のシステム
・ユニット４１０の１つのＬＡＡ５５Ｌ４に割り当て
る。従来のシステムにおけると同様、許可信号は、図８
の線８２２を通じて、システム・ユニットのＬＡＡ５５
Ｌ４の各々に直接戻ってくる。この場合フィルタ・ロジ
ック１０２２を無視し、選択されたＬＡＳ５５Ｌ３から
発したアドレス・トランザクションは、そのＧＡＢ５５
Ｇ１を通じて、１６個の全システム・ユニット上の対応
するＬＡＳに伝搬する。グローバル・アドレス・ルータ
４５０の次の転送動作において、グローバル・アドレス
・アービタ５５Ｇ２は、アドレスをその対応するＧＡＢ
５５Ｇ１に通過させるために、ローカル・アドレス・ス
イッチ５５Ｌ３に信号を送るように、選択されたＬＡＡ
５５Ｌ４に命令する。成功したトランザクションのGRAN
T線１０１３は、全てのシステム・ユニットに、それら
のどれが当該ＧＡＢ５５Ｇ１上のトランザクションを発
したかを示す。受信側システム・ユニットは、トランザ
クションを受け取ったときに、トランザクション自体に
ある情報から、ソース・ユニットを識別する。ローカル
・データ・ルータ５４Ｌ２は、グローバル・データ経路
群５４Ｇ１の内どれが、成功したトランザクションによ
って要求されるあらゆるデータを搬送するかについて、
図５のデータ・アービタ５４Ｌ３，５４Ｇ２と交渉を行
う。The arbitration logic 1010 employs any of a number of conventional algorithms to control the 16 grant lines 1
By launching one of the 013, the GAB
The transfer cycle of 55G1 (FIG. 5) is assigned to one LAA 55L4 of the 16 system units 410. As in the conventional system, the permission signal
Through the line 822 of the system unit LAA 55
Return directly to each of the L4s. In this case, the filter logic 1022 is ignored and address transactions originating from the selected LAS 55L3 are
Propagate through G1 to the corresponding LAS on all 16 system units. In the next transfer operation of the global address router 450, the global address arbiter 55G2 stores the address in its corresponding GAB.
Selected LAA to signal local address switch 55L3 to pass through
Command 55L4. GRAN for successful transactions
T-line 1013 indicates to all system units which of them has issued a transaction on that GAB 55G1. When the receiving system unit receives the transaction, it identifies the source unit from information in the transaction itself. The local data router 54L2 determines which of the global data paths 54G1 will carry any data required by a successful transaction.
Negotiation is performed with the data arbiters 54L3 and 54G2 in FIG.

【００６６】本発明によるマルチ・ドメイン・コンピュ
ータでは、ドメイン・フィルタ４８０のグローバル部分
は、物理的に、各グローバル・アドレス・アービタ５５
Ｇ２を伴う。１６個の可能なシステム・ユニット４１０
の各々に１つある、クラスタ・レジスタ１０２１のバン
ク１０２０が、１６本の許可信号線１０１３を受ける。
個々のクラスタ・レジスタ１０２１−ｉは各々、１６個
のシステム・ユニット４１０−ｊ毎に、１つのビット位
置１０２１−ｉ−ｊを有する。例えば、第１レジスタ１
０２１−０の「ユニット−３」の位置における'1'の値
は、システム・ユニット４１０−３が、システム・ユニ
ット４１０−０と同じクラスタ内にあることを示す。以
下の表は、上述の例示構成に対する、レジスタ１０２１
の内容を示す。In a multi-domain computer according to the present invention, the global portion of the domain filter 480 is physically located at each global address arbiter 55
With G2. 16 possible system units 410
Each of the banks 1020 of the cluster register 1021 receives one of the 16 enable signal lines 1013.
Each individual cluster register 1021-i has one bit position 1021-ij for every 16 system units 410-j. For example, the first register 1
A value of '1' at the position of “unit-3” of 021-0 indicates that the system unit 410-3 is in the same cluster as the system unit 410-0. The following table shows the registers 1021 for the example configuration described above.
Indicates the contents of

【００６７】[0067]

【表４】 [Table 4]

【００６８】１６個の可能なシステム・ユニット全てに
レジスタが常に装備されている。レジスタ１０２１−８
ないし１０２１−Ｆ内の値は、図４では実装されていな
いシステム・ユニットに対応し、重要ではない。しかし
ながら、「１」を全ての診断ビット位置（即ち、レジス
タｉの位置ｉ）に割り当て、「０」をそれ以外の位置に
割り当てることにより、システム内に既にある他のいず
れのユニットとも干渉せずに、システム・ユニットをコ
ンピュータ４００の動作中に差し込むことができると共
に、単体の診断を直ちに実行することができる。All 16 possible system units are always equipped with registers. Register 1021-8
The values in 1021 -F correspond to system units not implemented in FIG. 4 and are not significant. However, by assigning a “1” to all diagnostic bit positions (ie, position i of register i) and assigning a “0” to other positions, it does not interfere with any other units already in the system. In addition, the system unit can be plugged in during the operation of the computer 400, and a single diagnosis can be immediately performed.

【００６９】フィルタ・ロジック１０２２は、レジスタ
１０２１内のクラスタ定義に応じて、許可線１０１３を
線１０２３に結合する。各線１０２３は、それに対応す
るシステム・ユニット４１０に、「グローバル・アドレ
ス有効」（ＶＡＬＩＤ）信号８２２として到達する。図
３の３００のような通常のシステムでは、VALID信号
は、単に、バス上の現トランザクションが有効であり、
全てのシステム・ユニットに同報通信されていることを
示すタイミング信号に過ぎない。一方、本システム４０
０では、異なるクラスタ内にある多数のシステム・ユニ
ットが同じアドレスを有することができ、発生元と同じ
クラスタ内にある受信側はそのトランザクションを受け
なければならず、他のクラスタ内にあるシステム・ユニ
ットは、いずれのトランザクションが当該システム・ユ
ニットに対応するアドレスを保持していても、それが発
生していることを完全に無視し続けなければならない。Filter logic 1022 couples enable line 1013 to line 1023 according to the cluster definition in register 1021. Each line 1023 arrives at its corresponding system unit 410 as a “global address valid” (VALID) signal 822. In a typical system, such as 300 in FIG. 3, the VALID signal simply indicates that the current transaction on the bus is valid,
It is merely a timing signal indicating that it is being broadcast to all system units. On the other hand, the system 40
0, multiple system units in different clusters can have the same address, and receivers in the same cluster as the origin must receive the transaction, and system units in other clusters The unit must continue to completely ignore what is happening, no matter which transaction holds the address corresponding to the system unit.

【００７０】通常の単一ドメイン・コンピュータでは、
図８のいずれのＬＡＡ５５Ｌ４からのHOLD信号８２５
も、コンピュータ全体の他のシステム・ユニット４１０
毎のＬＡＡ５５Ｌ４内のそのＧＡＢに向かう線８２６に
伝搬されるだけである。しかしながら、コンピュータ４
００では、各ＧＡＡチップ５５Ｇ２上にある他のフィル
タ・ロジックの組１０２６によって、クラスタ・レジス
タ１０２０によって定義されるように、HOLD信号８２５
は同じハードウエア群内の他のシステム・ユニットに属
する線８２６のみに到達することができる。ARBSTOP信
号８２４も同様に動作する。１つのシステム・ユニット
によってアサートされたSTOPは、他の全ＬＡＡに向かう
入来ARBSTOP線８２６に単に接続されるのではなく、レ
ジスタ１０２０によって指定された他のユニット群のみ
に到達する。このドメイン・フィルタ４８０のグローバ
ル部分は、他の制御信号に対するフィルタ・ロジックの
各組も同様に含んでいる。例えば、いずれかのシステム
・ユニットの図９のキャッシュ・コントローラ９３０に
よってアサートされたCANCEL信号９３１は、フィルタ・
ロジック１０２８が許す場合にのみ、入来CANCEL線９２
３を通じてトランザクションをキャンセルすることがで
きる。１０２２および１０２６〜１０２８のようなフィ
ルタ・ロジックは全て、線１０２５によってクラスタ・
レジスタ群１０２１に並列に接続されている。In a typical single domain computer,
HOLD signal 825 from any LAA 55L4 in FIG.
Also other system units 410 throughout the computer
It is only propagated on line 826 towards the GAB in each LAA 55L4. However, computer 4
At 00, the HOLD signal 825, as defined by the cluster register 1020, by another set of filter logic 1026 on each GAA chip 55G2.
Can only reach line 826 belonging to another system unit in the same hardware group. The ARBSTOP signal 824 operates similarly. A STOP asserted by one system unit only reaches the other group of units specified by register 1020, rather than simply being connected to the incoming ARBSTOP line 826 going to all other LAAs. The global portion of the domain filter 480 includes each set of filter logic for other control signals as well. For example, the CANCEL signal 931 asserted by the cache controller 930 of FIG.
Incoming CANCEL line 92 only if logic 1028 allows.
3, the transaction can be canceled. Filter logic such as 1022 and 1026-1028 are all clustered by line 1025.
They are connected in parallel to the register group 1021.

【００７１】制御ユニット１０２４は、クラスタ定義を
動的に再構成するために、線１１４４がレジスタ群１０
２１に異なる値をロードすることを許可する。実施態様
の選択として、各グローバル・アービタ５５Ｇ２は同一
の集積回路を占有し、その各々は、同じクラスタ・レジ
スタおよびフィルタ・ロジックの組を重複して含んでい
る。クラスタ・レジスタの組には全て、同じ格納値集合
がロードされる。The control unit 1024 sets the line 1144 to the register group 10 to dynamically reconfigure the cluster definition.
Allows 21 to be loaded with different values. As an option in the embodiment, each global arbiter 55G2 occupies the same integrated circuit, each of which contains the same set of cluster registers and filter logic in duplicate. All sets of cluster registers are loaded with the same set of stored values.

【００７２】図１４は、１０２２または１０２６〜１０
２８のような１組のドメイン・フィルタ・ロジックを実
施する回路１４００の詳細を示す。図１４はロジック１
０２２をパラダイム(paradigm)として用い、その場合の
回路１４００に対する信号指定を示す。図示を容易にす
るために、図１４は、ビット線１０２５のみでなく、ク
ラスタ・レジスタ１０２１自体も示している。FIG.
28 shows details of a circuit 1400 that implements a set of domain filter logic, such as 28. FIG. 14 shows logic 1.
022 is used as a paradigm, and a signal designation to the circuit 1400 in that case is shown. For ease of illustration, FIG. 14 shows not only the bit line 1025, but also the cluster register 1021 itself.

【００７３】線１０２３−０はそのハードウエア・ドメ
イン・クラスタ内のいずれかのシステム・ユニットがト
ランザクションを開始したときはいつでも、システム・
ユニット４１０−０へのVALID信号をアサートする。ユ
ニット４１０−０に関連するGRANT信号１０１３−０
は、レジスタ１０２１−０のビット１０２１−０−０が
「１」値を含み、当該ユニット４１０−０がそれ自体の
クラスタの中にあることを示す場合、ＡＮＤゲート１４
０１−００を満足する。次に、論理ＯＲゲート１４０２
−０が出力１０２３−１をアサートし、それをシステム
・ユニット４１０−０に戻す。システム・ユニット４１
０−１からのGRANT線１０１３−１がアサートされる
と、これら２個のユニットが同じクラスタ内にある場
合、ユニット４１０−０に向かう線１０２３も立ち上げ
る。同じクラスタ内にある場合、１０１３−１が立ち上
がったときに、レジスタ１０２１−０のビット１（簡略
化のためビット１０２１−０−１と呼ぶ）の値「１」が
ＡＮＤゲート１４０１−０１およびＯＲ１４０２−０を
満足する。このバンク内の残りの１４個のＡＮＤゲート
も、１０２１−０−Ｆを通じて、レジスタ・ビット群１
０２１−０−２に対して同様に動作する。Line 1023-0 indicates that a system unit has started a transaction whenever any system unit in the hardware domain cluster has begun a transaction.
Assert the VALID signal to unit 410-0. GRANT signal 1013-0 associated with unit 410-0
Indicates that if bit 1021-0-0 of register 1021-0 contains a "1" value, indicating that unit 410-0 is in its own cluster, AND gate 14
01-00 is satisfied. Next, a logical OR gate 1402
-0 asserts output 1023-1 and returns it to system unit 410-0. System unit 41
When GRANT line 1013-1 from 0-1 is asserted, if these two units are in the same cluster, line 1023 going to unit 410-0 also rises. If they are in the same cluster, when 1013-1 rises, the value “1” of bit 1 of register 1021-0 (referred to as bit 1021-0-1 for simplicity) is AND gate 1401-01 and OR 1402 −0 is satisfied. The remaining 14 AND gates in this bank are also connected to register bit group 1 through 1021-0-F.
The same operation is performed with respect to 021-0-2.

【００７４】ゲート１４０１−１０ないし１４０１−１
Ｆおよび１４０２−１は、同様に機能し、同じドメイン
・クラスタ内のシステム・ユニットがトランザクション
を申し出た場合にはいつでも、VALID信号１０２３−１
をシステム・ユニット４１０−１に対して生成する。１
４個の追加のゲート・バンクが、１０２３−Ｆを通じ
て、残りの線を処理する。通常、レジスタ群１０２１の
内容は対角行列(diagonal matrix)を形成するので、ビ
ット１０２１−ｉ−ｊは常にビット１０２１−ｊ−ｉと
同じ値を有する。また、各ユニットは通常それ自体のク
ラスタのメンバであるので、全ての主対角ビット１０２
１−ｉ−ｉは常に「１」である。Gates 1401-10 to 1401-1
F and 1402-1 function similarly, and whenever a system unit in the same domain cluster offers a transaction, the VALID signal 1023-1
Is generated for the system unit 410-1. 1
Four additional gate banks process the remaining lines through 1023-F. Normally, the contents of registers 1021 form a diagonal matrix, so that bits 1021-ij always have the same value as bits 1021-ji. Also, since each unit is usually a member of its own cluster, all the main diagonal bits 102
1-ii is always "1".

【００７５】図１１は、図４のコンフィギュレータ４２
０がコンピュータ４００内においてドメイン群およびク
ラスタ群を動的に設定する態様を示す。FIG. 11 shows the configurator 42 of FIG.
0 indicates a mode in which a domain group and a cluster group are dynamically set in the computer 400.

【００７６】図４の通常の制御およびサービス・ユニッ
ト４７０は、標準的なワークステーション１１２０と通
信する、既に使用可能な専用サービス・プロセッサ１１
１０の形態を取り、遠隔コンソールとして機能する。こ
れら２個のユニットは、ケーブルまたはその他のリンク
１１２２によって結合されている、標準的なアダプタ１
１１１，１１２１を通じて互いに通信する。コンソール
１１２０は、アダプタ１１２３によって、それ自体の入
出力装置（図示せず）に接続されてもよい。サービス・
プロセッサのＩ／Ｏアダプタ群１１１２は、コンピュー
タ４００内の多数の機能を感知し制御する。例えば、線
１１１３は、コンピュータ全体のために、給電および冷
却サブシステム１１３０にインターフェースする。線１
１１４は、図４の制御分配手段４４０内の多数の線に接
続する。The normal control and service unit 470 of FIG. 4 is a dedicated service processor 11
It takes the form of 10 and functions as a remote console. These two units are connected by a cable or other link 1122, a standard adapter 1
They communicate with each other through 111 and 1121. Console 1120 may be connected to its own input / output device (not shown) by adapter 1123. service·
Processor I / O adapters 1112 sense and control a number of functions within computer 400. For example, line 1113 interfaces to the power and cooling subsystem 1130 for the entire computer. Line 1
114 connects to a number of lines in the control distribution means 440 of FIG.

【００７７】コンピュータ４００内に実施されている通
常の機能の１つとして、１組の線１１１５を通じて、１
１１６で示すような種々の要素に、記憶されている検査
パターンを送ることにより、その論理回路に対して検査
を行うことができる機能がある。これらの線の従来の機
能は、K.P. Parker, THE BOUNDARY-SCAN HANDBOOK (Klu
wer Academic Publishers, 1992)のような引用文献に記
載されている、境界スキャン検査(boundary-scan test)
を実施するためにある。当業者は、通常このプロトコル
を「ＪＴＡＧ規格」と呼んでいる。One of the usual functions implemented in the computer 400 is through a set of lines 1115,
By sending a stored test pattern to various elements as indicated by reference numeral 116, there is a function capable of performing a test on the logic circuit. The traditional function of these lines is the KP Parker, THE BOUNDARY-SCAN HANDBOOK (Klu
wer Academic Publishers, 1992), a boundary-scan test
There is to implement. Those skilled in the art usually refer to this protocol as the "JTAG standard".

【００７８】コンフィギュレータ４２０は、追加機能に
対して、既存のJTAG線１１１５を選定する(coopt)。通
常、これらの線は、コンピュータ４００全体にわたる多
くのチップに従来からのアドレス線およびデータ線を提
供し、これらのチップの機能を検査することを目的とし
ている。ＬＡＳ５５Ｌ３およびＧＡＡ５５Ｇ２のための
チップの中では、図９の制御ロジック９４７、および図
１０の制御ロジック１０２４は、JTAG線１１４３，１１
４４上のある所定の信号の組み合わせを検出する。これ
らの線は、ドメインおよびクラスタ仕様を線１１４３に
搬送し、図５および図９に示すように、選択されたシス
テム・ユニット４１０のローカル・アドレス・ルータ群
５５Ｌ３内のフィルタ・レジスタ群９４０の内容をロー
ドする。また、線１１４２は、クラスタ仕様を線１１４
４に搬送し、図５および図１０に示すように、グローバ
ル・アドレス・アービタ群５５Ｇ０と関連するフィルタ
・レジスタ１０２０にロードする。JTAGまたは他のかか
る線を既に適所に有していないシステム群４００は、サ
ービス・プロセッサ１１１０からの専用線を用いて、容
易に制御線１１４３，１１４４として使用したり、構成
機能を実行するために他の線に切り替えることができ
る。これらの線は、単に、偶然この特定の実施態様にお
いて容易に使用可能であるに過ぎない。他の代替案とし
て、レジスタ９４０，１０２０をシステム・メモリ空間
内の小さなメモリ・ブロックとして扱うというものがあ
る。先に注記したように、コンピュータ４００は、'10
0000 0000'から'1F FFFF FFFF'までのかかる空間を、そ
の全範囲として有している。The configurator 420 selects (coopts) an existing JTAG line 1115 for the additional function. Typically, these lines provide conventional address and data lines to many chips throughout the computer 400 and are intended to test the function of these chips. In the chip for the LAS55L3 and GAA55G2, the control logic 947 of FIG. 9 and the control logic 1024 of FIG.
A predetermined signal combination on 44 is detected. These lines carry the domain and cluster specifications on line 1143 and, as shown in FIGS. 5 and 9, the contents of filter registers 940 in local address routers 55L3 of selected system unit 410. To load. Also, the line 1142 indicates the cluster specification by the line 114.
4 and load the filter register 1020 associated with the global address arbiter group 55G0 as shown in FIGS. Systems 400 that do not already have JTAG or other such lines in place may use dedicated lines from service processor 1110 to easily use as control lines 1143, 1144 or perform configuration functions. You can switch to another line. These lines simply happen to be readily usable in this particular embodiment. Another alternative is to treat the registers 940, 1020 as small blocks of memory in the system memory space. As noted above, the computer 400 has a '10
Such a space from 0000 0000 'to' 1F FFFF FFFF 'is included in the entire range.

【００７９】サービス・プロセッサ４７０の形態にはな
んらクリティカルな点はない。場合によっては、この機
能のために、物理的に別個の実体を有することなく、通
常のシステム自体の一部を使用することですら可能な場
合もある。実際、好適なコンピュータ４００は、必要で
あれば、システム・ユニット自体が、ドメイン・コンフ
ィギュレータの機能のいくつかを提供することを可能に
している。システム・ユニット４１０内で実行されるオ
ペレーティング・システム内の特権ソフトウエア(privi
leged software)も、現行の共有メモリ・ブロックを指
定し直すために、図９の共有メモリ・レジスタ９４５，
９４６に書き込むことができる。また、サービス・プロ
セッサも、システム・ユニット群がＩ／Ｏコントローラ
１１１２内のステータス・ワードの構成ビットをセット
することによって、選択的にレジスタ９４４に書き込む
ことを可能にし、これは制御線１１１４の１本に現れ
る。There is no critical point in the form of service processor 470. In some cases, it may even be possible to use part of the normal system itself for this function without having a physically separate entity. In fact, the preferred computer 400 allows the system unit itself to provide some of the functions of the domain configurator, if necessary. Privileged software in the operating system (privi
leged software), to re-designate the current shared memory block, the shared memory register 945, FIG.
946. The service processor also allows the system units to selectively write to the register 944 by setting a configuration bit in the status word in the I / O controller 1112, which can control one of the control lines 1114. Appears in books.

【００８０】図１２は、コンピュータ４００内のドメイ
ン群およびクラスタ群を動的に構成する方法１２００を
記載したものである。図１２において右向きのブロック
は遠隔コンソール１１２０内で実行され、左向きのブロ
ックは、先に述べた実施例の図１１のサービス・プロセ
ッサ１１１０において実行される。ブロック群１２１０
は、構成プロセスを設定する。ブロック１２１１はオペ
レータの命令に応答して、構成モードを開始する。ブロ
ック１２１２はレジスタ９４０，１０２０をデフォルト
値に初期化する。好ましくは、全てのレジスタ９４２
は、それらをそれ自体のドメインに置く位置に「１」ビ
ット、その他の位置に「０」を受ける。全てのレジスタ
９４４〜９４６は、共有メモリがエクスポートされない
ことを示す値を受ける。レジスタ群１０２０は、好まし
くは、対角ストライプ(diagonal stripe)の１ビットを
除いて「０」を含む。これは、各システム・ユニットが
それ自体クラスタ内にあることを示す。FIG. 12 describes a method 1200 for dynamically configuring domains and clusters in a computer 400. In FIG. 12, the right-pointing blocks are executed in the remote console 1120, and the left-pointing blocks are executed in the service processor 1110 of FIG. 11 of the previously described embodiment. Block group 1210
Sets up the configuration process. Block 1211 initiates configuration mode in response to an operator command. Block 1212 initializes registers 940 and 1020 to default values. Preferably, all registers 942
Receives a "1" bit at the location where they are placed in its own domain and a "0" at other locations. All registers 944-946 receive a value indicating that the shared memory is not exported. Register group 1020 preferably includes "0" except for one bit of a diagonal stripe. This indicates that each system unit is itself in a cluster.

【００８１】ブロック群１２２０は、図４のドメイン・
フィルタ４８０の構成を設定する。ブロック１２２１に
おいて、遠隔コンソールにおけるオペレータが構成すべ
き特定のドメインを選択し、当該ドメインに属するシス
テム・ユニット４１０の番号をブロック１２２２におい
て入力する。次に、サービス・プロセッサ１１１０が線
１１１５，１１４２に信号を送出し、ブロック１２２３
において、図９のドメイン・マスク・レジスタ９４２に
適正な値をロードする。ブロック１２２４は、各ドメイ
ンをそれ自体のクラスタにするために適切なレジスタ群
１０２０を設定する。このステップはいずれの時点で実
行してもよいが、本実施例では、ドメイン群がクラスタ
に組み込まれる場合にも、クラスタ・レジスタを設定す
ることが必要である。ブロック１２２５は、オペレータ
が構成すべき追加のドメインを指定し終わったなら、制
御をブロック１２２１に戻す。その他の場合、ブロック
１２２６は、設定すべきマルチ・ドメイン・クラスタが
未だあるか否かについて問い合わせを行う。The block group 1220 corresponds to the domain in FIG.
The configuration of the filter 480 is set. At block 1221, the operator at the remote console selects a particular domain to configure and enters the number of the system unit 410 belonging to that domain at block 1222. Next, service processor 1110 sends a signal on lines 1115 and 1142 and block 1223
, The appropriate value is loaded into the domain mask register 942 of FIG. Block 1224 sets the appropriate registers 1020 to make each domain its own cluster. This step may be performed at any time, but in this embodiment, it is necessary to set the cluster register even when the domain group is incorporated into the cluster. Block 1225 returns control to block 1221 once the operator has specified additional domains to configure. Otherwise, block 1226 queries whether there are any more multi-domain clusters to configure.

【００８２】未だある場合、ブロック１２３０はいずれ
か所望の共有メモリを設定する。ブロック１２３１にお
いて、オペレータは、メモリをエクスポートするシステ
ム・ユニット４１０の１つを選択し、ブロック１２３２
において、当該メモリをインポートするドメインを選択
する。（システム・ユニットは、その物理的に実施され
ているメモリが他のシステム・ユニットに使用可能な場
合、メモリを「エクスポート」し、他のシステム・ユニ
ットは、それがインポート側ユニット上に位置している
かのように、そのメモリを「インポート」する。）ブロ
ック１２３３において、図９に関連して説明したよう
に、適切なレジスタ群９４４にロードする。ブロック１
２３４では、図１０に関連して説明したように、レジス
タ群１０２０内の適切なビットをセットする。ブロック
１２３５において、共有メモリ範囲の基底アドレスの値
をオペレータから受け取る。ブロック１２３６では、こ
れを適正なＳＭＢレジスタ群９４５に入力する。ブロッ
ク１２３７では、対応する制限アドレス値を受け取り、
ブロック１２３８においてこれをＳＭＬＲ群９４６にロ
ードする。オペレータが追加のクラスタを定義したい場
合、ブロック１２２６は制御をブロック１２３１に戻
す。その他の場合、手順１２００は終了する。図１２に
示す一連のステップには大多数の変形が可能である。同
様に、コンピュータ上の他のタスクに関するルーチン１
２００のタイミングも重要ではない。更に、コンピュー
タ４００内の特権ソフトウエアが、オペレータの代わり
にルーチン１２００を実行してもよい。破線１２０１
は、再構成が、コンピュータ・ハードウエアに手作業の
変化を加えることなく、オペレータまたはソフトウエア
のいずれかによって、繰り返し実行可能であることを象
徴的に示している。If so, block 1230 sets up any desired shared memory. At block 1231, the operator selects one of the system units 410 to export memory and returns to block 1232
In, a domain for importing the memory is selected. (The system unit "exports" the memory if its physically implemented memory is available to other system units, and the other system unit is located on the importing unit. As such, the memory is "imported.") At block 1233, the appropriate registers 944 are loaded, as described in connection with FIG. Block 1
At 234, the appropriate bits in registers 1020 are set, as described in connection with FIG. At block 1235, the value of the base address of the shared memory range is received from the operator. In block 1236, this is input to the appropriate SMB register group 945. At block 1237, the corresponding restricted address value is received,
This is loaded into SMLRs 946 at block 1238. If the operator wishes to define additional clusters, block 1226 returns control to block 1231. Otherwise, procedure 1200 ends. Most modifications can be made to the series of steps shown in FIG. Similarly, routine 1 for other tasks on the computer
The timing of 200 is not important either. Further, privileged software within computer 400 may execute routine 1200 on behalf of the operator. Broken line 1201
Symbolically indicates that the reconfiguration can be performed repeatedly, either by the operator or by software, without any manual changes to the computer hardware.

【００８３】システム・ユニット群４１０は、任意にド
メイン内に組み込むことができるが、明らかにドメイン
およびクラスタは全て、少なくとも１つのプロセッサが
実装されている少なくとも１つのシステム・ユニット、
およびメモリを含むシステム・ユニットを含んでいなけ
ればならない。ドメインまたはクラスタは、殆ど常に、
その１枚以上のシステム・ボード上に、いくらかのＩ／
Ｏ設備を含んでいる。これらの資源がどのようにしてド
メインまたはクラスタ内の種々のシステム・ボード間で
分配されるかは、しかしながら、任意である。方法１２
００は、システム４００全体およびそのオペレーティン
グ・システム（群）の正常動作の間に、ドメイン群およ
びクラスタ群を構成することができる。複雑性および些
細な間違いの可能性を回避するために、オペレーティン
グ・システム群のいずれもがブートされる前に、システ
ムが特殊状態にある場合にのみ再構成を許可することが
賢明である。The system units 410 can be arbitrarily incorporated within a domain, but obviously all domains and clusters are comprised of at least one system unit, at least one processor implemented,
And a system unit that includes memory. Domains or clusters are almost always
Some I / O on the one or more system boards
Includes O equipment. How these resources are distributed among the various system boards in a domain or cluster, however, is arbitrary. Method 12
00 can configure domains and clusters during normal operation of the entire system 400 and its operating system (s). To avoid complexity and the possibility of trivial mistakes, it is advisable to allow reconfiguration only if the system is in a special state before any of the operating systems are booted.

【００８４】図１３は、典型的なトランザクション１３
００の簡略図であり、正常動作の間、即ち、図１２のブ
ロック１２４２がコンピュータ４００の構成を完了した
後の、コンピュータ４００におけるドメイン群およびク
ラスタ群の効果を強調したものである。トランザクショ
ン１３００は、コンピュータ４００が１６個のシステム
・ユニットの最大の補集合(complement)を含むことを想
定している。トランザクションは線１３０１にて開始す
る。FIG. 13 shows a typical transaction 13
12 is a simplified diagram of FIG. 10, highlighting the effects of the domains and clusters on computer 400 during normal operation, ie, after block 1242 of FIG. Transaction 1300 assumes that computer 400 contains the largest complement of 16 system units. The transaction starts at line 1301.

【００８５】ブロック１３１０は、図１３の多数の列で
象徴化されているように、全てのシステム・ユニット４
１０上で実行される。これらは、異なる２個のシステム
・ユニット間または同じユニット内のいずれかにおい
て、トランザクション要求を開始する。要求群は、１個
以上のシステム・ユニットのいずれか１個以上のポート
がブロック群１３１１のいずれかにおいてトランザクシ
ョンを要求したときにはいつでも、非同期にかつ同時に
進行する。ブロック群１３１２において、ローカル・ア
ービタ５５Ｌ４は、多数のトラフィック等化優先アルゴ
リズムのいずれかに基づいて、システム・ユニット上の
要求元ポートを１個選択し、処理を進める。Block 1310 represents all system units 4 as symbolized by the multiple columns of FIG.
10 is executed. They initiate a transaction request either between two different system units or within the same unit. Requests proceed asynchronously and simultaneously whenever any one or more ports of one or more system units request a transaction in any of blocks 1311. In the block group 1312, the local arbiter 55L4 selects one requesting port on the system unit based on any of a number of traffic equalization priority algorithms, and proceeds.

【００８６】ブロック群１３２０は、全てのトランザク
ションについてアドレスを送信する。線１３２１−０な
いし１３２１−Ｆで示すように、各ブロック１３２２
は、全てのシステム・ユニット４１０−０ないし４１０
−Ｆからトランザクション要求を受け取り、システム・
ユニットへのバス群５５Ｇ１の１つに許可を与える。４
個のグローバル・アドレス・アービタ５５Ｇ２の各々は
標準的な公正方法を用いて、ブロック１３２２を並列に
実行し、その特定バス５５Ｇ１を競合するトランザクシ
ョン間で割り当てる。次に、ブロック１３２３は、ライ
ン１３２４によって示すように、そのブロック１３２２
によって選択されたシステム・ユニットからのアドレス
を、１６個のシステム・ユニット全てに同報通信する。
ここでも、本実施態様における４系統のバス５５Ｇ１の
各々は、かかる他のバス全てと同時に、別個のアドレス
を同報通信することができる。The block group 1320 sends addresses for all transactions. As indicated by lines 1321-0 through 1321-F, each block 1322
Are all system units 410-0 through 410
-F receives a transaction request from the
Permission is given to one of the bus groups 55G1 to the unit. 4
Each of the global address arbiters 55G2 executes block 1322 in parallel and allocates its particular bus 55G1 among competing transactions using standard fairness methods. Next, block 1323, as indicated by line 1324,
Broadcasts the address from the selected system unit to all 16 system units.
Also in this case, each of the four buses 55G1 in this embodiment can simultaneously broadcast a separate address simultaneously with all of the other buses.

【００８７】ステップ１３３０において、各バス５５Ｇ
１上のトランザクションを濾過し、適切なシステム・ユ
ニット４１０−０ないし４１０−Ｆのみがトランザクシ
ョンに作用することを許可されるようにする。別個のブ
ロック群１３３０が各システム・ユニット毎の各グロー
バル・アドレス・バス毎に存在するので、本実施例で
は、４ｘ１６＝６４個のブロック１３３０がある。各ブ
ロック１３３０は、レジスタ群１０２０から同時に、そ
のシステム・ユニットが、そのバス上のトランザクショ
ンの送出側ユニットと同じクラスタ内にあるか否かにつ
いて判定を行う。（単一のドメインはそれ自体が、レジ
スタ群１０２０においてクラスタとして定義されること
を思い出されたい。）ない場合、システム・ユニットは
このトランザクションを無視し、制御は出力１３０２に
渡る。その他の場合、制御は１３４０に渡る。At step 1330, each bus 55G
1 so that only the appropriate system units 410-0 through 410-F are allowed to act on the transaction. Since there is a separate block group 1330 for each global address bus for each system unit, there are 4 × 16 = 64 blocks 1330 in this embodiment. Each block 1330 determines from the register group 1020 simultaneously whether the system unit is in the same cluster as the transaction sending unit on the bus. (Recall that a single domain is itself defined as a cluster in registers 1020.) If not, the system unit ignores the transaction and control passes to output 1302. Otherwise, control passes to 1340.

【００８８】ブロック群１３４０の別個の組が、各シス
テム・ユニット内の各グローバル・アドレス・バスに対
して現れる。即ち、４ｘ１６＝６４組のブロックが現れ
る。ブロック群１３４１は、トランザクションが線９２
２に沿って図９の比較器９４１に移動する際に、ＧＡＢ
５５Ｇ１上のトランザクション自体からソース・ユニッ
トの番号を読み出す。ソース・ユニットが、それが位置
するユニットと同じドメイン内にないことを、ドメイン
・マスク・レジスタ９４２が明らかにした場合、ブロッ
ク１３４１は制御をブロック１３４２に渡す。共有メモ
リ・レジスタ９４４が、そのシステム・ユニットがメモ
リをソース・ユニットと共有していることを検出した場
合、ブロック１３４２からブロック１３４３に移動す
る。線９２２上を搬送されているトランザクションのア
ドレスが、レジスタ９４５内に格納されている基底アド
レスを超えることを、比較器９４１が示す場合、ブロッ
ク１３４４は、そのアドレスが、レジスタ９４６内に格
納されている共有メモリの上限より下にあるか否かにつ
いて検査を行う。そのシステム・ユニットが現トランザ
クションには関わっていないことを示す各ブロック１３
３０の各組について、終了１３４５において、その場所
におけるトランザクションを終了する。しかし、いずれ
かのフィルタ・ブロック・チェーン１３３０〜１３４０
が同じドメイン、または同じクラスタおよび適切なアド
レス範囲を感知した場合、線１３４６は、当該システム
・ユニットに対するブロック１３５０に制御を渡す。A separate set of blocks 1340 appears for each global address bus in each system unit. That is, 4 × 16 = 64 sets of blocks appear. The block group 1341 has a transaction of line 92
When moving along comparator 2 to the comparator 941 of FIG.
Read the source unit number from the transaction itself on 55G1. If the domain mask register 942 determines that the source unit is not in the same domain as the unit in which it is located, block 1341 passes control to block 1342. If the shared memory register 944 detects that the system unit is sharing memory with the source unit, it moves from block 1342 to block 1343. If comparator 941 indicates that the address of the transaction being carried on line 922 exceeds the base address stored in register 945, block 1344 indicates that the address is stored in register 946. A check is made to see if the shared memory is below the upper limit. Each block 13 indicating that the system unit is not involved in the current transaction
For each set of 30, at transaction 1345, the transaction at that location is terminated. However, any of the filter block chains 1330-1340
If it senses the same domain, or the same cluster and appropriate address range, line 1346 passes control to block 1350 for that system unit.

【００８９】ブロック１３５０は、要求元システム・ユ
ニットから、目標ユニット内の適正な宛先への実際のト
ランザクションを実行する。これには、データ・ルータ
４４０を通じてのあらゆる要求データの転送が含まれ
る。（先に注記したように、多くの異なるタイプのトラ
ンザクションがある）。点１３０２は、トランザクショ
ンの完了を印すものである。いずれの所与の時点におい
ても、フローチャート１３００の同一ブロックまたは異
なるブロックにおいて、いくつかの異なるトランザクシ
ョンが進行中である可能性があり、その各々が他とは独
立して進行する。Block 1350 performs the actual transaction from the requesting system unit to the proper destination in the target unit. This includes the transfer of any requested data through the data router 440. (There are many different types of transactions, as noted above). Point 1302 marks the completion of the transaction. At any given time, in the same or different blocks of flowchart 1300, several different transactions may be in progress, each of which proceeds independently of the others.

【００９０】図１５は、好適実施例に追加して、他のタ
イプのハードウエア障害がそれ自体のドメイン・クラス
タ外のシステム・ユニットに影響を与えるのを防止する
ことを可能にする、追加のドメイン・フィルタを示す。
ドメイン・フィルタ４８０は、これまで述べたように、
システム・ユニット４１０または図５のアドレス・ルー
タ５５Ｇ０内のエラーの影響が、同じドメイン・クラス
タ内にはない他のシステム・ユニットに及ぶのを制限す
る。FIG. 15 shows, in addition to the preferred embodiment, an additional type of hardware failure that can be prevented from affecting system units outside its own domain cluster. Indicates a domain filter.
The domain filter 480, as described above,
Limiting the effects of errors in system unit 410 or address router 55G0 of FIG. 5 to other system units that are not in the same domain cluster.

【００９１】図５との関連およびその他において説明し
たように、トランザクションは、１つのシステム・ユニ
ットから、グローバル・データ・ルータ５４Ｇ０上の他
のシステム・ユニットへのデータ転送を伴う場合があ
る。グローバル・データ・アービタは、システム・ユニ
ット全てから従来の信号１５１０を受け取る。例えば、
システム・ユニット４１０−０のローカル・データ・ア
ービタ５４Ｌ３からの線１５１０−０は、当該ユニット
４１０−０から、図４のシステム・ユニット４１０−０
ないし４１０−Ｆの特定の１つに転送を要求する場合が
ある。線１５０１−１は、システム・ユニット４１０−
０ないし４１０−Ｆのどれが、ユニット４１０−０から
の転送を受け取るか等を指定する。調停出力１５２０
は、データ線１５３０の１本からのデータが、線１５３
０の他の１本に流れることを可能にすることによって、
データ経路を確立する。例えば、ロジック５４Ｇ２が、
データをユニット４１０−０から４１０−１に搬送す
る、線１５１０−０の要求を許可した場合、FROM-0線１
５２１はデータ・バス５４Ｇ１を線１５３０−０に結合
し、TO-1線１５４０−１は直接TO-1線１５８０−１に結
合され、線１５３０−１がデータをユニット４１０−１
に送出可能とする。As described in connection with FIG. 5 and elsewhere, a transaction may involve a transfer of data from one system unit to another system unit on global data router 54G0. The global data arbiter receives conventional signals 1510 from all of the system units. For example,
The line 1510-0 from the local data arbiter 54L3 of the system unit 410-0 is connected from the unit 410-0 to the system unit 410-0 of FIG.
To request a transfer to a particular one of the STAs 410-F. Line 1501-1 is connected to system unit 410-
0-410-F specifies which ones will receive the transfer from unit 410-0, and so on. Arbitration output 1520
Indicates that data from one of the data lines 1530 is
By allowing it to flow to the other one of the 0
Establish a data path. For example, the logic 54G2 is
If the request on line 1510-0, which carries data from unit 410-0 to unit 410-1, is granted, FROM-0 line 1
521 couples data bus 54G1 to line 1530-0, TO-1 line 1540-1 is directly coupled to TO-1 line 1580-1, and line 1530-1 couples data to unit 410-1.
Can be sent to

【００９２】正常状態の下では、この構成はコンピュー
タ４００のドメイン構造には透過的である。しかしなが
ら、誤ってデータを違うシステム・ユニット（同じドメ
イン・クラスタ内のものではないシステム・ユニット）
に送出する障害は、他のクラスタにおけるシステム・ユ
ニットの動作を中断させる可能性がある。例えば、図４
のユニット４１０−０が同じドメインＳ１内のユニット
４１０−３にデータを送ろうとしたが、エラー信号がそ
れを代わりに（または付加的に）ユニット４１０−７に
送った場合を想定する。このような障害は、ドメイン・
フィルタ４８０によって実施されている分離を迂回し、
ドメインＳ１がドメインＳ２，Ｓ３の動作に影響を及ぼ
す可能性がある。これを「トランスグレッション・エラ
ー(transgression error)」と呼ぶ。Under normal conditions, this configuration is transparent to the computer 400 domain structure. However, erroneously transferring data to a different system unit (a system unit that is not in the same domain cluster)
Failures to send to other clusters can disrupt the operation of system units in other clusters. For example, FIG.
Suppose that unit 410-0 has attempted to send data to unit 410-3 in the same domain S1, but an error signal has instead (or additionally) sent it to unit 410-7. Such failures can be
Bypassing the separation implemented by filter 480;
The domain S1 may affect the operation of the domains S2 and S3. This is called a "transgression error".

【００９３】別のフィルタ・ロジック１５５０が、実行
しようとしたクラスタ外データ転送を通知することによ
って、この可能性を根絶する。図１０のレジスタ群１０
２０と同一の、別の組のクラスタ・レジスタ１５６０が
コンピュータ４００のクラスタ定義のコピーを保持し、
線１５６５を通じてそれらをロジック１５５０に渡す。
ロジック１５５０は図１４のフィルタ・ロジック１４０
０と同様に、ＡＮＤ／ＯＲ回路で構築する。ロジック１
５５０は、２組の出力を生成する。出力１５７０は、図
８および図１０に示した信号８２４と同じ種類のARBSTO
P信号を生成する。これらは、不適切なデータ転送を開
始したソース・システム・ユニットを停止させる。出力
１５８０は、不適切な要求を行ったソース・ユニットと
同じクラスタにはないあらゆるシステム・ユニットに、
この転送が影響を及ぼすのを防止する。上述の例を続け
ると、システム・ユニット群４１０、要求線１５１０等
における障害のために、データ経路５４Ｇ１が正しくな
い組の線１５３０を活性化する可能性がある。しかしな
がら、データ・ルータ・フィルタ・ロジック１５５０
は、レジスタ１５６０内のビットによって定義されてい
るように、ユニット４１０−０から適正な宛先は、同じ
ドメイン・クラスタ内のユニット４１０−１，４１０−
２，４１０−３のみであることを検出する。４１０−７
を宛先として指定するTO-7のような不適切な信号１５４
０は、ARBSTOP-0線１５７０−０を活性化し、このユニ
ット４１０−０が違法の転送を行おうとしたことを示
し、このユニットを停止する。即ち、ARBSTOP信号はソ
ース・ユニット、および同じドメイン・クラスタ内の他
のユニットに到達するので、ドメイン・クラスタＣＡ内
のエラーは、当該ドメイン・クラスタＣＡ内のシステム
・ユニットに影響を与えるに過ぎない。Another filter logic 1550 eliminates this possibility by signaling an attempted out-of-cluster data transfer. Register group 10 in FIG.
20, another set of cluster registers 1560 holds a copy of the cluster definition of computer 400;
Pass them to logic 1550 via line 1565.
Logic 1550 is the filter logic 140 of FIG.
Like the case of 0, it is constructed by an AND / OR circuit. Logic 1
550 produces two sets of outputs. Output 1570 is an ARBSTO of the same type as signal 824 shown in FIGS.
Generate a P signal. These stop the source system unit that initiated the improper data transfer. Output 1580 is sent to any system unit that is not in the same cluster as the source unit that made the improper request.
Prevent this transfer from affecting. Continuing the above example, a failure in system units 410, request line 1510, etc. may cause data path 54G1 to activate the incorrect set of lines 1530. However, the data router filter logic 1550
Is the correct destination from unit 410-0, as defined by the bits in register 1560, from units 410-1, 410- in the same domain cluster.
It is detected that there is only 2,410-3. 410-7
154, such as TO-7, which specifies the destination as a destination
A 0 activates the ARBSTOP-0 line 1570-0, indicates that this unit 410-0 has attempted an illegal transfer, and stops this unit. That is, since the ARBSTOP signal reaches the source unit and other units in the same domain cluster, an error in the domain cluster CA only affects the system unit in the domain cluster CA. .

【００９４】また、ロジック１５５０は、クラスタ・レ
ジスタ群１５６０内の定義を用いて、FROM信号を発行し
たユニットと同じクラスタ内にない宛先ユニットに、い
ずれのTO信号１５４０が到達するのも禁止する。本例で
は、TO線１５４０−０ないし１５４０−３のいずれかが
アサートされると、対応するTO線１５８０−０ないし１
５８０−３に渡され、対応するシステム・ユニット４１
０−０ないし４１０−３をイネーブルし、線１５３０−
０ないし１５３０−３上のデータを受け取る。一方、線
１５２０−０上のFROM信号およびTO信号１５５０−７の
同時発生、即ち、異なるクラスタ内のユニットに向けた
ものは、ロジック１５５０によって阻止される。したが
って、対応するTO線１５８０−７は休止状態のままであ
り、データ経路５４Ｇ１はデータをシステム・ユニット
４１０−７に渡さない。このように、フィルタ・ロジッ
クは、トランスグレッション・エラーが発生すると、デ
ータ転送を開始したユニットにARBSTOPを送出すること
によって、このユニットを遮断し、そのTO線を禁止する
ことにより、この転送が宛先ユニットにいかなる影響を
及ぼすことも防止するので、当該ユニットには実際に何
の転送も許可されることはない。The logic 1550 uses the definition in the cluster register group 1560 to prohibit any TO signal 1540 from reaching a destination unit that is not in the same cluster as the unit that issued the FROM signal. In this example, when any of the TO lines 1540-0 to 1540-3 is asserted, the corresponding TO line 1580-0 to 1580-1 is output.
580-3 and the corresponding system unit 41
Enable 0-0 through 410-3 and enable line 1530-
Receive the data on 0-1530-3. On the other hand, the coincidence of the FROM signal and TO signal 1550-7 on line 1520-0, that is, for units in different clusters, is blocked by logic 1550. Accordingly, the corresponding TO line 1580-7 remains dormant and data path 54G1 does not pass data to system unit 410-7. Thus, when a transgression error occurs, the filter logic shuts down this unit by sending an ARBSTOP to the unit that initiated the data transfer, and inhibits its TO line, thereby allowing the transfer to proceed to the destination unit. The unit is not actually allowed to transfer anything, as it prevents any effect on the unit.

【００９５】図１５は、図４のクロスポイント・データ
・ルータ４４０の一データ経路を示すに過ぎない。追加
のデータ経路も同様に機能し、単に追加の組のフィルタ
・ロジック１５５０を必要とするだけである。更に、各
データ経路は、更に障害の分離や冗長性のために、多数
の部分に分割してもよく、図１５の動作に影響を与える
ことはない。FIG. 15 only shows one data path of the crosspoint data router 440 of FIG. The additional data path functions similarly and simply requires an additional set of filter logic 1550. Further, each data path may be divided into a number of parts for further fault isolation and redundancy, without affecting the operation of FIG.

[Brief description of the drawings]

【図１】従来技術のバス指向マルチプロセッサ・デジタ
ル・コンピュータの概念概略図。FIG. 1 is a conceptual schematic diagram of a prior art bus-oriented multiprocessor digital computer.

【図２】多数のシステム・ユニットを有するコンピュー
タの同様の概略図。FIG. 2 is a similar schematic diagram of a computer having multiple system units.

【図３】本発明の概念にしたがって、図２のコンピュー
タをシステム・ドメインおよびクラスタに分割した場合
の構成図。FIG. 3 is a block diagram showing a case where the computer of FIG. 2 is divided into a system domain and a cluster according to the concept of the present invention.

【図４】本発明がいかにして図２のコンピュータを図３
のドメインおよびクラスタに分割するかを示す図。FIG. 4 illustrates how the present invention allows the computer of FIG.
FIG. 7 is a diagram showing whether the domain is divided into domains and clusters.

【図５】他のコンピュータ・システム・ユニットの関連
部分を含む、図４の完全に構成したシステム・ユニット
のブロック図。FIG. 5 is a block diagram of the fully configured system unit of FIG. 4, including relevant portions of other computer system units.

【図６】図５のポート・コントローラの詳細図。FIG. 6 is a detailed view of the port controller of FIG. 5;

【図７】図５のメモリ・コントローラの詳細図。FIG. 7 is a detailed view of the memory controller in FIG. 5;

【図８】図５のローカル・アドレス・アービタの詳細
図。FIG. 8 is a detailed view of a local address arbiter of FIG. 5;

【図９】本発明によるドメイン・フィルタのローカル部
分を含む、図５のローカル・アドレス・ルータの詳細
図。FIG. 9 is a detailed view of the local address router of FIG. 5, including the local part of the domain filter according to the present invention.

【図１０】ドメイン・フィルタのグローバル部分を含
む、図５のグローバル・アドレス・アービタの詳細図。FIG. 10 is a detailed view of the global address arbiter of FIG. 5, including the global portion of the domain filter.

【図１１】図４のドメイン・コンフィギュレータを示す
図。FIG. 11 is a diagram showing a domain configurator of FIG. 4;

【図１２】本発明にしたがってコンピュータをクラスタ
化システム・ドメインに構成する方法を示すフロー・チ
ャート。FIG. 12 is a flow chart illustrating a method of configuring a computer in a clustered system domain according to the present invention.

【図１３】本発明のドメイン・フィルタリングを強調し
た、トランザクション処理のフロー・チャート。FIG. 13 is a flow chart of a transaction process emphasizing domain filtering of the present invention.

【図１４】ドメイン・フィルタ内で使用する論理回路の
詳細図。FIG. 14 is a detailed diagram of a logic circuit used in the domain filter.

【図１５】ドメイン・フィルタの任意の更に別のグロー
バル部分を含む、図５のグローバル・データ・アービタ
を示す詳細図。FIG. 15 is a detailed diagram illustrating the global data arbiter of FIG. 5 including any further global portions of the domain filter.

[Explanation of symbols]

１００コンピュータ１１０プロセッサ１２０メモリ・ボード１３０Ｉ／Ｏボード１４０データ・バス１５０アドレス・バス１６０制御分配バス１７０サービス・ユニット２００アーキテクチャ２１０システム・ユニット２４０，２５０高速ルータ２６０制御分配バス３００コンピュータ３１０システム・ボード３４０データ・ルータ３５０アドレス・ルータ３６０制御分配手段３７０システム・コントローラ４００コンピュータ４１０システム・ユニット４４０データ・ルータ４５０アドレス・ルータ４６０制御バス４８０ドメイン・フィルタ５１０プロセッサ・サブシステム５１１マイクロプロセッサ５１２キャッシュ５２０メモリ・サブシステム５２１メモリ５２２パック／アンパック・モジュール５３０入出力サブシステム５３１システムＩ／Ｏバス５３２Ｉ／Ｏアダプタ Reference Signs List 100 computer 110 processor 120 memory board 130 I / O board 140 data bus 150 address bus 160 control distribution bus 170 service unit 200 architecture 210 system unit 240, 250 high-speed router 260 control distribution bus 300 computer 310 system board 340 data router 350 address router 360 control distribution means 370 system controller 400 computer 410 system unit 440 data router 450 address router 460 control bus 480 domain filter 510 processor subsystem 511 microprocessor 512 cache 520 memory Subsystem 521 Memory 522 Pack / Unpack module Lumpur 530 O subsystem 531 system I / O bus 532 I / O adapter

───────────────────────────────────────────────────── フロントページの続き (71)出願人 597004720 2550 ＧａｒｃｉａＡｖｅｎｕｅ，ＭＳＰＡＬ１−521，ＭｏｕｎｔａｉｎＶｉｅｗ，Ｃａｌｉｆｏｒｎｉａ 94043− 1100，ＵｎｉｔｅｄＳｔａｔｅｓｏｆＡｍｅｒｉｃａ (72)発明者アンドリュー・ジェイ・マククロクリンアメリカ合衆国カリフォルニア州92123, サン・ディエゴ，ウォーカー・ドライブ 2831 (72)発明者ニコラス・イー・アネシャンズレーアメリカ合衆国カリフォルニア州92129, サン・ディエゴ，ペナノヴァ・ストリート 11382 ────────────────────────────────────────────────── ─── Continuation of the front page (71) Applicant 597004720 2550 Garcia Avenue, MS PAL1-521, Mountain View, California 94043-1100, United States of America, United States of America 72. San Diego, Walker Drive 2831 (72) Inventor Nicholas e Anne Shansley 11382 Pennova Street, San Diego, California 92129, USA

Claims

[Claims]

1. A multi-processor computer having hardware domains variably configurable by instructions from an operator, comprising: a plurality of separate system units for executing a series of transactions, each system comprising: A processor unit which is physically removable and replaceable individually within the computer, each of which generates an address within a predetermined global range; data at a set of addresses within the predetermined global range. And a memory unit for storing and / or receiving at least one I / O adapter for generating and / or receiving a set of addresses within the predetermined global range, the system unit being coupled to the system unit. From any of the above system units A global address router for transferring the assigned address to another unit of the system unit; and a global data router for transferring data from any of the system units to another unit of the system unit. A router and the system unit from any of the system units.
A control signal distributor for communicating a plurality of control signals to other ones of the units and for affecting all operations of the system units in response to a condition occurring in any of the system units; and the computer Electronically divided into a plurality of software configurable hardware domains, each domain comprising any subset of the system units independent of any physical reconnection of the system units in the computer A configurator; a computer controller responsive to the instructions that specifies to the domain configurator which of the system units belongs to each of the hardware domains; and all of the system units Of the system unit A domain filter for electronically inhibiting at least some of the control signals emitted in one of the mains from affecting one of the system units outside the one domain. Multiprocessor computer equipped.

2. The computer according to claim 1,
The domain filter is coupled to at least one of the global routers, and a transaction on the one global router occurring in one of the domains of the system unit is:
A computer that prohibits being received on some of the system units outside the one domain.

3. The computer according to claim 2,
The computer wherein the one global router is the global address router.

4. The computer according to claim 3,
The global address router is connected to the system
A computer coupled to all of the units and having a plurality of paths for carrying a plurality of transactions simultaneously between different subsets of the system units.

5. The computer according to claim 2, wherein
The computer wherein the one global router is the global data router.

6. The computer according to claim 5,
The computer wherein the global data router is coupled to all of the system units and has a plurality of paths for carrying data relating to a plurality of transactions between different subsets of the system units simultaneously.

7. The computer according to claim 2,
The domain filter is coupled to both the global address router and the global data router so that both addresses and data generated in one of the domains of the system unit are transmitted to the system unit. A computer that prohibits being received on some of the units outside of the one domain.

8. The computer of claim 1, wherein the domain configurator further couples a plurality of the hardware domains and removes from any physical reconnection of the system units in the computer. Forming a domain cluster consisting of any subset of the independent domains; the computer controller responding to another one of the commands, and telling the domain configurator which of the system units is the domain cluster; And wherein the domain filter determines that the at least some control signals generated in one of the system units in one of the system units are out of the one of the system units. But the domain
A computer that allows you to affect what is in a cluster.

9. The computer according to claim 8, wherein
Computer wherein one of the domains in the domain cluster includes physical memory accessible by a different domain in the domain cluster within the same predetermined shared address range.

10. The computer of claim 1, wherein the domain filter comprises: a connection identifying which of the system units generated the current address in the address router; and the system unit. Responsive to at least one domain mask register for each of the system units, specifying which of the domains belongs to which of the domains; the source identification connection; and the domain mask register; Gate logic for disconnecting the system unit from all of the system units that are not in the same domain where the system unit generated the current address.

11. The computer of claim 10, wherein the domain filters each identify which of the system units belong to a domain cluster and respond to a current one of the transactions. A register, and a connection for sending a valid transaction signal to each of the system units in the common cluster for any of the transactions originating from one of the system units belonging to the domain cluster; Including computer.

12. The computer of claim 11, wherein the domain filter includes a shared address register indicating a shared address range between different system units in the domain cluster.

13. A computer having a plurality of system units, a global address router, a global data router, a control signal distributor, and a domain filter, comprising: A method for partitioning into independent hardware domains, comprising: (a) initiating a configuration mode; and (b) designation data defining a subset of the system units to be included in one of the hardware domains. (C) loading the specified data into a domain filter, causing one of the system units in the domain to respond to a control signal in the distributor, and Causing the distributor to be unresponsive to another of the units; and (d) the system. For another specified data defining a different subset of units, the method comprising the steps of repeating steps (b) and (c), the.

14. The method according to claim 13, wherein said step (c) is also responsive to said designated data loading said domain filter, wherein said step (c) includes: , The system
Responding to an address on the global address router originating from one of the units within the one domain, and providing the system unit in the first domain to the first domain of the system unit Not responding to addresses on the global address router originating from at least some that are not.

15. The method of claim 14, further comprising: (f) receiving second specified data defining a cluster of domains of the domain; and (g) storing the second data in the domain. Loading into a filter and responding to an address on the global address router emanating from one of the system units in the cluster of the domain and from one of the system units in the cluster of the domain Causing the data to be transmitted.

16. The method of claim 15, further comprising: (h) address sharing physically present in one of the domains and accessible to other domains in a cluster of the domains. Receiving third specified data defining a range; and (i) loading the third data into the domain filter and assigning the third data to one of the system units in the cluster of the domain. Responding to addresses on the global address router originating from within the cluster of domains but within the sharing range.

17. The method of claim 16, wherein the shared range is less than an entire address range of a memory physically located in at least one of the system units in the domain cluster.

18. The method of claim 13, wherein the method is performed after step (d), and (j) transmitting a transaction from one of the system units in the first domain to the global address. Broadcasting all of the system units, both inside and outside the first domain, via a router; and (k) filtering the transaction at each of the system units, Enabling response of the transaction to those of the units that are in the first domain and disabling the response to the transaction to other of the system units that are outside of the first domain; Disabling.

19. The method according to claim 18, wherein steps (j) and (k) are performed after step (e).

20. The method of claim 18, wherein step (k) does not disable all of the system units outside the first domain.

21. The method according to claim 18, wherein the plurality of systems are located in different domains of the domain.
The method wherein the unit physically comprises a memory having addresses in each range, wherein the addresses in each range at least partially overlap.

22. A system unit for a multiprocessor computer having a global address router, a global data router, and a control signal distributor, wherein the control signal distributor comprises a plurality of the system units. Interconnecting the other, said computer also having a computer controller, said system
A unit coupled to both of the global routers for receiving at least one processor unit for generating an address within a predetermined global range; and a unit coupled to the global router and configured to receive an address within the predetermined global range. Means for receiving at least one memory unit for storing data at a set of addresses; and at least one coupled to the global router for generating and / or receiving a set of addresses within the predetermined global range. Means for receiving an input / output adapter, and means for generating a control signal at the distributor, coupled to at least one of the aforementioned means, wherein the control signal is indicative of an error condition within the system unit; Displays the error status in the other system unit. Means for receiving control signals; means for filtering the control signals so that only control signals from selectable ones of the other units can affect the operation of the system unit; Means connectable to a computer controller and selecting said one from said other units.

23. The system unit according to claim 22, wherein said filter means comprises: a domain mask register holding data specifying said selectable one of said other units; and said selectable one. Gate means for passing certain signals from and blocking said certain signals from others of said system unit.

24. The system unit according to claim 23, further comprising the step of storing variable data in the domain mask.
A system unit that has means for loading registers.

25. The system unit according to claim 22, wherein said filter means stores data specifying a memory physically implemented in any of said system units in said computer. A shared memory register that is accessible to and maintained in a portion of said global range.

26. The system unit of claim 25, further comprising means for loading variable data into said shared memory register.

27. The system unit according to claim 25, wherein said filter means specifies an address range consisting only of a part of said memory physically mounted on any one of said system units. A system unit including at least one other shared memory register for holding

28. A system unit among a plurality of system units for a multiprocessor computer, wherein the multiprocessor computer sends an address issued in any of the system units to the system unit. A global address router for forwarding to all other units, wherein each address of said address has a source identifier indicating which of said plurality of system units issued said respective address. A router and a global router for transferring data from any of the system units to all other of the system units.
A data router, transmitting a plurality of control signals from any of the system units to all other ones of the system units, and responding to a condition occurring in any of the system units; A control signal distributor that affects the overall operation of the unit; and electronically divides the computer into a plurality of software configurable hardware domains, each one of the system units in the computer. A domain configurator composed of any subset of the system units, independent of physical reconnection of the system units, and in response to the command, which of the system units belong to each of the hardware domains Is the computer code that specifies the domain configurator. A controller, coupled to all of the system units, wherein at least some of the control signals emitted in one of the domains of the system units are outside of the one domain. A domain filter for electronically inhibiting any of the following: wherein said one system unit is at least one subsystem connected to said global address router, said subsystem comprising: Combining the address between a system and any other of the system units, coupled to the global data router, and transferring transaction data between the subsystem and any other of the system units. Subsystem that executes a transaction A processor subsystem selected from the group consisting of a processor subsystem that stores data within the global range, an input / output subsystem that communicates with an input / output adapter, and a processor subsystem coupled to the distributor. , At least one of the control signals
A generator, at least one receiver for the control signal, and a domain for receiving from the computer controller a value indicating which of the plurality of system units belongs to the same domain as the one system unit. A writable mask register; coupled to the domain mask register, generating an inhibit signal if the source identifier indicates that each address did not originate from within the same domain;
A comparator coupling the inhibit signal to the at least one subsystem and rendering it unresponsive to each of the addresses.

29. The system unit of claim 28, wherein the domain configurator further combines a plurality of the hardware domains and is independent of any physical reconnection of the system unit in the computer. At least one writable, indicating which of the plurality of system units belongs to the same domain cluster as the one system unit, forming a domain cluster comprising an arbitrary subset of the domains. A shared memory mask register, wherein the comparator is further coupled to the domain mask register, wherein each of the addresses originated at some of the system units outside the same domain cluster. The source identifier indicates that Generating said inhibit signal, the system unit.

30. The system unit of claim 31, wherein said at least one subsystem includes said memory subsystem storing data within a portion of said global address range.
The unit further comprises at least one shared memory address register for receiving from the computer controller a value defining a shared range of memory addresses within the portion of the global range, the comparator comprising: A system unit responsive to the shared memory address register for inhibiting the memory subsystem if the respective address is outside the shared range.

31. The system unit of claim 28, wherein said at least one generator of said control signal forms part of a local address arbiter requesting and receiving access to said global address router. System unit to do.

32. The system unit of claim 33, wherein said at least one receiver of said control signal further forms part of said local address arbiter.

33. The system unit of claim 28, wherein said one system unit comprises at least two different ones of said subsystems.
unit.

34. The system unit of claim 33, wherein said one system unit includes all three of said subsystems.

35. The system unit of claim 28, wherein said processor subsystem includes a plurality of separate microprocessors.