JP2936868B2

JP2936868B2 - Message packet routing method for array processor

Info

Publication number: JP2936868B2
Application number: JP4034860A
Authority: JP
Inventors: 一朗黒田
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1992-02-21
Filing date: 1992-02-21
Publication date: 1999-08-23
Anticipated expiration: 2014-08-23
Also published as: JPH05233567A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明はプロセッサ間をリンク結
合することによりプロセッサ間通信を行うアレイプロセ
ッサに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an array processor for performing interprocessor communication by linking processors.

【０００２】[0002]

【従来の技術】従来、アレイプロセッサの実現方式とし
ては、１）隣接プロセッサ間の通信のみを行う方式２）ルータを用いて遠隔プロセッサ間の通信を実現する
方式がある。2. Description of the Related Art Conventionally, as a method of realizing an array processor, there are 1) a method of performing only communication between adjacent processors, and 2) a method of realizing communication between remote processors using a router.

【０００３】このうち隣接プロセッサ間の通信のみを行
う方式では、プロセッサの通信のために要するハードウ
ェア量が比較的少ないが遠隔プロセッサ間で通信を行う
場合、送り側プロセッサから受け側プロセッサまでの経
路上にある各プロセッサ間において隣接プロセッサ間通
信を繰り返すため、通信時間および通信経路上にある各
プロセッサへの負荷が問題になる。隣接プロセッサ間の
みの通信を行うアレイプロセッサについては、Ｓ．Ｙ．
Ｋｕｎｇ，“ＷａｖｅｆｒｏｎｔＡｒｒａｙＰｒｏｃ
ｅｓｓｏｒｓ−ＣｏｎｃｅｐｔｓｔｏＩｍｐｌｅｍ
ｅｎｔａｔｉｏｎ”，ＩＥＥＥＣｏｍｐｕｔｅｒ，Ｊ
ｕｌｙ１９８７，ｐｐ１８−３３において説明されて
いる。一方、ルータを用いる方式では、プロセッサ間通
信のために要するハードウェア量が大きくなるが送り側
プロセッサから受け側プロセッサまでルータを経由して
通信が行われるため通信時間が比較的少なくまた他のプ
ロセッサへの負荷も問題にならない。[0003] In the method in which only communication between adjacent processors is performed, the amount of hardware required for communication between the processors is relatively small, but when communication is performed between remote processors, a path from a sending processor to a receiving processor is required. Since communication between adjacent processors is repeated between the above processors, the communication time and the load on each processor on the communication path become problems. For an array processor that performs communication only between adjacent processors, see S.A. Y.
Kung, “Wavefront ArrayProc
essors-Concepts to Implem
entation ”, IEEE Computer, J
uly 1987, pp 18-33. On the other hand, in the method using a router, the amount of hardware required for inter-processor communication is large, but since communication is performed from the sending processor to the receiving processor via the router, the communication time is relatively short, and other processors are used. The load on the system does not matter.

【０００４】ルータを用いる方式におけるルーティング
方式は、プロセッサの結合形態に依存して決まる。たと
えば２次元メッシュ結合方式では東西方向へのステップ
数（ホップ数）と南北方向のステップ数（ホップ数）を
指定することによりルーティングを行う。Ｗ．Ｊ．Ｄａ
ｌｌｙ，“ＡＶＬＳＩＡｒｃｈｉｔｅｃｔｕｒｅｆ
ｏｒＣｏｎｃｕｒｒｅｎｔＤａｔａＳｔｒｕｃｔ
ｕｒｅｓ”，ＫｌｕｗｅｒＡｃａｄｅｍｉｃＰｒｅ
ｓｓ，１９８７の５．３．３節では、以上に示したルー
ティングを行うシステムについて説明している。[0004] The routing method in the method using a router is determined depending on the form of connection of the processors. For example, in the two-dimensional mesh connection method, routing is performed by specifying the number of steps in the east-west direction (the number of hops) and the number of steps in the north-south direction (the number of hops). W. J. Da
lly, "A VLSI Architectureref.
or Current Data Struct
ures ", Kluwer Academic Pre
Section 5.3.3 of ss, 1987 describes a system that performs the routing described above.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、従来の
ルーティング法で使用する経路が固定されているため、
アプリケーションによっては１つのルートにトラフィッ
クが増大し、そこで通信ネックが生じ、処理の並列化の
さまたげになることがある。また複数のプロセッサにデ
ータのブロードキャストを行う場合は、宛先のプロセッ
サ毎に通信を行わなければならなかった。そのため処理
によりルーティングをプログラムにより自由に設定で
き、しかも送信元のプロセッサから複数のプロセッサへ
データのブロードキャストを一回のメッセージ通信で実
現するために書き換え可能なルーティングテーブルを用
いる方法が考えられるが、プロセッサ数の増大とともに
ルーティングテーブルのサイズが増大するという問題が
あった。However, since the route used in the conventional routing method is fixed,
Depending on the application, traffic may increase in one route, which may cause a communication bottleneck and hinder parallel processing. In the case of broadcasting data to a plurality of processors, communication has to be performed for each destination processor. Therefore, it is possible to use a rewritable routing table so that the routing can be freely set by a program through processing and the data can be broadcast from the transmitting processor to a plurality of processors in one message communication. There is a problem that the size of the routing table increases as the number increases.

【０００６】本発明の目的はプロセッサ数が増大した際
にルーティングテーブルのサイズの増大を抑えることの
できるアレイプロセッサのルーティング方法を提供する
ことにある。An object of the present invention is to provide an array processor routing method capable of suppressing an increase in the size of a routing table when the number of processors increases.

【０００７】[0007]

【課題を解決するための手段】第１の発明は、複数の演
算プロセッサと、プロセッサ間の通信を行う低次元のメ
ッシュ状の相互接続ネットワークとから構成されるアレ
イプロセッサにおいて、送信元プロセッサから単数ある
いは複数の宛先プロセッサへメッセージパケットのルー
ティングを行う方法であって、前記ネットワーク内の各
々の演算プロセッサに対して演算プロセッサから発生し
たあるいは接続された複数の接続リンクの一つから送ら
れてきたメッセージパケットを複数の接続リンクあるい
は演算プロセッサの中から指定された単数あるいは複数
の宛先に転送する手段および、メッセージパケット内の
アドレス情報からメッセージパケットを転送する単数あ
るいは複数のリンクあるいは演算プロセッサを指定する
演算プロセッサにより書き換え可能なルーティング情報
記憶手段を備えるアレイプロセッサを複数のクラスタに
分割し、クラスタ内の全部あるいは一部のプロセッサの
ルーティング情報記憶手段における全部あるいは一部の
アドレスを他のクラスタのプロセッサに割り当てること
により、クラスタ外のプロセッサへのメッセージ転送を
可能にすることを特徴とする。According to a first aspect of the present invention, there is provided an array processor comprising a plurality of arithmetic processors and a low-dimensional mesh interconnection network for performing communication between the processors. Alternatively, a method of routing a message packet to a plurality of destination processors, wherein for each of the arithmetic processors in the network, a message generated from the arithmetic processor or transmitted from one of a plurality of connected links connected thereto. Means for transferring a packet to a designated one or more destinations among a plurality of connection links or arithmetic processors, and an operation for designating one or more links or an arithmetic processor for transferring a message packet from address information in the message packet To the processor Dividing an array processor having rewritable routing information storage means into a plurality of clusters, and allocating all or some addresses in routing information storage means of all or some processors in the cluster to processors in another cluster Thus, a message can be transferred to a processor outside the cluster.

【０００８】第２の発明は、複数の演算プロセッサと、
プロセッサ間の通信を行う低次元のメッシュ状の相互接
続ネットワークとから構成されるアレイプロセッサにお
いて、送信元プロセッサから単数あるいは複数の宛先プ
ロセッサへメッセージパケットのルーティングを行う方
法であって、前記ネットワーク内の各々の演算プロセッ
サに対して演算プロセッサから発生したあるいは接続さ
れた複数の接続リンクの一つから送られてきたメッセー
ジパケットを複数の接続リンクあるいは演算プロセッサ
の中から指定された単数あるいは複数の宛先に転送する
手段および、メッセージパケット内のアドレス情報から
メッセージパケットを転送する単数あるいは複数のリン
クあるいは演算プロセッサを指定する演算プロセッサに
より書き換え可能なルーティング情報記憶手段を備える
アレイプロセッサを複数のクラスタに分割し、各々のク
ラスタに１つのアドレスを割り当て、送信プロセッサが
異なるクラスタに属する宛先プロセッサにメッセージパ
ケットを送る場合は、パケット内にクラスタアドレスと
クラスタ内のローカルアドレスを付与し、クラスタアド
レスによりルーティングを行うことにより宛先プロセッ
サの属するクラスタ内のクラスタアドレスを持つゲート
ウェイプロセッサに転送し、ゲートウェイプロセッサに
おいて、クラスタアドレスを指定されたメッセージパケ
ットを受信した場合はローカルアドレスによりルーティ
ングを行うように再設定してパケットの再送を行うこと
により、宛先プロセッサへの転送を行うことを特徴とす
る。[0008] A second invention provides a plurality of arithmetic processors,
A method of routing message packets from a source processor to one or more destination processors in an array processor comprising a low-dimensional mesh interconnect network for communicating between the processors, the method comprising: For each arithmetic processor, a message packet generated from the arithmetic processor or transmitted from one of the plurality of connection links connected to the arithmetic processor is sent to one or more destinations designated by the plurality of connection links or the arithmetic processor. An array processor comprising: a transfer unit; and a routing information storage unit that can be rewritten by an arithmetic processor that specifies one or more links or an arithmetic processor that transfers the message packet from the address information in the message packet. When the message is divided into a plurality of clusters, one address is assigned to each cluster, and the sending processor sends a message packet to a destination processor belonging to a different cluster, a cluster address and a local address within the cluster are assigned in the packet, By performing routing by address, the packet is transferred to the gateway processor having the cluster address in the cluster to which the destination processor belongs. When the gateway processor receives a message packet in which the cluster address is specified, the gateway processor performs routing by using the local address. By setting and retransmitting the packet, the packet is transferred to the destination processor.

【０００９】第３の発明は、第２の発明のルーティング
方法において、各クラスタにおいてクラスタ外部と接続
するクラスタ内の複数のプロセッサを同じクラスタアド
レスをもつゲートウェイプロセッサとし、クラスタ外の
プロセッサを各々のゲートウェイプロセッサに対して互
いに交差せずにルーティングを行えるようにルーティン
グ情報を設定することにより、クラスタ内のプロセッサ
への複数のメッセージ転送を同時に行うことを特徴とす
る。According to a third invention, in the routing method according to the second invention, in each cluster, a plurality of processors in the cluster connected to the outside of the cluster are gateway processors having the same cluster address, and the processors outside the cluster are each a gateway processor. By setting routing information so that the processors can perform routing without crossing each other, a plurality of messages are simultaneously transferred to the processors in the cluster.

【００１０】[0010]

【作用】本発明の原理を次に示す。第１の発明のメッセ
ージパケットルーティング方法では、メッセージパケッ
ト内に付加された宛先アドレス情報により、ルーティン
グテーブルをアクセスして転送先の情報を得る。ルーテ
ィングテーブルは可能な全ての転送先（接続リンクおよ
びプロセッサ）に対して転送するか否かの設定を行うビ
ットが割り当てられているため複数のリンクあるいは演
算プロセッサへの転送が可能である。The principle of the present invention will be described below. In the message packet routing method according to the first invention, the routing table is accessed by using the destination address information added in the message packet to obtain information on the transfer destination. In the routing table, bits for setting whether or not to transfer to all possible transfer destinations (connection links and processors) are assigned, so that transfer to a plurality of links or arithmetic processors is possible.

【００１１】各々のプロセッサに一意的にアドレスを割
り当てたクラスタにおいて、隣接クラスタに接する一部
の連続したプロセッサの集合のルーティングテーブルの
一部のアドレスに対する転送先を隣接クラスタに転送す
るように設定することにより、そのアドレスを指定した
パケットは送り側プロセッサの属するクラスタ内の対応
するアドレスのプロセッサには転送されずに隣接クラス
タ内の対応するアドレスを持つプロセッサに転送するこ
とができる。In a cluster in which an address is uniquely assigned to each processor, a transfer destination for a part of addresses in a routing table of a set of a part of continuous processors in contact with an adjacent cluster is set to be transferred to the adjacent cluster. Thus, the packet specifying the address can be transferred to the processor having the corresponding address in the adjacent cluster without being transferred to the processor of the corresponding address in the cluster to which the sending processor belongs.

【００１２】以上の方法によるとルーティングテーブル
のサイズにより制限されるクラスタを複数結合したプロ
セッサアレイを構成することが可能になる。According to the above method, it is possible to configure a processor array in which a plurality of clusters limited by the size of the routing table are connected.

【００１３】第２の発明のメッセージパケットルーティ
ング方法では、メッセージパケット内に付加された宛先
アドレス情報により、ルーティングテーブルをアクセス
して転送先の情報を得る。ルーティングテーブルは可能
な全ての転送先（接続リンクおよびプロセッサ）に対し
て転送するか否かの設定を行うビットが割り当てられて
いるため複数のリンクあるいは演算プロセッサへの転送
が可能である。またルーティングテーブル内にクラスタ
アドレスとクラスタ内ローカルアドレスに対する転送先
の指定を行う。ここでクラスタアドレスはクラスタ内の
一部あるいは全部のプロセッサに対して割り当てられ
る。In the message packet routing method according to the second invention, a routing table is accessed by using destination address information added in the message packet to obtain transfer destination information. In the routing table, bits for setting whether or not to transfer to all possible transfer destinations (connection links and processors) are assigned, so that transfer to a plurality of links or arithmetic processors is possible. Further, the transfer destination for the cluster address and the intra-cluster local address is specified in the routing table. Here, the cluster address is assigned to some or all processors in the cluster.

【００１４】送信プロセッサが異なるクラスタに属する
宛先プロセッサにメッセージパケットを転送する場合は
クラスタアドレスとクラスタ内ローカルアドレスをメッ
セージに付与し、まずクラスタアドレスによりルーティ
ングを行うことにより宛先プロセッサの属するクラスタ
内のプロセッサ（以下ゲートウェイプロセッサと呼ぶ）
にまで転送される。ゲートウェイプロセッサではメッセ
ージパケットをプロセッサ内に取り込み、パケット内の
クラスタ内ローカルアドレスを取り出してこれを用いて
ルーティングを行うように設定してから再送信を行う。
これにより宛先プロセッサへメッセージを送ることがで
きる。When the transmitting processor transfers a message packet to a destination processor belonging to a different cluster, a cluster address and a local address within the cluster are added to the message, and routing is first performed by using the cluster address to thereby execute the processor within the cluster to which the destination processor belongs. (Hereinafter referred to as gateway processor)
Is transferred to. The gateway processor fetches the message packet into the processor, takes out the local address in the cluster in the packet, sets it to perform routing using this, and then retransmits it.
This allows the message to be sent to the destination processor.

【００１５】以上の方法によるとＮ²個のプロセッサを
Ｍ²個のクラスタに分割することにより各プロセッサに
対して必要なルーティングテーブルのサイズはＮ²から
クラスタ内ローカルアドレスの数（Ｎ／Ｍ）²およびク
ラスタアドレスの数Ｍ²を加えた（Ｎ／Ｍ）²＋Ｍ²と
なる。According to the above method, by dividing N ² processors into M ² clusters, the size of the routing table required for each processor is calculated from N ² to the number of local addresses in the cluster (N / M). (N / M) ² + M ² obtained by adding ² and the number M ^{2 of} cluster addresses.

【００１６】第３の発明のメッセージパケットルーティ
ング方法では、各クラスタにおいてクラスタ外部と接す
る複数のプロセッサを同じクラスタアドレスをもつゲー
トウェイプロセッサとした場合、クラスタに転送された
メッセージパケットが全てのゲートウェイプロセッサに
転送されることになる。そこでゲートウェイプロセッサ
の数をＬとしたときクラスタ外部のプロセッサをＬ個の
ゲートウェイプロセッサと連続したＬ個のプロセッサ集
合に分割することにより、各プロセッサ集合から各々の
ゲートウェイプロセッサに対して互いに交差せずにルー
ティングを行うことができる。これにより最大Ｌ個のク
ラスタ内へのメッセージ転送を同時に行うことができ
る。In the message packet routing method according to the third invention, when a plurality of processors in contact with the outside of the cluster in each cluster are gateway processors having the same cluster address, the message packet transferred to the cluster is transferred to all gateway processors. Will be done. Therefore, when the number of gateway processors is L, the processors outside the cluster are divided into L gateway processors and L processor sets that are continuous with each other. Routing can be performed. As a result, message transfer into a maximum of L clusters can be performed simultaneously.

【００１７】[0017]

【実施例】次に本発明の実施例を図面を参照しながら説
明する。Next, an embodiment of the present invention will be described with reference to the drawings.

【００１８】図１は第１の発明の一実施例であり、４本
のリンクに接続された単位プロセッサノード１０を２次
元格子状に接続したアレイプロセッサを２次元格子状の
第１および第２の２つのクラスタに分割した構成を示
す。FIG. 1 shows an embodiment of the first invention, in which an array processor in which unit processor nodes 10 connected to four links are connected in a two-dimensional lattice is first and second in a two-dimensional lattice. 2 shows a configuration divided into two clusters.

【００１９】図２は、単位プロセッサノード１０の構成
を示す。単位プロセッサは、演算プロセッサ２１、プロ
セッサポート２２、第１の通信ポート２３、第２の通信
ポート２４、第３の通信ポート２５、第４の通信ポート
２６、制御回路２７およびルーティングテーブル２８と
から構成される。FIG. 2 shows the configuration of the unit processor node 10. The unit processor includes an arithmetic processor 21, a processor port 22, a first communication port 23, a second communication port 24, a third communication port 25, a fourth communication port 26, a control circuit 27, and a routing table 28. Is done.

【００２０】第１の通信ポート２３、第２の通信ポート
２４、第３の通信ポート２５、第４の通信ポート２６
は、各々接続された他の単位プロセッサノードから送ら
れてきたメッセージパケットを受け取り、メッセージパ
ケット内の宛先アドレス情報を抜出し、これを制御回路
２７に送る。プロセッサポート２２は、演算プロセッサ
２１から書き込まれたメッセージパケットの宛先アドレ
ス情報を抜出し、これを制御回路２７に送る。A first communication port 23, a second communication port 24, a third communication port 25, and a fourth communication port 26
Receives the message packets sent from the other connected unit processor nodes, extracts the destination address information in the message packets, and sends them to the control circuit 27. The processor port 22 extracts the destination address information of the message packet written from the arithmetic processor 21 and sends it to the control circuit 27.

【００２１】制御回路２７は送られた宛先アドレスによ
りルーティングテーブル２８を引いて転送すべき第１の
通信ポート２３、第２の通信ポート２４、第３の通信ポ
ート２５、第４の通信ポート２６のいずれか、あるいは
プロセッサポート２２に対してメッセージ転送制御信号
を送る。このときルーティングテーブルは第１の通信ポ
ート２３、第２の通信ポート２４、第３の通信ポート２
５、第４の通信ポート２６、プロセッサポート２２の５
つの送り先に１ビットずつ割り当てた５ビットを１つの
アドレスワードとし、プロセッサの数に一定量の付加数
を足したアドレス空間を持つ読み出し書込み可能なメモ
リにより構成される。またルーティングテーブル２８は
演算プロセッサ２１により設定，変更を行うことができ
る。The control circuit 27 determines the first communication port 23, the second communication port 24, the third communication port 25, and the fourth communication port 26 to which the routing table 28 is to be transferred based on the transmitted destination address. Either one or a message transfer control signal is sent to the processor port 22. At this time, the routing table includes the first communication port 23, the second communication port 24, the third communication port 2
5, 4th communication port 26, 5 of processor port 22
It is composed of a readable and writable memory having an address space in which 5 bits assigned to each destination one bit at a time is one address word, and a fixed number of additional numbers is added to the number of processors. The routing table 28 can be set and changed by the arithmetic processor 21.

【００２２】第１の通信ポート２３、第２の通信ポート
２４、第３の通信ポート２５、第４の通信ポート２６
は、またメッセージ転送制御信号を受取ると送られてき
たメッセージはパケットを接続された隣接単位プロセッ
サノードに送り出す。またプロセッサポート２２はメッ
セージ転送制御信号を受取ると送られてきたメッセージ
を受信し格納して、演算プロセッサ２１により読み出せ
るようにする。First communication port 23, second communication port 24, third communication port 25, fourth communication port 26
When a message transfer control signal is received, the transmitted message sends a packet to an adjacent unit processor node connected thereto. The processor port 22 receives and stores the transmitted message upon receiving the message transfer control signal so that the arithmetic processor 21 can read it.

【００２３】図３にメッセージ転送の一例を示す。図３
は図１に示した２次元格子状に接続されたアレイプロセ
ッサを示す。図３において、各単位プロセッサ内のＰは
演算プロセッサを示す。図３に示すアレイプロセッサの
第１のクラスタは、ローカルアドレス１をもつ第１の単
位プロセッサ３１、ローカルアドレス２をもつ第２の単
位プロセッサ３２、ローカルアドレス３をもつ第３の単
位プロセッサ３３、ローカルアドレス４をもつ第４の単
位プロセッサ３４、等から構成され、第２のクラスタ
は、ローカルアドレス３をもつ第５の単位プロセッサ３
５、ローカルアドレス１をもつ第６の単位プロセッサ３
６、ローカルアドレス２をもつ第７の単位プロセッサ３
７、等から構成される。このうち第３の単位プロセッサ
３３、第４の単位プロセッサ３４は、第１のクラスタに
おいて他のクラスタへの転送が可能なノードである。FIG. 3 shows an example of message transfer. FIG.
Denotes an array processor connected in a two-dimensional grid pattern shown in FIG. In FIG. 3, P in each unit processor indicates an arithmetic processor. The first cluster of array processors shown in FIG. 3 includes a first unit processor 31 having a local address 1, a second unit processor 32 having a local address 2, a third unit processor 33 having a local address 3, The second cluster is composed of a fourth unit processor 34 having a local address 3 and a fourth unit processor 34 having an address 4.
5. Sixth unit processor 3 having local address 1
6. Seventh unit processor 3 having local address 2
7, etc. Among them, the third unit processor 33 and the fourth unit processor 34 are nodes that can transfer data to another cluster in the first cluster.

【００２４】図４に各プロセッサのルーティングテーブ
ルを示す。（ａ）は第１，第３，第４および第６の単位
プロセッサ用ルーティングテーブルを、（ｂ）は第２，
第７の単位プロセッサ用ルーティングテーブルを、
（ｃ）は第５の単位プロセッサ用ルーティングテーブル
を示している。図４においてＰの欄は演算プロセッサ、
Ｎの欄は上方向、Ｅの欄は右方向、Ｓの欄は下方向、Ｗ
の欄は左方向への転送を指定するビットである。第３の
単位プロセッサ３３の演算プロセッサから送り出される
メッセージパケットは、クラスタ外ローカルアドレス２
をもち、異なるクラスタ内のプロセッサへ転送される。
まず、第３の単位プロセッサ３３のルーティングテーブ
ル（図４（ａ））のアドレス２の値Ｅにより第４の単位
プロセッサ３４に送られ、第４単位プロセッサ３４で
は、第４の単位プロセッサ３４のルーティングテーブル
（図４（ａ））のアドレス２の値Ｅにより第２のクラス
タに属する第５の単位プロセッサ３５に送られ、第５の
単位プロセッサ３５では、第５の単位プロセッサ３５の
ルーティングテーブル（図４（ｃ））のアドレス２の値
Ｎにより第６の単位プロセッサ３６に送られる。第６の
単位プロセッサ３６では、第６の単位プロセッサ３６の
ルーティングテーブル（図４（ａ））のアドレス２の値
Ｅにより第７の単位プロセッサ３７に送られる。第７の
単位プロセッサ３７では、第７の単位プロセッサ３７の
ルーティングテーブル（図４（ｂ））のアドレス２の値
Ｐにより第７の単位プロセッサ３７のプロセッサポート
に書込まれる。FIG. 4 shows a routing table of each processor. (A) shows the routing tables for the first, third, fourth and sixth unit processors, and (b) shows the second and third unit processor routing tables.
The seventh unit processor routing table is
(C) shows a fifth unit processor routing table. In FIG. 4, the column of P is an arithmetic processor,
The column of N is upward, the column of E is rightward, the column of S is downward, W
Are bits for designating transfer to the left. The message packet sent from the arithmetic processor of the third unit processor 33 is a local address 2 outside the cluster.
And transferred to processors in different clusters.
First, it is sent to the fourth unit processor 34 based on the value E of the address 2 in the routing table (FIG. 4A) of the third unit processor 33, and the fourth unit processor 34 routes the fourth unit processor 34. The value E of the address 2 in the table (FIG. 4A) is sent to the fifth unit processor 35 belonging to the second cluster, and the fifth unit processor 35 sends the routing table of the fifth unit processor 35 (FIG. 4A). 4 (c)) is sent to the sixth unit processor 36 according to the value N of the address 2. The sixth unit processor 36 sends the sixth unit processor 36 to the seventh unit processor 37 according to the value E of the address 2 in the routing table (FIG. 4A). In the seventh unit processor 37, the value is written to the processor port of the seventh unit processor 37 by the value P of the address 2 in the routing table (FIG. 4B) of the seventh unit processor 37.

【００２５】このようにメッセージ転送の経路は、図４
に示す各単位プロセッサのルーティングテーブルの設定
により決められ、第１のクラスタに属する第３の単位プ
ロセッサ３３から送信されたパケットはローカルアドレ
ス２を有するのにもかかわらず、第１のクラスタにおい
てローカルアドレス２を有する第２の単位プロセッサ３
２には転送されず、第２のクラスタにおいてローカルア
ドレス２を有する第７の単位プロセッサに転送される。As described above, the message transfer route is shown in FIG.
Is determined by the setting of the routing table of each unit processor shown in FIG. 3 and the packet transmitted from the third unit processor 33 belonging to the first cluster has the local address 2 in the first cluster despite the fact that the packet has the local address 2. 2nd unit processor 3 having 2
2 is transferred to the seventh unit processor having the local address 2 in the second cluster.

【００２６】図５は、第２の発明の一実施例であり、４
本のリンクに接続された単位プロセッサノード１０を２
次元格子状に接続したアレイプロセッサを２次元格子状
の第１，第２，第３および第４の４つのクラスタに分割
した構成を示す。単位プロセッサノード１０の構成は、
図２に示した構成と同じである。FIG. 5 shows an embodiment of the second invention.
Unit processor nodes 10 connected to two links
A configuration in which an array processor connected in a dimensional lattice is divided into four clusters of a first, second, third and fourth two-dimensional lattice is shown. The configuration of the unit processor node 10 is as follows.
This is the same as the configuration shown in FIG.

【００２７】図６にメッセージ転送の一例を示す。図６
は図５に示した２次元格子状に接続されたアレイプロセ
ッサを示す。図６において各単位プロセッサ内のＰは演
算プロセッサを示す。図６に示すアレイプロセッサの第
１のクラスタは第１の単位プロセッサ３１、第２の単位
プロセッサ３２、第３の単位プロセッサ３３、等から構
成され、第２のクラスタは第４の単位プロセッサ３４、
第５の単位プロセッサ３５、第６の単位プロセッサ３
６、等から構成され、第３のクラスタは第７の単位プロ
セッサ３７、第８の単位プロセッサ３８、等から構成さ
れる。このうち第４の単位プロセッサ３４は第２のクラ
スタのゲートウェイノードであり、第７の単位プロセッ
サ３７は第３のクラスタのゲートウェイノードである。FIG. 6 shows an example of message transfer. FIG.
Denotes an array processor connected in a two-dimensional lattice shown in FIG. In FIG. 6, P in each unit processor indicates an arithmetic processor. The first cluster of the array processor shown in FIG. 6 includes a first unit processor 31, a second unit processor 32, a third unit processor 33, and the like, and the second cluster is a fourth unit processor 34,
Fifth unit processor 35, sixth unit processor 3
The third cluster is composed of a seventh unit processor 37, an eighth unit processor 38, and the like. Among them, the fourth unit processor 34 is a gateway node of the second cluster, and the seventh unit processor 37 is a gateway node of the third cluster.

【００２８】図７に各プロセッサのルーティングテーブ
ルを示す。（ａ）は第１，第２，第３および第４の単位
プロセッサ用ルーティングテーブルを、（ｂ）は、第５
および第６の単位プロセッサ用ルーティングテーブル
を、（ｃ）は、第７の単位プロセッサ用ルーティングテ
ーブルを、（ｄ）は、第８の単位プロセッサ用ルーティ
ングテーブルを示している。図７においてＰの欄は演算
プロセッサ、Ｎの欄は上方向、Ｅの欄は右方向、Ｓの欄
は下方向、Ｗの欄は左方向への転送を指定するビットで
ある。FIG. 7 shows a routing table of each processor. (A) is the first, second, third and fourth unit processor routing tables, and (b) is the fifth
And (c) shows the seventh unit processor routing table, and (d) shows the eighth unit processor routing table. In FIG. 7, the column P indicates an arithmetic processor, the column N indicates an upward direction, the column E indicates a rightward direction, the column S indicates a downward direction, and the column W indicates a bit for specifying a transfer to the left.

【００２９】第１の単位プロセッサ３１の演算プロセッ
サから送り出されるメッセージパケットは、クラスタア
ドレス２００とクラスタ内ローカルアドレス２をもち、
異なるクラスタ内のプロセッサへのメッセージ転送であ
るため、まずクラスタアドレスによりルーティングを行
う。まず、第１の単位プロセッサ３１のルーティングテ
ーブル（図７（ａ））のアドレス２００の値Ｅにより第
２の単位プロセッサ３２に送られ、第２の単位プロセッ
サ３２では、第２の単位プロセッサ３２のルーティング
テーブル（図７（ａ））のアドレス２００の値Ｅにより
第３の単位プロセッサ３３に送られ、第３の単位プロセ
ッサ３３では、第２の単位プロセッサ３３のルーティン
グテーブル（図７（ａ））のアドレス２００の値Ｅによ
り第４の単位プロセッサ３７に送られる。第４の単位プ
ロセッサ３４では、第４の単位プロセッサ３４のルーテ
ィングテーブル（図７（ａ））のアドレス２００の値Ｅ
により第５の単位プロセッサ３５に送られる。The message packet sent from the arithmetic processor of the first unit processor 31 has a cluster address 200 and a local address 2 within the cluster.
Since the message is transferred to a processor in a different cluster, routing is first performed by using the cluster address. First, the first unit processor 31 is sent to the second unit processor 32 based on the value E of the address 200 in the routing table (FIG. 7A). It is sent to the third unit processor 33 according to the value E of the address 200 in the routing table (FIG. 7A), and the third unit processor 33 routes the second unit processor 33 (FIG. 7A). Is sent to the fourth unit processor 37 according to the value E of the address 200 of the third unit. In the fourth unit processor 34, the value E of the address 200 in the routing table (FIG. 7A) of the fourth unit processor 34
Is sent to the fifth unit processor 35.

【００３０】第５の単位プロセッサ３５では、第５の単
位プロセッサ３５のルーティングテーブル（図７
（ｂ））のアドレス２００の値Ｓにより第６の単位プロ
セッサ３６に送られる。第６の単位プロセッサ３６で
は、第６の単位プロセッサ３６のルーティングテーブル
（図７（ｂ））のアドレス２００の値Ｓにより第７の単
位プロセッサ３７に送られる。In the fifth unit processor 35, the routing table of the fifth unit processor 35 (FIG. 7)
It is sent to the sixth unit processor 36 by the value S of the address 200 in (b)). The sixth unit processor 36 sends the sixth unit processor 36 to the seventh unit processor 37 based on the value S of the address 200 in the routing table (FIG. 7B) of the sixth unit processor 36.

【００３１】第７の単位プロセッサ３７では、第７の単
位プロセッサ３７のルーティングテーブル（図７
（ｃ））のアドレス２００の値Ｐにより第３のクラスタ
のゲートウェイプロセッサである第７の単位プロセッサ
３７のプロセッサポートに書き込まれる。第７の単位プ
ロセッサの演算プロセッサはメッセージパケット内のク
ラスタ内ローカルアドレス２によりルーティングを行う
ように再設定してメッセージパケットを送信する。第７
の単位プロセッサ３７の演算プロセッサから送り出され
たメッセージパケットは第７の単位プロセッサ３７のル
ーティングテーブルのアドレス２の値Ｓにより第８の単
位プロセッサ３８に送られ、第８の単位プロセッサ３８
では、第８の単位プロセッサ３８のルーティングテーブ
ル（図７（ｄ））のアドレス２の値Ｐにより宛先プロセ
ッサである第８の単位プロセッサ３８のプロセッサポー
トに書込まれる。In the seventh unit processor 37, the routing table of the seventh unit processor 37 (FIG. 7)
The data is written to the processor port of the seventh unit processor 37 which is the gateway processor of the third cluster by the value P of the address 200 in (c)). The arithmetic processor of the seventh unit processor resets the routing so as to perform the routing based on the intra-cluster local address 2 in the message packet and transmits the message packet. Seventh
The message packet sent from the arithmetic processor of the unit processor 37 is sent to the eighth unit processor 38 according to the value S of the address 2 of the routing table of the seventh unit processor 37, and the eighth unit processor 38
Then, the data is written to the processor port of the eighth unit processor 38 as the destination processor by the value P of the address 2 in the routing table (FIG. 7D) of the eighth unit processor 38.

【００３２】このようにメッセージ転送の経路は図７に
示す各単位プロセッサのルーティングテーブルの設定に
より決められる。As described above, the path of the message transfer is determined by the setting of the routing table of each unit processor shown in FIG.

【００３３】次に、第３の発明の実施例について説明す
る。アドレスプロセッサのクラスタへの分割は図５に同
じであり、単位プロセッサノードの構成は、図２に示し
た構成と同じである。Next, an embodiment of the third invention will be described. The division of the address processor into clusters is the same as in FIG. 5, and the configuration of the unit processor node is the same as the configuration shown in FIG.

【００３４】図８に示すメッセージ転送の一例を示す。
図８は図５に示した２次元格子状に接続されたアレイプ
ロセッサを示す。図８において各単位プロセッサ内のＰ
は演算プロセッサを示す。図８に示すアレイプロセッサ
の第１のクラスタは第１の単位プロセッサ３１、第２の
単位プロセッサ３２、第３の単位プロセッサ３３、第９
の単位プロセッサ３９、第１０の単位プロセッサ４０、
第１１の単位プロセッサ４１、等から構成され、第２の
クラスタは第４の単位プロセッサ３４、第５の単位プロ
セッサ３５、第６の単位プロセッサ３６、第１２の単位
プロセッサ４２、等から構成され、第３のクラスタ第７
の単位プロセッサ３７、第８の単位プロセッサ３８、第
１３の単位プロセッサ４３、第１４の単位プロセッサ４
４、等から構成される。このうち第４の単位プロセッサ
３４および第１２の単位プロセッサ４２は第２のクラス
タのゲートウェイノードであり、第７の単位プロセッサ
３７および第１３の単位プロセッサ４３は第３のクラス
タのゲートウェイノードである。FIG. 8 shows an example of the message transfer shown in FIG.
FIG. 8 shows an array processor connected in a two-dimensional grid pattern shown in FIG. In FIG. 8, P in each unit processor
Indicates an arithmetic processor. The first cluster of the array processors shown in FIG. 8 includes a first unit processor 31, a second unit processor 32, a third unit processor 33, and a ninth unit processor.
Unit processor 39, tenth unit processor 40,
The second cluster is composed of a fourth unit processor 34, a fifth unit processor 35, a sixth unit processor 36, a twelfth unit processor 42, and the like. Third cluster seventh
Unit processor 37, eighth unit processor 38, thirteenth unit processor 43, fourteenth unit processor 4
4, etc. Among them, the fourth unit processor 34 and the twelfth unit processor 42 are gateway nodes of the second cluster, and the seventh unit processor 37 and the thirteenth unit processor 43 are gateway nodes of the third cluster.

【００３５】図９および図１０に各プロセッサのルーテ
ィングテーブルを示す。図９（ａ）は第１，第２，第
３，第４，第９，第１０および第１１の単位プロセッサ
用ルーティングテーブルを、図９（ｂ）は第５，第６お
よび第１２の単位プロセッサ用ルーティングテーブル
を、図９（ｃ）は第７の単位プロセッサ用ルーティング
テーブルを、図９（ｄ）は第８の単位プロセッサ用ルー
ティングテーブルを、図１０（ｅ）は第１３の単位プロ
セッサ用ルーティンテーブルを、図１０（ｆ）は第１４
の単位プロセッサ用ルーティングテーブルを示してい
る。FIGS. 9 and 10 show a routing table of each processor. FIG. 9A shows the routing tables for the first, second, third, fourth, ninth, tenth, and eleventh unit processors, and FIG. 9B shows the fifth, sixth, and twelfth unit processors. FIG. 9C shows the routing table for the seventh unit processor, FIG. 9D shows the routing table for the eighth unit processor, and FIG. 10E shows the routing table for the thirteenth unit processor. The routine table is shown in FIG.
3 shows a unit processor routing table.

【００３６】図９および図１０においてＰの欄は演算プ
ロセッサ、Ｎの欄は上方向、Ｅの欄は右方向、Ｓの欄は
下方向、Ｗの欄は左方向への転送を指定するビットであ
る。In FIGS. 9 and 10, a column P designates an arithmetic processor, a column N designates an upward direction, a column E designates a rightward direction, a column S designates a downward direction, and a column W designates a transfer direction to the left. It is.

【００３７】第１の単位プロセッサ３１の演算プロセッ
サから送り出されるメッセージパケットはクラスタアド
レス２００とクラスタ内ローカルアドレス２をもち、異
なるクラスタ内のプロセッサへのメッセージ転送である
ため、まずクラスタアドレスによりルーティングを行
う。まず、第１の単位プロセッサ３１のルーティングテ
ーブル（図９（ａ））のアドレス２００の値Ｅにより第
２の単位プロセッサ３２に送られ、第２単位プロセッサ
３２では、第２の単位プロセッサ３２のルーティングテ
ーブル（図９（ａ））のアドレス２００の値Ｅにより第
３の単位プロセッサ３３に送られ、第３の単位プロセッ
サ３３では、第３の単位プロセッサ３３のルーティング
テーブル（図９（ａ））のアドレス２００の値Ｅにより
第４の単位プロセッサ３４に送られる。第４の単位プロ
セッサ３４では、第４の単位プロセッサ３４のルーティ
ングテーブル（図９（ａ））のアドレス２００の値Ｅに
より第５の単位プロセッサ３５に送られる。The message packet sent from the arithmetic processor of the first unit processor 31 has a cluster address 200 and an intra-cluster local address 2, and is a message transfer to a processor in a different cluster. . First, the first unit processor 31 is sent to the second unit processor 32 based on the value E of the address 200 in the routing table (FIG. 9A). In the second unit processor 32, the routing of the second unit processor 32 is performed. It is sent to the third unit processor 33 according to the value E of the address 200 of the table (FIG. 9A), and the third unit processor 33 stores the value in the routing table of the third unit processor 33 (FIG. 9A). It is sent to the fourth unit processor 34 by the value E of the address 200. The fourth unit processor 34 sends the fourth unit processor 34 to the fifth unit processor 35 based on the value E of the address 200 in the routing table (FIG. 9A) of the fourth unit processor 34.

【００３８】第５の単位プロセッサ３５では、第５の単
位プロセッサ３５のルーティングテーブル（図９
（ｂ））のアドレス２００の値Ｓにより第６の単位プロ
セッサ３６に送られる。第６の単位プロセッサ３６で
は、第６の単位プロセッサ３６のルーティングテーブル
（図９（ｂ））のアドレス２００の値Ｓにより第７の単
位プロセッサ３７に送られる。In the fifth unit processor 35, the routing table of the fifth unit processor 35 (FIG. 9)
It is sent to the sixth unit processor 36 by the value S of the address 200 in (b)). The sixth unit processor 36 sends the sixth unit processor 36 to the seventh unit processor 37 based on the value S of the address 200 in the routing table (FIG. 9B).

【００３９】第７の単位プロセッサ３７では、第７の単
位プロセッサ３７のルーティングテーブル（図９
（ｃ））のアドレス２００の値Ｐにより第３のクラスタ
のゲートウェイプロセッサである第７の単位プロセッサ
３７のプロセッサポートに書込まれる。第７の単位プロ
セッサの演算プロセッサは、メッセージパケット内のク
ラスタ内ローカルアドレス２によりルーティングを行う
ように再設定してメッセージパケットを再送信する。第
７の単位プロセッサ３７の演算プロセッサから送り出さ
れたメッセージパケットは、第７の単位プロセッサ３７
のルーティングテーブルのアドレス２の値Ｓにより第８
の単位プロセッサ３８に送られ、第８の単位プロセッサ
３８では、第８の単位プロセッサ３８のルーティングテ
ーブル（図９（ｄ））のアドレス２の値Ｐにより宛先プ
ロセッサである第８の単位プロセッサ３８のプロセッサ
ポートに書込まれる。In the seventh unit processor 37, the routing table of the seventh unit processor 37 (FIG. 9)
The value is written to the processor port of the seventh unit processor 37 which is the gateway processor of the third cluster by the value P of the address 200 in (c)). The arithmetic processor of the seventh unit processor resets the routing so as to perform the routing using the intra-cluster local address 2 in the message packet and retransmits the message packet. The message packet sent from the arithmetic processor of the seventh unit processor 37 is
The value S of the address 2 in the routing table of
Of the eighth unit processor 38, which is the destination processor, based on the value P of the address 2 in the routing table (FIG. 9D) of the eighth unit processor 38. Written to the processor port.

【００４０】一方、第９の単位プロセッサ３９の演算プ
ロセッサから送り出されるメッセージパケットは、クラ
スタアドレス２００とクラスタ内ローカルアドレス３を
もち、異なるクラスタ内のプロセッサへのメッセージ転
送であるため、まずクラスタアドレスによりルーティン
グを行う。まず、第９の単位プロセッサ３９のルーティ
ングテーブル（図９（ａ））のアドレス２００の値Ｅに
より第１０の単位プロセッサ４０に送られ、第１０の単
位プロセッサ４０では、第１０の単位プロセッサ４０の
ルーティングテーブル（図９（ａ））のアドレス２００
の値Ｅにより第１１の単位プロセッサ４１に送られ、第
１１の単位プロセッサ４１では、第１１の単位プロセッ
サ４１のルーティングテーブル（図９（ａ））のアドレ
ス２００の値Ｅにより第１２の単位プロセッサ４２に送
られ、第１２の単位プロセッサ４２では、第１２の単位
プロセッサ４２のルーティングテーブル（図９（ｂ））
のアドレス２００の値Ｓにより第１３の単位プロセッサ
４３に送られる。On the other hand, a message packet sent from the arithmetic processor of the ninth unit processor 39 has a cluster address 200 and a local address 3 within a cluster, and is a message transfer to a processor in a different cluster. Perform routing. First, the ninth unit processor 39 is sent to the tenth unit processor 40 by the value E of the address 200 in the routing table (FIG. 9A). Address 200 in the routing table (FIG. 9A)
Is transmitted to the eleventh unit processor 41 according to the value E of the twelfth unit processor in the eleventh unit processor 41 by the value E of the address 200 in the routing table (FIG. 9A) of the eleventh unit processor 41. The routing table of the twelfth unit processor 42 is sent to the twelfth unit processor 42 (FIG. 9B).
Is sent to the thirteenth unit processor 43 according to the value S of the address 200 of.

【００４１】第１３の単位プロセッサ４３では、第１３
の単位プロセッサ４３のルーティングテーブル（図１０
（ｅ））のアドレス２００の値Ｐにより第３のクラスタ
のゲートウェイプロセッサである第１３の単位プロセッ
サ４３のプロセッサポートに書込まれる。第１３の単位
プロセッサの演算プロセッサは、メッセージパケット内
のクラスタ内ローカルアドレス３によりルーティングを
行うように再設定してメッセージパケットを送信する。
第１３の単位プロセッサ４３の演算プロセッサから送り
出されたメッセージパケットは１３の単位プロセッサ４
３のルーティングテーブルのアドレス３の値Ｓにより第
１４の単位プロセッサ４４に送られ、第１４の単位プロ
セッサ４４では、第１４の単位プロセッサ４４のルーテ
ィングテーブル（図１０（ｆ））のアドレス３の値Ｐに
より宛先プロセッサである第１４の単位プロセッサ４４
のプロセッサポートに書込まれる。In the thirteenth unit processor 43, the thirteenth
Table of the unit processor 43 of FIG.
The value is written to the processor port of the thirteenth unit processor 43 which is the gateway processor of the third cluster by the value P of the address 200 in (e)). The arithmetic processor of the thirteenth unit processor resets the routing so as to perform the routing using the intra-cluster local address 3 in the message packet, and transmits the message packet.
The message packet sent from the arithmetic processor of the thirteenth unit processor 43 is the thirteenth unit processor 4.
3 is sent to the fourteenth unit processor 44 based on the value S of the address 3 of the routing table, and the fourteenth unit processor 44 transmits the value of the address 3 in the routing table of the fourteenth unit processor 44 (FIG. 10F). The fourteenth unit processor 44 which is the destination processor by P
Is written to the processor port.

【００４２】このようにメッセージ転送の経路は図９お
よび図１０に示す各単位プロセッサのルーティングテー
ブルの設定により決められる。As described above, the message transfer path is determined by the setting of the routing table of each unit processor shown in FIGS.

【００４３】[0043]

【発明の効果】以上説明したように、第１の発明に従え
ばアドレスプロセッサにおいてルーティングテーブルの
サイズによりアレイクラスタを構成し、クラスタ内の一
部のプロセッサの転送先を他のクラスタ内のプロセッサ
と設定することにより複数のクラスタを結合して大規模
なアレイプロセッサを構成することができる。As described above, according to the first aspect of the present invention, an array cluster is formed by the size of the routing table in the address processor, and the transfer destination of some processors in the cluster is set to the processor in another cluster. By setting, a plurality of clusters can be combined to configure a large-scale array processor.

【００４４】また２の発明に従えばアレイプロセッサに
おいてルーティングテーブルにクラスタアドレスとロー
カルアドレスとを書換えることにより、任意の経路のメ
ッセージパケットを送ることができる。According to the second aspect of the present invention, a message packet of an arbitrary route can be sent by rewriting the cluster address and the local address in the routing table in the array processor.

【００４５】さらに第３の発明に従えば複数の単位プロ
セッサから１つのクラスタ内の複数の単位プロセッサに
同時にメッセージパケットを送ることが可能である。Further, according to the third aspect, it is possible to simultaneously send a message packet from a plurality of unit processors to a plurality of unit processors in one cluster.

[Brief description of the drawings]

【図１】第１の発明におけるクラスタ分割を示す図であ
る。FIG. 1 is a diagram showing cluster division in the first invention.

【図２】単位プロセッサの構成を示す図である。FIG. 2 is a diagram illustrating a configuration of a unit processor.

【図３】第１の発明によるメッセージ転送の一例を示す
図である。FIG. 3 is a diagram showing an example of a message transfer according to the first invention.

【図４】図３の例における各単位プロセッサのルーティ
ングテーブルを示す図である。FIG. 4 is a diagram illustrating a routing table of each unit processor in the example of FIG. 3;

【図５】第２の発明におけるクラスタ分割を示す図であ
る。FIG. 5 is a diagram showing cluster division in the second invention.

【図６】第２の発明によるメッセージ転送の一例を示す
図である。FIG. 6 is a diagram showing an example of a message transfer according to the second invention.

【図７】図６の例における各単位プロセッサのルーティ
ングテーブルを示す図である。FIG. 7 is a diagram illustrating a routing table of each unit processor in the example of FIG. 6;

【図８】第３の発明によるメッセージ転送の一例を示す
図である。FIG. 8 is a diagram showing an example of a message transfer according to the third invention.

【図９】図８の例における各単位プロセッサのルーティ
ングテーブルを示す図である。FIG. 9 is a diagram illustrating a routing table of each unit processor in the example of FIG. 8;

【図１０】図８の例における各単位プロセッサのルーテ
ィングテーブルを示す図である。FIG. 10 is a diagram showing a routing table of each unit processor in the example of FIG. 8;

[Explanation of symbols]

１０単位プロセッサノード２１演算プロセッサ２２プロセッサポート２３第１の通信ポート２４第２の通信ポート２５第３の通信ポート２６第４の通信ポート２７制御回路２８ルーティングテーブル３１第１の単位プロセッサ３２第２の単位プロセッサ３３第３の単位プロセッサ３４第４の単位プロセッサ３５第５の単位プロセッサ３６第６の単位プロセッサ３７第７の単位プロセッサ３８第８の単位プロセッサ３９第９の単位プロセッサ４０第１０の単位プロセッサ４１第１１の単位プロセッサ４２第１２の単位プロセッサ４３第１３の単位プロセッサ４４第１４の単位プロセッサ Reference Signs List 10 unit processor node 21 arithmetic processor 22 processor port 23 first communication port 24 second communication port 25 third communication port 26 fourth communication port 27 control circuit 28 routing table 31 first unit processor 32 second Unit processor 33 Third unit processor 34 Fourth unit processor 35 Fifth unit processor 36 Sixth unit processor 37 Seventh unit processor 38 Eighth unit processor 39 Ninth unit processor 40 Tenth unit processor 41 11th unit processor 42 12th unit processor 43 13th unit processor 44 14th unit processor

Claims

(57) [Claims]

1. An array processor comprising a plurality of arithmetic processors and a low-dimensional mesh interconnection network for communication between processors, wherein a message packet is routed from a source processor to one or more destination processors. A message packet generated from an arithmetic processor or sent from one of a plurality of connected links for each arithmetic processor in the network. Means for forwarding to one or more destinations designated from among them, and routing rewritable by an arithmetic processor which designates one or more links or arithmetic processors for forwarding the message packet from the address information in the message packet By dividing the array processor having the information storage means into a plurality of clusters and assigning all or some addresses in the routing information storage means of all or some processors in the cluster to processors in another cluster, A method for routing a message packet, wherein the method enables message transfer to a processor.

2. An array processor comprising a plurality of arithmetic processors and a low-dimensional mesh interconnection network for communication between the processors, wherein a message packet is routed from a source processor to one or more destination processors. A message packet generated from an arithmetic processor or sent from one of a plurality of connected links for each arithmetic processor in the network. Means for forwarding to one or more destinations designated from among them, and routing rewritable by an arithmetic processor which designates one or more links or arithmetic processors for forwarding the message packet from the address information in the message packet When the array processor having the information storage means is divided into a plurality of clusters, one address is assigned to each cluster, and the transmitting processor sends a message packet to a destination processor belonging to a different cluster, the packet includes the cluster address and the Is transferred to a gateway processor having a cluster address in the cluster to which the destination processor belongs by performing routing by the cluster address. If the gateway processor receives a message packet in which the cluster address is specified, A method for routing a message packet, wherein the message is transferred to a destination processor by resetting the routing so as to perform routing by an address and retransmitting the packet.

3. A message packet routing method for an array processor according to claim 2, wherein in each cluster, a plurality of processors in the cluster connected to the outside of the cluster are gateway processors having the same cluster address, and the processors outside the cluster are each a processor. A message packet routing method, wherein a plurality of message transfers to processors in a cluster are simultaneously performed by setting routing information so that routing can be performed without intersecting with a gateway processor.