JP2003196252A

JP2003196252A - Multi-processor system and its method of configuration

Info

Publication number: JP2003196252A
Application number: JP2001400867A
Authority: JP
Inventors: Akira Fujimaki; 朗藤巻; Hisao Hayakawa; 尚夫早川
Original assignee: Japan Science and Technology Corp
Current assignee: Japan Science and Technology Agency
Priority date: 2001-12-28
Filing date: 2001-12-28
Publication date: 2003-07-11

Abstract

<P>PROBLEM TO BE SOLVED: To provide a new multi-processor system capable of realizing a much larger data transfer capacity for an interconnection network and its method of configuration. <P>SOLUTION: An interconnection network is configured of a logic circuit based on a pulse logic whose operating time is hardly affected by wiring length. Especially, the interconnection network is configured of a plurality of routers and a line connecting those routers, and the time-division multiple transmission of packets is executed by a rate converting circuit, and an ultra-high speed logic circuit constituted of single magnetic flux quantum (magnetic flux quantized in an ultra-conductive loop including Josephson junction) is adopted in the rate converting circuit so that the interconnection network having a large transfer capacity can be realized. <P>COPYRIGHT: (C)2003,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、高性能計算機やサ
ーバーなどに好適なマルチプロセッサシステムとその構
成方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a multiprocessor system suitable for a high-performance computer, a server, etc., and a method of configuring the same.

【０００２】[0002]

【従来の技術】高性能計算機を構成するためのマルチプ
ロセッサ技術の研究、開発は、従来より、盛んに行われ
てきている。マルチプロセッサは、複数のプロセッサを
並列にに配置して、各プロセッサにプログラムを分散さ
せたりデータを分割したりして多重処理させ、システム
全体として高速化を図る方式である。2. Description of the Related Art Research and development of multiprocessor technology for constructing a high-performance computer have been actively carried out. The multiprocessor is a method in which a plurality of processors are arranged in parallel and a program is distributed to each processor or data is divided to perform multiple processing to increase the speed of the entire system.

【０００３】従来技術における典型的なマルチプロセッ
サのシステム構成例を図５の（ａ）、（ｂ）、（ｃ）に
示す。図中、１は独立したプロセッサユニットＰＵ、２
は独立したメモリＭ、３はＰＵごとのローカルメモリＬ
Ｍ、４はプロセッサユニットＰＵ間、あるいはプロセッ
サユニットＰＵとメモリＭ間などを結合して、データや
制御情報の転送をスイッチ制御する相互結合網である。A typical multiprocessor system configuration in the prior art is shown in FIGS. 5 (a), 5 (b) and 5 (c). In the figure, 1 is an independent processor unit PU, 2
Is an independent memory M, 3 is a local memory L for each PU
M and 4 are mutual coupling networks that couple the processor units PU, or between the processor units PU and the memory M, and switch control the transfer of data and control information.

【０００４】図５の（ａ）は、複数のメモリＭが独立し
て配置され、各プロセッサが各メモリＭのアドレス空間
を共有して相互結合網によりメモリＭを介してプロセッ
サユニットＰＵ間を結合する方式である。相互結合網は
メモリＭのアドレス空間をスイッチにより切り替えるた
め、各プロセッサから任意のメモリＭへのアクセス時間
は、アドレスによらず一定となる。図５の（ｂ）は、そ
れぞれのプロセッサユニットＰＵがローカルメモリＬＭ
と対で構成され、そのローカルメモリＬＭが他のプロセ
ッサユニットＰＵに付随したローカルメモリＬＭと相互
結合網を介して結合する方式である。図５の（ｃ）は、
プロセッサユニットＰＵがそれぞれ独自にメモリＭを持
ち、プロセッサユニットＰＵが相互結合網を介して直接
他のマイクロプロセッサと結びついているものである。
この場合、共有メモリは存在しない。相互結合網は、ル
ーターなどの、接続要求元からの制御情報により接続先
を自動選択して信号経路をルーティングするものである
ことができる。In FIG. 5A, a plurality of memories M are arranged independently, each processor shares the address space of each memory M, and the processor units PU are connected via the memories M by an interconnection network. It is a method to do. Since the mutual connection network switches the address space of the memory M by a switch, the access time from each processor to the arbitrary memory M is constant regardless of the address. In FIG. 5B, each processor unit PU has a local memory LM.
Is a system in which the local memory LM is connected to the local memory LM attached to another processor unit PU through an interconnection network. FIG. 5C shows
Each processor unit PU has its own memory M, and the processor unit PU is directly connected to another microprocessor via an interconnection network.
In this case, there is no shared memory. The interconnection network may be a router or the like that automatically selects a connection destination according to control information from a connection request source and routes a signal path.

【０００５】このようなマルチプロセッサのシステムを
実装する場合、従来は、複数のマイクロプロセッサ及び
メモリすべてを１つのプリント基板上に実装し、やはり
同じプリント基板上に実装したスイッチ専用集積回路Ｉ
ＣもしくはルーターＩＣを介して、各マイクロプロセッ
サ間あるいはマイクロプロセッサとメモリ間を相互結合
し、データの授受を行わせるようにしているのが一般的
であった。また、搭載するマイクロプロセッサの数が多
い場合には、基板を複数に分割して、マイクロプロセッ
サを分散実装することも行われていた。In the case of implementing such a multiprocessor system, conventionally, a plurality of microprocessors and memories are all mounted on one printed board, and an integrated circuit I dedicated to a switch is also mounted on the same printed board.
It has been common to mutually couple the respective microprocessors or between the microprocessor and the memory via C or a router IC so as to exchange data. Further, when the number of installed microprocessors is large, the board is divided into a plurality of pieces and the microprocessors are distributed and mounted.

【０００６】[0006]

【発明が解決しようとする課題】従来技術では、プリン
ト基板上の１本の配線のデータ転送容量は、プリント基
板の特性から数百Ｍｂｐｓが限度となっている。一方、
標準的なマイクロプロセッサの動作周波数は、すでに１
ＧＨｚを超えて２〜３ＧＨｚにも達する状況にあり、マ
イクロプロセッサの信号処理能力と基板のデータ転送容
量との乖離は大きなものとなっている。とりわけ複数の
マイクロプロセッサを結合させて並列動作させる構成の
マルチプロセッサシステムでは、マイクロプロセッサ間
の相互結合網が恒常的に混雑してデータ転送に遅れが生
じ、システムの処理性能向上を制約する大きな要因とな
っている。このように、マルチプロセッサシステムにお
ける相互結合網のデータ転送容量の不足は、今後ますま
す深刻な問題となることが予想される。In the prior art, the data transfer capacity of one wiring on the printed circuit board is limited to several hundred Mbps due to the characteristics of the printed circuit board. on the other hand,
The standard microprocessor operating frequency is already 1
The frequency exceeds GHz to reach 2-3 GHz, and the difference between the signal processing capacity of the microprocessor and the data transfer capacity of the board is large. In particular, in a multiprocessor system in which a plurality of microprocessors are connected to operate in parallel, the mutual connection network between the microprocessors is constantly crowded, causing a delay in data transfer, which is a major factor limiting the improvement in system processing performance. Has become. As described above, the lack of data transfer capacity of the interconnection network in the multiprocessor system is expected to become a serious problem in the future.

【０００７】また、プリント基板では、基板上の配線の
充放電時間がデータ転送能力を制約する重要な要因とな
る。今後、プリント基板の特性が改善されて、データ転
送能力が向上して行くことが考えられるが、その場合で
も本質的に大幅な向上を望むことは難しい。すなわち、
従来のマイクロプロセッサやメモリ、相互結合網のスイ
ッチの各インタフェースは、信号の１，０変化を中間レ
ベルに設定された閾値で検出するレベル論理にしたがっ
ている。たとえば図６に示すように、マイクロプロセッ
サからデータを転送する場合、データ転送が正常に行わ
れるためには、マイクロプロセッサの出力とそれにつな
がる相互結合網のスイッチもしくはメモリの入力は同じ
電位になる必要がある。しかし、このレベル論理による
出力側の電位と入力側の電位とが同じ電位になるには、
その間の配線が充放電される必要があり、その充放電時
間は配線の長さに比例する。そのため、たとえば１ＧＨ
ｚの信号を伝えられる距離は、数ｃｍ程度に制限されて
しまうという問題があった。Further, in the printed circuit board, the charging / discharging time of the wiring on the circuit board becomes an important factor that limits the data transfer capability. It is conceivable that the characteristics of the printed circuit board will be improved and the data transfer capability will be improved in the future, but even in that case, it is essentially difficult to expect a significant improvement. That is,
The interfaces of the conventional microprocessor, memory, and switch of the interconnection network follow the level logic for detecting 1,0 change of the signal with the threshold value set to the intermediate level. For example, as shown in FIG. 6, when data is transferred from a microprocessor, the output of the microprocessor and the input of the switch or memory of the interconnection network connected to the microprocessor must have the same potential in order for the data transfer to be performed normally. There is. However, in order for the potential on the output side and the potential on the input side to become the same potential by this level logic,
The wiring in the meantime needs to be charged / discharged, and the charging / discharging time is proportional to the length of the wiring. Therefore, for example, 1GH
There is a problem that the distance for transmitting the z signal is limited to about several cm.

【０００８】本発明は、以上の事情に鑑みて、従来技術
の問題点を解消しようとするものであり、相互結合網に
ついてより大きなデータ転送容量を実現できる新しいマ
ルチプロセッサシステムとその構成方法を提供するもの
である。In view of the above circumstances, the present invention is intended to solve the problems of the prior art, and provides a new multiprocessor system capable of realizing a larger data transfer capacity for an interconnection network and a method of configuring the same. To do.

【０００９】[0009]

【課題を解決するための手段】本発明は、上記の課題を
解決するため、マルチプロセッサシステムの相互結合網
にパルス論理にしたがう論理回路を採用する。パルス論
理は、図１（ａ）に示すように、入力パルスの有無で応
答し、レベル論理よりも配線の充放電時間の影響を著し
く小さくできる。本発明では、特にパルス論理回路とし
て、超高速の単一磁束量子論理回路（ＳＦＱ回路）を使
用するところに特徴がある。具体的には、マルチプロセ
ッサシステムの複数のマイクロプロセッサもしくはメモ
リを実装した基板上に、単一磁束量子論理回路で構成し
た相互結合網を形成することにより、大きな転送容量を
もつ相互結合網を有したマルチプロセッサ構成法を提供
する。In order to solve the above problems, the present invention employs a logic circuit according to pulse logic in an interconnection network of a multiprocessor system. As shown in FIG. 1A, the pulse logic responds to the presence or absence of an input pulse, and the influence of the charge / discharge time of the wiring can be made significantly smaller than that of the level logic. The present invention is particularly characterized in that an ultra-high speed single magnetic flux quantum logic circuit (SFQ circuit) is used as the pulse logic circuit. Specifically, by forming an interconnected network composed of a single magnetic flux quantum logic circuit on a substrate on which multiple microprocessors or memories of a multiprocessor system are mounted, an interconnected network with a large transfer capacity is provided. A multiprocessor configuration method is provided.

【００１０】単一磁束量子論理回路は、参考文献［吉田
二朗：単一磁束量子素子を用いた論理回路技術，応用物
理第６７巻第４号 1998 pp.410-416］などに紹介さ
れている。図１（ｂ）に、単一磁束量子論理回路の例と
して、ＲＳフリップフロップ回路を示す。図示の回路
は、二つのジョセフソン接合Ｊ₁，Ｊ₂をもつ超伝導ル
ープで構成される。はじめに回路の閉ループに周回電流
が流れていないリセット状態において、ジョセフソン接
合Ｊ₁側のＳ端子に入力パルスを加えると、Ｊ₁を流れ
る電流が臨海電流値を超えた時点でＪ₁が超伝導状態か
らスイッチし、Ｊ ₁に電圧が発生して閉ループを時計回
りに電流が流れる。Ｓ端子の入力パルスがなくなり、Ｊ
₁を流れる電流が小さくなるとＪ₁は超伝導状態に戻
り、閉ループを流れる電流が永久電流となって磁束の最
小単位Φ₀に量子化された磁束が保存される。この状態
がフリップフロップのセット状態となる。ここで、ジョ
セフソン接合Ｊ₂側のＲ端子にパルスが入力されるとＪ
₂が超伝導状態からスイッチし、Ｊ₂に発生した電圧に
より反時計回りに電流が流れて、先の時計回りの電流を
打消し、リセット状態に戻る。そして、このときのＪ₂
の電圧変化がＯｕｔ端子から出力される。Single flux quantum logic circuits are described in the reference [Yoshida
Jiro: Logic circuit technology using single-flux quantum devices, applications
Science Vol. 67, No. 4, 1998 pp.410-416]
Has been. FIG. 1B shows an example of a single magnetic flux quantum logic circuit and
Then, the RS flip-flop circuit is shown. Circuit shown
Is the two Josephson junction J₁, J₂With superconductivity
Group. First the circulating current in the closed loop of the circuit
In the reset state where no
Go J₁When an input pulse is applied to the S terminal on the₁Flow through
When the current exceeds the critical current value, J₁Is superconducting
Switch, J ₁Voltage is generated and the closed loop is rotated clockwise
Current flows. Input pulse of S terminal disappeared, J
₁If the current flowing through₁Returns to the superconducting state
The current flowing through the closed loop becomes a permanent current and
Small unit Φ₀The quantized magnetic flux is stored in. This state
Becomes the set state of the flip-flop. Where Jo
Sefson junction J₂When a pulse is input to the R terminal on the
₂Switches from the superconducting state, J₂To the voltage generated in
The current flows more counterclockwise and the previous clockwise current
Cancel and return to reset state. And J at this time₂
The voltage change is output from the Out terminal.

【００１１】[0011]

【発明の実施の形態】以下、本発明の好適な実施の態様
について説明する。BEST MODE FOR CARRYING OUT THE INVENTION Preferred embodiments of the present invention will be described below.

【００１２】図２は、本発明によるマルチプロセッサシ
ステムの実装構成例を示す。図中、１０は基板であり、
１１は基板１０上に配置された全部で８個のプロセッサ
モジュールであり、それぞれがプロセッサユニットＰＵ
とメモリＭを含む。１２は単一磁束量子論理回路で構成
した相互結合網のルーター、１３はルーター間を結合す
るジョセフソン伝送線路、１４はプロセッサモジュール
とルーター間を結合するジョセフソン伝送線路である。
ルーター１２は、ネットワーク間でデータをパケット形
式により指定された宛先に振り分けるデータ転送手段で
あり、図示の例では、２個のプロセッサモジュールに対
して１個ずつ設けられ、全体で４個のルーターがリング
状に結合されて相互結合網を構成している。しかし、こ
のような相互結合網の構成には、多くの変形が可能であ
る。FIG. 2 shows a mounting configuration example of the multiprocessor system according to the present invention. In the figure, 10 is a substrate,
Reference numeral 11 denotes a total of eight processor modules arranged on the substrate 10, each of which is a processor unit PU.
And memory M. Reference numeral 12 is a router of an interconnection network composed of a single magnetic flux quantum logic circuit, 13 is a Josephson transmission line connecting the routers, and 14 is a Josephson transmission line connecting the processor module and the router.
The router 12 is a data transfer means that distributes data to destinations specified by a packet format between networks. In the illustrated example, one router is provided for each of two processor modules, and a total of four routers are provided. They are connected in a ring shape to form an interconnection network. However, many variations are possible in the configuration of such an interconnection network.

【００１３】図２において、基板１０上に配置された複
数のプロセッサモジュール１１のプロセッサユニットＰ
ＵとメモリＭは、相互結合網の一つないし二つのルータ
ー１２を経由して、任意のプロセッサモジュールのプロ
セッサユニットＰＵやメモリＭとデータを交換し、全体
で１つの高性能計算機としての機能を発揮する。なお、
図示されたマルチプロセッサシステムの形態は例示的な
ものに過ぎず、図５の（ａ）、（ｂ）、（ｃ）に示され
るような他の任意の形態をとることができ、またプロセ
ッサモジュールの個数も任意でよい。In FIG. 2, a processor unit P of a plurality of processor modules 11 arranged on a substrate 10
The U and the memory M exchange data with the processor unit PU and the memory M of an arbitrary processor module via one or two routers 12 of the mutual connection network, and have a function as one high-performance computer as a whole. Demonstrate. In addition,
The form of the multiprocessor system shown is merely exemplary, and may take any other form as shown in FIGS. 5 (a), (b), (c), and may be a processor module. The number of may be arbitrary.

【００１４】ルーターを構成する単一磁束量子論理回路
は、ジョセフソン接合を含む超伝導ループ内で量子化さ
れた磁束（磁束量子と呼ぶ）ｌ個の有無を２値信号に対
応させて演算を行う。単一磁束量子論理回路では、磁束
という一種の電磁波を信号としているため、配線として
用いるジョセフソン伝送線路のバンド幅は線路１本あた
りで１００Ｇｂｐｓに達するほか、論理ゲートも１００
ＧＨｚ以上で動作し得る。本発明は、このような特徴を
有する単一磁束量子論理回路を前述した種々のマルチプ
ロセッサ構成における相互結合網として基板上に作製
し、超高速でのデータ伝送を可能にする。A single-flux-quantum logic circuit which constitutes a router performs an operation by correlating the presence or absence of l quantized magnetic fluxes (called flux quanta) in a superconducting loop including Josephson junctions with binary signals. To do. Since a single magnetic flux quantum logic circuit uses a kind of electromagnetic wave called magnetic flux as a signal, the Josephson transmission line used as a wiring has a bandwidth of 100 Gbps per line and a logic gate of 100.
It can operate above GHz. According to the present invention, the single-flux-quantum logic circuit having such characteristics is manufactured on the substrate as an interconnection network in the various multiprocessor configurations described above, and enables data transmission at an extremely high speed.

【００１５】マルチプロセッサにおいて相互結合網の抱
える大きな問題は、パケットの透過率である。例えば２
つのプロセッサが同時に１つのメモリアドレスにアクセ
スしようとした場合、衝突（メモリ・コンフリクト）が
起こり、通常は片方のパケットが廃棄されるので、透過
率を低下させる。これは、マルチプロセッサ全体の性能
向上を阻害する大きな要因となる。従来の相互結合網で
は、透過率は65％程度で頭打ちになる。将来のマイクロ
プロセッサにおける外部とのインターフェイスでは、信
号路１本でのバンド幅が３ＧＨｚ程度にまで拡大される
可能性があるが、このバンド幅は単一磁束量子論理回路
のバンド幅に比べるとはるかに低いものである。したが
って、３ＧＨｚ程度で入力もしくは出力される信号をパ
ケット化して、複数チャンネルの並列入力パケットを、
単一磁束量子論理回路で構成したレート変換回路を用い
て、時分割データ通信の手法により各チャンネルのパケ
ットを少しずつずらして一つの時系列信号に時分割多重
（マルチプレキシング）すれば、高速で宛先の異なる複
数のデータ交換を同時に実行することが可能となる。時
分割多重で伝送されたパケット信号は、伝送先で時分割
多重の逆の動作をするレート変換回路によりパケットの
分配（デマルチプレキシング）を行い、複数チャンネル
のパケットを空間分離して宛先に配信する。このような
レート変換回路を用いるとともに、ルーター同士をジョ
セフソン線路１２で接続することにより、複数のプロセ
ッサモジュール間の信号伝送時間を著しく短縮すること
ができる。A major problem of the interconnection network in the multiprocessor is packet transparency. Eg 2
If two processors try to access one memory address at the same time, a collision (memory conflict) occurs and one packet is usually dropped, thus reducing the transparency. This is a major factor that hinders the performance improvement of the entire multiprocessor. In the conventional interconnection network, the transmittance reaches a peak at around 65%. In the interface with the outside in the future microprocessor, the bandwidth in one signal path may be expanded to about 3 GHz, but this bandwidth is much larger than that of the single flux quantum logic circuit. It is very low. Therefore, a signal input or output at about 3 GHz is packetized, and parallel input packets of a plurality of channels are
By using a rate conversion circuit composed of a single magnetic flux quantum logic circuit, by gradually shifting the packets of each channel by the method of time division data communication and time division multiplexing (multiplexing) into one time series signal, high speed is achieved. It is possible to simultaneously perform a plurality of data exchanges with different destinations. A packet signal transmitted by time division multiplexing is distributed (demultiplexing) at the transmission destination by a rate conversion circuit that operates in reverse of time division multiplexing, and packets of multiple channels are spatially separated and delivered to the destination. To do. By using such a rate conversion circuit and connecting the routers by the Josephson line 12, the signal transmission time between the plurality of processor modules can be significantly shortened.

【００１６】図３は、レート変換回路の説明図である。
プロセッサユニットＰＵから出力される信号の周波数は
数ＧＨｚ程度に留まるが、単一磁束量子論理回路は数十
ＧＨｚのバンド幅を持つ。したがって、図に示すように
低速で並列に入ってきたパケットＡ，Ｂ，Ｃ，Ｄの入力
を、単一磁束量子論理回路による時分割レート変換回路
１５で時間をずらしながら高速時系列に並び替えること
をしても、単一磁束量子論理回路の高速性により十分に
処理可能である。このパケットの高レート化によりパケ
ットの廃棄率を低減し、結果としてパケットの透過率を
大幅に向上することが可能となる。FIG. 3 is an explanatory diagram of the rate conversion circuit.
The frequency of the signal output from the processor unit PU remains at around several GHz, but the single-flux quantum logic circuit has a bandwidth of several tens GHz. Therefore, as shown in the figure, the inputs of the packets A, B, C, and D that come in parallel at a low speed are rearranged into a high-speed time series while shifting the time by the time division rate conversion circuit 15 by the single magnetic flux quantum logic circuit. Even if it does, it can be sufficiently processed due to the high speed of the single flux quantum logic circuit. By increasing the packet rate, the packet discard rate can be reduced, and as a result, the packet transparency rate can be significantly improved.

【００１７】図４は、レート変換回路をどこの部分に設
けるかの例を示したものである。同図（ａ）はレート変
換回路をプロセッサユニットＰＵもしくはメモリＭ側に
設けたもので単一磁束量子論理回路を用いない例、同図
（ｂ）はレート変換回路を単一磁束量子論理回路による
相互結合網側に持たせた例である。FIG. 4 shows an example of where the rate conversion circuit is provided. The figure (a) is an example in which the rate conversion circuit is provided on the processor unit PU or the memory M side and the single flux quantum logic circuit is not used. The figure (b) shows the rate conversion circuit by the single flux quantum logic circuit. This is an example of having the mutual connection network side.

【００１８】図４（ａ）の例において、２０はマルチプ
ロセッサを構成する単位のプロセッサモジュールであ
る。２１はモジュール内のプロセッサユニットＰＵもし
くはメモリＭで、システム内でデータのアクセス元ある
いはアクセス先となる。２２はモジュール内に組み込ま
れたレート変換回路であり、プロセッサユニットＰＵも
しくはメモリＭと同じレベル論理に基づいて動作し、デ
ータの速度変換を行う。２３は単一磁束量子論理回路Ｓ
ＦＱで構成した相互結合網側のＳＦＱ転送回路であり、
ネットワークとモジュールとの間で、ルーター動作を行
う。２４は同様に単一磁束量子論理回路ＳＦＱで構成し
たＳＦＱネットワークであり、モジュール間を結合する
ＳＦＱ線路で構成される。この例の場合、プロセッサユ
ニットＰＵもしくはメモリＭの出力の帯域幅は３ＧＨｚ
程度であり、レート変換回路２２はレベル論理に基づい
て動作するので、単一磁束量子論理回路ＳＦＱを用いる
ときのように時分割の多重度を上げることはできない。
このため、例えば２値の入力信号を多値化する等の手法
により時間当たりの情報量を増加させるならば、出力可
能な通信容量を単一磁束量子論理回路での通信容量程度
にまで増やすことは可能である。ただし、多値信号をス
イッチあるいはルーター等で処理することは極めて難し
いので、レート変換回路２２から出力される多値信号
を、ＳＦＱ転送回路２３側で、再び２値信号に変換し、
図３に示したような形の時系列データとして処理する必
要がある。この手法は、マイクロプロセッサの抱える大
きな問題の１つであるインターフェイスのピン数による
限界（ピンネック）を緩和する有効な手段とも成り得
る。In the example of FIG. 4A, reference numeral 20 is a processor module as a unit constituting a multiprocessor. Reference numeral 21 denotes a processor unit PU or memory M in the module, which is an access source or an access destination of data in the system. Reference numeral 22 is a rate conversion circuit incorporated in the module, which operates based on the same level logic as that of the processor unit PU or the memory M, and performs speed conversion of data. 23 is a single magnetic flux quantum logic circuit S
It is an SFQ transfer circuit on the side of the mutual coupling network composed of FQ,
Performs router operation between the network and the module. Reference numeral 24 is an SFQ network similarly composed of a single magnetic flux quantum logic circuit SFQ, and is composed of an SFQ line connecting modules. In the case of this example, the bandwidth of the output of the processor unit PU or the memory M is 3 GHz.
Since the rate conversion circuit 22 operates based on the level logic, it is not possible to increase the degree of time division multiplexing as in the case of using the single flux quantum logic circuit SFQ.
For this reason, if the amount of information per unit time is increased by a method such as multileveling a binary input signal, the communication capacity that can be output should be increased to the communication capacity of a single magnetic flux quantum logic circuit. Is possible. However, since it is extremely difficult to process a multilevel signal with a switch or a router, the multilevel signal output from the rate conversion circuit 22 is converted into a binary signal again on the SFQ transfer circuit 23 side,
It is necessary to process as time-series data of the form shown in FIG. This method can also be an effective means of relaxing the limit (pin neck) due to the number of pins of the interface, which is one of the major problems of microprocessors.

【００１９】一方、図４（ｂ）は、レート変換回路の機
能を相互結合網のルーターに置くもので、図４（ａ）の
ＳＦＱ転送回路２３の機能と合わせて、ＳＦＱレート変
換／転送回路２５として構成するものである。この例の
場合は、ピンネックの問題を解消できないものの、マイ
クロプロセッサのインターフェイス部に大きな変更を要
求しないで済むと言う利点がある。しかし、信号は、相
互結合網側の単一磁束量子論理回路を用いたルーター
で、時系列データに多重化するレート変換が必要とな
る。On the other hand, FIG. 4B shows that the function of the rate conversion circuit is placed in the router of the interconnection network, and the SFQ rate conversion / transfer circuit is combined with the function of the SFQ transfer circuit 23 of FIG. 4A. It is configured as 25. In the case of this example, although the problem of the pin neck cannot be solved, there is an advantage that it is not necessary to make a large change to the interface unit of the microprocessor. However, the signal is a router that uses a single-flux-quantum logic circuit on the side of the mutual coupling network, and requires rate conversion to be multiplexed with time-series data.

【００２０】さらに、単一磁束量子論理回路を用いて信
号伝送を行う他の利点は、伝搬波形の自己修復作用を持
つこと、すなわち、信号が伝搬中は波形が崩れないこと
である。これは、通常では放射などのエネルギーの散逸
により正確な情報伝送が行えなくなるような曲がった基
板もしくはフレキシブル基板であっても、単一磁束量子
論理回路では無歪で信号を送れることを意味する。従っ
て、そういった基板に相互結合網として単一磁束量子論
理回路を用いれば、より高密度にあるいはよりフレキシ
ブルにマイクロプロセッサを実装することが可能とな
る。このことは、マルチプロセッサシステムの性能を向
上させることにつながる。Furthermore, another advantage of performing signal transmission using a single-flux-quantum logic circuit is that it has a self-healing action of the propagation waveform, that is, the waveform does not collapse during the propagation of the signal. This means that even if the substrate is a curved substrate or a flexible substrate where accurate information transmission cannot be performed due to dissipation of energy such as radiation, a single flux quantum logic circuit can send a signal without distortion. Therefore, by using a single magnetic flux quantum logic circuit as an interconnection network on such a substrate, it becomes possible to mount the microprocessor in higher density or more flexibly. This leads to improved performance of multiprocessor systems.

【００２１】[0021]

【発明の効果】以上に説明したとおり、本発明により、
マルチプロセッサにおける相互結合網の転送容量を大幅
に向上させることが可能となる。この結果、マルチプロ
セッサは、より高速な信号処理もしくはより高い計算能
力を得ることができ、計算機のさらなる高性能化を図る
ことができる。As described above, according to the present invention,
It is possible to significantly improve the transfer capacity of the interconnection network in the multiprocessor. As a result, the multiprocessor can obtain faster signal processing or higher computing capacity, and can further improve the performance of the computer.

[Brief description of drawings]

【図１】本発明の原理説明図である。FIG. 1 is a diagram illustrating the principle of the present invention.

【図２】本発明によるマルチプロセッサシステムの実装
構成例を示す概要図である。FIG. 2 is a schematic diagram showing an example of a mounting configuration of a multiprocessor system according to the present invention.

【図３】図３は、レート変換回路の説明図である。FIG. 3 is an explanatory diagram of a rate conversion circuit.

【図４】レート変換回路の挿入例を示す説明図である。FIG. 4 is an explanatory diagram showing an example of inserting a rate conversion circuit.

【図５】従来技術における典型的なマルチプロセッサの
システム構成例を示す概要図である。FIG. 5 is a schematic diagram showing an example of a typical multiprocessor system configuration in a conventional technique.

【図６】従来の論理回路のレベル論理の説明図である。FIG. 6 is an explanatory diagram of level logic of a conventional logic circuit.

[Explanation of symbols]

１：プロセッサユニットＰＵ２：メモリＭ３：ローカルメモリＬＭ４：相互結合網１０：基板１１：プロセッサモジュール１２：単一磁束量子論理回路で構成した相互結合網のル
ーター１３：ルーター間を結合するジョセフソン伝送線路１４：プロセッサモジュールとルーター間を結合するジ
ョセフソン伝送線路１５：レート変換回路1: Processor unit PU 2: Memory M 3: Local memory LM 4: Mutual coupling network 10: Substrate 11: Processor module 12: Router of mutual coupling network composed of single flux quantum logic circuit 13: Joseph coupling between routers Son transmission line 14: Josephson transmission line 15 for coupling between the processor module and the router: Rate conversion circuit

Claims

[Claims]

1. A multiprocessor system comprising a plurality of modules equipped with a microprocessor and a memory, wherein the modules are interconnected by an interconnected network, wherein the interconnected network is constructed using pulse logic. Characteristic multiprocessor system.

2. The multiprocessor system according to claim 1, wherein the mutual coupling network is configured by using a single magnetic flux quantum logic circuit.

3. The interconnection network comprises a router provided for each module, a line connecting between the router and the microprocessor and memory in the module, and a line connecting the routers. The multiprocessor system according to claim 1 or 2.

4. The multiprocessor system according to claim 2, wherein the interconnection network includes a rate conversion circuit.

5. The multiprocessor system according to claim 4, wherein the rate conversion circuit has a function of time-division multiplexing packets input in parallel and converting the packets into a high-speed time series signal.

6. The interconnection network comprises a line connecting between a microprocessor and a memory in a module and a router,
The multiprocessor system according to claim 3, wherein the line connecting the routers is a Josephson line.

7. A multiprocessor system comprising a plurality of modules equipped with a microprocessor and a memory, wherein the modules are interconnected using a logic circuit according to pulse logic.

8. The logic circuit according to the pulse logic is constituted by a single flux quantum logic circuit.
The multiprocessor configuration method described in.

9. The modules are interconnected by a router provided for each module, a line connecting between the router and the microprocessor and memory in the module, and a line connecting the routers. The multiprocessor configuration method according to claim 7 or 8.

10. The multiprocessor configuration method according to claim 7, wherein rate conversion is performed when mutual coupling is performed.

11. The multiprocessor configuration method according to claim 10, wherein the rate conversion is performed by time-division multiplexing packets input in parallel and converting the packets into a high-speed time series signal.

12. The method for constructing a multiprocessor according to claim 9, wherein a line connecting between the router and the microprocessor and memory in the module and a line connecting between the routers are composed of Josephson lines. .