JP2007510376A

JP2007510376A - Non-blocking and deterministic unicast packet scheduling

Info

Publication number: JP2007510376A
Application number: JP2006538294A
Authority: JP
Inventors: コンダ，ベンカット
Original assignee: チークテクノロジーズ，インク．
Priority date: 2003-10-30
Filing date: 2004-10-29
Publication date: 2007-04-19
Also published as: EP1690159A2; WO2005045633A2; IL175337A0; CA2544219A1; WO2005045633A3

Abstract

【課題】
【解決手段】相互接続ネットワークを介してユニキャストパケットをスケジューリングするシステムであって、それぞれがｒ_２個の入力キューを有するｒ_１個の入力ポートと、それぞれがｒ_１個の出力キューを有するｒ_２個の出力ポートと、少なくとも式（Ｉ）

のスピードアップとサブネットワークを有する相互接続ネットワークとを有するシステムであって、各サブネットワークは、各サブネットワークは総数少なくともｒ_１個の第１内部リンクに対する各入力ポートに接続される、少なくとも１個の第１内部リンクを備え、各サブネットワークはさらに総数少なくともｒ_２個の第２内部リンクに対する各出力ポートに接続される、少なくとも１個の第２内部リンクを備える。本システムは、決定論的な方式で、かつパケットのセグメント化および再組立を要求することなく、ｒ_１≦ｒ_２のときは、スイッチングすべき各スイッチング時間において、多くともてｒ_１回のスイッチング時間だけ、多くともｒ_１個のパケットをスケジューリングし、ｒ_２≦ｒ_１のときは、スイッチングｓｙべき各スイッチング時間において、多くともｒ_１回のスイッチング時間だけ、多くともｒ_２個のパケットをスケジューリングすることで、本発明による厳密にノンブロッキング方式で動作する。本システムはさらに、１００％スループット、処理保存的、公正に、それでいてなお決定論的に動作し、それにより出力ポートを決して輻輳させない。本システムは調停に対して１度だけの繰り返しで、相互接続ネットワークにおいて数学的に最小のスピードアップで実行する。本システムは、相互接続ネットワークにおけるパケット再順序付けの問題と、パケットの内部バッファリングが全くなく動作し、従って真にカットスルーかつ分散した方式で動作する。一実施形態において、本システムは、唯一のサブネットワークで、かつそのサブネットワークを介して２倍のスイッチングレートで、厳密にノンブロッキング方式で動作する。別の実施形態において、本システムは、相互接続ネットワークにおいて、少なくとも式（II）

のスピードアップで、再配置可能なノンブロッキング方式で動作する。入力ポートの数ｒ_１が出力ポートの数ｒ_２に等しく、ｒ_１＝ｒ_２＝ｒのとき、少なくとも式（III）

のスピードアップを有する相互接続ネットワークが、スイッチングすべき各スイッチング時間において、多くともｒ回のスイッチング時間だけ決定論的な方式で、多くともｒ個のパケットをスケジューリングすることで、本発明による厳密にノンブロッキングで決定論的方式で動作する。そして相互接続ネットワーク内におきて、少なくとも式（IV）

のスピードアップで、システムは再配置可能なノンブロッキングで決定論的方式で動作する。本システムは、入力ポートから出力ポートへのパケットに対するエンドツーエンド保証帯域幅およびレイテンシを提供する。全ての実施形態において、相互接続ネットワークは、クロスバーネットワーク、共有メモリネットワーク、クロスネットワーク、ハイパーキューブネットワーク、任意の内部ノンブロッキング相互接続ネットワークまたはネットワークのネットワークであってよい。
【選択図】図１Ｈ【Task】
The present invention provides a system for scheduling a unicast packet via the interconnect network, and r ₁ input ports each having r ₂ inputs queues, r, each having _one output queue r ₂ output ports and at least the formula (I)

And an interconnect network with sub-networks, each sub-network having at least _one sub-network connected to each input port for a total of at least r ₁ first internal links And each sub-network further comprises at least one second internal link connected to each output port for a total of at least r ₂ second internal links. The system is deterministic and does not require packet segmentation and reassembly, and when r ₁ ≦ r ₂ , at most r ₁ switching at each switching time to be switched. only time, schedules r ₁ one packet at most, when the r ₂ ≦ r _1, at each switching time should switching sy, with r ₁ times the switching time much, schedules r ₂ pieces of packets at most Thus, the operation is strictly non-blocking according to the present invention. The system further operates 100% throughput, conservatively, fairly, yet still deterministic, thereby never congesting the output port. The system runs with only one iteration for arbitration and with minimal mathematical speedup in the interconnect network. The system operates without any packet reordering issues in the interconnect network and no internal buffering of the packets, and thus operates in a truly cut-through and distributed manner. In one embodiment, the system operates in a strictly non-blocking manner with only one sub-network and twice the switching rate through that sub-network. In another embodiment, the system comprises at least formula (II) in an interconnect network

It works in a non-blocking manner that can be rearranged at a speedup of. When the number of input ports r ₁ is equal to the number of output ports r ₂ and r ₁ = r ₂ = r, at least the formula (III)

An interconnected network with a speedup of strictly schedules at most r packets at each switching time to be switched in a deterministic manner at most r switching times. It works in a non-blocking and deterministic manner. And at least the formula (IV)

With speedup, the system operates in a relocatable non-blocking, deterministic manner. The system provides end-to-end guaranteed bandwidth and latency for packets from input ports to output ports. In all embodiments, the interconnect network may be a crossbar network, a shared memory network, a cross network, a hypercube network, any internal non-blocking interconnect network, or a network of networks.
[Selection] Figure 1H

Description

関連出願に対する相互参照
本願は、２００３年１０月３０日出願の米国仮特許出願番号６０／５１６，０５７に関し、それに対する優先権を主張する。本願は、本願と同一の出願人に譲渡され、同時出願された、Venkat Kondaによる「NONBLOCKING AND DETERMINISTIC UNICAST PACKET SCHEDULING」と題する関連米国特許出願、代理人整理番号Ｖ−０００５に対するＰＣＴ出願であり、その全体を参照により組み込む。本願は、本願と同一の出願人に譲渡された２００１年９月２７日出願のＶｅｎｋａｔＫｏｎｄａによる「REARRANGEABLY NON-BLOCKING MULTICAST MULTI-STAGE NETWORKS」と題する関連米国特許出願番号０９／９６７，８１５およびその一部継続出願である２００３年９月６日出願のＰＣＴ出願番号ＰＣＴ／ＵＳ０３／２７９７１に関し、その全体を参照により組み込む。本願は、本願と同一の出願人に譲渡され、Venkat Kondaによる「STRICTLY NON-BLOCKING MULTICAST MULTI-STAGE NETWORKS」と題する２００１年９月２７日出願の関連米国特許出願番号０９／９６７，１０６およびその一部継続出願である２００３年９月６日出願のＰＣＴ出願番号ＰＣＴ／ＵＳ０３／２７９７２に関し、その全体を参照により組み込む。 This application claims and claims priority to US Provisional Patent Application No. 60 / 516,057, filed Oct. 30, 2003. This application is a related US patent application entitled "NONBLOCKING AND DETERMINISTIC UNICAST PACKET SCHEDULING" by Venkat Konda, assigned to the same applicant as the present application, and a PCT application for agent serial number V-0005. Incorporated by reference in its entirety. This application is related to US patent application Ser. No. 09 / 967,815 entitled “REARRANGEABLY NON-BLOCKING MULTICAST MULTI-STAGE NETWORKS” by Venkat Konda, filed Sep. 27, 2001, assigned to the same applicant as the present application. With regard to PCT application number PCT / US03 / 27971, filed on September 6, 2003, which is a continuation-in-part application, the entirety is incorporated by reference. This application is assigned to the same applicant as this application and is related to US patent application Ser. No. 09 / 967,106 filed Sep. 27, 2001 and entitled “STRICTLY NON-BLOCKING MULTICAST MULTI-STAGE NETWORKS” by Venkat Konda. With regard to PCT application number PCT / US03 / 27972, filed on September 6, 2003, which is a part continuation application, the entirety is incorporated by reference.

本願は、２００３年９月６日出願の米国仮特許出願番号６０／５００，７９０およびその米国特許出願番号１０／９３３，８９９、ならびに、その２００４年９月５日出願のＰＣＴ出願番号０４／２９０４３に関し、その全体を参照により組み込む。本願は、２００３年９月６日出願の米国仮特許出願番号６０／５００，７８９およびその米国特許出願番号１０／９３３，９００、およびその２００４年９月５日出願のＰＣＴ出願番号０４／２９０２７に関し、その全体を参照により組み込む。 This application includes US Provisional Patent Application No. 60 / 500,790 filed September 6, 2003 and US Patent Application No. 10 / 933,899, and PCT Application No. 04/29043 filed September 5, 2004. Is incorporated by reference in its entirety. This application is related to US Provisional Patent Application No. 60 / 500,789, filed Sep. 6, 2003, and US Patent Application No. 10 / 933,900, and PCT Application No. 04/29027, filed Sep. 5, 2004. , Incorporated by reference in its entirety.

本願は、２００３年１０月３０日出願の関連米国仮特許出願番号６０／５１６，２６５およびその米国特許出願、代理人整理番号Ｖ−０００６、および同時出願されたそのＰＣＴ出願、代理人整理番号Ｓ−０００６に関し、その全体を参照により組み込む。本願は、２００３年１０月３０日出願の米国仮特許出願番号６０／５１６，１６３、米国特許出願、代理人整理番号Ｖ−０００９、および同時出願されたそのＰＣＴ出願、代理人整理番号Ｓ−０００９に関し、その全体を参照により組み込む。本願は、２００３年１０月３０日出願の米国仮特許出願番号６０／５１５，９８５、米国特許出願、代理人整理番号Ｖ−００１０、および同時出願されたそのＰＣＴ出願、代理人整理番号Ｓ−００１０に関し、その全体を参照により組み込む。 This application is related to US Provisional Patent Application No. 60 / 516,265, filed Oct. 30, 2003, and its US Patent Application, Attorney Docket No. V-0006, and its concurrently filed PCT application, Attorney Docket No. S. -0006 is incorporated by reference in its entirety. This application includes US Provisional Patent Application No. 60 / 516,163, filed Oct. 30, 2003, US Patent Application, Attorney Docket No. V-0009, and its concurrently filed PCT application, Attorney Docket No. S-0009. Is incorporated by reference in its entirety. This application includes US Provisional Patent Application No. 60 / 515,985, filed Oct. 30, 2003, US Patent Application, Attorney Docket No. V-0010, and its simultaneously filed PCT application, Attorney Docket No. S-0010. Is incorporated by reference in its entirety.

今日のＡＴＭスイッチおよびＩＰルータは通常、多種の相互接続ネットワークを使用し、入力ポート（「入口ポート」とも呼ばれる）から所望の出力ポート（「出口ポート」とも呼ばれる）へパケットをスイッチングする。相互接続ネットワークを介してパケットをスイッチングするには、入力ポートもしくは出力ポート、または入力ポートと出力ポートの双方でキューイングされる。パケットは１つまたは複数の出力ポートを宛先としてよい。１つの出力ポートのみを宛先とするパケットをユニキャストパケットと呼び、１つを超える出力ポートを宛先とするパケットをマルチキャストパケットと呼び、全ての出力ポートを宛先とするパケットをブロードキャストパケットと呼ぶ。 Today's ATM switches and IP routers typically use a variety of interconnected networks to switch packets from an input port (also referred to as an “ingress port”) to a desired output port (also referred to as an “egress port”). To switch packets through the interconnect network, they are queued at the input or output port, or both the input and output ports. The packet may be destined for one or more output ports. A packet destined for only one output port is called a unicast packet, a packet destined for more than one output port is called a multicast packet, and a packet destined for all output ports is called a broadcast packet.

出力キュー（ＯＱ）スイッチは出力ポートでのみキューを採用する。出力キュースイッチにおいては、入力ポートにおいてパケットを受信すると即座に宛先出力ポートキューにスイッチングする。パケットは、即座に出力ポートキューに転送されるので、ｒ＊ｒの出力キュースイッチにおいては、相互接続ネットワークにおいはｒのスピードアップを必要とする。入力キュー（ＩＱ）スイッチは、入力ポートでのみキューを採用する。入力キュースイッチは、相互接続ネットワークにおいて、１のみのスピードアップを必要とする。あるいは、ＩＱスイッチにおいてスピードアップは必要ない。しかしながら、入力キュースイッチは行頭（ＨＯＬ）ブロッキングを除去しない。これは、入力キューの行頭にあるパケットの宛先出力ポートがスイッチング時間においてビジーである場合、キュー内の次のパケットの宛先ポートが空いていても、これも遮断してしまうことを意味する。 The output queue (OQ) switch employs a queue only at the output port. In the output queue switch, when a packet is received at the input port, the packet is immediately switched to the destination output port queue. Since packets are immediately transferred to the output port queue, r * r output queue switches require r speedups in the interconnect network. An input queue (IQ) switch employs a queue only at the input port. The input queue switch requires only one speedup in the interconnect network. Alternatively, no speed up is required in the IQ switch. However, the input queue switch does not remove head of line (HOL) blocking. This means that if the destination output port of the packet at the head of the input queue is busy at the switching time, even if the destination port of the next packet in the queue is free, this is also blocked.

入出力複合キュー（ＣＩＯＱ）スイッチは、その入力ポートと出力ポートの双方でキューを採用する。このようなスイッチは、相互接続ネットワークにおいて１からｒの間のスピードアップを採用することにより、ＯＱとＩＱスイッチ双方で最良の結果を出す。仮想出力キュー（ＶＯＱ）スイッチと呼ばれる別種のスイッチは、各入力ポートに、各出力ポートのうち１つを宛先とするパケットにそれぞれ対応するｒ個のキューを持つ設計となっている。ＶＯＱスイッチはＨＯＬブロッキングを除去する。 An input / output composite queue (CIOQ) switch employs queues at both its input and output ports. Such switches give the best results with both OQ and IQ switches by employing speedups between 1 and r in the interconnect network. Another type of switch called a virtual output queue (VOQ) switch is designed to have r queues corresponding to packets destined for one of the output ports at each input port. The VOQ switch removes HOL blocking.

ＶＯＱスイッチは、近年非常に注目を集めている。「The iSLIP Scheduling Algorithm for Input-Queued Switches」と題するNick Mckeownの論文、IEEE/ACM Transactions on Networking、１９９９年４月、７巻２号は、本発明の背景として参照によりここに組込まれる。この論文には、１８８頁から１９０頁の序章において、クロスバーを基礎とする相互接続ネットワークのための数多くのスケジューリングアルゴリズムが説明されている。 The VOQ switch has attracted much attention in recent years. Nick Mckeown's paper entitled “The iSLIP Scheduling Algorithm for Input-Queued Switches”, IEEE / ACM Transactions on Networking, April 1999, Vol. In this article, in the introductory pages from page 188 to page 190, a number of scheduling algorithms for crossbar-based interconnection networks are described.

背景として参照によりここに組込まれる、Nick Mckeownに権利が付与された「Combined Unicast and Multicast Scheduling」と題する米国特許６，２１２，１８２号は、各入力ポートにおいてｒ個のユニキャストキューと１つのマルチキャストキューを用いるＶＯＱスイッチング技術について説明している。各スイッチング時間で、繰り返し調停が実行し、１個のパケットを各出力ポートにスイッチングする。 US Pat. No. 6,212,182 entitled “Combined Unicast and Multicast Scheduling” entitled “Nick Mckeown”, incorporated herein by reference as a background, describes r unicast queues and one multicast at each input port. A VOQ switching technique using a queue is described. Arbitrary arbitration is performed at each switching time, and one packet is switched to each output port.

背景として参照してここに組込まれるPrabhakarらに権利が付与された「Switching System and Methods of Operation of Switching System」と題する米国特許６，３５１，４６６号には、各入力ポートにｒ個のユニキャストキューと、各出力ポートに１つのマルチキャストキューを有し、パケットレイテンシの正確な制御を含む出力キュースイッチのようなクロスバー相互接続ネットワークのＶＯＱスイッチング技術に、少なくとも４のスピードアップが必要であると説明されている。 US Pat. No. 6,351,466 entitled “Switching System and Methods of Operation of Switching System”, which is entitled to Prabhakar et al., Incorporated herein by reference, has r unicasts for each input port. VOQ switching technology for crossbar interconnect networks such as queues and output queue switches that have precise control of packet latencies with one multicast queue for each output port requires at least 4 speedups Explained.

しかしながら、先行技術のスイッチファブリックには多くの問題がある。第１に、マルチキャストパケットに対するＨＯＬブロッキングが除去されない。第２に、相互接続内の数学的に最小のスピードアップが未知である。第３に、相互接続ネットワークにおけるスピードアップの使用は、出力ポートをフラッドさせ、これが出力ポートにおける不必要なパケット輻輳、及びパケットをエグレスポート外に送信するためのレート減少を招く。第４に、任意にファンアウトされるマルチキャストパケットは、出力ポートに対してノンブロッキング方式ではスケジューリングできない。第５に、各スイッチング時間でパケット調停が繰り返し実行され、スイッチング時間、コストおよび電力の点で高くつく。最後に第６として、現行技術は、スケジューリングを最長で非決定論的方式で実行するため、入力ポートと出力ポートでのセグメント化と再組立を必要とする。 However, there are many problems with prior art switch fabrics. First, HOL blocking for multicast packets is not removed. Second, the mathematical minimum speedup within the interconnect is unknown. Third, the use of speedup in the interconnect network floods the output port, which results in unnecessary packet congestion at the output port and a reduced rate for sending packets out of the egress port. Fourthly, a multicast packet that is arbitrarily fanned out cannot be scheduled in a non-blocking manner with respect to the output port. Fifth, packet arbitration is repeatedly performed at each switching time, which is expensive in terms of switching time, cost and power. Finally, sixthly, current technology requires segmentation and reassembly at the input and output ports to perform scheduling in the longest non-deterministic manner.

各入力ポートにユニキャストパケットを備える、複数の入力ポート、複数の出力ポート、及び複数の入力キューを有する相互接続ネットワークを介してマルチレートマルチキャストパケットをスケジューリングするシステムは、本発明に従って、各入力ポートから各出力ポートへの入力キューと多くとも同数のパケットをスケジューリングすることにより、ノンブロッキング方式で動作する。システムは１００％スループットで、処理保存的、公正ながら決定論的に動作し、それにより出力ポートを輻輳させることは決してない。本システムは、相互作用ネットワークにおける数学的に最小のスピードアップにより、調停を１度だけの繰り返しで実行する。システムは、相互接続ネットワークにおけるパケットの再順序付け問題と、パケットの内部バッファリングが全くなく、従って真にカットスルーで分散的な方式で動作する。別の実施形態において、各出力ポートも複数の出力キューを備え、各パケットは、宛先出力ポート内の出力キューまで、パケットのサイズに違いがあっても、ノンブロッキングで決定論的方式で、パケットのセグメント化及び再組立の必要なしに転送される。一実施形態において、スケジューリングは、相互接続ネットワークにおいて少なくとも２のスピードアップを用いて、厳密なノンブロッキング方式で実行される。別の実施形態において、スケジューリングは相互接続ネットワークにおいて、少なくとも１のスピードアップを用いて、再配置可能なノンブロッキング方式で実行される。本システムは、パケットのため、入力ポートから出力ポートまで、エンドツーエンド保証帯域幅とレイテンシトを提供する。全ての実施形態において、相互接続ネットワークは、クロスバーネットワークや、共有メモリネットワーク、クロス（Ｃｌｏｓ）ネットワーク、ハイパーキューブネットワーク、任意の内部ノンブロッキング相互接続ネットワーク、またはネットワークのネットワークであってよい。 A system for scheduling multi-rate multicast packets via an interconnection network having a plurality of input ports, a plurality of output ports, and a plurality of input queues, comprising unicast packets at each input port, according to the present invention, By scheduling at most as many packets as there are input queues to each output port, it operates in a non-blocking manner. The system operates at 100% throughput, is conservative, fair but deterministic, thereby never congesting the output port. The system performs arbitration with only one iteration with minimal mathematical speedup in the interaction network. The system operates in a truly cut-through and distributed manner with no packet reordering issues in the interconnect network and no internal packet buffering. In another embodiment, each output port also comprises a plurality of output queues, each packet being non-blocking and deterministic in the non-blocking and deterministic manner, up to the output queue in the destination output port, even if the packet size is different. Transfer without the need for segmentation and reassembly. In one embodiment, scheduling is performed in a strictly non-blocking manner with at least two speedups in the interconnect network. In another embodiment, scheduling is performed in a relocatable non-blocking manner with at least one speedup in the interconnected network. The system provides end-to-end guaranteed bandwidth and latency for packets from the input port to the output port. In all embodiments, the interconnect network may be a crossbar network, a shared memory network, a cross (Clos) network, a hypercube network, any internal non-blocking interconnect network, or a network of networks.

本発明は、トラフィックの性質に関わらず、ユニキャストパケットを備え、ユニキャストパケットは入力ポートに到着するスイッチファブリックにおける、ノンブロッキングで決定論的スケジューリングの設計および動作に関する。具体的には、本発明は、パケットスケジューリングシステムにおける以下の問題に関する。即ち、１）パケットの厳密に再配置可能なノンブロッキングスケジューリング、２）入力ポートから出力ポートまで（必要ならば出力ポートの特定の出力キューまで）決定論的に、即ち、出力ポートを輻輳させることなく、パケットをスイッチングすること、３）パケットのセグメント化と再組立（ＳＡＲ）の実装を必要としないこと、４）調停は、１度の繰り返しだけで行われること、５）相互接続ネットワークにおいて、数学的に最小のスピードアップを使用すること、６）パケットのサイズが可変な場合であっても、１００％スループットで公正な方式で動作することである。 The present invention relates to the design and operation of non-blocking, deterministic scheduling in a switch fabric comprising unicast packets, regardless of the nature of the traffic, where the unicast packets arrive at the input port. Specifically, the present invention relates to the following problems in a packet scheduling system. 1) strictly relocatable non-blocking scheduling of packets, 2) from input port to output port (if necessary, to a specific output queue of the output port) deterministically, ie without congesting the output port Switching packets, 3) not requiring packet segmentation and reassembly (SAR) implementation, 4) arbitration is done only once, 5) mathematics in the interconnect network 6) to operate in a fair manner with 100% throughput even if the packet size is variable.

入力ポートのパケットが１つより多い出力ポートを宛先としている場合、１対多のパケット転送が必要であり、マルチキャストパケットと呼ばれる。入力ポートのパケットがただ１つの出力ポートを宛先としている場合、１対１のパケット転送が必要であり、ユニキャストパケットと呼ばれる。入力ポートのパケットが全ての出力ポートを宛先としている場合、１対全のパケット転送が必要であり、ブロードキャストパケットと呼ばれる。相互接続ネットワークを介して転送されるセットのマルチキャストパケットは、マルチキャスト割当と呼ばれる。 When more than one output port packet is destined for an output port, one-to-many packet transfer is required, which is called a multicast packet. When a packet of an input port is destined for only one output port, a one-to-one packet transfer is required, which is called a unicast packet. When a packet of an input port is destined for all output ports, a one-to-all packet transfer is required, which is called a broadcast packet. A set of multicast packets transferred over an interconnect network is called a multicast assignment.

ここで説明する種類のスイッチファブリックは、入力ポートの仮想出力キュー（ＶＯＱ）を使用する。一実施形態において、各入力ポートで受信したパケットは、出力ポートにあるのと同数のキューに配置される。各キューは、出力ポートの１つのみを宛先とするパケットを保持する。スイッチファブリックは、出力ポートに出力キューがあることも、ないこともある。出力キューがある場合、一実施形態において、入力ポートにあるのと同数のキューが各出力ポートに存在する。パケットは、各出力キューが１つの入力ポートのみからスイッチングされたパケットを保持するように、出力キューにスイッチングされる。 The type of switch fabric described here uses a virtual output queue (VOQ) of input ports. In one embodiment, packets received at each input port are placed in as many queues as there are at output ports. Each queue holds packets destined for only one of the output ports. The switch fabric may or may not have an output queue at the output port. If there are output queues, in one embodiment there are as many queues at each output port as there are at the input ports. Packets are switched to output queues such that each output queue holds packets switched from only one input port.

ここで説明した種類のスイッチファブリックのなかには、全ての入力ポート内の各入力キューが一定レートのユニキャストパケットを備え、出力ポート内で同等の帯域幅を配分する。一定レートのマルチキャストパケットを有し、すべての入力ポートの中に各入力キューを持つノンブロッキングで決定論的スイッチファブリックが、出力ポートにおいて同等の帯域幅を配分することは、先に参照により組込んだ米国特許出願、代理人整理番号Ｖ−０００６号及びＰＣＴ出願、代理人整理番号Ｓ−０００６号で詳しく説明されている。一定レートのマルチキャストパケットを有し、すべての入力ポート内に各入力キューを持つノンブロッキングで決定論的スイッチファブリックが、出力ポートにおいて同一の帯域幅を配分することは、先に参照により組込まれた米国特許出願、代理人整理番号Ｖ−０００９号及びそのＰＣＴ出願、代理人整理番号Ｓ−０００９号で詳しく説明されている。一定レートのマルチキャストパケットを有し、すべての入力ポート内に各入力キューを持つノンブロッキングで決定論的スイッチファブリックが、出力ポートに異なる帯域幅を配分し、先に参照により組み込んだ米国特許出願、代理人整理番号Ｖ−００１０、およびそのＰＣＴ出願、代理人整理番号Ｓ−００１０で詳しく説明されている。 In a switch fabric of the type described here, each input queue in all input ports has a uniform rate of unicast packets and allocates equal bandwidth in the output ports. The non-blocking, deterministic switch fabric with a constant rate of multicast packets and with each input queue in all input ports allocated equal bandwidth at the output ports was previously incorporated by reference. It is described in detail in US patent application, agent serial number V-0006 and PCT application, agent serial number S-0006. The non-blocking, deterministic switch fabric with constant rate multicast packets and each input queue in every input port allocates the same bandwidth at the output port is the US previously incorporated by reference This is described in detail in the patent application, agent serial number V-0009 and its PCT application, agent serial number S-0009. A non-blocking, deterministic switch fabric with a constant rate of multicast packets, with each input queue in every input port, allocates different bandwidth to the output ports, previously incorporated by reference This is described in detail in Person Reference Number V-0010 and its PCT application, Agent Reference Number S-0010.

図１Ａを参照すると、２つの４×４クロスバーネットワーク１３１−１３２から成る相互ネットワークである中間ステージ１３０を経由し、４個の入力ポート１５１−１５４から成る入力ステージ１１０と、４個の出力ポート１９１−１９４から成る出力ステージ１２０とを有する例示的なスイッチファブリック１０が示されている。各入力ポート１５１−１５４は、それぞれ入口リンク１４１−１４４を介してパケットを受信する。各出力ポート１９１−１９４は、それぞれ出口リンク２０１−２０４を介してパケットを送信する。各クロスバーネットワーク１３１−１３２は、８個のリンク（以下、「第１内部リンク」と呼ぶ）ＦＬ１−ＦＬ８を介して、４個の入力ポート１５１−１５４の各々に接続され、さらに８個のリンク（以下、「第２内部リンク」と呼ぶ）ＳＬ１−ＳＬ８を介して、４個の出力ポート１９１−１９４の各々に接続されている。図１Ａのスイッチファブリック１０において、入口リンク１４１−１４４、第１内部リンクＦＬ１−ＦＬ８、第２内部リンクＳＬ１−ＳＬ８、および出口リンク２０１−２０４の各々は、同一レートで動作する。 Referring to FIG. 1A, an input stage 110 consisting of four input ports 151-154 and four output ports go through an intermediate stage 130 which is a mutual network consisting of two 4 × 4 crossbar networks 131-132. An exemplary switch fabric 10 is shown having an output stage 120 comprised of 191-194. Each input port 151-154 receives a packet via an ingress link 141-144, respectively. Each output port 191-194 transmits a packet via an exit link 201-204, respectively. Each crossbar network 131-132 is connected to each of four input ports 151-154 via eight links (hereinafter referred to as "first internal links") FL1-FL8, It is connected to each of the four output ports 191 to 194 via links (hereinafter referred to as “second internal links”) SL1 to SL8. In the switch fabric 10 of FIG. 1A, each of the inlet links 141-144, the first internal links FL1-FL8, the second internal links SL1-SL8, and the outlet links 201-204 operate at the same rate.

出力ポート１９１−１９４を宛先とするパケットが各入力ポート１５１−１５４でそれぞれ入力キュー１７１−１７４内に置かれるように、各入力ポート１５１−１５４において、入口リンク１４１−１４４を介して受信されるパケットを、存在する出力ポートと同数の入力キュー１７１−１７４（４個）に、宛先出力ポートに従ってソートする。一実施形態において、図１Ａのスイッチファブリック１０に示すように、パケットは入力キューに置かれる前に、優先度キュー１６１−１６４内に置かれてもよい。各優先度キュー１６１−１６４は、優先度［１-ｆ］に対応するパケットを保持するｆ個のキューを含む。例えば、出力ポート１９１を宛先とするパケットは、そのパケットの優先度［１-ｆ］に基づく優先度キュー１６１内に置かれ、最も高い優先度のパケットを最初に入力キュー１７１内に置いた後に、次に高い優先度のパケットを置く。優先度キュー１６１−１６４の使用は、スイッチファブリック１０の動作に無関係なので、図１Ａのスイッチファブリック１０は、別の実施形態では優先度キュー１６１−１６４なしで実装することもできる（優先度キューの使用は本発明で説明する全ての実施形態に無関係なので、全ての実施形態は、優先度キューなしにノンブロッキングで決定論的方式で、実装することもできる）。 Packets destined for output ports 191-194 are received at each input port 151-154 via the ingress links 141-144 so that the packets are destined for each input port 151-154 at the respective input queues 171-174. The packets are sorted according to the destination output port into the same number of input queues 171-174 (4) as the existing output ports. In one embodiment, as shown in switch fabric 10 of FIG. 1A, packets may be placed in priority queues 161-164 before being placed in the input queue. Each priority queue 161-164 includes f queues that hold packets corresponding to the priority [1-f]. For example, a packet destined for the output port 191 is placed in the priority queue 161 based on the priority [1-f] of the packet, and the packet having the highest priority is first placed in the input queue 171. Put the next highest priority packet. Because the use of priority queues 161-164 is independent of the operation of switch fabric 10, switch fabric 10 of FIG. 1A may be implemented without priority queues 161-164 in another embodiment (priority queues Since use is irrelevant to all embodiments described in this invention, all embodiments can also be implemented in a non-blocking, deterministic manner without priority queues).

また、ネットワークは、入力ステージ１１０、出力ステージ１２０、および中間ステージ１３０の各々に結合し、入力ポート１５１−１５４から出力ポート１９１−１９４へパケットをスイッチングするスケジューラを含む。スケジューラは、中間ステージ１３０内の相互接続ネットワークを介するパスに対して利用可能な宛先のリストをメモリ内に維持している。 The network also includes a scheduler that couples to each of input stage 110, output stage 120, and intermediate stage 130 and switches packets from input ports 151-154 to output ports 191-194. The scheduler maintains a list of available destinations in memory for paths through the interconnect network in the intermediate stage 130.

一実施形態において、図１Ａに示すように、各出力ポート１９１−１９４は、存在する入力ポート（４個）と同数の出力キュー１８１−１８４から成り、入力ポート１５１−１５４からスイッチングされるパケットが各出力ポート１９１−１９４内の出力キュー１８１−１８４内にそれぞれ置かれるようになっている。図１Ａのスイッチファブリック１０における４個の入力ポート１５１−１５４内の各入力キュー１７１−１７４は、入力ポート１５１の入力キュー１７１内にＡ１−Ａ４を有し、出力ポートにスイッチングされる状態にある入力ポート１６４の４番目の入力キュー１７４内にＰ１−Ｐ４を有する例示的な４個のパケットを示す。４個の入力ポート１５１−１５４内の１６個の入力キュー全てにおける行頭パケットは、各々Ａ１−Ｐ１により指定される。 In one embodiment, as shown in FIG. 1A, each output port 191-194 consists of the same number of output queues 181-184 as the existing input ports (4), and packets switched from the input ports 151-154 are transmitted. These are placed in output queues 181 to 184 in the output ports 191 to 194, respectively. Each of the input queues 171 to 174 in the four input ports 151 to 154 in the switch fabric 10 of FIG. 1A has A1 to A4 in the input queue 171 of the input port 151 and is in a state of being switched to the output port. Illustrative four packets with P1-P4 in the fourth input queue 174 of input port 164 are shown. The head-of-line packets in all 16 input queues in the four input ports 151-154 are designated by A1-P1, respectively.

表１は、図１Ａのスイッチファブリック１０における入力キューと出力キューとの間の例示的なパケット割当を示す。Ｉ｛１，１｝で示される入力ポート１５１内の入力キュー１７１内のパケットは、Ｏ｛１，１｝で示される出力ポート１９１内の出力キュー１８１に対して割当てられてスイッチングされる。Ｉ｛１，２｝で示される入力ポート１５１内の入力キュー１７２内のパケットは、Ｏ｛２，１｝で示される出力ポート１９２内の出力キュー１８１に対して割当てられてスイッチングされる。同様に残りの１６個の入力キューにおけるパケットは、表１に示すように、残りの１６個の出力キューに割り当てられる。別の実施形態において、入力キューから出力キューへの割当は表１と異なってもよいが、本発明に従うと、各出力ポート内の出力キューにパケットをスイッチングするよう割り当てられた各入力ポート内の入力キューは１つのみであり、逆も真である。 Table 1 shows exemplary packet assignments between input queues and output queues in the switch fabric 10 of FIG. 1A. Packets in the input queue 171 in the input port 151 indicated by I {1,1} are allocated and switched to the output queue 181 in the output port 191 indicated by O {1,1}. The packet in the input queue 172 in the input port 151 indicated by I {1, 2} is assigned to the output queue 181 in the output port 192 indicated by O {2, 1} and switched. Similarly, packets in the remaining 16 input queues are assigned to the remaining 16 output queues as shown in Table 1. In another embodiment, the assignment of input queues to output queues may be different from Table 1, but according to the present invention, in each input port assigned to switch packets to the output queue in each output port. There is only one input queue and vice versa.

本発明によると、全て１６個のパケットＡ１−Ｐ１は４回のスイッチング時（以降、“ファブリックスイッチングサイクル”と呼ぶ）に、ノンブロッキング方式で、中間ステージ１３０内の相互接続ネットワーク経由で入力ポートから出力ポートにスイッチングされる。各スイッチング時間において、多くとも１個のパケットが各入力ポートからスイッチングされ、多くとも１個のパケットが各出力ポートにスイッチングされる。各入力ポートは４個のユニキャストパケットのみを受信することができるため、図１Ａのスイッチファブリック１０において入力ポート衝突は決して生じない。任意の入力ポートからの各入力キューは、各出力ポート内の唯一の指定出力キューにのみスイッチングされるため、図１Ａのスイッチファブリック１０において出力ポート衝突は決して生じない。従って、調停の３つのステップ、即ち入力ポートによる要求の生成、出力ポートによる許可の発行および入力ポートによる許可の承認、が要求される。ここで出願人は、図１Ａのスイッチファブリック１０内の出力ポート１９１−１９４にスイッチングすべき１６個のパケットＡ１−Ｐ１の決定論的およびノンブロッキングなスケジューリングに関する問題は、図１Ｃに示す３ステージクロスネットワーク１４のノンブロッキングなスケジューリングに関係するという重要な見方をしている。 According to the present invention, all 16 packets A1-P1 are output from the input port via the interconnection network in the intermediate stage 130 in a non-blocking manner at the time of switching four times (hereinafter referred to as “fabric switching cycle”). Switched to the port. At each switching time, at most one packet is switched from each input port and at most one packet is switched to each output port. Since each input port can only receive 4 unicast packets, input port collisions never occur in the switch fabric 10 of FIG. 1A. Since each input queue from any input port is switched to only one designated output queue in each output port, output port collisions never occur in the switch fabric 10 of FIG. 1A. Accordingly, three steps of arbitration are required: generation of a request by the input port, issue of permission by the output port, and approval of permission by the input port. Applicant now considers the problem with deterministic and non-blocking scheduling of 16 packets A1-P1 to be switched to output ports 191-194 in the switch fabric 10 of FIG. 1A is the three-stage cross network shown in FIG. 1C. It has an important view that it relates to 14 non-blocking scheduling.

図１Ｃを参照すると、中間ステージ１３０を経由して入力ステージ１１０と出力ステージ１２０との間の通信要求を満たす１０個のスイッチの時間−空間−時間（ＴＳＴ）構成で動作する例示的な対称型３ステージクロスネットワーク１４が示されている。ここで、入力ステージ１１０は、４個の４×２スイッチＩＳ１−ＩＳ４から成り、出力ステージ１２０は４個の２×４スイッチＯＳ１−ＯＳ４から成り、中間ステージ１３０は２個の４×４スイッチＭＳ１−ＭＳ２から成る。入力ステージ１１０内のスイッチそれぞれに対する入口リンクと、出力ステージ１２０内のスイッチそれぞれに対する出口リンクの数はｎで表され、入力ステージ１１０と出力ステージ１２０におけるスイッチの数はrで表される。２個の中間スイッチＭＳ１−ＭＳ２それぞれは、ｒ個のリンクを介してｒ個の入力スイッチそれぞれに接続され（例えば、リンクＦＬ１−ＦＬ４は入力スイッチＩＳ１−ＩＳ４それぞれから、中間スイッチＭＳ１に接続される）、ｒ個の第２内部リンクを介して出力スイッチそれぞれに接続される（例えば、リンクＳＬ１−ＳＬ４は、中間スイッチＭＳ１から出力スイッチＯＳ１−ＯＳ４それぞれに接続される）。ネットワークは、１６個の入口リンク、即ちＩ｛１，１｝−Ｉ｛４，４｝および、１６個の出口リンクＯ｛１，１｝−Ｏ｛４，４｝を有する。図１Ｃの３ステージクロスネットワーク１４において、図１Ａのスイッチファブリック１０とちょうど同じように、表１に示すように、全ての１６個の入力リンクは１６個の出力リンクに割り当てられる。図１Ｃのネットワーク１４は、中間ステージ１３０内のスイッチ数が

個に等しいとき、ユニキャスト接続要求に対して厳密なノンブロッキング方式で動作する（本発明の背景として参照により組み込んだ、Charles Closによる「A Study of Non-Blocking Switching Networks」、１９５３年１月、The Bell System Technical Journal発行、第３２巻、Ｎｏ，１、４０６−４２４頁を参照）。 Referring to FIG. 1C, an exemplary symmetric type operating in a 10-switch time-space-time (TST) configuration that satisfies the communication requirements between the input stage 110 and the output stage 120 via the intermediate stage 130. A three stage cross network 14 is shown. Here, the input stage 110 includes four 4 × 2 switches IS1 to IS4, the output stage 120 includes four 2 × 4 switches OS1 to OS4, and the intermediate stage 130 includes two 4 × 4 switches MS1. -Consist of MS2. The number of inlet links for each switch in the input stage 110 and the number of outlet links for each switch in the output stage 120 is represented by n, and the number of switches in the input stage 110 and the output stage 120 is represented by r. Each of the two intermediate switches MS1-MS2 is connected to each of the r input switches via r links (for example, the links FL1-FL4 are connected to the intermediate switch MS1 from each of the input switches IS1-IS4). ), Connected to each of the output switches via r second internal links (for example, the links SL1-SL4 are connected from the intermediate switch MS1 to each of the output switches OS1-OS4). The network has 16 ingress links: I {1,1} -I {4,4} and 16 egress links O {1,1} -O {4,4}. In the three-stage cross network 14 of FIG. 1C, all 16 input links are assigned to 16 output links, as shown in Table 1, just like the switch fabric 10 of FIG. 1A. In the network 14 of FIG. 1C, the number of switches in the intermediate stage 130 is

Behave in a strict non-blocking manner for unicast connection requests (“A Study of Non-Blocking Switching Networks” by Charles Clos, incorporated by reference as the background of the present invention, January 1953, The (See Bell System Technical Journal, Vol. 32, No. 1, pages 406-424).

本発明によると、中間ステージ１３０内の２個の４×４クロスバーネットワーク１３１−１３２、即ち、２のスピードアップを有する一実施形態において、図１Ａのスイッチファブリック１０が厳密なノンブロッキング方式で動作する。その厳密にノンブロッキングで決定論的スイッチングの実装に使用する詳細な方法は、本発明の開示内容の観点から当業者には自明である、多くの様々な方法のいずれであってもよい。そのような調停およびスケジューリング方法の１つは、以下の図１Ｂにおいて述べる。 In accordance with the present invention, in one embodiment having two 4x4 crossbar networks 131-132 in the intermediate stage 130, ie, two speedups, the switch fabric 10 of FIG. 1A operates in a strictly non-blocking manner. . The detailed method used to implement the strictly non-blocking deterministic switching may be any of a number of different methods that will be apparent to those skilled in the art in view of the present disclosure. One such arbitration and scheduling method is described in FIG. 1B below.

表２は、一実施形態おいて、４回のスイッチング時それぞれにおける表１のパケット要求、許可と承認、に対するパケットのスケジュールを示し、これは図１Ｂの調停およびスケジューリング方法４０のスケジューリング部を使用して計算される。図１Ｄから図１Ｈは、各スイッチング時間後の図１Ａのスイッチファブリック１０の状態を示す。図１Ｄは、パケットＡ１、Ｋ１、およびＰ１が出力キューにスイッチングされる第１のスイッチング時間後の図１Ａのスイッチファブリック１０の状態を示す。入力ポート１５１からのパケットＡ１は、クロスバーネットワーク１３１経由で、出力ポート１９１の出力キュー１８１にスイッチングされる。入力ポート１５２からのパケットＦ１は、クロスバーネットワーク１３１経由で、出力ポート１９２の出力キュー１８２にスイッチングされる。入力ポート１５３からのパケットＫ１は、クロスバーネットワーク１３２経由で、出力ポート１９３の出力キュー１８３にスイッチングされる。入力ポート１５４からのパケットＰ１は、クロスバーネットワーク１３２経由で、出力ポート１９４の出力キュー１８４にスイッチングされる。最初のスイッチング時間において、各入力ポートから唯一１個のパケットがスイッチングされ、各出力ポートが唯一１個のパケットを受信することは明らかである。 Table 2 shows, in one embodiment, the packet schedule for the packet requests, grants and grants of Table 1 at each of the four switching times, using the scheduling part of the arbitration and scheduling method 40 of FIG. 1B. Is calculated. 1D to 1H show the state of the switch fabric 10 of FIG. 1A after each switching time. FIG. 1D shows the state of the switch fabric 10 of FIG. 1A after a first switching time in which packets A1, K1, and P1 are switched to the output queue. The packet A1 from the input port 151 is switched to the output queue 181 of the output port 191 via the crossbar network 131. The packet F1 from the input port 152 is switched to the output queue 182 of the output port 192 via the crossbar network 131. The packet K1 from the input port 153 is switched to the output queue 183 of the output port 193 via the crossbar network 132. The packet P1 from the input port 154 is switched to the output queue 184 of the output port 194 via the crossbar network 132. It is clear that at the first switching time, only one packet is switched from each input port and each output port receives only one packet.

図１ＥはパケットＤ１、Ｅ１、およびＯ１が出力キューにスイッチングされる第２スイッチング時間後の図１Ａのスイッチファブリック１０の状態を示す。入力ポート１５１からのパケットＤ１は、クロスバーネットワーク１３１経由で、出力ポート１９４の出力キュー１８１にスイッチングされる。入力ポート１５２からのパケットＥ１は、クロスバーネットワーク１３１経由で、出力ポート１９１の出力キュー１８２にスイッチングされる。入力ポート１５３からのパケットＪ１は、クロスバーネットワーク１３２経由で、出力ポート１９２の出力キュー１８３にスイッチングされる。入力ポート１５４からのパケットＯ１は、クロスバーネットワーク１３２経由で、出力ポート１９３の出力キュー１８４にスイッチングされる。再び、第２スイッチング時間において、各入力ポートから唯一１個のパケットがスイッチングされ、各出力ポートが唯一１個のパケットを受信する。 FIG. 1E shows the state of the switch fabric 10 of FIG. 1A after a second switching time when packets D1, E1, and O1 are switched to the output queue. The packet D1 from the input port 151 is switched to the output queue 181 of the output port 194 via the crossbar network 131. The packet E1 from the input port 152 is switched to the output queue 182 of the output port 191 via the crossbar network 131. The packet J1 from the input port 153 is switched to the output queue 183 of the output port 192 via the crossbar network 132. The packet O1 from the input port 154 is switched to the output queue 184 of the output port 193 via the crossbar network 132. Again, at the second switching time, only one packet is switched from each input port, and each output port receives only one packet.

図１ＦはパケットＣ１、Ｈ１、Ｉ１、およびＮ１が出力キューにスイッチングされる第３スイッチング時間後の図１Ａのスイッチファブリック１０の状態を示す。入力ポート１５１からのパケットＣ１は、クロスバーネットワーク１３１経由で、出力ポート１９３の出力キュー１８１にスイッチングされる。入力ポート１５２からのパケットＨ１は、クロスバーネットワーク１３１経由で、出力ポート１９４の出力キュー１８２にスイッチングされる。入力ポート１５３からのパケットＩ１は、クロスバーネットワーク１３２経由で、出力ポート１９１の出力キュー１８３にスイッチングされる。入力ポート１５４からのパケットＮ１は、クロスバーネットワーク１３２経由で、出力ポート１９２の出力キュー１８４にスイッチングされる。再び、第３スイッチング時間において、各入力ポートから唯一１個のパケットがスイッチングされ、各出力ポートが唯一１個のパケットを受信する。 FIG. 1F shows the state of the switch fabric 10 of FIG. 1A after a third switching time when packets C1, H1, I1, and N1 are switched to the output queue. The packet C1 from the input port 151 is switched to the output queue 181 of the output port 193 via the crossbar network 131. The packet H1 from the input port 152 is switched to the output queue 182 of the output port 194 via the crossbar network 131. The packet I1 from the input port 153 is switched to the output queue 183 of the output port 191 via the crossbar network 132. The packet N1 from the input port 154 is switched to the output queue 184 of the output port 192 via the crossbar network 132. Again, at the third switching time, only one packet is switched from each input port and each output port receives only one packet.

図１Ｇは、パケットＢ２、Ｇ２、およびＬ１が出力キューにスイッチングされる第４スイッチング時間後の図１Ａのスイッチファブリック１０の状態を示す。入力ポート１５１からのパケットＢ２は、クロスバーネットワーク１３２経由で、出力ポート１９２の出力キュー１８１にスイッチングされる。入力ポート１５２からのパケットＧ１は、クロスバーネットワーク１３１経由で、出力ポート１９３の出力キュー１８２にスイッチングされる。入力ポート１５３からのパケットＬ１は、クロスバーネットワーク１３１経由で、出力ポート１９４の出力キュー１８３にスイッチングされる。入力ポート１５４からのパケットＭ１は、クロスバーネットワーク１３２経由で、出力ポート１９１の出力キュー１８４にスイッチングされる。第４スイッチング時間において、各入力ポートから唯一１個のパケットがスイッチングされ、各出力ポートが唯一１個のパケットを受信することは明らかである。 FIG. 1G shows the state of the switch fabric 10 of FIG. 1A after a fourth switching time in which packets B2, G2, and L1 are switched to the output queue. The packet B2 from the input port 151 is switched to the output queue 181 of the output port 192 via the crossbar network 132. The packet G1 from the input port 152 is switched to the output queue 182 of the output port 193 via the crossbar network 131. The packet L1 from the input port 153 is switched to the output queue 183 of the output port 194 via the crossbar network 131. The packet M1 from the input port 154 is switched to the output queue 184 of the output port 191 via the crossbar network 132. It is clear that at the fourth switching time, only one packet is switched from each input port and each output port receives only one packet.

図１Ｈは、パケットが、第１スイッチング時間においてＡ１、Ｆ１、Ｋ１およびＰ１がスイッチされるのと同じように、Ａ２、Ｆ２、Ｋ２、およびＰ２が出力キューにスイッチングされる第５スイッチング時間後の図１Ａのスイッチファブリック１０の状態を示す。入力ポート１５１からのパケットＡ２は、クロスバーネットワーク１３１経由で、出力ポート１９１の出力キュー１８１にスイッチングされる。入力ポート１５２からのパケットＦ２は、クロスバーネットワーク１３１経由で、出力ポート１９２の出力キュー１８２にスイッチングされる。入力ポート１５３からのパケットＫ２は、クロスバーネットワーク１３２経由で、出力ポート１９３の出力キュー１８３にスイッチングされる。入力ポート１５４からのパケットＰ２は、クロスバーネットワーク１３２経由で、出力ポート１９４の出力キュー１８４にスイッチングされる。従って図１Ｂの調停およびスケジューリング方法４０は、最初のファブリックスイッチングサイクルに対するスケジューリングが実行された後に、再スケジューリングする必要はない。従って、任意の特定の入力キューから宛先出力キューへのパケットは、同じパスに沿ってスイッチングされ、入力ポートがそれらを受信するのと同じ順番で移動し、従ってパケットの再順序付け問題は決して生じない。 FIG. 1H shows the packet after the fifth switching time when A2, F2, K2, and P2 are switched to the output queue in the same way that A1, F1, K1, and P1 are switched at the first switching time. The state of switch fabric 10 of Drawing 1A is shown. The packet A2 from the input port 151 is switched to the output queue 181 of the output port 191 via the crossbar network 131. The packet F2 from the input port 152 is switched to the output queue 182 of the output port 192 via the crossbar network 131. The packet K2 from the input port 153 is switched to the output queue 183 of the output port 193 via the crossbar network 132. The packet P2 from the input port 154 is switched to the output queue 184 of the output port 194 via the crossbar network 132. Accordingly, the arbitration and scheduling method 40 of FIG. 1B does not need to be rescheduled after scheduling for the first fabric switching cycle has been performed. Thus, packets from any particular input queue to the destination output queue are switched along the same path and move in the same order as the input ports receive them, thus never causing packet reordering problems .

ファブリックスイッチングサイクルにおいて、全ての１６個のパケットＡ１−Ｐ１が宛先出力ポートにスイッチングされるので、スイッチは本発明に従ってノンブロッキングであり１００％スループットで動作する。図１Ａのスイッチファブリック１０は、スイッチング時間において、パケットが少なくとも１個ある限り、各出力ポートは、その出力ポートを宛先とする入力キューのいずれか１つから、少なくとも１個のパケットを受信するよう動作するので、以後そのスイッチファブリックは、「処理保存的システム」と呼ばれる。スイッチファブリックは、ノンブロッキングである場合、直接的に処理保存的であることは簡単に分かる。本発明によると、図１Ａのスイッチファブリック１０は、各入力キューの行頭のパケットはどれも、各入力ポートでの入力キュー数（４）と同数のスイッチング回数より多く維持されないよう動作するため、以後そのスイッチファブリックは、「公正なシステム」と呼ばれる。仮想出力キューは行頭で使用されるため、ブロッキングも除去される。 Since all 16 packets A1-P1 are switched to the destination output port in the fabric switching cycle, the switch is non-blocking and operates at 100% throughput according to the present invention. The switch fabric 10 of FIG. 1A is such that each output port receives at least one packet from any one of the input queues destined for that output port as long as there is at least one packet at the switching time. In operation, the switch fabric is hereinafter referred to as a “process conservative system”. It can easily be seen that the switch fabric is directly process conservative when it is non-blocking. In accordance with the present invention, the switch fabric 10 of FIG. 1A operates so that none of the packets at the beginning of each input queue are maintained more than the same number of switching times as the number of input queues (4) at each input port. The switch fabric is called a “fair system”. Since the virtual output queue is used at the beginning of a line, blocking is also removed.

本発明によると、相互接続ネットワークにおいて２のスピードアップを使用してスイッチング時間において２個のパケットをスイッチング可能である場合でも、図１Ｂの調停およびスケジューリング方法４０を使用して、図１Ａのスイッチファブリック１０は、各出力ポートがスイッチ時に、多くとも１個のパケットを受信するように動作する。そのスピードアップは、相互接続ネットワークをノンブロッキング方式で動作させるためだけに厳密に使用され、かつ、出力ポートを決して輻輳させないように使用される。従って、図１Ａのスイッチファブリック１０内のパケットをスイッチングするための図１Ｃの調停およびスケジューリング方法４０は、決定論的である。各入口リンク１４１−１４４は各出口リンク２０１−２０４が送信するレートと同一レートで、即ち、各スイッチング時間において１個のパケットで、パケットを受信する。１個のパケットのみが各スイッチング時間で決定論的に各入力ポート１５１−１５４からスイッチされ、１個のパケットのみが各出力ポート１９１−１９４にスイッチングされるため、図１Ａのスイッチファブリック１０は決して出力ポートを輻輳させない。 In accordance with the present invention, the arbitration and scheduling method 40 of FIG. 1B can be used to switch the switch fabric of FIG. 1A even when two packets can be switched in switching time using two speedups in the interconnect network. 10 operates so that each output port receives at most one packet when switching. The speedup is used strictly to operate the interconnect network in a non-blocking manner and is used to never congest the output port. Accordingly, the arbitration and scheduling method 40 of FIG. 1C for switching packets in the switch fabric 10 of FIG. 1A is deterministic. Each ingress link 141-144 receives packets at the same rate as each egress link 201-204 transmits, ie, one packet at each switching time. Since only one packet is deterministically switched from each input port 151-154 at each switching time and only one packet is switched to each output port 191-194, the switch fabric 10 of FIG. Do not congest the output port.

本発明による決定論的なスイッチングの大きな利点は、パケットが、高くともピークレートで入力ポートからスイッチングされることである。このことは、パケットが、高くともピークレートで出力ポートにおいて受信されることも意味する。それは、出力ポートでトラフィック管理が必要なく、パケットは出力ポートから決定論的に送信されることを意味する。従って、トラフィック管理は、図１Ａのスイッチファブリック１０内の入力ポートでのみ必要となる。 A major advantage of deterministic switching according to the present invention is that packets are switched from the input port at the highest peak rate. This also means that packets are received at the output port at the highest peak rate. That means no traffic management is required at the output port and packets are sent deterministically from the output port. Accordingly, traffic management is only required at the input port in the switch fabric 10 of FIG. 1A.

図１Ａのスイッチファブリック１０の別の重要な特性は、特別な入力キューに属する全てのパケットが、宛先出力ポート内の同じ出力キューにスイッチングされることである。出願人は、出力キューによる３つのキーとなる利点を指摘する。１）スイッチング時間において、１バイトまたは一定数のバイトが入力ポートから出力ポートにスイッチングされる。あるいは、スイッチファブリックのスイッチング時間は可変であるため、スイッチファブリックの設計段階で柔軟なパラメータである。２）従って、たとえパケットＡ１−Ｐ１が任意の長さであり、サイズが可変であっても、入力キュー内の各パケットが宛先出力ポート内の同じ出力キューにスイッチングされるため、完全なパケットがスイッチング時間においてスイッチングされる必要はない。あるいは、出力キューの第２の利点は、長いパケットを入力ポート内で物理的にセグメント化する必要がなく、出力ポート内で再配置する必要がないということである。パケットは、物理的にセグメント化されずに、１セグメントずつ出力キューに論理的にスイッチングされる（パケットセグメントのサイズはスイッチング時間によって決定される）。各パケット内のパケットセグメントも、入力キューから宛先出力キューへの同一パスを介してスイッチングされる。３）出力キューの第３の利点は、パケットとパケットセグメントは、入力ポートがそれらを受信するのと同じ順番でスイッチングされ、決してパケット再順序付けの問題が生じないことである。 Another important characteristic of the switch fabric 10 of FIG. 1A is that all packets belonging to a special input queue are switched to the same output queue in the destination output port. Applicants point out three key advantages with output queues. 1) In the switching time, one byte or a certain number of bytes are switched from the input port to the output port. Alternatively, since the switching time of the switch fabric is variable, it is a flexible parameter at the design stage of the switch fabric. 2) Therefore, even if the packets A1-P1 are of arbitrary length and the size is variable, each packet in the input queue is switched to the same output queue in the destination output port, so that the complete packet There is no need to be switched in the switching time. Alternatively, the second advantage of the output queue is that long packets do not need to be physically segmented within the input port and need not be relocated within the output port. Packets are not physically segmented, but are logically switched to the output queue one segment at a time (the size of the packet segment is determined by the switching time). The packet segments within each packet are also switched over the same path from the input queue to the destination output queue. 3) A third advantage of the output queue is that packets and packet segments are switched in the same order that the input ports receive them, and never encounter packet reordering issues.

図１Ｉは、長いパケットをスイッチングするスイッチファブリック１６を示す。各入力キュー内には１個のパケットがあり、これにより１６個の全ての入力キューには１６個のパケットを作る。即ち、入力ポート１５１内の入力キュー１７１にはパケット｛Ａ１−Ａ４｝が、入力ポート１５１内の入力キュー１７２にはパケット｛Ｂ１−Ｂ４｝が、入力ポート１５１内の入力キュー１７３にはパケット｛Ｃ１−Ｃ４｝などが作られ、入力ポート１５４内の入力キュー１７４にはパケット｛Ｐ１−Ｐ４｝がある。これらの１６個のパケットそれぞれは、４個の等しいサイズのパケットセグメントから成る。例えば、パケット｛Ａ１−Ａ４｝は４個のパケットセグメント、即ちＡ１、Ａ２、Ａ３およびＡ４から成る。パケットのサイズが完全にパケットセグメントのサイズの４倍でない場合、４番目のパケットのサイズはより短い。しかしながら、４個のパケットセグメントのうちどれも、パケットセグメントのサイズの最大値よりは長くない。パケットセグメントのサイズはスイッチング時間によって決定される。即ち、各スイッチング時間において、唯一のパケットセグメントが任意の入力ポートから任意の出力ポートにスイッチングされる。より長いパケットのサイズ除いて、図１Ｉのスイッチファブリック１６の図は図１Ａのスイッチファブリック１０の図と同じである。 FIG. 1I shows a switch fabric 16 that switches long packets. There is one packet in each input queue, which creates 16 packets in all 16 input queues. That is, the packet {A1-A4} is stored in the input queue 171 in the input port 151, the packet {B1-B4} is stored in the input queue 172 in the input port 151, and the packet {B1 is stored in the input queue 173 in the input port 151. C1-C4} and the like are created, and the input queue 174 in the input port 154 has a packet {P1-P4}. Each of these 16 packets consists of 4 equally sized packet segments. For example, packet {A1-A4} consists of four packet segments, namely A1, A2, A3 and A4. If the size of the packet is not completely four times the size of the packet segment, the size of the fourth packet is shorter. However, none of the four packet segments is longer than the maximum packet segment size. The size of the packet segment is determined by the switching time. That is, at each switching time, a unique packet segment is switched from any input port to any output port. Except for the longer packet size, the diagram of switch fabric 16 of FIG. 1I is the same as the diagram of switch fabric 10 of FIG. 1A.

図１Ｊから図１Ｍは、各ファブリックスイッチングサイクル後の図１Ｉのスイッチファブリック１６の状態を示す。図１Ｊは、全ての行頭パケットセグメントＡ１−Ｐ１が出力ポートにスイッチングされる第１ファブリックスイッチングサイクル後の、図１Ｉのスイッチファブリック１６の状態を示す。図１Ｄ−１Ｇに示すように、これらのパケットセグメントは、図１Ｂの調停およびスケジューリング方法４０を使用して、図１Ａのスイッチファブリック１０においてパケットＡ１−Ｐ１が出力キューにスイッチングされるのと全く同じ方法で、出力キューにスイッチングされる。図１Ｋは、全ての行頭パケットセグメントＡ１−Ｐ１が出力キューにスイッチング第２ファブリックスイッチングサイクル後の図１Ｉのスイッチファブリック１６の状態を示す。図１Ｌは、全ての行頭パケットセグメントＡ３−Ｐ３が出力キューにスイッチングされる第３ファブリックスイッチングサイクル後の図１Ｉのスイッチファブリック１６の状態を示す。図１Ｍは、全ての行頭パケットセグメントＡ１−Ｐ１が出力キューにスイッチングされる第４ファブリックスイッチングサイクル後の図１Ｉのスイッチファブリック１６の状態を示す。第１、第２、第３、および第４のファブリックスイッチングサイクルの各々において、図１Ｄ−１Ｇに示すように、パケットＡ１−Ｐ１が、図１Ａのスイッチファブリック１０において、出力キューにスイッチングされるのと全く同じ方法で、パケットセグメントは出力キューにスイッチングされる。明らかに、全てのパケットセグメントは、各々の入力ポートが受信するのと同じ順序でスイッチングされる。従って、パケット再順序付けの問題はない。パケットはさらに１００％スループット、処理保存的、および公正な方式でスイッチングされる。 1J through 1M show the state of the switch fabric 16 of FIG. 1I after each fabric switching cycle. FIG. 1J shows the state of the switch fabric 16 of FIG. 1I after the first fabric switching cycle in which all leading packet segments A1-P1 are switched to the output ports. As shown in FIGS. 1D-1G, these packet segments are exactly the same as packets A1-P1 are switched to output queues in the switch fabric 10 of FIG. 1A using the arbitration and scheduling method 40 of FIG. 1B. In the manner switched to the output queue. FIG. 1K shows the state of switch fabric 16 of FIG. 1I after the second fabric switching cycle when all leading packet segments A1-P1 are switched to the output queue. FIG. 1L shows the state of switch fabric 16 of FIG. 1I after a third fabric switching cycle in which all leading packet segments A3-P3 are switched to the output queue. FIG. 1M shows the state of the switch fabric 16 of FIG. 1I after the fourth fabric switching cycle in which all leading packet segments A1-P1 are switched to the output queue. In each of the first, second, third, and fourth fabric switching cycles, packets A1-P1 are switched to output queues in switch fabric 10 of FIG. 1A, as shown in FIGS. 1D-1G. The packet segment is switched to the output queue in exactly the same way. Obviously, all packet segments are switched in the same order that each input port receives. Thus, there is no problem of packet reordering. Packets are further switched in a 100% throughput, process conservative and fair manner.

図１Ｊ−１Ｍにおいて、パケットは論理的にセグメント化され、出力ポートにスイッチングされる。一実施形態において、タグビット「１」がさらに各パケットセグメントの特別に指定されたビット位置にパディングされ、そのパケットセグメントが各パケット内で最初のパケットセグメントであることを表す。「１」のタグビットを読むことで、出力ポートはパケットセグメントＡ１−Ｐ１が、新規パケット内の最初のパケットセグメントであることを認識する。同様に各パケットセグメントは、「０」でパディングされる最後のパケットセグメントを除いて、指定ビット位置に「１」のタグビットでパディングされる（例えば、図１Ｉのスイッチファブリック１６内のパケットセグメントにおいて、パケットセグメントＡ１−Ｐ１、Ａ２−Ｐ２およびＡ３−Ｐ３は「１」のタグビットでパディングされ、パケットセグメントＡ４−Ｐ４は「０」のタグビットでパディングされる）。タグビットが「０」として検出されるとき、出力ポートは次に新規パケットのパケットセグメントまたは新規パケットを期待する。パケット内に１個のみのパケットセグメントがある場合、それは入力ポートによって「０」のタグビットで表される。出力ポートが「０」の指定タグビットを有する２つの連続パケットセグメントを受信した場合、出力ポートは、２番目のパケットセグメントを新規パケットの唯一のパケットセグメントであると判定する。 In FIGS. 1J-1M, packets are logically segmented and switched to output ports. In one embodiment, the tag bit “1” is further padded to a specially designated bit position in each packet segment to indicate that the packet segment is the first packet segment in each packet. By reading the tag bit of “1”, the output port recognizes that the packet segment A1-P1 is the first packet segment in the new packet. Similarly, each packet segment is padded with a tag bit of “1” at the specified bit position, except for the last packet segment padded with “0” (eg, in the packet segment in switch fabric 16 of FIG. 1I). Packet segments A1-P1, A2-P2 and A3-P3 are padded with tag bits of “1” and packet segments A4-P4 are padded with tag bits of “0”). When the tag bit is detected as “0”, the output port then expects a packet segment or new packet for the new packet. If there is only one packet segment in the packet, it is represented by a tag bit of “0” by the input port. If the output port receives two consecutive packet segments with a designated tag bit of “0”, the output port determines that the second packet segment is the only packet segment of the new packet.

図１Ｉのスイッチファブリック１６において、パケットは４セグメントの長さである。しかしながら、一般にパケットは任意の長さである。加えて、同一キュー内の異なるパケットは異なるサイズでありうる。双方の場合において、図１Ｂの調停およびスケジューリング方法は、スイッチファブリックをノンブロッキング方式で動作させ、パケットは１００％スループット、処理保存的、および公正な方式でスイッチされる。さらに、入力ポート内でパケットを物理的にセグメント化し、出力ポート内で再組立する必要はない。スイッチファブリックのスイッチング時間は柔軟な設計パラメータでもあり、各スイッチング時間において１バイトずつ、または数バイトずつパケットをスイッチングするように設定される。 In the switch fabric 16 of FIG. 1I, the packet is 4 segments long. However, in general, packets are of arbitrary length. In addition, different packets in the same queue can be of different sizes. In both cases, the arbitration and scheduling method of FIG. 1B operates the switch fabric in a non-blocking manner, and packets are switched in a 100% throughput, process conservative, and fair manner. In addition, the packets need not be physically segmented within the input port and reassembled within the output port. The switching time of the switch fabric is also a flexible design parameter, and is set to switch packets one byte or several bytes at each switching time.

図１Ｂは、一実施形態において図１Ａのスケジューラにより実行される調停およびスケジューリング方法４０の高レベルフローチャートを示す。この実施形態による、多くともｒ個の要求が、アクト４１で各入力ポートから生成される。各入力ポートは、各入力キューから多くとも１個の要求を備えた、ｒ個の入力キューを有するので、各入力ポートからの多くともｒ個の要求が存在することになる。これらｒ個の要求の各々は異なる出力ポートにも宛てられる。アクト４２において、各出力ポートは多くともｒ個の許可を発行することになる。このとき、各要求は関連する出力キューに対応する。各入力ポートは、唯一の要求を生成するので、各出力ポートが、多くともｒ個の要求を各入力ポートから１個受信することは簡単に分かる。そして各出力ポートは許可を、ｒ個の受信された要求全てに発行できる。アクト４３において、各入力ポートは多くともｒ個の許可を承認する。各出力ポートは多くともｒ個の許可を各入力ポートに対して１個発行するので、各入力ポートは多くともｒ個の許可を受信する。そして各入力ポートはｒ個の許可全てを承認する。 FIG. 1B illustrates a high-level flowchart of the arbitration and scheduling method 40 performed by the scheduler of FIG. 1A in one embodiment. At most r requests according to this embodiment are generated from each input port at act 41. Since each input port has r input queues with at most one request from each input queue, there will be at most r requests from each input port. Each of these r requests is also addressed to a different output port. In act 42, each output port will issue at most r permissions. Each request then corresponds to an associated output queue. Since each input port generates a unique request, it can easily be seen that each output port receives at most r requests from each input port. Each output port can then issue a grant to all r received requests. In Act 43, each input port accepts at most r permissions. Each output port issues at most r permissions to each input port, so each input port receives at most r permissions. Each input port then approves all r permissions.

アクト４４において、全てのｒ^２個の要求が、以前スケジューリングしたパケットのパスを再配置せずにスケジューリングされる。本発明によると、全てのｒ^２個の要求は中間ステージ１３０における少なくとも２のスピードアップで厳密にノンブロッキング方式でスケジューリングされる。要求生成の調停、許可の発行、承認の生成が、１度だけの繰り返しで実行されることに留意されたい。アクト４４の後に、制御はアクト４５に戻る。アクト４５において、入力ポートで新規の要求または異なる要求があるかどうかがチェックされる。結果が「ノー」の場合、制御はアクト４５に戻る。新規要求はあるが同一の要求であり、出力キュー要求に対して同じ入力キューを有する場合、同じスケジュールが使用され、次のｒ^２個の要求をスイッチングする。入力ポートから新規の要求および異なる要求があるとき、制御はアクト４５からアクト４１に移る。そしてアクト４１−４５がループ内で実行される。 At act 44, all r ² requests are scheduled without relocating the previously scheduled packet path. In accordance with the present invention, all r ² requests are scheduled in a strictly non-blocking manner with at least two speedups in the intermediate stage 130. Note that request generation arbitration, permission issuance, and approval generation are performed in a single iteration. After act 44, control returns to act 45. In act 45, it is checked whether there are new or different requests at the input port. If the result is “no”, control returns to Act 45. If there is a new request but the same request and has the same input queue for the output queue request, the same schedule is used to switch the next r ² requests. When there is a new request and a different request from the input port, control passes from act 45 to act 41. Acts 41-45 are then executed in the loop.

図１Ｃのネットワーク１４は、中間ステージ１３０のスイッチの数が

に等しいとき、ユニキャスト接続要求に対して再配置可能なノンブロッキング方式で動作することもできる。同様に本発明によると、中間ステージ１３０内で４×４クロスバーネットワーク１３１を１つだけ、即ち少なくとも１のスピードアップを有する別の実施形態において、図１Ｎのスイッチファブリック１８は、再配置可能なノンブロッキング方式で動作する。 In the network 14 of FIG. 1C, the number of switches in the intermediate stage 130 is

It is also possible to operate in a non-blocking manner that can be rearranged for unicast connection requests. Similarly, according to the present invention, in another embodiment having only one 4 × 4 crossbar network 131 in the intermediate stage 130, ie at least one speedup, the switch fabric 18 of FIG. 1N can be relocated. Operates in a non-blocking manner.

厳密にノンブロッキングなネットワークにおいて、全ての入力キューの行頭でパケットが一度にスケジューリングされると、以前スケジューリングしたパケットのパスを乱すことなく、ネットワークを介して入力キューから宛先出力キューまで、パケットのパスをスケジューリングすることが常に可能であり、１個より多くののそのようなパスが利用可能な場合、残りのパケットのスケジューリングを気にすることなく、任意のパスを選択できる。再配置可能なノンブロッキングなネットワークにおいて、全ての入力キューの行頭でパケットが一度にスケジューリングされると、入力キューから宛先出力キューへのパケットのパスをスケジューリングすることは、再配置が必要な場合は、以前スケジューリングしたパケットのパスをスケジューラが再配置することで満足されるよう保証される。本発明によると、図１Ａのスイッチファブリック１０が厳密にノンブロッキング方式で動作するのに対して、図１Ｎのスイッチファブリック１８は再配置可能なノンブロッキング方式で動作する。 In a strictly non-blocking network, once packets are scheduled at the beginning of all input queues, the packet path from the input queue to the destination output queue is routed through the network without disturbing the previously scheduled packet path. Scheduling is always possible, and if more than one such path is available, any path can be selected without worrying about scheduling the remaining packets. In a relocatable non-blocking network, once a packet is scheduled at the beginning of all input queues, scheduling the path of the packet from the input queue to the destination output queue is Satisfaction is guaranteed by relocating the path of previously scheduled packets by the scheduler. According to the present invention, the switch fabric 10 of FIG. 1A operates strictly in a non-blocking manner, whereas the switch fabric 18 of FIG. 1N operates in a relocatable non-blocking manner.

図２Ａを参照すると、スイッチファブリック２０は出力キューを有さない。そうでなければ、図２Ａのスイッチファブリック２０は、図１Ａのスイッチファブリック１０の図と全く同じである。本発明によると、スイッチファブリック２０は、入力ポートおよび出力ポート内にＳＡＲが必要であるのを除いて、図１Ａのスイッチファブリック１０について開示したのと全ての点で同じように、厳密にノンブロッキングで決定論的方式で動作する。パケットは、スイッチング時間によって決定されたように入力ポート内でセグメント化され、出力ポートにスイッチングされて、別個に再組立される必要がある。しかしながら、図１Ｂの調停およびスケジューリング方法４０は、図２Ａのスイッチファブリック２０内のパケットをスイッチングするためにも使用される。ここでさらに、スケジューリングは全ての１６個の行頭パケット上で同時に実行され、仮想的に１６個の出力キューがあると仮定すると、パケットは４回のスイッチング時間でスイッチングされる。しかしながら、スイッチング時間の間、パケットは出力キューの代わりに、宛先出力ポートにスイッチングされる。図２Ｂ−２Ｆは、図１Ｂの調停およびスケジューリング方法を使用し、表１に示すパケット要求をスケジューリングすることによる、各スイッチング時間後の図２Ａのスイッチファブリック２０の状態を示す。 Referring to FIG. 2A, the switch fabric 20 does not have an output queue. Otherwise, the switch fabric 20 of FIG. 2A is exactly the same as the view of the switch fabric 10 of FIG. 1A. In accordance with the present invention, the switch fabric 20 is strictly non-blocking, in all respects similar to that disclosed for the switch fabric 10 of FIG. 1A, except that a SAR is required in the input and output ports. Operates in a deterministic manner. The packet needs to be segmented within the input port as determined by the switching time, switched to the output port, and reassembled separately. However, the arbitration and scheduling method 40 of FIG. 1B is also used to switch packets within the switch fabric 20 of FIG. 2A. Further here, scheduling is performed simultaneously on all 16 first-line packets, assuming that there are virtually 16 output queues, the packets are switched in 4 switching times. However, during the switching time, packets are switched to the destination output port instead of the output queue. 2B-2F show the state of the switch fabric 20 of FIG. 2A after each switching time by scheduling the packet requests shown in Table 1 using the arbitration and scheduling method of FIG. 1B.

図２Ｂは、パケットＡ１、Ｆ１、Ｋ１、およびＰ１が出力キューにスイッチングされる最初のスイッチング時間後の図２Ａのスイッチファブリック２０の状態を示す。入力ポート１５１からのパケットＡ１は、クロスバーネットワーク１３１経由で、出力ポート１９１にスイッチングされる。入力ポート１５２からのパケットＦ１は、クロスバーネットワーク１３１経由で、出力ポート１９２にスイッチングされる。入力ポート１５３からのパケットＫ１は、クロスバーネットワーク１３２経由で、出力ポート１９３にスイッチングされる。入力ポート１５４からのパケットＰ１は、クロスバーネットワーク１３２経由で、出力ポート１９４にスイッチングされる。第１スイッチング時間において、各入力ポートから唯一１個のパケットがスイッチングされ、各出力ポートは唯一１個のパケットを受信することは明らかである。 FIG. 2B shows the state of the switch fabric 20 of FIG. 2A after the initial switching time when packets A1, F1, K1, and P1 are switched to the output queue. The packet A1 from the input port 151 is switched to the output port 191 via the crossbar network 131. The packet F1 from the input port 152 is switched to the output port 192 via the crossbar network 131. The packet K1 from the input port 153 is switched to the output port 193 via the crossbar network 132. The packet P1 from the input port 154 is switched to the output port 194 via the crossbar network 132. It is clear that at the first switching time, only one packet is switched from each input port and each output port receives only one packet.

図２Ｃは、パケットＤ１、Ｅ１、Ｊ１およびＯ１が出力キューにスイッチングされる第２スイッチング時間後の、図２Ａのスイッチファブリック２０の状態を示す。入力ポート１５１からのパケットＤ１は、クロスバーネットワーク１３１経由で、出力ポート１９４にスイッチングされる。入力ポート１５２からのパケットＥ１は、クロスバーネットワーク１３１経由で、出力ポート１９１にスイッチングされる。入力ポート１５３からのパケットＪ１は、クロスバーネットワーク１３２経由で、出力ポート１９２にスイッチングされる。入力ポート１５４からのパケットＯ１は、クロスバーネットワーク１３２経由で、出力ポート１９３にスイッチングされる。再び、第２スイッチング時間において、各入力ポートから唯一１個のパケットがスイッチングされ、各出力ポートは唯一１個のパケットを受信する。 FIG. 2C shows the state of the switch fabric 20 of FIG. 2A after a second switching time in which packets D1, E1, J1 and O1 are switched to the output queue. The packet D1 from the input port 151 is switched to the output port 194 via the crossbar network 131. The packet E1 from the input port 152 is switched to the output port 191 via the crossbar network 131. The packet J1 from the input port 153 is switched to the output port 192 via the crossbar network 132. The packet O1 from the input port 154 is switched to the output port 193 via the crossbar network 132. Again, at the second switching time, only one packet is switched from each input port, and each output port receives only one packet.

図２Ｄは、パケットＣ１、Ｈ１、Ｉ１およびＮ１が出力キューにスイッチングされる第３スイッチング時間後の図２Ａのスイッチファブリック２０の状態を示す。入力ポート１５１からのパケットＣ１は、クロスバーネットワーク１３１経由で、出力ポート１９３にスイッチングされる。入力ポート１５２からのパケットＨ１は、クロスバーネットワーク１３１経由で、出力ポート１９４にスイッチングされる。入力ポート１５３からのパケットＩ１は、クロスバーネットワーク１３２経由で、出力ポート１９１にスイッチングされる。入力ポート１５４からのパケットＮ１は、クロスバーネットワーク１３２経由で、出力ポート１９２にスイッチングされる。再び、第３スイッチング時間において、各入力ポートから唯一１個のパケットがスイッチングされ、各出力ポートは唯一１個のパケットを受信する。 FIG. 2D shows the state of the switch fabric 20 of FIG. 2A after a third switching time when packets C1, H1, I1, and N1 are switched to the output queue. The packet C1 from the input port 151 is switched to the output port 193 via the crossbar network 131. The packet H1 from the input port 152 is switched to the output port 194 via the crossbar network 131. The packet I1 from the input port 153 is switched to the output port 191 via the crossbar network 132. The packet N1 from the input port 154 is switched to the output port 192 via the crossbar network 132. Again, at the third switching time, only one packet is switched from each input port, and each output port receives only one packet.

図２Ｅは、パケットＢ１、Ｇ１、Ｌ１およびＭ１が出力キューにスイッチングされる第４スイッチング時間後の図２Ａのスイッチファブリック２０の状態を示す。入力ポート１５１からのパケットＢ１は、クロスバーネットワーク１３２経由で、出力ポート１９２にスイッチングされる。入力ポート１５２からのパケッＧ１は、クロスバーネットワーク１３１経由で、出力ポート１９３にスイッチングされる。入力ポート１５３からのパケットＬ１は、クロスバーネットワーク１３１経由で、出力ポート１９４にスイッチングされる。入力ポート１５４からのパケットＭ１は、クロスバーネットワーク１３２経由で、出力ポート１９１にスイッチングされる。第４スイッチング時間において、各入力ポートから唯一１個のパケットがスイッチングされ、各出力ポートは唯一１個のパケットを受信するは明らかである。 FIG. 2E shows the state of the switch fabric 20 of FIG. 2A after a fourth switching time in which packets B1, G1, L1 and M1 are switched to the output queue. The packet B1 from the input port 151 is switched to the output port 192 via the crossbar network 132. The packet G1 from the input port 152 is switched to the output port 193 via the crossbar network 131. The packet L1 from the input port 153 is switched to the output port 194 via the crossbar network 131. The packet M1 from the input port 154 is switched to the output port 191 via the crossbar network 132. In the fourth switching time, it is clear that only one packet is switched from each input port and each output port receives only one packet.

図２Ｆは、Ａ１、Ｆ１、Ｋ１およびＰ１が、第１スイッチング時間においてスイッチングされるのと全く同じように、パケットＡ２、Ｆ２、Ｋ２およびＰ２が出力キューにスイッチングされる第５スイッチング時間後の、図２Ａのスイッチファブリック２０の状態を示す。入力ポート１５１からのパケットＡ２は、クロスバーネットワーク１３１経由で、出力ポート１９１にスイッチングされる。入力ポート１５２からのパケットＦ２は、クロスバーネットワーク１３１経由で、出力ポート１９２にスイッチングされる。入力ポート１５３からのパケットＫ２は、クロスバーネットワーク１３２経由で、出力ポート１９３にスイッチングされる。入力ポート１５４からのパケットＰ２は、クロスバーネットワーク１３２経由で、出力ポート１９４にスイッチングされる。 FIG. 2F shows a fifth switching time after packets A2, F2, K2 and P2 are switched to the output queue, just as A1, F1, K1 and P1 are switched in the first switching time. The state of the switch fabric 20 of FIG. 2A is shown. The packet A2 from the input port 151 is switched to the output port 191 via the crossbar network 131. The packet F2 from the input port 152 is switched to the output port 192 via the crossbar network 131. The packet K2 from the input port 153 is switched to the output port 193 via the crossbar network 132. The packet P2 from the input port 154 is switched to the output port 194 via the crossbar network 132.

また、図１Ｂの調停およびスケジューリング方法４０は、図２Ａのスイッチファブリック２０を厳密にノンブロッキング方式で動作させ、パケットは、１００％スループット、処理保存的、および公正な方式で動作する。スイッチファブリックのスイッチング時間は、柔軟な設計パラメータでもあり、各スイッチング時間において１バイトずつまたは数バイトずつパケットをスイッチングするよう設定することができる。しかしながら、スイッチファブリック２０にＳＡＲ必要となり、パケットが入力ポート内で物理的にセグメント化され、出力ポート内で再組立される必要があることになる。それにも関わらず、スイッチファブリック２０において、パケットおよびパケットセグメントは、入力ポートが受信するのと同じ順序で、出力ポートにスイッチングされる。実際、ＳＡＲを除き、図１Ｂの調停およびスケジューリング方法４０は、図１Ａのスイッチファブリック１０に関して説明したのと全ての点で、同じ方法でスイッチファブリック２０を動作させる。 Also, the arbitration and scheduling method 40 of FIG. 1B operates the switch fabric 20 of FIG. 2A strictly in a non-blocking manner, and the packet operates in a 100% throughput, process conservative and fair manner. The switching time of the switch fabric is also a flexible design parameter and can be set to switch packets one byte or several bytes at each switching time. However, SAR is required for the switch fabric 20 and the packets need to be physically segmented in the input port and reassembled in the output port. Nevertheless, in switch fabric 20, packets and packet segments are switched to output ports in the same order that the input ports receive. Indeed, except for the SAR, the arbitration and scheduling method 40 of FIG. 1B operates the switch fabric 20 in the same manner in all respects as described with respect to the switch fabric 10 of FIG. 1A.

スイッチファブリックがノンブロッキングに動作するための中間ステージにおける２のスピードアップは、２つの方法で実現される。即ち、１）並列処理および２）スイッチングレートの２倍化である。並列処理は、例えば図１Ａのスイッチファブリック１０で示すように、中間ステージ内の２個の相互接続ネットワークを並列に使用することで実現される。スイッチングレートの２倍化は、入力ポートおよび出力ポート内の各クロックに対して、単一の相互接続ネットワーク、第１および第２の内部リンクを２倍のクロックレートで動作させることで実現される。第１クロックにおいて、単一の相互接続ネットワークは、２個の並列相互接続ネットワークで実装された等価なスイッチファブリックの第１相互接続、例えば図１Ａのスイッチファブリック１０内の相互接続ネットワーク１３１として、スイッチングに対して動作する。同様に第２クロックにおいて、単一の相互ネットワークは、第２相互接続ネットワーク、例えば図１Ａのスイッチファブリック１０内の相互接続ネットワーク１３２として動作する。従って、相互接続ネットワークのクロックスピードと、第１および第２の内部リンクとにおいて２倍のレートがこの実装に必要となる。図１Ｂの調停およびスケジューリング方法４０は、並列処理または２倍のレートによるスピードアップを実装して、双方のスイッチファブリックを、本発明において説明したように、全ての点でノンブロッキングで決定論的方式で動作させる。 The two speedups in the intermediate stage for the switch fabric to operate non-blocking are realized in two ways. That is, 1) parallel processing and 2) doubling of the switching rate. Parallel processing is realized by using two interconnected networks in the intermediate stage in parallel, as shown for example by the switch fabric 10 in FIG. 1A. Switching rate doubling is achieved by operating a single interconnect network, first and second internal links at twice the clock rate for each clock in the input and output ports. . At the first clock, the single interconnect network switches as the first interconnect of an equivalent switch fabric implemented with two parallel interconnect networks, for example, the interconnect network 131 in the switch fabric 10 of FIG. 1A. Works against. Similarly, at the second clock, the single interconnect network operates as an interconnect network 132 within the second interconnect network, eg, switch fabric 10 of FIG. 1A. Therefore, twice the rate is required for this implementation in the interconnect network clock speed and the first and second internal links. The arbitration and scheduling method 40 of FIG. 1B implements parallel processing or double rate speedup so that both switch fabrics are non-blocking and deterministic in all respects as described in the present invention. Make it work.

図３Ａを参照すると、図３Ａはスイッチファブリック３０を示す。中間ステージ１３０内の唯一のクロスバー相互接続ネットワークでのクロックスピードを２スピードアップさせることと、第１および第２の内部リンクを２スピードアップさせることを除いて、図１Ａのスイッチファブリック１０の図と同じである。図１Ａのネットワークの別の実施形態において、中間ステージ内の各相互接続ネットワークは共有メモリネットワークである。図３Ｂは、スイッチファブリック５０を示す。中間ステージ１３０内の２個の共有メモリ相互接続ネットワークをスピードアップさせることを除いて、図１Ａのスイッチファブリック１０と同じである。図３Ｃは、スイッチファブリック６０を示す。中間ステージ１３０内の唯一の共有メモリ相互接続ネットワークのクロックスピードを２スピードアップさせることと、第１および第２の内部リンクを２スピードアップさせることを除いて、図３Ａのスイッチファブリック３０と全く同じである。 Referring to FIG. 3A, FIG. 3A shows the switch fabric 30. The diagram of the switch fabric 10 of FIG. 1A except that the clock speed on the only crossbar interconnect network in the intermediate stage 130 is increased by 2 and the first and second internal links are increased by 2 Is the same. In another embodiment of the network of FIG. 1A, each interconnect network in the intermediate stage is a shared memory network. FIG. 3B shows the switch fabric 50. Same as the switch fabric 10 of FIG. 1A except that it speeds up the two shared memory interconnect networks in the intermediate stage 130. FIG. 3C shows the switch fabric 60. Exactly the same as switch fabric 30 of FIG. 3A, except that the clock speed of the only shared memory interconnect network in intermediate stage 130 is increased by 2 and the first and second internal links are increased by 2 It is.

同様に、図３Ｄは、スイッチファブリック７０を示す。２スピードアップに、中間ステージ１３０内の２個のハイパーキューブ相互接続ネットワークを２スピードアップさせることを除いて、図１Ａのスイッチファブリック１０と全く同じである。図３Ｅは、スイッチファブリック６０を示す。中間ステージ１３０内の唯一のハイパーキューブベースの相互接続ネットワークのクロックレートを２スピードアップさせることと、第１および第２の内部リンクを２スピードアップさせることを除いて、図３Ａのスイッチファブリック３０と全く同じである。 Similarly, FIG. 3D shows the switch fabric 70. Exactly the same as switch fabric 10 of FIG. 1A, except that two speedups speeds up the two hypercube interconnect networks in intermediate stage 130. FIG. 3E shows the switch fabric 60. The switch fabric 30 of FIG. 3A, except for speeding up the clock rate of the only hypercube-based interconnect network in the intermediate stage 130 by 2 and speeding up the first and second internal links by 2 Exactly the same.

図１Ａのスイッチファブリック１０、図１Ｉの１６、図１Ｎの１８、図２Ａの２０、図３Ａの３０、図３Ｂの５０、図３Ｃの６０、図３Ｄの７０、および図３Ｅの８０において、入力ポート１１０の数と出力ポート１２０の数が、一般に各ステージに対する変数ｒで表される。中間ステージにおけるスピードアップはｓで表される。中間ステージにおけるスピードアップは、並列処理、即ち２個の相互接続ネットワーク（図４Ａ、図４Ｃおよび図４Ｅに示す）、または１個の相互接続ネットワークにおける２倍のスイッチングレート（図４Ｂ、図４Ｄおよび図４Ｆに示す）のいずれかにより実現される。各入力ポートのサイズ１５１−{１５１＋ｒ}は、一般にｒ＊ｓの記法で表され（各入力ポートがｒ個の入力キューを有し、ｓ個の相互接続ネットワークとｓ個の第１内部リンクで接続されるということである）、各出力スイッチのサイズ１９１−{１９１＋ｒ}は、一般にｓ＊ｒの記法で表される（各出力ポートがｒ個の出力キューを有し、ｓ個の相互接続ネットワークとｓ個の第２内部リンクで接続されるということである）。同様に、中間ステージ１３０内の各相互接続ネットワークのサイズはｒ＊ｒと表される。ここで説明した相互接続ネットワークは、クロスバーネットワークや、共有メモリネットワーク、各々がクロスバーまたは共有メモリネットワークであるサブネットワークのネットワーク、３ステージクロスネットワーク、ハイパーキューブ、任意の内部ノンブロッキング相互接続ネットワーク、またはネットワークのネットワークであってよい。３ステージスイッチファブリックは、Ｖ（ｓ，ｒ）の記法で表される。 Inputs at switch fabric 10 of FIG. 1A, 16 of FIG. 1I, 18 of FIG. 1N, 20 of FIG. 2A, 30 of FIG. 3A, 50 of FIG. 3B, 60 of FIG. 3C, 70 of FIG. The number of ports 110 and the number of output ports 120 are generally represented by a variable r for each stage. The speed up in the intermediate stage is represented by s. The speed up in the intermediate stage is parallel processing, ie two interconnect networks (shown in FIGS. 4A, 4C and 4E), or twice the switching rate in one interconnect network (FIGS. 4B, 4D and 4). (Shown in FIG. 4F). Each input port size 151- {151 + r} is generally expressed in the notation of r * s (each input port has r input queues, s interconnect networks and s first internal links). The size of each output switch 191- {191 + r} is generally expressed in the notation of s * r (each output port has r output queues and s interconnects). It is connected to the network with s second internal links). Similarly, the size of each interconnect network in the intermediate stage 130 is denoted r * r. The interconnect network described here can be a crossbar network, a shared memory network, a network of sub-networks, each of which is a crossbar or shared memory network, a three-stage cross network, a hypercube, any internal non-blocking interconnect network, or It may be a network of networks. A three-stage switch fabric is represented by the notation of V (s, r).

存在する出力キュー１８１−{１８１＋ｒ}と同数の入力キュー１７１−{１７１＋ｒ}がある必要はないが、対称ネットワークにおいてそれらは同数である。ｓ個の中間ステージ相互接続ネットワーク１３１−１３２それぞれは、ｒ個の第１内部リンクを介して、ｒ個の入力ポートのそれぞれに接続され、ｒ個の第２内部リンクを介して、出力ポートそれぞれに接続される。第１の内部リンクＦＬ１−ＦＬｒそれぞれ、および第２内部リンクＳＬ１−ＳＬｒそれぞれは、新規パケットにより使用可能か、または別のパケットにより既に取得されている場合に利用可能でないかのいずれかである。 There need not be as many input queues 171-{171 + r} as there are output queues 181-{181 + r}, but they are the same in a symmetric network. Each of the s intermediate stage interconnect networks 131-132 is connected to each of the r input ports via r first internal links, and each of the output ports via r second internal links. Connected to. Each of the first internal links FL1-FLr and each of the second internal links SL1-SLr are either usable by new packets or not available if already acquired by another packet.

図１Ａのスイッチファブリック１０は、図４Ａの一般的な対称スイッチファブリックの例であり、中間ステージ１３０内の２個のクロスバー相互接続ネットワークを使用することで２のスピードアップを提供する。図４Ｂを参照すると、一般的な対称スイッチファブリックが示される。中間ステージ１３０内の唯一のクロスバー相互接続ネットワークのクロックスピードを２スピードアップすることと、第１および第２の内部リンクを２スピードアップすることを除いて、図４Ａのスイッチファブリック３０と同じである The switch fabric 10 of FIG. 1A is an example of the general symmetric switch fabric of FIG. 4A and provides two speedups by using two crossbar interconnect networks in the intermediate stage 130. Referring to FIG. 4B, a typical symmetric switch fabric is shown. Same as the switch fabric 30 of FIG. 4A except that the clock speed of the only crossbar interconnect network in the intermediate stage 130 is increased by 2 and the first and second internal links are increased by 2 is there

図４Ｃは、一般的な対称スイッチファブリックを示す。中間ステージ１３０内の２個の共有メモリ相互接続ネットワークを使用することで２のスピードアップを提供する。図４Ｄは、一般的な対称スイッチファブリックを示す。これは、中間ステージ１３０内の唯一の共有メモリ相互接続ネットワークのクロックスピードを２スピードアップし、第１および第２の内部リンクを２スピードアップすることを使用することで、２のスピードアップを提供する。 FIG. 4C shows a typical symmetric switch fabric. Using two shared memory interconnect networks in the intermediate stage 130 provides a 2 speedup. FIG. 4D shows a typical symmetric switch fabric. This provides a speedup of 2 by using 2 speedup of the clock speed of the only shared memory interconnect network in the intermediate stage 130 and 2 speedup of the first and second internal links. To do.

図４Ｅは、一般的な対称スイッチファブリックを示す。中間ステージ１３０内の２個の３ステージクロス相互接続ネットワークを使用することで、２のスピードアップを提供する。図４Ｆは、一般的な対称スイッチファブリックを示す。中間ステージ１３０内の唯一の３ステージクロス相互接続ネットワークを２スピードアップさせることと、第１および第２の内部リンクを２スピードアップさせることを使用することで、２のスピードアップを提供する。 FIG. 4E shows a typical symmetric switch fabric. Using two three-stage cross-interconnect networks in the intermediate stage 130 provides two speedups. FIG. 4F shows a typical symmetric switch fabric. Using two speedups of the only three-stage cross-connect network in the intermediate stage 130 and two speedups of the first and second internal links provides two speedups.

一般に、中間ステージ１３０内の相互接続ネットワークは、任意の相互接続ネットワーク、即ち、ハイパーキューブや、バッチャバニアン相互接続ネットワーク、任意の内部ノンブロッキング相互接続ネットワーク、またはネットワークのネットワークであってよい。一実施形態において、相互接続ネットワーク１３１および１３２は、２つの異種のネットワークのであってよい。例えば、相互接続ネットワーク１３１は、クロスバーネットワークであってよく、相互接続ネットワーク１３２は、共有メモリネットワークであってよい。本発明によると、中間ステージにおいて使用される相互接続ネットワークの種類とは無関係に、中間ステージ内の少なくとも２のスピードアップが、図１Ｂの調停およびスケジューリング方法４０を使用して、厳密にノンブロッキング方式でスイッチファブリックを動作させる。そして中間ステージ内の少なくとも１のスピードアップが、そのスイッチファブリックを再配置可能なノンブロッキング方式で動作させる。 In general, the interconnection network in the intermediate stage 130 may be any interconnection network, i.e., a hypercube, batcher vanian interconnection network, any internal non-blocking interconnection network, or a network of networks. In one embodiment, interconnect networks 131 and 132 may be two disparate networks. For example, the interconnect network 131 may be a crossbar network and the interconnect network 132 may be a shared memory network. In accordance with the present invention, regardless of the type of interconnect network used in the intermediate stage, at least two speedups in the intermediate stage are strictly non-blocking using the arbitration and scheduling method 40 of FIG. 1B. Operate the switch fabric. And at least one speedup in the intermediate stage causes the switch fabric to operate in a non-blocking manner that can be repositioned.

スイッチファブリック内のスピードアップは、相互接続ネットワークの内部スピードアップに関係しないことに留意されたい。例えば、クロスバーネットワークおよび共有メモリネットワークは完全接続のトポロジーであり、何ら追加的な内部スピードアップなくても、内部的にノンブロッキングである。例えば、図１Ａのスイッチファブリック１０または図３Ｂのスイッチファブリック５０のいずれかにおける相互接続ネットワーク１３１−１３２は、クロスバーネットワークまたは共有メモリネットワークであり、ノンブロッキング方式で動作するために相互接続ネットワーク１３１−１３２のいずれかに必要となるスピードアップはない。しかしながら、相互接続ネットワーク１３１−１３２が３ステージクロスネットワークである場合、各３ステージクロスネットワークには、厳密にノンブロッキング方式で動作可能であるために、内部での２のスピードアップが必要となる。中間ステージ相互接続ネットワーク１３１−１３２が３ステージクロスネットワークであるスイッチファブリックにおいて、スイッチファブリックの２のスピードアップは、１３１および１３２のような２個の異なる３ステージクロスネットワークの形で提供される。加えて、各３ステージクロスネットワーク１３１および１３２は、それ自体が内部的に厳密にノンブロッキングであるために、さらに２のスピードアップが必要である。明らかに、スイッチファブリックのスピードアップは、相互接続ネットワークの内部のスピードアップとは異なる。 Note that the speedup within the switch fabric is not related to the internal speedup of the interconnect network. For example, crossbar networks and shared memory networks are fully connected topologies and are internally non-blocking without any additional internal speedup. For example, the interconnect network 131-132 in either the switch fabric 10 of FIG. 1A or the switch fabric 50 of FIG. 3B is a crossbar network or a shared memory network, and the interconnect networks 131-132 to operate in a non-blocking manner. There is no speedup required for either. However, when the interconnection network 131-132 is a three-stage cross network, each three-stage cross network can operate strictly in a non-blocking system, and therefore requires an internal speedup of 2. In a switch fabric where the intermediate stage interconnect network 131-132 is a three stage cross network, a two speed up of the switch fabric is provided in the form of two different three stage cross networks, such as 131 and 132. In addition, each three-stage cross network 131 and 132 requires a further 2 speedup because it is strictly strictly non-blocking internally. Clearly, the switch fabric speedup is different from the internal speedup of the interconnect network.

同様に、中間ステージ１３１および１３２内の相互接続ネットワークがハイパーキューブネットワークである場合、ある実施形態においてｄランクのハイパーキューブ（２^ｄ個のノードを備える）において、それがノンブロッキングネットワークであるために、ｄの内部のスピードアップが必要である。本発明によると、中間ステージの相互接続ネットワーク１３１または１３２は、スイッチファブリックが図１Ｂの調停およびスケジューリング方法４０を使用して、中間ステージ内で２のスピードアップで、厳密にノンブロッキング方式で動作可能であり、また、中間ステージ内で少なくとも１のスピードアップで、再配置可能なノンブロッキング方式で動作可能であるために、任意の内部ノンブロッキング相互接続ネットワークであることができる。 Similarly, if the interconnection network in the intermediate stages 131 and 132 is a hypercube network, in one embodiment in a d-rank hypercube (comprising 2 ^d nodes), because it is a non-blocking network, It is necessary to increase the internal speed of d. In accordance with the present invention, the intermediate stage interconnect network 131 or 132 can operate in a strictly non-blocking manner, with the switch fabric speeding up 2 in the intermediate stage using the arbitration and scheduling method 40 of FIG. 1B. Yes, and can be any internal non-blocking interconnect network because it can operate in a relocatable non-blocking manner with at least one speedup in the intermediate stage.

図４Ｇを参照すると、中間ステージ相互接続ネットワーク１３１または１３２が図３Ｄのスイッチファブリック７０および図３Ｅのスイッチファブリック８０内にある一実施形態における、４×４ポート（２ランク）のハイパーキューブベース相互接続ネットワークの詳細図が示されている。４ノードハイパーキューブには４個のノード、即ち００、０１、１０および１１がある。ノード００は、ノード０１に双方向リンクＡで接続される。ノード０１は、ノード１１に双方向リンクＢで接続される。ノード１１は、ノード１０に双方向リンクＣで接続される。ノード１０は、ノード００に双方向リンクＤで接続される。４個のノードそれぞれは、スイッチファブリックの入力ポートおよび出力ポートに接続される。ノード００は、第１内部リンクＦＬ１と第２内部リンクＳＬ１に接続される。ノード０１は、第１内部リンクＦＬ２と第２内部リンクＳＬ１に接続される。ノード１０は、第１内部リンクＦＬ３と第２内部リンクＳＬ３に接続される。ノード１１は、第１内部リンクＦＬ４と第２内部リンクＳＬ４に接続される。図４Ｇに示すハイパーキューブネットワーク１３１または１３２が内部的にノンブロッキングであるために、一実施形態において、スイッチファブリックの入口リンク（または出口リンク）と同じレートで、あるいはハイパーキューブネットワークのスケジューリング体系に左右される幾らか要因のスピードアップによって、リンクＡ、Ｂ、Ｃ、Ｄを双方向に動作させる必要がある。本発明に従うと、ハイパーキューブが内部的にノンブロッキング方式で動作すること、スイッチファブリックが図１Ｂの調停およびスケジューリング方法４０を使用して、少なくとも２のスピードアップで厳密にノンブロッキング方式で動作可能であること、およびスイッチファブリックが中間ステージで少なくとも１のスピードアップで、再配置可能なノンブロッキング方式で動作可能であることが必要になる。 Referring to FIG. 4G, a 4 × 4 port (2 rank) hypercube based interconnect in one embodiment where the intermediate stage interconnect network 131 or 132 is in the switch fabric 70 of FIG. 3D and the switch fabric 80 of FIG. 3E. A detailed view of the network is shown. There are four nodes in the 4-node hypercube: 00, 01, 10 and 11. Node 00 is connected to node 01 by bidirectional link A. The node 01 is connected to the node 11 by a bidirectional link B. The node 11 is connected to the node 10 by a bidirectional link C. The node 10 is connected to the node 00 by a bidirectional link D. Each of the four nodes is connected to an input port and an output port of the switch fabric. The node 00 is connected to the first internal link FL1 and the second internal link SL1. The node 01 is connected to the first internal link FL2 and the second internal link SL1. The node 10 is connected to the first internal link FL3 and the second internal link SL3. The node 11 is connected to the first internal link FL4 and the second internal link SL4. Because the hypercube network 131 or 132 shown in FIG. 4G is internally non-blocking, in one embodiment, at the same rate as the switch fabric ingress link (or egress link) or depending on the hypercube network scheduling scheme. It is necessary to operate links A, B, C, and D in both directions due to some increase in speed. According to the present invention, the hypercube operates internally in a non-blocking manner, and the switch fabric can operate strictly in a non-blocking manner at a speedup of at least 2 using the arbitration and scheduling method 40 of FIG. 1B. And the switch fabric must be able to operate in a relocatable non-blocking manner with at least one speedup in the intermediate stage.

図４Ａ−４Ｆは、同数の第１内部リンクと第２内部リンクを示すが、対称スイッチファブリックの場合のように、本発明は非対称スイッチファブリックに拡張される。一般に、（ｒ_１＊ｒ_２）の非対称スイッチファブリックであって、その非対称スイッチファブリックは、各入力ポートがｒ_２個の入力キューを有するｒ_１個の入力ポートと、各出力ポートがｒ_１個の出力キューを有するｒ_２個の出力ポートと、ｓ個のサブネットワークを備えた、少なくとも

のスピードアップを有する相互接続ネットワークとを備え、各サブネットワークは総数少なくともｒ_１個の第１内部リンクに対する各入力ポートに接続される少なくとも１個の第１内部リンクを備え、各サブネットワークはさらに総数少なくともｒ_２個の第２内部リンクに対する各出力ポートに接続される少なくとも１個の第２内部リンクを備える。その非対称スイッチファブリックは、ｒ_１≦ｒ_２のときに、スイッチングすべき各スイッチング時間において、多くともｒ_２回のスイッチング時間だけ決定論的な方式で、かつパケットのセグメント化および再組立を要求することなく、多くともｒ_１個のパケットをスケジューリングすることで、本発明による厳密にノンブロッキング方式で動作する。別の実施形態においては、スイッチファブリックは、ｒ_２≦ｒ_１のときに、スイッチングすべき各スイッチング時間において、多くともｒ_１回のスイッチング時間だけ決定論的な方式で、かつパケットのセグメント化および再組立を要求することなく、多くともｒ_２個のパケットをスケジューリングすることで、本発明による厳密にノンブロッキング方式で動作する。 4A-4F show the same number of first and second internal links, but as in the case of a symmetric switch fabric, the invention extends to an asymmetric switch fabric. In general, be asymmetric switch fabric (r 1 _{* r} _2), the asymmetric switch fabric, and r ₁ input ports each input port having r ₂ inputs queues, each output port is _one r With at least r ₂ output ports having s output queues and s sub-networks

Each subnetwork comprising at least _one first internal link connected to each input port for a total of at least r ₁ first internal links, each subnetwork further comprising: A total of at least one second internal link connected to each output port for at least r ₂ second internal links. The asymmetric switch fabric requires packet segmentation and reassembly in a deterministic manner at most r ₂ switching times at each switching time to be switched when r ₁ ≦ r ₂ Without scheduling at most r ₁ packets, it operates in a strictly non-blocking manner according to the present invention. In another embodiment, the switch fabric is deterministic in each switching time to be switched when r ₂ ≦ r ₁ , and at most r ₁ switching time, and packet segmentation and By scheduling at most r ₂ packets without requiring reassembly, it operates in a strictly non-blocking manner according to the present invention.

そのような一般的な非対称スイッチファブリックはＶ（ｓ，ｒ_１，ｒ_２）として表される。一実施形態において、システムは調停に対して、相互接続ネットワークにおいて数学的に最小のスピードアップで一回のみの繰り返しを実行する。システムは１００％スループット、処理保存的、公正、それでいて決定論的に動作し、それによって出力ポートを決して輻輳させない。図１Ｂの調停およびスケジューリング方法４０は、Ｖ（ｓ，ｒ_１，ｒ_２）のスイッチファブリックにおいてパケットをスケジューリングするためにも使用される。 Such a general asymmetric switch fabric is represented as V (s, r ₁ , r ₂ ). In one embodiment, the system performs only one iteration for arbitration with minimal mathematical speedup in the interconnect network. The system operates 100% throughput, conservative, fair, yet deterministic, thereby never congesting the output port. The arbitration and scheduling method 40 of FIG. 1B is also used to schedule packets in a V (s, r ₁ , r ₂ ) switch fabric.

図１Ｂの調停およびスケジューリング方法４０は、一般的なＶ（ｓ，ｒ_１，ｒ_２）スイッチファブリックもノンブロッキング方式で動作させ、パケットは１００％スループット、処理保存的、公正な方式でスイッチングされる。スイッチファブリックのスイッチング時間は、柔軟な設計パラメータでもあり、各スイッチング時間において１バイトずつまたは数バイトずつパケットをスイッチングするよう設定されることもできる。さらに、本発明において説明したようにＳＡＲの必要もない。出力キューなしの実施形態において、パケットは入力ポート内で物理的にセグメント化され、出力ポートで再組立される必要がある。 In the arbitration and scheduling method 40 of FIG. 1B, a general V (s, r ₁ , r ₂ ) switch fabric is also operated in a non-blocking manner, and packets are switched in a 100% throughput, processing conservative and fair manner. The switching time of the switch fabric is also a flexible design parameter and can be set to switch packets one byte or several bytes at each switching time. Furthermore, there is no need for SAR as described in the present invention. In embodiments without an output queue, the packets must be physically segmented within the input port and reassembled at the output port.

同様に、一実施形態において、非対称スイッチファブリックＶ（ｓ，ｒ_１，ｒ_２）は、相互接続ネットワークにおいて、少なくとも

のスピードアップで再配置可能なノンブロッキング方式で動作する。これは、ｒ_１≦ｒ_２のときに、スイッチングすべき各スイッチング時間において、多くともｒ_２回のスイッチング時間だけ決定論的な方式で、かつパケットのセグメント化および再組立を要求することなく、多くともｒ_１個のパケットをスケジューリングすることで動作する。別の実施形態において、非対称スイッチファブリックＶ（ｓ，ｒ_１，ｒ_２）は、相互接続ネットワークにおいて、少なくとも

のスピードアップで再配置可能なノンブロッキング方式で動作する。これは、ｒ_２≦ｒ_１のときに、スイッチングすべき各スイッチング時間において、多くともｒ_１回のスイッチング時間だけ決定論的な方式で、かつパケットのセグメント化および再組立を要求することなく、多くともｒ_２個のパケットをスケジューリングすることで動作する。 Similarly, in one embodiment, the asymmetric switch fabric V (s, r ₁ , r ₂ ) is at least in the interconnect network

It operates in a non-blocking method that can be relocated at a speedup of. This is a deterministic method for each switching time to be switched when r ₁ ≦ r ₂ and at most r ₂ switching times, and without requiring packet segmentation and reassembly, It works by scheduling at most r ₁ packets. In another embodiment, the asymmetric switch fabric V (s, r ₁ , r ₂ ) is at least in the interconnect network

It operates in a non-blocking method that can be relocated at a speedup of. This is in a deterministic manner at each switching time to be switched when r ₂ ≦ r ₁ , at most r ₁ switching time, and without requiring packet segmentation and reassembly, It works by scheduling r ₂ packets at most.

非対称スイッチファブリックＶ（ｓ，ｒ_１，ｒ_２）であって、その非対称スイッチファブリックは、各入力ポートがｒ_２個の入力キューを有するｒ_１個の入力ポートと、ｒ_２個の出力ポートと、ｓ個のサブネットワークを備えた、少なくとも

のスピードアップを有する相互接続ネットワークとを備え、各サブネットワークは、少なくとも総数ｒ_１個の第１内部リンクに対する各入力ポートに接続された、少なくとも１つの第１内部リンクを備え、各サブネットワークは、さらに少なくとも総数ｒ_２個の第２内部リンクに対する各出力ポートに接続された、少なくとも１つの第２内部リンクを備える。その非対称スイッチファブリックは、本発明に従い、スイッチングすべき各スイッチング時間において、多くともｒ_２回のスイッチング時間だけ決定論的な方式で、多くともｒ_１個のパケットをスケジューリングし、かつパケットのセグメント化および再組立を要求することで、厳密にノンブロッキング方式で動作する。図１Ｂの調停およびスケジューリング方法４０は、出力キューを使用せずに、Ｖ（ｓ，ｒ_１，ｒ_２）スイッチファブリックにおいてパケットをスイッチングするためにも使用される。 An asymmetric switch fabric V (s, r ₁ , r ₂ ), wherein the asymmetric switch fabric includes r ₁ input ports each having r ₂ input queues, r ₂ output ports, and , With at least s subnetworks

Each subnetwork comprising at least one first internal link connected to each input port for at least a total of r ₁ first internal links, wherein each subnetwork is , And at least one second internal link connected to each output port for at least a total of r ₂ second internal links. The asymmetric switch fabric according to the present invention schedules at most r ₁ packets and segments the packets in a deterministic manner at most r ₂ switching times at each switching time to be switched. And by requiring reassembly, it operates strictly in a non-blocking manner. The arbitration and scheduling method 40 of FIG. 1B is also used to switch packets in a V (s, r ₁ , r ₂ ) switch fabric without using an output queue.

のスピードアップを有する相互接続ネットワークとを備え、各サブネットワークは、少なくとも総数ｒ_１個の第１内部リンクに対する各入力ポートと接続された少なくとも１つの第１内部リンクを備え、各サブネットワークは、さらに少なくとも総数ｒ_２個の第２内部リンクに対する各出力ポートに接続された少なくとも１つの第２内部リンクを備える。その非対称スイッチファブリックは、本発明に従い、スイッチングすべき各スイッチング時間において、多くともｒ_２回のスイッチング時間だけ決定論的な方式で、多くともｒ_１個のパケットをスケジューリングし、かつパケットのセグメント化および再組立を要求することで、再配置可能なノンブロッキング方式で動作する。 An asymmetric switch fabric V (s, r ₁ , r ₂ ), wherein the asymmetric switch fabric includes r ₁ input ports each having r ₂ input queues, r ₂ output ports, and , With at least s subnetworks

Each sub-network comprising at least one first internal link connected to each input port for at least a total number r ₁ first internal links, each sub-network comprising: And at least one second internal link connected to each output port for at least a total of r ₂ second internal links. The asymmetric switch fabric according to the present invention schedules at most r ₁ packets and segments the packets in a deterministic manner at most r ₂ switching times at each switching time to be switched. And by requiring reassembly, it operates in a non-blocking manner that can be repositioned.

出願人は、本発明で説明した全てのスイッチファブリックは、入力ポートから出力ポートへ、レートおよびレイテンシの保証を提供することを指摘している。エンドツーエンド保証帯域幅、即ち、任意の入力ポートから任意の出力ポートへの帯域幅が、表１に示す入力キューから出力キューへの割当に基づいて与えられる。保証された一定のレイテンシが、複数の入力ポートから任意の出力ポートへのパケットに与えられる。各入力ポートはその宛先出力ポート内の割当出力キューにパケットをスイッチングするため、ある入力ポートからのパケットは、同じ出力ポートにスイッチングする第２の入力ポートからの別のパケットを妨害せず、従って入力ポート全てからのパケットのレイテンシ保証を強制する。スイッチファブリックのスイッチング時間は、各フローにおけるパケットのレイテンシを決定し、さらに各パケット内のパケットセグメントのレイテンシも決定する。 Applicants note that all switch fabrics described in the present invention provide rate and latency guarantees from input ports to output ports. End-to-end guaranteed bandwidth, ie, bandwidth from any input port to any output port, is given based on the input queue to output queue allocation shown in Table 1. Guaranteed constant latency is provided for packets from multiple input ports to any output port. Since each input port switches packets to the assigned output queue in its destination output port, a packet from one input port does not interfere with another packet from a second input port that switches to the same output port, and therefore Enforce packet latency guarantees from all input ports. The switching time of the switch fabric determines the latency of packets in each flow, and further determines the latency of packet segments in each packet.

図５Ａは、図１Ｂの調停およびスケジューリング方法４０のアクト４４の実装を示す。ｒ^２個のパケットのスケジューリングがアクト４４で実行される。アクト４４Ａにおいて、スケジューリングすべきパケットがまだあるかどうかがチェックされる。スケジューリングすべきパケットがまだある場合、即ち、ｒ^２個のパケット全てがスケジューリングされない場合、制御はアクト４４Ｂに移る。アクト４４Ｂにおいて、中間ステージ内の２個の相互接続ネットワークのうち１個を介するオープンパスが、ｒ個のスケジューリング時間を介して検索することで選択される。パケットは、アクト４４Ｃにおいて、選択されたパス、選択されたスケジューリング時間を介してスケジューリングされる。４４Ｄにおいて、選択された第１内部リンクと第２内部リンクは選択済とマーク付けされ、他のパケットはどれも、同じスケジューリング時間においてこれらのリンクを選択できない。その後、制御はアクト４４Ａに戻り、従ってアクト４４Ａ、４４Ｂ、４４Ｃ、および４４Ｄは、ループ内で実行されて各パケットをスケジューリングする。 FIG. 5A shows an implementation of act 44 of the arbitration and scheduling method 40 of FIG. 1B. Scheduling r ² pieces of packets are executed by act 44. In act 44A, it is checked whether there are more packets to schedule. If there are more packets to schedule, ie if all r ² packets are not scheduled, control passes to act 44B. In act 44B, an open path through one of the two interconnected networks in the intermediate stage is selected by searching through r scheduling times. The packet is scheduled at act 44C via the selected path and the selected scheduling time. At 44D, the selected first and second internal links are marked as selected, and no other packet can select these links at the same scheduling time. Control then returns to act 44A so that act 44A, 44B, 44C, and 44D are executed in a loop to schedule each packet.

図５Ｂは、図５Ａのアクト４４の一変形の低レベルフローチャートである。アクト４４Ａは、スケジューリングすべき新規パケット要求がある場合、制御をアクト４４Ｂに移す。アクト４４Ｂ１は新規パケット要求をｃに割り当てる。アクト４４Ｂ２において、sched_time_1がインデックス変数ｉに割り当てられる。その後、アクト４４Ｂ３はｉがスケジューリング時間ｒ以下であるかどうかをチェックする。結果が「イエス」ならば、制御はアクト４４Ｂ４に移る。アクト４４Ｂ４において、別のインデックス変数ｊが相互接続ネットワーク１に設定される。アクト４４Ｂ５はｊが相互接続ネットワーク１または２のいずれであるかどうかをチェックする。結果が「イエス」ならば制御はアクト４４Ｂ６に移る。アクト４４Ｂ６は、パケット要求ｃがスケジューリング時間ｉにおいて、相互接続ネットワークｊに対する利用可能な第１内部リンクを有しないかどうかをチェックする。結果が「ノー」の場合、アクト４４Ｂ７は、スケジューリング時間ｉにおいて相互接続ネットワークｊが、パケット要求ｃの宛先出力ポートに対する利用可能な第２内部リンクを有しないかどうかをチェックする。結果が「ノー」の場合、制御はアクト４４Ｃに移る。アクト４４Ｃにおいて、パケット要求ｃはスケジューリング時間ｉにおいて、相互接続ネットワークｊを介してスケジューリングされ、その後、アクト４４Ｄにおいて第１および第２の内部リンクが、スケジューリング時間ｉにおける相互接続ネットワークｊに対応して、使用済とマーク付けされる。その後、制御はアクト４４Ａに移る。アクト４４Ｂ６またはアクト４４Ｂ７のいずれかにおいて、結果が「イエス」の場合、制御はアクト４４Ｂ９に移り、ｊが１増加され、制御はアクト４４Ｂ５に移る。アクト４４Ｂ５において結果が「ノー」の場合、制御はアクト４４Ｂ１０に移る。アクト４４Ｂ１０はｉを１増加し、制御はアクト４４Ｂ３に移る。アクト４４Ｂ３の結果は、決して「ノー」にならず、これはｒ個のスケジューリング時間において、パケット要求ｃはスケジューリングされるよう保証されることを意味する。アクト４４Ｂは２個のループを備える。内部ループはアクト４４Ｂ５、４４Ｂ６、４４Ｂ７、および４４Ｂ９から成る。外部ループはアクト４４Ｂ３、４４Ｂ４、４４Ｂ５、４４Ｂ６、４４Ｂ７、４４Ｂ９、および４４Ｂ１０から成る。アクト４４は、全てのｒ^２個のパケット要求がスケジューリングされるまで、全てのパケットに対して繰り返される。 FIG. 5B is a low-level flowchart of a variation of Act 44 of FIG. 5A. Act 44A transfers control to Act 44B if there is a new packet request to be scheduled. Act 44B1 assigns the new packet request to c. In Act 44B2, sched_time_1 is assigned to the index variable i. Thereafter, act 44B3 checks whether i is less than or equal to scheduling time r. If the result is “yes”, control transfers to act 44B4. In Act 44B4, another index variable j is set in the interconnection network 1. Act 44B5 checks whether j is either interconnect network 1 or 2. If the result is “yes”, control transfers to act 44B6. Act 44B6 checks whether packet request c does not have a first internal link available for interconnect network j at scheduling time i. If the result is “no”, act 44B7 checks whether the interconnection network j does not have a second internal link available to the destination output port of packet request c at scheduling time i. If the result is “no”, control is transferred to Act 44C. At act 44C, the packet request c is scheduled via interconnect network j at scheduling time i, after which the first and second internal links correspond to interconnect network j at scheduling time i at act 44D. , Marked as used. Thereafter, control is transferred to Act 44A. In either Act 44B6 or Act 44B7, if the result is “yes”, control is transferred to Act 44B9, j is incremented by 1, and control is transferred to Act 44B5. If the result is “NO” in Act 44B5, control is transferred to Act 44B10. Act 44B10 increases i by 1, and control passes to Act 44B3. The result of act 44B3 is never “no”, which means that in r scheduling times, packet request c is guaranteed to be scheduled. The act 44B includes two loops. The inner loop consists of Acts 44B5, 44B6, 44B7, and 44B9. The outer loop consists of Acts 44B3, 44B4, 44B5, 44B6, 44B7, 44B9, and 44B10. Act 44, until all r ² pieces of packet requests are scheduled, is repeated for all the packets.

以下の方法は、図５Ａのスケジューリング方法４４の一実装に対する擬似コードを示す。この擬似コードは、図４Ａ−４Ｆにおけるスイッチファブリック内の中間ステージ１３０の２のスピードアップ（２個の相互接続ネットワーク、またはクロックスピードおよびリンクスピードの２のスピードアップによる）を使用して、ｒ^２個のパケットを厳密にノンブロッキング方式でスケジューリングする。 The following method shows pseudo code for one implementation of scheduling method 44 of FIG. 5A. This pseudo code uses r ² speedups of intermediate stage 130 in the switch fabric in FIGS. 4A-4F (due to 2 interconnected networks or 2 speedups of clock speed and link speed). Scheduling of packets strictly in a non-blocking manner.

スケジューリング方法の擬似コード：
ステップ１：for スケジューリングすべき各パケット要求 do{
ステップ２： c=パケットスケジュール要求；
ステップ３： for i=shed_time_１ to sched_time_r do{
ステップ４： for j=inter_conn_net_１ to inter_conn_net_２ do{
ステップ５： if(cがjに対する利用可能な第１内部リンクを有さない) continue；
ステップ６： elseif(jがcの宛先出力ポートに対する利用可能な第２内部リンクを有さない) continue；
ステップ７： else{
スケジューリング時間iにおいて、相互接続ネットワークjを介してcをスケジューリング；
相互接続ネットワークjへ、およびそこから使用されたリンクを利用不可としてマークづけ；
}
}
}
} Pseudo code for scheduling method:
Step 1: For each packet request to be scheduled do {
Step 2: c = Request packet schedule;
Step 3: for i = shed_time_1 to sched_time_r do {
Step 4: for j = inter_conn_net_1 to inter_conn_net_2 do {
Step 5: if (c has no first internal link available for j) continue;
Step 6: elseif (j has no second internal link available for destination output port of c) continue;
Step 7: else {
Scheduling c via interconnection network j at scheduling time i;
Marking links used to and from interconnection network j as unavailable;
}
}
}
}

ステップ１は、各パケットをスケジューリングするループを開始する。ステップ２は、現在のパケット要求を「ｃ」としてラベル付けする。ステップ３は、第２ループを開始し、全てのｒスケジューリング時間を通る。ステップ４は、第３ループを開始し、２個の相互接続ネットワークを通る。パケット要求ｃの入力ポートが、ステップ５において、スケジューリング時間ｉにおいて相互接続ネットワークｊに対して利用可能な第１内部リンクを有さない場合、制御はステップ４に移り、次の相互接続ネットワークをｉとして選択する。ステップ６は、パケット要求ｃの宛先出力ポートが、スケジューリング時間ｉにおける相互接続ネットワークｊから利用可能な第２内部リンクを有しないかどうかをチェックし、もしそうならば制御はステップ４に移り、次の相互接続ネットワークをｉとして選択する。ステップ７において、パケット要求ｃが、スケジューリング時間ｉにおける相互接続ネットワークｊを介してセットアップされる。スケジューリング時間ｉにおける相互接続ネットワークｊに対する第１および第２の内部リンクが、将来のパケット要求のために利用不可としてマーク付けされる。これらのステップは、利用可能な第１および第２の内部リンクが見つかるまで、全てのｒ回のスケジューリング時間において、全ての２個の相互接続ネットワークに対して繰り返される。本発明に従って、ｒ回のスケジューリング時間の１つにおける１個の相互接続ネットワークを常に発見することができ、それを通してパケット要求ｃがスケジューリングされる。スケジューリング方法を実行するステップの数がｓ＊ｒに比例すると理解するのは簡単である。ここで、ｓは２に等しいスピードアップであり、ｒはスケジューリング時間の回数で、従ってそのスケジューリング方法は時間計算量Ｏ（ｓ＊ｒ）である。 Step 1 starts a loop that schedules each packet. Step 2 labels the current packet request as “c”. Step 3 starts the second loop and goes through all r scheduling times. Step 4 starts the third loop and goes through the two interconnected networks. If the input port of packet request c does not have a first internal link available to interconnect network j at scheduling time i at step 5, control passes to step 4 where the next interconnect network is assigned i. Choose as. Step 6 checks whether the destination output port of packet request c does not have a second internal link available from interconnect network j at scheduling time i, and if so, control passes to step 4 and then Select i as the interconnection network. In step 7, a packet request c is set up via the interconnect network j at scheduling time i. The first and second internal links for interconnect network j at scheduling time i are marked unavailable for future packet requests. These steps are repeated for all two interconnected networks at all r scheduling times until an available first and second internal link is found. In accordance with the present invention, one interconnect network at one of the r scheduling times can always be found, through which the packet request c is scheduled. It is easy to understand that the number of steps to execute the scheduling method is proportional to s * r. Here, s is a speedup equal to 2, r is the number of scheduling times, and therefore the scheduling method is the time complexity O (s * r).

表３は、上記の擬似コードのステップ１−８が、ある特別な実装における図５Ｂに示した方法のフローチャートをどのように実装するかを示す。 Table 3 shows how steps 1-8 of the pseudo code above implement the flowchart of the method shown in FIG. 5B in one particular implementation.

厳密にノンブロッキングなスイッチファブリックのスケジューリングにおいて、入力キューから出力キューへのパケット要求をスケジューリングするため、相互接続ネットワークを介するパスを発見し、既にスケジューリングされたパケットのパスを阻害することなく要求を満足することが常に可能であり、１つ以上のそのようなパスが利用可能な場合、任意のそれらを残りのパケット要求のスケジューリングを気にすることなく選択できる。厳密にノンブロッキングなネットワークにおいて、スイッチハードウェアのコストは増加するが、パケットをスケジューリングするのに要求される時間は、再配置可能なノンブロッキングなスイッチファブリックと比較して減少する。中間ステージ内の２のスピードアップを有する厳密にノンブロッキングなスイッチファブリックの実施形態は、時間計算量Ｏ（ｓ＊ｒ）である図５Ａのスケジューリング方法４４を使用し、図１Ａのスイッチファブリック１０および図１Ｉのスイッチファブリック１６において示されている。 In the scheduling of strictly non-blocking switch fabrics, packet requests from the input queue to the output queue are scheduled so that the path through the interconnect network is discovered and the request is satisfied without obstructing the path of the already scheduled packet It is always possible, and if one or more such paths are available, any of them can be selected without worrying about scheduling the remaining packet requests. In strictly non-blocking networks, the cost of switch hardware is increased, but the time required to schedule packets is reduced compared to a relocatable non-blocking switch fabric. A strictly non-blocking switch fabric embodiment with 2 speedups in the intermediate stage uses the scheduling method 44 of FIG. 5A with time complexity O (s * r), and the switch fabric 10 and FIG. It is shown in the 1I switch fabric 16.

再配置可能なノンブロッキングなスイッチファブリックにおいて、スイッチハードウェアのコストは、パケットをスケジューリングするのに要求される時間の増加の代償として削減される。スケジューリング時間は、再配置可能なノンブロッキングネットワークにおいて増加する。なぜならば、新規パケットのスケジューリングに加えて、再配置を実装するために乱された、既にスケジューリングされたパケットのパスを再度スケジューリングする必要があるからである。この理由のため、新規パケットをスケジューリングするときは、既にスケジューリングされたパケットを再配置する必要性を最小化または排除するのが望ましい。再配置の必要性が排除されるとき、そのネットワークは、中間ステージ相互接続ネットワークの数およびスケジューリング方法に左右されて、厳密にノンブロッキングである。中間ステージにおいてスピードアップを使用しない再配置可能なノンブロッキングスイッチファブリックの一実施形態が、図１Ｎのスイッチファブリック１８において示されている。要求生成、許可発行、および承認の生成の調停は、スイッチファブリックが厳密にノンブロッキング方式で動作するか、または再配置可能なノンブロッキング方式で動作するか否かに関わらず、唯一の繰り返しで実行されることは留意されたい。 In a relocatable non-blocking switch fabric, switch hardware costs are reduced at the cost of increased time required to schedule packets. Scheduling time increases in relocatable non-blocking networks. This is because, in addition to scheduling new packets, it is necessary to reschedule the paths of already scheduled packets that have been disturbed to implement relocation. For this reason, when scheduling new packets, it is desirable to minimize or eliminate the need to relocate already scheduled packets. When the need for relocation is eliminated, the network is strictly non-blocking, depending on the number of intermediate stage interconnect networks and the scheduling method. One embodiment of a relocatable non-blocking switch fabric that does not use speedup in an intermediate stage is shown in switch fabric 18 of FIG. 1N. Arbitration of request generation, authorization issuance, and approval generation is performed in a single iteration, regardless of whether the switch fabric operates strictly in non-blocking or relocatable non-blocking. Please note that.

出願人は、出力キューにキューイングされたスイッチに関する２、３の考察を行っている。出願人は、出力キューにキューイングされたスイッチは、入力ポートで受信したパケットを即座に宛先出力キューに送信することで、出力ポートを輻輳させることを指摘する。例えば、ｒ＊ｒのＯＱスイッチにおいて、全ての入力ポートが同じ出力ポートに入る場合、出力ポートは、受信するように設計された出力ポートよりもｒ回多くパケットを受信する。出力ポートの輻輳は、以下の不必要な問題を生み出す。１）追加的なパケットの優先度付けおよび管理が出力ポート内で要求される、２）レート保証の実装が極度に困難、３）出力ポートから送信される各パケットのレートが減少する、４）出力ポートにおけるトラフィック輻輳を排除することが、入力ポート内でパケットをランダムに落とすことを要求する、５）これらの全ての要因が、追加的なトラフィック管理コスト、電力およびメモリの要求につながる。本質的に出力キューイングは、ファブリックを越えてパケットを転送することで、局所的にのみパケットスイッチングを解消するが、ネットワーク装置における１００％スループットでのトラフィックの決定論的なフローという目標は達成できない。 Applicants have made a few considerations regarding switches queued in the output queue. Applicant points out that a switch queued in the output queue congests the output port by immediately sending packets received at the input port to the destination output queue. For example, in an r * r OQ switch, if all input ports enter the same output port, the output port will receive r more packets than the output port designed to receive. Output port congestion creates the following unnecessary problems: 1) Additional packet prioritization and management is required in the output port 2) Extremely difficult to implement rate guarantee 3) The rate of each packet transmitted from the output port is reduced 4) Eliminating traffic congestion at the output port requires that packets be dropped randomly within the input port 5) All these factors lead to additional traffic management costs, power and memory requirements. Inherently, output queuing eliminates packet switching only locally by forwarding packets across the fabric, but the goal of deterministic flow of traffic at 100% throughput in a network device cannot be achieved. .

出願人はここで、ＶＯＱスイッチファブリック内の出力ポートを潜在的に輻輳させうる方法を説明する。図６Ａおよび図６Ｂは、スピードアップを完全に使用した、各スイッチング時間後の図１Ａのスイッチファブリック１０の状態を示す。即ち、中間ステージ内の相互接続ネットワークにおけるスピードアップを使用し、パケットにより出力ポートが送信可能なレートの２倍でパケットを送信する。図６Ａは、パケットＡ１、Ｄ１、Ｅ１、Ｆ１、Ｊ１、Ｋ１、Ｏ１およびＰ１が出力キューにスイッチングされる第１スイッチング時間後の、図１Ａのスイッチファブリック１０の状態を示す。入力ポート１５１からのパケットＡ１は、クロスバーネットワーク１３１経由で、出力ポート１９１の出力キュー１８１にスイッチングされる。入力ポート１５１からのパケットＤ１は、クロスバーネットワーク１３２経由で、出力ポート１９４の出力キュー１８１にスイッチングされる。入力ポート１５２からのパケットＥ１は、クロスバーネットワーク１３２経由で、出力ポート１９１の出力キュー１８２にスイッチングされる。入力ポート１５２からのパケットＦ１は、クロスバーネットワーク１３１経由で、出力ポート１９２の出力キュー１８２にスイッチングされる。入力ポート１５３からのパケットＪ１は、クロスバーネットワーク１３２経由で、出力ポート１９２の出力キュー１８３にスイッチングされる。入力ポート１５３からのパケットＫ１は、クロスバーネットワーク１３１経由で、出力ポート１９３の出力キュー１８３にスイッチングされる。入力ポート１５４からのパケットＰ１は、クロスバーネットワーク１３１経由で、出力ポート１９４の出力キュー１８４にスイッチングされる。入力ポート１５４からのパケットＯ１は、クロスバーネットワーク１３２経由で、出力ポート１９３の出力キュー１８４にスイッチングされる。明らかに、第１スイッチング時間において、各入力ポートから２個のパケットがスイッチングされ、各出力ポートは２個のパケットを受信する。 Applicants now describe how the output ports in the VOQ switch fabric can potentially be congested. 6A and 6B show the state of the switch fabric 10 of FIG. 1A after each switching time, fully using speedup. That is, using speedup in the interconnect network in the intermediate stage, packets are sent at twice the rate that the output port can send by packets. FIG. 6A shows the state of the switch fabric 10 of FIG. 1A after a first switching time when packets A1, D1, E1, F1, J1, K1, O1, and P1 are switched to the output queue. The packet A1 from the input port 151 is switched to the output queue 181 of the output port 191 via the crossbar network 131. The packet D1 from the input port 151 is switched to the output queue 181 of the output port 194 via the crossbar network 132. The packet E1 from the input port 152 is switched to the output queue 182 of the output port 191 via the crossbar network 132. The packet F1 from the input port 152 is switched to the output queue 182 of the output port 192 via the crossbar network 131. The packet J1 from the input port 153 is switched to the output queue 183 of the output port 192 via the crossbar network 132. The packet K1 from the input port 153 is switched to the output queue 183 of the output port 193 via the crossbar network 131. The packet P1 from the input port 154 is switched to the output queue 184 of the output port 194 via the crossbar network 131. The packet O1 from the input port 154 is switched to the output queue 184 of the output port 193 via the crossbar network 132. Obviously, at the first switching time, two packets are switched from each input port and each output port receives two packets.

図６Ｂは、パケットＢ１、Ｃ１、Ｇ１、Ｈ１、Ｉ１、Ｌ１、Ｍ１およびＮ１が出力キューにスイッチングされる第２スイッチング時間後の、図１Ａのスイッチファブリック１０の状態を示す。入力ポート１５１からのパケットＢ１は、クロスバーネットワーク１３１経由で、出力ポート１９２の出力キュー１８１にスイッチングされる。入力ポート１５１からのパケットＣ１は、クロスバーネットワーク１３２経由で、出力ポート１９３の出力キュー１８１にスイッチングされる。入力ポート１５２からのパケットＧ１は、クロスバーネットワーク１３１経由で、出力ポート１９３の出力キュー１８２にスイッチングされる。入力ポート１５２からのパケットＨ１は、クロスバーネットワーク１３２経由で、出力ポート１９４の出力キュー１８２にスイッチングされる。入力ポート１５３からのパケットＩ１は、クロスバーネットワーク１３２経由で、出力ポート１９１の出力キュー１８３にスイッチングされる。入力ポート１５３からのパケットＬ１は、クロスバーネットワーク１３１経由で、出力ポート１９４の出力キュー１８３にスイッチングされる。入力ポート１５４からのパケットＭ１は、クロスバーネットワーク１３１経由で、出力ポート１９１の出力キュー１８４にスイッチングされる。入力ポート１５４からのパケットＮ１は、クロスバーネットワーク１３２経由で、出力ポート１９２の出力キュー１８４にスイッチングされる。再び、第２スイッチング時間において、各入力ポートから２個のパケットがスイッチングされ、各出力ポートは２個のパケットを受信する。 FIG. 6B shows the state of the switch fabric 10 of FIG. 1A after a second switching time in which packets B1, C1, G1, H1, I1, L1, M1, and N1 are switched to the output queue. The packet B1 from the input port 151 is switched to the output queue 181 of the output port 192 via the crossbar network 131. The packet C1 from the input port 151 is switched to the output queue 181 of the output port 193 via the crossbar network 132. The packet G1 from the input port 152 is switched to the output queue 182 of the output port 193 via the crossbar network 131. The packet H1 from the input port 152 is switched to the output queue 182 of the output port 194 via the crossbar network 132. The packet I1 from the input port 153 is switched to the output queue 183 of the output port 191 via the crossbar network 132. The packet L1 from the input port 153 is switched to the output queue 183 of the output port 194 via the crossbar network 131. The packet M1 from the input port 154 is switched to the output queue 184 of the output port 191 via the crossbar network 131. The packet N1 from the input port 154 is switched to the output queue 184 of the output port 192 via the crossbar network 132. Again, at the second switching time, two packets are switched from each input port, and each output port receives two packets.

しかしながら、出力ポートは、各スイッチング時間において１個のパケットを送信するのみであるといえる。さらに各入力ポートは、各スイッチング時間で１個のパケットのみを受信する。従って第３および第４スイッチング時間に対して、出力ポート内に十分な出力キュー領域がない場合、出力ポートは複数のパケットを受信できない。たとえ十分な領域がある場合でも、それは持続されることはできず、何らかの時点で出力キュー領域は満杯になり、出力ポートがクリアされるまで、入力ポートからのスイッチングは停止しなければならない。従って、スピードアップを完全に使用することは持続可能でなく、出力ポートにおける不必要な輻輳を生み出す。 However, it can be said that the output port only transmits one packet at each switching time. Furthermore, each input port receives only one packet at each switching time. Therefore, if there is not enough output queue area in the output port for the third and fourth switching times, the output port cannot receive a plurality of packets. Even if there is enough area, it cannot be sustained, and at some point the output queue area is full and switching from the input port must stop until the output port is cleared. Thus, full use of the speedup is not sustainable and creates unnecessary congestion at the output port.

さらに本発明に従って、スイッチファブリックがノンブロッキング方式で動作するために、中間ステージ１３０内で要求されるスピードアップの直接的な拡張が、パケットが出力ポートにスイッチングされる前に、追加された制御ビットの数によって比例的に調節される。例えば、１％という追加的な制御ビットが、入力ポートから出力ポートへスイッチングすべき、どのパケットまたはパケットセグメントにも追加される場合（これらの制御ビットは、入力ポートから出力ポートへのパケットをスイッチングするためだけに導入される、中間ステージ１３０において、スイッチファブリックのために要求されるスピードアップは、厳密にノンブロッキング方式で動作するためには２．０１であり、再配置可能なノンブロッキング方式で動作するためには１．０１である。 Furthermore, in accordance with the present invention, because the switch fabric operates in a non-blocking manner, a direct extension of the speedup required in the intermediate stage 130 can be achieved by adding additional control bits before the packet is switched to the output port. Proportionally adjusted by number. For example, if an additional control bit of 1% is added to any packet or packet segment that should be switched from input port to output port (these control bits switch packets from input port to output port The speedup required for the switch fabric in the intermediate stage 130, which is introduced only to do so, is 2.01 to operate strictly in a non-blocking manner and operates in a relocatable non-blocking manner 1.01 for this purpose.

同様に本発明に従って、パケットがセグメント化されて出力ポートにスイッチングされるとき、最後のパケットセグメントはそのパケットセグメントと同じであっても、また同じでなくてよい。あるいはパケットのサイズがパケットセグメントのサイズの完全な倍数でない場合、スイッチファブリックのスループットは１００％未満である。最後のパケットセグメントがそのパケットセグメントのサイズより頻繁に小さい場合、中間ステージにおけるスピードアップは、システムが１００％スループットで動作するために比例的に増加する必要がある。 Similarly, when a packet is segmented and switched to an output port according to the present invention, the last packet segment may or may not be the same as that packet segment. Alternatively, if the packet size is not a perfect multiple of the packet segment size, the switch fabric throughput is less than 100%. If the last packet segment is often smaller than the size of that packet segment, the speedup in the intermediate stage needs to increase proportionally for the system to operate at 100% throughput.

ノンブロッキングで決定論的スイッチファブリックに関する本発明は、任意の多数の入力キューに直接的に拡張できる。即ち、各入力ポート内の２個以上の入力キューは、宛先出力ポート内の２個以上の出力キューにスイッチングし、全ての入力ポート内の、異なるユニキャストフローまたはユニキャストマイクロフローのセットを保持する入力キューそれぞれは、レートおよびレイテンシ保証を有する１フローずつのＱｏＳを提供する。エンドツーエンド保証帯域幅は、即ち、入力ポートの異なる入力キューにおける、任意の宛先出力ポートへの複数のユニキャストフローに対して、提供することができる。さらに、保証された一定のレイテンシが、入力ポート内の複数の入力キューから任意の宛先出力ポートへのパケットフローに対して提供される。入力ポート内の各入力キューは異なるフローを保持するけれども、パケットを同じ宛先出力ポートにスイッチングするので、１個の入力キューからのより長いパケットは、同じ宛先出力ポートにスイッチングする同じ入力ポートの第２入力キューからの小さな別のパケットを妨害しない。従って入力ポートからのパケットフローのレイテンシ保証を強制する。ここで、スイッチファブリックのスイッチング時間は、各フローにおけるパケットのレイテンシを決定し、各パケットにおけるパケットセグメントのレイテンシも決定する。 The present invention for non-blocking and deterministic switch fabrics can be directly extended to any number of input queues. That is, two or more input queues in each input port switch to two or more output queues in the destination output port and maintain a set of different unicast flows or unicast microflows in all input ports Each incoming queue provides one flow of QoS with rate and latency guarantees. End-to-end guaranteed bandwidth can be provided for multiple unicast flows to any destination output port, ie in different input queues of the input port. In addition, guaranteed constant latency is provided for packet flows from multiple input queues within an input port to any destination output port. Although each input queue within an input port maintains a different flow, packets are switched to the same destination output port, so longer packets from one input queue are the same for the same input port that switches to the same destination output port. 2 Do not block small separate packets from the input queue. Therefore, the packet flow latency guarantee from the input port is enforced. Here, the switching time of the switch fabric determines the packet latency in each flow, and also determines the packet segment latency in each packet.

入力キューから出力ポートへ別個にスイッチングされるフローの数を増加することで、エンドツーエンド保証帯域幅とレイテンシは、細粒度のフローに対して提供することもできる。さらに、各フローは独立して形成することもでき、必要であれば、オーバーサブスクリプション下の所望のフローから予測的にパケットをテールドロップすることと、サービスプロバイダを提供して、レートおよびレイテンシ保証を個別フローに提供し、従って追加的な収入機会を可能とすることにより、形成することもできる。 By increasing the number of flows that are switched separately from the input queue to the output port, end-to-end guaranteed bandwidth and latency can also be provided for fine-grained flows. In addition, each flow can be formed independently, and if necessary, tail-drop packets from the desired flow under oversubscription and provide a service provider to guarantee rate and latency. Can be formed by providing a separate flow and thus allowing additional revenue opportunities.

ここで説明した実施形態、実装、および例の多数の修正および適合化は、本開示内容を参照することで当業者には明らかであろう。 Numerous modifications and adaptations of the embodiments, implementations, and examples described herein will be apparent to those of skill in the art upon reviewing the present disclosure.

また、本発明で説明した実施形態は、並列コンピュータ、ビデオサーバ、ロードバランサのアプリケーション、およびグリッドコンピューティングアプリケーションにおいて直接的に有用である。さらに、本発明で説明した実施形態は、回線交換タイムスロットおよび、パケット交換されたパケットまたはセルの双方をスイッチングするために、ハイブリッドスイッチおよびルータにおいて直接的に有用である。 The embodiments described in the present invention are also directly useful in parallel computers, video servers, load balancer applications, and grid computing applications. Furthermore, the embodiments described in the present invention are directly useful in hybrid switches and routers for switching both circuit-switched time slots and packet-switched packets or cells.

多数のそのような修正および適合化は、添付の請求項に包含されている。 Many such modifications and adaptations are encompassed by the appended claims.

本発明に従って、クロスバーベースの相互接続ネットワークにおける、短いパケットを含む入力および出力ユニキャストキューと２のスピードアップとを有する例示的な４×４ポートスイッチファブリックの図である。FIG. 4 is an illustration of an exemplary 4 × 4 port switch fabric having input and output unicast queues with short packets and 2 speedups in a crossbar based interconnect network in accordance with the present invention. 本発明に従って、入力ポートから出力ポートへパケットをスイッチングするために使用する、調停およびスケジューリング方法４０の高レベルフローチャートである。4 is a high level flowchart of an arbitration and scheduling method 40 used to switch packets from an input port to an output port in accordance with the present invention. 図１Ａのスイッチファブリック１０のスケジューリングにおける同様の３ステージネットワークの図である。1B is a diagram of a similar three-stage network in the scheduling of the switch fabric 10 of FIG. 1A. FIG. 本発明に従って、５回連続のスイッチング時間におけるノンブロッキングで決定論的パケットスイッチング後の、図１Ａのスイッチファブリックの状態を示す図である。FIG. 1B shows the state of the switch fabric of FIG. 1A after non-blocking, deterministic packet switching in 5 consecutive switching times according to the present invention. 本発明に従って、５回連続のスイッチング時間におけるノンブロッキングで決定論的パケットスイッチング後の、図１Ａのスイッチファブリックの状態を示す図である。FIG. 1B shows the state of the switch fabric of FIG. 1A after non-blocking, deterministic packet switching in 5 consecutive switching times according to the present invention. 本発明に従って、５回連続のスイッチング時間におけるノンブロッキングで決定論的パケットスイッチング後の、図１Ａのスイッチファブリックの状態を示す図である。FIG. 1B shows the state of the switch fabric of FIG. 1A after non-blocking, deterministic packet switching in 5 consecutive switching times according to the present invention. 本発明に従って、５回連続のスイッチング時間におけるノンブロッキングで決定論的パケットスイッチング後の、図１Ａのスイッチファブリックの状態を示す図である。FIG. 1B shows the state of the switch fabric of FIG. 1A after non-blocking, deterministic packet switching in 5 consecutive switching times according to the present invention. 本発明に従って、５回連続のスイッチング時間におけるノンブロッキングで決定論的パケットスイッチング後の、図１Ａのスイッチファブリックの状態を示す図である。FIG. 1B shows the state of the switch fabric of FIG. 1A after non-blocking, deterministic packet switching in 5 consecutive switching times according to the present invention. 本発明に従って、クロスバーベースの相互接続ネットワークにおける、長いパケットを含む入力および出力ユニキャストキューと２のスピードアップとを有する例示的な４×４ポートスイッチファブリックの図である。FIG. 4 is an illustration of an exemplary 4 × 4 port switch fabric having input and output unicast queues with long packets and two speedups in a crossbar-based interconnect network in accordance with the present invention. 本発明に従って、４回連続のファブリックスイッチングサイクル後の、パケットのセグメント化および再組立なしのノンブロッキングで決定論的パケットスイッチング後の、図１Ｉのスイッチファブリック１６の状態を示す図である。FIG. 2 illustrates the state of the switch fabric 16 of FIG. 1I after non-blocking, deterministic packet switching without packet segmentation and reassembly after four consecutive fabric switching cycles in accordance with the present invention. 本発明に従って、４回連続のファブリックスイッチングサイクル後の、パケットのセグメント化および再組立なしのノンブロッキングで決定論的パケットスイッチング後の、図１Ｉのスイッチファブリック１６の状態を示す図である。FIG. 2 illustrates the state of the switch fabric 16 of FIG. 1I after non-blocking, deterministic packet switching without packet segmentation and reassembly after four consecutive fabric switching cycles in accordance with the present invention. 本発明に従って、４回連続のファブリックスイッチングサイクル後の、パケットのセグメント化および再組立なしのノンブロッキングで決定論的パケットスイッチング後の、図１Ｉのスイッチファブリック１６の状態を示す図である。FIG. 2 illustrates the state of the switch fabric 16 of FIG. 1I after non-blocking, deterministic packet switching without packet segmentation and reassembly after four consecutive fabric switching cycles in accordance with the present invention. 本発明に従って、４回連続のファブリックスイッチングサイクル後の、パケットのセグメント化および再組立なしのノンブロッキングで決定論的パケットスイッチング後の、図１Ｉのスイッチファブリック１６の状態を示す図である。FIG. 2 illustrates the state of the switch fabric 16 of FIG. 1I after non-blocking, deterministic packet switching without packet segmentation and reassembly after four consecutive fabric switching cycles in accordance with the present invention. 本発明に従って、クロスバーベースの相互接続ネットワークにおける、入力および出力ユニキャストキューを有し、スピードアップがない例示的な４×４ポートスイッチファブリックの図である。FIG. 3 is an illustration of an exemplary 4 × 4 port switch fabric with input and output unicast queues and no speedup in a crossbar based interconnect network in accordance with the present invention. 本発明に従って、クロスバーベースの相互接続ネットワークにおける、入力ユニキャストキューと２のスピードアップとを有する例示的な４×４ポートスイッチファブリックの図である。FIG. 4 is an illustration of an exemplary 4 × 4 port switch fabric with an input unicast queue and 2 speedups in a crossbar based interconnect network in accordance with the present invention. 本発明に従って、５回連続のスイッチング時間におけるノンブロッキングで決定論的パケットスイッチング後の、図２Ａのスイッチファブリック２０の状態を示す図である。2B shows the state of the switch fabric 20 of FIG. 2A after non-blocking, deterministic packet switching in 5 consecutive switching times, in accordance with the present invention. FIG. 本発明に従って、５回連続のスイッチング時間におけるノンブロッキングで決定論的パケットスイッチング後の、図２Ａのスイッチファブリック２０の状態を示す図である。2B shows the state of the switch fabric 20 of FIG. 2A after non-blocking, deterministic packet switching in 5 consecutive switching times, in accordance with the present invention. FIG. 本発明に従って、５回連続のスイッチング時間におけるノンブロッキングで決定論的パケットスイッチング後の、図２Ａのスイッチファブリック２０の状態を示す図である。2B shows the state of the switch fabric 20 of FIG. 2A after non-blocking, deterministic packet switching in 5 consecutive switching times, in accordance with the present invention. FIG. 本発明に従って、５回連続のスイッチング時間におけるノンブロッキングで決定論的パケットスイッチング後の、図２Ａのスイッチファブリック２０の状態を示す図である。2B shows the state of the switch fabric 20 of FIG. 2A after non-blocking, deterministic packet switching in 5 consecutive switching times, in accordance with the present invention. FIG. 本発明に従って、５回連続のスイッチング時間におけるノンブロッキングで決定論的パケットスイッチング後の、図２Ａのスイッチファブリック２０の状態を示す図である。2B shows the state of the switch fabric 20 of FIG. 2A after non-blocking, deterministic packet switching in 5 consecutive switching times, in accordance with the present invention. FIG. 本発明に従って、クロスバーベースの相互接続ネットワークにおける、入力および出力ユニキャストキューと、リンクスピードおよびクロックスピードにおいて２のスピードアップとを有する例示的な４×４ポートスイッチファブリックの図である。FIG. 4 is an illustration of an exemplary 4 × 4 port switch fabric having input and output unicast queues and a 2 speed up in link and clock speeds in a crossbar based interconnect network in accordance with the present invention. 本発明に従って、共有メモリベースの相互接続ネットワークにおける、入力および出力ユニキャストキューと２のスピードアップとを有する例示的な４×４ポートスイッチファブリックの図である。FIG. 3 is a diagram of an exemplary 4 × 4 port switch fabric with input and output unicast queues and two speedups in a shared memory based interconnect network in accordance with the present invention. 本発明に従って、共有メモリベースの相互接続ネットワークにおける、入力および出力ユニキャストキューと、リンクスピードおよびクロックスピードにおいて２のスピードアップとを有する例示的な４×４ポートスイッチファブリックの図である。FIG. 4 is an illustration of an exemplary 4 × 4 port switch fabric having input and output unicast queues and a two speed up in link and clock speeds in a shared memory based interconnect network in accordance with the present invention. 本発明に従って、ハイパーキューブベースの相互接続ネットワークにおける、入力および出力ユニキャストキューと２のスピードアップとを有する例示的な４×４ポートスイッチファブリックの図である。FIG. 4 is an illustration of an exemplary 4 × 4 port switch fabric with input and output unicast queues and two speedups in a hypercube based interconnect network in accordance with the present invention. 本発明に従って、ハイパーキューブベースの相互接続ネットワークにおける、入力および出力ユニキャストキューと、リンクスピードおよびクロックスピードにおいて２のスピードアップとを有する例示的な４×４ポートスイッチファブリックの図である。FIG. 3 is an illustration of an exemplary 4 × 4 port switch fabric having input and output unicast queues and a two speed up in link and clock speeds in a hypercube based interconnect network in accordance with the present invention. 本発明に従って、クロスバーベースの相互接続ネットワークにおける、入力および出力ユニキャストキューと２のスピードアップとを有する一般的なｒ＊ｒポートスイッチファブリックの図である。FIG. 3 is a diagram of a typical r * r port switch fabric with input and output unicast queues and two speedups in a crossbar based interconnect network in accordance with the present invention. 本発明に従って、クロスバーベースの相互接続ネットワークにおける、入力および出力ユニキャストキューと、リンクスピードおよびクロックスピードにおいて２のスピードアップとを有する一般的なｒ＊ｒポートスイッチファブリックの図である。FIG. 4 is a diagram of a generic r * r port switch fabric with input and output unicast queues and a 2 speed up in link and clock speeds in a crossbar based interconnect network in accordance with the present invention. 本発明に従って、共有メモリベースの相互接続ネットワークにおける、入力および出力ユニキャストキューと２のスピードアップとを有する一般的なｒ＊ｒポートスイッチファブリックの図である。FIG. 3 is a diagram of a generic r * r port switch fabric with input and output unicast queues and two speedups in a shared memory based interconnect network in accordance with the present invention. 本発明に従って、共有メモリベースの相互接続ネットワークにおける、入力および出力ユニキャストキューと、リンクスピードおよびクロックスピードにおいて２のスピードアップとを有する一般的なｒ＊ｒポートスイッチファブリックの図である。FIG. 3 is a diagram of a generic r * r port switch fabric with input and output unicast queues and a 2 speed up in link and clock speeds in a shared memory based interconnect network in accordance with the present invention. 本発明に従って、３ステージのクロスネットワークベースの相互接続ネットワークにおける、入力および出力ユニキャストキューと２のスピードアップとを有する一般的なｒ＊ｒポートスイッチファブリックの図である。FIG. 2 is a diagram of a typical r * r port switch fabric with input and output unicast queues and two speedups in a three stage cross network based interconnect network in accordance with the present invention. 本発明に従って、３ステージのクロスネットワークベースの相互接続ネットワークにおける、入力および出力ユニキャストキューと、リンクスピードおよびクロックスピードにおいて２のスピードアップとを有する一般的なｒ＊ｒポートスイッチファブリックの図である。FIG. 3 is a diagram of a typical r * r port switch fabric with input and output unicast queues and a two speed up in link and clock speeds in a three stage cross network based interconnect network in accordance with the present invention. . 図３Ｄのスイッチファブリック７０および図３Ｅのスイッチファブリック８０内の中間ステージ相互接続ネットワーク１３１または１３２の１の実施形態における４×４ポート（ランク２）のハイパーキューブベースの相互接続ネットワークを詳細に示す図である。FIG. 3 illustrates in detail a 4 × 4 port (rank 2) hypercube-based interconnect network in one embodiment of an intermediate stage interconnect network 131 or 132 in switch fabric 70 of FIG. 3D and switch fabric 80 of FIG. 3E. It is. 図１Ｃの調停およびスケジューリング方法４０のアクト４４に対する中間レベルの実装を示す図である。FIG. 1D illustrates an intermediate level implementation for act 44 of the arbitration and scheduling method 40 of FIG. 1C. 図５Ａのアクト４４の一変形に対する低レベルフローチャートである。5B is a low-level flowchart for one variation of Act 44 of FIG. 5A. ２回連続のスイッチング時間における、スピードアップを最大に使用してパケットをスイッチングした後の、図１Ａのスイッチファブリック１０の状態を示す図である。1B is a diagram illustrating the state of the switch fabric 10 of FIG. 1A after switching packets using maximum speedup in two consecutive switching times. FIG. ２回連続のスイッチング時間における、スピードアップを最大に使用してパケットをスイッチングした後の、図１Ａのスイッチファブリック１０の状態を示す図である。1B is a diagram illustrating the state of the switch fabric 10 of FIG. 1A after switching packets using maximum speedup in two consecutive switching times. FIG.

Claims

A system for scheduling unicast packets via an interconnection network having a plurality of input ports and a plurality of output ports, wherein each of the packets has a designated output port, the system comprising:
A plurality of input queues at each of the input ports, the input queue having an input queue having unicast packets;
Means for requesting service from the designated output port for each input port for at most as many packets as there are input queues at each input port;
Each output port allows a plurality of requests;
Means for each input port to approve at most as many permissions as the input queue;
Means for scheduling at most as many packets as there are input queues from each input port having an approved permission to each output port associated with the approved permission.

A plurality of output queues at each output port, wherein the output queues receive output unicast packets via the interconnect network;
Means for each output port to grant at most as many requests as the output queue;
Scheduling at most as many packets as the number of input queues from each input port with approved grants and at most as many as the number of output queues to each output port associated with approved grants The system of claim 1, further comprising:

The system of claim 1, wherein the interconnect network is a non-blocking interconnect network.

The system of claim 3, wherein the non-blocking interconnect network comprises at least two speedups.

The speed up is
Means of parallelism, ie means for physically replicating said interconnection network at least twice and connecting by means of separate links from said each input port and each said output port; or
Realized by means of speedup at least twice in the link bandwidth between the input port and the interconnect network, between the output port and the interconnect network, and also in the clock speed of the interconnect network The system according to claim 4, wherein:

5. The unicast packet path is always selectable through the non-blocking interconnect network by never changing the already selected path for another unicast packet. The system described in
The system, hereinafter referred to as the “strict non-blocking network”.

The system of claim 3, wherein the non-blocking interconnect network comprises at least one speedup.

The unicast packet path can be always selected via the non-blocking interconnection network by changing an already selected path of another unicast packet, if necessary. 7. The system according to claim 7,
The system, hereinafter referred to as the “relocatable non-blocking network”.

The system of claim 1, further comprising a memory coupled to the scheduling means for holding a schedule of the packets already scheduled.

The system of claim 2, further comprising a memory coupled to the scheduling means for holding a schedule of the packets already scheduled.

Arbitration, that is, executing the request for the service by the input port, the permission for the request by the output port, and the approval of the permission by the input port in only one iteration. The system of claim 1.

Arbitration, that is, executing the request for the service by the input port, the permission for the request by the output port, and the approval of the permission by the input port in only one iteration. The system according to claim 2.

The system of claim 1, wherein the packets are substantially the same size.

The system of claim 1, wherein head of line blocking at the input port is completely eliminated.

The scheduling means schedules at most one packet at each switching time from each input queue having an approved grant to each output port associated with the approved grant. The system of claim 1.

The scheduling means includes, at switching time, at most one packet from each input queue having an approved grant and at most one packet to each output queue associated with the approved grant. The system according to claim 2, wherein the system is scheduled.

As long as there is at least one packet, each output port is operative to receive at least one packet from any one of the input queues destined for that output port at switching time. The system of claim 1, wherein
Hereinafter, the system will be referred to as a “process conservation system”.

As long as there is at least one packet, each output port is operative to receive at least one packet from any one of the input queues destined for that output port at switching time. The system according to claim 2, wherein
Hereinafter, the system will be referred to as a “process conservation system”.

Each output port is operable to receive at most one packet in switching time, regardless of the speed up in the interconnect network, even if more than one packet is destined for that output port. Yes,
2. The system of claim 1, wherein the speedup is thereby utilized only to operate the interconnect network in a deterministic manner and the output port is never congested.

Each output port is operable to receive at most one packet in switching time, regardless of the speed up in the interconnect network, even if more than one packet is destined for that output port. Yes,
3. The system of claim 2, wherein the speedup is thereby utilized only to operate the interconnect network in a deterministic manner and the output port is never congested.

Packets from one of the input queues are always deterministically switched to the destination output port in the same path as they are received by the input port on the same path through the interconnect network, never Works to avoid packet sequencing issues,
2. The system of claim 1, wherein the switching time is variable at design time and provides room for selecting the switching time such that multiple bytes are switched at each switching time.

Packets from one of the input queues are sent to one of the output queues in the destination output port in the same order as they are received by the input port on the same path through the interconnect network. Always switched deterministically and does not require segmentation of the packets in the input port nor reassembly of the packets in the output port, and never operates to cause packet sequencing problems;
3. The system according to claim 2, wherein the switching time is variable at the time of design and provides a room for selecting the switching time so that a plurality of bytes are switched at each switching time.

2. The operation according to claim 1, wherein any of the packets at the head of each of the input queues is operated so as not to be held beyond the same number of switching times as the number of input queues in each of the input ports. System,
Hereinafter, the system will be referred to as a “fair system”.

3. The operation according to claim 2, wherein any one of the packets at the head of each of the input queues is operated so as not to be held more than the same number of switching times as the number of input queues in each of the input ports. System,
Hereinafter, the system will be referred to as a “fair system”.

The system of claim 1, wherein the interconnect network is a crossbar network, a shared memory network, a cross network, a hypercube network, or any internal non-blocking interconnect network, or a network of networks.

The system of claim 1, wherein the system operates at 100% throughput.

The system of claim 2, wherein the system operates at 100% throughput.

The system of claim 1, wherein the system provides end-to-end guaranteed bandwidth from any input port to any output port.

The system of claim 2, wherein the system provides end-to-end guaranteed bandwidth from any input port to any output port.

The system of claim 1, wherein the system provides guaranteed constant latency for packets from multiple input ports to any output port.

3. The system of claim 2, wherein the system provides guaranteed constant latency for packets from multiple input ports to any output port.

The system of claim 1, wherein the system does not require a buffer in the interconnect network and is therefore a cut-through architecture.

The system of claim 2, wherein the system does not require a buffer in the interconnect network and is therefore a cut-through architecture.

In a method for scheduling unicast packets via an interconnection network having a plurality of input ports and a plurality of output ports, each input port comprises a plurality of input queues, each packet having at least one designated output. Said method comprising a port, said method comprising:
Requesting service from each designated input port from the designated output port for at most as many packets as there are input queues in each input port;
Allowing a plurality of requests to each output port;
Approving each input port with at most as many permissions as the number of input queues;
Scheduling said packet at most as many as the number of input queues from said input port having an approved permission to each said output port associated with said approved permission. .

A plurality of output queues for each output port;
Granting requests for each output port at most as many as the number of output queues at the output port;
Scheduling at most as many packets as the number of input queues from each input port with approved grants and at most as many as the number of output queues to each output port associated with approved grants The method of claim 34, further comprising the step of:

35. Arbitration, that is, the request for the service by the input port, the permission of the request by the output port, and the approval of the permission by the input port are executed only once. The system described in.

36. The arbitration, that is, the request for the service by the input port, the permission of the request by the output port, and the approval of the permission by the input port are executed only once. The system described in.

The method of claim 34, wherein the packets are substantially the same size.

35. The method of claim 34, wherein head of line blocking is completely eliminated at the input port.

The scheduling means schedules at most one packet at each switching time from each input queue having an approved grant to each output port associated with the approved grant. 35. The method of claim 34.

The scheduling means includes, at switching time, at most one packet from each input queue having an approved grant and at most one packet to each output queue associated with the approved grant. 36. The method of claim 35, wherein:

Each output port operates to receive at least one packet from any one of the input queues destined for that output port as long as there is at least one packet at switching time. 35. The method of claim 34, wherein:

Each output port operates to receive at least one packet from any one of the input queues destined for that output port as long as there is at least one packet at switching time. 36. The method of claim 35.

Each output port is operable to receive at most one packet in switching time, regardless of the speed up in the interconnect network, even if more than one packet is destined for that output port. ,
35. The method of claim 34, whereby the speed up in the interconnect network is only utilized to operate the interconnect network in a deterministic manner, and the output port is never congested.

Each output port is operable to receive at most one packet in switching time, regardless of the speed up in the interconnect network, even if more than one packet is destined for that output port. ,
36. The method of claim 35, whereby the speedup is only used to operate the interconnect network in a deterministic manner and the output port is never congested.

Packets from one of the input queues are always deterministically switched to the destination output port in the same path as they are received by the input port on the same path through the interconnect network, never Works to avoid packet sequencing issues,
35. The method according to claim 34, wherein the number of times of switching becomes a variable at the time of design, giving room to select the number of times of switching so that a plurality of bytes are switched at each switching time.

Packets from one of the input queues are sent to one of the output queues in the destination output port in the same order as they are received by the input port on the same path through the interconnect network. Always switched deterministically and does not require segmentation of the packets in the input port nor reassembly of the packets in the output port, and never operates to cause packet sequencing problems;
36. The method of claim 35, wherein the number of times of switching becomes a variable at design time and provides room for selecting the number of times of switching so that a plurality of bytes are switched at each switching time.

35. The method of claim 34, wherein the packet at the beginning of each input queue is operative not to be held beyond the same number of switching times as the number of input queues at each input port. .

36. The method of claim 35, wherein the packet at the beginning of each input queue is operative not to be held beyond the same number of switching times as the number of input queues at each input port. .

The method of claim 34, wherein the method schedules with 100% throughput.

36. The method of claim 35, wherein the method schedules with 100% throughput.

The method of claim 34, wherein the method operates to provide end-to-end guaranteed bandwidth from any input port to any output port.

36. The method of claim 35, wherein the method operates to provide end-to-end guaranteed bandwidth from any input port to any output port.

The method of claim 34, wherein the method operates to provide a guaranteed constant latency of packets from multiple input ports to any output port.

36. The method of claim 35, wherein the method operates to provide a guaranteed constant latency of packets from multiple input ports to any output port.

A system for scheduling unicast packets over an interconnect network, the system comprising:
R ₁ input port and r ₂ output port, each packet having a designated output port;
R ₂ input queues comprising the packets at each of the r ₁ input ports;
said interconnect network comprising s ≧ 1 sub-networks, each sub-network having a total of at least one link connected to each input port for at least r ₁ first internal links (hereinafter, Each subnetwork further comprising at least one link connected to each output port for a total of at least r ₂ second internal links (hereinafter “second internal links”). Said interconnect network comprising a link);
Means for each input port to request service from the designated output port for at most r ₂ packets from each input port;
Each output port allows a plurality of requests;
Means for each of said input ports to approve permission of at most r ₂ packets;
Means for scheduling at most r ₁ packets to each of the output ports associated with the approved grant, having an approved grant at each switching time to be switched in at most r ₂ switchings; A system characterized by comprising.

R ₁ output queues at each of the r ₂ output ports, the output queues receiving unicast packets via the interconnect network;
s ≧ 1 subnetworks, each subnetwork having a total of at least one link connected to each input port for at least r ₁ first internal links (hereinafter “first internal links”) Each sub-network further comprises at least one link connected to each output port for a total of at least r ₂ second internal links (hereinafter referred to as “second internal links”). The interconnection network;
Means for allowing each output port to accept at most r ₁ packets;
When r ₁ ≦ r ₂ , at most r ₁ packets in each switching time to be switched in at most r ₂ times switching, and when r ₂ ≦ r ₁ , at most r ₁ in each switching time to be switched in the times of switching, scheduling and r ₂ pieces of packets at most, the most r ₂ pieces of packets of said associated with permissions the approved to each output port has permission approved 57. The system of claim 56, further comprising means for performing.

57. The system of claim 56, wherein the interconnect network is a non-blocking interconnect network.

The system

With sub-networks
Furthermore, a path can always be selected for a unicast packet by never changing a path already selected for another unicast packet via the non-blocking interconnection network. 59. The system of claim 58, wherein
Hereinafter, the interconnect network will be referred to as a “strict non-blocking network”.

The system comprises s ≧ 1 subnetworks,
Both the first internal link and the second internal link operate at least twice as fast as the peak rate of each packet received in the input queue;
The subnetwork is a system that operates at least twice as fast as the peak rate of each packet received in the input queue;
The system further ensures that a path is always selectable for a unicast packet by never changing the already selected path for another unicast packet via the non-blocking interconnection network. 59. The system of claim 58, comprising:
Hereinafter, the interconnect network will be referred to as a “strict non-blocking network”.

The system

With sub-networks
Both the first internal link and the second internal link operate at least as fast as the peak rate of each packet received in the input queue;
The sub-network is a system that operates at least as fast as the peak rate of each packet received in the input queue;
The system further allows a path to be always selected for a unicast packet by changing the already selected path of another unicast packet, if necessary, via the non-blocking interconnection network. 59. The system of claim 58, comprising:
Hereinafter, the interconnect network is referred to as a “relocatable non-blocking network”.

57. The system of claim 56, further comprising a memory coupled to the means for scheduling, wherein the system schedules the already scheduled packet.

58. The system of claim 57, further comprising a memory coupled to the means for scheduling, wherein the system schedules the already scheduled packets.

The arbitration, that is, the request for the service by the input port, the permission for the request by the output port, and the approval of the permission by the input port are executed only once. 56. The system according to 56.

The arbitration, that is, the request for the service by the input port, the permission for the request by the output port, and the approval of the permission by the input port are executed only once. 58. The system according to 57.

r ₁ = r ₂ = r and the scheduling means has each approved output at each switching time to be switched in at most r times of switching and each output port associated with the approved grant 57. The system of claim 56, wherein at most r packets are scheduled.

r ₁ = r ₂ = r and the scheduling means has each approved output at each switching time to be switched in at most r times of switching and each output port associated with the approved grant 58. The system of claim 57, wherein at most r packets are scheduled.

The system of claim 56, wherein the packets are substantially the same size.

57. The system of claim 56, wherein head-of-line blocking is completely eliminated at the input port.

The scheduling means schedules at most one packet at each switching time from each input queue having an approved grant to each output port associated with the approved grant. 57. The system of claim 56.

The scheduling means includes, at switching time, at most one packet from each input queue having an approved grant and at most one packet to each output queue associated with the approved grant. 58. The system of claim 57, wherein the system is scheduled.

As long as there is at least one packet, each output port is operative to receive at least one packet from any one of the input queues destined for that output port at switching time. 57. The system of claim 56, comprising:
Hereinafter, the system will be referred to as a “process conservation system”.

As long as there is at least one packet, each output port is operative to receive at least one packet from any one of the input queues destined for that output port at switching time. 58. The system of claim 57, wherein
Hereinafter, the system will be referred to as a “process conservation system”.

Each output port is operable to receive at most one packet in switching time, regardless of the speed up in the interconnect network, even if more than one packet is destined for that output port. Yes,
57. The system of claim 56, whereby the speedup is only used to operate the interconnect network in a deterministic manner and the output port is never congested.

Each output port is operable to receive at most one packet in switching time, regardless of the speed up in the interconnect network, even if more than one packet is destined for that output port. Yes,
58. The system of claim 57, whereby the speedup is only used to operate the interconnect network in a deterministic manner and the output port is never congested.

Packets from one of the input queues are always deterministically switched to the destination output port in the same path as they are received by the input port on the same path through the interconnect network, never Works to avoid packet sequencing issues,
57. The system of claim 56, whereby the number of switchings is a variable at design time and provides room for selecting the number of switchings such that multiple bytes are switched at each switching time.

Packets from one of the input queues are sent to one of the output queues in the destination output port in the same order as they are received by the input port on the same path through the interconnect network. Always switched deterministically and does not require segmentation of the packets in the input port nor reassembly of the packets in the output port, and never operates to cause packet sequencing problems;
58. The system according to claim 57, wherein the number of times of switching becomes a variable at the time of design, and gives a room for selecting the number of times of switching so that a plurality of bytes are switched at each switching time.

57. The operation of claim 56, wherein any packet at the beginning of each input queue is not retained beyond the same number of switching times as the number of input queues at each input port. System,
Hereinafter, the system will be referred to as a “fair system”.

58. The operation of claim 57, wherein any packet at the beginning of each input queue is not retained beyond the same number of switching times as the number of input queues at each input port. System,
Hereinafter, the system will be referred to as a “fair system”.

57. The system of claim 56, wherein the interconnect network is a crossbar network, shared memory network, cross network, hypercube network, or any internal non-blocking interconnect network, or network of networks.

The system of claim 56, wherein the system operates at 100% throughput.

58. The system of claim 57, wherein the system operates at 100% throughput.

57. The system of claim 56, wherein the system provides end-to-end guaranteed bandwidth from any input port to any output port.

58. The system of claim 57, wherein the system provides end-to-end guaranteed bandwidth from any input port to any output port.

57. The system of claim 56, providing guaranteed constant latency for packets from multiple input ports to any output port.

58. The system of claim 57, providing guaranteed constant latency for packets from multiple input ports to any output port.

57. The system of claim 56, wherein the system does not require a buffer in the interconnect network and is therefore a cut-through architecture.

58. The system of claim 57, wherein the system does not require a buffer in the interconnect network and is therefore a cut-through architecture.

A method for scheduling unicast packets over an interconnect network, comprising:
R ₁ input port and r ₂ output port, each packet having at least one designated output port;
R ₂ input queues comprising the packets at each of the r ₁ input ports;
s ≧ 1 subnetworks, each subnetwork having a total of at least one link connected to each input port for at least r ₁ first internal links (hereinafter “first internal links”) Each sub-network further comprises at least one link connected to each output port for a total of at least r ₂ second internal links (hereinafter referred to as “second internal links”). Said method comprising said interconnect network, said method comprising:
Requesting service to each of the input ports from the designated output port for at most r ₂ of the packets;
Allowing a plurality of requests to each output port;
Approving the request to each input port in at most r ₂ of the packets;
Approving permission for each of said input ports in at most r ₂ packets;
Scheduling at most r ₁ packets to each of the output ports associated with the approved grant having an approved grant at each switching time to be switched in at most r ₂ switchings; Said method comprising the steps of:

r ₁ output queue at each of the ₂ output ports, the output queue receiving multirate unicast packets via the interconnect network;
said interconnect network comprising s ≧ 1 sub-networks, each sub-network being connected for at least r ₁ first internal links in total for each input port (hereinafter referred to as Each sub-network is further connected to each output port for a total of at least r ₂ second internal links (hereinafter “second internal links”). An interconnect network comprising "internal links";
Allowing at most r ₁ packets for each output port;
When r ₁ ≦ r ₂ , at most r ₁ to the output port associated with the approved grant having an approved grant at each switching time to be switched in at most r ₂ times switching. The output associated with the approved grant having an approved grant at each switching time to be switched in at most r ₁ switching when r ₂ ≦ r ₁ the method of claim 89, further comprising the step of scheduling with r ₂ pieces of packet number of the port.

Arbitration, that is, executing the request for the service by the input port, the permission for the request by the output port, and the approval of the permission by the input port in only one iteration. 90. The method of claim 89.

Arbitration, that is, executing the request for the service by the input port, the permission for the request by the output port, and the approval of the permission by the input port in only one iteration. The method of claim 90.

r ₁ = r ₂ = r and the scheduling means has each approved output at each switching time to be switched in at most r times of switching and each output port associated with the approved grant 90. The method of claim 89, wherein at most r packets are scheduled.

r ₁ = r ₂ = r and the scheduling means has each approved output at each switching time to be switched in at most r times of switching and each output port associated with the approved grant The method of claim 90, wherein at most r packets are scheduled.

90. The method of claim 89, wherein the packets are substantially the same size.

90. The method of claim 89, wherein head of line blocking at the input port is completely eliminated.

The scheduling means schedules at most one packet at each switching time from each input queue having an approved grant to each output port associated with the approved grant. 90. The method of claim 89.

The scheduling means includes, at switching time, at most one packet from each input queue having an approved grant and at most one packet to each output queue associated with the approved grant. The method of claim 90, wherein scheduling is performed.

Each output port operates to receive at least one packet from any one of the input queues destined for that output port as long as there is at least one packet at switching time. 90. The method of claim 89, wherein:

Each output port operates to receive at least one packet from any one of the input queues destined for that output port as long as there is at least one packet at switching time. 92. The method of claim 90, wherein:

Each output port is operable to receive at most one packet in switching time, regardless of the speed up in the interconnect network, even if more than one packet is destined for that output port. ,
90. The method of claim 89, whereby speedups in the interconnect network are only used to operate the interconnect network in a deterministic manner and the output port is never congested.

Each output port is operable to receive at most one packet in switching time, regardless of the speed up in the interconnect network, even if more than one packet is destined for that output port. ,
93. The method of claim 90, whereby the speedup is only used to operate the interconnect network in a deterministic manner, and the output port is never congested.

Packets from one of the input queues are always deterministically switched to the destination output port in the same path as they are received by the input port on the same path through the interconnect network, never Works to avoid packet sequencing issues,
90. The system of claim 89, wherein the number of switchings is a variable at design time and provides room for selecting the number of switchings so that multiple bytes are switched at each switching time.

Packets from one of the input queues are sent to one of the output queues in the destination output port in the same order as they are received by the input port on the same path through the interconnect network. Always switched deterministically and does not require segmentation of the packets in the input port nor reassembly of the packets in the output port, and never operates to cause packet sequencing problems;
The system according to claim 90, wherein the number of times of switching becomes a variable at the time of design, and gives a room for selecting the number of times of switching so that a plurality of bytes are switched at each switching time.

90. The method of claim 89, wherein the packets at the beginning of each input queue are not retained beyond the same number of switching times as the number of input queues at each input port. .

93. The method of claim 90, wherein the packet at the beginning of each input queue is operative not to be held beyond the same number of switching times as the number of input queues at each input port. .

90. The method of claim 89, wherein the method operates at 100% throughput.

The method of claim 90, wherein the method operates at 100% throughput.

90. The method of claim 89, wherein the method operates to provide end-to-end guaranteed bandwidth from any input port to any output port.

The method of claim 90, wherein the method operates to provide end-to-end guaranteed bandwidth from any input port to any output port.

90. The method of claim 89, wherein the method operates to provide a guaranteed constant latency of packets from multiple input ports to any output port.

The method of claim 90, wherein the method operates to provide a guaranteed constant latency of packets from multiple input ports to any output port.