JP2012113604A

JP2012113604A - Computer system, method, and i/o switch

Info

Publication number: JP2012113604A
Application number: JP2010263440A
Authority: JP
Inventors: Chihiro Yoshimura; 地尋吉村
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2010-11-26
Filing date: 2010-11-26
Publication date: 2012-06-14

Abstract

PROBLEM TO BE SOLVED: To reduce congestion in an I/O interface and interconnect in a server by distributing data transmission between the server and an I/O device to multiple I/O interfaces which the server has.SOLUTION: When CPUs 102A-D in a server 100 issue a load instruction or a store instruction for I/O devices 120A-D, a computer system adds a different offset value for each CPU to a destination address of a packet or a transaction occurring due to the load instruction or the store instruction. Also, an I/O switch having a switch exchange part subtracts a different offset value for each I/O interface from a destination address or a destination ID of a packet or a transaction issued by the server.

Description

本発明は、複数のプロセッサと複数のＩ／Ｏインタフェースを備えたサーバ、或いは計算機システムに関し、特にＩ／Ｏインタフェースのデータ転送負荷の分散技術に関する。 The present invention relates to a server or computer system having a plurality of processors and a plurality of I / O interfaces, and more particularly to a technique for distributing a data transfer load of an I / O interface.

近年、計算機システムの構成の柔軟性を高めるために、サーバとＩ／Ｏ（Ｉｎｐｕｔ／Ｏｕｔｐｕｔ）デバイスの間にＩ／Ｏスイッチを介在させる構成が用いられている。これにより、サーバとＩ／Ｏデバイスの間の接続をＩ／Ｏスイッチの設定を変更するのみで切替えることができ、構成変更を迅速かつ柔軟に行うことが出来る。ひいては、計算機システム上で動作するジョブの負荷変動に対してＩ／Ｏを増強する、Ｉ／Ｏデバイスの障害発生時に切替えを実現する、等の計算機システム運用上の利便性を得ることが出来る。 In recent years, a configuration in which an I / O switch is interposed between a server and an I / O (Input / Output) device has been used in order to increase the flexibility of the configuration of a computer system. As a result, the connection between the server and the I / O device can be switched only by changing the setting of the I / O switch, and the configuration can be changed quickly and flexibly. As a result, it is possible to obtain convenience in the operation of the computer system, such as increasing I / O against fluctuations in the load of jobs operating on the computer system, and realizing switching when a failure occurs in an I / O device.

通常、サーバは、プロセッサ(ＣＰＵ、ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）、主記憶装置（メモリ）、及び、Ｉ／Ｏデバイスを接続するためのＩ／Ｏインタフェースを提供するＩ／Ｏハブ（ＩＯＨ、Ｉｎｐｕｔ／ＯｕｔｐｕｔＨｕｂ）から構成される。 Normally, a server is an I / O hub (IOH, Input / Output Hub) that provides an I / O interface for connecting a processor (CPU, Central Processing Unit), a main storage (memory), and an I / O device. ).

Ｉ／Ｏインタフェースに関する技術は、種々の業界標準技術を含め、非特許文献１、２、３や、特許文献１などに開示されており、例えば、ＰＣＩ−ＳＩＧ（ＰｅｒｉｐｈｅｒａｌＣｏｍｐｏｎｅｎｔＩｎｔｅｒｃｏｎｅｃｔ−ＳｐｅｃｉａｌＩｎｔｅｒｅｓｔＧｒｏｕｐ）のＰＣＩＥｘｐｒｅｓｓのような、業界標準規格で定められたインタフェースが用いられている（非特許文献１）。そして、近年市販されている装置では、ＩＯＨは１本、ないしは、複数本のＰＣＩＥｘｐｒｅｓｓを提供している。 Technologies related to the I / O interface are disclosed in Non-Patent Documents 1, 2, 3, and Patent Document 1 including various industry standard technologies. For example, PCI-SIG (Peripheral Component Interconnect-Special Interest Group). An interface defined by an industry standard such as PCI Express is used (Non-Patent Document 1). In recent years, commercially available apparatuses provide one or a plurality of PCI Express IOHs.

Ｉ／Ｏデバイスは、例えばサーバをＥｔｈｅｒｎｅｔ（登録商標）に接続するためのネットワークインタフェースカード（ＮＩＣ、ＮｅｔｗｏｒｋＩｎｔｅｒｆａｃｅＣａｒｄ）が代表例である。ＮＩＣを始めとするＩ／Ｏデバイスは前述したＩ／Ｏインタフェースを介してサーバと接続される。現在標準的に用いられているのは、１ＧｂｉｔＥｔｈｅｒｎｅｔや１０ＧｂｉｔＥｔｈｅｒｎｅｔのＮＩＣで、サーバとの間はＰＣＩＥｘｐｒｅｓｓで接続するものが多い。 A typical example of the I / O device is, for example, a network interface card (NIC, Network Interface Card) for connecting a server to Ethernet (registered trademark). An I / O device such as a NIC is connected to a server via the I / O interface described above. Currently, 1 Gbit Ethernet and 10 Gbit Ethernet NICs are used as standard, and many servers are connected to the server by PCI Express.

特開２０１０−７９８１６号公報JP 2010-79816 A

ＰＣＩＥｘｐｒｅｓｓＢａｓｅＳｐｅｃｉｆｉｃａｔｉｏｎＲｅｖｉｓｉｏｎ２．０、ＰＣＩ−ＳＩＧ、Ｄｅｃｅｍｂｅｒ２０、２００６．PCI Express Base Specification Revision 2.0, PCI-SIG, December 20, 2006. Ｍｕｌｔｉ−ＲｏｏｔＩ／ＯＶｉｒｔｕａｌｉｚａｔｉｏｎａｎｄＳｈａｒｉｎｇＳｐｅｃｉｆｉｃａｔｉｏｎＲｅｖｉｓｉｏｎ１．０、ＰＣＩ−ＳＩＧ、Ｍａｙ１２、２００８．Multi-Root I / O Virtualization and Sharing Specification Revision 1.0, PCI-SIG, May 12, 2008. Multi-Root I / O Virtualization and Sharing Specification Revision 1.0. ＳｉｎｇｌｅＲｏｏｔＩ／ＯＶｉｒｔｕａｌｉｚａｔｉｏｎａｎｄＳｈａｒｉｎｇＳｐｅｃｉｆｉｃａｔｉｏｎＲｅｖｉｓｉｏｎ１．０、ＰＣＩ−ＳＩＧ、Ｓｅｐｔｅｍｂｅｒ１１、２００７．Single Root I / O Virtualization and Sharing Specification Revision 1.0, PCI-SIG, September 11, 2007.

現在サーバで普及している、上述のＰＣＩＥｘｐｒｅｓｓに注目して検討すると、ＰＣＩＥｘｐｒｅｓｓではツリー状のトポロジーを採用していることにより、Ｉ／Ｏスイッチを導入しても構成に制約を受けることが有り、それに対する発明が従来なされてきている。 Considering the above-mentioned PCI Express, which is now widely used in servers, PCI Express adopts a tree-like topology, so that even if an I / O switch is introduced, the configuration is restricted. Yes, the invention for this has been made.

例えば、単一のＩ／Ｏデバイスを複数のサーバから共有することは非特許文献１で示されているＰＣＩＥｘｐｒｅｓｓの規格では不可能である。ＰＣＩＥｘｐｒｅｓｓで接続されるＩ／Ｏデバイスはツリー状のトポロジーではリーフ（末端）に当たるが、リーフからツリーを辿って到達できるのは単一のルート（サーバ）のみである。単一のＩ／Ｏデバイス（リーフ）から複数のサーバ（ルート）に到達することはできないという制約がある。 For example, sharing a single I / O device from a plurality of servers is impossible with the PCI Express standard disclosed in Non-Patent Document 1. An I / O device connected by PCI Express corresponds to a leaf (terminal) in a tree topology, but only a single route (server) can be reached from the leaf by following the tree. There is a restriction that a plurality of servers (root) cannot be reached from a single I / O device (leaf).

これに対して、例えば非特許文献２で示すＭＲ−ＩＯＶ（ＭｕｌｔｉＲｏｏｔＩ／ＯＶｉｒｔｕａｌｉｚａｔｉｏｎａｎｄＳｈａｒｉｎｇ）と呼ばれる、ＰＣＩＥｘｐｒｅｓｓの拡張規格がある。このＭＲ−ＩＯＶを実現するためには、Ｉ／Ｏデバイス、Ｉ／Ｏスイッチ、及び、サーバ上で動作するオペレーティングシステム等のシステムソフトウェアに対して新規な機構を必要とする。これにより、複数のサーバで単一のデバイスを共有することが実現されたが、ＭＲ−ＩＯＶに対応したデバイス（ＭＲ−ＩＯＶデバイス）を新規に開発する必要がある。 On the other hand, there is a PCI Express extended standard called MR-IOV (Multi Root I / O Virtualization and Sharing) shown in Non-Patent Document 2, for example. In order to realize this MR-IOV, a new mechanism is required for system software such as an I / O device, an I / O switch, and an operating system operating on a server. As a result, a single device is shared by a plurality of servers, but it is necessary to newly develop a device (MR-IOV device) corresponding to MR-IOV.

そこで、特許文献１に示される技術では、非特許文献３で示されるＳＲ−ＩＯＶ（ＳｉｎｇｌｅＲｏｏｔＩ／ＯＶｉｒｔｕａｌｉｚａｔｉｏｎａｎｄＳｈａｒｉｎｇ）と呼ばれる、ＰＣＩＥｘｐｒｅｓｓの拡張規格に準拠したデバイス（ＳＲ−ＩＯＶデバイス）を、複数のサーバで共有する技術を開示している。ＳＲ−ＩＯＶは単一のサーバ上で動作する複数の仮想マシン間で単一のＩ／Ｏデバイスを共有するための技術で、ＭＲ−ＩＯＶデバイスと比較してＳＲ−ＩＯＶデバイスは多く市場に流通している。 Therefore, in the technology disclosed in Patent Document 1, a device (SR-IOV device) compliant with the PCI Express extended standard called SR-IOV (Single Root I / O Virtualization and Sharing) shown in Non-Patent Document 3 is used. Discloses a technique shared by a plurality of servers. SR-IOV is a technology for sharing a single I / O device among multiple virtual machines running on a single server. Compared with MR-IOV devices, SR-IOV devices are more widely distributed in the market. is doing.

しかし、ＳＲ−ＩＯＶデバイスではＭＲ−ＩＯＶデバイスのように複数のサーバから共有されることはできない。そこで、特許文献１ではＩ／Ｏスイッチ内にアドレス変換等の機構を有することで、単一のＳＲ−ＩＯＶデバイスを複数のサーバから共有することを実現している。 However, SR-IOV devices cannot be shared from a plurality of servers like MR-IOV devices. Thus, in Patent Document 1, a single SR-IOV device is shared from a plurality of servers by having a mechanism such as address translation in the I / O switch.

また、計算機システムの処理能力を高めるために、ＳＭＰ（ＳｙｍｍｅｔｒｉｃＭｕｌｔｉｐｌｅＰｒｏｃｅｓｓｏｒ）サーバの利用が広まっている。ＳＭＰサーバは複数のＣＰＵを有しており、複数のＣＰＵで単一の主記憶空間を共有することを特徴としたサーバであり、各々のＣＰＵ上で個別のオペレーティングシステム等のシステムソフトウェアを動作させるのではなく、ＳＭＰサーバを構成する複数のＣＰＵに跨って単一のオペレーティングシステムを動作させて用いる。 In addition, the use of SMP (Symmetric Multiple Processor) servers has become widespread in order to increase the processing capacity of computer systems. The SMP server has a plurality of CPUs, and a plurality of CPUs share a single main storage space, and operates system software such as an individual operating system on each CPU. Instead, a single operating system is operated and used across a plurality of CPUs constituting the SMP server.

ＳＭＰサーバでは、複数のＣＰＵを有して処理性能を上げていることから、Ｉ／Ｏ性能もそれに見合うように高める必要があり、ＩＯＨを複数有し、複数のＣＰＵと複数のＩＯＨの間を接続するバスを、高速シリアル伝送によるポイントツーポイント接続にしている場合が多い。 Since the SMP server has multiple CPUs to improve processing performance, it is necessary to increase the I / O performance to match that, and there are multiple IOHs between the multiple CPUs and multiple IOHs. The bus to be connected is often point-to-point connection by high-speed serial transmission.

すなわち、複数のＣＰＵを有し、複数のＩＯＨから複数のＩ／Ｏインタフェースが提供されているＳＭＰサーバにおいて、あるＣＰＵから見た時に、当該ＣＰＵに直に接続されているＩＯＨから提供されているＩ／Ｏインタフェース（近いＩ／Ｏインタフェース）と、他のＣＰＵを経由して接続されているＩＯＨから提供されるＩ／Ｏインタフェース（遠いＩ／Ｏインタフェース）の２種類にＩ／Ｏインタフェースが分類できることになる。 In other words, in an SMP server that has a plurality of CPUs and is provided with a plurality of I / O interfaces from a plurality of IOHs, it is provided from an IOH that is directly connected to the CPU when viewed from a certain CPU. I / O interfaces are classified into two types: I / O interfaces (close I / O interfaces) and I / O interfaces (distant I / O interfaces) provided by IOH connected via other CPUs It will be possible.

ＳＭＰサーバ上では、オペレーティングシステム及びその他のシステムソフトウェアや、アプリケーションソフトウェアを構成する複数のスレッドが複数のＣＰＵ上で分散して動作している。すなわち、スレッド毎に近いＩ／Ｏインタフェースと遠いＩ／Ｏインタフェースが異なる。このような事象は、ＳＭＰサーバを構成する方式の一つであるＮＵＭＡ（ＮｏｎＵｎｉｆｏｒｍＭｅｍｏｒｙＡｒｃｈｉｔｅｃｔｕｒｅ）でもメモリとＣＰＵ間での近い、遠いが発生しているが、Ｉ／Ｏインタフェースに関しても同様の特性が生じている。 On the SMP server, an operating system, other system software, and a plurality of threads constituting application software operate in a distributed manner on a plurality of CPUs. That is, the near I / O interface and the far I / O interface are different for each thread. Such an event occurs near and far between the memory and the CPU even in NUMA (Non Uniform Memory Architecture), which is one of the methods for configuring the SMP server, but the I / O interface has similar characteristics. Has occurred.

特許文献１の発明では、単一のＩ／Ｏデバイスを複数のサーバで共有することを実現していた。しかし、個々のサーバはいずれもツリー状のトポロジーのルートであるため、サーバとＩ／Ｏデバイス間の経路は一経路しか存在しないことになる。すなわち、あるサーバが複数本のＩ／Ｏインタフェースを有していたとしても、ある特定のＩ／Ｏデバイスとの間でデータ転送をする際には、１本のＩ／Ｏインタフェースしか使われない。 In the invention of Patent Document 1, a single I / O device is shared by a plurality of servers. However, since each individual server is a root of a tree-like topology, there is only one path between the server and the I / O device. In other words, even if a server has multiple I / O interfaces, only one I / O interface is used when transferring data to or from a specific I / O device. .

しかし、上述のＳＭＰサーバでは、複数のＩ／Ｏインタフェースを持つため、特許文献１に記載のような動作では、ＳＭＰサーバが有するＩ／Ｏ性能を活かすことができない。具体的には、ＳＭＰサーバとＩ／ＯデバイスをＩ／Ｏスイッチを介して接続する時に、ＳＭＰサーバが複数のＩ／Ｏインタフェースを介してＩ／Ｏスイッチと接続されているにもかかわらず、前述したツリー状のトポロジーにより経路を決定するため、ある特定のＩ／Ｏデバイスに対してアクセスする際には、常に同じＩ／Ｏインタフェースが用いられてしまうという問題が発生する。 However, since the above-mentioned SMP server has a plurality of I / O interfaces, the operation described in Patent Document 1 cannot make use of the I / O performance of the SMP server. Specifically, when an SMP server and an I / O device are connected via an I / O switch, the SMP server is connected to the I / O switch via a plurality of I / O interfaces. Since the path is determined based on the tree-like topology described above, there is a problem that the same I / O interface is always used when accessing a specific I / O device.

これにより、特定のＩ／Ｏインタフェースの混雑が発生しやすくなってしまう。また、ＳＭＰサーバ内部で特定のＩ／Ｏインタフェースから転送を行うために、ＣＰＵ間を接続する内部インターコネクトの混雑が著しくなるという問題も生じる。特に内部インターコネクトは、Ｉ／Ｏのみならずキャッシュコヒーレンス等種々の処理を担っているため、内部インターコネクトの混雑はＳＭＰサーバ全体の性能を著しく低下させる要因となる。 As a result, congestion of a specific I / O interface is likely to occur. Further, since transfer is performed from a specific I / O interface inside the SMP server, there is a problem that congestion of internal interconnects connecting between CPUs becomes significant. In particular, since the internal interconnect is responsible for various processes such as cache coherence as well as I / O, congestion of the internal interconnect is a factor that significantly reduces the performance of the entire SMP server.

そこで、ＰＣＩＥｘｐｒｅｓｓのようなツリー状のトポロジーによるＩ／Ｏインタフェースを用いながらも、複数のＩ／Ｏインタフェースを有するＳＭＰサーバにおいて、Ｉ／Ｏの負担を各Ｉ／Ｏインタフェースに分散させることが課題となっている。 Therefore, while using an I / O interface with a tree-like topology such as PCI Express, it is an issue to distribute the I / O load to each I / O interface in an SMP server having a plurality of I / O interfaces. It has become.

本発明は、上記したサーバ等のＩ／Ｏインタフェース技術に存在する課題を解決する計算機システム、方法、及びＩ／Ｏスイッチを提供することを目的とする。 An object of the present invention is to provide a computer system, a method, and an I / O switch for solving the problems existing in the I / O interface technology such as the server described above.

上記の目的を達成するため、本発明においては、計算機システムであって、複数のＩ／Ｏインタフェースと、記憶空間を共有する複数の処理部を有するサーバと、サーバが利用するＩ／Ｏデバイスと、サーバのＩ／Ｏインタフェースと、Ｉ／Ｏデバイスを接続するＩ／Ｏスイッチとを備え、処理部は、処理部がＩ／Ｏデバイスに対する命令を発行したときに、処理部毎に異なるオフセット値を、命令に基づき発生するデータの宛先データに加算するアドレス変換機構を有し、Ｉ／Ｏスイッチは、サーバが発行した宛先データから、対応するオフセット値を減算するオフセット減算部を有する構成の計算機システムを提供する。 In order to achieve the above object, according to the present invention, a computer system includes a plurality of I / O interfaces, a server having a plurality of processing units sharing a storage space, and an I / O device used by the server. And an I / O interface of the server and an I / O switch for connecting the I / O device. When the processing unit issues a command for the I / O device, the processing unit has an offset value that is different for each processing unit. Is added to the destination data of the data generated based on the command, and the I / O switch has a configuration having an offset subtracting unit that subtracts the corresponding offset value from the destination data issued by the server Provide a system.

また、上記の目的を達成するため、本発明においては、複数のＩ／Ｏインタフェースと記憶空間を共有する複数の処理部を有するサーバが利用するＩ／Ｏデバイスと、Ｉ／Ｏインタフェースを接続するＩ／Ｏスイッチを備える計算機システムのデータ転送方法であって、処理部は、Ｉ／Ｏデバイスに対する命令を発行したときに、命令に基づき発生するデータの宛先データに、処理部毎に異なるオフセット値を与え、Ｉ／Ｏスイッチは、サーバが発行した宛先データから、対応するオフセット値を除いた後、Ｉ／Ｏデバイスに前記データを転送するデータ転送方法を提供する。 In order to achieve the above object, in the present invention, an I / O interface used by a server having a plurality of processing units sharing a storage space with a plurality of I / O interfaces is connected to the I / O interface. A data transfer method for a computer system having an I / O switch, wherein when a command is issued to an I / O device, a processing unit adds an offset value that differs for each processing unit to destination data of data generated based on the command The I / O switch provides a data transfer method for transferring the data to the I / O device after removing the corresponding offset value from the destination data issued by the server.

更に、上記の目的を達成するため、本発明においては、記憶空間を共有する複数の処理部を備えるサーバの複数のＩ／Ｏインタフェースと、Ｉ／Ｏデバイスとを接続するＩ／Ｏスイッチであって、サーバが発行したデータの宛先データから、Ｉ／Ｏインタフェース毎に異なるオフセット値を減算するオフセット減算部と、オフセット減算部の出力に接続されるスイッチ交換部と、オフセット減算部とスイッチ交換部を制御するＩ／Ｏコントローラを備える構成のＩ／Ｏスイッチを提供する。 Furthermore, in order to achieve the above object, the present invention is an I / O switch for connecting a plurality of I / O interfaces of a server having a plurality of processing units sharing a storage space and an I / O device. An offset subtracting unit that subtracts a different offset value for each I / O interface from destination data issued by the server, a switch exchanging unit connected to the output of the offset subtracting unit, an offset subtracting unit, and a switch exchanging unit An I / O switch having an I / O controller for controlling the I / O controller is provided.

すなわち、上記の課題を解決するために、本発明の好適な態様においては、複数のＩ／Ｏインタフェース、及び複数のＣＰＵを有するサーバと、サーバが利用する少なくとも１個以上のＩ／Ｏデバイスと、サーバとＩ／Ｏデバイス間を接続するＩ／Ｏスイッチとを備える計算機システムであって、サーバの複数ＣＰＵは単一の主記憶空間を共有するものであり、サーバとＩ／Ｏスイッチは複数のＩ／Ｏインタフェースで接続されており、サーバはアドレス変換手段を有しており、アドレス変換手段は、ＣＰＵがＩ／Ｏデバイスに対するロード命令ないしはストア命令を発行したときに、ＣＰＵ毎に異なるオフセット値をロード命令ないしはストア命令などの命令に起因して発生するパケットないしはトランザクションの宛先アドレスに加算するものであり、Ｉ／Ｏスイッチは、サーバが発行したパケットないしはトランザクションの宛先アドレス、ないしは宛先識別子（Ｉｄｅｎｔｉｆｉｅｒ：ＩＤ）から、Ｉ／Ｏインタフェース毎に対応するオフセット値を減算するオフセット減算部と、オフセット値を記憶するオフセット情報記憶部を有する構成の計算機システムを提供する。 That is, in order to solve the above problems, in a preferred aspect of the present invention, a server having a plurality of I / O interfaces and a plurality of CPUs, and at least one or more I / O devices used by the server, A computer system comprising an I / O switch for connecting between a server and an I / O device, wherein a plurality of CPUs of the server share a single main storage space, and a plurality of servers and I / O switches are provided. The server has an address conversion unit, and the address conversion unit has an offset different for each CPU when the CPU issues a load instruction or a store instruction to the I / O device. The value is added to the destination address of a packet or transaction generated due to an instruction such as a load instruction or a store instruction. The I / O switch includes an offset subtracting unit that subtracts an offset value corresponding to each I / O interface from a destination address of a packet or transaction issued by a server, or a destination identifier (Identifier: ID), and an offset. A computer system having an offset information storage unit for storing a value is provided.

本発明は、複数のＩ／Ｏインタフェースを有するサーバ等を用いる計算機システムにおいて、Ｉ／Ｏインタフェース、及び、サーバ内部で用いられている内部インターコネクトの混雑を緩和し、システム全体の性能を向上させる効果を有する。 The present invention reduces the congestion of I / O interfaces and internal interconnects used inside the server in a computer system using a server having a plurality of I / O interfaces, and improves the performance of the entire system. Have

第１の実施例に係る計算機システムの全体構成の一例を示すブロック図である。It is a block diagram which shows an example of the whole structure of the computer system which concerns on a 1st Example. 第１の実施例に係る計算機システムのＩ／Ｏスイッチにおけるオフセット減算部の構成の一例を示すブロック図である。It is a block diagram which shows an example of a structure of the offset subtraction part in the I / O switch of the computer system concerning a 1st Example. 第１の実施例に係る計算機システムのＩ／Ｏスイッチにおける宛先ポート検索部の構成の一例を示すブロック図である。It is a block diagram which shows an example of a structure of the destination port search part in the I / O switch of the computer system concerning a 1st Example. 第１の実施例に係る計算機システムのＩ／Ｏコントローラが有するオフセット情報の構成の一例を示す概念図である。It is a conceptual diagram which shows an example of a structure of the offset information which the I / O controller of the computer system concerning a 1st Example has. 第１の実施例に係る計算機システムのＩ／Ｏコントローラが有する宛先ポートテーブルの構成の一例を示す概念図である。It is a conceptual diagram which shows an example of a structure of the destination port table which the I / O controller of the computer system concerning a 1st Example has. 第１の実施例に係る計算機システムの主記憶空間の一例を示すメモリマップである。It is a memory map which shows an example of the main storage space of the computer system which concerns on a 1st Example. 第１の実施例に係る計算機システムの主記憶空間のうちＥＣＡＭ空間の一例を示すメモリマップである。It is a memory map which shows an example of ECAM space among the main memory spaces of the computer system which concerns on a 1st Example. 第１の実施例に係る計算機システムの初期化シーケンスの一例を示すフローチャートである。It is a flowchart which shows an example of the initialization sequence of the computer system which concerns on a 1st Example. 第１の実施例に係る計算機システムのバスツリーの一例を示す概念図である。It is a conceptual diagram which shows an example of the bus tree of the computer system which concerns on a 1st Example. 第１の実施例に係る計算機システム上でデータが転送される経路を示す概念図である。It is a conceptual diagram which shows the path | route by which data is transferred on the computer system which concerns on a 1st Example. 第１の実施例に係る計算機システム上でデータが転送される経路を示す概念図である。It is a conceptual diagram which shows the path | route by which data is transferred on the computer system which concerns on a 1st Example. 第１の実施例に係る計算機システム上でデータが転送される経路を示す概念図である。It is a conceptual diagram which shows the path | route by which data is transferred on the computer system which concerns on a 1st Example. 第１の実施例に係る計算機システム上でデータが転送される経路を示す概念図である。It is a conceptual diagram which shows the path | route by which data is transferred on the computer system which concerns on a 1st Example. 第１の実施例に係る計算機システムの主記憶空間全体の一例を示すメモリマップである。It is a memory map which shows an example of the whole main storage space of the computer system concerning a 1st Example. 第１の実施例に係る計算機システム上でデータが転送される経路を示す概念図である。It is a conceptual diagram which shows the path | route by which data is transferred on the computer system which concerns on a 1st Example.

以下、本発明の一実施形態を添付図面に基づいて説明する。なお、実施の形態を説明するための全図において、同一の部材には原則として同一の符号を付し、同一の符号の繰り返しの説明は省略する。なお、本明細書において、サーバとは、処理部であるプロセッサ(ＣＰＵ、ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）、記憶部であるメモリ、及び、インタフェース部であるＩ／Ｏハブ（ＩＯＨ）などから構成される計算機である。また、本明細書において、各システムの要素となる機能ブロックを、「機構」、「手段」、或いは「部」と表現する場合がある。例えば、アドレス変換機能を、「アドレス変換機構」、「アドレス変換部」、或いは「アドレス変換手段」と呼ぶ。このアドレス変換機構として、本明細書における計算機システムでは、オペレーティングシステムが用いる第１のアドレス変換機構とハイパーバイザ及びその他のシステムソフトウェアが用いる第２のアドレス変換機構がある。更に、ＣＰＵやＩ／Ｏデバイスが発するパケットないしはトランザクションをデータと、パケットないしはトランザクションの宛先アドレスを宛先アドレスや宛先ＩＤを宛先データと呼ぶ。 Hereinafter, an embodiment of the present invention will be described with reference to the accompanying drawings. Note that components having the same function are denoted by the same reference symbols throughout the drawings for describing the embodiment, and the repetitive description of the same symbols will be omitted. In this specification, a server is a computer that includes a processor (CPU, Central Processing Unit) that is a processing unit, a memory that is a storage unit, and an I / O hub (IOH) that is an interface unit. is there. Further, in this specification, functional blocks that are elements of each system may be expressed as “mechanism”, “means”, or “part”. For example, the address conversion function is referred to as “address conversion mechanism”, “address conversion unit”, or “address conversion means”. As this address translation mechanism, the computer system in this specification includes a first address translation mechanism used by the operating system and a second address translation mechanism used by the hypervisor and other system software. Furthermore, a packet or transaction issued by the CPU or I / O device is called data, and a destination address of the packet or transaction is called a destination address or destination ID.

図１は、第１の実施例に係る計算機システムの全体構成を示す図である。図１に示す計算機システムは、ＳＭＰサーバ１００、Ｉ／Ｏスイッチ１１０、Ｉ／Ｏデバイス１２０Ａ〜Ｄ、管理サーバ１３０から構成されている。なお、Ｉ／Ｏスイッチ１１０は、Ｉ／Ｏドロワの形に形成されても良いし、集積回路（ＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ：ＩＣ）の形で構成しても良い。 FIG. 1 is a diagram illustrating an overall configuration of a computer system according to the first embodiment. The computer system shown in FIG. 1 includes an SMP server 100, an I / O switch 110, I / O devices 120A to 120D, and a management server 130. The I / O switch 110 may be formed in the form of an I / O drawer or may be configured in the form of an integrated circuit (IC).

ＳＭＰサーバ１００内の構成は種々の態様が考えられるが、本実施例では近年広く利用されているブレードサーバにおいてＳＭＰサーバを実現する技術であるブレード間ＳＭＰを適用したＳＭＰサーバを例示する。ブレード間ＳＭＰはブレードサーバのブレード１０１Ａ〜Ｄ（なお、それぞれのブレードには＃０〜＃３の番号を付与している。例えば、ブレード１０１Ａはブレード＃０である。）の間を内部インターコネクト１０９で接続することで、複数のブレード１０１Ａ〜Ｄを１台のＳＭＰサーバ１００として利用する技術である。内部インターコネクト１０９としては、例えばＩｎｔｅｌ社のＱｕｉｃｋＰａｔｈＩｎｔｅｒｃｏｎｎｅｃｔや、ＨｙｐｅｒＴｒａｎｓｐｏｒｔが利用される。 Various configurations are conceivable in the SMP server 100. In this embodiment, an SMP server to which an inter-blade SMP, which is a technology for realizing an SMP server in blade servers widely used in recent years, is applied is illustrated. The inter-blade SMP includes blades 101A to 101D of blade servers (numbers # 0 to # 3 are assigned to the respective blades. For example, blade 101A is blade # 0). Is a technique for using a plurality of blades 101A-D as a single SMP server 100. As the internal interconnect 109, for example, QuickPath Interconnect of Hyper and HyperTransport are used.

ブレード１０１Ａ〜Ｄは何れも同じ内部構成をとる。ブレード１０１Ａ〜Ｄはそれぞれ、ＣＰＵ１０２Ａ〜Ｄ、メモリ１０３Ａ〜Ｄ、内部インターコネクト１０４Ａ〜Ｄ、ＩＯＨ１０５Ａ〜Ｄから構成される。 All of the blades 101A to 101D have the same internal configuration. Each of the blades 101A to 101D includes CPUs 102A to 102D, memories 103A to 103D, internal interconnects 104A to 104D, and IOH 105A to 105D.

なお、ブレード内の接続を担う内部インターコネクト１０４Ａ〜Ｄと、ブレード間を接続する内部インターコネクト１０９は一般的に同種のインターコネクト技術が用いられることが多い。ブレード１０１Ａ〜Ｄ各々のＣＰＵ１０２Ａ〜Ｄは少なくとも一個のＣＰＵで構成されている。メモリ１０３Ａ〜Ｄ各々の一部は、ＳＭＰサーバ１００の主記憶メモリとして利用される。 In general, the same type of interconnect technology is often used for the internal interconnects 104A to 104D that are responsible for connection within the blade and the internal interconnect 109 that connects the blades. Each of the CPUs 102A to 102D of the blades 101A to 101D is composed of at least one CPU. A part of each of the memories 103A to 103D is used as a main storage memory of the SMP server 100.

ＩＯＨ１０５Ａ〜ＤはＩ／Ｏインタフェース１１５Ａ〜Ｄ（ＰＣＩＥｘｐｒｅｓｓインタフェース）を提供する。ＣＰＵ１０２Ａ〜ＤがＩ／Ｏデバイス１２０Ａ〜Ｄとの間でデータ転送を行うために発行したロード命令やストア命令などの命令は、ＩＯＨ１０５Ａ〜ＤがＰＣＩＥｘｐｒｅｓｓのパケットに変換して、Ｉ／Ｏインタフェース１１５Ａ〜Ｄに出力する。また、Ｉ／Ｏデバイス１２０Ａ〜Ｄから送信されたＰＣＩＥｘｐｒｅｓｓパケットで指示されるメモリへの書込みや、メモリからの読出しを実現するために、ＩＯＨ１０５Ａ〜Ｄはメモリ１０３Ａ〜Ｄに直接アクセスする（ＤｉｒｅｃｔＭｅｍｏｒｙＡｃｃｅｓｓ：ＤＭＡ）。なお、ＳＭＰサーバ１００においては、単一の主記憶を共有するため、ＩＯＨ１０５Ａ〜Ｄがアクセスすることの可能なメモリは、当該ＩＯＨが存在するブレード上のメモリに限らない。例えばＩＯＨ１０５Ａは、他のブレード上に存在するメモリ１０３Ｃにアクセスすることも可能である。この場合、内部インターコネクト１０４Ａ、１０９、１０４Ｃを経由してアクセスすることになる。 The IOHs 105A to 105D provide I / O interfaces 115A to 115D (PCI Express interfaces). Instructions such as load instructions and store instructions issued by the CPUs 102A to 102D to transfer data to and from the I / O devices 120A to 120D are converted into PCI Express packets by the IOH 105A to D, and the I / O interface Output to 115A-D. In addition, the IOHs 105A to 105D directly access the memories 103A to 103D (Direct Memory) in order to realize writing to and reading from the memory indicated by the PCI Express packet transmitted from the I / O devices 120A to 120D. Access: DMA). Since the SMP server 100 shares a single main memory, the memory that can be accessed by the IOHs 105A to 105D is not limited to the memory on the blade in which the IOH exists. For example, the IOH 105A can access the memory 103C existing on another blade. In this case, access is made via the internal interconnects 104A, 109, and 104C.

ＳＭＰサーバ１００では、ブレード１０１Ａ〜Ｄに跨って単一のオペレーティングシステム、ハイパーバイザ、ないしは、その他のシステムソフトウェアを動作させ、その上でアプリケーションを動作させることができる。例えば、本実施例においては、ＳＭＰサーバ１００上で単一のハイパーバイザを動作させ、その上で仮想マシン（ＶｉｒｔｕａｌＭａｃｈｉｎｅ：ＶＭ）を動作させるものとする。そのために、メモリ１０３Ａ〜Ｄ上に分散して、ハイパーバイザのバイナリと、ＶＭイメージのバイナリが配置されている。 In the SMP server 100, a single operating system, hypervisor, or other system software can be operated across the blades 101A to 101D, and an application can be operated thereon. For example, in this embodiment, it is assumed that a single hypervisor is operated on the SMP server 100 and a virtual machine (Virtual Machine: VM) is operated thereon. For this purpose, the hypervisor binary and the VM image binary are arranged on the memories 103A to 103D.

Ｉ／Ｏスイッチ１１０は、ＳＭＰサーバ１００とＩ／Ｏデバイス１２０Ａ〜Ｄの接続関係を柔軟に切替えるためのスイッチを提供する。Ｉ／Ｏスイッチ１１０は、ＰＣＩＥｘｐｒｅｓｓスイッチで構成されるスイッチ交換部１１１、Ｉ／Ｏコントローラ１１２、オフセット減算部１１３Ａ〜Ｄ、宛先ポート検索部１１４Ａ〜Ｄから構成される。 The I / O switch 110 provides a switch for flexibly switching the connection relationship between the SMP server 100 and the I / O devices 120A to 120D. The I / O switch 110 includes a switch exchanging unit 111 configured with a PCI Express switch, an I / O controller 112, offset subtracting units 113A to 113D, and destination port searching units 114A to 114D.

スイッチ交換部１１１はサーバを接続するポート（以降、サーバ側ポートと称する）を複数個有するスイッチである。また、デバイスを接続するポート（以降、デバイス側ポートと称する）を少なくとも１ポート以上有する。 The switch exchange unit 111 is a switch having a plurality of ports (hereinafter referred to as server-side ports) for connecting servers. In addition, it has at least one port for connecting devices (hereinafter referred to as a device-side port).

このようなスイッチとしては、例えば、非特許文献２のＭＲ−ＩＯＶ規格に準拠したスイッチがある。また、ＭＲ−ＩＯＶ規格に準拠せずとも、複数のサーバ側ポートを有するスイッチであれば本実施例で利用することができる。 As such a switch, for example, there is a switch compliant with the MR-IOV standard of Non-Patent Document 2. Further, a switch having a plurality of server-side ports can be used in this embodiment without conforming to the MR-IOV standard.

Ｉ／Ｏコントローラ１１２は、Ｉ／Ｏスイッチ１１０内の構成要素、すなわちスイッチ交換部１１１、オフセット減算部１１３Ａ〜Ｄ、宛先ポート検索部１１４Ａ〜Ｄを制御するためのコントローラである。Ｉ／Ｏコントローラ１１２は具体的には、ＣＰＵやメモリをワンチップに集積したマイクロコントローラを利用すれば良い。Ｉ／Ｏコントローラ１１２上には、後で詳述するオフセット情報１５０、及び宛先ポートテーブル１５１が格納されている。 The I / O controller 112 is a controller for controlling the components in the I / O switch 110, that is, the switch exchange unit 111, the offset subtraction units 113A to 113D, and the destination port search units 114A to D. Specifically, the I / O controller 112 may use a microcontroller in which a CPU and a memory are integrated on a single chip. On the I / O controller 112, offset information 150 and a destination port table 151, which will be described in detail later, are stored.

管理サーバ１３０は、図１に示す計算機システム全体を管理するためのサーバである。管理サーバ１３０は、ハードウェアとしてはブレードサーバ等を用いて実現できる。また、管理に要する負荷が軽微であれば、マイクロコントローラを利用して実現することも可能である。 The management server 130 is a server for managing the entire computer system shown in FIG. The management server 130 can be realized using a blade server or the like as hardware. If the load required for management is light, it can be realized using a microcontroller.

管理サーバ１３０は管理ネットワーク１３１を経由して、ＳＭＰサーバ１００及びＩ／Ｏコントローラ１１２と接続されている。管理サーバはＳＭＰサーバ１００上で動作するハイパーバイザ、及び、Ｉ／Ｏコントローラ１１２と通信を行い、後述する各種パラメータをハイパーバイザ、ないしは、Ｉ／Ｏコントローラ１１２に指定することができる。また、管理サーバ１３０はＳＭＰサーバ１００の構成情報としてブレードの枚数、メモリ容量等を取得することができる。 The management server 130 is connected to the SMP server 100 and the I / O controller 112 via the management network 131. The management server communicates with the hypervisor operating on the SMP server 100 and the I / O controller 112 and can specify various parameters to be described later to the hypervisor or the I / O controller 112. Further, the management server 130 can acquire the number of blades, the memory capacity, and the like as the configuration information of the SMP server 100.

図１４に本実施例のＳＭＰサーバ１００の主記憶空間を示す。ブレード１０１Ａ〜Ｄが有するメモリ１０３Ａ〜Ｄは、図１４に示す単一の主記憶空間にマップされる。また、ＣＰＵ１０２Ａ〜ＤがＩ／Ｏデバイス１２０Ａ〜Ｄにアクセスする際に用いるＭＭＩＯ（ＭｅｍｏｒｙＭａｐｐｅｄＩｎｐｕｔ／Ｏｕｔｐｕｔ）空間等も配置されている。 FIG. 14 shows the main storage space of the SMP server 100 of this embodiment. The memories 103A to 103D included in the blades 101A to 101D are mapped to a single main storage space shown in FIG. In addition, a memory mapped input / output (MMIO) space used when the CPUs 102A to 102D access the I / O devices 120A to 120D are also arranged.

ＰＣＩＥｘｐｒｅｓｓでは、Ｉ／Ｏデバイス１２０Ａ〜Ｄは、図１４に示す主記憶空間の任意のアドレスに対して、メモリの書込み、ないしは、メモリの読出しを要求するパケットを投げることができる。 In PCI Express, the I / O devices 120A to 120A-D can throw a packet for requesting memory writing or memory reading to an arbitrary address in the main storage space shown in FIG.

また、ＰＣＩＥｘｐｒｅｓｓでは、Ｉ／Ｏデバイス１２０Ａ〜Ｄが有するＭＭＲ(ＭｅｍｏｒｙＭａｐｐｅｄＲｅｇｉｓｔｅｒ）を主記憶空間内にマップすることができ、これをＭＭＩＯ空間と称する。ＣＰＵ１０２Ａ〜ＤはＭＭＩＯ空間に対してロード命令、ないしは、ストア命令を発行することで、ＭＭＲからの読出し、ないしは、ＭＭＲへの書込みを行うことができる。 In PCI Express, an MMR (Memory Mapped Register) included in the I / O devices 120A to 120D can be mapped in the main storage space, which is referred to as an MMIO space. The CPUs 102A to 102D can read from the MMR or write to the MMR by issuing a load instruction or a store instruction to the MMIO space.

ＰＣＩＥｘｐｒｅｓｓのバストポロジーでは、Ｂｕｓ＃（バス番号、８ｂｉｔ）、Ｄｅｖ＃（デバイス番号、５ｂｉｔ）、Ｆｕｎ＃（ファンクション番号、３ｂｉｔ）の３種の番号の組合せ（合計１６ｂｉｔ）で、Ｉ／Ｏデバイスや経路を指定する。なお、Ｄｅｖ＃とＦｕｎ＃に関しては、Ｉ／Ｏデバイス内に複数の機能を有する時の指定に用いられるものである。本実施例においては、Ｉ／Ｏデバイス内の動作は影響しないので、Ｄｅｖ＃とＦｕｎ＃に関しては説明を割愛する。 In the PCI Express bus topology, an I / O device is a combination of three numbers (Bus # (bus number, 8 bits), Dev # (device number, 5 bits), Fun # (function number, 3 bits)) (16 bits in total). And specify the route. Note that Dev # and Fun # are used for designation when the I / O device has a plurality of functions. In the present embodiment, the operation in the I / O device has no effect, and thus description of Dev # and Fun # is omitted.

図９に、図１に示す計算機システムに対して本発明を適用せずに、ＰＣＩＥｘｐｒｅｓｓ規格にのみ則って利用する場合のバスツリーを示す。図９において、ＳＭＰサーバ１００から４本の線がＩ／Ｏスイッチ１１０に接続されているが、これは図１で示す物理的なコンポーネントとしてはＩ／Ｏインタフェース１１５Ａ〜Ｄと対応している。また、同様にＩ／ＯスイッチからＩ／Ｏデバイスに接続する線は、それぞれ図１のＩ／Ｏインタフェース１２１Ａ〜Ｄ、Ｉ／Ｏデバイスはそれぞれ図１のＩ／Ｏデバイス１２０Ａ〜Ｄに対応している。 FIG. 9 shows a bus tree when the present invention is not applied to the computer system shown in FIG. 1 and the computer system is used only in conformity with the PCI Express standard. In FIG. 9, four lines from the SMP server 100 are connected to the I / O switch 110, which corresponds to the I / O interfaces 115A to 115D as physical components shown in FIG. Similarly, the lines connecting the I / O switches to the I / O devices correspond to the I / O interfaces 121A to 121D in FIG. 1, respectively, and the I / O devices correspond to the I / O devices 120A to 120D in FIG. ing.

ＰＣＩＥｘｐｒｅｓｓではバスツリーの頂点（ルート）から、深さ優先方向にＢｕｓ＃を付与していく。Ｂｕｓ＃は図９に示すようにＩ／Ｏインタフェースに対して付与されていく。途中経路では単一のＢｕｓ＃に複数のブリッジ（ツリーの中継点）が接続されるが、末端（リーフ）ではブリッジ１個につき1個のＩ／Ｏデバイスしかつなげることが出来ない。そのため、Ｉ／Ｏデバイスが直に接続されるＩ／ＯインタフェースのＢｕｓ＃を、当該Ｉ／Ｏデバイスを識別する識別子として見做すことが出来る。例えば、Ｉ／Ｏインタフェース１２１ＡのＢｕｓ＃はＢｕｓ＃３であるが、Ｉ／Ｏデバイス１２０Ａを示す識別子としてＢｕｓ＃３を用いる。 In PCI Express, Bus # is assigned in the depth priority direction from the apex (root) of the bus tree. The Bus # is assigned to the I / O interface as shown in FIG. On the way, a plurality of bridges (tree relay points) are connected to a single Bus #, but at the end (leaf), one I / O device per bridge cannot be connected. Therefore, the Bus # of the I / O interface to which the I / O device is directly connected can be regarded as an identifier for identifying the I / O device. For example, the Bus # of the I / O interface 121A is Bus # 3, but Bus # 3 is used as an identifier indicating the I / O device 120A.

ＰＣＩＥｘｐｒｅｓｓでは、ＣＰＵ１０２Ａ〜Ｄから、Ｉ／Ｏデバイス１２０Ａ〜Ｄに対してパケットを発行する時、Ｉ／Ｏデバイス１２０Ａ〜Ｄが有するＭＭＲがマッピングされているＭＭＩＯ空間のアドレスを宛先として指定するＭＭＩＯリクエストと、Ｉ／Ｏデバイス１２０Ａ〜Ｄを示す識別子と、当該デバイス内の４ＫＢ空間（Ｃｏｎｆｉｇｕｒａｔｉｏｎ空間）のアドレスの組合せを宛先として指定するＣｏｎｆｉｇｕｒａｔｉｏｎリクエストの２種類がある。 In PCI Express, when a packet is issued from the CPU 102A to D to the I / O devices 120A to 120D, the MMIO that specifies the address of the MMIO space to which the MMR of the I / O devices 120A to 120D is mapped as a destination There are two types of configuration requests: a request, an identifier indicating the I / O devices 120A to 120D, and a configuration request that designates a combination of addresses in a 4 KB space (Configuration space) in the device as a destination.

Ｃｏｎｆｉｇｕｒａｔｉｏｎリクエストは、Ｂｕｓ＃（さらにＩ／Ｏデバイスが複数の機能を有する場合にはＤｅｖ＃、Ｆｕｎ＃も組合せる）とＣｏｎｆｉｇｕｒａｔｉｏｎ空間アドレス（１２ｂｉｔ）の組合せで宛先を表現する。ＣＰＵ１０２Ａ〜ＤからＩ／Ｏデバイス１２０Ａ〜Ｄまでの間は、Ｂｕｓ＃を手掛かりにルーティングを行う。そして、Ｉ／Ｏデバイス１２０Ａ〜Ｄまで到達した後は、単一の機能だけを有するデバイスの場合にはＣｏｎｆｉｇｕｒａｔｉｏｎ空間アドレス、複数の機能を有するデバイスの場合にはＤｅｖ＃、Ｆｕｎ＃、及び、Ｃｏｎｆｉｇｕｒａｔｉｏｎ空間アドレスの組合せで、Ｉ／Ｏデバイス内で最終的に到達すべき箇所が決定される。 The Configuration request expresses the destination by a combination of Bus # (further combining Dev # and Fun # if the I / O device has a plurality of functions) and Configuration space address (12 bits). Routing from the CPUs 102A to 102D to the I / O devices 120A to 120D is performed using Bus # as a clue. Then, after reaching the I / O devices 120A to 120D, a configuration space address in the case of a device having only a single function, Dev #, Fun #, and configuration in the case of a device having a plurality of functions. The location to be finally reached in the I / O device is determined by the combination of the space addresses.

Ｂｕｓ＃を手掛かりにしたルーティングは、図９に示すような、Ｂｕｓ＃が深さ優先で付与されたツリー状のトポロジーであれば容易に行うことが出来る。ＳＭＰサーバ１００内では、宛先のＢｕｓ＃を見て、Ｂｕｓ＃１〜６３はＩ／Ｏインタフェース１１５Ａに、Ｂｕｓ＃６５〜１２７はＩ／Ｏインタフェース１１５Ｂに、Ｂｕｓ＃１２９〜１９１はＩ／Ｏインタフェース１１５Ｃに、Ｂｕｓ＃１９３〜２５５はＩ／Ｏインタフェース１１５Ｄに出力する。このように、各ブリッジには、そのブリッジの配下にあるＢｕｓ＃の範囲が記録されており、それを手掛かりにルーティングすることができる。Ｉ／Ｏスイッチ１１０内でも同様にルーティングを行うことで、Ｉ／Ｏデバイス１２０Ａ〜Ｄに到達することができる。 The routing using the Bus # as a clue can be easily performed in a tree-like topology in which the Bus # is given with depth priority as shown in FIG. In the SMP server 100, looking at the destination Bus #, Bus # 1 to 63 are the I / O interface 115A, Bus # 65 to 127 are the I / O interface 115B, and Bus # 129 to 191 are the I / O interface. In 115C, Bus # 193-255 outputs to the I / O interface 115D. In this way, the range of Bus # under the bridge is recorded in each bridge, and routing can be performed using this as a clue. By performing routing in the I / O switch 110 in the same manner, it is possible to reach the I / O devices 120A to 120D.

なお、ＣｏｎｆｉｇｕｒａｔｉｏｎリクエストとＭＭＩＯリクエストはＣＰＵ１０２Ａ〜Ｄから見ると、いずれも主記憶空間へのロード命令、ないしは、ストア命令として発行される。主記憶空間中のＭＭＩＯ空間へのロード命令、ないしは、ストア命令の発行がＭＭＩＯリクエストを発生させる。また、主記憶空間中にはＣｏｎｆｉｇｕｒａｔｉｏｎリクエストを発生させるための特別な空間（ＰＣＩＥｘｐｒｅｓｓＥｎｈａｎｃｅｄＣｏｎｆｉｇｕｒａｔｉｏｎＡｃｃｅｓｓＭｅｃｈａｎｉｓｍ空間、ＥＣＡＭ空間）が用意されており、この空間に対するロード命令、ないしは、ストア命令の発行がＣｏｎｆｉｇｕｒａｔｉｏｎリクエストを発生させる。いずれも、ＣＰＵ１０２Ａ〜Ｄから見ると、主記憶空間内のメモリへのアクセスと区別が無いことに留意されたい。 Note that the Configuration request and the MMIO request are both issued as a load instruction to the main storage space or a store instruction when viewed from the CPUs 102A to 102D. Issuing a load instruction or a store instruction to the MMIO space in the main memory space generates an MMIO request. In addition, a special space (PCI Express Enhanced Configuration Mechanism space, ECAM space) for generating a Configuration request is prepared in the main memory space, and a load instruction or a store instruction issuance for this space is issued. Generate a request. It should be noted that none of them is distinguished from access to the memory in the main storage space when viewed from the CPUs 102A to 102D.

ＭＭＩＯリクエストは、主記憶空間のアドレス（近年のサーバでは６４ｂｉｔが多い。旧来のサーバでは３２ｂｉｔのものも存在する。）で宛先を指定する。Ｂｕｓ＃を手掛かりにしたルーティングと同様に、図９に示す各ブリッジには、当該ブリッジの配下にあるアドレスの範囲が記録されており、これを手掛かりにルーティングを行っている。ルーティングの動作としてはＭＭＩＯリクエストも、Ｃｏｎｆｉｇｕｒａｔｉｏｎリクエストも類似しているため、以降の説明ではＣｏｎｆｉｇｕｒａｔｉｏｎリクエストを事例とする。 The MMIO request designates a destination by an address of the main storage space (in recent servers, there are many 64 bits. In conventional servers, there are also 32 bits). Similar to the routing using Bus # as a clue, each bridge shown in FIG. 9 records the range of addresses under the bridge, and performs routing using this as a clue. Since the MMIO request and the configuration request are similar as routing operations, the configuration request is an example in the following description.

図１０に、図９のバスツリーにおいて、ＳＭＰサーバ１００からＩ／Ｏデバイス１２０Ｂ（Ｂｕｓ＃４）にアクセスする際の経路を示す。この経路は、Ｂｕｓ＃４宛のロード命令、ないしは、ストア命令がＣＰＵ１０２Ａ〜Ｄ（ブレード１０１Ａ〜Ｄ、ブレード番号では＃０〜＃３）のどこで実行されたとしても不変である。 FIG. 10 shows a path for accessing the I / O device 120B (Bus # 4) from the SMP server 100 in the bus tree of FIG. This path is unchanged regardless of where the load instruction or the store instruction addressed to Bus # 4 is executed in the CPUs 102A to 102D (blades 101A to D, blade numbers # 0 to # 3).

図１０で示した経路はバスツリーという論理的なレイヤーでのものであったが、物理的な経路で表現すると、図１１で示すような経路になる。図１１では、ＣＰＵ１０２Ａ（ブレード１０１Ａ、＃０）からＩ／Ｏデバイス１０２Ｂ（Ｂｕｓ＃４）にアクセスする経路を破線、ＣＰＵ１０２Ｃ（ブレード１０１Ｃ、＃２）からＩ／Ｏデバイス１０２Ｂ（Ｂｕｓ＃４）にアクセスする経路を実線で示している。図９に示すバスツリーではいずれも同じ経路を通るが、図１０に示す物理的な経路では、ロード命令、ないしは、ストア命令が発行されるＣＰＵが異なるため、途中までは異なる経路を通る。 The route shown in FIG. 10 is a logical layer called a bus tree. However, when expressed by a physical route, the route is as shown in FIG. In FIG. 11, the path from the CPU 102A (blade 101A, # 0) to the I / O device 102B (Bus # 4) is indicated by a broken line, and the CPU 102C (blade 101C, # 2) to the I / O device 102B (Bus # 4). The route to be accessed is indicated by a solid line. In the bus tree shown in FIG. 9, all pass the same route. However, in the physical route shown in FIG. 10, the CPU to which the load instruction or the store instruction is issued is different, so the route is different until halfway.

図１１において、ＣＰＵ１０２ＣからＩ／Ｏデバイス１０２Ｂへのアクセスでは内部インターコネクト１０９を経由している。内部インターコネクト１０９は、Ｉ／Ｏデバイスへのアクセスの他にも、キャッシュコヒーレンス等多くのトラフィックが流れており、ＳＭＰサーバ１００全体のパフォーマンスを左右するため、内部インターコネクト１０９のトラフィックは出来るだけ回避することが望ましい。また、ＣＰＵ１０２Ｃのみならず、ＣＰＵ１０２Ｂ、１０２Ｄも同時にＩ／Ｏデバイス１２０Ｂに対してアクセスしようとすると、内部インターコネクト１０９をより一層混雑させてしまう。 In FIG. 11, the CPU 102C accesses the I / O device 102B via the internal interconnect 109. In addition to access to I / O devices, the internal interconnect 109 has a lot of traffic such as cache coherence, which affects the overall performance of the SMP server 100. Therefore, avoid traffic on the internal interconnect 109 as much as possible. Is desirable. Further, if not only the CPU 102C but also the CPUs 102B and 102D try to access the I / O device 120B at the same time, the internal interconnect 109 is further congested.

この計算機システムでは、ブレード１０１ＣとＩ／Ｏスイッチ１１０間のＩ／Ｏインタフェース１１５Ｃも存在するので、ＣＰＵ１０２ＣからＩ／Ｏデバイス１０２Ｂへのアクセスは、Ｉ／Ｏインタフェース１１５Ｃを経由すれば、内部インターコネクト１０９の混雑を回避することができる。 In this computer system, since there is also an I / O interface 115C between the blade 101C and the I / O switch 110, access from the CPU 102C to the I / O device 102B is via the internal interconnect 109 via the I / O interface 115C. Can be avoided.

しかし、図９で示したバストポロジーでＩ／Ｏインタフェース１１５Ｃを経由するようなアクセスを行うためには、Ｉ／Ｏデバイス１０２ＢはＢｕｓ＃１２９〜１９１でなければならない。例えば、Ｉ／Ｏデバイス１０２ＢをＢｕｓ＃１３２に置く必要が出てくる。すると、今度は、ＣＰＵ１０２Ａ〜ＤからＩ／Ｏデバイス１０２Ｂへのアクセスは全てＩ／Ｏインタフェース１１５Ｃに集中することになり、ＣＰＵ１０２Ｃからのアクセスに起因する混雑は解消しても、ＣＰＵ１０２Ａ、Ｂ、Ｄからのアクセスに起因する混雑が発生することになる。 However, in order to perform access via the I / O interface 115C in the bus topology shown in FIG. 9, the I / O device 102B must be Bus # 129-191. For example, the I / O device 102B needs to be placed on the Bus # 132. In this case, all accesses from the CPUs 102A to 102D to the I / O device 102B are concentrated on the I / O interface 115C, and even if the congestion caused by the access from the CPU 102C is eliminated, the CPUs 102A, B, D Congestion due to access from the network will occur.

そこで、本実施例では、図１のオフセット演算部１１３Ａ〜Ｄ、宛先ポート検索部１１４Ａ〜Ｄ、Ｉ／Ｏコントローラ１１２、オフセット情報１５０、宛先ポートテーブル１５１、管理サーバ１３０、及び、ＳＭＰサーバ１００のメモリ１０３Ａ〜Ｄ上にロードされているハイパーバイザ、ないしは、その他のシステムソフトウェア、及び、ＥＰＴ（ＥｘｔｅｎｄｅｄＰａｇｅＴａｂｌｅ）、ないしは、その他のアドレス変換機構を用いて、上記の混雑を解消する手段を提供する。 Therefore, in this embodiment, the offset calculation units 113A to 113D, the destination port search units 114A to 114D, the I / O controller 112, the offset information 150, the destination port table 151, the management server 130, and the SMP server 100 in FIG. A hypervisor loaded on the memories 103A to 103D or other system software, and an EPT (Extended Page Table) or other address translation mechanism are used to provide means for solving the above congestion. .

ここで、ＥＰＴ等のアドレス変換機構による通常の手法により、各ブレード１０１Ａ、＃０〜１０１Ｄ、＃０３各々において、所望のオフセットが設定される。このアドレス変換機構は、各ブレード１０１Ａ、＃０〜１０１Ｄ、＃０３のＣＰＵがＩ／Ｏデバイスに対するロード命令ないしはストア命令などの命令を発行したときに、各ブレード１０１Ａ、＃０〜１０１Ｄ、＃０３各々のＣＰＵ毎に異なるオフセット値をロード命令ないしはストア命令に起因して発生するパケット、ないしはトランザクションの宛先アドレス等の宛先データに加算する。このオフセット値は、Ｉ／Ｏコントローラ１１２に予めオフセット情報１５０として記憶される。 Here, a desired offset is set in each of the blades 101A, # 0 to 101D, and # 03 by a normal method using an address translation mechanism such as EPT. This address translation mechanism is configured so that when the CPUs of the blades 101A, # 0 to 101D, and # 03 issue instructions such as a load instruction or a store instruction for the I / O device, the blades 101A, # 0 to 101D, and # 03 A different offset value for each CPU is added to destination data such as a packet generated due to a load instruction or a store instruction, or a destination address of a transaction. This offset value is stored in advance as offset information 150 in the I / O controller 112.

図２は、図１のＩ／Ｏスイッチ１１０が有するオフセット減算部１１３Ａ〜Ｄの詳細な構成を示すブロック図である。オフセット減算部１１３Ａ〜Ｄはサーバ側ポートに設けられる構成要素であり、サーバ側ポート１ポートに対して１個のオフセット減算部を設ける。本実施例では、サーバ側ポートはＩ／Ｏインタフェース１１５Ａ〜Ｄの４ポートであるため、その数に合わせてオフセット減算部１１３Ａ〜Ｄの４個を有する。 FIG. 2 is a block diagram showing a detailed configuration of the offset subtraction units 113A to 113D included in the I / O switch 110 of FIG. The offset subtracting units 113A to 113D are components provided in the server side port, and one offset subtracting unit is provided for one server side port. In this embodiment, since the server side ports are the four ports of the I / O interfaces 115A to 115D, there are four offset subtracting units 113A to 113D according to the number.

オフセット減算部１１３Ａ〜Ｄは、Ｉ／Ｏインタフェース１１５Ａ〜ＤでＳＭＰサーバ１００と接続され、Ｉ／Ｏインタフェース１１６Ａ〜Ｄでスイッチ交換部１１１と接続される。また、Ｉ／Ｏコントローラが有するオフセット情報１５０を、信号１１８として受け取っている。そして、このオフセット情報１５０は、Ｉ／Ｏインタフェース１１５Ａ〜Ｄから入力されるＳＭＰサーバ発、Ｉ／Ｏデバイス宛パケットの宛先を書き換えて、Ｉ／Ｏインタフェース１１６Ａ〜Ｄから出力するために利用される。 The offset subtraction units 113A to 113D are connected to the SMP server 100 via I / O interfaces 115A to 115D, and are connected to the switch exchange unit 111 via I / O interfaces 116A to 116D. Further, offset information 150 included in the I / O controller is received as a signal 118. This offset information 150 is used to rewrite the destination of the packet addressed to the I / O device from the SMP server input from the I / O interfaces 115A to 115D and output it from the I / O interfaces 116A to 116D. .

オフセット減算部１１３Ａ〜Ｄは、Ｉ／Ｏインタフェース１１６Ａ〜Ｄから入力されたＩ／Ｏデバイス発サーバ宛パケット２２０は素通しして、Ｉ／Ｏインタフェース１１５Ａ〜Ｄから出力する。 The offset subtracting units 113A to 113D pass the I / O device originating server-addressed packet 220 input from the I / O interfaces 116A to 116D and output from the I / O interfaces 115A to 115D.

一方、Ｉ／Ｏインタフェース１１５Ａ〜Ｄから入力されたサーバ発Ｉ／Ｏデバイス宛パケット２３０は、当該パケットの宛先（ＭＭＩＯリクエストのパケットの場合には宛先のアドレス、Ｃｏｎｆｉｇｕｒａｔｉｏｎリクエストのパケットの場合には宛先のＢｕｓ#）を書き換えて、Ｉ／Ｏインタフェース１１６Ａ〜Ｄから出力する。 On the other hand, the server-originated I / O device-addressed packet 230 input from the I / O interfaces 115A to 115D is the destination of the packet (destination address in the case of a MMIO request packet, destination in the case of a Configuration request packet). (Bus #) is rewritten and output from the I / O interfaces 116A to 116D.

サーバ発Ｉ／Ｏデバイス宛パケット２３０は、パケット解析部２１１で解析される。パケット２３０のデータペイロード等はデータ２３３として、そのままパケット組当部２１２に送られる。宛先情報２３２は減算器２１４に入力される。なお、宛先情報２３２は前述した通り、６４ｂｉｔのアドレス（ＭＭＩＯリクエスト）と、８ｂｉｔのＢｕｓ＃（Ｃｏｎｆｉｇｕｒａｔｉｏｎリクエスト）の場合がある。パケット２３０がＭＭＩＯリクエストであるかＣｏｎｆｉｇｕｒａｔｉｏｎリクエストであるかはパケット解析部２１１が解析し、リクエスト種情報２３１としてセレクタ２１３に提供する。 The server-originated I / O device-addressed packet 230 is analyzed by the packet analysis unit 211. The data payload and the like of the packet 230 are sent as data 233 to the packet allocation unit 212 as they are. The destination information 232 is input to the subtracter 214. Note that the destination information 232 may be a 64-bit address (MMIO request) and an 8-bit Bus # (Configuration request) as described above. The packet analysis unit 211 analyzes whether the packet 230 is an MMIO request or a configuration request, and provides the request type information 231 to the selector 213.

セレクタ２１３は、信号１１８でオフセット情報１５０を受け取り、オフセット情報１５０のうち、当該オフセット減算部１１３Ａ〜Ｄが担当しているＩ／Ｏインタフェース１１５Ａ〜Ｄのサーバ側ポート番号に対応するエントリを選択する。さらに、リクエスト種情報２３１を元に、ＭＭＩＯリクエストであればＭＭＩＯアドレスオフセットを選択して、オフセット値２３６として減算器２３４に入力する。 The selector 213 receives the offset information 150 by the signal 118, and selects an entry corresponding to the server side port number of the I / O interface 115A-D that the offset subtracting unit 113A-D is in charge of from the offset information 150. . Further, based on the request type information 231, if it is an MMIO request, an MMIO address offset is selected and input to the subtracter 234 as an offset value 236.

減算器２３４は、宛先情報２３２からオフセット値２３６を減じた結果を宛先情報２３４としてパケット組立部２１２に入力する。演算を式で示すと、（宛先情報２３４）＝（宛先情報２３２）−（オフセット値２３６）である。 The subtractor 234 inputs the result obtained by subtracting the offset value 236 from the destination information 232 as the destination information 234 to the packet assembling unit 212. Expressing the calculation as an expression, (destination information 234) = (destination information 232) − (offset value 236).

パケット組立部２１２は、宛先情報２３４とデータ２３３からパケット２３５を組み立てて、Ｉ／Ｏインタフェース１１６Ａ〜Ｄに出力する。 The packet assembling unit 212 assembles the packet 235 from the destination information 234 and the data 233 and outputs the packet 235 to the I / O interfaces 116A to 116D.

図３は、図１のＩ／Ｏスイッチ１１０が有する宛先ポート検索部１１４Ａ〜Ｄの詳細な構成を示すブロック図である。宛先ポート検索部１１４Ａ〜Ｄはデバイス側ポートに設けられる構成要素であり、デバイス側ポート1ポートに対して1個の宛先ポート検索部を設ける。本実施例では、デバイス側ポートはＩ／Ｏインタフェース１２１Ａ〜Ｄの４ポートであるため、その数に合わせて宛先ポート検索部１１４Ａ〜Ｄの４個を有する。 FIG. 3 is a block diagram illustrating a detailed configuration of the destination port search units 114A to 114D included in the I / O switch 110 of FIG. The destination port search units 114A to 114D are components provided in the device side port, and one destination port search unit is provided for one device side port. In this embodiment, since the device side ports are the four ports of the I / O interfaces 121A to 121D, there are four destination port search units 114A to 114D corresponding to the number of the device side ports.

宛先ポート検索部１１４Ａ〜Ｄは、Ｉ／Ｏインタフェース１１７Ａ〜Ｄから入力されたサーバ発Ｉ／Ｏデバイス宛パケット３２０は素通しして、Ｉ／Ｏインタフェース１２１Ａ〜Ｄから出力する。 The destination port searching units 114A to 114D pass through the server-originated I / O device-addressed packet 320 input from the I / O interfaces 117A to 117D and output from the I / O interfaces 121A to 121D.

一方、Ｉ／Ｏインタフェース１２１Ａ〜Ｄから入力されたＩ／Ｏデバイス発サーバ宛パケットは、当該パケットの宛先（Ｉ／Ｏデバイス発サーバ宛パケットは主記憶空間のアドレスを宛先としている）を元に、ＰＣＩＥｘｐｒｅｓｓスイッチが当該パケットを出力すべきＩ／Ｏインタフェース１１６Ａ〜Ｄを決定する。すなわち、Ｉ／Ｏデバイス発サーバ宛パケットの宛先アドレスを元に、宛先となるサーバ（ブレード）を決定する。 On the other hand, the packet addressed to the I / O device originating server input from the I / O interfaces 121A to 121D is based on the destination of the packet (the packet addressed to the I / O device originating server is addressed to the address of the main storage space). The PCI Express switch determines the I / O interfaces 116A to 116D to output the packet. In other words, the server (blade) as the destination is determined based on the destination address of the packet addressed to the I / O device originating server.

Ｉ／Ｏデバイス発サーバ宛パケット３３０は、パケット解析部３１１にて、宛先アドレス３３２を抽出する。宛先アドレス３３２はテーブル参照部３１２に入力される。一方、パケット３３０はそのまま改変なくパケット３３１として、パケット組立部３１３に入力される。 The packet analysis unit 311 extracts the destination address 332 of the packet 330 addressed to the I / O device originating server. The destination address 332 is input to the table reference unit 312. On the other hand, the packet 330 is directly input to the packet assembling unit 313 as a packet 331 without modification.

テーブル参照部３１２は、信号１１９で宛先ポートテーブル１５１を参照し、テーブル参照部３１２に入力されている宛先アドレス３３２に対応するエントリを検索する。すなわち、（ベースアドレス）≦（宛先アドレス３３２）＜（ベースアドレス＋サイズ）という条件を満たすエントリを検索し、そのエントリに対応する宛先ポート番号を、宛先ポート番号３３３として出力する。 The table reference unit 312 refers to the destination port table 151 using the signal 119 and searches for an entry corresponding to the destination address 332 input to the table reference unit 312. That is, an entry satisfying the condition (base address) ≦ (destination address 332) <(base address + size) is searched, and the destination port number corresponding to the entry is output as the destination port number 333.

なお、宛先アドレス３３２が何れのエントリにも対応しなかった場合には、デフォルトの宛先ポート番号を宛先ポート番号３３３として出力する。本実施例では、デフォルトの宛先ポート番号は０（Ｉ／Ｏインタフェース１１６Ａ、すなわち、ブレード１１０Ａ（＃０）に対応する）とする。 When the destination address 332 does not correspond to any entry, the default destination port number is output as the destination port number 333. In this embodiment, the default destination port number is 0 (corresponding to the I / O interface 116A, that is, the blade 110A (# 0)).

パケット組当部３１３は、パケット３３１に宛先ポート番号３３３を付与して、パケット３３４としてインタフェース１１３Ａ〜Ｄから出力する。 The packet assembling unit 313 assigns the destination port number 333 to the packet 331 and outputs the packet 334 from the interfaces 113A to 113D.

なお、宛先ポート番号はＭＲ−ＩＯＶ対応のＰＣＩＥｘｐｒｅｓｓスイッチを用いている場合には、ＭＲ−ＩＯＶ規格で定義されているＶＨ（ＶｉｒｔｕａｌＨｉｅｒａｒｃｈｙ）を用いることができる。また、パケットに宛先ポート番号（ＶＨ)を付与するための機構も、ＴＬＰＰｒｅｆｉｘとしてＭＲ−ＩＯＶ規格で定義されている。 As the destination port number, when a PCI Express switch compatible with MR-IOV is used, VH (Virtual Hierarchy) defined in the MR-IOV standard can be used. A mechanism for assigning a destination port number (VH) to a packet is also defined in the MR-IOV standard as TLP Prefix.

また、本実施例では、ＭＲ−ＩＯＶ規格でなくても、独自のルーティング機構を採用しているＰＣＩＥｘｐｒｅｓｓスイッチを利用しても良い。その場合、当該スイッチの定めるプロトコルで宛先ポート番号を付与する。 In the present embodiment, a PCI Express switch that employs a unique routing mechanism may be used instead of the MR-IOV standard. In that case, the destination port number is given by the protocol determined by the switch.

図４に、Ｉ／Ｏコントローラ１１２に記憶される、本実施例のオフセット情報１５０の構成の一例を示す。同図に示すように、オフセット情報１５０は、ＭＭＩＯアドレスオフセットテーブル４１０と、ＩＤオフセットテーブル４２０から構成される。いずれのテーブルも、サーバ側ポートの数と同数のエントリから構成される。すなわち、本実施例の場合は４ポートであるため、ＭＭＩＯアドレスオフセット４エントリと、ＩＤオフセット４エントリの合計８エントリでオフセット情報１５０は構成される。 FIG. 4 shows an example of the configuration of the offset information 150 of this embodiment stored in the I / O controller 112. As shown in the figure, the offset information 150 includes an MMIO address offset table 410 and an ID offset table 420. Each table includes the same number of entries as the number of server-side ports. That is, in this embodiment, since there are 4 ports, the offset information 150 is composed of a total of 8 entries including 4 entries for MMIO address offset and 4 entries for ID offset.

ＭＭＩＯアドレスオフセットは６４ｂｉｔのエントリで、サーバ側ポート番号に対応するポート（Ｉ／Ｏインタフェース１１５Ａ〜Ｄ）からＩ／Ｏスイッチ１１０に入力された、サーバ発Ｉ／Ｏデバイス宛パケット（ＭＭＩＯリクエスト）の宛先アドレスから減算すべきオフセット量を指示する。ハードウェアとしてはレジスタやフリップフロップとして実装される。 The MMIO address offset is a 64-bit entry, and the server-originated I / O device addressed packet (MMIO request) input to the I / O switch 110 from the port (I / O interface 115A to 115D) corresponding to the server-side port number. Indicates the offset amount to be subtracted from the destination address. As hardware, it is implemented as a register or flip-flop.

ＩＤオフセットは８ｂｉｔエントリで、ＭＭＩＯアドレスオフセットと同様に、サーバ側ポート番号に対応するポート（Ｉ／Ｏインタフェース１１５Ａ〜Ｄ）からＩ／Ｏスイッチ１１０に入力された、サーバ発Ｉ／Ｏデバイス宛パケット（Ｃｏｎｆｉｇｕｒａｔｉｏｎリクエスト）の宛先Ｂｕｓ＃から減算すべきオフセット量を指示する。実装形態はＭＭＩＯアドレスオフセットと同様に、レジスタやフリップフロップを用いる。 The ID offset is an 8-bit entry, and, similar to the MMIO address offset, the packet addressed to the server-originated I / O device input to the I / O switch 110 from the port (I / O interface 115A to 115D) corresponding to the server-side port number The offset amount to be subtracted from the destination Bus # of the (Configuration request) is indicated. The mounting form uses registers and flip-flops in the same manner as the MMIO address offset.

図５に図１の宛先ポートテーブル１５１の構成を示す。宛先ポートテーブルは（有効ビット、ベースアドレス、サイズ、宛先ポート番号）を１エントリとして、最低１エントリ以上で構成される。有効ビットが立っている（有効ビット＝１）エントリのみが用いられる。 FIG. 5 shows the configuration of the destination port table 151 of FIG. The destination port table is composed of at least one entry, with (valid bit, base address, size, destination port number) as one entry. Only entries with a valid bit set (valid bit = 1) are used.

ベースアドレスは主記憶空間が６４ｂｉｔのＳＭＰサーバ１００の場合、６４ｂｉｔのベースアドレスとなる。サイズはベースアドレスを起点とした領域を示すサイズである。一般的に、主記憶空間上の領域のサイズは２の冪乗で表わされることが多いので、本実施例はサイズは２＾ＮバイトのＮを表す６ｂｉｔの数値とした。この場合、２＾０バイト〜２＾６３バイトのサイズを表現することができるため、６４ｂｉｔの主記憶空間で用いるのに十分な大きさである。 In the case of the SMP server 100 having a 64-bit main storage space, the base address is a 64-bit base address. The size is a size indicating an area starting from the base address. In general, since the size of the area on the main storage space is often expressed by a power of 2, in this embodiment, the size is a 6-bit numerical value representing N of 2 ^ N bytes. In this case, since a size of 2 ^ 0 bytes to 2 ^ 63 bytes can be expressed, the size is sufficient for use in a 64-bit main storage space.

宛先ポート番号は、ベースアドレスとサイズで示された領域に対するＩ／Ｏデバイス発サーバ宛パケットを出力すべきサーバ側ポートのポート番号を指定するものである。本実施例では、サーバ側ポートは４ポートであるため、宛先ポート番号は２ｂｉｔあれば表現可能である。 The destination port number specifies the port number of the server side port to which the packet addressed to the I / O device originating server for the area indicated by the base address and size is to be output. In this embodiment, since the server side port is 4 ports, the destination port number can be expressed by 2 bits.

図１２に、図１の計算機システムにおけるバスツリー状の経路の一例を示す。図１２では、図１１と同様にＣＰＵ１０２Ａ（ブレード１０１Ａ、＃０）からＩ／Ｏデバイス１０２Ｂ（Ｂｕｓ＃４）にアクセスする経路を破線、ＣＰＵ１０２Ｃ（ブレード１０１Ｃ、＃２）にアクセスする経路を実線で示している。 FIG. 12 shows an example of a bus tree-like route in the computer system of FIG. In FIG. 12, the path for accessing the I / O device 102B (Bus # 4) from the CPU 102A (blade 101A, # 0) from the CPU 102A (blade 101A, # 0) is indicated by a broken line, and the path for accessing the CPU 102C (blade 101C, # 2) is indicated by the solid line. Show.

破線で示されるブレード＃０発Ｂｕｓ＃４宛の経路は図１１と同じである。しかし、実線で示されるブレード＃２発Ｂｕｓ＃４宛の経路は図１１とは異なり、Ｂｕｓ＃１２９（Ｉ／Ｏインタフェース１１５Ｃ）を経由している。図１２で示す経路を、物理的な構成要素上で示したのが図１３である。これを見ると、図１１で問題となっていた内部インタフェース１０９やＢｕｓ＃１（Ｉ／Ｏインタフェース１１５Ａ）の混雑を回避することができる。 The route from blade # 0 to Bus # 4 indicated by the broken line is the same as in FIG. However, the path from blade # 2 to Bus # 4 indicated by the solid line is different from that in FIG. 11 and passes through Bus # 129 (I / O interface 115C). FIG. 13 shows the paths shown in FIG. 12 on physical components. Looking at this, it is possible to avoid congestion of the internal interface 109 and Bus # 1 (I / O interface 115A), which was a problem in FIG.

図１２で行われている動作を、図１に示す本実施例の構成要素の動作を用いて説明する。ブレード＃０のＣＰＵ１０２ＡがＢｕｓ＃４宛のＣｏｎｆｉｇｕｒａｔｉｏｎリクエストを発行する時には、Ｂｕｓ＃４に対応した主記憶空間のアドレスに対するロード命令、ストア命令を発行する。同様に、ブレード＃２のＣＰＵ１０２Ｃもロード命令、ストア命令を発行する。 The operation performed in FIG. 12 will be described using the operation of the components of this embodiment shown in FIG. When the CPU 102A of blade # 0 issues a configuration request addressed to Bus # 4, it issues a load instruction and a store instruction for the address of the main storage space corresponding to Bus # 4. Similarly, the CPU 102C of blade # 2 issues a load instruction and a store instruction.

この時、本実施例では、ＣＰＵ１０２Ａのロード命令、ストア命令はそのまま発行させるが、ＣＰＵ１０２Ｃのロード命令、ストア命令は、メモリ１０３Ｃ内のＥＰＴを用いてアドレス変換を行うことで、Ｂｕｓ＃４に対するＣｏｎｆｉｇｕｒａｔｉｏｎリクエストをＢｕｓ＃１３２（Ｂｕｓ＃４＋オフセット＃１２８）に対するＣｏｎｆｉｇｕｒａｔｉｏｎリクエストに変換する。 At this time, in this embodiment, the load instruction and the store instruction of the CPU 102A are issued as they are, but the load instruction and the store instruction of the CPU 102C are configured for the Bus # 4 by performing address conversion using the EPT in the memory 103C. The request is converted into a configuration request for Bus # 132 (Bus # 4 + offset # 128).

これにより、ＣＰＵ１０２Ｃのロード命令、ストア命令に起因して発行されたパケットは、Ｂｕｓ＃１３２を目指してルーティングするので、途中でＢｕｓ＃１２９を通る経路でルーティングされることになる。すなわち、ＳＭＰサーバ１１０とＩ／Ｏスイッチ１１０の間では、Ｉ／Ｏインタフェース１１５Ｃ（Ｂｕｓ＃１２９）を経由することになり、内部インターコネクト１０９、及び、Ｉ／Ｏインタフェース１１５Ａ（Ｂｕｓ＃１）の混雑を回避することができる。 As a result, the packet issued due to the load instruction and the store instruction of the CPU 102C is routed toward the Bus # 132, so that it is routed along the path passing through the Bus # 129. That is, the SMP server 110 and the I / O switch 110 are routed via the I / O interface 115C (Bus # 129), and the internal interconnect 109 and the I / O interface 115A (Bus # 1) are congested. Can be avoided.

次に、Ｉ／Ｏスイッチ１１０の内部では、本来Ｂｕｓ＃４に向かわなければならない当該パケット（Ｉ／Ｏスイッチ１１０に到達した段階ではＢｕｓ＃１３２宛となっている）の宛先をＢｕｓ＃４に戻す。そのために、オフセット減算部１１３Ｃで、Ｂｕｓ＃４（Ｂｕｓ＃１３２−オフセット＃１２８）に戻す処理を行う。その後、Ｂｕｓ＃３〜Ｂｕｓ＃６を配下に持つＢｕｓ＃２のブリッジにパケットをルーティングすることで、Ｂｕｓ＃４のＩ／Ｏデバイス１２０Ｂに対してパケットを到達させることができる。このようにして、図１２、及び、図１３に示すようなルーティングが実現できる。 Next, inside the I / O switch 110, the destination of the packet (addressed to Bus # 132 when it reaches the I / O switch 110) that should originally go to Bus # 4 is set to Bus # 4. return. For this purpose, the offset subtraction unit 113C performs processing to return to Bus # 4 (Bus # 132−offset # 128). After that, by routing the packet to the Bus # 2 bridge having Bus # 3 to Bus # 6 under control, the packet can reach the I / O device 120B of Bus # 4. In this way, routing as shown in FIGS. 12 and 13 can be realized.

なお、ＭＭＩＯリクエストの場合には、Ｂｕｓ＃をアドレスに読み替えることで同じ処理が実現できる。また、前記したのはいずれもサーバ発Ｉ／Ｏデバイスのリクエストを行うパケットであったが、リクエストに対するリプライのパケットをＩ／Ｏデバイスからサーバに、リクエストの経路を逆に辿って返す必要がある。 In the case of an MMIO request, the same processing can be realized by replacing Bus # with an address. In addition, all of the above-mentioned packets are requests for a server-originated I / O device, but it is necessary to return a reply packet to the request from the I / O device to the server by tracing back the request path. .

ＰＣＩＥｘｐｒｅｓｓでは、リクエストを行うパケットには、いずれもリクエストを発行したコンポーネントを識別するためのＲＩＤ（ＲｅｑｕｅｓｔｅｒＩＤ）が付与されている。図１０、及び、図１２に示すようなバスツリーでは、リクエストを発行するのはバスツリーの頂点であるが、ハードウェアの動作としては図１で示すＩＯＨ１０５Ａ〜Ｄでパケットが生成される。すなわち、ＲＩＤはリクエストを行うパケットを生成したＩＯＨ１０５Ａ〜Ｄを持つはずである。そして、Ｉ／Ｏデバイス１２０Ａ〜Ｄはリクエストを受け取ると、それに応答するためのリプライのパケットを生成するが、リプライのパケットの宛先は、リクエストのパケットのＲＩＤを用いる。そのため、リクエストに対するリプライのパケットは、Ｉ／Ｏデバイスからサーバに、リクエストの経路を逆に辿って返すことが出来る。 In PCI Express, RID (Requester ID) for identifying the component that issued the request is assigned to each requesting packet. In the bus tree as shown in FIG. 10 and FIG. 12, the request is issued at the top of the bus tree, but as hardware operation, packets are generated by the IOHs 105A to 105D shown in FIG. That is, the RID should have the IOHs 105A-D that generated the requesting packet. When receiving the request, the I / O devices 120A to 120D generate a reply packet for responding to the request, and the reply packet destination uses the RID of the request packet. Therefore, the reply packet for the request can be returned from the I / O device to the server by tracing the request path in reverse.

図１２及び図１３で示す動作を行うためには、ＣＰＵ１０２Ａ〜Ｄがロード命令、ないしは、ストア命令を発行する時に、ＭＭＩＯリクエストであればパケットの宛先アドレスを、Ｃｏｎｆｉｇｕｒａｔｉｏｎリクエストであればパケットの宛先Ｂｕｓ＃を、所定のオフセット分ずらす必要がある。そのための動作を説明する。 In order to perform the operations shown in FIGS. 12 and 13, when the CPUs 102A to 102D issue a load instruction or a store instruction, the destination address of the packet is sent if it is an MMIO request, and the destination bus of the packet is sent if it is a Configuration request It is necessary to shift # by a predetermined offset. The operation for this will be described.

先に説明した様に、ＳＭＰサーバ１００は図１４で示した単一の主記憶空間を有する。ＭＭＩＯリクエスト、及び、Ｃｏｎｆｉｇｕｒａｔｉｏｎリクエストは、ＣＰＵ１０２Ａ〜Ｄが、図１４の主記憶空間の中で「ＭＭＩＯ空間、等」とされている空間にロード命令、ないしは、ストア命令を発行したときに発生する。 As described above, the SMP server 100 has the single main storage space shown in FIG. The MMIO request and the Configuration request are generated when the CPUs 102A to 102D issue a load instruction or a store instruction to a space designated as “MMIO space, etc.” in the main storage space of FIG.

図６は前述した「ＭＭＩＯ空間、等」をさらに詳細に示したメモリマップである。この中には、ブレード＃０ＭＭＩＯ空間、ブレード＃１ＭＭＩＯ空間、ブレード＃２ＭＭＩＯ空間、ブレード＃３ＭＭＩＯ空間、及び、ＰＣＩＥｘｐｒｅｓｓＥｎｈａｎｃｅｄＣｏｎｆｉｇｕｒａｔｉｏｎＡｃｃｅｓｓＭｅｃｈａｎｉｓｍ空間（ＥＣＡＭ空間）がある。 FIG. 6 is a memory map showing the above-mentioned “MMIO space, etc.” in more detail. Among them, there are a blade # 0 MMIO space, a blade # 1 MMIO space, a blade # 2 MMIO space, a blade # 3 MMIO space, and a PCI Express Enhanced Configuration Mechanism space (ECAM space).

ＭＭＩＯリクエストは、Ｃｏｎｆｉｇｕｒａｔｉｏｎリクエストと同様にバスツリー状でルーティングされる。図９で示すブリッジは、自分の配下にあるＢｕｓ＃を把握しており、それに基づいてＣｏｎｆｉｇｕｒａｔｉｏｎリクエストのルーティングが行われる。それに加えてブリッジは自分の配下にあるＭＭＩＯアドレスも把握しており、それに基づいてＣｏｎｆｉｇｕｒａｔｉｏｎリクエストのルーティングが行われる。 The MMIO request is routed in the form of a bus tree in the same manner as the configuration request. The bridge shown in FIG. 9 knows the Bus # under its control, and the configuration request is routed based on the Bus #. In addition, the bridge also knows the MMIO address under its control, and the configuration request is routed based on the MMIO address.

ここで、図６のブレード＃０ＭＭＩＯ空間は、ＩＯＨ１０５Ａの配下にあるＭＭＩＯアドレス空間であり、ブレード＃０ＭＭＩＯ空間宛のＭＭＩＯリクエストパケットはＩ／Ｏインタフェース１１５Ａから出力される。同様に、ブレード＃１ＭＭＩＯ空間はＩＯＨ１０５Ｂ、及び、Ｉ／Ｏインタフェース１１５Ｂに、ブレード＃２ＭＭＩＯ空間はＩＯＨ１０５Ｃ、及び、Ｉ／Ｏインタフェース１１５Ｃに、ブレード＃３ＭＭＩO空間はＩＯＨ１０５Ｄ、及び、Ｉ／Ｏインタフェース１１５Ｄに、それぞれ対応している。 Here, the blade # 0 MMIO space in FIG. 6 is an MMIO address space under the IOH 105A, and an MMIO request packet addressed to the blade # 0 MMIO space is output from the I / O interface 115A. Similarly, blade # 1 MMIO space is IOH 105B and I / O interface 115B, blade # 2 MMIO space is IOH 105C and I / O interface 115C, and blade # 3 MMIO space is IOH 105D and I / O interface 115D. , Respectively.

ここで、Ｉ／Ｏデバイス１２０Ａ〜Ｄが有するＭＭＲは、全てブレード＃０ＭＭＩＯ空間にマッピングするものとする。そのため、ＣＰＵ１０２Ａ〜Ｄが発行するＭＭＩＯリクエストは、全てブレード＃０ＭＭＩＯ空間のアドレスを宛先としたものになる。 Here, all the MMRs included in the I / O devices 120A to 120D are mapped to the blade # 0 MMIO space. Therefore, all MMIO requests issued by the CPUs 102A to 102D are addressed to the address of the blade # 0 MMIO space.

しかし、そのままでは、全てのＭＭＩＯリクエストがＩ／Ｏインタフェース１１５Ａに集中してしまうことになるので、アドレス変換を行うことで、ＣＰＵ１０２Ａ〜Ｄ毎に使用するＩ／Ｏインタフェース１１５Ａ〜Ｄを分散させる。 However, as it is, all MMIO requests are concentrated on the I / O interface 115A. Therefore, by performing address conversion, the I / O interfaces 115A to 115D used for the CPUs 102A to 102D are distributed.

ＣＰＵ１０２Ａがブレード＃０ＭＭＩＯ空間へのロード命令、ないしは、ストア命令を発行したときには、アドレス変換において、オフセット＋０を加える（すなわち、アドレス変換しない）。これにより、ＣＰＵ１０２Ａ由来のＭＭＩＯリクエストはＩ／Ｏインタフェース１１５Ａから出力される。 When the CPU 102A issues a load instruction to the blade # 0 MMIO space or a store instruction, an offset +0 is added in the address conversion (that is, the address is not converted). Thereby, the MMIO request derived from the CPU 102A is output from the I / O interface 115A.

ＣＰＵ１０２Ｂがブレード＃０ＭＭＩＯ空間へのロード命令、ないしは、ストア命令を発行したときには、アドレス変換において、オフセット＋αを加える（オフセット＋αは図６に示す）。これにより、ＣＰＵ１０２Ｂ由来のＭＭＩＯリクエストはブレード＃１ＭＭＩＯ空間に対するものとなり、Ｉ／Ｏインタフェース１１５Ｂから出力される。しかし、本来の宛先はオフセットを加える前のアドレスであるため、Ｉ／Ｏスイッチ１１０に到着後はオフセット演算部１１３Ｂでオフセット分を減じる（宛先アドレスからαを減じる）。 When the CPU 102B issues a load instruction to the blade # 0 MMIO space or a store instruction, an offset + α is added in the address conversion (the offset + α is shown in FIG. 6). Thus, the MMIO request derived from the CPU 102B is for the blade # 1 MMIO space and is output from the I / O interface 115B. However, since the original destination is the address before the offset is added, after the arrival at the I / O switch 110, the offset calculation unit 113B subtracts the offset (subtracts α from the destination address).

ＣＰＵ１０２Ｃがブレード＃０ＭＭＩＯ空間へのロード命令、ないしは、ストア命令を発行したときには、アドレス変換において、オフセット＋βを加える（オフセット＋βは図６に示す）。これにより、ＣＰＵ１０２Ｃ由来のＭＭＩＯリクエストはブレード＃２ＭＭＩＯ空間に対するものとなり、Ｉ／Ｏインタフェース１１５Ｃから出力される。しかし、本来の宛先はオフセットを加える前のアドレスであるため、Ｉ／Ｏスイッチ１１０に到着後はオフセット演算部１１３Ｃでオフセット分を減じる（宛先アドレスからβを減じる）。 When the CPU 102C issues a load instruction to the blade # 0 MMIO space or a store instruction, an offset + β is added in the address conversion (the offset + β is shown in FIG. 6). As a result, the MMIO request derived from the CPU 102C is for the blade # 2 MMIO space and is output from the I / O interface 115C. However, since the original destination is the address before the offset is added, after the arrival at the I / O switch 110, the offset calculation unit 113C subtracts the offset (subtracts β from the destination address).

ＣＰＵ１０２Ｄがブレード＃０ＭＭＩＯ空間へのロード命令、ないしは、ストア命令を発行したときには、アドレス変換において、オフセット＋γを加える（オフセット＋γは図６に示す）。これにより、ＣＰＵ１０２Ｄ由来のＭＭＩＯリクエストはブレード＃３ＭＭＩＯ空間に対するものとなり、Ｉ／Ｏインタフェース１１５Ｄから出力される。しかし、本来の宛先はオフセットを加える前のアドレスであるため、Ｉ／Ｏスイッチ１１０に到着後はオフセット演算部１１３Ｄでオフセット分を減じる（宛先アドレスからβを減じる）。 When the CPU 102D issues a load instruction to the blade # 0 MMIO space or a store instruction, an offset + γ is added in the address conversion (the offset + γ is shown in FIG. 6). As a result, the MMIO request derived from the CPU 102D is for the blade # 3 MMIO space and is output from the I / O interface 115D. However, since the original destination is the address before the offset is added, the offset is reduced by the offset calculation unit 113D after arrival at the I / O switch 110 (β is subtracted from the destination address).

以上のような動作において、ＭＭＩＯリクエストを、当該リクエストを発行したＣＰＵ１０２Ａ〜Ｄに応じて、Ｉ／Ｏインタフェース１１５Ａ〜Ｄに分散して出力させることで、負荷分散を実現することができるようになった。 In the operation as described above, load distribution can be realized by distributing MMIO requests to the I / O interfaces 115A to 115D according to the CPUs 102A to 102D that issued the requests. It was.

次に、Ｃｏｎｆｉｇｕｒａｔｉｏｎリクエストの場合を説明する。Ｃｏｎｆｉｇｕｒａｉｔｏｎリクエストの宛先は、既に説明したように、Ｂｕｓ＃、Ｄｅｖ＃、Ｆｕｎ＃、Ｃｏｎｆｉｇｕｒａｔｉｏｎ空間アドレスの組合せで指定される。 Next, the case of a configuration request will be described. As described above, the destination of the Configuration request is specified by a combination of Bus #, Dev #, Fun #, and Configuration space address.

Ｃｏｎｇｉｇｕｒａｔｉｏｎリクエストは、図６のＥＣＡＭ空間に対してＣＰＵ１０２Ａ〜Ｄがロード命令、ないしは、ストア命令を発行することでパケットが発生する。ＥＣＡＭ空間はＣｏｎｆｉｇｕｒａｔｉｏｎ空間（１２ｂｉｔ空間＝４ＫＢ空間）に相当する領域を、Ｂｕｓ＃、Ｄｅｖ＃、Ｆｕｎ＃の個数分組合せたものである。すなわち、４ＫＢ×Ｆｕｎ＃個数（８個／Ｄｅｖ＃）×Ｄｅｖ＃個数（３２個／Ｂｕｓ＃）×Ｂｕｓ＃個数（２５６個／システム）の合計２５６ＭＢの空間である。 A packet is generated when the CPU 102A to 102D issues a load instruction or a store instruction to the ECAM space of FIG. The ECAM space is a combination of areas corresponding to the configuration space (12-bit space = 4 KB space) by the number of Bus #, Dev #, and Fun #. That is, a space of 256 MB in total of 4 KB × Fun # number (8 / Dev #) × Dev # number (32 / Bus #) × Bus # number (256 / system).

図７にＥＣＡＭ空間の詳細を示す。例えば、Ｂｕｓ＃３のＩ／Ｏデバイス１２０Ａに対するＣｏｎｆｉｇｕｒａｔｉｏｎリクエストは、先頭アドレスｙを起点として、＋１００００Ｈ〜＋１ＦＦＦＦＨまでの１ＭＢの空間である（４ＫＢ×Ｆｕｎ＃個数×Ｄｅｖ＃個数＝１ＭＢ）。この１ＭＢ空間内でのアドレスは、Ｉ／Ｏデバイス１２０Ａが有するどのリソース（レジスタ）にアクセスしたいかによって異なる。 FIG. 7 shows details of the ECAM space. For example, the configuration request for the Bus # 3 I / O device 120A is a 1 MB space from the start address y to + 10000H to + 1FFFFH (4 KB × Fun # number × Dev # number = 1 MB). The address in the 1 MB space differs depending on which resource (register) the I / O device 120A has to access.

前述した、ＣＰＵ１０２Ｃ発のＢｕｓ＃４に対するＣｏｎｆｉｇｕｒａｔｉｏｎリクエストをＢｕｓ＃１３２（Ｂｕｓ＃４＋オフセット＃１２８）に対するＣｏｎｆｉｇｕｒａｔｉｏｎリクエストに変換するのは、ＣＰＵ１０２ＣがＢｕｓ＃０〜Ｂｕｓ＃６３用空間にロード命令、ないしは、ストア命令を発行したときに、アドレスにオフセット＋８００００００Ｈを加えるアドレス変換を行うことで実現できる。 The CPU 102C converts the configuration request for the Bus # 4 from the CPU 102C into a Configuration request for the Bus # 132 (Bus # 4 + offset # 128). The CPU 102C loads the Bus # 0 to Bus # 63 space, This can be realized by performing address conversion by adding an offset + 8000000H to the address when a store instruction is issued.

以上のように、ＭＭＩＯリクエストの宛先アドレスにオフセットを加える、ないしは、Ｃｏｎｆｉｇｕｒａｔｉｏｎリクエストの宛先Ｂｕｓ＃にオフセットを加えることで、Ｉ／Ｏインタフェース１１５Ａ〜Ｄを分散して利用することは、ＣＰＵ１０２Ａ〜Ｄ上でロード命令、ないしは、ストア命令を発行するときのアドレス変換で実現できることを示した。 As described above, using the I / O interfaces 115A to 115D in a distributed manner by adding an offset to the destination address of the MMIO request or adding an offset to the destination Bus # of the Configuration request is performed on the CPUs 102A to D. This shows that it can be realized by address conversion when issuing a load instruction or a store instruction.

アドレス変換を実現する手段としては、ＣＰＵ１０２Ａ〜Ｄが有する種々の機構、ないしは、ハイパーバイザ等のシステムソフトウェアを利用することが出来る。一般的にＣＰＵでは、仮想アドレスから物理アドレスへのアドレス変換を行うためのＰＴ（ＰａｇｅＴａｂｌｅ）を提供しているが、これは一般的にはオペレーティングシステムが使うものであり、ＰＴで上記のアドレス変換を実現するためには、オペレーティングシステムに手を加える必要が出てくる。 As means for realizing the address translation, various mechanisms included in the CPUs 102A to 102D or system software such as a hypervisor can be used. In general, a CPU provides a PT (Page Table) for performing address conversion from a virtual address to a physical address. This is generally used by an operating system. In order to realize the conversion, it is necessary to modify the operating system.

その他の手段としては、ハイパーバイザでＳＰＴ（ＳｈａｄｏｗＰａｇｅＴａｂｌｅ）を用意して、オペレーティングシステムに手を加えないで、ハイパーバイザのレイヤーで上記のアドレス変換を実現することがある。また、一部のＣＰＵでＰＴに加えて二段目のアドレス変換機構として用意されているＥＰＴ（ＥｘｔｅｎｄｅｄＰａｇｅＴａｂｌｅ）を利用して、上記のアドレス変換を実現しても良い。 As another means, there is a case where an SPT (Shadow Page Table) is prepared in the hypervisor, and the above address conversion is realized in the hypervisor layer without changing the operating system. In addition to the PT, some CPUs may implement the above address conversion using an EPT (Extended Page Table) prepared as a second-stage address conversion mechanism.

次に、Ｉ／Ｏデバイス発サーバ宛のリクエストを行うパケットに関して述べる。上記は、いずれもサーバ発Ｉ／Ｏデバイス宛のリクエストと、それに対するリプライに関して述べてきた。以下では、その逆であるＩ／Ｏデバイス発サーバ宛のリクエストと、それに対するリプライに関して述べる。 Next, a packet for making a request addressed to the I / O device originating server will be described. All of the above has been described with respect to a request from a server to an I / O device and a reply to the request. In the following, a request directed to an I / O device originating server and a reply to the request will be described.

Ｉ／Ｏデバイス１２０Ａ〜Ｄは、図１４に示すＳＭＰサーバ１００の主記憶空間全体を読書きすることができる。しかし、本実施例で示す宛先ポート検索部１１４Ａ〜Ｄを用いないと、Ｉ／Ｏデバイス１２０Ａ〜Ｄが、ブレード＃０メモリ（メモリ１０３Ａ）、ブレード＃１メモリ（メモリ１０３Ｂ）、ブレード＃２メモリ（メモリ１０３Ｃ）、及び、ブレード＃３メモリ（メモリ１０３Ｄ）の何れに対するアクセスでも、全てＩＯＨ１０５Ａ、及び、Ｉ／Ｏインタフェース１１５Ａ経由になってしまう。これは、Ｉ／Ｏインタフェース１１５Ａ、及び、内部インターコネクト１０９の混雑を招く。 The I / O devices 120A to 120D can read and write the entire main storage space of the SMP server 100 shown in FIG. However, if the destination port search units 114A to 114D shown in the present embodiment are not used, the I / O devices 120A to 120D have the blade # 0 memory (memory 103A), the blade # 1 memory (memory 103B), and the blade # 2 memory. (Memory 103C) and blade # 3 memory (memory 103D) are all accessed via IOH 105A and I / O interface 115A. This causes congestion of the I / O interface 115A and the internal interconnect 109.

そこで、本実施例では、宛先ポートテーブル検索部１１４Ａ〜Ｄにおいて、Ｉ／Ｏデイバス１２０Ａ〜Ｄが発するリクエストのパケットの宛先アドレスから、サーバ側ポート番号を参照し、出力すべきＩ／Ｏインタフェース１１５Ａ〜Ｄを選択する。これにより、宛先ポートテーブル１５１が適切に設定されていれば、図１５に示すようにＩ／Ｏデバイス１２０Ａ〜Ｄが発するリクエストを、宛先のメモリ１０３Ａ〜Ｄまで最短の経路で伝送できることになるので、メモリアクセスのレイテンシ低減に寄与すると共に、Ｉ／Ｏインタフェース１１５Ａ〜Ｄ、及び、内部インターコネクト１９０の混雑緩和を実現できる。 Therefore, in this embodiment, the destination port table search units 114A to 114D refer to the server side port number from the destination address of the packet of the request issued by the I / O day bus 120A to D, and output the I / O interface 115A. Select ~ D. As a result, if the destination port table 151 is set appropriately, the requests issued by the I / O devices 120A to 120D can be transmitted to the destination memories 103A to 103D through the shortest path as shown in FIG. Thus, it is possible to reduce the latency of memory access and to reduce congestion of the I / O interfaces 115A to 115D and the internal interconnect 190.

次に図８に、図１に示す本実施例に基づく計算機システムを利用して、所望の効果を得るために必要な初期化のシーケンスを示す。 Next, FIG. 8 shows an initialization sequence necessary for obtaining a desired effect using the computer system based on the present embodiment shown in FIG.

ステップＳ８０１で、図１に示す計算機システムの中で、管理サーバ１３０を起動させる。図１の計算機システムは全体の動作を管理サーバ１３０が管理する。 In step S801, the management server 130 is activated in the computer system shown in FIG. The management server 130 manages the overall operation of the computer system of FIG.

ステップＳ８０２で、管理サーバ１３０は、管理ネットワーク１３１経由でＩ／Ｏスイッチ１１０に起動の指令を発行し、Ｉ／Ｏスイッチ１１０を起動させる。Ｉ／Ｏスイッチ１１０は起動すると、内部のＩ／Ｏコントローラ１１２によって、スイッチ交換部１１１の初期設定が行われる。Ｉ／Ｏコントローラ１１２は、オフセット減算部１１３Ａは、オフセット値を減算しないように初期化（ＭＭＩＯアドレスオフセットテーブル４１０、及び、ＩＤオフセットテーブル４２０の全てのエントリを０にする）される。また、同様にＩ／Ｏコントローラ１１２は、宛先ポートテーブル１５１の全てのエントリを無効にするため、全ての有効ビットを無効（０）にする。 In step S <b> 802, the management server 130 issues an activation command to the I / O switch 110 via the management network 131 to activate the I / O switch 110. When the I / O switch 110 is activated, the internal switching unit 111 is initially set by the internal I / O controller 112. The I / O controller 112 is initialized so that the offset subtraction unit 113A does not subtract the offset value (all entries in the MMIO address offset table 410 and the ID offset table 420 are set to 0). Similarly, the I / O controller 112 invalidates all valid bits (0) in order to invalidate all entries in the destination port table 151.

ステップＳ８０３で、管理サーバ１３０は、管理ネットワーク１３１経由でＩ／Ｏスイッチ１１０に構成情報収集の指令を発行し、Ｉ／Ｏスイッチ１１０から構成情報を受信する。構成情報収集の指令を受けたＩ／Ｏスイッチ１１０は、Ｉ／Ｏコントローラ１１２が、Ｉ／Ｏインタフェース１８０経由で、Ｉ／Ｏスイッチ１１０に接続されているＩ／Ｏデバイス１２０Ａ〜Ｄにアクセスし、Ｉ／Ｏデバイスの存在を検出する。そして、Ｉ／Ｏデバイス１２０Ａ〜Ｄが接続されていることを、管理サーバ１３０に構成情報として送信する。 In step S <b> 803, the management server 130 issues a configuration information collection command to the I / O switch 110 via the management network 131 and receives configuration information from the I / O switch 110. When the I / O switch 110 receives the configuration information collection command, the I / O controller 112 accesses the I / O devices 120A to 120D connected to the I / O switch 110 via the I / O interface 180. , Detecting the presence of an I / O device. Then, the fact that the I / O devices 120A to 120D are connected is transmitted to the management server 130 as configuration information.

ステップＳ８０４で、管理サーバ１３０は、Ｉ／Ｏスイッチ１１０から得た構成情報を計算機システムの管理者に提示し、Ｉ／ＯデバイスとＳＭＰサーバ１００の接続に関して指示を受ける。以降の説明では、Ｉ／Ｏデバイス１２０Ａ〜Ｄを全てＳＭＰサーバ１００に接続するという設定が管理者によってなされたものとして説明する。 In step S804, the management server 130 presents the configuration information obtained from the I / O switch 110 to the administrator of the computer system, and receives an instruction regarding the connection between the I / O device and the SMP server 100. In the following description, it is assumed that the setting for connecting all the I / O devices 120A to 120D to the SMP server 100 is made by the administrator.

ステップＳ８０５では、オフセット情報１５０（ＭＭＩＯアドレスオフセットテーブル４１０、及び、ＩＤオフセットテーブル４２０）に設定すべきオフセット値を算出する。本実施例で示す計算機システムでは、オフセット値は図６及び図７に示すようになる。 In step S805, an offset value to be set in the offset information 150 (MMIO address offset table 410 and ID offset table 420) is calculated. In the computer system shown in this embodiment, the offset values are as shown in FIGS.

すなわち、ＭＭＩＯアドレスオフセットテーブル４１０のサーバ側ポート番号０エントリ（Ｉ／Ｏインタフェース１１５Ａに対応し、オフセット減算部１１３Ａが使用）は０、サーバ側ポート番号１エントリ（Ｉ／Ｏインタフェース１１５Ｂに対応し、オフセット減算部１１３Ｂが使用）はα、サーバ側ポート番号２エントリ（Ｉ／Ｏインタフェース１１５Ｂに対応し、オフセット減算部１１３Ｂが使用）はβ、サーバ側ポート番号３エントリ（Ｉ／Ｏインタフェース１１５Ｃに対応し、オフセット減算部１１３Ｃが使用）はγ、となる。 That is, the server side port number 0 entry (corresponding to the I / O interface 115A and used by the offset subtraction unit 113A) of the MMIO address offset table 410 is 0, and the server side port number 1 entry (corresponding to the I / O interface 115B) Offset subtracting unit 113B) is α, server side port number 2 entry (corresponding to I / O interface 115B, offset subtracting unit 113B is used) β, server side port number 3 entry (corresponding to I / O interface 115C) The offset subtraction unit 113C uses γ.

また、ＩＤオフセットテーブル４２０のサーバ側ポート番号０エントリ（Ｉ／Ｏインタフェース１１５Ａに対応し、オフセット減算部１１３Ａが使用）は０、サーバ側ポート番号１エントリ（Ｉ／Ｏインタフェース１１５Ｂに対応し、オフセット減算部１１３Ｂが使用）は６４、サーバ側ポート番号２エントリ（Ｉ／Ｏインタフェース１１５Ｂに対応し、オフセット減算部１１３Ｂが使用）は１２８、サーバ側ポート番号３エントリ（Ｉ／Ｏインタフェース１１５Ｃに対応し、オフセット減算部１１３Ｃが使用）は２５６、となる。なお、これらのオフセット値は、後で説明するように、ブレード＃０〜＃３各々のＥＰＴに、対応して設定される必要がある。 Further, the server side port number 0 entry (corresponding to the I / O interface 115A and used by the offset subtracting unit 113A) of the ID offset table 420 is 0, and the server side port number 1 entry (corresponding to the I / O interface 115B is offset. 64 is used for the server side port number 2 (corresponding to the I / O interface 115B, and used by the offset subtractor 113B) is 128, and 3 entries for the server side port number (corresponding to the I / O interface 115C). , Used by the offset subtracting unit 113C) is 256. These offset values need to be set corresponding to the EPTs of the blades # 0 to # 3, as will be described later.

ステップＳ８０６では、宛先ポートテーブル１５１に設定すべきベースアドレス、サイズ、及び、宛先ポート番号を生成する。本実施例で示す計算機システムでは、ベースアドレス、及び、サイズは図１４に示すようになる。 In step S806, the base address, size, and destination port number to be set in the destination port table 151 are generated. In the computer system shown in this embodiment, the base address and size are as shown in FIG.

すなわち、宛先ポートテーブル１５１のエントリ０番はベースアドレスをＤ０、サイズをＲ０、宛先ポート番号を０（Ｉ／Ｏインタフェース１１５Ａを示す番号）とし、有効ビットを有効（１）とする。エントリ１番はベースアドレスをＤ１、サイズをＲ１、宛先ポート番号を１（Ｉ／Ｏインタフェース１１５Ｂを示す番号）とし、有効ビットを有効（１）とする。エントリ２番はベースアドレスをＤ２、サイズをＲ２、宛先ポート番号を２（Ｉ／Ｏインタフェース１１５Ｃを示す番号）とし、有効ビットを有効（１）とする。エントリ３番はベースアドレスをＤ３、サイズをＲ３、宛先ポート番号を３（Ｉ／Ｏインタフェース１１５Ｄを示す番号）とし、有効ビットを有効（１）とする。エントリ４〜５番は使わないので、ステップＳ８０２で無効にされたままにしておく。 That is, entry 0 in the destination port table 151 has a base address of D0, a size of R0, a destination port number of 0 (a number indicating the I / O interface 115A), and a valid bit of valid (1). For entry No. 1, the base address is D1, the size is R1, the destination port number is 1 (number indicating the I / O interface 115B), and the valid bit is valid (1). For entry No. 2, the base address is D2, the size is R2, the destination port number is 2 (number indicating the I / O interface 115C), and the valid bit is valid (1). For entry 3, the base address is D3, the size is R3, the destination port number is 3 (number indicating the I / O interface 115D), and the valid bit is valid (1). Since entries 4-5 are not used, they are left invalidated in step S802.

ステップＳ８０７では、ステップＳ８０５で算出したオフセット値を、管理ネットワーク１３１を経由して、Ｉ／Ｏコントローラ１１２上のオフセット情報１５０に書込む。また、ステップＳ８０６で生成した宛先ポートテーブルを、管理ネットワーク１３１を経由して、Ｉ／Ｏコントローラ１１２上の宛先ポートテーブル１５１に書込む。 In step S807, the offset value calculated in step S805 is written into the offset information 150 on the I / O controller 112 via the management network 131. Also, the destination port table generated in step S806 is written into the destination port table 151 on the I / O controller 112 via the management network 131.

ステップＳ８０８では、管理サーバ１３０は管理ネットワーク１３１を経由してＳＭＰサーバ１００を起動する指示を発行する。本実施例では、ＳＭＰサーバ１００はブレード１０１Ａ〜Ｄで構成されているため、ブレード１０１Ａ〜Ｄを起動させることになる。 In step S808, the management server 130 issues an instruction to activate the SMP server 100 via the management network 131. In this embodiment, since the SMP server 100 is composed of blades 101A to 101D, the blades 101A to 101D are activated.

ブレード１０１Ａ〜Ｄは、その起動の過程において、各種システムソフトウェア（ＢＩＯＳ、ＥＦＩ、ハイパーバイザ、オペレーティングシステム、等）が、ＩＯＨ１０５Ａ〜Ｄや、図９に示すバスツリー状のブリッジを設定することで、図９に示すようなバスツリーを形成する。 In the process of starting up the blades 101A-D, various system software (BIOS, EFI, hypervisor, operating system, etc.) set the IOHs 105A-D and the bus-tree bridge shown in FIG. A bus tree as shown in FIG. 9 is formed.

ステップＳ８０９では、管理サーバ１３０から管理ネットワーク１３１を経由して、ＳＭＰサーバ１００上で動作しているハイパーバイザにアドレス変換で用いるオフセット値を通知する。本実施例の場合には、アドレス変換手段としてＥＰＴを使用すると考えると、以下に示すようなオフセット値をハイパーバイザに通知し、ハイパーバイザは各ブレード１０１Ａ〜ＤのＥＰＴに対応する値を設定し、ＥＰＴを有効化する。なお、上述の通り、各ブレード１０１Ａ〜ＤのＥＰＴは通常通りの使用がなされる。 In step S809, the management server 130 notifies the hypervisor operating on the SMP server 100 of the offset value used for address conversion via the management network 131. In the case of this embodiment, assuming that EPT is used as the address conversion means, the following offset value is notified to the hypervisor, and the hypervisor sets a value corresponding to the EPT of each blade 101A-D. Enable EPT. As described above, the EPT of each blade 101A-D is used as usual.

ＣＰＵ１０２Ａ上で動作するハイパーバイザのスレッドが使うＥＰＴは、図６のブレード＃０ＭＭＩＯ空間（アドレスｘ〜（ｘ＋ｓ））に対するアクセスが発生したとき、オフセット値＋０を加算する（あるいは、オフセット値を加算しない）ようにＥＰＴを設定する。また、図７のＢｕｓ＃０〜６３用空間に対するアクセスが発生した時、オフセット値＋０を加算する（あるいは、オフセット値を加算しない）ようにＥＰＴを設定する。 The EPT used by the hypervisor thread operating on the CPU 102A adds the offset value +0 (or does not add the offset value) when an access to the blade # 0 MMIO space (addresses x to (x + s)) in FIG. 6 occurs. EPT is set as follows. Further, when an access to the space for Bus # 0 to 63 in FIG. 7 occurs, the EPT is set so that the offset value +0 is added (or the offset value is not added).

ＣＰＵ１０２Ｂ上で動作するハイパーバイザのスレッドが使うＥＰＴは、図６のブレード＃０ＭＭＩＯ空間（アドレスｘ〜（ｘ＋ｓ））に対するアクセスが発生したとき、オフセット値＋αを加算するようにＥＰＴを設定する。また、図７のＢｕｓ＃０〜６３用空間に対するアクセスが発生した時、オフセット値＋４００００００Ｈを加算するようにＥＰＴを設定する。 The EPT used by the thread of the hypervisor operating on the CPU 102B is set so that the offset value + α is added when an access to the blade # 0 MMIO space (address x to (x + s)) in FIG. 6 occurs. Further, when an access to the space for Bus # 0 to 63 in FIG. 7 occurs, the EPT is set so that the offset value + 4000000H is added.

ＣＰＵ１０２Ｃ上で動作するハイパーバイザのスレッドが使うＥＰＴは、図６のブレード＃０ＭＭＩＯ空間（アドレスｘ〜（ｘ＋ｓ））に対するアクセスが発生したとき、オフセット値＋βを加算するようにＥＰＴを設定する。また、図７のＢｕｓ＃０〜６３用空間に対するアクセスが発生した時、オフセット値＋８００００００Ｈを加算するようにＥＰＴを設定する。 The EPT used by the thread of the hypervisor operating on the CPU 102C is set so that the offset value + β is added when an access to the blade # 0 MMIO space (address x to (x + s)) in FIG. 6 occurs. Further, when an access to the space for Bus # 0 to 63 in FIG. 7 occurs, the EPT is set so that the offset value + 8000000H is added.

ＣＰＵ１０２Ｄ上で動作するハイパーバイザのスレッドが使うＥＰＴは、図６のブレード＃０ＭＭＩＯ空間（アドレスｘ〜（ｘ＋ｓ））に対するアクセスが発生したとき、オフセット値＋γを加算するようにＥＰＴを設定する。また、図７のＢｕｓ＃０〜６３用空間に対するアクセスが発生した時、オフセット値＋Ｃ００００００Ｈを加算するようにＥＰＴを設定する。 The EPT used by the hypervisor thread operating on the CPU 102D is set to add an offset value + γ when an access to the blade # 0 MMIO space (addresses x to (x + s)) in FIG. 6 occurs. Further, when an access to the space for Bus # 0 to 63 in FIG. 7 occurs, the EPT is set so that the offset value + C000000H is added.

本実施例によれば、ＳＭＰサーバと複数のＩ／Ｏデバイスとの間のデータ転送を、ＳＭＰサーバが有する複数のＩ／Ｏインタフェースに分散させることで、Ｉ／Ｏインタフェースとサーバ内部インターコネクトの混雑を緩和することができる。 According to the present embodiment, the data transfer between the SMP server and the plurality of I / O devices is distributed to the plurality of I / O interfaces of the SMP server, thereby congesting the I / O interface and the server internal interconnect. Can be relaxed.

以上、本発明を実施例に基づき具体的に説明したが、本発明は上述した実施例の形態に限定されるものではなく、その要旨を逸脱しない範囲で種々変更可能であることは言うまでもない。上記した実施例は本発明のより良い理解のために詳細に説明したのであり、必ずしも説明の全ての構成を備えるものに限定されものではない。 Although the present invention has been specifically described above based on the embodiments, it is needless to say that the present invention is not limited to the above-described embodiments and can be variously modified without departing from the gist thereof. The above-described embodiments have been described in detail for better understanding of the present invention, and are not necessarily limited to those having all the configurations described above.

更に、上述した各構成、機能、処理部等は、それらの一部又は全部を、集積回路で設計する等によりハードウェアで実現しても良いし、それらの一部又は全部を実現するプログラムを作成することによりソフトウェアで実現しても良いことは言うまでもない。 Furthermore, each of the above-described configurations, functions, processing units, and the like may be realized by hardware by designing a part or all of them with an integrated circuit, or a program that realizes part or all of them. Needless to say, it can be realized by software.

本発明は、複数のＩ／Ｏインタフェースを有するＳＭＰサーバ等の計算機において、Ｉ／Ｏインタフェース、及び、計算機内部で用いられている内部インターコネクトの混雑を緩和し、計算機全体の性能を向上させる効果を有する。 The present invention has the effect of reducing the congestion of the I / O interface and the internal interconnect used inside the computer in a computer such as an SMP server having a plurality of I / O interfaces, and improving the performance of the entire computer. Have.

１００ＳＭＰサーバ
１０１Ａ〜Ｄブレード
１０２Ａ〜ＤＣＰＵ
１０３Ａ〜Ｄメモリ
１０４Ａ〜Ｄ、１０９内部インターコネクト
１０５Ａ〜ＤＩＯＨ（Ｉ／Ｏハブ）
１１０Ｉ／Ｏスイッチ
１１１スイッチ交換部
１１２Ｉ／Ｏコントローラ
１１３Ａ〜Ｄオフセット減算部
１１４Ａ〜Ｄ宛先ポート検索部
１２０Ａ〜ＤＩ／Ｏデバイス
１３０管理サーバ
１３１管理ネットワーク
１１５Ａ〜Ｄ、１２１Ａ〜ＤＩ／Ｏインタフェース
１５０オフセット情報
１５１宛先ポートテーブル。 100 SMP servers 101A-D Blades 102A-D CPU
103A to D Memory 104A to D, 109 Internal interconnect 105A to D IOH (I / O hub)
110 I / O switch 111 Switch exchange unit 112 I / O controller 113A-D Offset subtraction unit 114A-D Destination port search unit 120A-D I / O device 130 Management server 131 Management network 115A-D, 121A-D I / O Interface 150 Offset information 151 Destination port table.

Claims

A computer system,
A server having a plurality of I / O interfaces and a plurality of processing units sharing a storage space;
An I / O device used by the server;
Comprising the I / O interface of the server and an I / O switch for connecting the I / O device;
The processor is
When the processing unit issues an instruction to the I / O device, the processing unit has an address conversion mechanism that adds a different offset value for each processing unit to destination data of data generated based on the command,
The I / O switch is
An offset subtraction unit that subtracts the corresponding offset value from the destination data issued by the server;
A computer system characterized by that.

The computer system according to claim 1,
The I / O switch is
A destination port table for storing a correspondence relationship between a plurality of areas constituting a part of the storage space and the I / O interface;
The destination port table is referred to by the destination data address of the data issued by the I / O device, and the data is transferred to the I / O interface obtained as a reference result.
A computer system characterized by that.

The computer system according to claim 1,
The instruction is a load instruction or a store instruction issued by the processing unit to the I / O device.
A computer system characterized by that.

The computer system according to claim 1,
The destination data is a destination address of data issued by the server or a destination ID (Identifier).
A computer system characterized by that.

The computer system according to claim 1,
The server is an SMP (Symmetric Multiple Processor) server.
A computer system characterized by that.

The computer system according to claim 5,
The SMP server includes a plurality of blades, and each of the blades includes the processing unit, the I / O interface, a storage unit, and an internal interconnect, and the plurality of processing units and the plurality of the storage units include the internal unit. By connecting to each other through an interconnect, a single main storage space is formed as the storage space.
A computer system characterized by that.

A data transfer method for a computer system comprising an I / O device used by a server having a plurality of processing units sharing a storage space with a plurality of I / O interfaces, and an I / O switch for connecting the I / O interface. And
The processor is
When issuing an instruction to the I / O device, a different offset value is given for each processing unit to destination data of data generated based on the instruction,
The I / O switch is
Transferring the data to the I / O device after removing the corresponding offset value from the destination data issued by the server;
A data transfer method characterized by the above.

The data transfer method according to claim 7, comprising:
The I / O switch is
Storing a correspondence relationship between a plurality of areas constituting a part of the storage space and the I / O interface;
The correspondence relationship stored is referred to by destination data of data issued by the I / O device, and the data is transferred to the I / O interface obtained as a reference result.
A data transfer method characterized by the above.

The data transfer method according to claim 7, comprising:
The processor is
As the instruction, a load instruction or a store instruction is issued to the I / O device.
A data transfer method characterized by the above.

The data transfer method according to claim 7, comprising:
The destination data is a destination address or a destination ID of the data issued by the processing unit.
A data transfer method characterized by the above.

An I / O switch for connecting a plurality of I / O interfaces of a server including a plurality of processing units sharing a storage space and an I / O device,
An offset subtraction unit that subtracts a different offset value for each I / O interface from destination data of data issued by the server;
A switch exchange unit connected to the output of the offset subtraction unit;
An I / O controller for controlling the offset subtraction unit and the switch exchange unit;
I / O switch characterized by that.

The I / O switch according to claim 11, wherein
The I / O controller
An offset information storage unit for storing the offset value;
I / O switch characterized by that.

The I / O switch according to claim 11, wherein
A destination search unit for searching for an I / O interface to be a destination among a plurality of the I / O interfaces of the server to be a destination according to destination data of data issued by the I / O device;
The data is transferred to the I / O interface obtained as a search result via the switch exchange unit.
I / O switch characterized by that.

The I / O switch according to claim 13, wherein
The I / O controller
A destination port table that stores information indicating an area of the storage space of the server and a correspondence relationship with the I / O interface;
The destination port search unit
The I / O interface is searched with reference to the destination port table.
I / O switch characterized by that.

The I / O switch according to claim 11, wherein
The destination data is a destination address of the data issued by the server or a destination ID (Identifier).
I / O switch characterized by that.