JP2007226494A

JP2007226494A - Data transfer system

Info

Publication number: JP2007226494A
Application number: JP2006046296A
Authority: JP
Inventors: Noriyuki Terao; 典之寺尾; Junichi Ikeda; 純一池田; Koji Oshikiri; 幸治押切; Mitsuharu Takeo; 光治竹尾
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2006-02-23
Filing date: 2006-02-23
Publication date: 2007-09-06

Abstract

<P>PROBLEM TO BE SOLVED: To provide a data transfer system wherein a transfer rate is not reduced without using an excess lane number even when transaction (data transfer) from a plurality of end points to one switch simultaneously occurs. <P>SOLUTION: In this data transfer system, a lane number of a link of a high-speed serial bus forming a data transfer route from the switch 14 to a node (a route complex 13) positioned on an upper side of a tree structure is set as a lane number obtained by adding lane numbers satisfying maximum values of respective data transfer amounts transmitted through the respective data transfer routes from the end points 21-27 to the switch 14. Thereby, because a bottleneck by simultaneous operation can be dissolved, the excess lane number is not used, and the transfer rate is not reduced even when the plurality of the data transfer are overlapped. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、データ転送システムに関する。 The present invention relates to a data transfer system.

一般に、画像データその他のデータを扱うデジタル複写機、複合機（ＭＦＰ）等の情報処理装置では、デバイス間のインタフェースにＰＣＩバスが使用されている。しかし、パラレル方式のＰＣＩバスでは、レーシングやスキューなどの問題があり、高速・高画質の画像形成装置に使用するには、転送レートが低い段階にきており、最近では、ＰＣＩバスのようなパラレル方式のインタフェースに代えて、IEEE1394やUSB等の高速シリアルインタフェースの使用が検討されている。例えば、特許文献１によれば、内部インタフェースとして、IEEE1394やUSB等の高速シリアルインタフェースを使用することが提案されている。 In general, in an information processing apparatus such as a digital copying machine or a multifunction peripheral (MFP) that handles image data and other data, a PCI bus is used as an interface between devices. However, the parallel PCI bus has problems such as racing and skew, and the transfer rate has been low for use in high-speed and high-quality image forming apparatuses. The use of a high-speed serial interface such as IEEE1394 or USB is being considered in place of the parallel interface. For example, according to Patent Document 1, it is proposed to use a high-speed serial interface such as IEEE1394 or USB as an internal interface.

また、他の高速シリアルインタフェースとして、ＰＣＩバス方式の後継規格に当るPCI Express（登録商標）なるインタフェースも提案され、実用化の段階にきている（例えば、非特許文献１参照）。このPCI Expressシステムは、概略的には、例えば非特許文献１中の図１等に示されるようなルートコンプレックス−スイッチ（任意階層）−デバイス等のツリー構造（木構造）によるデータ通信網として構成されている。 As another high-speed serial interface, an interface called PCI Express (registered trademark), which is a successor to the PCI bus system, has been proposed and has been put to practical use (for example, see Non-Patent Document 1). This PCI Express system is schematically configured as a data communication network having a tree structure (tree structure) such as a root complex-switch (arbitrary hierarchy) -device as shown in FIG. Has been.

特開２００１−０１６３８２号公報JP 2001-016382 A “PCI Express 規格の概要”Interface誌、July’2003 里見尚志“Outline of PCI Express Standard” Interface, July’2003 Naoshi Satomi

ところで、IEEE1394やUSB等の高速シリアルインタフェースを用いたデータ転送システムでは、ノード間の転送路の性能は固定（レーン数１）である。一方、PCI Expressでは、レーン数を変更することで、各転送路の性能を変更することができる。ただし、性能とコストとは、トレードオフの関係になっている。 By the way, in a data transfer system using a high-speed serial interface such as IEEE1394 or USB, the performance of the transfer path between nodes is fixed (number of lanes 1). On the other hand, in PCI Express, the performance of each transfer path can be changed by changing the number of lanes. However, there is a trade-off between performance and cost.

しかしながら、PCI Expressでは、スイッチを介したノード間での平均転送レートは、経路中の最も細い転送路の性能（リンク幅・レーン数）に支配されるため、１個所のボトルネックがある場合には、その他の転送路をいくら広くしてもその効果は得られない。ボトルネックは、物理的なレーン数だけでなく、同時に動作する複数のトランザクション（データ転送）を考慮する必要がある。 However, in PCI Express, the average transfer rate between nodes via a switch is governed by the performance (link width and number of lanes) of the narrowest transfer path in the path, so there is a bottleneck at one location. However, no effect is obtained no matter how wide the other transfer paths are. The bottleneck needs to consider not only the number of physical lanes but also a plurality of transactions (data transfer) operating simultaneously.

本発明は、上記に鑑みてなされたものであって、一のスイッチに対する複数のエンドポイントからのトランザクション（データ転送）が同時に発生するような場合であっても、余分なレーン数を用いることなく、転送レートが低下することのないデータ転送システムを提供することを目的とする。 The present invention has been made in view of the above, and even when transactions (data transfer) from a plurality of endpoints to one switch occur simultaneously, an extra lane number is not used. An object of the present invention is to provide a data transfer system in which the transfer rate does not decrease.

また、本発明は、各エンドポイントからスイッチに対するデータ転送は同時に行なわれることがないデータ転送システムにおいて、コスト性能を最適化することを目的とする。 Another object of the present invention is to optimize cost performance in a data transfer system in which data transfer from each endpoint to a switch is not performed simultaneously.

上述した課題を解決し、目的を達成するために、請求項１にかかる発明は、木構造によるデータ通信網としてポイントツーポイントで送受信独立の通信チャネルが確立されるとともに任意のレーン数を選択可能な高速シリアルバスを用いることにより一のスイッチに対して複数のエンドポイントが接続されており、前記各エンドポイントから前記スイッチに対してデータ転送を行なうデータ転送システムにおいて、前記スイッチから前記木構造の上位側に位置するノードに至るデータ転送経路を形成する前記高速シリアルバスのリンクのレーン数を、前記エンドポイントから前記スイッチに至る各データ転送経路を送信される各データ転送量の最大値を満たすレーン数をそれぞれ加算したレーン数とする。 In order to solve the above-mentioned problems and achieve the object, the invention according to claim 1 is a tree-structured data communication network in which a point-to-point independent communication channel is established and any number of lanes can be selected. In a data transfer system in which a plurality of end points are connected to one switch by using a high-speed serial bus, and data is transferred from each end point to the switch. The number of lanes of the link of the high-speed serial bus that forms the data transfer path to the node located on the upper side satisfies the maximum value of each data transfer amount transmitted through each data transfer path from the end point to the switch The number of lanes is added to each lane number.

また、請求項２にかかる発明は、木構造によるデータ通信網としてポイントツーポイントで送受信独立の通信チャネルが確立されるとともに任意のレーン数を選択可能な高速シリアルバスを用いることにより、一のスイッチに対して複数のエンドポイントが接続されており、前記各エンドポイントから前記スイッチに対するデータ転送は同時に行なわれることがないデータ転送システムにおいて、前記各エンドポイントから前記スイッチに至るデータ転送経路および当該スイッチから前記木構造の上位側に位置するノードに至るデータ転送経路を形成する前記高速シリアルバスのリンクの全てのレーン数を、前記データ転送経路を送信されるデータ転送量の最大値を満たすレーン数に統一する。 Further, the invention according to claim 2 uses a high-speed serial bus in which a communication channel independent of transmission and reception is established point-to-point as a tree-structured data communication network, and an arbitrary number of lanes can be selected. In a data transfer system in which a plurality of end points are connected to each other, and data transfer from each end point to the switch is not performed simultaneously, a data transfer path from each end point to the switch and the switch The number of lanes satisfying the maximum value of the data transfer amount transmitted through the data transfer path is the number of all lanes of the link of the high-speed serial bus that forms the data transfer path from the node to the node positioned on the upper side of the tree structure. To unify.

また、請求項３にかかる発明は、請求項１または２記載のデータ転送システムにおいて、前記高速シリアルバスは、PCI Express 規格の高速シリアルバスである。 According to a third aspect of the present invention, in the data transfer system according to the first or second aspect, the high-speed serial bus is a PCI Express standard high-speed serial bus.

請求項１にかかる発明によれば、スイッチから木構造の上位側に位置するノードに至るデータ転送経路を形成する高速シリアルバスのリンクのレーン数を、エンドポイントからスイッチに至る各データ転送経路を送信される各データ転送量の最大値を満たすレーン数をそれぞれ加算したレーン数とすることにより、一のスイッチに対する複数のエンドポイントからのトランザクション（データ転送）が同時に発生するような場合であっても、同時動作によるボトルネックを解消することができるので、余分なレーン数を用いることなく、複数のデータ転送が重なった場合にも転送レートが低下することのないデータ転送システムを提供することができるという効果を奏する。 According to the first aspect of the present invention, the number of lanes of the link of the high-speed serial bus that forms the data transfer path from the switch to the node located on the upper side of the tree structure is determined, and each data transfer path from the end point to the switch is determined. This is the case where transactions (data transfer) from multiple endpoints to one switch occur simultaneously by setting the number of lanes to which the maximum value of each data transfer amount to be transmitted is added. However, since the bottleneck due to the simultaneous operation can be eliminated, it is possible to provide a data transfer system in which the transfer rate does not decrease even when a plurality of data transfers overlap without using an extra number of lanes. There is an effect that can be done.

また、請求項２にかかる発明によれば、各エンドポイントからスイッチに至るデータ転送経路および当該スイッチから木構造の上位側に位置するノードに至るデータ転送経路を形成する高速シリアルバスのリンクの全てのレーン数を、データ転送経路を送信されるデータ転送量の最大値を満たすレーン数に統一することにより、１つのデータ転送に関するデータ転送経路上のすべてのデータ転送路のレーン数を、所望の転送レートが得られるレーン数で等しくそろえることで、コスト性能を最適化することができるという効果を奏する。 According to the invention of claim 2, all the links of the high-speed serial bus that form the data transfer path from each end point to the switch and the data transfer path from the switch to the node located on the upper side of the tree structure The number of lanes of all data transfer paths on the data transfer path related to one data transfer is changed to a desired number of lanes by unifying the number of lanes to the number of lanes satisfying the maximum value of the data transfer amount transmitted through the data transfer path. By making the transfer rate equal to the number of lanes from which the transfer rate can be obtained, the cost performance can be optimized.

また、請求項３にかかる発明によれば、高速シリアルバスは、PCI Express 規格の高速シリアルバスであることにより、低電圧差動信号伝送、ポイントツーポイントで送受信独立の通信チャネル、パケット化されたスプリットトランザクション、リンク構成の違いによる高いスケーラビリティなどの特徴を有するという効果を奏する。 According to the invention of claim 3, the high-speed serial bus is a PCI Express standard high-speed serial bus, so that low-voltage differential signal transmission, point-to-point independent transmission and reception communication channels, and packetization It has the effect of having features such as high scalability due to differences in split transactions and link configurations.

以下に添付図面を参照して、この発明にかかるデータ転送システムの最良な実施の形態を詳細に説明する。 Exemplary embodiments of a data transfer system according to the present invention will be explained below in detail with reference to the accompanying drawings.

［第１の実施の形態］
本発明の第１の実施の形態を図１ないし図１８に基づいて説明する。以下では、PCI Expressの詳細について、［PCI Express規格の概要］〜［PCI Express のアーキテクチャの詳細］の欄で説明し、その後、本実施の形態のデータ転送システムについて、［データ転送システム］の欄で説明する。 [First Embodiment]
A first embodiment of the present invention will be described with reference to FIGS. In the following, details of PCI Express will be described in the columns [Outline of PCI Express Standard] to [Details of Architecture of PCI Express], and then the [Data Transfer System] column for the data transfer system of the present embodiment. I will explain it.

［PCI Express規格の概要］
まず、本実施の形態は高速シリアルバスの一つであるPCI Express（登録商標）を利用するものであり、本実施の形態の前提として当該PCI Express規格の概要について、非特許文献１の一部抜粋により説明する。ここに、高速シリアルバスとは、１本の伝送路を用いてシリアル（直列）伝送により高速（１００Ｍｂｐｓ程度以上）にデータをやり取りすることができるインタフェースを意味する。 [Outline of PCI Express standard]
First, this embodiment uses PCI Express (registered trademark), which is one of high-speed serial buses. As an assumption of this embodiment, an outline of the PCI Express standard is a part of Non-Patent Document 1. Explained with excerpts. Here, the high-speed serial bus means an interface capable of exchanging data at high speed (about 100 Mbps or more) by serial (serial) transmission using a single transmission line.

PCI Expressは、PCIの後継規格としてコンピュータ全般に通用する標準拡張バスとして規格化されたバスであり、概略的には、低電圧差動信号伝送、ポイントツーポイントで送受信独立の通信チャネル、パケット化されたスプリットトランザクション、リンク構成の違いによる高いスケーラビリティなどの特徴を持つ。 PCI Express is a standardized expansion bus that can be used for all computers as a successor to PCI. In general, low-voltage differential signal transmission, point-to-point independent communication channels, and packetization Split transactions and high scalability due to differences in link configuration.

図１に既存のPCIシステム、図２にPCI Expressシステムの各々の構成例を示す。既存のPCIシステムにあっては、ＣＰＵ１００やＡＧＰグラフィックス１０１やメモリ１０２が接続されたホストブリッジ１０３に対して、PCI-X（PCIの上位互換規格）デバイス１０４ａ，１０４ｂがPCI-Xブリッジ１０５ａを介して接続されたり、PCIデバイス１０４ｃ，１０４ｄが接続されたPCIブリッジ１０５ｂやPCIバススロット１０６が接続されたPCIブリッジ１０７がPCIブリッジ１０５ｃを介して接続されたりしたツリー構造（木構造）とされている。 FIG. 1 shows a configuration example of an existing PCI system, and FIG. 2 shows a configuration example of a PCI Express system. In an existing PCI system, PCI-X (PCI upward compatible standard) devices 104a and 104b connect a PCI-X bridge 105a to a host bridge 103 to which a CPU 100, AGP graphics 101, and memory 102 are connected. Or a PCI bridge 105b to which the PCI devices 104c and 104d are connected and a PCI bridge 107 to which the PCI bus slot 106 is connected are connected via the PCI bridge 105c (tree structure). Yes.

これに対して、PCI Expressシステムにあっては、ＣＰＵ１１０やメモリ１１１が接続されたルートコンプレックス１１２に対して、PCI Expressグラフィックス１１３がPCI Express１１４ａにより接続され、また、エンドポイント１１５ａやレガシーエンドポイント１１６ａがPCI Express１１４ｂにより接続されたスイッチ１１７ａがPCI Express１１４ｃにより接続され、さらには、エンドポイント１１５ｂやレガシーエンドポイント１１６ｂがPCI Express１１４ｄにより接続されたスイッチ１１７ｂやPCIバススロット１１８が接続されたPCIブリッジ１１９がPCI Express１１４ｅにより接続されたスイッチ１１７ｃがPCI Express１１４ｆにより接続されたツリー構造（木構造）とされている。 On the other hand, in the PCI Express system, the PCI Express graphics 113 is connected by the PCI Express 114a to the root complex 112 to which the CPU 110 and the memory 111 are connected, and the endpoint 115a and the legacy endpoint 116a. PCI Express 114b connects the switch 117a to which the PCI Express 114b is connected, and the PCI bridge 119 to which the switch 117b to which the endpoint 115b and the legacy endpoint 116b are connected by the PCI Express 114d and the PCI bus slot 118 are connected. The switch 117c connected by the Express 114e has a tree structure (tree structure) connected by the PCI Express 114f.

実際に想定されるPCI Expressプラットホーム例を図３に示す。図示例は、デスクトップ／モバイルへの適用例を示し、ＣＰＵ１２１がＣＰＵホストバス１２２により接続され、メモリ１２３が接続されたメモリハブ１２４（ルートコンプレックスに相当する）に対して、例えば、グラフィックス１２５がｘ１６のPCI Express１２６ａにより接続され、また、変換機能を有するＩ／Ｏハブ１２７がPCI Express１２６ｂにより接続されている。このＩ／Ｏハブ１２７には、例えば、Serial ATA１２８によりストレージ１２９が接続され、LPC１３０によりローカルＩ／Ｏ１３１が接続され、USB 2.0１３２やPCIバススロット１３３が接続されている。さらには、Ｉ／Ｏハブ１２７には、PCI Express１２６ｃによりスイッチ１３４が接続され、このスイッチ１３４には、各々、PCI Express１２６ｄ，１２６ｅ，１２６ｆによりモバイルドック１３５、ギガビットイーサネット１３６（イーサネットは登録商標）、アドインカード１３７が接続されている。 An example of an actually assumed PCI Express platform is shown in FIG. The illustrated example shows an application example to desktop / mobile. For example, graphics 125 is x16 with respect to a memory hub 124 (corresponding to a root complex) to which a CPU 121 is connected by a CPU host bus 122 and a memory 123 is connected. PCI Express 126a and an I / O hub 127 having a conversion function are connected by PCI Express 126b. For example, a storage 129 is connected to the I / O hub 127 by a Serial ATA 128, a local I / O 131 is connected by an LPC 130, and a USB 2.0 132 and a PCI bus slot 133 are connected. Further, a switch 134 is connected to the I / O hub 127 by a PCI Express 126c, and the mobile dock 135, Gigabit Ethernet 136 (Ethernet is a registered trademark), and an add-in are connected to the switch 134 by PCI Express 126d, 126e, and 126f, respectively. A card 137 is connected.

即ち、PCI Expressシステムでは、従来のPCI，PCI-X，AGPといったバスがPCI Expressで置き換わり、既存のPCI／PCI-Xデバイスを接続するためにブリッジが使用される。チップセット間の接続もPCI Express接続となり、IEEE1394，Serial ATA，USB 2.0などの既存のバスはＩ／ＯハブによりPCI Expressに接続される。 That is, in the PCI Express system, the conventional PCI, PCI-X, AGP bus is replaced with PCI Express, and a bridge is used to connect an existing PCI / PCI-X device. Connection between chipsets is also PCI Express connection, and existing buses such as IEEE1394, Serial ATA, and USB 2.0 are connected to PCI Express by an I / O hub.

［PCI Expressの構成要素］
Ａ．ポート（Port）／レーン（Lane）／リンク（Link）
図４に物理層の構造を示す。ポートは、物理的には同一半導体内にあり、リンクを形成するトランスミッタ／レシーバの集合で、論理的にはコンポーネント・リンク間を１対１で接続（ポイント・ツー・ポイント）するインタフェースを意味する。転送レートは、例えば片方向２．５Ｇｂｐｓとされている。レーンは、例えば０．８Ｖの差動信号ペアのセットで、送信側の信号ペア（２本）、受信側の信号ペア（２本）からなる。リンクは、２つのポートとその間を結ぶレーンの集まりであり、コンポーネント間のデュアルシンプレックス通信バスである。「ｘＮリンク」はＮ本のレーンから構成され、現在の規格では、Ｎ＝１，２，４，８，１６，３２が定義されている。図示例は、ｘ４リンク例である。例えば、図５に示すように、デバイスＡ，Ｂ間を結ぶこのレーン幅Ｎを可変することにより、スケーラブルなバンド幅を構成することが可能となる。 [Components of PCI Express]
A. Port / Lane / Link
FIG. 4 shows the structure of the physical layer. A port is a set of transmitters / receivers that are physically in the same semiconductor and form a link, and logically means an interface that connects component links in a one-to-one relationship (point-to-point). . The transfer rate is, for example, 2.5 Gbps in one direction. The lane is, for example, a set of 0.8 V differential signal pairs, and includes a transmission-side signal pair (two) and a reception-side signal pair (two). A link is a collection of lanes connecting two ports and the two ports, and is a dual simplex communication bus between components. The “xN link” is composed of N lanes, and N = 1, 2, 4, 8, 16, 32 are defined in the current standard. The illustrated example is an x4 link example. For example, as shown in FIG. 5, by changing the lane width N connecting the devices A and B, a scalable bandwidth can be configured.

Ｂ．ルートコンプレックス（Root Complex）
ルートコンプレックス１１２は、Ｉ／Ｏ構造の最上位に位置し、ＣＰＵやメモリサブシステムをＩ／Ｏに接続する。ブロック図などでは、図３に示すように、「メモリハブ」と記述されることが多い。ルートコンプレックス１１２（又は、１２４）は、１つ以上のPCI Expressポート（ルートポート）（図２中では、ルートコンプレックス１１２中の四角で示す）を持ち、各々のポートは独立したＩ／Ｏ階層ドメインを形成する。Ｉ／Ｏ階層ドメインは、単純なエンドポイントである場合（例えば、図２中のエンドポイント１１５ａ側の例）や、多数のスイッチやエンドポイントから形成される場合（例えば、図２中のエンドポイント１１５ｂやスイッチ１１７ｂ，１１５ｃ側の例）がある。 B. Root Complex
The root complex 112 is located at the highest level of the I / O structure, and connects the CPU and the memory subsystem to the I / O. In a block diagram or the like, as shown in FIG. 3, it is often described as “memory hub”. The root complex 112 (or 124) has one or more PCI Express ports (root ports) (indicated by squares in the root complex 112 in FIG. 2), and each port is an independent I / O hierarchical domain. Form. The I / O hierarchical domain is a simple endpoint (for example, the example of the endpoint 115a side in FIG. 2), or is formed from a large number of switches and endpoints (for example, the endpoint in FIG. 2). 115b and switches 117b and 115c side).

Ｃ．エンドポイント（End Point）
エンドポイント１１５は、タイプ００ｈのコンフィグレーション空間ヘッダを持つデバイス（具体的には、ブリッジ以外のデバイス）で、レガシーエンドポイントとPCI Expressエンドポイントとに分けられる。両者の大きな違いは、PCI ExpressエンドポイントはＢＡＲ（ベースアドレスレジスタ）でＩ／Ｏリソースを要求せず、このためＩ／Ｏリクエストを要求しない。また、PCI Expressエンドポイントは、ロックリクエストもサポートしていない。 C. End point
The endpoint 115 is a device having a configuration space header of type 00h (specifically, a device other than a bridge), and is divided into a legacy endpoint and a PCI Express endpoint. The major difference between the two is that the PCI Express endpoint does not request I / O resources in the BAR (Base Address Register), and therefore does not request an I / O request. PCI Express endpoints also do not support lock requests.

Ｄ．スイッチ（Switch）
スイッチ１１７（又は、１３４）は、２つ以上のポートを結合し、ポート間でのパケットルーティングを行う。コンフィグレーションソフトウェアからは、当該スイッチは、図６に示すように、仮想PCI-PCIブリッジ１４１の集合体として認識される。図中、両矢印はPCI Expressリンク１１４（又は、１２６）を示し、１４２ａ〜１４２ｄはポートを示す。このうち、ポート１４２ａはルートコンプレックスに近い方のアップストリームポートであり、ポート１４２ｂ〜１４２ｄはルートコンプレックスから遠い方のダウンストリームポートである。 D. Switch
The switch 117 (or 134) couples two or more ports and performs packet routing between the ports. From the configuration software, the switch is recognized as a collection of virtual PCI-PCI bridges 141 as shown in FIG. In the figure, double arrows indicate the PCI Express link 114 (or 126), and 142a to 142d indicate ports. Among these, the port 142a is an upstream port closer to the root complex, and the ports 142b to 142d are downstream ports farther from the root complex.

Ｅ．PCI Express１１４ｅ−PCIブリッジ１１９
PCI ExpressからPCI／PCI-Xへの接続を提供する。これにより、既存のPCI／PCI-XデバイスをPCI Expressシステム上で使用することができる。 E. PCI Express 114e-PCI bridge 119
Provides connection from PCI Express to PCI / PCI-X. Thereby, an existing PCI / PCI-X device can be used on the PCI Express system.

［階層アーキテクチャ］
従来のＰＣＩのアーキテクチャは、図７−１に示すように、プロトコルとシグナリングが密接に関連する構造であり階層という考え方はなかったが、PCI Expressでは、図７−２に示すように、一般的な通信プロトコルやInfiniBandのように、独立した階層構造とされ、各層に分けて仕様が定義されている。即ち、最上位のソフトウェア１５１、最下位の機構（メカニカル）部１５２間に、トランザクション層１５３、データリンク層１５４、物理層１５５を持つ構造とされている。これにより、各層のモジュール性が確保され、スケーラビリティを持たせることやモジュールの再利用が可能となる。例えば、新たな信号コーディング方式や伝送媒体を採用する場合、物理層を変更するだけでデータリンク層やトランザクション層は変更せずに対応できる。 [Hierarchical architecture]
As shown in FIG. 7A, the conventional PCI architecture has a structure in which protocols and signaling are closely related and there is no concept of hierarchy. In PCI Express, as shown in FIG. Like the standard communication protocol and InfiniBand, it has an independent hierarchical structure, and specifications are defined for each layer. In other words, a transaction layer 153, a data link layer 154, and a physical layer 155 are provided between the uppermost software 151 and the lowermost mechanism (mechanical) unit 152. Thereby, the modularity of each layer is ensured, and it becomes possible to provide scalability and reuse the module. For example, when adopting a new signal coding method or transmission medium, it is possible to cope with only changing the physical layer without changing the data link layer or the transaction layer.

PCI Expressのアーキテクチャの中心となるのは、トランザクション層１５３、データリンク層１５４、物理層１５５であり、各々図８を参照して説明する以下のような役割を持つ。 The core of the PCI Express architecture is a transaction layer 153, a data link layer 154, and a physical layer 155, each having the following roles described with reference to FIG.

Ａ．トランザクション層１５３
トランザクション層１５３は、最上位に位置し、トランザクションレイヤパケット（ＴＬＰ）の組み立て、分解機能を持つ。トランザクションレイヤパケット（ＴＬＰ）は、リード／ライト、各種イベントといったトランザクションの伝達に用いられる。また、トランザクション層１５３は、トランザクションレイヤパケット（ＴＬＰ）のためのクレジットを用いたフロー制御を行う。各層１５３〜１５５におけるトランザクションレイヤパケット（ＴＬＰ）の概要を図９に示す（詳細は、後述する）。 A. Transaction layer 153
The transaction layer 153 is located at the highest level and has a function of assembling and disassembling a transaction layer packet (TLP). The transaction layer packet (TLP) is used for transmission of transactions such as read / write and various events. The transaction layer 153 performs flow control using credits for transaction layer packets (TLP). An outline of a transaction layer packet (TLP) in each of the layers 153 to 155 is shown in FIG. 9 (details will be described later).

Ｂ．データリンク層１５４
データリンク層１５４の主な役割は、エラー検出／訂正（再送）によりトランザクションレイヤパケット（ＴＬＰ）のデータ完全性を保証することと、リンク管理である。データリンク層１５４間では、リンク管理やフロー制御のためのパケットのやり取りを行う。このパケットは、トランザクションレイヤパケット（ＴＬＰ）と区別するために、データリンクレイヤパケット（ＤＬＬＰ）と呼ばれる。 B. Data link layer 154
The main role of the data link layer 154 is to guarantee data integrity of the transaction layer packet (TLP) by error detection / correction (retransmission) and link management. Packets for link management and flow control are exchanged between the data link layers 154. This packet is called a data link layer packet (DLLP) to distinguish it from a transaction layer packet (TLP).

Ｃ．物理層１５５
物理層１５５は、ドライバ、入力バッファ、パラレル−シリアル／シリアル−パラレル変換器、ＰＬＬ、インピーダンス整合回路といったインタフェース動作に必要な回路を含んでいる。また、論理的な機能としてインタフェースの初期化・保守の機能を持つ。物理層１５５は、データリンク層１５４／トランザクション層１５３を実際のリンクで使用される信号技術から独立させる役目も持っている。 C. Physical layer 155
The physical layer 155 includes circuits necessary for interface operations such as a driver, an input buffer, a parallel-serial / serial-parallel converter, a PLL, and an impedance matching circuit. It also has interface initialization / maintenance functions as logical functions. The physical layer 155 also serves to make the data link layer 154 / transaction layer 153 independent of the signaling technology used in the actual link.

なお、PCI Expressのハードウェア構成上、エンベデッド・クロックという技術を採用しており、クロック信号はなく、クロックのタイミングはデータ信号中に埋め込まれており、受信側でデータ信号のクロス・ポイントを基にクロックを抽出する方式とされている。 The PCI Express hardware configuration employs a technology called embedded clock, there is no clock signal, the clock timing is embedded in the data signal, and the receiving side is based on the cross-point of the data signal. The system extracts the clock.

［コンフィグレーション空間］
PCI Expressは、従来のPCIと同様にコンフィグレーション空間を持つが、その大きさは従来のPCIが２５６バイトであるのに対して、図１０に示すように、４０９６バイトへと拡張されている。これにより、多数のデバイス固有レジスタセットを必要とするデバイス（ホストブリッジなど）に対しても、将来的に十分な空間が確保されている。PCI Expressでは、コンフィグレーション空間へのアクセスは、フラットなメモリ空間へのアクセス（コンフィグレーションリード／ライト）で行われ、バス／デバイス／機能／レジスタ番号はメモリアドレスにマップされている。 [Configuration space]
PCI Express has a configuration space like conventional PCI, but its size is expanded to 4096 bytes as shown in FIG. 10, whereas conventional PCI has 256 bytes. As a result, sufficient space is secured in the future even for devices (such as host bridges) that require a large number of device-specific register sets. In PCI Express, the configuration space is accessed by accessing a flat memory space (configuration read / write), and the bus / device / function / register number is mapped to a memory address.

当該空間の先頭２５６バイトは、PCIコンフィグレーション空間として、ＢＩＯＳや従来のＯＳからＩ／Ｏポートを使用した方法でもアクセスできる。従来のアクセスをPCI Expressでのアクセスに変換する機能は、ホストブリッジ上に実装される。００ｈから３ＦｈまではPCI2.3互換のコンフィグレーションヘッダとなっている。これにより、PCI Expressで拡張された機能以外であれば、従来のＯＳやソフトウェアをそのまま使用することができる。即ち、PCI Expressにおけるソフトウェア層は、既存のPCIと互換性を保ったロード・ストア・アーキテクチャ（プロセッサが直接Ｉ／Ｏレジスタをアクセスする方式）を継承している。しかし、PCI Expressで拡張された機能（例えば、同期転送やＲＡＳ（Reliability,Availability and Serviceability）などの機能）を使用するには、４ＫバイトのPCI Express拡張空間にアクセスできるようにする必要がある。 The first 256 bytes of the space can be accessed as a PCI configuration space by a method using an I / O port from a BIOS or a conventional OS. The function of converting the conventional access to the access by PCI Express is implemented on the host bridge. From 00h to 3Fh, it is a PCI2.3 compatible configuration header. As a result, a conventional OS and software can be used as they are except for functions extended by PCI Express. That is, the software layer in PCI Express inherits a load / store architecture (a method in which a processor directly accesses an I / O register) that is compatible with the existing PCI. However, in order to use functions extended by PCI Express (for example, functions such as synchronous transfer and RAS (Reliability, Availability and Serviceability)), it is necessary to make it possible to access a 4 Kbyte PCI Express expansion space.

なお、PCI Expressとしては様々なフォームファクタ（形状）が考えられるが、具体化している例としては、アドインカード、プラグインカード（Express Card）、Mini PCI Expressなどがある。 Although various form factors (shapes) can be considered as PCI Express, examples of specific examples include an add-in card, a plug-in card (Express Card), and Mini PCI Express.

［PCI Express のアーキテクチャの詳細］
PCI Express のアーキテクチャの中心となっているトランザクション層１５３、データリンク層１５４、物理層１５５について、各々詳細に説明する。 [PCI Express architecture details]
The transaction layer 153, data link layer 154, and physical layer 155, which are the core of the PCI Express architecture, will be described in detail.

Ａ．トランザクション層１５３
トランザクション層１５３の主な役割は、前述したように、上位のソフトウェア層１５１と下位のデータリンク層１５４との間でトランザクションレイヤパケット（ＴＬＰ）の組み立てと分解を行うことである。 A. Transaction layer 153
The main role of the transaction layer 153 is to assemble and disassemble transaction layer packets (TLP) between the upper software layer 151 and the lower data link layer 154 as described above.

ａ．アドレス空間とトランザクションタイプ
PCI Expressでは、従来のPCIでサポートされていたメモリ空間（メモリ空間とのデータ転送用）、Ｉ／Ｏ空間（Ｉ／Ｏ空間とのデータ転送用）、コンフィグレーション空間（デバイスのコンフィグレーションとセットアップ用）に加えて、メッセージ空間（PCI Expressデバイス間のインバンドでのイベント通知や一般的なメッセージ送信（交換）用…割り込み要求や確認は、メッセージを「仮想ワイヤ」として使用することにより伝達される）が追加され、４つのアドレス空間が定義されている。各々の空間に対してトランザクションタイプが定義されている（メモリ空間、Ｉ／Ｏ空間、コンフィグレーション空間は、リード／ライト、メッセージ空間は基本（ベンダ定義含む））。 a. Address space and transaction type
In PCI Express, memory space (for data transfer with memory space), I / O space (for data transfer with I / O space), and configuration space (device configuration and setup) supported by conventional PCI In addition to message space (in-band event notification between PCI Express devices and general message transmission (exchange) ... Interrupt requests and confirmations are communicated by using the message as a "virtual wire" And four address spaces are defined. Transaction types are defined for each space (memory space, I / O space, configuration space is read / write, and message space is basic (including vendor definition)).

ｂ．トランザクションレイヤパケット（ＴＬＰ）
PCI Expressは、パケット単位で通信を行う。図９に示したトランザクションレイヤパケット（ＴＬＰ）のフォーマットにおいて、ヘッダのヘッダ長は３ＤＷ（ＤＷはダブルワードの略；合計１２バイト）又は４ＤＷ（１６バイト）で、トランザクションレイヤパケット（ＴＬＰ）のフォーマット（ヘッダ長とペイロードの有無）、トランザクションタイプ、トラフィッククラス（ＴＣ）、アトリビュートやペイロード長などの情報が含まれる。パケット内の最大ペイロード長は１０２４ＤＷ（４０９６バイト）である。 b. Transaction layer packet (TLP)
PCI Express performs communication in units of packets. In the transaction layer packet (TLP) format shown in FIG. 9, the header length of the header is 3DW (DW is an abbreviation of double word; total 12 bytes) or 4DW (16 bytes), and the transaction layer packet (TLP) format ( Information such as header length and presence / absence of payload), transaction type, traffic class (TC), attribute, and payload length are included. The maximum payload length in the packet is 1024 DW (4096 bytes).

ＥＣＲＣは、エンドツーエンドのデータ完全性を保証するためのもので、トランザクションレイヤパケット（ＴＬＰ）部分の３２ビットＣＲＣである。これは、スイッチ内部などでトランザクションレイヤパケット（ＴＬＰ）にエラーが発生した場合、ＬＣＲＣ（リンクＣＲＣ）ではエラーを検出できないためである（エラーとなったＴＬＰでＬＣＲＣが再計算されるため）。 ECRC is an end-to-end data integrity guarantee and is a 32-bit CRC of the transaction layer packet (TLP) portion. This is because when an error occurs in the transaction layer packet (TLP) inside the switch or the like, the LCRC (link CRC) cannot detect the error (because the LCRC is recalculated with the TLP in error).

リクエストは、完了パケットが不要なものと必要なものとがある。 Some requests do not require a completion packet, and some requests.

ｃ．トラフィッククラス（ＴＣ）と仮想チャネル（ＶＣ）
上位のソフトウェアは、トラフィッククラス（ＴＣ）を使用することによりトラフィックの差別化（優先度をつける）を行うことができる。例えば、映像データをネットワークのデータよりも優先して転送する、といったことが可能となる。トラフィッククラス（ＴＣ）はＴＣ０からＴＣ７まで８つある。 c. Traffic class (TC) and virtual channel (VC)
Upper software can differentiate (prioritize) traffic by using a traffic class (TC). For example, video data can be transferred with priority over network data. There are eight traffic classes (TC) from TC0 to TC7.

仮想チャネル（ＶＣ：Vertual Channel）は、各々独立した仮想通信バス（同一のリンクを共用する複数の独立したデータ・フロー・バッファを使用するメカニズム）で、各々がリソース（バッファやキュー）を持ち、図１１に示すように、独立したフロー制御を行う。これにより、１つの仮想チャネルのバッファが満杯の状態（full）になっても、他の仮想チャネルの転送を行うことができる。つまり、物理的には１つのリンクを仮想的な複数のチャネルに分けることで、有効に使用することができる。例えば、図１１中に示すように、スイッチを経由してルートのリンクが複数のデバイスに分かれる場合、各デバイスのトラフィックの優先度を制御することができる。ＶＣ０は必須で、コストパフォーマンスのトレードオフに応じてその他の仮想チャネル（ＶＣ１〜ＶＣ７）が実装される。図１１中の実線矢印は、デフォルト仮想チャネル（ＶＣ０）を示し、破線矢印はその他の仮想チャネル（ＶＣ１〜ＶＣ７）を示している。 A virtual channel (VC) is an independent virtual communication bus (a mechanism that uses a plurality of independent data flow buffers sharing the same link), each having resources (buffers and queues) As shown in FIG. 11, independent flow control is performed. Thereby, even if the buffer of one virtual channel becomes full (full), the transfer of another virtual channel can be performed. In other words, it can be effectively used by physically dividing one link into a plurality of virtual channels. For example, as shown in FIG. 11, when a route link is divided into a plurality of devices via a switch, the priority of traffic of each device can be controlled. VC0 is indispensable, and other virtual channels (VC1 to VC7) are mounted in accordance with the cost performance trade-off. The solid line arrow in FIG. 11 indicates the default virtual channel (VC0), and the broken line arrow indicates the other virtual channels (VC1 to VC7).

トランザクション層内では、トラフィッククラス（ＴＣ）が仮想チャネル（ＶＣ）にマッピングされる。１つの仮想チャネル（ＶＣ）に対して１つ又は複数のトラフィッククラス（ＴＣ）をマッピングできる（仮想チャネル（ＶＣ）の数が少ない場合）。単純な例では、各トラフィッククラス（ＴＣ）から各仮想チャネル（ＶＣ）に１対１、全てのトラフィッククラス（ＴＣ）を仮想チャネルＶＣ０にマッピングする、といったことが考えられる。ＴＣ０−ＶＣ０のマッピングは、必須／固定で、それ以外のマッピングは上位のソフトウェアから制御される。ソフトウェアはトラフィッククラス（ＴＣ）を利用することで、トランザクションの優先度を制御することが可能となる。 Within the transaction layer, a traffic class (TC) is mapped to a virtual channel (VC). One or more traffic classes (TC) can be mapped to one virtual channel (VC) (when the number of virtual channels (VC) is small). In a simple example, it can be considered that each traffic class (TC) is mapped to each virtual channel (VC) on a one-to-one basis, and all traffic classes (TC) are mapped to the virtual channel VC0. The mapping of TC0-VC0 is essential / fixed, and the other mappings are controlled from the upper software. The software can control the priority of the transaction by using the traffic class (TC).

ｄ．フロー制御
受信バッファのオーバーフローを避け、伝送順序を確立するためにフロー制御（ＦＣ：Flow Control）が行われる。フロー制御は、リンク間のポイントツーポイントで行われ、エンドツーエンドではない。従って、フロー制御により最終的な相手（コンプリータ）にパケットが届いたことを確認することはできない。 d. Flow control Flow control (FC) is performed in order to avoid overflow of the reception buffer and establish the transmission order. Flow control is done point-to-point between links, not end-to-end. Therefore, it cannot be confirmed that the packet has reached the final partner (completer) by flow control.

PCI Expressのフロー制御は、クレジット・ベースで行われる（データ転送を始める前に、受け取り側のバッファの空き状況を確認し、オーバーフロー、アンダフローが発生しないメカニズム）。即ち、受信側はリンク初期化時にバッファ容量（クレジット値）を送信側に通知し、送信側はクレジット値と送信するパケットの長さとを比較し、一定の残りがある場合のみパケットを送信する。このクレジットには６種類ある。 PCI Express flow control is performed on a credit basis (mechanism to check the buffer availability on the receiving side before starting data transfer and prevent overflow and underflow). That is, the receiving side notifies the transmitting side of the buffer capacity (credit value) at the time of link initialization, and the transmitting side compares the credit value with the length of the packet to be transmitted, and transmits the packet only when there is a certain remaining. There are six types of credits.

フロー制御の情報交換はデータリンク層のデータリンクレイヤパケット（ＤＬＬＰ）を使用して行われる。フロー制御はトランザクションレイヤパケット（ＴＬＰ）のみに適用され、データリンクレイヤパケット（ＤＬＬＰ）には適用されない（ＤＬＬＰは常時送受信可能）。 Flow control information exchange is performed using data link layer packets (DLLP) in the data link layer. The flow control is applied only to the transaction layer packet (TLP) and not to the data link layer packet (DLLP) (DLLP can always be transmitted / received).

Ｂ．データリンク層１５４
データリンク層１５４の主な役割は、前述したように、リンク上の２つのコンポーネント間での信頼性の高いトランザクションレイヤパケット（ＴＬＰ）交換機能を提供することである。 B. Data link layer 154
The main role of the data link layer 154 is to provide a reliable transaction layer packet (TLP) exchange function between two components on the link, as described above.

ａ．トランザクションレイヤパケット（ＴＬＰ）の扱い
トランザクション層１５３から受け取ったトランザクションレイヤパケット（ＴＬＰ）に対しては、先頭に２バイトのシーケンス番号、末尾に４バイトのリンクＣＲＣ（ＬＣＲＣ）を付加して、物理層１５５に渡す（図９参照）。トランザクションレイヤパケット（ＴＬＰ）は、リトライバッファに保管され、相手から受信確認（ＡＣＫ）が届くまで再送される。トランザクションレイヤパケット（ＴＬＰ）の送信に失敗が続いた場合は、リンク異常であると判断して物理層１５５に対してリンクの再トレーニングを要求する。リンクのトレーニングが失敗した場合、データリンク層１５４の状態はインアクティブに遷移する。 a. Handling of transaction layer packet (TLP) For the transaction layer packet (TLP) received from the transaction layer 153, a 2-byte sequence number at the beginning and a 4-byte link CRC (LCRC) at the end are added to the physical layer. To 155 (see FIG. 9). The transaction layer packet (TLP) is stored in the retry buffer and retransmitted until a reception confirmation (ACK) is received from the partner. When the transmission of the transaction layer packet (TLP) continues to fail, it is determined that the link is abnormal, and the physical layer 155 is requested to retrain the link. If link training fails, the state of the data link layer 154 transitions to inactive.

物理層１５５から受け取ったトランザクションレイヤパケット（ＴＬＰ）は、シーケンス番号とリンクＣＲＣ（ＬＣＲＣ）が検査され、正常であればトランザクション層１５３に渡され、エラーがあった場合は再送を要求する。 The transaction layer packet (TLP) received from the physical layer 155 is inspected for the sequence number and the link CRC (LCRC). If normal, the transaction layer packet (TLP) is passed to the transaction layer 153. If there is an error, a retransmission is requested.

ｂ．データリンクレイヤパケット（ＤＬＬＰ）
トランザクションレイヤパケット（ＴＬＰ）は、物理層から送信されるときに自動的に図１２に示すようなデータリンクレイヤパケット（ＤＬＬＰ）に分割されて各レーンに送信される。データリンク層１５４が生成するパケットは、データリンクレイヤパケット（ＤＬＬＰ）と呼ばれ、データリンク層１５４間でやり取りされる。データリンクレイヤパケット（ＤＬＬＰ）には、
・Ack／Nak：ＴＬＰの受信確認、リトライ（再送）
・InitFC1／InitFC2／UpdateFC：フロー制御の初期化とアップデート
・電源管理のためのＤＬＬＰ
なる種類がある。 b. Data link layer packet (DLLP)
The transaction layer packet (TLP) is automatically divided into data link layer packets (DLLP) as shown in FIG. 12 and transmitted to each lane when transmitted from the physical layer. A packet generated by the data link layer 154 is called a data link layer packet (DLLP), and is exchanged between the data link layers 154. Data link layer packet (DLLP)
-Ack / Nak: TLP reception confirmation, retry (retransmission)
-InitFC1 / InitFC2 / UpdateFC: Flow control initialization and update-DLLLP for power management
There are different types.

図１２に示すように、データリンクレイヤパケット（ＤＬＬＰ）の長さは６バイトで、種類を示すＤＬＬＰタイプ（１バイト）、ＤＬＬＰの種類で固有の情報（３バイト）、ＣＲＣ（２バイト）から構成される。 As shown in FIG. 12, the length of the data link layer packet (DLLP) is 6 bytes. From the DLLP type (1 byte) indicating the type, the information specific to the type of DLLP (3 bytes), and CRC (2 bytes) Composed.

Ｃ．物理層−論理サブブロック１５６
図８中に示す物理層１５５の論理サブブロック１５６での主な役割は、データリンク層１５４から受け取ったパケットを電気サブブロック１５７で送信できる形式に変換することである。また、物理層１５５を制御／管理する機能も有する。 C. Physical layer-logical sub-block 156
The main role of the physical layer 155 in the logical sub-block 156 shown in FIG. 8 is to convert the packet received from the data link layer 154 into a format that can be transmitted by the electrical sub-block 157. It also has a function of controlling / managing the physical layer 155.

ａ．データ符号化とパラレル−シリアル変換
PCI Expressは、連続した“０”や“１”が続かないように（長い期間、クロス・ポイントが存在しない状態が続かないようにするため）、データ符号化に８Ｂ／１０Ｂ変換を用いる。変換されたデータは、図１３中に示すように、シリアル変換され、ＬＳＢからレーン上に送信される。ここに、レーンが複数ある場合は（図１３はｘ４リンクの場合を例示している）、符号化の前にデータがバイト単位で各レーンに割り振られる。この場合、一見パラレル・バスのようにみえるが、レーン毎に独立した転送を行うので、パラレル・バスで問題となるスキューが大幅に緩和される。 a. Data encoding and parallel-serial conversion
PCI Express uses 8B / 10B conversion for data encoding so that consecutive “0” s and “1” s do not continue (in order not to maintain a state where there is no cross point for a long period of time). The converted data is serial-converted and transmitted from the LSB onto the lane as shown in FIG. Here, when there are a plurality of lanes (FIG. 13 illustrates the case of x4 link), data is allocated to each lane in units of bytes before encoding. In this case, it looks like a parallel bus at first glance, but since the transfer is performed independently for each lane, the skew which is a problem with the parallel bus is greatly reduced.

ｂ．電源管理とリンクステート
リンクの消費電力を低く抑えるために、図１４に示すように、Ｌ０／Ｌ０ｓ／Ｌ１／Ｌ２というリンクステートが定義されている。 b. Power Management and Link State In order to keep the power consumption of the link low, a link state of L0 / L0s / L1 / L2 is defined as shown in FIG.

Ｌ０が通常モードで、Ｌ０ｓからＬ２へと低消費電力となるが、Ｌ０への復帰にも時間がかかるようになる。図１５に示すように、ソフトウェアによる電源管理に加えて、アクティブステート電源管理を積極的に行うことにより、消費電力を極力小さくすることが可能となる。 L0 is a normal mode, and power consumption is reduced from L0s to L2, but it takes time to return to L0. As shown in FIG. 15, by actively performing active state power management in addition to software power management, it is possible to reduce power consumption as much as possible.

Ｄ．物理層−電気サブブロック１５７
物理層１５５の電気サブブロック１５７での主な役割は、論理サブブロック１５６でシリアル化されたデータをレーン上に送信することと、レーン上のデータを受信して論理サブブロック１５６に渡すことである。 D. Physical layer—Electric sub-block 157
The main role of the physical layer 155 in the electrical sub-block 157 is to transmit the data serialized in the logical sub-block 156 onto the lane, and to receive the data on the lane and pass it to the logical sub-block 156. is there.

ａ．ＡＣカップリング
リンクの送信側では、ＡＣカップリング用のコンデンサが実装される。これにより、送信側と受信側のＤＣコモンモード電圧が同一である必要がなくなる。このため、送信側と受信側で異なる設計、半導体プロセス、電源電圧を使用することが可能となる。 a. AC coupling On the transmission side of the link, a capacitor for AC coupling is mounted. This eliminates the need for the DC common mode voltage on the transmission side and the reception side to be the same. For this reason, it is possible to use different designs, semiconductor processes, and power supply voltages on the transmission side and the reception side.

ｂ．デエンファシス
PCI Expressでは、前述したように、８Ｂ／１０Ｂエンコーディングによってできるだけ連続した“０”や“１”が続かないように処理されるが、連続した“０”や“１”が続くこともある（最大５回）。この場合、送信側はデエンファシス転送を行わなければならないことが規定されている。同一極性のビットが連続する場合は、２つ目のビットからは差動電圧レベル（振幅）を３．５±０．５ｄＢ落とすことで、受信側で受け取る信号のノイズ・マージンを稼ぐ必要がある。これを、デエンファシスという。伝送路の周波数依存性減衰のため、変化するビットの場合は高周波成分が多く、減衰により受信側の波形が小さくなるが、変化しないビットの場合は高周波成分が少なく、相対的に受信側の波形が大きくなる。このため、受信側での波形を一定とするためにデエンファシスを行う。 b. De-emphasis
In PCI Express, as described above, processing is performed so that continuous “0” and “1” do not continue as much as possible by 8B / 10B encoding, but continuous “0” and “1” may continue (maximum). 5 times). In this case, it is specified that the transmission side must perform de-emphasis transfer. When bits of the same polarity are consecutive, it is necessary to increase the noise margin of the signal received on the receiving side by dropping the differential voltage level (amplitude) from the second bit by 3.5 ± 0.5 dB. . This is called de-emphasis. Due to the frequency-dependent attenuation of the transmission line, there are many high-frequency components in the case of changing bits, and the waveform on the receiving side becomes small due to attenuation. Becomes larger. For this reason, de-emphasis is performed in order to make the waveform on the receiving side constant.

［データ転送システム］
本実施の形態のデータ転送システムは、プリンタのような画像出力装置やスキャナのような画像入力装置、さらには、これらを併せ持つデジタル複写機、ＭＦＰ（Multi Function Peripheral）のような画像形成装置等の画像形成システムに適用されるものであって、前述したようなPCI Express規格に準拠する高速シリアルバスを利用するものである。 [Data transfer system]
The data transfer system according to the present embodiment includes an image output device such as a printer, an image input device such as a scanner, a digital copier having both of them, and an image forming device such as an MFP (Multi Function Peripheral). The present invention is applied to an image forming system and uses a high-speed serial bus conforming to the PCI Express standard as described above.

本実施の形態のデータ転送システムが適用される画像形成システムの一例を図１６を参照して説明する。図１６は、本実施の形態のデータ転送システムが適用される画像形成システムの各機器・デバイス等の接続の概要を示すブロック図である。当該画像形成システム１は、シリアルデータ転送にPCI Express規格のバスシステムを用いている。即ち、画像形成システム１の各部を集中的に制御するＣＰＵ１１と、ＣＰＵ１１の作業エリアとなるメモリ１２がPCI Express規格のルートコンプレックス（Root Complex）１３に接続されている。また、ルートコンプレックス１３とPCI Express規格のスイッチ（或いはスイッチ網）１４とがPCI Express規格のシリアルバス１５を介して接続されている。PCI Express規格のスイッチ（或いはスイッチ網）１４には、PCI Express規格のエンドポイント（End Point）となる各種機器・デバイスが接続されている。即ち、画像データなどを記憶するハードディスク（ＨＤＤ）ユニット２１、画像メモリユニット２２、及び、メモリユニット２３、各種の画像処理を行なう画像処理ユニット２４、外部のネットワークなどと通信を行なう高速ネットワーク２５、画像入力エンジンとしてのスキャナ２６、画像出力エンジンとしてのプロッタ２７、画像入力エンジン、画像出力エンジンを併せ持つ他のＭＦＰ２８などである。 An example of an image forming system to which the data transfer system of this embodiment is applied will be described with reference to FIG. FIG. 16 is a block diagram showing an outline of connections of devices, devices, etc. of an image forming system to which the data transfer system of the present embodiment is applied. The image forming system 1 uses a PCI Express standard bus system for serial data transfer. That is, a CPU 11 that centrally controls each unit of the image forming system 1 and a memory 12 that is a work area of the CPU 11 are connected to a root complex 13 of the PCI Express standard. A root complex 13 and a PCI Express standard switch (or switch network) 14 are connected via a PCI Express standard serial bus 15. The PCI Express standard switch (or switch network) 14 is connected to various devices and devices serving as PCI Express standard end points. That is, a hard disk (HDD) unit 21 that stores image data and the like, an image memory unit 22 and a memory unit 23, an image processing unit 24 that performs various image processing, a high-speed network 25 that communicates with an external network, etc. A scanner 26 as an input engine, a plotter 27 as an image output engine, an image input engine, another MFP 28 having an image output engine, and the like.

このような構成によれば、高速シリアルバスであるPCI Expressシステムを利用しているので、基本的にデータ転送の高速化を図れるが、それに加えて、PCI Express規格のエンドポイント（End Point）となる各種機器・デバイス２１〜２７間でのデータ転送の一層の高速化を図ることができる。即ち、各種機器・デバイス２１〜２７間のPCI Expressシステムはルートコンプレックス１３を介することなくPCI Express規格のスイッチ（或いはスイッチ網）１４を最上位とする木構造で接続されており、各種機器・デバイス２１〜２７間でのデータ転送がルートコンプレックス１３を経ることなく行われるため、高速処理が可能となる。 According to such a configuration, since the PCI Express system, which is a high-speed serial bus, is used, the data transfer speed can be basically increased. In addition, the PCI Express standard end point (End Point) and Further speeding up of data transfer between the various devices / devices 21 to 27 can be achieved. That is, the PCI Express system between the various devices / devices 21 to 27 is connected through a tree structure having the PCI Express standard switch (or switch network) 14 as the highest level without going through the root complex 13. Since data transfer between 21 to 27 is performed without passing through the route complex 13, high-speed processing is possible.

ところで、図４および図５で説明したように、各ノード（各種機器・デバイス）２１〜２７間をスイッチ（或いはスイッチ網）１４を介して接続するPCI Express規格のシリアルバス１５は、各々性能の異なるもの（例えば、バス幅を示すレーン数がｘ８，ｘ４．ｘ２等）を使用可能とされており、各転送路の性能を変更することができるようになっている。しかしながら、PCI Expressでは、スイッチ（或いはスイッチ網）１４を介した各ノード（各種機器・デバイス）２１〜２７間での平均転送レートは、経路中の最も細い転送路の性能（リンク幅・レーン数）に支配されるため、１個所のボトルネックがある場合には、その他の転送路をいくら広くしてもその効果は得られない。 4 and 5, the PCI Express standard serial bus 15 that connects the nodes (various devices / devices) 21 to 27 via the switch (or switch network) 14 has performance characteristics. Different ones (for example, the number of lanes indicating the bus width is x8, x4.x2, etc.) can be used, and the performance of each transfer path can be changed. However, in PCI Express, the average transfer rate between the nodes (various devices / devices) 21 to 27 via the switch (or switch network) 14 is the performance of the narrowest transfer path in the path (link width and number of lanes). Therefore, if there is one bottleneck, the effect cannot be obtained no matter how wide the other transfer paths are.

図１７は、不必要に大きなレーン数を用いた接続例である。ここでは、同時に動作（データ転送）する複数のデータ転送経路が存在し、かつ１つ以上の同時に動作するシリアル転送路を共有する場合において、下記に示すデータ転送１とデータ転送２とを並行して処理する場合について考察する。
データ転送１：エンドポイント１とルートコンプレックス（メモリ）間
データ転送２：エンドポイント２とルートコンプレックス（メモリ）間 FIG. 17 is a connection example using an unnecessarily large number of lanes. Here, when there are a plurality of data transfer paths that operate simultaneously (data transfer) and one or more serial transfer paths that operate simultaneously are shared, data transfer 1 and data transfer 2 shown below are performed in parallel. Consider the case of processing.
Data transfer 1: Between endpoint 1 and root complex (memory) Data transfer 2: Between endpoint 2 and root complex (memory)

データ転送１における転送経路は、下記に示すような経路となっている。

エンドポイント１
↓ リンク１（レーン数ｘ１）
スイッチ１
↓ リンク５（レーン数ｘ４）
ルートコンプレックス
The transfer path in the data transfer 1 is as shown below.

Endpoint 1
↓ Link 1 (lane number x1)
Switch 1
↓ Link 5 (lane number x4)
Root complex

一方、データ転送２における転送経路は、下記に示すような経路となっている。

エンドポイント２
↓ リンク２（レーン数ｘ１）
スイッチ１
↓ リンク５（レーン数ｘ４）
ルートコンプレックス
On the other hand, the transfer path in the data transfer 2 is as shown below.

Endpoint 2
↓ Link 2 (lane number x1)
Switch 1
↓ Link 5 (lane number x4)
Root complex

このようにデータ転送１とデータ転送２とを並行して処理するような場合においては、リンク１とリンク２はともにレーン数ｘ１であるので、リンク５の部分で同時動作したとしてもリンク５のレーン数はｘ２で充分であるにも関わらずレーン数がｘ４となっており、コスト高になっている。なお、リンク５のレーン数がｘ２より小さい場合は、転送レートが低下する。 Thus, when data transfer 1 and data transfer 2 are processed in parallel, both link 1 and link 2 have the number of lanes x1, so even if the link 5 portion operates simultaneously, the link 5 Although the number of lanes is x2, the number of lanes is x4, which increases the cost. Note that when the number of lanes of the link 5 is smaller than x2, the transfer rate decreases.

そこで、本実施の形態においては、上述したように複数の独立したデータ転送を並行して処理するエンドポイント（各機器・デバイス）を接続する場合、上流のPCI Express規格のシリアルバス１５のリンク幅と下流のPCI Express規格のシリアルバス１５のリンク幅の組み合わせをデータ転送量の最大値で決定する。 Therefore, in the present embodiment, when connecting endpoints (each device / device) that processes a plurality of independent data transfers in parallel as described above, the link width of the upstream PCI Express standard serial bus 15 is connected. And the link width of the downstream PCI Express standard serial bus 15 is determined by the maximum data transfer amount.

図１８は、最適なレーン数を用いた接続例である。データ転送１における転送経路は、下記に示すような経路となっている。

エンドポイント１
↓ リンク１（レーン数ｘ１）
スイッチ１
↓ リンク５（レーン数ｘ２）
ルートコンプレックス
FIG. 18 is a connection example using the optimum number of lanes. The transfer path in the data transfer 1 is as shown below.

Endpoint 1
↓ Link 1 (lane number x1)
Switch 1
↓ Link 5 (lane number x2)
Root complex

一方、データ転送２における転送経路は、下記に示すような経路となっている。

エンドポイント２
↓ リンク２（レーン数ｘ１）
スイッチ１
↓ リンク５（レーン数ｘ２）
ルートコンプレックス
On the other hand, the transfer path in the data transfer 2 is as shown below.

Endpoint 2
↓ Link 2 (lane number x1)
Switch 1
↓ Link 5 (lane number x2)
Root complex

このようにデータ転送１とデータ転送２とを並行して処理するような場合においては、リンク１とリンク２はともにレーン数ｘ１であるので、リンク５の部分で同時動作を考慮した場合、リンク５のレーン数はｘ２とするのが良い。すなわち、同時に動作するシリアル転送路のレーン数が、それぞれのデータ転送経路中の最も小さいレーン数を加算したレーン数である。レーン数がｘ２より小さい場合は転送レートが低下し、レーン数がｘ２より大きい場合はコスト高になる。 When data transfer 1 and data transfer 2 are processed in parallel in this way, both link 1 and link 2 have the number of lanes x1, so when considering the simultaneous operation in the link 5 portion, the link The number of 5 lanes should be x2. That is, the number of lanes of the serial transfer path operating simultaneously is the number of lanes obtained by adding the smallest number of lanes in each data transfer path. When the number of lanes is smaller than x2, the transfer rate decreases, and when the number of lanes is larger than x2, the cost becomes high.

このように本実施の形態によれば、スイッチから木構造の上位側に位置するノード（スイッチまたはルートコンプレックス）に至るデータ転送経路を形成する高速シリアルバスのリンクのレーン数を、エンドポイントからスイッチに至る各データ転送経路を送信される各データ転送量の最大値を満たすレーン数をそれぞれ加算したレーン数とすることにより、一のスイッチに対する複数のエンドポイントからのトランザクション（データ転送）が同時に発生するような場合であっても、同時動作によるボトルネックを解消することができるので、余分なレーン数を用いることなく、複数のデータ転送が重なった場合にも転送レートが低下することのないデータ転送システムを提供することができる。 As described above, according to the present embodiment, the number of lanes of the high-speed serial bus link forming the data transfer path from the switch to the node (switch or root complex) located on the upper side of the tree structure is changed from the end point to the switch. Transactions (data transfer) from multiple endpoints to one switch occur simultaneously by adding the number of lanes that satisfy the maximum value of each data transfer amount sent through each data transfer path leading to Even in such a case, the bottleneck due to the simultaneous operation can be eliminated, so the data that does not decrease the transfer rate even when multiple data transfers overlap without using the extra lane number. A transfer system can be provided.

［第２の実施の形態］
次に、本発明の第２の実施の形態を図１９に基づいて説明する。なお、前述した第１の実施の形態と同じ部分は同じ符号で示し説明も省略する。 [Second Embodiment]
Next, a second embodiment of the present invention will be described with reference to FIG. The same parts as those in the first embodiment described above are denoted by the same reference numerals, and description thereof is also omitted.

第１の実施の形態においては、複数の独立したデータ転送を並行して処理するエンドポイント（各機器・デバイス）を接続する場合について説明したが、本実施の形態においては、データ転送を同時に発生することがないエンドポイント（各機器・デバイス）が接続されている場合について説明する。 In the first embodiment, a case has been described in which endpoints (each device / device) that process a plurality of independent data transfers in parallel are connected. However, in this embodiment, data transfer occurs simultaneously. A case where endpoints (devices / devices) that are not connected are connected will be described.

データ転送を同時に発生することがないエンドポイント（各機器・デバイス）が接続されている場合、同時動作を考慮する必要がないことから、上流のPCI Express規格のシリアルバス１５のリンク幅と下流のPCI Express規格のシリアルバス１５のリンク幅とは、データ転送量の最大値で決定すれば良い。すなわち、データ転送経路上の全てのレーン数を、データ転送量の最大値で等しくすれば良い。 When endpoints (each device / device) that do not generate data transfer at the same time are connected, it is not necessary to consider the simultaneous operation. Therefore, the link width of the upstream PCI Express standard serial bus 15 and the downstream The link width of the serial bus 15 of the PCI Express standard may be determined by the maximum data transfer amount. That is, the number of all lanes on the data transfer path may be equalized with the maximum data transfer amount.

図１９は、最適なレーン数を用いた接続例である。データ転送１における転送経路は、下記に示すような経路となっている。

エンドポイント１
↓ リンク１（レーン数ｘ１）
スイッチ１
↓ リンク５（レーン数ｘ１）
ルートコンプレックス
FIG. 19 is a connection example using the optimum number of lanes. The transfer path in the data transfer 1 is as shown below.

Endpoint 1
↓ Link 1 (lane number x1)
Switch 1
↓ Link 5 (lane number x1)
Root complex

一方、データ転送２における転送経路は、下記に示すような経路となっている。

エンドポイント２
↓ リンク２（レーン数ｘ１）
スイッチ１
↓ リンク５（レーン数ｘ１）
ルートコンプレックス
On the other hand, the transfer path in the data transfer 2 is as shown below.

Endpoint 2
↓ Link 2 (lane number x1)
Switch 1
↓ Link 5 (lane number x1)
Root complex

このようにデータ転送を同時に発生することがないエンドポイント（各機器・デバイス）が接続されている場合においては、リンク１とリンク２はともにレーン数ｘ１であるので、リンク５の部分で同時動作を考慮する必要がないことから、リンク５のレーン数はｘ１とすれば良い。 When endpoints (devices / devices) that do not generate data transfer at the same time are connected in this way, both link 1 and link 2 have the number of lanes x1, so the link 5 portion operates simultaneously. Therefore, the number of lanes in the link 5 may be set to x1.

このように本実施の形態によれば、各エンドポイントからスイッチに至るデータ転送経路および当該スイッチから木構造の上位側に位置するノード（スイッチまたはルートコンプレックス）に至るデータ転送経路を形成する高速シリアルバスのリンクの全てのレーン数を、データ転送経路を送信されるデータ転送量の最大値を満たすレーン数に統一することにより、１つのデータ転送に関するデータ転送経路上のすべてのデータ転送路のレーン数を、所望の転送レートが得られるレーン数で等しくそろえることで、コスト性能を最適化することができる。 As described above, according to the present embodiment, a high-speed serial forming a data transfer path from each end point to a switch and a data transfer path from the switch to a node (switch or root complex) positioned on the upper side of the tree structure. By unifying the number of all lanes of the bus link to the number of lanes that satisfy the maximum value of the data transfer amount to be transmitted on the data transfer route, the lanes of all data transfer routes on the data transfer route related to one data transfer Cost performance can be optimized by aligning the numbers equally with the number of lanes at which the desired transfer rate is obtained.

既存PCIシステムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the existing PCI system. PCI Expressシステムの構成例を示すブロック図である。FIG. 2 is a block diagram illustrating a configuration example of a PCI Express system. デスクトップ／モバイルでのPCI Expressプラットホームの構成例を示すブロック図である。It is a block diagram which shows the structural example of the PCI Express platform in desktop / mobile. ｘ４の場合の物理層の構造例を示す模式図である。It is a schematic diagram which shows the structural example of the physical layer in the case of x4. デバイス間のレーン接続例を示す模式図である。It is a schematic diagram which shows the example of lane connection between devices. スイッチの論理的構造例を示すブロック図である。It is a block diagram which shows the logical structural example of a switch. 既存のPCIのアーキテクチャを示すブロック図である。It is a block diagram which shows the architecture of the existing PCI. PCI Expressのアーキテクチャを示すブロック図である。It is a block diagram which shows the architecture of PCI Express. PCI Expressの階層構造を示すブロック図である。It is a block diagram which shows the hierarchical structure of PCI Express. トランザクションレイヤパケットのフォーマット例を示す説明図である。It is explanatory drawing which shows the format example of a transaction layer packet. PCI Expressのコンフィグレーション空間を示す説明図である。It is explanatory drawing which shows the configuration space of PCI Express. 仮想チャネルの概念を説明するための模式図である。It is a schematic diagram for demonstrating the concept of a virtual channel. データリンクレイヤパケットのフォーマット例を示す説明図である。It is explanatory drawing which shows the format example of a data link layer packet. ｘ４リンクでのバイトストライピング例を示す模式図である。It is a schematic diagram which shows the byte striping example in x4 link. Ｌ０／Ｌ０ｓ／Ｌ１／Ｌ２というリンクステートの定義について説明する説明図である。It is explanatory drawing explaining the definition of the link state of L0 / L0s / L1 / L2. アクティブステート電源管理の制御例を示すタイムチャートである。It is a time chart which shows the example of control of active state power management. 本発明の第１の実施の形態のデータ転送システムが適用される画像形成システムの各機器・デバイス等の接続の概要を示すブロック図である。1 is a block diagram showing an outline of connections of devices, devices, etc. of an image forming system to which a data transfer system according to a first embodiment of the present invention is applied. 不必要に大きなレーン数を用いた接続例を示す模式図である。It is a schematic diagram which shows the example of a connection using an unnecessarily large number of lanes. 最適なレーン数を用いた接続例を示す模式図である。It is a schematic diagram which shows the example of a connection using the optimal number of lanes. 本発明の第２の実施の形態の最適なレーン数を用いた接続例を示す模式図である。It is a schematic diagram which shows the example of a connection using the optimal number of lanes of the 2nd Embodiment of this invention.

Explanation of symbols

１データ転送システム
１５高速シリアルバス
１４スイッチ
２１〜２７エンドポイント 1 Data transfer system 15 High-speed serial bus 14 Switch 21-27 Endpoint

Claims

As a data communication network with a tree structure, a point-to-point independent transmission / reception channel is established, and multiple endpoints are connected to one switch by using a high-speed serial bus that can select any number of lanes. In the data transfer system for transferring data from the respective endpoints to the switch,
The number of lanes of the link of the high-speed serial bus that forms a data transfer path from the switch to a node positioned on the upper side of the tree structure, and each data transmitted through each data transfer path from the endpoint to the switch The number of lanes that satisfy the maximum transfer amount is added to each lane.
A data transfer system characterized by that.

A tree-structured data communication network establishes a point-to-point independent communication channel and uses a high-speed serial bus that can select any number of lanes to connect multiple endpoints to a single switch. In the data transfer system in which data transfer from each endpoint to the switch is not performed simultaneously,
The number of all lanes of the link of the high-speed serial bus that forms a data transfer path from each end point to the switch and a data transfer path from the switch to a node located on the upper side of the tree structure is determined by the data transfer. Unify routes to the number of lanes that meet the maximum amount of data transferred.
A data transfer system characterized by that.

The high-speed serial bus is a PCI Express standard high-speed serial bus.
The data transfer system according to claim 1 or 2, characterized in that