WO2012103736A1 - Data processing node, system and method - Google Patents

Data processing node, system and method Download PDF

Info

Publication number
WO2012103736A1
WO2012103736A1 PCT/CN2011/076735 CN2011076735W WO2012103736A1 WO 2012103736 A1 WO2012103736 A1 WO 2012103736A1 CN 2011076735 W CN2011076735 W CN 2011076735W WO 2012103736 A1 WO2012103736 A1 WO 2012103736A1
Authority
WO
WIPO (PCT)
Prior art keywords
data processing
interconnection
interconnect
interface
cpu
Prior art date
Application number
PCT/CN2011/076735
Other languages
French (fr)
Chinese (zh)
Inventor
刘建根
卢广
郑伟
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to CN2011800009706A priority Critical patent/CN102301363A/en
Priority to PCT/CN2011/076735 priority patent/WO2012103736A1/en
Publication of WO2012103736A1 publication Critical patent/WO2012103736A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/16Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
    • G06F15/163Interprocessor communication
    • G06F15/173Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star, snowflake

Definitions

  • the present invention relates to computer systems, and more particularly to a data processing node, system and method. Background technique
  • the data processing node may include a plurality of CPUs.
  • how the CPUs between different nodes access each other is a core technology, that is, the interconnection between the CPUs is The core technology of the system.
  • the data processing nodes can realize the interconnection between CPUs by means of full interconnection, and the interconnection between CPUs can also be realized by the form of interconnection modules.
  • any two nodes in the system need to be directly connected through a high-speed interconnection interface, and the control function module must be integrated inside the node.
  • the control function module has the function of a node controller (NC).
  • NC node controller
  • the nodes in the system are connected by the interconnection module, and the nodes therein include a plurality of interconnected CPUs and NCs. .
  • Embodiments of the present invention provide a data processing node, system, and method to implement flexible expansion of a computer system. To achieve the above objective, the embodiment of the present invention adopts the following technical solutions:
  • a data processing node includes a plurality of CPUs, each CPU includes a plurality of interconnect interfaces, and the CPUs are completely interconnected through an interconnect interface to form a CPU module, and the CPU module reserves at least one interconnect interface and An interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for interfacing with external interconnects of interconnect boards of other data processing nodes.
  • a data processing system comprising at least two of the above data processing nodes, an interconnection between the at least two nodes.
  • a data processing method for a data processing system includes a plurality of data processing nodes, the data processing node includes a plurality of processor CPUs, and a plurality of CPUs are completely interconnected through an interconnection interface to form a CPU module, wherein the CPU module reserves at least one interconnect interface and an internal interconnect interface of the interconnect board, the interconnect board includes at least one external interconnect interface, and is used for interconnecting with other data processing nodes.
  • the interconnect board is a field programmable gate array FPGA interconnect board
  • the FPGA interconnect board includes an FPGA chip corresponding to the reserved interconnect interface
  • each FPGA chip provides at least An internal interconnection interface and an external interconnection interface
  • the data processing method includes: converting a data packet into a high-speed transmission signal, and transmitting the high-speed transmission signal to other data processing nodes connected to the data processing node; The high speed transmission signal is converted into a data packet that the CPU can process, and the data packet is sent to the CPU.
  • a data processing method is applied to a data processing system, where the data processing node includes a plurality of processor CPUs, and a plurality of CPUs are completely interconnected through an interconnection interface to form a CPU module, and the CPU module is reserved at least An interconnection interface is connected to an internal interconnection interface of the interconnection board, and the interconnection board includes at least one external interconnection interface for external interconnection with other data processing node interconnection boards a connection port, the interconnection board is a node control NC chip, and the NC chip includes an internal interconnection interface corresponding to the reserved interconnection interface, and an NC chip with the other data processing node
  • the external interconnection interface that is connected to the external interconnection interface, and the data processing method includes:
  • the NC chip routes the data packets sent by the CPU to its corresponding CPU.
  • the data processing node provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be used as a single node, and, in addition, the CPU module is at least Retaining an interconnect interface to connect to an interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for connecting to an external interconnect interface of the interconnecting boards of other data processing nodes, The CPU module can also be extended to use with multi-node systems in conjunction with interconnect boards.
  • each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board.
  • the interconnection improves the scalability of the minicomputer system.
  • Figure 1 (a) is a system architecture diagram of the full interconnection mode in the prior art
  • Figure 1 (b) shows the system architecture connected in the form of interconnect modules in the prior art
  • FIG. 2 is a schematic structural diagram of a data processing node according to an embodiment of the present invention.
  • FIG. 3( a ) is a schematic diagram of a data processing node architecture provided by another embodiment of the present invention
  • FIG. 3 ( b ) is a schematic structural diagram of an FPGA chip in the data processing node provided in FIG. 3 ( a )
  • FIG. 4 ( a ) is The schematic diagram of the CPU interconnection architecture in the 2P node provided by the embodiment of the present invention
  • FIG. 4(b) is a schematic diagram of the 2P node architecture provided by the embodiment of the present invention
  • FIG. 5(a) is a schematic diagram of a CPU interconnection architecture in a 4P node according to an embodiment of the present invention
  • FIG. 5(b) is a schematic diagram of a 4P node architecture according to an embodiment of the present invention
  • FIG. 6 is a schematic structural diagram of a data processing node according to still another embodiment of the present invention.
  • FIG. 7 is a schematic structural diagram of an 8P system according to an embodiment of the present invention.
  • FIG. 8 is a flowchart of a data processing method according to an embodiment of the present invention.
  • a data processing node includes a plurality of CPUs, each CPU includes a plurality of interconnect interfaces, and the CPUs are completely interconnected through interconnect interfaces to form a CPU module.
  • the CPU module reserves at least one interconnect interface connected to the interconnecting interface of the interconnect board, the interconnect board includes at least one external interconnect interface, and an external interconnect interface for interconnecting boards with other data processing nodes. connection.
  • the number of CPUs included in the data processing node is set according to the total number of interconnected interfaces of the CPU and the system architecture formed by the data processing node. It is ensured that at least one interconnect interface is reserved for connection to the interconnect board after each CPU in the node is fully interconnected. Specifically, when the CPU includes four fast interconnect channel (QPI) interfaces, when setting up this type of CPU in the data processing node, up to four can be set, and each CPU inside the node is performed. The interconnection needs to occupy three QPI interfaces, and a QPI interface is reserved for connection with the interconnect board for system expansion. In addition, the number of CPUs included in the node needs to be set according to the system architecture formed by the node. If the system architecture of the node is an 8P system (a system with 8 CPUs) and consists of 4 nodes, then each node in the system can set 2 CPUs.
  • QPI fast interconnect channel
  • the data processing node provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be used as a single node, and, in addition, the CPU module is at least Retaining an interconnect interface to connect to an interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for connecting to an external interconnect interface of the interconnecting boards of other data processing nodes, The CPU module can also be extended to use with multi-node systems in conjunction with interconnect boards.
  • each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board.
  • the interconnection, and the external interconnection interface provided by the interconnection board between the nodes realizes the connection, and does not need to pass through the interconnection module, thereby reducing the delay caused by the interconnection module.
  • the interconnect board is a Field Programmable Gate Array (FPGA) interconnect board.
  • FPGA Field Programmable Gate Array
  • a data processing node includes a plurality of CPUs, each CPU includes a plurality of interconnect interfaces, and the CPUs are completely interconnected through interconnect interfaces to form a CPU module, wherein the CPU module reserves at least one interconnect interface and an internal interconnect interface of the interconnect board, and the FPGA interconnect board includes an FPGA chip corresponding to the reserved interconnect interface, and each FPGA The chip provides at least one internal interconnect interface and one external interconnect interface.
  • the external interconnect interface is used to connect with an external interconnect interface of the interconnecting boards in other data processing nodes.
  • the FPGA chip includes: a module of a physical layer initialization process.
  • the string de-serialization module 32 is configured to: when receiving a data packet sent by the CPU, convert the data packet into a high-speed transmission signal and send the data to another data processing node, and when receiving the high-speed transmission signal sent by another data processing node, The high speed transmission signal is converted to a data packet that the CPU can process and sent to the CPU.
  • the 1 condition causes the clock recovery circuit to be in error
  • the serial deserialization module converts the data packet into a high-speed transmission signal by using 8B/10B coding, and then sends the data packet to another data processing node, and receives the data transmission node and sends it to another data processing node.
  • the high-speed transmission signal is decoded by 10B/8B and converted into a data packet that the CPU can process.
  • a CPU with four interconnect interfaces is taken as an example for detailed description.
  • the data processing node includes multiple CPUs, if the CPU only includes four interconnect interfaces, the multiple CPUs are at most Four CPUs are now described in detail with 2P nodes and 4P nodes as examples.
  • 2P node includes 2 CPUs, respectively CPUO, CPU1, as shown in FIG. 4(a), CPUO and CPU1 are interconnected by a QPI interface to form a CPU module.
  • the CPU0 and The CPU 1 is interconnected by two QPI interfaces, that is, two interconnected channels are formed between the CPUO and the CPU 1, one of which is used for interconnecting communication and the other is used for redundant backup. As shown in FIG.
  • the CPU module reserves four QPI interfaces for external interconnection, and the CPU0 and the CPU1 respectively provide two QPI interfaces.
  • the four QPI interfaces reserved by the CPU module are respectively connected to the FPGA interconnect board through a dedicated plug-in Air max, wherein the FPGA interconnect board includes FPGA chips corresponding to the reserved four QPI interfaces, respectively fpgaO , Q) gal , Q) ga2 , fpga3 , each FPGA chip provides an internal interconnect interface for its corresponding QPI interface and provides an external interconnect interface for external connections.
  • the reserved four QPI interfaces are respectively provided by CPU0 and CPU1, and at least one reserved QPI interface of CPU0 and CPU1 is used for external interconnection.
  • 4P node including 4 CPUs, respectively CPUO, CPU1, CPU2, CPU3, as shown in Figure 5 (a), CPUO, CPU1, CPU2, CPU3 are fully interconnected to form a CPU module.
  • the CPU0-CPU3 needs to adopt three QPI interfaces to implement full interconnection.
  • the CPU module reserves four QPI interfaces for external interconnection, and one QPI interface is provided by CPU0, CPU1, CPU2, and CPU3, respectively.
  • the four QPI interfaces reserved by the CPU module are respectively connected to the FPGA interconnect board through a dedicated plug-in Air max, wherein the FPGA interconnect board includes FPGA chips corresponding to the reserved four QPI interfaces, respectively Ga0 , fpgal , ga2 , ga3 , each FPGA chip provides an internal interconnect interface for its corresponding QPI interface and provides an external interconnect interface for external connections.
  • the data processing node provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be used as a single node, and, in addition, the CPU module is at least Retaining an interconnect interface to connect to an interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for connecting to an external interconnect interface of the interconnecting boards of other data processing nodes, The CPU module can also be extended to use with multi-node systems in conjunction with interconnect boards.
  • each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board.
  • the interconnection, and the external interconnection interface provided by the interconnection board between the nodes realizes the connection, and does not need to pass through the interconnection module, thereby reducing the delay caused by the interconnection module.
  • interconnecting board may also be an NC chip.
  • a data processing node which includes an interconnecting board as an NC chip, and the following interconnecting board is an NC chip data processing. The node performs a detailed description.
  • a data processing node includes a plurality of processor CPUs, each CPU includes a plurality of interconnect interfaces, and the CPUs are completely interconnected through interconnect interfaces to form a CPU module, the CPU module at least one interconnect interface for connecting to an internal interconnect interface of the NC chip, the NC chip including an interconnecting interface corresponding to the reserved interconnect interface And at least one external interconnect interface for connecting to an external interconnect interface of the NC chip of the other data processing node.
  • the NC chip integrates a routing module, and the routing module routes the data packet received by the NC chip to its corresponding CPU.
  • the data processing node provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be used as a single node, and, in addition, the CPU module is at least Reserve an interconnection interface and the interconnection of the interconnection board
  • the connection port is connected, and the interconnection board includes at least one external interconnection interface for connecting with an external interconnection interface of the interconnection board of other data processing nodes, so the CPU module can also be extended with the interconnection board to extend to the multi-node system. .
  • each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board.
  • the interconnection, and the external interconnection interface provided by the interconnection board between the nodes realizes the connection, and does not need to pass through the interconnection module, thereby reducing the delay caused by the interconnection module.
  • the data processing system provided by the embodiment of the present invention includes at least two data processing nodes provided by the foregoing embodiments, where the at least two data processing nodes implement the at least two data processing through an external interconnection interface on the interconnect board thereof. Interconnection between nodes.
  • the at least two data processing nodes implement interconnection between the at least two data processing nodes through external interconnect ports on the interconnecting board, including:
  • the at least two data processing nodes implement full interconnection between the at least two data processing nodes via an external interconnect interface on their interconnect boards.
  • interconnection board may further be provided with an interconnection interface as a backup redundancy interface.
  • a 2P node is described as a data processing node, i.e., a basic node of the data processing system, in detail.
  • the four 2P nodes are extended to an 8P data processing system through a high speed transmission line.
  • the structure diagram of the 2P node is shown in FIG. 5( a ) and FIG. 5 ( b ), and details are not described herein again.
  • the 2P node reserves four QPI interfaces externally, which are provided by CPU0 and CPU1, respectively.
  • node 1-4 When node 1-4 implements partial node direct connection, and through the direct connection of the partial node to realize the indirect interconnection of the remaining nodes, node 1-4 still has an unused QPI interface, then, the remaining QPI The interface is used to implement full interconnection between nodes. In the implementation of the full interconnection between nodes, nodes 1-4 also have unused QPI interfaces, which can be used as redundant backups.
  • NC chip since the function of the NC chip has more powerful processing functions than the function of the FPGA chip, it is relatively powerful in the above, so the data processing system is composed of 2P nodes, which can realize a higher level of system expansion. I will not go into details here.
  • the data processing system provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be used as a single node, and at least, the CPU module is at least Retaining an interconnect interface to connect to an interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for connecting to an external interconnect interface of the interconnecting boards of other data processing nodes, The CPU module can also be extended to use with multi-node systems in conjunction with interconnect boards.
  • each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board.
  • the interconnection, and the external interconnection interface provided by the interconnection board between the nodes realizes the connection, and does not need to pass through the interconnection module, thereby reducing the delay caused by the interconnection module.
  • the embodiment of the present invention provides a data processing method for a data processing system, where the data processing system includes a plurality of data processing nodes, and the data processing node includes multiple CPUs, and multiple CPUs pass each other.
  • the connection ports are all interconnected to form a CPU module, and the CPU module reserves at least one interconnection interface to be connected to the interconnection interface of the interconnection board, the interconnection board includes at least one external interconnection interface, and the other An external interconnect interface of the interconnecting board of the data processing node, the interconnecting board is a field programmable gate array FPGA interconnect board, and the FPGA interconnect board includes an FPGA chip corresponding to the reserved interconnect interface Each FPGA chip provides at least one internal interconnection interface and one external interconnection interface, and the data processing method includes: converting the data packet into a high speed transmission signal, and transmitting the high speed transmission signal to the Other data processing nodes to which the data processing node is connected. Step 802,
  • the high-speed transmission signal converts the high-speed transmission signal into a data packet that the CPU can process, and transmits the data packet to the CPU.
  • the method further includes:
  • the FPGA chip completes the physical layer initialization process of the interconnection interface in conjunction with the CPU connected thereto.
  • the data processing method provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be used as a single node, and, in addition, the CPU module Retaining at least one interconnect interface with an interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for connecting to an external interconnect interface of the interconnecting boards of other data processing nodes, Therefore, the CPU module can also be extended to the multi-node system in conjunction with the interconnect board.
  • each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board.
  • the interconnection, and the external interconnection interface provided by the interconnection board between the nodes realizes the connection, and does not need to pass through the interconnection module, thereby reducing the delay caused by the interconnection module.
  • the data processing method provided by the embodiment of the present invention is applied to a data processing system, where the data processing node includes a plurality of CPUs, and multiple CPUs are completely interconnected through an interconnection interface to form a CPU module, and the CPU module is Retaining at least one interconnect interface with an interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for connecting to an external interconnect interface of the interconnecting boards of other data processing nodes,
  • the interconnecting board is a node control NC chip, and the NC chip includes an interconnecting interface corresponding to the reserved interconnecting interface, and an external interconnecting interface with an NC chip of the other data processing node.
  • the data connection processing method includes: the NC chip routing the data packet sent by the CPU to its corresponding CPU.
  • the data processing method provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be separately used as One node is used. In addition, since the CPU module reserves at least one interconnection interface and the interconnection interface of the interconnection board, the interconnection board includes at least one external interconnection interface for mutual interaction with other data processing nodes. The external interface of the board is connected, so the CPU module can also be extended to the multi-node system in conjunction with the interconnect board.
  • each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board.
  • the interconnection, and the connection between the nodes provided by the interconnection board to the external interconnection interface, improves the scalability of the minicomputer system.
  • the data processing node, system and method provided by the embodiments of the present invention can be applied to a computer system.

Abstract

A data processing node, system and method is disclosed by the embodiments of the present invention which relates to the computer field and enables the flexible extension of the computer system. Said data processing node includes multiple Central Processing Units (CPUs), each CPU including multiple interconnect interfaces. Said CPUs are fully connected through the interconnect interfaces to form a CPU module. At least one interconnect interface is reserved in said CPU module to connect with the inner interconnect interface of an interconnect board. The interconnect board includes at least one outer interconnect interface which is used for connecting with the outer interconnect interfaces of the interconnect boards of other data processing nodes. The invention can be applied in the computer field.

Description

数据处理节点、 系统及方法 技术领域  Data processing node, system and method
本发明涉及计算机系统, 尤其涉及一种数据处理节点、 系统及方法。 背景技术  The present invention relates to computer systems, and more particularly to a data processing node, system and method. Background technique
数据处理节点可以包括多个 CPU, 在存在多个数据处理节点且节点中包 含多个 CPU的系统中,不同节点之间的 CPU如何进行相互访问是一个核心技 术, 即 CPU之间的互连为该系统的核心技术。 目前数据处理节点之间可以采 用全互连的方式实现 CPU之间的互连,还可以通过互连模块的形式实现 CPU 之间的互连。  The data processing node may include a plurality of CPUs. In a system in which a plurality of data processing nodes exist and the nodes include multiple CPUs, how the CPUs between different nodes access each other is a core technology, that is, the interconnection between the CPUs is The core technology of the system. At present, the data processing nodes can realize the interconnection between CPUs by means of full interconnection, and the interconnection between CPUs can also be realized by the form of interconnection modules.
如图 1 ( a )所示, 在采用全互连方式实现 CPU之间的互连时, 系统中任 意两节点之间都需要通过高速互连接口直接连接, 且节点内部必须集成有控 制功能模块, 该控制功能模块具有节点控制器(Node Controller, NC ) 的功 能。 如图 1 ( b )所示, 在通过互连模块的形式实现 CPU之间的互连时, 系统 中各节点之间通过互连模块进行连接, 其节点内部包括多个相互连接的 CPU 和 NC。  As shown in Figure 1 (a), when the interconnection between CPUs is implemented by the full interconnection method, any two nodes in the system need to be directly connected through a high-speed interconnection interface, and the control function module must be integrated inside the node. The control function module has the function of a node controller (NC). As shown in Figure 1 (b), when the interconnection between the CPUs is realized by the form of the interconnection module, the nodes in the system are connected by the interconnection module, and the nodes therein include a plurality of interconnected CPUs and NCs. .
在实现本发明实施例的过程中, 发明人发现, 现有技术至少存在以下问 题:  In the process of implementing the embodiments of the present invention, the inventors have found that the prior art has at least the following problems:
在采用如图 1 ( a )所示的全互连方式实现 CPU之间的互连时, 由于需要 在节点内集成控制功能模块, 导致每个节点的成本较高, 且由于受到高速互 连接口数量的限制, 系统的可扩展性较差, 灵活性低;  When the interconnection between the CPUs is realized by the full interconnection method as shown in FIG. 1(a), the cost of each node is high due to the need to integrate the control function modules in the nodes, and the high-speed interconnection interface is adopted. The number of restrictions, the system's scalability is poor, and the flexibility is low;
在通过如图 1 ( b )所示的互连模块的形式实现 CPU之间的互连时, 虽然 可实现系统的扩展, 但是系统中节点间的访问需要经过互连模块进行选择互 连, 导致节点间的访问延迟较大, 进而影响系统的运行效率。  When the interconnection between the CPUs is realized in the form of an interconnection module as shown in FIG. 1(b), although the expansion of the system can be realized, the access between the nodes in the system needs to be selectively interconnected through the interconnection module, resulting in The access delay between nodes is large, which affects the operating efficiency of the system.
发明内容 Summary of the invention
本发明实施例提供一种数据处理节点、 系统及方法, 以实现计算机系统 的灵活扩展。 为达到上述目的, 本发明实施例采用如下技术方案: Embodiments of the present invention provide a data processing node, system, and method to implement flexible expansion of a computer system. To achieve the above objective, the embodiment of the present invention adopts the following technical solutions:
一种数据处理节点, 包括多个 CPU, 每个 CPU包括多个互连接口, 所述 CPU之间通过互连接口全互连以形成 CPU模块, 所述 CPU模块至少预留一 个互连接口与互连板的对内互连接口连接 , 所述互连板包括至少一个对外互 连接口 , 用于与其他数据处理节点的互连板的对外互连接口连接。  A data processing node includes a plurality of CPUs, each CPU includes a plurality of interconnect interfaces, and the CPUs are completely interconnected through an interconnect interface to form a CPU module, and the CPU module reserves at least one interconnect interface and An interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for interfacing with external interconnects of interconnect boards of other data processing nodes.
一种数据处理系统, 包括至少两个上述的数据处理节点, 所述至少两个 理节点间的互连。  A data processing system comprising at least two of the above data processing nodes, an interconnection between the at least two nodes.
一种数据处理方法, 用于数据处理系统, 所述数据处理系统包括多个数 据处理节点, 所述数据处理节点包括多个处理器 CPU, 多个 CPU之间通过互 连接口全互连以形成 CPU模块 ,所述 CPU模块至少预留一个互连接口与互连 板的对内互连接口连接, 所述互连板包括至少一个对外互连接口, 用于与其 他数据处理节点的互连板的对外互连接口连接, 所述互连板为现场可编程门 阵列 FPGA互连板, 所述 FPGA互连板包括与所述预留的互连接口对应的 FPGA芯片, 每个 FPGA芯片至少提供一个对内互连接口和一个对外互连接 口, 所述数据处理方法, 包括: 数据包转换为高速传输信号, 将所述高速传输信号发送给与所述数据处理节 点连接的其他数据处理节点; 速传输信号, 将所述高速传输信号转换为 CPU能够处理的数据包, 将所述数 据包发送给 CPU。  A data processing method for a data processing system, the data processing system includes a plurality of data processing nodes, the data processing node includes a plurality of processor CPUs, and a plurality of CPUs are completely interconnected through an interconnection interface to form a CPU module, wherein the CPU module reserves at least one interconnect interface and an internal interconnect interface of the interconnect board, the interconnect board includes at least one external interconnect interface, and is used for interconnecting with other data processing nodes. Connected to the external interconnect interface, the interconnect board is a field programmable gate array FPGA interconnect board, the FPGA interconnect board includes an FPGA chip corresponding to the reserved interconnect interface, and each FPGA chip provides at least An internal interconnection interface and an external interconnection interface, the data processing method includes: converting a data packet into a high-speed transmission signal, and transmitting the high-speed transmission signal to other data processing nodes connected to the data processing node; The high speed transmission signal is converted into a data packet that the CPU can process, and the data packet is sent to the CPU.
一种数据处理的方法, 应用在数据处理系统中, 所述数据处理节点包括 多个处理器 CPU, 多个 CPU之间通过互连接口全互连以形成 CPU模块, 所 述 CPU模块至少预留一个互连接口与互连板的对内互连接口连接, 所述互连 板包括至少一个对外互连接口 , 用于与其他数据处理节点的互连板的对外互 连接口连接, 所述互连板为节点控制 NC芯片, 所述 NC芯片包括与所述预留 的互连接口——对应的对内互连接口以及与所述其他数据处理节点的 NC 芯 片的对外互连接口相连接的对外互连接口, 所述数据处理方法, 包括: A data processing method is applied to a data processing system, where the data processing node includes a plurality of processor CPUs, and a plurality of CPUs are completely interconnected through an interconnection interface to form a CPU module, and the CPU module is reserved at least An interconnection interface is connected to an internal interconnection interface of the interconnection board, and the interconnection board includes at least one external interconnection interface for external interconnection with other data processing node interconnection boards a connection port, the interconnection board is a node control NC chip, and the NC chip includes an internal interconnection interface corresponding to the reserved interconnection interface, and an NC chip with the other data processing node The external interconnection interface that is connected to the external interconnection interface, and the data processing method includes:
所述 NC芯片将 CPU发送的数据包路由至其相应的 CPU。  The NC chip routes the data packets sent by the CPU to its corresponding CPU.
本发明实施例提供的数据处理节点, 包括 CPU模块和互连板两部分, 由 于 CPU模块是通过 CPU之间全互连形成的, 所以 CPU模块可以单独作为一 个节点使用, 此外, 由于 CPU模块至少预留一个互连接口与互连板的对内互 连接口连接, 所述互连板包括至少一个对外互连接口, 用于与其他数据处理 节点的互连板的对外互连接口连接, 所以 CPU模块也可以与互连板配合扩展 至多节点系统使用。 不管是作为独立的节点还是扩展为多节点系统, 节点内 每个 CPU通过全互连实现了节点内 CPU之间的互连 , 不同节点的 CPU通过 所述互连板实现不同节点的 CPU之间的互连, 提高了小型机系统的扩展性。 附图说明  The data processing node provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be used as a single node, and, in addition, the CPU module is at least Retaining an interconnect interface to connect to an interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for connecting to an external interconnect interface of the interconnecting boards of other data processing nodes, The CPU module can also be extended to use with multi-node systems in conjunction with interconnect boards. Whether as a stand-alone node or as a multi-node system, each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board. The interconnection improves the scalability of the minicomputer system. DRAWINGS
图 1 ( a ) 为现有技术中全互连方式的系统架构图;  Figure 1 (a) is a system architecture diagram of the full interconnection mode in the prior art;
图 1 ( b ) 为现有技术中通过互连模块形式连接的系统架构;  Figure 1 (b) shows the system architecture connected in the form of interconnect modules in the prior art;
图 2为本发明实施例提供过的数据处理节点架构示意图;  2 is a schematic structural diagram of a data processing node according to an embodiment of the present invention;
图 3 ( a ) 为本发明又一实施例提供过的数据处理节点架构示意图; 图 3 ( b ) 为图 3 ( a )提供的数据处理节点中 FPGA芯片的结构示意图; 图 4 ( a ) 为本发明实施例提供过的 2P节点中 CPU互连架构示意图; 图 4 ( b ) 为本发明实施例提供的 2P节点架构示意图;  3( a ) is a schematic diagram of a data processing node architecture provided by another embodiment of the present invention; FIG. 3 ( b ) is a schematic structural diagram of an FPGA chip in the data processing node provided in FIG. 3 ( a ); FIG. 4 ( a ) is The schematic diagram of the CPU interconnection architecture in the 2P node provided by the embodiment of the present invention; FIG. 4(b) is a schematic diagram of the 2P node architecture provided by the embodiment of the present invention;
图 5 ( a ) 为本发明实施例提供过的 4P节点中 CPU互连架构示意图; 图 5 ( b ) 为本发明实施例提供的 4P节点架构示意图;  5(a) is a schematic diagram of a CPU interconnection architecture in a 4P node according to an embodiment of the present invention; FIG. 5(b) is a schematic diagram of a 4P node architecture according to an embodiment of the present invention;
图 6为本发明再一实施例提供过的数据处理节点架构示意图;  6 is a schematic structural diagram of a data processing node according to still another embodiment of the present invention;
图 7为本发明实施例提供的 8P系统架构示意图;  FIG. 7 is a schematic structural diagram of an 8P system according to an embodiment of the present invention;
图 8为本发明实施例提供的数据处理方法的流程图。 具体实施方式 为了实现计算机系统的灵活扩展, 本发明实施例提供一种数据处理节点。 如图 2所示, 本发明实施例提供的数据处理节点, 包括多个 CPU, 每个 CPU包括多个互连接口, 所述 CPU之间通过互连接口全互连以形成 CPU模 块, 所述 CPU模块至少预留一个互连接口与互连板的对内互连接口连接, 所 述互连板包括至少一个对外互连接口 , 用于与其他数据处理节点的互连板的 对外互连接口连接。 FIG. 8 is a flowchart of a data processing method according to an embodiment of the present invention. detailed description In order to achieve flexible extension of the computer system, an embodiment of the present invention provides a data processing node. As shown in FIG. 2, a data processing node according to an embodiment of the present invention includes a plurality of CPUs, each CPU includes a plurality of interconnect interfaces, and the CPUs are completely interconnected through interconnect interfaces to form a CPU module. The CPU module reserves at least one interconnect interface connected to the interconnecting interface of the interconnect board, the interconnect board includes at least one external interconnect interface, and an external interconnect interface for interconnecting boards with other data processing nodes. connection.
值得说明的是,该数据处理节点中包含的 CPU个数根据 CPU的互连接口 总数及由该数据处理节点构成的系统架构设置。 以使得节点中每个 CPU全互 连之后确保还预留有至少一个互连接口用于与所述互连板连接。 具体的, 如 当 CPU包含 4个快速互连通道(QPI )接口时, 那么, 在数据处理节点中设 置此类型的 CPU时,最多可设置 4个,此时节点内部的每个 CPU之间进行互 连需要占用 3个 QPI接口, 预留出一个 QPI接口用于与所述互连板连接, 以 便系统的扩展, 此外, 还需要根据该节点构成的系统架构设置节点中包含的 CPU个数, 如果该节点所在的系统架构为 8P系统(包含 8个 CPU的系统), 由 4个节点组成 , 那么, 该系统中的每个节点设置 2个 CPU即可。  It is worth noting that the number of CPUs included in the data processing node is set according to the total number of interconnected interfaces of the CPU and the system architecture formed by the data processing node. It is ensured that at least one interconnect interface is reserved for connection to the interconnect board after each CPU in the node is fully interconnected. Specifically, when the CPU includes four fast interconnect channel (QPI) interfaces, when setting up this type of CPU in the data processing node, up to four can be set, and each CPU inside the node is performed. The interconnection needs to occupy three QPI interfaces, and a QPI interface is reserved for connection with the interconnect board for system expansion. In addition, the number of CPUs included in the node needs to be set according to the system architecture formed by the node. If the system architecture of the node is an 8P system (a system with 8 CPUs) and consists of 4 nodes, then each node in the system can set 2 CPUs.
本发明实施例提供的数据处理节点, 包括 CPU模块和互连板两部分, 由 于 CPU模块是通过 CPU之间全互连形成的, 所以 CPU模块可以单独作为一 个节点使用, 此外, 由于 CPU模块至少预留一个互连接口与互连板的对内互 连接口连接, 所述互连板包括至少一个对外互连接口, 用于与其他数据处理 节点的互连板的对外互连接口连接, 所以 CPU模块也可以与互连板配合扩展 至多节点系统使用。 不管是作为独立的节点还是扩展为多节点系统, 节点内 每个 CPU通过全互连实现了节点内 CPU之间的互连 , 不同节点的 CPU通过 所述互连板实现不同节点的 CPU之间的互连, 且在节点之间由互连板提供过 的对外互连接口实现连接, 不需要通过互连模块, 减少了由于互连模块导致 的时延。  The data processing node provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be used as a single node, and, in addition, the CPU module is at least Retaining an interconnect interface to connect to an interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for connecting to an external interconnect interface of the interconnecting boards of other data processing nodes, The CPU module can also be extended to use with multi-node systems in conjunction with interconnect boards. Whether as a stand-alone node or as a multi-node system, each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board. The interconnection, and the external interconnection interface provided by the interconnection board between the nodes, realizes the connection, and does not need to pass through the interconnection module, thereby reducing the delay caused by the interconnection module.
为了便于本领域技术人员的理解, 现就本发明又一实施例提供的数据处 理节点进行详细。 在本实施例中所述互连板为现场可编程门阵列 (Filed Programmable Gate Array , FPGA ) 互连板。 In order to facilitate the understanding of those skilled in the art, the data provided by another embodiment of the present invention is now provided. The node is detailed. In this embodiment, the interconnect board is a Field Programmable Gate Array (FPGA) interconnect board.
如图 3 ( a )所示,本发明又一实施例提供的数据处理节点,包括多个 CPU, 每个 CPU包括多个互连接口 ,所述 CPU之间通过互连接口全互连以形成 CPU 模块, 所述 CPU模块至少预留一个互连接口与互连板的对内互连接口连接, 所述 FPGA互连板包括与所述预留的互连接口对应的 FPGA芯片 ,每个 FPGA 芯片至少提供一个对内互连接口和一个对外互连接口。 其中, 所述对外互连 接口用于与其他数据处理节点中互连板的对外互连接口连接。  As shown in FIG. 3( a ), a data processing node according to another embodiment of the present invention includes a plurality of CPUs, each CPU includes a plurality of interconnect interfaces, and the CPUs are completely interconnected through interconnect interfaces to form a CPU module, wherein the CPU module reserves at least one interconnect interface and an internal interconnect interface of the interconnect board, and the FPGA interconnect board includes an FPGA chip corresponding to the reserved interconnect interface, and each FPGA The chip provides at least one internal interconnect interface and one external interconnect interface. The external interconnect interface is used to connect with an external interconnect interface of the interconnecting boards in other data processing nodes.
在本实施例中, 如图 3 ( b )所示, 所述 FPGA芯片, 包括: 议的物理层初始化过程的模块。  In this embodiment, as shown in FIG. 3(b), the FPGA chip includes: a module of a physical layer initialization process.
串解串模块 32: 用于在接收到 CPU发送的数据包时, 将所述数据包转换 为高速传输信号发送至其他数据处理节点, 并在接收其他数据处理节点发送 的高速传输信号时, 将所述高速传输信号转换为 CPU能够处理的数据包发送 给 CPU。  The string de-serialization module 32 is configured to: when receiving a data packet sent by the CPU, convert the data packet into a high-speed transmission signal and send the data to another data processing node, and when receiving the high-speed transmission signal sent by another data processing node, The high speed transmission signal is converted to a data packet that the CPU can process and sent to the CPU.
进一步的 , 为了避免所述串解串模块在发送高速传输信号时出现长期连 Further, in order to avoid the long-term connection of the string de-serialization module when transmitting a high-speed transmission signal
0、 连 1状况导致时钟恢复电路出错, 所述串解串模块采用 8B/10B编码将所 述数据包转换为高速传输信号后发送给其他数据处理节点, 并在接收到其他 数据处理节点发送的高速传输信号时将所述高速传输信号采用 10B/8B解码后 转换为 CPU能够处理的数据包。 这样数据在从一个所述数据处理节点到另一 个所述数据处理节点的传输过程中, 没有对数据进行任何其他的处理, 在任 何所述数据处理节点看来, 好像是直接与另一个所述数据处理节点相连的, 数据从一个所述数据处理节点被透传到另一个所述数据处理节点。 0, the 1 condition causes the clock recovery circuit to be in error, and the serial deserialization module converts the data packet into a high-speed transmission signal by using 8B/10B coding, and then sends the data packet to another data processing node, and receives the data transmission node and sends it to another data processing node. When the signal is transmitted at a high speed, the high-speed transmission signal is decoded by 10B/8B and converted into a data packet that the CPU can process. Thus, during the transmission of data from one of the data processing nodes to another of the data processing nodes, there is no other processing of the data, as seen by any of the data processing nodes, as if directly with another The data processing nodes are connected, and data is transparently transmitted from one of the data processing nodes to another of the data processing nodes.
现以包含 4个互连接口 (QPI ) 的 CPU为实例进行详细的说明, 在所述 数据处理节点包含多个 CPU时,若 CPU仅包含 4个互连接口,那么所述多个 CPU为至多 4个 CPU, 现在以 2P节点、 4P节点为例进行详细的说明。 (一) 2P节点: 包括 2个 CPU, 分别为 CPUO, CPUl , 如图 4 ( a )所示, CPUO, CPUl之间通过 QPI接口互连形成 CPU模块, 在本实施例中, 所述 CPUO与 CPU1分别采用 2个 QPI接口互连, 即所述 CPUO与所述 CPU1之间 形成有两个互连通道, 其中一条通道用于互连通信, 另一条通道用于冗余备 份。 如图 4 ( b )所示, 所述 CPU模块预留出 4个 QPI接口用于对外互连, 由 CPUO, CPUl分别提供 2个 QPI接口。所述 CPU模块预留的 4个 QPI接口分 别通过专用插件 Air max与 FPGA互连板连接, 其中, 所述 FPGA互连板包 括与所述预留的 4个 QPI接口对应的 FPGA芯片 ,分别为 fpgaO , Q)gal , Q)ga2 , fpga3 , 每个 FPGA芯片为其对应的 QPI接口提供一个对内互连接口, 并提供 用于外部连接的对外互连接口。 A CPU with four interconnect interfaces (QPI) is taken as an example for detailed description. When the data processing node includes multiple CPUs, if the CPU only includes four interconnect interfaces, the multiple CPUs are at most Four CPUs are now described in detail with 2P nodes and 4P nodes as examples. (1) 2P node: includes 2 CPUs, respectively CPUO, CPU1, as shown in FIG. 4(a), CPUO and CPU1 are interconnected by a QPI interface to form a CPU module. In this embodiment, the CPU0 and The CPU 1 is interconnected by two QPI interfaces, that is, two interconnected channels are formed between the CPUO and the CPU 1, one of which is used for interconnecting communication and the other is used for redundant backup. As shown in FIG. 4(b), the CPU module reserves four QPI interfaces for external interconnection, and the CPU0 and the CPU1 respectively provide two QPI interfaces. The four QPI interfaces reserved by the CPU module are respectively connected to the FPGA interconnect board through a dedicated plug-in Air max, wherein the FPGA interconnect board includes FPGA chips corresponding to the reserved four QPI interfaces, respectively fpgaO , Q) gal , Q) ga2 , fpga3 , each FPGA chip provides an internal interconnect interface for its corresponding QPI interface and provides an external interconnect interface for external connections.
在本实施例中,所述预留的 4个 QPI接口由 CPU0、 CPU1分别提供, CPU0, CPUl的至少一个预留 QPI接口用于对外互连。 In this embodiment, the reserved four QPI interfaces are respectively provided by CPU0 and CPU1, and at least one reserved QPI interface of CPU0 and CPU1 is used for external interconnection.
值得说明的是, 采用上述 2P节点的数据处理系统可以实现 4P、 8P的系 统扩展。  It is worth noting that the data processing system using the above 2P nodes can realize the system expansion of 4P and 8P.
(二) 4P节点: 包括 4个 CPU, 分别为 CPUO, CPUl , CPU2, CPU3 , 如图 5 ( a )所示, CPUO, CPUl , CPU2, CPU3之间进行全互连以形成 CPU 模块,在本实施例中,所述 CPU0-CPU3均需要采用 3个 QPI接口才能实现全 互连。 如图 5 ( b )所示, 所述 CPU模块预留出 4个 QPI接口用于对外互连, 由 CPU0、 CPU1、 CPU2、 CPU3分别提供 1个 QPI接口。 所述 CPU模块预留 的 4个 QPI接口分别通过专用插件 Air max与 FPGA互连板连接, 其中, 所 述 FPGA互连板包括与所述预留的 4个 QPI接口对应的 FPGA芯片 , 分别为 ga0 , fpgal , ga2 , ga3 , 每个 FPGA芯片为其对应的 QPI接口提供一个 对内互连接口, 并提供用于外部连接的对外互连接口。  (2) 4P node: including 4 CPUs, respectively CPUO, CPU1, CPU2, CPU3, as shown in Figure 5 (a), CPUO, CPU1, CPU2, CPU3 are fully interconnected to form a CPU module. In the embodiment, the CPU0-CPU3 needs to adopt three QPI interfaces to implement full interconnection. As shown in Figure 5 (b), the CPU module reserves four QPI interfaces for external interconnection, and one QPI interface is provided by CPU0, CPU1, CPU2, and CPU3, respectively. The four QPI interfaces reserved by the CPU module are respectively connected to the FPGA interconnect board through a dedicated plug-in Air max, wherein the FPGA interconnect board includes FPGA chips corresponding to the reserved four QPI interfaces, respectively Ga0 , fpgal , ga2 , ga3 , each FPGA chip provides an internal interconnect interface for its corresponding QPI interface and provides an external interconnect interface for external connections.
在本实施例中, 由于所述 CPU互连接口个数的限制, 在每个节点内部做 麥设置 4个 CPU, 且每个 CPU的 QPI接口均用于 CPU之间的互连, 故无法 进行冗余备份。 值得说明的是, 采用上述 4P节点的数据处理系统可以实现 8P、 16P的系 统扩展。 In this embodiment, due to the limitation of the number of CPU interconnection interfaces, four CPUs are set in each node, and the QPI interfaces of each CPU are used for interconnection between CPUs, so Redundant backup. It is worth noting that the data processing system using the above 4P node can realize the system expansion of 8P and 16P.
本发明实施例提供的数据处理节点, 包括 CPU模块和互连板两部分, 由 于 CPU模块是通过 CPU之间全互连形成的, 所以 CPU模块可以单独作为一 个节点使用, 此外, 由于 CPU模块至少预留一个互连接口与互连板的对内互 连接口连接, 所述互连板包括至少一个对外互连接口, 用于与其他数据处理 节点的互连板的对外互连接口连接, 所以 CPU模块也可以与互连板配合扩展 至多节点系统使用。 不管是作为独立的节点还是扩展为多节点系统, 节点内 每个 CPU通过全互连实现了节点内 CPU之间的互连 , 不同节点的 CPU通过 所述互连板实现不同节点的 CPU之间的互连, 且在节点之间由互连板提供过 的对外互连接口实现连接, 不需要通过互连模块, 减少了由于互连模块导致 的时延。  The data processing node provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be used as a single node, and, in addition, the CPU module is at least Retaining an interconnect interface to connect to an interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for connecting to an external interconnect interface of the interconnecting boards of other data processing nodes, The CPU module can also be extended to use with multi-node systems in conjunction with interconnect boards. Whether as a stand-alone node or as a multi-node system, each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board. The interconnection, and the external interconnection interface provided by the interconnection board between the nodes, realizes the connection, and does not need to pass through the interconnection module, thereby reducing the delay caused by the interconnection module.
值得说明的是,所述互连板还可以是 NC芯片,本发明再一实施例提供一 种数据处理节点,其包含的互连板为 NC芯片, 下面就互连板为 NC芯片的数 据处理节点进行伴细的说明。  It should be noted that the interconnecting board may also be an NC chip. In another embodiment of the present invention, a data processing node is provided, which includes an interconnecting board as an NC chip, and the following interconnecting board is an NC chip data processing. The node performs a detailed description.
如图 6所示, 本发明再一实施例提供过的数据处理节点, 包括多个处理 器 CPU, 每个 CPU包括多个互连接口 , 所述 CPU之间通过互连接口全互连 以形成 CPU模块,所述 CPU模块至少预留一个互连接口用于与 NC芯片的对 内互连接口连接,所述 NC芯片包括与所述预留的互连接口——对应的对内互 连接口, 以及至少一个对外互连接口,用于与其他数据处理节点的 NC芯片的 对外互连接口相连接。  As shown in FIG. 6, a data processing node according to another embodiment of the present invention includes a plurality of processor CPUs, each CPU includes a plurality of interconnect interfaces, and the CPUs are completely interconnected through interconnect interfaces to form a CPU module, the CPU module at least one interconnect interface for connecting to an internal interconnect interface of the NC chip, the NC chip including an interconnecting interface corresponding to the reserved interconnect interface And at least one external interconnect interface for connecting to an external interconnect interface of the NC chip of the other data processing node.
在本实施例中, 所述 NC芯片集成有路由模块, 所述路由模块将所述 NC 芯片接收到的数据包路由至其相应的 CPU。  In this embodiment, the NC chip integrates a routing module, and the routing module routes the data packet received by the NC chip to its corresponding CPU.
本发明实施例提供的数据处理节点, 包括 CPU模块和互连板两部分, 由 于 CPU模块是通过 CPU之间全互连形成的, 所以 CPU模块可以单独作为一 个节点使用, 此外, 由于 CPU模块至少预留一个互连接口与互连板的对内互 连接口连接, 所述互连板包括至少一个对外互连接口, 用于与其他数据处理 节点的互连板的对外互连接口连接, 所以 CPU模块也可以与互连板配合扩展 至多节点系统使用。 不管是作为独立的节点还是扩展为多节点系统, 节点内 每个 CPU通过全互连实现了节点内 CPU之间的互连 , 不同节点的 CPU通过 所述互连板实现不同节点的 CPU之间的互连, 且在节点之间由互连板提供过 的对外互连接口实现连接, 不需要通过互连模块, 减少了由于互连模块导致 的时延。 The data processing node provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be used as a single node, and, in addition, the CPU module is at least Reserve an interconnection interface and the interconnection of the interconnection board The connection port is connected, and the interconnection board includes at least one external interconnection interface for connecting with an external interconnection interface of the interconnection board of other data processing nodes, so the CPU module can also be extended with the interconnection board to extend to the multi-node system. . Whether as a stand-alone node or as a multi-node system, each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board. The interconnection, and the external interconnection interface provided by the interconnection board between the nodes, realizes the connection, and does not need to pass through the interconnection module, thereby reducing the delay caused by the interconnection module.
本发明实施例提供的数据处理系统, 包括至少两个上述实施例提供的数 据处理节点, 所述至少两个数据处理节点通过其互连板上的对外互连接口实 现所述至少两个数据处理节点间的互连。  The data processing system provided by the embodiment of the present invention includes at least two data processing nodes provided by the foregoing embodiments, where the at least two data processing nodes implement the at least two data processing through an external interconnection interface on the interconnect board thereof. Interconnection between nodes.
具体的, 所述至少两个数据处理节点之间通过其互连板上的对外互连接 口实现所述至少两个数据处理节点间的互连, 包括:  Specifically, the at least two data processing nodes implement interconnection between the at least two data processing nodes through external interconnect ports on the interconnecting board, including:
如果所述至少三个数据处理节点之间通过其互连板上的对外互连接口实 现所述至少三个数据处理节点间的部分节点直连 , 并且通过所述部分节点的 直连实现剩余节点的间接互连。 或者  Performing partial node direct connection between the at least three data processing nodes through the external interconnection interface on the interconnect board between the at least three data processing nodes, and implementing the remaining nodes by directly connecting the partial nodes Indirect interconnection. Or
所述至少两个数据处理节点之间通过其互连板上的对外互连接口实现所 述至少两个数据处理节点间的全互连。  The at least two data processing nodes implement full interconnection between the at least two data processing nodes via an external interconnect interface on their interconnect boards.
进一步的, 所述互连板上还可以设有作为备份冗余接口的互连接口。 在本实施例中, 以 2P节点作为所述数据处理系统的数据处理节点即基本 节点进行详细的说明。  Further, the interconnection board may further be provided with an interconnection interface as a backup redundancy interface. In the present embodiment, a 2P node is described as a data processing node, i.e., a basic node of the data processing system, in detail.
如图 Ί所示, 所述 4个 2P节点之间通过高速传输线扩展到 8P的数据处 理系统。 其中所述 2P节点的结构示意图如图 5 ( a )和图 5 ( b )所示, 此处 不再赘述。 所述 2P节点对外预留有 4个 QPI接口, 分别由 CPU0、 CPU1提 供。  As shown in FIG. ,, the four 2P nodes are extended to an 8P data processing system through a high speed transmission line. The structure diagram of the 2P node is shown in FIG. 5( a ) and FIG. 5 ( b ), and details are not described herein again. The 2P node reserves four QPI interfaces externally, which are provided by CPU0 and CPU1, respectively.
在节点 1-4实现部分节点直连,并且通过所述部分节点的直连实现剩余节 点的间接互连时, 节点 1-4还存在未使用的 QPI接口, 那么, 将剩余的 QPI 接口用于实现节点间全互连。在实现了节点间的全互连, 节点 1-4还存在未使 用的 QPI接口, 可以用作冗余备份。 When node 1-4 implements partial node direct connection, and through the direct connection of the partial node to realize the indirect interconnection of the remaining nodes, node 1-4 still has an unused QPI interface, then, the remaining QPI The interface is used to implement full interconnection between nodes. In the implementation of the full interconnection between nodes, nodes 1-4 also have unused QPI interfaces, which can be used as redundant backups.
值得说明的是, 由于 NC芯片的功能具有比 FPGA芯片功能更强大的处 理功能, 所以在所述相比较强大, 所以在数据处理系统由 2P节点组成是, 其 能够实现更高级别的系统扩展, 此处不一一赘述。  It is worth noting that since the function of the NC chip has more powerful processing functions than the function of the FPGA chip, it is relatively powerful in the above, so the data processing system is composed of 2P nodes, which can realize a higher level of system expansion. I will not go into details here.
本发明实施例提供的数据处理系统, 包括 CPU模块和互连板两部分, 由 于 CPU模块是通过 CPU之间全互连形成的, 所以 CPU模块可以单独作为一 个节点使用, 此外, 由于 CPU模块至少预留一个互连接口与互连板的对内互 连接口连接, 所述互连板包括至少一个对外互连接口, 用于与其他数据处理 节点的互连板的对外互连接口连接, 所以 CPU模块也可以与互连板配合扩展 至多节点系统使用。 不管是作为独立的节点还是扩展为多节点系统, 节点内 每个 CPU通过全互连实现了节点内 CPU之间的互连 , 不同节点的 CPU通过 所述互连板实现不同节点的 CPU之间的互连, 且在节点之间由互连板提供过 的对外互连接口实现连接, 不需要通过互连模块, 减少了由于互连模块导致 的时延。  The data processing system provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be used as a single node, and at least, the CPU module is at least Retaining an interconnect interface to connect to an interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for connecting to an external interconnect interface of the interconnecting boards of other data processing nodes, The CPU module can also be extended to use with multi-node systems in conjunction with interconnect boards. Whether as a stand-alone node or as a multi-node system, each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board. The interconnection, and the external interconnection interface provided by the interconnection board between the nodes, realizes the connection, and does not need to pass through the interconnection module, thereby reducing the delay caused by the interconnection module.
如图 8所示, 本发明实施例提供数据处理方法, 用于数据处理系统, 所 述数据处理系统包括多个数据处理节点, 所述数据处理节点包括多个 CPU, 多个 CPU之间通过互连接口全互连以形成 CPU模块, 所述 CPU模块至少预 留一个互连接口与互连板的对内互连接口连接, 所述互连板包括至少一个对 外互连接口, 用于与其他数据处理节点的互连板的对外互连接口连接, 所述 互连板为现场可编程门阵列 FPGA互连板, 所述 FPGA互连板包括与所述预 留的互连接口对应的 FPGA芯片 , 每个 FPGA芯片至少提供一个对内互连接 口和一个对外互连接口, 所述数据处理方法, 包括: 将所述数据包转换为高速传输信号, 将所述高速传输信号发送给与所述数据 处理节点连接的其他数据处理节点。 步骤 802, As shown in FIG. 8, the embodiment of the present invention provides a data processing method for a data processing system, where the data processing system includes a plurality of data processing nodes, and the data processing node includes multiple CPUs, and multiple CPUs pass each other. The connection ports are all interconnected to form a CPU module, and the CPU module reserves at least one interconnection interface to be connected to the interconnection interface of the interconnection board, the interconnection board includes at least one external interconnection interface, and the other An external interconnect interface of the interconnecting board of the data processing node, the interconnecting board is a field programmable gate array FPGA interconnect board, and the FPGA interconnect board includes an FPGA chip corresponding to the reserved interconnect interface Each FPGA chip provides at least one internal interconnection interface and one external interconnection interface, and the data processing method includes: converting the data packet into a high speed transmission signal, and transmitting the high speed transmission signal to the Other data processing nodes to which the data processing node is connected. Step 802,
的高速传输信号, 将所述高速传输信号转换为 CPU能够处理的数据包, 将所 述数据包发送给 CPU。 The high-speed transmission signal converts the high-speed transmission signal into a data packet that the CPU can process, and transmits the data packet to the CPU.
进一步的, 所述方法还包括:  Further, the method further includes:
在所述数据处理系统初始化时, 所述 FPGA芯片配合与其连接的 CPU完 成所述互连接口物理层初始化过程。  When the data processing system is initialized, the FPGA chip completes the physical layer initialization process of the interconnection interface in conjunction with the CPU connected thereto.
本发明实施例提供的数据处理的方法, 包括 CPU模块和互连板两部分, 由于 CPU模块是通过 CPU之间全互连形成的, 所以 CPU模块可以单独作为 一个节点使用, 此外, 由于 CPU模块至少预留一个互连接口与互连板的对内 互连接口连接, 所述互连板包括至少一个对外互连接口, 用于与其他数据处 理节点的互连板的对外互连接口连接, 所以 CPU模块也可以与互连板配合扩 展至多节点系统使用。 不管是作为独立的节点还是扩展为多节点系统, 节点 内每个 CPU通过全互连实现了节点内 CPU之间的互连, 不同节点的 CPU通 过所述互连板实现不同节点的 CPU之间的互连, 且在节点之间由互连板提供 过的对外互连接口实现连接, 不需要通过互连模块, 减少了由于互连模块导 致的时延。  The data processing method provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be used as a single node, and, in addition, the CPU module Retaining at least one interconnect interface with an interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for connecting to an external interconnect interface of the interconnecting boards of other data processing nodes, Therefore, the CPU module can also be extended to the multi-node system in conjunction with the interconnect board. Whether as a stand-alone node or as a multi-node system, each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board. The interconnection, and the external interconnection interface provided by the interconnection board between the nodes, realizes the connection, and does not need to pass through the interconnection module, thereby reducing the delay caused by the interconnection module.
本发明实施例提供过的数据处理的方法, 应用在数据处理系统中, 所述 数据处理节点包括多个 CPU,多个 CPU之间通过互连接口全互连以形成 CPU 模块, 所述 CPU模块至少预留一个互连接口与互连板的对内互连接口连接, 所述互连板包括至少一个对外互连接口 , 用于与其他数据处理节点的互连板 的对外互连接口连接, 所述互连板为节点控制 NC芯片, 所述 NC芯片包括与 所述预留的互连接口——对应的对内互连接口以及与所述其他数据处理节点 的 NC芯片的对外互连接口相连接的对外互连接口 ,所述数据处理方法,包括: 所述 NC芯片将 CPU发送的数据包路由至其相应的 CPU。  The data processing method provided by the embodiment of the present invention is applied to a data processing system, where the data processing node includes a plurality of CPUs, and multiple CPUs are completely interconnected through an interconnection interface to form a CPU module, and the CPU module is Retaining at least one interconnect interface with an interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for connecting to an external interconnect interface of the interconnecting boards of other data processing nodes, The interconnecting board is a node control NC chip, and the NC chip includes an interconnecting interface corresponding to the reserved interconnecting interface, and an external interconnecting interface with an NC chip of the other data processing node. The data connection processing method includes: the NC chip routing the data packet sent by the CPU to its corresponding CPU.
本发明实施例提供的数据处理的方法, 包括 CPU模块和互连板两部分, 由于 CPU模块是通过 CPU之间全互连形成的, 所以 CPU模块可以单独作为 一个节点使用, 此外, 由于 CPU模块至少预留一个互连接口与互连板的对内 互连接口连接, 所述互连板包括至少一个对外互连接口, 用于与其他数据处 理节点的互连板的对外互连接口连接, 所以 CPU模块也可以与互连板配合扩 展至多节点系统使用。 不管是作为独立的节点还是扩展为多节点系统, 节点 内每个 CPU通过全互连实现了节点内 CPU之间的互连, 不同节点的 CPU通 过所述互连板实现不同节点的 CPU之间的互连, 且在节点之间由互连板提供 过的对外互连接口实现连接, 提高了小型机系统的扩展性。 The data processing method provided by the embodiment of the present invention includes a CPU module and an interconnection board. Since the CPU module is formed by full interconnection between the CPUs, the CPU module can be separately used as One node is used. In addition, since the CPU module reserves at least one interconnection interface and the interconnection interface of the interconnection board, the interconnection board includes at least one external interconnection interface for mutual interaction with other data processing nodes. The external interface of the board is connected, so the CPU module can also be extended to the multi-node system in conjunction with the interconnect board. Whether as a stand-alone node or as a multi-node system, each CPU in the node realizes interconnection between CPUs in the node through full interconnection, and CPUs of different nodes realize CPU between different nodes through the interconnection board. The interconnection, and the connection between the nodes provided by the interconnection board to the external interconnection interface, improves the scalability of the minicomputer system.
本发明实施例提供过的数据处理节点、 系统及方法可以应用在计算机系 统。  The data processing node, system and method provided by the embodiments of the present invention can be applied to a computer system.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分步骤 是可以通过程序来指令相关的硬件完成, 所述的程序可以存储于一计算机可 读存储介质中, 如 ROM/RAM、 磁碟或光盘等。 以上所述, 仅为本发明的具体实施方式, 但本发明的保护范围并不局限 于此, 任何熟悉本技术领域的技术人员在本发明揭露的技术范围内, 可轻易 想到变化或替换, 都应涵盖在本发明的保护范围之内。 因此, 本发明的保护 范围应以所述权利要求的保护范围为准。  A person skilled in the art can understand that all or part of the steps of implementing the above embodiments can be completed by a program to instruct related hardware, and the program can be stored in a computer readable storage medium, such as ROM/RAM, magnetic. Disc or CD. The above is only the specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily think of changes or substitutions within the technical scope of the present invention. It should be covered by the scope of the present invention. Therefore, the scope of the invention should be determined by the scope of the appended claims.

Claims

权利 要求 书 Claim
1、 一种数据处理节点, 包括多个处理器 CPU, 每个 CPU包括多个互连接 口, 其特征在于, 所述 CPU之间通过互连接口全互连以形成 CPU模块, 所述 CPU模块至少预留一个互连接口与互连板的对内互连接口连接, 所述互连板包 括至少一个对外互连接口 , 用于与其他数据处理节点的互连板的对外互连接口 连接。  A data processing node, comprising a plurality of processor CPUs, each CPU comprising a plurality of interconnect interfaces, wherein the CPUs are completely interconnected by interconnecting interfaces to form a CPU module, the CPU module At least one interconnect interface is reserved for interconnection to the interconnect interface of the interconnect board, the interconnect board including at least one external interconnect interface for interfacing with external interconnect interfaces of interconnect boards of other data processing nodes.
2、 根据权利要求 1所述的数据处理节点, 其特征在于, 该数据处理节点中 包含的 CPU个数根据 CPU的互连接口总数及由该数据处理节点构成的系统架构 设置。  2. The data processing node according to claim 1, wherein the number of CPUs included in the data processing node is set according to a total number of interconnected interfaces of the CPU and a system architecture formed by the data processing node.
3、 根据权利要求 1或 2所述的数据处理节点, 其特征在于, 所述互连板为 现场可编程门阵列 FPGA互连板, 所述 FPGA互连板包括与所述预留的互连接 口对应的 FPGA芯片 , 每个 FPGA芯片至少提供一个对内互连接口和一个对外 互连接口。  The data processing node according to claim 1 or 2, wherein the interconnection board is a field programmable gate array FPGA interconnection board, and the FPGA interconnection board includes an interconnection with the reservation The FPGA chip corresponding to the interface, each FPGA chip provides at least one internal interconnection interface and one external interconnection interface.
4、 根据权利要求 3所述的数据处理节点, 其特征在于, 所述 FPGA芯片, 包括:  The data processing node according to claim 3, wherein the FPGA chip comprises:
初始化模块: 用于配合与所述 FPGA芯片连接的 CPU完成物理层初始化过 程的模块;  An initialization module: a module for performing a physical layer initialization process with a CPU connected to the FPGA chip;
串解串模块: 用于在接收到 CPU发送的数据包时, 将所述数据包转换为高 速传输信号发送至其他数据处理节点, 并在接收其他数据处理节点发送的高速 传输信号时, 将所述高速传输信号转换为 CPU能够处理的数据包发送给 CPU。  The string de-serialization module is configured to: when receiving a data packet sent by the CPU, convert the data packet into a high-speed transmission signal and send it to another data processing node, and when receiving the high-speed transmission signal sent by another data processing node, The high-speed transmission signal is converted to a data packet that the CPU can process and sent to the CPU.
5、 根据权利要求 1或 2所述的数据处理节点, 其特征在于, 所述互连板为 节点控制器 NC芯片,所述 NC芯片包括与所述预留的互连接口——对应的对内 互连接口以及与所述其他数据处理节点的 NC 芯片的对外互连接口相连接的对 外互连接口。  The data processing node according to claim 1 or 2, wherein the interconnection board is a node controller NC chip, and the NC chip includes a corresponding pair with the reserved interconnection interface. An internal interconnect interface and an external interconnect interface coupled to the external interconnect interface of the NC chip of the other data processing node.
6、 根据权利要求 5所述的数据处理节点, 其特征在于, 所述 NC芯片集成 有路由模块, 所述路由模块将所述 NC 芯片接收到的数据包路由至其相应的 CPU。 The data processing node according to claim 5, wherein the NC chip is integrated with a routing module, and the routing module routes the data packet received by the NC chip to its corresponding CPU.
7、 一种数据处理系统, 其特征在于, 包括至少两个权利要求 1-6任一项所 述的数据处理节点, 所述至少两个数据处理节点之间通过其互连板上的对外互 连接口实现所述至少两个数据处理节点间的互连。  A data processing system, comprising: at least two data processing nodes according to any one of claims 1-6, wherein the at least two data processing nodes communicate with each other through an interconnecting board thereof The connection port implements interconnection between the at least two data processing nodes.
8、 根据权利要求 7所述的数据处理系统, 其特征在于, 所述至少两个数据 间的互连, 包括:  8. The data processing system according to claim 7, wherein the interconnection between the at least two data comprises:
如果所述至少三个数据处理节点之间通过其互连板上的对外互连接口实现 所述至少三个数据处理节点间的部分节点直连, 并且通过所述部分节点的直连 实现剩余节点的间接互连; 或者,  Performing partial node direct connection between the at least three data processing nodes through the external interconnection interface on the interconnect board between the at least three data processing nodes, and implementing the remaining nodes by directly connecting the partial nodes Indirect interconnection; or,
所述至少两个数据处理节点之间通过其互连板上的对外互连接口实现所述 至少两个数据处理节点间的全互连。  The at least two data processing nodes implement full interconnection between the at least two data processing nodes via an external interconnect interface on their interconnect boards.
9、 根据权利要求 8所述的数据处理系统, 其特征在于, 所述互连板上还设 有作为备份冗余接口的互连接口。  9. The data processing system according to claim 8, wherein the interconnection board is further provided with an interconnection interface as a backup redundancy interface.
10、 一种数据处理方法, 用于数据处理系统, 所述数据处理系统包括多个 数据处理节点,其特征在于,所述数据处理节点包括多个处理器 CPU,多个 CPU 之间通过互连接口全互连以形成 CPU模块,所述 CPU模块至少预留一个互连接 口与互连板的对内互连接口连接, 所述互连板包括至少一个对外互连接口, 用 于与其他数据处理节点的互连板的对外互连接口连接, 所述互连板为现场可编 程门阵列 FPGA互连板, 所述 FPGA互连板包括与所述预留的互连接口对应的 FPGA芯片, 所述数据处理方法, 包括: 据包转换为高速传输信号, 将所述高速传输信号发送给与所述数据处理节点连 接的其他数据处理节点; 传输信号, 将所述高速传输信号转换为 CPU能够处理的数据包, 将所述数据包 发送给 CPU。 10. A data processing method for a data processing system, the data processing system comprising a plurality of data processing nodes, wherein the data processing node comprises a plurality of processor CPUs, and the plurality of CPUs are interconnected The interfaces are fully interconnected to form a CPU module, the CPU module at least one interconnect interface being interconnected with an interconnecting interface of the interconnect board, the interconnect board including at least one external interconnect interface for use with other data The external interconnect interface of the interconnecting board of the processing node, the interconnecting board is a field programmable gate array FPGA interconnect board, and the FPGA interconnect board includes an FPGA chip corresponding to the reserved interconnect interface. The data processing method includes: converting a packet into a high-speed transmission signal, transmitting the high-speed transmission signal to other data processing nodes connected to the data processing node; transmitting a signal, converting the high-speed transmission signal into a CPU capable Processed packet, the packet Sent to the CPU.
11、 根据权利要求 10所述的方法, 其特征在于, 还包括:  The method according to claim 10, further comprising:
在所述数据处理系统初始化时, 所述 FPGA芯片配合与其连接的 CPU完成 所述对内互连接口物理层初始化过程。  When the data processing system is initialized, the FPGA chip cooperates with a CPU connected thereto to complete the physical layer initialization process of the internal interconnection interface.
12、 一种数据处理的方法, 应用在数据处理系统中, 其特征在于, 所述数 据处理节点包括多个处理器 CPU, 多个 CPU之间通过互连接口全互连以形成 CPU模块,所述 CPU模块至少预留一个互连接口与互连板的对内互连接口连接, 所述互连板包括至少一个对外互连接口 , 用于与其他数据处理节点的互连板的 对外互连接口连接,所述互连板为节点控制 NC芯片 ,所述数据处理方法,包括: 所述 NC芯片将 CPU发送的数据包路由至其相应的 CPU。  12. A data processing method, applied in a data processing system, wherein the data processing node comprises a plurality of processor CPUs, and the plurality of CPUs are completely interconnected through an interconnection interface to form a CPU module. The CPU module reserves at least one interconnect interface connected to the interconnecting interface of the interconnect board, and the interconnect board includes at least one external interconnect interface for external interconnection of interconnect boards with other data processing nodes. An interface is connected, the interconnecting board is a node control NC chip, and the data processing method includes: the NC chip routing a data packet sent by the CPU to its corresponding CPU.
PCT/CN2011/076735 2011-06-30 2011-06-30 Data processing node, system and method WO2012103736A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN2011800009706A CN102301363A (en) 2011-06-30 2011-06-30 Data processsing node, system and method
PCT/CN2011/076735 WO2012103736A1 (en) 2011-06-30 2011-06-30 Data processing node, system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2011/076735 WO2012103736A1 (en) 2011-06-30 2011-06-30 Data processing node, system and method

Publications (1)

Publication Number Publication Date
WO2012103736A1 true WO2012103736A1 (en) 2012-08-09

Family

ID=45360526

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/076735 WO2012103736A1 (en) 2011-06-30 2011-06-30 Data processing node, system and method

Country Status (2)

Country Link
CN (1) CN102301363A (en)
WO (1) WO2012103736A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104503871A (en) * 2014-12-29 2015-04-08 浪潮电子信息产业股份有限公司 Implementation method based on full-redundancy model of small computer system

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102520768B (en) * 2011-12-29 2014-11-26 曙光信息产业股份有限公司 Blade server motherboard and system
CN103853686A (en) * 2012-11-30 2014-06-11 英业达科技有限公司 Server
CN103034613A (en) * 2012-12-12 2013-04-10 深圳市华力特电气股份有限公司 Data communication method between processors and FPGA (field programmable gate array) equipment
CN103491079B (en) * 2013-09-12 2017-01-18 杭州华为数字技术有限公司 Message generating device, server and method
CN104536930A (en) * 2015-01-16 2015-04-22 浪潮(北京)电子信息产业有限公司 Communication connecting method and system
WO2016155443A1 (en) * 2015-04-03 2016-10-06 深圳市共创百业科技开发有限公司 Wearable device for use in assessing orientation of attacker
CN105022715A (en) * 2015-07-08 2015-11-04 浪潮(北京)电子信息产业有限公司 Server backplane interconnection method and system
CN108337175A (en) * 2018-01-31 2018-07-27 郑州云海信息技术有限公司 A kind of multipath server and its node communication method
US10489341B1 (en) * 2018-06-25 2019-11-26 Quanta Computer Inc. Flexible interconnect port connection

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5379440A (en) * 1990-12-20 1995-01-03 Motorola Inc. Parallel processor with array of clustered processing elements having inputs seperate from outputs and outputs limited to a maximum of two per dimension
CN1946076A (en) * 2005-10-07 2007-04-11 国际商业机器公司 Data processing system, method and interconnect fabric supporting multiple planes of processing nodes
CN1987871A (en) * 2005-12-21 2007-06-27 中国科学院微电子研究所 Two dimension gid layout structure of multiple processor chip

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1172248C (en) * 2001-10-26 2004-10-20 华为技术有限公司 Method of realizing multiple CPU totally interconnected using synchronous serial port
US7577727B2 (en) * 2003-06-27 2009-08-18 Newisys, Inc. Dynamic multiple cluster system reconfiguration
US20090037932A1 (en) * 2007-08-01 2009-02-05 Clark Michael T Mechanism for broadcasting system management interrupts to other processors in a computer system
JP5550089B2 (en) * 2009-03-30 2014-07-16 エヌイーシーコンピュータテクノ株式会社 Multiprocessor system, node controller, failure recovery method
CN101588175B (en) * 2009-06-24 2011-05-04 北京理工大学 FPGA array processing board

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5379440A (en) * 1990-12-20 1995-01-03 Motorola Inc. Parallel processor with array of clustered processing elements having inputs seperate from outputs and outputs limited to a maximum of two per dimension
CN1946076A (en) * 2005-10-07 2007-04-11 国际商业机器公司 Data processing system, method and interconnect fabric supporting multiple planes of processing nodes
CN1987871A (en) * 2005-12-21 2007-06-27 中国科学院微电子研究所 Two dimension gid layout structure of multiple processor chip

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104503871A (en) * 2014-12-29 2015-04-08 浪潮电子信息产业股份有限公司 Implementation method based on full-redundancy model of small computer system

Also Published As

Publication number Publication date
CN102301363A (en) 2011-12-28

Similar Documents

Publication Publication Date Title
WO2012103736A1 (en) Data processing node, system and method
US9292460B2 (en) Versatile lane configuration using a PCIe PIE-8 interface
Bainbridge et al. Chain: a delay-insensitive chip area interconnect
CN101753388B (en) Router and interface device suitable for the extending on and among sheets of polycaryon processor
JP3992148B2 (en) Electronic circuit boards for building large and scalable processor systems
CN101710314B (en) High-speed peripheral component interconnection switching controller and realizing method thereof
US9264346B2 (en) Resilient duplicate link aggregation emulation
US20130252543A1 (en) Low-latency interface-based networking
CN109634900B (en) AXI (advanced extensible interface) protocol-based multi-level low-delay interconnection structure
KR20150099759A (en) Die-stacked device with partitioned multi-hop network
JP4509827B2 (en) Computer system using serial connect bus and method of connecting multiple CPU units by serial connect bus
CN101739241A (en) On-chip multi-core DSP cluster and application extension method
JP2007538331A (en) Integrated circuit and buffering method
JP7266698B2 (en) Signal processing chip and signal processing system
EP3899740A1 (en) Alternative protocol selection
CN106575283A (en) Configuration of a cluster server using cellular automata
WO2009137996A1 (en) Memory device cascade method, memory system and memory device
JP2007534052A (en) Integrated circuit and transaction withdrawal method
US11537543B2 (en) Technique for handling protocol conversion
WO2009067855A1 (en) Method for implementing a computer system or local area network
US7206889B2 (en) Systems and methods for enabling communications among devices in a multi-cache line size environment and disabling communications among devices of incompatible cache line sizes
US20240020455A1 (en) Software-defined wafer-level switching system design method and apparatus
US7526631B2 (en) Data processing system with backplane and processor books configurable to support both technical and commercial workloads
US7404020B2 (en) Integrated fibre channel fabric controller
US20230101918A1 (en) Interconnect system

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201180000970.6

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11857452

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11857452

Country of ref document: EP

Kind code of ref document: A1