CN107239432A

CN107239432A - A kind of server with novel topological structure

Info

Publication number: CN107239432A
Application number: CN201710670978.2A
Authority: CN
Inventors: 王晓; 王鹏辉
Original assignee: Zhengzhou Yunhai Information Technology Co Ltd
Current assignee: Zhengzhou Yunhai Information Technology Co Ltd
Priority date: 2017-08-08
Filing date: 2017-08-08
Publication date: 2017-10-10

Abstract

The invention discloses a kind of server with novel topological structure, including at least one node, node includes Node Controller and at least N number of CPU, CPU and Node Controller are respectively provided with N number of QPI ports, any two CPU is connected by respective QPI ports, and any one CPU is connected by respective QPI ports with Node Controller；Wherein, N is the positive integer more than or equal to 2.As can be seen here, a node includes at least N number of CPU in server, CPU and Node Controller are respectively provided with N number of QPI ports, that is CPU is equal with the QPI ports on Node Controller and CPU number, and each CPU is connected with Node Controller, and connected between any two CPU also by QPI ports, and then improve QPI port resources utilization rate and data transmission efficiency.

Description

A kind of server with novel topological structure

Technical field

The present invention relates to computer application field, more particularly to a kind of server with novel topological structure.

Background technology

At present, divide by CPU interconnection architectures, server can be divided into 2 classes, a class is symmetric multiprocessor structure SMP, separately One class is nonuniformity storage access structure NUMA.

SMP structure service devices refer to that multiple CPU symmetrically work in server, no primary and secondary and subordinate relation, in server Memory, which is concentrated, to be placed, and each CPU shares identical physical memory, and the delay that each CPU accesses memory is equal, but such as Fruit can not just carry out the interconnection of more multi -CPU not against Node Controller, i.e. SMP structure services device scalability is limited.

Fig. 1 is server internal structural representation in the prior art, as shown in figure 1, NUMA structure services device 10 is with more Kind of implementation, is generally divided into multiple nodes 11, each node is made up of multiple CPU 102, and with it is independent it is local in Deposit, the interconnection architectures of CPU 102 are that 2 CPU102 or 4 CPU 102 and a Node Controller 101 are mutual in a node 11 Interconnection forms more massive system between connection, Node Controller 101.It can pass through Node Controller between each node 11 101 are attached, and its major function is to interconnect at a high speed between the node for realizing multi-CPU system.The characteristics of framework, visits for local cpu Ask that the speed of other CPU in node is significantly larger than the speed that local cpu accesses CPU in the other nodes in distal end.

Either limited bandwidth problem, can all occur in SMP structure services device or NUMA architecture server, and bandwidth refers to The data quantity that regular time can transmit, i.e., can transmit the ability of data in transmission pipeline.Fig. 2 is a section in Fig. 1 Point internal structure schematic diagram, is all that 2 CPU 102 share a Node Controller 101,1 node control as depicted in figs. 1 and 2 Under 2 CPU102 of the correspondence of device 101 processed, the identical quantity sizes of CPU 102 of same system, the required quantity of Node Controller 101 It is more, and QPI port resources can not be made full use of.And under Fig. 2 frameworks, on the inner opposite angle line of node 11 between CPU 102 , it is necessary to be forwarded by CPU forwardings or Node Controller during exchanging visit, cause delay, limit fractional bandwidth.

As can be seen here, how to overcome in server node caused by CPU exchanges visits that QPI port resource utilization rates are low and data The problem of efficiency of transmission is low is those skilled in the art's urgent problem to be solved.

The content of the invention

The embodiment of the present application provides a kind of server with novel topological structure, to solve QPI ends in the prior art Mouthful resource utilization is low and the problem of low data transmission efficiency.

In order to solve the above technical problems, the invention provides a kind of server with novel topological structure, including at least One node, the node is respectively provided with N number of including Node Controller and at least N number of CPU, the CPU and the Node Controller QPI ports, CPU described in any two is connected by the respective QPI ports, and any one of CPU is by respective The QPI ports be connected with the Node Controller；Wherein, N is the positive integer more than or equal to 2.

Preferably, the N is equal to 3.

Preferably, the server is that nonuniformity stores access structure NUMA servers.

Preferably, when the number of nodes is more than 1, connected between the Node Controller by network interface, so as to CPU between multiple nodes carries out data transmission.

Preferably, the network interface is NI EBIs.

Preferably, the Node Controller is standard controller NC.

Preferably, the CPU is Intel X86CPU.

Compared to prior art, the server provided by the present invention with novel topological structure, including at least one section Point, node includes Node Controller and at least N number of CPU, CPU and Node Controller is respectively provided with N number of QPI ports, any two CPU Connected by respective QPI ports, and any one CPU is connected by respective QPI ports with Node Controller；Wherein, N For the positive integer more than or equal to 2.As can be seen here, on the one hand, a node includes at least N number of CPU, CPU and section in server Base site controller is respectively provided with N number of QPI ports, i.e. CPU is equal with the QPI ports on Node Controller and CPU number, and each CPU is connected with Node Controller, caused service when overcoming QPI port resources waste in the prior art and data transfer Device delay issue, and then improve QPI port resources utilization rate and data transmission efficiency.On the other hand, any two CPU it Between also by QPI ports connect, further increase QPI port resources utilization rate and data transmission efficiency.

Brief description of the drawings

Fig. 1 is server internal structural representation in the prior art；

Fig. 2 is an intra-node structural representation in Fig. 1；

An intra-node structural representation in the server that Fig. 3 is provided by the embodiment of the present invention.

Embodiment

Below in conjunction with the accompanying drawing in the embodiment of the present application, the technical scheme in the embodiment of the present application is carried out clear, complete Site preparation is described, it is clear that described embodiment is only a part of embodiment of the invention, rather than whole embodiments.It is based on Embodiment in the present invention, it is every other that those of ordinary skill in the art are obtained under the premise of creative work is not made Embodiment, belongs to the scope of protection of the invention.

The core of the present invention is to provide a kind of server with novel topological structure, can improve QPI port resources profit With rate and data transmission efficiency.

In order that those skilled in the art more fully understand the present invention program, with reference to the accompanying drawings and detailed description The present invention is described in further detail.

An intra-node structural representation in the server that Fig. 3 is provided by the embodiment of the present invention, as shown in figure 3, this Inventing a kind of server 10 with novel topological structure provided includes at least one node 11, and node 11 includes node control Device 101 processed and at least N number of CPU, CPU and Node Controller 101 are respectively provided with N number of QPI ports, and any two CPU passes through respective QPI ports are connected, and any one CPU is connected by respective QPI ports with Node Controller 101；Wherein, N be more than Or the positive integer equal to 2.

Topology refers to entity is abstracted into " point " unrelated with its size, shape, and the circuit of connection entity is abstracted into " line ", and then in graph form come the method for representing relation between these points and line, its object is to study between these point, lines Annexation.The figure of relation is referred to as topology diagram between expression Points And lines.Novel topological structure server is exactly to service Annexation in device 10 between the internal module of each node 11.The inside of server 10 includes at least one node 11, that is, takes Business device 10 inside be possible to multiple nodes 11, it is also possible in only one of which node 11, Fig. 3 be a node 11 in it is interior Portion's structure chart, includes Node Controller 101 and at least N number of CPU, and CPU and Node Controller 101 again in each node 11 Be respectively provided with N number of QPI ports, that is to say, that CPU is equal with the number of the QPI ports on Node Controller and CPU number, i.e., one Having in individual node 11 will be respectively to that should have several QPI ports on several CPU, Node Controller 101 and CPU.If for example, one There are 5 CPU in individual node 11, CPU QPI ports will be to that there should be 5, and the QPI ports of Node Controller 101 also can be to that should have 5.Any two CPU is connected by respective QPI ports, and any one CPU passes through respective QPI ports and node control Device 101 processed is connected, i.e. be all connection between all CPU in node 11, between all CPU and Node Controller 101 It is connection.This annexation in correspondence and node 11 on initial quantity, will solve QPI port resources waste with And the problem of server delays.Previously mentioned N is the positive integer more than or equal to 2.

In order that those skilled in the art is better understood from an intra-node annexation in the embodiment of the present application, with N is equal to exemplified by 3, and the present invention is described in further detail.

Table 1 is the connection relational table of CPU and other node cpus between CPU and in the node in the corresponding nodes of Fig. 3, As shown in table 1：

N refers to CPU number, while also referring to the QPI port numbers on CPU and Node Controller 101, when N is 3, i.e., There are 3 CPU in one node 11, CPU0 301, CPU1302 and CPU2 303, and CPU0 301, CPU1 are denoted as respectively 302nd, there are 3 QPI ports respectively on CPU2 303 and Node Controller 101, QPI port orders are denoted as P0, P1, P2 respectively.On 3 CPU are stated while be connected with a Node Controller 101, between 3 CPU and the company between CPU and Node Controller 101 The relation of connecing can refer to Fig. 3 and table 1, and this connected mode can realize the exchanging visit between 3 CPU in the node 11, can also be real The now access in the node between CPU and Node Controller 101, while the exchanging visit of CPU between different nodes can be realized.

Table 1

Annexation is simple, clear and easy between the shared Node Controller of 3 CPU, module in one node Realize.It is understood that N can be 3, can also be 4,5,6 etc., N be 3 be that one kind is preferably carried out mode, be one Application scenarios are planted, does not represent and there was only a kind of this mode.CPU0, CPU1, CPU2 and QPI port order P0, P1, P2 are It is artificial defined, is not changeless, can be changed according to personal habits.CPU naming method, QPI port are suitable Sequence can't influence the realization of the embodiment of the present application.

Strictly speaking, node 11 is really non-existent, is intended merely to study a question what is conveniently artificially delimited, and delimit Mode also without strict requirements, can arbitrarily be changed according to actual conditions.It is understood that CPU number and node Number can't influence the realization of the embodiment of the present application.

Server provided by the present invention with novel topological structure a, on the one hand, node is included extremely in server Few N number of CPU, CPU and Node Controller are respectively provided with N number of QPI ports, i.e. CPU and the QPI ports on Node Controller and CPU's Number is equal, and each CPU is connected with Node Controller, overcomes QPI port resources in the prior art and wastes and number According to caused server delays problem during transmission, and then improve QPI port resources utilization rate and data transmission efficiency.It is another Connected between aspect, any two CPU also by QPI ports, further increase QPI port resources utilization rate and data are passed Defeated efficiency.

On the basis of above-described embodiment, preferably embodiment, server is that (nonuniformity storage is accessed NUMA Structure) server.

NUMA architecture server uses distributed memory pattern, and each CPU accesses the speed of other CPU in local node Higher than the speed for accessing the CPU in some distant-end nodes.In other words, that is, access the time that CPU in different nodes is spent It is unequal, the characteristics of NUMA had both had the application programming pattern of simplicity and had been easily managed, there is expandability again, The scale of system can effectively be expanded.When NUMA architecture server, especially inside have more than 8 CPU interconnections, generally compare Other servers are more economical and performance is higher.NUMA major advantage is retractility, and NUMA architecture server is compared to other IA frame serverPC improves to some extent on scalability, and server scalability refers to when CPU number changes in server, service The phenomenon that the disposal ability of device changes therewith, all internal storage access of general IA frame serverPC are all delivered to identical shared drive Bus, this mode applies in general to the relatively small number of situation of CPU quantity, but is not suitable for a fairly large number of situations of CPU, because CPU can vie each other the access to shared drive bus, NUMA by limit the CPU quantity on any rambus and according to Interconnect to connect each node at a high speed by Node Controller, so as to avoid above mentioned problem.NUMA architecture server can be Integrated multiple CPU in one server, make system have higher transaction capabilities.It is understood that server can be with For NUMA architecture server, or qualified other servers, the type of server has no effect on the application implementation The realization of example.

On the basis of above-described embodiment, preferably embodiment, when the quantity of node 11 is more than 1, node control Connected between device 101 by network interface, so that the CPU 102 between multiple nodes 11 carries out data transmission.

There are in the server 10 multiple nodes 11, and local node CPU 102 is when needing to access distant-end node CPU, It is accomplished by first transferring data to local node control by the progress of Node Controller 101 data forwarding, i.e. local node CPU 102 The data received are given distant-end node controller by device 101 processed, local node controller 101 by network interface transfers again, most Distant-end node CPU is transferred data to by distant-end node controller afterwards, finally realizes that local node CPU 102 accesses distant-end node CPU.The process can realize the data transfer of CPU between different nodes.In the embodiment of the present application, network interface can be NI EBI, NI bus interface datas efficiency of transmission is high, it is of course also possible to be other qualified network interfaces, network connects The type of mouth has no effect on the realization of the embodiment of the present application.

On the basis of above-described embodiment, preferably embodiment, Node Controller 101 is that (standard is controlled NC Device).

Node Controller 101 plays an important role in the data transmission procedure of each node, and Node Controller is by each factory Family realizes its specification and concrete function, and on each node, its major function is that the overall situation for realizing multicomputer system is delayed for distribution Deposit and interconnected at a high speed between uniformity and node, realize the quick transmission access of data between CPU, the type of Node Controller and inside Implementation has no effect on the realization of the embodiment of the present application.

On the basis of above-described embodiment, preferably embodiment, CPU is IntelX86CPU.

X86 is the standard number abbreviation of an Intel universal computer series, also identifies a set of general computer instruction Set, X and CPU does not have any relation, and it is that a simple asterisk wildcard to all 86 systems is defined, and x86 instruction set is beautiful Intel Company of state is what its first piece of 16 bit CPU was specially developed.IntelX86CPU manufacturing process is advanced, low in energy consumption, stably Property strong, cost is low, monokaryon performance and power consumption control preferably, in a word, the clothes with topological structure that the embodiment of the present invention is provided Business device, IntelX86CPU has higher performance, preferably faster speed, compatibility.Certainly, CPU type can't shadow Ring the realization of the embodiment of the present application.

The server provided by the present invention with novel topological structure is described in detail above.Use herein Several examples are set forth to the principle and embodiment of the present invention, and the explanation of above example is only intended to help and understood The method and its core concept of the present invention；Simultaneously for those of ordinary skill in the art, according to the thought of the present invention, in tool It will change in body embodiment and application, in summary, this specification content should not be construed as to the present invention Limitation, those skilled in the art, on the premise of no creative work, to the present invention made modification, equivalent substitution, Improve etc., it should be included in the application.

It should also be noted that, in this manual, such as CPU0 and CPU1 or the like relational terms be used merely to by One operation makes a distinction with another operation, and not necessarily requires or imply these entities or exist between operating any This actual relation or order.

Claims

1. a kind of server with novel topological structure, it is characterised in that including at least one node, the node includes section Base site controller and at least N number of CPU, the CPU and the Node Controller are respectively provided with N number of QPI ports, CPU described in any two Connected by the respective QPI ports, and any one of CPU passes through the respective QPI ports and the node Controller is connected；

Wherein, N is the positive integer more than or equal to 2.

2. server according to claim 1, it is characterised in that the N is equal to 3.

3. server according to claim 1, it is characterised in that the server is that nonuniformity stores access structure NUMA servers.

4. server according to claim 1, it is characterised in that when the number of nodes is more than 1, the node control Connected between device by network interface, so that the CPU between multiple nodes carries out data transmission.

5. server according to claim 4, it is characterised in that the network interface is NI EBIs.

6. server according to claim 1, it is characterised in that the Node Controller is standard controller NC.

7. the server according to claim 1-6 any one, it is characterised in that the CPU is Intel X86CPU.