WO2016095613A1 - Multi-path server and signal processing method thereof - Google Patents

Multi-path server and signal processing method thereof Download PDF

Info

Publication number
WO2016095613A1
WO2016095613A1 PCT/CN2015/093586 CN2015093586W WO2016095613A1 WO 2016095613 A1 WO2016095613 A1 WO 2016095613A1 CN 2015093586 W CN2015093586 W CN 2015093586W WO 2016095613 A1 WO2016095613 A1 WO 2016095613A1
Authority
WO
WIPO (PCT)
Prior art keywords
pbi
signal
board
server
power
Prior art date
Application number
PCT/CN2015/093586
Other languages
French (fr)
Chinese (zh)
Inventor
程龙飞
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2016095613A1 publication Critical patent/WO2016095613A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/16Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
    • G06F15/163Interprocessor communication

Definitions

  • the present invention relates to the field of server technologies, and in particular, to a multi-path server and a signal processing method thereof.
  • Multi-path server refers to the number of physical CPUs in the server.
  • the number of physical CPUs in the server determines the number of "multiple" servers in the server. For example, if the server contains 16 physical CPUs, it is a 16-way server. The more the number of physical CPUs in a multi-way server, the higher the ability to process data. Therefore, the 16-channel and 32-way servers become the development trend of multi-way servers.
  • the 8-way server is used to implement the key signal processing method of the 16-channel and above servers.
  • the 8-way server uses different switches to switch different basic boards (each base board has multiple CPUs).
  • the number of CPUs on the board is the unit of the smallest hard partition. In series, the number of switches needs to be connected.
  • the key signals are transmitted, the corresponding switch switch linkage needs to be controlled to achieve the implementation process.
  • the implementation process is extremely complicated and the reliability is poor. After multi-stage switching switches with internal resistance, the signal quality becomes very poor.
  • a multi-path server and a signal processing method thereof are provided to solve the problem that the scheme in the prior art is complicated and the signal quality of the poor reliability set is poor.
  • the present invention provides a multi-way server, including: multiple CPUs, and multiple south bridge management monitors Controller input and output PBI board, and key signal processing board, wherein one PBI board monitors at least two CPUs;
  • the PBI board is connected to a CPU that is monitored by itself;
  • each PBI board is connected to an input end of the key signal processing board, and an input end of each PBI board is connected to an output end of the key signal processing board, and the key signal processing board receives a key signal sent by the PBI board, And returning a response signal corresponding to the key signal to the PBI board;
  • the PBI board that receives the response signal sends the response signal to the CPU that it monitors.
  • the key signal processing board receives a key signal sent by the PBI board, and returns a corresponding signal to the PBI board Response signals, including:
  • the key signal processing board When the multi-way server is powered on as a server, the key signal processing board receives the first type power-on signal sent by all the PBI boards in the multi-path server, and simultaneously sends the PBI to all the PBIs. The board transmits a second type of power-on signal as a response signal corresponding to the first-type power-on signal, so that all CPUs in the multi-channel server receive the time difference of the second-type power-on signal Preset time requirement;
  • the key signal processing board receives the third of the PBI boards respectively sent by the sub-servers. After the power-on signal is sent to the power supply, the fourth type power-on signal corresponding to the third-type power-on signal is sent to all the PBI boards in the sub-server, so that the sub-server is in the sub-server.
  • the time difference that all CPUs receive the power-on signal of the fourth type of power source meets a preset time requirement;
  • the first type of power-on signal and the third type of power-on signal are generated when the PBI board detects that the power of each CPU that is monitored by itself is powered on.
  • the critical signal processing board receives a key signal transmitted by the PBI board and returns a corresponding signal to the PBI board Response signals, including:
  • the key signal processing board receives the fifth-type power-on signal sent by the PBI board corresponding to the new CPU, and then goes to the new one at a preset time.
  • a PBI board corresponding to the CPU transmits a sixth type power-on signal as a response signal corresponding to the fifth-type power-on signal, so that the new CPU receives the sixth-type power-on signal Time meets the preset time requirement;
  • the fifth type of power-on signal is generated when the PBI board detects that the new CPU is powered on.
  • the key signal processing board receives a key signal sent by the PBI board and returns a corresponding signal to the PBI board Response signals, including:
  • the key signal processing board receives the first system error signal sent by any one of the PBI boards, and sends the system error signal PBI to the multi-path server respectively.
  • the other respective PBI boards transmit a second system error signal as a response signal corresponding to the first system error signal;
  • the key signal processing board receives the third system error signal sent by any one of the PBI boards, determining and transmitting the third system according to the obtained sub-server information
  • the PBI boards of the error signals belong to other PBI boards of the same sub-server; respectively, to each of the PBI boards except the PBI board that transmits the third system error signal, as the third system error
  • the fourth system error signal of the response signal corresponding to the signal is divided into at least two sub-servers.
  • the key signal processing board receives a key signal sent by the PBI board, and returns to the PBI board
  • the response signal corresponding to the key signal further includes:
  • the key signal processing board receives the next first system error signal or the third system error signal when it is detected that the first system error signal or the third system error transmitted by the current PBI board changes from the active state to the inactive state.
  • the present invention provides a signal processing method for a multi-way server, which is applied to a multi-path server, which includes a plurality of CPUs, a plurality of south bridge substrate management monitor input and output PBI boards, and key signals.
  • a processing board wherein one PBI board monitors at least two CPUs; the method includes:
  • the key signal processing board receives a key signal sent by the PBI board
  • the key signal processing board returns a response signal corresponding to the key signal to the PBI board according to the received key signal, so that the PBI board that receives the response signal sends the response signal The CPU that is monitored by the PBI board.
  • the key signal processing board returns a response signal corresponding to the key signal to the PBI board according to the received key signal, including:
  • the key signal processing board When the multi-way server is powered on as a server, the key signal processing board receives the first type power-on signal sent by all the PBI boards in the multi-way server, and simultaneously sends the power signals to all the PBI boards. a second type of power-on signal as a response signal corresponding to the power-on signal of the first type of power source, so that all CPUs in the multi-way server receive the time difference of the second-type power-on signal to meet the preset time requirement;
  • the key signal processing board receives the third of the PBI boards respectively sent by the sub-servers.
  • a fourth type power-on signal as a response signal corresponding to the power-on signal of the third-type power source is simultaneously sent to all PBI boards in the sub-server, so that the sub-power All CPUs in the server Receiving the time difference of the fourth type power supply power-on signal to meet a preset time requirement;
  • the first type of power-on signal and the third type of power-on signal are generated when the PBI board detects that the power of each CPU that is monitored by itself is powered on.
  • the key signal processing board returns a response signal corresponding to the key signal to the PBI board according to the received key signal, including:
  • the key signal processing board receives the fifth-type power-on signal sent by the PBI board corresponding to the new CPU, and then goes to the new one at a preset time.
  • a PBI board corresponding to the CPU transmits a sixth type power-on signal as a response signal corresponding to the fifth-type power-on signal, so that the new CPU receives the sixth-type power-on signal Time meets the preset time requirement;
  • the fifth type of power-on signal is generated when the PBI board detects that the new CPU is powered on.
  • the key signal processing board returns the key signal to the PBI board according to the received critical signal
  • Corresponding response signals include:
  • the key signal processing board receives the first system error signal sent by any one of the PBI boards, and simultaneously sends the system error signal PBI to the multi-path server.
  • the other respective PBI boards transmit a second system error signal as a response signal corresponding to the first system error signal;
  • the key signal processing board acquires sub-server information of the multiplex server, and when receiving a third system error signal sent by any one of the PBI boards, according to the sub-server
  • the server information determines that each PBI board belonging to the same sub-server as the PBI board that transmits the third system error signal; and each PBI other than the PBI board that transmits the third system error signal to the sub-server
  • the board transmits a fourth system error signal as a response signal corresponding to the third system error signal.
  • the method further includes:
  • the key signal processing board receives the next first system error signal or the third system error signal when it is detected that the first system error signal or the third system error transmitted by the current PBI board changes from the active state to the inactive state.
  • the multiplex server provided by the embodiment of the present invention includes: multiple CPUs, multiple PBI (PCH BMC IO, South Bridge Management Monitor Input and Output) boards, and a key signal processing board, wherein the BMC is Baseboard Management Controller, Baseboard Management Monitor.
  • Each PBI board monitors at least two CPUs, and each PBI board is connected to the CPU it monitors.
  • the output ends of the respective PBI boards are respectively connected to the input ends of the key signal processing boards, and the input ends of the respective PBI boards are respectively connected to the output ends of the key signal processing boards. It can be seen that each CPU is connected to a corresponding PBI board, and the PBI board is directly connected to the key signal processing board.
  • the multi-way server receives the key signals sent by the respective PBI boards through the key signal processing boards, and then simultaneously sends corresponding response signals to the corresponding PBI boards.
  • Each PBI board is directly connected to the key signal processing board, and does not require a switch to be connected in series, reducing the number of switchers, and having a simple structure and high reliability. Moreover, the number of switching switches through which the signal passes is reduced, and the signal quality is improved.
  • FIG. 1 is a schematic structural diagram of a multi-path server according to an embodiment of the present invention.
  • FIG. 2 is a schematic structural diagram of a PBI board according to an embodiment of the present invention.
  • FIG. 3 is a flowchart of a signal processing method of a multi-path server according to an embodiment of the present invention.
  • FIG. 1 a schematic structural diagram of a multi-path server according to an embodiment of the present invention is shown.
  • This embodiment uses a 32-way server as an example for description.
  • the multi-path server may be a 16-way, 8-way, 4-way server, and the number of PBI boards may be flexibly determined according to actual needs.
  • the multi-way server includes: 32 CPUs, at least 8 PBI boards, and 1 key signal processing board.
  • PBI1 monitors CPU1 ⁇ CPU4
  • PBI2 monitors CPU5 ⁇ CPU8
  • PBI3 monitors CPU9 ⁇ CPU12
  • PBI4 monitors CPU13 ⁇ CPU16
  • PBI8 monitors CPU29 ⁇ CPU32.
  • Each of the PBI boards is connected to a plurality of CPUs that are monitored by itself, CPUs 1 to 4 are connected to PBI 1, CPUs 5 to 8 are connected to PBI 2, and so on, and CPUs 29 to 32 are connected to PBI 8.
  • the PBI board connects to some of the CPUs that it monitors through its own I/O interface, and another part of the CPU that it monitors is connected to the PBI board through the backplane.
  • one PBI board can monitor 2 or 8 CPUs. If one PBI board monitors two CPUs, the 32-way server needs to be configured with 16 PBI boards; if one PBI board monitors eight CPUs, the 32-way servers need to be configured with four PBI boards.
  • Each PBI board includes an input end and an output end.
  • the input ends of the respective PBI boards are respectively connected to the output ends of the key signal processing boards 100, and the output ends of the respective PBI boards are respectively connected to the input ends of the key signal processing boards 100.
  • the PBI board After receiving the key signal sent by the CPU or monitoring the state of the CPU and generating the corresponding key signal, the PBI board directly sends the key signal to the key signal processing board 100, and the key signal processing board 100 sends the corresponding signal to the corresponding PBI board.
  • the response signal corresponding to the key signal.
  • the PBI board that receives the response signal sends the response signal to the corresponding CPU, thereby ensuring that each CPU receives the response signal to meet the requirement.
  • the key signals include, but are not limited to, a Powergood signal, a Caterr signal, a PMSYNC signal, a TSC (Time Stamp Counte) signal, an S4 signal, a Thermtrip signal, a Powerbutton signal, an MSMI signal, and the like.
  • the response signals corresponding to different key signals are also different, and the functions of different response signals are also different.
  • the critical signals between the CPUs in the multi-way server satisfy the requirements of the multi-way server by responding to the signal.
  • the response signal of the Powergood signal is still the Powergood signal, and the CPU starts after receiving the Powergood signal.
  • Multi-channel servers have very strict timing requirements for Powergood signals. For example, when multiple servers are powered on, the time difference of Powergood signals received by all CPUs must be monitored within 10 ns, which can maximize the system performance of 32-way servers. The time at which the Powergood signal is transmitted to each CPU is recorded as the initial time T0.
  • the PBI board detects that the CPU monitored by itself is powered on, and sends the first Powergood signal to the critical signal processing board (ie, the first type of power-on signal).
  • the critical signal processing board ie, the first type of power-on signal
  • the second Powergood signal ie, the second type power-on signal
  • the PBI board sends it to the CPU of its own monitoring.
  • the second Powergood signal is uniformly sent to each PBI board by the key signal processing board, and then sent by each PBI board to the CPU of its own monitoring, thereby ensuring that each CPU in the multi-path server receives the second Powergood signal.
  • the time difference is within a preset time (for example, 10 ns).
  • the time that the new online CPU receives the Powergood signal must be the preset time.
  • the preset time is T0 plus the preset period, where different The CPUs produced by the manufacturer, or different types of CPUs produced by the same manufacturer, may have different preset periods.
  • the preset period set by the Ivybridge processor can be N*864BCLKs; the preset period set by the Haswell processor can be N*384BCLKs. Where N is a positive integer and 1BCLK is 10 ns.
  • the Caterr signal indicates that the system has experienced a serious error.
  • the PMSYNC (Power Management Sync) signal is an energy saving control signal.
  • the S4 signal is a status signal of the multi-way server system.
  • the signal is sent by the South Bridge (PCH).
  • PCH South Bridge
  • CPLD Complex Programmable Logic Device
  • Thermtrip signal is a signal that is sent when the CPU overheats.
  • the Powerbutton signal is a signal generated when the power-on button on the server is triggered. This signal is used to notify the PCH that there is a button triggering action. It needs to be powered on or powered off.
  • the MSMI signal is exactly the same as the Caterr signal and is a signal added by the latest Intel platform.
  • the multi-path server provided by the embodiment of the present invention includes a plurality of CPUs, a plurality of PBI boards, and a key signal processing board.
  • Each PBI board monitors at least two CPUs, and each PBI board is connected to the CPU it monitors.
  • the output ends of the respective PBI boards are respectively connected to the input ends of the key signal processing boards, and the input ends of the respective PBI boards are respectively connected to the output ends of the key signal processing boards.
  • the key signal processing board After receiving the key signal sent by the PBI board, the key signal processing board returns a response signal corresponding to the key signal to the corresponding PBI board.
  • the PBI board that receives the response signal sends the response signal to the CPU that is monitored by itself, so that the response signal received by the CPU satisfies the requirements of the multi-path server.
  • the multi-channel server receives the key signals sent by the PBI boards through the key signal processing boards, and then sends corresponding response signals to the corresponding PBI boards, so that each CPU receives the response signal to meet the requirements of the multi-path server.
  • each PBI board is directly connected to the key signal processing board, and does not require a switch to be connected in series, reducing the number of switchers, and having a simple structure and high reliability. Moreover, the number of switching switches through which the signal passes is reduced, and the signal quality is improved.
  • the PBI boards are not connected in series through the conversion switch, and any plurality of PBI combinations can be realized to form one hard partition, that is, the hard partition is flexibly configured.
  • the CPU corresponding to PBI1 and PBI3 is configured as a hard partition, that is, configured as an 8-way server;
  • PBI2 and PBI5 are configured as one hard partition, that is, configured as an 8-way server.
  • the multi-path server because the PBI boards are connected in series through the switch, therefore, the adjacent PBI can only be configured as a hard partition when configuring the hard partition, and the hard partition cannot be flexibly configured.
  • FIG. 2 is a schematic diagram showing the internal structure of a PBI board according to an embodiment of the present invention, as shown in FIG. 2, each PBI The boards each include: a changeover switch 200, a level shifting chip 210, a first CPLD 220, and a south bridge chip 230.
  • the number of the changeover switches 200 is the same as the number of CPUs monitored by the PBI board. In this embodiment, four switchers are disposed in each PBI board.
  • the I/O interface of each CPU is connected to the first I/O interface of the level conversion chip 210 through the changeover switch 200, and the second I/O interface of the level conversion chip 210 is connected to the first I/O interface of the first CPLD 220.
  • the first input of the first CPLD 220 is connected to the output of the key signal processing board 100 as an input of the PBI board.
  • the first output of the first CPLD 220 is connected to the input of the critical signal processing board 100 as an output of the PBI board.
  • the second output of the first CPLD 220 is connected to the south bridge chip 230.
  • the signal output by the CPU is supplied to the level conversion chip 210 to be converted into a high level signal that the CPLD can process, and is converted to a high level signal to be more suitable for long distance transmission.
  • the level conversion chip 210 transmits the converted signal to the first CPLD 220.
  • the first CPLD 220 transmits the received signal to the key signal processing board 100.
  • the key signal processing board 100 returns a response signal corresponding to the received key signal to the first CPLD 220 in the corresponding PBI. Then, the corresponding changeover switch 200 is controlled by the first CPLD 220 to be closed, and the signal sent by the first CPLD 220 to the key signal processing board 100 is sent to the corresponding CPU.
  • SMM System Management Module
  • Each PBI board and each CPU it monitors are powered on separately; after each PBI board detects that the power of each CPU monitored by itself is powered on, the first Powergood signal is generated and sent to the key signal processing board, indicating that the PBI board is displayed. The CPU it monitors has been powered on.
  • each PBI board can generate a first Powergood signal that is generated when power to all of the CPUs that it controls is detected to be powered up.
  • each PBI board does not need to be powered on synchronously, and the key signal processing board avoids the unstable state caused by the power process of each PBI board by delay or debounce processing, so that the reliability of the system is higher.
  • the delay means that the key signal processing board does not process the key signal after receiving the key signal, and the key signal such as the delay preset time period is completely stable before processing.
  • the preset duration can be set according to the power-on stabilization time. Of course, it can also be set according to actual needs.
  • the key signal processing board After receiving the first Powergood signal sent by all the PBI boards, the key signal processing board sends the second Powergood signal to all PBI boards at the same time, and records the time at which the second Powergood signal is transmitted as the initial time T0.
  • the timing of receiving the second Powergood signal is consistent, thereby satisfying the timing requirement of the system for each CPU to receive the second Powergood signal.
  • BIOS Basic Input Output System
  • online online
  • offline offline
  • each two CPUs are divided into one group.
  • Each PBI board generates a first Powergood signal for each group of CPUs and sends them to the key signal processing boards.
  • the key signal processing board needs to return a second Powergood signal to the PBI board for each group of CPUs, ensuring that the number of second Powergood signals is the same as the number of first Powergood signals.
  • each PBI board monitors 4 CPUs, and two CPUs are a group, and each PBI board generates two first Powergood signals.
  • the critical signal processing board needs to return two second Powergood signals to each PBI board simultaneously.
  • the first CPLD in the PBI board sends the second Powergood signal sent by the key signal processing board to each CPU monitored by itself.
  • the key signal processing board can simultaneously send the second Powergood signal to each PBI board, and then the second Powergood signal is sent to the corresponding CPU by each PBI board, thereby ensuring that each CPU receives the second Powergood signal.
  • the timing requirements are met (ie, the time difference between each CPU receiving the second Powergood signal is within 10 ns).
  • the process of powering up the sub-server is similar to the above process.
  • one sub-server is a hard partition, and each sub-server can work alone.
  • a 32-way server can be divided into four 8-way sub-servers, or can be divided into one 16-way sub-server and two 8-way sub-servers.
  • the power-on process of the sub-server is as follows:
  • the SMM board notifies the critical signal processing board of the hard partition information of the multi-way server.
  • the hard partition information may include information that the multi-way server contains several hard partitions, and which CPUs each hard partition contains.
  • Each hard partitioned PBI board and its monitored CPUs are respectively powered on; the PBI board detects that each of the CPUs monitored by the CPU is powered on, and generates a first Powergood signal (ie, a third type of power-on signal) And sent to the key signal processing board, indicating that the PBI board and its monitored CPU have been powered on.
  • a first Powergood signal ie, a third type of power-on signal
  • the critical signal processing board determines the hard partition where the PBI board is located.
  • Determining, according to hard partition information from the SMM board, information of a hard partition where the PBI board that sends the first Powergood signal is located for example, determining which hard partition the PBI board that sends the first Powergood signal is located, and the hard partition includes Which PBI boards and other information.
  • the key signal processing board After receiving the first Powergood signal sent by all PBI boards in the hard partition, the key signal processing board sends a second Powergood signal to each PBI board in the hard partition at the same time (ie, the fourth type power supply) The power-on signal is recorded, and the time at which the second Powergood signal is transmitted is recorded as the initial time T0.
  • the first CPLD in the PBI board sends the second Powergood signal sent by the key signal processing board to each CPU monitored by itself.
  • each CPU in the hard partition receives the second Powergood signal satisfies the timing requirement. Moreover, each hard partition is separately powered and separately controlled, and there is no coupling relationship between them.
  • the PBI board After the PBI board detects that the CPU of the new online is powered on, it generates a first Powergood signal (ie, the fifth type of power-on signal) and sends the first Powergood signal to the key signal processing board.
  • a first Powergood signal ie, the fifth type of power-on signal
  • the key signal processing board After receiving the first Powergood signal sent by the PBI board, the key signal processing board sends a second Powergood signal (ie, a sixth type power supply power-on signal) to the PBI board at a preset time.
  • a second Powergood signal ie, a sixth type power supply power-on signal
  • the preset time is the initial time T0 plus a preset period.
  • the CPUs produced by different manufacturers, or different types of CPUs produced by the same manufacturer may have different preset periods.
  • the preset period of the Ivybridge processor is N*864BCLKs, corresponding to the corresponding Ivybridge processor.
  • the preset time is T0+N*864BCLKs;
  • the preset around the Haswell processor is N*384BCLKs, and the corresponding preset time of the corresponding Haswell processor is T0+N*384BCLKs, where N is a positive integer.
  • the following describes the working process of the multi-path server provided by the embodiment of the present invention by taking a system error signal (ie, a Caterr signal) as an example.
  • a system error signal ie, a Caterr signal
  • the PBI board When the PBI board receives the first Caterr signal sent by the CPU (ie, the first system error signal), the PBI board transmits the first Caterr signal to the key signal processing board.
  • the key signal processing board After receiving the first Caterr signal, the key signal processing board determines whether to respond to the first Caterr signal according to the flag bit corresponding to the Caterr signal. For example, when the flag corresponding to the Caterr signal is "0", the key signal processing board no longer responds to the Caterr signal sent by other CPUs; when the flag corresponding to the Caterr signal is "1", the key signal processing board can Respond to Caterr signals from other CPUs.
  • the key signal processing board When the key signal processing board receives the first Caterr signal, it changes the flag corresponding to the Caterr signal to a binary number "0". If the first Caterr signal changes from low to high, it indicates that the fault is eliminated. At this time, the flag corresponding to the Caterr signal is modified to a binary number "1".
  • the key signal processing board responds to the Caterr signal sent by the CPU. After the key signal processing board receives the Caterr signal sent by the CPU, it forwards the Caterr signal to other CPUs that are in the same hard partition as the CPU, that is, the same as the CPU. The Caterr signal lines of other CPUs in the partition are pulled low by the high level to shield the Caterr signals sent by other CPUs. If the CPU that issued the Caterr signal can fix the error by itself, the Caterr signal will disappear and the system will return to normal. If the CPU cannot repair itself, the system will hang.
  • Each PBI board sends the received second Caterr signal to the corresponding CPU.
  • each sub-server is a hard partition.
  • the transmission process of the Caterr signal is as follows:
  • the SMM board notifies the hard partition information of the multi-path server of the key signal processing board.
  • the PBI board When the PBI board receives the third Caterr signal (ie, the third system error signal) sent by the CPU, the PBI board transmits the third Caterr signal to the key signal processing board.
  • the third Caterr signal ie, the third system error signal
  • the key signal processing board After receiving the third Caterr signal, the key signal processing board determines a hard partition corresponding to the PBI board according to the hard partition information.
  • the key signal processing board determines the broadcast range of the response signal corresponding to the third Caterr signal, in other words, determines which PBI boards to send the response signal corresponding to the third Caterr signal.
  • the key signal processing board transmits a fourth Caterr signal (ie, a fourth system error signal) to the other PBI boards in the hard partition where the PBI board is located.
  • a fourth Caterr signal ie, a fourth system error signal
  • the key signal processing board can be sent to the corresponding CPU through the PBI, because the PBI boards are not connected in series through the switch, but directly deal with the key signals.
  • the board is connected so that the control process is simple and the broadcast time of the Caterr signal is greatly reduced.
  • the number of switches through which the signal passes is reduced, and the signal quality can be improved.
  • the present invention also provides an embodiment of a signal synchronization method for a multi-way server.
  • the multi-way server includes a plurality of CPUs, a plurality of PBI boards, and a key signal processing board, wherein one PBI board monitors at least two CPUs; each PBI board is connected to a plurality of CPUs monitored by itself; each PBI board includes an input At the end and the output end, the input ends of the respective PBI boards are respectively connected to the output ends of the key signal processing boards, and the output ends of the respective PBI boards are respectively connected to the input ends of the key signal processing boards.
  • the method may include the following steps:
  • the key signal processing board receives the key signal sent by the PBI board.
  • the key signal is sent by the CPU monitored by the PBI board to the PBI board, or is generated by the PBI board detecting the corresponding state of the CPU, for example, when the PBI board detects the power of each CPU that it monitors.
  • the first Powergood signal is generated; for example, when the CPU fails, the first Caterr signal is generated, and the first Caterr signal is sent to the PBI board, and sent by the PBI board to the key signal processing board.
  • the key signals include, but are not limited to, Powergood signals, Caterr signals, PMSYNC signals, TSC signals. No., S4 signal, Thermtrip signal, Powerbutton signal, MSMI signal.
  • the key signal processing board returns a response signal corresponding to the key signal to the PBI board according to the received key signal, so that the PBI board that receives the response signal sends the response signal to the PBI. Board monitors the CPU.
  • the multi-way server can include 32 CPUs, 8 PBI boards, and a key signal processing board.
  • Each PBI board monitors a plurality of CPUs; an output end of each PBI board is connected to an input end of the key signal processing board, an input end of each PBI board is connected to an output end of the key signal processing board; and a key signal processing board receives a PBI
  • the key signal sent by the board generates a response signal corresponding to the key signal and returns the response signal to the corresponding PBI board.
  • the PBI board that receives the response signal distributes the response signal to the corresponding CPU.
  • each PBI board is directly connected to the key signal processing board, does not need to switch the switch to be connected in series, reduces the number of switching switches, has a simple structure and high reliability, and reduces the switching quality of the signal passing through the switching switch.
  • step S120 may include the following two situations:
  • the multi-way server is powered on as a server.
  • the first type of power-on signal is generated when the PBI board detects that the power of each CPU that is monitored by itself is powered on.
  • the multi-way server is divided into at least two sub-servers, and the process of powering any one of the sub-servers is as follows:
  • the key signal processing board After receiving the third type power-on signal sent by all the PBI boards in the sub-server, the key signal processing board simultaneously sends the third-type power-on signal to the PBI board in the sub-server as the third The fourth type of power-on signal corresponding to the response signal of the power-type power-on signal, so that all the CPUs in the sub-server receive the time difference of the power-on signal of the fourth-type power source to meet the preset time requirement.
  • the third type power-on power-on signal is generated when the PBI board detects that the power of each CPU that is monitored by itself is powered on.
  • the steps are S120 can include the following process:
  • the key signal processing board receives the fifth-type power-on signal sent by the PBI board corresponding to the new CPU, and sends the new signal to the new one at a preset time.
  • the PBI board corresponding to the CPU transmits a sixth type power-on signal as a response signal corresponding to the fifth type power-on signal, so that the new CPU receives the sixth-type power-on signal The time meets the preset time requirement.
  • the fifth type of power-on signal is generated when the PBI board detects that the new CPU is powered on.
  • step S120 may include the following two cases:
  • the key signal processing board After receiving the first system error signal sent by any one of the PBI boards, the key signal processing board simultaneously sends a second system error signal to each of the PBI boards except the system error signal PBI.
  • the second system error signal is the response signal corresponding to the first system error signal
  • the multi-way server is divided into at least two sub-servers.
  • the key signal processing board acquires the sub-server information of the multi-path server, and when receiving the third system error signal sent by any one of the PBI boards, determining, according to the sub-server information, the PBI board that sends the third system error signal Other PBI boards belonging to the same sub-server; simultaneously transmitting a fourth system error signal to each of the PBI boards except the PBI board transmitting the third system error signal; the fourth system error signal Is the response signal corresponding to the third system error signal.
  • the method further includes:
  • the key signal processing board receives the next first system error signal or the third system error signal when it is detected that the first system error signal or the third system error transmitted by the current PBI board changes from the active state to the inactive state.
  • whether the state of the system error signal is valid can be obtained by detecting a flag bit corresponding to the system error signal.
  • the flag bit corresponding to the system error signal is a binary number “0”, indicating a system error. The signal is valid; if the flag corresponding to the system error signal is binary number "1", it indicates that the system error is invalid.
  • the flag bit is "1”, it indicates that the system error signal is valid; when the flag bit is "0", it indicates that the system error signal is invalid.
  • the present invention can be implemented by means of software plus a necessary general hardware platform, and of course, can also be through hardware, but in many cases, the former is better.
  • Implementation Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium, including a plurality of instructions for causing a A computer device (which may be a personal computer, server, or network device, etc.) performs all or part of the steps of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes various media that can store program codes, such as a read only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk.
  • the invention may be described in the general context of computer-executable instructions executed by a computer, such as a program module.
  • program modules include routines, programs, objects, components, data structures, and the like that perform particular tasks or implement particular abstract data types.
  • the invention may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are connected through a communication network.
  • program modules can be located in both local and remote computer storage media including storage devices.

Abstract

Disclosed are a multi-path server and a signal processing method thereof. The multi-path server comprises a plurality of CPUs, a plurality of PBI boards and a key signal processing board. Each PBI board monitors at least two CPUs. The output end of each PBI board is connected with the input end of the key signal processing board respectively; the input end of each PBI board is connected with the output end of the key signal processing board respectively. The multi-path server receives key signals transmitted by the various PBI boards by means of the key signal processing board; and then transmits response signals corresponding to the key signals to the corresponding PBI boards synchronously; thus, the response signals received by the CPUs meet the demand of the multi-path server. From the above contents, it can be known that each PBI board is directly connected with the key signal processing board, serial connection by switches is not required, the amount of the switches is reduced, the structure is simple, and reliability is high. Furthermore, the amount of the switches passed by the signals is reduced, and signal quality is improved.

Description

多路服务器及其信号处理方法Multi-path server and signal processing method thereof
本申请要求于2014年12月16日提交中国专利局、申请号为201410781480.X、发明名称为“多路服务器及其信号处理方法”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to Chinese Patent Application No. 201410781480.X, filed on Dec. 16, 2014, entitled "Multiple Servers and Signal Processing Methods", the entire contents of which are incorporated by reference. In this application.
技术领域Technical field
本发明涉及服务器技术领域,特别是涉及一种多路服务器及其信号处理方法。The present invention relates to the field of server technologies, and in particular, to a multi-path server and a signal processing method thereof.
背景技术Background technique
多路服务器是指服务器内物理CPU的数量有多个,服务器中物理CPU的数量决定服务器中“多路”的数量,例如,服务器中包含16个物理CPU,则为16路服务器。多路服务器中物理CPU的数量越多,其处理数据的能力越高,因此,16路和32路的服务器成为多路服务器的发展趋势。Multi-path server refers to the number of physical CPUs in the server. The number of physical CPUs in the server determines the number of "multiple" servers in the server. For example, if the server contains 16 physical CPUs, it is a 16-way server. The more the number of physical CPUs in a multi-way server, the higher the ability to process data. Therefore, the 16-channel and 32-way servers become the development trend of multi-way servers.
若实现32路服务器,必须实现32路内信号全互联,且对时序要求极为严格。例如,在多路服务器开机时,所有的CPU收到表示电源正常的Powergood信号的时间差必须监控在10ns以内,并记录该时刻为T0;在实现CPU热插拔、online或offline时,新上线的CPU(Intel Ivybridge处理器)收到Powergood的信号时间必须为T0+864bclks(若采用Intel Haswell处理器,则为T0+384bclks)。而Intel公司只给出了8路及以下的服务器中关键信号(例如,Caterr、PMSYNC、Powergood、TSC等信号)的处理方式,对于8路以上的服务器没有给出任何推荐方案。If a 32-way server is implemented, 32 internal signals must be fully interconnected, and the timing requirements are extremely strict. For example, when the multi-way server is powered on, the time difference between all the CPUs receiving the Powergood signal indicating that the power is normal must be monitored within 10 ns, and the time is recorded as T0; when the CPU is hot plugged, online or offline, the new line is online. The time that the CPU (Intel Ivybridge processor) receives Powergood must be T0+864bclks (or T0+384bclks if Intel Haswell processor is used). Intel only gives the key signals (such as Caterr, PMSYNC, Powergood, TSC, etc.) in the 8-way and below servers, and does not give any recommendation for more than 8 servers.
假设采用Intel给出的8路服务器对关键信号的处理方式实现16路及以上的服务器,由于8路服务器是通过切换开关将不同基本单板(每个基本单板上设置有多个CPU,所述单板上的CPU数量是最小硬分区的单位)串联在一起,需要切换开关数量多,当传递关键信号时,需要控制相应的切换开关联动才能实现,实现过程极其复杂、可靠性差,且经过多级带有内阻的切换开关后,信号质量变得很差。Assume that the 8-way server is used to implement the key signal processing method of the 16-channel and above servers. The 8-way server uses different switches to switch different basic boards (each base board has multiple CPUs). The number of CPUs on the board is the unit of the smallest hard partition. In series, the number of switches needs to be connected. When the key signals are transmitted, the corresponding switch switch linkage needs to be controlled to achieve the implementation process. The implementation process is extremely complicated and the reliability is poor. After multi-stage switching switches with internal resistance, the signal quality becomes very poor.
发明内容Summary of the invention
本发明实施例中提供了一种多路服务器及其信号处理方法,以解决现有技术中的方案复杂、可靠性差集信号质量差的问题。In the embodiment of the present invention, a multi-path server and a signal processing method thereof are provided to solve the problem that the scheme in the prior art is complicated and the signal quality of the poor reliability set is poor.
为了解决上述技术问题,本发明实施例公开了如下技术方案:In order to solve the above technical problem, the embodiment of the present invention discloses the following technical solutions:
第一方面,本发明提供一种多路服务器,包括:多个CPU、多个南桥基板管理监 控器输入输出PBI板,以及关键信号处理板,其中,一个PBI板监控至少两个CPU;In a first aspect, the present invention provides a multi-way server, including: multiple CPUs, and multiple south bridge management monitors Controller input and output PBI board, and key signal processing board, wherein one PBI board monitors at least two CPUs;
所述PBI板连接自身监控的CPU;The PBI board is connected to a CPU that is monitored by itself;
各个PBI板的输出端连接所述关键信号处理板的输入端,各个PBI板的输入端连接所述关键信号处理板的输出端,所述关键信号处理板接收所述PBI板发送的关键信号,并向所述PBI板返回与所述关键信号相对应的回应信号;An output end of each PBI board is connected to an input end of the key signal processing board, and an input end of each PBI board is connected to an output end of the key signal processing board, and the key signal processing board receives a key signal sent by the PBI board, And returning a response signal corresponding to the key signal to the PBI board;
接收到所述回应信号的PBI板将所述回应信号分别发送给自身监控的CPU。The PBI board that receives the response signal sends the response signal to the CPU that it monitors.
结合第一方面,在第一方面的第一种可能的实现方式中,所述关键信号处理板接收所述PBI板发送的关键信号,并向所述PBI板返回与所述关键信号相对应的回应信号,包括:In conjunction with the first aspect, in a first possible implementation manner of the first aspect, the key signal processing board receives a key signal sent by the PBI board, and returns a corresponding signal to the PBI board Response signals, including:
当所述多路服务器作为一个服务器进行上电时,所述关键信号处理板接收到所述多路服务器内的全部PBI板分别发送的第一类电源上电信号后,同时向所述全部PBI板发送作为与所述第一类电源上电信号相对应的回应信号的第二类电源上电信号,以使所述多路服务器内的全部CPU接收到第二类电源上电信号的时间差满足预设时间要求;When the multi-way server is powered on as a server, the key signal processing board receives the first type power-on signal sent by all the PBI boards in the multi-path server, and simultaneously sends the PBI to all the PBIs. The board transmits a second type of power-on signal as a response signal corresponding to the first-type power-on signal, so that all CPUs in the multi-channel server receive the time difference of the second-type power-on signal Preset time requirement;
若所述多路服务器内的全部CPU划分成至少两个子服务器,当任意一个子服务器上电时,所述关键信号处理板接收到所述子服务器内的全部PBI板分别发送的所述第三类电源上电信号后,同时向所述子服务器内的全部PBI板分别发送作为与所述第三类电源上电信号相对应的第四类电源上电信号,以使所述子服务器内的全部CPU接收到所述第四类电源上电信号的时间差满足预设时间要求;If all the CPUs in the multi-path server are divided into at least two sub-servers, when any one of the sub-servers is powered on, the key signal processing board receives the third of the PBI boards respectively sent by the sub-servers. After the power-on signal is sent to the power supply, the fourth type power-on signal corresponding to the third-type power-on signal is sent to all the PBI boards in the sub-server, so that the sub-server is in the sub-server. The time difference that all CPUs receive the power-on signal of the fourth type of power source meets a preset time requirement;
其中,所述第一类电源上电信号和所述第三类电源上电信号均由所述PBI板检测到自身监控的各个CPU的电源均上电完成时产生。The first type of power-on signal and the third type of power-on signal are generated when the PBI board detects that the power of each CPU that is monitored by itself is powered on.
结合第一方面,在第一方面的第二种可能的实现方式中,所述关键信号处理板接收所述PBI板发送的关键信号,并向所述PBI板返回与所述关键信号相对应的回应信号,包括:In conjunction with the first aspect, in a second possible implementation of the first aspect, the critical signal processing board receives a key signal transmitted by the PBI board and returns a corresponding signal to the PBI board Response signals, including:
当所述多路服务器中插入新的CPU时,所述关键信号处理板接收到所述新的CPU对应的PBI板发送的第五类电源上电信号后,在预设时刻向所述新的CPU对应的PBI板发送作为与所述第五类电源上电信号相对应的回应信号的第六类电源上电信号,以使所述新的CPU接收到所述第六类电源上电信号的时间满足预设时间要求;When a new CPU is inserted in the multi-path server, the key signal processing board receives the fifth-type power-on signal sent by the PBI board corresponding to the new CPU, and then goes to the new one at a preset time. a PBI board corresponding to the CPU transmits a sixth type power-on signal as a response signal corresponding to the fifth-type power-on signal, so that the new CPU receives the sixth-type power-on signal Time meets the preset time requirement;
所述第五类电源上电信号由所述PBI板检测到所述新的CPU上电完成时产生。The fifth type of power-on signal is generated when the PBI board detects that the new CPU is powered on.
结合第一方面,在第一方面的第三种可能的实现方式中,所述关键信号处理板接收所述PBI板发送的关键信号,并向所述PBI板返回与所述关键信号相对应的回应信号,包括: In conjunction with the first aspect, in a third possible implementation of the first aspect, the key signal processing board receives a key signal sent by the PBI board and returns a corresponding signal to the PBI board Response signals, including:
若所述多路服务器作为一个服务器使用,所述关键信号处理板接收到任意一个PBI板发送的第一系统错误信号后,分别向所述多路服务器中除发送所述系统错误信号PBI之外的其它各个PBI板发送作为与所述第一系统错误信号对应的回应信号的第二系统错误信号;If the multi-path server is used as a server, the key signal processing board receives the first system error signal sent by any one of the PBI boards, and sends the system error signal PBI to the multi-path server respectively. The other respective PBI boards transmit a second system error signal as a response signal corresponding to the first system error signal;
若所述多路服务器划分成至少两个子服务器,所述关键信号处理板接收到任意一个PBI板发送的第三系统错误信号时,根据获得的所述子服务器信息确定与发送所述第三系统错误信号的PBI板属于同一个子服务器的其它各个PBI板;分别向所述子服务器中除发送所述第三系统错误信号的PBI板之外的其它各个PBI板发送作为与所述第三系统错误信号对应的回应信号的第四系统错误信号。If the multi-way server is divided into at least two sub-servers, and the key signal processing board receives the third system error signal sent by any one of the PBI boards, determining and transmitting the third system according to the obtained sub-server information The PBI boards of the error signals belong to other PBI boards of the same sub-server; respectively, to each of the PBI boards except the PBI board that transmits the third system error signal, as the third system error The fourth system error signal of the response signal corresponding to the signal.
结合第一方面的第三种可能的实现方式,在第一方面的第四种可能的实现方式,所述关键信号处理板接收所述PBI板发送的关键信号,并向所述PBI板返回与所述关键信号相对应的回应信号,还包括:In conjunction with the third possible implementation of the first aspect, in a fourth possible implementation manner of the first aspect, the key signal processing board receives a key signal sent by the PBI board, and returns to the PBI board The response signal corresponding to the key signal further includes:
当检测到当前PBI板发送的第一系统错误信号或第三系统错误由有效状态变为无效状态时,所述关键信号处理板接收下一个第一系统错误信号或第三系统错误信号。The key signal processing board receives the next first system error signal or the third system error signal when it is detected that the first system error signal or the third system error transmitted by the current PBI board changes from the active state to the inactive state.
第二方面,本发明提供一种多路服务器的信号处理方法,应用于多路服务器中,所述多路服务器包括多个CPU、多个南桥基板管理监控器输入输出PBI板,以及关键信号处理板,其中,一个PBI板监控至少两个CPU;所述方法包括:In a second aspect, the present invention provides a signal processing method for a multi-way server, which is applied to a multi-path server, which includes a plurality of CPUs, a plurality of south bridge substrate management monitor input and output PBI boards, and key signals. a processing board, wherein one PBI board monitors at least two CPUs; the method includes:
所述关键信号处理板接收所述PBI板发送的关键信号;The key signal processing board receives a key signal sent by the PBI board;
所述关键信号处理板根据接收到的所述关键信号向所述PBI板返回与所述关键信号相对应的回应信号,以使接收到所述回应信号的所述PBI板将所述回应信号发送给所述PBI板监控的CPU。The key signal processing board returns a response signal corresponding to the key signal to the PBI board according to the received key signal, so that the PBI board that receives the response signal sends the response signal The CPU that is monitored by the PBI board.
结合第二方面,在第二方面的第一种可能的实现方式中,所述关键信号处理板根据接收到的所述关键信号向PBI板返回与所述关键信号相对应的回应信号,包括:With reference to the second aspect, in a first possible implementation manner of the second aspect, the key signal processing board returns a response signal corresponding to the key signal to the PBI board according to the received key signal, including:
当所述多路服务器作为一个服务器进行上电时,所述关键信号处理板接收到所述多路服务器内的全部PBI板分别发送的第一类电源上电信号后,同时向全部PBI板发送作为与所述第一类电源上电信号相对应的回应信号的第二类电源上电信号,以使所述多路服务器内的全部CPU接收到第二类电源上电信号的时间差满足预设时间要求;When the multi-way server is powered on as a server, the key signal processing board receives the first type power-on signal sent by all the PBI boards in the multi-way server, and simultaneously sends the power signals to all the PBI boards. a second type of power-on signal as a response signal corresponding to the power-on signal of the first type of power source, so that all CPUs in the multi-way server receive the time difference of the second-type power-on signal to meet the preset time requirement;
若所述多路服务器内的全部CPU划分成至少两个子服务器,当任意一个子服务器上电时,所述关键信号处理板接收到所述子服务器内的全部PBI板分别发送的所述第三类电源上电信号后,同时向所述子服务器内的全部PBI板分别发送作为与所述第三类电源上电信号相对应的回应信号的第四类电源上电信号,以使所述子服务器内的全部CPU 接收到所述第四类电源上电信号的时间差满足预设时间要求;If all the CPUs in the multi-path server are divided into at least two sub-servers, when any one of the sub-servers is powered on, the key signal processing board receives the third of the PBI boards respectively sent by the sub-servers. After the power-on signal of the power-like type, a fourth type power-on signal as a response signal corresponding to the power-on signal of the third-type power source is simultaneously sent to all PBI boards in the sub-server, so that the sub-power All CPUs in the server Receiving the time difference of the fourth type power supply power-on signal to meet a preset time requirement;
其中,所述第一类电源上电信号和所述第三类电源上电信号均由所述PBI板检测到自身监控的各个CPU的电源均上电完成时产生。The first type of power-on signal and the third type of power-on signal are generated when the PBI board detects that the power of each CPU that is monitored by itself is powered on.
结合第二方面,在第二方面的第二种可能的实现方式中,所述关键信号处理板根据接收到的所述关键信号向PBI板返回与所述关键信号相对应的回应信号,包括:With reference to the second aspect, in a second possible implementation manner of the second aspect, the key signal processing board returns a response signal corresponding to the key signal to the PBI board according to the received key signal, including:
当所述多路服务器中插入新的CPU时,所述关键信号处理板接收到所述新的CPU对应的PBI板发送的第五类电源上电信号后,在预设时刻向所述新的CPU对应的PBI板发送作为与所述第五类电源上电信号相对应的回应信号的第六类电源上电信号,以使所述新的CPU接收到所述第六类电源上电信号的时间满足预设时间要求;When a new CPU is inserted in the multi-path server, the key signal processing board receives the fifth-type power-on signal sent by the PBI board corresponding to the new CPU, and then goes to the new one at a preset time. a PBI board corresponding to the CPU transmits a sixth type power-on signal as a response signal corresponding to the fifth-type power-on signal, so that the new CPU receives the sixth-type power-on signal Time meets the preset time requirement;
所述第五类电源上电信号由所述PBI板检测到所述新的CPU上电完成时产生。The fifth type of power-on signal is generated when the PBI board detects that the new CPU is powered on.
结合第二方面的第二种可能的实现方式,在第二方面的第三种可能的实现方式中,所述关键信号处理板根据接收到的所述关键信号向PBI板返回与所述关键信号相对应的回应信号包括:In conjunction with the second possible implementation of the second aspect, in a third possible implementation of the second aspect, the key signal processing board returns the key signal to the PBI board according to the received critical signal Corresponding response signals include:
若所述多路服务器作为一个服务器使用,所述关键信号处理板接收到任意一个PBI板发送的第一系统错误信号后,同时向所述多路服务器中除发送所述系统错误信号PBI之外的其它各个PBI板发送作为与所述第一系统错误信号对应的回应信号的第二系统错误信号;If the multi-path server is used as a server, the key signal processing board receives the first system error signal sent by any one of the PBI boards, and simultaneously sends the system error signal PBI to the multi-path server. The other respective PBI boards transmit a second system error signal as a response signal corresponding to the first system error signal;
若所述多路服务器划分成至少两个子服务器,所述关键信号处理板获取所述多路服务器的子服务器信息,当接收到任意一个PBI板发送的第三系统错误信号时,根据所述子服务器信息确定与发送所述第三系统错误信号的PBI板属于同一个子服务器的其它各个PBI板;同时向所述子服务器中除发送所述第三系统错误信号的PBI板之外的其它各个PBI板发送作为与所述第三系统错误信号对应的回应信号的第四系统错误信号。If the multiplex server is divided into at least two sub-servers, the key signal processing board acquires sub-server information of the multiplex server, and when receiving a third system error signal sent by any one of the PBI boards, according to the sub-server The server information determines that each PBI board belonging to the same sub-server as the PBI board that transmits the third system error signal; and each PBI other than the PBI board that transmits the third system error signal to the sub-server The board transmits a fourth system error signal as a response signal corresponding to the third system error signal.
结合第二方面的第三种可能的实现方式,在第二方面的第四种可能的实现方式中,所述方法还包括:In conjunction with the third possible implementation of the second aspect, in a fourth possible implementation of the second aspect, the method further includes:
当检测到当前PBI板发送的第一系统错误信号或第三系统错误由有效状态变为无效状态时,所述关键信号处理板接收下一个第一系统错误信号或第三系统错误信号。The key signal processing board receives the next first system error signal or the third system error signal when it is detected that the first system error signal or the third system error transmitted by the current PBI board changes from the active state to the inactive state.
由以上技术方案可见,本发明实施例提供的多路服务器包括:多个CPU、多个PBI(PCH BMC IO,南桥基板管理监控器输入输出)板和一个关键信号处理板,其中,BMC为Baseboard Management Controller,基板管理监控器。每个PBI板监控至少两个CPU,且每个PBI板与自身所监控的CPU连接。各个PBI板的输出端分别连接所述关键信号处理板的输入端,各个PBI板的输入端分别连接所述关键信号处理板的输出端。 由此可知,各个CPU分别与相应的PBI板连接,PBI板直接与关键信号处理板连接。所述多路服务器通过关键信号处理板接收各个PBI板发出的关键信号,然后,同时向相应的PBI板发送对应的回应信号。各个PBI板直接与关键信号处理板连接,不需要切换开关进行串联,减少切换开关的数量,结构简单、可靠性高。而且减少了信号所经过的切换开关的数量,提高了信号质量。The multiplex server provided by the embodiment of the present invention includes: multiple CPUs, multiple PBI (PCH BMC IO, South Bridge Management Monitor Input and Output) boards, and a key signal processing board, wherein the BMC is Baseboard Management Controller, Baseboard Management Monitor. Each PBI board monitors at least two CPUs, and each PBI board is connected to the CPU it monitors. The output ends of the respective PBI boards are respectively connected to the input ends of the key signal processing boards, and the input ends of the respective PBI boards are respectively connected to the output ends of the key signal processing boards. It can be seen that each CPU is connected to a corresponding PBI board, and the PBI board is directly connected to the key signal processing board. The multi-way server receives the key signals sent by the respective PBI boards through the key signal processing boards, and then simultaneously sends corresponding response signals to the corresponding PBI boards. Each PBI board is directly connected to the key signal processing board, and does not require a switch to be connected in series, reducing the number of switchers, and having a simple structure and high reliability. Moreover, the number of switching switches through which the signal passes is reduced, and the signal quality is improved.
附图说明DRAWINGS
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,对于本领域普通技术人员而言,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments or the description of the prior art will be briefly described below, and it will be apparent to those skilled in the art that In other words, other drawings can be obtained based on these drawings without paying for creative labor.
图1为本发明实施例一种多路服务器的结构示意图;1 is a schematic structural diagram of a multi-path server according to an embodiment of the present invention;
图2为本发明实施例一种PBI板的结构示意图;2 is a schematic structural diagram of a PBI board according to an embodiment of the present invention;
图3为本发明实施例一种多路服务器的信号处理方法的流程图。FIG. 3 is a flowchart of a signal processing method of a multi-path server according to an embodiment of the present invention.
具体实施方式detailed description
为了使本技术领域的人员更好地理解本发明中的技术方案,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本发明保护的范围。In order to make those skilled in the art better understand the technical solutions in the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described in conjunction with the accompanying drawings in the embodiments of the present invention. The embodiments are only a part of the embodiments of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts shall fall within the scope of the present invention.
参见图1,示出了本发明实施例提供的一种多路服务器的结构示意图,本实施例以32路服务器为例进行说明。当然,在本发明的其它实施例中,多路服务器可以是16路、8路、4路的服务器,PBI板的数量可以根据实际需要灵活确定。Referring to FIG. 1, a schematic structural diagram of a multi-path server according to an embodiment of the present invention is shown. This embodiment uses a 32-way server as an example for description. Of course, in other embodiments of the present invention, the multi-path server may be a 16-way, 8-way, 4-way server, and the number of PBI boards may be flexibly determined according to actual needs.
如图1所示,所述多路服务器包括:32个CPU、至少8个PBI板和1个关键信号处理板。As shown in FIG. 1, the multi-way server includes: 32 CPUs, at least 8 PBI boards, and 1 key signal processing board.
本实施例以1个PBI板监控4个CPU为例进行说明,例如,PBI1监控CPU1~CPU4,PBI2监控CPU5~CPU8,PBI3监控CPU9~CPU12,PBI4监控CPU13~CPU16,依次类推,PBI8监控CPU29~CPU32。In this embodiment, four CPUs are monitored by one PBI board as an example. For example, PBI1 monitors CPU1~CPU4, PBI2 monitors CPU5~CPU8, PBI3 monitors CPU9~CPU12, PBI4 monitors CPU13~CPU16, and so on, PBI8 monitors CPU29~ CPU32.
每个PBI板均连接自身监控的多个CPU,CPU1~CPU4与PBI1连接,CPU5~CPU8与PBI2连接,依次类推,CPU29~CPU32与PBI8连接。 Each of the PBI boards is connected to a plurality of CPUs that are monitored by itself, CPUs 1 to 4 are connected to PBI 1, CPUs 5 to 8 are connected to PBI 2, and so on, and CPUs 29 to 32 are connected to PBI 8.
由于PBI板上的I/O接口的数量有限,PBI板通过自身的I/O接口连接自身监控的部分CPU,自身监控的另一部分CPU通过背板连接PBI板。Due to the limited number of I/O interfaces on the PBI board, the PBI board connects to some of the CPUs that it monitors through its own I/O interface, and another part of the CPU that it monitors is connected to the PBI board through the backplane.
当然,在本发明的其它实施例中,可以是1个PBI板监控2个或者8个CPU。如果1个PBI板监控2个CPU,则32路服务器需要配置16个PBI板;如果1个PBI板监控8个CPU,则32路服务器需要配置4个PBI板。Of course, in other embodiments of the invention, one PBI board can monitor 2 or 8 CPUs. If one PBI board monitors two CPUs, the 32-way server needs to be configured with 16 PBI boards; if one PBI board monitors eight CPUs, the 32-way servers need to be configured with four PBI boards.
各个PBI板均包括输入端和输出端,各个PBI板的输入端分别连接关键信号处理板100的输出端,各个PBI板的输出端分别连接关键信号处理板100的输入端。Each PBI board includes an input end and an output end. The input ends of the respective PBI boards are respectively connected to the output ends of the key signal processing boards 100, and the output ends of the respective PBI boards are respectively connected to the input ends of the key signal processing boards 100.
PBI板接收到CPU发送的关键信号或者自身监控CPU的状态而产生相应的关键信号后,直接将关键信号发送给关键信号处理板100,由关键信号处理板100向相应的PBI板发送与所述关键信号相对应的回应信号。接收到回应信号的PBI板将所述回应信号发送给相应的CPU,从而保证各个CPU接收到回应信号满足要求。After receiving the key signal sent by the CPU or monitoring the state of the CPU and generating the corresponding key signal, the PBI board directly sends the key signal to the key signal processing board 100, and the key signal processing board 100 sends the corresponding signal to the corresponding PBI board. The response signal corresponding to the key signal. The PBI board that receives the response signal sends the response signal to the corresponding CPU, thereby ensuring that each CPU receives the response signal to meet the requirement.
其中,所述关键信号包括但不限于Powergood信号、Caterr信号、PMSYNC信号、TSC(Time Stamp Counte,时间戳计数器)信号、S4信号、Thermtrip信号、Powerbutton信号、MSMI信号等。The key signals include, but are not limited to, a Powergood signal, a Caterr signal, a PMSYNC signal, a TSC (Time Stamp Counte) signal, an S4 signal, a Thermtrip signal, a Powerbutton signal, an MSMI signal, and the like.
不同的关键信号对应的回应信号也不同,不同的回应信号的功能也不同。总体而言,通过回应信号使多路服务器中的各个CPU之间的关键信号满足多路服务器的要求。例如,Powergood信号的回应信号仍是Powergood信号,CPU收到Powergood信号后启动。The response signals corresponding to different key signals are also different, and the functions of different response signals are also different. In general, the critical signals between the CPUs in the multi-way server satisfy the requirements of the multi-way server by responding to the signal. For example, the response signal of the Powergood signal is still the Powergood signal, and the CPU starts after receiving the Powergood signal.
在多路服务器中,通过Powergood信号通知CPU:外界的各个电源已经完成上电。多路服务器对Powergood信号有非常严格的时序要求,例如,在多路服务器开机时,所有CPU收到的Powergood信号的时间差必须监控在10ns以内,这样能最大限度提高32路服务器的系统性能。并记录向各个CPU发送Powergood信号的时刻为初始时刻T0。In the multi-way server, the CPU is notified by the Powergood signal that the external power supplies have been powered on. Multi-channel servers have very strict timing requirements for Powergood signals. For example, when multiple servers are powered on, the time difference of Powergood signals received by all CPUs must be monitored within 10 ns, which can maximize the system performance of 32-way servers. The time at which the Powergood signal is transmitted to each CPU is recorded as the initial time T0.
例如,多路服务器作为一个服务器进行上电时,PBI板检测到自身监控的CPU上电后,向关键信号处理板发送第一Powergood信号(即,第一类电源上电信号)。当关键信号处理板接收到服务器内的全部PBI板分别发送的第一类Powergood信号后,同时向多路服务器内全部的PBI板返回第二Powergood信号(即,第二类电源上电信号)。PBI板接收到关键信号处理板发送的第二Powergood信号后,分别发送给自身监控的CPU。这样,由关键信号处理板将第二Powergood信号统一发送给各个PBI板,然后,再由各个PBI板分别发送给自身监控的CPU,从而保证多路服务器内的各个CPU接收到第二Powergood信号的时间差在预设时间(例如,10ns)内。 For example, when the multi-way server is powered on as a server, the PBI board detects that the CPU monitored by itself is powered on, and sends the first Powergood signal to the critical signal processing board (ie, the first type of power-on signal). After the key signal processing board receives the first type of Powergood signals sent by all the PBI boards in the server, the second Powergood signal (ie, the second type power-on signal) is returned to all PBI boards in the multi-way server. After receiving the second Powergood signal sent by the key signal processing board, the PBI board sends it to the CPU of its own monitoring. In this way, the second Powergood signal is uniformly sent to each PBI board by the key signal processing board, and then sent by each PBI board to the CPU of its own monitoring, thereby ensuring that each CPU in the multi-path server receives the second Powergood signal. The time difference is within a preset time (for example, 10 ns).
在CPU实现热插拔、online(上线)或offline(下线)时,新online的CPU接收到Powergood信号的时间必须是预设时刻,该预设时刻是T0加上预设周期,其中,不同厂商生产的CPU,或者,同一厂商生产的不同型号的CPU,规定的预设周期可能不相同。例如,Ivybridge处理器设定的预设周期可以是N*864BCLKs;Haswell处理器设定的预设周期可以是N*384BCLKs。其中,N为正整数,且1BCLK是10ns。When the CPU implements hot plugging, online (online) or offline (offline), the time that the new online CPU receives the Powergood signal must be the preset time. The preset time is T0 plus the preset period, where different The CPUs produced by the manufacturer, or different types of CPUs produced by the same manufacturer, may have different preset periods. For example, the preset period set by the Ivybridge processor can be N*864BCLKs; the preset period set by the Haswell processor can be N*384BCLKs. Where N is a positive integer and 1BCLK is 10 ns.
Caterr信号表示系统发生了严重错误。The Caterr signal indicates that the system has experienced a serious error.
PMSYNC(Power Management Sync,电源管理同步)信号是节能控制信号。The PMSYNC (Power Management Sync) signal is an energy saving control signal.
S4信号是多路服务器系统的一种状态信号,该信号由南桥芯片(PCH)发出,当CPU所在单板上的CPLD(Complex Programmable Logic Device,复杂可编程逻辑器件)检测到PCH发出S4以后,控制整个多路服务器系统上电。The S4 signal is a status signal of the multi-way server system. The signal is sent by the South Bridge (PCH). When the CPLD (Complex Programmable Logic Device) on the board where the CPU is located detects the PCH and sends the S4. Control the powering of the entire multi-way server system.
Thermtrip信号是CPU过热时发出的信号。The Thermtrip signal is a signal that is sent when the CPU overheats.
Powerbutton信号是触发服务器上的上电按钮时产生的信号,该信号用于通知PCH外界有按钮触发动作,需要上电开机或者下电关机。The Powerbutton signal is a signal generated when the power-on button on the server is triggered. This signal is used to notify the PCH that there is a button triggering action. It needs to be powered on or powered off.
MSMI信号和Caterr信号功能完全一样,是最新的Intel平台增加的一个信号。The MSMI signal is exactly the same as the Caterr signal and is a signal added by the latest Intel platform.
本发明实施例提供的多路服务器包括多个CPU、多个PBI板和一个关键信号处理板。每个PBI板监控至少两个CPU,且每个PBI板与自身所监控的CPU连接。各个PBI板的输出端分别连接所述关键信号处理板的输入端,各个PBI板的输入端分别连接所述关键信号处理板的输出端。关键信号处理板接收PBI板发送的关键信号后,向相应的所述PBI板返回与所述关键信号相对应的回应信号。接收到回应信号的PBI板将所述回应信号分别发送给自身监控的CPU,以使CPU接收到的回应信号满足多路服务器的要求。由上述内容可知,所述多路服务器通过关键信号处理板接收各个PBI板发出的关键信号,然后向相应的PBI板发送相应的回应信号,能够保证各个CPU接收到回应信号满足多路服务器的要求。而且,各个PBI板直接与关键信号处理板连接,不需要切换开关进行串联,减少切换开关的数量,结构简单、可靠性高。而且,减少了信号所经过的切换开关的数量,提高了信号质量。The multi-path server provided by the embodiment of the present invention includes a plurality of CPUs, a plurality of PBI boards, and a key signal processing board. Each PBI board monitors at least two CPUs, and each PBI board is connected to the CPU it monitors. The output ends of the respective PBI boards are respectively connected to the input ends of the key signal processing boards, and the input ends of the respective PBI boards are respectively connected to the output ends of the key signal processing boards. After receiving the key signal sent by the PBI board, the key signal processing board returns a response signal corresponding to the key signal to the corresponding PBI board. The PBI board that receives the response signal sends the response signal to the CPU that is monitored by itself, so that the response signal received by the CPU satisfies the requirements of the multi-path server. It can be seen from the above that the multi-channel server receives the key signals sent by the PBI boards through the key signal processing boards, and then sends corresponding response signals to the corresponding PBI boards, so that each CPU receives the response signal to meet the requirements of the multi-path server. . Moreover, each PBI board is directly connected to the key signal processing board, and does not require a switch to be connected in series, reducing the number of switchers, and having a simple structure and high reliability. Moreover, the number of switching switches through which the signal passes is reduced, and the signal quality is improved.
此外,本发明实施例中各个PBI板之间没有通过转换开关串联,可以实现任意多个PBI组合构成一个硬分区,即灵活配置硬分区。例如,将PBI1和PBI3对应的CPU配置为一个硬分区,即配置为一个8路服务器;将PBI2和PBI5配置为一个硬分区,即配置为一个8路服务器。而现有技术中的多路服务器,由于PBI板之间通过转换开关串联,因此,在配置硬分区时只能将相邻的PBI配置为一个硬分区,不能实现灵活配置硬分区。In addition, in the embodiment of the present invention, the PBI boards are not connected in series through the conversion switch, and any plurality of PBI combinations can be realized to form one hard partition, that is, the hard partition is flexibly configured. For example, the CPU corresponding to PBI1 and PBI3 is configured as a hard partition, that is, configured as an 8-way server; PBI2 and PBI5 are configured as one hard partition, that is, configured as an 8-way server. In the prior art, the multi-path server, because the PBI boards are connected in series through the switch, therefore, the adjacent PBI can only be configured as a hard partition when configuring the hard partition, and the hard partition cannot be flexibly configured.
图2示出了本发明实施例一种PBI板的内部结构示意图,如图2所示,每个PBI 板均包括:转换开关200、电平转换芯片210、第一CPLD220、南桥芯片230。其中,转换开关200的数量与PBI板监控的CPU的数量一致,本实施例中每个PBI板内设置有四个转换开关。FIG. 2 is a schematic diagram showing the internal structure of a PBI board according to an embodiment of the present invention, as shown in FIG. 2, each PBI The boards each include: a changeover switch 200, a level shifting chip 210, a first CPLD 220, and a south bridge chip 230. The number of the changeover switches 200 is the same as the number of CPUs monitored by the PBI board. In this embodiment, four switchers are disposed in each PBI board.
每个CPU的I/O接口通过转换开关200连接电平转换芯片210的第一I/O接口,电平转换芯片210的第二I/O接口连接第一CPLD220的第一I/O接口。The I/O interface of each CPU is connected to the first I/O interface of the level conversion chip 210 through the changeover switch 200, and the second I/O interface of the level conversion chip 210 is connected to the first I/O interface of the first CPLD 220.
第一CPLD220的第一输入端作为PBI板的输入端连接关键信号处理板100的输出端。第一CPLD220的第一输出端作为PBI板的输出端连接关键信号处理板100的输入端。第一CPLD220的第二输出端连接南桥芯片230。The first input of the first CPLD 220 is connected to the output of the key signal processing board 100 as an input of the PBI board. The first output of the first CPLD 220 is connected to the input of the critical signal processing board 100 as an output of the PBI board. The second output of the first CPLD 220 is connected to the south bridge chip 230.
CPU输出的信号提供给电平转换芯片210转换成CPLD能处理的高电平的信号,转换成高电平信号后更适合远距离传输。The signal output by the CPU is supplied to the level conversion chip 210 to be converted into a high level signal that the CPLD can process, and is converted to a high level signal to be more suitable for long distance transmission.
电平转换芯片210将转换后的信号发送给第一CPLD220。第一CPLD220将接收到的信号发送给关键信号处理板100。The level conversion chip 210 transmits the converted signal to the first CPLD 220. The first CPLD 220 transmits the received signal to the key signal processing board 100.
关键信号处理板100向相应的PBI内的第一CPLD220返回与接收到的关键信号相对应的回应信号。然后,由第一CPLD220控制相应的转换开关200闭合,将第一CPLD220接收到关键信号处理板100发送的信号发送给相应的CPU。The key signal processing board 100 returns a response signal corresponding to the received key signal to the first CPLD 220 in the corresponding PBI. Then, the corresponding changeover switch 200 is controlled by the first CPLD 220 to be closed, and the signal sent by the first CPLD 220 to the key signal processing board 100 is sent to the corresponding CPU.
下面结合图1和图2,以不同的关键信号为例介绍本发明实施例提供的多路服务器的关键信号处理过程。The key signal processing process of the multi-path server provided by the embodiment of the present invention is described in the following with reference to FIG. 1 and FIG.
(1)32路服务器作为一个服务器上电时,Powergood信号的传输过程:(1) When the 32-way server is powered on as a server, the transmission process of Powergood signals:
11)SMM(System Management Module,系统管理模块)板通知关键信号处理板该多路服务器是32路服务器。SMM板用于获取多路服务器系统的分区信息。11) SMM (System Management Module) board notifies the key signal processing board that the multi-path server is a 32-way server. The SMM board is used to obtain partition information of the multi-path server system.
12)各个PBI板及其监控的各个CPU分别上电;各个PBI板检测到自己监控的各个CPU的电源均上电完成后,产生第一Powergood信号并发送给关键信号处理板,表明该PBI板及其监控的CPU已经上电完成。12) Each PBI board and each CPU it monitors are powered on separately; after each PBI board detects that the power of each CPU monitored by itself is powered on, the first Powergood signal is generated and sent to the key signal processing board, indicating that the PBI board is displayed. The CPU it monitors has been powered on.
在一个实施例中,每个PBI板可以产生一个第一Powergood信号,当检测到自身控制的全部CPU的电源都上电完成时产生。In one embodiment, each PBI board can generate a first Powergood signal that is generated when power to all of the CPUs that it controls is detected to be powered up.
各个PBI板的电源不需要同步上电,关键信号处理板通过延时或去抖处理避免各个PBI板上电过程带来的不稳定状态,使系统的可靠性更高。延时是指在关键信号处理板接收到关键信号后先不处理,延时预设时长等关键信号完全稳定后再进行处理。预设时长可以根据上电稳定时间设定,当然也可以根据实际需求设定。The power supply of each PBI board does not need to be powered on synchronously, and the key signal processing board avoids the unstable state caused by the power process of each PBI board by delay or debounce processing, so that the reliability of the system is higher. The delay means that the key signal processing board does not process the key signal after receiving the key signal, and the key signal such as the delay preset time period is completely stable before processing. The preset duration can be set according to the power-on stabilization time. Of course, it can also be set according to actual needs.
需要说明的是,各个PBI板将第一Powergood信号发送给关键信号处理板后, PBI板不做任何处理,直到接收到关键信号处理板发送的第二Powergood信号。It should be noted that after each PBI board sends the first Powergood signal to the key signal processing board, The PBI board does nothing until it receives the second Powergood signal sent by the critical signal processing board.
13)关键信号处理板接收到全部PBI板分别发送的第一Powergood信号后,在同一时刻分别向全部PBI板发送第二Powergood信号,并记录发送第二Powergood信号的时刻为初始时刻T0。13) After receiving the first Powergood signal sent by all the PBI boards, the key signal processing board sends the second Powergood signal to all PBI boards at the same time, and records the time at which the second Powergood signal is transmitted as the initial time T0.
此外,由于不同的单板之间传输信号具有离散性,以及信号的传输线路不同延时也不同,要求关键信号处理板到各个PBI板之间的传输线路的长度相等,这样能够保证各个PBI板收到第二Powergood信号的时刻保持一致,进而能够满足系统对各个CPU收到第二Powergood信号的时序要求。In addition, since the signals transmitted between different boards are discrete, and the transmission lines of the signals have different delays, the lengths of the transmission lines between the key signal processing boards and the PBI boards are required to be equal, so that each PBI board can be guaranteed. The timing of receiving the second Powergood signal is consistent, thereby satisfying the timing requirement of the system for each CPU to receive the second Powergood signal.
需要说明的是,在CPU热插拔时,目前的BIOS(Basic Input Output System,基本输入输出系统)只能做到两个CPU同时online(上线)或offline(下线),因此,在多路服务器中,需要将两个CPU划分成一组,例如,32路服务器一共有16组。It should be noted that when the CPU is hot swapped, the current BIOS (Basic Input Output System) can only achieve two CPUs simultaneously online (online) or offline (offline), so in multiple channels. In the server, you need to divide the two CPUs into one group. For example, there are 16 groups of 32-way servers.
为了使多路服务器上电与CPU热插拔时,对Powergood信号的处理过程相兼容。多路服务器上电时将每两个CPU划分成一组,每个PBI板针对每一组CPU产生一个第一Powergood信号,并分别发送给关键信号处理板。关键信号处理板需要针对每一组CPU向PBI板返回一个第二Powergood信号,保证第二Powergood信号的数量与第一Powergood信号的数量相同。例如,每个PBI板监控4个CPU,两个CPU为一组,则每个PBI板产生两个第一Powergood信号。而且,关键信号处理板需要同时向每个PBI板返回两个第二Powergood信号。In order to enable the multi-way server to be powered on and off with the CPU, the processing of the Powergood signal is compatible. When the multi-way server is powered on, each two CPUs are divided into one group. Each PBI board generates a first Powergood signal for each group of CPUs and sends them to the key signal processing boards. The key signal processing board needs to return a second Powergood signal to the PBI board for each group of CPUs, ensuring that the number of second Powergood signals is the same as the number of first Powergood signals. For example, each PBI board monitors 4 CPUs, and two CPUs are a group, and each PBI board generates two first Powergood signals. Moreover, the critical signal processing board needs to return two second Powergood signals to each PBI board simultaneously.
14)PBI板中的第一CPLD将关键信号处理板发送的第二Powergood信号分别发送给自身监控的各个CPU。14) The first CPLD in the PBI board sends the second Powergood signal sent by the key signal processing board to each CPU monitored by itself.
关键信号处理板可以同时向各个PBI板分别发送第二Powergood信号,然后,再由各个PBI板将所述第二Powergood信号分别发送给相应的CPU,从而保证各个CPU收到第二Powergood信号的时刻满足时序要求(即,各个CPU收到第二Powergood信号的时间差在10ns以内)。The key signal processing board can simultaneously send the second Powergood signal to each PBI board, and then the second Powergood signal is sent to the corresponding CPU by each PBI board, thereby ensuring that each CPU receives the second Powergood signal. The timing requirements are met (ie, the time difference between each CPU receiving the second Powergood signal is within 10 ns).
(2)当将32路服务器划分成多个子服务器时,子服务器上电的过程与上述过程类似。其中,一个子服务器即一个硬分区,每个子服务器可以单独工作。例如,32路服务器可以分为4个8路的子服务器,也可以分为1个16路的子服务器和2个8路的子服务器。(2) When the 32-way server is divided into multiple sub-servers, the process of powering up the sub-server is similar to the above process. Among them, one sub-server is a hard partition, and each sub-server can work alone. For example, a 32-way server can be divided into four 8-way sub-servers, or can be divided into one 16-way sub-server and two 8-way sub-servers.
子服务器的上电过程如下: The power-on process of the sub-server is as follows:
21)SMM板通知关键信号处理板该多路服务器的硬分区信息。硬分区信息可以包括多路服务器包含几个硬分区、每个硬分区包含哪些CPU等信息。21) The SMM board notifies the critical signal processing board of the hard partition information of the multi-way server. The hard partition information may include information that the multi-way server contains several hard partitions, and which CPUs each hard partition contains.
22)各个硬分区的PBI板及其监控的各个CPU分别上电;PBI板检测到自己监控的各个CPU的电源均上电完成后,产生第一Powergood信号(即,第三类电源上电信号)并发送给关键信号处理板,表明该PBI板及其监控的CPU已经上电完成。22) Each hard partitioned PBI board and its monitored CPUs are respectively powered on; the PBI board detects that each of the CPUs monitored by the CPU is powered on, and generates a first Powergood signal (ie, a third type of power-on signal) And sent to the key signal processing board, indicating that the PBI board and its monitored CPU have been powered on.
23)关键信号处理板接收到硬分区内的PBI板发送的第一Powergood信号之后,确定该PBI板所在的硬分区。23) After receiving the first Powergood signal sent by the PBI board in the hard partition, the critical signal processing board determines the hard partition where the PBI board is located.
可以根据来自SMM板的硬分区信息确定发送所述第一Powergood信号的PBI板所在硬分区的信息,例如,确定发送第一Powergood信号的PBI板位于哪个硬分区内,且该硬分区内还包括哪些PBI板等信息。Determining, according to hard partition information from the SMM board, information of a hard partition where the PBI board that sends the first Powergood signal is located, for example, determining which hard partition the PBI board that sends the first Powergood signal is located, and the hard partition includes Which PBI boards and other information.
24)关键信号处理板接收到所述硬分区内的全部PBI板发送的第一Powergood信号之后,在同一时刻向该硬分区内的各个PBI板分别发送第二Powergood信号(即,第四类电源上电信号),并记录发送第二Powergood信号的时刻为初始时刻T0。24) After receiving the first Powergood signal sent by all PBI boards in the hard partition, the key signal processing board sends a second Powergood signal to each PBI board in the hard partition at the same time (ie, the fourth type power supply) The power-on signal is recorded, and the time at which the second Powergood signal is transmitted is recorded as the initial time T0.
25)PBI板中的第一CPLD将关键信号处理板发送的第二Powergood信号分别发送给自身监控的各个CPU。25) The first CPLD in the PBI board sends the second Powergood signal sent by the key signal processing board to each CPU monitored by itself.
通过上述的过程,能够实现硬分区内的各个CPU收到第二Powergood信号的时间满足时序要求。而且,各个硬分区分别上电、分别控制,相互之间不存在耦合关系。Through the above process, it can be realized that the time when each CPU in the hard partition receives the second Powergood signal satisfies the timing requirement. Moreover, each hard partition is separately powered and separately controlled, and there is no coupling relationship between them.
(3)当CPU online/offline时,Powergood信号的传输过程(3) When the CPU is online/offline, the transmission process of Powergood signal
当CPU online/offline时,32路服务器作为一个服务器或划分成多个子服务器Powergood信号的传输过程相同,具体过程如下:When the CPU is online/offline, the transmission process of the 32-way server as a server or Powergood signal divided into multiple sub-servers is the same. The specific process is as follows:
31)需要online的CPU插入多路服务器系统。31) The CPU that requires online is plugged into the multi-way server system.
32)PBI板检测到新online的CPU上电完成后,产生第一Powergood信号(即,第五类电源上电信号),并将第一Powergood信号发送给关键信号处理板。32) After the PBI board detects that the CPU of the new online is powered on, it generates a first Powergood signal (ie, the fifth type of power-on signal) and sends the first Powergood signal to the key signal processing board.
33)关键信号处理板接收到PBI板发送的第一Powergood信号后,在预设时刻将第二Powergood信号(即,第六类电源上电信号)发送给所述PBI板。33) After receiving the first Powergood signal sent by the PBI board, the key signal processing board sends a second Powergood signal (ie, a sixth type power supply power-on signal) to the PBI board at a preset time.
所述预设时刻为所述初始时刻T0加上预设周期。其中,不同厂商生产的CPU,或者,同一厂商生产的不同型号的CPU,规定的预设周期可能不相同,例如,Ivybridge处理器对应的预设周期为N*864BCLKs,相应的Ivybridge处理器对应的 预设时刻为T0+N*864BCLKs;Haswell处理器对应的预设周围为N*384BCLKs,相应的Haswell处理器对应的预设时刻为T0+N*384BCLKs,其中,N为正整数。The preset time is the initial time T0 plus a preset period. Among them, the CPUs produced by different manufacturers, or different types of CPUs produced by the same manufacturer, may have different preset periods. For example, the preset period of the Ivybridge processor is N*864BCLKs, corresponding to the corresponding Ivybridge processor. The preset time is T0+N*864BCLKs; the preset around the Haswell processor is N*384BCLKs, and the corresponding preset time of the corresponding Haswell processor is T0+N*384BCLKs, where N is a positive integer.
34)由所述PBI板将接收到的第二Powergood信号发送给相应的CPU。34) transmitting, by the PBI board, the received second Powergood signal to the corresponding CPU.
通过上述的过程实现CPU online时,online的CPU接收到的第二Powergood信号满足预设时间要求。When the CPU online is implemented through the above process, the second Powergood signal received by the online CPU meets the preset time requirement.
下面以系统错误信号(即,Caterr信号)为例,介绍本发明实施例提供的多路服务器的工作过程。The following describes the working process of the multi-path server provided by the embodiment of the present invention by taking a system error signal (ie, a Caterr signal) as an example.
(4)若32路服务器作为一个服务器使用,则Caterr信号的传输过程如下:(4) If the 32-way server is used as a server, the transmission process of the Caterr signal is as follows:
41)PBI板接收到CPU发送的第一Caterr信号(即,第一系统错误信号)时,该PBI板将所述第一Caterr信号发送给关键信号处理板。41) When the PBI board receives the first Caterr signal sent by the CPU (ie, the first system error signal), the PBI board transmits the first Caterr signal to the key signal processing board.
42)关键信号处理板接收到的第一Caterr信号后,向除发送所述第一Caterr信号的PBI板之外的其它各个PBI板分别发送第二Caterr信号(即,第二系统错误信号)。42) After receiving the first Caterr signal from the key signal processing board, transmitting a second Caterr signal (ie, a second system error signal) to each of the PBI boards except the PBI board that transmits the first Caterr signal.
关键信号处理板接收到第一Caterr信号后,根据与Caterr信号对应的标志位确定是否响应该第一Caterr信号。例如,当与Caterr信号对应的标志位为“0”时,关键信号处理板不再响应其它CPU发来的Caterr信号;当与Caterr信号对应的标志位为“1”时,关键信号处理板能够响应其它CPU发来的Caterr信号。After receiving the first Caterr signal, the key signal processing board determines whether to respond to the first Caterr signal according to the flag bit corresponding to the Caterr signal. For example, when the flag corresponding to the Caterr signal is "0", the key signal processing board no longer responds to the Caterr signal sent by other CPUs; when the flag corresponding to the Caterr signal is "1", the key signal processing board can Respond to Caterr signals from other CPUs.
当关键信号处理板接收到第一个Caterr信号后,将与Caterr信号对应的标志位修改成二进制数“0”。若第一个Caterr信号由低电平变为高电平时,表明故障排除,此时,将与Caterr信号对应的标志位修改成二进制数“1”。When the key signal processing board receives the first Caterr signal, it changes the flag corresponding to the Caterr signal to a binary number "0". If the first Caterr signal changes from low to high, it indicates that the fault is eliminated. At this time, the flag corresponding to the Caterr signal is modified to a binary number "1".
关键信号处理板响应CPU发来的Caterr信号是指关键信号处理板接收到CPU发送的Caterr信号之后,将Caterr信号转发给与该CPU处于同一硬分区的其它CPU,即把与该CPU处于同一硬分区的其它CPU的Caterr信号线由高电平拉成低电平,从而屏蔽其它CPU发送的Caterr信号。如果发出Caterr信号的CPU能够将该错误自行修复,则该Caterr信号就会消失,系统恢复正常;如果CPU无法自行修复,系统将会出现挂死现象。The key signal processing board responds to the Caterr signal sent by the CPU. After the key signal processing board receives the Caterr signal sent by the CPU, it forwards the Caterr signal to other CPUs that are in the same hard partition as the CPU, that is, the same as the CPU. The Caterr signal lines of other CPUs in the partition are pulled low by the high level to shield the Caterr signals sent by other CPUs. If the CPU that issued the Caterr signal can fix the error by itself, the Caterr signal will disappear and the system will return to normal. If the CPU cannot repair itself, the system will hang.
43)各个PBI板将接收到的第二Caterr信号分别发送给相应的CPU。43) Each PBI board sends the received second Caterr signal to the corresponding CPU.
(5)若32路服务器划分成多个子服务器使用,每个子服务器即一个硬分区。Caterr信号的传输过程如下: (5) If the 32-way server is divided into multiple sub-servers, each sub-server is a hard partition. The transmission process of the Caterr signal is as follows:
51)SMM板通知关键信号处理板多路服务器的硬分区信息。51) The SMM board notifies the hard partition information of the multi-path server of the key signal processing board.
52)PBI板接收到CPU发送的第三Caterr信号(即,第三系统错误信号)时,该PBI板将所述第三Caterr信号发送给关键信号处理板。52) When the PBI board receives the third Caterr signal (ie, the third system error signal) sent by the CPU, the PBI board transmits the third Caterr signal to the key signal processing board.
53)关键信号处理板接收到第三Caterr信号后,根据所述硬分区信息确定该PBI板对应的硬分区。53) After receiving the third Caterr signal, the key signal processing board determines a hard partition corresponding to the PBI board according to the hard partition information.
即关键信号处理板确定第三Caterr信号对应的回应信号的广播范围,换言之,确定将与第三Caterr信号对应的回应信号发送给哪些PBI板。That is, the key signal processing board determines the broadcast range of the response signal corresponding to the third Caterr signal, in other words, determines which PBI boards to send the response signal corresponding to the third Caterr signal.
54)关键信号处理板将第四Caterr信号(即,第四系统错误信号)发送给所述PBI板所在硬分区中的其它PBI板。54) The key signal processing board transmits a fourth Caterr signal (ie, a fourth system error signal) to the other PBI boards in the hard partition where the PBI board is located.
55)所述硬分区中的其它PBI板接收到第四Caterr信号后,发送给自己监控的CPU。55) After receiving the fourth Caterr signal, the other PBI boards in the hard partition are sent to the CPU of their own monitoring.
利用本实施例提供的多路服务器,关键信号处理板接收到CPU发送的Caterr信号后,能够通过PBI发送给相应的CPU,由于PBI板之间未通过转换开关串联,而是直接与关键信号处理板连接,因此,控制过程简单,而且大大减少了Caterr信号的广播时间。另外,减少了信号所经过的切换开关的数量,能够提高信号质量。With the multi-path server provided by the embodiment, after receiving the Caterr signal sent by the CPU, the key signal processing board can be sent to the corresponding CPU through the PBI, because the PBI boards are not connected in series through the switch, but directly deal with the key signals. The board is connected so that the control process is simple and the broadcast time of the Caterr signal is greatly reduced. In addition, the number of switches through which the signal passes is reduced, and the signal quality can be improved.
相应于上述的多路服务器,本发明还提供了多路服务器的信号同步方法实施例。Corresponding to the multi-way server described above, the present invention also provides an embodiment of a signal synchronization method for a multi-way server.
参见图3,为本发明实施例提供的一种多路服务器的信号处理方法的流程示意图,该方法应用于多路服务器的关键信号处理板中。所述多路服务器包括多个CPU、多个PBI板和关键信号处理板,其中,一个PBI板监控至少两个CPU;每个PBI板均连接自身监控的多个CPU;各个PBI板均包括输入端和输出端,各个PBI板的输入端分别连接关键信号处理板的输出端,各个PBI板的输出端分别连接关键信号处理板的输入端。3 is a schematic flowchart of a signal processing method of a multi-path server according to an embodiment of the present invention. The method is applied to a key signal processing board of a multi-path server. The multi-way server includes a plurality of CPUs, a plurality of PBI boards, and a key signal processing board, wherein one PBI board monitors at least two CPUs; each PBI board is connected to a plurality of CPUs monitored by itself; each PBI board includes an input At the end and the output end, the input ends of the respective PBI boards are respectively connected to the output ends of the key signal processing boards, and the output ends of the respective PBI boards are respectively connected to the input ends of the key signal processing boards.
如图3所示,该方法可以包括以下步骤:As shown in FIG. 3, the method may include the following steps:
S110,关键信号处理板接收PBI板发送的关键信号。S110, the key signal processing board receives the key signal sent by the PBI board.
所述关键信号由所述PBI板所监控的CPU发送给所述PBI板,或者,由PBI板检测到CPU的相应状态后产生,例如,当PBI板检测到自己监控的各个CPU的电源均上电完成后,产生第一Powergood信号;又如,当CPU发生故障后,产生第一Caterr信号,并将第一Caterr信号发送给PBI板,由PBI板发送给关键信号处理板。The key signal is sent by the CPU monitored by the PBI board to the PBI board, or is generated by the PBI board detecting the corresponding state of the CPU, for example, when the PBI board detects the power of each CPU that it monitors. After the power is completed, the first Powergood signal is generated; for example, when the CPU fails, the first Caterr signal is generated, and the first Caterr signal is sent to the PBI board, and sent by the PBI board to the key signal processing board.
所述关键信号包括但不限于Powergood信号、Caterr信号、PMSYNC信号、TSC信 号、S4信号、Thermtrip信号、Powerbutton信号、MSMI信号。The key signals include, but are not limited to, Powergood signals, Caterr signals, PMSYNC signals, TSC signals. No., S4 signal, Thermtrip signal, Powerbutton signal, MSMI signal.
其中Powergood信号和Caterr信号的处理过程如上述过程所述,此处不再赘述。The processing of the Powergood signal and the Caterr signal is as described in the above process, and will not be described here.
S120,关键信号处理板根据接收到的所述关键信号向PBI板返回与所述关键信号相对应的回应信号,以使接收到所述回应信号的PBI板将所述回应信号发送给所述PBI板监控的CPU。S120. The key signal processing board returns a response signal corresponding to the key signal to the PBI board according to the received key signal, so that the PBI board that receives the response signal sends the response signal to the PBI. Board monitors the CPU.
由上述实施例可见,多路服务器可以包括32个CPU、8个PBI板和一个关键信号处理板。每个PBI板监控多个CPU;各个PBI板的输出端均连接所述关键信号处理板的输入端,各个PBI板的输入端连接所述关键信号处理板的输出端;关键信号处理板接收PBI板发送的关键信号,产生与所述关键信号相对应的回应信号,并将所述回应信号返回给相应的PBI板。接收到回应信号的PBI板将所述回应信号分发给相应的CPU。由上述内容可知,各个PBI板直接与关键信号处理板连接,不需要切换开关进行串联,减少切换开关的数量,结构简单、可靠性高,而且减少了信号所经过的切换开关提高了信号质量。As can be seen from the above embodiments, the multi-way server can include 32 CPUs, 8 PBI boards, and a key signal processing board. Each PBI board monitors a plurality of CPUs; an output end of each PBI board is connected to an input end of the key signal processing board, an input end of each PBI board is connected to an output end of the key signal processing board; and a key signal processing board receives a PBI The key signal sent by the board generates a response signal corresponding to the key signal and returns the response signal to the corresponding PBI board. The PBI board that receives the response signal distributes the response signal to the corresponding CPU. It can be seen from the above that each PBI board is directly connected to the key signal processing board, does not need to switch the switch to be connected in series, reduces the number of switching switches, has a simple structure and high reliability, and reduces the switching quality of the signal passing through the switching switch.
若关键信号为多路服务器上电时的电源上电信号(Powergood信号),则步骤S120可以包括以下两种情况:If the key signal is a power-on signal (Powergood signal) when the multi-server is powered on, step S120 may include the following two situations:
其一,多路服务器作为一个服务器上电First, the multi-way server is powered on as a server.
所述关键信号处理板接收到所述多路服务器内的全部PBI板分别发送的第一类电源上电信号后,同时向全部PBI板发送作为与所述第一类电源上电信号相对应的回应信号的第二类电源上电信号,以使所述多路服务器内的全部CPU接收到第二类电源上电信号的时间差满足预设时间要求。Receiving, by the key signal processing board, the first type of power-on signal sent by all the PBI boards in the multi-path server, and simultaneously transmitting to the entire PBI board as corresponding to the power-on signal of the first type of power source. And responding to the second type of power-on signal of the signal, so that all the CPUs in the multi-way server receive the time difference of the second-type power-on signal to meet the preset time requirement.
其中,所述第一类电源上电信号由所述PBI板检测到自身监控的各个CPU的电源均上电完成时产生。The first type of power-on signal is generated when the PBI board detects that the power of each CPU that is monitored by itself is powered on.
其二,所述多路服务器划分成至少两个子服务器,任意一个子服务器上电的过程如下:Second, the multi-way server is divided into at least two sub-servers, and the process of powering any one of the sub-servers is as follows:
所述关键信号处理板接收到所述子服务器内的全部PBI板分别发送的所述第三类电源上电信号后,同时向所述子服务器内的全部PBI板分别发送作为与所述第三类电源上电信号相对应的回应信号的第四类电源上电信号,以使所述子服务器内的全部CPU接收到所述第四类电源上电信号的时间差满足预设时间要求。After receiving the third type power-on signal sent by all the PBI boards in the sub-server, the key signal processing board simultaneously sends the third-type power-on signal to the PBI board in the sub-server as the third The fourth type of power-on signal corresponding to the response signal of the power-type power-on signal, so that all the CPUs in the sub-server receive the time difference of the power-on signal of the fourth-type power source to meet the preset time requirement.
其中,所述第三类电源上电信号均由所述PBI板检测到自身监控的各个CPU的电源均上电完成时产生。The third type power-on power-on signal is generated when the PBI board detects that the power of each CPU that is monitored by itself is powered on.
若关键信号为CPU online/offline时的电源上电信号(即,Powergood信号),步骤 S120可以包括以下过程:If the key signal is the power-on signal (ie, Powergood signal) when the CPU is online/offline, the steps are S120 can include the following process:
当所述多路服务器中插入新的CPU时,所述关键信号处理板接收到所述新的CPU对应的PBI板发送的第五类电源上电信号后,并在预设时刻向所述新的CPU对应的PBI板发送作为与所述第五类电源上电信号相对应的回应信号的第六类电源上电信号,以使所述新的CPU接收到所述第六类电源上电信号的时间满足预设时间要求。When a new CPU is inserted in the multi-path server, the key signal processing board receives the fifth-type power-on signal sent by the PBI board corresponding to the new CPU, and sends the new signal to the new one at a preset time. The PBI board corresponding to the CPU transmits a sixth type power-on signal as a response signal corresponding to the fifth type power-on signal, so that the new CPU receives the sixth-type power-on signal The time meets the preset time requirement.
所述第五类电源上电信号由所述PBI板检测到所述新的CPU上电完成时产生。The fifth type of power-on signal is generated when the PBI board detects that the new CPU is powered on.
若关键信号为系统错误信号(即Caterr信号),则步骤S120可以包括以下两种情况:If the key signal is a system error signal (ie, a Caterr signal), step S120 may include the following two cases:
其一,若多路服务器作为一个服务器使用First, if the multi-way server is used as a server
关键信号处理板接收到任意一个PBI板发送的第一系统错误信号后,同时向所述多路服务器中除发送所述系统错误信号PBI之外的其它各个PBI板发送第二系统错误信号;所述第二系统错误信号是与所述第一系统错误信号相对应的所述回应信号After receiving the first system error signal sent by any one of the PBI boards, the key signal processing board simultaneously sends a second system error signal to each of the PBI boards except the system error signal PBI. The second system error signal is the response signal corresponding to the first system error signal
其二,多路服务器划分成至少两个子服务器使用Second, the multi-way server is divided into at least two sub-servers.
关键信号处理板获取所述多路服务器的子服务器信息,当接收到任意一个PBI板发送的第三系统错误信号时,根据所述子服务器信息确定与发送所述第三系统错误信号的PBI板属于同一个子服务器的其它各个PBI板;同时向所述子服务器中除发送所述第三系统错误信号的PBI板之外的其它各个PBI板发送第四系统错误信号;所述第四系统错误信号是与所述第三系统错误信号相对应的所述回应信号。The key signal processing board acquires the sub-server information of the multi-path server, and when receiving the third system error signal sent by any one of the PBI boards, determining, according to the sub-server information, the PBI board that sends the third system error signal Other PBI boards belonging to the same sub-server; simultaneously transmitting a fourth system error signal to each of the PBI boards except the PBI board transmitting the third system error signal; the fourth system error signal Is the response signal corresponding to the third system error signal.
可选地,关键信号为Caterr信号时,所述方法还包括:Optionally, when the key signal is a Caterr signal, the method further includes:
当检测到当前PBI板发送的第一系统错误信号或第三系统错误由有效状态变为无效状态时,所述关键信号处理板接收下一个第一系统错误信号或第三系统错误信号。The key signal processing board receives the next first system error signal or the third system error signal when it is detected that the first system error signal or the third system error transmitted by the current PBI board changes from the active state to the inactive state.
其中,系统错误信号的状态是否有效可以通过检测与系统错误信号对应的标志位获得,在一个具体的实施例中,若与系统错误信号对应的标志位为二进制数“0”时,表明系统错误信号有效;若与系统错误信号对应的标志位为二进制数“1”时,表明系统错误无效。当然,也可以是标志位为“1”时,表示系统错误信号有效;标志位为“0”时,表示系统错误信号无效。Wherein, whether the state of the system error signal is valid can be obtained by detecting a flag bit corresponding to the system error signal. In a specific embodiment, if the flag bit corresponding to the system error signal is a binary number “0”, indicating a system error. The signal is valid; if the flag corresponding to the system error signal is binary number "1", it indicates that the system error is invalid. Of course, when the flag bit is "1", it indicates that the system error signal is valid; when the flag bit is "0", it indicates that the system error signal is invalid.
通过以上的方法实施例的描述,所属领域的技术人员可以清楚地了解到本发明可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而 前述的存储介质包括:只读存储器(ROM)、随机存取存储器(RAM)、磁碟或者光盘等各种可以存储程序代码的介质。Through the description of the above method embodiments, those skilled in the art can clearly understand that the present invention can be implemented by means of software plus a necessary general hardware platform, and of course, can also be through hardware, but in many cases, the former is better. Implementation. Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium, including a plurality of instructions for causing a A computer device (which may be a personal computer, server, or network device, etc.) performs all or part of the steps of the methods described in various embodiments of the present invention. And The foregoing storage medium includes various media that can store program codes, such as a read only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk.
本说明书中的各个实施例均采用递进的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。尤其,对于装置或系统实施例而言,由于其基本相似于方法实施例,所以描述得比较简单,相关之处参见方法实施例的部分说明即可。以上所描述的装置及系统实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。本领域普通技术人员在不付出创造性劳动的情况下,即可以理解并实施。The various embodiments in the specification are described in a progressive manner, and the same or similar parts between the various embodiments may be referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for a device or system embodiment, since it is substantially similar to the method embodiment, the description is relatively simple, and the relevant portions can be referred to the description of the method embodiment. The apparatus and system embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, ie It can be located in one place or it can be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the embodiment. Those of ordinary skill in the art can understand and implement without any creative effort.
本发明可以在由计算机执行的计算机可执行指令的一般上下文中描述,例如程序模块。一般地,程序模块包括执行特定任务或实现特定抽象数据类型的例程、程序、对象、组件、数据结构等等。也可以在分布式计算环境中实践本发明,在这些分布式计算环境中,由通过通信网络而被连接的远程处理设备来执行任务。在分布式计算环境中,程序模块可以位于包括存储设备在内的本地和远程计算机存储介质中。The invention may be described in the general context of computer-executable instructions executed by a computer, such as a program module. Generally, program modules include routines, programs, objects, components, data structures, and the like that perform particular tasks or implement particular abstract data types. The invention may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are connected through a communication network. In a distributed computing environment, program modules can be located in both local and remote computer storage media including storage devices.
需要说明的是,在本文中,诸如“第一”和“第二”等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should be noted that, in this context, relational terms such as "first" and "second" are used merely to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply these There is any such actual relationship or order between entities or operations. Furthermore, the term "comprises" or "comprises" or "comprises" or any other variations thereof is intended to encompass a non-exclusive inclusion, such that a process, method, article, or device that comprises a plurality of elements includes not only those elements but also Other elements, or elements that are inherent to such a process, method, item, or device. An element that is defined by the phrase "comprising a ..." does not exclude the presence of additional equivalent elements in the process, method, item, or device that comprises the element.
以上所述仅是本发明的具体实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。 The above is only a specific embodiment of the present invention, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present invention. It should be considered as the scope of protection of the present invention.

Claims (10)

  1. 一种多路服务器,其特征在于,包括:多个CPU、多个南桥基板管理监控器输入输出PBI板,以及关键信号处理板,其中,一个PBI板监控至少两个CPU;A multi-way server, comprising: a plurality of CPUs, a plurality of south bridge substrate management monitor input and output PBI boards, and a key signal processing board, wherein one PBI board monitors at least two CPUs;
    所述PBI板连接自身监控的CPU;The PBI board is connected to a CPU that is monitored by itself;
    各个PBI板的输出端连接所述关键信号处理板的输入端,各个PBI板的输入端连接所述关键信号处理板的输出端,所述关键信号处理板接收所述PBI板发送的关键信号,并向PBI板返回与所述关键信号相对应的回应信号;An output end of each PBI board is connected to an input end of the key signal processing board, and an input end of each PBI board is connected to an output end of the key signal processing board, and the key signal processing board receives a key signal sent by the PBI board, And returning a response signal corresponding to the key signal to the PBI board;
    接收到所述回应信号的PBI板将所述回应信号分别发送给自身监控的CPU。The PBI board that receives the response signal sends the response signal to the CPU that it monitors.
  2. 根据权利要求1所述的多路服务器,其特征在于,所述关键信号处理板接收所述PBI板发送的关键信号,并向所述PBI板返回与所述关键信号相对应的回应信号,包括:The multiplex server according to claim 1, wherein said key signal processing board receives a key signal transmitted by said PBI board and returns a response signal corresponding to said key signal to said PBI board, including :
    当所述多路服务器作为一个服务器进行上电时,所述关键信号处理板接收到所述多路服务器内的全部PBI板分别发送的第一类电源上电信号后,同时向所述全部PBI板发送作为与所述第一类电源上电信号相对应的回应信号的第二类电源上电信号,以使所述多路服务器内的全部CPU接收到第二类电源上电信号的时间差满足预设时间要求;When the multi-way server is powered on as a server, the key signal processing board receives the first type power-on signal sent by all the PBI boards in the multi-path server, and simultaneously sends the PBI to all the PBIs. The board transmits a second type of power-on signal as a response signal corresponding to the first-type power-on signal, so that all CPUs in the multi-channel server receive the time difference of the second-type power-on signal Preset time requirement;
    若所述多路服务器内的全部CPU划分成至少两个子服务器,当任意一个子服务器上电时,所述关键信号处理板接收到所述子服务器内的全部PBI板分别发送的所述第三类电源上电信号后,同时向所述子服务器内的全部PBI板分别发送作为与所述第三类电源上电信号相对应的第四类电源上电信号,以使所述子服务器内的全部CPU接收到所述第四类电源上电信号的时间差满足预设时间要求;If all the CPUs in the multi-path server are divided into at least two sub-servers, when any one of the sub-servers is powered on, the key signal processing board receives the third of the PBI boards respectively sent by the sub-servers. After the power-on signal is sent to the power supply, the fourth type power-on signal corresponding to the third-type power-on signal is sent to all the PBI boards in the sub-server, so that the sub-server is in the sub-server. The time difference that all CPUs receive the power-on signal of the fourth type of power source meets a preset time requirement;
    其中,所述第一类电源上电信号和所述第三类电源上电信号均由所述PBI板检测到自身监控的各个CPU的电源均上电完成时产生。The first type of power-on signal and the third type of power-on signal are generated when the PBI board detects that the power of each CPU that is monitored by itself is powered on.
  3. 根据权利要求1所述的多路服务器,其特征在于,所述关键信号处理板接收所述PBI板发送的关键信号,并向所述PBI板返回与所述关键信号相对应的回应信号,包括:The multiplex server according to claim 1, wherein said key signal processing board receives a key signal transmitted by said PBI board and returns a response signal corresponding to said key signal to said PBI board, including :
    当所述多路服务器中插入新的CPU时,所述关键信号处理板接收到所述新的CPU对应的PBI板发送的第五类电源上电信号后,在预设时刻向所述新的CPU对应的PBI板发送作为与所述第五类电源上电信号相对应的回应信号的第六类电源上电信号,以使所述新的CPU接收到所述第六类电源上电信号的时间满足预设时间要求;When a new CPU is inserted in the multi-path server, the key signal processing board receives the fifth-type power-on signal sent by the PBI board corresponding to the new CPU, and then goes to the new one at a preset time. a PBI board corresponding to the CPU transmits a sixth type power-on signal as a response signal corresponding to the fifth-type power-on signal, so that the new CPU receives the sixth-type power-on signal Time meets the preset time requirement;
    所述第五类电源上电信号由所述PBI板检测到所述新的CPU上电完成时产生。The fifth type of power-on signal is generated when the PBI board detects that the new CPU is powered on.
  4. 根据权利要求1所述的多路服务器,其特征在于,所述关键信号处理板接收所 述PBI板发送的关键信号,并向所述PBI板返回与所述关键信号相对应的回应信号,包括:The multiplex server according to claim 1, wherein said key signal processing board receives Deriving a key signal transmitted by the PBI board and returning a response signal corresponding to the key signal to the PBI board, including:
    若所述多路服务器作为一个服务器使用,所述关键信号处理板接收到任意一个PBI板发送的第一系统错误信号后,分别向所述多路服务器中除发送所述系统错误信号PBI之外的其它各个PBI板发送作为与所述第一系统错误信号对应的回应信号的第二系统错误信号;If the multi-path server is used as a server, the key signal processing board receives the first system error signal sent by any one of the PBI boards, and sends the system error signal PBI to the multi-path server respectively. The other respective PBI boards transmit a second system error signal as a response signal corresponding to the first system error signal;
    若所述多路服务器划分成至少两个子服务器,所述关键信号处理板接收到任意一个PBI板发送的第三系统错误信号时,根据获得的所述子服务器信息确定与发送所述第三系统错误信号的PBI板属于同一个子服务器的其它各个PBI板;分别向所述子服务器中除发送所述第三系统错误信号的PBI板之外的其它各个PBI板发送作为与所述第三系统错误信号对应的回应信号的第四系统错误信号。If the multi-way server is divided into at least two sub-servers, and the key signal processing board receives the third system error signal sent by any one of the PBI boards, determining and transmitting the third system according to the obtained sub-server information The PBI boards of the error signals belong to other PBI boards of the same sub-server; respectively, to each of the PBI boards except the PBI board that transmits the third system error signal, as the third system error The fourth system error signal of the response signal corresponding to the signal.
  5. 根据权利要求4所述的多路服务器,其特征在于,所述关键信号处理板接收所述PBI板发送的关键信号,并向所述PBI板返回与所述关键信号相对应的回应信号,还包括:The multiplex server according to claim 4, wherein said key signal processing board receives a key signal transmitted by said PBI board, and returns a response signal corresponding to said key signal to said PBI board, include:
    当检测到当前PBI板发送的第一系统错误信号或第三系统错误由有效状态变为无效状态时,所述关键信号处理板接收下一个第一系统错误信号或第三系统错误信号。The key signal processing board receives the next first system error signal or the third system error signal when it is detected that the first system error signal or the third system error transmitted by the current PBI board changes from the active state to the inactive state.
  6. 一种多路服务器的信号处理方法,其特征在于,应用于多路服务器中,所述多路服务器包括多个CPU、多个南桥基板管理监控器输入输出PBI板,以及关键信号处理板,其中,一个PBI板监控至少两个CPU;所述方法包括:A signal processing method for a multi-path server, which is characterized in that it is applied to a multi-path server, which includes a plurality of CPUs, a plurality of south bridge substrate management monitor input and output PBI boards, and a key signal processing board. Wherein, one PBI board monitors at least two CPUs; the method includes:
    所述关键信号处理板接收所述PBI板发送的关键信号;The key signal processing board receives a key signal sent by the PBI board;
    所述关键信号处理板根据接收到的所述关键信号向所述PBI板返回与所述关键信号相对应的回应信号,以使接收到所述回应信号的所述PBI板将所述回应信号发送给所述PBI板监控的CPU。The key signal processing board returns a response signal corresponding to the key signal to the PBI board according to the received key signal, so that the PBI board that receives the response signal sends the response signal The CPU that is monitored by the PBI board.
  7. 根据权利要求6所述的方法,其特征在于,所述关键信号处理板根据接收到的所述关键信号向PBI板返回与所述关键信号相对应的回应信号,包括:The method according to claim 6, wherein the key signal processing board returns a response signal corresponding to the key signal to the PBI board according to the received key signal, including:
    当所述多路服务器作为一个服务器进行上电时,所述关键信号处理板接收到所述多路服务器内的全部PBI板分别发送的第一类电源上电信号后,同时向全部PBI板发送作为与所述第一类电源上电信号相对应的回应信号的第二类电源上电信号,以使所述多路服务器内的全部CPU接收到第二类电源上电信号的时间差满足预设时间要求;When the multi-way server is powered on as a server, the key signal processing board receives the first type power-on signal sent by all the PBI boards in the multi-way server, and simultaneously sends the power signals to all the PBI boards. a second type of power-on signal as a response signal corresponding to the power-on signal of the first type of power source, so that all CPUs in the multi-way server receive the time difference of the second-type power-on signal to meet the preset time requirement;
    若所述多路服务器内的全部CPU划分成至少两个子服务器,当任意一个子服务器上电时,所述关键信号处理板接收到所述子服务器内的全部PBI板分别发送的所述第三 类电源上电信号后,同时向所述子服务器内的全部PBI板分别发送作为与所述第三类电源上电信号相对应的回应信号的第四类电源上电信号,以使所述子服务器内的全部CPU接收到所述第四类电源上电信号的时间差满足预设时间要求;If all the CPUs in the multi-path server are divided into at least two sub-servers, when any one of the sub-servers is powered on, the key signal processing board receives the third of the PBI boards respectively sent by the sub-servers. After the power-on signal of the power-like type, a fourth type power-on signal as a response signal corresponding to the power-on signal of the third-type power source is simultaneously sent to all PBI boards in the sub-server, so that the sub-power The time difference between all the CPUs in the server receiving the power-on signal of the fourth type of power meets the preset time requirement;
    其中,所述第一类电源上电信号和所述第三类电源上电信号均由所述PBI板检测到自身监控的各个CPU的电源均上电完成时产生。The first type of power-on signal and the third type of power-on signal are generated when the PBI board detects that the power of each CPU that is monitored by itself is powered on.
  8. 根据权利要求6所述的方法,其特征在于,所述关键信号处理板根据接收到的所述关键信号向PBI板返回与所述关键信号相对应的回应信号,包括:The method according to claim 6, wherein the key signal processing board returns a response signal corresponding to the key signal to the PBI board according to the received key signal, including:
    当所述多路服务器中插入新的CPU时,所述关键信号处理板接收到所述新的CPU对应的PBI板发送的第五类电源上电信号后,在预设时刻向所述新的CPU对应的PBI板发送作为与所述第五类电源上电信号相对应的回应信号的第六类电源上电信号,以使所述新的CPU接收到所述第六类电源上电信号的时间满足预设时间要求;When a new CPU is inserted in the multi-path server, the key signal processing board receives the fifth-type power-on signal sent by the PBI board corresponding to the new CPU, and then goes to the new one at a preset time. a PBI board corresponding to the CPU transmits a sixth type power-on signal as a response signal corresponding to the fifth-type power-on signal, so that the new CPU receives the sixth-type power-on signal Time meets the preset time requirement;
    所述第五类电源上电信号由所述PBI板检测到所述新的CPU上电完成时产生。The fifth type of power-on signal is generated when the PBI board detects that the new CPU is powered on.
  9. 根据权利要求8所述的方法,其特征在于,所述关键信号处理板根据接收到的所述关键信号向PBI板返回与所述关键信号相对应的回应信号,包括:The method according to claim 8, wherein the key signal processing board returns a response signal corresponding to the key signal to the PBI board according to the received key signal, including:
    若所述多路服务器作为一个服务器使用,所述关键信号处理板接收到任意一个PBI板发送的第一系统错误信号后,同时向所述多路服务器中除发送所述系统错误信号PBI之外的其它各个PBI板发送作为与所述第一系统错误信号对应的回应信号的第二系统错误信号;If the multi-path server is used as a server, the key signal processing board receives the first system error signal sent by any one of the PBI boards, and simultaneously sends the system error signal PBI to the multi-path server. The other respective PBI boards transmit a second system error signal as a response signal corresponding to the first system error signal;
    若所述多路服务器划分成至少两个子服务器,所述关键信号处理板获取所述多路服务器的子服务器信息,当接收到任意一个PBI板发送的第三系统错误信号时,根据所述子服务器信息确定与发送所述第三系统错误信号的PBI板属于同一个子服务器的其它各个PBI板;同时向所述子服务器中除发送所述第三系统错误信号的PBI板之外的其它各个PBI板发送作为与所述第三系统错误信号对应的回应信号的第四系统错误信号。If the multiplex server is divided into at least two sub-servers, the key signal processing board acquires sub-server information of the multiplex server, and when receiving a third system error signal sent by any one of the PBI boards, according to the sub-server The server information determines that each PBI board belonging to the same sub-server as the PBI board that transmits the third system error signal; and each PBI other than the PBI board that transmits the third system error signal to the sub-server The board transmits a fourth system error signal as a response signal corresponding to the third system error signal.
  10. 根据权利要求9所述的方法,其特征在于,所述方法还包括:The method of claim 9 wherein the method further comprises:
    当检测到当前PBI板发送的第一系统错误信号或第三系统错误由有效状态变为无效状态时,所述关键信号处理板接收下一个第一系统错误信号或第三系统错误信号。 The key signal processing board receives the next first system error signal or the third system error signal when it is detected that the first system error signal or the third system error transmitted by the current PBI board changes from the active state to the inactive state.
PCT/CN2015/093586 2014-12-16 2015-11-02 Multi-path server and signal processing method thereof WO2016095613A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410781480.X 2014-12-16
CN201410781480.XA CN104503947B (en) 2014-12-16 2014-12-16 Multipath server and its signal processing method

Publications (1)

Publication Number Publication Date
WO2016095613A1 true WO2016095613A1 (en) 2016-06-23

Family

ID=52945345

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/093586 WO2016095613A1 (en) 2014-12-16 2015-11-02 Multi-path server and signal processing method thereof

Country Status (2)

Country Link
CN (2) CN108345562B (en)
WO (1) WO2016095613A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108345562B (en) * 2014-12-16 2022-03-29 超聚变数字技术有限公司 Multi-path server and signal processing method thereof
CN106446311B (en) * 2015-08-10 2019-09-13 杭州华为数字技术有限公司 CPU warning circuit and alarm method
CN105468119A (en) * 2015-11-19 2016-04-06 浪潮电子信息产业股份有限公司 Hardware realization method of reset circuit with multiple control terminals
CN105549706B (en) * 2015-12-11 2018-12-07 华为技术有限公司 A kind of method, apparatus and system of hot restart server
TWI790110B (en) * 2022-01-27 2023-01-11 神雲科技股份有限公司 High-reliability server and multi-party key signal control method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102880583A (en) * 2012-08-01 2013-01-16 浪潮(北京)电子信息产业有限公司 Device and method for configuring dynamic link of multi-way server
CN103257950A (en) * 2013-04-23 2013-08-21 浪潮电子信息产业股份有限公司 Double-step initialization chip configuring method capable of achieving dynamic link usability conversion
CN104503947A (en) * 2014-12-16 2015-04-08 华为技术有限公司 Multi-server and signal processing method thereof
EP1662387B1 (en) * 2004-11-26 2015-09-16 Fujitsu Limited Server, method for controlling data communication of server, computer product

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030070064A1 (en) * 2001-10-09 2003-04-10 Te-Yu Liang Circuit and method capable of adjusting the external clock of a CPU
US7246178B2 (en) * 2002-05-07 2007-07-17 Nortel Networks Limited Methods and systems for changing a topology of a network
CN100426159C (en) * 2005-09-06 2008-10-15 台达电子工业股份有限公司 Controller with dual-processor
US8122278B2 (en) * 2009-04-28 2012-02-21 Oracle America, Inc. Clock skew measurement for multiprocessor systems
JP5139399B2 (en) * 2009-10-21 2013-02-06 株式会社東芝 REPRODUCTION DEVICE AND REPRODUCTION DEVICE CONTROL METHOD
CN102147640A (en) * 2010-02-05 2011-08-10 英业达股份有限公司 Server with a plurality of main boards
CN102375775B (en) * 2010-08-11 2014-08-20 英业达股份有限公司 Computer system unrecoverable error indication signal detection circuit
WO2012163026A1 (en) * 2011-10-27 2012-12-06 华为技术有限公司 Server power-on method, device and system
CN103246634B (en) * 2013-04-26 2017-02-08 华为技术有限公司 Method and device for configuring working modes of multi-processor system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1662387B1 (en) * 2004-11-26 2015-09-16 Fujitsu Limited Server, method for controlling data communication of server, computer product
CN102880583A (en) * 2012-08-01 2013-01-16 浪潮(北京)电子信息产业有限公司 Device and method for configuring dynamic link of multi-way server
CN103257950A (en) * 2013-04-23 2013-08-21 浪潮电子信息产业股份有限公司 Double-step initialization chip configuring method capable of achieving dynamic link usability conversion
CN104503947A (en) * 2014-12-16 2015-04-08 华为技术有限公司 Multi-server and signal processing method thereof

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"SUPER X80BN-F Platform with X80BN-F Baseboard, X80BN-CPU CPU Board, X80BN- BR 1 Bridge Card, USER'S MANUAL, Revision 1.1a", XBOBN-F, 9 December 2011 (2011-12-09) *

Also Published As

Publication number Publication date
CN108345562B (en) 2022-03-29
CN108345562A (en) 2018-07-31
CN104503947A (en) 2015-04-08
CN104503947B (en) 2018-03-16

Similar Documents

Publication Publication Date Title
WO2016095613A1 (en) Multi-path server and signal processing method thereof
US7761726B2 (en) Method and apparatus for fault tolerant time synchronization mechanism in a scaleable multi-processor computer
EP2226700B1 (en) Clock supply method and information processing apparatus
CN101594235B (en) Method for managing blade server based on SMBUS
US8527803B2 (en) System and method for multiple backplane time synchronization
CN104794078B (en) Cabinet server system and automatic addressing method thereof
TW201421258A (en) I2C bus structure and availability checking method
CN105577430A (en) Node management method of high-end fault-tolerant server
WO2019100701A1 (en) Sas switch controller extension architecture and design method thereof
TW201421260A (en) I2C bus structure and address management method
CN111506156B (en) Time service method and system of processor array
US7627774B2 (en) Redundant manager modules to perform management tasks with respect to an interconnect structure and power supplies
CN102231700A (en) Exchange card switching information transmission method and exchange card hot backup system
CN110096114B (en) System and method for managing multiple ARM server nodes
CN115686872B (en) Memory resource processing equipment, method, device and medium based on BMC (baseboard management controller)
JP2008004062A (en) Expandable multi-computer switching device
TW201201013A (en) Method and multiple computer system with a failover support to manage shared resources
US9612931B2 (en) System and method for synchronously controlling LED on multiple control modules based on a register synchronized with synchronous packets
CN210983372U (en) Hardware arbitration device under double-partition mode
TWI489287B (en) Multiplexer device and switching method therefor
CN106843444B (en) Multi-architecture CPU energy consumption control system
CN205247367U (en) Low -power consumption cloud server storage type node means
US9990323B2 (en) Configuring a communication interconnect for electronic devices
US20080148093A1 (en) Expandable exchange apparatus and a backup system thereof
CN113704148A (en) Server system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15869123

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15869123

Country of ref document: EP

Kind code of ref document: A1