CN116627729A - External connection cable, external connection cable in-place detection device, startup self-checking method and system - Google Patents

External connection cable, external connection cable in-place detection device, startup self-checking method and system Download PDF

Info

Publication number
CN116627729A
CN116627729A CN202310504486.1A CN202310504486A CN116627729A CN 116627729 A CN116627729 A CN 116627729A CN 202310504486 A CN202310504486 A CN 202310504486A CN 116627729 A CN116627729 A CN 116627729A
Authority
CN
China
Prior art keywords
pin
input
expansion board
cable
output expansion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310504486.1A
Other languages
Chinese (zh)
Inventor
池金波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202310504486.1A priority Critical patent/CN116627729A/en
Publication of CN116627729A publication Critical patent/CN116627729A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2205Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing using arrangements specific to the hardware being tested
    • G06F11/2221Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing using arrangements specific to the hardware being tested to test input/output devices or peripheral units
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2273Test methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2284Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing by power-on test, e.g. power-on self test [POST]
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Power Sources (AREA)

Abstract

The application belongs to the technical field of heterogeneous servers, and particularly provides an external cable, an external cable in-place detection device, a startup self-checking method and a startup self-checking system, wherein the device comprises: the first device comprises a plurality of first ports, the first ports are in butt joint with a first joint of the external cable, the first ports comprise first pins, and the first pins are in butt joint with first in-place pins of the first joint; the first pin is electrically connected with a detection power supply; the first pin is connected with an internal control device of the first equipment; the second device comprises a plurality of second ports, the second ports are in butt joint with a second joint of the external boundary cable, the second ports comprise second pins, the second pins are in butt joint with second in-place pins of the second joint, and the second in-place pins are electrically connected with the first in-place pins; the second pin is grounded. The application can ensure the detection accuracy, and the abnormal external connection cables can be rapidly positioned in the heterogeneous sensor, so that all the external connection cables can be used after the power-on.

Description

External connection cable, external connection cable in-place detection device, startup self-checking method and system
Technical Field
The application belongs to the technical field of heterogeneous servers, and particularly relates to an external cable, an external cable in-place detection device, a startup self-checking method and a startup self-checking system.
Background
The remarkable advantage of heterogeneous computing is that the optimization of performance and cost is achieved by letting the most appropriate dedicated hardware do the most appropriate to adjust the power consumption; the heterogeneous server is used as a hardware carrier for heterogeneous computation, and optimal cost performance and efficiency under typical scenes such as deep learning, image recognition, voice recognition and the like are realized through different collocation ratios of the CPU and the GPU.
For the heterogeneous server design of a machine head (CPU server) +a machine tail (GPU server), the method has the advantages of small occupied space, flexible collocation and relatively high efficiency in assembly and transportation; the system has the defects that the machine head and the machine tail are interconnected by using cables, the cables are positioned outside the machine case and are high-speed PCIe buses, the interfaces are more, and the heterogeneous server is abnormal in operation once a certain cable is in poor contact. The existing detection method for the external connection cable is that a link state indicator lamp is designed near the tail cable, and when the cable is not inserted, the corresponding interface is not link, so that the indicator lamp is not on.
The existing in-place detection method has the defects that link state indicator lamps are effective only when the system completes the enumeration of PCIe devices in the BIOS stage after the system is started, if the lamps are not found to be on, the cable is plugged in again, which is equivalent to hot plug, the indicator lamps are on, but because the system completes the enumeration, BDF is distributed, and then the downlink devices corresponding to the plug-in cable cannot be actually used or can report errors; the machine head needs to be restarted, and the constraint cannot be formed on site operators.
Disclosure of Invention
Aiming at the problem of inaccurate in-place verification in the prior art, the application provides an external cable, an in-place detection device of the external cable, a startup self-checking method and a startup self-checking system, so as to solve the technical problems.
In a first aspect, the present application provides an external cable comprising: the cable comprises a cable, and a first connector and a second connector which are respectively arranged at two ends of the cable, wherein the first connector comprises a first in-place pin, the second connector comprises a second in-place pin, and the first in-place pin is electrically connected with the second in-place pin; the length of the first bit pin is the minimum length of all pin lengths of the first connector, and the length of the second bit pin is the minimum length of all pin lengths of the second connector.
In a second aspect, the present application provides an external cable in-place detection apparatus, where the heterogeneous server includes a first device and a second device, and the apparatus includes:
the first device comprises a plurality of first ports, the first ports are in butt joint with a first joint of the external cable, the first ports comprise first pins, and the first pins are in butt joint with first in-place pins of the first joint; the first pin is electrically connected with a detection power supply; the first pin is connected with an internal control device of the first equipment;
the second device comprises a plurality of second ports, the second ports are in butt joint with a second joint of the external boundary cable, the second ports comprise second pins, the second pins are in butt joint with second in-place pins of the second joint, and the second in-place pins are electrically connected with the first in-place pins; the second pin is grounded.
Further, the internal control device comprises a first BMC and a first input/output expansion board of the first device, a first pin of the first port is connected with a corresponding pin of the first input/output expansion board, and the first input/output expansion board is connected with the first BMC of the first device.
Further, the internal control device is connected with an input port of a second input/output expansion board, and an output end of the second input/output expansion board is connected with a digital display, wherein the digital display comprises, but is not limited to, a nixie tube.
Further, in a state that the first device and the second device are respectively inserted into two ends of the external connection cable, the first BMC passes through the specific external connection cable and the second BMC of the second device.
In a third aspect, the present application provides a boot self-test method, including:
the method comprises the steps that after the first BMC is powered on, register data of a first input/output expansion board are captured preferentially, wherein the register data comprise level states of all pins of the first input/output expansion board;
based on the corresponding relation between each pin of the first input/output expansion board and the first port, converting the level state of each pin of the first input/output expansion board into the connection state of each external cable;
and displaying the external cable number in an abnormal connection state through the coding control digital display.
Further, based on the correspondence between each pin of the first input/output expansion board and the first port, converting the level state of each pin of the first input/output expansion board into the connection state of each external connection cable, including:
establishing a one-to-one mapping relation between a first port number and the pin number of the first input/output expansion board based on the connection relation between each pin of the first input/output expansion board and the first port in advance, and binding a second port number with an association relation by the first port number;
a rule for converting the level state into the connection state is preset, wherein the rule comprises that a high level state corresponds to an abnormal state and a low level state corresponds to a normal state;
and converting the level state of each pin of the first input/output expansion board into the connection state of the external connection cable between the corresponding first port and the corresponding second port based on the one-to-one mapping relation and the rule.
Further, displaying the external cable number in the abnormal connection state by the encoding control digital display includes:
generating a unique identity number for an external cable between a first port and a second port in advance;
and writing the identity number of the external cable with the abnormal connection state into a register of the second input/output expansion board so that the second input/output expansion board controls the nixie tube to display the identity number based on a nixie tube control rule.
Further, after converting the level state of each pin of the first input/output expansion board into the connection state of each external connection cable based on the correspondence between each pin of the first input/output expansion board and the first port, the method further includes:
judging whether an external cable in an abnormal connection state exists or not:
if yes, suspending executing the starting program;
if not, executing the startup procedure, and sending startup prompt information to the second BMC of the second device, wherein the startup prompt information is used for informing the second BMC to execute the startup procedure.
In a fourth aspect, the present application further provides a power-on self-checking system, including:
the data grabbing module is used for preferentially grabbing register data of the first input/output expansion board after the first BMC is electrified, wherein the register data comprise level states of all pins of the first input/output expansion board;
the data conversion module is used for converting the level state of each pin of the first input/output expansion board into the connection state of each external cable based on the corresponding relation between each pin of the first input/output expansion board and the first port;
and the data display module is used for controlling the digital display to display the external cable number in the abnormal connection state through the code.
Further, the data conversion module includes:
the mapping establishing unit is used for establishing a one-to-one mapping relation between the first port number and the pin number of the first input/output expansion board in advance based on the connection relation between each pin of the first input/output expansion board and the first port, and binding the first port number with a second port number with an association relation;
a rule setting unit, configured to preset a rule for converting a level state into a connection state, where the rule includes that a high level state corresponds to an abnormal state and a low level state corresponds to a normal state;
and the data conversion unit is used for converting the level state of each pin of the first input/output expansion board into the connection state of the external connection cable between the corresponding first port and the corresponding second port based on the one-to-one mapping relation and the rule.
Further, the data display module includes:
the serial number generating unit is used for generating a unique identity number for the external cable between the first port and the second port in advance;
the number display unit is used for writing the identity number of the external cable with the abnormal connection state into a register of the second input/output expansion board so that the second input/output expansion board controls the nixie tube to display the identity number based on a nixie tube control rule.
Further, the system further comprises:
the state judging module is used for judging whether an external cable in an abnormal connection state exists or not;
the first execution module is used for suspending execution of the starting program if an external cable in an abnormal connection state exists;
the second execution module is used for executing the startup program if the external cable in the abnormal connection state does not exist, and sending startup prompt information to the second BMC of the second device, wherein the startup prompt information is used for notifying the second BMC to execute the startup program.
The external cable, the external cable on-site detection device, the startup self-checking method and the startup self-checking system have the advantages that the external cable is arranged in the on-site detection loop, the on-site detection loop is connected with the internal control device of the first equipment at the first pin of the first port of the first equipment end, the internal control device judges whether the external cable is on site or not based on the on-off state of the loop, the detection accuracy can be ensured, the abnormal external cable can be rapidly positioned in the heterogeneous sensor, and all the external cables can be ensured to be available after startup.
In addition, the application has reliable design principle, simple structure and very wide application prospect.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings that are required to be used in the description of the embodiments or the prior art will be briefly described below, and it will be obvious to those skilled in the art that other drawings can be obtained from these drawings without inventive effort.
Fig. 1 is a schematic structural view of an external connection cable in-place detecting device according to an embodiment of the present application.
Fig. 2 is a schematic structural view of an external connection cable according to an embodiment of the present application.
FIG. 3 is an exemplary flow chart of a power-on self-test method according to one embodiment of the application.
FIG. 4 is another exemplary flow chart of a power-on self-test method according to one embodiment of the application.
FIG. 5 is an exemplary block diagram of a power-on self-test system according to one embodiment of the application.
1, a golden finger; 2. an in-place pin; 3. a detection line; 4. I2C bus.
Detailed Description
In order to make the technical solution of the present application better understood by those skilled in the art, the technical solution of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application, and it is apparent that the described embodiments are only some embodiments of the present application, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present application without making any inventive effort, shall fall within the scope of the present application.
It should be noted that, without conflict, the embodiments of the present application and features of the embodiments may be combined with each other.
In the description of the present application, it should be understood that the terms "first," "second," and the like are used for descriptive purposes only and are not to be construed as indicating or implying a relative importance or number of technical features indicated. Thus, a feature defining "a first", "a second", etc. may explicitly or implicitly include one or more such feature. In the description of the present application, unless otherwise indicated, the meaning of "a plurality" is two or more.
In the description of the present application, it should be noted that, unless explicitly specified and limited otherwise, the terms "mounted," "connected," and "connected" are to be construed broadly, and may be either fixedly connected, detachably connected, or integrally connected, for example; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the above terms in the present application can be understood by those of ordinary skill in the art in a specific case.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs. The terminology used herein in the description of the application is for the purpose of describing particular embodiments only and is not intended to be limiting of the application.
The following explains key terms appearing in the present application.
BMC, execution server remote management controller, english name Baseboard Management controller. The method can perform firmware upgrade, check machine equipment and other operations on the machine in a state that the machine is not started. Fully implementing IPMI functionality in a BMC requires a powerful 16-bit or 32-bit microcontroller and RAM for data storage, flash memory for non-volatile data storage, and firmware to provide basic remote manageability in terms of secure remote reboot, secure re-power-up, LAN alerting, and system health monitoring. In addition to the basic IPMI and system operation monitoring functions, the mBMC can also enable BIOS flash element selection and protection by storing the previous BIOS using one of the 2 flash memories. For example, when the system fails to boot after a remote BIOS upgrade, the remote administrator may switch back to the previously-working BIOS image to boot the system. Once BIOS is upgraded, the BIOS image can be locked, so as to effectively prevent virus from invading it.
The I2C bus is a simple, bi-directional two-wire synchronous serial bus developed by Philips corporation. It requires only two wires to transfer information between devices connected to the bus. The master device is used to initiate the bus transfer of data and generate a clock to open the transfer device, where any addressed device is considered a slave device. If the host computer is to send data to the slave device, the host computer firstly addresses the slave device, then actively sends the data to the slave device, and finally the host computer terminates the data transmission; if the host is to receive data from the slave, the slave is addressed by the master first, then the host receives data sent by the slave, and finally the host terminates the receiving process. In this case, the host is responsible for generating the timing clock and terminating the data transfer.
BIOS is an acronym of English "Basic Input Output System", and the Chinese name is "basic input output System" after being translated. On IBM PC compatible systems, is a industry standard firmware interface. The computer is a set of programs solidified on a ROM chip on a main board in the computer, which stores the most important programs of basic input and output, self-checking programs after starting up and system self-starting programs, and can read and write specific information of system settings from CMOS. Its main function is to provide the lowest, most direct hardware setup and control for the computer. In addition, the BIOS provides some system parameters to the operating system. The system hardware changes are hidden by the BIOS and the program uses the BIOS functions rather than directly controlling the hardware. Modern operating systems ignore the abstraction layer provided by the BIOS and directly control the hardware components.
A CPU central processing unit (central processing unit, abbreviated as CPU) is used as an operation and control core of the computer system, and is a final execution module for information processing and program running.
Graphics processors (English: graphics processing unit, abbreviated: GPU), also known as display cores, vision processors, display chips, are microprocessors that are dedicated to performing image and graphics related operations on personal computers, workstations, gaming machines, and some mobile devices (e.g., tablet computers, smartphones, etc.).
The application will be described in detail below with reference to the drawings in connection with embodiments.
The application also provides an external cable shown in fig. 1, which comprises the following specific structures: the cable comprises a cable, and a first connector and a second connector which are respectively arranged at two ends of the cable, wherein the first connector comprises a first in-place pin, the second connector comprises a second in-place pin, and the first in-place pin is electrically connected with the second in-place pin; the length of the first bit pin is the minimum length of all pin lengths of the first connector, and the length of the second bit pin is the minimum length of all pin lengths of the second connector.
The high-speed cable interconnection (taking MiniSAS-HD as an example) is used between the machine head and the machine tail, the number of cables is determined according to the machine type and configuration, the golden finger 1 at the two ends of the cable is respectively provided with an on-site pin 2, the length of the on-site pin 2 is shorter than that of other information pins, the on-site pin 2 is contacted with the corresponding pin of the connector only when the cable port is completely inserted into the board end connector, the on-site pins 2 of the golden finger at the two ends are communicated by a detection line (cable) 3, and meanwhile, the cable is provided with an I2C bus 4.
On the basis of the external cable, an external cable in-place detection device shown in fig. 2 is provided, the GPU service device of the heterogeneous server is defined as a first device, the CPU service device of the heterogeneous server is defined as a second device, and a plurality of external cables are connected between the first device and the second device. The concrete structure comprises:
the first device comprises a plurality of first ports, the first ports are in butt joint with a first joint of the external cable, the first ports comprise first pins, and the first pins are in butt joint with first in-place pins of the first joint; the first pin is electrically connected with a detection power supply; the first pin is connected with an internal control device of the first equipment; the second device comprises a plurality of second ports, the second ports are in butt joint with a second joint of the external boundary cable, the second ports comprise second pins, the second pins are in butt joint with second in-place pins of the second joint, and the second in-place pins are electrically connected with the first in-place pins; the second pin is grounded.
The external cable, the butt joint ports of the first equipment and the second equipment and the power supply on the circuit form a detection loop, when the external cable is inserted, the detection loop is a passage after the server is electrified, and if the external cable is not inserted, the detection loop is a circuit breaker. The internal control device of the first equipment can judge the in-place state of the external connection cable by monitoring the on-off state of the detection loop.
In one embodiment, the internal control device includes a first BMC and a first input output expansion board (IO Expander) of the first device, a first pin of the first port being connected to a corresponding pin of the first input output expansion board, the first input output expansion board being connected to the first BMC of the first device. Through taking the BMC as the core chip of the internal control device and matching with the first input/output expansion board, the accurate positioning of the cable which is not located outside the site can be realized, meanwhile, the hardware is not required to be greatly modified, and only one IO Expander is required to be added.
In the existing external connection cable positioning detection method, if the software detects the in-place condition by judging whether communication is performed by ping or not, a large amount of log alarm information can be generated when the external connection cable is out of place, and in the mode, operation and maintenance personnel are required to analyze the log, and then the abnormal external connection cable is positioned according to the equipment type corresponding to the external connection cable. The log alarming mode is not intuitive, and based on the log alarming mode, in one embodiment of the application, the internal control device is connected with the input port of the second input/output expansion board, and the output end of the second input/output expansion board is connected with a digital display, and the digital display comprises, but is not limited to, a nixie tube.
Specifically, a cable in-place pin of the high-speed connector at the machine head end is connected with GND, a cable in-place detection circuit is designed at the machine tail, an IO Expander pin is connected with the cable in-place pin of the high-speed connector, and the IO Expander is connected with the BMC at the machine tail through an I2C bus; when the external high-speed cable is not inserted or one end of the cable is not inserted, the pin corresponding to the IO Expander at the tail end is in a high level, when both ends of the high-speed cable are inserted, the pin corresponding to the IO Expander at the tail end is in a low level, and the BMC can acquire the connection condition of each cable by polling the IO Expander register through the I2C bus. The front window at the tail end of the machine is provided with two 8-bit nixie tubes for polling and displaying the serial numbers of the external cables which are not connected correctly, each field of the nixie tubes is controlled by an independent pin of the IO Expander, BMC codes 0-9 which needs to be displayed, and the numerical value is displayed by writing the coding value into the IO Expander register through the I2C bus, so that on-site operators can acquire the cable information of abnormal connection quickly and intuitively.
In addition, when the second device detects that the external cable is in place, the first device needs to be notified, so in one embodiment of the present application, in a state that the first device and the second device are respectively inserted into two ends of the external cable, the first BMC passes through the specific external cable and the second BMC of the second device. Specifically, the IPMB protocol is used for data interaction between the machine head and the machine tail, and the machine head BMC controls the starting time of the machine head by reading the in-place information of the external cable detected by the machine tail BMC.
The first input/output expansion board and the second input/output expansion board can use an IO Expander or CPLD GPIO.
The detection process of the external connection cable in-place detection device comprises the following steps: after the tail standby power supply is electrified, firstly starting the tail BMC, and controlling all indicator lamps of the tail to be extinguished before the tail BMC detects that all cables are in place; if the external cables are not connected, the nixie tube polls and displays the serial numbers of the cables which are not inserted, so that field personnel can identify abnormality, and the corresponding cable connection is checked according to the serial numbers of the nixie tube; and after all the cables are correctly connected, the front window nixie tube is extinguished, other indicator lamps normally display, and the tail of the machine is normally started. After the power supply of the machine head is powered on, firstly the machine head BMC is started, cable in-place information detected by the machine tail BMC is obtained through the IPMB bus, if the cable is not in place, all the indicator lamps of the machine head are turned off until all the cables detected by the machine tail BMC are in place, the machine head indicator lamps are normally displayed, then the machine head is started after waiting for the standby tail to start, and after the machine tail is started, the machine head is normally started.
The boot self-checking method provided by the embodiment of the application is executed by the computer equipment, and correspondingly, the boot self-checking system is operated in the computer equipment.
Fig. 3 is a schematic flow chart of a method of one embodiment of the application. The execution body of fig. 3 may be a power-on self-checking system. The order of the steps in the flow chart may be changed and some may be omitted according to different needs.
As shown in fig. 3, the method includes:
step 310, after the first BMC is powered on, preferentially grabbing register data of the first input/output expansion board, wherein the register data comprises level states of all pins of the first input/output expansion board;
step 320, converting the level state of each pin of the first input/output expansion board into the connection state of each external cable based on the corresponding relation between each pin of the first input/output expansion board and the first port;
and 330, displaying the external cable number in the abnormal connection state by the encoding control digital display.
In order to facilitate understanding of the present application, the power-on self-inspection method provided by the present application is further described below by using the principle of the power-on self-inspection method of the present application, and combining the process of performing power-on self-inspection on heterogeneous servers in the embodiment.
The external cable is a PCIe bus, which is different from a hot plug supporting bus such as an SAS/SATA, an optical port and a network port, the insertion of the external cable at any time before and after the startup has no influence on the system, and the PCIe bus between the tail and the machine head must ensure reliable connection before the startup, otherwise, the corresponding equipment cannot be identified. In particular. Referring to fig. 4, the power-on self-checking method includes:
s1, after the first BMC is powered on, the register data of the first input/output expansion board are captured preferentially, wherein the register data comprise the level states of all pins of the first input/output expansion board.
After the first BMC of the first device is powered on, a detection program is firstly executed, namely, data of the level states of all pins in a register of the first input/output expansion board are read through an I2C bus.
After the heterogeneous server is electrified, a detection power supply of the external cable in the position detection device supplies power for the detection loop, and at the moment, if the external cable is in a well-inserted state, the level state of each pin of the first input/output expansion board is required to be low level.
After the machine is electrified and before the machine is started, all the indicator lamps (power indicator lamps, health indicator lamps and the like) are kept off before the correct connection of the external cables is confirmed, so that field operators are warned of abnormality, and the investigation is continued.
S2, based on the corresponding relation between each pin of the first input/output expansion board and the first port, converting the level state of each pin of the first input/output expansion board into the connection state of each external connection cable.
Establishing a one-to-one mapping relation between a first port number and the pin number of the first input/output expansion board based on the connection relation between each pin of the first input/output expansion board and the first port in advance, and binding a second port number with an association relation by the first port number; a rule for converting the level state into the connection state is preset, wherein the rule comprises that a high level state corresponds to an abnormal state and a low level state corresponds to a normal state; and converting the level state of each pin of the first input/output expansion board into the connection state of the external connection cable between the corresponding first port and the corresponding second port based on the one-to-one mapping relation and the rule.
For example, when no external high-speed cable is inserted or one end of the cable is not inserted, a pin corresponding to the tail end IO Expander is at a high level, when both ends of the high-speed cable are inserted, a pin corresponding to the tail end IO Expander is at a low level, and the BMC can acquire the connection condition of each cable by polling the IO Expander register through the I2C bus.
S3, displaying the external cable number in an abnormal connection state through the coding control digital display.
Generating a unique identity number for an external cable between a first port and a second port in advance; and writing the identity number of the external cable with the abnormal connection state into a register of the second input/output expansion board so that the second input/output expansion board controls the nixie tube to display the identity number based on a nixie tube control rule.
For example, the front window at the tail end of the machine is provided with two 8-bit nixie tubes for polling and displaying the serial numbers of the external cables which are not connected correctly, each field of the nixie tubes is controlled by an independent pin of the IO Expander, BMC codes 0-9 which needs to be displayed, and the numerical values are displayed by writing coded values into the IO Expander register through the I2C bus, so that on-site operators can acquire the cable information of abnormal connection quickly and intuitively.
The on-site information of the two ends of the external connection cable is detected, and the serial number of the cable with abnormal connection is visually displayed to the outside through the nixie tube, so that a quick and accurate checking direction is provided for on-site operators, and the dependence on the quality and the proficiency of the operators is avoided.
S4, the first BMC of the first device sends the in-place detection result to the second BMC of the second device, and the first BMC and the second BMC negotiate to start up based on the detection result.
The first BMC judges whether an external cable in an abnormal connection state exists or not: if yes, suspending executing the starting program; if not, executing the startup procedure, and sending startup prompt information to the second BMC of the second device, wherein the startup prompt information is used for informing the second BMC to execute the startup procedure.
After the tail standby power supply is electrified, firstly starting the tail BMC, and controlling all indicator lamps of the tail to be extinguished before the tail BMC detects that all cables are in place; if the external cables are not connected, the nixie tube polls and displays the serial numbers of the cables which are not inserted, so that field personnel can identify abnormality, and the corresponding cable connection is checked according to the serial numbers of the nixie tube; and after all the cables are correctly connected, the front window nixie tube is extinguished, other indicator lamps normally display, and the tail of the machine is normally started. After the power supply of the machine head is powered on, firstly the machine head BMC is started, cable in-place information detected by the machine tail BMC is obtained through the IPMB bus, if the cable is not in place, all the indicator lamps of the machine head are turned off until all the cables detected by the machine tail BMC are in place, the machine head indicator lamps are normally displayed, then the machine head is started after waiting for the standby tail to start, and after the machine tail is started, the machine head is normally started.
Whether the cable is correctly connected is used as a constraint condition for controlling the starting-up of the machine head and the machine tail, so that a large number of repair worksheets caused by nonstandard operation in the processes of putting on the shelf and moving the server are avoided, and the robustness of the system and the convenience of putting on the shelf and maintaining are improved.
In some embodiments, the power-on self-test system 500 may include a plurality of functional modules consisting of computer program segments. The computer program of each program segment in the power-on self-test system 500 may be stored in a memory of a computer device and executed by at least one processor to perform the function of power-on self-test (as described in detail with reference to fig. 3).
In this embodiment, the power-on self-checking system 500 may be divided into a plurality of functional modules according to the functions performed by the power-on self-checking system, as shown in fig. 5. The functional module may include: a data grabbing module 510, a data conversion module 520 and a data display module 530. The module referred to in the present application refers to a series of computer program segments capable of being executed by at least one processor and of performing a fixed function, stored in a memory. In the present embodiment, the functions of the respective modules will be described in detail in the following embodiments.
The data grabbing module 510 is configured to grab register data of the first input/output expansion board after the first BMC is powered on, where the register data includes a level state of each pin of the first input/output expansion board;
the data conversion module 520 is configured to convert a level state of each pin of the first input/output expansion board into a connection state of each external cable based on a correspondence between each pin of the first input/output expansion board and the first port;
the data display module 530 is configured to control the digital display to display the external cable number in the abnormal connection state through the code.
Optionally, as an embodiment of the present application, the data conversion module includes:
the mapping establishing unit is used for establishing a one-to-one mapping relation between the first port number and the pin number of the first input/output expansion board in advance based on the connection relation between each pin of the first input/output expansion board and the first port, and binding the first port number with a second port number with an association relation;
a rule setting unit, configured to preset a rule for converting a level state into a connection state, where the rule includes that a high level state corresponds to an abnormal state and a low level state corresponds to a normal state;
and the data conversion unit is used for converting the level state of each pin of the first input/output expansion board into the connection state of the external connection cable between the corresponding first port and the corresponding second port based on the one-to-one mapping relation and the rule.
Optionally, as an embodiment of the present application, the data display module includes:
the serial number generating unit is used for generating a unique identity number for the external cable between the first port and the second port in advance;
the number display unit is used for writing the identity number of the external cable with the abnormal connection state into a register of the second input/output expansion board so that the second input/output expansion board controls the nixie tube to display the identity number based on a nixie tube control rule.
Optionally, as an embodiment of the present application, the system further includes:
the state judging module is used for judging whether an external cable in an abnormal connection state exists or not;
the first execution module is used for suspending execution of the starting program if an external cable in an abnormal connection state exists;
the second execution module is used for executing the startup program if the external cable in the abnormal connection state does not exist, and sending startup prompt information to the second BMC of the second device, wherein the startup prompt information is used for notifying the second BMC to execute the startup program.
Therefore, the external cable is arranged in the on-site detection loop, the first pin of the first port of the on-site detection loop at the first equipment end is connected with the internal control device of the first equipment, the internal control device judges whether the external cable is on site or not based on the on-off state of the loop, the detection accuracy can be ensured by the mode, the abnormal external cable can be rapidly positioned in the heterogeneous sensor, all the external cables can be ensured to be available after the power-on, and the technical effects achieved by the embodiment can be seen from the description above and are not repeated.
The same or similar parts between the various embodiments in this specification are referred to each other. In particular, for the terminal embodiment, since it is substantially similar to the method embodiment, the description is relatively simple, and reference should be made to the description in the method embodiment for relevant points.
In the several embodiments provided by the present application, it should be understood that the disclosed systems and methods may be implemented in other ways. For example, the system embodiments described above are merely illustrative, e.g., the division of the modules is merely a logical function division, and there may be additional divisions when actually implemented, e.g., multiple modules or components may be combined or integrated into another system, or some features may be omitted or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with respect to each other may be through some interface, indirect coupling or communication connection of systems or modules, electrical, mechanical, or other form.
The modules described as separate components may or may not be physically separate, and components shown as modules may or may not be physical modules, i.e., may be located in one place, or may be distributed over a plurality of network modules. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional module in each embodiment of the present application may be integrated into one processing module, or each module may exist alone physically, or two or more modules may be integrated into one module.
Although the present application has been described in detail by way of preferred embodiments with reference to the accompanying drawings, the present application is not limited thereto. Various equivalent modifications and substitutions may be made in the embodiments of the present application by those skilled in the art without departing from the spirit and scope of the present application, and it is intended that all such modifications and substitutions be within the scope of the present application/be within the scope of the present application as defined by the appended claims. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (10)

1. An add-on cable, comprising: the cable comprises a cable, and a first connector and a second connector which are respectively arranged at two ends of the cable, wherein the first connector comprises a first in-place pin, the second connector comprises a second in-place pin, and the first in-place pin is electrically connected with the second in-place pin; the length of the first bit pin is the minimum length of all pin lengths of the first connector, and the length of the second bit pin is the minimum length of all pin lengths of the second connector.
2. An external cable in-place detection device applied to the external cable as claimed in claim 1, wherein the heterogeneous server comprises a first device and a second device, the device comprising:
the first device comprises a plurality of first ports, the first ports are in butt joint with a first joint of the external cable, the first ports comprise first pins, and the first pins are in butt joint with first in-place pins of the first joint; the first pin is electrically connected with a detection power supply; the first pin is connected with an internal control device of the first equipment;
the second device comprises a plurality of second ports, the second ports are in butt joint with a second joint of the external boundary cable, the second ports comprise second pins, the second pins are in butt joint with second in-place pins of the second joint, and the second in-place pins are electrically connected with the first in-place pins; the second pin is grounded.
3. The apparatus of claim 2, wherein the internal control device comprises a first BMC of the first device and a first input-output expansion board, a first pin of the first port being connected to a corresponding pin of the first input-output expansion board, the first input-output expansion board being connected to the first BMC of the first device.
4. The apparatus of claim 2, wherein the internal control device is connected to an input port of a second input output expansion board, and an output port of the second input output expansion board is connected to a digital display, the digital display including but not limited to a nixie tube.
5. The apparatus of claim 3, wherein the first BMC passes through a second BMC of a specific external cable and the second device in a state that both ends of the external cable are inserted into the first device and the second device, respectively.
6. A power-on self-checking method based on the external connection cable in-place detection device according to any one of claims 2 to 5, characterized by comprising:
the method comprises the steps that after the first BMC is powered on, register data of a first input/output expansion board are captured preferentially, wherein the register data comprise level states of all pins of the first input/output expansion board;
based on the corresponding relation between each pin of the first input/output expansion board and the first port, converting the level state of each pin of the first input/output expansion board into the connection state of each external cable;
and displaying the external cable number in an abnormal connection state through the coding control digital display.
7. The method of claim 6, wherein converting the level state of each pin of the first input output expansion board to the connection state of each external cable based on the correspondence of each pin of the first input output expansion board to the first port comprises:
establishing a one-to-one mapping relation between a first port number and the pin number of the first input/output expansion board based on the connection relation between each pin of the first input/output expansion board and the first port in advance, and binding a second port number with an association relation by the first port number;
a rule for converting the level state into the connection state is preset, wherein the rule comprises that a high level state corresponds to an abnormal state and a low level state corresponds to a normal state;
and converting the level state of each pin of the first input/output expansion board into the connection state of the external connection cable between the corresponding first port and the corresponding second port based on the one-to-one mapping relation and the rule.
8. The method of claim 6, wherein controlling the digital display to display the external cable number in the abnormal connection state by the code comprises:
generating a unique identity number for an external cable between a first port and a second port in advance;
and writing the identity number of the external cable with the abnormal connection state into a register of the second input/output expansion board so that the second input/output expansion board controls the nixie tube to display the identity number based on a nixie tube control rule.
9. The method of claim 6, wherein after converting the level state of each pin of the first input-output expansion board to the connection state of each external cable based on the correspondence of each pin of the first input-output expansion board to the first port, the method further comprises:
judging whether an external cable in an abnormal connection state exists or not:
if yes, suspending executing the starting program;
if not, executing the startup procedure, and sending startup prompt information to the second BMC of the second device, wherein the startup prompt information is used for informing the second BMC to execute the startup procedure.
10. A power-on self-test system, comprising:
the data grabbing module is used for preferentially grabbing register data of the first input/output expansion board after the first BMC is electrified, wherein the register data comprise level states of all pins of the first input/output expansion board;
the data conversion module is used for converting the level state of each pin of the first input/output expansion board into the connection state of each external cable based on the corresponding relation between each pin of the first input/output expansion board and the first port;
and the data display module is used for controlling the digital display to display the external cable number in the abnormal connection state through the code.
CN202310504486.1A 2023-05-06 2023-05-06 External connection cable, external connection cable in-place detection device, startup self-checking method and system Pending CN116627729A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310504486.1A CN116627729A (en) 2023-05-06 2023-05-06 External connection cable, external connection cable in-place detection device, startup self-checking method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310504486.1A CN116627729A (en) 2023-05-06 2023-05-06 External connection cable, external connection cable in-place detection device, startup self-checking method and system

Publications (1)

Publication Number Publication Date
CN116627729A true CN116627729A (en) 2023-08-22

Family

ID=87640913

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310504486.1A Pending CN116627729A (en) 2023-05-06 2023-05-06 External connection cable, external connection cable in-place detection device, startup self-checking method and system

Country Status (1)

Country Link
CN (1) CN116627729A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116990725A (en) * 2023-09-27 2023-11-03 成都电科星拓科技有限公司 Cable in-place signal indication system and method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116990725A (en) * 2023-09-27 2023-11-03 成都电科星拓科技有限公司 Cable in-place signal indication system and method
CN116990725B (en) * 2023-09-27 2023-12-12 成都电科星拓科技有限公司 Cable in-place signal indication system and method

Similar Documents

Publication Publication Date Title
CN108829567A (en) Support the monolithic NVMe hard disk backboard of dual lighting and the ignition method of hard disk backboard
US20080046706A1 (en) Remote Monitor Module for Computer Initialization
TW201109913A (en) Main system board error-detecting system and its pluggable error-detecting board
CN102880527B (en) Data recovery method of baseboard management controller
CN116627729A (en) External connection cable, external connection cable in-place detection device, startup self-checking method and system
CN112069766A (en) Method and device for reducing cables of hard disk backboard in server
CN104679626A (en) System and method for debugging and detecting BIOS (Basic Input / Output System)
CN211505789U (en) PCIE board card testing arrangement
CN107544883B (en) Diagnostic device, system and method of server
CN114003445A (en) I2C monitoring function test method, system, terminal and storage medium of BMC
CN115098342A (en) System log collection method, system, terminal and storage medium
CN116010141A (en) Method, device and medium for positioning starting abnormality of multipath server
CN115729872A (en) Computing device and detection method for PCIE cable connection
CN113849267A (en) Virtual display method, system, terminal and storage medium for display card
CN114253573A (en) PCIe device firmware batch upgrading method, system, terminal and storage medium
CN112596983A (en) Monitoring method for connector in server
TWM598968U (en) Out-of-band external control equipment and system
CN104572423A (en) Debugging system and debugging device and method thereof
CN211149437U (en) OCP3.0 hot plug framework
CN111338911A (en) Server hard disk indicator lamp control system and control method thereof
CN114253561B (en) System and method for automatically burning firmware
CN215987289U (en) Abnormal display device for server equipment
CN111324495B (en) Host machine starting-up detection method and system thereof
CN117608933A (en) Method, device, equipment and medium for identifying silk screen printing of PCIe card case of server
CN114857069A (en) Fan smelting tool plate

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination