CN215181972U - System for rapidly judging reason of abnormal shutdown of server - Google Patents

System for rapidly judging reason of abnormal shutdown of server Download PDF

Info

Publication number
CN215181972U
CN215181972U CN202121504327.4U CN202121504327U CN215181972U CN 215181972 U CN215181972 U CN 215181972U CN 202121504327 U CN202121504327 U CN 202121504327U CN 215181972 U CN215181972 U CN 215181972U
Authority
CN
China
Prior art keywords
bmc
port
cpld
signal
reason
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202121504327.4U
Other languages
Chinese (zh)
Inventor
岳远斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202121504327.4U priority Critical patent/CN215181972U/en
Application granted granted Critical
Publication of CN215181972U publication Critical patent/CN215181972U/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The utility model provides a system for judge server unusual shutdown reason fast, the utility model discloses a trigger the signal of shutting down and BMC remote control with the hardware button, after handling through CPLD, be connected to BMC's two different GP IO respectively, BMC distinguishes the log of record different grade type according to different GP IO, when taking place unusual shutdown problem, just can come the reason of analysis positioning problem through the trigger source of record in the BMC log. The utility model discloses a hardware description language realizes the filtration of signal, avoids the false triggering of signal, utilizes existing chip in the at utmost, practices thrift the cost, effectively improves failure diagnosis's ageing and accuracy, improves the competitiveness of customer satisfaction and product.

Description

System for rapidly judging reason of abnormal shutdown of server
Technical Field
The utility model relates to a data center technical field, especially a system for judge server unusual shutdown reason fast.
Background
With the advent of the era of big data, cloud computing and artificial intelligence, the internet traffic has grown dramatically, the amount of computation and the frequency of computation have increased, and in a server system, the traffic computation has increased, so that the carrying pressure of the server has increased, under such circumstances, a higher requirement has been put forward on the stability of the server, if the server is abnormally shut down due to unstable conditions during operation, the consequences are service interruption and data loss of the client, and the loss caused by abnormal shut down is unpredictable. To avoid this multiple occurrence, it is necessary to analyze the cause of the malfunctioning machine to find the factors that trigger the problem.
In a server system, a BMC is generally used to monitor and manage the health of a motherboard. Some important parameters of the mainboard core component, such as voltage, temperature, power consumption, fan rotating speed and the like, are monitored and recorded through the BMC. In the operation process, if the BMC monitors that some parameters are abnormal, an alarm log is recorded, alarm information is transmitted to a remote operation and maintenance server, a client can sense fault information, maintenance operation can be carried out in time, and large accidents are avoided. There are various reasons for abnormal shutdown of the server, such as abnormal triggering of a power on/off button, abnormal control of a BMC remote command, and the like. When the abnormal shutdown is recorded in the BMC, the recorded formats are consistent, and when the abnormal shutdown occurs, there is no way to distinguish whether the abnormal shutdown is triggered by a hardware startup and shutdown signal or by a BMC remote command, so that a relatively large obstruction is generated to problem analysis and judgment, and the fault processing efficiency is seriously influenced.
SUMMERY OF THE UTILITY MODEL
The utility model aims at providing a system for judge server unusual shutdown reason fast aims at solving the problem that can't distinguish the trigger source of shutting down when unusual among the prior art shuts down, realizes effectively improving failure diagnosis's ageing and accuracy, reduce cost.
In order to achieve the above technical purpose, the utility model provides a system for judge the reason of the unusual shutdown of server fast, the system includes:
the CPLD end is provided with an IN1 port, an IN2 port, an OUT1 port and an OUT2 port, the IN1 port is connected with a hardware key signal, and the IN2 port is connected with a BMC remote control output signal;
the BMC end is provided with GPIO1 and GPIO2 ports, the GPIO1 port is connected with the OUT1 port of the CPLD, and the GPIO2 port is connected with the OUT2 port of the CPLD.
Preferably, the CPLD filters the hardware key signal and the BMC remote control output signal.
Preferably, the BMC generates a log record according to signals received by the two GPIO ports.
Preferably, the BMC distinguishes and records different types of logs according to different GPIO ports.
The effects provided in the contents of the present invention are only the effects of the embodiments, not all the effects of the present invention, and one of the above technical solutions has the following advantages or advantageous effects:
compared with the prior art, the utility model discloses a trigger the signal of shutting down and BMC remote control with the hardware button, after handling through CPLD, be connected to BMC's two different GPIOs respectively, BMC distinguishes the log of record different grade type according to the GPIO of difference, when taking place unusual shutdown problem, just can analyze the reason of location problem through the trigger source of record in the BMC log. The utility model discloses a hardware description language realizes the filtration of signal, avoids the false triggering of signal, utilizes existing chip in the at utmost, practices thrift the cost, effectively improves failure diagnosis's ageing and accuracy, improves the competitiveness of customer satisfaction and product.
Drawings
Fig. 1 is a block diagram of a system structure for quickly determining a reason for abnormal shutdown of a server according to an embodiment of the present invention.
Detailed Description
In order to clearly illustrate the technical features of the present invention, the present invention is explained in detail by the following embodiments in combination with the accompanying drawings. The following disclosure provides many different embodiments, or examples, for implementing different features of the invention. In order to simplify the disclosure of the present invention, the components and arrangements of specific examples are described below. Furthermore, the present invention may repeat reference numerals and/or letters in the various examples. This repetition is for the purpose of simplicity and clarity and does not in itself dictate a relationship between the various embodiments and/or configurations discussed. It should be noted that the components illustrated in the figures are not necessarily drawn to scale. Descriptions of well-known components and processing techniques and processes are omitted so as to not unnecessarily limit the invention.
The following describes in detail a system for quickly determining a reason for abnormal shutdown of a server according to an embodiment of the present invention with reference to the accompanying drawings.
As shown in fig. 1, the utility model discloses a system for judge the reason of unusual shutdown of server fast, the system includes:
the CPLD end is provided with an IN1 port, an IN2 port, an OUT1 port and an OUT2 port, the IN1 port is connected with a hardware key signal, and the IN2 port is connected with a BMC remote control output signal;
the BMC end is provided with GPIO1 and GPIO2 ports, the GPIO1 port is connected with the OUT1 port of the CPLD, and the GPIO2 port is connected with the OUT2 port of the CPLD.
Hardware circuit separation is designed, ports IN1 and IN2, ports OUT1 and OUT2 are arranged IN the CPLD, a hardware key signal is connected to the port IN1 of the CPLD, a BMC remote control output signal is connected to the port IN2 of the CPLD, and the hardware key signal and the BMC remote control output signal are filtered inside the CPLD, so that false triggering is avoided.
After the CPLD is processed, the CPLD is respectively output to two different GPIO ports of the BMC from two independent ports OUT1 and OUT2 of the CPLD, and the BMC generates log records according to signals received by the two GPIO ports, so that the BMC can distinguish and record different types of logs according to different GPIO ports. And setting the GPIO1 port as a log port for recording that the trigger source is abnormal of the on-off key, setting the GPIO2 port as a log port for recording that the trigger source is a BMC remote command, and determining the trigger source according to logs recorded by different GPIO ports when the abnormal trigger of the on-off key or the BMC remote command is triggered. Therefore, when an abnormal shutdown problem occurs, the reason of the problem can be analyzed and positioned through the trigger source recorded in the BMC log.
The utility model discloses a trigger the signal of shutting down and BMC remote control with the hardware button and trigger the signal of shutting down, after handling through CPLD, be connected to BMC's two different GPIOs respectively, BMC distinguishes the log of record different grade type according to the GPIO of difference, when taking place unusual shutdown problem, just can come the reason of analysis positioning problem through the trigger source of record in the BMC log. The utility model discloses a hardware description language realizes the filtration of signal, avoids the false triggering of signal, utilizes existing chip in the at utmost, practices thrift the cost, effectively improves failure diagnosis's ageing and accuracy, improves the competitiveness of customer satisfaction and product.
The above description is only exemplary of the present invention and should not be taken as limiting the scope of the present invention, as any modifications, equivalents, improvements and the like made within the spirit and principles of the present invention are intended to be included within the scope of the present invention.

Claims (4)

1. A system for rapidly determining a reason for abnormal shutdown of a server, the system comprising:
the CPLD end is provided with an IN1 port, an IN2 port, an OUT1 port and an OUT2 port, the IN1 port is connected with a hardware key signal, and the IN2 port is connected with a BMC remote control output signal;
the BMC end is provided with GPIO1 and GPIO2 ports, the GPIO1 port is connected with the OUT1 port of the CPLD, and the GPIO2 port is connected with the OUT2 port of the CPLD.
2. The system according to claim 1, wherein the CPLD filters the hardware key signal and the BMC remote control output signal.
3. The system according to claim 1, wherein the BMC generates a log record according to signals received by the two GPIO ports.
4. The system according to claim 1, wherein the BMC distinguishes and records different types of logs according to different GPIO ports.
CN202121504327.4U 2021-06-29 2021-06-29 System for rapidly judging reason of abnormal shutdown of server Active CN215181972U (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202121504327.4U CN215181972U (en) 2021-06-29 2021-06-29 System for rapidly judging reason of abnormal shutdown of server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202121504327.4U CN215181972U (en) 2021-06-29 2021-06-29 System for rapidly judging reason of abnormal shutdown of server

Publications (1)

Publication Number Publication Date
CN215181972U true CN215181972U (en) 2021-12-14

Family

ID=79400996

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202121504327.4U Active CN215181972U (en) 2021-06-29 2021-06-29 System for rapidly judging reason of abnormal shutdown of server

Country Status (1)

Country Link
CN (1) CN215181972U (en)

Similar Documents

Publication Publication Date Title
US10931511B2 (en) Predicting computer network equipment failure
US10430260B2 (en) Troubleshooting method, computer system, baseboard management controller, and system
US9720761B2 (en) System fault detection and processing method, device, and computer readable storage medium
CN104639380A (en) Server monitoring method
CN107209511B (en) Monitoring control device
CN107704359B (en) Monitoring system of big data platform
US11853150B2 (en) Method and device for detecting memory downgrade error
CN113687969A (en) Alarm information generation method and device, electronic equipment and readable storage medium
CN104156297A (en) Warning method and device
CN112084087A (en) Industrial equipment state monitoring and operation and maintenance management method and system
WO2023179684A1 (en) Method and apparatus for monitoring state of central processing unit, and device and storage medium
CN115878356A (en) Disk failure prediction method and device
CN116820820A (en) Server fault monitoring method and system
CN215181972U (en) System for rapidly judging reason of abnormal shutdown of server
CN111625386A (en) Monitoring method and device for power-on overtime of system equipment
WO2018103185A1 (en) Fault processing method, computer system, baseboard management controller and system
WO2019141024A1 (en) Equipment unit state controlling method and device, and equipment unit
CN103995759B (en) High-availability computer system failure handling method and device based on core internal-external synergy
US11652831B2 (en) Process health information to determine whether an anomaly occurred
CN116126772A (en) UART serial port management system and method applied to ARM server
CN116126574A (en) System fault diagnosis method, device, equipment and storage medium
CN115543707A (en) Hard disk fault detection method, system and device, storage medium and electronic device
CN111274089B (en) Server abnormal behavior perception system based on bypass technology
CN115080362A (en) PCIE (peripheral component interface express) equipment speed reduction reporting method, system, equipment and storage medium
CN109189644B (en) Whole cabinet RMC, and method and system for automatically configuring number of newly added nodes of whole cabinet

Legal Events

Date Code Title Description
GR01 Patent grant
GR01 Patent grant