CN113505045A - Hard disk fault display method and device and server - Google Patents

Hard disk fault display method and device and server Download PDF

Info

Publication number
CN113505045A
CN113505045A CN202110554453.9A CN202110554453A CN113505045A CN 113505045 A CN113505045 A CN 113505045A CN 202110554453 A CN202110554453 A CN 202110554453A CN 113505045 A CN113505045 A CN 113505045A
Authority
CN
China
Prior art keywords
hard disk
state
logic chip
chip
fault
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110554453.9A
Other languages
Chinese (zh)
Inventor
张贺
路广
孙启辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
New H3C Information Technologies Co Ltd
Original Assignee
New H3C Information Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by New H3C Information Technologies Co Ltd filed Critical New H3C Information Technologies Co Ltd
Priority to CN202110554453.9A priority Critical patent/CN113505045A/en
Publication of CN113505045A publication Critical patent/CN113505045A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/325Display of status information by lamps or LED's
    • G06F11/326Display of status information by lamps or LED's for error or online/offline status
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3034Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a storage system, e.g. DASD based or network based
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3051Monitoring arrangements for monitoring the configuration of the computing system or of the computing system component, e.g. monitoring the presence of processing resources, peripherals, I/O links, software programs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3089Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
    • G06F11/3093Configuration details thereof, e.g. installation, enabling, spatial arrangement of the probes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/327Alarm or error message display

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The specification provides a hard disk fault display method, a hard disk fault display device and a server, and relates to the technical field of communication. A hard disk fault display method is applied to an expansion chip of a hard disk backboard, and comprises the following steps: acquiring out-of-band information fed back by a hard disk, wherein the out-of-band information is determined according to an out-of-band signal of the hard disk; receiving in-place information of the hard disk sent by the logic chip; sending the out-of-band information to a hard disk management card so that the hard disk management card determines a first hard disk state according to the out-of-band information and generates a control signal carrying the first hard disk state; the method comprises the steps of receiving a control signal sent by the hard disk management card, updating the state value of the indicator light register in the logic chip according to the control signal and the in-place information, enabling the logic chip to control the corresponding indicator light according to the state value of the indicator light register, and therefore, the hard disk faults in the server can be displayed accurately, and the monitoring and maintaining efficiency of the server is improved.

Description

Hard disk fault display method and device and server
Technical Field
The present disclosure relates to the field of communications technologies, and in particular, to a method and an apparatus for displaying hard disk failures, and a server.
Background
With the application of large data, cloud computing and the like on a network, the storage space requirement required to be configured on a single server is also increasing, and the number of hard disks provided in the server is gradually increasing as the largest source for providing the storage space. In order to monitor and manage a large number of hard Disks installed in the server, the number of connected hard Disks may be expanded by installing a hard disk backplane, and the hard Disks connected to the hard disk backplane may be monitored and managed by a hard disk management card, such as a RAID (Redundant array of Independent Disks) card or an HBA (Host Bus Adapter) card.
The hard disk backboard is provided with a backboard chip and a logic chip, the logic chip is internally provided with registers for recording the state information of each hard disk, and the logic chip controls the corresponding indicator lamps through the state values in the registers to display the state of the hard disk corresponding to the logic chip. The state value of the register in the logic chip can be determined by the hard disk management card acquiring the out-of-band signal fed back by each hard disk through the backplane chip.
Under the condition that the server works, the logic chip can detect that the hard disk is in place based on hardware connection and display the hard disk through the indicator lamp, but if the hard disk fails, an out-of-band signal cannot be output externally, and the hard disk management card cannot judge whether the current hard disk fails according to the out-of-band signal, so that a worker cannot determine the failure of the hard disk by observing the indicator lamp, and the efficiency of running and supervision of the server is reduced.
Disclosure of Invention
In order to overcome the problems in the related art, the specification provides a hard disk fault display method, a hard disk fault display device and a server.
In combination with the first aspect of the embodiments of the present specification, the present application provides a hard disk fault display method, which is applied to an expansion chip of a hard disk backplane, and includes:
acquiring out-of-band information fed back by a hard disk, wherein the out-of-band information is determined according to an out-of-band signal of the hard disk;
receiving in-place information of the hard disk sent by the logic chip;
sending the out-of-band information to a hard disk management card so that the hard disk management card determines a first hard disk state according to the out-of-band information and generates a control signal carrying the first hard disk state;
and receiving a control signal sent by the hard disk management card, and updating the state value of the indicator light register in the logic chip according to the control signal and the in-place information so that the logic chip controls the corresponding indicator light according to the state value of the indicator light register.
Further, the indicator lights comprise fault indicator lights, and the indicator light registers comprise fault registers corresponding to the fault indicator lights;
receiving the on-site state of the hard disk sent by the logic chip, comprising the following steps:
receiving the on-site information of the hard disk sent by the logic chip through a bus between the expansion chip and the logic chip;
updating the status value of the indicator light register in the logic chip according to the control signal and the bit information, comprising:
determining a state value of a fault register corresponding to the fault indicator lamp according to the control signal and the on-site information;
and issuing the determined state value of the fault register to the logic chip based on a bus between the expansion chip and the logic chip.
Optionally, determining a state value of a fault register corresponding to the fault indicator according to the control signal and the in-place information includes:
if the hard disk is determined to be in place according to the in-place information and the out-of-band signal which is not fed back by the hard disk is determined to be in place, the out-of-band signal of the hard disk is marked to be in an abnormal state;
if the out-of-band signal of the hard disk is in an abnormal state and the first hard disk state determined according to the control signal is a normal state, marking the state as an abnormal state in a second hard disk state of the hard disk;
and determining the state value of the fault register corresponding to the fault indicator lamp according to the state of the second hard disk.
Optionally, determining a state value of a fault register corresponding to the fault indicator lamp according to the control signal and the in-place information, further comprising:
if the hard disk is determined not to be in place according to the in-place information and the out-of-band signal which is not fed back by the hard disk is determined or the hard disk is determined to be in place according to the in-place information and the out-of-band signal which is fed back by the hard disk is received, marking the out-of-band signal of the hard disk in a normal state;
if the out-of-band signal of the hard disk is in a normal state and the first hard disk state determined according to the control signal is in a normal state, marking the state as a normal state in a second hard disk state of the hard disk; and if the out-of-band signal of the hard disk is in a normal state and the first hard disk state determined according to the control signal is an abnormal state, marking the second hard disk state of the hard disk as an abnormal state.
Optionally, the logic chip and the expansion chip are connected through an SGPIO bus;
the in-place information is sent to the extension chip by the logic chip through an SDataIn line in the SGPIO bus;
the status value of the indicator light register is sent by the expansion chip to the logic chip via the SDataOut line in the SGPIO bus.
In combination with the second aspect of the embodiments of the present specification, the present application provides a hard disk failure display device, which is applied to an expansion chip of a hard disk backplane, and includes:
the acquisition unit is used for acquiring the out-of-band information fed back by the hard disk, wherein the out-of-band information is determined according to the out-of-band signal of the hard disk;
the receiving unit is used for receiving the in-place information of the hard disk sent by the logic chip;
the sending unit is used for sending the out-of-band information to the hard disk management card so that the hard disk management card determines a first hard disk state according to the out-of-band information and generates a control signal carrying the first hard disk state;
the receiving unit is also used for receiving a control signal sent by the hard disk management card;
and the control unit is used for updating the state value of the indicator light register in the logic chip according to the control signal and the in-place information so that the logic chip controls the corresponding indicator light according to the state value of the indicator light register.
Further, the indicator lights comprise fault indicator lights, and the indicator light registers comprise fault registers corresponding to the fault indicator lights;
the receiving unit is specifically used for receiving the in-place information of the hard disk sent by the logic chip through a bus between the expansion chip and the logic chip;
a control unit comprising:
the determining module is used for determining the state value of a fault register corresponding to the fault indicator lamp according to the control signal and the in-place information;
and the issuing module is used for issuing the determined state value of the fault register to the logic chip based on a bus between the expansion chip and the logic chip.
Optionally, the determining module is specifically configured to mark the out-of-band signal of the hard disk as an abnormal state if the in-place information of the hard disk is determined and the out-of-band signal is not fed back by the hard disk; if the out-of-band signal of the hard disk is in an abnormal state and the first hard disk state determined according to the control signal is a normal state, marking the state as an abnormal state in a second hard disk state of the hard disk; and determining the state value of the fault indicator lamp according to the state of the second hard disk.
Optionally, the determining module is further configured to mark the out-of-band signal of the hard disk as a normal state if it is determined that the hard disk is not in place and the out-of-band signal is not fed back by the hard disk according to the in-place information or it is determined that the hard disk is in place and the out-of-band signal is fed back by the hard disk according to the in-place information; if the out-of-band signal of the hard disk is in a normal state and the first hard disk state determined according to the control signal is in a normal state, marking the state as a normal state in a second hard disk state of the hard disk; and if the out-of-band signal of the hard disk is in a normal state and the first hard disk state determined according to the control signal is an abnormal state, marking the second hard disk state of the hard disk as an abnormal state.
Optionally, the logic chip and the expansion chip are connected through an SGPIO bus;
the in-place information is sent to the expansion chip by the logic chip through a serial data input SDataIn line in the SGPIO bus;
the status value of the fault register is sent by the expansion chip to the logic chip via the serial data output SDataOut line in the SGPIO bus.
In combination with the third aspect of the embodiments of the present specification, the present application provides a server, including: the device comprises a hard disk management card, a hard disk back plate, a hard disk inserted on the hard disk back plate and an indicator lamp corresponding to the hard disk;
the hard disk backboard is provided with an extension chip, a logic chip and a machine-readable storage medium, wherein the logic chip is provided with an indicator light register corresponding to the hard disk, the machine-readable storage medium stores machine-executable instructions capable of being executed by the extension chip, and the extension chip is prompted by the machine-executable instructions to: the above-described method steps are implemented.
The technical scheme provided by the implementation mode of the specification can have the following beneficial effects:
in the embodiment of the specification, the logic chip sends the in-place information and the out-of-band information to the expansion chip, the out-of-band information is sent to the hard disk management card through the expansion chip, the hard disk state of the hard disk is determined through the out-of-band information through the hard disk management card, and a control signal carrying the hard disk state is returned to the expansion chip, so that the expansion chip determines the hard disk with the out-of-band signal abnormality according to the control signal and the in-place information, the problem that a worker is difficult to confirm the fault of the hard disk under the condition that the hard disk fault cannot feed back the out-of-band signal to the expansion chip is solved, and the efficiency of operation, maintenance and supervision of the server is improved. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the specification.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present specification and together with the description, serve to explain the principles of the specification.
FIG. 1 is a schematic diagram of a server according to the present application;
FIG. 2 is a flow chart of a hard disk failure presentation method according to the present application;
FIG. 3 is a schematic structural diagram of an SGPIO bus connection between an expansion chip and a logic chip according to the present application;
fig. 4 is a schematic diagram of signals transmitted on the SDataOut line and the SDataIn line to which the present application relates;
fig. 5 is a schematic structural diagram of a hard disk failure display apparatus according to the present application.
Detailed Description
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present specification.
As shown in fig. 1, a motherboard (not shown) is disposed in the server, a hard disk backplane and a hard disk management card, such as a RAID card or an HBA card, may be plugged into an expansion connector (not shown) of the motherboard, and the expansion connector may be based on PCIE (Peripheral Component Interconnect Express) or based on other bus protocols, which is not described herein again.
The hard disk management card can acquire the relevant information of the connected hard disk and manage the connected hard disk. In order to increase the number of hard disks that can be managed by the hard disk management card, a hard disk backplane may be provided, and the hard disk backplane may include an expansion chip and a logic chip. The logic chip may be pre-programmed with a register and an operation module, where the register is used to record status signals, such as in-place signals, obtained from the hard disk. The operation module can realize data analysis, packaging and the like among the expansion chips.
From the hardware structure aspect, the hard disk management card is connected with the hard disk backboard, and the hard disk backboard is connected to the hard disk. Specifically, the memory controller is connected to an expansion chip, and the expansion chip is connected to the logic chip and each hard disk. The server can also be provided with a plurality of indicator lamps which are connected with the logic chip, and the logic chip can control the on and off of the indicator lamps through the value of a specific register of the logic chip. Interaction among the memory controller, the expansion chip and the hard disk can be performed through out-of-band signals.
After the server is started, the expansion chip may send a request message to the hard disk based on the communication protocol. After the hard disk receives the request message, a feedback message may be sent to the expansion chip, and the expansion chip then feeds back to the storage controller, so that the storage controller may detect the state of the hard disk, and determine whether the hard disk has a fault, but if the hard disk is not in place, the expansion chip sends a notification message to the storage controller after the request message is sent out time, where the feedback performed by the expansion chip to the storage controller based on the out-of-band signal is referred to as out-of-band information, that is, the feedback message includes the feedback message received by the expansion chip from the hard disk and the notification message sent to the storage controller when the feedback message of the hard disk is not received. However, when a hard disk has a serious failure, the hard disk cannot send a feedback message to the storage controller, so that the failure of the hard disk cannot be sensed, and further, an indicator lamp on the server cannot be controlled to correctly show the state of the hard disk. In this case, when the storage controller can receive the feedback message, the storage controller may identify the failed hard disk based on the feedback message, but when the notification message is not received, the storage controller may consider that the hard disk is not in place. Therefore, the storage controller cannot control the lighting of the fault indicator lamp on the server, and when the on-site indicator lamp is on, the state of the hard disk displayed by the storage controller may be considered as that the hard disk is in a normal state by a worker, so that the worker cannot find the fault of the hard disk.
In order to enable the storage controller to more accurately interpret whether the hard disk has a fault or not so as to prompt the state of the hard disk of a worker, the application provides a hard disk fault display method, which is applied to an expansion chip of a hard disk backplane, and as shown in fig. 2, the hard disk fault display method includes:
and S100, acquiring out-of-band information fed back by the hard disk.
S101, receiving the on-site information of the hard disk sent by the logic chip.
After the extension chip sends the request message based on the out-of-band signal to the hard disk, the extension chip may receive a feedback message sent by the hard disk based on the out-of-band signal, and determine the out-of-band information for one hard disk, that is, determine the out-of-band signal according to the feedback message or without receiving the feedback message. And, the on-site signal of the hard disk is obtained through the connection between the Logic chip, such as a CPLD (Complex Programmable Logic Device), on the hard disk backplane and the hard disk, and the on-site information is generated according to the on-site signal. This in-place information may be recorded by a specific register in the logic chip, which is subsequently referred to as the in-place register. The logic chip may periodically obtain the in-place signal of the hard disk, or may obtain the in-place signal of the hard disk according to an interrupt or other manners.
For example, taking the hard disk 0 in fig. 1 as an example, after the logic chip acquires the in-place signal of the hard disk 0 and records the in-place signal in the corresponding in-place indicator, the in-place indicator corresponding to the hard disk 0 may be turned on based on the state value in the in-place register, so that the worker knows that the hard disk 0 is in place.
Thereafter, the logic chip may cause the expansion chip to acquire the in-place information recorded in the logic chip based on the bus connection with the expansion chip. The bus may be an SGPIO bus, and as shown in fig. 3, may include four lines, namely, a Sclock line, a Sload line, an SDataOut line, and an SDataIn line. The Sclock line is used for providing a clock signal, the Sload line is used for triggering transmission of an SDataOut line and an SDataIn line, the SDataOut line is used for sending data to the logic chip by the expansion chip, and the SDataIn line is used for sending data to the expansion chip by the logic chip. After one Sload line triggering, the logic chip and the expansion chip carry out bidirectional data transmission.
For a hard disk, 3 bits can be reserved for information transmission, and the reserved bits can be continuously transmitted on the SDataIn line by one Sload line rising edge trigger, and the form is shown in FIG. 4. In order for the expansion chip to acquire the bit information of the hard disk, the bit information recorded in the bit register in the logic chip may occupy one of 3 bits, that is, the bit information may be sent by the logic chip to the expansion chip through the sdataiin line in the SGPIO bus. The logic chip transmits the in-place information through the first bit corresponding to the hard disk 0 in the sdataiin line, and the expansion chip can know that the hard disk 0 is in the in-place state.
S102, sending the out-of-band information to a hard disk management card so that the hard disk management card determines a first hard disk state according to the out-of-band information and generates a control signal carrying the first hard disk state.
After the expansion chip acquires the in-place information and the out-of-band information, the out-of-band information can be sent to the hard disk management card.
Then, the storage controller on the hard disk management card may determine whether the hard disk has a fault according to the out-of-band information, that is, the storage controller may determine the first hard disk state of the hard disk based on an SES (SCSI Enclosure Service) protocol according to the out-of-band information. In this first hard disk state, an out-of-band signal exception condition may be interpreted by the storage controller as the corresponding hard disk being out of place.
S103, receiving a control signal sent by the hard disk management card, and updating the state value of the indicator light register in the logic chip according to the control signal and the in-place information, so that the logic chip controls the corresponding indicator light according to the state value of the indicator light register.
Specifically, step S103, updating the state value of the indicator register in the logic chip according to the control signal and the in-place information, includes:
and S1030, determining the state value of the fault register corresponding to the fault indicator lamp according to the control signal and the on-site information.
And S1031, issuing the determined state value of the fault register to the logic chip based on the bus between the expansion chip and the logic chip.
Optionally, step S1030, determining a state value of a fault register corresponding to the fault indicator according to the control signal and the in-place information, includes:
S1030A, if the hard disk is determined to be in place according to the in-place information and the out-of-band signal is not fed back by the hard disk, marking the out-of-band signal of the hard disk as an abnormal state.
S1030B, if the out-of-band signal of the hard disk is in an abnormal state and the first hard disk state determined according to the control signal is a normal state, marking the second hard disk state as an abnormal state.
S1030C, determining the state value of the fault register corresponding to the fault indicator lamp according to the state of the second hard disk.
Optionally, step S1030, determining a state value of a fault register corresponding to the fault indicator according to the control signal and the in-place information, further including:
S1030D, if the hard disk is determined not to be in place and the out-of-band signal fed back by the hard disk is not present according to the in-place information or the hard disk is determined to be in place according to the in-place information and the out-of-band signal fed back by the hard disk is received, marking the out-of-band signal of the hard disk as a normal state.
S1030E, if the out-of-band signal of the hard disk is in a normal state and the first hard disk state determined according to the control signal is in a normal state, marking the second hard disk state as a normal state.
S1030F, if the out-of-band signal of the hard disk is in a normal state and the first hard disk state determined according to the control signal is an abnormal state, marking the second hard disk state as an abnormal state.
The hard disk backplane is connected with two hard disks, namely a hard disk 1 and a hard disk 2, and the hard disks are respectively inserted into the slot position 1 and the slot position 2, and the slot position 0 is not inserted with the hard disks.
After the extension chip determines the bit information and the out-of-band information, the bit information and the out-of-band information may be recorded in the form of a table, as shown in table 1 below.
Slot position identification Out-of-band signal On-site information Without out-of-band signal failure
0 0 0 0
1 1 1 0
2 0 1 1
TABLE 1
Since the slot 0 and the slot 2 are not plugged with a hard disk, a feedback message cannot be sent to the expansion chip, and the slot 1 receives the feedback message. The in-place information of the slot 0 indicates that the slot 0 is not plugged with a hard disk, and the in-place information of the slot 1 and the slot 2 indicates that the hard disk is connected with the slot 1 and the slot 2. In table 1, out-of-band information flag is detected as 1, out-of-band information flag is not detected as 0, hard disk in-place flag is detected as 1, and hard disk in-place flag is not detected as 0.
And the expansion chip performs exclusive OR operation according to the out-of-band information and the in-place information, records an operation result and determines the fault that the out-of-band signal cannot be received under the condition that the hard disk is in place. That is, a fault without an out-of-band signal is indicated in table 1, where 0 indicates that no out-of-band signal fault exists and is in a normal state of the out-of-band signal, and 1 indicates that no out-of-band signal fault exists and is in an abnormal state of the out-of-band signal.
In the expansion chip, the following table 2 may also be recorded, and the table 2 is used to determine whether the hard disk actually has a fault. The out-of-band signal-free fault corresponds to an exclusive or operation result in table 1, and the SES state represents a fault determined based on out-of-band information. The confirmation of the actual failure may be an or operation with the SES status through the out-of-band information-free failure, so that the failure determined by the out-of-band signal and the failure in which the hard disk is in place and does not receive the out-of-band signal may be respectively exhibited in the actual failure. The SES state is the first hard disk state fed back by the storage controller according to the out-of-band information. Through the first hard disk state, the expansion chip can determine the possible faults of the hard disk which is in place and has normal out-of-band signals. The expansion chip determines the state of the second hard disk based on the state of the first hard disk fed back by the storage controller and whether the out-of-band signal determined by the expansion chip is abnormal or not so as to reflect the state of the hard disk more accurately.
For slot 0, it may be determined that the slot 0 is in the slot, and although the out-of-band signal is not detected (that is, the out-of-band signal is in the invalid state), the out-of-band signal is not detected because the hard disk is not set, so that a situation that the hard disk is connected but the out-of-band signal fails does not occur, that is, the hard disk is in the normal state.
For the slot 1, the expansion chip detects an out-of-band signal (that is, the out-of-band signal is in an effective state) and also detects an in-place signal, so that whether a fault exists in the hard disk can be normally judged by the storage controller on the hard disk management card according to the received out-of-band signal.
For the slot 2, the expansion chip detects the in-place signal, but does not detect the out-of-band signal (that is, the out-of-band signal is in an invalid state), it can be determined that the hard disk may have a hard disk fault that cannot be determined by the storage controller, and at this time, the result of the xor operation between the in-place information and the out-of-band information is 1, which indicates that the hard disk has a fault and is in an abnormal state.
Figure BDA0003076615040000101
Figure BDA0003076615040000111
TABLE 2
Finally, the expansion chip in table 1 and table 2 can determine that the hard disk actually has a fault, that is, the hard disk includes a case where the out-of-band signal is in an abnormal state but the first hard disk determined by the storage controller is in a normal state, and a case where the out-of-band signal is in a normal state but the first hard disk determined by the storage controller is in an abnormal state.
Specifically, the hard disk at the slot 0 is in place and the out-of-band signal is in a normal state, the state of the first hard disk fed back by the storage controller indicates that the hard disk is in a normal state, the expansion chip can determine that the state of the second hard disk is in a normal state, and the fault indicator lamp and the in-place indicator lamp corresponding to the slot 0 do not need to be turned on.
The hard disk on the slot position 1 is in place and the out-of-band signal is in a normal state, the state of the first hard disk fed back by the storage controller is in an abnormal state, the expansion chip can determine that the state of the second hard disk is in the abnormal state, and an in-place indicator lamp and a fault indicator lamp corresponding to the slot position 1 need to be respectively lightened.
The hard disk on the slot position 2 is in place and the out-of-band signal is in an abnormal state, the first hard disk running state fed back by the storage controller is in a normal state, the expansion chip can determine that the second hard disk state is in the abnormal state, and the fault indicator lamp and the in-place indicator lamp corresponding to the slot position 2 need to be lightened. The indicator light corresponding to the slot position is an indicator light corresponding to the hard disk.
In the embodiment of the specification, the logic chip sends the in-place information and the out-of-band information to the expansion chip, the out-of-band information is sent to the hard disk management card through the expansion chip, the hard disk state of the hard disk is determined through the out-of-band information through the hard disk management card, and a control signal carrying the hard disk state is returned to the expansion chip, so that the expansion chip determines the hard disk with the out-of-band signal abnormality according to the control signal and the in-place information, the problem that a worker is difficult to confirm the fault of the hard disk under the condition that the hard disk fault cannot feed back the out-of-band signal to the expansion chip is solved, and the efficiency of operation, maintenance and supervision of the server is improved.
Correspondingly, the present application provides a hard disk failure display device, is applied to the expansion chip of hard disk backplate, as shown in fig. 5, includes:
the acquisition unit is used for acquiring the out-of-band information fed back by the hard disk, wherein the out-of-band information is determined according to the out-of-band signal of the hard disk;
the receiving unit is used for receiving the in-place information of the hard disk sent by the logic chip;
the sending unit is used for sending the out-of-band information to the hard disk management card so that the hard disk management card determines a first hard disk state according to the out-of-band information and generates a control signal carrying the first hard disk state;
the receiving unit is also used for receiving a control signal sent by the hard disk management card;
and the control unit is used for updating the state value of the indicator light register in the logic chip according to the control signal and the in-place information so that the logic chip controls the corresponding indicator light according to the state value of the indicator light register.
Further, the indicator lights comprise fault indicator lights, and the indicator light registers comprise fault registers corresponding to the fault indicator lights;
the receiving unit is specifically used for receiving the in-place information of the hard disk sent by the logic chip through a bus between the expansion chip and the logic chip;
a control unit comprising:
the determining module is used for determining the state value of a fault register corresponding to the fault indicator lamp according to the control signal and the in-place information;
and the issuing module is used for issuing the determined state value of the fault register to the logic chip based on a bus between the expansion chip and the logic chip.
Optionally, the determining module is specifically configured to mark the out-of-band signal of the hard disk as an abnormal state if the in-place information of the hard disk is determined and the out-of-band signal is not fed back by the hard disk; if the out-of-band signal of the hard disk is in an abnormal state and the first hard disk state determined according to the control signal is a normal state, marking the state as an abnormal state in a second hard disk state of the hard disk; and determining the state value of the fault indicator lamp according to the state of the second hard disk.
Optionally, the determining module is further configured to mark the out-of-band signal of the hard disk as a normal state if it is determined that the hard disk is not in place and the out-of-band signal is not fed back by the hard disk according to the in-place information or it is determined that the hard disk is in place and the out-of-band signal is fed back by the hard disk according to the in-place information; if the out-of-band signal of the hard disk is in a normal state and the first hard disk state determined according to the control signal is in a normal state, marking the state as a normal state in a second hard disk state of the hard disk; and if the out-of-band signal of the hard disk is in a normal state and the first hard disk state determined according to the control signal is an abnormal state, marking the second hard disk state of the hard disk as an abnormal state.
Optionally, the logic chip and the expansion chip are connected through an SGPIO bus;
the in-place information is sent to the expansion chip by the logic chip through a serial data input SDataIn line in the SGPIO bus;
the status value of the fault register is sent by the expansion chip to the logic chip via the serial data output SDataOut line in the SGPIO bus.
Correspondingly, the present application provides a server, as shown in fig. 1, including: the device comprises a hard disk management card, a hard disk back plate, a hard disk inserted on the hard disk back plate and an indicator lamp corresponding to the hard disk;
the hard disk backboard is provided with an extension chip, a logic chip and a machine-readable storage medium, wherein the logic chip is provided with an indicator light register corresponding to the hard disk, the machine-readable storage medium stores machine-executable instructions capable of being executed by the extension chip, and the extension chip is prompted by the machine-executable instructions to: the above-described method steps are implemented.
Optionally, the logic chip and the expansion chip are connected through an SGPIO bus;
the in-place information is sent to the extension chip by the logic chip through an SDataIn line in the SGPIO bus;
the status value of the fault register is sent by the expansion chip to the logic chip via the SDataOut line in the SGPIO bus.
It will be understood that the present description is not limited to the precise arrangements described above and shown in the drawings, and that various modifications and changes may be made without departing from the scope thereof.
The above description is only for the purpose of illustrating the preferred embodiments of the present disclosure and is not to be construed as limiting the present disclosure, and any modifications, equivalents, improvements, etc. made within the spirit and principle of the present disclosure are intended to be included within the scope of the present disclosure.

Claims (11)

1. A hard disk fault display method is characterized in that an expansion chip applied to a hard disk backboard comprises the following steps:
acquiring out-of-band information of a hard disk, wherein the out-of-band information is determined according to an out-of-band signal of the hard disk;
receiving in-place information of the hard disk sent by the logic chip;
sending the out-of-band information to a hard disk management card so that the hard disk management card determines a first hard disk state according to the out-of-band information and generates a control signal carrying the first hard disk state;
and receiving a control signal sent by the hard disk management card, and updating the state value of an indicator light register in the logic chip according to the control signal and the in-place information, so that the logic chip controls a corresponding indicator light according to the state value of the indicator light register.
2. The method of claim 1, wherein the indicator light comprises a fault indicator light, and the indicator light register comprises a fault register corresponding to the fault indicator light;
the receiving of the on-site information of the hard disk sent by the logic chip includes:
receiving the on-site information of the hard disk sent by the logic chip through a bus between the expansion chip and the logic chip;
the updating the state value of the indicator light register in the logic chip according to the control signal and the in-place information comprises:
determining a state value of a fault register corresponding to the fault indicator lamp according to the control signal and the in-place information;
and issuing the determined state value of the fault register to the logic chip based on a bus between the extension chip and the logic chip.
3. The method of claim 2, wherein determining the status value of the fault register corresponding to the fault indicator lamp according to the control signal and the in-place information comprises:
if the hard disk is determined to be in place according to the in-place information and the out-of-band signal which is not fed back by the hard disk is determined to be in place, the out-of-band signal of the hard disk is marked to be in an abnormal state;
if the out-of-band signal of the hard disk is in an abnormal state and the first hard disk state determined according to the control signal is a normal state, marking the state as an abnormal state in a second hard disk state of the hard disk;
and determining the state value of the fault register corresponding to the fault indicator lamp according to the state of the second hard disk.
4. The method of claim 3, wherein determining the status value of the fault register corresponding to the fault indicator lamp according to the control signal and the in-place information further comprises:
if the hard disk is determined not to be in place according to the in-place information and the out-of-band signal which is not fed back by the hard disk is determined or the hard disk is determined to be in place according to the in-place information and the out-of-band signal which is fed back by the hard disk is received, marking the out-of-band signal of the hard disk in a normal state;
if the out-of-band signal of the hard disk is in a normal state and the first hard disk state determined according to the control signal is in a normal state, marking the state as a normal state in a second hard disk state of the hard disk; and if the out-of-band signal of the hard disk is in a normal state and the first hard disk state determined according to the control signal is an abnormal state, marking the state as an abnormal state in the second hard disk state of the hard disk.
5. The method of claim 1, wherein the logic chip and the expansion chip are connected via an SGPIO bus;
the in-place information is sent to the expansion chip by the logic chip through a serial data input SDataIn line in the SGPIO bus;
the status value of the indicator light register is sent by the expansion chip to the logic chip through a serial data output SDataOut line in the SGPIO bus.
6. The utility model provides a hard disk fault display device which characterized in that, is applied to the extension chip of hard disk backplate, includes:
the device comprises an acquisition unit, a processing unit and a processing unit, wherein the acquisition unit is used for acquiring out-of-band information of a hard disk, and the out-of-band information is determined according to an out-of-band signal of the hard disk;
the receiving unit is used for receiving the in-place information of the hard disk sent by the logic chip;
the sending unit is used for sending the out-of-band information to a hard disk management card so that the hard disk management card determines a first hard disk state according to the out-of-band information and generates a control signal carrying the first hard disk state;
the receiving unit is also used for receiving a control signal sent by the hard disk management card;
and the control unit is used for updating the state value of the indicator light register in the logic chip according to the control signal and the in-place information so as to enable the logic chip to control the corresponding indicator light according to the state value of the indicator light register.
7. The apparatus of claim 6, wherein the indicator light comprises a fault indicator light, and the indicator light register comprises a fault register corresponding to the fault indicator light;
the receiving unit is specifically configured to receive, through a bus based on the expansion chip and the logic chip, in-place information of a hard disk sent by the logic chip;
the control unit includes:
the determining module is used for determining a state value of a fault register corresponding to the fault indicator lamp according to the control signal and the in-place information;
and the issuing module is used for issuing the determined state value of the fault register to the fault register of the logic chip based on the bus between the expansion chip and the logic chip.
8. The apparatus according to claim 7, wherein the determining module is specifically configured to mark the out-of-band signal of the hard disk as an abnormal state if the in-place information indicates that the hard disk is in place and the out-of-band signal is not fed back by the hard disk; if the out-of-band signal of the hard disk is in an abnormal state and the first hard disk state determined according to the control signal is a normal state, marking the state as an abnormal state in a second hard disk state of the hard disk; and determining the state value of the fault indicator lamp according to the state of the second hard disk.
9. The apparatus of claim 8, wherein the determining module is further configured to mark the out-of-band signal of the hard disk as a normal state if it is determined that the hard disk is not in place and the out-of-band signal is not fed back by the hard disk according to the in-place information or it is determined that the hard disk is in place and the out-of-band signal is received from the hard disk according to the in-place information; if the out-of-band signal of the hard disk is in a normal state and the first hard disk state determined according to the control signal is in a normal state, marking the state as a normal state in a second hard disk state of the hard disk; and if the out-of-band signal of the hard disk is in a normal state and the first hard disk state determined according to the control signal is an abnormal state, marking the state as an abnormal state in the second hard disk state of the hard disk.
10. The apparatus of claim 6, wherein the logic chip and the expansion chip are connected via an SGPIO bus;
the in-place information is sent to the extension chip by the logic chip through an SDataIn line in the SGPIO bus;
the status value of the fault register is sent by the expansion chip to the logic chip through the SDataOut line in the SGPIO bus.
11. A server, comprising: the device comprises a hard disk management card, a hard disk back plate, a hard disk inserted on the hard disk back plate and an indicator light corresponding to the hard disk;
an expansion chip, a logic chip and a machine-readable storage medium are arranged on the hard disk backboard, an indicator light register corresponding to the hard disk is arranged in the logic chip, wherein the machine-readable storage medium stores machine-executable instructions capable of being executed by the expansion chip, and the expansion chip is prompted by the machine-executable instructions to: carrying out the method steps of any one of claims 1 to 5.
CN202110554453.9A 2021-05-20 2021-05-20 Hard disk fault display method and device and server Pending CN113505045A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110554453.9A CN113505045A (en) 2021-05-20 2021-05-20 Hard disk fault display method and device and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110554453.9A CN113505045A (en) 2021-05-20 2021-05-20 Hard disk fault display method and device and server

Publications (1)

Publication Number Publication Date
CN113505045A true CN113505045A (en) 2021-10-15

Family

ID=78008506

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110554453.9A Pending CN113505045A (en) 2021-05-20 2021-05-20 Hard disk fault display method and device and server

Country Status (1)

Country Link
CN (1) CN113505045A (en)

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120133520A1 (en) * 2010-11-30 2012-05-31 Inventec Corporation Computer chassis system and hard disk status display method thereof
CN105975357A (en) * 2016-04-29 2016-09-28 浪潮电子信息产业股份有限公司 Fault location method and system
CN106776196A (en) * 2017-02-27 2017-05-31 郑州云海信息技术有限公司 A kind of server hard disc method for monitoring state and system
WO2017101217A1 (en) * 2015-12-17 2017-06-22 英业达科技有限公司 Method and system for controlling pcie hard drive status light
CN107832199A (en) * 2017-10-10 2018-03-23 曙光信息产业(北京)有限公司 Hard disk monitoring system based on CPLD
WO2018209889A1 (en) * 2017-05-16 2018-11-22 杭州海康威视数字技术股份有限公司 Extended storage device based on pcie bus
CN109408272A (en) * 2018-11-06 2019-03-01 郑州云海信息技术有限公司 A kind of storage fault handling method and device
CN110162446A (en) * 2019-04-13 2019-08-23 深圳市同泰怡信息技术有限公司 A kind of backboard hard disk ignition method based on BMC
CN110377142A (en) * 2019-06-27 2019-10-25 苏州浪潮智能科技有限公司 A kind of system and method for supporting server hard disc independence power-on and power-off
WO2019223235A1 (en) * 2018-05-23 2019-11-28 郑州云海信息技术有限公司 Expansion system and extension method for sgpio signal of server hard disk
CN111338911A (en) * 2018-12-19 2020-06-26 佛山市顺德区顺达电脑厂有限公司 Server hard disk indicator lamp control system and control method thereof

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120133520A1 (en) * 2010-11-30 2012-05-31 Inventec Corporation Computer chassis system and hard disk status display method thereof
WO2017101217A1 (en) * 2015-12-17 2017-06-22 英业达科技有限公司 Method and system for controlling pcie hard drive status light
CN105975357A (en) * 2016-04-29 2016-09-28 浪潮电子信息产业股份有限公司 Fault location method and system
CN106776196A (en) * 2017-02-27 2017-05-31 郑州云海信息技术有限公司 A kind of server hard disc method for monitoring state and system
WO2018209889A1 (en) * 2017-05-16 2018-11-22 杭州海康威视数字技术股份有限公司 Extended storage device based on pcie bus
CN107832199A (en) * 2017-10-10 2018-03-23 曙光信息产业(北京)有限公司 Hard disk monitoring system based on CPLD
WO2019223235A1 (en) * 2018-05-23 2019-11-28 郑州云海信息技术有限公司 Expansion system and extension method for sgpio signal of server hard disk
CN109408272A (en) * 2018-11-06 2019-03-01 郑州云海信息技术有限公司 A kind of storage fault handling method and device
CN111338911A (en) * 2018-12-19 2020-06-26 佛山市顺德区顺达电脑厂有限公司 Server hard disk indicator lamp control system and control method thereof
CN110162446A (en) * 2019-04-13 2019-08-23 深圳市同泰怡信息技术有限公司 A kind of backboard hard disk ignition method based on BMC
CN110377142A (en) * 2019-06-27 2019-10-25 苏州浪潮智能科技有限公司 A kind of system and method for supporting server hard disc independence power-on and power-off

Similar Documents

Publication Publication Date Title
CN107832199A (en) Hard disk monitoring system based on CPLD
CN106055438B (en) The method and system of memory bar exception on a kind of quick positioning mainboard
US7917664B2 (en) Storage apparatus, storage apparatus control method, and recording medium of storage apparatus control program
US7356431B2 (en) Method for testing an input/output functional board
US6202160B1 (en) System for independent powering of a computer system
US8082368B2 (en) Display device for indicating connection statuses of a communication channel provided between two systems and method thereof
TW440755B (en) Method and system for environmental sensing and control within a computer system
US6269412B1 (en) Apparatus for recording information system events
JPS62500269A (en) Power control network with reliable communication protocols
US6134668A (en) Method of selective independent powering of portion of computer system through remote interface from remote interface power supply
TWI615774B (en) Server system with bmc having hard disk drive monitoring function
CN110377956B (en) U.2 interface multi-hard-disk management back plate and method
US20090006902A1 (en) Methods, systems, and computer program products for reporting fru failures in storage device enclosures
CN101930393B (en) Testing device for SAS (Serial Attached SCSI) back plate
CN111176913A (en) Circuit and method for detecting Cable Port in server
US7380054B2 (en) Storage of fault and position data in nonvolatile memory on system components
US6943463B2 (en) System and method of testing connectivity between a main power supply and a standby power supply
TWI238933B (en) Computer system with dedicated system management buses
CN113961478A (en) Memory fault recording method and device
US7373208B2 (en) Control apparatus and control method
CN108304290A (en) Server power-up state monitors system and method, computer storage and equipment
CN108399116A (en) A kind of server power-up state monitoring system and method
CN113992501A (en) Fault positioning system, method and computing device
CN113505045A (en) Hard disk fault display method and device and server
US7519741B2 (en) Apparatus, system, and method for automating adapter replacement

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination