WO2020140428A1 - 一种服务器硬盘背板健康状态监测装置、方法及系统 - Google Patents

一种服务器硬盘背板健康状态监测装置、方法及系统 Download PDF

Info

Publication number
WO2020140428A1
WO2020140428A1 PCT/CN2019/098502 CN2019098502W WO2020140428A1 WO 2020140428 A1 WO2020140428 A1 WO 2020140428A1 CN 2019098502 W CN2019098502 W CN 2019098502W WO 2020140428 A1 WO2020140428 A1 WO 2020140428A1
Authority
WO
WIPO (PCT)
Prior art keywords
hard disk
controller
eeprom
programmable controller
voltage regulation
Prior art date
Application number
PCT/CN2019/098502
Other languages
English (en)
French (fr)
Inventor
王义晖
Original Assignee
郑州云海信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 郑州云海信息技术有限公司 filed Critical 郑州云海信息技术有限公司
Publication of WO2020140428A1 publication Critical patent/WO2020140428A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring

Definitions

  • the invention belongs to the technical field of servers, and particularly relates to a device, method and system for monitoring the health status of a server hard disk backplane.
  • the server hard disk backplane is an important part of the server storage subsystem. It is mainly used as a component to connect the hard disk.
  • the hard disk interface is connected to the server's Raid card, SAS card, or onboard interface in a certain way.
  • server functions With the development of server functions, the development of storage subsystems has become more and more diversified.
  • types in addition to the traditional backplanes connected to SATA hard drives and SAS hard drives, NVME hard drive backplanes and M.2 hard drives have also been developed.
  • backplanes such as backplanes for hard drives of ruler specifications.
  • Some backplanes also need to support the feature of mixed insertion, that is, connecting a variety of different types of backplanes to the same backplane.
  • the shape of the backplane has also gradually diversified. In addition to horizontal and vertical backplanes, there are also many irregular backplanes, and the number of hard drives on the backplane is increasing.
  • the programmable controller CPLD is generally used as the interface of the baseboard management controller BMC monitoring on the hard disk backplane.
  • the main function of the CPLD is to parse the SGPIO signal of the storage controller into specific error, locate, and active signals. Control the LED light of HDD slot by GPIO; convert SGPIO signal to I2C signal and communicate with mainboard BMC, it is the server out-of-band management system to know the status of hard disk on the backplane.
  • the BMC directly communicates with the temperature sensor on the backplane and reads the temperature on the backplane through I2C for monitoring.
  • the monitoring range of the hard disk backplane is limited, and only the temperature information of the hard disk backplane can be monitored, and other failure information of the hard disk backplane cannot be acquired and recorded, so that the health state of the hard disk backplane cannot be obtained, and there is a safe operation Hidden dangers.
  • the present invention provides a server hard disk backplane health monitoring device, which aims to solve the limited monitoring range of the hard disk backplane in the prior art, which can only monitor the temperature information of the hard disk backplane, but cannot Monitor and obtain other fault information of the hard disk backplane, so that the health status of the hard disk backplane cannot be obtained, and there is a hidden danger of safe operation.
  • a server hard disk backplane health monitoring device including a baseboard management controller, a programmable controller, a memory controller, and a hard disk slot, and the hard disk slot is installed with the A hard disk LED indicator connected to a programmable controller, the programmable controller is respectively connected to the baseboard management controller and the memory controller, and the server hard disk backplane health monitoring device further includes a temperature sensor, a voltage regulation module, and EEPROM, the temperature sensor, the voltage regulation module, and the EEPROM are respectively connected to the programmable controller;
  • the programmable controller accesses the temperature sensor, the voltage regulation module, and the EEPROM in a polling manner, respectively, to obtain the temperature parameters collected by the temperature sensor, the fault information in the internal register of the voltage regulation module, and the memory
  • the error, locate, and active signal parameters obtained by parsing the GPIO signal sent by the controller, and the temperature parameters collected by the acquired temperature sensor, the fault information in the internal register of the voltage regulation module, and the GPIO signal sent by the memory controller
  • the parsed error, locate, and active signal parameters are stored in the EEPROM;
  • the temperature parameter collected by the temperature sensor stored in the EEPROM When receiving the polling action instruction of the substrate management controller, the temperature parameter collected by the temperature sensor stored in the EEPROM, the fault information in the internal register of the voltage regulation module, and the memory controller The error, locate, and active signal parameters obtained by parsing the sent GPIO signal are fed back to the substrate management controller.
  • the temperature sensor is connected to the programmable controller through an I2C bus
  • the voltage regulation module is connected to the programmable controller through PMBUS
  • the EEPROM is connected to the programmable controller through an SPI bus The controller is connected.
  • the programmable controller and the baseboard management controller are connected via an I2C bus.
  • Another object of the present invention is to provide a server hard disk backplane health status monitoring method based on a server hard disk backplane health status monitoring device.
  • the method includes the following steps:
  • the programmable controller accesses the temperature sensor, the voltage regulation module, and the EEPROM in a polling manner, respectively, to obtain the temperature parameters collected by the temperature sensor, the fault information in the internal register of the voltage regulation module, and the memory controller Parameters of error, locate, and active signals obtained by parsing the sent GPIO signals;
  • the programmable controller stores the temperature parameters collected by the acquired temperature sensor, the fault information in the internal register of the voltage regulation module, and the error, locate, and active signal parameters obtained by parsing the GPIO signal sent by the memory controller.
  • EEPROM Electrically erasable read-only memory
  • the baseboard management controller accesses the programmable controller in a polling manner to generate a polling action instruction
  • the programmable controller When the programmable controller receives the polling action instruction of the substrate management controller, it stores the temperature parameters collected by the temperature sensor stored in the EEPROM, the fault information in the internal register of the voltage regulation module, and The error, locate, and active signal parameters obtained by parsing the GPIO signal sent by the memory controller are fed back to the substrate management controller.
  • the method further includes the following steps:
  • An EEPROM is set on the hard disk backplane in advance, and the EEPROM is used to temporarily store the temperature parameters collected by the temperature sensor, the fault information in the internal register of the voltage regulation module, and the GPIO signal analysis sent by the memory controller The obtained error, locate, and active signal parameters.
  • the temperature sensor is connected to the programmable controller through an I2C bus
  • the voltage regulation module is connected to the programmable controller through PMBUS
  • the EEPROM is connected to the programmable controller through an SPI bus Controller connection;
  • the programmable controller and the baseboard management controller are connected through an I2C bus.
  • Another object of the present invention is to provide a server hard disk backplane health monitoring system based on a server hard disk backplane health monitoring method.
  • the system includes:
  • the first polling module built into the programmable controller, is used to access the temperature sensor, the voltage regulation module, and the EEPROM in a polling manner, respectively to obtain the temperature parameters collected by the temperature sensor, and the interior of the voltage regulation module
  • the fault information in the register and the error, locate, and active signal parameters obtained by parsing the GPIO signal sent by the memory controller;
  • a storage control module built into the programmable controller, is used by the programmable controller to collect the temperature parameters collected by the temperature sensor, the fault information in the internal register of the voltage regulation module, and the information sent by the memory controller
  • the error, locate, and active signal parameters obtained by GPIO signal analysis are stored in the EEPROM;
  • the second polling module built into the baseboard management controller, is used to access the programmable controller in a polling manner to generate a polling action instruction
  • the feedback module is built into the programmable controller and is used to collect the temperature parameter and voltage regulation module collected by the temperature sensor stored in the EEPROM when receiving the polling action instruction of the substrate management controller.
  • the fault information in the internal register and the error, locate, and active signal parameters obtained by parsing the GPIO signal sent by the memory controller are fed back to the substrate management controller.
  • the system further includes:
  • a preset module is used to set an EEPROM on the hard disk backplane in advance, the EEPROM is used to temporarily store the temperature parameters collected by the temperature sensor, the fault information in the internal register of the voltage regulation module and controlled by the memory The error, locate, and active signal parameters obtained by parsing the GPIO signal sent by the device.
  • the temperature sensor is connected to the programmable controller through an I2C bus
  • the voltage regulation module is connected to the programmable controller through PMBUS
  • the EEPROM is connected to the programmable controller through an SPI bus Controller connection;
  • the programmable controller and the baseboard management controller are connected through an I2C bus.
  • the server hard disk backplane health monitoring device includes a baseboard management controller, a programmable controller, a memory controller, a hard disk slot, a temperature sensor, a voltage regulation module, and an EEPROM.
  • the polling method accesses the temperature sensor, the voltage regulation module, and the EEPROM, respectively obtains various hard disk operating parameters, and stores the acquired hard disk operating parameters in the EEPROM; when receiving the polling action of the substrate management controller
  • the hard disk operating parameters stored in the EEPROM are fed back to the baseboard management controller, so as to monitor and record the health status of the server hard disk backplane, and improve the quality of the server product and user experience.
  • FIG. 1 is a structural block diagram of a health status monitoring device for a server hard disk backplane provided by the present invention
  • FIG. 2 is an implementation flowchart of a method for monitoring the health status of a server hard disk backplane provided by the present invention
  • FIG. 3 is a structural block diagram of a health monitoring system for a server hard disk backplane provided by the present invention.
  • 1-baseboard management controller 2-programmable controller, 3-memory controller, 4-hard disk slot, 5-temperature sensor, 6-voltage regulation module, 7-EEPROM, 8-first polling module , 9- storage control module, 10- second polling module, 11- feedback module, 12- preset module.
  • FIG. 1 is a structural block diagram of a health status monitoring device for a server hard disk backplane provided by the present invention. For ease of description, only parts related to the embodiments of the present invention are shown in the figure.
  • the server hard disk backplane health monitoring device includes a baseboard management controller 1, a programmable controller 2, a memory controller 3, and a hard disk slot 4, the hard disk slot 4 is installed with the programmable controller 2 connected Hard disk LED indicator, the programmable controller 2 is respectively connected to the baseboard management controller 1 and the memory controller 3, and the server hard disk backplane health monitoring device further includes a temperature sensor 5, a voltage regulation module 6, and an EEPROM 7. The temperature sensor 5, the voltage regulation module 6, and the EEPROM 7 are respectively connected to the programmable controller 2;
  • the programmable controller 2 accesses the temperature sensor 5, the voltage regulation module 6, and the EEPROM 7 in a polling manner, respectively, and obtains the temperature parameters collected by the temperature sensor 5 and the fault in the internal register of the voltage regulation module 6, respectively Information and the parameters of the error, locate, and active signals parsed by the GPIO signal sent by the memory controller 3, and the temperature parameters collected by the temperature sensor 5 acquired, the fault information in the internal register of the voltage regulation module 6, and the The error, locate, and active signal parameters obtained by parsing the GPIO signal sent by the memory controller 3 are stored in the EEPROM;
  • the temperature parameter collected by the temperature sensor 5 stored in the EEPROM, the fault information in the internal register of the voltage regulation module 6 and the The error, locate, and active signal parameters obtained by parsing the GPIO signal sent by the memory controller 3 are fed back to the substrate management controller 1.
  • the temperature sensor 5 is connected to the programmable controller 2 through an I2C bus
  • the voltage regulation module 6 is connected to the programmable controller 2 through PMBUS
  • the EEPROM is connected to the programmable controller through an SPI bus Programming controller 2 connection;
  • the programmable controller 2 and the baseboard management controller 1 are connected through an I2C bus.
  • the programmable controller 2 has more interfaces and communicates with the voltage regulation module 6 through PMBUS to monitor the power supply of the hard disk backplane.
  • an EEPROM memory is added to the hard disk backplane. Record and save the fault conditions on the main board for the baseboard management controller 1 to query, plus the temperature sensor 5 directly connected to the programmable controller 2 to realize all-round monitoring and recording of the hard disk backplane, so as to further Improve the server experience.
  • FIG. 2 shows an implementation flowchart of a server hard disk backplane health monitoring method based on a server hard disk backplane health monitoring device provided by the present invention, which specifically includes the following steps:
  • step S101 the programmable controller 2 accesses the temperature sensor 5, the voltage adjustment module 6, and the EEPROM in a polling manner, respectively, and obtains the temperature parameters collected by the temperature sensor 5 and the internal registers of the voltage adjustment module 6, respectively Fault information and error, locate, and active signal parameters obtained by parsing the GPIO signal sent by the memory controller 3;
  • step S102 the programmable controller 2 analyzes the acquired temperature parameter of the temperature sensor 5, the fault information in the internal register of the voltage regulation module 6, and the error obtained by parsing the GPIO signal sent by the memory controller 3. locate and active signal parameters are stored in the EEPROM;
  • step S103 the board management controller 1 accesses the programmable controller 2 in a polling manner to generate a polling action instruction
  • step S104 when the programmable controller 2 receives the polling action instruction of the substrate management controller 1, the temperature parameter and voltage collected by the temperature sensor 5 stored in the EEPROM are adjusted The fault information in the internal register of the module 6 and the error, locate, and active signal parameters obtained by parsing the GPIO signal sent by the memory controller 3 are fed back to the substrate management controller 1.
  • An EEPROM is provided on the hard disk backplane in advance, and the EEPROM is used to temporarily store the temperature parameters collected by the temperature sensor 5, the fault information in the internal register of the voltage regulation module 6, and the information sent by the memory controller 3. Parameters of error, locate, and active signals obtained by GPIO signal analysis.
  • FIG. 3 shows a structural block diagram of a server hard disk backplane health monitoring system provided by the present invention. For ease of explanation, only parts related to the embodiments of the present invention are shown in the figure.
  • the server hard disk backplane health monitoring system includes:
  • the first polling module 8 built in the programmable controller 2, is used to access the temperature sensor 5, the voltage regulation module 6, and the EEPROM in a polling manner, respectively, to obtain temperature parameters collected by the temperature sensor 5 respectively 1.
  • the storage control module 9 is built in the programmable controller 2 and is used by the programmable controller 2 to collect the temperature parameters collected by the temperature sensor 5, the fault information in the internal register of the voltage regulation module 6 and the The error, locate, and active signal parameters obtained by parsing the GPIO signal sent by the memory controller 3 are stored in the EEPROM;
  • the second polling module 10 is built into the baseboard management controller 1 and is used to access the programmable controller 2 in a polling manner to generate a polling action instruction;
  • the feedback module 11 is built in the programmable controller 2 and is used to collect the temperature parameter collected by the temperature sensor 5 stored in the EEPROM when receiving the polling action instruction of the substrate management controller 1 1.
  • the fault information in the internal register of the voltage regulation module 6 and the error, locate, and active signal parameters obtained by parsing the GPIO signal sent by the memory controller 3 are fed back to the substrate management controller 1.
  • the preset module 12 is used to preset an EEPROM on the hard disk backplane, and the EEPROM is used to temporarily store the temperature parameters collected by the temperature sensor 5, the fault information in the internal register of the voltage adjustment module 6, and Parameters of error, locate, and active signals obtained by parsing the GPIO signal sent by the memory controller 3.
  • the server hard disk backplane health monitoring device includes a baseboard management controller 1, a programmable controller 2, a memory controller 3, a hard disk slot 4, a temperature sensor 5, a voltage regulation module 6, and an EEPROM,
  • the programmable controller 2 respectively accesses the temperature sensor 5, the voltage regulation module 6, and the EEPROM in a polling manner, respectively obtains various hard disk operating parameters, and stores the obtained hard disk operating parameters in the EEPROM; when receiving When polling the operation command of the baseboard management controller 1, the hard disk operating parameters stored in the EEPROM are fed back to the baseboard management controller 1, so as to monitor and record the health status of the server hard disk backplane, and improve the quality and quality of the server products. user experience.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Debugging And Monitoring (AREA)
  • Programmable Controllers (AREA)

Abstract

一种服务器硬盘背板健康状态监测装置、方法及系统,装置包括基板管理控制器、可编程控制器、存储器控制器、硬盘插槽、包括温度传感器、电压调节模块以及EEPROM,可编程控制器分别以轮询的方式对所述温度传感器、电压调节模块、EEPROM进行访问,分别获取各种硬盘运行参数,并将获取到的硬盘运行参数存储到EEPROM中;当接收到基板管理控制器的轮询动作指令时,将存储在EEPROM中的硬盘运行参数反馈给基板管理控制器,从而实现对服务器硬盘背板的健康状态监测和记录,提升服务器产品的品质和用户体验。

Description

一种服务器硬盘背板健康状态监测装置、方法及系统
本申请要求于2019年1月4日提交中国专利局、申请号为201910007155.0、发明名称为“一种服务器硬盘背板健康状态监测装置、方法及系统”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本发明属于服务器技术领域,尤其涉及一种服务器硬盘背板健康状态监测装置、方法及系统。
背景技术
服务器硬盘背板是服务器存储子系统的重要组成部分,主要作为连接硬盘的部件,将硬盘接口按照一定的方式与服务器的Raid卡、SAS卡或者板载接口相连。随着服务器功能的发展,存储子系统的发展也越来越多样化,按照种类来讲,除了传统的连接SATA硬盘、SAS硬盘的背板,还发展出了NVME硬盘背板、M.2硬盘背板、ruler规格硬盘的背板等多种形态,有的背板还需要支持混插的特性,即将多种不同种类的背板连接到同一块背板上。背板的形态也逐渐多样化,现在除了横插和竖插的背板外,还出现了很多不规则的背板,背板上硬盘的数量也越来越多。
目前,硬盘背板上一般使用可编程控制器CPLD作为基板管理控制器BMC监控的接口使用,CPLD负责的主要功能是:将存储控制器的SGPIO信号解析为具体的error、locate、active信号后,通过GPIO控制HDD slot的LED灯;将SGPIO信号转换为I2C信号,与主板BMC进行通信,是服务器带外管理系统可以知道背板上硬盘的状态。而BMC直接与背板上的温度sensor进行通信,通过I2C来读取背板上的温度进行监控。
从上可知,对硬盘背板的监测范围有限,只能监控硬盘背板的温度信息,无法监测获取并记录硬盘背板的其他故障信息,从而无法获取硬盘背板的健康状态,存在安全运行的隐患。
发明内容
针对现有技术中的缺陷,本发明提供了一种服务器硬盘背板健康状态监测装置,旨在解决现有技术中对硬盘背板的监测范围有限,只能监控硬盘背板的温度信息,无法监测获取并记录硬盘背板的其他故障信息,从而无法获取硬盘背板的健康状态,存在安全运行的隐患的问题。
本发明所提供的技术方案是:一种服务器硬盘背板健康状态监测装置,包括基板管理控制器、可编程控制器、存储器控制器以及硬盘插槽,所述硬盘插槽上安装有与所述可编程控制器连接的硬盘LED指示灯,所述可编程控制器分别与所述基板管理控制器和存储器控制器连接,所述服务器硬盘背板健康状态监测装置还包括温度传感器、电压调节模块以及EEPROM,所述温度传感器、电压调节模块、EEPROM分别与所述可编程控制器连接;
所述可编程控制器分别以轮询的方式对所述温度传感器、电压调节模块、EEPROM进行访问,分别获取温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数,并将获取到的温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数存储到所述EEPROM中;
当接收到所述基板管理控制器的轮询动作指令时,将存储在所述EEPROM中的所述温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数反馈给所述基板管理控制器。
作为一种改进的方案,所述温度传感器通过I2C总线与所述可编程控制器连接,所述电压调节模块通过PMBUS与所述可编程控制器连接,所述EEPROM通过SPI总线与所述可编程控制器连接。
作为一种改进的方案,所述可编程控制器与所述基板管理控制器之间通过I2C总线连接。
本发明的另一目的在于提供一种基于服务器硬盘背板健康状态监测装置的服务器硬盘背板健康状态监测方法,所述方法包括下述步骤:
可编程控制器分别以轮询的方式对所述温度传感器、电压调节模块、EEPROM进行访问,分别获取温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数;
所述可编程控制器将获取到的温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数存储到所述EEPROM中;
基板管理控制器以轮询的方式对所述可编程控制器进行访问,生成轮询动作指令;
当所述可编程控制器接收到所述基板管理控制器的轮询动作指令时,将存储在所述EEPROM中的所述温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数反馈给所述基板管理控制器。
作为一种改进的方案,所述方法还包括下述步骤:
预先在所述硬盘背板上设置一个EEPROM,所述EEPROM用于暂时存储所述温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数。
作为一种改进的方案,所述温度传感器通过I2C总线与所述可编程控制器连接,所述电压调节模块通过PMBUS与所述可编程控制器连接,所述EEPROM通过SPI总线与所述可编程控制器连接;
所述可编程控制器与所述基板管理控制器之间通过I2C总线连接。
本发明的另一目的在于提供一种基于服务器硬盘背板健康状态监测方法的服务器硬盘背板健康状态监测系统,所述系统包括:
第一轮询模块,内置于可编程控制器内,用于分别以轮询的方式对所述温度传感器、电压调节模块、EEPROM进行访问,分别获取温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存 储器控制器发送的GPIO信号解析得到的error、locate、active信号参数;
存储控制模块,内置于可编程控制器内,用于所述可编程控制器将获取到的温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数存储到所述EEPROM中;
第二轮询模块,内置于基板管理控制器内,用于以轮询的方式对所述可编程控制器进行访问,生成轮询动作指令;
反馈模块,内置于可编程控制器内,用于当接收到所述基板管理控制器的轮询动作指令时,将存储在所述EEPROM中的所述温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数反馈给所述基板管理控制器。
作为一种改进的方案,所述系统还包括:
预先设置模块,用于预先在所述硬盘背板上设置一个EEPROM,所述EEPROM用于暂时存储所述温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数。
作为一种改进的方案,所述温度传感器通过I2C总线与所述可编程控制器连接,所述电压调节模块通过PMBUS与所述可编程控制器连接,所述EEPROM通过SPI总线与所述可编程控制器连接;
所述可编程控制器与所述基板管理控制器之间通过I2C总线连接。
在本发明实施例中,服务器硬盘背板健康状态监测装置包括基板管理控制器、可编程控制器、存储器控制器、硬盘插槽、包括温度传感器、电压调节模块以及EEPROM,可编程控制器分别以轮询的方式对所述温度传感器、电压调节模块、EEPROM进行访问,分别获取各种硬盘运行参数,并将获取到的硬盘运行参数存储到EEPROM中;当接收到基板管理控制器的轮询动作指令时,将存储在EEPROM中的硬盘运行参数反馈给基板管理控制器,从而实现对服务器硬盘背板的健康状态监测和记录,提升服务器产品的品质和用户体验。
附图说明
为了更清楚地说明本发明具体实施方式或现有技术中的技术方案,下面将对具体实施方式或现有技术描述中所需要使用的附图作简单地介绍。在所有附图中,类似的元件或部分一般由类似的附图标记标识。附图中,各元件或部分并不一定按照实际的比例绘制。
图1是本发明提供的服务器硬盘背板健康状态监测装置的结构框图;
图2是本发明提供的服务器硬盘背板健康状态监测方法的实现流程图;
图3是本发明提供的服务器硬盘背板健康状态监测系统的结构框图;
其中,1-基板管理控制器,2-可编程控制器,3-存储器控制器,4-硬盘插槽,5-温度传感器,6-电压调节模块,7-EEPROM,8-第一轮询模块,9-存储控制模块,10-第二轮询模块,11-反馈模块,12-预先设置模块。
具体实施方式
下面将结合附图对本发明技术方案的实施例进行详细的描述。以下实施例仅用于更加清楚地说明本发明的、技术方案,因此只作为示例,而不能以此来限制本发明的保护范围。
图1是本发明提供的服务器硬盘背板健康状态监测装置的结构框图,为了便于说明,图中仅给出与本发明实施例相关的部分。
服务器硬盘背板健康状态监测装置包括基板管理控制器1、可编程控制器2、存储器控制器3以及硬盘插槽4,所述硬盘插槽4上安装有与所述可编程控制器2连接的硬盘LED指示灯,所述可编程控制器2分别与所述基板管理控制器1和存储器控制器3连接,所述服务器硬盘背板健康状态监测装置还包括温度传感器5、电压调节模块6以及EEPROM 7,所述温度传感器5、电压调节模块6、EEPROM 7分别与所述可编程控制器2连接;
所述可编程控制器2分别以轮询的方式对所述温度传感器5、电压调节模块6、EEPROM 7进行访问,分别获取温度传感器5采集到的温度参数、电压调节模块6内部寄存器中的故障信息以及由所述存储器控制器3 发送的GPIO信号解析得到的error、locate、active信号参数,并将获取到的温度传感器5采集到的温度参数、电压调节模块6内部寄存器中的故障信息以及由所述存储器控制器3发送的GPIO信号解析得到的error、locate、active信号参数存储到所述EEPROM中;
当接收到所述基板管理控制器1的轮询动作指令时,将存储在所述EEPROM中的所述温度传感器5采集到的温度参数、电压调节模块6内部寄存器中的故障信息以及由所述存储器控制器3发送的GPIO信号解析得到的error、locate、active信号参数反馈给所述基板管理控制器1。
在该实施例中,温度传感器5通过I2C总线与所述可编程控制器2连接,所述电压调节模块6通过PMBUS与所述可编程控制器2连接,所述EEPROM通过SPI总线与所述可编程控制器2连接;
可编程控制器2与所述基板管理控制器1之间通过I2C总线连接。
在本发明实施例中,可编程控制器2具有较多的接口,通过PMBUS与电压调节模块6通信连接,实现对硬盘背板的供电情况进行监控,同时在硬盘背板上加入一个EEPROM存储器,将主板上的故障情况进行记录,并保存,供基板管理控制器1查询,加上与可编程控制器2直接连接的温度传感器5,实现对硬盘背板的全方位监控和记录,从而进一步的提升服务器的使用体验。
图2示出了本发明提供的基于服务器硬盘背板健康状态监测装置的服务器硬盘背板健康状态监测方法的实现流程图,其具体包括下述步骤:
在步骤S101中,可编程控制器2分别以轮询的方式对所述温度传感器5、电压调节模块6、EEPROM进行访问,分别获取温度传感器5采集到的温度参数、电压调节模块6内部寄存器中的故障信息以及由所述存储器控制器3发送的GPIO信号解析得到的error、locate、active信号参数;
在步骤S102中,可编程控制器2将获取到的温度传感器5采集到的温度参数、电压调节模块6内部寄存器中的故障信息以及由所述存储器控制器3发送的GPIO信号解析得到的error、locate、active信号参数存储到所述EEPROM中;
在步骤S103中,基板管理控制器1以轮询的方式对所述可编程控制器 2进行访问,生成轮询动作指令;
在步骤S104中,当所述可编程控制器2接收到所述基板管理控制器1的轮询动作指令时,将存储在所述EEPROM中的所述温度传感器5采集到的温度参数、电压调节模块6内部寄存器中的故障信息以及由所述存储器控制器3发送的GPIO信号解析得到的error、locate、active信号参数反馈给所述基板管理控制器1。
其中,在执行上述步骤S101之前还需要执行下述步骤:
预先在所述硬盘背板上设置一个EEPROM,所述EEPROM用于暂时存储所述温度传感器5采集到的温度参数、电压调节模块6内部寄存器中的故障信息以及由所述存储器控制器3发送的GPIO信号解析得到的error、locate、active信号参数。
图3示出了本发明提供的服务器硬盘背板健康状态监测系统的结构框图,为了便于说明,图中仅给出了与本发明实施例相关的部分。
服务器硬盘背板健康状态监测系统包括:
第一轮询模块8,内置于可编程控制器2内,用于分别以轮询的方式对所述温度传感器5、电压调节模块6、EEPROM进行访问,分别获取温度传感器5采集到的温度参数、电压调节模块6内部寄存器中的故障信息以及由所述存储器控制器3发送的GPIO信号解析得到的error、locate、active信号参数;
存储控制模块9,内置于可编程控制器2内,用于所述可编程控制器2将获取到的温度传感器5采集到的温度参数、电压调节模块6内部寄存器中的故障信息以及由所述存储器控制器3发送的GPIO信号解析得到的error、locate、active信号参数存储到所述EEPROM中;
第二轮询模块10,内置于基板管理控制器1内,用于以轮询的方式对所述可编程控制器2进行访问,生成轮询动作指令;
反馈模块11,内置于可编程控制器2内,用于当接收到所述基板管理控制器1的轮询动作指令时,将存储在所述EEPROM中的所述温度传感器5采集到的温度参数、电压调节模块6内部寄存器中的故障信息以及由所述存储器控制器3发送的GPIO信号解析得到的error、locate、active信号 参数反馈给所述基板管理控制器1。
其中,预先设置模块12,用于预先在所述硬盘背板上设置一个EEPROM,所述EEPROM用于暂时存储所述温度传感器5采集到的温度参数、电压调节模块6内部寄存器中的故障信息以及由所述存储器控制器3发送的GPIO信号解析得到的error、locate、active信号参数。
在本发明实施例中,服务器硬盘背板健康状态监测装置包括基板管理控制器1、可编程控制器2、存储器控制器3、硬盘插槽4、包括温度传感器5、电压调节模块6以及EEPROM,可编程控制器2分别以轮询的方式对所述温度传感器5、电压调节模块6、EEPROM进行访问,分别获取各种硬盘运行参数,并将获取到的硬盘运行参数存储到EEPROM中;当接收到基板管理控制器1的轮询动作指令时,将存储在EEPROM中的硬盘运行参数反馈给基板管理控制器1,从而实现对服务器硬盘背板的健康状态监测和记录,提升服务器产品的品质和用户体验。
以上各实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述各实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围,其均应涵盖在本发明的权利要求和说明书的范围当中。

Claims (9)

  1. 一种服务器硬盘背板健康状态监测装置,包括基板管理控制器、可编程控制器、存储器控制器以及硬盘插槽,所述硬盘插槽上安装有与所述可编程控制器连接的硬盘LED指示灯,所述可编程控制器分别与所述基板管理控制器和存储器控制器连接,其特征在于,所述服务器硬盘背板健康状态监测装置还包括温度传感器、电压调节模块以及EEPROM,所述温度传感器、电压调节模块、EEPROM分别与所述可编程控制器连接;
    所述可编程控制器分别以轮询的方式对所述温度传感器、电压调节模块、EEPROM进行访问,分别获取温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数,并将获取到的温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数存储到所述EEPROM中;
    当接收到所述基板管理控制器的轮询动作指令时,将存储在所述EEPROM中的所述温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数反馈给所述基板管理控制器。
  2. 根据权利要求1所述的服务器硬盘背板健康状态监测装置,其特征在于,所述温度传感器通过I2C总线与所述可编程控制器连接,所述电压调节模块通过PMBUS与所述可编程控制器连接,所述EEPROM通过SPI总线与所述可编程控制器连接。
  3. 根据权利要求1或2所述的服务器硬盘背板健康状态监测装置,其特征在于,所述可编程控制器与所述基板管理控制器之间通过I2C总线连接。
  4. 一种基于权利要求1所述的服务器硬盘背板健康状态监测装置的服务器硬盘背板健康状态监测方法,其特征在于,所述方法包括下述步骤:
    可编程控制器分别以轮询的方式对所述温度传感器、电压调节模块、EEPROM进行访问,分别获取温度传感器采集到的温度参数、电压调节模 块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数;
    所述可编程控制器将获取到的温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数存储到所述EEPROM中;
    基板管理控制器以轮询的方式对所述可编程控制器进行访问,生成轮询动作指令;
    当所述可编程控制器接收到所述基板管理控制器的轮询动作指令时,将存储在所述EEPROM中的所述温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数反馈给所述基板管理控制器。
  5. 根据权利要求4所述的服务器硬盘背板健康状态监测方法,其特征在于,所述方法还包括下述步骤:
    预先在所述硬盘背板上设置一个EEPROM,所述EEPROM用于暂时存储所述温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数。
  6. 根据权利要求4所述的服务器硬盘背板健康状态监测方法,其特征在于,所述温度传感器通过I2C总线与所述可编程控制器连接,所述电压调节模块通过PMBUS与所述可编程控制器连接,所述EEPROM通过SPI总线与所述可编程控制器连接;
    所述可编程控制器与所述基板管理控制器之间通过I2C总线连接。
  7. 一种基于权利要求4所述的服务器硬盘背板健康状态监测方法的服务器硬盘背板健康状态监测系统,其特征在于,所述系统包括:
    第一轮询模块,内置于可编程控制器内,用于分别以轮询的方式对所述温度传感器、电压调节模块、EEPROM进行访问,分别获取温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数;
    存储控制模块,内置于可编程控制器内,用于所述可编程控制器将获 取到的温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数存储到所述EEPROM中;
    第二轮询模块,内置于基板管理控制器内,用于以轮询的方式对所述可编程控制器进行访问,生成轮询动作指令;
    反馈模块,内置于可编程控制器内,用于当接收到所述基板管理控制器的轮询动作指令时,将存储在所述EEPROM中的所述温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数反馈给所述基板管理控制器。
  8. 根据权利要求7所述的服务器硬盘背板健康状态监测系统,其特征在于,所述系统还包括:
    预先设置模块,用于预先在所述硬盘背板上设置一个EEPROM,所述EEPROM用于暂时存储所述温度传感器采集到的温度参数、电压调节模块内部寄存器中的故障信息以及由所述存储器控制器发送的GPIO信号解析得到的error、locate、active信号参数。
  9. 根据权利要求8所述的服务器硬盘背板健康状态监测系统,其特征在于,所述温度传感器通过I2C总线与所述可编程控制器连接,所述电压调节模块通过PMBUS与所述可编程控制器连接,所述EEPROM通过SPI总线与所述可编程控制器连接;
    所述可编程控制器与所述基板管理控制器之间通过I2C总线连接。
PCT/CN2019/098502 2019-01-04 2019-07-31 一种服务器硬盘背板健康状态监测装置、方法及系统 WO2020140428A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910007155.0 2019-01-04
CN201910007155.0A CN109857602A (zh) 2019-01-04 2019-01-04 一种服务器硬盘背板健康状态监测装置、方法及系统

Publications (1)

Publication Number Publication Date
WO2020140428A1 true WO2020140428A1 (zh) 2020-07-09

Family

ID=66893856

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/098502 WO2020140428A1 (zh) 2019-01-04 2019-07-31 一种服务器硬盘背板健康状态监测装置、方法及系统

Country Status (2)

Country Link
CN (1) CN109857602A (zh)
WO (1) WO2020140428A1 (zh)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109857602A (zh) * 2019-01-04 2019-06-07 郑州云海信息技术有限公司 一种服务器硬盘背板健康状态监测装置、方法及系统
CN110553706A (zh) * 2019-07-23 2019-12-10 中科智水(北京)科技有限公司 一种无人值守自动称重控制器及系统
CN112000284A (zh) * 2020-08-14 2020-11-27 苏州浪潮智能科技有限公司 一种支持存储服务器硬盘保护的硬件系统及方法、程序
CN112213980A (zh) * 2020-10-21 2021-01-12 苏州浪潮智能科技有限公司 一种单片机故障诊断板卡及方法
CN113010108A (zh) * 2021-02-26 2021-06-22 山东英信计算机技术有限公司 一种存储系统及其硬盘管理装置
CN113032216B (zh) * 2021-03-26 2023-04-25 山东英信计算机技术有限公司 一种监控方法、装置、设备和介质
CN117591378B (zh) * 2024-01-17 2024-04-05 苏州元脑智能科技有限公司 一种服务器的温度控制方法、系统、设备及存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101901181A (zh) * 2010-02-09 2010-12-01 浪潮(北京)电子信息产业有限公司 硬盘状态监测方法和系统
CN104572724A (zh) * 2013-10-21 2015-04-29 研祥智能科技股份有限公司 主机工作状态的监测系统和监测方法
US20150143203A1 (en) * 2013-11-15 2015-05-21 SK Hynix Inc. Semiconductor device and method of operating the same
CN109117342A (zh) * 2018-08-13 2019-01-01 郑州云海信息技术有限公司 一种服务器及其硬盘健康状态监测系统
CN109857602A (zh) * 2019-01-04 2019-06-07 郑州云海信息技术有限公司 一种服务器硬盘背板健康状态监测装置、方法及系统

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101901181A (zh) * 2010-02-09 2010-12-01 浪潮(北京)电子信息产业有限公司 硬盘状态监测方法和系统
CN104572724A (zh) * 2013-10-21 2015-04-29 研祥智能科技股份有限公司 主机工作状态的监测系统和监测方法
US20150143203A1 (en) * 2013-11-15 2015-05-21 SK Hynix Inc. Semiconductor device and method of operating the same
CN109117342A (zh) * 2018-08-13 2019-01-01 郑州云海信息技术有限公司 一种服务器及其硬盘健康状态监测系统
CN109857602A (zh) * 2019-01-04 2019-06-07 郑州云海信息技术有限公司 一种服务器硬盘背板健康状态监测装置、方法及系统

Also Published As

Publication number Publication date
CN109857602A (zh) 2019-06-07

Similar Documents

Publication Publication Date Title
WO2020140428A1 (zh) 一种服务器硬盘背板健康状态监测装置、方法及系统
US10606725B2 (en) Monitor peripheral device based on imported data
US9804937B2 (en) Backup backplane management control in a server rack system
US10402207B2 (en) Virtual chassis management controller
JP3831377B2 (ja) コンピュータ・システムにおける電力障害を解析する方法および装置
US8935437B2 (en) Peripheral component health monitoring apparatus
US20050276092A1 (en) Virtual mass storage device for server management information
US20140351470A1 (en) Methods and systems for an interposer board
DE102015115533A1 (de) Kontrollstrategie für eine Laufwerksanordnung
US20120137159A1 (en) Monitoring system and method of power sequence signal
CN110377142A (zh) 一种支持服务器硬盘独立上下电的系统及方法
US20060075292A1 (en) Storage system
US20120137027A1 (en) System and method for monitoring input/output port status of peripheral devices
US20120131361A1 (en) Remote controller and method for remotely controlling motherboard using the remote controller
US20170156238A1 (en) Controlling air flow in a server rack
TW201417536A (zh) 伺服器自動管理方法及系統
CN111209241A (zh) 整机柜服务器的管理系统
CN117251333A (zh) 一种硬盘信息获取方法、装置、设备及存储介质
JP2023071968A (ja) ストレージシステム及びストレージシステムの稼働モードを切り替えるための方法
CN113311754A (zh) 一种基于gd32单片机的电源模块的bmc管理系统
US9003068B2 (en) Service channel for connecting a host computer to peripheral devices
TWI611290B (zh) 伺服器機櫃監控方法
CN218824636U (zh) 一种用于服务器硬盘背板的电源检测装置
CN210038709U (zh) 一种电源监控管理扣板
CN111338907A (zh) 一种pcie设备的远程状态监测系统及方法

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19907046

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19907046

Country of ref document: EP

Kind code of ref document: A1