TW201723839A - Method, system, and server for detecting system status - Google Patents

Method, system, and server for detecting system status Download PDF

Info

Publication number
TW201723839A
TW201723839A TW104142449A TW104142449A TW201723839A TW 201723839 A TW201723839 A TW 201723839A TW 104142449 A TW104142449 A TW 104142449A TW 104142449 A TW104142449 A TW 104142449A TW 201723839 A TW201723839 A TW 201723839A
Authority
TW
Taiwan
Prior art keywords
fault
event
priority order
fault event
module
Prior art date
Application number
TW104142449A
Other languages
Chinese (zh)
Inventor
韓應賢
Original Assignee
英業達股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 英業達股份有限公司 filed Critical 英業達股份有限公司
Priority to TW104142449A priority Critical patent/TW201723839A/en
Publication of TW201723839A publication Critical patent/TW201723839A/en

Links

Abstract

In the method, a first step is provided to read a current-fault event of the system status through a complex programmable logic device (CPLD). A second method is provided to determine whether the current-fault event is stored in a desktop server or not. A third step is provided to search a fault priority which is correspond to the current-fault event and make a LED light which is correspond to the fault priority when the current-fault event is stored in a desktop server. A forth step is provided to divide the current-fault event to the fault priority.

Description

系統狀態的檢測方法、系統及伺服器 System status detection method, system and server

本發明屬於電腦技術領域,有關於一種檢測方法及系統,特別是指一種系統狀態的檢測方法、系統及伺服器。 The invention belongs to the field of computer technology, and relates to a detection method and system, in particular to a system state detection method, system and server.

伺服器一般具備完整的機箱、電源、主板、存儲等標準元件,所以不管是HP或者是其他伺服器都有一個基板管理控制晶片(即BMC),系統的狀態資訊都透過BMC來顯示在面板的健康燈上面。 The server generally has a complete chassis, power supply, motherboard, storage and other standard components, so whether HP or other servers have a substrate management control chip (BMC), the system status information is displayed on the panel through the BMC. Above the health light.

而現有的伺服器和存儲設備的發展日新月異,新技術、新產品層出不窮,故障現象也千奇百怪,像最常見的死機、系統藍屏等硬體故障中,硬碟、主板、記憶體、資料線等部件均有可能導致故障。對於使用者甚至技術服務人員來說這些故障一般很難有準確的判斷,並需運行透過BMC的scan chain或者xregister的方式實現對系統狀態的整天回饋,並在健康燈上顯示,因此狀態資訊的顯示都離不開BMC的支援,但是,一些客戶定制機型,例如桌面伺服器,出於成本的考慮不再設置BMC,但是BMC的所能實現的基本功能還需要保留。 The development of existing servers and storage devices is changing with each passing day. New technologies and new products are emerging one after another. The faults are also strange. In the most common crashes, system blue screens and other hardware failures, hard disks, motherboards, memory, data lines and other components. Both can cause malfunctions. For users and even technical service personnel, these faults are generally difficult to accurately judge, and need to run through the BMC scan chain or xregister way to achieve a full day feedback on the system status, and display on the health light, so the status information The display is inseparable from BMC support, but some custom models, such as desktop servers, no longer set up BMC for cost reasons, but the basic functions that BMC can implement need to be retained.

因此,如何提供一種系統狀態的檢測方法、系統及伺服器,以解決現有技術中不使用BMC晶片但仍需實現BMC的基本功能來滿足客戶需求,實已成為本領域從業者極待解決的技術問題。 Therefore, how to provide a system state detection method, system and server to solve the problem that the prior art does not use the BMC chip but still needs to implement the basic functions of the BMC to meet the customer's needs has become a technology to be solved by practitioners in the field. problem.

鑒於以上所述現有技術的缺點,本發明的目的在於提供一種系統狀態的檢測方法、系統及伺服器,用於解決現有技術中不使用BMC晶片但仍需實現BMC的基本功能來滿足客戶需求。 In view of the above-mentioned shortcomings of the prior art, an object of the present invention is to provide a method, a system, and a server for detecting a system state, which are used to solve the problem that the BMC chip is not used in the prior art but still needs to implement the basic functions of the BMC to meet customer requirements.

為實現上述目的及其他相關目的,本發明一方面提供一種系統狀態的檢測方法,應用於一桌面伺服器,所述系統狀態的檢測方法包括以下步驟:(a)透過一複雜可編程邏輯器件(Complex Programmable Logic Device;CPLD)讀取與所述系統狀態相關的一當前發生的故障事件;(b)透過所述複雜可編程邏輯器件判斷所述當前發生的故障事件是否已預存在所述桌面伺服器中;(c)在所述步驟(b)之判斷結果為是時,透過所述複雜可編程邏輯器件查找與所述當前發生的故障事件對應的一故障優先順序,根據對應的所述故障優先順序以預定報警方式點亮與所述故障優先順序匹配的一LED健康燈;(d)在所述步驟(b)之判斷結果為否時,將所述當前發生的故障事件劃分到對應的所述故障優先順序中。 To achieve the above and other related objects, an aspect of the present invention provides a method for detecting a system state, which is applied to a desktop server, and the method for detecting a state of the system includes the following steps: (a) transmitting a complex programmable logic device ( Complex Programmable Logic Device (CPLD) reads a currently occurring fault event associated with the state of the system; (b) determines, by the complex programmable logic device, whether the currently occurring fault event has pre-stored the desktop servo (c) when the result of the step (b) is YES, searching for a fault priority order corresponding to the currently occurring fault event through the complex programmable logic device, according to the corresponding fault The priority order illuminates an LED health light that matches the fault priority order in a predetermined alarm manner; (d) when the determination result of the step (b) is negative, the current fault event is divided into corresponding The fault is prioritized.

於本發明的一實施例中,所述故障優先順序包括:與系統狀態上電相關的故障係定義為一第一故障優先順序;與系統進程在運行過程中所發生的故障係定義為一第二故障優先順序;與系統硬體散熱相關導致系統關閉的故障係定義為一第三故障優先順序;與系統硬體散熱相關系統仍保持運行的故障係定義為一第四故障優先順序。 In an embodiment of the present invention, the fault priority order includes: a fault system related to system state power-on is defined as a first fault priority order; and a fault system occurring during a running process of the system process is defined as a first The second fault priority order; the fault that is related to the system hardware heat dissipation and the system shutdown is defined as a third fault priority order; the fault system that remains in operation with the system hardware heat dissipation system is defined as a fourth fault priority order.

於本發明的一實施例中,所述第一故障優先順序對應的故障事件包括記憶體電源故障事件、處理器電源故障事件以及處理器電源 控制錯誤故障事件中之至少一者;當至少一所述第一故障優先順序對應的故障事件發生時,所述LED健康燈的紅燈以4Hz頻率閃爍。 In an embodiment of the invention, the fault events corresponding to the first fault priority order include a memory power failure event, a processor power failure event, and a processor power supply. Controlling at least one of the faulty fault events; the red light of the LED health light flashes at a frequency of 4 Hz when at least one fault event corresponding to the first fault priority sequence occurs.

於本發明的一實施例中,所述第二故障優先順序對應的故障事件包括一處理器報告與系統進程相關的故障事件;當所述處理器報告與系統進程相關的故障事件發生時,所述LED健康燈的紅燈常亮。 In an embodiment of the invention, the fault event corresponding to the second fault priority sequence includes a processor reporting a fault event related to the system process; when the processor reports a fault event related to the system process, the fault occurs. The red light of the LED health light is always on.

於本發明的一實施例中,所述第三故障優先順序對應的故障事件包括系統的風扇故障事件、系統的溫度感測器過熱故障事件以及處理器一級過熱故障事件中之至少一者;當至少一所述第三故障優先順序對應的故障事件發生時,所述LED健康燈的黃燈以1Hz頻率閃爍。 In an embodiment of the present invention, the fault event corresponding to the third fault priority sequence includes at least one of a fan fault event of the system, a temperature sensor overheat fault event of the system, and a processor level superheat fault event; When at least one fault event corresponding to the third fault priority sequence occurs, the yellow light of the LED health light flashes at a frequency of 1 Hz.

於本發明的一實施例中,所述第四故障優先順序對應的故障事件包括處理器電源過熱故障事件以及處理器二級過熱故障事件中之一者;當至少一所述第四故障優先順序對應的故障事件發生時,所述LED健康燈的黃燈常亮。 In an embodiment of the present invention, the fault event corresponding to the fourth fault priority sequence includes one of a processor power overheat fault event and a processor secondary overheat fault event; and at least one of the fourth fault priority orders When the corresponding fault event occurs, the yellow light of the LED health light is always on.

本發明另一方面提供一種系統狀態的檢測系統,應用於一桌面伺服器,所述系統狀態的檢測系統包括一讀取模組、一查找模組、一操作模組以及一劃分模組,所述讀取模組用於讀取與所述系統狀態相關的一當前發生的故障事件;所述處理模組電性連接於所述讀取模組、所述查找模組、所述操作模組以及所述劃分模組,用於判斷所述當前發生的故障事件是否已預存在所述桌面伺服器中;若是,則調用用於查找與所述當前發生的故障事件對應的一故障優先順序的所述查找模組,和用於根據對應的所述故障優先順序以預定報警方式點亮與所述故障優先順序匹配的一LED健康燈的所述操作模組;若否,則調用用於將所述當前發生的故障事件劃分到對應的所述故障優先順序中的所述劃分模組。 Another aspect of the present invention provides a system state detection system, which is applied to a desktop server, and the system state detection system includes a reading module, a search module, an operation module, and a division module. The reading module is configured to read a currently occurring fault event related to the state of the system; the processing module is electrically connected to the reading module, the searching module, and the operating module And the dividing module, configured to determine whether the currently occurring fault event is pre-existing in the desktop server; if yes, calling a fault priority order corresponding to the currently occurring fault event The search module, and the operation module for lighting an LED health light matched with the fault priority order in a predetermined alarm manner according to the corresponding fault priority order; if not, calling The currently occurring fault event is divided into the partitioning modules in the corresponding fault priority order.

於本發明的一實施例中,所述系統狀態的檢測系統還包括與所述劃分模組連接的一存儲模組,所述存儲模組用於在所述當前發生的故障事件劃分到對應的所述故障優先順序之後,將所述當前發生的故障事件存儲。 In an embodiment of the present invention, the system state detection system further includes a storage module connected to the partitioning module, and the storage module is configured to divide the currently occurring fault event into a corresponding one. After the fault priority sequence, the currently occurring fault event is stored.

本發明又一方面還提供一種伺服器,所述伺服器包括所述的系統狀態的檢測系統。 Yet another aspect of the present invention also provides a server, the server including the system state detection system.

於本發明的一實施例中,所述伺服器為桌面伺服器。 In an embodiment of the invention, the server is a desktop server.

如上所述,本發明的系統狀態的檢測方法、系統及伺服器,具有以下有益效果:本發明所述的系統狀態的檢測方法、系統及伺服器無需使用BMC晶片支援狀態資訊的顯示,實現對整個系統狀態的偵測,透過健康燈不同顏色告訴用戶和測試人員系統哪里出了問題,該怎麼去解決,因此,大大提高了系統工作效率,滿足了各種客戶的需求。 As described above, the system state detection method, system, and server of the present invention have the following beneficial effects: the system state detection method, system, and server according to the present invention do not need to use the BMC chip support status information display to achieve The detection of the entire system status tells the user and the tester how to solve the problem through the different colors of the health light, and how to solve it, thus greatly improving the system work efficiency and satisfying the needs of various customers.

本發明所採用的具體實施例,將藉由以下之實施例及圖式作進一步之說明。 The specific embodiments of the present invention will be further described by the following examples and drawings.

1‧‧‧系統狀態的檢測系統 1‧‧‧System state detection system

11‧‧‧讀取模組 11‧‧‧Reading module

12‧‧‧處理模組 12‧‧‧Processing module

13‧‧‧查找模組 13‧‧‧Search module

14‧‧‧操作模組 14‧‧‧Operating module

15‧‧‧劃分模組 15‧‧‧Division module

16‧‧‧存儲模組 16‧‧‧Memory Module

2‧‧‧伺服器 2‧‧‧Server

第一圖顯示為本發明的系統狀態的檢測方法於一實施例中的流程示意圖。 The first figure shows a flow chart of the method for detecting the state of the system of the present invention in an embodiment.

第二圖顯示為本發明的系統狀態的檢測系統於一實施例中的原理結構示意圖。 The second figure shows a schematic structural diagram of the detection system of the system state of the present invention in an embodiment.

第三圖顯示為本發明的伺服器於一實施例中的原理結構示意圖。 The third figure shows a schematic structural diagram of a server of the present invention in an embodiment.

以下通過特定的具體實例說明本發明的實施方式,本領域技術人員可由本說明書所揭露的內容輕易地瞭解本發明的其他優點與功效。本發明還可以通過另外不同的具體實施方式加以實施或應用,本說明書中的各項細節也可以基於不同觀點與應用,在沒有背離本發明的精神下進行各種修飾或改變。需說明的是,在不衝突的情況下,以下實施例及實施例中的特徵可以相互組合。 The embodiments of the present invention are described below by way of specific examples, and those skilled in the art can readily understand other advantages and effects of the present invention from the disclosure of the present disclosure. The present invention may be embodied or applied in various other specific embodiments, and various modifications and changes can be made without departing from the spirit and scope of the invention. It should be noted that the features in the following embodiments and embodiments may be combined with each other without conflict.

需要說明的是,以下實施例中所提供的圖示僅以示意方式說明本發明的基本構想,遂圖式中僅顯示與本發明中有關的組件而非按照實際實施時的元件數目、形狀及尺寸繪製,其實際實施時各元件的型態、數量及比例可為一種隨意的改變,且其元件佈局型態也可能更為複雜。 It should be noted that the illustrations provided in the following embodiments merely illustrate the basic concept of the present invention in a schematic manner, and only the components related to the present invention are shown in the drawings, rather than the number and shape of components in actual implementation. Dimensional drawing, the actual type of implementation of each component type, number and proportion can be a random change, and its component layout can be more complicated.

實施例一: Embodiment 1:

本實施例提供一種系統狀態的檢測方法,應用於一桌面伺服器,所述系統狀態的檢測方法包括以下步驟:透過一複雜可編程邏輯器件(Complex Programmable Logic Device;CPLD)讀取與所述系統狀態相關的一當前發生的故障事件;透過所述複雜可編程邏輯器件判斷所述當前發生的故障事件是否已預存在所述桌面伺服器中;若是,則透過所述複雜可編程邏輯器件查找與所述當前發生的故障事件對應的一故障優先順序,根據對應的所述故障優先順序以預定報警方式點亮與所述故障優先順序匹配的一LED健康燈。 The embodiment provides a method for detecting a system state, and is applied to a desktop server. The method for detecting a state of the system includes the following steps: reading and using the system through a Complex Programmable Logic Device (CPLD) State-related a currently occurring fault event; determining, by the complex programmable logic device, whether the currently occurring fault event is pre-existing in the desktop server; if so, searching through the complex programmable logic device And a fault priority sequence corresponding to the currently occurring fault event, and lighting an LED health light matched with the fault priority order in a predetermined alarm manner according to the corresponding fault priority order.

若上述步驟的判斷結果為否,則將所述當前發生的故障事件劃分到對應的所述故障優先順序中。 If the determination result of the above step is no, the currently occurring fault event is divided into the corresponding fault priority order.

以下將結合圖式對本實施例所提供的系統狀態的檢測方法進行詳細闡述。本實施例所述的系統狀態的檢測方法應用於所述桌面伺服器,即desktop伺服器。所述的系統狀態的檢測方法是基於desktop伺服器沒有BMC的特點,透過對CPLD code的設計,不透過BMC晶片的管理也可以在健康燈上實現對系統狀態資訊的顯示,來提醒用戶進行相應的維修和檢測。 The method for detecting the state of the system provided by this embodiment will be described in detail below with reference to the drawings. The system status detecting method described in this embodiment is applied to the desktop server, that is, a desktop server. The detection method of the system state is based on the fact that the desktop server does not have the BMC. Through the design of the CPLD code, the management of the system status information can be realized on the health light without the management of the BMC chip, to remind the user to perform corresponding Maintenance and testing.

請參閱第一圖,顯示為系統狀態的檢測方法於一實施例中的流程示意圖。如第一圖所示,所述系統狀態的檢測方法具體包括以下幾個步驟: Please refer to the first figure, which is a schematic diagram of a process for detecting a system state in an embodiment. As shown in the first figure, the method for detecting the state of the system specifically includes the following steps:

S1,透過所述複雜可編程邏輯器件讀取與所述系統狀態相關的一當前發生的故障事件。所述複雜可編程邏輯器件是從PAL和GAL器件發展出來的器件,屬於大型積體電路範圍。是一種用戶根據各自需要而自行構造邏輯功能的數位積體電路。其基本設計方法是借助集成開發軟體平台,用原理圖、硬體描述語言等方法,生成相應的目標檔,透過下載電纜(“在系統”編程)將代碼,CPLD code即傳送到目標晶片中,實現設計的數位系統。在本實施例中,CPLD主要是由可編程邏輯巨集單元(MC;Macro Cell)圍繞中心的可編程互連矩陣單元組成。其中MC結構較複雜,並具有複雜的I/O單元互連結構,可由用戶根據需要生成特定的電路結構,完成檢測系統狀態的功能。 S1, reading, by the complex programmable logic device, a currently occurring fault event related to the state of the system. The complex programmable logic device is a device developed from PAL and GAL devices and belongs to a large integrated circuit range. It is a digital integrated circuit that users construct their own logic functions according to their needs. The basic design method is to use the integrated development software platform, use the schematic diagram, hardware description language and other methods to generate the corresponding target file, and transmit the code, CPLD code, to the target wafer through the download cable ("in-system" programming). A digital system that implements the design. In this embodiment, the CPLD is mainly composed of a programmable logic macro unit (MC; Macro Cell) surrounding the central programmable interconnect matrix unit. The MC structure is complex and has a complicated I/O unit interconnection structure, and the user can generate a specific circuit structure according to needs, and complete the function of detecting the state of the system.

S2,透過所述複雜可編程邏輯器件判斷所述當前發生的故障事件是否已預存在所述桌面伺服器中;若是,則執行步驟S3,若否,則執行步驟S4。在本實施例中,每一故障事件都會有相應的故障優先順序,所述故障優先順序包括:與系統狀態上電相關的故障定義為一第一故障優先順序;與系統進程在運行過程中所發生的故障定義為一 第二故障優先順序;與系統硬體散熱相關導致系統關閉的故障定義為一第三故障優先順序;與系統硬體散熱相關系統仍保持運行的故障定義為一第四故障優先順序。在本實施例中,優先順序為所述第一故障優先順序大於所述第二故障優先順序,所述第一故障優先順序、所述第二故障優先順序大於所述第三故障優先順序,所述第一故障優先順序、所述第二故障優先順序、所述第三故障優先順序大於所述第四故障優先順序。因此,在本實施例中,已預存在所述desktop伺服器,即所述桌面伺服器中的與所述系統狀態相關的當前發生的故障事件是按照所述故障優先順序來排列。 S2, determining, by the complex programmable logic device, whether the currently occurring fault event is pre-existing in the desktop server; if yes, executing step S3; if not, executing step S4. In this embodiment, each fault event has a corresponding fault priority order, and the fault priority order includes: the fault related to the system state power-on is defined as a first fault priority order; and the system process is in operation The fault that occurred is defined as one The second fault priority order; the fault that causes the system to shut down related to the system hardware heat dissipation is defined as a third fault priority order; the fault that remains related to the system hardware heat dissipation system is defined as a fourth fault priority order. In this embodiment, the priority order is that the first fault priority order is greater than the second fault priority order, and the first fault priority order and the second fault priority order are greater than the third fault priority order. The first fault priority order, the second fault priority order, and the third fault priority order are greater than the fourth fault priority order. Therefore, in the present embodiment, the desktop server is pre-existing, that is, the currently occurring fault events related to the system state in the desktop server are arranged according to the fault priority order.

S4,若在已預存在所述桌面伺服器中發現所述當前發生的故障事件,那麼透過所述複雜可編程邏輯器件查找與所述當前發生的故障事件對應的所述故障優先順序。 S4. If the currently occurring fault event is found in the pre-existing desktop server, the fault priority order corresponding to the currently occurring fault event is searched through the complex programmable logic device.

與系統狀態上電相關的故障為所述第一故障優先順序對應的故障事件稱為一第一故障事件。在本實施例中,所述第一故障事件包括記憶體電源故障事件、處理器電源故障事件和/或處理器電源控制錯誤故障事件。 A fault event related to the power failure of the system state is a first fault event. In this embodiment, the first fault event includes a memory power failure event, a processor power failure event, and/or a processor power control error event.

與系統進程在運行過程中所發生的故障為所述第二故障優先順序對應的故障事件稱為一第二故障事件。在本實施例中,所述第二故障事件包括一處理器報告與系統進程相關的故障事件。 The fault event corresponding to the fault that occurs during the running of the system process for the second fault priority order is referred to as a second fault event. In this embodiment, the second fault event includes a processor reporting a fault event associated with the system process.

與系統硬體散熱相關導致系統關閉的故障為所述第三故障優先順序對應的故障事件稱為一第三故障事件。所述第三故障優先順序對應的故障事件包括系統的風扇故障事件、系統的溫度感測器過熱故障事件和/或處理器一級過熱故障事件。 A fault event that is related to the hardware heat dissipation of the system and causes the system to be shut down is a third fault event. The fault events corresponding to the third fault priority order include a fan fault event of the system, a temperature sensor overheat fault event of the system, and/or a processor level overheat fault event.

在本實施例中,所述處理器一級過熱故障事件是指檢測到處理器的溫度超過預設的第一過熱閾值。 In this embodiment, the processor level one overheat fault event refers to detecting that the temperature of the processor exceeds a preset first overheat threshold.

與系統硬體散熱相關系統仍保持運行的故障為所述第四故障優先順序對應的故障事件稱為一第四故障事件。所述第四故障優先順序的故障事件包括處理器電源過熱故障事件和/或處理器二級過熱故障事件。 The fault that the system related to the system hardware heat dissipation remains operational is the fault event corresponding to the fourth fault priority order, which is called a fourth fault event. The fault event of the fourth fault priority sequence includes a processor power overheat fault event and/or a processor secondary overheat fault event.

在本實施例中,所述處理器二級過熱故障事件是指檢測到處理器的溫度超大預設的第二過熱閾值。所述第一過熱閾值大於第二過熱閾值。 In this embodiment, the processor secondary overheat fault event refers to detecting that the temperature of the processor is excessively preset by a second superheat threshold. The first superheat threshold is greater than the second superheat threshold.

S5,根據對應的所述故障優先順序以預定報警方式點亮與所述故障優先順序匹配的LED健康燈。在本實施例中,所述LED健康燈包括紅燈,黃燈,及綠燈。 S5. Light up the LED health light that matches the fault priority order in a predetermined alarm manner according to the corresponding fault priority order. In this embodiment, the LED health light includes a red light, a yellow light, and a green light.

例如,當一件或多件(至少一)上述的第一故障優先順序對應的故障事件,即所述第一故障事件發生時,所述LED健康燈的紅燈以4Hz頻率閃爍。 For example, when one or more (at least one) fault events corresponding to the first fault priority order described above, that is, the first fault event occurs, the red light of the LED health light flashes at a frequency of 4 Hz.

當上述第二故障事件,即處理器報告與系統進程相關的故障事件發生時,所述LED健康燈的紅燈常亮。 When the second fault event, that is, the processor reports a fault event related to the system process, the red light of the LED health light is always on.

當一件或多件(至少一)上述的第三故障優先順序對應的故障事件,即第三故障事件發生時,所述LED健康燈的黃燈以1Hz頻率閃爍。 When one or more (at least one) fault events corresponding to the third fault priority sequence described above, that is, a third fault event occurs, the yellow light of the LED health light flashes at a frequency of 1 Hz.

當一件或多件(至少一)上述的第四故障優先順序對應的故障事件,即第二故障事件發生時,所述LED健康燈的黃燈常亮。 When one or more (at least one) fault events corresponding to the fourth fault priority sequence described above, that is, the second fault event occurs, the yellow light of the LED health light is always on.

S4,若未在已預存在所述桌面伺服器中發現所述當前發生的故障事件,那麼所述複雜可編程邏輯器件將所述當前發生的故障事件根據用戶需求將其劃分到對應的所述故障優先順序中。 S4. If the currently occurring fault event is not found in the pre-existing desktop server, the complex programmable logic device divides the currently occurring fault event into corresponding corresponding ones according to user requirements. In the priority order of failure.

S6,所述複雜可編程邏輯器件將未在已預存在所述桌面伺服器中的當前發生的故障事件存儲起來。 S6, the complex programmable logic device will not store the currently occurring fault event that has been pre-existing in the desktop server.

本實施例所述的系統狀態的檢測方法無需使用BMC晶片支援狀態資訊的顯示,實現對整個系統狀態的偵測,透過健康燈不同顏色告訴用戶和測試人員系統哪里出了問題,該怎麼去解決,因此,大大提高了系統工作效率,滿足了各種客戶的需求。 The method for detecting the state of the system described in this embodiment does not need to use the display of the state information of the BMC chip to realize the detection of the state of the whole system, and tells the user and the tester that there is a problem through the different colors of the health light, how to solve the problem. Therefore, the system work efficiency is greatly improved and the needs of various customers are met.

實施例二: Embodiment 2:

本實施例提高一種系統狀態的檢測系統,應用於一桌面伺服器,所述系統狀態的檢測系統包括一讀取模組、一查找模組、一操作模組以及一劃分模組。 In this embodiment, a system status detection system is applied to a desktop server. The system status detection system includes a reading module, a search module, an operation module, and a division module.

所述讀取模組用於讀取與所述系統狀態相關的一當前發生的故障事件。 The reading module is configured to read a currently occurring fault event associated with the state of the system.

所述處理模組電性連接於所述讀取模組、所述查找模組、所述操作模組以及所述劃分模組,用於判斷所述當前發生的故障事件是否已預存在所述桌面伺服器中;若是,則調用用於查找與所述當前發生的故障事件對應的一故障優先順序的所述查找模組,和用於根據對應的所述故障優先順序以預定報警方式點亮與所述故障優先順序匹配的一LED健康燈的所述操作模組;若否,則調用用於將所述當前發生的故障事件劃分到對應的所述故障優先順序中的所述劃分模組。 The processing module is electrically connected to the reading module, the searching module, the operating module, and the dividing module, and is configured to determine whether the currently occurring fault event has pre-existed In the desktop server; if yes, calling the lookup module for finding a fault priority order corresponding to the currently occurring fault event, and for lighting in a predetermined alarm manner according to the corresponding fault priority order The operation module of an LED health light matching the fault priority order; if not, invoking the partitioning module for dividing the currently occurring fault event into a corresponding fault priority order .

以下將結合圖式對本實施例所提供的系統狀態的檢測系統進行詳細闡述。本實施例所述的系統狀態的檢測系統應用於一桌面伺服 器,即desktop伺服器。所述的系統狀態的檢測系統是基於desktop伺服器沒有BMC的特點,透過對CPLD code的設計,不透過BMC晶片的管理也可以在健康燈上實現對系統狀態資訊的顯示,來提醒用戶進行相應的維修和檢測。 The system state detection system provided by this embodiment will be described in detail below with reference to the drawings. The system state detection system described in this embodiment is applied to a desktop servo , the desktop server. The system state detection system is based on the fact that the desktop server does not have a BMC. Through the design of the CPLD code, the system status information can be displayed on the health light without the management of the BMC chip, to remind the user to perform corresponding Maintenance and testing.

請參閱第二圖,顯示為系統狀態的檢測系統於一實施例中的原理結構示意圖。如第二圖所示,所述系統狀態的檢測系統1包括:一讀取模組11、一處理模組12、一查找模組13、一操作模組14、一劃分模組15以及一存儲模組16。 Please refer to the second figure, which is a schematic structural diagram of a detection system of a system state in an embodiment. As shown in the second figure, the system state detecting system 1 includes: a reading module 11, a processing module 12, a searching module 13, an operating module 14, a dividing module 15, and a storage. Module 16.

所述讀取模組11用於讀取與所述系統狀態相關的一當前發生的故障事件。 The reading module 11 is configured to read a currently occurring fault event related to the state of the system.

與所述讀取模組11連接的處理模組12用於判斷所述當前發生的故障事件是否已預存在所述桌面伺服器中;若是,則調用查找模組13和操作模組14,若否,則調用所述劃分模組15和存儲模組16。在本實施例中,每一故障事件都會有相應的一故障優先順序,所述故障優先順序包括:與系統狀態上電相關的故障定義為一第一故障優先順序;與系統進程在運行過程中所發生的故障定義為一第二故障優先順序;與系統硬體散熱相關導致系統關閉的故障定義為一第三故障優先順序;與系統硬體散熱相關系統仍保持運行的故障定義為一第四故障優先順序。在本實施例中,優先順序為所述第一故障優先順序大於所述第二故障優先順序,所述第一故障優先順序、所述第二故障優先順序大於所述第三故障優先順序,所述第一故障優先順序、所述第二故障優先順序、所述第三故障優先順序大於第四故障優先順序。因此,在本實施例中,已預存在所述desktop伺服器,即所述桌面伺服器中的 與所述系統狀態相關的當前發生的故障事件是按照故障優先順序來排列。 The processing module 12 connected to the reading module 11 is configured to determine whether the currently occurring fault event is pre-existing in the desktop server; if yes, calling the searching module 13 and the operating module 14 Otherwise, the partitioning module 15 and the storage module 16 are called. In this embodiment, each fault event has a corresponding fault priority order, and the fault priority order includes: the fault related to the system state power-on is defined as a first fault priority order; and the system process is in operation The fault that occurs is defined as a second fault priority order; the fault that causes the system to shut down related to the system hardware heat dissipation is defined as a third fault priority order; the fault that remains related to the system hardware heat dissipation system is defined as a fourth fault. Fault priority order. In this embodiment, the priority order is that the first fault priority order is greater than the second fault priority order, and the first fault priority order and the second fault priority order are greater than the third fault priority order. The first fault priority order, the second fault priority order, and the third fault priority order are greater than the fourth fault priority order. Therefore, in the embodiment, the desktop server is pre-existing, that is, in the desktop server. The currently occurring fault events associated with the state of the system are ranked in order of failure priority.

與所述處理模組12電性連接的查找模組13用於若在已預存在所述桌面伺服器中發現所述當前發生的故障事件,查找與所述當前發生的故障事件對應的一故障優先順序。 The searching module 13 electrically connected to the processing module 12 is configured to find a fault corresponding to the currently occurring fault event if the currently occurring fault event is found in the desktop server pre-existing Priority order.

與系統狀態上電相關的故障為所述第一故障優先順序對應的故障事件稱為一第一故障事件。在本實施例中,所述第一故障事件包括記憶體電源故障事件、處理器電源故障事件和/或處理器電源控制錯誤故障事件。 A fault event related to the power failure of the system state is a first fault event. In this embodiment, the first fault event includes a memory power failure event, a processor power failure event, and/or a processor power control error event.

與系統進程在運行過程中所發生的故障為所述第二故障優先順序對應的故障事件稱為一第二故障事件。在本實施例中,所述第二故障事件包括一處理器報告與系統進程相關的故障事件。 The fault event corresponding to the fault that occurs during the running of the system process for the second fault priority order is referred to as a second fault event. In this embodiment, the second fault event includes a processor reporting a fault event associated with the system process.

與系統硬體散熱相關導致系統關閉的故障為所述第三故障優先順序對應的故障事件稱為一第三故障事件。所述第三故障優先順序對應的故障事件包括系統的風扇故障事件、系統的溫度感測器過熱故障事件和/或處理器一級過熱故障事件。 A fault event that is related to the hardware heat dissipation of the system and causes the system to be shut down is a third fault event. The fault events corresponding to the third fault priority order include a fan fault event of the system, a temperature sensor overheat fault event of the system, and/or a processor level overheat fault event.

在本實施例中,所述處理器一級過熱故障事件是指檢測到處理器的溫度超過預設的第一過熱閾值。 In this embodiment, the processor level one overheat fault event refers to detecting that the temperature of the processor exceeds a preset first overheat threshold.

與系統硬體散熱相關系統仍保持運行的故障為所述第四故障優先順序對應的故障事件稱為一第四故障事件。所述第四故障優先順序的故障事件包括:處理器電源過熱故障事件,和/或處理器二級過熱故障事件。 The fault that the system related to the system hardware heat dissipation remains operational is the fault event corresponding to the fourth fault priority order, which is called a fourth fault event. The fault events of the fourth fault priority sequence include: a processor power overheat fault event, and/or a processor secondary overheat fault event.

在本實施例中,所述處理器二級過熱故障事件是指檢測到處理器的溫度超大預設的第二過熱閾值。所述第一過熱閾值大於第二過熱閾值。 In this embodiment, the processor secondary overheat fault event refers to detecting that the temperature of the processor is excessively preset by a second superheat threshold. The first superheat threshold is greater than the second superheat threshold.

與所述處理模組12電性連接的操作模組14用於根據對應的所述故障優先順序以預定報警方式點亮與所述故障優先順序匹配的一LED健康燈。在本實施例中,所述LED健康等包括紅燈,黃燈,及綠燈。 An operation module 14 electrically connected to the processing module 12 is configured to illuminate an LED health light that matches the fault priority order in a predetermined alarm manner according to the corresponding fault priority order. In this embodiment, the LED health and the like include a red light, a yellow light, and a green light.

例如,當一件或多件(至少一)上述的第一故障優先順序對應的故障事件,即第一故障事件發生時,所述操作模組14令所述LED健康燈的紅燈以4Hz頻率閃爍。 For example, when one or more (at least one) fault events corresponding to the first fault priority order, that is, the first fault event occurs, the operation module 14 causes the red light of the LED health light to be at a frequency of 4 Hz. flicker.

當上述第二故障事件,即處理器報告與系統進程相關的故障事件發生時,所述操作模組14令所述LED健康燈的紅燈常亮。 When the second fault event, that is, the processor reports a fault event related to the system process, the operation module 14 causes the red light of the LED health light to be constantly on.

當一件或多件(至少一)上述的第三故障優先順序對應的故障事件,即第三故障事件發生時,所述操作模組14令所述LED健康燈的黃燈以1Hz頻率閃爍。 When one or more (at least one) fault events corresponding to the third fault priority sequence, that is, the third fault event occurs, the operation module 14 causes the yellow light of the LED health light to blink at a frequency of 1 Hz.

當一件或多件(至少一)上述的第四故障優先順序對應的故障事件,即第二故障事件發生時,所述操作模組14令所述LED健康燈的黃燈常亮。 When one or more (at least one) fault events corresponding to the fourth fault priority sequence, that is, the second fault event occurs, the operation module 14 causes the yellow light of the LED health light to be always on.

與所述處理模組12電性連接的劃分模組15用於若未在已預存在所述桌面伺服器中發現所述當前發生的故障事件,將所述當前發生的故障事件根據用戶需求將其劃分到對應的所述故障優先順序中。 The dividing module 15 electrically connected to the processing module 12 is configured to: if the currently occurring fault event is not found in the pre-existing desktop server, the current fault event will be according to user requirements It is divided into corresponding fault priority orders.

與所述劃分模組15電性連接的存儲模組16用於將未在已預存在所述桌面伺服器中的所述當前發生的故障事件存儲起來。 The storage module 16 electrically connected to the partitioning module 15 is configured to store the currently occurring fault event that has not been pre-stored in the desktop server.

本實施例還提供一種伺服器2,請參閱第三圖,顯示為伺服器於一實施例中的原理結構示意圖。如第三圖所示,所述伺服器2包括上述的系統狀態的檢測系統1。在本實施例中,透過複雜可編程邏輯器件具體實現所述系統狀態的檢測系統1所具有各項功能。具體來說,所述伺服器2於本實施例中可為桌面伺服器。 This embodiment also provides a server 2, please refer to the third figure, which is shown as a schematic structural diagram of the server in an embodiment. As shown in the third figure, the server 2 includes the above-described system state detection system 1. In the present embodiment, the functions of the detection system 1 of the system state are specifically realized by a complex programmable logic device. Specifically, the server 2 can be a desktop server in this embodiment.

綜上所述,本發明所述的系統狀態的檢測方法、系統及伺服器無需使用BMC晶片支援狀態資訊的顯示,實現對整個系統狀態的偵測,透過健康燈不同顏色告訴用戶和測試人員系統哪里出了問題,該怎麼去解決,因此,大大提高了系統工作效率,滿足了各種客戶的需求。所以,本發明有效克服了現有技術中的種種缺點而具高度產業利用價值。 In summary, the system state detection method, system and server of the present invention do not need to use BMC chip support status information display to realize the detection of the entire system state, and tell the user and the tester system through different colors of the health light. Where is the problem, how to solve it, therefore, greatly improve the system work efficiency and meet the needs of various customers. Therefore, the present invention effectively overcomes various shortcomings in the prior art and has high industrial utilization value.

藉由以上較佳具體實施例之詳述,係希望能更加清楚描述本發明之特徵與精神,而並非以上述所揭露的較佳具體實施例來對本發明之範疇加以限制。相反地,其目的是希望能涵蓋各種改變及具相等性的安排於本發明所欲申請之專利範圍的範疇內。 The features and spirit of the present invention will be more apparent from the detailed description of the preferred embodiments. On the contrary, the intention is to cover various modifications and equivalents within the scope of the invention as claimed.

Claims (10)

一種系統狀態的檢測方法,應用於一桌面伺服器,所述系統狀態的檢測方法包括以下步驟:(a)透過一複雜可編程邏輯器件(Complex Programmable Logic Device;CPLD)讀取與所述系統狀態相關的一當前發生的故障事件;(b)透過所述複雜可編程邏輯器件判斷所述當前發生的故障事件是否已預存在所述桌面伺服器中;(c)在該步驟(b)之判斷結果為是時,透過所述複雜可編程邏輯器件查找與所述當前發生的故障事件對應的一故障優先順序,根據對應的所述故障優先順序以預定報警方式點亮與所述故障優先順序匹配的一LED健康燈;(d)在該步驟(b)之判斷結果為否時,將所述當前發生的故障事件劃分到對應的所述故障優先順序中。 A system state detection method is applied to a desktop server, and the system state detection method comprises the following steps: (a) reading and the system state through a Complex Programmable Logic Device (CPLD) Correlating a currently occurring fault event; (b) determining, by the complex programmable logic device, whether the currently occurring fault event is pre-existing in the desktop server; (c) determining at step (b) When the result is YES, the faulty priority order corresponding to the currently occurring fault event is searched through the complex programmable logic device, and the predetermined alarm mode is illuminated according to the corresponding fault priority order to match the fault priority order. And an LED health light; (d) when the determination result of the step (b) is negative, dividing the currently occurring fault event into the corresponding fault priority order. 如申請專利範圍第1項所述的系統狀態的檢測方法,其中,所述故障優先順序中,與系統狀態上電相關的故障係定義為一第一故障優先順序;與系統進程在運行過程中所發生的故障係定義為一第二故障優先順序;與系統硬體散熱相關導致系統關閉的故障係定義為一第三故障優先順序;與系統硬體散熱相關系統仍保持運行的故障係定義為一第四故障優先順序。 The method for detecting a system state according to claim 1, wherein in the fault priority order, the fault related to the system state power-on is defined as a first fault priority order; and the system process is in operation The fault that occurs is defined as a second fault priority order; the fault that is related to the system hardware heat dissipation and causes the system to shut down is defined as a third fault priority order; the fault system that remains in operation with the system hardware heat dissipation system is defined as A fourth failure priority order. 如申請專利範圍第2項所述的系統狀態的檢測方法,其中,所述第一故障優先順序對應的故障事件包括記憶體電源故障事件、處理器電源故障事件以及處理器電源控制錯誤故障事件中之至少一者;當至少一所述第一故障優先順序對應的故障事件發生時,所述LED健康燈的紅燈以4Hz頻率閃爍。 The method for detecting a system state according to claim 2, wherein the fault event corresponding to the first fault priority sequence includes a memory power failure event, a processor power failure event, and a processor power control error event. At least one of the LEDs of the LED health light flashes at a frequency of 4 Hz when at least one fault event corresponding to the first fault priority sequence occurs. 如申請專利範圍第2項所述的系統狀態的檢測方法,其中,所述第二故障優先順序對應的故障事件包括一處理器報告與系統進程相關的故障事件,當所述處理器報告與系統進程相關的故障事件發生時,所述LED健康燈的紅燈常亮。 The method for detecting a system state according to claim 2, wherein the fault event corresponding to the second fault priority sequence comprises a processor reporting a fault event related to a system process, when the processor reports and the system When the process-related fault event occurs, the red light of the LED health light is always on. 如申請專利範圍第2項所述的系統狀態的檢測方法,其中,所述第三故障優先順序對應的故障事件包括系統的風扇故障事件、系統的溫度感測器過熱故障事件以及處理器一級過熱故障事件中之至少一者,當至少一所述第三故障優先順序對應的故障事件發生時,所述LED健康燈的黃燈以1Hz頻率閃爍。 The method for detecting a system state according to claim 2, wherein the fault event corresponding to the third fault priority sequence includes a fan fault event of the system, a temperature sensor overheat fault event of the system, and a processor level overheating. At least one of the fault events, when at least one fault event corresponding to the third fault priority sequence occurs, the yellow light of the LED health light flashes at a frequency of 1 Hz. 如申請專利範圍第2項所述的系統狀態的檢測方法,其中,所述第四故障優先順序對應的故障事件包括處理器電源過熱故障事件以及處理器二級過熱故障事件中之至少一者;當至少一所述第四故障優先順序對應的故障事件發生時,所述LED健康燈的黃燈常亮。 The method for detecting a system state according to claim 2, wherein the fault event corresponding to the fourth fault priority order includes at least one of a processor power overheat fault event and a processor secondary overheat fault event; When at least one fault event corresponding to the fourth fault priority order occurs, the yellow light of the LED health light is always on. 一種系統狀態的檢測系統,其中,應用於一桌面伺服器,所述系統狀態的檢測系統包括:一讀取模組,用於讀取與所述系統狀態相關的一當前發生的故障事件;一查找模組;一操作模組;一劃分模組;一處理模組,電性連接於所述讀取模組、所述查找模組、所述操作模組以及所述劃分模組,用於判斷所述當前發生的故障事件是否已預存在所述桌面伺服器中;若是,則調用用於查找與所述當前發生的故障事件對應的一故障優先順序的所述查找模組,和用於根據對應的所述故障優先順序以預定報警方式點亮與所述故障優先順序匹配的一LED健康燈的所述操作模組;若否,則調用用於將所述當前發生的故障事件劃分到對應的所述故障優先順序中的所述劃分模組。 A system state detecting system, wherein the system state detecting system comprises: a reading module, configured to read a currently occurring fault event related to the state of the system; a search module; an operation module; a division module; a processing module electrically connected to the reading module, the search module, the operation module, and the division module, Determining whether the currently occurring fault event is pre-existing in the desktop server; if yes, calling the lookup module for finding a fault priority order corresponding to the currently occurring fault event, and for And illuminating, according to the corresponding fault priority order, the operation module of an LED health light matched with the fault priority order in a predetermined alarm manner; if not, calling to divide the currently occurring fault event into Corresponding to the dividing module in the fault priority order. 如申請專利範圍第7項所述的系統狀態的檢測系統,其中,所述系統狀態的檢測系統還包括與所述劃分模組連接的一存儲模組,所述存儲模組用於在所述當前發生的故障事件劃分到對應的所述故障優先順序之後,將所述當前發生的故障事件存儲。 The system state detecting system of claim 7, wherein the system state detecting system further comprises a storage module connected to the dividing module, wherein the storage module is used in After the currently occurring fault event is divided into the corresponding fault priority order, the currently occurring fault event is stored. 一種伺服器,其中,所述伺服器包括如申請專利範圍第7-8項中任一項所述的系統狀態的檢測系統。 A server, wherein the server includes a system state detection system as described in any one of claims 7-8. 如申請專利範圍第9項所述的伺服器,其中,所述伺服器為桌面伺服器。 The server of claim 9, wherein the server is a desktop server.
TW104142449A 2015-12-17 2015-12-17 Method, system, and server for detecting system status TW201723839A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW104142449A TW201723839A (en) 2015-12-17 2015-12-17 Method, system, and server for detecting system status

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW104142449A TW201723839A (en) 2015-12-17 2015-12-17 Method, system, and server for detecting system status

Publications (1)

Publication Number Publication Date
TW201723839A true TW201723839A (en) 2017-07-01

Family

ID=60047352

Family Applications (1)

Application Number Title Priority Date Filing Date
TW104142449A TW201723839A (en) 2015-12-17 2015-12-17 Method, system, and server for detecting system status

Country Status (1)

Country Link
TW (1) TW201723839A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI803628B (en) * 2019-04-29 2023-06-01 安圖斯科技股份有限公司 Warning light control method and electronic device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI803628B (en) * 2019-04-29 2023-06-01 安圖斯科技股份有限公司 Warning light control method and electronic device

Similar Documents

Publication Publication Date Title
US20170161136A1 (en) System state detection method and system and server
US9262851B2 (en) Heat mapping of defects in software products
US10519960B2 (en) Fan failure detection and reporting
US10514741B2 (en) Server information handling system indicator light management
US20080162078A1 (en) End of life prediction of flash memory
CN110489367B (en) Method and system for flexibly allocating and easily managing backplane by CPLD (complex programmable logic device)
US20110119424A1 (en) Server management system
CN106055438A (en) Method and system for rapidly locating anomaly of memory banks on mainboard
US9760071B2 (en) Profile based fan control for an unmanageable component in a computing system
CN106561018A (en) Server monitoring method, monitoring device and monitoring system
US20080148109A1 (en) Implicating multiple possible problematic components within a computer system using indicator light diagnostics
US20220011169A1 (en) Thermal management system, method, and device for monitoring health of electronic devices
CN103019898A (en) Error reporting system for memory module detection and slot position traffic light positioning
TW201530304A (en) Method for alarming abnormal status
CN104598283A (en) Realization method of single-architecture multi-structure BMC firmware program
CN105335276A (en) Fault detection method and electronic device
TW201723839A (en) Method, system, and server for detecting system status
US20200004704A1 (en) Mainboard of a server and method of populating dimm slots of a mainboard
Brandt et al. New systems, new behaviors, new patterns: Monitoring insights from system standup
CN210109722U (en) High-density tower 12-disk NAS storage server
US11138512B2 (en) Management of building energy systems through quantification of reliability
CN210015435U (en) Server
JP5262240B2 (en) Connection check method, programmable device, and circuit structure file generation program
US20140359378A1 (en) System and method for detecting status information of motherboard of server
CN111208889A (en) Server temperature control method and system and substrate management controller