TWI715005B - Monitor method for demand of a bmc - Google Patents

Monitor method for demand of a bmc Download PDF

Info

Publication number
TWI715005B
TWI715005B TW108112080A TW108112080A TWI715005B TW I715005 B TWI715005 B TW I715005B TW 108112080 A TW108112080 A TW 108112080A TW 108112080 A TW108112080 A TW 108112080A TW I715005 B TWI715005 B TW I715005B
Authority
TW
Taiwan
Prior art keywords
management controller
baseboard management
resident program
processing module
resident
Prior art date
Application number
TW108112080A
Other languages
Chinese (zh)
Other versions
TW202038093A (en
Inventor
林裕敦
楊順傑
Original Assignee
神雲科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 神雲科技股份有限公司 filed Critical 神雲科技股份有限公司
Priority to TW108112080A priority Critical patent/TWI715005B/en
Publication of TW202038093A publication Critical patent/TW202038093A/en
Application granted granted Critical
Publication of TWI715005B publication Critical patent/TWI715005B/en

Links

Images

Abstract

A monitor method for demand of a BMC is implemented by a computer device including a processing module and a BMC, the BMC executing a plurality of demand, the method comprising: (A1) the BMC generates a demand execution table, the demand execution table includes a plurality of demand codes corresponding to the resident demand, and a surviving value corresponding to each demand, each surviving value is updated when the corresponding demand is executed. (A2) The processing module determines, according to the demand execution table, whether there is at least one abnormal demand in the demand. (A3) When it is determined that there is at least one abnormal demand, the processing module notifies the BMC to restart the abnormal demand.

Description

用於監控基板管理控制器之常駐程序的方法 Method for monitoring resident program of substrate management controller

本發明是有關於一種系統檢測方法,特別是指一種能自動分辨出基板管理控制器當前之常駐程序是否正常執行的方法。 The present invention relates to a system detection method, in particular to a method that can automatically distinguish whether the current resident program of the baseboard management controller is normally executed.

目前在電腦裝置中的基板管理控制器(BMC,Baseboard management controller)在開機運行階段偶爾會發生重要的常駐程序(demand)無法正常運作的情況,此時,雖然基板管理控制器的運行燈號是正常的,但實際上已無法提供完整的服務,使得BMC的監控功能無法確實運作。因此,在電腦裝置完成開機後,管理者透過電腦裝置發送訊號並發現常駐程序已無法正常運作時,僅能將基板管理控制器重新開機,才能使基板管理控制器的常駐程序恢復正常運作。 At present, the baseboard management controller (BMC, Baseboard management controller) in the computer device may occasionally encounter the situation that important resident programs (demand) cannot operate normally during the startup stage. At this time, although the operating light of the baseboard management controller is Normal, but in fact it has been unable to provide a complete service, making the BMC monitoring function unable to operate reliably. Therefore, after the computer device is completely booted, when the administrator sends a signal through the computer device and finds that the resident program is no longer operating normally, the baseboard management controller can only be restarted to restore the resident program of the baseboard management controller to normal operation.

有鑑於此,故如何提供一種可有效於電腦裝置開機階段監控基板管理控制器所執行之常駐程序是否正常運作並解決無法正常運作之常駐程序的問題,即為本創作所欲解決之首要課題。 In view of this, how to provide a method that can effectively monitor whether the resident program executed by the baseboard management controller is operating normally during the booting phase of the computer device and solve the problem of the resident program that cannot operate normally is the primary problem that this author intends to solve.

因此,本發明的目的,即在提供可有效於電腦裝置開機階段監控基板管理控制器所執行之常駐程序是否正常運作並解決無法正常運作常駐程序的方法。 Therefore, the object of the present invention is to provide a method that can effectively monitor whether the resident program executed by the baseboard management controller is operating normally during the booting phase of the computer device and solve the problem of the resident program not operating normally.

於是,本發明用於監控基板管理控制器之常駐程序的方法,藉由一電腦裝置來實施,該電腦裝置包含一處理模組及一電連接該處理模組的基板管理控制器,該基板管理控制器用於執行多個常駐程序,該用於監控基板管理控制器之常駐程序的方法包含一步驟(A1)、一步驟(A2),以及一步驟(A3)。 Therefore, the method of the present invention for monitoring the resident program of a substrate management controller is implemented by a computer device, the computer device includes a processing module and a substrate management controller electrically connected to the processing module, the substrate management The controller is used to execute multiple resident programs, and the method for monitoring the resident programs of the baseboard management controller includes one step (A1), one step (A2), and one step (A3).

該步驟(A1)是藉由該基板管理控制器,產生一常駐程序執行表,該常駐程序執行表包含多個對應該等常駐程序的程序代碼,以及每一常駐程序所對應的一存活值,每一存活值在所對應之常駐程序被該基板管理控制器正常執行時會被更新。 The step (A1) is to generate a resident program execution table by the baseboard management controller. The resident program execution table includes a plurality of program codes corresponding to the resident programs and a survival value corresponding to each resident program. Each survival value will be updated when the corresponding resident program is normally executed by the baseboard management controller.

該步驟(A2)是藉由該處理模組,讀取該常駐程序執行表中該等常駐程序所對應之存活值,並根據前次所讀取到之該等常駐程序所對應之存活值與當前所讀取到之該等常駐程序所對應之存活值判定該等常駐程序中是否存在至少一運作異常的常駐程序,其中每一運作異常的常駐程序在前次被讀取到之存活值相同於其在當前被讀取到之存活值。 This step (A2) uses the processing module to read the survival values corresponding to the resident programs in the resident program execution table, and based on the survival values corresponding to the resident programs read last time and The survival value corresponding to the resident programs currently read determines whether there is at least one resident program with abnormal operation in the resident programs, wherein each resident program with abnormal operation has the same survival value read last time For its survival value currently read.

該步驟(A3)是當該處理模組判定出存在該至少一運作異常的常駐程序時,藉由該處理模組,通知該基板管理控制器,以致該基板管理控制器重新啟動每一運作異常的常駐程序。 The step (A3) is that when the processing module determines that there is at least one resident program of the abnormal operation, the processing module informs the baseboard management controller, so that the baseboard management controller restarts each abnormal operation Resident program.

本發明之功效在於:藉由該電腦裝置根據所產生的常駐程序執行表中的每一常駐程序及其對應之存活值,並檢測每一常駐程序之狀態是否為正常執行,而當檢測出存在該至少一運作異常的常駐程序無法正常執行時,則將該至少一運作異常的常駐程序重新啟動,有效地達成在無需將該基板管理控制器重新開機之情況下,便可使該至少一運作異常的常駐程序重新正常執行,進而使該基板管理控制器正常運作。 The effect of the present invention is that the computer device executes each resident program in the generated resident program table and its corresponding survival value, and detects whether the status of each resident program is normally executed, and when it is detected When the at least one abnormally operating resident program cannot be executed normally, restart the at least one abnormally operating resident program, effectively achieving the at least one operating without restarting the baseboard management controller The abnormal resident program is re-executed normally, and the baseboard management controller operates normally.

1:電腦裝置 1: computer device

11:基板管理控制器 11: baseboard management controller

12:記憶模組 12: Memory module

121:基本輸入輸出程式 121: Basic Input Output Program

13:處理模組 13: Processing module

14:平台路徑控制器 14: Platform Path Controller

91~93:步驟 91~93: Step

21~23:步驟 21~23: Steps

31~36:步驟 31~36: Step

41~45:步驟 41~45: Steps

51~52:步驟 51~52: Steps

61~64:步驟 61~64: Step

71~73:步驟 71~73: Steps

本發明的其他的特徵及功效,將於參照圖式的實施方式中清楚地呈現,其中:圖1是一方塊圖,說明一執行本發明用於監控基板管理控制器之常駐程序的方法的一實施例的電腦裝置;圖2是一流程圖,說明該實施例的一資料讀取程序;圖3是一流程圖,說明該實施例的一常駐程序重啟程序;圖4是一流程圖,說明該實施例的一第一基板管理控制器重啟程序; 圖5是一流程圖,說明該實施例的一第一異常記錄產生程序;圖6是一流程圖,說明該實施例的一溝通介面重啟程序;圖7是一流程圖,說明該實施例的一第二基板管理控制器重啟程序;及圖8是一流程圖,說明該實施例的一第二異常記錄產生程序。 Other features and effects of the present invention will be clearly presented in the embodiments with reference to the drawings, in which: FIG. 1 is a block diagram illustrating a method for executing the resident program of the present invention for monitoring the baseboard management controller Figure 2 is a flowchart illustrating a data reading procedure of this embodiment; Figure 3 is a flowchart illustrating a resident program restart procedure of this embodiment; Figure 4 is a flowchart illustrating A first baseboard management controller restart program of this embodiment; Fig. 5 is a flowchart illustrating a first abnormal record generation procedure of this embodiment; Fig. 6 is a flowchart illustrating a communication interface restart procedure of this embodiment; Fig. 7 is a flowchart illustrating the procedure of this embodiment A second BMC restart procedure; and FIG. 8 is a flowchart illustrating a second abnormal record generation procedure of this embodiment.

在本發明被詳細描述之前,應當注意在以下的說明內容中,類似的元件是以相同的編號來表示。 Before the present invention is described in detail, it should be noted that in the following description, similar elements are represented by the same numbers.

參閱圖1,執行本發明用於監控基板管理控制器之常駐程序的方法之一實施例之一電腦裝置1,該電腦裝置1包含一基板管理控制器11(BMC,Baseboard management controller)、一儲存有一基本輸入輸出程式121(BIOS,Basic Input/Output System)的記憶模組12、一處理模組13,以及一電連接該基板管理控制器11、該記憶模組12及該處理模組13的平台路徑控制器14(PCH,Platform Controller Hub)。 Referring to FIG. 1, a computer device 1 is one of the embodiments of the method for executing a resident program for monitoring a baseboard management controller of the present invention. The computer device 1 includes a baseboard management controller 11 (BMC, Baseboard management controller), a storage There is a basic input/output program 121 (BIOS, Basic Input/Output System) memory module 12, a processing module 13, and an electrical connection to the baseboard management controller 11, the memory module 12 and the processing module 13 Platform path controller 14 (PCH, Platform Controller Hub).

值得特別說明的是,在該實施例中,該平台路徑控制器14係透過多個通用型之輸入輸出接腳(GPIO,General-purpose input/output)連接該基板管理控制器11。該等通用型之輸入輸出接腳之其中任一者皆可用於重啟對應的該常駐程式,或用於重啟該 基板管理控制器11。 It is worth noting that in this embodiment, the platform path controller 14 is connected to the baseboard management controller 11 through a plurality of general-purpose input/output (GPIO) pins. Any one of these universal input and output pins can be used to restart the corresponding resident program, or to restart the Baseboard management controller 11.

特別地,在該實施例中,該等通用型之輸入輸出接腳之其中一第一條通用型之輸入輸出接腳連接至該基板管理控制器11的通用型之輸入輸出接腳,該平台路徑控制器14可透過該第一條通用型之輸入輸出接腳傳輸一指示出需重啟相關於一溝通介面的第一重啟訊號。該等通用型之輸入輸出接腳之其中一第二條通用型之輸入輸出接腳連接至該基板管理控制器11的重新啟動接腳(Reset Pin),該平台路徑控制器14可透過該第二條通用型之輸入輸出接腳傳輸一指示出需重啟該基板管理控制器11的第二重啟訊號以強制該基板管理控制器11進行自身的重新啟動,也就是說,以該第二重啟訊號觸發基板管理控制器11進行自身的重新啟動。而在另一實施例中,該等通用型之輸入輸出接腳中之該第一條通用型之輸入輸出接腳及該第二條通用型之輸入輸出接腳以外的至少一第三通用型之輸入輸出接腳,還分別連接至該基板管理控制器11的通用型之輸入輸出接腳,該平台路徑控制器14可透過每一第三條通用型之輸入輸出接腳傳輸一指示出需重啟相關於一對應之該常駐程序的第三重啟訊號。 Particularly, in this embodiment, one of the first universal input and output pins of the universal input and output pins is connected to the universal input and output pins of the baseboard management controller 11, and the platform The path controller 14 can transmit a first restart signal indicating that a communication interface needs to be restarted through the first universal input/output pin. One of the general-purpose input and output pins, the second general-purpose input and output pin is connected to the reset pin of the baseboard management controller 11, and the platform path controller 14 can pass through the first Two general-purpose input and output pins transmit a second restart signal indicating that the baseboard management controller 11 needs to be restarted to force the baseboard management controller 11 to restart itself, that is, with the second restart signal The baseboard management controller 11 is triggered to restart itself. In another embodiment, among the universal input and output pins, at least one third universal type other than the first universal input and output pin and the second universal input and output pin The input and output pins are also respectively connected to the general-purpose input and output pins of the baseboard management controller 11. The platform path controller 14 can transmit an indication of the demand through every third general-purpose input and output pins. The restart is related to a third restart signal corresponding to the resident program.

在該實施例中,該處理模組13係經由該溝通介面,例如為一智慧型平台管理介面(IPMI,Intelligent Platform Management Interface)且透過該平台路徑控制器14與該基板管 理控制器11進行訊息、指令或資料的傳輸,但不以此為限。值得特別說明的是,該溝通介面還可以是例如為可透過基體電路匯流排(I2C,Inter-Integrated Circuit)或系統管理匯流排(SMBus,System Management Bus)傳送的智慧型平台管理介面、鍵盤控制器規格介面(KCS,keyboard controller style)、單一區塊傳輸介面(BT,block transfer),但不以此為限。 In this embodiment, the processing module 13 passes through the communication interface, for example, an Intelligent Platform Management Interface (IPMI) and communicates with the substrate management interface through the platform path controller 14 The management controller 11 transmits messages, instructions or data, but not limited to this. It is worth noting that the communication interface can also be, for example, a smart platform management interface and keyboard control that can be transmitted through a base circuit bus (I2C, Inter-Integrated Circuit) or a system management bus (SMBus, System Management Bus). Device specification interface (KCS, keyboard controller style), single block transfer interface (BT, block transfer), but not limited to this.

在該實施例中,該電腦裝置1之實施態樣例如為一個人電腦、一伺服器或一雲端主機,但不以此為限。 In this embodiment, the implementation aspect of the computer device 1 is, for example, a personal computer, a server, or a cloud host, but it is not limited to this.

在該實施例中,該記憶模組12之實施態樣例如為一揮發性記憶體、一非揮發性記憶體或其組合,但不以此為限。 In this embodiment, the implementation of the memory module 12 is, for example, a volatile memory, a non-volatile memory or a combination thereof, but it is not limited thereto.

在該實施例中,該處理模組13之實施態樣例如為一中央處理器(CPU,Central Processing Unit),但不以此為限。該處理模組13亦可與該平台路徑控制器14共同存在於一系統單晶片(SoC,System on a chip)的實施樣態,但不以此為限。 In this embodiment, the implementation of the processing module 13 is, for example, a central processing unit (CPU), but it is not limited to this. The processing module 13 and the platform path controller 14 can also coexist in a system on a chip (SoC) implementation, but not limited to this.

以下將藉由本發明用於監控基板管理控制器之常駐程序的方法之該實施例來說明該電腦裝置1之該基板管理控制器11、該記憶模組12、該處理模組13,以及該平台路徑控制器14各元件的運作細節,其中用於執行該用於監控基板管理控制器之常駐程序的方法之相關程式係整合於該基本輸入輸出程式121中並於開機時由該處理模組13所載入執行,本發明用於監控基板管理控制器之常駐 程序的方法包含一資料讀取程序、一常駐程序重啟程序、一第一基板管理控制器重啟程序、一第一異常記錄產生程序、一溝通介面重啟程序、一第二基板管理控制器重啟程序,以及一第二異常記錄產生程序。 Hereinafter, the baseboard management controller 11, the memory module 12, the processing module 13, and the platform of the computer device 1 will be described by the embodiment of the method for monitoring the resident program of the baseboard management controller of the present invention The operation details of the components of the path controller 14, wherein the relevant program for executing the method for monitoring the resident program of the baseboard management controller is integrated in the basic input and output program 121 and used by the processing module 13 when it is turned on The loaded execution, the present invention is used to monitor the resident of the baseboard management controller The program method includes a data reading program, a resident program restart program, a first BMC restart program, a first abnormal record generation program, a communication interface restart program, and a second BMC restart program, And a second abnormal record generating program.

參閱圖2,該資料讀取程序係讀取相關於一常駐程序執行表的資料,並包含一步驟91、一步驟92,以及一步驟93。 Referring to FIG. 2, the data reading program reads data related to a resident program execution table, and includes a step 91, a step 92, and a step 93.

在該步驟91中,當該電腦裝置1中之該基板管理控制器11初始化時,該基板管理控制器11產生該常駐程序執行表,並儲存該常駐程序執行表。其中,該常駐程序執行表包含多個對應該等常駐程序的程序代碼,以及每一常駐程序所對應的一存活值(heartbeat)。其中,每一存活值在所對應之常駐程序於正常執行時會被該基板管理控制器11週期性地更新。 In step 91, when the baseboard management controller 11 in the computer device 1 is initialized, the baseboard management controller 11 generates the resident program execution table and stores the resident program execution table. Wherein, the resident program execution table includes a plurality of program codes corresponding to the resident programs, and a heartbeat corresponding to each resident program. Among them, each survival value is periodically updated by the baseboard management controller 11 when the corresponding resident program is executed normally.

在該步驟92中,該處理模組13產生一相關於該常駐程序執行表的讀取請求,並透過該平台路徑控制器14傳送至該基板管理控制器11。 In the step 92, the processing module 13 generates a read request related to the resident program execution table, and transmits it to the baseboard management controller 11 through the platform path controller 14.

在該步驟93中,該處理模組13判定有無接收到該基板管理控制器11傳送的相關於該常駐程序執行表的資料。當該處理模組13判定出有接收到該基板管理控制器11回應傳送的相關於該常駐程序執行表的資料,則進行該常駐程序重啟程序;當該處理模組13判定出沒有接收到該基板管理控制器11回應傳送的相關於該常駐 程序執行表的資料,則進行該溝通介面重啟程序。需進行該溝通介面重啟程序即表示有可能是因為該溝通介面故障而造成該基板管理控制器11無法正確的接收經由該溝通介面傳送的該相關於該常駐程序執行表的讀取請求或是該基板管理控制器11無法正確的經由該溝通介面回應傳送該相關於該常駐程序執行表的資料。值得特別說明的是,該處理模組13係週期性地傳送該相關於該常駐程序執行表的讀取請求並判斷是否有在一傳送週期內接收到該基板管理控制器11傳送的相關於該常駐程序執行表的資料,進而接著讀取所接收的相關於該常駐程序執行表的資料。 In the step 93, the processing module 13 determines whether it has received the data related to the resident program execution table sent by the baseboard management controller 11. When the processing module 13 determines that it has received the data related to the resident program execution table sent by the baseboard management controller 11 in response, it performs the resident program restart procedure; when the processing module 13 determines that the resident program execution table has not been received The baseboard management controller 11 responds to the resident For the data in the program execution table, the communication interface restarts the program. The need to restart the communication interface means that the BMC 11 may not be able to correctly receive the read request related to the resident program execution table transmitted through the communication interface or the read request related to the resident program execution table or the communication interface failure. The baseboard management controller 11 cannot correctly send the data related to the resident program execution table through the communication interface. It is worth noting that the processing module 13 periodically transmits the read request related to the resident program execution table and determines whether there is a transmission period related to the baseboard management controller 11 that has been received. The data of the resident program execution table is read, and then the received data related to the resident program execution table is read.

參閱圖3,該常駐程序重啟程序係重啟所有運作異常的常駐程序,並包含一步驟21、一步驟22,以及一步驟23。 Referring to FIG. 3, the resident program restarting procedure is to restart all resident programs that operate abnormally, and includes a step 21, a step 22, and a step 23.

在該步驟21中,該處理模組13讀取相關於該常駐程序執行表的資料,以獲得該等常駐程序所分別對應之存活值,且根據前次所讀取到之該等常駐程序所分別對應之存活值與當前所讀取到之該等常駐程序所分別對應之存活值判定該等常駐程序中是否存在至少一運作異常的常駐程序,其中每一運作異常的常駐程序在前次被讀取到之存活值相同於其在當前被讀取到之存活值。當該處理模組13判定出存在該至少一運作異常的常駐程序時,進行流程步驟22;當該處理模組13判定出不存在任何運作異常的常駐程序時,流程結束。值得特別說明的是,該處理模組13係週期性地讀取相關 於該常駐程序執行表的資料,以獲得並儲存前次所讀取到之該等常駐程序所分別對應之存活值與當前所讀取到之該等常駐程序所分別對應之存活值。 In the step 21, the processing module 13 reads the data related to the resident program execution table to obtain the survival values corresponding to the resident programs, and according to the resident program values read last time Respectively corresponding to the survival value and the survival value corresponding to the currently read resident programs to determine whether there is at least one resident program with abnormal operation in the resident programs, wherein each resident program with abnormal operation was previously The survival value read is the same as the survival value currently read. When the processing module 13 determines that there is at least one resident program with an abnormal operation, step 22 is performed; when the processing module 13 determines that there is no resident program with an abnormal operation, the process ends. It is worth noting that the processing module 13 periodically reads the relevant The data in the resident program execution table is used to obtain and store the survival values corresponding to the resident programs previously read and the survival values corresponding to the resident programs currently read.

在該步驟22中,對於每一運作異常的常駐程序,該處理模組13產生一指示出該運作異常的常駐程序運作異常的通知訊息並透過該平台路徑控制器14傳送至該基板管理控制器11。值得特別說明的是,在另一實施例之該步驟22中,對於每一運作異常的常駐程序,該處理模組13觸發該平台路徑控制器14產生指示出重啟該運作異常的常駐程序的該第三重啟訊號,並經由該第三條通用型之輸入輸出接腳傳送至該基板管理控制器11。其中,該處理模組13是藉由產生並傳送一指示出重啟該運作異常的常駐程序的第三重啟命令至該平台路徑控制器14,以觸發該平台路徑控制器14產生並傳送指示出重啟該運作異常的常駐程序的該第三重啟訊號。此外,該處理模組13也可以直接將指示出重啟該運作異常的常駐程序的一參數寫入該平台路徑控制器14的一指示出重啟該運作異常的常駐程序的暫存器,以觸發該平台路徑控制器14產生並傳送指示出重啟該運作異常的常駐程序的該第三重啟訊號。 In this step 22, for each resident program of abnormal operation, the processing module 13 generates a notification message indicating the abnormal operation of the resident program and transmits it to the baseboard management controller through the platform path controller 14. 11. It is worth noting that in the step 22 of another embodiment, for each resident program of abnormal operation, the processing module 13 triggers the platform path controller 14 to generate the instruction to restart the resident program of the abnormal operation The third restart signal is transmitted to the baseboard management controller 11 through the third universal input/output pin. Wherein, the processing module 13 generates and transmits a third restart command indicating restarting the resident program of the abnormal operation to the platform path controller 14, so as to trigger the platform path controller 14 to generate and transmit instructions to restart The third restart signal of the resident program that operates abnormally. In addition, the processing module 13 can also directly write a parameter indicating the restart of the resident program of the abnormal operation into a register of the platform path controller 14 indicating the restart of the resident program of the abnormal operation to trigger the The platform path controller 14 generates and transmits the third restart signal indicating restarting the resident program of the abnormal operation.

在該子步驟23中,對於每一通知訊息,該基板管理控制器11在接收到該通知訊息後,重新啟動該通知訊息所對應的該運作異常的常駐程序。值得特別說明的是,在另一實施例之該步驟23 中,對於每一第三重啟訊號,該基板管理控制器11在接收到該第三重啟訊號後,重新啟動該第三重啟訊號所對應的該運作異常的常駐程序。 In the sub-step 23, for each notification message, the baseboard management controller 11 restarts the resident program of the abnormal operation corresponding to the notification message after receiving the notification message. It is worth noting that in another embodiment, step 23 For each third restart signal, after receiving the third restart signal, the BMC 11 restarts the resident program of the abnormal operation corresponding to the third restart signal.

參閱圖4,該第一基板管理控制器重啟程序係在執行該常駐程序重啟程序後,若重新啟動每一運作異常的常駐程序後,仍無法使所有運作異常的常駐程序正常運作時,即會重啟該基板管理控制器11,該第一基板管理控制器重啟程序包含一步驟31、一步驟32、一步驟33、一步驟34、一步驟35,以及一步驟36。 Referring to Figure 4, the first baseboard management controller restart procedure is executed after the resident program restart procedure, if after restarting each abnormal resident program, all the resident programs that are abnormally functioning are still unable to operate normally. To restart the BMC 11, the first BMC restart procedure includes a step 31, a step 32, a step 33, a step 34, a step 35, and a step 36.

在該步驟31中,對於每一運作異常的常駐程序,在該處理模組13傳送對應的該通知訊息且經過一第一預設時間後,對於每一運作異常的常駐程序,該處理模組13產生一相關於該運作異常的常駐程序的第一確認請求並透過該平台路徑控制器14傳送至該基板管理控制器11。特別地,該第一預設時間係為預設該至少一運作異常的常駐程序重啟完成所需的時間。值得特別說明的是,在另一實施例之該步驟31中,對於每一運作異常的常駐程序,在該處理模組13觸發傳送對應的該第三重啟訊號且經過該第一預設時間後,對於每一運作異常的常駐程序,該處理模組13產生相關於該運作異常的常駐程序的該第一確認請求並透過該平台路徑控制器14傳送至該基板管理控制器11。 In this step 31, for each resident program of abnormal operation, after the processing module 13 transmits the corresponding notification message and a first preset time has elapsed, for each resident program of abnormal operation, the processing module 13 generates a first confirmation request related to the resident program of the abnormal operation and transmits it to the baseboard management controller 11 through the platform path controller 14. In particular, the first predetermined time is a predetermined time required for the restart of the at least one abnormally operating resident program. It is worth noting that in the step 31 of another embodiment, for each resident program with abnormal operation, after the processing module 13 triggers the transmission of the corresponding third restart signal and the first preset time has elapsed For each resident program of abnormal operation, the processing module 13 generates the first confirmation request related to the resident program of the abnormal operation and transmits it to the baseboard management controller 11 through the platform path controller 14.

在該步驟32中,對於每一第一確認請求,該基板管理控 制器11在接收到該第一確認請求後,判定該第一確認請求所對應之運作異常的常駐程序是否回復運作正常的運作狀態,其中該基板管理控制器11係根據該第一確認請求所對應之該運作異常的常駐程序的該存活值是否隨時間改變來判定其運作狀態是否回復運作正常,當判定出其所對應的該存活值有隨時間改變,則表示該第一確認請求所對應之該運作異常的常駐程序已回復運作正常的運作狀態,反之,則為未回復運作正常的運作狀態,也就是運作異常的運作狀態。當該基板管理控制器11判定出該第一確認請求所對應之運作異常的常駐程序回復運作正常,則進行流程步驟33;當該基板管理控制器11判定出該第一確認請求所對應之運作異常的常駐程序仍為運作異常,則流程結束。 In this step 32, for each first confirmation request, the substrate management control After receiving the first confirmation request, the controller 11 determines whether the abnormal resident program corresponding to the first confirmation request returns to a normal operating state, wherein the baseboard management controller 11 is based on the first confirmation request. Whether the survival value of the resident program corresponding to the abnormal operation changes over time is used to determine whether its operating state returns to normal operation. When it is determined that the corresponding survival value has changed over time, it means that the first confirmation request corresponds to The resident program with abnormal operation has returned to a normal operating state, otherwise, it has not returned to a normal operating state, that is, an abnormal operating state. When the baseboard management controller 11 determines that the abnormal resident program corresponding to the first confirmation request is operating normally, it proceeds to step 33; when the baseboard management controller 11 determines the operation corresponding to the first confirmation request If the abnormal resident program still operates abnormally, the process ends.

在該步驟33中,該基板管理控制器11產生一相關於該步驟32中之該回復正常運作之運作異常的常駐程序的第一確認回覆並透過該平台路徑控制器14傳送至該處理模組13。 In the step 33, the baseboard management controller 11 generates a first confirmation response related to the abnormal resident program that returns to normal operation in the step 32 and transmits it to the processing module through the platform path controller 14 13.

在該步驟34中,該處理模組13判定在發送該第一確認請求後的一第二預設時間內是否有接收到該基板管理控制器11所回傳之回應於每一第一確認請求的一第一確認回覆。當該處理模組13判定出在該第二預設時間內有接收到該基板管理控制器11所回傳之對應每一第一確認請求的該第一確認回覆,則流程結束;當該處理模組13判定出在該第二預設時間內沒有接收到該基板管理控制 器11所回傳之對應每一第一確認請求的該第一確認回覆,則進行流程步驟35。 In the step 34, the processing module 13 determines whether a response from the baseboard management controller 11 is received within a second preset time after the first confirmation request is sent for each first confirmation request A first confirmation reply from. When the processing module 13 determines that the first confirmation reply corresponding to each first confirmation request returned by the baseboard management controller 11 is received within the second preset time, the process ends; when the processing The module 13 determines that the substrate management control is not received within the second preset time For the first confirmation reply corresponding to each first confirmation request returned by the device 11, the process step 35 is performed.

在該步驟35中,該處理模組13觸發該平台路徑控制器14產生指示出重啟該基板管理控制器11的該第二重啟訊號,並經由該第二條通用型之輸入輸出接腳傳送至該基板管理控制器11,並將每一不存在所對應之第一確認回覆的第一確認請求所相關的運作異常的常駐程序作為一待觀察常駐程序。其中,該處理模組13是藉由產生並傳送一指示出重啟該基板管理控制器11的第二重啟訊號至該平台路徑控制器14,以觸發該平台路徑控制器14產生並傳送指示出重啟該基板管理控制器11的該第二重啟訊號。此外,該處理模組13也可以直接將指示出重啟該基板管理控制器11的一參數寫入該平台路徑控制器14的一指示出重啟該基板管理控制器11的暫存器,以觸發該平台路徑控制器14產生並傳送指示出重啟該基板管理控制器11的該第二重啟訊號。其中,每一不存在所對應之第一確認回復的第一確認請求,係在該第二預設時間內沒有接收到該基板管理控制器11所回傳之該第一確認回覆所對應之第一確認請求。 In step 35, the processing module 13 triggers the platform path controller 14 to generate the second restart signal indicating to restart the baseboard management controller 11, and transmits it to the second universal input/output pin The baseboard management controller 11 regards the resident program of the abnormal operation related to each first confirmation request for which there is no corresponding first confirmation reply as a resident program to be observed. The processing module 13 generates and transmits a second restart signal indicating to restart the baseboard management controller 11 to the platform path controller 14 to trigger the platform path controller 14 to generate and transmit the instruction to restart The second restart signal of the baseboard management controller 11. In addition, the processing module 13 can also directly write a parameter indicating restarting the baseboard management controller 11 into a register of the platform path controller 14 indicating restarting the baseboard management controller 11 to trigger the The platform path controller 14 generates and transmits the second restart signal indicating to restart the baseboard management controller 11. Wherein, for each first confirmation request that does not have a corresponding first confirmation reply, the first confirmation request corresponding to the first confirmation reply returned by the baseboard management controller 11 is not received within the second preset time One confirmation request.

在該步驟36中,該基板管理控制器11在接收到該第二重啟訊號後,重新啟動該基板管理控制器11。 In this step 36, the baseboard management controller 11 restarts the baseboard management controller 11 after receiving the second restart signal.

參閱圖5,該第一異常記錄產生程序係在執行該第一基板管理控制器重啟程序後,若重新啟動該基板管理控制器11後,仍無 法使所有待觀察常駐程序正常運作時,即會產生相關於仍運作異常之待觀察常駐程序的一第一異常記錄,該第一異常記錄產生程序包含一步驟41、一步驟42、一步驟43、一步驟44,以及一步驟45。 Referring to FIG. 5, the first abnormal record generation program is executed after the first baseboard management controller restart program, if the baseboard management controller 11 is restarted, there is still no When the method makes all the resident programs to be observed operate normally, a first abnormality record related to the resident program to be observed that is still operating abnormally will be generated. The first abnormality record generation procedure includes a step 41, a step 42, and a step 43 , A step 44, and a step 45.

在該步驟41中,在該處理模組13傳送該第二重啟訊號且經過一第三預設時間後,對於每一待觀察常駐程序,該處理模組13產生一相關於該待觀察常駐程序的第二確認請求並透過該平台路徑控制器14傳送至該基板管理控制器11。特別地,該第三預設時間係預設為等待該基板管理控制器11重啟完成所需的時間。 In step 41, after the processing module 13 transmits the second restart signal and a third preset time has elapsed, for each resident program to be observed, the processing module 13 generates a resident program related to the resident program to be observed The second confirmation request is sent to the baseboard management controller 11 through the platform path controller 14. In particular, the third preset time is preset to be the time required to wait for the completion of the restart of the baseboard management controller 11.

在該步驟42中,對於每一第二確認請求,該基板管理控制器11在接收到該第二確認請求後,判定該第二確認請求所對應之待觀察常駐程序是否回復正常運作。當該基板管理控制器11判定出該第二確認請求所對應之待觀察常駐程序回復正常運作,則進行流程步驟43;當該基板管理控制器11判定出該第二確認請求所對應之待觀察常駐程序仍運作異常,則流程結束。 In this step 42, for each second confirmation request, after receiving the second confirmation request, the baseboard management controller 11 determines whether the resident program to be observed corresponding to the second confirmation request returns to normal operation. When the baseboard management controller 11 determines that the to-be-observed resident program corresponding to the second confirmation request is back to normal operation, proceed to step 43; when the baseboard management controller 11 determines that the to-be-observed corresponding to the second confirmation request If the resident program still operates abnormally, the process ends.

在該步驟43中,該基板管理控制器11產生一相關於該步驟42中之該有回應之待觀察常駐程序的第二確認回覆並透過該平台路徑控制器14傳送至該處理模組13。 In the step 43, the baseboard management controller 11 generates a second confirmation response related to the responsive to-be-observed resident program in the step 42 and transmits it to the processing module 13 through the platform path controller 14.

在該步驟44中,該處理模組13判定在傳送該第二確認請求後的一第四預設時間內是否有接收到該基板管理控制器11所回傳之回應於每一第二確認請求的一第二確認回覆。當該處理模組13 判定出在該第四預設時間內有接收到該基板管理控制器11所回傳對應每一第二確認請求的該第二確認回覆,則流程結束;當該處理模組13判定出在該第四預設時間內沒有接收到該基板管理控制器11所回傳之對應每一第二確認請求的該第二確認回覆,則進行流程步驟45。 In this step 44, the processing module 13 determines whether a response from the baseboard management controller 11 is received within a fourth preset time after the second confirmation request is sent for each second confirmation request A second confirmation reply from. When the processing module 13 It is determined that the second confirmation response corresponding to each second confirmation request from the baseboard management controller 11 is received within the fourth preset time, and the process ends; when the processing module 13 determines that the If the second confirmation reply corresponding to each second confirmation request returned by the baseboard management controller 11 is not received within the fourth preset time, the process step 45 is performed.

在該步驟45中,該處理模組13將每一不存在所對應之第二確認回覆的第二確認請求所相關的待觀察常駐程序作為一待紀錄常駐程序,並產生指示出每一待紀錄常駐程序運作異常的該第一異常紀錄(Logfile)。其中,每一不存在所對應之第二確認回覆的第二確認請求,係在該第四預設時間內沒有接收到該基板管理控制器11所回傳的該第二確認回覆所對應之第二確認請求。 In this step 45, the processing module 13 regards the to-be-observed resident program related to the second confirmation request corresponding to the second confirmation reply that does not exist as a to-be-recorded resident program, and generates an instruction for each to-be-recorded The first exception record (Logfile) of the abnormal operation of the resident program. Wherein, for each second confirmation request that does not have a corresponding second confirmation reply, the first confirmation reply corresponding to the second confirmation reply returned by the baseboard management controller 11 is not received within the fourth preset time 2. Confirm the request.

參閱圖6,該溝通介面重啟程序係於無法接收相關於該常駐程序執行表的資料,則重啟該溝通介面,並包含一步驟512,以及一步驟52。 Referring to FIG. 6, the communication interface restart procedure is to restart the communication interface because it cannot receive data related to the resident program execution table, and includes a step 512 and a step 52.

在該步驟51中,該處理模組13觸發該平台路徑控制器14產生指示出重啟該溝通介面的該第一重啟訊號,並經由該第一條通用型之輸入輸出接腳傳送至該基板管理控制器11。其中,該處理模組13是藉由產生並傳送一指示出重啟該溝通介面的第一重啟命令至該平台路徑控制器14,以觸發該平台路徑控制器14產生並傳送指示出重啟該溝通介面的該第一重啟訊號。此外,該處理模組13 也可以直接將指示出重啟該溝通介面的一參數寫入該平台路徑控制器14的一指示出重啟該溝通介面的暫存器,以觸發該平台路徑控制器14產生並傳送指示出重啟該溝通介面的該第一重啟訊號。 In step 51, the processing module 13 triggers the platform path controller 14 to generate the first restart signal indicating restart of the communication interface, and transmits it to the substrate management via the first universal input/output pin Controller 11. Wherein, the processing module 13 generates and transmits a first restart command indicating to restart the communication interface to the platform path controller 14, so as to trigger the platform path controller 14 to generate and transmit instructions to restart the communication interface Of the first restart signal. In addition, the processing module 13 It is also possible to directly write a parameter indicating restart of the communication interface into a register of the platform path controller 14 indicating restart of the communication interface to trigger the platform path controller 14 to generate and send instructions to restart the communication The first restart signal of the interface.

在該步驟52中,該基板管理控制器11在接收到該第一重啟訊號,重新啟動該溝通介面。 In step 52, the baseboard management controller 11 restarts the communication interface after receiving the first restart signal.

參閱圖7,該第二基板管理控制器重啟程序係在執行該溝通介面重啟程序後,還無法接收相關於該常駐程序執行表的資料,則重啟該基板管理控制器11,並包含一步驟61、一步驟62、一步驟63,以及一步驟64。 Referring to FIG. 7, the second BMC restart procedure is after the communication interface restart procedure is executed, and the data related to the resident program execution table cannot be received, then the BMC 11 is restarted, and includes a step 61 , A step 62, a step 63, and a step 64.

在該步驟61中,在該處理模組13觸發該平台路徑控制器14產生並傳送該第一重啟訊號且經過一第五預設時間後,該處理模組13再次產生另一相關於該常駐程序執行表的讀取請求並透過該平台路徑控制器14傳送至該基板管理控制器11。特別地,該第五預設時間係為預設等待該溝通介面重啟完成所需的時間。 In step 61, after the processing module 13 triggers the platform path controller 14 to generate and transmit the first restart signal and a fifth preset time has elapsed, the processing module 13 generates another signal related to the resident The read request of the program execution table is transmitted to the baseboard management controller 11 through the platform path controller 14. In particular, the fifth preset time is a preset time required to wait for the completion of the restart of the communication interface.

在該步驟62中,該處理模組13判定有無接收到該基板管理控制器11傳送的相關於該常駐程序執行表的資料。當該處理模組13判定出有接收到該基板管理控制器11傳送的相關於該常駐程序執行表的資料,則回到流程步驟21;當該處理模組13判定出沒有接收到該基板管理控制器11傳送的相關於該常駐程序執行表的資料,則進行流程步驟63。 In this step 62, the processing module 13 determines whether it has received the data related to the resident program execution table sent by the baseboard management controller 11. When the processing module 13 determines that it has received the data related to the resident program execution table sent by the substrate management controller 11, it returns to the process step 21; when the processing module 13 determines that the substrate management has not been received The data related to the resident program execution table sent by the controller 11 proceeds to step 63 of the process.

在該步驟63中,該處理模組13觸發該平台路徑控制器14產生指示出重啟該基板管理控制器11的該第二重啟訊號,並經由該第二條通用型之輸入輸出接腳傳送至該基板管理控制器11。其中,該處理模組13是藉由產生並傳送一指示出重啟該基板管理控制器11的第二重啟訊號至該平台路徑控制器14,以觸發該平台路徑控制器14產生並傳送指示出重啟該基板管理控制器11的該第三重啟訊號。此外,該處理模組13也可以直接將指示出重啟該基板管理控制器11的一參數寫入該平台路徑控制器14的指示出重啟該基板管理控制器11的該暫存器,以觸發該平台路徑控制器14產生並傳送指示出重啟該基板管理控制器11的該第二重啟訊號。 In step 63, the processing module 13 triggers the platform path controller 14 to generate the second restart signal indicating to restart the baseboard management controller 11, and transmits it to the second universal input/output pin The baseboard management controller 11. The processing module 13 generates and transmits a second restart signal indicating to restart the baseboard management controller 11 to the platform path controller 14 to trigger the platform path controller 14 to generate and transmit the instruction to restart The third restart signal of the baseboard management controller 11. In addition, the processing module 13 can also directly write a parameter indicating restarting the baseboard management controller 11 into the register of the platform path controller 14 indicating restarting the baseboard management controller 11 to trigger the The platform path controller 14 generates and transmits the second restart signal indicating to restart the baseboard management controller 11.

在該步驟64中,該基板管理控制器11在接收到該第二重啟訊號後,重新啟動該基板管理控制器11。 In this step 64, the baseboard management controller 11 restarts the baseboard management controller 11 after receiving the second restart signal.

參閱圖8,該第二異常記錄產生程序係在執行該第二基板管理控制器重啟程序後,還無法接收該常駐程序執行表時,產生相關於該溝通介面的一第二異常記錄,並包含一步驟71、一步驟72,以及一步驟73。 Referring to FIG. 8, the second abnormal record generating program generates a second abnormal record related to the communication interface when the resident program execution table cannot be received after the second baseboard management controller restart program is executed, and includes One step 71, one step 72, and one step 73.

在該步驟71中,在該處理模組13傳送該第二重啟訊號且經過一第六預設時間後,該處理模組13再次產生另一相關於該常駐程序執行表的讀取請求並透過該平台路徑控制器14傳送至該基板管理控制器11。特別地,該第六預設時間係為預設等待該基板管理 控制器11重啟完成所需的時間。 In step 71, after the processing module 13 transmits the second restart signal and a sixth preset time has elapsed, the processing module 13 again generates another read request related to the resident program execution table and transmits The platform path controller 14 is transmitted to the baseboard management controller 11. In particular, the sixth preset time is a preset waiting for the substrate management The time required for the controller 11 to restart.

在該步驟72中,該處理模組13判定有無接收到該基板管理控制器11傳送的相關於該常駐程序執行表的資料。當該處理模組13判定出有接收到該基板管理控制器11傳送的相關於該常駐程序執行表的資料,則回到流程步驟21;當該處理模組13判定出沒有接收到該基板管理控制器11傳送的相關於該常駐程序執行表的資料,也就是判斷是否有在該傳送週期內接收到該基板管理控制器11傳送的相關於該常駐程序執行表的資料,則進行流程步驟73。 In this step 72, the processing module 13 determines whether it has received the data related to the resident program execution table sent by the baseboard management controller 11. When the processing module 13 determines that it has received the data related to the resident program execution table sent by the substrate management controller 11, it returns to the process step 21; when the processing module 13 determines that the substrate management has not been received The data related to the resident program execution table sent by the controller 11, that is, to determine whether the data related to the resident program execution table sent by the baseboard management controller 11 is received in the transmission period, then proceed to step 73 .

在該步驟73中,該處理模組13產生相關於該溝通介面運作異常的該第二異常紀錄。 In the step 73, the processing module 13 generates the second abnormal record related to the abnormal operation of the communication interface.

綜上所述,本發明用於監控基板管理控制器之常駐程序的方法,藉由該處理模組13於電腦裝置1開機階段及開機完成後其中至少一者,週期性地讀取相關於該常駐程序執行表的資料,以檢測每一常駐程序之運作狀態是否為運作正常,並當檢測出存在該至少一處於運作異常的運作異常的常駐程序時,則先將該至少一運作異常的常駐程序重新啟動,並再次檢測該至少一運作異常的常駐程序是否回復運作正常之運作狀態,當再次檢測存在仍處於運作異常的該至少一待觀察常駐程序,才進行重新啟動該基板管理控制器11,透過該用於監控基板管理控制器之常駐程序的方法有效地達成當檢測到該至少一運作異常的常駐程序是單純的僅因為該至少一 運作異常的常駐程序本身的問題而不是該基板管理控制器11本身運作異常所造成該常駐程序無法處於運作正常的運作狀態,則無需直接將該基板管理控制器11整個重新啟動,便可使該至少一運作異常的常駐程序自動回復運作正常的運作狀態,此外,還可自動確認是否為該處理模組13無法經由該溝通介面與該基板管理控制器11進行溝通造成無法該基板管理控制器11的監控常駐程序運作狀態,並於確認後,自動透過該平台路徑控制器14將該溝通介面重新啟動,使得該處理模組13可繼續監控該基板管理控制器11的常駐程序。因此,確實能達成本發明的目的。 In summary, the method of the present invention for monitoring the resident program of a baseboard management controller uses the processing module 13 to periodically read information related to the computer device 1 during the boot phase and after the boot is completed. The data of the resident program execution table is used to detect whether the operating status of each resident program is operating normally, and when the at least one abnormally operating resident program is detected, first the at least one abnormally operating resident program The program is restarted, and it is checked again whether the at least one resident program with abnormal operation returns to a normal operating state, and when the at least one resident program still in abnormal operation is detected again, the baseboard management controller 11 is restarted , Through the method for monitoring the resident program of the baseboard management controller, it is effectively achieved that when the at least one abnormal operation of the resident program is detected simply because the at least one resident program The abnormal operation of the resident program itself is not caused by the abnormal operation of the baseboard management controller 11 itself. The resident program cannot be in a normal operating state, so there is no need to directly restart the baseboard management controller 11 to make the At least one resident program that operates abnormally automatically returns to a normal operating state. In addition, it can also automatically determine whether the processing module 13 cannot communicate with the baseboard management controller 11 through the communication interface, causing the baseboard management controller 11 to fail. The operating status of the resident program is monitored, and after confirmation, the communication interface is automatically restarted through the platform path controller 14 so that the processing module 13 can continue to monitor the resident program of the baseboard management controller 11. Therefore, the purpose of the invention can indeed be achieved.

惟以上所述者,僅為本發明的實施例而已,當不能以此限定本發明實施的範圍,凡是依本發明申請專利範圍及專利說明書內容所作的簡單的等效變化與修飾,皆仍屬本發明專利涵蓋的範圍內。 However, the above are only examples of the present invention. When the scope of implementation of the present invention cannot be limited by this, all simple equivalent changes and modifications made in accordance with the scope of the patent application of the present invention and the content of the patent specification still belong to Within the scope of the patent for the present invention.

21~23:步驟 21~23: Steps

Claims (9)

一種用於監控基板管理控制器之常駐程序的方法,藉由一電腦裝置來實施,該電腦裝置包含一處理模組及一電連接該處理模組的基板管理控制器,該基板管理控制器用於執行多個常駐程序,該用於監控基板管理控制器之常駐程序的方法包含以下步驟:(A1)藉由該基板管理控制器,產生一常駐程序執行表,該常駐程序執行表包含多個對應該等常駐程序的程序代碼,以及每一常駐程序所對應的一存活值,每一存活值在所對應之常駐程序被該基板管理控制器正常執行時會被更新;(A2)藉由該處理模組,讀取該常駐程序執行表中該等常駐程序所對應之存活值,並根據前次所讀取到之該等常駐程序所對應之存活值與當前所讀取到之該等常駐程序所對應之存活值判定該等常駐程序中是否存在至少一運作異常的常駐程序,其中每一運作異常的常駐程序在前次被讀取到之存活值相同於其在當前被讀取到之存活值;及(A3)當該處理模組判定出存在該至少一運作異常的常駐程序時,藉由該處理模組,通知該基板管理控制器,以致該基板管理控制器重新啟動每一運作異常的常駐程序。 A method for monitoring the resident program of a substrate management controller is implemented by a computer device. The computer device includes a processing module and a substrate management controller electrically connected to the processing module. The substrate management controller is used for A plurality of resident programs are executed. The method for monitoring the resident programs of a baseboard management controller includes the following steps: (A1) A resident program execution table is generated by the baseboard management controller, and the resident program execution table includes a plurality of pairs It should wait for the program code of the resident program and a survival value corresponding to each resident program. Each survival value will be updated when the corresponding resident program is normally executed by the baseboard management controller; (A2) by this process The module reads the survival values corresponding to the resident programs in the resident program execution table, and based on the survival values corresponding to the resident programs read last time and the resident programs currently read The corresponding survival value determines whether there is at least one abnormally functioning resident program in the resident programs, wherein the survival value of each abnormally functioning resident program in the previous read is the same as its current survival value. Value; and (A3) when the processing module determines that there is at least one resident program with an abnormal operation, the processing module informs the baseboard management controller, so that the baseboard management controller restarts each abnormal operation Resident program. 如請求項1所述的用於監控基板管理控制器之常駐程序的方法,該電腦裝置還包含一電連接該處理模組及該基板管 理控制器的平台路徑控制器,該處理模組係經由該平台路徑控制器與該基板管理控制器電連接,該處理模組還透過一溝通介面與該基板管理控制器進行訊號傳輸,其中,在該步驟(A2)之前,還包含以下步驟:(B1)藉由該處理模組,產生一相關於該常駐程序執行表的讀取請求並透過該平台路徑控制器傳送至該基板管理控制器;(B2)藉由該處理模組,判定有無接收到該基板管理控制器中的相關於該常駐程序執行表的資料;及(B3)當該處理模組判定出有接收到該基板管理控制器中的相關於該常駐程序執行表的資料時,進行該步驟(A2)。 The method for monitoring the resident program of a substrate management controller according to claim 1, wherein the computer device further includes an electrical connection between the processing module and the substrate tube The platform path controller of the management controller, the processing module is electrically connected to the baseboard management controller via the platform path controller, and the processing module also performs signal transmission with the baseboard management controller through a communication interface, wherein, Before this step (A2), it also includes the following steps: (B1) through the processing module, generate a read request related to the resident program execution table and send it to the baseboard management controller through the platform path controller ; (B2) by the processing module, determine whether the baseboard management controller has received the data related to the resident program execution table; and (B3) when the processing module determines that the baseboard management control has been received When the data related to the resident program execution table in the device, perform this step (A2). 如請求項2所述的用於監控基板管理控制器之常駐程序的方法,其中,在該步驟(B2)之後,還包含以下步驟:(C1)當該處理模組判定出沒有接收到該基板管理控制器中的相關於該常駐程序執行表的資料時,藉由該處理模組,觸發該平台路徑控制器產生並傳送一指示出重啟該溝通介面的第一重啟訊號至該基板管理控制器;及(C2)藉由該基板管理控制器,在接收到該第一重啟訊號後,重新啟動該溝通介面。 The method for monitoring the resident program of a baseboard management controller according to claim 2, wherein, after this step (B2), it further includes the following steps: (C1) when the processing module determines that the substrate has not been received When managing data related to the resident program execution table in the controller, the processing module triggers the platform path controller to generate and send a first restart signal indicating restarting the communication interface to the baseboard management controller ; And (C2) through the baseboard management controller, after receiving the first restart signal, restart the communication interface. 如請求項3所述的用於監控基板管理控制器之常駐程序的方法,其中,在該步驟(C2)之後,還包含以下步驟:(D1)藉由該處理模組,再次產生另一相關於該常駐 程序執行表的讀取請求並透過該平台路徑控制器傳送至該基板管理控制器;(D2)藉由該處理模組,判定有無接收到該基板管理控制器中的相關於該常駐程序執行表的資料;及(D3)當該處理模組判定出有接收到該基板管理控制器中的相關於該常駐程序執行表的資料時,進行該步驟(A2)。 The method for monitoring the resident program of a baseboard management controller according to claim 3, wherein, after the step (C2), the method further includes the following steps: (D1) using the processing module to generate another related Resident in this The read request of the program execution table is transmitted to the baseboard management controller through the platform path controller; (D2) through the processing module, it is determined whether the execution table related to the resident program in the baseboard management controller is received (D3) When the processing module determines that it has received the data related to the resident program execution table in the baseboard management controller, perform this step (A2). 如請求項4所述的用於監控基板管理控制器之常駐程序的方法,其中,在該步驟(D2)之後,還包含以下步驟:(E1)當該處理模組再次判定出沒有接收到該基板管理控制器中的相關於該常駐程序執行表的資料時,藉由該處理模組,觸發該平台路徑控制器產生並傳送一指示出重啟該基板管理控制器的第二重啟訊號至該基板管理控制器;及(E2)藉由該基板管理控制器,在接收到該第二重啟訊號後,重新啟動該基板管理控制器。 The method for monitoring the resident program of a baseboard management controller as described in claim 4, wherein after this step (D2), it further includes the following steps: (E1) when the processing module again determines that it has not received the When the data in the baseboard management controller is related to the resident program execution table, the processing module triggers the platform path controller to generate and transmit a second restart signal indicating to restart the baseboard management controller to the substrate Management controller; and (E2) through the baseboard management controller, after receiving the second restart signal, restart the baseboard management controller. 如請求項5所述的用於監控基板管理控制器之常駐程序的方法,其中,在該步驟(E2)之後,還包含以下步驟:(F1)藉由該處理模組,再次產生另一相關於該常駐程序執行表的讀取請求並透過該平台路徑控制器傳送至該基板管理控制器;(F2)藉由該處理模組,判定有無接收到該基板管理控制器中的相關於該常駐程序執行表的資料; (F3)當該處理模組判定出有接收到該基板管理控制器中的相關於該常駐程序執行表的資料時,進行該步驟(A2);及(F4)當該處理模組再次判定出沒有接收到該基板管理控制器中的相關於該常駐程序執行表的資料時,藉由該處理模組,產生一相關於該溝通介面運作異常的第二異常紀錄。 The method for monitoring the resident program of a baseboard management controller according to claim 5, wherein after this step (E2), it further includes the following steps: (F1) using the processing module to generate another related The read request in the resident program execution table is transmitted to the baseboard management controller through the platform path controller; (F2) through the processing module, it is determined whether the baseboard management controller has received the relevant Data of the program execution table; (F3) When the processing module determines that it has received the data related to the resident program execution table in the baseboard management controller, perform this step (A2); and (F4) when the processing module determines again When the data related to the resident program execution table in the baseboard management controller is not received, the processing module generates a second abnormal record related to the abnormal operation of the communication interface. 如請求項1所述的用於監控基板管理控制器之常駐程序的方法,該電腦裝置還包含一電連接該處理模組及該基板管理控制器的平台路徑控制器,該處理模組係經由該平台路徑控制器與該基板管理控制器電連接,其中,該步驟(A3)還包含以下步驟:(A31)當該處理模組判定出存在該至少一運作異常的常駐程序時,對於每一運作異常的常駐程序,藉由該處理模組,產生一指示出該運作異常的常駐程序運作異常的通知訊息並透過該平台路徑控制器傳送至該基板管理控制器;及(A32)對於每一通知訊息,藉由該基板管理控制器,在接收到該通知訊息後,重新啟動該通知訊息所對應的運作異常的常駐程序。 According to claim 1, the method for monitoring the resident program of a baseboard management controller, the computer device further includes a platform path controller electrically connected to the processing module and the baseboard management controller, and the processing module passes through The platform path controller is electrically connected to the baseboard management controller, wherein the step (A3) further includes the following steps: (A31) when the processing module determines that the at least one abnormal resident program exists, for each The resident program of abnormal operation, through the processing module, generates a notification message indicating the abnormal operation of the resident program, and transmits it to the baseboard management controller through the platform path controller; and (A32) for each The notification message is used by the baseboard management controller to restart the resident program of the abnormal operation corresponding to the notification message after receiving the notification message. 如請求項7所述的用於監控基板管理控制器之常駐程序的方法,其中,在該步驟(A32)之後,還包含以下步驟:(G1)對於每一運作異常的常駐程序,藉由該處理模組,產生一相關於該運作異常的常駐程序的第一確認請求 並透過該平台路徑控制器傳送至該基板管理控制器;(G2)藉由該處理模組,判定在一第二預設時間內是否有接收到該基板管理控制器回應於每一第一確認請求的一第一確認回覆;及(G3)當該處理模組判定出在該第二預設時間內沒有接收到該基板管理控制器所回傳之回應於每一第一確認請求的該第一確認回覆時,藉由該處理模組,觸發該平台路徑控制器產生並傳送一指示出重啟該基板管理控制器的第二重啟訊號至該基板管理控制器,並將每一不存在所對應之第一確認回覆的第一確認請求所相關的運作異常的常駐程序作為一待觀察常駐程序;及(G4)藉由該基板管理控制器,在接收到該第二重啟訊號後,重新啟動該基板管理控制器。 The method for monitoring the resident program of a baseboard management controller according to claim 7, wherein, after this step (A32), further includes the following steps: (G1) For each resident program of abnormal operation, by The processing module generates a first confirmation request related to the resident program of the abnormal operation And send to the baseboard management controller through the platform path controller; (G2) through the processing module, determine whether the baseboard management controller responds to each first confirmation within a second preset time Request a first confirmation response; and (G3) when the processing module determines that the response returned by the baseboard management controller is not received within the second preset time for the first confirmation request Upon a confirmation reply, the processing module is used to trigger the platform path controller to generate and transmit a second restart signal indicating to restart the baseboard management controller to the baseboard management controller, and each non-existence corresponds to The abnormal resident program related to the first confirmation request of the first confirmation reply is used as a resident program to be observed; and (G4) through the baseboard management controller, after receiving the second restart signal, restart the Baseboard management controller. 如請求項8所述的用於監控基板管理控制器之常駐程序的方法,其中,在該步驟(G4)之後,還包含以下步驟:(H1)對於每一待觀察常駐程序,藉由該處理模組,產生一相關於該待觀察常駐程序的第二確認請求並透過該平台路徑控制器傳送至該基板管理控制器;(H2)藉由該處理模組,判定在一第四預設時間內是否有接收到該基板管理控制器所回傳之回應於每一第二確認請求的一第二確認回覆;及(H3)當該處理模組判定出在該第四預設時間內沒有接收到該基板管理控制器所回傳之回應於每一第二確認請求的該第二確認回覆時,藉由該處理模組,將每一不存 在所對應之第二確認回覆的第二確認請求所相關的待觀察常駐程序作為一待紀錄常駐程序,產生一指示出每一待紀錄常駐程序運作異常的第一異常紀錄。 The method for monitoring the resident program of a baseboard management controller according to claim 8, wherein, after this step (G4), it further includes the following steps: (H1) For each resident program to be observed, by the processing The module generates a second confirmation request related to the resident program to be observed and transmits it to the baseboard management controller through the platform path controller; (H2) through the processing module, determines a fourth preset time Whether there is a second confirmation reply in response to each second confirmation request sent by the baseboard management controller; and (H3) when the processing module determines that it has not received the response within the fourth preset time When the response returned by the baseboard management controller is to the second confirmation response of each second confirmation request, through the processing module, each missing The resident program to be observed related to the second confirmation request of the corresponding second confirmation reply is used as a resident program to be recorded, and a first abnormal record indicating the abnormal operation of each resident program to be recorded is generated.
TW108112080A 2019-04-08 2019-04-08 Monitor method for demand of a bmc TWI715005B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW108112080A TWI715005B (en) 2019-04-08 2019-04-08 Monitor method for demand of a bmc

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW108112080A TWI715005B (en) 2019-04-08 2019-04-08 Monitor method for demand of a bmc

Publications (2)

Publication Number Publication Date
TW202038093A TW202038093A (en) 2020-10-16
TWI715005B true TWI715005B (en) 2021-01-01

Family

ID=74091042

Family Applications (1)

Application Number Title Priority Date Filing Date
TW108112080A TWI715005B (en) 2019-04-08 2019-04-08 Monitor method for demand of a bmc

Country Status (1)

Country Link
TW (1) TWI715005B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201118561A (en) * 2009-11-30 2011-06-01 Inventec Corp A event management system an a method of the server therefore
CN105579973A (en) * 2014-01-10 2016-05-11 株式会社日立制作所 Redundant system and method for managing redundant system
US20170116103A1 (en) * 2015-03-09 2017-04-27 Vapor IO Inc. Data center management via out-of-band, low-pin count, external access to local motherboard monitoring and control
TWI618380B (en) * 2015-10-14 2018-03-11 廣達電腦股份有限公司 Management methods, service controller devices and non-stransitory, computer-readable media

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201118561A (en) * 2009-11-30 2011-06-01 Inventec Corp A event management system an a method of the server therefore
CN105579973A (en) * 2014-01-10 2016-05-11 株式会社日立制作所 Redundant system and method for managing redundant system
US20170116103A1 (en) * 2015-03-09 2017-04-27 Vapor IO Inc. Data center management via out-of-band, low-pin count, external access to local motherboard monitoring and control
TWI618380B (en) * 2015-10-14 2018-03-11 廣達電腦股份有限公司 Management methods, service controller devices and non-stransitory, computer-readable media

Also Published As

Publication number Publication date
TW202038093A (en) 2020-10-16

Similar Documents

Publication Publication Date Title
US10055296B2 (en) System and method for selective BIOS restoration
JP6530774B2 (en) Hardware failure recovery system
WO2022198972A1 (en) Method, system and apparatus for fault positioning in starting process of server
TWI754317B (en) Method and system for optimal boot path for a network device
JP6034990B2 (en) Server control method and server control apparatus
US9846616B2 (en) Boot recovery system
US7953831B2 (en) Method for setting up failure recovery environment
US20100162045A1 (en) Method, apparatus and system for restarting an emulated mainframe iop
US11526411B2 (en) System and method for improving detection and capture of a host system catastrophic failure
WO2018095107A1 (en) Bios program abnormal processing method and apparatus
TWI261748B (en) Policy-based response to system errors occurring during OS runtime
WO2021057795A1 (en) System starting method and apparatus, node device and computer-readable storage medium
TWI739127B (en) Method, system, and server for providing the system data
TWI518680B (en) Method for maintaining file system of computer system
TW201734779A (en) Boot status notification method and server system using the same
TWI715005B (en) Monitor method for demand of a bmc
WO2017072904A1 (en) Computer system and failure detection method
TWI554876B (en) Method for processing node replacement and server system using the same
CN112084049A (en) Method for monitoring resident program of baseboard management controller
CN107450894B (en) Method for informing startup phase and server system
TWI777664B (en) Booting method of embedded system
JP7389877B2 (en) Network optimal boot path method and system
TWI726434B (en) Control method for solving abnormal operation of me
TWI840907B (en) Computer system and method for detecting deviations, and non-transitory computer readable medium
TWI298137B (en)

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees