TWI668567B - Server and method for restoring a baseboard management controller automatically - Google Patents

Server and method for restoring a baseboard management controller automatically Download PDF

Info

Publication number
TWI668567B
TWI668567B TW107112539A TW107112539A TWI668567B TW I668567 B TWI668567 B TW I668567B TW 107112539 A TW107112539 A TW 107112539A TW 107112539 A TW107112539 A TW 107112539A TW I668567 B TWI668567 B TW I668567B
Authority
TW
Taiwan
Prior art keywords
management controller
control
chipset
determines
control chipset
Prior art date
Application number
TW107112539A
Other languages
Chinese (zh)
Other versions
TW201944239A (en
Inventor
丁偉雄
Original Assignee
神雲科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 神雲科技股份有限公司 filed Critical 神雲科技股份有限公司
Priority to TW107112539A priority Critical patent/TWI668567B/en
Application granted granted Critical
Publication of TWI668567B publication Critical patent/TWI668567B/en
Publication of TW201944239A publication Critical patent/TW201944239A/en

Links

Landscapes

  • Test And Diagnosis Of Digital Computers (AREA)

Abstract

一種伺服器,包含一基板管理控制器、一記憶模組,及一控制晶片組,該控制晶片組回應於一基本輸入輸出系統程式之執行,進行一開機自我檢測,並於進行該開機自我檢測時,傳送一詢問指令至該基板管理控制器,且判定是否接收到一來自該基板管理控制器且回應於該詢問指令的回覆指令。當該控制晶片組判定出無接收到該回覆指令時,該控制晶片組將一計數值加一,並判定該計數值是否大於一預設值,當該控制晶片組判定出該計數值大於該預設值時,該控制晶片組使該基板管理控制器所儲存的一第一映像檔更新為一第二映像檔。A server includes a substrate management controller, a memory module, and a control chip set. The control chip set performs a boot self-test in response to execution of a basic input/output system program, and performs the boot self-test At this time, an inquiry command is transmitted to the baseboard management controller, and it is determined whether a reply command from the baseboard management controller and in response to the inquiry command is received. When the control chipset determines that the reply command is not received, the control chipset increments a count value and determines whether the count value is greater than a predetermined value, and when the control chipset determines that the count value is greater than the When the preset value is reached, the control chipset updates a first image file stored by the substrate management controller to a second image file.

Description

伺服器及自動檢修基板管理控制器的方法Server and method for automatically repairing substrate management controller

本發明是有關於一種自動檢修的方法,特別是指一種伺服器及自動檢修基板管理控制器的方法。The invention relates to a method for automatic maintenance, in particular to a server and a method for automatically repairing a substrate management controller.

現有的基板管理控制器(Baseboard Management Controller,BMC)適用於伺服器,並支援智慧平台管理介面(Intelligent Platform Management Interface,IPMI)的工業標準,用來監控伺服器主機板上之硬體設備的狀態,如環境溫度、風扇轉速、供電情況等。然而,若該基板管理控制器發生異常,如系統功能崩潰或硬體問題,則該基板管理控制器即無法達成監控伺服器主機板上之硬體設備的功能。The existing Baseboard Management Controller (BMC) is applicable to servers and supports the industry standard of Intelligent Platform Management Interface (IPMI) to monitor the status of hardware devices on the server board. Such as ambient temperature, fan speed, power supply, etc. However, if the substrate management controller is abnormal, such as a system function crash or a hardware problem, the baseboard management controller cannot perform the function of monitoring the hardware device on the server motherboard.

現有技術雖可藉由一控制晶片組執行一基本輸入輸出系統程式來偵測該基板管理控制器是否發生異常,並在該控制晶片組偵測出該基板管理控制器發生異常時,傳送一重置指令至該基板管理控制器,以重置該基板管理控制器,藉此來達到自動檢修的效果。然而,在某些情況下,即便重置該基板管理控制器仍無法使該基板管理控制器恢復正常運作,此時,便需要派遣人力對該基板管理控制器進行檢修,因而造成人力成本及時間的耗費。In the prior art, a basic input/output system program can be executed by a control chipset to detect whether an abnormality occurs in the substrate management controller, and when the control chipset detects that the substrate management controller is abnormal, a weight is transmitted. The instruction is directed to the baseboard management controller to reset the baseboard management controller to achieve the effect of automatic maintenance. However, in some cases, even if the baseboard management controller is reset, the baseboard management controller cannot be restored to normal operation. At this time, it is necessary to dispatch human resources to perform maintenance on the baseboard management controller, thereby causing labor cost and time. The cost.

因此,本發明的目的,即在提供一種節省檢修基板管理控制器所需耗費之人力成本與時間的自動檢修基板管理控制器的方法。Accordingly, it is an object of the present invention to provide a method of automatically overhauling a substrate management controller that saves labor and time required to overhaul a substrate management controller.

於是,本發明自動檢修基板管理控制器的方法,藉由一控制晶片組執行一基本輸入輸出系統程式來實施,該控制晶片組電連接一儲存有一第一映像檔的基板管理控制器,以及一記憶模組,該記憶模組儲存有一相關於該基板管理控制器之第二映像檔,該自動檢修基板管理控制器的方法包含以下步驟:Therefore, the method for automatically repairing the substrate management controller of the present invention is implemented by executing a basic input/output system program by a control chip set electrically connected to a substrate management controller storing a first image file, and a a memory module, wherein the memory module stores a second image file associated with the substrate management controller, and the method for automatically repairing the substrate management controller comprises the following steps:

(A)該控制晶片組進行一開機自我檢測;(A) the control chip set performs a boot self-test;

(B) 該控制晶片組傳送一詢問指令至該基板管理控制器;(B) the control chip set transmits an inquiry command to the substrate management controller;

(C)該控制晶片組判定是否接收到一來自該基板管理控制器且回應於該詢問指令的回覆指令;(C) the control chipset determines whether a reply command from the baseboard management controller and in response to the query command is received;

(D)當該控制晶片組判定出無接收到該回覆指令時,該控制晶片組將一計數值加一,並判定該計數值是否大於一預設值;及(D) when the control chipset determines that the reply command is not received, the control chipset increments a count value and determines whether the count value is greater than a predetermined value;

(E)當該控制晶片組判定出該計數值大於該預設值時,該控制晶片組使該基板管理控制器所儲存的第一映像檔更新為該第二映像檔。(E) when the control chipset determines that the count value is greater than the preset value, the control chipset updates the first image file stored by the substrate management controller to the second image file.

本發明的另一目的,即在提供一種節省檢修基板管理控制器所需耗費之人力成本與時間的伺服器。Another object of the present invention is to provide a server that saves labor and time required to repair a substrate management controller.

於是,本發明伺服器包含一基板管理控制器、一記憶模組,及一電連接該基板管理控制器及該記憶模組的控制晶片組。Therefore, the server of the present invention comprises a substrate management controller, a memory module, and a control chip set electrically connected to the substrate management controller and the memory module.

該基板管理控制器儲存有一第一映像檔。The baseboard management controller stores a first image file.

該記憶模組儲存有一相關於該基板管理控制器的第二映像檔。The memory module stores a second image associated with the substrate management controller.

該控制晶片組回應於一基本輸入輸出系統程式之執行,進行一開機自我檢測,並於進行該開機自我檢測時,傳送一詢問指令至該基板管理控制器,且判定是否接收到一來自該基板管理控制器且回應於該詢問指令的回覆指令,當該控制晶片組判定出無接收到該回覆指令時,該控制晶片組將一計數值加一,並判定該計數值是否大於一預設值,當該控制晶片組判定出該計數值大於該預設值時,該控制晶片組使該基板管理控制器所儲存的第一映像檔更新為該第二映像檔。The control chipset responds to execution of a basic input/output system program, performs a boot self-test, and transmits an inquiry command to the substrate management controller when the boot self-test is performed, and determines whether a substrate is received from the substrate. Administering the controller and responding to the reply command of the query command, when the control chipset determines that the reply command is not received, the control chipset increments a count value and determines whether the count value is greater than a preset value. When the control chipset determines that the count value is greater than the preset value, the control chipset updates the first image file stored by the substrate management controller to the second image file.

本發明的功效在於:藉由該控制晶片組執行該基本輸入輸出單元以判定是否接收到一來自該基板管理控制器且回應於該詢問指令的回覆指令,當該控制晶片組判定出無接收到該回覆指令時,且判定出該計數值大於該預設值時,即代表透過重置該基板管理控制器仍無法使該基板管理控制器恢復正常運作,此時,該控制晶片組將該基板管理控制器所儲存的第一映像檔更新為該第二映像檔,藉此達到自動檢修的效果,而能節省人力並且縮短檢修時間。The effect of the present invention is that the basic input/output unit is executed by the control chipset to determine whether a response command from the baseboard management controller and in response to the inquiry command is received, when the control chipset determines that no reception is received. When the reply command is executed, and it is determined that the count value is greater than the preset value, it means that the substrate management controller cannot be restored to normal operation by resetting the baseboard management controller. At this time, the control chip set is the substrate. The first image file stored by the management controller is updated to the second image file, thereby achieving the effect of automatic maintenance, thereby saving manpower and shortening the inspection time.

參閱圖1,本發明伺服器的實施例包含一基板管理控制器1(Baseboard Management Controller,BMC)、一第一記憶模組23、一第二記憶模組24,及一電連接該基板管理控制器1、該第一記憶模組23與該第二記憶模組24的控制晶片組3。Referring to FIG. 1 , an embodiment of a server of the present invention includes a Baseboard Management Controller (BMC), a first memory module 23, a second memory module 24, and an electrical connection to the baseboard management control. The first memory module 23 and the control chip group 3 of the second memory module 24.

該基板管理控制器1儲存有一第一映像檔11,該第一映像檔11為一程式,用以執行該基板管理控制器1的功能,包括監控伺服器主機板上之硬體設備的狀態,如環境溫度、風扇轉速、供電情況等等。當該基板管理控制器1接收到一來自該控制晶片組3的重置指令時,該基板管理控制器1即會重新啟動。The substrate management controller 1 stores a first image file 11 that is a program for executing the functions of the substrate management controller 1, including monitoring the state of the hardware device on the server motherboard. Such as ambient temperature, fan speed, power supply, and so on. When the substrate management controller 1 receives a reset command from the control chip set 3, the baseboard management controller 1 is restarted.

該第一記憶模組232儲存有一基本輸入輸出系統(Basic Input/Output System,簡稱BIOS)程式,該第二記憶模組24儲存有一相關於該基板管理控制器1的第二映像檔22。在本實施例中,該第一記憶模組23例如為一唯讀記憶體(Read Only Memory,簡稱ROM),該第二記憶模組24可為一USB記憶體、M.2的硬碟(M.2 HDD)或一PXE 伺服器等的外部儲存裝置,但不限於此。The first memory module 232 stores a basic input/output system (BIOS) program. The second memory module 24 stores a second image file 22 associated with the substrate management controller 1. In this embodiment, the first memory module 23 is, for example, a read only memory (ROM), and the second memory module 24 can be a USB memory or a hard disk of M.2 ( M.2 HDD) or an external storage device such as a PXE server, but is not limited thereto.

該控制晶片組3包含一平台路徑控制器31(Platform Controller Hub,簡稱PCH)及一中央處理單元32(Central Processing Unit,簡稱CPU)。在該伺服器上電啟動後,該控制晶片組3執行該基本輸入輸出系統程式21以啟動伺服器硬體及周邊裝置,執行開機自我檢測(Power On Self Test,簡稱POST)。The control chipset 3 includes a Platform Controller Hub (PCH) and a Central Processing Unit (CPU). After the server is powered on, the control chipset 3 executes the basic input/output system program 21 to start the server hardware and peripheral devices, and performs Power On Self Test (POST).

參閱圖1、圖2與圖3,以下將配合本發明自動檢修基板管理控制器1的方法之實施例來說明該基板管理控制器1、該第一記憶模組23、該第二記憶模組24及該控制晶片組3各元件間之作動。Referring to FIG. 1 , FIG. 2 and FIG. 3 , the substrate management controller 1 , the first memory module 23 , and the second memory module are described below with reference to an embodiment of the method for automatically repairing the substrate management controller 1 of the present invention. 24 and the operation of the components of the control chip set 3.

在步驟201中,該控制晶片組3回應於該基本輸入輸出系統程式21之執行,進行一開機自我檢測(POST)。In step 201, the control chipset 3 performs a power-on self-test (POST) in response to execution of the basic input/output system program 21.

在步驟202中,該控制晶片組3回應於該基本輸入輸出系統程式21之執行,傳送一詢問指令至該基板管理控制器1。In step 202, the control chipset 3 transmits an inquiry command to the baseboard management controller 1 in response to execution of the basic input/output system program 21.

在步驟203中,該控制晶片組3回應於該基本輸入輸出系統程式21之執行,判定是否接收到一來自該基板管理控制器1且回應於該詢問指令的回覆指令。當該控制晶片組3判定出無接收到該回覆指令時,流程進行步驟204;當該控制晶片組3判定出接收到該回覆指令時,流程進行步驟209。In step 203, the control chipset 3, in response to execution of the basic input/output system program 21, determines whether a reply command from the baseboard management controller 1 and in response to the query command is received. When the control chipset 3 determines that the reply command has not been received, the flow proceeds to step 204; when the control chipset 3 determines that the reply command is received, the flow proceeds to step 209.

在步驟204中,該控制晶片組3回應於該基本輸入輸出系統程式21之執行,將一計數值加一並判定該計數值是否大於一預設值。當該控制晶片組3判定出該計數值大於該預設值時,流程進行步驟205;當該控制晶片組3判定出該計數值小於等於該預設值時,流程進行步驟207。值得特別說明的是,實施上,該計數值例如可被設置為該基本輸入輸出系統程式21中所包含的一參數,且初始時該參數被預設為0。In step 204, the control chipset 3 responds to the execution of the basic input/output system program 21, increments a count value and determines whether the count value is greater than a predetermined value. When the control chip group 3 determines that the count value is greater than the preset value, the flow proceeds to step 205; when the control chip group 3 determines that the count value is less than or equal to the preset value, the flow proceeds to step 207. It should be particularly noted that, in practice, the count value can be set, for example, to a parameter included in the basic input/output system program 21, and initially the parameter is preset to zero.

在步驟205中,該控制晶片組3回應於該基本輸入輸出系統程式21之執行,初始化該計數值(使其歸零)並使該基板管理控制器1所儲存的第一映像檔11更新為該第二映像檔22。因此,藉由該控制晶片組3判定該計數值是否大於該預設值,當該計數值大於該預設值時,表示透過多次重置該基板管理控制器1後仍無法使該基板管理控制器1恢復正常運作,該控制晶片組3即令該基板管理控制器1自動更新運行於該基板管理控制器1中之映像檔,以達成自動檢修的效果。值得一提的是,該控制晶片組3係透過一基板管理控制器軟體工具(BMC tool)來更新該基板管理控制器1的映像檔。值得一提的是,在本實施例中,該伺服器除了包含該第二記憶模組24外,還可包含其他的外部儲存裝置,當該控制晶片組3判定出該計數值大於該預設值時,該控制晶片組3會先自所有外部儲存裝置搜尋出儲存有該第二映像檔22的第二記憶模組24,接著,該控制晶片組3才使該基板管理控制器1所儲存的第一映像檔11更新為儲存於所搜尋出之第二記憶模組24中的該第二映像檔22。In step 205, the control chip set 3 is responsive to the execution of the basic input/output system program 21, initializes the count value (to zero), and updates the first image file 11 stored by the baseboard management controller 1 to The second image file 22. Therefore, the control chipset 3 determines whether the count value is greater than the preset value. When the count value is greater than the preset value, it indicates that the substrate management cannot be performed after resetting the substrate management controller 1 multiple times. The controller 1 resumes normal operation, and the control chipset 3 causes the substrate management controller 1 to automatically update the image file running in the substrate management controller 1 to achieve the effect of automatic maintenance. It is worth mentioning that the control chipset 3 updates the image of the substrate management controller 1 through a substrate management controller software tool (BMC tool). It should be noted that, in this embodiment, the server may include other external storage devices in addition to the second memory module 24, when the control chip group 3 determines that the count value is greater than the preset. In the case of the value, the control chipset 3 first searches for the second memory module 24 storing the second image file 22 from all external storage devices, and then the control chip group 3 stores the substrate management controller 1 The first image file 11 is updated to the second image file 22 stored in the searched second memory module 24.

接續在步驟205後,在步驟206中,該基板管理控制器1重新啟動,流程繼續執行步驟202。值得一提的是,在該基板管理控制器1完成該第一映像檔11之更新後(亦即,該第一映像檔11已更新為該第二映像檔22),該基板管理控制器1即會重新啟動。After the step 205 is continued, in step 206, the baseboard management controller 1 is restarted, and the flow proceeds to step 202. It is worth mentioning that after the substrate management controller 1 completes the update of the first image file 11 (that is, the first image file 11 has been updated to the second image file 22), the substrate management controller 1 It will restart.

在步驟207中,該控制晶片組3回應於該基本輸入輸出系統程式21之執行,傳送一重置指令至該基板管理控制器1,以致該基板管理控制器1重新啟動。值得一提的是,該重置指令係為一IPMI指令。In step 207, the control chipset 3 transmits a reset command to the baseboard management controller 1 in response to execution of the basic input/output system program 21, so that the baseboard management controller 1 is restarted. It is worth mentioning that the reset command is an IPMI command.

接續在步驟207後,在步驟208中,該基板管理控制器1重新啟動,流程繼續執行步驟202。After the step 207 is continued, in step 208, the baseboard management controller 1 is restarted, and the process proceeds to step 202.

在步驟209中,該控制晶片組3回應於該基本輸入輸出系統程式21之執行,計數一預設時間並判定是否完成該開機自我檢測。當該控制晶片組3判定出尚未完成該開機自我檢測時,流程進行步驟202,也就是說,該控制晶片組3會在該開機自我檢測的過程中,每間隔該預設時間,週期性的重複發出該詢問指令,以確認該基板管理控制器1是否仍正常運作(亦即,保持為存活狀態);當該控制晶片組3判定出已完成該開機自我檢測時,流程進行步驟210。 In step 209, the control chipset 3 responds to the execution of the basic input/output system program 21, counts a predetermined time and determines whether the power-on self-test is completed. When the control chipset 3 determines that the power-on self-test has not been completed, the process proceeds to step 202, that is, the control chipset 3 will periodically repeat the preset time during the power-on self-detection process. The inquiry command is repeatedly issued to confirm whether the baseboard management controller 1 is still operating normally (that is, remains in a live state); when the control chip set 3 determines that the power-on self-test has been completed, the flow proceeds to step 210.

在步驟210中,該控制晶片組3回應於該基本輸入輸出系統程式21之執行,初始化該計數值並移交系統控制權予作業系統。 In step 210, the control chipset 3 is responsive to execution of the basic input/output system program 21, initializes the count value and hands over system control to the operating system.

值得特別說明的是,本發明自動檢修基板管理控制器1的方法之實施例中的步驟201~205、步驟207及步驟209~210皆是藉由該控制晶片組3執行該基本輸入輸出系統程式21來實行,換言之上述步驟係被編程(programming)在該基本輸入輸出系統程式21中。 It should be noted that steps 201-205, step 207, and steps 209-210 in the embodiment of the method for automatically repairing the substrate management controller 1 of the present invention are executed by the control chipset 3 to execute the basic input/output system program. 21 is implemented, in other words, the above steps are programmed in the basic input/output system program 21.

綜上所述,本發明自動檢修基板管理控制器的方法,藉由該控制晶片組3於該開機自我檢測的過程間,週期性的重複發出該詢問指令,以確認該基板管理控制器1是否仍保持為存活狀態。當未接收到該基板管理控制器1的回覆指令時,該控制晶片組3即判定該計數值是否大於該預設值,以確認透過多次地重置該基板管理控制器1是否仍無法使該基板管理控制器1恢復正常運作,當該計數值大於該預設值時,該控制晶片組3使該基板管理控制器1所儲存的第一映像檔11更新為該第二映像檔22,藉此達到自動檢修的效果,從而能節省人力並且縮短檢修時間,故確實能達成本發明的目的。In summary, the method for automatically repairing the substrate management controller of the present invention, by the control chip group 3 periodically repeating the inquiry command during the startup self-detection process, to confirm whether the substrate management controller 1 is Still remain alive. When the reply command of the substrate management controller 1 is not received, the control chip group 3 determines whether the count value is greater than the preset value to confirm whether the substrate management controller 1 is still unable to be reset by resetting the substrate management controller 1 multiple times. The substrate management controller 1 resumes normal operation. When the count value is greater than the preset value, the control chipset 3 updates the first image file 11 stored by the substrate management controller 1 to the second image file 22, Thereby, the effect of the automatic inspection is achieved, thereby saving labor and shortening the inspection time, so that the object of the present invention can be achieved.

惟以上所述者,僅為本發明的實施例而已,當不能以此限定本發明實施的範圍,凡是依本發明申請專利範圍及專利說明書內容所作的簡單的等效變化與修飾,皆仍屬本發明專利涵蓋的範圍內。However, the above is only the embodiment of the present invention, and the scope of the invention is not limited thereto, and all the simple equivalent changes and modifications according to the scope of the patent application and the patent specification of the present invention are still Within the scope of the invention patent.

1‧‧‧基板管理控制器1‧‧‧Baseboard Management Controller

11‧‧‧第一映像檔11‧‧‧ first image file

23‧‧‧第一記憶模組23‧‧‧First Memory Module

24‧‧‧第二記憶模組24‧‧‧Second memory module

21‧‧‧基本輸入輸出系統程式21‧‧‧Basic input and output system program

22‧‧‧第二映像檔22‧‧‧second image file

3‧‧‧控制晶片組3‧‧‧Control chipset

31‧‧‧平台路徑控制器31‧‧‧Platform Path Controller

32‧‧‧中央處理單元32‧‧‧Central Processing Unit

201~210‧‧‧步驟201~210‧‧‧Steps

本發明的其他的特徵及功效,將於參照圖式的實施方式中清楚地呈現,其中: 圖1是一方塊圖,說明本發明伺服器之實施例;及 圖2是一流程圖,說明本發明自動檢修基板管理控制器的方法之實施例。Other features and advantages of the present invention will be apparent from the embodiments of the present invention, wherein: Figure 1 is a block diagram illustrating an embodiment of a server of the present invention; and Figure 2 is a flow chart illustrating the present invention An embodiment of a method of automatically overhauling a substrate management controller is invented.

Claims (8)

一種自動檢修基板管理控制器的方法,藉由一控制晶片組執行一基本輸入輸出系統程式來實施,該控制晶片組電連接一儲存有一第一映像檔的基板管理控制器,以及一記憶模組,該記憶模組儲存有一相關於該基板管理控制器之第二映像檔,該自動檢修基板管理控制器的方法包含以下步驟:(A)該控制晶片組進行一開機自我檢測;(B)該控制晶片組傳送一詢問指令至該基板管理控制器;(C)該控制晶片組判定是否接收到一來自該基板管理控制器且回應於該詢問指令的回覆指令;(D)當該控制晶片組判定出無接收到該回覆指令時,該控制晶片組將一計數值加一,並判定該計數值是否大於一預設值;(E)當該控制晶片組判定出該計數值小於等於該預設值時,該控制晶片組傳送一重置指令至該基板管理控制器,以致該基板管理控制器重新啟動,並重複執行步驟(B)至步驟(C);及(F)當該控制晶片組判定出該計數值大於該預設值時,該控制晶片組使該基板管理控制器所儲存的第一映像檔更新為該第二映像檔。 A method for automatically repairing a substrate management controller is implemented by a control chipset executing a basic input/output system program, the control chipset electrically connecting a substrate management controller storing a first image file, and a memory module The memory module stores a second image file associated with the substrate management controller. The method for automatically repairing the substrate management controller includes the following steps: (A) the control chip set performs a boot self-test; (B) the Controlling the chipset to transmit an interrogation command to the baseboard management controller; (C) the control chipset determining whether a reply command from the baseboard management controller and in response to the query command is received; (D) when the control chipset When it is determined that the reply command is not received, the control chip group increments a count value and determines whether the count value is greater than a preset value; (E) when the control chip group determines that the count value is less than or equal to the preset When set, the control chipset transmits a reset command to the baseboard management controller, so that the baseboard management controller is restarted, and step (B) is repeated. (C); and (F) when the control chip set determines that the counter value is greater than the predetermined value, the control chip of the first set of image file stored in the BMC for updating a second image file. 如請求項1所述的自動檢修基板管理控制器的方法,在步驟(C)後,還包含以下步驟: (G)當該控制晶片組判定出接收到該回覆指令時,該控制晶片組計數一預設時間,並判定是否完成該開機自我檢測;及(H)當該控制晶片組判定出尚未完成該開機自我檢測時,該控制晶片組繼續進行該開機自我檢測,並重複執行步驟(B)至步驟(C)。 The method for automatically repairing the substrate management controller according to claim 1, further comprising the following steps after the step (C): (G) when the control chipset determines that the reply command is received, the control chipset counts for a predetermined time and determines whether the boot self-test is completed; and (H) when the control chipset determines that the process is not completed During the power-on self-test, the control chipset continues the boot self-test and repeats steps (B) through (C). 如請求項2所述的自動檢修基板管理控制器的方法,在步驟(G)後,還包含一步驟:(I)當該控制晶片組判定出已完成該開機自我檢測時,該控制晶片組初始化該計數值。 The method for automatically repairing the substrate management controller according to claim 2, after the step (G), further comprising a step of: (I) when the control chipset determines that the power-on self-detection has been completed, the control chipset Initialize the count value. 如請求項1所述的自動檢修基板管理控制器的方法,其中,在步驟(F)中,當該控制晶片組判定出該計數值大於該預設值時,該控制晶片組還初始化該計數值。 The method of automatically repairing a substrate management controller according to claim 1, wherein in the step (F), when the control chip group determines that the count value is greater than the preset value, the control chip group further initializes the meter Value. 如請求項1所述的自動檢修基板管理控制器的方法,其中,步驟(F)包含以下子步驟:(F-1)當該控制晶片組判定出該計數值大於該預設值時,該控制晶片組搜尋出儲存有該第二映像檔的記憶模組;及(F-2)該控制晶片組使該基板管理控制器所儲存的第一映像檔更新為儲存於所搜尋出之記憶模組中的該第二映像檔。 The method of automatically repairing a substrate management controller according to claim 1, wherein the step (F) comprises the following sub-steps: (F-1) when the control chip group determines that the count value is greater than the preset value, Controlling the chipset to search for the memory module storing the second image file; and (F-2) the control chipset updating the first image file stored by the substrate management controller to be stored in the searched memory module The second image in the group. 一種伺服器,包含:一基板管理控制器,儲存有一第一映像檔;一記憶模組,儲存有一相關於該基板管理控制器的第 二映像檔;及一控制晶片組,電連接該基板管理控制器及該記憶模組,該控制晶片組回應於一基本輸入輸出系統程式之執行,進行一開機自我檢測,並於進行該開機自我檢測時,傳送一詢問指令至該基板管理控制器,且判定是否接收到一來自該基板管理控制器且回應於該詢問指令的回覆指令,當該控制晶片組判定出無接收到該回覆指令時,該控制晶片組將一計數值加一,並判定該計數值是否大於一預設值,當該控制晶片組判定出該計數值小於等於該預設值時,該控制晶片組傳送一重置指令至該基板管理控制器,以致該基板管理控制器重新啟動,該控制晶片組重複地傳送另一詢問指令至該基板管理控制器,且判定是否接收到另一來自該基板管理控制器且回應於該另一詢問指令的回覆指令,當該控制晶片組判定出該計數值大於該預設值時,該控制晶片組使該基板管理控制器所儲存的第一映像檔更新為該第二映像檔。。 A server includes: a substrate management controller storing a first image file; a memory module storing a first layer associated with the substrate management controller And a control chipset electrically connected to the substrate management controller and the memory module, wherein the control chipset performs a boot self-test in response to execution of a basic input/output system program, and performs the boot self At the time of detection, an inquiry command is transmitted to the baseboard management controller, and it is determined whether a reply command from the baseboard management controller and in response to the inquiry command is received, when the control chipset determines that the reply command is not received. The control chipset increments a count value and determines whether the count value is greater than a predetermined value. When the control chipset determines that the count value is less than or equal to the preset value, the control chipset transmits a reset. Directing to the baseboard management controller such that the baseboard management controller is restarted, the control chipset repeatedly transmits another query command to the baseboard management controller, and determines whether another response from the baseboard management controller is received and In the reply command of the another query command, when the control chip group determines that the count value is greater than the preset value, the control chip The BMC image file stored in the first update for the second image file. . 如請求項6所述的伺服器,其中,當該控制晶片組判定出接收到該回覆指令時,該控制晶片組計數一預設時間,並判定是否完成該開機自我檢測,當該控制晶片組判定出尚未完成該開機自我檢測時,該控制晶片組繼續進行該開機自我檢測,並重複地傳送另一詢問指令至該基板管理控制器,且判定是否接收到另一來自該基板管理控制器且回應於該另一詢問指令的回覆指令。 The server of claim 6, wherein when the control chipset determines that the reply command is received, the control chipset counts for a predetermined time and determines whether the boot self-test is completed, when the control chip When the group determines that the power-on self-test has not been completed, the control chipset continues the power-on self-detection, and repeatedly transmits another query command to the baseboard management controller, and determines whether another slave management controller is received. And in response to the reply command of the other query instruction. 如請求項6所述的伺服器,其中,當該控制晶片組判定出該計數值大於該預設值時,該控制晶片組還初始化該計數值。 The server of claim 6, wherein the control chipset further initializes the count value when the control chipset determines that the count value is greater than the preset value.
TW107112539A 2018-04-12 2018-04-12 Server and method for restoring a baseboard management controller automatically TWI668567B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW107112539A TWI668567B (en) 2018-04-12 2018-04-12 Server and method for restoring a baseboard management controller automatically

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW107112539A TWI668567B (en) 2018-04-12 2018-04-12 Server and method for restoring a baseboard management controller automatically

Publications (2)

Publication Number Publication Date
TWI668567B true TWI668567B (en) 2019-08-11
TW201944239A TW201944239A (en) 2019-11-16

Family

ID=68316493

Family Applications (1)

Application Number Title Priority Date Filing Date
TW107112539A TWI668567B (en) 2018-04-12 2018-04-12 Server and method for restoring a baseboard management controller automatically

Country Status (1)

Country Link
TW (1) TWI668567B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI709036B (en) * 2019-10-03 2020-11-01 神雲科技股份有限公司 Method of recovering the bios configuration parameter and server system
TWI742430B (en) * 2019-09-17 2021-10-11 神雲科技股份有限公司 Method of recovering firmware of baseboard management controller automatically
CN113687843A (en) * 2020-05-18 2021-11-23 佛山市顺德区顺达电脑厂有限公司 Method for automatically recovering firmware of baseboard management controller

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11126518B1 (en) 2020-03-16 2021-09-21 Quanta Computer Inc. Method and system for optimal boot path for a network device
TWI760839B (en) * 2020-09-04 2022-04-11 宇瞻科技股份有限公司 Building method of system restore mechanism and system booting and restoring method

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6219828B1 (en) * 1998-09-30 2001-04-17 International Business Machines Corporation Method for using two copies of open firmware for self debug capability
US20090240934A1 (en) * 2008-03-21 2009-09-24 Asustek Computer Inc. Computer system with dual boot-program area and method of booting the same
TW201500911A (en) * 2013-06-26 2015-01-01 Inventec Corp Debug device and debug method
TW201704994A (en) * 2015-07-30 2017-02-01 神雲科技股份有限公司 Technology for updating a server image file
TW201709081A (en) * 2015-08-18 2017-03-01 神雲科技股份有限公司 Automatic image recovery method and server system
TW201715396A (en) * 2015-10-23 2017-05-01 神雲科技股份有限公司 Server and error detecting method thereof
TW201715331A (en) * 2015-10-16 2017-05-01 神雲科技股份有限公司 Server and method for auto repairing a baseboard management controller
TW201717001A (en) * 2015-11-05 2017-05-16 廣達電腦股份有限公司 Unified firmware managment system, non-transitory computer-readable storage medium and method for unified firmware managment

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6219828B1 (en) * 1998-09-30 2001-04-17 International Business Machines Corporation Method for using two copies of open firmware for self debug capability
US20090240934A1 (en) * 2008-03-21 2009-09-24 Asustek Computer Inc. Computer system with dual boot-program area and method of booting the same
TW201500911A (en) * 2013-06-26 2015-01-01 Inventec Corp Debug device and debug method
TW201704994A (en) * 2015-07-30 2017-02-01 神雲科技股份有限公司 Technology for updating a server image file
TW201709081A (en) * 2015-08-18 2017-03-01 神雲科技股份有限公司 Automatic image recovery method and server system
TW201715331A (en) * 2015-10-16 2017-05-01 神雲科技股份有限公司 Server and method for auto repairing a baseboard management controller
TW201715396A (en) * 2015-10-23 2017-05-01 神雲科技股份有限公司 Server and error detecting method thereof
TW201717001A (en) * 2015-11-05 2017-05-16 廣達電腦股份有限公司 Unified firmware managment system, non-transitory computer-readable storage medium and method for unified firmware managment

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI742430B (en) * 2019-09-17 2021-10-11 神雲科技股份有限公司 Method of recovering firmware of baseboard management controller automatically
TWI709036B (en) * 2019-10-03 2020-11-01 神雲科技股份有限公司 Method of recovering the bios configuration parameter and server system
CN113687843A (en) * 2020-05-18 2021-11-23 佛山市顺德区顺达电脑厂有限公司 Method for automatically recovering firmware of baseboard management controller
CN113687843B (en) * 2020-05-18 2024-04-19 佛山市顺德区顺达电脑厂有限公司 Method for automatically recovering firmware of baseboard management controller

Also Published As

Publication number Publication date
TW201944239A (en) 2019-11-16

Similar Documents

Publication Publication Date Title
TWI668567B (en) Server and method for restoring a baseboard management controller automatically
TWI571736B (en) Method and system of automatic debug information collection
WO2022198972A1 (en) Method, system and apparatus for fault positioning in starting process of server
US10585755B2 (en) Electronic apparatus and method for restarting a central processing unit (CPU) in response to detecting an abnormality
TWI754317B (en) Method and system for optimal boot path for a network device
TWI632462B (en) Switching device and method for detecting i2c bus
CN104636221B (en) Computer system fault processing method and device
US20090249319A1 (en) Testing method of baseboard management controller
WO2018095107A1 (en) Bios program abnormal processing method and apparatus
TWI576706B (en) Method for early boot phase and the related device
JP5296036B2 (en) DMI redundancy in multiprocessor computer systems
TWI598729B (en) Server and method for auto repairing a baseboard management controller
CN103577298A (en) Baseboard management controller monitoring system and method
CN110471800B (en) Server and method for automatically overhauling substrate management controller
TWI582699B (en) Boot Status Notification Method and Server System Using the Same
TWI553490B (en) Method and system for remote system configuration management and non-transitory computer-readable storage medium
US10824493B2 (en) Disambiguation of error logging during system reset
CN105912414A (en) Method and system for server management
TWI537721B (en) Baseboard management control system and method
TW201430702A (en) Method and system for updating firmware
TW201324115A (en) Computer system and boot managing method of computer system
TWI554876B (en) Method for processing node replacement and server system using the same
TWI715005B (en) Monitor method for demand of a bmc
TWI726434B (en) Control method for solving abnormal operation of me
JP7389877B2 (en) Network optimal boot path method and system

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees