TWI602054B - Method of providing error status data for computer device - Google Patents

Method of providing error status data for computer device Download PDF

Info

Publication number
TWI602054B
TWI602054B TW105110454A TW105110454A TWI602054B TW I602054 B TWI602054 B TW I602054B TW 105110454 A TW105110454 A TW 105110454A TW 105110454 A TW105110454 A TW 105110454A TW I602054 B TWI602054 B TW I602054B
Authority
TW
Taiwan
Prior art keywords
error
status data
error status
control system
management control
Prior art date
Application number
TW105110454A
Other languages
Chinese (zh)
Other versions
TW201737079A (en
Inventor
郭明義
Original Assignee
神雲科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 神雲科技股份有限公司 filed Critical 神雲科技股份有限公司
Priority to TW105110454A priority Critical patent/TWI602054B/en
Application granted granted Critical
Publication of TWI602054B publication Critical patent/TWI602054B/en
Publication of TW201737079A publication Critical patent/TW201737079A/en

Links

Landscapes

  • Debugging And Monitoring (AREA)

Description

用於電腦裝置的錯誤狀態資料提供方法 Error status data providing method for computer device

本發明是有關於電腦裝置的錯誤狀態資料,特別是指一種用於電腦裝置的錯誤狀態資料提供方法。 The present invention relates to error status information of a computer device, and more particularly to a method for providing error status data for a computer device.

目前作為伺服器使用的一電腦裝置通常包括一基板管理控制系統(baseboard management controller system),該基板管理控制系統被用來提供該電腦裝置的錯誤狀態資料,以協助管理者管控此電腦裝置。 A computer device currently used as a server typically includes a baseboard management controller system that is used to provide error status information for the computer device to assist the administrator in managing the computer device.

當此基板管理控制系統接收到來自於該中央處理單元的一錯誤通知,如致命錯誤(CATERR)通知時,基板管理控制系統讀取該電腦裝置的一中央處理單元之內部暫存器中所儲存的該錯誤狀態資料。然而,實際上,由於該電腦裝置在一發生異常如,發生致命錯誤(CATERR)的情況下,就會重新啟動而因此清除了該中央處理單元所保留且對應該異常情況的錯誤狀態資料。值得注意的是,在該電腦裝置重新啟動時不會影響此基板管理控制系統運作。 於是,若該中央處理單元在該電腦裝置在重新啟動後才接收到來自此基板管理控制系統的該錯誤通知,則此基板管理控制系統所讀取並儲存的該中央處理單元之內部暫存器中所儲存的該錯誤狀態資料例如機器檢查架構錯誤狀態(Machine Check Architecture error status)資料並非對應於發生該異常情況時的錯誤狀態資料,而是對應於該電腦裝置重新啟動後之狀態的錯誤狀態資料,因此,該電腦裝置之管理者恐無法根據此基板管理控制系統所提供的錯誤狀態資料正確地分析出該電腦裝置發生異常的原因。 When the substrate management control system receives an error notification from the central processing unit, such as a fatal error (CATERR) notification, the substrate management control system reads the internal temporary storage of a central processing unit of the computer device. The error status information. However, in practice, since the computer device is restarted in the event of an abnormality such as a fatal error (CATERR), the error state data retained by the central processing unit and corresponding to the abnormal condition is cleared. It is worth noting that the operation of the baseboard management control system will not be affected when the computer device is restarted. Therefore, if the central processing unit receives the error notification from the substrate management control system after the computer device is restarted, the internal processing unit of the central processing unit read and stored by the substrate management control system The error status data stored in the machine check structure error status (Machine Check Architecture error status) data does not correspond to the error status data when the abnormal condition occurs, but corresponds to the error status of the state after the computer device is restarted. Therefore, the administrator of the computer device cannot correctly analyze the cause of the abnormality of the computer device based on the error status data provided by the baseboard management control system.

因此,本發明之目的,即在提供一種錯誤狀態資料提供方法。 Accordingly, it is an object of the present invention to provide an error status data providing method.

於是,本發明錯誤狀態資料提供方法,藉由一電腦裝置所包括的一基板管理控制系統來實施,該電腦裝置還包括一電連接該基板管理控制系統的中央處理單元,該錯誤狀態資料提供方法包含以下步驟:(A)讀取並儲存該中央處理單所儲存的錯誤狀態資料;(B)判定該錯誤狀態資料是否含有多個特定錯誤的至少一者;(C)當判定出該錯誤狀態資料不含該至少一特定錯誤時,繼續執行步驟(A);及 (D)當判定出該錯誤狀態資料含有該至少一特定錯誤時,在接收到一來自一使用端的資料請求後,傳送先前於步驟(A)所儲存的該錯誤狀態資料至該使用端。 Therefore, the error state data providing method of the present invention is implemented by a substrate management control system included in a computer device, the computer device further comprising a central processing unit electrically connected to the substrate management control system, and the error state data providing method The method comprises the steps of: (A) reading and storing the error status data stored in the central processing unit; (B) determining whether the error status data contains at least one of a plurality of specific errors; (C) determining the error status. If the data does not contain the at least one specific error, proceed to step (A); and (D) When it is determined that the error status data contains the at least one specific error, after receiving a data request from a user terminal, transmitting the error status data previously stored in step (A) to the user terminal.

本發明之功效在於,藉由該基板管理控制系統讀取並儲存該電腦裝置發生該至少一特定錯誤時的該錯誤狀態資料,且在該錯誤狀態資料含有該至少一特定錯誤的情況下,在接收到來自該使用端的資料請求後,傳送該電腦裝置發生該至少一特定錯誤時的該錯誤狀態資料至該使用端,以避免該電腦裝置發生該至少一特定錯誤時的該錯誤狀態資料在被該基板管理控制系統儲存前即被清除,進而使得管理者可根據該使用端所接收的該錯誤資料分析出該電腦裝置發生錯誤的原因。 The effect of the present invention is that the error management data when the computer device generates the at least one specific error is read and stored by the substrate management control system, and in the case that the error state data contains the at least one specific error, After receiving the data request from the user end, transmitting the error status data of the at least one specific error to the user terminal to the user end, so as to prevent the error status data when the computer device generates the at least one specific error is The substrate management control system is cleared before being stored, so that the administrator can analyze the cause of the error of the computer device according to the error data received by the user terminal.

1‧‧‧電腦裝置 1‧‧‧Computer equipment

11‧‧‧基板管理控制系統 11‧‧‧Base management control system

111‧‧‧非揮發性記憶模組 111‧‧‧Non-volatile memory module

112‧‧‧通訊模組 112‧‧‧Communication module

113‧‧‧處理模組 113‧‧‧Processing module

12‧‧‧中央處理單元 12‧‧‧Central Processing Unit

2‧‧‧使用端 2‧‧‧Use side

100‧‧‧通訊網路 100‧‧‧Communication network

30~34‧‧‧步驟 30~34‧‧‧Steps

本發明之其他的特徵及功效,將於參照圖式的實施方式中清楚地呈現,其中:圖1是一方塊圖,說明執行本發明錯誤狀態資料提供方法之實施例的一電腦裝置所包括的一基板管理控制系統電連接該電腦裝置所包括的一中央處理單元,並經由一通訊網路連接一使用端;圖2是一流程圖,說明本發明錯誤狀態資料提供方法之實施例;及 圖3是一流程圖,說明本發明錯誤狀態資料提供方法之另一實施例。 Other features and advantages of the present invention will be apparent from the embodiments of the present invention, which is illustrated in the accompanying drawings. FIG. 1 is a block diagram illustrating a computer device including an embodiment of the method for providing error status data of the present invention. A substrate management control system is electrically connected to a central processing unit included in the computer device, and is connected to a user terminal via a communication network; FIG. 2 is a flowchart illustrating an embodiment of the error state data providing method of the present invention; Figure 3 is a flow chart showing another embodiment of the error status data providing method of the present invention.

參閱圖1,本發明錯誤狀態資料提供方法之實施例,藉由一電腦裝置1所包括的一基板管理控制系統11來實施。該基板管理控制系統11經由一通訊網路100連接一使用端2。該電腦裝置1還包括一電連接該基板管理控制系統11的中央處理單元12,在本實施例中,該電腦裝置1例如為一伺服器,且該基板管理控制系統11例如包括一非揮發性記憶模組111、一連接該通訊網路100的通訊模組112、及一電連接該非揮發性記憶模組111與該通訊模組112的處理模組113,而且該中央處理單元12例如為一Intel公司所生產的處理器。 Referring to FIG. 1, an embodiment of a method for providing error status data according to the present invention is implemented by a substrate management control system 11 included in a computer device 1. The baseboard management control system 11 is connected to a user terminal 2 via a communication network 100. The computer device 1 further includes a central processing unit 12 electrically connected to the substrate management control system 11. In the embodiment, the computer device 1 is, for example, a server, and the substrate management control system 11 includes, for example, a non-volatile The memory module 111, a communication module 112 connected to the communication network 100, and a processing module 113 electrically connected to the non-volatile memory module 111 and the communication module 112, and the central processing unit 12 is, for example, an Intel The processor produced by the company.

參閱圖1與圖2,本發明錯誤狀態資料提供方法之實施例包含以下步驟。 Referring to Figures 1 and 2, an embodiment of the error status data providing method of the present invention comprises the following steps.

在步驟31中,該基板管理控制系統11的處理模組113經由一平台環境控制介面(Platform Environmental Control Interface,簡稱PECI)讀取並儲存該中央處理單元12之內部暫存器(圖未示)中所儲存的錯誤狀態資料,其中該錯誤狀態資料與該電腦裝置1相關。在本實施例中,該錯誤狀態資料包含機器檢查架構 錯誤狀態資料。此外,該基板管理控制系統11的處理模組113係藉由將先前已儲存於該非揮發性記憶模組111的先前錯誤狀態資料更新為目前所讀取到的該錯誤狀態資料,以儲存該錯誤狀態資料。 In step 31, the processing module 113 of the substrate management control system 11 reads and stores the internal register of the central processing unit 12 via a platform environmental control interface (PECI) (not shown). The error status data stored in the file, wherein the error status data is associated with the computer device 1. In this embodiment, the error status data includes a machine check architecture. Error status data. In addition, the processing module 113 of the substrate management control system 11 stores the error status data that has been previously stored in the non-volatile memory module 111 to the currently read error status data to store the error. Status data.

在步驟32中,該基板管理控制系統11的處理模組113判定其所儲存的該錯誤狀態資料(亦即,儲存於該非揮發性記憶模組111的該錯誤狀態資料)是否含有多個特定錯誤的至少一者。該等特定錯誤為符合一致命錯誤(CATERR)、一不可修正週邊元件介面錯誤(Uncorrectable PCI error)、一致命週邊元件介面錯誤(Fatal PCI error)、一系統管理中斷超時(SMI timeout)、一同位元錯誤(PERR),及一系統錯誤(SERR)之錯誤種類的其中至少一者。當判定出該錯誤狀態資料含有該至少一特定錯誤時,流程進行至步驟33。否則,流程進行至步驟34。 In step 32, the processing module 113 of the substrate management control system 11 determines whether the error status data stored therein (that is, the error status data stored in the non-volatile memory module 111) contains a plurality of specific errors. At least one of them. These specific errors are consistent with a fatal error (CATERR), an uncorrectable PCI error, a fatal peripheral error (Fatal PCI error), a system management interrupt timeout (SMI timeout), together with At least one of a bit error (PERR), and a system error (SERR) error type. When it is determined that the error status data contains the at least one specific error, the flow proceeds to step 33. Otherwise, the flow proceeds to step 34.

在步驟33中,在該基板管理控制系統11的處理模組113經由該通訊模組112接收到一來自該使用端2的資料請求後,該基板管理控制系統11的處理模組113經由該通訊模組112傳送先前於步驟31所儲存的該錯誤狀態資料至該使用端2。在本發明之其他實施例中,該錯誤狀態資料提供方法在步驟31之前還包含一步驟30(見圖3),該基板管理控制系統11的處理模組113判定該電腦裝置1在一目前時間前的一參考時間期間內是否曾重新啟動。當判定 出該電腦裝置1在該目前時間前的該參考時間期間內曾重新啟動時,流程進行至步驟33。否則,流程進行至步驟31。 In step 33, after the processing module 113 of the substrate management control system 11 receives a data request from the user terminal 2 via the communication module 112, the processing module 113 of the substrate management control system 11 communicates via the communication module 112. The module 112 transmits the error status data previously stored in step 31 to the user terminal 2. In other embodiments of the present invention, the error status data providing method further includes a step 30 (see FIG. 3) before the step 31, and the processing module 113 of the baseboard management control system 11 determines that the computer device 1 is in a current time. Whether it was restarted during the previous reference time period. When judged When the computer device 1 is restarted during the reference time period before the current time, the flow proceeds to step 33. Otherwise, the flow proceeds to step 31.

在步驟34中,該基板管理控制系統11的處理模組113計數一預設時間期間後,繼續執行步驟31。 In step 34, after the processing module 113 of the substrate management control system 11 counts a preset time period, the process proceeds to step 31.

值得特別說明的是,在實際運用時,現有的基板管理控制系統11在檢測到該電腦裝置1不正常運作時,會儲存相關於該電腦裝置1的一系統事件日誌(System Event Log,簡稱SEL),以協助管理者了解該電腦裝置1運作異常的原因。然而,管理者除了參考該系統事件日誌外,還須參考如,機器檢查架構錯誤狀態資料等資訊以通盤了解該電腦裝置1運作異常的原因。當該錯誤狀態資料含有該至少一特定錯誤時,該電腦裝置1將無法正常運作,因而該系統事件日誌中將會含有一些異常資訊,當管理者從該系統事件日誌獲知該電腦裝置1因帶有一些異常資訊而不正常運作時,管理者即會利用該使用端2對該基板管理控制系統11發出對於儲存於該基板管理控制系統11內部的該非揮發性記憶模組111的該錯誤狀態資料的該資料請求,該基板管理控制系統11依據該資料請求回傳其(該基板管理控制系統11)內部的該非揮發性記憶模組111儲存的該錯誤狀態資料,該管理者藉此以獲得在該至少一特定錯誤發生時即被該基板管理控制系統11所儲存的該錯誤狀態資料。 It should be particularly noted that, in actual use, the existing substrate management control system 11 stores a system event log (System Event Log, SEL for short) related to the computer device 1 when detecting that the computer device 1 is not operating normally. ) to assist the manager in understanding the cause of the abnormal operation of the computer device 1. However, in addition to referencing the system event log, the administrator must refer to information such as the machine check architecture error status data to understand the reason why the computer device 1 is abnormal. When the error status data contains the at least one specific error, the computer device 1 will not operate normally, and thus the system event log will contain some abnormal information, when the administrator learns from the system event log that the computer device 1 is brought When there is some abnormal information and does not operate normally, the administrator uses the user terminal 2 to issue the error status data to the substrate management control system 11 for the non-volatile memory module 111 stored in the substrate management control system 11. According to the data request, the substrate management control system 11 requests to return the error status data stored by the non-volatile memory module 111 inside the substrate management control system 11 according to the data request, and the manager obtains the The error status data stored by the substrate management control system 11 when the at least one specific error occurs.

在管理者利用該使用端2發出該資料請求,獲得在該至少一特定錯誤發生時即被該基板管理控制系統11所儲存的該錯誤狀態資料後,該基板管理控制系統11才會繼續執行步驟31~步驟32(見圖2)或步驟30~步驟32(見圖3),換言之,在該基板管理控制系統11接收到儲存於該基板管理控制系統11內部的該非揮發性記憶模組111的該錯誤狀態資料的該資料請求前,該基板管理控制系統11不會週期性的至該中央處理單元12內部的暫存器讀取任何錯誤狀態資料,也不會將該中央處理單元12內部的暫存器所儲存的錯誤狀態資料儲存至該基板管理控制系統11內部的該非揮發性記憶模組111,以免覆蓋掉或於儲存過程中破壞該至少一特定錯誤發生時即被儲存於該基板管理控制系統11內部的該非揮發性記憶模組111的錯誤狀態資料。 When the administrator sends the data request by using the terminal 2 to obtain the error state data stored by the substrate management control system 11 when the at least one specific error occurs, the substrate management control system 11 continues to perform the steps. 31~Step 32 (see FIG. 2) or Step 30~Step 32 (see FIG. 3), in other words, the substrate management control system 11 receives the non-volatile memory module 111 stored in the substrate management control system 11 Before the data of the error status data is requested, the baseboard management control system 11 does not periodically read any error status data to the internal register of the central processing unit 12, nor does it internal to the central processing unit 12. The error state data stored in the temporary storage device is stored in the non-volatile memory module 111 inside the substrate management control system 11 so as not to be overwritten or destroyed during storage. The at least one specific error is stored in the substrate management. The error status data of the non-volatile memory module 111 inside the control system 11.

在該錯誤狀態資料不含有該至少一特定錯誤的情況下,該基板管理控制系統11計數該預設時間期間如,50ms後,即會繼續執行步驟31~步驟32(見圖2)或步驟30~步驟32(見圖3),由於該電腦裝置1因發生該至少一特定錯誤而重新啟動需耗費超過50ms的時間期間,因此,因應該電腦裝置1之重新啟動而清除包含該至少一特定錯誤的該錯誤狀態資料所需耗費的時間期間亦超過50ms,故該基板管理控制系統11藉由每50ms即週期性地自動讀取該中央處理單元12之內部暫存器中所儲存的錯誤狀態資料,藉此, 可避免在該至少一特定錯誤發生時的錯誤狀態資料在被該基板管理控制系統11儲存前即被清除。再者,由於該基板管理控制系統11在該至少一特定錯誤發生時即被該基板管理控制系統11所儲存的該錯誤狀態資料已傳送至該使用端2後,才會繼續執行步驟31~步驟32(見圖2)或步驟30~步驟32(見圖3)「亦即,在該至少一特定錯誤發生時即被該基板管理控制系統11所儲存的該錯誤狀態資料傳送至該使用端2前,該基板管理控制系統11的處理模組113不會執行步驟31~步驟32(見圖2)或步驟30~步驟32(見圖3)」,藉此,可避免在該至少一特定錯誤發生時即被儲存於該基板管理控制系統11內部的該非揮發性記憶模組111的該錯誤狀態資料被該基板管理控制系統11後續所讀取的錯誤狀態資料覆蓋或破壞。在本發明的其他實施例中,該基板管理控制系統11除了藉由週期性地讀取該中央處理單元12內部的暫存器中所儲存的該錯誤狀態資料,以獲得該錯誤狀態資料外,當該基板管理控制系統11接收到來自於該中央處理單元12的一錯誤通知,如致命錯誤(CATERR)通知時,該基板管理控制系統11也會讀取該中央處理單元12內部的暫存器中所儲存的該錯誤狀態資料,以便獲得該錯誤狀態資料。 In the case that the error state data does not contain the at least one specific error, the substrate management control system 11 counts the preset time period, for example, after 50 ms, the steps 31 to 32 (see FIG. 2) or step 30 are continued. ~ Step 32 (see FIG. 3), since the computer device 1 restarts due to the occurrence of the at least one specific error, it takes a time period of more than 50 ms, so that the at least one specific error is cleared due to the restart of the computer device 1. The time period required for the error status data also exceeds 50 ms, so the baseboard management control system 11 automatically reads the error status data stored in the internal register of the central processing unit 12 periodically every 50 ms. With this, It is possible to avoid that the error status data at the time when the at least one specific error occurs is cleared before being stored by the baseboard management control system 11. Furthermore, since the substrate management control system 11 transmits the error status data stored by the substrate management control system 11 to the user terminal 2 when the at least one specific error occurs, the step 31 to the step are continued. 32 (see FIG. 2) or step 30 to step 32 (see FIG. 3) "that is, the error status data stored by the substrate management control system 11 is transmitted to the user terminal 2 when the at least one specific error occurs. Before, the processing module 113 of the substrate management control system 11 does not perform steps 31 to 32 (see FIG. 2) or steps 30 to 32 (see FIG. 3), thereby avoiding at least one specific error. The error state data of the non-volatile memory module 111 stored in the substrate management control system 11 at the time of occurrence is overwritten or destroyed by the error state data read by the substrate management control system 11. In other embodiments of the present invention, the substrate management control system 11 obtains the error status data by periodically reading the error status data stored in the temporary register of the central processing unit 12 to obtain the error status data. When the substrate management control system 11 receives an error notification from the central processing unit 12, such as a fatal error (CATERR) notification, the baseboard management control system 11 also reads the internal register of the central processing unit 12. The error status data stored in the file is obtained in order to obtain the error status data.

綜上所述,本發明錯誤狀態資料提供方法,藉由該基板管理控制系統11週期性地讀取並儲存該中央處理單元12內部的暫存器的該錯誤狀態資料,且在該錯誤狀態資料含有該至少一特定 錯誤的情況下,該基板管理控制系統11暫時停止週期性地至該中央處理單元12內部的暫存器讀取並儲存任何錯誤狀態資料的動作,在接收到來自該使用端2的資料請求後,傳送在該至少一特定錯誤發生時的該錯誤狀態資料至該使用端2,以確保該管理者可獲得在該至少一特定錯誤發生時的該錯誤狀態資料,故確實能達成本發明之目的。 In summary, the error status data providing method of the present invention, the substrate management control system 11 periodically reads and stores the error status data of the temporary register in the central processing unit 12, and the error status data is Containing at least one specific In the case of an error, the baseboard management control system 11 temporarily stops the action of periodically reading and storing any error status data to the temporary register inside the central processing unit 12, after receiving the data request from the use terminal 2 Transmitting the error status data to the user terminal 2 when the at least one specific error occurs to ensure that the manager can obtain the error status data when the at least one specific error occurs, so that the purpose of the present invention can be achieved. .

惟以上所述者,僅為本發明之實施例而已,當不能以此限定本發明實施之範圍,凡是依本發明申請專利範圍及專利說明書內容所作之簡單的等效變化與修飾,皆仍屬本發明專利涵蓋之範圍內。 However, the above is only the embodiment of the present invention, and the scope of the invention is not limited thereto, and all the equivalent equivalent changes and modifications according to the scope of the patent application and the patent specification of the present invention are still The scope of the invention is covered.

31~34‧‧‧步驟 31~34‧‧‧Steps

Claims (7)

一種錯誤狀態資料提供方法,藉由一電腦裝置所包括的一基板管理控制系統來實施,該電腦裝置還包括一電連接該基板管理控制系統的中央處理單元,該錯誤狀態資料提供方法包含以下步驟:(A)讀取並儲存該中央處理單元所儲存的錯誤狀態資料;(B)判定該錯誤狀態資料是否含有多個特定錯誤的至少一者;(C)當判定出該錯誤狀態資料不含該至少一特定錯誤時,繼續執行步驟(A);及(D)當判定出該錯誤狀態資料含有該至少一特定錯誤時,在接收到一來自一使用端的資料請求後,傳送先前於步驟(A)所儲存的該錯誤狀態資料至該使用端,其中,在將先前於步驟(A)所儲存的該錯誤狀態資料傳送至該使用端前,該基板管理控制系統不會重覆執行步驟(A)至步驟(B)。 An error state data providing method is implemented by a substrate management control system included in a computer device, the computer device further comprising a central processing unit electrically connected to the substrate management control system, and the error state data providing method comprises the following steps : (A) reading and storing the error status data stored by the central processing unit; (B) determining whether the error status data contains at least one of a plurality of specific errors; (C) determining that the error status data does not contain When the at least one specific error is continued, the step (A) is continued; and (D) when it is determined that the error status data contains the at least one specific error, after receiving a data request from a user end, transmitting the previous step ( A) storing the error status data to the use end, wherein the substrate management control system does not repeat the step before transmitting the error status data previously stored in step (A) to the use end ( A) to step (B). 如請求項1所述的錯誤狀態資料提供方法,其中,在該步驟(C)中,當判定出該錯誤狀態資料不含該至少一特定錯誤時,該基板管理控制系統計數一預設時間期間後,重複步驟(A)至步驟(B)一次。 The error status data providing method according to claim 1, wherein in the step (C), when it is determined that the error status data does not include the at least one specific error, the substrate management control system counts a preset time After the period, repeat steps (A) to (B) once. 如請求項1所述的錯誤狀態資料提供方法,該基板管理控制系統包括一用於儲存該錯誤狀態資料的非揮發性記憶模組,其中,在步驟(A)中,該基板管理控制系統將先前 已儲存於該非揮發性記憶模組的先前錯誤狀態資料更新為在步驟(A)所讀取到的該錯誤狀態資料,以儲存該錯誤狀態資料。 The method of providing an error status data according to claim 1, wherein the baseboard management control system includes a non-volatile memory module for storing the error status data, wherein in step (A), the baseboard management control system previously The previous error status data stored in the non-volatile memory module is updated to the error status data read in step (A) to store the error status data. 如請求項1所述的錯誤狀態資料提供方法,在步驟(D)之後還包含一步驟(E),重覆步驟(A)至步驟(B)。 The error status data providing method according to claim 1, further comprising a step (E) after the step (D), repeating the steps (A) to (B). 如請求項1所述的錯誤狀態資料提供方法,其中:在步驟(A)中,該錯誤狀態資料包含機器檢查架構錯誤狀態(Machine Check Architecture error status)資料;及在步驟(B)中,該等特定錯誤包含一致命錯誤(CATERR)、一不可修正週邊元件介面錯誤(Uncorrectable PCI error)、一致命週邊元件介面錯誤(Fatal PCI error)、一系統管理中斷超時(SMI timeout)、一同位元錯誤(PERR),及一系統錯誤(SERR)之錯誤種類的其中至少一者。 The error status data providing method according to claim 1, wherein: in the step (A), the error status data includes a Machine Check Architecture error status data; and in the step (B), the Specific errors include a fatal error (CATERR), an uncorrectable PCI error, a fatal peripheral error (Fatal PCI error), a system management interrupt timeout (SMI timeout), a homobit At least one of the error (PERR), and a system error (SERR) error type. 如請求項1所述的錯誤狀態資料提供方法,其中,在步驟(A)中,該基板管理控制系統係經由一平台環境控制介面來讀取該中央處理單元之該錯誤狀態資料。 The error status data providing method according to claim 1, wherein in the step (A), the substrate management control system reads the error status data of the central processing unit via a platform environment control interface. 如請求項1所述的錯誤狀態資料提供方法,在步驟(A)之前,還包含以下步驟:(F)該基板管理控制系統判定該電腦裝置在一目前時間前的一參考時間期間內是否曾重新啟動;(G)當判定出該電腦裝置在該目前時間前的該參考時間期間內曾重新啟動時,在接收到來自該使用端的另一資 料請求後,傳送先前於步驟(A)所儲存的先前錯誤狀態資料至該使用端;(H)當判定出該電腦裝置在該目前時間前的該參考時間期間內不曾重新啟動時,步驟(A)被執行。 The error status data providing method according to claim 1, before the step (A), further comprising the following steps: (F) the substrate management control system determines whether the computer device has been in a reference time period before the current time Restarting; (G) receiving another resource from the user terminal when it is determined that the computer device was restarted during the reference time period before the current time After the request is made, the previous error status data previously stored in step (A) is transmitted to the use end; (H) when it is determined that the computer device has not been restarted during the reference time period before the current time, the step ( A) is executed.
TW105110454A 2016-04-01 2016-04-01 Method of providing error status data for computer device TWI602054B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW105110454A TWI602054B (en) 2016-04-01 2016-04-01 Method of providing error status data for computer device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW105110454A TWI602054B (en) 2016-04-01 2016-04-01 Method of providing error status data for computer device

Publications (2)

Publication Number Publication Date
TWI602054B true TWI602054B (en) 2017-10-11
TW201737079A TW201737079A (en) 2017-10-16

Family

ID=61011313

Family Applications (1)

Application Number Title Priority Date Filing Date
TW105110454A TWI602054B (en) 2016-04-01 2016-04-01 Method of providing error status data for computer device

Country Status (1)

Country Link
TW (1) TWI602054B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102968354A (en) * 2012-11-13 2013-03-13 浪潮电子信息产业股份有限公司 Intel Brickland-EX platform-based same-frequency lock-step mode automatic switching method
TW201351133A (en) * 2012-06-13 2013-12-16 Hon Hai Prec Ind Co Ltd Method and system for reading system event
TW201423390A (en) * 2012-12-06 2014-06-16 Inventec Corp Computer system and operating method thereof
TWI512490B (en) * 2014-10-27 2015-12-11 Quanta Comp Inc System for retrieving console messages and method thereof and non-transitory computer-readable medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201351133A (en) * 2012-06-13 2013-12-16 Hon Hai Prec Ind Co Ltd Method and system for reading system event
US9141464B2 (en) * 2012-06-13 2015-09-22 Shenzhen Treasure City Technology Co., Ltd. Computing device and method for processing system events of computing device
CN102968354A (en) * 2012-11-13 2013-03-13 浪潮电子信息产业股份有限公司 Intel Brickland-EX platform-based same-frequency lock-step mode automatic switching method
TW201423390A (en) * 2012-12-06 2014-06-16 Inventec Corp Computer system and operating method thereof
TWI512490B (en) * 2014-10-27 2015-12-11 Quanta Comp Inc System for retrieving console messages and method thereof and non-transitory computer-readable medium

Also Published As

Publication number Publication date
TW201737079A (en) 2017-10-16

Similar Documents

Publication Publication Date Title
JP6333410B2 (en) Fault processing method, related apparatus, and computer
CN106936616B (en) Backup communication method and device
US10990468B2 (en) Computing system and error handling method for computing system
US10275330B2 (en) Computer readable non-transitory recording medium storing pseudo failure generation program, generation method, and generation apparatus
US11687395B2 (en) Detecting and recovering from fatal storage errors
US20170149925A1 (en) Processing cache data
TW201417536A (en) Method and system for automatically managing servers
US20140143597A1 (en) Computer system and operating method thereof
CN112667422A (en) Memory fault processing method and device, computing equipment and storage medium
TWI518680B (en) Method for maintaining file system of computer system
US11182252B2 (en) High availability state machine and recovery
JP6599725B2 (en) Information processing apparatus, log management method, and computer program
CN115705261A (en) Memory fault repairing method, CPU, OS, BIOS and server
JP5999254B2 (en) Management apparatus, method and program
TWI602054B (en) Method of providing error status data for computer device
CN116719657A (en) Firmware fault log generation method, device, server and readable medium
US20110271138A1 (en) System and method for handling system failure
CN107451035B (en) Error state data providing method for computer device
US11797368B2 (en) Attributing errors to input/output peripheral drivers
TWI587128B (en) Method of automatically providing error status data for computer device
WO2024000535A1 (en) Partition table update method and apparatus, and electronic device and storage medium
CN112084049B (en) Method for monitoring resident program of baseboard management controller
CN115292086A (en) Method and device for remotely controlling intelligent terminal, terminal equipment and storage medium
TW201926073A (en) Server having storage device and operation method thereof