TWI767378B - Error type determination system and method thereof - Google Patents

Error type determination system and method thereof Download PDF

Info

Publication number
TWI767378B
TWI767378B TW109137256A TW109137256A TWI767378B TW I767378 B TWI767378 B TW I767378B TW 109137256 A TW109137256 A TW 109137256A TW 109137256 A TW109137256 A TW 109137256A TW I767378 B TWI767378 B TW I767378B
Authority
TW
Taiwan
Prior art keywords
error type
error
information
type information
processing unit
Prior art date
Application number
TW109137256A
Other languages
Chinese (zh)
Other versions
TW202217567A (en
Inventor
於寶在
Original Assignee
英業達股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 英業達股份有限公司 filed Critical 英業達股份有限公司
Priority to TW109137256A priority Critical patent/TWI767378B/en
Publication of TW202217567A publication Critical patent/TW202217567A/en
Application granted granted Critical
Publication of TWI767378B publication Critical patent/TWI767378B/en

Links

Images

Landscapes

  • Eye Examination Apparatus (AREA)
  • Radio Transmission System (AREA)
  • Measurement Of Radiation (AREA)
  • Programmable Controllers (AREA)

Abstract

An error type determination system and method are provided in the present invention. The error type determination system includes a server operating unit, a basic input/output system, a logic processing module, and a management control module. The server operating unit is provided to send an error occurred signal which includes a first error information when executes a executing program and generates an error. When the basic input/output system receives the error occurred signal, the basic input/output system figures out a corresponding error type information according to an error type comparison table and the first error information, and then sends an error type signal which includes the corresponding error type information. When the logic processing module receives the error type signal, the logic processing module figures out and stores the corresponding error type information, and then transmits the corresponding error type information to the management control module.

Description

錯誤類型判斷系統及其方法Error type judgment system and method

本發明係有關於一種錯誤類型判斷系統及其方法,尤其是指一種用於判斷執行發生錯誤時之錯誤類型之錯誤類型判斷系統及其方法。The present invention relates to an error type judging system and method, in particular to an error type judging system and method for judging the error type when an execution error occurs.

隨著網路科技的進步,伺服器在人們的生活中扮演著不可或缺的角色,一般而言,伺服器的系統在運作的過程中,無論是自關機狀態進入開機狀態、自休眠狀態回復至工作狀態或是正常運作的狀態下,會有一定的機率產生不可校正的錯誤(Uncorrectable Error, UCE ERROR),進而導致當機之問題。With the advancement of network technology, servers play an indispensable role in people's lives. Generally speaking, during the operation of the server system, whether it is from the shutdown state to the power-on state, or from the sleep state to recover In the working state or in the normal operation state, there will be a certain probability of uncorrectable errors (Uncorrectable Error, UCE ERROR), which will lead to the problem of machine crash.

其中,現有技術中,由於產線人員無法精準的確定此不可校正的錯誤的類型,因而無法有效了解是甚麼原因造成此不可校正的錯誤,因此時常需要請基本輸入輸出系統(Basic Input/Output System, BIOS)之負責部門人員、基板管理控制器(Baseboard Management Controller, BMC)之負責部門人員以及其他硬體之負責部門人員至產線進行除錯(debug),因而造成所有人員時間之浪費並影響到生產之效率,因此,現有技術仍具備改善之空間。Among them, in the prior art, since the production line personnel cannot accurately determine the type of the uncorrectable error, they cannot effectively understand what causes the uncorrectable error. Therefore, the Basic Input/Output System (Basic Input/Output System) is often required. , BIOS), the Baseboard Management Controller (BMC), and other hardware departments to debug the production line, thus causing a waste of time and affecting all personnel To the efficiency of production, therefore, the existing technology still has room for improvement.

有鑒於在先前技術中,現有之伺服器在運作過程中若產生錯誤時,現場人員無法確定錯誤之類型而產生有大量人員的不方便以及生產效率降低之問題,本發明透過提供一種用於判斷執行發生錯誤時之錯誤類型之錯誤類型判斷系統及其方法,以解決先前技術中所述之問題。Considering that in the prior art, if an error occurs in the operation of the existing server, the on-site personnel cannot determine the type of error, resulting in the inconvenience of a large number of personnel and the reduction of production efficiency, the present invention provides a method for judging An error type judging system and method thereof are implemented to solve the problems described in the prior art.

本發明為解決先前技術之問題,所採用之必要技術手段為提供一種錯誤類型判斷系統,係包含一伺服運作單元、一基本輸入輸出系統(Basic Input/Output System, BIOS)、一邏輯處理模組以及一管理控制模組。伺服運作單元係用以在執行一執行程式並發生一第一錯誤資訊時,發送出一包含該第一錯誤資訊之錯誤發生信號。基本輸入輸出系統係包含一第一儲存單元以及一第一處理單元,第一儲存單元係儲存有一包含有複數個第二錯誤資訊與複數個分別對應於各第二錯誤資訊之錯誤類型資訊之對應關係之錯誤類型比對表。In order to solve the problems of the prior art, the necessary technical means adopted by the present invention is to provide an error type judgment system, which includes a servo operation unit, a Basic Input/Output System (BIOS), and a logic processing module and a management control module. The servo operation unit is used for sending an error occurrence signal including the first error information when an execution program is executed and a first error information occurs. The basic input output system includes a first storage unit and a first processing unit, and the first storage unit stores a correspondence including a plurality of second error messages and a plurality of error type messages corresponding to the second error messages respectively Error type comparison table for relationship.

第一處理單元電性連接於第一儲存單元,通信連接於伺服運作單元,用以在接收到錯誤發生信號時,依據錯誤類型比對表找出第一錯誤資訊所對應之該些第二錯誤資訊中之一者,再依據錯誤類型比對表與上述所對應之該些第二錯誤資訊中之一者找出所對應之該些錯誤類型資訊中之一者,並將上述所對應之該些錯誤類型資訊中之一者定義為一對應錯誤類型資訊,藉以透過一資訊傳輸協議傳送出一包含有對應錯誤類型資訊之錯誤類型信號。The first processing unit is electrically connected to the first storage unit, and is communicatively connected to the servo operation unit for finding the second errors corresponding to the first error information according to the error type comparison table when an error occurrence signal is received One of the information, and then according to the error type comparison table and one of the second error information corresponding to the above to find out the corresponding one of the error type information, and the corresponding one of the above error type information. One of the error type information is defined as a corresponding error type information, so that an error type signal including the corresponding error type information is transmitted through an information transmission protocol.

邏輯處理模組係包含一第二儲存單元以及一第二處理單元,第二處理單元係電性連接於第二儲存單元,通信連接於第一處理單元,用以在接收到錯誤類型信號時,解析出對應錯誤類型資訊,將對應錯誤類型資訊儲存於第二儲存單元,並用以傳送出對應錯誤類型資訊。管理控制模組係通信連接於第二處理單元,用以接收對應錯誤類型資訊。The logic processing module includes a second storage unit and a second processing unit, the second processing unit is electrically connected to the second storage unit, and is communicatively connected to the first processing unit, for receiving an error type signal, The corresponding error type information is parsed out, the corresponding error type information is stored in the second storage unit, and the corresponding error type information is transmitted. The management control module is communicatively connected to the second processing unit for receiving corresponding error type information.

在上述必要技術手段的基礎下,本發明所衍生之一附屬技術手段為管理控制模組係更用以將一錯誤類型檢查指令傳送至第二處理單元,藉以觸發第二處理單元將對應錯誤類型資訊傳送至管理控制模組。此外,邏輯處理模組為一複雜可程式邏輯裝置(Complex Programmable Logic Device, CPLD),管理控制模組為一基板管理控制器(Baseboard Management Controller, BMC),資訊傳輸協議為一串列通用型輸入輸出(Serial General Purpose Input/Output, SGPIO)協議。On the basis of the above-mentioned necessary technical means, an auxiliary technical means derived from the present invention is that the management control module is further used to transmit an error type check command to the second processing unit, thereby triggering the second processing unit to detect the corresponding error type The information is sent to the management control module. In addition, the logic processing module is a Complex Programmable Logic Device (CPLD), the management control module is a Baseboard Management Controller (BMC), and the information transmission protocol is a serial general-purpose input Output (Serial General Purpose Input/Output, SGPIO) protocol.

本發明為解決先前技術之問題,所採用之必要技術手段為另外提供一種錯誤類型判斷方法,係利用上述之錯誤類型判斷系統加以實施。錯誤類型判斷方法中,先利用基本輸入輸出系統之第一處理單元判斷是否接收到伺服運作單元執行上述執行程式並發生第一錯誤資訊時所發送出之包含第一錯誤資訊之錯誤發生信號。在上述步驟之判斷結果為是時,利用基本輸入輸出系統之第一處理單元依據錯誤類型比對表找出第一錯誤資訊所對應之該些第二錯誤資訊中之一者,再依據錯誤類型比對表與上述所對應之該些第二錯誤資訊中之一者找出所對應之該些錯誤類型資訊中之一者,並將上述所對應之該些錯誤類型資訊中之一者定義為對應錯誤類型資訊,藉以透過資訊傳輸協議傳送出包含有對應錯誤類型資訊之錯誤類型信號。In order to solve the problems of the prior art, the necessary technical means adopted by the present invention is to provide an error type judgment method, which is implemented by using the above error type judgment system. In the error type judging method, the first processing unit of the basic input output system is used to judge whether an error occurrence signal including the first error information is sent when the servo operation unit executes the execution program and generates the first error information. When the judgment result of the above step is yes, use the first processing unit of the basic input output system to find out one of the second error information corresponding to the first error information according to the error type comparison table, and then according to the error type Find out one of the corresponding error type information by comparing the table with one of the above-mentioned corresponding second error information, and define one of the above-mentioned corresponding error type information as Corresponding error type information, so as to transmit the error type signal including the corresponding error type information through the information transmission protocol.

接著利用邏輯處理模組之第二處理單元接收錯誤類型信號,解析出對應錯誤類型資訊,將對應錯誤類型資訊儲存於第二儲存單元,並傳送出對應錯誤類型資訊。最後利用管理控制模組接收對應錯誤類型資訊,藉以顯示出對應錯誤類型資訊。其中,在上述第一個步驟之判斷結果為否時,係重複執行上述第一個步驟。Then, the second processing unit of the logic processing module receives the error type signal, parses out the corresponding error type information, stores the corresponding error type information in the second storage unit, and transmits the corresponding error type information. Finally, the management control module is used to receive the corresponding error type information, so as to display the corresponding error type information. Wherein, when the judgment result of the above-mentioned first step is no, the above-mentioned first step is repeatedly executed.

在上述必要技術手段的基礎下,本發明所衍生之一附屬技術手段為邏輯處理模組為一複雜可程式邏輯裝置(Complex Programmable Logic Device, CPLD),管理控制模組為一基板管理控制器(Baseboard Management Controller, BMC),資訊傳輸協議為一串列通用型輸入輸出(Serial General Purpose Input/Output, SGPIO)協議。On the basis of the above necessary technical means, an auxiliary technical means derived from the present invention is that the logic processing module is a Complex Programmable Logic Device (CPLD), and the management control module is a baseboard management controller ( Baseboard Management Controller, BMC), the information transmission protocol is a serial general purpose input/output (Serial General Purpose Input/Output, SGPIO) protocol.

承上所述,在採用本發明所提供之錯誤類型判斷系統及其方法後,由於預先將錯誤類型比對表建立於基本輸入輸出系統,因此在伺服器運作過程中發生錯誤時,基本輸入輸出系統即可立即識別出是何種錯誤類型,並可直接傳送至邏輯處理模組而觸發邏輯處理模組儲存並傳送至管理控制模組,使得現場人員即可透過管理控制模組獲知發生錯誤時的錯誤類型,從而可快速發現發生錯誤的來源,因而可有效降低其他人員的不方便,並可有效提升在發生錯誤時的處理效率。Based on the above, after using the error type judgment system and method provided by the present invention, since the error type comparison table is established in the basic input output system in advance, when an error occurs during the operation of the server, the basic input output The system can immediately identify the type of error, and can directly transmit it to the logic processing module to trigger the logic processing module to store and transmit it to the management control module, so that the on-site personnel can know when an error occurs through the management control module. Therefore, the source of the error can be quickly found, which can effectively reduce the inconvenience of other personnel, and can effectively improve the processing efficiency when errors occur.

下面將結合示意圖對本發明的具體實施方式進行更詳細的描述。根據下列描述和申請專利範圍,本發明的優點和特徵將更清楚。需說明的是,圖式均採用非常簡化的形式且均使用非精準的比例,僅用以方便、明晰地輔助說明本發明實施例的目的。The specific embodiments of the present invention will be described in more detail below with reference to the schematic diagrams. The advantages and features of the present invention will become more apparent from the following description and the scope of the claims. It should be noted that the drawings are all in a very simplified form and use inaccurate scales, and are only used to facilitate and clearly assist the purpose of explaining the embodiments of the present invention.

請參閱第一圖,第一圖係顯示本發明較佳實施例所提供之錯誤類型判斷系統之方塊圖。如圖所示,本發明所提供之錯誤類型判斷系統1,係包含一伺服運作單元11、一基本輸入輸出系統(Basic Input/Output System, BIOS)12、一邏輯處理模組13以及一管理控制模組14。其中,本發明較佳實施例中,錯誤類型判斷系統1係應用於一伺服器(圖未示),而伺服運作單元11例如可為伺服器內如中央處理器(Central Processing Unit, CPU)、微控制器(Microcontroller Unit, MCU)或其他具有處理功能之處理器,也可為下述基本輸入輸出系統12內的處理模組,其係視實務上之設計而定。Please refer to the first figure. The first figure is a block diagram of an error type determination system provided by a preferred embodiment of the present invention. As shown in the figure, the error type determination system 1 provided by the present invention includes a servo operation unit 11 , a basic input/output system (BIOS) 12 , a logic processing module 13 and a management control Module 14. Among them, in a preferred embodiment of the present invention, the error type determination system 1 is applied to a server (not shown in the figure), and the servo operation unit 11 may be, for example, a central processing unit (CPU), The microcontroller (Microcontroller Unit, MCU) or other processors with processing functions can also be the processing modules in the basic input output system 12 described below, depending on the practical design.

基本輸入輸出系統12係包含一第一儲存單元121以及一第一處理單元122,第一儲存單元121例如可為現有之具有儲存資料功能之記憶體,第一儲存單元121係儲存有一包含有複數個第二錯誤資訊與複數個分別對應於各第二錯誤資訊之錯誤類型資訊之對應關係之錯誤類型比對表1211。The basic input output system 12 includes a first storage unit 121 and a first processing unit 122. The first storage unit 121 can be, for example, an existing memory with a function of storing data, and the first storage unit 121 stores a memory including a plurality of The error type comparison table 1211 of the corresponding relationship between the second error information and the plurality of error type information corresponding to the second error information respectively.

舉例來說,本發明較佳實施例之第一儲存單元121係以bit位元方式儲存第二錯誤資訊,因此第二錯誤資訊例如是0x10000000、0x20000000與0x40000000。另外,上述錯誤類型資訊例如是多位元修正錯誤記憶體錯誤(Multi Bit ECC Memory Error)、普通數據奇偶校檢錯誤(Parity Error, PERR)與系統錯誤(System Error, SERR),但其他實施例中不限於此。另外,本發明較佳實施例之錯誤類型比對表1211所儲存之對應關係例如可為下表。 錯誤類型資訊 第二錯誤資訊 多位元修正錯誤記憶體錯誤 0x10000000 普通數據奇偶校檢錯誤 0x20000000 系統錯誤 0x40000000 For example, the first storage unit 121 of the preferred embodiment of the present invention stores the second error information in the form of bits, so the second error information is, for example, 0x10000000, 0x20000000 and 0x40000000. In addition, the above-mentioned error type information is, for example, a multi-bit correction error memory error (Multi Bit ECC Memory Error), a normal data parity error (Parity Error, PERR), and a system error (System Error, SERR), but other embodiments is not limited to this. In addition, the corresponding relationship stored in the error type comparison table 1211 of the preferred embodiment of the present invention can be, for example, the following table. Error type information Second error message multi-bit bugfix memory bug 0x10000000 Normal data parity error 0x20000000 system error 0x40000000

第一處理單元122例如可為現有具有處理功能之處理器,電性連接於第一儲存單元121,通信連接於伺服運作單元11,另外,第一處理單元122也可與伺服運作單元11整合為上述之處理模組而設置於基本輸入輸出系統12內,其係視實務上之設計而定。其中,本發明較佳實施例所述之通信連接皆為有線通信連接,在其他實施例中可為無線通信連接,其係視實務上之設計而定。The first processing unit 122 can be, for example, an existing processor with processing functions, electrically connected to the first storage unit 121, and communicatively connected to the servo operation unit 11. In addition, the first processing unit 122 can also be integrated with the servo operation unit 11 as a The above-mentioned processing module is arranged in the basic input output system 12, which depends on the practical design. Wherein, the communication connections described in the preferred embodiments of the present invention are all wired communication connections, and in other embodiments, they may be wireless communication connections, which depend on practical designs.

邏輯處理模組13例如可為一複雜可程式邏輯裝置(Complex Programmable Logic Device, CPLD)。邏輯處理模組13係包含一第二儲存單元131以及一第二處理單元132,第二儲存單元131例如可為現有之具有儲存資料之功能之記憶體。第二處理單元132例如可為現有具有處理功能之處理器,並電性連接於第二儲存單元131,通信連接於第一處理單元122。The logic processing module 13 can be, for example, a complex programmable logic device (Complex Programmable Logic Device, CPLD). The logic processing module 13 includes a second storage unit 131 and a second processing unit 132 , and the second storage unit 131 can be, for example, an existing memory with the function of storing data. The second processing unit 132 can be, for example, an existing processor with processing functions, and is electrically connected to the second storage unit 131 and communicatively connected to the first processing unit 122 .

管理控制模組14例如為一基板管理控制器(Baseboard Management Controller, BMC),並通信連接於第二處理單元132。The management control module 14 is, for example, a Baseboard Management Controller (BMC), and is communicatively connected to the second processing unit 132 .

伺服運作單元11係用以在執行一執行程式並發生一第一錯誤資訊時,發送出一包含第一錯誤資訊之錯誤發生信號S1。其中,上述執行程式例如是開機程式、運作作業系統程式或是其他運作程式,而本發明較佳實施例中,第一錯誤資訊例如是0x40000000。The servo operation unit 11 is used for sending an error occurrence signal S1 including the first error information when an execution program is executed and a first error information is generated. Wherein, the above-mentioned execution program is, for example, a boot program, an operating system program or other operating programs, and in a preferred embodiment of the present invention, the first error message is, for example, 0x40000000.

第一處理單元122在接收到錯誤發生信號S1時,依據錯誤類型比對表1211找出第一錯誤資訊所對應之該些第二錯誤資訊中之一者(本發明較佳實施例即找出所對應之0x40000000),再依據錯誤類型比對表1211與上述所對應之該些第二錯誤資訊中之一者找出所對應之該些錯誤類型資訊中之一者(本發明較佳實施例即找出0x40000000是對應於該些錯誤類型資訊中之系統錯誤),並將上述所對應之該些錯誤類型資訊中之一者定義為一對應錯誤類型資訊1311(即將系統錯誤定義為對應錯誤類型資訊1311),藉以透過一資訊傳輸協議傳送出一包含有對應錯誤類型資訊1311之錯誤類型信號S2。When receiving the error occurrence signal S1, the first processing unit 122 finds out one of the second error messages corresponding to the first error message according to the error type comparison table 1211 (a preferred embodiment of the present invention is to find out corresponding 0x40000000), and then according to the error type comparison table 1211 and one of the above-mentioned corresponding second error information to find out one of the corresponding error type information (a preferred embodiment of the present invention That is to find out that 0x40000000 corresponds to the system error in the error type information), and define one of the above-mentioned corresponding error type information as a corresponding error type information 1311 (that is, define the system error as the corresponding error type information 1311), so as to transmit an error type signal S2 including the corresponding error type information 1311 through an information transmission protocol.

其中,上述之資訊傳輸協議例如為一串列通用型輸入輸出(Serial General Purpose Input/Output, SGPIO)協議,即第一儲存單元以bit位元儲存方式係為了因應串列通用型輸入輸出協議的傳送方式,再具體而言,本發明較佳實施例即是基本輸入輸出系統12透過串列通用型輸入輸出的腳位連接於邏輯處理模組13,而本發明較佳實施例即採用基本輸入輸出系統12之串列通用型輸入輸出的腳位將錯誤類型信號S2傳送至邏輯處理模組13之第二處理單元132。Wherein, the above-mentioned information transmission protocol is, for example, a serial general purpose input/output (SGPIO) protocol, that is, the first storage unit is stored in bits in order to respond to the serial general purpose input/output protocol. The transmission method, and more specifically, the preferred embodiment of the present invention is that the basic input and output system 12 is connected to the logic processing module 13 through the serial general-purpose input and output pins, and the preferred embodiment of the present invention uses the basic input The serial general-purpose input and output pins of the output system 12 transmit the error type signal S2 to the second processing unit 132 of the logic processing module 13 .

第二處理單元132在接收到錯誤類型信號S2時,解析出對應錯誤類型資訊1311,從而得知此次發生錯誤的錯誤類型資訊為系統錯誤,並將對應錯誤類型資訊1311儲存於第二儲存單元131。此外,第二處理單元132並以一包含有對應錯誤類型資訊1311之錯誤告知信號S3傳送至管理控制模組14的方式將對應錯誤類型資訊1311傳送至管理控制模組14。When the second processing unit 132 receives the error type signal S2, it parses out the corresponding error type information 1311, so as to know that the error type information that has occurred this time is a system error, and stores the corresponding error type information 1311 in the second storage unit 131. In addition, the second processing unit 132 transmits the corresponding error type information 1311 to the management control module 14 in a manner of transmitting an error notification signal S3 including the corresponding error type information 1311 to the management control module 14 .

管理控制模組14接收到錯誤告知信號S3後,即可解析出對應錯誤類型資訊1311,進而可透過顯示裝置顯示出對應錯誤類型資訊1311,也就是說,現場產線人員可即時透過管理控制模組14獲知此次發生錯誤之錯誤類型為何。After the management control module 14 receives the error notification signal S3, it can analyze the corresponding error type information 1311, and then can display the corresponding error type information 1311 through the display device. Group 14 learns what type of error occurred this time.

另外,本發明較佳實施例中,管理控制模組14係先將一錯誤類型檢查指令S4傳送至第二處理單元132,第二處理單元132才進一步將包含有對應錯誤類型資訊1311之錯誤告知信號S3傳送至管理控制模組14。也就是說,本發明較佳實施例中,現場產線人員可在發生錯誤之後,再觸發管理控制模組14發送錯誤類型檢查指令S4。其他實施例中,可為只要一發生錯誤,邏輯處理模組13之第二處理單元132即主動將對應錯誤類型資訊1311發送至管理控制模組14,其係視實務上之設計而定。In addition, in the preferred embodiment of the present invention, the management control module 14 first transmits an error type check command S4 to the second processing unit 132, and then the second processing unit 132 further informs the error including the corresponding error type information 1311 The signal S3 is sent to the management control module 14 . That is to say, in a preferred embodiment of the present invention, the on-site production line personnel can trigger the management control module 14 to send the error type check instruction S4 after an error occurs. In other embodiments, as soon as an error occurs, the second processing unit 132 of the logic processing module 13 can actively send the corresponding error type information 1311 to the management control module 14, which depends on practical design.

請參閱第二圖,第二圖係顯示本發明較佳實施例所提供之錯誤類型判斷方法之流程圖。本發明較佳實施例係還提供一種錯誤類型判斷方法,並且是利用第一圖所示之錯誤類型判斷系統加以實施,並包含以下步驟S101至步驟S104。Please refer to the second figure. The second figure is a flow chart of the method for judging the error type provided by the preferred embodiment of the present invention. A preferred embodiment of the present invention also provides an error type judging method, which is implemented by using the error type judging system shown in the first figure, and includes the following steps S101 to S104.

步驟S101:利用基本輸入輸出系統12之第一處理單元122判斷是否接收到伺服運作單元11執行執行程式並發生第一錯誤資訊時所發送出之包含第一錯誤資訊之錯誤發生信號S1。Step S101 : Use the first processing unit 122 of the BIOS 12 to determine whether the error occurrence signal S1 including the first error information is received when the servo operation unit 11 executes the program and generates the first error information.

步驟S102:利用基本輸入輸出系統12之第一處理單元122依據錯誤類型比對表1211找出第一錯誤資訊所對應之該些第二錯誤資訊中之一者,再依據錯誤類型比對表1211與上述所對應之該些第二錯誤資訊中之一者找出所對應之該些錯誤類型資訊中之一者,並將上述所對應之該些錯誤類型資訊中之一者定義為對應錯誤類型資訊1311,藉以透過資訊傳輸協議傳送出包含有對應錯誤類型資訊1311之錯誤類型信號S2。Step S102: Use the first processing unit 122 of the BIOS 12 to find out one of the second error messages corresponding to the first error message according to the error type comparison table 1211, and then according to the error type comparison table 1211 Find one of the corresponding error type information with one of the second error information corresponding to the above, and define one of the corresponding error type information as the corresponding error type The information 1311 is used to transmit the error type signal S2 including the corresponding error type information 1311 through the information transmission protocol.

步驟S103:利用邏輯處理模組13之第二處理單元132接收錯誤類型信號S2,解析出對應錯誤類型資訊1311,將對應錯誤類型資訊1311儲存於第二儲存單元131,並傳送出對應錯誤類型資訊1311。Step S103: Use the second processing unit 132 of the logic processing module 13 to receive the error type signal S2, parse out the corresponding error type information 1311, store the corresponding error type information 1311 in the second storage unit 131, and transmit the corresponding error type information 1311.

步驟S104:利用管理控制模組14接收對應錯誤類型資訊1311,藉以顯示出對應錯誤類型資訊1311。Step S104 : use the management control module 14 to receive the corresponding error type information 1311 so as to display the corresponding error type information 1311 .

其中,各步驟其他的詳細說明皆已在上述數個段落中提及,故不多加贅述。Wherein, other detailed descriptions of each step have been mentioned in the above paragraphs, so they are not repeated here.

綜上所述,在採用本發明所提供之錯誤類型判斷系統及其方法後,由於預先將錯誤類型比對表建立於基本輸入輸出系統,因此在伺服器運作過程中發生錯誤時,基本輸入輸出系統即可立即識別出是何種錯誤類型,可直接傳送至邏輯處理模組而觸發邏輯處理模組儲存並傳送至管理控制模組,使得現場人員即可獲知發生錯誤時的錯誤類型,從而可快速發現發生錯誤的來源,因而可有效降低其他人員的不方便,並可有效提升在發生錯誤時的處理效率。To sum up, after using the error type judgment system and method provided by the present invention, since the error type comparison table is established in the basic input and output system in advance, when an error occurs during the operation of the server, the basic input and output The system can immediately identify what type of error it is, and can directly transmit it to the logic processing module to trigger the logic processing module to store and transmit it to the management control module, so that the on-site personnel can know the type of error when the error occurs, so that the Quickly find the source of the error, which can effectively reduce the inconvenience of other personnel, and can effectively improve the processing efficiency when errors occur.

藉由以上較佳具體實施例之詳述,係希望能更加清楚描述本發明之特徵與精神,而並非以上述所揭露的較佳具體實施例來對本發明之範疇加以限制。相反地,其目的是希望能涵蓋各種改變及具相等性的安排於本發明所欲申請之專利範圍的範疇內。Through the detailed description of the preferred embodiments above, it is hoped that the features and spirit of the present invention can be described more clearly, and the scope of the present invention is not limited by the preferred embodiments disclosed above. On the contrary, the intention is to cover various modifications and equivalent arrangements within the scope of the claimed scope of the present invention.

1:錯誤類型判斷系統 11:伺服運作單元 12:基本輸入輸出系統 121:第一儲存單元 1211:錯誤類型比對表 122:第一處理單元 13:邏輯處理模組 131:第二儲存單元 1311:對應錯誤類型資訊 132:第二處理單元 14:管理控制模組 S1:錯誤發生信號 S2:錯誤類型信號 S3:錯誤告知信號 S4:錯誤類型檢查指令 S101-S104:步驟 1: Error type judgment system 11: Servo operation unit 12: Basic Input Output System 121: The first storage unit 1211: Wrong type comparison table 122: first processing unit 13: Logic processing module 131: Second storage unit 1311: Corresponding error type information 132: Second processing unit 14: Management control module S1: Error occurrence signal S2: Wrong type signal S3: Error notification signal S4: Error type check instruction S101-S104: Steps

第一圖係顯示本發明較佳實施例所提供之錯誤類型判斷系統之方塊圖;以及 第二圖係顯示本發明較佳實施例所提供之錯誤類型判斷方法之流程圖。 The first figure is a block diagram showing an error type judgment system provided by a preferred embodiment of the present invention; and The second figure is a flow chart showing a method for judging an error type provided by a preferred embodiment of the present invention.

1:錯誤類型判斷系統 11:伺服運作單元 12:基本輸入輸出系統 121:第一儲存單元 1211:錯誤類型比對表 122:第一處理單元 13:邏輯處理模組 131:第二儲存單元 1311:對應錯誤類型資訊 132:第二處理單元 14:管理控制模組 S1:錯誤發生信號 S2:錯誤類型信號 S3:錯誤告知信號 S4:錯誤類型檢查指令 1: Error type judgment system 11: Servo operation unit 12: Basic Input Output System 121: The first storage unit 1211: Wrong type comparison table 122: first processing unit 13: Logic processing module 131: Second storage unit 1311: Corresponding error type information 132: Second processing unit 14: Management control module S1: Error occurrence signal S2: Wrong type signal S3: Error notification signal S4: Error type check instruction

Claims (9)

一種錯誤類型判斷系統,係包含: 一伺服運作單元,係用以在執行一執行程式並發生一第一錯誤資訊時,發送出一包含該第一錯誤資訊之錯誤發生信號; 一基本輸入輸出系統(Basic Input/Output System, BIOS),係包含: 一第一儲存單元,係儲存有一包含有複數個第二錯誤資訊與複數個分別對應於各第二錯誤資訊之錯誤類型資訊之對應關係之錯誤類型比對表;以及 一第一處理單元,電性連接於該第一儲存單元,通信連接於該伺服運作單元,用以在接收到該錯誤發生信號時,依據該錯誤類型比對表找出該第一錯誤資訊所對應之該些第二錯誤資訊中之一者,再依據該錯誤類型比對表與上述所對應之該些第二錯誤資訊中之一者找出所對應之該些錯誤類型資訊中之一者,並將上述所對應之該些錯誤類型資訊中之一者定義為一對應錯誤類型資訊,藉以透過一資訊傳輸協議傳送出一包含有該對應錯誤類型資訊之錯誤類型信號; 一邏輯處理模組,係包含: 一第二儲存單元;以及 一第二處理單元,係電性連接於該第二儲存單元,通信連接於該第一處理單元,用以在接收到該錯誤類型信號時,解析出該對應錯誤類型資訊,將該對應錯誤類型資訊儲存於該第二儲存單元,並用以傳送出該對應錯誤類型資訊;以及 一管理控制模組,係通信連接於該第二處理單元,用以接收該對應錯誤類型資訊。 An error type judgment system, including: a servo operation unit for sending an error occurrence signal including the first error information when an execution program is executed and a first error information occurs; A Basic Input/Output System (BIOS), including: a first storage unit storing an error type comparison table including a plurality of second error information and a plurality of error type information corresponding to the second error information respectively; and a first processing unit, electrically connected to the first storage unit, and communicatively connected to the servo operation unit, for finding out the source of the first error information according to the error type comparison table when the error occurrence signal is received corresponding to one of the second error information, and then find out the corresponding one of the error type information according to the error type comparison table and one of the above-mentioned corresponding second error information , and define one of the corresponding error type information as a corresponding error type information, so as to transmit an error type signal including the corresponding error type information through an information transmission protocol; A logic processing module, including: a second storage unit; and a second processing unit, electrically connected to the second storage unit and communicatively connected to the first processing unit, for parsing out the corresponding error type information when receiving the error type signal, and the corresponding error type information is stored in the second storage unit and used to transmit the corresponding error type information; and A management control module is communicatively connected to the second processing unit for receiving the corresponding error type information. 如請求項1所述之錯誤類型判斷系統,其中,該管理控制模組係更用以將一錯誤類型檢查指令傳送至該第二處理單元,藉以觸發該第二處理單元將該對應錯誤類型資訊傳送至該管理控制模組。The error type determination system according to claim 1, wherein the management control module is further configured to transmit an error type check command to the second processing unit, so as to trigger the second processing unit to the corresponding error type information sent to the management control module. 如請求項1所述之錯誤類型判斷系統,其中,該邏輯處理模組為一複雜可程式邏輯裝置(Complex Programmable Logic Device, CPLD)。The error type judgment system according to claim 1, wherein the logic processing module is a Complex Programmable Logic Device (CPLD). 如請求項1所述之錯誤類型判斷系統,其中,該管理控制模組為一基板管理控制器(Baseboard Management Controller, BMC)。The error type judgment system according to claim 1, wherein the management control module is a baseboard management controller (BMC). 如請求項1所述之錯誤類型判斷系統,其中,該資訊傳輸協議為一串列通用型輸入輸出(Serial General Purpose Input/Output, SGPIO)協議。The error type judgment system according to claim 1, wherein the information transmission protocol is a serial general purpose input/output (SGPIO) protocol. 一種錯誤類型判斷方法,係利用如請求項1所述之錯誤類型判斷系統加以實施,並包含以下步驟: (a)   利用該基本輸入輸出系統之該第一處理單元判斷是否接收到該伺服運作單元執行該執行程式並發生該第一錯誤資訊時所發送出之包含該第一錯誤資訊之該錯誤發生信號; (b)  在該步驟(a)之判斷結果為是時,利用該基本輸入輸出系統之該第一處理單元依據該錯誤類型比對表找出該第一錯誤資訊所對應之該些第二錯誤資訊中之一者,再依據該錯誤類型比對表與上述所對應之該些第二錯誤資訊中之一者找出所對應之該些錯誤類型資訊中之一者,並將上述所對應之該些錯誤類型資訊中之一者定義為該對應錯誤類型資訊,藉以透過該資訊傳輸協議傳送出包含有該對應錯誤類型資訊之該錯誤類型信號; (c)   利用該邏輯處理模組之該第二處理單元接收該錯誤類型信號,解析出該對應錯誤類型資訊,將該對應錯誤類型資訊儲存於該第二儲存單元,並傳送出該對應錯誤類型資訊;以及 (d)  利用該管理控制模組接收該對應錯誤類型資訊,藉以顯示出該對應錯誤類型資訊; 其中,在該步驟(a)之判斷結果為否時,係重複執行該步驟(a)。 An error type judging method is implemented using the error type judging system as described in claim 1, and includes the following steps: (a) Using the first processing unit of the basic input output system to determine whether the error occurrence signal including the first error information sent by the servo operation unit when the execution program is executed and the first error information is generated is received ; (b) when the judgment result of step (a) is yes, use the first processing unit of the basic input output system to find out the second errors corresponding to the first error information according to the error type comparison table One of the information, and then according to the error type comparison table and one of the second error information corresponding to the above to find out the corresponding one of the error type information, and the corresponding one of the above error type information. One of the error type information is defined as the corresponding error type information, so that the error type signal including the corresponding error type information is transmitted through the information transmission protocol; (c) using the second processing unit of the logic processing module to receive the error type signal, parse out the corresponding error type information, store the corresponding error type information in the second storage unit, and transmit the corresponding error type information; and (d) using the management control module to receive the corresponding error type information, so as to display the corresponding error type information; Wherein, when the judgment result of the step (a) is no, the step (a) is repeatedly executed. 如請求項6所述之錯誤類型判斷方法,其中,該邏輯處理模組為一複雜可程式邏輯裝置(Complex Programmable Logic Device, CPLD)。The error type judgment method according to claim 6, wherein the logic processing module is a Complex Programmable Logic Device (CPLD). 如請求項6所述之錯誤類型判斷方法,其中,該管理控制模組為一基板管理控制器(Baseboard Management Controller, BMC)。The method for judging an error type according to claim 6, wherein the management control module is a baseboard management controller (BMC). 如請求項6所述之錯誤類型判斷方法,其中,該資訊傳輸協議為一串列通用型輸入輸出(Serial General Purpose Input/Output, SGPIO)協議。The error type determination method according to claim 6, wherein the information transmission protocol is a serial general purpose input/output (SGPIO) protocol.
TW109137256A 2020-10-27 2020-10-27 Error type determination system and method thereof TWI767378B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW109137256A TWI767378B (en) 2020-10-27 2020-10-27 Error type determination system and method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW109137256A TWI767378B (en) 2020-10-27 2020-10-27 Error type determination system and method thereof

Publications (2)

Publication Number Publication Date
TW202217567A TW202217567A (en) 2022-05-01
TWI767378B true TWI767378B (en) 2022-06-11

Family

ID=82558788

Family Applications (1)

Application Number Title Priority Date Filing Date
TW109137256A TWI767378B (en) 2020-10-27 2020-10-27 Error type determination system and method thereof

Country Status (1)

Country Link
TW (1) TWI767378B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201423390A (en) * 2012-12-06 2014-06-16 Inventec Corp Computer system and operating method thereof
CN107357694A (en) * 2016-05-10 2017-11-17 佛山市顺德区顺达电脑厂有限公司 Error event reporting system and its method during startup self-detection
WO2019062218A1 (en) * 2017-09-27 2019-04-04 郑州云海信息技术有限公司 Design method for implementing backplane lighting for multiple nvme hard disks
CN109947612A (en) * 2019-03-26 2019-06-28 苏州浪潮智能科技有限公司 A kind of method and device reading BIOS POST code by setting BMC SDR

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201423390A (en) * 2012-12-06 2014-06-16 Inventec Corp Computer system and operating method thereof
CN107357694A (en) * 2016-05-10 2017-11-17 佛山市顺德区顺达电脑厂有限公司 Error event reporting system and its method during startup self-detection
WO2019062218A1 (en) * 2017-09-27 2019-04-04 郑州云海信息技术有限公司 Design method for implementing backplane lighting for multiple nvme hard disks
CN109947612A (en) * 2019-03-26 2019-06-28 苏州浪潮智能科技有限公司 A kind of method and device reading BIOS POST code by setting BMC SDR

Also Published As

Publication number Publication date
TW202217567A (en) 2022-05-01

Similar Documents

Publication Publication Date Title
TWI229796B (en) Method and system to implement a system event log for system manageability
US9436548B2 (en) ECC bypass using low latency CE correction with retry select signal
CN100440157C (en) Detecting correctable errors and logging information relating to their location in memory
JP2017517060A (en) Fault processing method, related apparatus, and computer
WO2022228499A1 (en) Pcie fault self-repairing method, apparatus and device, and readable storage medium
US7774638B1 (en) Uncorrectable data error containment systems and methods
US11687395B2 (en) Detecting and recovering from fatal storage errors
WO2024082844A1 (en) Fault detection apparatus and detection method for random access memory
CN116049249A (en) Error information processing method, device, system, equipment and storage medium
TWI767378B (en) Error type determination system and method thereof
US20230366951A1 (en) Power failure monitoring device and power failure monitoring method
US9106258B2 (en) Early data tag to allow data CRC bypass via a speculative memory data return protocol
US8108736B2 (en) Multi-partition computer system, failure handling method and program therefor
CN112256467B (en) Error type judging system and method thereof
CN114003416B (en) Memory error dynamic processing method, system, terminal and storage medium
US8726102B2 (en) System and method for handling system failure
CN115509786A (en) Method, device, equipment and medium for reporting fault
US20200174875A1 (en) Secure forking of error telemetry data to independent processing units
TWI738627B (en) Smart network interface controller system and method of detecting error
JPWO2007096987A1 (en) Error control device
TW202024916A (en) Servo method, servo system, main board and computer readable storage medium
CN117116332B (en) Multi-bit error processing method, device, server and storage medium
US11797368B2 (en) Attributing errors to input/output peripheral drivers
US11449383B2 (en) Methods for providing and identifying fatal error information for system-on-chip product
CN116136805A (en) Memory channel fault detection method and device, memory system and computer system