TWI518519B - Server system and node replacement method - Google Patents

Server system and node replacement method Download PDF

Info

Publication number
TWI518519B
TWI518519B TW102144929A TW102144929A TWI518519B TW I518519 B TWI518519 B TW I518519B TW 102144929 A TW102144929 A TW 102144929A TW 102144929 A TW102144929 A TW 102144929A TW I518519 B TWI518519 B TW I518519B
Authority
TW
Taiwan
Prior art keywords
node
server system
preset time
hardware
module
Prior art date
Application number
TW102144929A
Other languages
Chinese (zh)
Other versions
TW201523276A (en
Inventor
盧盈志
Original Assignee
英業達股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 英業達股份有限公司 filed Critical 英業達股份有限公司
Priority to TW102144929A priority Critical patent/TWI518519B/en
Publication of TW201523276A publication Critical patent/TW201523276A/en
Application granted granted Critical
Publication of TWI518519B publication Critical patent/TWI518519B/en

Links

Landscapes

  • Debugging And Monitoring (AREA)

Description

伺服器系統與節點替換方法 Server system and node replacement method

本發明有關於一種伺服器系統(例如機櫃式之資料中心)與節點替換方法,且特別是有關於一種可快速進行節點替換程序的伺服器系統與節點替換方法。 The invention relates to a server system (for example, a cabinet type data center) and a node replacement method, and in particular to a server system and node replacement method capable of quickly performing a node replacement procedure.

隨著科技的發展,透過網際網路能夠使得世界各地的電腦進行連結。一台電腦透過網路連線便能夠與另一台電腦進行資料的交換、存取等動作。在客戶端與伺服器系統架構上,客戶端與伺服器便是透過網路來進行溝通。 With the development of technology, computers around the world can be connected through the Internet. A computer can exchange data and access data with another computer through a network connection. On the client and server system architecture, the client and server communicate through the network.

一般來說,伺服器系統可配置有多個節點,且每一個節點同時運行多個虛擬機器(virtual machine,VM),藉以提供給每一使用者獨立的運作環境。並且,每個節點可視為各自獨立的計算機,亦即各節點具有記憶體、儲存空間、運算能力與網路連結功能。因此,各節點可以運行獨自的作業系統,且各節點之間也可以透過網路設備進行溝通與資料傳輸。 In general, a server system can be configured with multiple nodes, and each node runs multiple virtual machines (VMs) at the same time, thereby providing each user with a separate operating environment. Moreover, each node can be regarded as a separate computer, that is, each node has a memory, a storage space, a computing capability, and a network connection function. Therefore, each node can run a separate operating system, and each node can also communicate and transmit data through the network device.

在伺服器系統架構完畢後,必須對其內的節點完成部署,亦即安裝各節點所需的作業系統、軟體封裝資料與 韌體封裝資料,以使伺服器系統運作並提供服務給使用者使用。然而,當伺服器系統中的任意一組節點的硬體產生損壞時,將會導致對應於此硬體的節點無法正常運作,此時檢測人員由於無法精確得知是哪一節點中的特定硬體發生損壞,而僅能依序將節點拔離與插入伺服器系統來進行檢測,造成檢測時間上的浪費。 After the server system architecture is completed, the nodes in it must be deployed, that is, the operating system and software package information required for installing each node. Firmware package information to enable the server system to operate and provide services to the user. However, when the hardware of any one of the nodes in the server system is damaged, the node corresponding to the hardware cannot be operated normally. At this time, the inspector cannot accurately know which specific node is in the node. The body is damaged, and the node can only be pulled out and inserted into the server system for detection, resulting in waste of detection time.

有鑑於以上的問題,本揭露提出一種伺服器系統與節點替換方法,其可自動地依據節點的識別碼與節點中的硬體的硬體組態資訊來判斷特定節點或此節點中的特定硬體是否需要被替換,以讓檢測人員可快速方便地將節點進行替換。 In view of the above problems, the present disclosure proposes a server system and node replacement method, which can automatically determine a specific node or a specific hard in the node according to the identification code of the node and the hardware configuration information of the hardware in the node. Whether the body needs to be replaced, so that the tester can quickly and easily replace the node.

根據本揭露一實施例中的一種節點替換方法,此節點替換方法適用於伺服器系統。此節點替換方法的步驟流程如下所述。偵測節點是否插入伺服器系統,並於偵測到節點插入伺服器系統時產生第一提示訊號,其中此第一提示訊號用以指示節點不能拔離伺服器系統。偵測節點的第一識別碼與節點中的硬體的第一硬體組態資訊。依據第一識別碼與第一硬體組態資訊以及節點插入伺服器系統之前的第二識別碼與第二硬體組態資訊,判斷節點或節點中的硬體是否已被替換。若判斷出節點與節點中的硬體皆未被替換,則關閉節點的電源,以執行節點替換程序。若判斷出節點或節點中的 硬體被替換,則對節點安裝作業系統、軟體封裝資料與韌體封裝資料至少其中之一。 According to a node replacement method in an embodiment of the disclosure, the node replacement method is applicable to a server system. The flow of steps for this node replacement method is as follows. Detecting whether the node is inserted into the server system, and generating a first prompt signal when detecting that the node is inserted into the server system, wherein the first prompt signal is used to indicate that the node cannot be pulled out of the server system. The first identification code of the node and the first hardware configuration information of the hardware in the node are detected. Determining whether the hardware in the node or node has been replaced according to the first identification code and the first hardware configuration information and the second identification code and the second hardware configuration information before the node is inserted into the server system. If it is determined that the hardware in the node and the node are not replaced, the power of the node is turned off to perform the node replacement procedure. If it is judged in the node or node If the hardware is replaced, at least one of the operating system, the software package data, and the firmware package information is installed on the node.

在一實施例中,於對節點安裝作業系統、軟體封裝資料與韌體封裝資料至少其中之一的步驟之後,更包括以下所述的步驟流程。持續偵測節點中的硬體的狀況,以判斷硬體是否發生錯誤。若判斷出硬體產生不可修復之錯誤,則關閉此節點的電源,以執行節點替換程序。若判斷出硬體產生可修復之錯誤次數達到預設之門檻值,則對此節點進行正常關機程序,並據以執行節點替換程序。 In an embodiment, after the step of installing at least one of the operating system, the software package data and the firmware package data on the node, the step flow described below is further included. The status of the hardware in the node is continuously detected to determine whether the hardware has an error. If it is determined that the hardware has an unrepairable error, turn off the power of this node to perform the node replacement procedure. If it is determined that the number of times the hardware generates a repairable error reaches the preset threshold, the node is normally shut down and the node replacement procedure is performed accordingly.

在一實施例中,節點替換程序包括以下所述的步驟流程。將節點設定為初始模式。產生第二提示訊號,其中此第二提示訊號用以指示節點能拔離伺服器系統。偵測節點是否拔離伺服器系統。偵測節點或另一節點是否插入伺服器系統。若偵測到節點或另一節點插入伺服器系統,則產生第一提示訊號,並且接續偵測節點的第一識別碼與節點中的硬體的第一硬體組態資訊之後的步驟。 In an embodiment, the node replacement procedure includes the step flow described below. Set the node to the initial mode. A second prompt signal is generated, wherein the second prompt signal is used to indicate that the node can be pulled out of the server system. Check if the node is pulled out of the server system. Detects whether a node or another node is plugged into the server system. If a node or another node is detected to be inserted into the server system, a first prompt signal is generated, and the step subsequent to detecting the first identification code of the node and the first hardware configuration information of the hardware in the node is performed.

承接上述之實施例,於偵測節點是否拔離伺服器系統的步驟中,更包括以下所述的步驟流程。設定第一預設時間並開始計時。判斷節點是否已拔離伺服器系統。若判斷出節點尚未拔離伺服器系統,則重置第一預設時間並重新計時。若判斷出節點拔離伺服器系統且已經過第一預設時間,則執行偵測節點或另一節點是否插入伺服器系統之後的步 驟。 In the above-mentioned embodiment, in the step of detecting whether the node is disconnected from the server system, the step flow described below is further included. Set the first preset time and start timing. Determine if the node has been removed from the server system. If it is determined that the node has not been pulled out of the server system, the first preset time is reset and the time is re-timed. If it is determined that the node is pulled out of the server system and the first preset time has elapsed, then the step of detecting whether the node or another node is inserted into the server system is performed Step.

承接上述之實施例,於偵測節點或另一節點是否插入伺服器系統的步驟中,更包括以下所述的步驟流程。設定第一預設時間與第二預設時間並開始計時,其中第二預設時間接續於第一預設時間後。判斷節點是否插入伺服器系統。若判斷出節點尚未插入伺服器系統,則重置第一預設時間並重新計時。若判斷出節點插入伺服器系統且已經過第一預設時間,則接著判斷節點於第二預設時間中是否仍持續插入伺服器系統。若判斷出節點於第二預設時間中拔離伺服器系統,則接續將節點設定為初始模式之後的步驟。若判斷出節點於經過第二預設時間後仍未拔離伺服器系統,則產生第一提示訊號,並且接續偵測節點的第一識別碼與節點中的硬體的第一硬體組態資訊之後的步驟。 In the above-mentioned embodiment, in the step of detecting whether the node or another node is inserted into the server system, the step flow described below is further included. Setting a first preset time and a second preset time and starting timing, wherein the second preset time is subsequent to the first preset time. Determine if the node is plugged into the server system. If it is determined that the node has not been inserted into the server system, the first preset time is reset and the time is re-timed. If it is determined that the node is inserted into the server system and the first preset time has elapsed, then it is determined whether the node continues to be inserted into the server system during the second preset time. If it is determined that the node is disconnected from the server system in the second preset time, the step after the node is set to the initial mode is continued. If it is determined that the node has not been pulled out of the server system after the second preset time, the first prompt signal is generated, and the first identification code of the detecting node and the first hardware configuration of the hardware in the node are connected. The steps after the news.

根據本揭露一實施例中的一種伺服器系統,此伺服器系統包括節點、偵測模組、提示模組以及處理模組。節點具有硬體。偵測模組通訊連接節點,此偵測模組用以偵測節點是否插入或拔離伺服器系統,以及用以偵測節點的第一識別碼與硬體的第一硬體組態資訊。提示模組通訊連接偵測模組,此提示模組用以於偵測模組偵測到節點插入伺服器系統時產生第一提示訊號。其中,此第一提示訊號用以指示節點不能拔離伺服器系統。處理模組通訊連接於偵測模組與節點之間,此處理模組用以依據第一識別碼與第一硬體組態資 訊以及節點插入伺服器系統之前的第二識別碼與第二硬體組態資訊來判斷節點或硬體是否已被替換。其中,若處理模組判斷出節點與硬體皆未被替換,則關閉節點的電源,以執行節點替換程序,若處理模組判斷出節點或硬體被替換,則對節點安裝作業系統、軟體封裝資料與韌體封裝資料至少其中之一。 According to an embodiment of the present disclosure, a server system includes a node, a detection module, a prompt module, and a processing module. The node has hardware. The detection module communication connection node is configured to detect whether the node is inserted or removed from the server system, and to detect the first identification code of the node and the first hardware configuration information of the hardware. The prompt module communication connection detection module is configured to generate a first prompt signal when the detection module detects that the node is inserted into the server system. The first prompt signal is used to indicate that the node cannot be disconnected from the server system. The processing module communication is connected between the detecting module and the node, and the processing module is configured to use the first identification code and the first hardware configuration And the second identification code and the second hardware configuration information before the node is inserted into the server system to determine whether the node or the hardware has been replaced. Wherein, if the processing module determines that the node and the hardware are not replaced, the power of the node is turned off to perform the node replacement procedure, and if the processing module determines that the node or the hardware is replaced, the operating system and the software are installed on the node. At least one of the package data and the firmware package information.

在一實施例中,於處理模組對節點安裝作業系統、軟體封裝資料與韌體封裝資料至少其中之一之後,處理模組更持續判斷硬體是否發生錯誤。若處理模組判斷出硬體產生不可修復之錯誤,則關閉節點的電源,以執行節點替換程序。若處理模組判斷出硬體產生可修復之錯誤次數達到預設之門檻值,則對節點進行正常關機程序,並據以執行節點替換程序。 In an embodiment, after the processing module installs at least one of the operating system, the software package data, and the firmware package data, the processing module further determines whether the hardware has an error. If the processing module determines that the hardware has an unrepairable error, power off the node to perform a node replacement procedure. If the processing module determines that the number of times the hardware generates a repairable error reaches a preset threshold, the node is normally shut down and the node replacement procedure is performed accordingly.

在一實施例中,於伺服器系統執行節點替換程序時,處理模組將節點設定為初始模式。接著,提示模組產生第二提示訊號,此第二提示訊號用以指示節點能拔離伺服器系統。再來,偵測模組偵測節點是否拔離伺服器系統,並於偵測到節點拔離伺服器系統後接續偵測節點或另一節點是否插入伺服器系統。若偵測模組偵測到節點或另一節點插入伺服器系統,則提示模組產生第一提示訊號,並且偵測模組接續偵測節點的第一識別碼與硬體的第一硬體組態資訊之後的處理程序。 In one embodiment, the processing module sets the node to the initial mode when the server system executes the node replacement procedure. Then, the prompting module generates a second prompt signal, which is used to indicate that the node can be pulled out of the server system. Then, the detection module detects whether the node is pulled out of the server system, and detects whether the node is disconnected from the server system and then detects whether the node or another node is inserted into the server system. If the detecting module detects that the node or another node is inserted into the server system, the prompting module generates a first prompt signal, and the detecting module continues to detect the first identifier of the node and the first hardware of the hardware. The handler after configuring the information.

承接上述之實施例,伺服器系統更包括計時模組,此計時模組通訊連接偵測模組。於偵測模組偵測節點是否拔離伺服器系統時,計時模組會設定第一預設時間並開始計時。若偵測模組偵測出節點於第一預設時間內仍未拔離伺服器系統,則計時模組將重置第一預設時間並重新計時。若偵測模組偵測出節點拔離伺服器系統且已經過第一預設時間,則接續偵測節點或另一節點是否插入伺服器系統之後的處理程序。 According to the above embodiment, the server system further includes a timing module, and the timing module is connected to the detection module. When the detection module detects whether the node is pulled out of the server system, the timing module sets the first preset time and starts timing. If the detecting module detects that the node has not been pulled out of the server system within the first preset time, the timing module resets the first preset time and re-times. If the detecting module detects that the node is pulled out of the server system and has passed the first preset time, the processing procedure after detecting whether the node or another node is inserted into the server system is connected.

承接上述之實施例,伺服器系統更包括計時模組,此計時模組通訊連接偵測模組。於偵測模組偵測節點或另一節點是否插入伺服器系統時,計時模組會設定第一預設時間與第二預設時間並開始計時,其中第二預設時間接續於第一預設時間後。若偵測模組偵測出節點於第一預設時間內尚未插入伺服器系統,則計時模組重置第一預設時間並重新計時。若偵測模組偵測出節點插入伺服器系統且已經過第一預設時間,則接續偵測節點於第二預設時間中是否仍持續插入伺服器系統。若偵測模組偵測出節點於第二預設時間中拔離伺服器系統,則接續處理模組將節點設定為初始模式之處理程序。若偵測模組偵測出節點於經過第二預設時間後仍未拔離伺服器系統,則提示模組產生第一提示訊號,並且偵測模組接續偵測節點的第一識別碼與硬體的第一硬體組態資訊之後的處理程序。 According to the above embodiment, the server system further includes a timing module, and the timing module is connected to the detection module. When the detecting module detects whether the node or another node is inserted into the server system, the timing module sets the first preset time and the second preset time and starts timing, wherein the second preset time is continued from the first preset Set the time. If the detecting module detects that the node has not been inserted into the server system within the first preset time, the timing module resets the first preset time and re-times. If the detecting module detects that the node is inserted into the server system and has passed the first preset time, the connection detecting node continues to be inserted into the server system during the second preset time. If the detecting module detects that the node is pulled out of the server system in the second preset time, the connecting processing module sets the node as the processing mode of the initial mode. If the detecting module detects that the node has not been pulled out of the server system after the second preset time, the prompting module generates a first prompt signal, and the detecting module continues to detect the first identification code of the node and The handler after the first hardware configuration information of the hardware.

綜合以上所述,本揭露提供一種伺服器系統與節點替換方法,其透過偵測節點的識別碼與節點中的硬體的硬體組態資訊,來判斷節點或節點中的硬體的硬體是否影被替換,進而選擇性地執行節點替換程序或對此節點安裝作業系統、軟體封裝資料或韌體封裝資料。此外,本發明之伺服器系統與節點替換方法更可於節點安裝作業系統、軟體封裝資料或韌體封裝資料後,持續地偵測節點中的硬體狀況,並於節點中的硬體產生錯誤時透過提示訊號來讓檢測人員得知可以進行節點替換程序。 In summary, the present disclosure provides a server system and a node replacement method for determining hardware of a hardware in a node or a node by detecting an identification code of a node and hardware configuration information of a hardware in the node. Whether the shadow is replaced, and optionally performs a node replacement procedure or installs an operating system, software package data, or firmware package data for the node. In addition, the server system and the node replacement method of the present invention can continuously detect the hardware condition in the node and install the error in the hardware in the node after the node installs the operating system, the software package data or the firmware package data. The prompt signal is used to let the inspector know that the node replacement procedure can be performed.

以上之關於本揭露內容之說明及以下之實施方式之說明係用以示範與解釋本發明之精神與原理,並且提供本發明之專利申請範圍更進一步之解釋。 The above description of the disclosure and the following description of the embodiments of the present invention are intended to illustrate and explain the spirit and principles of the invention, and to provide further explanation of the scope of the invention.

1‧‧‧伺服器系統 1‧‧‧Server system

10‧‧‧節點 10‧‧‧ nodes

12‧‧‧偵測模組 12‧‧‧Detection module

14‧‧‧提示模組 14‧‧‧Tips module

16‧‧‧處理模組 16‧‧‧Processing module

18‧‧‧計時模組 18‧‧‧Time Module

S200~S214、S300~S308、S400~S404、S500~S506‧‧‧步驟流程 S200~S214, S300~S308, S400~S404, S500~S506‧‧‧Step flow

第1圖係為根據本揭露一實施例之伺服器系統的功能方塊圖。 1 is a functional block diagram of a server system in accordance with an embodiment of the present disclosure.

第2A圖係為根據本揭露一實施例之伺服器系統的節點替換方法的步驟流程圖。 2A is a flow chart showing the steps of a node replacement method of a server system according to an embodiment of the present disclosure.

第2B圖係為根據本揭露另一實施例之伺服器系統的節點替換方法的步驟流程圖。 2B is a flow chart showing the steps of the node replacement method of the server system according to another embodiment of the present disclosure.

第3圖係為根據本揭露一實施例之節點替換程序的步驟流程圖。 Figure 3 is a flow chart showing the steps of a node replacement procedure in accordance with an embodiment of the present disclosure.

第4圖係為根據第3圖中的步驟S304的詳細步驟流程圖。 Fig. 4 is a flow chart showing the detailed steps of step S304 in Fig. 3.

第5圖係為根據第3圖中的步驟S306的詳細步驟流程圖。 Fig. 5 is a flow chart showing the detailed steps in accordance with step S306 in Fig. 3.

以下在實施方式中詳細敘述本發明之詳細特徵以及優點,其內容足以使任何熟習相關技藝者了解本發明之技術內容並據以實施,且根據本說明書所揭露之內容、申請專利範圍及圖式,任何熟習相關技藝者可輕易地理解本發明相關之目的及優點。以下之實施例係進一步詳細說明本發明之觀點,但非以任何觀點限制本發明之範疇。 The detailed features and advantages of the present invention are set forth in the Detailed Description of the Detailed Description of the <RTIgt; </ RTI> <RTIgt; </ RTI> </ RTI> </ RTI> <RTIgt; The objects and advantages associated with the present invention can be readily understood by those skilled in the art. The following examples are intended to describe the present invention in further detail, but are not intended to limit the scope of the invention.

請參照第1圖,第1圖係為根據本揭露一實施例之伺服器系統的功能方塊圖。如第1圖所示,伺服器系統1包括節點10、偵測模組12、提示模組14、處理模組16與計時模組18,其中偵測模組12通訊連接於節點10提示模組14、處理模組16與計時模組18之間,且節點10又與處理模組16通訊連接。本發明所述之通訊連接可以是以實體線路連接而實現,或是以無線通訊連接而實現,本發明在此不加以限制。於實務上,伺服器系統1可以為一種機櫃式之資料中心(container data center),但不以此為限。以下將分別就伺服器系統1中的各部功能模組作詳細的說明。 Please refer to FIG. 1. FIG. 1 is a functional block diagram of a server system according to an embodiment of the present disclosure. As shown in FIG. 1, the server system 1 includes a node 10, a detection module 12, a prompt module 14, a processing module 16, and a timing module 18, wherein the detection module 12 is communicably connected to the node 10 to prompt the module. 14. The processing module 16 is connected to the timing module 18, and the node 10 is in communication with the processing module 16. The communication connection of the present invention may be implemented by a physical line connection or by a wireless communication connection, and the present invention is not limited thereto. In practice, the server system 1 can be a rack-type data center, but not limited thereto. The function modules of each part in the server system 1 will be described in detail below.

節點10具有至少一硬體,所述的硬體可以包括 有基板管理控制器(baseboard management controller,BMC)、網路介面控制器(network interface controller,NIC,亦稱網路卡)、硬碟(hard disk drive,HDD)、DIMM(Dual In-line Memory Module)與中央處理器(CPU)等,但不以此為限。此外,雖然第1圖僅繪示有一組節點,然而本發明的伺服器系統在此不加以限制節點的個數。 The node 10 has at least one hardware, and the hardware may include Baseboard management controller (BMC), network interface controller (NIC, also known as network card), hard disk drive (HDD), DIMM (Dual In-line Memory Module) ) and the central processing unit (CPU), etc., but not limited to this. Further, although FIG. 1 only shows a set of nodes, the server system of the present invention does not limit the number of nodes here.

偵測模組12用以偵測節點10是否有插入或拔離伺服器系統1,以及用以偵測節點10的第一識別碼與節點10中硬體的第一硬體組態資訊(hardware configuration information)。於實務上,節點10的識別碼可以為一種通用唯一識別碼(universally unique identifier,UUID),但不以此為限。一般來說,此種通用唯一識別碼是由一串16位元組(亦稱128位元)的16進位數字所構成,用以讓每一個節點10都能有唯一的辨識資訊,再者,此UUID能由SMBIOS(System Management BIOS)Type1資料結構之UUID欄位取得;節點10的硬體組態資訊能被計算而得其唯一4位元組之硬體簽字(Hardware Signature),BIOS(Basic Input Output System)將於其POST(Power On Self Test)時取得硬體組態資訊並進得將硬體簽字存於ACPI(Advanced Configuration Power Management Interface)FACS(Firmware ACPI Control Structure)表之Hardware Signature欄位,此Hardware Signature欄位可用來快速決定是否硬體組態資訊不同;再者,可藉由ping節點10上BMC之 NIC來偵測節點10是否插入或拔除。 The detecting module 12 is configured to detect whether the node 10 is inserted or removed from the server system 1 and to detect the first identification code of the node 10 and the first hardware configuration information of the hardware in the node 10 (hardware) Configuration information). In practice, the identifier of the node 10 may be a universally unique identifier (UUID), but is not limited thereto. In general, the universal unique identification code is composed of a string of 16-bit 16-bit (also known as 128-bit) hexadecimal digits, so that each node 10 can have unique identification information. This UUID can be obtained from the UUID field of the SMBIOS (System Management BIOS) Type1 data structure; the hardware configuration information of the node 10 can be calculated to obtain the unique 4-bit hardware signature (Hardware Signature), BIOS (Basic) Input Output System) will obtain the hardware configuration information in its POST (Power On Self Test) and store the hardware signature in the Hardware Signature field of the ACPI (Advanced Configuration Power Management Interface) FACS (Firmware ACPI Control Structure) table. The Hardware Signature field can be used to quickly determine whether the hardware configuration information is different; in addition, by pinging the node 10 on the BMC The NIC detects if the node 10 is inserted or removed.

提示模組14用以於偵測模組12偵測到節點10插入伺服器系統1時產生一組第一提示訊號,此第一提示訊號用以指示節點10不能拔離伺服器系統1。此外,在某些情況中,提示模組14產生一組第二提示訊號,此第二提示訊號用以指示節點10能拔離伺服器系統1。於實務上,提示模組14可以為一種顯示模組(例如發光二極體、顯示面板、七段顯示器等電子顯示元件)或是發聲模組(例如喇叭、蜂鳴器等電子發聲元件),本發明在此不加以限制。若提示模組14為顯示模組的話,則提示訊號係以影像或是光線之型式呈現給使用者;若提示模組14為發聲模組的話,則提示訊號係以聲音之型式呈現給使用者。 The prompting module 14 is configured to generate a set of first prompt signals when the detecting module 12 detects that the node 10 is inserted into the server system 1. The first prompt signal is used to indicate that the node 10 cannot be pulled out of the server system 1. In addition, in some cases, the prompting module 14 generates a second set of prompt signals, which are used to indicate that the node 10 can be pulled out of the server system 1. In practice, the prompting module 14 can be a display module (such as a light-emitting diode, a display panel, a seven-segment display, etc.) or a sounding module (such as an electronic sounding component such as a speaker or a buzzer). The invention is not limited herein. If the prompting module 14 is a display module, the prompting signal is presented to the user in the form of image or light; if the prompting module 14 is a sounding module, the prompting signal is presented to the user in the form of sound. .

處理模組16用以依據節點10的識別碼與節點10中硬體的硬體組態資訊以及節點10插入伺服器系統1之前的第二識別碼與第二硬體組態資訊來判斷節點10或節點10中的硬體是否已被替換;需知,【節點10插入伺服器系統1之前的第二識別碼與第二硬體組態資訊】是【節點10之前次插入伺服器系統1時之其識別碼與硬體組態資訊】,再者,要是新的節點10插入伺服器系統,則其第二識別碼與第二硬體組態資訊皆是空的。計時模組18用以設定至少一組預設時間,並且開始計時。於某些情況中,計時模組18會在計時的過程中將計時歸零,以重新開始計時。 The processing module 16 is configured to determine the node 10 according to the identification code of the node 10 and the hardware configuration information of the hardware in the node 10 and the second identification code and the second hardware configuration information before the node 10 is inserted into the server system 1. Or whether the hardware in the node 10 has been replaced; it should be noted that [the second identification code and the second hardware configuration information before the node 10 is inserted into the server system 1] is [the node 10 was inserted into the server system 1 before. The identification code and the hardware configuration information], and if the new node 10 is inserted into the server system, the second identification code and the second hardware configuration information are all empty. The timing module 18 is configured to set at least one set of preset times and start timing. In some cases, the timing module 18 will zero the timing during the timing to restart the timing.

為了更加清楚說明本發明之伺服器系統1與節點替換方法的實際運作方式,請一併參照第1圖與第2A圖,第2A圖係為根據本揭露一實施例之伺服器系統的節點替換方法的步驟流程圖。如第2A圖所示,在步驟S200中,偵測模組12會偵測節點10是否插入伺服器系統1,並於偵測到節點10插入伺服器系統時,提示模組14會產生一組第一提示訊號,並進入步驟S202。若偵測模組12未偵測到節點10插入伺服器系統1,則繼續重複執行步驟S200,直至偵測模組12偵測到節點10插入伺服器系統1為止。 In order to more clearly illustrate the actual operation mode of the server system 1 and the node replacement method of the present invention, please refer to FIG. 1 and FIG. 2A together. FIG. 2A is a node replacement of the server system according to an embodiment of the present disclosure. Step flow chart of the method. As shown in FIG. 2A, in step S200, the detecting module 12 detects whether the node 10 is inserted into the server system 1, and when detecting that the node 10 is inserted into the server system, the prompting module 14 generates a group. The first prompt signal, and proceeds to step S202. If the detecting module 12 does not detect that the node 10 is inserted into the server system 1, the step S200 is continuously executed until the detecting module 12 detects that the node 10 is inserted into the server system 1.

在步驟S202中,偵測模組12會接著偵測節點10的第一識別碼與此節點10中的其中一硬體的第一硬體組態資訊。在步驟S204中,處理模組16會依據第一識別碼與第一硬體組態資訊以及節點10插入伺服器系統1之前的第二識別碼與第二硬體組態資訊,判斷節點10或節點10中的硬體是否已被替換。若處理模組16判斷出節點10或節點10中的硬體已被替換,則執行步驟S206;若處理模組16判斷出節點10與節點10中的硬體皆未被替換,則執行步驟S208。需知,在節點10或其硬體沒被替換下,也可強制重新對此節點10安裝作業系統、軟體封裝資料或韌體封裝資料(未繪示於圖式),其可應用於實際之情境如下:當節點10只是因其上之硬體因接觸不良而產生硬體錯誤,此時可將節點10拔除,然後使其硬體接觸良好,然後再將節點10重新插回伺服器系統1。 In step S202, the detection module 12 then detects the first identification code of the node 10 and the first hardware configuration information of one of the hardware in the node 10. In step S204, the processing module 16 determines the node 10 according to the first identification code and the first hardware configuration information and the second identification code and the second hardware configuration information before the node 10 is inserted into the server system 1. Whether the hardware in node 10 has been replaced. If the processing module 16 determines that the hardware in the node 10 or the node 10 has been replaced, step S206 is performed; if the processing module 16 determines that the hardware in the node 10 and the node 10 are not replaced, step S208 is performed. . It should be noted that, when the node 10 or its hardware is not replaced, the operating system, the software package data or the firmware package data (not shown in the figure) may be forced to be re-installed on the node 10, which may be applied to the actual application. The situation is as follows: When the node 10 only generates a hardware error due to poor contact on the hardware, the node 10 can be removed, and then the hardware is in good contact, and then the node 10 is reinserted back into the server system 1 .

在步驟S206中,處理模組16會對節點10安裝作業系統(operating system,OS)、軟體封裝資料(software package data)與韌體封裝資料(firmware package data)至少其中之一。在步驟S208中,處理模組16會關閉(power off)節點10的電源,以執行節點替換程序。 In step S206, the processing module 16 installs at least one of an operating system (OS), software package data, and firmware package data for the node 10. In step S208, the processing module 16 powers off the power of the node 10 to perform a node replacement procedure.

請參照第2B圖,第2B圖係為根據本揭露另一實施例之伺服器系統的節點替換方法的步驟流程圖。如第2B圖所示,在對節點安裝作業系統、軟體封裝資料與韌體封裝資料至少其中之一的步驟(即步驟S206)之後,偵測模組12或是另一組監控模組(未繪示於圖式)可以持續地偵測節點中的硬體的狀況,以使處理模組16判斷硬體是否發生錯誤(即步驟S210)。並且,當處理模組16判斷硬體發生錯誤時,於步驟S211中,處理模組16更判斷硬體發生的錯誤是可修復之錯誤或不可修復之錯誤。若偵測模組12或是另一組監控模組偵測出節點10當中的硬體產生不可修復之錯誤(un-correct error),則執行步驟S212;若偵測模組12或是另一組監控模組偵測出節點10當中的硬體產生可修復之錯誤(correct error)次數達到預設之門檻值(default threshold value),則執行步驟S214。 Please refer to FIG. 2B , which is a flow chart of the steps of the node replacement method of the server system according to another embodiment of the disclosure. As shown in FIG. 2B, after the step of installing at least one of the operating system, the software package data, and the firmware package data (ie, step S206), the detection module 12 or another group of monitoring modules (not The figure is shown to continuously detect the condition of the hardware in the node, so that the processing module 16 determines whether the hardware has an error (ie, step S210). Moreover, when the processing module 16 determines that an error occurs in the hardware, in step S211, the processing module 16 further determines that the error generated by the hardware is a repairable error or an unrepairable error. If the detection module 12 or another group of monitoring modules detects that the hardware in the node 10 generates an un-correct error, step S212 is performed; if the detection module 12 or another The group monitoring module detects that the number of correct errors of the hardware in the node 10 reaches a default threshold value, and then performs step S214.

在步驟S212中,由於節點10當中的硬體產生不可修復之錯誤,亦即此時節點10已經損壞而無法正常運作,則處理模組16會關閉節點10的電源,以執行節點替換程序。在步驟S214中,由於節點10當中的硬體產生可修復之錯誤 次數達到預設之門檻值(例如,一小時內產生10次以上之可修復之錯誤次數),亦即此時節點10快要損壞而即將無法正常運作,則處理模組16會對節點10進行正常關機(shutdown)程序,並據以執行節點替換程序。 In step S212, since the hardware in the node 10 generates an unrepairable error, that is, the node 10 is damaged and cannot operate normally, the processing module 16 turns off the power of the node 10 to execute the node replacement procedure. In step S214, a repairable error is generated due to the hardware in the node 10. The number of times reaches the preset threshold (for example, the number of times the repair can be repaired more than 10 times in one hour), that is, when the node 10 is about to be damaged and is about to be able to operate normally, the processing module 16 will normally perform the node 10. Shut down the program and execute the node replacement program accordingly.

請參照第3圖,第3圖係為根據本揭露一實施例之節點替換程序的步驟流程圖。如第3圖所示,在步驟S300中,處理模組16會將節點10設定為初始模式。於本實施例中,初始模式係為動態主機設定協定(dynamic host configuration protocol,DHCP)模式。在實際的操作中,當處理模組16判斷出節點10能拔離伺服器系統1時,會自動地將節點10的基板管理控制器設定回DHCP模式,以取得新的一組基板管理控制器的網際網路協議位址(internet protocol address,IP address)。 Please refer to FIG. 3, which is a flow chart of the steps of the node replacement procedure according to an embodiment of the present disclosure. As shown in FIG. 3, in step S300, the processing module 16 sets the node 10 to the initial mode. In this embodiment, the initial mode is a dynamic host configuration protocol (DHCP) mode. In actual operation, when the processing module 16 determines that the node 10 can be detached from the server system 1, the baseboard management controller of the node 10 is automatically set back to the DHCP mode to obtain a new set of substrate management controllers. Internet protocol address (IP address).

在步驟S302中,提示模組14會產生一組第二提示訊號,此第二提示訊號用以指示節點10能拔離伺服器系統1。在步驟S304中,偵測模組12會偵測節點10是否拔離伺服器系統1。若偵測模組12偵測出節點10仍未拔離伺服器系統1,則繼續執行步驟S304;若偵測模組12偵測出節點10拔離伺服器系統1,則執行步驟S306。在步驟S306中,偵測模組12會接續偵測節點10或另一組節點是否插入伺服器系統1。若偵測模組12偵測出有一組節點(節點10或另一組節點)插入伺服器系統1,則執行步驟S308;若偵測模組12偵測出仍未 有節點插入伺服器系統1,則繼續執行步驟S306。在步驟S308中,提示模組14會產生第一提示訊號,並接續執行步驟S202。 In step S302, the prompting module 14 generates a second set of prompt signals, which are used to indicate that the node 10 can be pulled out of the server system 1. In step S304, the detecting module 12 detects whether the node 10 is pulled out of the server system 1. If the detecting module 12 detects that the node 10 has not been removed from the server system 1, the process proceeds to step S304; if the detecting module 12 detects that the node 10 is pulled out of the server system 1, step S306 is performed. In step S306, the detection module 12 will continue to detect whether the node 10 or another group of nodes is inserted into the server system 1. If the detecting module 12 detects that a group of nodes (node 10 or another group of nodes) is inserted into the server system 1, step S308 is performed; if the detecting module 12 detects that it is still not detected When a node is inserted into the server system 1, step S306 is continued. In step S308, the prompting module 14 generates a first prompt signal, and then performs step S202.

請參照第4圖,第4圖係為根據第3圖中的步驟S304的詳細步驟流程圖。如第4圖所示,在提示模組14產生第二提示訊號(即步驟S302)之後,計時模組18會設定一組第一預設時間(例如一分鐘)並且開始計時。在步驟S402中,會判斷節點10是否已拔離伺服器系統1。若判斷出節點10於第一預設時間內仍未拔離伺服器系統1,則執行步驟S404;若判斷出節點10拔離伺服器系統1且已經過第一預設時間,則執行步驟S306。在步驟S404中,計時模組18會重置第一預設時間並重新計時,並接續步驟S402的判斷程序。 Please refer to FIG. 4, which is a flow chart of detailed steps according to step S304 in FIG. As shown in FIG. 4, after the prompting module 14 generates the second prompt signal (ie, step S302), the timing module 18 sets a set of first preset time (for example, one minute) and starts timing. In step S402, it is determined whether the node 10 has been detached from the server system 1. If it is determined that the node 10 has not been pulled out of the server system 1 within the first preset time, step S404 is performed; if it is determined that the node 10 is pulled out of the server system 1 and the first preset time has elapsed, step S306 is performed. . In step S404, the timing module 18 resets the first preset time and re-times, and continues the determination process of step S402.

此外,步驟S402中所執行的判斷節點10是否已拔離伺服器系統1的步驟可以係藉由偵測模組12、處理模組16或節點10的網路介面控制器來達成,本發明在此不加以限制,例如可藉由ping節點10上BMC之NIC來偵測節點10是否拔離。藉此,藉由第4圖的判斷機制可以避免因網路不穩或接觸不良所造成的誤判節點10已拔離伺服器系統1的情況,換句話說,第4圖的判斷機制為一種de-bounce機制。 In addition, the step of determining whether the node 10 has been pulled out of the server system 1 in step S402 can be achieved by using the detection module 12, the processing module 16 or the network interface controller of the node 10, and the present invention is This is not limited, for example, by pinging the NIC of the BMC on the node 10 to detect whether the node 10 is detached. Thereby, the judgment mechanism of FIG. 4 can avoid the situation that the misjudged node 10 has been pulled out of the server system 1 due to network instability or poor contact. In other words, the judgment mechanism of FIG. 4 is a kind of de -bounce mechanism.

請參照第5圖,第5圖係為根據第3圖中的步驟S306的詳細步驟流程圖。如第5圖所示,於偵測模組12偵測節點10是否拔離伺服器系統1的步驟(即步驟S304)之後,計時模組18會設定一組第一預設時間與一組第二預設時間並開 始計時,其中第二預設時間接續於第一預設時間之後。舉例來說,第一預設時間為計時模組18開始計時的第一分鐘(即第0~60秒),而第二預設時間則為計時模組18開始計時的第二分鐘(即第61~120秒),需知,第一預設時間與第二預設時間是可不同的。 Please refer to FIG. 5, which is a flow chart of detailed steps according to step S306 in FIG. As shown in FIG. 5, after the detecting module 12 detects whether the node 10 is pulled out of the server system 1 (ie, step S304), the timing module 18 sets a set of first preset time and a group of Two preset times and open The timing is started, wherein the second preset time is subsequent to the first preset time. For example, the first preset time is the first minute when the timing module 18 starts counting (ie, 0 to 60 seconds), and the second preset time is the second minute when the timing module 18 starts counting (ie, the first 61~120 seconds), it should be noted that the first preset time and the second preset time may be different.

在步驟S502中,會判斷節點10是否插入伺服器系統1。若判斷出節點10仍未插入伺服器系統1,則執行步驟S504;若判斷出節點10插入伺服器系統1,則執行步驟S506。在步驟S504中,計時模組18會重置第一預設時間並重新計時,並接續步驟S502的判斷程序,所以步驟S502及步驟S504的判斷機制為一種de-bounce機制。此外,步驟S502所執行的判斷程序可以係藉由偵測模組12、處理模組16或節點10的網路介面控制器來達成,本發明在此不加以限制,例如可藉由ping節點10上BMC之NIC來偵測節點10是否插入。 In step S502, it is judged whether or not the node 10 is inserted into the server system 1. If it is determined that the node 10 is still not inserted into the server system 1, step S504 is performed; if it is determined that the node 10 is inserted into the server system 1, step S506 is performed. In step S504, the timing module 18 resets the first preset time and re-times, and continues the determination process of step S502, so the determination mechanism of step S502 and step S504 is a de-bounce mechanism. In addition, the determining process performed in step S502 can be implemented by the detection module 12, the processing module 16, or the network interface controller of the node 10. The present invention is not limited herein, for example, by the ping node 10. The NIC of the upper BMC detects whether the node 10 is inserted.

在步驟S506中,若判斷出節點10插入伺服器系統1且已經過第一預設時間,則接著判斷節點10於第二預設時間中是否仍持續插入伺服器系統1。若判斷節點10於第二預設時間中仍持續插入伺服器系統1,表示節點10與所插入伺服器系統1中的位置皆正確無誤,則執行步驟S308之後的步驟;若判斷節點10於第二預設時間中拔離伺服器系統1,表示節點10與所插入伺服器系統1中的位置可能有錯誤或插錯節點10而被拔離,則執行步驟S300之後的步驟,以將正 確的節點10插入至伺服器系統1中的正確位置,所以步驟S506的判斷機制為一種人為防呆(fool-proofing)機制。 In step S506, if it is determined that the node 10 is inserted into the server system 1 and has passed the first preset time, it is then determined whether the node 10 continues to be inserted into the server system 1 during the second preset time. If it is determined that the node 10 continues to be inserted into the server system 1 during the second preset time, indicating that the positions of the node 10 and the inserted server system 1 are correct, the steps subsequent to step S308 are performed; If the position of the node 10 and the inserted server system 1 may be wrong or the node 10 is inserted and disconnected, the steps after step S300 are performed to The exact node 10 is inserted into the correct position in the server system 1, so the decision mechanism of step S506 is a human-fool-proofing mechanism.

此外,步驟S506中所執行的判斷程序可以係藉由偵測模組12、處理模組16或節點10的網路介面控制器來達成,本發明在此不加以限制,例如可藉由ping節點10上BMC之NIC來偵測節點10是否持續插入。藉此,藉由第5圖的判斷機制除了可以避免因網路不穩或接觸不良所造成的誤判節點10已插入伺服器系統1的情況之外,更能讓使用者有機會當誤差節點時,有機會將此節點拔除並插入正確之節點,換句話說,第5圖的判斷機制為一種de-bounce機制與防呆(fool-proofing)機制之組合。 In addition, the determining process performed in step S506 can be implemented by the detection module 12, the processing module 16, or the network interface controller of the node 10. The present invention is not limited herein, for example, by using a ping node. 10 on the BMC NIC to detect whether the node 10 continues to insert. Thereby, the judgment mechanism of FIG. 5 can prevent the user from having the opportunity to be the error node, in addition to the case that the misjudged node 10 has been inserted into the server system 1 due to network instability or poor contact. There is a chance to unplug this node and insert it into the correct node. In other words, the judgment mechanism of Figure 5 is a combination of a de-bounce mechanism and a fool-proofing mechanism.

綜合以上所述,本發明實施例提供一種伺服器系統與節點替換方法,其透過偵測節點的識別碼與節點中的硬體的硬體組態資訊,來判斷節點或節點中的硬體的硬體是否已被替換或新的節點是否被加入,進而選擇性地執行節點替換程序或對此節點安裝作業系統、軟體封裝資料或韌體封裝資料,甚至在節點或硬體沒被替換下,也可強制重新對此節點安裝作業系統、軟體封裝資料或韌體封裝資料。此外,本發明之伺服器系統與節點替換方法更可於節點安裝作業系統、軟體封裝資料或韌體封裝資料後,持續地偵測節點中的硬體狀況,並於節點中的硬體產生錯誤時透過提示訊號來讓使用者得知可以進行節點替換程序。藉此,本發明之伺服器 系統與節點替換方法可以自動地執行節點是否需要換置的處理程序,使用者僅需依據提示訊號來將節點插入或拔離伺服器系統,而不需要去做其他的檢測程序,十分具有實用性。 In summary, the embodiment of the present invention provides a server system and a node replacement method, which determine the hardware of a node or a node by detecting the identification code of the node and the hardware configuration information of the hardware in the node. Whether the hardware has been replaced or whether a new node has been added, thereby selectively performing a node replacement procedure or installing an operating system, software package data or firmware package data for the node, even if the node or hardware is not replaced, It is also possible to force the installation of operating system, software package data or firmware package data for this node. In addition, the server system and the node replacement method of the present invention can continuously detect the hardware condition in the node and install the error in the hardware in the node after the node installs the operating system, the software package data or the firmware package data. The prompt signal is used to let the user know that the node replacement procedure can be performed. Thereby, the server of the present invention The system and node replacement method can automatically execute the processing procedure of whether the node needs to be replaced. The user only needs to insert or disconnect the node from the server system according to the prompt signal, and does not need to perform other detection procedures, which is very practical. .

雖然本發明以上述之實施例揭露如上,然其並非用以限定本發明。在不脫離本發明之精神和範圍內,所為之更動與潤飾,均屬本發明之專利保護範圍。關於本發明所界定之保護範圍請參考所附之申請專利範圍。 Although the present invention has been disclosed above in the above embodiments, it is not intended to limit the invention. It is within the scope of the invention to be modified and modified without departing from the spirit and scope of the invention. Please refer to the attached patent application for the scope of protection defined by the present invention.

Claims (10)

一種節點替換方法,適用於一伺服器系統,該節點替換方法包括:該伺服器系統偵測一節點是否插入,並於偵測到該節點插入時產生一第一提示訊號,該第一提示訊號用以指示該節點不能拔離該伺服器系統;該伺服器系統偵測該節點的一第一識別碼與該節點中的一硬體的一第一硬體組態資訊;該伺服器系統依據該第一識別碼與該第一硬體組態資訊以及該節點插入該伺服器系統之前的一第二識別碼與一第二硬體組態資訊,判斷該節點或該節點中的該硬體是否已被替換;當該伺服器系統判斷出該節點與該節點中的該硬體皆未被替換,則該伺服器系統關閉該節點的電源,且提示該節點能拔離該伺服器系統,並開始計時一第一預設時間,判斷於該第一預設時間內該節點是否已拔離;以及若該伺服器系統判斷該節點或該節點中的該硬體於該第一預設時間內被替換,則對該節點安裝一作業系統、一軟體封裝資料與一韌體封裝資料至少其中之一。 A node replacement method is applicable to a server system. The node replacement method includes: the server system detects whether a node is inserted, and generates a first prompt signal when the node is detected to be inserted, the first prompt signal The server system is configured to detect that the node cannot be disconnected from the server system; the server system detects a first identification code of the node and a first hardware configuration information of a hardware in the node; the server system is based on Determining the hardware in the node or the node by the first identification code and the first hardware configuration information and a second identification code and a second hardware configuration information before the node is inserted into the server system Whether it has been replaced; when the server system determines that the node and the hardware in the node are not replaced, the server system turns off the power of the node, and prompts the node to pull out the server system, And starting to count a first preset time, determining whether the node has been disconnected during the first preset time; and if the server system determines that the node or the hardware in the node is at the first preset time Inner In other words, the operating system for the installation of a node, a software and a firmware data package at least one packaging data. 如請求項1所述之節點替換方法,其中於對該節點安裝該作業系統、該軟體封裝資料與該韌體封裝資料至少其中之一的步驟之後,更包括: 持續偵測該節點中的該硬體的狀況,以判斷該硬體是否發生錯誤;當判斷硬體發生錯誤時,該伺服器系統判斷該硬體發生的錯誤是可修復之錯誤或不可修復之錯誤;若判斷出該硬體產生不可修復之錯誤,則關閉該節點的電源,以執行一節點替換程序;以及若判斷出該硬體產生可修復之錯誤次數達到預設之門檻值,則對該節點進行正常關機程序,並據以執行該節點替換程序。 The node replacement method of claim 1, wherein after the step of installing at least one of the operating system, the software package data and the firmware package data on the node, the method further comprises: Continuously detecting the status of the hardware in the node to determine whether the hardware has an error; when determining that the hardware has an error, the server system determines that the error occurred by the hardware is a repairable error or is not repairable. Error; if it is determined that the hardware generates an irreparable error, the power of the node is turned off to perform a node replacement procedure; and if it is determined that the number of repairable errors of the hardware reaches a preset threshold, then The node performs a normal shutdown procedure and executes the node replacement procedure accordingly. 如請求項2所述之節點替換方法,其中該節點替換程序包括:將該節點設定為一初始模式;產生一第二提示訊號,該第二提示訊號用以指示該節點能拔離該伺服器系統;偵測該節點是否拔離該伺服器系統;偵測該節點或另一該節點是否插入該伺服器系統;以及若偵測到該節點或另一該節點插入該伺服器系統,則產生該第一提示訊號,並且接續偵測該節點的該第一識別碼與該節點中的該硬體的該第一硬體組態資訊之後的步驟。 The node replacement method of claim 2, wherein the node replacement procedure comprises: setting the node to an initial mode; generating a second prompt signal, the second prompt signal indicating that the node can be disconnected from the server a system; detecting whether the node is disconnected from the server system; detecting whether the node or another node is inserted into the server system; and generating the node or another node to insert the server system, generating The first prompt signal, and the step subsequent to detecting the first identification code of the node and the first hardware configuration information of the hardware in the node. 如請求項3所述之節點替換方法,其中於偵測該節點是否拔離該伺服器系統的步驟中,更包括:開始計時該第一預設時間;判斷該節點是否已拔離該伺服器系統;若判斷出該節點尚未拔離該伺服器系統,則重置該第一預設時間並重新計時;以及若判斷出該節點拔離該伺服器系統且已經過該第一預設時間,則執行偵測該節點或另一該節點是否插入該伺服器系統之後的步驟。 The node replacement method of claim 3, wherein the step of detecting whether the node is disconnected from the server system further comprises: starting to time the first preset time; determining whether the node has been disconnected from the server a system; if it is determined that the node has not been detached from the server system, resetting the first preset time and re-clocking; and if it is determined that the node is disconnected from the server system and the first preset time has elapsed, Then, the step of detecting whether the node or another node is inserted into the server system is performed. 如請求項3所述之節點替換方法,其中於偵測該節點或另一該節點是否插入該伺服器系統的步驟中,更包括:設定一第二預設時間並開始計時該第一預設時間及該第二預設時間,其中該第二預設時間接續於該第一預設時間後;判斷該節點是否插入該伺服器系統;若判斷出該節點尚未插入該伺服器系統,則重置該第一預設時間並重新計時;若判斷出該節點插入該伺服器系統且已經過該第一預設時間,則接著判斷該節點於該第二預設時間中是否仍持續插入該伺服器系統; 若判斷出該節點於該第二預設時間中拔離該伺服器系統,則接續將該節點設定為該初始模式之後的步驟;以及若判斷出該節點於經過該第二預設時間後仍未拔離該伺服器系統,則產生該第一提示訊號,並且接續偵測該節點的該第一識別碼與該節點中的該硬體的該第一硬體組態資訊之後的步驟。 The node replacement method of claim 3, wherein the step of detecting whether the node or another node is inserted into the server system further comprises: setting a second preset time and starting timing the first preset Time and the second preset time, wherein the second preset time is subsequent to the first preset time; determining whether the node is inserted into the server system; if it is determined that the node has not been inserted into the server system, Setting the first preset time and re-clocking; if it is determined that the node is inserted into the server system and the first preset time has elapsed, then determining whether the node continues to insert the servo during the second preset time System If it is determined that the node is disconnected from the server system in the second preset time, the node is subsequently set to the step after the initial mode; and if it is determined that the node is still after the second preset time After the server system is not pulled out, the first prompt signal is generated, and the step after detecting the first identifier of the node and the first hardware configuration information of the hardware in the node is performed. 一種伺服器系統,包括:一節點,具有一硬體;一偵測模組,通訊連接該節點,用以偵測該節點是否插入或拔離該伺服器系統,以及用以偵測該節點的一第一識別碼與該硬體的一第一硬體組態資訊;一提示模組,通訊連接該偵測模組,用以於該偵測模組偵測到該節點插入該伺服器系統時產生一第一提示訊號,該第一提示訊號用以指示該節點不能拔離該伺服器系統;以及一處理模組,通訊連接於該偵測模組與該節點之間,用以依據該第一識別碼與該第一硬體組態資訊以及該節點插入該伺服器系統之前的一第二識別碼與一第二硬體組態資訊來判斷該節點或該硬體是否已被替換;一計時模組,通訊連接該偵測模組,用以當該處理模組判斷出該節點與該硬體皆未被替換,且該處理模組關閉 該節點的電源時,該開始計時一第一預設時間,使該處理模組於該第一預設時間內判斷該節點是否已拔離,當該處理模組判斷該節點於該第一預設時間內未已拔離時,重新計時該第一預設時間,當該處理模組判斷該節點或該硬體於該第一預設時間內被替換,則對該節點安裝一作業系統、一軟體封裝資料與一韌體封裝資料至少其中之一。 A server system includes: a node having a hardware; a detection module communicatively coupled to the node for detecting whether the node is inserted or removed from the server system, and for detecting the node a first identification code and a first hardware configuration information of the hardware; a prompting module, the communication module is connected to the detecting module, and the detecting module detects that the node is inserted into the server system a first prompt signal is generated, the first prompt signal is used to indicate that the node cannot be disconnected from the server system, and a processing module is connected between the detecting module and the node, according to the Determining whether the node or the hardware has been replaced by the first identification code and the first hardware configuration information and a second identification code and a second hardware configuration information before the node is inserted into the server system; a timing module, the communication module is connected to the detection module, and the processing module determines that the node and the hardware are not replaced, and the processing module is closed When the power of the node is started, the first timing is started, so that the processing module determines whether the node has been detached during the first preset time, and when the processing module determines that the node is in the first pre- When the set time has not been pulled out, the first preset time is re-timed. When the processing module determines that the node or the hardware is replaced within the first preset time, an operation system is installed on the node, At least one of a software package material and a firmware package material. 如請求項6所述之伺服器系統,其中於該處理模組對該節點安裝該作業系統、該軟體封裝資料與該韌體封裝資料至少其中之一之後,該處理模組更持續判斷該硬體是否發生錯誤,若判斷出該硬體產生不可修復之錯誤,則關閉該節點的電源,以執行該節點替換程序,若判斷出該硬體產生可修復之錯誤次數達到預設之門檻值,則對該節點進行正常關機程序,並據以執行該節點替換程序。 The server system of claim 6, wherein after the processing module installs at least one of the operating system, the software package data, and the firmware package data for the node, the processing module further determines the hard If there is an error in the body, if it is determined that the hardware has an unrepairable error, the power of the node is turned off to execute the node replacement procedure, and if it is determined that the number of times the hardware generates a repairable error reaches a preset threshold, Then the node is normally shut down and the node replacement procedure is executed accordingly. 如請求項7所述之伺服器系統,其中於該伺服器系統執行該節點替換程序時,該處理模組將該節點設定為一初始模式,接著該提示模組產生一第二提示訊號,該第二提示訊號用以指示該節點能拔離該伺服器系統,再來該偵測模組偵測該節點是否拔離該伺服器系統,並於偵測到該節點拔離該伺服器系統後接續偵測該節點或另一該節點是否插入該伺服器系統,若該偵測模組偵測到該節點或另一該節點插入該伺服器系統,則該提示模組產生該第一提示訊 號,並且該偵測模組接續偵測該節點的該第一識別碼與該硬體的該第一硬體組態資訊之後的處理程序。 The server system of claim 7, wherein when the server system executes the node replacement program, the processing module sets the node to an initial mode, and then the prompting module generates a second prompt signal. The second prompt signal is used to indicate that the node can be pulled out of the server system, and then the detecting module detects whether the node is pulled out of the server system, and after detecting that the node is disconnected from the server system, Continuing to detect whether the node or another node is inserted into the server system, and if the detecting module detects that the node or another node is inserted into the server system, the prompting module generates the first prompt message. And the detection module continues to detect the first identification code of the node and the processing procedure after the first hardware configuration information of the hardware. 如請求項8所述之伺服器系統,其中於該偵測模組偵測該節點是否拔離該伺服器系統時,該計時模組開始計時該第一預設時間,若該偵測模組偵測出該節點於該第一預設時間內仍未拔離該伺服器系統,則該計時模組重置該第一預設時間並重新計時,若該偵測模組偵測出該節點拔離該伺服器系統且已經過該第一預設時間,則接續偵測該節點或另一該節點是否插入該伺服器系統之後的處理程序。 The server system of claim 8, wherein when the detecting module detects whether the node is removed from the server system, the timing module starts timing the first preset time, if the detecting module Detecting that the node has not been removed from the server system within the first preset time, the timing module resets the first preset time and re-times, if the detection module detects the node After the server system is removed and the first preset time has elapsed, the processing procedure after the node or another node is inserted into the server system is detected. 如請求項8所述之伺服器系統,其中於該偵測模組偵測該節點或另一該節點是否插入該伺服器系統時,該計時模組設定一第二預設時間並開始計時該第一預設時間與該第二預設時間,其中該第二預設時間接續於該第一預設時間後,若該偵測模組偵測出該節點於該第一預設時間內尚未插入該伺服器系統,則該計時模組重置該第一預設時間並重新計時,若該偵測模組偵測出該節點插入該伺服器系統且已經過該第一預設時間,則接續偵測該節點於該第二預設時間中是否仍持續插入該伺服器系統,若該偵測模組偵測出該節點於該第二預設時間中拔離該伺服器系統,則接續該處理模組將該節點設定為該初始模式之處理程序,若該偵測模組偵測出該節點於經過該第二預設時間後仍未拔離該伺服器系統,則該提示模組產 生該第一提示訊號,並且該偵測模組接續偵測該節點的該第一識別碼與該硬體的該第一硬體組態資訊之後的處理程序。 The server system of claim 8, wherein the timing module sets a second preset time and starts timing when the detecting module detects whether the node or another node is inserted into the server system. The first preset time and the second preset time, wherein the second preset time is subsequent to the first preset time, if the detecting module detects that the node has not been in the first preset time Inserting the server system, the timing module resets the first preset time and re-times. If the detecting module detects that the node is inserted into the server system and has passed the first preset time, then Continuing to detect whether the node continues to be inserted into the server system during the second preset time. If the detecting module detects that the node is disconnected from the server system during the second preset time, the connection is continued. The processing module sets the node as a processing procedure of the initial mode, and if the detecting module detects that the node has not been detached from the server system after the second preset time, the prompting module Production The first prompt signal is generated, and the detecting module continues to detect the first identifier of the node and the processing program after the first hardware configuration information of the hardware.
TW102144929A 2013-12-06 2013-12-06 Server system and node replacement method TWI518519B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW102144929A TWI518519B (en) 2013-12-06 2013-12-06 Server system and node replacement method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW102144929A TWI518519B (en) 2013-12-06 2013-12-06 Server system and node replacement method

Publications (2)

Publication Number Publication Date
TW201523276A TW201523276A (en) 2015-06-16
TWI518519B true TWI518519B (en) 2016-01-21

Family

ID=53935644

Family Applications (1)

Application Number Title Priority Date Filing Date
TW102144929A TWI518519B (en) 2013-12-06 2013-12-06 Server system and node replacement method

Country Status (1)

Country Link
TW (1) TWI518519B (en)

Also Published As

Publication number Publication date
TW201523276A (en) 2015-06-16

Similar Documents

Publication Publication Date Title
TWI684859B (en) Method for remote system recovery
CN106648958B (en) Basic input output system replys management system and its method and program product
US10127032B2 (en) System and method for unified firmware management
JP6383839B2 (en) Method, storage device and system used for remote KVM session
TWI632462B (en) Switching device and method for detecting i2c bus
US10114952B2 (en) System, apparatus and method for performing secure memory training and management in a trusted environment
US8874953B2 (en) System and method of cloud testing and remote monitoring for integrated circuit components in system validation
TWI576706B (en) Method for early boot phase and the related device
TW201637400A (en) Method for cabling connections and cabling connection system
US9703937B2 (en) Method to prevent operating system digital product key activation failures
US9712382B2 (en) Retrieving console messages after device failure
TWI611289B (en) Server and error detecting method thereof
TWI604304B (en) Electronic apparatus and detection method using the same
TW201314574A (en) Method and device for updating BIOS program for computer system
TWI501090B (en) Server system and node detection method
TWI553490B (en) Method and system for remote system configuration management and non-transitory computer-readable storage medium
US9792168B2 (en) System and method for cloud remediation of a client with a non-bootable storage medium
TW201734779A (en) Boot status notification method and server system using the same
JP5689783B2 (en) Computer, computer system, and failure information management method
TWI518519B (en) Server system and node replacement method
TW201411339A (en) Receiving an update module by accessing a network site
TW201502806A (en) Error coordination message for a blade device having a logical processor in another system firmware domain
JP6217086B2 (en) Information processing apparatus, error detection function diagnosis method, and computer program
JP2018110345A (en) Setting program, setting method, and setting device
JP5683354B2 (en) Monitoring device and monitoring method

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees