TWI505674B - Server system and a data transferring method thereof - Google Patents

Server system and a data transferring method thereof Download PDF

Info

Publication number
TWI505674B
TWI505674B TW102126952A TW102126952A TWI505674B TW I505674 B TWI505674 B TW I505674B TW 102126952 A TW102126952 A TW 102126952A TW 102126952 A TW102126952 A TW 102126952A TW I505674 B TWI505674 B TW I505674B
Authority
TW
Taiwan
Prior art keywords
server
module
management
server system
node
Prior art date
Application number
TW102126952A
Other languages
Chinese (zh)
Other versions
TW201505400A (en
Inventor
Li Zhang
Original Assignee
Inventec Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inventec Corp filed Critical Inventec Corp
Priority to TW102126952A priority Critical patent/TWI505674B/en
Publication of TW201505400A publication Critical patent/TW201505400A/en
Application granted granted Critical
Publication of TWI505674B publication Critical patent/TWI505674B/en

Links

Landscapes

  • Debugging And Monitoring (AREA)
  • Small-Scale Networks (AREA)

Description

伺服器系統和其資料傳送方法Server system and its data transmission method

本發明係關於一種伺服器系統,尤其係關於一種可提升頻寬使用效率之伺服器系統和其資料傳送方法。The present invention relates to a server system, and more particularly to a server system capable of improving bandwidth usage efficiency and a data transmission method thereof.

近幾年來,隨著科技的快速發展,電腦系統所具備的功能則愈益強大。為了能夠有效監控主機板上各個元件的運作情形,許多主機板廠商便利用基板管理控制器(Baseboard Management Control,BMC)來監控系統的各式運作,並將監控之結果傳送至一管理模組。In recent years, with the rapid development of technology, the functions of computer systems have become more powerful. In order to effectively monitor the operation of various components on the motherboard, many motherboard manufacturers facilitate the use of Baseboard Management Control (BMC) to monitor various operations of the system and transmit the results of the monitoring to a management module.

一般而言,基板管理控制器會週期性地輪詢(polling)主機板上不同之感測器以監視主機板上硬體當前的工作狀態,並根據管理模組發送給基板管理控制器之詢問訊息,將監控結果傳送給管理模組以進行進一步之處理。換言之,在此種傳送模式下,當管理模組發送詢問訊息時,基板管理控制器並不會同時傳送監控結果給管理模組;反之,當基板管理控制器傳送監控結果給管理模組時,管理模組也不會同時發送詢問訊息。因此如此之傳送方式不利於頻寬之使用效率,因此仍有進一步改良的空間。In general, the baseboard management controller periodically polls different sensors on the motherboard to monitor the current working state of the hardware on the motherboard, and sends an inquiry to the baseboard management controller according to the management module. The message is transmitted to the management module for further processing. In other words, in the transmission mode, when the management module sends the inquiry message, the baseboard management controller does not simultaneously transmit the monitoring result to the management module; otherwise, when the baseboard management controller transmits the monitoring result to the management module, The management module will not send the inquiry message at the same time. Therefore, such a transmission method is not conducive to the efficiency of use of the bandwidth, so there is still room for further improvement.

鑑於上述頻寬使用效率不佳,本發明根據一預設條件自動傳送監控結果給管理模組,來解決頻寬佔用之問題。In view of the inefficient use of the above bandwidth, the present invention automatically transmits the monitoring result to the management module according to a preset condition to solve the problem of bandwidth occupation.

本發明之一態樣係在提供一種伺服器系統。此伺服器系統包括多個伺服器節點以及至少一管理模組。其中每個伺服器節點包含一節點控制模組,此節點控制模組可採集對應伺服器節點的運行狀態資訊。每一節點控制模組可分別通過一上行資料通道與管理模組通訊連接。其中,每一節點控制模組可將對應伺服器節點的運行狀態資訊封裝成一資料封包,並依據一預設條件自動將資料封包沿上行資料通道發送給管理模組,管理模組接收並解析資料封包,並依據對應伺服器節點的運行狀態資訊管理伺服器節點的運行。One aspect of the present invention is to provide a server system. The server system includes a plurality of server nodes and at least one management module. Each of the server nodes includes a node control module, and the node control module can collect the running status information of the corresponding server node. Each node control module can communicate with the management module through an uplink data channel. Each node control module can encapsulate the running status information of the corresponding server node into a data packet, and automatically send the data packet to the management module along the uplink data channel according to a preset condition, and the management module receives and parses the data. The packet is managed, and the operation of the server node is managed according to the running status information of the corresponding server node.

在一實施例中,預設條件為一固定時間週期,其中每經過此固定時間週期,每一節點控制模組會將資料封包沿上行資料通道發送給管理模組。In an embodiment, the preset condition is a fixed time period, and each time the fixed time period passes, each node control module sends the data packet to the management module along the uplink data channel.

在一實施例中,預設條件為一資訊變化,當些伺服器節點其中之一的運行狀態資訊發生變化時,該伺服器節點對應之節點控制模組將資料封包沿上行資料通道發送給該至少一管理模組。In an embodiment, the preset condition is an information change. When the running status information of one of the server nodes changes, the node control module corresponding to the server node sends the data packet to the uplink data channel. At least one management module.

在一實施例中,管理模組更包含一糾錯模組,以當伺服器節點之一出現資訊推送錯誤時,檢測出該伺服 器節點,其中糾錯模組通過檢測伺服器節點之一對應之節點控制模組是否停止該資訊推送動作,以判斷其是否出現資訊推送錯誤。In an embodiment, the management module further includes an error correction module to detect the servo when one of the server nodes has an information push error. And the error correction module determines whether the information push error occurs by detecting whether the node control module corresponding to one of the server nodes stops the information push action.

在一實施例中,更包含一下行資料通道,管理模組通過該下行資料通道發送一命令給該些伺服器節點,其中該命令為開關機命令。In an embodiment, the data channel is further included, and the management module sends a command to the server nodes through the downlink data channel, wherein the command is a power on/off command.

在一實施例中,節點控制模組可將對應伺服器節點的運行狀態資訊分類封裝為復數個資料封包,而一系統管理控制器耦接此管理模組,管理模組將所接收之復數個資料封包進行二次封裝,並發送給該系統管理控制器,其中運行狀態資訊為溫度資訊。In an embodiment, the node control module can encapsulate the running state information of the corresponding server node into a plurality of data packets, and a system management controller is coupled to the management module, and the management module receives the plurality of data modules. The data packet is secondarily packaged and sent to the system management controller, wherein the running status information is temperature information.

在一實施例中,系統管理控制器可為一機櫃管理控制器(Racks Management Controller,RMC),管理模組可為一風扇管理控制板(Fan Controller Board,FCB),節點控制模組可為一基板管理控制器(Baseboard Management Controller,BMC)其中,風扇管理控制板更電性連接一用於對此些伺服器節點散熱的風扇模組,並根據溫度資訊調整風扇模組中各風扇之轉速。In an embodiment, the system management controller can be a Racks Management Controller (RMC), the management module can be a Fan Controller Board (FCB), and the node control module can be a A baseboard management controller (BMC), wherein the fan management control board is further electrically connected to a fan module for dissipating heat from the server nodes, and adjusts the speed of each fan in the fan module according to the temperature information.

在一實施例中,管理模組為一機櫃管理控制器,節點控制模組為一基板管理控制器,而每一伺服器節點更分別連接一風扇,該基板管理控制器根據溫度資訊控制風扇之轉速。In one embodiment, the management module is a cabinet management controller, the node control module is a baseboard management controller, and each server node is further connected to a fan, and the baseboard management controller controls the fan according to the temperature information. Rotating speed.

本發明之另一態樣係在提供一種伺服器系統資料傳送方法。其中伺服器系統至少包括複數個伺服器節點以及至少一管理模組。首先採集此些伺服器節點的運行狀態資訊,並將此些運行狀態資訊封裝成複數個資料封包。接著,依據一預設條件 自動將此些資料封包沿一上行資料通道發送給管理模組。最後,接收並解析該些資料封包,其中管理模組依據該些伺服器節點的運行狀態資訊管理該些伺服器節點的運行。Another aspect of the present invention is to provide a server system data transfer method. The server system includes at least a plurality of server nodes and at least one management module. First, the running status information of the server nodes is collected, and the running status information is encapsulated into a plurality of data packets. Then, based on a predetermined condition The data packets are automatically sent to the management module along an uplink data channel. Finally, the data packets are received and parsed, wherein the management module manages the running of the server nodes according to the running status information of the server nodes.

綜上所述,本發明藉由根據一預設條件來讓節點控制模組主動將對應伺服器的運行狀態資訊傳送給管理模組,管理模組不須發送一詢問訊息給節點控制模組,因此不會佔用發送詢問訊息之頻寬,在頻寬使用上更有效率。In summary, the present invention allows the node control module to actively transmit the running status information of the corresponding server to the management module according to a preset condition, and the management module does not need to send an inquiry message to the node control module. Therefore, it does not occupy the bandwidth of sending the inquiry message, and is more efficient in bandwidth usage.

100和200和300‧‧‧伺服器系統100 and 200 and 300‧‧‧ server systems

101‧‧‧伺服器節點101‧‧‧ server node

102‧‧‧管理模組102‧‧‧Management module

103‧‧‧上行資料通道103‧‧‧Upstream data channel

104‧‧‧下行資料通道104‧‧‧Down data channel

105‧‧‧系統管理控制器105‧‧‧System Management Controller

1011‧‧‧節點控制模組1011‧‧‧node control module

1012‧‧‧伺服器1012‧‧‧Server

1021和2021和3011‧‧‧糾錯模組1021 and 2021 and 3011‧‧‧ error correction modules

201和301‧‧‧機櫃管理控制器201 and 301‧‧‧Cabinet Management Controller

202‧‧‧風扇管理控制板202‧‧‧Fan Management Control Board

203和303‧‧‧基板管理控制器203 and 303‧‧‧Base Management Controller

204和304‧‧‧主機板204 and 304‧‧‧ motherboards

205‧‧‧風扇模組205‧‧‧Fan module

2041和3041‧‧‧運算處理單元2041 and 3041‧‧‧Operation Processing Unit

3042‧‧‧風扇3042‧‧‧fan

2043和3043‧‧‧溫度感測器2043 and 3043‧‧‧ Temperature Sensors

401和402和403‧‧‧步驟401 and 402 and 403‧‧ steps

第1圖所示為根據本發明一實施例的伺服器系統概略圖示。Figure 1 is a schematic illustration of a server system in accordance with an embodiment of the present invention.

第2圖所示為根據本發明另一實施例的伺服器系統概略圖示。Figure 2 is a schematic illustration of a server system in accordance with another embodiment of the present invention.

第3圖所示為根據本發明再一實施例的伺服器系統概略圖示。Figure 3 is a schematic illustration of a server system in accordance with yet another embodiment of the present invention.

第4圖所示為根據本發明一實施例伺服器系統資料傳送方法。FIG. 4 is a diagram showing a data transmission method of a server system according to an embodiment of the present invention.

以下為本發明較佳具體實施例以所附圖示加以詳細說明,下列之說明及圖示使用相同之參考數字以表示相同或類似元件,並且在重複描述相同或類似元件時則予省略。The following description of the preferred embodiments of the invention is in the

第1圖所示為根據本發明一實施例的伺服器系統概略圖示。本發明之伺服器系統100包括:多個伺服器節點101以 及至少一管理模組102。其中每個伺服器節點101更包含有一節點控制模組1011以及一對應伺服器1012,此節點控制模組1011可採集此對應伺服器1012,亦即此節點,的運行狀態資訊。此外,此些節點控制模組1011可分別通過一上行資料通道103來耦接管理模組102,藉以將對應伺服器1012的運行狀態資訊傳送給管理模組102。傳統上,是由管理模組102先發送一詢問訊息(request)給節點控制模組1011後,再由節點控制模組1011將對應伺服器1012的運行狀態資訊傳送給管理模組102,然如此之傳送方式不利於頻寬之使用效率。因此本發明是讓節點控制模組1011依據一預設條件將對應伺服器1012的運行狀態資訊傳送給管理模組102,也就是說,只要預設條件滿足,節點控制模組1011即自動將對應伺服器1012的運行狀態資訊自動傳送給管理模組102,管理模組102不須先發送一詢問訊息給節點控制模組1011,因此,不會佔用頻寬,在頻寬使用上更有效率。Figure 1 is a schematic illustration of a server system in accordance with an embodiment of the present invention. The server system 100 of the present invention includes: a plurality of server nodes 101 to And at least one management module 102. Each of the server nodes 101 further includes a node control module 1011 and a corresponding server 1012. The node control module 1011 can collect the running status information of the corresponding server 1012, that is, the node. In addition, the node control module 1011 can be coupled to the management module 102 through an uplink data channel 103, so as to transmit the running status information of the server 1012 to the management module 102. Traditionally, after the management module 102 first sends a query message to the node control module 1011, the node control module 1011 transmits the running status information of the corresponding server 1012 to the management module 102. The transmission mode is not conducive to the efficiency of bandwidth usage. Therefore, the present invention allows the node control module 1011 to transmit the running status information of the corresponding server 1012 to the management module 102 according to a preset condition, that is, as long as the preset condition is satisfied, the node control module 1011 automatically responds. The running status information of the server 1012 is automatically transmitted to the management module 102. The management module 102 does not need to send an inquiry message to the node control module 1011 first. Therefore, the bandwidth is not occupied and the bandwidth is more efficient.

其中,節點控制模組1011會週期性地輪詢(polling)對應伺服器1012上不同之感測器(圖中未繪示出)以監視伺服器1012當前的運行狀態,並將對應伺服器1012的運行狀態資訊分類封裝成多個資料封包。當預設條件滿足時,節點控制模組1011會主動將此資料封包沿上行資料通道103發送給管理模組102。當管理模組102接收到節點控制模組1011上傳之資料封包後,會解析此些資料封包,並依據資料封包所載之運行狀態資訊管理各伺服器1012的運行。在一實施例中,此預設條件為一固定時間週期,也就是說,每當一固定時間週期經過後,節點控制模組1011會主動將伺服器1012當前的運行狀態沿上行資料通道103發送給 管理模組102。而在另一實施例中,此預設條件為一資訊變化,例如:伺服器1012運行狀態之改變情況,也就是說,每當節點控制模組1011監測到對應伺服器1012當前的運行狀態發生一劇烈之變化,例如:不能運作,此時節點控制模組1011會主動將伺服器1012當前的運行狀態沿上行資料通道103發送給管理模組102。在實作上,可設定一門檻值,當運行狀態之變化率超過該門檻值時,此節點控制模組1011會主動將伺服器1012當前的運行狀態發送給管理模組102。例如,門檻值設定為2℃,當節點控制模組1011監測到對應伺服器1012當前的溫度變化超過2℃,節點控制模組1011會主動將伺服器1012當前的運行狀態發送給管理模組102。此外,本發明之管理模組102更包含一糾錯模組1021,其中當此些伺服器節點101之其中之一出現資訊推送錯誤時,藉由糾錯模組1021可檢測出此出錯之伺服器節點。在一實施例中,糾錯模組1021是藉由檢測每一伺服器節點101對應之節點控制模組1011是否停止資訊推送之動作,來判斷其是否出現資訊推送錯誤。伺服器系統100更包含一下行資料通道104,管理模組102可通過此下行資料通道104發送命令給該些伺服器節點,如命令節點控制模組1011控制對應伺服器1012進行開機或關機。The node control module 1011 periodically polls different sensors (not shown) on the corresponding server 1012 to monitor the current running state of the server 1012, and the corresponding server 1012. The operational status information classification is encapsulated into multiple data packets. When the preset condition is met, the node control module 1011 will actively send the data packet to the management module 102 along the uplink data channel 103. After the management module 102 receives the data packet uploaded by the node control module 1011, the data packet is parsed, and the running of each server 1012 is managed according to the running status information contained in the data packet. In an embodiment, the preset condition is a fixed time period, that is, the node control module 1011 actively sends the current running state of the server 1012 along the uplink data channel 103 every time a fixed time period elapses. give Management module 102. In another embodiment, the preset condition is an information change, for example, a change in the running state of the server 1012, that is, whenever the node control module 1011 detects that the current running state of the corresponding server 1012 occurs. A drastic change, for example, cannot be operated. At this time, the node control module 1011 actively sends the current running state of the server 1012 to the management module 102 along the uplink data channel 103. In practice, a threshold may be set. When the rate of change of the operating state exceeds the threshold, the node control module 1011 actively sends the current running state of the server 1012 to the management module 102. For example, the threshold value is set to 2 ° C. When the node control module 1011 detects that the current temperature change of the corresponding server 1012 exceeds 2 ° C, the node control module 1011 actively sends the current running status of the server 1012 to the management module 102. . In addition, the management module 102 of the present invention further includes an error correction module 1021, wherein when one of the server nodes 101 has an information push error, the error correction module 1021 can detect the error servo. Node. In an embodiment, the error correction module 1021 determines whether a information push error occurs by detecting whether the node control module 1011 corresponding to each server node 101 stops the information push operation. The server system 100 further includes a data channel 104. The management module 102 can send commands to the server nodes through the downlink data channel 104. For example, the command node control module 1011 controls the corresponding server 1012 to be powered on or off.

此外,本案之伺服器系統更包括一系統管理控制器105,耦接此管理模組102。其中若一伺服器系統具有多個管理模組102,此系統管理控制器105可耦接此多個管理模組102以進行整個伺服器系統之整合管理控制。其中每一管理模組102會將節點控制模組1011上傳且分類封裝後多個資料封包進行二次封 裝,再上傳給系統管理控制器105。其中系統管理控制器105可根據此類別封裝後之資料封包,判別是由那一管理模組102上傳之資訊包,藉以進行整個伺服器系統之管理。例如,在一實施例中,對應伺服器1012包含主機板以及散熱風扇,節點控制模組1011週期性地輪詢(polling)對應伺服器1012上不同之感測器以分別監視主機板以及散熱風扇當前的運行狀態,並將主機板以及散熱風扇的運行狀態資訊分類封裝成資料封包後傳送給管理模組102。管理模組102會將主機板的資料封包以及散熱風扇的資料封包進行二次封裝,並包括對應管理模組102之資訊後,再上傳給系統管理控制器105。系統管理控制器105即可根據二次封裝後之資訊確定是哪一管理模組102上傳之運行狀態資訊,來進行對應之管理。In addition, the server system of the present invention further includes a system management controller 105 coupled to the management module 102. If the server system has multiple management modules 102, the system management controller 105 can be coupled to the plurality of management modules 102 for integrated management control of the entire server system. Each management module 102 uploads the node control module 1011 and classifies and encapsulates multiple data packets for secondary sealing. Installed and uploaded to the system management controller 105. The system management controller 105 can determine the information packet uploaded by the management module 102 according to the encapsulated data packet of the category, so as to manage the entire server system. For example, in an embodiment, the corresponding server 1012 includes a motherboard and a cooling fan, and the node control module 1011 periodically polls different sensors on the corresponding server 1012 to separately monitor the motherboard and the cooling fan. The current running state, and the operating state information of the motherboard and the cooling fan are packaged into data packets and transmitted to the management module 102. The management module 102 repackages the data package of the motherboard and the data package of the cooling fan, and includes information corresponding to the management module 102, and then uploads the information to the system management controller 105. The system management controller 105 can determine which management module 102 uploads the operating status information according to the information after the secondary encapsulation, and performs corresponding management.

在一實施例中,第1圖所示之系統管理控制器105,例如為一機櫃管理控制器(Racks Management Controller,RMC)。管理模組102,例如為一風扇管理控制板(Fan Controller Board,FCB)。而節點控制模組1011,例如為一設在主機板上之基板管理控制器(Baseboard Management Controller,BMC)。第2圖所示為根據本發明此一實施例的伺服器系統概略圖示。伺服器系統200包括機櫃管理控制器201、風扇管理控制板202、基板管理控制器203以及一具有多個風扇之風扇模組205。其中,基板管理控制器203設在一主機板204上。主機板204上例如包含運算處理單元2041和溫度感測器2043。而風扇管理控制板202電性連接風扇模組205,風扇模組205則用於對該複數個伺服器節點進行散熱,其中風扇管理控制板202根據一資訊,例如,每一主機板上運算 處理單元2041之溫度資訊來調整風扇模組205中對應風扇之轉速,藉以控制對應運算處理單元2041之工作溫度。值得注意的是,在本實施例中,僅以主機板204上設置有運算處理單元2041來說明本發明之應用,然本發明之應用與結構不限於上述之實施例。根據本實施例,基板管理控制器203對每一個主機板204的溫度感測器2043以輪詢(Polling)方式反覆讀取所測得的工作溫度值,藉以獲得運算處理單元2041工作溫度值,並將所測得的工作溫度值封裝成一資料封包。In an embodiment, the system management controller 105 shown in FIG. 1 is, for example, a Racks Management Controller (RMC). The management module 102 is, for example, a fan controller board (FCB). The node control module 1011 is, for example, a Baseboard Management Controller (BMC) provided on the motherboard. Figure 2 is a schematic illustration of a server system in accordance with this embodiment of the invention. The server system 200 includes a cabinet management controller 201, a fan management control board 202, a substrate management controller 203, and a fan module 205 having a plurality of fans. The substrate management controller 203 is disposed on a motherboard 204. The motherboard 204 includes, for example, an arithmetic processing unit 2041 and a temperature sensor 2043. The fan management control board 202 is electrically connected to the fan module 205, and the fan module 205 is configured to dissipate heat from the plurality of server nodes. The fan management control board 202 is configured according to an information, for example, on each motherboard. The temperature information of the processing unit 2041 adjusts the rotation speed of the corresponding fan in the fan module 205, thereby controlling the operating temperature of the corresponding operation processing unit 2041. It should be noted that in the present embodiment, the application of the present invention is described only by the operation processing unit 2041 provided on the motherboard 204. However, the application and structure of the present invention are not limited to the above embodiments. According to the embodiment, the substrate management controller 203 repeatedly reads the measured operating temperature value in a polling manner for the temperature sensor 2043 of each motherboard 204 to obtain the operating temperature value of the arithmetic processing unit 2041. And measuring the measured operating temperature value into a data packet.

當預設條件滿足時,基板管理控制器203會主動將此資料封包發送給風扇管理控制板202。當風扇管理控制板202接收到基板管理控制器203上傳之資料封包後,會解析此些資料封包,並依據資料封包所載之運算處理單元2041之工作溫度值控制風扇模組205之運轉狀態。在一實施例中,此預設條件為一固定時間週期,也就是說,每當一固定時間週期經過後,基板管理控制器203會主動將運算處理單元2041工作溫度值封裝成之資料封包發送給風扇管理控制板202以進行風扇模組205運轉狀態之控制。而在另一實施例中,此預設條件為一資訊變化,例如:運行狀態之改變情況,也就是說,每當基板管理控制器203監測到一主機板上之運算處理單元2041的運行狀態發生一劇烈之變化且超過一門檻值,例如:突然溫度上升超過設定之門檻值,此時基板管理控制器203會主動將此突發狀態發送給風扇管理控制板202進行即刻處理。在一實施例中,門檻值設定為2℃,當基板管理控制器203監測運算處理單元2041當前的溫度變化超過2℃,基板管理控制器203會主動將運算處理單元2041當前的運 行狀態發送給風扇管理控制板202,進行對應之處理,例如提升風扇模組205之運轉速度,藉以即時降溫。When the preset condition is met, the baseboard management controller 203 actively sends the data packet to the fan management control board 202. After the fan management control board 202 receives the data packet uploaded by the substrate management controller 203, the data packet is parsed, and the operating state of the fan module 205 is controlled according to the operating temperature value of the arithmetic processing unit 2041 contained in the data packet. In an embodiment, the preset condition is a fixed time period, that is, the substrate management controller 203 actively encapsulates the operating temperature value of the operation processing unit 2041 into a data packet transmission every time a fixed time period elapses. The fan management control board 202 is controlled to perform the operation state of the fan module 205. In another embodiment, the preset condition is an information change, for example, a change in the operating state, that is, each time the baseboard management controller 203 monitors the running state of the operation processing unit 2041 on a motherboard. A drastic change occurs and exceeds a threshold. For example, if the sudden temperature rise exceeds the set threshold, the baseboard management controller 203 will actively send the burst status to the fan management control board 202 for immediate processing. In an embodiment, the threshold value is set to 2 ° C. When the substrate management controller 203 monitors that the current temperature change of the arithmetic processing unit 2041 exceeds 2 ° C, the substrate management controller 203 actively takes the current operation of the arithmetic processing unit 2041. The row status is sent to the fan management control board 202 for corresponding processing, such as increasing the operating speed of the fan module 205, so as to instantly cool down.

另一方面,風扇管理控制板202更耦接一機櫃管理控制器201。風扇管理控制板202會將基板管理控制器203上傳之運算處理單元2041工作溫度值進行二次封裝以包括對應風扇管理控制板202之資訊,再上傳給機櫃管理控制器201。其中機櫃管理控制器201可根據此類別封裝後之資訊包,判別是由那一風扇管理控制板202上傳之資訊封包,藉以進行對應之管理。此外,風扇管理控制板202更包括一糾錯模組2021,藉以在基板管理控制器203上傳風扇管理控制板202資料封包出現資訊推送錯誤時,檢測出此出錯之基板管理控制器203。其中,糾錯模組2021是藉由檢測每一基板管理控制器203是否停止資訊推送之動作,來判斷其是否出現資訊推送錯誤。On the other hand, the fan management control board 202 is further coupled to a cabinet management controller 201. The fan management control board 202 performs secondary encapsulation on the operating temperature value of the operation processing unit 2041 uploaded by the baseboard management controller 203 to include information corresponding to the fan management control board 202, and then uploads the information to the rack management controller 201. The cabinet management controller 201 can determine the information packet uploaded by the fan management control board 202 according to the information package encapsulated in the category, so as to perform corresponding management. In addition, the fan management control board 202 further includes an error correction module 2021, so that when the substrate management controller 203 uploads the fan management control board 202 data packet occurrence information push error, the faulty substrate management controller 203 is detected. The error correction module 2021 determines whether or not an information push error occurs by detecting whether each of the substrate management controllers 203 stops the information push operation.

在另一實施例中,第1圖所示之管理模組102,例如為一機櫃管理控制器(Racks Management Controller,RMC)。而節點控制模組1011,例如為一設在主機板上之基板管理控制器(Baseboard Management Controller,BMC)。第3圖所示為根據本發明此一實施例的伺服器系統概略圖示。伺服器系統300包括機櫃管理控制器301以及基板管理控制器303。其中,基板管理控制器303設在一主機板304上。主機板304上例如包含運算處理單元3041、風扇3042和溫度感測器3043。值得注意的是,在本實施例中,僅以主機板304上設置有運算處理單元3041和風扇3042來說明本發明之應用,然本發明之應用與結構不限於上述之實施例。根據本實施例,基板管理控制器303根據一資訊,例如, 每一主機板上運算處理單元3041之溫度資訊來調整對應風扇3042之轉速,藉以控制運算處理單元3041之工作溫度。其中,基板管理控制器303對每一個主機板304的溫度感測器3043以輪詢(Polling)方式反覆讀取所測得的運算處理單元3041工作溫度值,並據此溫度資訊來調整風扇3042之轉速。並在預設條件滿足時,基板管理控制器303會主動將此資料封包發送給機櫃管理控制器301,進行後續之處理。在一實施例中,此預設條件為一固定時間週期,也就是說,每當一固定時間週期經過後,基板管理控制器303會主動將運算處理單元3041工作溫度值和風扇3042之運轉狀態分類封裝成資料封包發送給發送給機櫃管理控制器301以進行後續之控制。此外,機櫃管理控制器301更包括一糾錯模組3011,藉以在基板管理控制器303上傳資料封包給機櫃管理控制器301出現資訊推送錯誤時,檢測出此出錯之基板管理控制器303。其中,糾錯模組3011是藉由檢測每一基板管理控制器303是否停止資訊推送之動作,來判斷此基板管理控制器303是否出現資訊推送錯誤。In another embodiment, the management module 102 shown in FIG. 1 is, for example, a Racks Management Controller (RMC). The node control module 1011 is, for example, a Baseboard Management Controller (BMC) provided on the motherboard. Figure 3 is a schematic illustration of a server system in accordance with this embodiment of the present invention. The server system 300 includes a cabinet management controller 301 and a baseboard management controller 303. The substrate management controller 303 is disposed on a motherboard 304. The motherboard 304 includes, for example, an arithmetic processing unit 3041, a fan 3042, and a temperature sensor 3043. It should be noted that in the present embodiment, the application processing unit 3041 and the fan 3042 are provided on the motherboard 304 to explain the application of the present invention. However, the application and structure of the present invention are not limited to the above embodiments. According to the embodiment, the substrate management controller 303 is based on an information, for example, The temperature information of the arithmetic processing unit 3041 on each motherboard adjusts the rotation speed of the corresponding fan 3042, thereby controlling the operating temperature of the arithmetic processing unit 3041. The substrate management controller 303 repeatedly reads the measured operating temperature value of the arithmetic processing unit 3041 in a polling manner for the temperature sensor 3043 of each motherboard 304, and adjusts the fan 3042 according to the temperature information. The speed of rotation. When the preset condition is met, the baseboard management controller 303 actively sends the data packet to the rack management controller 301 for subsequent processing. In an embodiment, the preset condition is a fixed time period, that is, the substrate management controller 303 actively takes the operation processing unit 3041 operating temperature value and the operating state of the fan 3042 every time a fixed time period elapses. The classification is encapsulated into data packets and sent to the cabinet management controller 301 for subsequent control. In addition, the rack management controller 301 further includes an error correction module 3011 for detecting the faulty substrate management controller 303 when the substrate management controller 303 uploads the data packet to the rack management controller 301 for an information push error. The error correction module 3011 determines whether the substrate management controller 303 has an information push error by detecting whether each of the substrate management controllers 303 stops the information push operation.

第4圖所示為根據本發明一實施例伺服器系統資料傳送方法,請同時參閱第1圖和第4圖。首先,於步驟401,採集伺服器節點的運行狀態資訊,並將運行狀態資訊封裝成複數個資料封包。在一實施例中,伺服器系統至少包括複數個伺服器節點以及至少一管理模組,每個伺服器節點101更包含有一節點控制模組1011以及一對應伺服器1012,此節點控制模組1011可採集此對應伺服器1012,亦即此節點,的運行狀態資訊並封裝成複數個資料封包。接著,於步驟402,依據一預設條件自動將此些 資料封包沿一上行資料通道發送給管理模組。在一實施例中,節點控制模組1011通過一上行資料通道103來耦接管理模組102,並依據一預設條件將複數個資料封包傳送給管理模組102,其中,預設條件例如為一固定時間週期或一運行狀態資訊變化,也就是說,每當一固定時間週期經過或運行狀態資訊變化,節點控制模組1011會主動將伺服器1012當前的運行狀態沿上行資料通道103發送給管理模組102。最後於步驟403,管理模組接收並解析資料封包,以依據此些伺服器節點的運行狀態資訊管理該些伺服器節點的運行。FIG. 4 is a diagram showing a data transmission method of a server system according to an embodiment of the present invention. Please refer to FIG. 1 and FIG. 4 at the same time. First, in step 401, the running state information of the server node is collected, and the running state information is encapsulated into a plurality of data packets. In an embodiment, the server system includes at least a plurality of server nodes and at least one management module. Each server node 101 further includes a node control module 1011 and a corresponding server 1012. The node control module 1011 The corresponding server 1012, that is, the running status information of the node, can be collected and encapsulated into a plurality of data packets. Then, in step 402, the automatic conditions are automatically determined according to a preset condition. The data packet is sent to the management module along an uplink data channel. In an embodiment, the node control module 1011 is coupled to the management module 102 via an uplink data channel 103, and transmits a plurality of data packets to the management module 102 according to a preset condition, wherein the preset condition is, for example, A fixed time period or an operating state information change, that is, each time a fixed time period passes or the operating state information changes, the node control module 1011 actively sends the current running state of the server 1012 along the uplink data channel 103. Management module 102. Finally, in step 403, the management module receives and parses the data packet to manage the operation of the server nodes according to the running status information of the server nodes.

綜上所述,本發明藉由設定一預設條件來讓節點控制模組主動將對應伺服器的運行狀態資訊傳送給管理模組,管理模組不須發送一詢問訊息給節點控制模組,因此不會佔用發送詢問訊息之頻寬,僅在傳送伺服器運行狀態資訊給管理模組時佔用頻寬,因此在頻寬使用上更有效率。In summary, the present invention allows the node control module to actively transmit the running status information of the corresponding server to the management module by setting a preset condition, and the management module does not need to send an inquiry message to the node control module. Therefore, it does not occupy the bandwidth of sending the inquiry message, and only occupies the bandwidth when transmitting the server running status information to the management module, so it is more efficient in bandwidth usage.

雖然本發明已以實施方式揭露如上,然其並非用以限定本發明,任何熟習此技藝者,在不脫離本發明之精神和範圍內,當可作各種之更動與潤飾,因此本發明之保護範圍當視後附之申請專利範圍所界定者為準。Although the present invention has been disclosed in the above embodiments, it is not intended to limit the present invention, and the present invention can be modified and modified without departing from the spirit and scope of the present invention. The scope is subject to the definition of the scope of the patent application attached.

100‧‧‧伺服器系統100‧‧‧Server system

101‧‧‧伺服器節點101‧‧‧ server node

102‧‧‧管理模組102‧‧‧Management module

103‧‧‧上行資料通道103‧‧‧Upstream data channel

104‧‧‧下行資料通道104‧‧‧Down data channel

105‧‧‧系統管理控制器105‧‧‧System Management Controller

1011‧‧‧節點控制模組1011‧‧‧node control module

1012‧‧‧伺服器1012‧‧‧Server

1021‧‧‧糾錯模組1021‧‧‧Error Correction Module

Claims (24)

一種伺服器系統,至少包括:複數個伺服器節點,每個伺服器節點包含一節點控制模組,其中該節點控制模組可採集對應伺服器節點的運行狀態資訊;以及至少一管理模組,其中每一該些節點控制模組分別通過一上行資料通道與該至少一管理模組通訊連接;其中,每一該些節點控制模組將對應伺服器節點的運行狀態資訊封裝成一資料封包,並依據一預設條件自動將該資料封包沿該上行資料通道發送給該至少一管理模組,該至少一管理模組接收並解析該資料封包,並依據該對應伺服器節點的運行狀態資訊管理該伺服器節點的運行。A server system includes at least: a plurality of server nodes, each server node including a node control module, wherein the node control module can collect operating state information of a corresponding server node; and at least one management module, Each of the node control modules is respectively connected to the at least one management module through an uplink data channel; wherein each of the node control modules encapsulates the running status information of the corresponding server node into a data packet, and The data packet is automatically sent to the at least one management module along the uplink data channel according to a preset condition, and the at least one management module receives and parses the data packet, and manages the data packet according to the running state information of the corresponding server node. The operation of the server node. 如申請專利範圍第1項所述之伺服器系統,其中該預設條件為一固定時間週期,其中每經過該固定時間週期,每一該些節點控制模組將該資料封包沿該上行資料通道發送給該至少一管理模組。The server system of claim 1, wherein the preset condition is a fixed time period, wherein each of the node control modules encapsulates the data packet along the uplink data channel every time the fixed time period elapses Send to the at least one management module. 如申請專利範圍第1項所述之伺服器系統,其中該預設條件為一資訊變化,當該些伺服器節點其中之一的運行狀態資訊發生變化時,該伺服器節點對應之節點控制模組將該資料封包沿該上行資料通道發送給該至少一管理 模組。The server system of claim 1, wherein the preset condition is an information change, and when the operating state information of one of the server nodes changes, the node control mode corresponding to the server node The group sends the data packet to the at least one management along the uplink data channel Module. 如申請專利範圍第1項所述的伺服器系統,其中該管理模組更包含一糾錯模組,以當該些伺服器節點之一出現資訊推送錯誤時,檢測出該伺服器節點。The server system of claim 1, wherein the management module further comprises an error correction module for detecting the server node when an information push error occurs in one of the server nodes. 如申請專利範圍第4項所述的伺服器系統,其中該糾錯模組通過檢測該些伺服器節點之一對應之該節點控制模組是否停止該資訊推送動作,以判斷其是否出現資訊推送錯誤。The server system of claim 4, wherein the error correction module determines whether the information push is triggered by detecting whether the node control module corresponding to one of the server nodes stops the information push action error. 如申請專利範圍第1項所述之伺服器系統,更包含一下行資料通道,該管理模組通過該下行資料通道發送一命令給該些伺服器節點。For example, the server system described in claim 1 further includes a data channel, and the management module sends a command to the server nodes through the downlink data channel. 如申請專利範圍第6項所述之伺服器系統,其中該命令為開關機命令。The server system of claim 6, wherein the command is a power on/off command. 如申請專利範圍第1項所述之伺服器系統,其中每一該些節點控制模組將對應伺服器節點的運行狀態資訊分類封裝為復數個資料封包。The server system of claim 1, wherein each of the node control modules encapsulates the operating state information of the corresponding server node into a plurality of data packets. 如申請專利範圍第8項所述之伺服器系統,更包括一系統管理控制器耦接該至少一管理模組,該至少一管 理模組將所接收之復數個資料封包進行二次封裝,並發送給該系統管理控制器。The server system of claim 8, further comprising a system management controller coupled to the at least one management module, the at least one tube The module performs secondary encapsulation on the received plurality of data packets and sends them to the system management controller. 如申請專利範圍第9項所述之伺服器系統,其中該系統管理控制器為一機櫃管理控制器(Racks Management Controller,RMC),該管理模組為一風扇管理控制板(Fan Controller Board,FCB),該節點控制模組為一基板管理控制器(Baseboard Management Controller,BMC)。The server system of claim 9, wherein the system management controller is a Racks Management Controller (RMC), and the management module is a fan management control board (Fan Controller Board, FCB). The node control module is a Baseboard Management Controller (BMC). 如申請專利範圍第10項所述的伺服器系統,其中該運行狀態資訊為溫度資訊。The server system of claim 10, wherein the operating status information is temperature information. 如申請專利範圍第11項所述的伺服器系統,其中該至少一風扇管理控制板更電性連接一用於對該複數個伺服器節點散熱的風扇模組,並根據該溫度資訊調整該風扇模組中各風扇之轉速。The server system of claim 11, wherein the at least one fan management control board is further electrically connected to a fan module for dissipating heat to the plurality of server nodes, and adjusting the fan according to the temperature information. The speed of each fan in the module. 如申請專利範圍第1項所述之伺服器系統,其中該管理模組為一機櫃管理控制器,該節點控制模組為一基板管理控制器。The server system of claim 1, wherein the management module is a cabinet management controller, and the node control module is a baseboard management controller. 如申請專利範圍第13項所述之伺服器系統,其中該運行狀態資訊為溫度資訊。The server system of claim 13, wherein the operating status information is temperature information. 如申請專利範圍第14項所述之伺服器系統,其中該每一伺服器節點更分別連接一風扇,該基板管理控制器根據該溫度資訊控制該些風扇之轉速。The server system of claim 14, wherein each of the server nodes is further connected to a fan, and the baseboard management controller controls the rotation speed of the fans according to the temperature information. 一種伺服器系統資料傳送方法,其中該伺服器系統至少包括複數個伺服器節點以及至少一管理模組,該方法包括:採集該些伺服器節點的運行狀態資訊,並將該些運行狀態資訊封裝成複數個資料封包;依據一預設條件自動將該些資料封包沿一上行資料通道發送給該至少一管理模組;以及接收並解析該些資料封包,其中該至少一管理模組依據該些伺服器節點的運行狀態資訊管理該些伺服器節點的運行。A server system data transmission method, wherein the server system includes at least a plurality of server nodes and at least one management module, the method comprising: collecting operation state information of the server nodes, and packaging the operation state information Forming a plurality of data packets; automatically transmitting the data packets to the at least one management module along an uplink data channel according to a preset condition; and receiving and parsing the data packets, wherein the at least one management module is configured according to the The running status information of the server node manages the operation of the server nodes. 如申請專利範圍第16項所述之伺服器系統資料傳送方法,其中該預設條件為一固定時間週期,依據該固定時間週期自動將該些資料封包沿該上行資料通道發送給該至少一管理模組。The server system data transmission method according to claim 16, wherein the preset condition is a fixed time period, and the data packets are automatically sent to the at least one management along the uplink data channel according to the fixed time period. Module. 如申請專利範圍第16項所述之伺服器系統資料傳送方法,其中該預設條件為一資訊變化,當該些伺服器節點其中之一的運行狀態資訊發生變化時,自動將該對應資料封包沿該上行資料通道發送給該至少一管理模組。The method for transmitting data of a server system according to claim 16, wherein the preset condition is an information change, and when the running status information of one of the server nodes changes, the corresponding data is automatically encapsulated. And sending the at least one management module along the uplink data channel. 如申請專利範圍第16項所述之伺服器系統資料傳送方法,更包括:對該些伺服器節點進行糾錯,以當該些伺服器節點其中之一出現資訊推送錯誤時,檢測出該伺服器節點。The server system data transmission method as described in claim 16 further includes: correcting the server nodes to detect the servo when an information push error occurs in one of the server nodes. Node. 如申請專利範圍第19項所述之伺服器系統資料傳送方法,更包括:檢測該些伺服器節點之一是否停止該資訊推送動作,以判斷其是否出現資訊推送錯誤。The method for transmitting a server system data according to claim 19, further comprising: detecting whether one of the server nodes stops the information pushing action to determine whether a information pushing error occurs. 如申請專利範圍第16項所述之伺服器系統資料傳送方法,更包含:通過一下行資料通道由該至少一管理模組發送一命令給該些伺服器節點。The server system data transmission method of claim 16, further comprising: sending, by the at least one management module, a command to the server nodes by using a downlink data channel. 如申請專利範圍第21項所述之伺服器系統資料傳送方法,其中該命令為開關機命令。The server system data transmission method according to claim 21, wherein the command is a power on/off command. 如申請專利範圍第16項所述之伺服器系統資料傳送方法,更包括:分類封裝該些伺服器節點的運行狀態資訊為復數個資料封包。For example, the server system data transmission method described in claim 16 further includes: classifying and packaging the running status information of the server nodes into a plurality of data packets. 如申請專利範圍第23項所述之伺服器系統資料傳送方法,更包括:二次封裝該復數個資料封包並發送給一系統管理控制器。The method for transmitting a server system data according to claim 23, further comprising: sub-packaging the plurality of data packets and sending the data to a system management controller.
TW102126952A 2013-07-26 2013-07-26 Server system and a data transferring method thereof TWI505674B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW102126952A TWI505674B (en) 2013-07-26 2013-07-26 Server system and a data transferring method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW102126952A TWI505674B (en) 2013-07-26 2013-07-26 Server system and a data transferring method thereof

Publications (2)

Publication Number Publication Date
TW201505400A TW201505400A (en) 2015-02-01
TWI505674B true TWI505674B (en) 2015-10-21

Family

ID=53019057

Family Applications (1)

Application Number Title Priority Date Filing Date
TW102126952A TWI505674B (en) 2013-07-26 2013-07-26 Server system and a data transferring method thereof

Country Status (1)

Country Link
TW (1) TWI505674B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI633416B (en) * 2017-06-30 2018-08-21 神雲科技股份有限公司 Server fan control system and control method

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI611290B (en) * 2015-09-04 2018-01-11 神雲科技股份有限公司 Method for monitoring server racks
US10298479B2 (en) 2016-05-09 2019-05-21 Mitac Computing Technology Corporation Method of monitoring a server rack system, and the server rack system
TWI587128B (en) 2016-05-11 2017-06-11 神雲科技股份有限公司 Method of automatically providing error status data for computer device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7589624B2 (en) * 2004-10-29 2009-09-15 Nec Corporation Component unit monitoring system and component unit monitoring method
TW201114214A (en) * 2009-10-14 2011-04-16 Inventec Corp Method for detecting node status
TW201222273A (en) * 2010-11-29 2012-06-01 Inventec Corp Computer system and method for managing computer device
TW201321943A (en) * 2011-11-17 2013-06-01 Hon Hai Prec Ind Co Ltd Fan control system and method
TW201324345A (en) * 2011-12-14 2013-06-16 Inventec Corp Server system and address configuration method for power distribution units

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7589624B2 (en) * 2004-10-29 2009-09-15 Nec Corporation Component unit monitoring system and component unit monitoring method
TW201114214A (en) * 2009-10-14 2011-04-16 Inventec Corp Method for detecting node status
TW201222273A (en) * 2010-11-29 2012-06-01 Inventec Corp Computer system and method for managing computer device
TW201321943A (en) * 2011-11-17 2013-06-01 Hon Hai Prec Ind Co Ltd Fan control system and method
TW201324345A (en) * 2011-12-14 2013-06-16 Inventec Corp Server system and address configuration method for power distribution units

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI633416B (en) * 2017-06-30 2018-08-21 神雲科技股份有限公司 Server fan control system and control method

Also Published As

Publication number Publication date
TW201505400A (en) 2015-02-01

Similar Documents

Publication Publication Date Title
US20150019711A1 (en) Server system and a data transferring method thereof
TWI505674B (en) Server system and a data transferring method thereof
US8229596B2 (en) Systems and methods to interface diverse climate controllers and cooling devices
US8538584B2 (en) Apparatus and method for controlling environmental conditions in a data center using wireless mesh networks
US8600560B2 (en) Apparatus and method for controlling computer room air conditioning units (CRACs) in data centers
US7135826B2 (en) Fan control system
US8656003B2 (en) Method for controlling rack system using RMC to determine type of node based on FRU's message when status of chassis is changed
US20150127814A1 (en) Monitoring Server Method
US20120136484A1 (en) Data center
US20140069626A1 (en) Temperature control system and temperature control method thereof
US10303574B1 (en) Self-generated thermal stress evaluation
US10797959B2 (en) LLDP based rack management controller
US8788874B2 (en) Container system and monitoring method for container system
US20130138997A1 (en) Rack system
US20120078422A1 (en) Interfacing climate controllers and cooling devices
US20140362526A1 (en) Server system and heat-dissipation method of the same
US20140317267A1 (en) High-Density Server Management Controller
CN102495786B (en) Server system
JP6709086B2 (en) Communication control device and communication control method
TW201306728A (en) Managing system for heat dissipation of server group
US20080218360A1 (en) Transmission apparatus, transmission method and recording medium with recorded transmission program
US10284134B2 (en) Method for controlling a fan module of a server rack and controller unit for implementing the same
CN111324503A (en) Machine frame management device, method and computer readable storage medium
JP6591880B2 (en) COMMUNICATION DEVICE AND ITS CONTROL METHOD
WO2023125574A1 (en) Speed regulation control method, apparatus, storage medium, and electronic apparatus

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees