CN105868077B - A method of obtaining whole machine cabinet server node monitoring information - Google Patents

A method of obtaining whole machine cabinet server node monitoring information Download PDF

Info

Publication number
CN105868077B
CN105868077B CN201610222967.3A CN201610222967A CN105868077B CN 105868077 B CN105868077 B CN 105868077B CN 201610222967 A CN201610222967 A CN 201610222967A CN 105868077 B CN105868077 B CN 105868077B
Authority
CN
China
Prior art keywords
node
cmd
bmc
plate
whole machine
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610222967.3A
Other languages
Chinese (zh)
Other versions
CN105868077A (en
Inventor
苏孝
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Electronic Information Industry Co Ltd
Original Assignee
Inspur Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Electronic Information Industry Co Ltd filed Critical Inspur Electronic Information Industry Co Ltd
Priority to CN201610222967.3A priority Critical patent/CN105868077B/en
Publication of CN105868077A publication Critical patent/CN105868077A/en
Application granted granted Critical
Publication of CN105868077B publication Critical patent/CN105868077B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3058Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3466Performance evaluation by tracing or monitoring
    • G06F11/3476Data logging

Abstract

The invention discloses a kind of method obtaining whole machine cabinet server node monitoring information, specific implementation process is:Hardware components are set, and in the whole machine cabinet server being made of several server nodes, BMC chip built in each server node installs plate in Centralized Monitoring administrative unit RMC and node in whole machine cabinet server;Increase OEM command modules in BMC;The specific data format of the BMC OEM CMD modules of definition node, i.e. OEM orders use CMD block formats, support multiple CMD modules while obtaining and sending;Plate sends OEM CMD module datas to BMC and obtains nodal information in node;RMC regulates and controls from nodal information, real time monitoring and heat dissipation is obtained between plate in node.This it is a kind of obtain whole machine cabinet server node monitoring information method compared with prior art, simplify the communication process of plate in node BMC and node, plate once can obtain mass data from node BMC in node, especially real-time change information, it is of great advantage to improving system response time, it is highly practical.

Description

A method of obtaining whole machine cabinet server node monitoring information
Technical field
The present invention relates to field of computer technology, specifically a kind of highly practical, acquisition whole machine cabinet server node The method of monitoring information.
Background technology
With the development of computer technology, whole machine cabinet server is especially applied in large-scale data center in practical applications More and more extensive, whole machine cabinet information is generally by RMC(Rack Management Controller)To be managed collectively, concentrate prison Control.In whole machine cabinet server system, each layer (being usually 4U) server node is connect by I2C with plate in node, complete machine Plate is connected to upper layer Centralized Monitoring by I2C and manages system RMC in every node layer in cabinet.It is every to send commands to this layer for plate in node A node BMC obtains nodal information, and RMC sends commands to plate indirect gain monitoring nodes information in every node layer.Node at present Middle plate usually obtains this node layer sensor, network, the information such as FRU by standard IPMI orders.But standard IPMI orders are deposited The problem of be in data format it is less comprising information content, and plate needs the information content obtained from node more in node, in this way Plate must will send many IPMI orders and obtain nodal information to BMC in node, cause the interactive efficiency of the two low, for The heat dissipation strategy such as cpu temperature, memory temperature, intake air temperature needs the real-time change information used, often because information obtains Take speed slow, gathered data causes whole machine cabinet heat dissipation effect poor not in time, and fan wall and node power consumption can not be reduced effectively. Therefore, it realizes a kind of method that can rapidly and efficiently obtain whole machine cabinet server node monitoring information, becomes design and developer The problem of urgent need to resolve.
Invention content
The technical assignment of the present invention is to be directed to the above shortcoming, provides a kind of highly practical, acquisition whole machine cabinet server The method of monitoring nodes information.
A method of whole machine cabinet server node monitoring information being obtained, specific implementation process is:
Hardware components are set, in the whole machine cabinet server being made of several server nodes, in each server node BMC chip is set, plate in Centralized Monitoring administrative unit RMC and node is installed in whole machine cabinet server;
Increase OEM command modules in BMC;
The specific data format of the BMC OEM CMD modules of definition node, i.e. OEM orders use CMD block formats, support Multiple CMD modules are obtained and are sent simultaneously;
Plate sends OEM CMD module datas to BMC and obtains nodal information in node;
RMC regulates and controls from nodal information, real time monitoring and heat dissipation is obtained between plate in node.
The whole machine cabinet server is 4U servers, and each layer of server node is connect by I2C with plate in node, Plate is connected to upper layer Centralized Monitoring by I2C and manages system RMC in every layer of node in whole machine cabinet server.
The detailed process of the specific data format of definition node BMC OEM CMD modules is:The CMD modules include sensor CMD, network C MD, FRU CMD three parts, in sensor CMD, define cpu temperature, node disengaging air temperature, memory temperature, Voltage, node power consumption, Node Switch machine state, the data format of health status;In network C MD, BMC share NIC are supported It obtains and is arranged with the special mouth network information;In FRU CMD, Product Name, Product Serial, Chassis are supported Extra fields obtain simultaneously and setting.
Communication is communicated to connect using IPMB communication interfaces between plate and node BMC in the node.
It is communicated by CMD modules between plate and node BMC in node, which supports acquisition and setting command, Information and setting nodal information are obtained from node for plate in node, every CMD information includes three parts:
CMD Index distinguish different CMD;
CMD length, i.e., specific data length;
The parts CMD data of CMD data, i.e., specific data format definition, each CMD module include several nodes BMC monitoring management information.
A kind of method of acquisition whole machine cabinet server node monitoring information of the present invention, has the following advantages:
A kind of method of acquisition whole machine cabinet server node monitoring information of the present invention is based on by plate in node in BMC OEM CMD modular manners realize that whole machine cabinet monitoring nodes information obtains in real time, simplify the communication of plate in node BMC and node Journey greatly reduces the time that RMC obtains nodal information, improves acquisition efficiency, plate can be obtained once from node BMC in node Mass data, especially real-time change information are taken, system response time is substantially increased;RMC can be according to CPU, memory, air inlet The real time information such as temperature quickly adjust control rotation speed of the fan, not only increase heat dissipation effect, can also further decrease node power consumption, It is highly practical, it is easy to spread.
Description of the drawings
Attached drawing 1 is the implementation flow chart of the present invention.
Specific implementation mode
The invention will be further described in the following with reference to the drawings and specific embodiments.
As shown in Fig. 1, the present invention provides a kind of method obtaining whole machine cabinet server node monitoring information, passes through complete machine Cabinet server centered monitoring management unit(RMC,Rack Management Controler)With plate in node, it is based on node BMC In increased OEM command modules more quickly and efficiently obtain information from node BMC, such as server node on-off state, temperature Information, the network information, FRU information, assets information, node power consumption etc..
Its specific implementation process is:
Hardware components are set, in the whole machine cabinet server being made of several server nodes, in each server node BMC chip is set, plate in Centralized Monitoring administrative unit RMC and node is installed in whole machine cabinet server;
Increase OEM command modules in BMC;
The specific data format of the BMC OEM CMD modules of definition node, i.e. OEM orders use CMD block formats, support Multiple CMD modules are obtained and are sent simultaneously;
Plate sends OEM CMD module datas to BMC and obtains nodal information in node;
RMC regulates and controls from nodal information, real time monitoring and heat dissipation is obtained between plate in node.
The whole machine cabinet server is 4U servers, and each layer of server node is connect by I2C with plate in node, Plate is connected to upper layer Centralized Monitoring by I2C and manages system RMC in every layer of node in whole machine cabinet server.
The detailed process of the specific data format of definition node BMC OEM CMD modules is:The CMD modules include sensor CMD, network C MD, FRU CMD three parts, in sensor assembly sensor CMD, by cpu temperature, node disengaging air temperature, The data format definitions such as memory temperature, voltage, node power consumption, Node Switch machine state, health status are good, in network C MD modules In support that BMC share NIC and the special mouth network information obtain and setting simultaneously, in FRU CMD modules, i.e. field changeable Component field-replaceable unit modules support Product Name, Product Serial, Chassis Extra Etc. fields simultaneously obtain and setting.
Communication is communicated to connect using IPMB communication interfaces between plate and node BMC in the node.
It is communicated by CMD modules between plate and node BMC in node, which supports acquisition and setting command, Information and setting nodal information are obtained from node for plate in node, every CMD information includes three parts:
CMD Index distinguish different CMD;
CMD length, i.e., specific data length;
The parts CMD data of CMD data, i.e., specific data format definition, each CMD module include several nodes BMC monitoring management information.
The method proposed by the present invention for rapidly and efficiently obtaining whole machine cabinet server node monitoring information, simplifies node BMC With the communication process of plate in node, plate can once obtain mass data, especially real-time change from node BMC and believe in node Breath, it is of great advantage to improving system response time.For complete machine cabinet heat radiation speed governing, since RMC can pass through plate in node quick one It is secondary to get the information such as CPU, memory, intake air temperature, it can be quickly according to current heat dissipating state tune when node load changes Whole rotation speed of the fan improves heat dissipation effect, while can also substantially reduce fan wall and node power consumption.
Above-mentioned specific implementation mode is only the specific case of the present invention, and scope of patent protection of the invention includes but not limited to Above-mentioned specific implementation mode, a kind of right of the method for any acquisition whole machine cabinet server node monitoring information for meeting the present invention The appropriate change or replacement that the those of ordinary skill of claim and any technical field does it, should all fall into this hair Bright scope of patent protection.

Claims (2)

1. a kind of method obtaining whole machine cabinet server node monitoring information, which is characterized in that its specific implementation process is:
Hardware components, in the whole machine cabinet server being made of several server nodes, BMC built in each server node are set Chip installs plate in Centralized Monitoring administrative unit RMC and node in whole machine cabinet server;
Increase OEM command modules in BMC;
The specific data format of the BMC OEM CMD modules of definition node, i.e. OEM orders use CMD block formats, support multiple CMD modules are obtained and are sent simultaneously;
Plate sends OEM CMD module datas to BMC and obtains nodal information in node;
RMC regulates and controls from nodal information, real time monitoring and heat dissipation is obtained between plate in node;
The detailed process of the specific data format of definition node BMC OEM CMD modules is:The CMD modules include sensor CMD, Network C MD, FRU CMD three parts define cpu temperature, node disengaging air temperature, memory temperature, electricity in sensor CMD The data format of pressure, node power consumption, Node Switch machine state;In network C MD, BMC is supported to share network interface card and special mouth network letter Breath obtains and setting;In FRU CMD, support Product Name, Product Serial, Chassis Extra fields same When obtain and setting;
Communication is communicated to connect using IPMB communication interfaces between plate and node BMC in the node;
It is communicated by CMD modules between plate and node BMC in node, which supports acquisition and setting command, is used for Plate obtains information and setting nodal information from node in node, and every CMD information includes three parts:
CMD Index distinguish different CMD;
CMD length, i.e., specific data length;
It is supervised comprising several node BMC the parts CMD data of CMD data, i.e., specific data format definition, each CMD module Control management information.
2. a kind of method obtaining whole machine cabinet server node monitoring information according to claim 1, which is characterized in that institute It is 4U servers to state whole machine cabinet server, and each layer of server node is connect by I2C with plate in node, whole machine cabinet service Plate is connected to upper layer Centralized Monitoring by I2C and manages system RMC in every layer of node in device.
CN201610222967.3A 2016-04-12 2016-04-12 A method of obtaining whole machine cabinet server node monitoring information Active CN105868077B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610222967.3A CN105868077B (en) 2016-04-12 2016-04-12 A method of obtaining whole machine cabinet server node monitoring information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610222967.3A CN105868077B (en) 2016-04-12 2016-04-12 A method of obtaining whole machine cabinet server node monitoring information

Publications (2)

Publication Number Publication Date
CN105868077A CN105868077A (en) 2016-08-17
CN105868077B true CN105868077B (en) 2018-09-25

Family

ID=56637476

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610222967.3A Active CN105868077B (en) 2016-04-12 2016-04-12 A method of obtaining whole machine cabinet server node monitoring information

Country Status (1)

Country Link
CN (1) CN105868077B (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106326050A (en) * 2016-08-18 2017-01-11 浪潮电子信息产业股份有限公司 Automatic monitoring and managing method for complete machine cabinet server
CN106339294A (en) * 2016-08-29 2017-01-18 浪潮电子信息产业股份有限公司 Voltage monitoring system and method
CN107979502A (en) * 2016-10-25 2018-05-01 郑州云海信息技术有限公司 The method and flow that plate compatibility different type node monitors in a kind of server
CN107977273A (en) * 2016-10-25 2018-05-01 郑州云海信息技术有限公司 The Memory Optimize Method of node information collection memory sharing in a kind of cabinet
CN106528308B (en) * 2016-11-25 2019-07-02 山东浪潮人工智能研究院有限公司 A kind of server sensor information acquisition method
CN106850814B (en) * 2017-02-15 2020-02-14 浪潮集团有限公司 Method for realizing sensor information acquisition by adding custom command support
CN107302465B (en) * 2017-08-18 2021-06-29 郑州云海信息技术有限公司 PCIe Switch server complete machine management method
CN107623591A (en) * 2017-08-28 2018-01-23 北京云集智造科技有限公司 A kind of server universal monitor method and device
CN109240891A (en) * 2018-09-26 2019-01-18 郑州云海信息技术有限公司 A kind of monitoring method and device of SR whole machine cabinet server
CN113204361A (en) * 2021-05-20 2021-08-03 山东英信计算机技术有限公司 Automatic configuration method and device for whole cabinet server
CN115150304B (en) * 2022-07-29 2023-06-02 苏州浪潮智能科技有限公司 Monitoring method, system, device and medium for server node IPv6 network

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105224756A (en) * 2015-10-14 2016-01-06 浪潮电子信息产业股份有限公司 A kind of method for designing obtaining SmartRack whole machine cabinet air quantity
CN105389242A (en) * 2015-10-14 2016-03-09 浪潮电子信息产业股份有限公司 Method for acquiring overall cabinet server information in batch

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105224756A (en) * 2015-10-14 2016-01-06 浪潮电子信息产业股份有限公司 A kind of method for designing obtaining SmartRack whole machine cabinet air quantity
CN105389242A (en) * 2015-10-14 2016-03-09 浪潮电子信息产业股份有限公司 Method for acquiring overall cabinet server information in batch

Also Published As

Publication number Publication date
CN105868077A (en) 2016-08-17

Similar Documents

Publication Publication Date Title
CN105868077B (en) A method of obtaining whole machine cabinet server node monitoring information
US11838183B2 (en) Autonomous distributed workload and infrastructure scheduling
US11665230B2 (en) Data center network device sensing
US10817398B2 (en) Data center management via out-of-band, low-pin count, external access to local motherboard monitoring and control
US10404523B2 (en) Data center management with rack-controllers
US9965442B2 (en) Node card management in a modular and large scalable server system
US10234833B2 (en) Technologies for predicting power usage of a data center
US20150215343A1 (en) Multi-host network interface controller with host management
US20220263702A1 (en) Data center management with rack-controllers
JP2005182814A (en) Equipment rack load adjustment system and method
JP2005115941A (en) Rack equipment management system and method
WO2017222763A2 (en) Autonomous distributed workload and infrastructure scheduling
CN102081567A (en) Server
EP3400498B1 (en) Data center management
CN209015216U (en) A kind of electronic equipment
CN110296501A (en) A kind of centralized management system and method for air-conditioning system
CN103428021B (en) Cabinet level server, cabinet level server node information management method and system
CN103605413A (en) Rack-mounted server system cabinet, rack-mounted server system and management method thereof
EP3400497B1 (en) Data center management
CN107239300A (en) A kind of intelligent cabinet RMC and MP batch refreshing methods
US10402357B1 (en) Systems and methods for group manager based peer communication
CN106484656B (en) A kind of management board of collectable multinode management information
CN107105599A (en) A kind of design method based on the autonomous fan regulation and controlling of InCloudRack racks
TW201312333A (en) Monitoring and management system and method for computer power consumption
CN116226022A (en) Multi-node blade server and management method thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant