CN104184604B - A kind of cloud platform architecture supervisory systems - Google Patents
A kind of cloud platform architecture supervisory systems Download PDFInfo
- Publication number
- CN104184604B CN104184604B CN201310198963.2A CN201310198963A CN104184604B CN 104184604 B CN104184604 B CN 104184604B CN 201310198963 A CN201310198963 A CN 201310198963A CN 104184604 B CN104184604 B CN 104184604B
- Authority
- CN
- China
- Prior art keywords
- node
- group
- module
- monitoring
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Remote Monitoring And Control Of Power-Distribution Networks (AREA)
- Debugging And Monitoring (AREA)
Abstract
The invention discloses a kind of cloud platform architecture supervisory systems, including:Node, group management system, for creating and safeguarding the list of node and group;Allocating operating system system, the installation of operating system is carried out for the node to being increased newly in the list and group;Node, group information monitoring system, for monitoring the software-hardware configuration information and operating status of the node in the list and group in real time;Server alarm system, for carrying out the configuration and execution of monitoring strategies and warning strategies on the node of deployed good operating system and group.The cloud platform architecture supervisory systems of the present invention frees staff from original artificial deployment system, and realizing can long-range rapid deployment system after new machine restocking.Do not participated in manually during this.The substantial amounts of consuming for reducing human resources.
Description
Technical field
The present invention relates to server cluster monitoring management technical field, more particularly to a kind of cloud platform architecture supervision system
System.
Background technology
It is existing to be used to support the technology of large-scale data center server to have IPMI (Intelligent Platform
Management Interface).IPMI is intelligent platform management interface.The information of IPMI passes through baseboard management controller
BMC (Baseboard Management Controller) is transmitted.
Intel DCM (Intel Datacenter Manager) are the data center management platforms that Intel Company releases.
The platform can assess data center's system according to the priority level dynamically distributes power supply of server with the numerical value actually measured
Cool equipment and analysis supply electric loading.Its real value brought is the direct energy consumption for saving data center, and in data center
Each node is effectively monitored, management and report task.DCM is a series of for its software platform will be applied to provide upwards
Interface directly invoked easy to software systems.
Existing data center management plateform system some can realize such as remote on-off operation.Such as node is realized
Monitoring, and alerted.But have no a independent system and can include and realize remote on-off, long-range key deployment system,
Intelligent node monitoring alarm.
The content of the invention
The server of data center can be carried out it is an object of the invention to provide a kind of from restocking, networking, Yi Jijian
A series of operation and the follow-up maintenances and service such as charge police, while the cloud that can carry out server admin and energy consumption control is put down
Platform architecture supervisory systems, so as to solve foregoing problems existing in the prior art.
To achieve these goals, the technical solution adopted by the present invention is as follows:
A kind of cloud platform architecture supervisory systems, including:
Node, group management system, for creating and safeguarding the list of node and group;Specially:The node that will be found automatically
It is added to group in the list, is additionally operable to the group of manual creation being added in the list;
Allocating operating system system, the peace of operating system is carried out for the node to being increased newly in the list and group
Dress;It is additionally operable to in the list and having installed operating system but needing replacing or update described in operating system
Node and group carry out the installation of operating system;
Node, group information monitoring system, for monitoring the software and hardware configuration of the node in the list and group in real time
Information and operating status;
Server alarm system, on the node of deployed good operating system and group carry out monitoring strategies and
The configuration and execution of warning strategies.
Preferably, further include:
Server intelligence energy consumption control system, for each node and each described group of carry out intelligence energy consumption control
System;
Log Administration System, for the daily record in the node and group to be stored and backed up, is additionally operable to the cloud
The running log and User operation log of platform base architecture management system are stored and backed up;Also provide for log query
Service.
Preferably, the node, group management system, including:Group management module, node administration module and node are found automatically
Module;Described group of management module is for creating, maintenance and management group;The node administration module is used for each node or clothes
Business device is managed, and the management includes addition or deletion and the group of adjustment node or server of node or server;
The automatic discovery module of node, for after the IPMI mouths of new demand servicing device or new node have been attached to interchanger, according to IPMI
Agreement finds the new demand servicing device or new node automatically.
Preferably, for the allocating operating system system in deployment operation system, bottom is based on Windows Server portions
Administration's server is disposed.
Preferably, the node, group information monitoring system, including monitoring nodes module, group monitoring module, nodal information obtain
Modulus block and node remote control module;The monitoring nodes module, for monitoring performance information, the storage shape of node in real time
State, power consumption state and warning information;Described group of monitoring module, the member node in change for real-time monitoring group and described group
OS Type, operating status;The nodal information acquisition module, for obtaining the information of the node from physical layer interface
Afterwards and show, the information of the node includes:CPU information, memory information, mainboard BIOS information, fan information, air inlet
Temperature and the network information;The node remote control module, for carrying out remote on-off operation and long-range weight to the node
Open operation.
Preferably, the server alarm system, including warning strategies configuration module and policy enforcement module;The alarm
Tactful configuration module, for carrying out monitoring strategies configuration and warning strategies on the node of deployed good operating system or in group
Configuration;The warning strategies execution module, for performing the monitoring strategies and the warning strategies.
Preferably, the monitoring strategies include monitoring CPU utilization rate, memory usage and intake air temperature;The alarm
Strategy includes judging whether the cpu busy percentage and/or memory usage and/or intake air temperature reach predetermined threshold value, if
Meet or exceed the threshold value and then send alarm.
Preferably, the server intelligence energy consumption control system, including intelligent power consumption strategies configuration module and intelligent power consumption
Control module;The intelligence power consumption strategies configuration module, for creating and safeguarding that power consumption strategies control for the intelligent function
Module uses;The intelligence power consumption control module is used for the power consumption feelings for monitoring the node and group in real time according to the power consumption strategies
Condition, when the power consumption number of a certain node meets or exceeds predetermined threshold value, then controls the node to reduce its load and is arrived with reducing power consumption
Below the predetermined threshold value;The intelligence power consumption control module is additionally operable to that a certain server group will be assigned in the power consumption strategies
Total power consumption be dynamically assigned to each server or node in the group, to ensure that server in the group or node load are equal
Weighing apparatus.
Preferably, the Log Administration System, including DCM logger modules, operation log recording module, alarm log
Logging modle and log query module;The DCM logger modules, for the data center's pipe provided by intel DCM
Manage extraction and storage that interface carries out daily record;The operation log recording module, for the cloud platform architecture prison
All operations of the user of guard system carry out log recording;The alarm log logging modle, for recording all alarms
Information;The log query module, for providing the inquiry service to all daily records for the user.
Preferably, the node is single server or virtual machine, and described group includes more than two servers and/or two
Above node.
The beneficial effects of the invention are as follows:
Staff is solved releasing by the cloud platform architecture supervisory systems of the present invention from original artificial deployment system
Come, realizing can long-range rapid deployment system after new machine restocking.Do not participated in manually during this.Largely reduce manpower
The consuming of resource.Present invention also offers the effect of server in efficient management data center, energy consumption can be effectively controlled, is protected
Server load balancing is demonstrate,proved, while operating condition and alarm can also be directed to, avoids failure from occurring, improves in whole data
The reliability of the heart.
Brief description of the drawings
Fig. 1 is the structure diagram of the cloud platform architecture supervisory systems of the present invention;
Fig. 2 is the basic procedure general introduction figure of the cloud platform architecture supervisory systems of the present invention operationally;
Fig. 3 is cloud platform architecture supervisory systems itself installation procedure of the present invention and finds that the process of server is shown
It is intended to.
Embodiment
In order to make the purpose , technical scheme and advantage of the present invention be clearer, below in conjunction with attached drawing, to the present invention into
Row is further described.It should be appreciated that the specific embodiments described herein are not used to only to explain the present invention
Limit the present invention.
As shown in Figs. 1-2, the invention discloses a kind of cloud platform architecture supervisory systems, including:
Node, group management system, for creating and safeguarding the list of node and group;Specially:The node that will be found automatically
It is added to group in the list, is additionally operable to the group of manual creation being added in the list;The node, group management system
System, including:Group management module, node administration module and the automatic discovery module of node;Described group of management module is used to create, safeguard
With management group;The node administration module be used for each node or server are managed, it is described management include node or
The addition or deletion of server and the group of adjustment node or server;The automatic discovery module of node, for ought newly take
After the IPMI mouths of business device or new node have been attached to interchanger, the new demand servicing device or new section are found automatically according to IPMI protocol
Point.
Allocating operating system system, the peace of operating system is carried out for the node to being increased newly in the list and group
Dress;It is additionally operable to in the list and having installed operating system but needing replacing or update described in operating system
Node and group carry out the installation of operating system;In deployment operation system, bottom is based on the allocating operating system system
Windows Server deployment services devices are disposed.
Node, group information monitoring system, for monitoring the software and hardware configuration of the node in the list and group in real time
Information and operating status;The node, group information monitoring system, including monitoring nodes module, group monitoring module, nodal information obtain
Modulus block and node remote control module;The monitoring nodes module, for monitoring performance information, the storage shape of node in real time
State, power consumption state and warning information;Described group of monitoring module, the member node in change for real-time monitoring group and described group
OS Type, operating status;The nodal information acquisition module, for obtaining the information of the node from physical layer interface
Afterwards and show, the information of the node includes:CPU information, memory information, mainboard BIOS information, fan information, air inlet
Temperature and the network information;The node remote control module, for carrying out remote on-off operation and long-range weight to the node
Open operation.
Server alarm system, on the node of deployed good operating system and group carry out monitoring strategies and
The configuration and execution of warning strategies;The server alarm system, including warning strategies configuration module and policy enforcement module;Institute
Warning strategies configuration module is stated, for carrying out monitoring strategies configuration and announcement on the node of deployed good operating system or in group
It is pithy slightly to configure;The warning strategies execution module, for performing the monitoring strategies and the warning strategies.
It can also include:
Server intelligence energy consumption control system, for each node and each described group of carry out intelligence energy consumption control
System;The server intelligence energy consumption control system, including intelligent power consumption strategies configuration module and intelligent power consumption control module;It is described
Intelligent power consumption strategies configuration module, for creating and safeguarding that power consumption strategies use for the intelligent function control module;It is described
Intelligent power consumption control module is used for the power consumption situation for monitoring the node and group in real time according to the power consumption strategies, when a certain node
Power consumption number when meeting or exceeding predetermined threshold value, then control the node to reduce its load with reduce power consumption to the predetermined threshold value with
Under;The intelligence power consumption control module is additionally operable to that the total power consumption dynamic point of a certain server group will be assigned in the power consumption strategies
Each server or node in the dispensing group, to ensure the server or node load balancing in the group;The monitoring plan
Slightly include monitoring CPU utilization rate, memory usage and intake air temperature;The warning strategies include judging the cpu busy percentage
And/or whether memory usage and/or intake air temperature reach predetermined threshold value, and report is sent if the threshold value is met or exceeded
It is alert.
Log Administration System, for the daily record in the node and group to be stored and backed up, is additionally operable to the cloud
The running log and User operation log of platform base architecture management system are stored and backed up;Also provide for log query
Service;The Log Administration System, including DCM logger modules, operation log recording module, alarm log logging modle and
Log query module;The DCM logger modules, the data center management interface for being provided by intel DCM carry out
The extraction and storage of daily record;The operation log recording module, for making to the cloud platform architecture supervisory systems
All operations of user carry out log recording;The alarm log logging modle, for recording all warning information;The day
Will enquiry module, for providing the inquiry service to all daily records for the user.The node is single server, described
Server group is the server group being made of more than two servers.
The initialization of cloud platform architecture supervisory systems of the invention introduced below and deployment flow:
1 early period, the cloud platform architecture supervisory systems installation of the present invention prepared.The cloud platform architecture prison of the present invention
Guard system need to be loaded in advance a single server (such as:Super Cloud Server R6240-G9) or PC machine on, afterwards should
Platform machine can carry out a series of remote operation or control to the node in data center.
2 new machine restockings.Physical machine to be disposed is put into rack, is connected cable by data center operations personnel.Wherein one
Platform interchanger corresponds to the IPMI communication network interfaces for being connected to each node in four nodes of server respectively.Another of each node
LAN mouthfuls are uniformly connected to an other interchanger.New restocking is remotely being carried out by the cloud platform architecture supervisory systems of the present invention
The search of server.New restocking physical machine can be in batches added to the cloud platform architecture supervision system of the present invention after search
In system and carry out follow-up maintenance and management.
3 deployment systems.Added physics can be shown in the cloud platform architecture supervisory systems of the present invention
The essential information of machine.Deployment system process is can be carried out, selection is thought operating system to be mounted, directly pulled
One key of operating system to be installed is deployed to node to be disposed.It can complete to dispose without artificial intervention during deployment
Task.The cloud platform architecture supervisory systems of the present invention supports deployment in the market most of operating systems such as Windows,
Linux, VMWare virtual machine etc..
4 groups of establishments.For the node setting group being successfully added in the cloud platform architecture supervisory systems of the present invention.
Group can support Rack, Row, Room, DataCenter and Logical group.So do to facilitate in data center and service
The management of device.Also allow for supporting preferably controlling well.
5 monitoring alarms.Monitoring strategies are carried out on the node of deployed good operating system or in group and warning strategies are matched somebody with somebody
Put.Monitoring strategies and warning strategies can be only fitted in single node or in group.
The cloud platform architecture supervisory systems of the present invention is supported to include cpu busy percentage, memory usage, intake air temperature
The alarm item such as alarm.When being managed node and triggering the condition of alarm, for example the cpu busy percentage of certain node is 80%, the section
The alarm threshold set on point alerts when being higher than 50% for cpu busy percentage.Then cloud platform architecture supervisory systems of the invention
Meeting display alarm information, can send an SMS to data center related personnel if necessary.
6 intelligent energy consumption controls.The cloud platform architecture supervisory systems of the present invention can carry out individual node and group
Intelligent energy consumption control.Such as when certain node power consumption is more than 300W, user has set the node power consumption and should be less than 280W before this.This
When, the intelligent power consumption control in cloud platform architecture supervisory systems of the invention plays a role, and node power consumption control is existed
Below 280W.For one group of server, cloud platform architecture supervisory systems of the invention directly can carry out power consumption plan to group
Slightly configure, such as set the power consumption threshold value of one group of server containing 8 nodes as 2000W.Then cloud platform foundation frame of the invention
Constitutive regulatory system can distribute appropriate power consumption in group automatically according to business demand on the premise of business demand is not influenced
Each node is balanced with proof load.
By using above-mentioned technical proposal disclosed by the invention, following beneficial effect has been obtained:
Staff is solved releasing by the cloud platform architecture supervisory systems of the present invention from original artificial deployment system
Come, realizing can long-range rapid deployment system after new machine restocking.Do not participated in manually during this.Largely reduce manpower
The consuming of resource.Present invention also offers the effect of server in efficient management data center, energy consumption can be effectively controlled, is protected
Server load balancing is demonstrate,proved, while operating condition and alarm can also be directed to, avoids failure from occurring, improves in whole data
The reliability of the heart.
The above is only the preferred embodiment of the present invention, it is noted that for the ordinary skill people of the art
For member, various improvements and modifications may be made without departing from the principle of the present invention, these improvements and modifications also should
Depending on protection scope of the present invention.
Claims (1)
- A kind of 1. cloud platform architecture supervisory systems, it is characterised in that including:Node, group management system, for creating and safeguarding the list of node and group;Specially:By the node found automatically and group It is added in the list, is additionally operable to the group of manual creation being added in the list;The node, group management system, bag Include:Group management module, node administration module and the automatic discovery module of node;Described group of management module is used to create, safeguard and manage Reason group;The node administration module is used to be managed each node or server, and the management includes node or service The addition or deletion of device and the group of adjustment node or server;The automatic discovery module of node, for when new demand servicing device Or after the IPMI mouths of new node have been attached to interchanger, find the new demand servicing device or new node automatically according to IPMI protocol;Allocating operating system system, the installation of operating system is carried out for the node to being increased newly in the list and group;Also For in the list and having installed operating system but needing replacing or update the node of operating system With a group installation for progress operating system;Node, group information monitoring system, for monitoring the software-hardware configuration information of the node in the list and group in real time And operating status;The node, group information monitoring system, including monitoring nodes module, group monitoring module, nodal information obtain mould Block and node remote control module;The monitoring nodes module, for monitoring performance information, storage state, the energy of node in real time Consumption state and warning information;Described group of monitoring module, the behaviour of the member node in change for real-time monitoring group and described group Make system type, operating status;The nodal information acquisition module, after the information for obtaining the node from physical layer interface simultaneously Show, the information of the node includes:CPU information, memory information, mainboard BIOS information, fan information, intake air temperature And the network information;The node remote control module, for carrying out remote on-off operation and remote reboot behaviour to the node Make;Server alarm system, for carrying out monitoring strategies and alarm on the node of deployed good operating system and group The configuration and execution of strategy;The server alarm system, including warning strategies configuration module and policy enforcement module;The announcement Pithy slightly configuration module, for carrying out monitoring strategies configuration and alarm plan on the node of deployed good operating system or in group Slightly configure;The warning strategies execution module, for performing the monitoring strategies and the warning strategies;The monitoring strategies bag Include monitoring CPU utilization rate, memory usage and intake air temperature;The warning strategies include judge the cpu busy percentage and/or Whether memory usage and/or intake air temperature reach predetermined threshold value, and alarm is sent if the threshold value is met or exceeded;Server intelligence energy consumption control system, for each node and each described group of carry out intelligence energy consumption control; The server intelligence energy consumption control system, including intelligent power consumption strategies configuration module and intelligent power consumption control module;The intelligence Energy power consumption strategies configuration module, for creating and safeguarding that power consumption strategies use for the intelligent function control module;The intelligence Energy power consumption control module is used for the power consumption situation for monitoring the node and group in real time according to the power consumption strategies, when a certain node When power consumption number meets or exceeds predetermined threshold value, then control the node reduce its load with reduce power consumption to the predetermined threshold value with Under;The intelligence power consumption control module is additionally operable to that the total power consumption dynamic point of a certain server group will be assigned in the power consumption strategies Each server or node in the dispensing group, to ensure the server or node load balancing in the group;Log Administration System, for the daily record in the node and group to be stored and backed up, is additionally operable to the cloud platform The running log and User operation log of architecture management system are stored and backed up;Also provide for log query clothes Business;The Log Administration System, including DCM logger modules, operation log recording module, alarm log logging modle and day Will enquiry module;The DCM logger modules, the data center management interface for being provided by intel DCM carry out day The extraction and storage of will;The operation log recording module, for the use to the cloud platform architecture supervisory systems All operations of person carry out log recording;The alarm log logging modle, for recording all warning information;The daily record Enquiry module, for providing the inquiry service to all daily records for the user;In deployment operation system, bottom is carried out the allocating operating system system based on Windows Server deployment services device Deployment;The node is single server or virtual machine, and described group includes more than two servers and/or more than two nodes.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310198963.2A CN104184604B (en) | 2013-05-24 | 2013-05-24 | A kind of cloud platform architecture supervisory systems |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310198963.2A CN104184604B (en) | 2013-05-24 | 2013-05-24 | A kind of cloud platform architecture supervisory systems |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104184604A CN104184604A (en) | 2014-12-03 |
CN104184604B true CN104184604B (en) | 2018-05-01 |
Family
ID=51965367
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310198963.2A Active CN104184604B (en) | 2013-05-24 | 2013-05-24 | A kind of cloud platform architecture supervisory systems |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104184604B (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104883273B (en) * | 2015-05-05 | 2018-04-27 | 广州杰赛科技股份有限公司 | The processing method and system of service impact model in virtualization services management platform |
CN105183520B (en) * | 2015-09-21 | 2019-01-15 | 赵伟 | Computer software remote automation Method of Adjustment and system |
CN105553721A (en) * | 2015-12-15 | 2016-05-04 | 浪潮电子信息产业股份有限公司 | Cloud application stretching method, application management side and system |
CN108075920B (en) * | 2016-11-14 | 2019-02-05 | 视联动力信息技术股份有限公司 | A kind of management method and system regarding networked terminals |
CN106850295A (en) * | 2017-02-04 | 2017-06-13 | 郑州云海信息技术有限公司 | A kind of log collection monitoring method of privatization cloud platform |
CN107612748B (en) * | 2017-10-13 | 2021-03-09 | 苏州浪潮智能科技有限公司 | Multi-node server power consumption management system |
CN109728938A (en) * | 2018-12-11 | 2019-05-07 | 国云科技股份有限公司 | A kind of method of assessment system service level |
CN116450464B (en) * | 2023-06-13 | 2023-08-25 | 浙江睿数云联科技有限公司 | Operation and maintenance management method, system and equipment |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1620020A (en) * | 2003-11-20 | 2005-05-25 | 国际商业机器公司 | Automatic configuration of the network devices via connection to specific switch ports |
CN201467145U (en) * | 2009-05-26 | 2010-05-12 | 深圳市汉普电子技术开发有限公司 | Remote management system and control device |
-
2013
- 2013-05-24 CN CN201310198963.2A patent/CN104184604B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1620020A (en) * | 2003-11-20 | 2005-05-25 | 国际商业机器公司 | Automatic configuration of the network devices via connection to specific switch ports |
CN201467145U (en) * | 2009-05-26 | 2010-05-12 | 深圳市汉普电子技术开发有限公司 | Remote management system and control device |
Also Published As
Publication number | Publication date |
---|---|
CN104184604A (en) | 2014-12-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104184604B (en) | A kind of cloud platform architecture supervisory systems | |
US8838286B2 (en) | Rack-level modular server and storage framework | |
CN107070726A (en) | A kind of integrated management approach based on MDC | |
CN104615112B (en) | Resource and environmental monitoring early warning system under network environment | |
CN102480749B (en) | Method, device and system for remotely collecting host process information | |
CN104023068B (en) | A kind of method that Passive Mode elastic calculation scheduling of resource is realized in load balancing | |
CN103543718A (en) | Internet of Things based intelligent IDC (Internet data center) computer room monitoring system | |
CN104463492A (en) | Operation management method of electric power system cloud simulation platform | |
CN201623722U (en) | Supervising platform for running and maintaining information security of electric power secondary system | |
CN107612748A (en) | A kind of multi node server power consumption management system | |
CN106095562A (en) | The method and apparatus of Portable Batch System | |
CN105893211A (en) | Method and system for monitoring | |
CN108282540A (en) | A kind of subway monitoring system and its monitoring method | |
CN104601673B (en) | Extensible high-availability server layered monitoring system | |
CN212183550U (en) | Novel urban rail transit integrated monitoring system based on cloud platform | |
CN103701889A (en) | Data center energy saving method on basis of cloud computing | |
CN107018026A (en) | Automate main website network monitoring system | |
CN107203255A (en) | Power-economizing method and device are migrated in a kind of network function virtualized environment | |
CN108877160A (en) | A kind of monitoring device and monitoring method of well lid | |
CN107360045A (en) | The monitoring method and device of a kind of storage cluster system | |
CN101860024A (en) | Implementation method for integrating provincial dispatch organization PAS system and local-level dispatch organization PAS systems in electric power system | |
CN106452966A (en) | Multi-gateway management realization method for OpenStack cloud desktop | |
CN108809702A (en) | A kind of device management method and device management platform | |
CN206096878U (en) | Machine room monitoring equipment | |
CN104601378A (en) | Virtual resource flexible scheduling implementation method combining application performance indicator monitoring data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
EXSB | Decision made by sipo to initiate substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |