CN108121639A - A kind of data center's total management system design method based on cloud platform - Google Patents

A kind of data center's total management system design method based on cloud platform Download PDF

Info

Publication number
CN108121639A
CN108121639A CN201711395908.7A CN201711395908A CN108121639A CN 108121639 A CN108121639 A CN 108121639A CN 201711395908 A CN201711395908 A CN 201711395908A CN 108121639 A CN108121639 A CN 108121639A
Authority
CN
China
Prior art keywords
data
monitoring
management system
data center
total management
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711395908.7A
Other languages
Chinese (zh)
Inventor
李俊山
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co Ltd filed Critical Zhengzhou Yunhai Information Technology Co Ltd
Priority to CN201711395908.7A priority Critical patent/CN108121639A/en
Publication of CN108121639A publication Critical patent/CN108121639A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3006Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/323Visualisation of programs or trace data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Computer And Data Communications (AREA)

Abstract

Propose a kind of data center's total management system design method and data center's total management system based on cloud platform, data center's total management system has the function of automatic management, it is laid down a regulation according to historical statistical data, realize the operation such as being set automatic threshold, the data acquisition modes of monitoring agent are relatively abundanter, data acquisition modes are various including IT kind equipments and power & environment supervision categorical data, so that data center's total management system is uniformly included under designed integrated service frame, construct one efficiently, stable data center's total management system.

Description

A kind of data center's total management system design method based on cloud platform
Technical field
The present invention relates to information technology field, more particularly to a kind of data center's total management system based on cloud platform is set Meter method and data center's total management system.
Background technology
Modular data center (Module Data Center, MDC) is the New Generation of IDC portion based on cloud computing Administration's form in order to tackle the trend of the servers such as cloud computing, virtualization, centralization, high densification development, uses modularized design Theory reduces coupling of the infrastructure to building environment to the greatest extent.Be integrated with power supply and distribution, refrigeration, cabinet, air-flow containment, The subsystems such as comprehensive wiring, power & environment supervision, improve the whole efficiency of operation of data center, realize rapid deployment, resilient expansion and Green energy conservation.
With the rapid development of big data information industry, the development of data center also enters a new stage.Management System is the important component of configuration inside data center.Traditional management system mainly based on power & environment supervision, possesses more Kind of data-interface, can access UPS, power distribution cabinet, precision air conditioner, gate inhibition, Temperature Humidity Sensor, smoke detector, temperature detector, A variety of monitored object such as leakage sensor, turning roof window and web camera.
The power of data center mainly include with environmental data center total management system UPS, power distribution cabinet, precision air conditioner, Gate inhibition, Temperature Humidity Sensor, smoke detector, temperature detector, leakage sensor, turning roof window and web camera etc. are a variety of Monitored object, abbreviation rotating ring data center total management system, core equipment are power & environment supervision host.
Data center's total management system mainly includes information technoloy equipment management (server admin, virtual management), rotating ring number According to the complete data center facility management such as center total management system management.
Currently, with the fast development of cloud computing, big data and internet, basic turn has occurred in information-based infrastructure Become, the demand of monitoring management is converted into integral platform, the unified platform, the system being managed collectively from some individual system requirements It is required that.Every application server is no longer individual computing module, but will be calculated, deposited by platforms such as cloud computing, big datas Storage resource is united, and forms in large scale, unified monitoring and management resource pools across data center's scope, it is therefore desirable to energy Enough uniform data center total management systems for monitoring extensive, the distributed, virtual resource of cross-region and physical resource.
Data center's infrastructure is the core of cloud computing framework, it is supplied to user to including CPU, memory, storage, net The use of the computing resources such as network is effectively reduced the cost and complexity of IT O&Ms.Cloud computing framework has distributed, inter-network The characteristics of network, more resource category, brings unprecedented challenge, compared to traditional services for resource management aspect thereupon Device aggregated structure, except the management to physical resources such as Web server, application servers, it is also necessary to CPU, memory, storage, The unified management of the virtual resources such as network, virtual machine.
Effective management to resource and service is a core requirement in cloud computing delivery process.According to cloud computing framework Distinguishing hierarchy can be divided into hardware platform management, virtual platform management, middleware management, application management etc., according to function pair The difference of elephant can be divided into user management, storage management, network management, management of computing etc..
Currently, with the fast development of cloud computing technology, the quantity of cloud data center is also more and more.In cloud data center There is large number of server, the very important composition portion of cloud platform is become to the monitoring of data center and operation condition of server Point.Efficiently monitoring can ensure the stabilization of cloud platform in real time, improve the availability of cloud service.And traditional data center's synthesis Management system is in the case of current cloud data center server is numerous, it is difficult to ensure that the real-time and high efficiency of monitoring.Therefore, Study the server data center total management system of real-time, efficient, low occupancy still highly significant.
Not only include storage resource, computing resource and the Internet resources in server in cloud computing platform, be additionally included in cloud Various software resources on platform and the more complicated resource under distributed environment.In addition, resource quantity is more, network environment Complexity, monitoring software can not only generate substantial amounts of monitoring data, but also also need to be efficiently completed policer operation.This environment and Under it is required that, traditional monitoring programme is difficult efficiently to carry out integrated management to data center server operating status.
The architecture of server data center total management system generally has centralization and two kinds of moulds of layer-stepping at present Formula.
Centralized architecture is made of data center's total management system server and monitoring agent two parts, centralization In data center's total management system, monitoring agent is distributed on the node that each needs monitors, and it is monitored to be responsible for acquisition Then monitoring data is transmitted to data center's total management system server by certain mode, and connect by the monitoring data of resource The control instruction from data center's total management system server is received, is operated accordingly;Integrated management system of data center Server unite on a certain specific server, is responsible for obtaining the data that monitoring agent transmission comes, at the same it is responsible to data into Row analysis, processing, storage and data display are responsible for carrying out dynamic configuration to monitoring agent.
There are problems that low-response when single point failure and more node in a centralized architecture, in order to make up this lack , there is Layered Architecture in point, and in Layered Architecture, monitoring agent is divided into several groups with hierarchy, There are several monitoring nodes in each group, there is a capability this group of things is handled.
Each group is equivalent to a centralized data center total management system environment, and the local node that monitors serves as data The role of center total management system server, global monitoring node are responsible for monitoring each local monitoring node.This structure subtracts Lack the situation that a large amount of monitoring datas are sent simultaneously to a server, reduce network load;Although the meeting inside a group There is the problem of single point failure, but Fault Isolation is realized between group and group.
Centralized data center total management system relative hierarchical formula data center total management system is simple in structure, delay It is smaller, it easily manages, is easy to dispose.But if server where data center's total management system server goes wrong, Just it is present with the situation of the completely paralysed disease of data center's total management system;In addition when the node for needing to monitor is excessive, largely Monitoring data is transmitted in a network, and system can be led to problems such as to respond, and slow, network occupancy is high, influences the fortune of distributed system Line efficiency reduces the real-time of data center's total management system.
When node is more, although layered structure solves the problems, such as that the response speed of hubbed mode is slow, but is layered Structure program is disposed and implements that difficulty is larger, and the mode of information is transferred in layering, and system delay is bigger, and real-time is bad.
In short, centralized monitoring architecture is deposited monitoring data due to intensive data center total management system server Enter single node, it is possible that the problem of single point failure, and the transmission of substantial amounts of monitoring data may result in network and gather around Plug;Layered Architecture solves the problems, such as single point failure, but due to layered structure, system needs the access for specifying node Data are successively transferred, access efficiency is caused to reduce, and are disposed complex.
The present invention uses following technical term:
MDC modular data centers (Module Data Center, MDC)
SOA Enterprise SOAs (Service-Oriented Architecture)
API application programming interfaces
The content of the invention
The technical issues of in order to solve as above, the present invention propose a kind of integrated management system of data center based on cloud platform System design method.Invention defines data center's total management system design methods based on cloud platform, define one point Cloth service layer frame, defines monitoring agent module, defines data collector function module;Define data center's synthesis Tomcat-AdminPortal functional method.
Wherein, the present invention proposes a kind of data center's total management system design method based on cloud platform, including such as Lower step:
Step 1, the design of agreement connector:Agreement connector is divided into connector server end and connector client, often A connector client all has the distal end view there are one connector server end, and connector server end receives from various companies The connection request of device client is connect, and handles these requests;
Step 2, the design of monitoring agent, monitoring agent are the monitoring implementation sections of data center's total management system, It is distributed in data center's monitoring host computer and server cluster, is responsible for acquisition, threshold test and the data-pushing of monitoring data, separately It is outer that corresponding operating is carried out according to the instruction received;
Step 3, the design of data collector, data collector are responsible for collecting the information that a large amount of monitoring agents are transferred, and Calling database purchase, data collector is distributed in the part of nodes of data center's total management system into cloud platform database On, for gathering the monitoring information for the monitoring agent specified;
Step 4, the design of data center's total management system server, data center's total management system server is to be Administrative center, monitoring center and the data processing centre of system, for performing following management function:Threshold value is crossed the border management, data Processing management, system configuration management;
Step 5, Web server designs, the interface that Web server is user to be interacted with data center total management system, Interface is provided for system configuration, it is necessary to complete Sign-On authentication, data display function.
Step 6, the design of data storage, data center's total management system carry out data storage using HBase databases, For storing the monitoring parameter of data center's total management system, monitoring data and treated statistics.
Preferably, agreement connector is based on RMI protocol connector, and connector server end is responsible for establishing RMI Connector servers, connector client are Web browser;
Preferably, the data that monitoring agent is gathered include but not limited to CPU real time loads, memory usage, current magnetic Disk service condition, network utilization, disk read-write rate and system information;Monitoring agent is established not for different monitoring resources With correspondent entity class preserve monitoring data, and data center's total management system server is allowed to obtain corresponding monitoring number According to;
Preferably, when monitoring agent starts, two threads can be started:The data acquisition thread of server The GatherThread and thread ServerThread for establishing JMX servers;After collecting thread starts, GatherThread is adopted The related data of virtual machine or power & environment supervision host where collection, and by the data got compared with threshold value, when there is threshold value Cross the border happen or transmission mode for push when, monitoring agent sends JMX notification informations, informs the data being registered on itself The current state status of collector oneself;
Preferably, data collector needs to collect the information of which monitoring agent node by data center's total management system Server is specified, and when system starts, data center's total management system server can use JMX technologies by the monitoring agent of registration Fifty-fifty distribute to data collector, data collector according to the list of the monitoring agent of acquisition, polling request monitoring agent Data;
Preferably, data center's total management system server can with data collector timed communication, when there is data collection When rock machine occurs in monitoring node where device, data center's total management system server, which can be withdrawn, is distributed to the data collector Monitoring agent information, and these information are reassigned to active data collector;
Preferably, the threshold value performed by data center's total management system server cross the border management be divided into data center synthesis Threshold value Preliminary detection is responsible at Tomcat-AdminPortal end and monitoring agent end two parts, monitoring agent end, and threshold value is crossed the border Monitoring data is pushed to data center's total management system server;Data center's total management system server obtains monitoring letter After breath, monitoring data is parsed, obtains the data item beyond threshold value, warning content of crossing the border is organized, and with web page notification or sends postal Part mode notifies administrator and user in time;
Preferably, data center's total management system server needs to realize multiple corresponding functions using multiple threads: CountThread threads are used for the timing statistical disposition of monitoring data;RegisterListener threads are used for monitoring agent And data collector registration monitor, when there is notification message to send, the corresponding class that handles is handled;SeverThread lines Journey is used to implement JMX server ends, and connecting interface is provided for Web server;
Preferably, data center's total management system reads data from database is accessed in units of monitoring agent , monitoring data is divided into three kinds according to the difference of type:Real-time monitoring data, monitoring parameter and the prison after data processing Control data.
Accordingly, the present invention also proposes a kind of data center's total management system based on cloud platform, realizes in cloud computing In platform, which includes:
Agreement connector:Agreement connector includes connector server end and connector client, each connector client End all tools are there are one the distal end view of connector server end, and the receiving of connector server end is from various connector clients Connection request, and handle these requests;
At least one monitoring agent;Monitoring implementation section of the monitoring agent as data center's total management system, distribution In data center's monitoring host computer and server cluster, it is responsible for acquisition, threshold test and the data-pushing of monitoring data, in addition root Corresponding operating is carried out according to the instruction received;
At least one data collector is responsible for collecting the information that a large amount of monitoring agents are transferred, and calls database purchase Into cloud platform database, at least one data collector is distributed in the part of nodes of data center's total management system On, for gathering the monitoring information for the monitoring agent specified;
Data center's total management system server, as the administrative center of system, monitoring center and data processing centre For performing following management function:Threshold value is crossed the border management, data processing management, system configuration management;
As the interface that user interacts with data center total management system, boundary is provided for system configuration for Web server Sign-On authentication, data display function are completed in face.
HBase databases are responsible for carrying out data storage, for storing the monitoring parameter of data center's total management system, Monitoring data and treated statistics.
By as above System and method for proposed by the invention, the unified management of data center is can contribute to, especially The scene monitored is needed simultaneously being related to physical resource and virtual resource, is disposed by data center's total management system of design It in cloud platform, solves the problems, such as data center's total management system server failure, improves the stability of management.The party Method has reference significance for studying the systems such as similar high availability management, unified management.
Description of the drawings
Fig. 1 is the design method workflow schematic diagram in the embodiment of the present invention;
Fig. 2 is the Solution Architecture figure in the embodiment of the present invention;
Specific embodiment
It to describe the technical solutions in the embodiments of the present invention more clearly, below will be to needed in the embodiment Attached drawing is briefly described, it should be apparent that, the accompanying drawings in the following description is only some embodiments of the present invention, for ability For the those of ordinary skill of domain, without creative efforts, it can also be obtained according to these attached drawings other attached Figure.
The present invention proposes a kind of data center's total management system design method based on cloud platform, referring to such as Fig. 1 institutes The method flow diagram shown, including:
Step 1, the design of agreement connector:Agreement connector is divided into connector server end and connector client, often A connector client all has the distal end view there are one connector server end, and connector server end receives from various companies The connection request of device client is connect, and handles these requests;
The agreement connector of the Distributed Services layer of data center's total management system based on JMX frames, management system In data center's total management system server deposited with server, the monitoring agent of Agent layer and data collector and data The design method of storage is described as follows:
Distributed Services layer is the important component that program distribution management is realized in JMX specifications, mainly including agreement Connector part.Agreement connector is made of agent side, data collector end and server end, and being mainly responsible for makes part operation Become transparent, and a unified view is provided for the service in different server.
Data center's total management system server is JMX management clients, runs monitoring generation on the remote server Reason or data collector program then cannot be with the direct mutual operations of server, and need using connection protocol, connector service Device end need to use identical transport protocol with connector client.
JMX Distributed Services layer includes the agreement connector based on Simple Network Management Protocol, based on right inside internet Agreement connector as request agency agreement, the agreement connector based on hypertext markup language, based on hypertext transfer protocol Agreement connector and agreement connector based on remote method call.The course of work of agreement connector is fully transparent 's.Agreement connector is responsible for providing point-to-point company between data collector and data center's total management system server It connects, connector depends on specific connection protocol.
Data center's total management system supports browser access monitoring agent, while also needs to support data center's synthesis The agreement that Tomcat-AdminPortal is acted on behalf of by remote method call management and monitoring.Realization to Distributed Services layer is mainly pair The realization of agreement connector, the system use the agreement connector based on RMI.
Distributed Services layer is the important component that program distribution management is realized in JMX specifications, mainly including agreement Connector and protocol adaptor two parts.
Agreement connector:It is made of agent side and server end two parts, being mainly responsible for makes part operation become transparent, And provide a unified view for the service in different server or power & environment supervision host.
Protocol adaptor:There is no client, be operated in server end, and can be with format transmission number that client is understood that According to.Master monitor, monitoring agent and data collector in data center's total management system meet framework specification, number According between center total management system server and monitoring agent, the Distributed Services layer between data collector and monitoring agent Connection is created using connector, connector is divided into connector server end and connector client, server end and client Between mutual call for user be transparent.Connector client is connected to agency using remote method call connector Afterwards, an interface identical with connector server end can be obtained, Agent layer application is called by this interface.
Data collector can obtain the monitoring data of monitoring agent by modes such as remote procedure calls;Data center is comprehensive The monitoring parameter of monitoring agent can be set by closing Tomcat-AdminPortal;Meanwhile monitoring agent is when needed, it can be actively Notice, notice data center total management system server Managed Resource shape are sent to data center's total management system server Change of state etc..
Agreement connector in the embodiment of the present invention is divided into connector server end and connector client, each connector Client generally all tool there are one Connection Service device end distal end view.Connector server end receives from various connectors The connection request of client, and pass through server and go to handle these requests.Connection between the two is once successfully established, distal end Monitoring agent reforms into transparent for monitoring server.
It is responsible for establishing RMI connector servers based on RMI protocol connector server end, connector client is Web browser.Therefore realize that the agreement connector based on HTTP mainly realizes connector server end, implementation step is such as Under:
First, a MBean server is created, is used as the container of MBean.Secondly, an ObjectName is created Object, to specify the title for needing to register MBean and port, it would be desirable to which MBean's is registered in MBean servers.3rd, An agreement connector object is created, and specifies the administration interface of MBean as HTML types interface.4th, by agreement connector pair As being also registered in MBean servers, and start agreement device, wait HTTP connections.
JMX creates MBean servers there are two types of method, a kind of to need, using MBeanServerFactory factory class, to be somebody's turn to do Class all exists in different JMX is realized, it contains the method for creating MBean server.When creating MBean servers, The createMBeanServer provided in factory class methods can be called to create.This method can return to a new MBean clothes Business device, after using, it is necessary to which releaseMBeanServer methods is called to discharge newly-built MBean servers.
Step 2, the design of monitoring agent, monitoring agent are the monitoring implementation sections of data center's total management system, It is distributed in data center's monitoring host computer and server cluster, is responsible for acquisition, threshold test and the data-pushing of monitoring data, separately It is outer that corresponding operating is carried out according to the instruction received;
Monitoring agent is the monitoring implementation section of data center's total management system, is distributed in data center's monitoring host computer In server cluster, it is responsible for acquisition, threshold test and the data-pushing of monitoring data, according further to from monitoring data center Instruction with server carries out corresponding operating.Monitoring agent uses modularized processing, between function module independently of each other, under utilization Layer API realizes corresponding business.
Data center's total management system is directed to the monitoring of the operating status of the virtual machine server in cloud computing platform.Cloud Server in computing platform has different operating system, and system utilizes the cross-platform sampling instrument of third party, can realize to cloud The data acquisition of majority operation system in computing platform.Monitoring agent is realized to the part identical data of more than operating system Acquisition, such as CPU real time loads, memory usage, when front disk service condition, network utilization, disk read-write rate and system Information etc..
Monitoring agent establishes different correspondent entity classes to preserve monitoring data for different monitoring resources, and is entity In attribute addition get and set attributes, data center's total management system server is allowed to pass through above method and obtain and is corresponded to Monitoring data.
When monitoring agent starts, two threads can be started:The data acquisition thread GatherThread of server and Establish the thread ServerThread of JMX servers.After collecting thread starts, virtual machine or dynamic where GatherThread acquisitions The related data of ring monitoring host computer, and by the data got compared with threshold value, happen or pass when there is threshold value to cross the border When defeated pattern is pushes, monitoring agent sends JMX notification informations, informs the current state of the monitor oneself being registered on itself Situation.
Step 3, the design of data collector, data collector are responsible for collecting the information that a large amount of monitoring agents are transferred, and Calling database purchase, data collector is distributed in the part of nodes of data center's total management system into cloud platform database On, for gathering the monitoring information for the monitoring agent specified;
Since cloud platform database is more suitable for the concurrently write-in of mass data, the frequent of many small datas is not suitable for Write-in, therefore data collector is devised in data center's total management system, it is transferred to be responsible for collecting a large amount of monitoring agents Information, and call database purchase into cloud platform database.
Data collector is distributed on some nodes of system, for gathering the monitoring information for the monitoring agent specified.Number The information for collecting which monitoring agent node is needed to be specified by data center's total management system server according to collector.System When system starts, data center's total management system server can make JMX that the monitoring agent of registration be tried one's best average divide with technology Dispensing data collector, data collector is according to the list of the monitoring agent of acquisition, the data of polling request monitoring agent.
Data center's total management system server can be with data collector timed communication, the prison where having data collector When rock machine occurs in control node, data center's total management system server can withdraw the monitoring agent for being distributed to the data collector Information, and these information are reassigned to active data collector.
In JMX frames, data collector is acted on behalf of using connector (Connector) access monitoring, automatic regular polling request Up-to-date information.There are two types of the modes of the method for data center's total management system server calls monitoring agent:One kind is long-range The correlation technique of monitoring agent is called, is got parms;Another kind is to directly acquire relevant parameter after creating agency.
Data collector is responsible for collecting the monitoring data for the monitoring agent acquisition specified, and monitoring data is saved in data In storehouse, therefore data collector needs to configure the address of database, and data collector is made to be able to access that database.Data collection The realization of device equally based on two threads, establishes server thread and data collection thread.After data collection thread starts, need Data center's total management system server is waited to need the address for the monitoring agent collected for its distribution, works as data collector Monitoring agent list for it is empty when, data collector just by way of training in rotation request, is acted on behalf of using client access monitoring, Obtain monitoring data.
Step 4, the design of data center's total management system server, data center's total management system server is to be Administrative center, monitoring center and the data processing centre of system, for performing following management function:Threshold value is crossed the border management, data Processing management, system configuration management;
Data center's total management system server is administrative center, monitoring center and the data processing centre of system.Number It is the most important part of data center's total management system according to center total management system server, integrated management system of data center System server needs completion work more, below will be to number of data center's total management system server based on Map/Reduce According to processing and the design of threshold process function.
The management that threshold value is crossed the border is divided into data center's total management system server end and monitoring agent end two parts, monitoring Agent side is responsible for threshold value Preliminary detection, and the monitoring data that threshold value is crossed the border is pushed to data center's total management system service Device;After data center's total management system server obtains monitoring information, monitoring data is parsed, obtains the data beyond threshold value , warning content of crossing the border is organized, and administrator and user are notified in time in a manner of web page notification, transmission mail etc..
After data center's total management system server obtains monitoring data, data collection module transmits monitoring information To data processing module, data processing module is responsible for carrying out the operations such as threshold determination and data statistics.Data collection module gathers To after a data set, each monitoring parameter therein is parsed, the threshold value of corresponding parameter is then taken out in threshold list, so The size of both comparisons is made to determine whether to send threshold alarm afterwards, if necessary to alert, will check to see whether corresponding pretreatment Strategy if there is then triggering dependent event according to strategy, and sends threshold alarm, if directly sending threshold alarm without if, Comparison until completing all monitored item.
During data processing, input data is the set of the monitoring data of certain server, and every monitoring data is json numbers According to key-value pair, form is as follows:
Timestamp,{“item1”:value1,“item2”:value2,…,“itemN”:valueN}
The operation object of Map/Reduce computation models is Map<Key,Value>Value pair.In data processing, first will Incoming data carry out Map and are decomposed into prescribed form subdata, then summarize subdata progress Reduce.First time Map is grasped When making, incoming key is null, and value is a monitoring record, and when Map is exported, monitoring record is decomposed into key for prison Control project item is worth for the form of { current time stamp, Current data values }
Reduce operations are carried out after the completion of Map operations, Map results are subjected to Reduce operations according to monitoring data item, it is defeated Go out the statistical result according to monitoring data item.After Map/Reduce is operated, you can complete to certain server or certain number Statistic of classification is carried out according to the monitoring data at center, then carries out data processing according to time series.
Data center's total management system server is most important part in data center's total management system, be responsible for Data collector distributes the address of its monitoring agent that needs gather, and receives at notification message and message from monitoring agent Reason, while it is also responsible for the statistical disposition of monitoring data.It realizes the function of data center's total management system server, also needs Use multiple threads:CountThread threads are used for the timing statistical disposition of monitoring data;RegisterListener threads are used In registering monitor to monitoring agent and data collector, when there is notification message to send, the corresponding class that handles is handled; SeverThread threads are used to implement JMX server ends, and connecting interface is provided for Web server.During startup of server, also need Monitoring agent is distributed into data collector, data collector is enable to obtain the IP address of monitoring agent, carries out data receipts Collection.
Step 5, Web server designs, the interface that Web server is user to be interacted with data center total management system, Interface is provided for system configuration, it is necessary to complete Sign-On authentication, data display function.
The interface that Web server is user to be interacted with data center total management system is, it is necessary to complete Sign-On authentication, data The functions such as displaying, it is also necessary to provide interface for system configuration.
Web page is expression layer in Web server structure, is used for and user interaction;Request processing and data transmission are industry It is engaged in logical layer, for handling web-page requests, it is specified that data transmission format, and Basic API is called to be obtained from database and shows number According to and update the data monitoring parameter in storehouse;Database manipulation management and the connection of database, and api interface is provided for upper strata tune With.
Step 6, the design of data storage, data center's total management system carry out data storage using HBase databases, For storing the monitoring parameter of data center's total management system, monitoring data and treated statistics.
Data center's total management system carries out data storage using HBase databases, for storing data center's synthesis The monitoring parameter of management system, monitoring data and treated statistics etc..
It is mainly what is accessed in units of monitoring agent that data center's total management system reads data from database, because This mainly designs the line unit of monitoring data part the design of line unit, and monitoring data is divided into three kinds according to the difference of type: Real-time monitoring data, monitoring parameter and the monitoring data after data processing.For real time data, line unit is by each prison The form for controlling both unique AgentID and monitoring data acquisition time of agency composition " AgentID_ times " carries out tissue, Monitoring data is stored in the column family of monitoring data;For monitoring parameter, the information such as threshold value are included in parameter, only need to specify one Line unit;For the monitoring data after data processing, line unit includes opening for docket number, measurement type and timing statistics section Begin the time, distinguished with the form of " AgentID_ measurement types _ time ".
The design key point of the embodiment of the present invention is:Design reasonably data center's total management system based on cloud platform Design method is deployed in cloud platform by data center's total management system of design, solves data center's integrated management The problem of system server fails improves the stability of management, and data center's total management system has automatic management, It is laid down a regulation according to historical statistical data, realizes the operation such as being set automatic threshold, the data acquisition modes ratio of monitoring agent Relatively abundant, data acquisition modes are various including IT kind equipments and power & environment supervision categorical data so that integrated management system of data center It completely one includes under designed integrated service frame, constructs data center's total management system efficiently, stable.
Accordingly, the present invention also proposes a kind of data center's total management system based on cloud platform, realizes in cloud computing In platform, referring to Fig. 2, which includes:
Agreement connector:Agreement connector includes connector server end and connector client, each connector client End all tools are there are one the distal end view of connector server end, and the receiving of connector server end is from various connector clients Connection request, and handle these requests;
At least one monitoring agent;Monitoring implementation section of the monitoring agent as data center's total management system, distribution In data center's monitoring host computer and server cluster, it is responsible for acquisition, threshold test and the data-pushing of monitoring data, in addition root Corresponding operating is carried out according to the instruction received;
At least one data collector is responsible for collecting the information that a large amount of monitoring agents are transferred, and calls database purchase Into cloud platform database, at least one data collector is distributed in the part of nodes of data center's total management system On, for gathering the monitoring information for the monitoring agent specified;
Data center's total management system server, as the administrative center of system, monitoring center and data processing centre For performing following management function:Threshold value is crossed the border management, data processing management, system configuration management;
As the interface that user interacts with data center total management system, boundary is provided for system configuration for Web server Sign-On authentication, data display function are completed in face.
HBase databases are responsible for carrying out data storage, for storing the monitoring parameter of data center's total management system, Monitoring data and treated statistics.
Method proposed by the invention can contribute to the unified management of data center, especially be related to physical resource with Virtual resource needs the scene monitored simultaneously, is deployed in by data center's total management system of design in cloud platform, solves The problem of data center total management system server failure, improves the stability of management.This method is similar for studying High availability management, the systems such as unified management there is reference significance.
The foregoing description of the disclosed embodiments enables those skilled in the art to realize or use the present invention.To this A variety of modifications of a little embodiments will be apparent for a person skilled in the art, and the general principles defined herein can Without departing from the spirit or scope of the present invention, to realize in other embodiments.Therefore, the present invention will not be limited The embodiments shown herein is formed on, but meets the most wide model consistent with the principles and novel features disclosed herein It encloses.

Claims (10)

1. a kind of data center's total management system design method based on cloud platform, which is characterized in that include the following steps:
Step 1, the design of agreement connector:Agreement connector is divided into connector server end and connector client, Mei Gelian It connects device client and all has distal end view there are one connector server end, connector server end receives from various connectors The connection request of client, and handle these requests;
Step 2, the design of monitoring agent, monitoring agent are the monitoring implementation sections of data center's total management system, distribution In data center's monitoring host computer and server cluster, it is responsible for acquisition, threshold test and the data-pushing of monitoring data, in addition root Corresponding operating is carried out according to the instruction received;
Step 3, the design of data collector, data collector is responsible for collecting the information that a large amount of monitoring agents are transferred, and calls Into cloud platform database, data collector is distributed on the part of nodes of data center's total management system database purchase, For gathering the monitoring information for the monitoring agent specified;
Step 4, the design of data center's total management system server, data center's total management system server is system Administrative center, monitoring center and data processing centre, for performing following management function:Threshold value is crossed the border management, data processing Management, system configuration management;
Step 5, Web server designs, the interface that Web server is user to be interacted with data center total management system, to be Under unified central planning put provides interface, it is necessary to complete Sign-On authentication, data display function;
Step 6, the design of data storage, data center's total management system carry out data storage using HBase databases, are used for Store the monitoring parameter of data center total management system, monitoring data and treated statistics.
2. design method as described in claim 1, which is characterized in that agreement connector is based on RMI protocol connector, connection Device server end is responsible for establishing RMI connector servers, and connector client is Web browser.
3. design method as claimed in claim 2, which is characterized in that the data that monitoring agent is gathered include but not limited to CPU real time loads, memory usage, when front disk service condition, network utilization, disk read-write rate and system information;Prison Control agency establishes different correspondent entity classes to preserve monitoring data for different monitoring resources, and data center is allowed to integrate Tomcat-AdminPortal obtains corresponding monitoring data.
4. design method as claimed in claim 3, which is characterized in that when monitoring agent starts, two threads can be started:Clothes The data acquisition thread GatherThread of the business device and thread ServerThread for establishing JMX servers;Collecting thread opens After dynamic, the related data of virtual machine or power & environment supervision host where GatherThread acquisitions, and by the data got and threshold Value is compared, when have threshold value cross the border happen or transmission mode for push when, monitoring agent sends JMX notification informations, accuse Know the current state status of the data collector oneself being registered on itself.
5. design method as claimed in claim 4, which is characterized in that data collector needs which monitoring agent node collected Information specified by data center's total management system server, system start when, data center's total management system server It can use JMX technologies that the monitoring agent of registration fifty-fifty distributed to data collector, data collector is according to the monitoring of acquisition The list of agency, the data of polling request monitoring agent.
6. design method as claimed in claim 5, which is characterized in that data center's total management system server meeting and data Collector timed communication, when there is rock machine in monitoring node where having data collector, data center's total management system service Device can withdraw the monitoring agent information for being distributed to the data collector, and these information are reassigned to active data collection Device.
7. design method as claimed in claim 6, which is characterized in that performed by data center's total management system server Threshold value management of crossing the border is divided into data center's total management system server end and monitoring agent end two parts, and monitoring agent end is responsible for Threshold value Preliminary detection, and the monitoring data that threshold value is crossed the border is pushed to data center's total management system server;Data center After total management system server obtains monitoring information, monitoring data is parsed, obtains the data item beyond threshold value, organizes the announcement that crosses the border Alert content, and administrator and user are notified with web page notification or transmission lettergram mode in time.
8. design method as claimed in claim 7, which is characterized in that data center's total management system server needs to use Multiple threads realize multiple corresponding functions:CountThread threads are used for the timing statistical disposition of monitoring data; RegisterListener threads are used to register monitor to monitoring agent and data collector, when there is notification message to send, Corresponding processing class is handled;SeverThread threads are used to implement JMX server ends, and providing connection for Web server connects Mouthful.
9. design method as claimed in claim 8, which is characterized in that data center's total management system is read from database Data are accessed in units of monitoring agent, and monitoring data is divided into three kinds according to the difference of type:Real-time monitoring data, monitoring Parameter and the monitoring data after data processing.
10. a kind of data center's total management system based on cloud platform, is realized in cloud computing platform, which is characterized in that should Data center's total management system includes:
Agreement connector:Agreement connector includes connector server end and connector client, each connector client There are one the distal end view of connector server end, connection of the connector server end receiving from various connector clients for tool Request, and handle these requests;
At least one monitoring agent;Monitoring implementation section of the monitoring agent as data center's total management system, is distributed in number According in center monitoring host and server cluster, it is responsible for acquisition, threshold test and the data-pushing of monitoring data, according further to institute The instruction of reception carries out corresponding operating;
At least one data collector is responsible for collecting the information that a large amount of monitoring agents are transferred, and calls database purchase to cloud In platform database, at least one data collector is distributed on the part of nodes of data center's total management system, For gathering the monitoring information for the monitoring agent specified;
Data center's total management system server, is used for as the administrative center of system, monitoring center and data processing centre Perform following management function:Threshold value is crossed the border management, data processing management, system configuration management;
As the interface that user interacts with data center total management system, interface is provided for system configuration for Web server, complete Into Sign-On authentication, data display function.
HBase databases are responsible for carrying out data storage, for storing the monitoring parameter of data center's total management system, monitoring Data and treated statistics.
CN201711395908.7A 2017-12-21 2017-12-21 A kind of data center's total management system design method based on cloud platform Pending CN108121639A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711395908.7A CN108121639A (en) 2017-12-21 2017-12-21 A kind of data center's total management system design method based on cloud platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711395908.7A CN108121639A (en) 2017-12-21 2017-12-21 A kind of data center's total management system design method based on cloud platform

Publications (1)

Publication Number Publication Date
CN108121639A true CN108121639A (en) 2018-06-05

Family

ID=62230992

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711395908.7A Pending CN108121639A (en) 2017-12-21 2017-12-21 A kind of data center's total management system design method based on cloud platform

Country Status (1)

Country Link
CN (1) CN108121639A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109327335A (en) * 2018-10-07 2019-02-12 杭州安恒信息技术股份有限公司 A kind of cloud monitoring solution system and method
CN111800297A (en) * 2020-07-07 2020-10-20 浪潮云信息技术股份公司 Snmp-based intelligent monitoring method and system for cloud physical host
CN112104739A (en) * 2020-09-18 2020-12-18 江苏工程职业技术学院 Cloud computing remote data monitoring system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103559306A (en) * 2013-11-18 2014-02-05 电子科技大学 Query system and method for accessing data centers through cloud platform
CN107203454A (en) * 2017-05-23 2017-09-26 郑州云海信息技术有限公司 A kind of kernel internal memory monitoring method of power & environment supervision main frame

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103559306A (en) * 2013-11-18 2014-02-05 电子科技大学 Query system and method for accessing data centers through cloud platform
CN107203454A (en) * 2017-05-23 2017-09-26 郑州云海信息技术有限公司 A kind of kernel internal memory monitoring method of power & environment supervision main frame

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
吴夫丹: "基于云平台的服务器监控系统设计", 《 中国优秀硕士学位论文全文数据库》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109327335A (en) * 2018-10-07 2019-02-12 杭州安恒信息技术股份有限公司 A kind of cloud monitoring solution system and method
CN111800297A (en) * 2020-07-07 2020-10-20 浪潮云信息技术股份公司 Snmp-based intelligent monitoring method and system for cloud physical host
CN112104739A (en) * 2020-09-18 2020-12-18 江苏工程职业技术学院 Cloud computing remote data monitoring system

Similar Documents

Publication Publication Date Title
US8230056B2 (en) Enterprise management system
CN101277304B (en) Management system and management method for Web service operational environment based on rules
CN100521628C (en) Expandable dynamic network monitor system and its monitor method
CN103973815A (en) Method for unified monitoring of storage environment across data centers
CN108092813A (en) Data center&#39;s total management system server hardware Governance framework and implementation method
CN110659109B (en) System and method for monitoring openstack virtual machine
CN106201754A (en) Mission bit stream analyzes method and device
CN101095307A (en) Network management appliance
CN101815013B (en) Method for monitoring operation of satellite application system based on Ajax and Web service technology
CN109194617A (en) The automatically parsing of XML message, packaging method and device
CN108121639A (en) A kind of data center&#39;s total management system design method based on cloud platform
Sun et al. An architecture model of management and monitoring on cloud services resources
CN103795575B (en) A kind of system monitoring method towards multiple data centers
CN109391516A (en) Realize the cloud third party NMS system of more producer UTN equipment centralized maintenance management
CN108923976A (en) Space communication private network lightweight network operation management system
CN111984505B (en) Fortune dimension data acquisition device and acquisition method
CN108182263A (en) A kind of date storage method of data center&#39;s total management system
CN104346168B (en) A kind of monitoring management method for visualizing based on interexchange bus
CN106713428A (en) Business operation support system applied to Internet-of-things self-management platform
CN110011984A (en) A kind of distributed cluster system and method based on REST and RPC
CN110929130B (en) Public security level audit data query method based on distributed scheduling
CN117370053A (en) Information system service operation-oriented panoramic monitoring method and system
CN102298648A (en) Out-of-process access method of open real-time database
Casola et al. A reference architecture for sensor networks integration and management
Bauer et al. Services supporting management of distributed applications and systems

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180605