CN105871957B - Monitoring framework design method and monitoring server, agent unit, control server - Google Patents

Monitoring framework design method and monitoring server, agent unit, control server Download PDF

Info

Publication number
CN105871957B
CN105871957B CN201510031593.2A CN201510031593A CN105871957B CN 105871957 B CN105871957 B CN 105871957B CN 201510031593 A CN201510031593 A CN 201510031593A CN 105871957 B CN105871957 B CN 105871957B
Authority
CN
China
Prior art keywords
monitoring
data
server
virtual machine
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510031593.2A
Other languages
Chinese (zh)
Other versions
CN105871957A (en
Inventor
徐振佳
张丹枫
陈杰
冯亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Tencent Computer Systems Co Ltd
Original Assignee
Shenzhen Tencent Computer Systems Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Tencent Computer Systems Co Ltd filed Critical Shenzhen Tencent Computer Systems Co Ltd
Priority to CN201510031593.2A priority Critical patent/CN105871957B/en
Publication of CN105871957A publication Critical patent/CN105871957A/en
Application granted granted Critical
Publication of CN105871957B publication Critical patent/CN105871957B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Debugging And Monitoring (AREA)
  • Computer And Data Communications (AREA)

Abstract

The embodiment of the invention discloses Monitoring framework design methods and monitoring server, agent unit, control server, for realizing the comprehensive monitoring system of three-dimensional.A kind of Monitoring framework design method of the embodiment of the present invention includes: the monitoring data that monitoring server obtains that agent unit reports respectively in each virtual machine, obtains existing monitoring data when the control server controls each virtual machine from control server;The supervising data storage that the monitoring server will acquire is into database;The monitoring server reads monitoring data from the database and carries out anomaly analysis, alerts to the monitoring data for having abnormal, and there is abnormal supervising data storage into the database for described.

Description

Monitoring framework design method and monitoring server, agent unit, control server
Technical field
The present invention relates to field of computer technology more particularly to a kind of Monitoring framework design method and monitoring servers, generation Manage unit, control server.
Background technique
Cloud computing Infrastructure platform is a complicated service platform, has diversification, isomerism and dynamic change Feature.The normal operation of cloud computing system be unable to do without the support of cloud monitoring system, and cloud monitoring system can reflect that cloud is flat in real time The operation conditions of platform can find and handle in time the own generation of cloud computing platform and potential problem, this is for managing and dispatching Cloud computing system resource plays the role of critical.Therefore, Monitoring framework how is designed for the normal fortune of cloud computing system Dimension plays conclusive effect.In the prior art to how design Monitoring framework be just able to satisfy cloud computing system demand it is not true Cut regulation.
Summary of the invention
The embodiment of the invention provides a kind of Monitoring framework design methods and monitoring server, agent unit, middle control service Device, for realizing the comprehensive monitoring system of three-dimensional.
In order to solve the above technical problems, the embodiment of the present invention the following technical schemes are provided:
In a first aspect, the embodiment of the present invention provides a kind of Monitoring framework design method, comprising:
Monitoring server obtains the monitoring data that agent unit reports respectively in each virtual machine, obtains from control server The control server existing monitoring data when controlling each virtual machine;
The supervising data storage that the monitoring server will acquire is into database;
The monitoring server reads monitoring data from the database and carries out anomaly analysis, the monitoring to there is exception Data are alerted, and there is abnormal supervising data storage into the database for described.
Second aspect, the embodiment of the present invention also provide a kind of Monitoring framework design method, comprising:
Agent unit monitors virtual machine, generates monitoring data, the agent unit according to the operation data of the virtual machine It is deployed in the virtual machine;
The agent unit reports the monitoring data to monitoring server.
The third aspect, the embodiment of the present invention also provide a kind of Monitoring framework design method, comprising:
Control server generates the monitoring data when controlling each virtual machine;
The control server sends the monitoring data to the monitoring server according to the request of monitoring server.
Fourth aspect, the embodiment of the present invention also provide a kind of monitoring server, comprising:
Collector unit, for obtaining the monitoring data that agent unit reports respectively in each virtual machine, from control server Obtain existing monitoring data when the control server controls each virtual machine;
Storage unit, the supervising data storage that will acquire is into database;
Alarm Unit carries out anomaly analysis for reading monitoring data from the database, the monitoring to there is exception Data are alerted;
The storage unit is also used to there is abnormal supervising data storage into the database for described.
5th aspect, the embodiment of the present invention also provide a kind of agent unit, and the agent unit is deployed in the virtual machine In, comprising: monitoring subelement and transmission sub-unit, wherein
The monitoring subelement generates monitoring data according to the operation data of the virtual machine for monitoring virtual machine,
The transmission sub-unit, for reporting the monitoring data to monitoring server.
6th aspect, the embodiment of the present invention also provide a kind of control server, comprising: data generating unit and transmission are single Member, wherein
The data generating unit, for generating the monitoring data when controlling each virtual machine;
The transmission unit, for sending the monitoring number to the monitoring server according to the request of monitoring server According to.
As can be seen from the above technical solutions, the embodiment of the present invention has the advantage that
In embodiments of the present invention, monitoring server obtains the monitoring number that agent unit in each virtual machine reports respectively According to obtaining existing monitoring data, monitoring server when control server controls each virtual machine from control server will obtain Into database, monitoring server reads monitoring data from database and carries out anomaly analysis the supervising data storage got, right It is alerted in the presence of abnormal monitoring data, and abnormal supervising data storage will be present into database.Due to monitoring service The source for the monitoring data that device is got includes each virtual machine and the control server that is controlled virtual machine, monitoring clothes The monitoring face of business device is wider, and the control server for having covered virtual machine itself and having been controlled each virtual machine is based on The anomaly analysis that the monitoring data being collected into from virtual machine and control server carries out, which can be determined accurately, generates abnormal prison Data are controlled, and abnormal monitoring data is alerted, realize the comprehensive monitoring system of three-dimensional.
Detailed description of the invention
To describe the technical solutions in the embodiments of the present invention more clearly, make required in being described below to embodiment Attached drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the invention, for For those skilled in the art, it is also possible to obtain other drawings based on these drawings.
Fig. 1 is a kind of process blocks schematic diagram of Monitoring framework design method provided in an embodiment of the present invention;
Fig. 2 is the process blocks schematic diagram of another Monitoring framework design method provided in an embodiment of the present invention;
Fig. 3 is the process blocks schematic diagram of another Monitoring framework design method provided in an embodiment of the present invention;
Fig. 4-a is a kind of design structure schematic diagram of three-dimensional Monitoring framework provided in an embodiment of the present invention;
Fig. 4-b is the processing flow schematic diagram in Monitoring framework provided in an embodiment of the present invention to monitoring data;
Fig. 5-a is a kind of composed structure schematic diagram of monitoring server provided in an embodiment of the present invention;
Fig. 5-b is the composed structure schematic diagram of another monitoring server provided in an embodiment of the present invention;
Fig. 5-c is a kind of composed structure schematic diagram of collector unit provided in an embodiment of the present invention;
Fig. 6 is a kind of composed structure schematic diagram of agent unit provided in an embodiment of the present invention;
Fig. 7 is a kind of composed structure schematic diagram of control server provided in an embodiment of the present invention;
Fig. 8 is the composed structure signal that Monitoring framework design method provided in an embodiment of the present invention is applied to monitoring server Figure;
Fig. 9 is the composed structure schematic diagram that Monitoring framework design method provided in an embodiment of the present invention is applied to virtual machine;
Figure 10 is that Monitoring framework design method provided in an embodiment of the present invention is shown applied to the composed structure of control server It is intended to.
Specific embodiment
The embodiment of the invention provides a kind of Monitoring framework design methods and monitoring server, agent unit, middle control service Device, for realizing the comprehensive monitoring system of three-dimensional.
In order to make the invention's purpose, features and advantages of the invention more obvious and easy to understand, below in conjunction with the present invention Attached drawing in embodiment, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that disclosed below Embodiment be only a part of the embodiment of the present invention, and not all embodiments.Based on the embodiments of the present invention, this field Technical staff's every other embodiment obtained, shall fall within the protection scope of the present invention.
Term " includes " in description and claims of this specification and above-mentioned attached drawing and " having " and they Any deformation, it is intended that covering non-exclusive includes so as to a series of process, method comprising units, system, product or to set It is standby to be not necessarily limited to those units, but be not clearly listed or these process, methods, product or equipment are consolidated The other units having.
It is described in detail separately below.
In Monitoring framework design method of the present invention, monitoring server for realizing Monitoring framework, monitoring server with it is each Virtual machine and the control server controlled each virtual machine, which are all established, communication connection, and monitoring server passes through to void The monitoring of quasi- machine and control server, can have alarm accurately and timely to the failure of generation, to facilitate operation maintenance personnel fast Speed positioning solves the problems, such as, guarantees QoS of customer.Monitoring framework design method provided in an embodiment of the present invention can realize solid Change comprehensive monitoring system.It connects lower respectively to each in monitoring server, virtual machine, control server execution Monitoring framework design The method that a equipment is realized is illustrated.
One embodiment of Monitoring framework design method of the present invention, specifically can be applied in monitoring server, please refers to Shown in Fig. 1, control frame design method provided by one embodiment of the present invention be may include steps of:
101, monitoring server obtains the monitoring data that agent unit reports respectively in each virtual machine, from control server Obtain existing monitoring data when control server controls each virtual machine.
In embodiments of the present invention, in order to enable monitoring server determination failure accurately and timely and be alerted, Agent unit is disposed respectively in each virtual machine, is collected the operation data of virtual machine by agent unit and is generated to each virtual machine The monitoring data being monitored, control server also generate monitoring data, monitoring of the invention when controlling virtual machine Need to configure in virtual machine agent unit in frame and report monitoring data to monitoring server, it is also desirable to configuration monitoring server from Control server requests monitoring data, so monitoring server needs to obtain the prison that agent unit reports respectively in each virtual machine Data are controlled, existing monitoring data when control server controls each virtual machine are obtained from control server, since monitoring takes The source of monitoring data that business device is got includes each virtual machine and the control server that is controlled virtual machine, monitoring The monitoring face of server is wider, the control server for having covered virtual machine itself and having been controlled each virtual machine.
Further, in order to realize finer monitoring service, in some embodiments of the invention, in step 101 Monitoring server obtains the monitoring data that agent unit reports respectively in each virtual machine, can specifically include following steps:
A1, monitoring server obtain agent unit in each virtual machine according to preset each monitored item and collect the period The monitoring data of real-time report.
That is, in order to enable monitoring server can more finely provide monitoring service, it is also necessary to being deployed in void Agent unit in quasi- machine carries out presetting configuration file, and agent unit supervises virtual machine according to the requirement of configuration file Control, such as the monitored item for needing to monitor is set in configuration file and the collection period of agent unit is set, then agency is single Just the monitored item that is arranged according to configuration file of needs monitors every operation data of virtual machine in real time to member respectively, according to matching The collection period for setting file setting reports the monitoring data of multiple groups to monitoring server.Specifically, matching to agent unit setting Set monitored item preset in file can include: the system layer and process level of virtual machine, i.e. agent unit collect the agent unit institute Virtual machine in system layer data and process layer data, agent unit reports the system number of plies of virtual machine to monitoring server According to process layer data.
It should be noted that the system layer of virtual machine refers to that the bottom hardware level of virtual machine, system layer data refer to It is the monitoring data being monitored to the underlying infrastructure class of virtual machine, including but not limited to following the case where illustrating Belong to system layer data: network bandwidth occupancy situation, the load of packet amount, physical content amount, virutal machine memory capacity, network interface card speed Rate parses dmesg information, to various system stacks, input/output port (Input/Output, I/O).The process level of virtual machine Refer to the critical processes level on virtual machine, process layer data refer to the operating statuses of the critical processes on virtual machine into The monitoring data that row monitoring obtains, including but not limited to the case where following citing belong to process layer data: central processing unit (Central Processing Unit, CPU) utilization rate situation, EMS memory occupation situation, disk service condition.
Further, it is obtained in step 101 from control server existing when control server controls each virtual machine Monitoring data can specifically include following steps:
A2, monitoring server obtain control server according to preset each monitored item from control server and collect week The monitoring data that phase sends in real time.
That is, in order to enable monitoring server can more finely provide monitoring service, it is also necessary to monitoring service Device carries out presetting configuration file, and monitoring server requests monitoring data to control server according to the requirement of configuration file, Such as the monitored item and setting that setting needs monitor in configuration file collect the period, then monitoring server just needs basis Configuration file setting monitored item and collect period timing to control server request monitoring data, specifically, to monitoring Preset monitored item in the configuration file of server setting can include: module layer, podium level and client layer in control server, I.e. monitoring server needs therefrom to control server pull module layer data, platform layer data and user's layer data, control server According to the request of monitoring server to monitoring server feedback module layer data, platform layer data and user's layer data.
It should be noted that can dispose virtual platform in control server, virtual platform can realize resource allocation, return It receives, circulation, the virtual platform is disposed in control server can realize centralized management to resource, and in addition control server is also logical It crosses virtual platform and disposes multiple modules, module is to realize each component part of virtual platform, has been interacted by each module At the distribution etc. of resource, the podium level in control server refers to virtual platform level, during platform layer data refers to The monitoring data that control server is monitored virtual platform, including but not limited to the case where following citing belong to Platform layer data: stream compression and application programming interface (the Application Programming inside virtual platform Interface, API) it calls.Module layer in control server refers to the modules level for including in virtual platform, Module layer data refers to the monitoring data that control server is monitored modules in virtual platform, including but Being not limited to the case where illustrating as follows belongs to module layer data: state, the time-consuming, survival of correlation module inside virtual platform Situations such as.User level when client layer in control server refers to control server to user using virtual machine, user Layer data refers to the monitoring data that control server is monitored user using virtual machine, including but not limited to such as The case where lower citing belongs to user's layer data: user uses the CPU usage of virtual machine, and user uses the I/O of virtual machine, uses Family uses the Internet Use of virtual machine.
In some embodiments of the invention, the monitored item difference for needing to report for agent unit, step 101 monitoring clothes Business device obtains the monitoring data that agent unit reports respectively in each virtual machine, specifically comprises the following steps:
B1, monitoring server obtain the system layer data and the process number of plies that agent unit in each virtual machine reports respectively According to.
Wherein, the monitored item that the configuration file requirement agent unit being arranged in agent unit reports include: system layer and into Journey layer, such as be set as each monitored item according to different monitoring dimension and globally unique monitoring (Identity, ID) is set, then it is System layer monitored item and process level monitored item use different monitoring ID respectively, and agent unit identifies system using the monitoring ID of system layer System layer data, using the monitoring ID identification process layer data of process level, the system layer that monitoring server Receiving Agent unit reports Data and process layer data identify system layer data and the process number of plies according to the monitoring ID of the monitoring ID of system layer and process level According to.
Further, under the realization scene for executing step B1, step B1 monitoring server obtains generation in each virtual machine The system layer data and process layer data that reason unit reports respectively, can specifically include following steps:
System collector unit is set in monitoring server and process collector unit, system collector unit are each for receiving The system layer data that agent unit reports respectively in virtual machine;Process collector unit acts on behalf of list for receiving in each virtual machine The process layer data that member reports respectively.
That is, needing the monitored item for each layer to design respective receipts in monitoring server in specific implementation level Collect unit, system collector unit is set for system layer, is to guarantee flexibility for process level setting process collector unit The monitoring data that system collector unit and process collector unit will receive each agent unit and report respectively.
In some embodiments of the invention, the monitored item difference for needing to report for agent unit, step 101 are therefrom controlled Server obtains existing monitoring data when control server controls each virtual machine, specifically comprises the following steps:
When B2, monitoring server control each virtual machine by virtual platform from control server acquisition control server Existing module layer data, platform layer data and user's layer data are deployed with multiple modules, user's layer data on virtual platform Existing virtual machine operation data when including: user using virtual machine.
Wherein, the monitored item that the configuration file requirement monitoring server being arranged in monitoring server is obtained from control server Include: module layer, podium level and client layer, for example, according to different monitoring dimension be set as each monitored item be arranged it is globally unique ID is monitored, then module layer monitored item, podium level monitored item and client layer monitored item use different monitoring ID, monitoring service respectively Device uses user using the monitoring ID label platform layer data of podium level using the monitoring ID mark module layer data of module layer The monitoring ID identity user layer data of layer, monitoring server obtain control server from control server and pass through virtual platform control Existing module layer data, platform layer data and user's layer data when each virtual machine are made, according to monitoring ID, the platform of module layer The monitoring ID of layer and the monitoring ID of client layer identify module layer data, platform layer data and user's layer data.
Further, under the realization scene for executing step B2, step B2 monitoring server is from control server acquisition Control server existing module layer data, platform layer data and user's number of plies when controlling each virtual machine by virtual platform According to can specifically include following steps:
Setup module collector unit, platform collector unit and user's collector unit in monitoring server, module collection unit, For therefrom controlling the module layer data generated when server pull control server controls each virtual machine by virtual platform; Platform collector unit, for therefrom controlling the platform layer data of server pull virtual platform;User's collector unit, for therefrom The user's layer data generated when controlling server pull user using virtual machine.
That is, needing the monitored item for each layer to design respective receipts in monitoring server in specific implementation level Collect unit, for module layer setup module collector unit, platform collector unit is set for podium level, is arranged for client layer and uses Family collector unit, to guarantee flexibility, module collection unit, platform collector unit and user's collector unit will be controlled in reception respectively The monitoring data that server returns.
102, the supervising data storage that monitoring server will acquire is into database.
In embodiments of the present invention, monitoring server gets monitoring data from agent unit and obtains from control server After getting monitoring data, the monitoring data that monitoring server is got just includes: the agent unit in each virtual machine The monitoring data that the monitoring data and control server of transmission are sent, be deployed in monitoring server database (Data Base, DB), the monitoring data that monitoring server will acquire all is stored into database, and monitoring server acquisition is stored in database The monitoring data arrived, these monitoring datas can be transferred by monitoring server when needed.
In some embodiments of invention, for executing the application scenarios of abovementioned steps B1, step 102 monitoring server will The supervising data storage got can specifically include following steps into database:
System data word bank is set in monitoring server and process data word bank, monitoring server store system layer data Into system data word bank, by the storage of process layer data into process data word bank.
Likewise, for the application scenarios for executing abovementioned steps B2, the monitoring number that step 102 monitoring server will acquire According to storage into database, comprising:
Setup module data word bank, platform data word bank and user data word bank in monitoring server, monitoring server will Module layer data is stored into module data word bank, by the storage of platform layer data into platform data word bank, by user's layer data It stores in user data word bank.
That is, monitoring data can be stored respectively according to the fine degree that monitored item is arranged in database, in number According to also carrying out marking off five memory spaces according to monitored item in library, indicated for the monitoring ID of storage monitoring data are as follows: system Data word bank, process data word bank, module data word bank, platform data word bank and user data word bank, each word bank storage pair Should word bank monitoring data, facilitate monitoring server to the classification storage of monitoring data, if monitoring server need to call it is each , it can be achieved that efficiently when the monitoring of a monitored item.Fast.
103, monitoring server reads monitoring data from database and carries out anomaly analysis, to the monitoring data that there is exception It is alerted, and abnormal supervising data storage will be present into database.
In embodiments of the present invention, monitoring server is collected into from the agent unit and control server of each virtual machine Monitoring data is all stored in the database, and monitoring server can read out monitoring data from the database in real time, then Anomaly analysis is carried out to the monitoring data read, is alerted so that it is determined that going out in the presence of abnormal monitoring data, needs to illustrate , monitoring server determines whether monitoring data generates and abnormal needs to combine specific application scenarios and operation maintenance personnel to prison It is related to control the available specific monitoring service of server, the anomaly analysis of monitoring data can be used specifically under different scenes Data analysis mode, in addition monitoring server to have that abnormal monitoring data alerted can also be there are many realization side Formula, such as the information instruction that priority is high, or in such a way that animation, audio and specific program can be used as alarm, Specific alarm mode can be indicated by being pre-configured with Alarm ID, herein without limitation.In addition monitoring server is analyzing After abnormal monitoring data, it is also necessary to store the monitoring data of presence exception, be stored to monitoring clothes In the database disposed in business device, it is to be understood that database states what monitoring server in step 102 was got before storing When there is abnormal monitoring data in monitoring data and storage, it is only necessary to use different storage concordance lists, the monitoring of two classes Data are stored separately, so as to the subsequent calls of monitoring server.
In some embodiments of the invention, will be present in step 103 abnormal supervising data storage into database it Afterwards, Monitoring framework design method provided by the invention can also include the following steps:
C1, monitoring server are extracted from database in the presence of abnormal monitoring data, and are shown outward to carry out event Barrier analysis.
That is, monitoring server is after determining in the presence of abnormal monitoring data and alarm, for convenience of O&M people Member carries out the quick positioning analysis of failure, and monitoring server, which also needs to extract from database, has abnormal monitoring data, and It is shown outward to carry out accident analysis, such as fortune monitoring server can pass through Operation Support System (Operation Support System, OSS) to the monitoring data of operation maintenance personnel output abnormality, it is quickly positioned and is solved the problems, such as by operation maintenance personnel, protected Demonstrate,prove QoS of customer.Specifically, display unit can be disposed in monitoring server, existed by display unit and extraneous output Abnormal monitoring data.Specifically, display unit, may include: system demonstration unit, process display unit, modules exhibit list Member, platform display unit and user's display unit, corresponding display unit is for realizing the displaying to corresponding monitored item.
In some embodiments of the invention, it for executing the application scenarios of abovementioned steps A1 and A2, monitors in step 103 Server reads monitoring data from database and carries out anomaly analysis, comprising:
Monitoring server carries out anomaly analysis according to the multiple groups monitoring data got, determines in the presence of abnormal monitoring ?.
Further, under realization scene above-mentioned, there is abnormal monitoring data and alert in step 103 pair, specifically It may include steps of:
Monitoring server polymerize the multiple abnormal monitoring data for determining the same monitored item in the presence of exception, according to Polymerization result carries out concentration alarm.
That is, if monitoring server receives the monitoring data of multiple groups from agent unit or control server, to same The horizontal analysis of multiple groups monitoring data in one monitored item, can also be to self-monitoring to Monitoring framework be realized, further , settable Alarm Unit in monitoring server, the monitoring data for having abnormal is alerted, for example: with virtual For machine card of surfing Internet packet loss, the agent unit on virtual machine is that acquisition in every 5 minutes once reports once, and monitoring server will be every Receive the monitoring data that agent unit reports within 5 minutes, the meeting one packet loss information of storage in every 5 minutes in that database, if alarm Unit is not done polymerize if, can every 5 minutes issue one alarm, a large amount of information redundancy is had in this way, if Alarm Unit is to announcement Police carries out polymerization calculating, is alerted by the way that this kind of information fusion is disposably issued together, avoids unnecessary interference., alarm Unit can also polymerize multiple alarms of the same monitored item, carry in polymerization result to inhibit alarm windstorm Multiple abnormal monitoring data are, it can be achieved that concentrate alarm.Specifically, Alarm Unit, may include: system alarm unit, process announcement Alert unit, module Alarm Unit, platform Alarm Unit and user's Alarm Unit, corresponding Alarm Unit is for realizing to corresponding prison Control the alarm of item.
By above embodiments to the description of the embodiment of the present invention it is found that being acted on behalf of in each virtual machine of monitoring server acquisition The monitoring data that unit reports respectively obtains existing monitoring when control server controls each virtual machine from control server Data, for the supervising data storage that monitoring server will acquire into database, monitoring server reads monitoring from database Data carry out anomaly analysis, alert to the monitoring data for having abnormal, and abnormal supervising data storage will be present to number According in library.The source of the monitoring data got due to monitoring server is included each virtual machine and controlled virtual machine Control server, the monitoring face of monitoring server is wider, has covered virtual machine itself and has controlled to each virtual machine The control server of system, the anomaly analysis carried out based on the monitoring data being collected into from virtual machine and control server can be accurate Really the raw abnormal monitoring data of fixed output quota, and abnormal monitoring data is alerted, realize the comprehensive monitoring of three-dimensional System.
Monitoring framework design method of the present invention is carried out from monitoring server side above for example, next introducing this Another Monitoring framework design method that inventive embodiments provide specifically can be applied to the agency disposed in each virtual machine list It in member, please refers to shown in Fig. 2, control frame design method provided by one embodiment of the present invention may include steps of:
201, agent unit monitors virtual machine, generates monitoring data, agent unit deployment according to the operation data of virtual machine In virtual machine.
202, agent unit reports monitoring data to monitoring server.
In inventive embodiments, in order to enable monitoring server determination failure accurately and timely and be alerted, each An agent unit is disposed in a virtual machine respectively, collected the operation data of this virtual machine by agent unit and is generated to each void The monitoring data that quasi- machine is monitored disposes agent unit on each virtual machine, from agent unit actively to monitoring server It reports, the mode of active reporting avoids single-point collection monitoring information, that is to say, that the acquisition to all virtual machine informations is to pass through It disposes what agent unit was realized on each virtual machine, rather than is obtained by a program on monitoring server all virtual The monitoring data of machine.
In some embodiments of the invention, step 201 agent unit monitors virtual machine, according to the operation data of virtual machine Monitoring data is generated, can specifically include following steps:
The monitored item that agent unit is arranged according to configuration file supervises every operation data of virtual machine in real time respectively Control, according to the monitoring data for collecting period generation multiple groups of configuration file setting.
That is, in order to enable monitoring server can more finely provide monitoring service, it is also necessary to being deployed in void Agent unit in quasi- machine carries out presetting configuration file, and agent unit supervises virtual machine according to the requirement of configuration file Control, such as the monitored item for needing to monitor is set in configuration file and the collection period of agent unit is set, then agency is single Just the monitored item that is arranged according to configuration file of needs monitors every operation data of virtual machine in real time to member respectively, according to matching The collection period for setting file setting reports the monitoring data of multiple groups to monitoring server.Specifically, matching to agent unit setting Set monitored item preset in file can include: the system layer and process level of virtual machine, i.e. agent unit collect the agent unit institute Virtual machine in system layer data and process layer data, agent unit reports the system number of plies of virtual machine to monitoring server According to process layer data.
It should be noted that the system layer of virtual machine refers to that the bottom hardware level of virtual machine, system layer data refer to It is the monitoring data being monitored to the underlying infrastructure class of virtual machine, including but not limited to following the case where illustrating Belong to system layer data: network bandwidth occupancy situation, the load of packet amount, physical content amount, virutal machine memory capacity, network interface card speed Rate parses dmesg information, to various system stacks, I/O.The process level of virtual machine refers to the critical processes layer on virtual machine Face, process layer data refer to the monitoring data that the operating status to the critical processes on virtual machine is monitored, including But being not limited to the case where illustrating as follows belongs to process layer data: CPU usage situation, and EMS memory occupation situation, disk uses Situation.
Further, if monitoring data includes: system layer data and process layer data, step 202 agent unit is to monitoring Server reports monitoring data, can specifically include following steps:
The system collector unit that agent unit is arranged into monitoring server sends system layer data, into monitoring server The process collector unit of setting sends process layer data.
That is, needing the monitored item for each layer to design respective receipts in monitoring server in specific implementation level Collect unit, system collector unit is set for system layer, is to guarantee flexibility for process level setting process collector unit The monitoring data that system collector unit and process collector unit will receive each agent unit and report respectively.
By above embodiments to the description of the embodiment of the present invention it is found that agent unit is respectively to monitoring in each virtual machine The monitoring data that server reports, when monitoring server also controls each virtual machine from control server acquisition control server Existing monitoring data, the supervising data storage that monitoring server will acquire is into database, and monitoring server is from database Middle reading monitoring data carries out anomaly analysis, alerts to the monitoring data for having abnormal, and abnormal monitoring number will be present According to storage into database.The source of the monitoring data got due to monitoring server includes each virtual machine and to virtual The monitoring face of the control server that machine is controlled, monitoring server is wider, has covered virtual machine itself and to each void The control server that quasi- machine is controlled, the exception point carried out based on the monitoring data being collected into from virtual machine and control server Analysis, which can be determined accurately, generates abnormal monitoring data, and alerts to abnormal monitoring data, and it is complete to realize three-dimensional The monitoring system in orientation.
Monitoring framework design method of the present invention is carried out from monitoring server side above for example, next introducing this Another Monitoring framework design method that inventive embodiments provide, specifically can be applied in control server, please refers to Fig. 3 institute Show, control frame design method provided by one embodiment of the present invention may include steps of:
301, control server generates the monitoring data when controlling each virtual machine.
302, control server sends monitoring data to monitoring server according to the request of monitoring server.
In inventive embodiments, control server can also be referred to as central control server, and central control server is used It is controlled in each virtual machine, in order to enable monitoring server determination failure accurately and timely and be alerted, middle control Server generates monitoring data when controlling virtual machine, needed to configure in Monitoring framework of the invention monitoring server from Control server requests monitoring data, so monitoring server needs to obtain the prison that agent unit reports respectively in each virtual machine Data are controlled, obtain existing monitoring data when control server controls each virtual machine from control server.Control server Existing monitoring data when the control server controls virtual machine is provided to monitoring server, control server is matched The design that monitoring server improves Monitoring framework is closed, the monitoring data that control server provides can satisfy monitoring server to complete The needs of orientation three-dimensional monitoring, the single monitoring data source of middle monitoring server, which has more, compared with the prior art is directed to Property.
In some embodiments of the invention, step 301 control server is generated when controlling each virtual machine Monitoring data, can specifically include following steps:
Monitored item and collection period that control server is arranged according to configuration file generate the monitoring data of multiple groups respectively.
That is, in order to enable monitoring server can more finely provide monitoring service, it is also necessary to monitoring service Device carries out presetting configuration file, and monitoring server requests monitoring data to control server according to the requirement of configuration file, Such as the monitored item and setting that setting needs monitor in configuration file collect the period, then monitoring server just needs basis Configuration file setting monitored item and collect period timing to control server request monitoring data, specifically, to monitoring Preset monitored item in the configuration file of server setting can include: module layer, podium level and client layer in control server, I.e. monitoring server needs therefrom to control server pull module layer data, platform layer data and user's layer data, control server According to the request of monitoring server to monitoring server feedback module layer data, platform layer data and user's layer data.
It should be noted that can dispose virtual platform in control server, virtual platform can realize resource allocation, return It receives, circulation, the virtual platform is disposed in control server can realize centralized management to resource, and in addition control server is also logical It crosses virtual platform and disposes multiple modules, module is to realize each component part of virtual platform, has been interacted by each module At the distribution etc. of resource, the podium level in control server refers to virtual platform level, during platform layer data refers to The monitoring data that control server is monitored virtual platform, including but not limited to the case where following citing belong to Platform layer data: stream compression and API inside virtual platform.Module layer in control server refers to virtual platform In include modules level, module layer data refers to that control server is monitored modules in virtual platform Obtained monitoring data, including but not limited to the case where following citing belong to module layer data: phase inside virtual platform Situations such as closing the state, time-consuming, survival of module.It is empty that client layer in control server refers to that control server uses user User level when quasi- machine, user's layer data refer to the monitoring that control server is monitored user using virtual machine Data, including but not limited to the case where following citing belong to user's layer data: user uses the CPU usage of virtual machine, User uses the I/O of virtual machine, and user uses the Internet Use of virtual machine.
Further, if monitoring data includes: module layer data, platform layer data and user's layer data are controlled in step 302 Server sends monitoring data to monitoring server according to the request of monitoring server, specifically may include following steps:
The module collection unit sending module layer data that control server is arranged into monitoring server, to monitoring server The platform collector unit of middle setting sends platform layer data, and the user's collector unit being arranged into monitoring server sends client layer Data.
That is, needing the monitored item for each layer to design respective receipts in monitoring server in specific implementation level Collect unit, for module layer setup module collector unit, platform collector unit is set for podium level, is arranged for client layer and uses Family collector unit, to guarantee flexibility, module collection unit, platform collector unit and user's collector unit will be controlled in reception respectively The monitoring data that server returns.
By above embodiments to the description of the embodiment of the present invention it is found that control server is controlled into monitoring server transmission Server existing monitoring data when controlling each virtual machine, monitoring server also obtain agent unit point in each virtual machine The monitoring data not reported, the supervising data storage that monitoring server will acquire is into database, and monitoring server is from data Monitoring data is read in library and carries out anomaly analysis, the monitoring data for having abnormal is alerted, and abnormal monitoring will be present Data are stored into database.The source of the monitoring data got due to monitoring server includes each virtual machine and to void The monitoring face of the control server that quasi- machine is controlled, monitoring server is wider, has covered virtual machine itself and to each The control server that virtual machine is controlled, the exception carried out based on the monitoring data being collected into from virtual machine and control server Analysis, which can be determined accurately, generates abnormal monitoring data, and alerts to abnormal monitoring data, realizes three-dimensional Comprehensive monitoring system.
In order to facilitate a better understanding and implementation of the above scheme of the embodiment of the present invention, corresponding application scenarios of illustrating below come It is specifically described.
Monitoring framework design method provided by the invention can realize the comprehensive monitoring system of three-dimensional, such as set on basis The cloud computing system for servicing (Infrasturcture-as-a-Service, IaaS) is applied, is had to the stability of cloud computing system Strict requirements may be implemented the monitor closely to cloud computing system in conjunction with the present invention, and have accurately and timely to the failure of generation Alarm, solve the problems, such as, guarantee QoS of customer so that operation maintenance personnel be facilitated quickly to position.
Three-dimensional monitoring scheme composed structure of the invention is illustrated first, is please referred to as depicted in fig. 4-a, is this hair Monitored item is subdivided by a kind of design structure schematic diagram for three-dimensional Monitoring framework that bright embodiment provides, three-dimensional monitoring scheme Five levels realize the monitor closely to cloud computing system, are client layer, podium level, module layer, process level and system layer respectively. Wherein, this five layers of monitored item can realize the conduct monitoring at all levels to virtualization.
Next from Monitoring framework from the agent unit in monitoring server, control server, virtual machine to monitoring number According to process flow be illustrated, please refer to shown in Fig. 4-b, be Monitoring framework provided in an embodiment of the present invention in monitoring data Processing flow schematic diagram.Four primary layers: collector unit, database, Alarm Unit and exhibition are divided into monitoring server Show unit.
1), the agent unit in virtual machine is mainly responsible for the key message for collecting local system layer and process level, and reports It is reported to system collector unit to corresponding central node, such as system layer data, process layer data is reported to process and collects list Member.
For example, disposing monitoring programme (i.e. agent unit) on all virtual machines, agent unit registers the prison of each monitored item ID is controlled, and determines the collecting and reporting frequency of each monitoring ID according to configuration file.
2), system collector unit and process collector unit be responsible for complete Receiving Agent unit the system layer data reported and Process layer data, module collection unit, platform collector unit and user's collector unit therefrom control server pull module layer data, Platform layer data and user's layer data.
Specifically, globally unique monitoring ID is arranged in the different monitoring item for monitoring dimension to five layers, accelerate the event to system Barrier positioning.Agent unit and collector unit can realize timed task using APScheduler, and APScheduler is to be based on A python frame of Quartz, can provide the task based on date, Fixed Time Interval and crontab type, and And it can be with persistence task, to realize the flexibility of acquisition monitoring data.
For database, two kinds of databases can be used: memory database redis and mysql, collector unit in real time will generations The monitoring data that reason unit reports flushes in redis, and redis is the NoSQL data based on key-value of an open source Library has high-performance and high availability, can cope with high concurrent access.Redis data are regularly imported by collector unit In mysql, mysql stores all metadata and monitoring data, and guarantees that monitoring data is continuously, to utilize simultaneously HAProxy transverse direction cluster expansion breaks through the mysql read-write bottleneck of single machine, guarantees the stability of system.
3), Alarm Unit extracts monitoring data from each data word bank, determines in the presence of abnormal monitoring data, and Polymerization calculating is carried out to monitoring data, flexible, various ways alarms are realized according to alarm item configuration.For example, Alarm Unit exists Increase the parsing result of dmesg, fast and easy positioning and analysis failure in warning content.Wherein, there are abnormal monitoring datas It can be user's layer data, process layer data and system layer data, the exception of these monitoring datas is likely to result in virtual machine not It can use, if platform layer data and module layer data will use family in the presence of exception and can not operate to virtual machine, such as shut down, restart Deng.For example, operation and maintenance tools using tornado web server frame realize, tornado have the characteristics that it is non-block, it is fireballing, Function is deleted in the increase that can be convenient, to realize the front and back end separation of OSS, guarantees the scalability and flexibility of system.
4), display unit obtains monitoring data from the data word bank of each monitored item in real time, provides to cloud computing system reality When condition monitoring, and provide a variety of operation and maintenance tools, operation maintenance personnel facilitated quickly to operate, repair failure, analyze reason.Its In, operation and maintenance tools can be the tool inside cloud computing system to troubleshooting process, such as, a Trouble ticket is created, is remembered Record failure and the solution situation for tracking failure.
It is illustrated by above it is found that the monitoring of Monitoring framework design method of the present invention is wide general, most from system Bottom hardware, critical processes, console module waits until that top application covers comprehensively, in monitoring data continuum, makes to monitoring number According to analysis it is very convenient, and have remove single-point, expansible, self-monitoring, alarm promptly and accurately the characteristics of.Setting is global only One monitoring ID can be convenient earth fault type by monitoring ID, increase parsing dmesg information, acceleration disturbance in warning content Positioning and analysis.Agent unit is deployed on all virtual machines, and the mode of active reporting avoids single-point collection monitoring information.Each prison It controls layer and connects respective collector unit, the function simplification of collector unit is made to facilitate removal, addition and upgrading, favorable expandability.Alarm Unit in real time analyzes continuous monitoring data, and the monitoring ID according to monitored item polymerize warning information, inhibits alarm wind Cruelly.In addition, requiring monitoring data is continuously no matter whether monitoring information requires to report monitoring extremely in Monitoring framework design As a result, there is exception in the virtual machine where can determining agent unit, to realize self if lacking corresponding monitoring data Monitoring.
It should be noted that for the various method embodiments described above, for simple description, therefore, it is stated as a series of Combination of actions, but those skilled in the art should understand that, the present invention is not limited by the sequence of acts described because According to the present invention, some steps may be performed in other sequences or simultaneously.Secondly, those skilled in the art should also know It knows, the embodiments described in the specification are all preferred embodiments, and related actions and modules is not necessarily of the invention It is necessary.
For the above scheme convenient for the better implementation embodiment of the present invention, phase for implementing the above scheme is also provided below Close device.
It please refers to shown in Fig. 5-a, a kind of monitoring server 500 provided in an embodiment of the present invention, may include: collector unit 501, storage unit 502, Alarm Unit 503, wherein
Collector unit 501, for obtaining the monitoring data that agent unit reports respectively in each virtual machine, therefrom control service Device obtains existing monitoring data when the control server controls each virtual machine;
Storage unit 502, the supervising data storage that will acquire is into database;
Alarm Unit 503 carries out anomaly analysis for reading monitoring data from the database, to the prison that there is exception Control data are alerted;
The storage unit 502 is also used to there is abnormal supervising data storage into the database for described.
In some embodiments of the invention, it please refers to as shown in Fig. 5-b, relative to the monitoring service as shown in Fig. 5-a Device, monitoring server 500, further includes: display unit 504 has abnormal monitoring number for described for the storage unit 502 According to storage into the database after, extracted from the database in the presence of abnormal monitoring data, and show outward with Just accident analysis is carried out.
In some embodiments of the invention, the collector unit 501 is specifically used for obtaining and acts on behalf of list in each virtual machine Member is according to preset each monitored item and the monitoring data of collection period real-time report;The middle control is obtained from control server The monitoring data that server is sent in real time according to preset each monitored item and collection period.
In some embodiments of the invention, the Alarm Unit 503, specifically for monitoring number according to the multiple groups got According to anomaly analysis is carried out, determine in the presence of abnormal monitored item;To the multiple exceptions determined in the presence of the abnormal same monitored item Monitoring data is polymerize, and carries out concentration alarm according to polymerization result.
In some embodiments of the invention, it please refers to as shown in Fig. 5-c, the collector unit 501, comprising: system is received Collect unit 5011, process collector unit 5012, module collection unit 5013, platform collector unit 5014 and user's collector unit 5015, wherein
The system collector unit 5011, for receiving the system layer that agent unit reports respectively in each virtual machine Data;
The process collector unit 5012, for receiving the process level that agent unit reports respectively in each virtual machine Data.
The module collection unit 5013 passes through virtualization for pulling the control server from the control server The module layer data generated when each virtual machine described in platform courses;
The platform collector unit 5014, for pulling the platform number of plies of the virtual platform from the control server According to;
User's collector unit 5015, the use generated when for pulling user using virtual machine from the control server Family layer data;Be deployed with multiple modules on the virtual platform, the client layer data include: user using virtual machine when deposit Virtual machine operation data.
In some embodiments of the invention, system data word bank and process data are set in the monitoring server Library, the memory module 502 are specifically used for by system layer data storage into the system data word bank, will it is described into Journey layer data is stored into the process data word bank;
Setup module data word bank, platform data word bank and user data word bank, the storage in the monitoring server Module 502 is specifically used for module layer data storage into the module data word bank, the platform layer data is stored Into the platform data word bank, by user's layer data storage into the user data word bank.
By above embodiments to the description of the embodiment of the present invention it is found that being acted on behalf of in each virtual machine of monitoring server acquisition The monitoring data that unit reports respectively obtains existing monitoring when control server controls each virtual machine from control server Data, for the supervising data storage that monitoring server will acquire into database, monitoring server reads monitoring from database Data carry out anomaly analysis, alert to the monitoring data for having abnormal, and abnormal supervising data storage will be present to number According in library.The source of the monitoring data got due to monitoring server is included each virtual machine and controlled virtual machine Control server, the monitoring face of monitoring server is wider, has covered virtual machine itself and has controlled to each virtual machine The control server of system, the anomaly analysis carried out based on the monitoring data being collected into from virtual machine and control server can be accurate Really the raw abnormal monitoring data of fixed output quota, and abnormal monitoring data is alerted, realize the comprehensive monitoring of three-dimensional System.
It please refers to shown in Fig. 6, a kind of agent unit 600 provided in an embodiment of the present invention, the agent unit is deployed in institute It states in virtual machine, may include: monitoring subelement 601 and transmission sub-unit 602, wherein
The monitoring subelement 601 generates monitoring number according to the operation data of the virtual machine for monitoring virtual machine According to,
The transmission sub-unit 602, for reporting the monitoring data to monitoring server.
In some embodiments of the invention, the monitoring subelement 601, specifically for the prison being arranged according to configuration file Control item monitors every operation data of the virtual machine in real time respectively, the collection period being arranged according to the configuration file Generate the monitoring data of multiple groups.
In some embodiments of the invention, the monitoring data, comprising: system layer data and process layer data;It is described Transmission sub-unit 602 sends the system layer data specifically for the system collector unit being arranged into the monitoring server, The process collector unit being arranged into the monitoring server sends the process layer data.
By above embodiments to the description of the embodiment of the present invention it is found that agent unit is respectively to monitoring in each virtual machine The monitoring data that server reports, when monitoring server also controls each virtual machine from control server acquisition control server Existing monitoring data, the supervising data storage that monitoring server will acquire is into database, and monitoring server is from database Middle reading monitoring data carries out anomaly analysis, alerts to the monitoring data for having abnormal, and abnormal monitoring number will be present According to storage into database.The source of the monitoring data got due to monitoring server includes each virtual machine and to virtual The monitoring face of the control server that machine is controlled, monitoring server is wider, has covered virtual machine itself and to each void The control server that quasi- machine is controlled, the exception point carried out based on the monitoring data being collected into from virtual machine and control server Analysis, which can be determined accurately, generates abnormal monitoring data, and alerts to abnormal monitoring data, and it is complete to realize three-dimensional The monitoring system in orientation.
It please refers to shown in Fig. 7, a kind of control server 700 provided in an embodiment of the present invention, may include: that data generate list Member 701 and transmission unit 702, wherein
The data generating unit 701, for generating the monitoring data when controlling each virtual machine;
The transmission unit 702, for sending the monitoring to the monitoring server according to the request of monitoring server Data.
In some embodiments of the invention, the data generating unit 701, specifically for what is be arranged according to configuration file Monitored item and collection period generate the monitoring data of multiple groups respectively.
In some embodiments of the invention, the monitoring data, comprising: module layer data, platform layer data and user Layer data;The transmission unit 702 sends the mould specifically for the module collection unit being arranged into the monitoring server Block layer data, the platform collector unit being arranged into the monitoring server send the platform layer data, take to the monitoring The user's collector unit being arranged in business device sends user's layer data.
By above embodiments to the description of the embodiment of the present invention it is found that control server is controlled into monitoring server transmission Server existing monitoring data when controlling each virtual machine, monitoring server also obtain agent unit point in each virtual machine The monitoring data not reported, the supervising data storage that monitoring server will acquire is into database, and monitoring server is from data Monitoring data is read in library and carries out anomaly analysis, the monitoring data for having abnormal is alerted, and abnormal monitoring will be present Data are stored into database.The source of the monitoring data got due to monitoring server includes each virtual machine and to void The monitoring face of the control server that quasi- machine is controlled, monitoring server is wider, has covered virtual machine itself and to each The control server that virtual machine is controlled, the exception carried out based on the monitoring data being collected into from virtual machine and control server Analysis, which can be determined accurately, generates abnormal monitoring data, and alerts to abnormal monitoring data, realizes three-dimensional Comprehensive monitoring system.
Fig. 8 is a kind of monitoring server structural schematic diagram provided in an embodiment of the present invention, which can be because matching It sets or performance is different and generate bigger difference, may include one or more central processing units (central Processing units, CPU) 822 (for example, one or more processors) and memory 832, one or more Store the storage medium 830 (such as one or more mass memory units) of application program 842 or data 844.Wherein, it deposits Reservoir 832 and storage medium 830 can be of short duration storage or persistent storage.The program for being stored in storage medium 830 may include One or more modules (diagram does not mark), each module may include to the series of instructions operation in server.More Further, central processing unit 822 can be set to communicate with storage medium 830, and storage is executed on monitoring server 800 and is situated between Series of instructions operation in matter 830.
Monitoring server 800 can also include one or more power supplys 826, one or more are wired or wireless Network interface 850, one or more input/output interfaces 858, and/or, one or more operating systems 841, example Such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM etc..
The step as performed by monitoring server can be based on shown in Fig. 5-a, Fig. 5-b, Fig. 5-c in above-described embodiment Monitoring server structure.
Fig. 9 is a kind of virtual Machine Architecture schematic diagram provided in an embodiment of the present invention, which can be because of configuration or performance It is different and generate bigger difference, it may include one or more central processing units (central processing Units, CPU) 922 (for example, one or more processors) and memory 932, one or more storages apply journey The storage medium 930 (such as one or more mass memory units) of sequence 942 or data 944.Wherein, 932 He of memory Storage medium 930 can be of short duration storage or persistent storage.The program for being stored in storage medium 930 may include one or one With upper module (diagram does not mark), each module may include to the series of instructions operation in server.Further, in Central processor 922 can be set to communicate with storage medium 930, execute on virtual machine 900 a series of in storage medium 930 Instruction operation.
Virtual machine 900 can also include one or more power supplys 926, one or more wired or wireless networks Interface 950, one or more input/output interfaces 958, and/or, one or more operating systems 941, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM etc..
The step as performed by virtual machine can be based on the agent unit structure shown in fig. 6 in above-described embodiment.
Figure 10 is a kind of control server structural schematic diagram provided in an embodiment of the present invention, which can be because Configuration or performance are different and generate bigger difference, may include one or more central processing units (central Processing units, CPU) 1022 (for example, one or more processors) and memory 1032, one or one with The storage medium 1030 (such as one or more mass memory units) of upper storage application program 1042 or data 1044.Its In, memory 1032 and storage medium 1030 can be of short duration storage or persistent storage.It is stored in the program of storage medium 1030 It may include one or more modules (diagram does not mark), each module may include to the series of instructions in server Operation.Further, central processing unit 1022 can be set to communicate with storage medium 1030, in control server 1000 Execute the series of instructions operation in storage medium 1030.
Control server 1000 can also include one or more power supplys 1026, one or more wired or nothings Wired network interface 1050, one or more input/output interfaces 1058, and/or, one or more operating systems 1041, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM etc..
The step as performed by control server can be based on the control server knot shown in Fig. 7 in above-described embodiment Structure.
In addition it should be noted that, the apparatus embodiments described above are merely exemplary, wherein described as separation The unit of part description may or may not be physically separated, component shown as a unit can be or It can not be physical unit, it can it is in one place, or may be distributed over multiple network units.It can be according to reality Border needs to select some or all of the modules therein to achieve the purpose of the solution of this embodiment.In addition, provided by the invention In Installation practice attached drawing, the connection relationship between module indicates there is communication connection between them, specifically can be implemented as one Item or a plurality of communication bus or signal wire.Those of ordinary skill in the art are without creative efforts, it can It understands and implements.
Through the above description of the embodiments, it is apparent to those skilled in the art that the present invention can borrow Help software that the mode of required common hardware is added to realize, naturally it is also possible to by specialized hardware include specific integrated circuit, specially It is realized with CPU, private memory, special components and parts etc..Under normal circumstances, all functions of being completed by computer program are ok It is easily realized with corresponding hardware, moreover, being used to realize that the specific hardware structure of same function is also possible to a variety of more Sample, such as analog circuit, digital circuit or special circuit etc..But software program is real in situations more for the purpose of the present invention It is now more preferably embodiment.Based on this understanding, technical solution of the present invention substantially in other words makes the prior art The part of contribution can be embodied in the form of software products, which is stored in the storage medium that can be read In, such as the floppy disk of computer, USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory Device (RAM, Random Access Memory), magnetic or disk etc., including some instructions are with so that a computer is set Standby (can be personal computer, server or the network equipment etc.) executes method described in each embodiment of the present invention.
In conclusion the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although referring to upper Stating embodiment, invention is explained in detail, those skilled in the art should understand that: it still can be to upper Technical solution documented by each embodiment is stated to modify or equivalent replacement of some of the technical features;And these It modifies or replaces, the spirit and scope for technical solution of various embodiments of the present invention that it does not separate the essence of the corresponding technical solution.

Claims (25)

1. a kind of Monitoring framework design method characterized by comprising
Monitoring server obtains the monitoring data that agent unit reports respectively in each virtual machine, described in control server acquisition Control server existing monitoring data when controlling each virtual machine;Wherein, described from described in control server acquisition Control server existing monitoring data when controlling each virtual machine, comprising: described in control server acquisition Control server existing module layer data, platform layer data and user when controlling each virtual machine by virtual platform Layer data, is deployed with multiple modules on the virtual platform, the client layer data include: user using virtual machine when exist Virtual machine operation data;
The supervising data storage that the monitoring server will acquire is into database;
The monitoring server reads monitoring data from the database and carries out anomaly analysis, to the monitoring data that there is exception It is alerted, and there is abnormal supervising data storage into the database for described.
2. the method according to claim 1, wherein described have abnormal supervising data storage to institute for described After stating in database, the method also includes:
The monitoring server is extracted from the database in the presence of abnormal monitoring data, and is shown outward to carry out event Barrier analysis.
3. the method according to claim 1, wherein the monitoring server, which obtains, acts on behalf of list in each virtual machine The monitoring data that member reports respectively, comprising:
It is real according to preset each monitored item and collection period that the monitoring server obtains agent unit in each virtual machine When the monitoring data that reports;
It is described to obtain existing monitoring data, packet when the control server controls each virtual machine from control server It includes:
The monitoring server obtains the control server according to preset each monitored item and collection from control server The monitoring data that period sends in real time.
4. according to the method described in claim 3, it is characterized in that, the monitoring server reads monitoring from the database Data carry out anomaly analysis, comprising:
The monitoring server carries out anomaly analysis according to the multiple groups monitoring data got, determines in the presence of abnormal monitoring ?;
Described pair has abnormal monitoring data and alerts, comprising:
The monitoring server polymerize the multiple abnormal monitoring data for determining the same monitored item in the presence of exception, according to Polymerization result carries out concentration alarm.
5. according to the method in any one of claims 1 to 3, which is characterized in that the monitoring server obtains each void The monitoring data that agent unit reports respectively in quasi- machine, comprising:
The monitoring server obtains the system layer data and process layer data that agent unit in each virtual machine reports respectively.
6. according to the method described in claim 5, it is characterized in that, the monitoring server, which obtains, acts on behalf of list in each virtual machine The system layer data and process layer data that member reports respectively, comprising:
System collector unit and process collector unit, the system collector unit, for receiving are set in the monitoring server The system layer data that agent unit reports respectively in each virtual machine;The process collector unit, it is described each for receiving The process layer data that agent unit reports respectively in a virtual machine.
7. according to the method described in claim 5, it is characterized in that, the supervising data storage that the monitoring server will acquire Into database, comprising:
System data word bank and process data word bank be set in the monitoring server, and the monitoring server is by the system layer Data are stored into the system data word bank, by process layer data storage into the process data word bank.
8. the method according to claim 1, wherein the monitoring server obtains institute from the control server State existing module layer data, platform layer data and use when control server controls each virtual machine by virtual platform Family layer data, comprising:
Setup module collector unit, platform collector unit and user's collector unit, the module collection in the monitoring server Unit, when controlling each virtual machine by virtual platform for pulling the control server from the control server The module layer data of generation;The platform collector unit, for pulling the flat of the virtual platform from the control server Platform layer data;User's collector unit, the user generated when for pulling user using virtual machine from the control server Layer data.
9. the method according to claim 1, wherein the supervising data storage that the monitoring server will acquire Into database, comprising:
Setup module data word bank, platform data word bank and user data word bank, the monitoring service in the monitoring server Device stores the module layer data into the module data word bank, by platform layer data storage to the platform data In word bank, by user's layer data storage into the user data word bank.
10. a kind of Monitoring framework design method characterized by comprising
Agent unit monitors virtual machine, generates monitoring data, the agent unit deployment according to the operation data of the virtual machine In the virtual machine;
The agent unit reports the monitoring data to monitoring server, wherein the monitoring server also therefrom control service Device obtains existing monitoring data, the supervising data storage that will acquire to data when control server controls each virtual machine In library, monitoring data is read from the database and carries out anomaly analysis, the monitoring data for having abnormal is alerted, and will In the presence of abnormal supervising data storage into database;The monitoring server also obtains the middle control service from control server Device existing monitoring data when controlling each virtual machine, comprising: the monitoring server is also from the control server Obtain existing module layer data, the platform number of plies when control server controls each virtual machine by virtual platform According to user's layer data, be deployed with multiple modules on the virtual platform, the client layer data include: user use it is virtual Existing virtual machine operation data when machine.
11. according to the method described in claim 10, it is characterized in that, the agent unit monitors virtual machine, according to the void The operation data of quasi- machine generates monitoring data, comprising:
The monitored item that the agent unit is arranged according to configuration file carries out reality to every operation data of the virtual machine respectively When monitor, according to the configuration file setting collect the period generate multiple groups monitoring data.
12. method described in 0 or 11 according to claim 1, which is characterized in that the monitoring data, comprising: system layer data and Process layer data;
The agent unit reports the monitoring data to monitoring server, comprising:
System collector unit that the agent unit is arranged into the monitoring server sends the system layer data, to described The process collector unit being arranged in monitoring server sends the process layer data.
13. a kind of Monitoring framework design method characterized by comprising
Control server generates the monitoring data when controlling each virtual machine;The monitoring data, comprising: module layer Data, platform layer data and user's layer data;Wherein, the module layer data refers to that the control server is flat to virtualization The monitoring data that modules are monitored in platform, the platform layer data refer to the control server to virtualization The monitoring data that platform is monitored, user's layer data refer to that the control server uses virtual machine to user The monitoring data being monitored;
The control server sends the monitoring data to the monitoring server according to the request of monitoring server;Wherein, The control server sends the monitoring data to the monitoring server according to the request of monitoring server, comprising: described The module collection unit that control server is arranged into the monitoring server sends the module layer data, takes to the monitoring The platform collector unit being arranged in business device sends the platform layer data, and the user being arranged into the monitoring server collects single Member sends user's layer data.
14. according to the method for claim 13, which is characterized in that the control server generate to each virtual machine into Monitoring data when row control, comprising:
Monitored item and collection period that the control server is arranged according to configuration file generate the monitoring data of multiple groups respectively.
15. a kind of monitoring server characterized by comprising
Collector unit is obtained for obtaining the monitoring data that agent unit reports respectively in each virtual machine from control server The control server existing monitoring data when controlling each virtual machine;Wherein, the collector unit, further includes: Module collection unit, platform collector unit and user's collector unit, the module collection unit are used for from the control server Pull the module layer data generated when the control server controls each virtual machine by virtual platform;The platform Collector unit, for pulling the platform layer data of the virtual platform from the control server;User's collector unit, The user's layer data generated when for pulling user using virtual machine from the control server;It is disposed on the virtual platform Have multiple modules, the client layer data include: user using virtual machine when existing virtual machine operation data;
Storage unit, the supervising data storage that will acquire is into database;
Alarm Unit carries out anomaly analysis for reading monitoring data from the database, to the monitoring data that there is exception It is alerted;
The storage unit is also used to there is abnormal supervising data storage into the database for described.
16. monitoring server according to claim 15, which is characterized in that the monitoring server further include: show single Member, for the storage unit by it is described there is abnormal supervising data storage into the database after, from the data It is extracted in library in the presence of abnormal monitoring data, and is shown outward to carry out accident analysis.
17. monitoring server according to claim 15, which is characterized in that the collector unit is specifically used for obtaining each Agent unit is according to preset each monitored item and the monitoring data of collection period real-time report in a virtual machine;Therefrom control clothes Business device obtains the monitoring data that the control server is sent in real time according to preset each monitored item and collection period.
18. monitoring server according to claim 15, which is characterized in that the Alarm Unit is obtained specifically for basis The multiple groups monitoring data got carries out anomaly analysis, determines in the presence of abnormal monitored item;There is the same of exception to determining Multiple abnormal monitoring data of monitored item are polymerize, and carry out concentration alarm according to polymerization result.
19. monitoring server described in any one of 5 to 17 according to claim 1, which is characterized in that the collector unit, packet Include: system collector unit and process collector unit, the system collector unit act on behalf of list for receiving in each virtual machine The system layer data that member reports respectively;The process collector unit is distinguished for receiving agent unit in each virtual machine The process layer data reported.
20. monitoring server according to claim 19, which is characterized in that system data is arranged in the monitoring server Word bank and process data word bank, the memory module are specifically used for system layer data storage to system data In library, by process layer data storage into the process data word bank;
Setup module data word bank, platform data word bank and user data word bank in the monitoring server, the memory module, Specifically for into the module data word bank, platform layer data storage is put down to described for module layer data storage Number of units stores user's layer data into the user data word bank according in word bank.
21. a kind of agent unit, which is characterized in that the agent unit is deployed in the virtual machine, comprising: monitoring subelement And transmission sub-unit, wherein
The monitoring subelement generates monitoring data according to the operation data of the virtual machine for monitoring virtual machine,
The transmission sub-unit, for reporting the monitoring data to monitoring server;Wherein, the monitoring server is also therefrom Control server obtains existing monitoring data, the supervising data storage that will acquire when control server controls each virtual machine Into database, monitoring data is read from the database and carries out anomaly analysis, the monitoring data for having abnormal is accused It is alert, and abnormal supervising data storage will be present into database;The monitoring server is also from described in control server acquisition Control server existing monitoring data when controlling each virtual machine, comprising: the monitoring server is also from described Control server obtain existing module layer data when the control server controls each virtual machine by virtual platform, Platform layer data and user's layer data are deployed with multiple modules on the virtual platform, and the client layer data include: user Existing virtual machine operation data when using virtual machine.
22. agent unit according to claim 21, which is characterized in that the monitoring subelement is matched specifically for basis The monitored item for setting file setting monitors every operation data of the virtual machine in real time respectively, according to the configuration file The monitoring data for collecting period generation multiple groups of setting.
23. the agent unit according to claim 21 or 22, which is characterized in that the monitoring data, comprising: the system number of plies According to process layer data;
The transmission sub-unit sends the system layer specifically for the system collector unit being arranged into the monitoring server Data, the process collector unit being arranged into the monitoring server send the process layer data.
24. a kind of control server characterized by comprising data generating unit and transmission unit, wherein
The data generating unit, for generating the monitoring data when controlling each virtual machine;Wherein, the monitoring Data, comprising: module layer data, platform layer data and user's layer data;Wherein, the module layer data refers to the middle control The monitoring data that server is monitored modules in virtual platform, the platform layer data refer in described The monitoring data that control server is monitored virtual platform, user's layer data refer to the control server The monitoring data that user is monitored using virtual machine;
The transmission unit, for sending the monitoring data to the monitoring server according to the request of monitoring server;Its In, the transmission unit sends the module number of plies specifically for the module collection unit being arranged into the monitoring server According to the platform collector unit being arranged into the monitoring server sends the platform layer data, into the monitoring server User's collector unit of setting sends user's layer data.
25. control server according to claim 24, which is characterized in that the data generating unit is specifically used for root Generate the monitoring data of multiple groups respectively according to the monitored item and collection period of configuration file setting.
CN201510031593.2A 2015-01-21 2015-01-21 Monitoring framework design method and monitoring server, agent unit, control server Active CN105871957B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510031593.2A CN105871957B (en) 2015-01-21 2015-01-21 Monitoring framework design method and monitoring server, agent unit, control server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510031593.2A CN105871957B (en) 2015-01-21 2015-01-21 Monitoring framework design method and monitoring server, agent unit, control server

Publications (2)

Publication Number Publication Date
CN105871957A CN105871957A (en) 2016-08-17
CN105871957B true CN105871957B (en) 2019-02-05

Family

ID=56623154

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510031593.2A Active CN105871957B (en) 2015-01-21 2015-01-21 Monitoring framework design method and monitoring server, agent unit, control server

Country Status (1)

Country Link
CN (1) CN105871957B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107104852A (en) * 2017-03-28 2017-08-29 深圳市神云科技有限公司 Monitor the method and device of cloud platform virtual network environment
CN107346278A (en) * 2017-07-07 2017-11-14 郑州云海信息技术有限公司 A kind of data capture method, apparatus and system
CN108173672B (en) * 2017-12-04 2021-06-08 华为技术有限公司 Method and device for detecting fault
CN108681499B (en) * 2018-05-04 2019-03-15 广州市玄武无线科技股份有限公司 O&M monitoring method, device and computer readable storage medium
CN110888785A (en) * 2018-09-11 2020-03-17 福建天晴数码有限公司 Method and device for monitoring alarm
CN109800136A (en) * 2018-12-06 2019-05-24 珠海西山居移动游戏科技有限公司 A kind of long-range redis performance data method of sampling and its system
CN109587258B (en) * 2018-12-14 2022-03-04 北京金山云网络技术有限公司 Service activity detection method and device
CN112187570A (en) * 2020-09-15 2021-01-05 中信银行股份有限公司 Risk detection method and device, electronic equipment and readable storage medium
CN116170341B (en) * 2022-12-23 2024-04-09 中国联合网络通信集团有限公司 Virtualization platform monitoring method, device, system and storage medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1177435C (en) * 2001-08-24 2004-11-24 华为技术有限公司 Hierarchical management system for distributed network management platform
US8209684B2 (en) * 2007-07-20 2012-06-26 Eg Innovations Pte. Ltd. Monitoring system for virtual application environments
CN103870297B (en) * 2012-12-14 2016-12-21 北京华胜天成科技股份有限公司 The performance data collection system and method for virtual machine in cloud computing environment
CN103024060B (en) * 2012-12-20 2015-05-13 中国科学院深圳先进技术研究院 Open type cloud computing monitoring system for large scale cluster and method thereof
CN103414579A (en) * 2013-07-24 2013-11-27 广东电子工业研究院有限公司 Cross-platform monitoring system applicable to cloud computing and monitoring method thereof

Also Published As

Publication number Publication date
CN105871957A (en) 2016-08-17

Similar Documents

Publication Publication Date Title
CN105871957B (en) Monitoring framework design method and monitoring server, agent unit, control server
CN105760214B (en) A kind of equipment state and resource information monitoring method, relevant device and system
US8589543B2 (en) Virtual data center monitoring
CN108335075B (en) Logistics big data oriented processing system and method
CN105653425B (en) Monitoring system based on complex event processing engine
US10419437B2 (en) Quasi-agentless cloud resource management
CN107508722B (en) Service monitoring method and device
US10454771B2 (en) Virtual infrastructure
CN107534570A (en) Virtualize network function monitoring
US10536348B2 (en) Operational micro-services design, development, deployment
CN111459763A (en) Cross-kubernets cluster monitoring system and method
CN112929187B (en) Network slice management method, device and system
Gardikis et al. An integrating framework for efficient NFV monitoring
US10498817B1 (en) Performance tuning in distributed computing systems
WO2017080161A1 (en) Alarm information processing method and device in cloud computing
CN113778615B (en) Rapid and stable network shooting range virtual machine construction system
CN114443435A (en) Container micro-service oriented performance monitoring alarm method and alarm system
CN107291594A (en) The device and method that openstack platforms are monitored and managed to ceph
CN109951320A (en) A kind of expansible multi layer monitoing frame and its monitoring method of facing cloud platform
CN102929769A (en) Virtual machine internal-data acquisition method based on agency service
CN105893211A (en) Method and system for monitoring
US11212173B2 (en) Model-driven technique for virtual network function rehoming for service chains
EP4024761A1 (en) Communication method and apparatus for multiple management domains
CN108989456B (en) A kind of network implementation approach based on big data
CN115719147A (en) Power transmission line inspection data processing method, device and platform

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant