CN107894944A - A kind of intelligent control method and system based under big data and cloud calculation service - Google Patents

A kind of intelligent control method and system based under big data and cloud calculation service Download PDF

Info

Publication number
CN107894944A
CN107894944A CN201711241870.8A CN201711241870A CN107894944A CN 107894944 A CN107894944 A CN 107894944A CN 201711241870 A CN201711241870 A CN 201711241870A CN 107894944 A CN107894944 A CN 107894944A
Authority
CN
China
Prior art keywords
warning information
operation system
center
monitoring strategies
alarm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711241870.8A
Other languages
Chinese (zh)
Inventor
陈祥晓
陈国镇
刘畅
罗龙
李强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Three League Polytron Technologies Inc
Original Assignee
Three League Polytron Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Three League Polytron Technologies Inc filed Critical Three League Polytron Technologies Inc
Priority to CN201711241870.8A priority Critical patent/CN107894944A/en
Publication of CN107894944A publication Critical patent/CN107894944A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/327Alarm or error message display

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Alarm Systems (AREA)

Abstract

The invention discloses a kind of intelligent control method based under big data and cloud calculation service, including:Surveillance center sets the monitoring strategies of differentiation in units of operation system;When operation system triggers monitoring strategies, Surveillance center's generation warning information simultaneously sends warning information to Intelligent treatment center and intellectual analysis center;Intelligent treatment center is handled operation system according to the warning information and processing strategy, and the processing strategy includes expanding policy, take-back strategy and cooling strategy;Intellectual analysis central collection and statistical analysis warning information, and result is fed back into Surveillance center.The invention also discloses based on the intelligent monitor system under big data and cloud calculation service., can be according to the differentiation monitoring strategies that different business systems can customize using the present invention, and Intelligent treatment center and analysis center are provided, the alarm to appearance carries out intelligent processing method.

Description

A kind of intelligent control method and system based under big data and cloud calculation service
Technical field
The present invention relates to areas of information technology, more particularly to a kind of intelligent monitoring based under big data and cloud calculation service Method and based on the intelligent monitor system under big data and cloud calculation service.
Background technology
After VMware in 1998 propose the virtualization technology of X86-based, the development of virtualization technology tide is swift and violent, and short After short a few years, the concept of the cloud computing based on virtualization technology is suggested, and user can take on demand in cloud computing environment With resource, resource is used based on high in the clouds whenever and wherever possible under conditions of network is reachable.Pass through the development of more than 10 years, virtualization technology It is gradually ripe with cloud computing technology, and be most of user admissions, increasing business will be all run on beyond the clouds.And according to 《State Council cultivates the decision with development strategy new industry on accelerating》In the expected future of strategic new industry that refers to, At least more than 30% growth level will be reached in the five-year of Chinese cloud computing market.
On the other hand, under the fast-developing overall background of global IT application, big data has turned into important basic of country Strategic resource, just leading new round scientific and technical innovation.Issued according to Chinese information Communication Studies institute《Chinese big data hair in 2015 Open up survey report》It has been shown that, Chinese big data market scale in 2015 are up to 115.9 hundred million yuan, and speedup is up to 38%.Furthermore it is contemplated that 2016 to 2018 years Chinese big data market scales are also by the rapid growth of maintenance 40% or so.
Cloud computing and big data technology are the main trends of Information Technology Development so far at the beginning of 21 century, and due to cloud computing The flexible characteristic easily extended, it turns into the base node framework of big data data analysis and process, started in 2013, two kinds Technology has shown highly effective and close marriage relation, and this relation will be more close in future development.
Based on above technology trends, big data business can be not only supported on cloud computing platform, also has substantial amounts of biography Service operation unite above.Based under this pattern, it would be desirable to a kind of intelligent monitoring system is provided, to big data business and Cloud calculation service is monitored management, helps user to grasp service operation situation in real time.
Monitoring management mode on cloud computing platform common at present is mainly following several:
(1) basis monitoring:The CPU of physical server and virtual machine operations, internal memory operation, storage running, IO are read and write Monitored in real time, there is provided availability data, there is provided alarming mechanism, alerted according to fixed threshold triggers.
(2) advanced monitoring:The object of monitoring includes underlying resource, application layer, safety etc..Monitored object is comprehensive, that is, includes Underlying resource, also including the application monitoring above resource, while also the safety under virtualized environment is monitored.
Two above-mentioned technical schemes, it is more stiff, not enough intelligently.In scheme (1), simply to the resource of cloud computing environment Aspect is monitored management, and user can only be helped to find the bottleneck that resource uses in time, can prevent in certain degree big Data service and traditional operation system go wrong.And in scheme (2), although the scope of monitoring is wider, can monitor The underlying resource of service operation, the application running situation on upper strata can be also monitored, triggering warning information can also be in time provided and helped User solves the problems, such as service operation, but scheme (2) is not still optimal monitoring scheme, because each operation system is the bottom of to The sensitivity of layer resource is that different, i.e., different operation system is not using reaction caused by bottleneck to same resource Equally.For example in big data business, the collection of data and processing procedure have bigger requirement to CPU computings, IO read-writes, such as Fruit network goes wrong, and read-write postpones, then the data collection service of whole big data will receive serious influence.It is another Aspect, in scheme (1) and scheme (2), all without the monitoring and treatment mechanism for providing the slack resources to become more meticulous.
The content of the invention
The technical problems to be solved by the invention are, there is provided a kind of intelligence prison based under big data and cloud calculation service Method and system are controlled, may be according to the differentiation monitoring strategies that different business systems can customize, and intelligent place is provided Reason center and analysis center, the alarm to appearance carry out intelligent processing method.
In order to solve the above-mentioned technical problem, the invention provides a kind of intelligence prison based under big data and cloud calculation service Prosecutor method, including:Surveillance center sets the monitoring strategies of differentiation in units of operation system;When operation system triggers monitoring plan When slightly, Surveillance center's generation warning information simultaneously sends warning information to Intelligent treatment center and intellectual analysis center;At intelligence Reason center according to the warning information and processing strategy operation system is handled, it is described processing strategy include expanding policy, Take-back strategy and cooling strategy;Intellectual analysis central collection and statistical analysis warning information, and result is fed back into Surveillance center.
As the improvement of such scheme, the monitoring strategies include underlying resource basis monitoring strategies, upper-layer service application Monitoring strategies and the advanced monitoring strategies of slack resources.
As the improvement of such scheme, underlying resource basis monitoring strategies include:Surveillance center obtains virtual in real time The technical indicator of main frame, the technical indicator include central processor CPU utilization rate, fictitious host computer memory usage and disk I/O Utilization rate;The threshold value and logical relation of each technical indicator triggering alarm are set respectively.
As the improvement of such scheme, the upper-layer service application monitoring strategies include:Surveillance center obtains operation system In the application installed, and set and open business monitoring function.
As the improvement of such scheme, the advanced monitoring strategies of slack resources include:Set for the virtual machine of operation system Determine monitor control index, the monitor control index includes index single-point and left unused threshold value, single-point acquiring cycle, idle measurement period, index spare time Put ratio and logical relation;The idle result of virtual machine is judged according to monitor control index.
As the improvement of such scheme, the intellectual analysis central collection and statistical analysis warning information simultaneously feed back result Method to Surveillance center includes:The warning information of Surveillance center is obtained, the warning information includes:Alarm object, virtual machine Index parameter value and metrics-thresholds when the operation system at place, alarm triggered time, the type of alarm, triggering alert;Division is accused The alarm type of alert information, the alarm type include underlying resource basis, upper-layer service application, slack resources;According to alarm Type carries out law-analysing and anticipation to warning information.
Correspondingly, present invention also offers a kind of intelligent monitor system based under big data and cloud calculation service, including: Surveillance center, for setting the monitoring strategies of differentiation in units of operation system, and when operation system triggers monitoring strategies Generation warning information simultaneously sends warning information to Intelligent treatment center and intellectual analysis center;Intelligent treatment center, for root Operation system is handled according to the warning information and processing strategy, the processing strategy includes expanding policy, take-back strategy And cooling strategy;Intellectual analysis center, for collecting simultaneously statistical analysis warning information, and result is fed back into Surveillance center.
As the improvement of such scheme, the Surveillance center includes:Monitoring strategies setting unit, for using operation system as Unit sets the monitoring strategies of differentiation;Warning information generation unit, for the generation announcement when operation system triggers monitoring strategies Alert information;Warning information transmitting element, for warning information to be sent to Intelligent treatment center and intellectual analysis center.
As the improvement of such scheme, the monitoring strategies setting unit includes:Underlying resource basis monitoring strategies are set Unit, for obtaining the technical indicator of fictitious host computer in real time and setting the threshold value of each technical indicator triggering alarm and logic to close respectively System;Upper-layer service application monitoring strategies setting unit, for obtaining the application installed in operation system and setting unlatching business to supervise Control function;The advanced monitoring strategies setting unit of slack resources, for setting monitor control index and basis for the virtual machine of operation system Monitor control index judges the idle result of virtual machine.
As the improvement of such scheme, the intellectual analysis center includes:Acquiring unit, for obtaining the announcement of Surveillance center Alert information, the warning information include:Operation system, alarm triggered time, the class of alarm where alarm object, virtual machine Index parameter value and metrics-thresholds when type, triggering alarm;Division unit, for dividing the alarm type of warning information, the announcement Alert type includes underlying resource basis, upper-layer service application, slack resources;Analytic unit, for according to alarm type to alarm Information carries out law-analysing and anticipation;Feedback unit, for result to be fed back into Surveillance center.
Implement the present invention, have the advantages that:
1st, the present invention is the self-defined monitoring system using operation system as granularity, it is allowed to which user is in magnanimity operation system On cloud computing platform in units of operation system, the monitoring strategies with differentiation are set.
2nd, Surveillance center can link Intelligent treatment center, when operation system triggers the prison set in the process of running Control strategy, solves the problems, such as resource bottleneck automatically by system.
3rd, slack resources policing algorithm process support it is self-defined, can with user-defined counter single-point leave unused threshold value, collection period, Measurement period, idle ratio logical relation.
4th, monitoring system includes intellectual analysis center, analysis result reverse correlation Surveillance center, helps user setting prison Rational monitoring is carried out when controlling tactful according to the data at intellectual analysis center to set.
Brief description of the drawings
Fig. 1 is the flow chart of the invention based on the intelligent control method under big data and cloud calculation service;
Fig. 2 is the structural representation of the invention based on the intelligent monitor system under big data and cloud calculation service;
Fig. 3 is the structural representation of Tu2Zhong Surveillance center;
Fig. 4 is the structural representation at intellectual analysis center in Fig. 2.
Embodiment
To make the object, technical solutions and advantages of the present invention clearer, the present invention is made into one below in conjunction with accompanying drawing It is described in detail on step ground.Only this is stated, appearance in the text of the invention or the side such as the up, down, left, right, before and after that will appear from, inside and outside Position word, only on the basis of the accompanying drawing of the present invention, it is not the specific restriction to the present invention.
Referring to Fig. 1, Fig. 1 shows the implementation of the invention based on the intelligent control method under big data and cloud calculation service Example, it includes:
S101, Surveillance center set the monitoring strategies of differentiation in units of operation system;
It is high that the monitoring strategies include underlying resource basis monitoring strategies, upper-layer service application monitoring strategies and slack resources Level monitoring strategies.Specifically:
Underlying resource basis monitoring strategies include:Surveillance center obtains the technical indicator of fictitious host computer in real time, and divides The threshold value and logical relation of each technical indicator triggering alarm are not set.The technical indicator include central processor CPU utilization rate, Fictitious host computer memory usage and disk I/O utilization rate.
It should be noted that user can be respectively to " central processor CPU utilizes in the monitoring strategies of underlying resource basis The self-defined threshold value that triggering alarm is set of these three technical indicators of rate, fictitious host computer memory usage and disk I/O utilization rate ", and And can be with the logical relation of these three self-defined technical indicators.For example A virtual machines, user can be set when cpu busy percentage height In 80% and memory usage be higher than 90% when trigger virtual machine alarming mechanism at once.
The upper-layer service application monitoring strategies include:Surveillance center obtains the application installed in operation system, and sets Open business monitoring function.Specifically, service application monitoring is supported:MySQL, Oracle, web are applied.MySQL:Operation system, Triggered time, connection status, thread connection number, most frequent SQL, slow query SQL;Oracle:Operation system, triggered time, company Connect state, thread connection number, most frequent SQL, slow query SQL;Web is applied:Operation system, port numbers, connection status.
The advanced monitoring strategies of slack resources include:Monitor control index is set for the virtual machine of operation system and according to monitoring Index judges the idle result of virtual machine.The monitor control index includes the idle threshold value of index single-point, single-point acquiring cycle, idle system Count cycle, the idle ratio of index and logical relation;
The idle threshold value of index single-point is set for the virtual machine of operation system, i.e., judges that virtual machine is on certain acquisition time The no criterion to leave unused, corresponding specific parameter are exactly CPU, internal memory, the idle threshold value of storage.As setting target single-point is not busy Put threshold value:Cpu busy percentage≤85%, memory usage≤60%, space utilisation≤90%, then what each collection point obtained Data all can be contrasted and recorded with the idle threshold value of single-point set.
Single-point acquiring cycle, that is, the interval of acquisition time are set for the virtual machine of operation system.It is assumed to be business system The collection period that the A that unites is set is 30 minutes, then monitoring system can gather 4 void in an operation system A every 30 minutes The CPU of plan machine, internal memory, space utilisation are simultaneously contrasted with single-point threshold value of leaving unused.
For the idle measurement period of virtual machine setting of operation system, that is, it is arranged in a period of time, the statistics for situation of leaving unused. The idle measurement period for being assumed to be operation system A settings is 10 days, then monitoring system can be tactful with 10 days by what is set For a cycle, in this 10 days, all data acquisition and idle monitoring were carried out to operation system A virtual machine every 30 minutes, Draw idle result.
The idle ratio of index and logical relation are set for the virtual machine of operation system.CPU, internal memory, storage index are set The idle standard of item, it is assumed that be that the idle ratio that CPU is set is 70%, the idle ratio for being memory setting is 80%, is set for storage The idle ratio put be 90%, their logical relation be and.Hereinbefore, we are provided with for operation system A virtual machine The measurement period of 10 days and the collection period of 30 minutes, and it is provided with the idle threshold value of index single-point, respectively CPU for virtual machine Utilization rate≤85%, memory usage≤60%, space utilisation≤90%.Every virtual machine collects quantity in so 10 days For:10 × 24 × 60/30=480.The value of i.e. 480 cpu busy percentages, the value of 480 memory usages, 480 storages utilize The value of rate.Assuming that in this 10 day time, the number of cpu busy percentage≤85% collected accounts for total number and accounts for total number 480 Ratio more than 70%, and the number of memory usage≤60% collected accounts for the ratio that total number accounts for total number 480 Example is more than 80%, and the number of space utilisation≤60% collected accounts for total number and accounts for the ratio of total number 480 and surpasses Cross 90%, then the virtual machine is may determine that in this 10 days measurement period, for the virtual machine that leaves unused.
S102, when operation system trigger monitoring strategies when, Surveillance center generation warning information and by warning information send to Intelligent treatment center and intellectual analysis center;
S103, Intelligent treatment center are handled operation system according to the warning information and processing strategy;
It should be noted that Intelligent treatment center linkage surveillance center, virtual machine underlying resource using once there is bottleneck, Resource can be laterally added, and inherit automatically into affiliated operation system, one according to pre-set Intelligent treatment mode The virtual machine that last time laterally adds will be removed out operation system, and release by denier resource automatically using the condition for reaching release With the relation of other virtual machines in operation system.
Specifically, the processing strategy includes expanding policy, take-back strategy and cooling strategy.
Expanding policy:The underlying resource basis monitoring at linkage surveillance center, obtaining the resource of virtual machine under operation system makes Use situation.The quantity and resource specification for the virtual machine for needing laterally to add are set, and the virtual machine laterally added is based on former empty Plan machine is what masterplate was carried out.
Take-back strategy:Obtain the resource service condition of virtual machine under operation system.The recovery threshold of same index parameter is set Value, activation threshold value is run, by the virtual machine of automatic recovery expanding policy addition.
Cool time:Start to calculate cool time after addition virtual machine success, within cool time, do not perform recovery plan Slightly.
Way of recycling:Delete manually, automatic shutdown.
Using operation system A as example, set when virtual machine cpu busy percentage is higher than 80% and memory usage is higher than 90% When trigger virtual machine alarming mechanism at once, linkage Intelligent treatment center is handled.Assuming that virtual machine ACPU utilization rates It has been higher than 90%, then triggering alarm, and by Intelligent treatment center, using virtual machine A as masterplate, has laterally added into operation system A Add 2 virtual machines.In the process of running, virtual machine A and the average CPU utilization by it for two virtual machines of masterplate addition Less than threshold value (such as:20%), and cool time has been had been subjected to, then the resource of the two days virtual machines newly added can be reclaimed, returned The mode of receipts supports to delete manually and automatic shutdown.
S104, intellectual analysis central collection and statistical analysis warning information, and result is fed back into Surveillance center.
Intellectual analysis center is counted the warning information of all triggerings.Then according to certain rule carry out classification and Obtained conclusion reverse feedback, is referred to analysis center by analysis to Surveillance center, user when monitoring strategies are set The trend of offer is configured.
Specifically, the intellectual analysis central collection and statistical analysis warning information and result is fed back into Surveillance center Method includes:
A, the warning information of Surveillance center is obtained, the warning information includes:Business system where alarm object, virtual machine Index parameter value and metrics-thresholds when system, alarm triggered time, the type of alarm, triggering alarm;
B, the alarm type of warning information is divided according to the criterion of Surveillance center, the alarm type includes underlying resource base Plinth, upper-layer service application, slack resources, are specifically classified as follows shown in table:
C, law-analysing and anticipation are carried out to warning information according to alarm type.
Underlying resource basis:CPU, internal memory are analyzed respectively, store three dimensions and time, the relation of threshold value, and according to There is the tendency of bottleneck in resource in rule one week future of anticipation in past one month.
Upper-layer service application:The rule of MySQL, Oracle query statement is analyzed respectively, counts the TOP most frequencies in one month Numerous SQL and TOP are inquired about slowly.And according to the most frequent SQL occurred in the rule in past one month one week future of anticipation and slowly Query statement.
Slack resources:It is most to analyze virtual machine quantity of being left unused existing for the operation system of which type, and in time The idle virtual machine number change trend of statistics, and walking for quantity of being left unused in following one week is prejudged according to the rule in past one month Gesture.
From the foregoing, it will be observed that the present invention allows user on the cloud computing platform of magnanimity operation system in units of operation system, Monitoring strategies with differentiation are set;And when these operation systems trigger the monitoring strategies set in the process of running, Can then link Intelligent treatment center, by Command Center process problem (for example, when the alarm of triggering meets that Intelligent treatment center is triggered During condition, it will it is laterally automatic in the operation system to add resource, dynamically to ensure demand of the business to resource);Intelligence simultaneously Energy analysis center can collect and warning information caused by statistical analysis, so as to which reverse feedback is to user, helps user setting prison Rational monitoring is carried out when controlling tactful according to the data at intellectual analysis center to set.
Referring to Fig. 2, Fig. 2 shows the knot of the invention based on the intelligent monitor system 100 under big data and cloud calculation service Structure schematic diagram, it includes:
Surveillance center 1, for setting the monitoring strategies of differentiation in units of operation system, and when operation system triggering prison Warning information is generated when controlling tactful and sends warning information to Intelligent treatment center 2 and intellectual analysis center 3;
Intelligent treatment center 2, for being handled according to the warning information and processing strategy operation system, the place Reason strategy includes expanding policy, take-back strategy and cooling strategy;It should be noted that Intelligent treatment center linkage surveillance center 1, Virtual machine underlying resource can laterally add resource using once there is bottleneck according to pre-set Intelligent treatment mode, And inherit automatically into affiliated operation system, once resource will laterally be added last automatically using the condition for reaching release The virtual machine added removes out operation system, and releases the relation with other virtual machines in operation system.Specifically, the processing plan Slightly include expanding policy, take-back strategy and cooling strategy.Expanding policy:The underlying resource basis monitoring at linkage surveillance center 1, is obtained Take the resource service condition of virtual machine under operation system.The quantity and resource specification for the virtual machine for needing laterally to add are set, The virtual machine laterally added is to be carried out based on former virtual machine for masterplate.Take-back strategy:Obtain the money of virtual machine under operation system Source service condition.The recovery threshold value of same index parameter is set, runs activation threshold value, by the void of automatic recovery expanding policy addition Plan machine.Cool time:Start to calculate cool time after addition virtual machine success, within cool time, do not perform take-back strategy.Return Debit's formula:Delete manually, automatic shutdown.Using operation system A as example, set when virtual machine cpu busy percentage higher than 80% and Memory usage triggers virtual machine alarming mechanism at once when being higher than 90%, and linkage Intelligent treatment center 2 is handled.It is false If virtual machine ACPU utilization rates have been higher than 90%, then triggering alarm, and by Intelligent treatment center 2, using virtual machine A as masterplate, 2 virtual machines are laterally added into operation system A.In the process of running, virtual machine A and two void by it for masterplate addition The average CPU utilization of plan machine is less than threshold value (such as:20%), and cool time has been had been subjected to, then can reclaim and newly add The resource of two days virtual machines, the mode of recovery supports to delete manually and automatic shutdown.
Intellectual analysis center 3, for collecting simultaneously statistical analysis warning information, and result is fed back into Surveillance center 1.Intelligence Analysis center 3 is counted the warning information of all triggerings, is then classified and is analyzed according to certain rule, obtaining Conclusion reverse feedback to Surveillance center 1, user be referred to when monitoring strategies are set intellectual analysis center 3 offer Trend is configured.
As shown in figure 3, the Surveillance center 1 includes:
Monitoring strategies setting unit 11, for setting the monitoring strategies of differentiation in units of operation system;The monitoring Strategy includes underlying resource basis monitoring strategies, upper-layer service application monitoring strategies and the advanced monitoring strategies of slack resources.
Warning information generation unit 12, for generating warning information when operation system triggers monitoring strategies;
Warning information transmitting element 13, for warning information to be sent to Intelligent treatment center 2 and intellectual analysis center 3.
Further, the monitoring strategies setting unit 11 includes:
Underlying resource basis monitoring strategies setting unit 11, for obtaining the technical indicator of fictitious host computer in real time and setting respectively Put the threshold value and logical relation of each technical indicator triggering alarm;The technical indicator includes central processor CPU utilization rate, virtual Host memory utilization rate and disk I/O utilization rate.It should be noted that user can distinguish in the monitoring strategies of underlying resource basis To " central processor CPU utilization rate, fictitious host computer memory usage and disk I/O utilization rate ", these three technical indicators are self-defined The threshold value of triggering alarm is set, and can be with the logical relation of these three self-defined technical indicators.Such as to A virtual machines, user It can set and trigger virtual machine alarm machine at once when cpu busy percentage is higher than 80% and memory usage is higher than 90% System.
Upper-layer service application monitoring strategies setting unit 112, for obtaining the application installed in operation system and setting is opened Open business monitoring function;Specifically, service application monitoring is supported:MySQL, Oracle, web are applied.MySQL:Operation system, touch Send out time, connection status, thread connection number, most frequent SQL, slow query SQL;Oracle:Operation system, triggered time, connection State, thread connection number, most frequent SQL, slow query SQL;Web is applied:Operation system, port numbers, connection status.
The advanced monitoring strategies setting unit 113 of slack resources, for setting monitor control index simultaneously for the virtual machine of operation system The idle result of virtual machine is judged according to monitor control index.The monitor control index includes index single-point and left unused threshold value, single-point acquiring week Phase, idle measurement period, the idle ratio of index and logical relation.
The idle threshold value of index single-point is set for the virtual machine of operation system, i.e., judges that virtual machine is on certain acquisition time The no criterion to leave unused, corresponding specific parameter are exactly CPU, internal memory, the idle threshold value of storage.As setting target single-point is not busy Put threshold value:Cpu busy percentage≤85%, memory usage≤60%, space utilisation≤90%, then what each collection point obtained Data all can be contrasted and recorded with the idle threshold value of single-point set.
Single-point acquiring cycle, that is, the interval of acquisition time are set for the virtual machine of operation system.It is assumed to be business system The collection period that the A that unites is set is 30 minutes, then monitoring system can gather 4 void in an operation system A every 30 minutes The CPU of plan machine, internal memory, space utilisation are simultaneously contrasted with single-point threshold value of leaving unused.
For the idle measurement period of virtual machine setting of operation system, that is, it is arranged in a period of time, the statistics for situation of leaving unused. The idle measurement period for being assumed to be operation system A settings is 10 days, then monitoring system can be tactful with 10 days by what is set For a cycle, in this 10 days, all data acquisition and idle monitoring were carried out to operation system A virtual machine every 30 minutes, Draw idle result.
The idle ratio of index and logical relation are set for the virtual machine of operation system.CPU, internal memory, storage index are set The idle standard of item, it is assumed that be that the idle ratio that CPU is set is 70%, the idle ratio for being memory setting is 80%, is set for storage The idle ratio put be 90%, their logical relation be and.Hereinbefore, we are provided with for operation system A virtual machine The measurement period of 10 days and the collection period of 30 minutes, and it is provided with the idle threshold value of index single-point, respectively CPU for virtual machine Utilization rate≤85%, memory usage≤60%, space utilisation≤90%.Every virtual machine collects quantity in so 10 days For:10 × 24 × 60/30=480.The value of i.e. 480 cpu busy percentages, the value of 480 memory usages, 480 storages utilize The value of rate.Assuming that in this 10 day time, the number of cpu busy percentage≤85% collected accounts for total number and accounts for total number 480 Ratio more than 70%, and the number of memory usage≤60% collected accounts for the ratio that total number accounts for total number 480 Example is more than 80%, and the number of space utilisation≤60% collected accounts for total number and accounts for the ratio of total number 480 and surpasses Cross 90%, then the virtual machine is may determine that in this 10 days measurement period, for the virtual machine that leaves unused.
As shown in figure 4, the intellectual analysis center 3 includes:
Acquiring unit 31, for obtaining the warning information of Surveillance center 1, the warning information includes:It is alarm object, virtual Index parameter value and metrics-thresholds when the type of operation system, alarm triggered time, alarm where machine, triggering alert;
Division unit 32, for dividing the alarm type of warning information, the alarm type include underlying resource basis, on Layer service application, slack resources;
Analytic unit 33, for carrying out law-analysing and anticipation to warning information according to alarm type;Wherein, underlying resource Basis:CPU, internal memory, three dimensions of storage and time, the relation of threshold value are analyzed respectively, and according to the rule in past one month There is the tendency of bottleneck in resource in anticipation is following one week.Upper-layer service application:MySQL, Oracle query statement are analyzed respectively Rule, the TOP most frequent SQL and TOP counted in one month are inquired about slowly.And future one is prejudged according to the rule in past one month The most frequent SQL and slow query statement occurred in week.Slack resources:Analyze void of being left unused existing for the operation system of which type Plan machine quantity is most, and counts idle virtual machine number change trend in time, and according to the rule in past one month The tendency for quantity of being left unused in anticipation is following one week.
Feedback unit 34, for result to be fed back into Surveillance center 1.
From the foregoing, it will be observed that the invention has the advantages that:
1st, the present invention is the self-defined monitoring system using operation system as granularity, it is allowed to which user is in magnanimity operation system On cloud computing platform in units of operation system, the monitoring strategies with differentiation are set.
2nd, Surveillance center can link Intelligent treatment center, solve the problems, such as resource bottleneck automatically by system.
3rd, slack resources policing algorithm process support it is self-defined, can with user-defined counter single-point leave unused threshold value, collection period, Measurement period, idle ratio logical relation.
4th, monitoring system includes intellectual analysis center, analysis result reverse correlation Surveillance center, there is provided decision data.
Described above is the preferred embodiment of the present invention, it is noted that for those skilled in the art For, under the premise without departing from the principles of the invention, some improvements and modifications can also be made, these improvements and modifications are also considered as Protection scope of the present invention.

Claims (10)

  1. A kind of 1. intelligent control method based under big data and cloud calculation service, it is characterised in that including:
    Surveillance center sets the monitoring strategies of differentiation in units of operation system;
    When operation system triggers monitoring strategies, Surveillance center's generation warning information simultaneously sends warning information into Intelligent treatment The heart and intellectual analysis center;
    Intelligent treatment center is handled operation system according to the warning information and processing strategy, and the processing strategy includes Expanding policy, take-back strategy and cooling strategy;
    Intellectual analysis central collection and statistical analysis warning information, and result is fed back into Surveillance center.
  2. 2. as claimed in claim 1 based on the intelligent control method under big data and cloud calculation service, it is characterised in that described Monitoring strategies include underlying resource basis monitoring strategies, upper-layer service application monitoring strategies and the advanced monitoring strategies of slack resources.
  3. 3. as claimed in claim 2 based on the intelligent control method under big data and cloud calculation service, it is characterised in that described Underlying resource basis monitoring strategies include:
    Surveillance center obtains the technical indicator of fictitious host computer in real time, and the technical indicator includes central processor CPU utilization rate, void Intend host memory utilization rate and disk I/O utilization rate;
    The threshold value and logical relation of each technical indicator triggering alarm are set respectively.
  4. 4. as claimed in claim 2 based on the intelligent control method under big data and cloud calculation service, it is characterised in that described Upper-layer service application monitoring strategies include:Surveillance center obtains the application installed in operation system, and sets unlatching business monitoring Function.
  5. 5. as claimed in claim 2 based on the intelligent control method under big data and cloud calculation service, it is characterised in that described The advanced monitoring strategies of slack resources include:
    Monitor control index is set for the virtual machine of operation system, the monitor control index includes the idle threshold value of index single-point, single-point acquiring Cycle, idle measurement period, the idle ratio of index and logical relation;
    The idle result of virtual machine is judged according to monitor control index.
  6. 6. as claimed in claim 1 based on the intelligent control method under big data and cloud calculation service, it is characterised in that described Result is simultaneously fed back to the method for Surveillance center and included by intellectual analysis central collection and statistical analysis warning information:
    The warning information of Surveillance center is obtained, the warning information includes:Operation system, announcement where alarm object, virtual machine Index parameter value and metrics-thresholds when alert triggered time, the type of alarm, triggering alarm;
    The alarm type of warning information is divided, the alarm type includes underlying resource basis, upper-layer service application, idle money Source;
    Law-analysing and anticipation are carried out to warning information according to alarm type.
  7. A kind of 7. intelligent monitor system based under big data and cloud calculation service, it is characterised in that including:
    Surveillance center, for setting the monitoring strategies of differentiation in units of operation system, and when operation system triggering monitoring plan Warning information is generated when slightly and sends warning information to Intelligent treatment center and intellectual analysis center;
    Intelligent treatment center, for being handled according to the warning information and processing strategy operation system, the processing plan Slightly include expanding policy, take-back strategy and cooling strategy;
    Intellectual analysis center, for collecting simultaneously statistical analysis warning information, and result is fed back into Surveillance center.
  8. 8. as claimed in claim 7 based on the intelligent monitor system under big data and cloud calculation service, it is characterised in that described Surveillance center includes:
    Monitoring strategies setting unit, for setting the monitoring strategies of differentiation in units of operation system;
    Warning information generation unit, for generating warning information when operation system triggers monitoring strategies;
    Warning information transmitting element, for warning information to be sent to Intelligent treatment center and intellectual analysis center.
  9. 9. as claimed in claim 8 based on the intelligent monitor system under big data and cloud calculation service, it is characterised in that described Monitoring strategies setting unit includes:
    Underlying resource basis monitoring strategies setting unit, for obtaining the technical indicator of fictitious host computer in real time and setting each skill respectively The threshold value and logical relation of art index triggering alarm;
    Upper-layer service application monitoring strategies setting unit, for obtaining the application installed in operation system and setting unlatching business to supervise Control function;
    The advanced monitoring strategies setting unit of slack resources, for setting monitor control index and according to monitoring for the virtual machine of operation system Index judges the idle result of virtual machine.
  10. 10. as claimed in claim 7 based on the intelligent monitor system under big data and cloud calculation service, it is characterised in that institute Stating intellectual analysis center includes:
    Acquiring unit, for obtaining the warning information of Surveillance center, the warning information includes:Where alarm object, virtual machine Operation system, the alarm triggered time, the type of alarm, triggering alarm when index parameter value and metrics-thresholds;
    Division unit, for dividing the alarm type of warning information, the alarm type includes underlying resource basis, upper-layer service Using, slack resources;
    Analytic unit, for carrying out law-analysing and anticipation to warning information according to alarm type;
    Feedback unit, for result to be fed back into Surveillance center.
CN201711241870.8A 2017-11-30 2017-11-30 A kind of intelligent control method and system based under big data and cloud calculation service Pending CN107894944A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711241870.8A CN107894944A (en) 2017-11-30 2017-11-30 A kind of intelligent control method and system based under big data and cloud calculation service

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711241870.8A CN107894944A (en) 2017-11-30 2017-11-30 A kind of intelligent control method and system based under big data and cloud calculation service

Publications (1)

Publication Number Publication Date
CN107894944A true CN107894944A (en) 2018-04-10

Family

ID=61807365

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711241870.8A Pending CN107894944A (en) 2017-11-30 2017-11-30 A kind of intelligent control method and system based under big data and cloud calculation service

Country Status (1)

Country Link
CN (1) CN107894944A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111884878A (en) * 2020-07-24 2020-11-03 樊馨 Data monitoring method based on block chain
CN112291112A (en) * 2020-10-27 2021-01-29 中国光大银行股份有限公司 Cloud computing service monitoring system and method
CN117527523A (en) * 2023-11-23 2024-02-06 广东堡塔安全技术有限公司 Cloud computing-based server security monitoring system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100102982A1 (en) * 2008-10-28 2010-04-29 Open Systems International, Inc. Holistic alarm monitoring
CN102096461A (en) * 2011-01-13 2011-06-15 浙江大学 Energy-saving method of cloud data center based on virtual machine migration and load perception integration
CN104378262A (en) * 2013-12-13 2015-02-25 国家计算机网络与信息安全管理中心 Intelligent monitoring analyzing method and system under cloud computing
CN104410535A (en) * 2014-12-23 2015-03-11 浪潮电子信息产业股份有限公司 Intelligent cloud resource monitoring and warning method
CN105260235A (en) * 2015-09-23 2016-01-20 浪潮集团有限公司 Method and device for scheduling resources on basis of application scenarios in cloud platform

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100102982A1 (en) * 2008-10-28 2010-04-29 Open Systems International, Inc. Holistic alarm monitoring
CN102096461A (en) * 2011-01-13 2011-06-15 浙江大学 Energy-saving method of cloud data center based on virtual machine migration and load perception integration
CN104378262A (en) * 2013-12-13 2015-02-25 国家计算机网络与信息安全管理中心 Intelligent monitoring analyzing method and system under cloud computing
CN104410535A (en) * 2014-12-23 2015-03-11 浪潮电子信息产业股份有限公司 Intelligent cloud resource monitoring and warning method
CN105260235A (en) * 2015-09-23 2016-01-20 浪潮集团有限公司 Method and device for scheduling resources on basis of application scenarios in cloud platform

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111884878A (en) * 2020-07-24 2020-11-03 樊馨 Data monitoring method based on block chain
CN112291112A (en) * 2020-10-27 2021-01-29 中国光大银行股份有限公司 Cloud computing service monitoring system and method
CN112291112B (en) * 2020-10-27 2022-11-15 中国光大银行股份有限公司 Cloud computing service monitoring system and method
CN117527523A (en) * 2023-11-23 2024-02-06 广东堡塔安全技术有限公司 Cloud computing-based server security monitoring system

Similar Documents

Publication Publication Date Title
JP5584780B2 (en) Data collection method, data collection apparatus, and network management device
CN107894944A (en) A kind of intelligent control method and system based under big data and cloud calculation service
CN100549982C (en) Memory management apparatus, method and system based on the dynamic property monitoring
CN104239183B (en) The method and device of monitoring system performance
CN106844161A (en) Abnormal monitoring and Forecasting Methodology and system in a kind of carrier state stream calculation system
CN107528870B (en) A kind of collecting method and its equipment
CN109491850A (en) A kind of disk failure prediction technique and device
US20130226922A1 (en) Identification of Complementary Data Objects
CN103684916A (en) Method and system for intelligent monitoring and analyzing under cloud computing
CN103384206A (en) Concurrent processing method and system for mass data
CN102081622A (en) Method and device for evaluating system health degree
CN104965861A (en) Monitoring device for data access
CN107786616A (en) Main frame intelligent monitor system based on high in the clouds
CN104572401A (en) Alarming method and alarming system
CN103914363B (en) A kind of internal memory monitoring method and relevant apparatus
US10089236B2 (en) Apparatus and method of performing agentless remote IO catching analysis, prediction, automation, and recommendation in a computer environment
US20220300345A1 (en) Entity and Method Performed therein for Handling Computational Resources
CN109446017A (en) A kind of alarm algorithm generation method, monitoring system and terminal device
CN104477776B (en) The crane remote hierarchical monitoring system of based role
CN104375954A (en) Implementing dynamic cache enabling and disabling based upon workload
CN113297046A (en) Early warning method and device for memory fault
CN109800130A (en) A kind of apparatus monitoring method, device, equipment and medium
CN110515938B (en) Data aggregation storage method, equipment and storage medium based on KAFKA message bus
CN104702654A (en) Video cloud storage system based storage and extraction performance balancing method and device
CN107277143A (en) A kind of resource matched management method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180410

RJ01 Rejection of invention patent application after publication