CN106027328B - Cluster monitoring method and system based on application container deployment - Google Patents

Cluster monitoring method and system based on application container deployment Download PDF

Info

Publication number
CN106027328B
CN106027328B CN201610321338.6A CN201610321338A CN106027328B CN 106027328 B CN106027328 B CN 106027328B CN 201610321338 A CN201610321338 A CN 201610321338A CN 106027328 B CN106027328 B CN 106027328B
Authority
CN
China
Prior art keywords
monitoring
host machine
monitored object
host
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610321338.6A
Other languages
Chinese (zh)
Other versions
CN106027328A (en
Inventor
包琼林
刘祥涛
姚子建
赵彦晖
孙淏添
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Cubic Information Technology Co Ltd Is Moistened In Shenzhen
Original Assignee
Cubic Information Technology Co Ltd Is Moistened In Shenzhen
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cubic Information Technology Co Ltd Is Moistened In Shenzhen filed Critical Cubic Information Technology Co Ltd Is Moistened In Shenzhen
Priority to CN201610321338.6A priority Critical patent/CN106027328B/en
Publication of CN106027328A publication Critical patent/CN106027328A/en
Application granted granted Critical
Publication of CN106027328B publication Critical patent/CN106027328B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/12Network monitoring probes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/06Generation of reports
    • H04L43/067Generation of reports using time frame reporting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0817Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Environmental & Geological Engineering (AREA)
  • Testing And Monitoring For Control Systems (AREA)
  • Measuring And Recording Apparatus For Diagnosis (AREA)

Abstract

A cluster monitoring method and system based on application container deployment are provided, the method is applied to a host machine in an application management system, the host machine comprises a monitoring module, and the method comprises the following steps: the host machine acquires the monitoring state of a monitoring object through a monitoring module, wherein the monitoring object comprises at least one of a target application container running on the host machine, a target system service component running on the host machine, a target application running on the host machine and the host machine; a monitoring module in the host machine performs health analysis on the monitoring state according to a preset health analysis rule to obtain health parameters of a monitored object; the host synchronizes the monitoring state and the health parameters of the monitored objects to the main control node, so that the main control node generates alarm information corresponding to the target monitored object when determining that the health parameters of the target monitored object in the monitored objects are lower than a first threshold value, and sends the alarm information to the cloud control platform.

Description

Cluster monitoring method and system based on application container deployment
Technical Field
The invention relates to the technical field of cloud, in particular to a cluster monitoring method and system based on application container deployment.
Background
When applications are deployed based on application containers, each application may correspond to a large number of application containers, and under the influence of a container scheduler, a host for installing the application containers is randomly selected by the scheduler, that is, the location and network address of each application container are randomly generated. Therefore, when monitoring the application containers deployed in the cluster, it is necessary to collect and summarize the monitoring information of each application container, and then analyze the monitoring information. Particularly, when a large number of application containers are deployed in a cluster, the whole process is complicated, and a larger workload is brought to a master host or a server used for dispatching cluster resources, so that the workload of the master host or the server is larger, and the purpose of load balancing cannot be achieved on the premise of not changing the existing system architecture and increasing the equipment cost.
In order to realize load balancing, after deployment of the application container is completed, a monitoring agent end needs to be additionally installed on each host, service configuration and service debugging are carried out on the basis of open source monitoring codes, then the monitoring agent end collects monitoring information and sends the monitoring information of the hosts to the main control node at regular time, so that the main control node can collect and analyze the monitoring information and judge whether alarm is needed or not. However, the current monitoring agent end mainly monitors the host machine, and does not monitor with finer granularity, so that the monitoring state of each monitored object cannot be targeted, and the monitored object with abnormal monitoring is inconvenient to maintain. And secondly, the later installation monitoring agent end increases the workload of application deployment personnel.
Disclosure of Invention
The invention provides a cluster monitoring method and system based on application container deployment, which can solve the problem that fine-grained cluster monitoring cannot be performed in the prior art.
A first aspect provides a cluster monitoring method based on application container deployment, where the method is applied to a host in an application management system, where the application management system includes a cloud control platform, a master control node, and a cluster, the cluster includes at least one host, and the host includes a monitoring module, and the method includes:
the host machine acquires a monitoring state of a monitoring object through the monitoring module, wherein the monitoring object comprises at least one of a target application container running on the host machine, a target system service component running on the host machine, a target application running on the host machine and the host machine;
the monitoring module in the host machine performs health analysis on the monitoring state according to a preset health analysis rule to obtain health parameters of the monitored object;
the host synchronizes the monitoring state and the health parameters of the monitored objects to the main control node, so that the main control node generates alarm information corresponding to a target monitored object when determining that the health parameters of the target monitored object in the monitored objects are lower than a first threshold value, and sends the alarm information to the cloud control platform.
In some possible designs, the host further includes a monitoring object recognition module, and before the host obtains the monitoring state of the monitoring object, the method further includes:
the host machine obtains at least one attribute information of the target application container, the attribute information of the target system service assembly, the attribute information of the target application and the attribute information of the host machine through the monitoring object identification module.
In some possible designs, the host obtains the monitoring state of the monitored object through the monitoring module, where the monitoring state includes at least one of:
the host machine acquires the monitoring state of the host machine according to the attribute information of the target application container;
or the like, or, alternatively,
the host machine acquires the monitoring state of the target application container according to the attribute information of the target system service assembly;
or the like, or, alternatively,
the host machine acquires the monitoring state of the target application container according to the attribute information of the target application;
or the like, or, alternatively,
and the host machine acquires the monitoring state of the target system service assembly according to the attribute information of the host machine.
In some possible designs, the health analysis of the monitoring state by the monitoring module in the host according to a preset health analysis rule to obtain the health parameter of the monitored object at least includes one of the following items:
the host machine reduces the health parameter of the monitored object with the processor utilization rate higher than the second threshold value and lower than the third threshold value within the first time period by M1
Or, the host machine reduces the health parameter of the monitored object with the processor utilization rate of the monitored object higher than the third threshold value within the first time period by M2
Or, the host pair is within a second durationAnd reducing the health parameter of the monitored object with the memory usage rate of the monitored object higher than the fourth threshold and lower than the fifth threshold by M3
Or, the host machine reduces the health parameter of the monitored object of which the memory usage rate is higher than the fifth threshold value within the second time length by M4
Or, the host machine reduces the health parameters of the monitored object of which the hard disk utilization rate is higher than the sixth threshold and lower than the seventh threshold within the third time length by M5
Or, the host machine reduces M for the health parameters of the monitored object of which the hard disk utilization rate is higher than the seventh threshold value in the third time length6
Or, the host machine reduces the health parameter of the monitored object with the network bandwidth occupation ratio higher than the eighth threshold and lower than the ninth threshold within the fourth time length by M7
Or, the host machine reduces the network bandwidth occupation of the monitored object by M compared with the health parameter of the monitored object with the network bandwidth occupation higher than the ninth threshold value within the fourth time length8
Or, the host sets the health parameter of the monitoring object with abnormal operation as 0;
M1、M2、M3、M4、M5、M6、M7、M8、M9are all positive integers.
A second aspect provides a cluster monitoring method based on application container deployment, where the method is applied to a master control node in an application management system, where the application management system includes a cloud control platform, the master control node, and a cluster, where the cluster includes at least one host, and the host includes a monitoring module, and the method includes:
the master control node acquires a monitoring state of a monitoring object and health parameters of the monitoring object from the host, wherein the monitoring object comprises at least one of a target application container running on the host, a target system service component running on the host, a target application running on the host and the host; the health parameters are obtained by the monitoring module in the host machine through health analysis on the monitoring state according to preset health analysis rules;
when the master control node determines that the health parameter of a target monitoring object in the monitoring objects is lower than a first threshold value, alarm information corresponding to the target monitoring object is generated, and the alarm information is sent to the cloud control platform.
In some possible designs, when determining that a health parameter of a target monitoring object in the monitoring objects is lower than a first threshold, the master control node generates alarm information corresponding to the target monitoring object, including:
and the master control node determines the alarm level of the target monitored object according to the health parameter of the monitored object, performs level identification on the alarm information according to the alarm level, and generates alarm information corresponding to the target monitored object if the average health parameter of the monitored object is lower than the first threshold value within a preset time length.
In some possible designs, if the alarm level is a first level, the alarm information is information for indicating that the target monitored object is to be early warned;
if the alarm level is the second level, the alarm information is information for indicating to alarm the target monitoring object;
and if the alarm level is a third level, the alarm information is information for indicating that the target monitoring object is alarmed.
In some possible designs, the method further comprises:
the master control node sends the monitoring state of the host machine and the health parameters of the monitored object to the cloud control platform through an inquiry port, so that the cloud control platform graphically displays the monitoring state of the host machine and the health parameters of the monitored object according to the health parameters of the monitored object.
The third aspect of the present invention provides a cluster monitoring system based on application container deployment, which has the function of implementing the above cluster monitoring method based on application container deployment. The functions can be realized by hardware, and the functions can also be realized by executing corresponding software by hardware. The hardware or software includes one or more modules corresponding to the above functions, which may be software and/or hardware.
In one possible design, the cluster monitoring system deployed based on the application container comprises a cluster and a master control node, wherein the cluster comprises at least one host machine, and the host machine comprises a monitoring module;
the host machine acquires a monitoring state of a monitoring object through the monitoring module, wherein the monitoring object comprises at least one of a target application container running on the host machine, a target system service component running on the host machine, a target application running on the host machine and the host machine;
the monitoring module in the host machine performs health analysis on the monitoring state according to a preset health analysis rule to obtain health parameters of the monitored object;
the host synchronizes the monitoring state and the health parameters of the monitored object to the master control node;
the method comprises the steps that a main control node obtains a monitoring state of a monitored object and health parameters of the monitored object from a host, generates alarm information corresponding to a target monitored object when the health parameters of the target monitored object in the monitored object are lower than a first threshold value, and sends the alarm information to a cloud control platform.
In some possible designs, the host further includes a monitoring object recognition module, and before the host acquires the monitoring state of the monitoring object, the host acquires at least one of attribute information of the target application container, attribute information of the target system service component, attribute information of the target application, and attribute information of the host through the monitoring object recognition module.
In some possible designs, the host obtains the monitoring state of the monitored object through the monitoring module, where the monitoring state includes at least one of:
the host machine acquires the monitoring state of the host machine according to the attribute information of the target application container;
or the like, or, alternatively,
the host machine acquires the monitoring state of the target application container according to the attribute information of the target system service assembly;
or the like, or, alternatively,
the host machine acquires the monitoring state of the target application container according to the attribute information of the target application;
or the like, or, alternatively,
and the host machine acquires the monitoring state of the target system service assembly according to the attribute information of the host machine.
In some possible designs, the health analysis of the monitoring state by the monitoring module in the host according to a preset health analysis rule to obtain the health parameter of the monitored object at least includes one of the following items:
the host machine reduces the health parameter of the monitored object with the processor utilization rate higher than the second threshold value and lower than the third threshold value within the first time period by M1
Or, the host machine reduces the health parameter of the monitored object with the processor utilization rate of the monitored object higher than the third threshold value within the first time period by M2
Or, the host machine reduces the health parameter of the monitored object of which the memory usage rate is higher than the fourth threshold and lower than the fifth threshold within the second time length by M3
Or, the host machine reduces the health parameter of the monitored object of which the memory usage rate is higher than the fifth threshold value within the second time length by M4
Or, the host machine reduces the health parameters of the monitored object of which the hard disk utilization rate is higher than the sixth threshold and lower than the seventh threshold within the third time length by M5
Or, the host machine reduces M for the health parameters of the monitored object of which the hard disk utilization rate is higher than the seventh threshold value in the third time length6
Or, the host machine reduces the health parameter of the monitored object with the network bandwidth occupation ratio higher than the eighth threshold and lower than the ninth threshold within the fourth time length by M7
Or, the host machine reduces the network bandwidth occupation of the monitored object by M compared with the health parameter of the monitored object with the network bandwidth occupation higher than the ninth threshold value within the fourth time length8
Or, the host sets the health parameter of the monitoring object with abnormal operation as 0;
M1、M2、M3、M4、M5、M6、M7、M8、M9are all positive integers.
In some possible designs, the master node is specifically configured to:
and determining the alarm level of the target monitored object according to the health parameter of the monitored object, performing level identification on the alarm information according to the alarm level, and generating alarm information corresponding to the target monitored object if the average health parameter of the monitored object is lower than the first threshold value within a preset time.
Compared with the prior art, in the scheme provided by the invention, the monitoring state of at least one monitoring object in the target application container, the target system service component, the target application and the host is obtained through the monitoring module deployed based on the application container, so that the monitoring of the application container level, the application level and the host level is realized, and the monitoring state with finer granularity is provided. And the health analysis is carried out on the monitoring state according to the preset health analysis rule, so that more detailed and more accurate monitoring state and health parameters can be provided for the main control node.
Drawings
Fig. 1 is a schematic flowchart of a cluster monitoring method based on application container deployment in this embodiment;
fig. 2 is another schematic flow chart of a cluster monitoring method based on application container deployment in this embodiment;
fig. 3 is another schematic flowchart of a cluster monitoring method based on application container deployment in this embodiment;
FIG. 4 is a schematic illustration of a monitoring status and health parameter in the present embodiment;
FIG. 5 is another schematic illustration of the monitoring of status and health parameters in the present embodiment;
fig. 6 is a schematic flowchart of a cluster monitoring method based on application container deployment in this embodiment;
fig. 7 is a schematic flowchart of the cluster monitoring system deployed based on the application container in this embodiment.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, rather than all embodiments, and all other embodiments obtained by those skilled in the art without any creative work based on the embodiments of the present invention belong to the protection scope of the present invention.
The terms "first," "second," and the like in the description and in the claims, and in the drawings described above, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It will be appreciated that the data so used may be interchanged under appropriate circumstances such that the embodiments described herein may be practiced otherwise than as specifically illustrated or described herein. Furthermore, the terms "comprise," "include," and "have," and any variations thereof, are intended to cover non-exclusive inclusions, such that a process, method, system, article, or apparatus that comprises a list of steps or modules is not necessarily limited to those steps or modules expressly listed, but may include other steps or modules not expressly listed or inherent to such process, method, article, or apparatus, the division of modules herein shown is merely a logical division and may be implemented in a practical application in a different manner, such that multiple modules may be combined or integrated into another system or certain features may be omitted or not implemented, and such that mutual or direct coupling or communicative coupling between the modules shown or discussed may be through interfaces, and indirect coupling or communicative coupling between the modules may be electrical or other similar, are not intended to be limiting herein. Moreover, the modules or sub-modules described as separate components may or may not be physically separated, may or may not be physical modules, or may not be separated into multiple circuit modules, and some or all of the modules may be selected according to actual needs to achieve the purpose of the embodiment of the present invention.
The embodiment of the invention provides a cluster monitoring method and system based on application container deployment, which are used for solving the problem that fine-grained cluster monitoring cannot be performed in the prior art. The details will be described below.
The application management system herein includes: the system comprises a cloud control platform, at least one main control node and at least one cluster, wherein each cluster comprises at least one host.
The main control node is used for storing the monitoring state and the health parameter sent by each monitoring module, providing a monitoring state query port for the cloud control platform and realizing the pushing of the alarm message for the abnormal monitoring state, and mainly comprises the following functional nodes:
(1) and the monitoring state query port is used for receiving a request of the cloud control platform, reading the monitoring state from the time sequence database and returning the result to the cloud control platform.
(2) And the monitoring alarm module observes the change condition of the health score from the time sequence database in an observer mode, and sends an alarm message to the message queue when the health parameters of the host, the system service assembly, the application or the application container are lower than a specified threshold (the threshold of each layer is preset by the cloud control platform and is stored in a configuration file of the main control node).
(3) The message pushing module is used for observing messages in the message queue in real time, and when the message queue is observed to have alarm messages, the message pushing module acquires user information of a host, a system service component, an application or an application container from a public service information query interface of the cloud control platform and sends the messages to the user.
The main control node, at least one host and the cloud control platform can form a cluster monitoring system based on application container deployment, and the monitoring module installed on each host forms the whole container management background.
In order to solve the above technical problems, embodiments of the present invention mainly provide the following technical solutions:
1. when the container management platform layout is carried out, the host machine is preassembled with a host machine daemon, the host machine daemon comprises a monitoring module, namely the monitoring module and an application container are integrated and released, a monitoring main control service is installed on a main control node and used for summarizing the monitoring state of each monitoring object and the health parameters of the monitoring object, which are acquired by the monitoring module, and whether the monitoring object needs to be alarmed or not can be judged according to the health parameters of the monitoring object.
2. And the host machine operation monitoring module is used for monitoring the host machine, the application container, the system service assembly and the like operated on the host machine.
Through the technical scheme, fine-grained cluster monitoring can be achieved, and cluster monitoring and application container publishing are integrated, so that monitoring modules do not need to be installed on each host machine in the later period, and the workload of application deployment personnel is effectively reduced.
Referring to fig. 1, fig. 2 and fig. 3, the following illustrates an application container deployment-based cluster monitoring method, where the method is applied to a host in an application management system, where the application management system includes a cloud control platform, a master control node and a cluster, the cluster includes at least one host, the host includes a monitoring module, and the monitoring module is a module in a host daemon preinstalled when the host leaves a factory, so that, based on the preinstalled host daemon, an operation and maintenance worker may directly perform relevant monitoring configuration on the cloud control platform without additionally installing a third-party monitoring agent, and the monitoring described in the following embodiments can be implemented, where the embodiment of the invention includes:
101. and the host machine acquires the monitoring state of the monitored object through the monitoring module.
The monitoring module is configured with acquisition parameters, such as service name, health check path, service port, etc.
The monitoring object comprises at least one of a target application container running on the host machine, a target system service component running on the host machine, a target application running on the host machine and the host machine. Monitoring parameters pre-configured by a user on the cloud control platform are pushed to each host by the main control node, so that the monitoring module at least collects and analyzes monitoring states of the application container, the application, the system service assembly and the host by using the monitoring parameters.
The system service component includes a cache, an Access service (full name: Access Server), a Nginx Server, a private warehouse (full name: Registry), and the like, and the specific content is not limited.
The monitoring state of the monitored object is as follows:
a. for the host, the monitoring state includes processor (CPU) utilization, memory utilization, hard disk utilization, network input/output, disk space quota, network bandwidth ratio, host state, health parameter, and the like.
b. For the application container, the monitoring state includes the utilization rate of the CPU of the application container, the utilization rate of the memory of the container, the utilization rate of the disk space of the container, the network bandwidth ratio of the container, the health parameters of the application container, and the like.
c. For the system service assembly, the monitoring state comprises the running state of the system service assembly, the CPU utilization rate of a host machine to which the system service assembly belongs, the memory utilization rate of the host machine to which the system service assembly belongs, the disk space utilization rate of the host machine to which the system service assembly belongs, the network bandwidth occupation ratio of the host machine to which the system service assembly belongs, the health parameters of the system service assembly and the like.
d. For the application, the monitoring state comprises application CPU utilization rate, application memory utilization rate, application disk space utilization rate, application network bandwidth ratio, health parameters of the application and the like, and the monitoring state of the application is obtained by summarizing corresponding application containers and carrying out mean value calculation.
When monitoring the monitoring objects, the standard value of the monitoring parameter of each monitoring object is set according to conditions such as hardware capability or actual requirements, and the specific value is not limited herein.
Optionally, the host further includes a monitoring object identification module, before the host acquires the monitoring state of the monitoring object, the host acquires attribute information of each monitoring object through the monitoring object identification module, for example, acquires at least one of attribute information of the target application container, attribute information of the target system service component, and attribute information of the host, so that the host may position the corresponding monitoring object according to the attribute information of each monitoring object, for example, position the target application container according to the attribute information of the target application container, position the target system service component according to the attribute information of the target system service component, position the target application according to the attribute information of the target application, and the like. The attribute information of each monitored object may include information such as identification information of the monitored object, and monitoring parameter items of the monitored object, and the specific text is not limited as long as positioning of the monitored object and fine-grained monitoring can be achieved.
Further, the host machine obtains the monitoring state of the monitored object through the monitoring module, and the monitoring state at least comprises one of the following items:
the host machine acquires the monitoring state of the host machine according to the attribute information of the target application container;
or the like, or, alternatively,
the host machine acquires the monitoring state of the target application container according to the attribute information of the target system service assembly;
or the like, or, alternatively,
the host machine acquires the monitoring state of the target application container according to the attribute information of the target application;
or the like, or, alternatively,
and the host machine acquires the monitoring state of the target system service assembly according to the attribute information of the host machine.
102. And the monitoring module in the host machine performs health analysis on the monitoring state according to a preset health analysis rule to obtain the health parameters of the monitored object.
The preset health analysis rule is configured by a user on the cloud control platform and stored in a configuration file of the main control node, so that a monitoring module in the host machine analyzes and calculates the acquired monitoring state by using the preset health analysis rule.
The monitoring module in the host machine performs health analysis on the monitoring state according to a preset health analysis rule, and the obtained health parameters of the monitored object at least comprise one of the following items:
the host machine reduces the health parameter of the monitored object with the processor utilization rate higher than the second threshold value and lower than the third threshold value within the first time period by M1
Or, the host machine reduces the health parameter of the monitored object with the processor utilization rate of the monitored object higher than the third threshold value within the first time period by M2
Or, the host machine reduces the health parameter of the monitored object of which the memory usage rate is higher than the fourth threshold and lower than the fifth threshold within the second time length by M3
Or, the host machine reduces the health parameter of the monitored object of which the memory usage rate is higher than the fifth threshold value within the second time length by M4
Or, the host machine reduces the health parameters of the monitored object of which the hard disk utilization rate is higher than the sixth threshold and lower than the seventh threshold within the third time length by M5
Or, the host machine reduces M for the health parameters of the monitored object of which the hard disk utilization rate is higher than the seventh threshold value in the third time length6
Or, the host machine reduces the health parameter of the monitored object with the network bandwidth occupation ratio higher than the eighth threshold and lower than the ninth threshold within the fourth time length by M7
Or, the host machine reduces the network bandwidth occupation of the monitored object by M compared with the health parameter of the monitored object with the network bandwidth occupation higher than the ninth threshold value within the fourth time length8
Or, the host sets the health parameter of the monitoring object with abnormal operation as 0;
wherein M is1、M2、M3、M4、M5、M6、M7、M8、M9Are all positive integers.
For example, when the monitored object is a host, the rules for analyzing and calculating the health parameters of the host are shown in table 1 below:
Figure BDA0000989627640000111
Figure BDA0000989627640000121
TABLE 1
For example, when the monitoring object is an application container, the rules for analyzing and calculating the health parameters of the application container are shown in table 2 below:
Figure BDA0000989627640000122
TABLE 2
The same rules for analyzing and calculating the health parameters of the system service components are not described in detail.
103. And the host synchronizes the monitoring state and the health parameters of the monitored object to the main control node.
After the analysis and calculation of the monitoring state are completed, the host can acquire the monitoring state of each monitored object and send the health parameters corresponding to the monitored objects to the master control node, so that the master control node can summarize the monitoring state of the monitored objects and the health parameters of the monitored objects and store the summarized information in the time sequence database.
It is understood that the monitoring state of the object is monitored. And the health parameters of the monitored object can fluctuate, so the host can synchronize the monitoring state obtained by real-time dynamic monitoring and the health parameters obtained by real-time dynamic analysis and calculation to the master control node, and the master control node can dynamically update the monitoring state of the monitored object and the health parameters of the monitored object.
104. When the master control node determines that the health parameter of a target monitoring object in the monitoring objects is lower than a first threshold value, alarm information corresponding to the target monitoring object is generated, and the alarm information is sent to the cloud control platform.
The alarm information includes a cluster name to which the host belongs, a host name or network address, a service name, an application name, a monitoring state, health parameters, and the like.
Specifically, after acquiring the monitoring state and the health parameter of each monitored object from each host, the master control node stores the monitoring state and the health parameter of each monitored object into the time sequence database, reads the corresponding monitoring state and the health parameter of each monitored object from the time sequence database after receiving an inquiry request of the cloud control platform through the inquiry port, and then returns the summarized monitoring state and health parameter to the cloud control platform through the inquiry port.
The monitoring alarm module in the main control node observes the change condition of the health parameters of each monitored object from the time sequence database in an observer mode, and sends alarm information to the message queue of the cache message when the health parameters of the host, the system service assembly, the application or the application container are lower than a specified threshold (the alarm threshold is set by the cloud control platform and is stored in the configuration file of the main control node). It is understood that the alarm threshold set for each health parameter of the monitored object may be the same, and may be different, depending on the actual application requirements or hardware capabilities, and the like, and is not limited herein. And then, the message pushing module in the master control node inquires the user to which the host or the application belongs according to the host or the application service name of the message, finds out the binding information such as the mobile phone or the micro-signal and the like, and respectively pushes the alarm information to the cloud control platform and the mobile phone short message or the micro-message according to the system configuration rule.
Optionally, the master control node may alarm when the health parameter at a certain time is lower than the first threshold, or may observe the health parameter of the monitored object in the time sequence database for a period of time, and determine whether to alarm if the average value is obtained, specifically as follows:
and the master control node determines the alarm level of the target monitored object according to the health parameter of the monitored object, performs level identification on the alarm information according to the alarm level, and generates alarm information corresponding to the target monitored object if the average health parameter of the monitored object is lower than the first threshold value within a preset time length.
Optionally, the alarm level is determined according to the size of the health parameter, and different alarm information may be generated according to the alarm level, which is specifically as follows:
if the alarm level is a first level, the alarm information is information for indicating that the target monitoring object is early-warned;
if the alarm level is the second level, the alarm information is information for indicating to alarm the target monitoring object;
and if the alarm level is a third level, the alarm information is information for indicating that the target monitoring object is alarmed.
As shown in fig. 4 and 5, after the master control node sends the monitoring state of the host and the health parameters of the monitoring object to the cloud control platform through the query port, the cloud control platform can graphically display the monitoring state of the host and the health parameters of the monitoring object according to the health parameters of the monitoring object, so that the running state of each monitoring object is clear at a glance, and a manager can know the real-time running state of each monitoring object at any time. The specific presentation is not limiting herein.
In addition, as shown in fig. 6, after receiving the alarm information sent by the main control node, the cloud control platform may display the alarm information in a user browser in a manner of pop-up window, voice prompt, and the like, for example, for the serious alarm information (third level), the cloud control platform may further perform message assembly on the alarm information in a manner of short message of a mobile phone, then send the alarm information to a mobile phone bound by the user through a short message gateway, may further perform message assembly on the alarm information in a manner of short message, and then push the alarm information to a short message bound by the user through a public number of the short message, so that the user bound to the mobile phone or the public number of the short message can timely and specifically process and repair the component with abnormal service, thereby improving the monitoring efficiency. The specific manner of pushing the alarm information is not limited herein.
In the embodiment of the invention, the monitoring state of at least one monitoring object in the target application container, the target system service component, the target application and the host is obtained by the monitoring module based on the application container deployment, so that the monitoring of the application container level, the application level and the host level is realized, and the monitoring state with finer granularity is provided. And the health analysis is carried out on the monitoring state according to the preset health analysis rule, so that more detailed and more accurate monitoring state and health parameters can be provided for the main control node.
In the above description, a cluster monitoring method based on application container deployment in the present invention is described, and a cluster monitoring system 70 for executing the above cluster monitoring method based on application container deployment is described below.
Referring to fig. 7, a cluster monitoring system 70 based on application container deployment is described, where the cluster monitoring system 70 based on application container deployment includes:
a cluster and a master control node 702, where the cluster includes at least one host 701, the host 701 includes a monitoring module 7011, and the monitoring module 7011 is a module that is installed integrally with an application container when the application container is deployed on the host 701;
the host 701 obtains a monitoring state of a monitoring object through the monitoring module 7011, where the monitoring object includes at least one of a target application container running on the host 701, a target system service component running on the host 701, a target application running on the host 701, and the host 701;
the monitoring module 7011 in the host 701 performs health analysis on the monitoring state according to a preset health analysis rule to obtain a health parameter of the monitored object;
the host 701 synchronizes the monitoring state and the health parameters of the monitored object to the master control node;
the master control node 702 acquires a monitoring state of a monitoring object and health parameters of the monitoring object from the host 701, generates alarm information corresponding to a target monitoring object when determining that the health parameters of the target monitoring object in the monitoring object are lower than a first threshold value, and sends the alarm information to the cloud control platform.
In the embodiment of the present invention, the monitoring module 7011 deployed based on the application container obtains the monitoring state of at least one monitoring object in the target application container, the target system service component, and the host 701, so as to implement monitoring at the application container level, the application level, and the host level, and provide a finer-grained monitoring state. And the health analysis is carried out on the monitoring state according to the preset health analysis rule, so that more detailed and more accurate monitoring state and health parameters can be provided for the main control node.
Optionally, in some embodiments of the present invention, the host 701 further includes a monitoring object recognition module 7012, and before the host 701 obtains the monitoring state of the monitoring object, the host 701 obtains at least one attribute information of the target application container, the attribute information of the target system service component, the attribute information of the target application, and the attribute information of the host 701 through the monitoring object recognition module 7012.
Optionally, in some embodiments of the present invention, the host 701 obtains the monitoring state of the monitored object through the monitoring module 7011, where the obtaining includes at least one of:
the host 701 acquires the monitoring state of the host 701 according to the attribute information of the target application container;
or the like, or, alternatively,
the host 701 acquires the monitoring state of the target application container according to the attribute information of the target system service component;
or the like, or, alternatively,
the host machine acquires the monitoring state of the target application container according to the attribute information of the target application;
or the like, or, alternatively,
the host 701 acquires the monitoring state of the target system service component according to the attribute information of the host.
Optionally, in some invention embodiments, the monitoring module 7011 in the host 701 performs health analysis on the monitoring state according to a preset health analysis rule, and the obtaining of the health parameter of the monitored object at least includes one of the following items:
the host machine 701 reduces the health parameter of the monitored object of which the processor utilization rate is higher than a second threshold and lower than a third threshold within a first time period by M1
Or, the host 701 decreases M for the health parameter of the monitored object whose processor usage rate of the monitored object is higher than the third threshold value in the first time period2
Or, the host 701 reduces M for the health parameters of the monitoring object whose memory usage rate is higher than the fourth threshold and lower than the fifth threshold within the second time period3
Or, the host 701 reduces M to the health parameter of the monitoring object whose memory usage rate is higher than the fifth threshold in the second time period4
Or, the host 701 reduces M for the health parameters of the monitored object whose hard disk usage rate is higher than the sixth threshold and lower than the seventh threshold within the third time period5
Or, the host 701 reduces M to the health parameter of the monitoring object whose hard disk usage rate is higher than the seventh threshold in the third time period6
Or, the host 701 reduces M for the health parameter of the monitoring object whose network bandwidth occupation ratio is higher than the eighth threshold and lower than the ninth threshold within the fourth time period7
Or, the host 701 decreases the network bandwidth occupation ratio of the monitoring object by M from the health parameter of the monitoring object with the network bandwidth occupation ratio higher than the ninth threshold within the fourth time period8
Or, the host 701 sets the health parameter of the monitoring object with abnormal operation to 0;
M1、M2、M3、M4、M5、M6、M7、M8、M9are all positive integers.
Optionally, in some embodiments of the present invention, the main control node 702 is specifically configured to:
and determining the alarm level of the target monitored object according to the health parameter of the monitored object, performing level identification on the alarm information according to the alarm level, and generating alarm information corresponding to the target monitored object if the average health parameter of the monitored object is lower than the first threshold value within a preset time.
The specific whole monitoring and alarming process is shown in fig. 6, which is not described herein.
In the above description, each embodiment is described with emphasis, and reference may be made to related descriptions of other embodiments for parts that are not described in detail in a certain embodiment.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the several embodiments provided in the present application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
The technical solutions provided by the present invention are described in detail above, and the principle and the implementation of the present invention are explained in this document by applying specific examples, and the descriptions of the above examples are only used to help understanding the method and the core idea of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (13)

1. A cluster monitoring method based on application container deployment is characterized in that the method is applied to hosts in an application management system, the application management system comprises a cloud control platform, a master control node and a cluster, the cluster comprises at least one host, the host comprises a monitoring module and a monitoring object identification module, and the method comprises the following steps:
the host machine acquires the attribute information of the monitored object through the monitored object identification module;
the host machine positions the monitoring object through the monitoring module according to the attribute information of the monitoring object and acquires the monitoring state of the monitoring object, wherein the monitoring object comprises at least one of a target application container running on the host machine, a target system service component running on the host machine, a target application running on the host machine and the host machine;
the monitoring module in the host machine performs health analysis on the monitoring state according to a preset health analysis rule to obtain health parameters of the monitored object;
the host synchronizes the monitoring state and the health parameters of the monitored objects to the main control node, so that the main control node generates alarm information corresponding to a target monitored object when determining that the health parameters of the target monitored object in the monitored objects are lower than a first threshold value, and sends the alarm information to the cloud control platform.
2. The method according to claim 1, wherein the host obtains attribute information of the monitored object through the monitored object identification module, and the method comprises:
the host machine obtains at least one attribute information of the target application container, the attribute information of the target system service assembly, the attribute information of the target application and the attribute information of the host machine through the monitoring object identification module.
3. The method according to claim 2, wherein the host locates the monitoring object according to the attribute information of the monitoring object through the monitoring module, and obtains the monitoring state of the monitoring object, and the method comprises at least one of the following steps:
the host machine acquires the monitoring state of the host machine according to the attribute information of the target application container;
or the like, or, alternatively,
the host machine acquires the monitoring state of the target application container according to the attribute information of the target system service assembly;
or the like, or, alternatively,
the host machine acquires the monitoring state of the target application container according to the attribute information of the target application;
or the like, or, alternatively,
and the host machine acquires the monitoring state of the target system service assembly according to the attribute information of the host machine.
4. The method according to claim 3, wherein the monitoring module in the host machine performs health analysis on the monitoring state according to a preset health analysis rule, and obtaining the health parameter of the monitored object at least comprises one of the following items:
the host machineReducing the health parameter of the monitored subject with the processor utilization rate higher than the second threshold value and lower than the third threshold value within the first time period by M1
Or, the host machine reduces the health parameter of the monitored object with the processor utilization rate of the monitored object higher than the third threshold value within the first time period by M2
Or, the host machine reduces the health parameter of the monitored object of which the memory usage rate is higher than the fourth threshold and lower than the fifth threshold within the second time length by M3
Or, the host machine reduces the health parameter of the monitored object of which the memory usage rate is higher than the fifth threshold value within the second time length by M4
Or, the host machine reduces the health parameters of the monitored object of which the hard disk utilization rate is higher than the sixth threshold and lower than the seventh threshold within the third time length by M5
Or, the host machine reduces M for the health parameters of the monitored object of which the hard disk utilization rate is higher than the seventh threshold value in the third time length6
Or, the host machine reduces the health parameter of the monitored object with the network bandwidth occupation ratio higher than the eighth threshold and lower than the ninth threshold within the fourth time length by M7
Or, the host machine reduces the network bandwidth occupation of the monitored object by M compared with the health parameter of the monitored object with the network bandwidth occupation higher than the ninth threshold value within the fourth time length8
Or, the host sets the health parameter of the monitoring object with abnormal operation as 0;
M1、M2、M3、M4、M5、M6、M7、M8、M9are all positive integers.
5. A cluster monitoring method based on application container deployment is characterized in that the method is applied to a main control node in an application management system, the application management system comprises a cloud control platform, the main control node and a cluster, the cluster comprises at least one host, the host comprises a monitoring module, and the method comprises the following steps:
the master control node acquires a monitoring state of a monitoring object and health parameters of the monitoring object from the host, wherein the monitoring object comprises at least one of a target application container running on the host, a target system service component running on the host, a target application running on the host and the host; the health parameters are obtained by the monitoring module in the host machine through health analysis on the monitoring state according to preset health analysis rules, and the monitoring state is obtained by the monitoring module after the monitoring module locates the monitoring object according to the attribute of the monitoring object obtained by the monitoring object identification module in the host machine;
when the master control node determines that the health parameter of a target monitoring object in the monitoring objects is lower than a first threshold value, alarm information corresponding to the target monitoring object is generated, and the alarm information is sent to the cloud control platform.
6. The method according to claim 5, wherein when determining that the health parameter of a target monitoring object in the monitoring objects is lower than a first threshold, the master node generates alarm information corresponding to the target monitoring object, including:
and the master control node determines the alarm level of the target monitored object according to the health parameter of the monitored object, performs level identification on the alarm information according to the alarm level, and generates alarm information corresponding to the target monitored object if the average health parameter of the monitored object is lower than the first threshold value within a preset time length.
7. The method according to claim 6, wherein if the alarm level is a first level, the alarm information is information indicating that the target monitored object is early-warned;
if the alarm level is the second level, the alarm information is information for indicating to alarm the target monitoring object;
and if the alarm level is a third level, the alarm information is information for indicating that the target monitoring object is alarmed.
8. The method of claim 5 or 6, further comprising:
the master control node sends the monitoring state of the host machine and the health parameters of the monitored object to the cloud control platform through an inquiry port, so that the cloud control platform graphically displays the monitoring state of the host machine and the health parameters of the monitored object according to the health parameters of the monitored object.
9. A cluster monitoring system based on application container deployment is characterized by comprising a cluster and a main control node, wherein the cluster comprises at least one host machine, and the host machine comprises a monitoring module and a monitoring object identification module;
the host machine acquires the attribute information of the monitored object through the monitored object identification module;
the host machine positions the monitoring object through the monitoring module according to the attribute information of the monitoring object and acquires the monitoring state of the monitoring object, wherein the monitoring object comprises at least one of a target application container running on the host machine, a target system service component running on the host machine, a target application running on the host machine and the host machine;
the monitoring module in the host machine performs health analysis on the monitoring state according to a preset health analysis rule to obtain health parameters of the monitored object;
the host synchronizes the monitoring state and the health parameters of the monitored object to the master control node;
the method comprises the steps that a main control node obtains a monitoring state of a monitored object and health parameters of the monitored object from a host, generates alarm information corresponding to a target monitored object when the health parameters of the target monitored object in the monitored object are lower than a first threshold value, and sends the alarm information to a cloud control platform.
10. The cluster monitoring system according to claim 9, wherein the host obtains attribute information of the monitored object through the monitored object identification module, and the attribute information includes:
the host machine obtains at least one attribute information of the target application container, the attribute information of the target system service assembly, the attribute information of the target application and the attribute information of the host machine through the monitoring object identification module.
11. The cluster monitoring system according to claim 10, wherein the host locates the monitoring object according to the attribute information of the monitoring object through the monitoring module, and obtains the monitoring state of the monitoring object, and the system comprises at least one of the following:
the host machine acquires the monitoring state of the host machine according to the attribute information of the target application container;
or the like, or, alternatively,
the host machine acquires the monitoring state of the target application container according to the attribute information of the target system service assembly;
or the like, or, alternatively,
the host machine acquires the monitoring state of the target application container according to the attribute information of the target application;
or the like, or, alternatively,
and the host machine acquires the monitoring state of the target system service assembly according to the attribute information of the host machine.
12. The cluster monitoring system according to claim 11, wherein the monitoring module in the host performs health analysis on the monitoring state according to a preset health analysis rule, and obtaining the health parameter of the monitored object at least includes one of:
the host machine reduces the health parameter of the monitored object with the processor utilization rate higher than the second threshold value and lower than the third threshold value within the first time period by M1
Or, the host machine reduces the health parameter of the monitored object with the processor utilization rate of the monitored object higher than the third threshold value within the first time period by M2
Or, the host machine reduces the health parameter of the monitored object of which the memory usage rate is higher than the fourth threshold and lower than the fifth threshold within the second time length by M3
Or, the host machine reduces the health parameter of the monitored object of which the memory usage rate is higher than the fifth threshold value within the second time length by M4
Or, the host machine reduces the health parameters of the monitored object of which the hard disk utilization rate is higher than the sixth threshold and lower than the seventh threshold within the third time length by M5
Or, the host machine reduces M for the health parameters of the monitored object of which the hard disk utilization rate is higher than the seventh threshold value in the third time length6
Or, the host machine reduces the health parameter of the monitored object with the network bandwidth occupation ratio higher than the eighth threshold and lower than the ninth threshold within the fourth time length by M7
Or, the host machine reduces the network bandwidth occupation of the monitored object by M compared with the health parameter of the monitored object with the network bandwidth occupation higher than the ninth threshold value within the fourth time length8
Or, the host sets the health parameter of the monitoring object with abnormal operation as 0;
M1、M2、M3、M4、M5、M6、M7、M8、M9are all positive integers.
13. The cluster monitoring system of claim 12, wherein the master node is specifically configured to:
and determining the alarm level of the target monitored object according to the health parameter of the monitored object, performing level identification on the alarm information according to the alarm level, and generating alarm information corresponding to the target monitored object if the average health parameter of the monitored object is lower than the first threshold value within a preset time.
CN201610321338.6A 2016-05-13 2016-05-13 Cluster monitoring method and system based on application container deployment Active CN106027328B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610321338.6A CN106027328B (en) 2016-05-13 2016-05-13 Cluster monitoring method and system based on application container deployment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610321338.6A CN106027328B (en) 2016-05-13 2016-05-13 Cluster monitoring method and system based on application container deployment

Publications (2)

Publication Number Publication Date
CN106027328A CN106027328A (en) 2016-10-12
CN106027328B true CN106027328B (en) 2020-02-07

Family

ID=57097772

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610321338.6A Active CN106027328B (en) 2016-05-13 2016-05-13 Cluster monitoring method and system based on application container deployment

Country Status (1)

Country Link
CN (1) CN106027328B (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106899659B (en) * 2017-01-25 2020-05-05 广东神马搜索科技有限公司 Distributed system and management method and management device thereof
CN109495398A (en) * 2017-09-11 2019-03-19 中国移动通信集团浙江有限公司 A kind of resource regulating method and equipment of container cloud
CN109561121B (en) * 2017-09-26 2021-10-15 北京国双科技有限公司 Method and device for monitoring deployment
CN107659459A (en) * 2017-11-02 2018-02-02 郑州云海信息技术有限公司 Distributed type assemblies management software disaster tolerance management method, system, medium and equipment
CN107786392A (en) * 2017-11-13 2018-03-09 郑州云海信息技术有限公司 A kind of apparatus monitoring method and device
CN109960577A (en) * 2017-12-14 2019-07-02 中国电信股份有限公司 Cluster control method and server based on Nginx server
CN108156225B (en) * 2017-12-15 2021-02-05 南瑞集团有限公司 Micro-application monitoring system and method based on container cloud platform
CN109271172B (en) * 2018-09-17 2022-02-11 杭州安恒信息技术股份有限公司 Host performance expansion method and device of sweep cluster
CN110515793B (en) * 2019-07-23 2022-02-18 平安科技(深圳)有限公司 System performance monitoring method, device, equipment and storage medium
CN111277460B (en) * 2020-01-17 2022-02-25 江苏满运软件科技有限公司 ZooKeeper containerization control method and device, storage medium and electronic equipment
CN111786830B (en) * 2020-07-01 2023-06-13 浪潮云信息技术股份公司 Network quality simulation method in cloud computing environment
CN112131088B (en) * 2020-09-29 2024-04-09 北京计算机技术及应用研究所 High availability method based on health examination and container
CN112732412B (en) * 2021-04-06 2021-08-03 浙江华创视讯科技有限公司 Service configuration file processing method and device, storage medium and electronic equipment
CN113438136B (en) * 2021-08-27 2021-11-19 苏州浪潮智能科技有限公司 Application service monitoring method and device, electronic equipment and readable storage medium

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2775399A4 (en) * 2012-12-26 2015-04-29 Huawei Tech Co Ltd Resource management method of virtual machine system, virtual machine system, and apparatus
CN103986608B (en) * 2014-05-29 2018-05-04 浪潮电子信息产业股份有限公司 A kind of J2EE application virtualization management methods based on Anthem Linux application containers
CN104899126B (en) * 2015-06-12 2018-01-23 北京奇虎科技有限公司 The local method, apparatus and system monitored in real time is carried out to container in host
CN105119952B (en) * 2015-07-07 2018-12-21 北京京东尚科信息技术有限公司 The method and system of resource is distributed to automatic elastic under cloud platform
CN105024880A (en) * 2015-07-17 2015-11-04 哈尔滨工程大学 Elastic monitoring method for key task computer cluster
CN105118008A (en) * 2015-09-08 2015-12-02 中国航空无线电电子研究所 Channel-level health management method of integrated communication navigation and recognition system

Also Published As

Publication number Publication date
CN106027328A (en) 2016-10-12

Similar Documents

Publication Publication Date Title
CN106027328B (en) Cluster monitoring method and system based on application container deployment
CN108776934B (en) Distributed data calculation method and device, computer equipment and readable storage medium
US9280437B2 (en) Dynamically scalable real-time system monitoring
CN109672582B (en) Full path monitoring method, device, storage medium and device
US20170220663A1 (en) Log Event Summarization for Distributed Server System
US9369364B2 (en) System for analysing network traffic and a method thereof
WO2013186870A1 (en) Service monitoring system and service monitoring method
CN101707632A (en) Method for dynamically monitoring performance of server cluster and alarming real-timely
EP2800024A1 (en) System and methods for identifying applications in mobile networks
JP2022033685A (en) Method, apparatus, electronic device, computer readable storage medium and computer program for determining robustness
KR102136583B1 (en) Monitoring system and method of cloud environment
CN114095567A (en) Data access request processing method and device, computer equipment and medium
CN105471938B (en) Server load management method and device
CN112711512A (en) Prometheus monitoring method, device and equipment
CN104361517B (en) Financial system monitoring method, device and related system
US20170034019A1 (en) Application centric network experience monitoring
US10616081B2 (en) Application aware cluster monitoring
CN107168846A (en) The monitoring method and device of electronic equipment
JP6049136B2 (en) Network management system and method
KR102464688B1 (en) Method and apparatus for detrmining event level of monitoring result
CN103297480A (en) System and method for automatically detecting application service
US20210092159A1 (en) System for the prioritization and dynamic presentation of digital content
CN109766238B (en) Session number-based operation and maintenance platform performance monitoring method and device and related equipment
CN114756301A (en) Log processing method, device and system
US20160224990A1 (en) Customer health tracking system based on machine data and human data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant