CN111459763A - Cross-kubernets cluster monitoring system and method - Google Patents
Cross-kubernets cluster monitoring system and method Download PDFInfo
- Publication number
- CN111459763A CN111459763A CN202010258248.3A CN202010258248A CN111459763A CN 111459763 A CN111459763 A CN 111459763A CN 202010258248 A CN202010258248 A CN 202010258248A CN 111459763 A CN111459763 A CN 111459763A
- Authority
- CN
- China
- Prior art keywords
- component
- monitoring
- data
- cluster
- alcor
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012544 monitoring process Methods 0.000 title claims abstract description 185
- 238000000034 method Methods 0.000 title claims abstract description 32
- 235000000332 black box Nutrition 0.000 claims abstract description 19
- 244000085682 black box Species 0.000 claims abstract description 19
- 238000013480 data collection Methods 0.000 claims abstract 4
- 238000004590 computer program Methods 0.000 claims description 16
- 238000003860 storage Methods 0.000 claims description 13
- 238000010586 diagram Methods 0.000 claims description 12
- 238000013500 data storage Methods 0.000 claims description 5
- 238000004364 calculation method Methods 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 8
- 230000006870 function Effects 0.000 description 5
- 238000012545 processing Methods 0.000 description 4
- 230000008859 change Effects 0.000 description 3
- 238000009434 installation Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 238000012423 maintenance Methods 0.000 description 2
- 238000007726 management method Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- RPAJSBKBKSSMLJ-DFWYDOINSA-N (2s)-2-aminopentanedioic acid;hydrochloride Chemical compound Cl.OC(=O)[C@@H](N)CCC(O)=O RPAJSBKBKSSMLJ-DFWYDOINSA-N 0.000 description 1
- 241000380131 Ammophila arenaria Species 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3003—Monitoring arrangements specially adapted to the computing system or computing system component being monitored
- G06F11/301—Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is a virtual computing platform, e.g. logically partitioned systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3065—Monitoring arrangements determined by the means or processing involved in reporting the monitored data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3089—Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
- G06F11/3093—Configuration details thereof, e.g. installation, enabling, spatial arrangement of the probes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3089—Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
- G06F11/3096—Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents wherein the means or processing minimize the use of computing system or of computing system component resources, e.g. non-intrusive monitoring which minimizes the probe effect: sniffing, intercepting, indirectly deriving the monitored data from other directly available data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/32—Monitoring with visual or acoustical indication of the functioning of the machine
- G06F11/323—Visualisation of programs or trace data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring
- G06F2201/815—Virtual
Abstract
The invention provides a cross-kubernets cluster monitoring system and a method, which comprises the following steps: a plurality of open-sun Alcor clusters, proxeus-out and grafana-out components, the proxeus-out and grafana-out components being deployed outside the Alcor clusters; a promoter, an alert manager and a grafana monitoring component, a node-exporter, a process-exporter and a blackbox data acquisition component are installed in the Alcor cluster; the prometheus-out component synchronizes the monitoring data from the prometheus monitoring component; the grafana-out component exposes the monitoring data. The scheme solves the monitoring and data display of cross-cluster data collection.
Description
Technical Field
The invention relates to the technical field of kubernets cluster monitoring, in particular to a cross-kubernets cluster monitoring system and a cross-kubernets cluster monitoring method.
Background
Container technology is currently the hot door technology and is the leading technology. Since Docker's release, software deployment became easy, and truly realized that deployment was run everywhere. kubernets is an open-source Docker container cluster management system of a cross-host cluster, and provides a whole set of functions of resource scheduling, deployment and operation, service discovery, capacity expansion, capacity reduction and the like for containerized applications. The Kaiyang Alcor is a container cloud platform developed and packaged based on the original kubernets, massive applications and services are deployed in the container cloud platform, the situation that multiple clusters run simultaneously is inevitably formed as the application increases and the user demand complexity is improved, and the kubernets are monitored and data display is carried out through matching of promemeus, alert manager and grafana in the prior art. However, when multiple clusters occur, each cluster has a set of prometheus monitoring component, and no matter a system administrator or a tenant, it is inevitable to switch back and forth among the multiple clusters to observe complete monitoring data, and complexity of the Kaiyang Alcor in extracting the monitoring data is increased, so that the monitoring data in the multiple clusters are difficult to converge, and great challenges are brought to analysis and comparison of the monitoring data when faults occur in the later period. The prior art lacks a cross-cluster solution for monitoring, alarming and data displaying of massive applications and services on a plurality of clusters.
Disclosure of Invention
The embodiment of the invention provides a cross-kubernets cluster monitoring system and a cross-kubernets cluster monitoring method, which solve the technical problem that a cross-cluster solution for monitoring, alarming and displaying data of massive applications and services on a plurality of clusters is lacked in the prior art.
The embodiment of the invention provides a cross-kubernets cluster monitoring system, which comprises:
a plurality of open-sun Alcor clusters, a progreus-out component, and a grafana-out component, the progreus-out component and the grafana-out component deployed outside the plurality of open-sun Alcor clusters;
a promoter monitoring component, an alert manager monitoring component, a grafana monitoring component, a node-exporter data acquisition component, a process-export data acquisition component and a blackbox data acquisition component are installed in each open-sun Alcor cluster;
the prometheus monitoring component is used to: acquiring monitoring data from an open-sun Alcor cluster component and a cluster container Docker, acquiring monitoring data of an open-sun Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component, generating alarm information according to the monitoring data, and sending the alarm information to an alert manager monitoring component;
the alert manager monitoring component is used for: managing the alarm information;
the grafana monitoring component is configured to: acquiring monitoring data from a prometheus monitoring component for displaying;
the prometheus-out component is used to: synchronizing monitoring data from prometheus monitoring components in a plurality of open-sun Alcor clusters, and adding a data distinguishing label to the monitoring data of each open-sun Alcor cluster;
the grafana-out component is to: and acquiring monitoring data from a prometheus-out component for displaying.
The embodiment of the invention also provides a cross-kubernets cluster monitoring method, which comprises the following steps:
installing a proxy monitoring component, an alert manager monitoring component, a grafana monitoring component, a node-exporter data acquisition component, a process-export data acquisition component and a blackbox data acquisition component in each open-sun Alcor cluster in the plurality of open-sun Alcor clusters, and deploying the proxy-out component and the grafana-out component outside the plurality of open-sun Alcor clusters;
the method comprises the steps that a proxy monitoring component obtains monitoring data from a Kaiyang Alcor cluster component and a cluster container Docker, obtains monitoring data of a Kaiyang Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component, generates alarm information according to the monitoring data, and sends the alarm information to an alertmager monitoring component;
the alert information is managed by the alert manager monitoring component;
the method comprises the following steps that a grafana monitoring component obtains monitoring data from a prometheus monitoring component for displaying;
a prometheus-out component synchronizes monitoring data from prometheus monitoring components in a plurality of open-sun Alcor clusters, and adds a data distinguishing tag to the monitoring data of each open-sun Alcor cluster;
the grafana-out component obtains the monitoring data from the prometheus-out component for displaying.
The embodiment of the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and the processor implements the method when executing the computer program.
The embodiment of the invention also provides a computer readable storage medium, and the computer readable storage medium stores a computer program for executing the method.
In the embodiment of the invention, a protemetheus monitoring component, an alert manager monitoring component and a grafana monitoring component are installed in each open-sun Alcor cluster, so that the acquisition pressure of monitoring information and the calculation pressure of monitoring items are separated from one set of protemetheus, the protemetheus in each open-sun Alcor cluster is only responsible for the data acquisition and monitoring item calculation of the cluster, and the pressure is controllable; deploying a proxy-out component and a grafana-out component outside a plurality of open-sun Alcor clusters, wherein the proxy-out is only responsible for synchronizing the monitoring data of all the clusters, and does not perform monitoring item calculation, so that the calculation pressure is reduced; the user can check the monitoring data of the containers deployed on different clusters through the grafana-out portal, and the user experience is improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a structural diagram of a cross-kubernets cluster monitoring system according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Explanation of technical terms
Docker is an open source application container engine that allows developers to package their applications and dependencies into a portable container and then publish them on any popular L inux machine, as well as to implement virtualization.
kubernets, K8s for short, is an open-source application for managing containerization on multiple hosts in a cloud platform, and aims to make it simple and efficient to deploy containerized applications (powerfull), and provides a mechanism for application deployment, planning, updating, and maintenance.
Prometheus is a set of open source monitoring & alarm & time series database combinations. The monitoring data acquisition and storage device is used for acquiring and storing monitoring data of docker and kubernets.
grafana is an open source application written in go language, is mainly used for visualization display of large-scale index data, is the most popular time sequence data display tool in network architecture and application analysis, and supports most common time sequence databases at present. The monitoring data is used for graphically displaying the monitoring data stored by the prometheus.
The alert manager is an independent alarm module, receives alerts sent by clients such as Prometheus, and then processes the alerts by grouping, deleting duplicates, and the like, and sends the alerts to a correct receiver through a route; the alarm mode can be sent to different module responsible persons according to different rules. The alarm of Prometheus is divided into two parts. The alarm rules in the Prometheus server send alarms to Alertmanager. The alert manager then manages these alerts including methods of silence, suppression, aggregation and passage, such as email notification, call notification systems, and instant messaging platforms.
Apiserver is the bus and external API interface for open sun Alcor cluster message communication.
The Controller is the Controller of the open sun Alcor cluster state.
Scheduler is the Scheduler of the open sun Alcor cluster container service.
Coredns is an open Alcor cluster internal DNS resolution service.
Helm chart is a kubernets service template orchestration tool.
The main steps for setting up alerts and notifications are:
setting and configuring an Alertmanager;
configuring Prometheus and Alertmanager dialogue;
alarm rules are created in Prometheus.
Exporter: all programs that can provide monitoring sample data to Prometheus can be referred to as an Exporter.
The Node exporter is mainly used for exposing metrics to Prometeus, wherein the metrics comprises: cpu load, memory usage, network, etc.
Process-exporter is mainly used to expose metrics to Prometeus, where metrics includes: the state of the process running on the server.
Blackbox is primarily used to expose metrics to Prometheus, where metrics include: a server port status.
In an embodiment of the present invention, a method for monitoring a system across kubernets is provided, as shown in fig. 1, including:
a plurality of open-sun Alcor clusters, a progreus-out component, and a grafana-out component, the progreus-out component and the grafana-out component deployed outside the plurality of open-sun Alcor clusters;
a promoter monitoring component, an alert manager monitoring component, a grafana monitoring component, a node-exporter data acquisition component, a process-export data acquisition component and a blackbox data acquisition component are installed in each open-sun Alcor cluster;
the prometheus monitoring component is used to: acquiring monitoring data from an open-sun Alcor cluster component and a cluster container Docker, acquiring monitoring data of an open-sun Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component, generating alarm information according to the monitoring data, and sending the alarm information to an alert manager monitoring component;
the alert manager monitoring component is used for: managing the alarm information;
the grafana monitoring component is configured to: acquiring monitoring data from a prometheus monitoring component for displaying;
the prometheus-out component is used to: synchronizing monitoring data from prometheus monitoring components in a plurality of open-sun Alcor clusters, and adding a data distinguishing label to the monitoring data of each open-sun Alcor cluster;
the grafana-out component is to: and acquiring monitoring data from a prometheus-out component for displaying.
In an embodiment of the present invention, the prometheus monitoring component is specifically configured to:
acquiring monitoring data from an open anode Aclor cluster component (apiserver, controller, scheduler, codedns) and a cluster container Docker according to a set data acquisition rule, and acquiring monitoring data of an open anode Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component. Wherein Docker, kubernets are components of the open-market alcor cluster.
Presetting a monitoring data storage time limit, and storing the monitoring data according to the preset monitoring data storage time limit. Wherein the storage period of the monitoring data is 7 days;
and generating alarm information according to the monitoring data based on the set alarm item rule.
In an embodiment of the present invention, the grafana monitoring component and the grafana-out component are specifically configured to: and displaying the monitoring data through a preset data display diagram.
In an embodiment of the present invention, the prometheus-out component and the grafana-out component may be deployed 2 outside the plurality of open-sun Alcor clusters. Using a prometheus federation mechanism, the pometheus-out is configured to synchronize monitoring data from prometheus in all open-sun Alcor clusters for a preset monitoring data storage period (e.g., 30 days), and to add distinguishable tags to the data of each cluster. And configuring the grafana-out, and defining a data presentation general view applicable to all Kaiyang Alcor clusters.
Based on the same inventive concept, the embodiment of the present invention further provides a cross-kubernets cluster monitoring method, as described in the following embodiments.
Installing a proxy monitoring component, an alert manager monitoring component, a grafana monitoring component, a node-exporter data acquisition component, a process-export data acquisition component and a blackbox data acquisition component in each open-sun Alcor cluster in the plurality of open-sun Alcor clusters, and deploying the proxy-out component and the grafana-out component outside the plurality of open-sun Alcor clusters;
the method comprises the steps that a proxy monitoring component obtains monitoring data from a Kaiyang Alcor cluster component and a cluster container Docker, obtains monitoring data of a Kaiyang Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component, generates alarm information according to the monitoring data, and sends the alarm information to an alertmager monitoring component;
the alert information is managed by the alert manager monitoring component;
the method comprises the following steps that a grafana monitoring component obtains monitoring data from a prometheus monitoring component for displaying;
a prometheus-out component synchronizes monitoring data from prometheus monitoring components in a plurality of open-sun Alcor clusters, and adds a data distinguishing tag to the monitoring data of each open-sun Alcor cluster;
the grafana-out component obtains the monitoring data from the prometheus-out component for displaying.
In the embodiment of the invention, an implementation method of a cross-kubernets cluster monitoring system based on data acquisition in an open-sun Alcor environment is a cross-cluster monitoring and data display method of a large-scale container cloud platform, and specifically comprises the following installation steps:
writing a configuration file of a proxy and an alert manager of a proxy operator, and predefining a data acquisition rule and an alarm item rule so as to realize the installation of monitoring and data display components in each cluster in the form of the operator.
Writing a configuration file of the grafana, and predefining a data display diagram for later integration into an installer of the Kaiyang Alcor through palm charts.
Writing palm characters, integrating the installation of components of a promoter operator, a node-exporter, a process-exporter and a blackbox into an installation program of the open-sun Alcor, and automatically installing a cluster monitoring and data display component when the open-sun Alcor cluster is installed.
And IV, independently deploying 2 sets of proxy-out and grfana-out outside the cluster, modifying a configured proxy-out configuration file to form a federation with the proxy in all the clusters, acquiring monitoring data of the proxy in the cluster in real time, and uniformly displaying the data by the grfana-out.
The embodiment of the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and the processor implements the method when executing the computer program.
The embodiment of the invention also provides a computer readable storage medium, and the computer readable storage medium stores a computer program for executing the method.
In summary, the present invention provides a cross-kubernets cluster monitoring system and method, which relates to a computer technology, a docker container technology, a kubernets large-scale container management technology, a prometheus system monitoring and alarming technology, and a grafana data display technology, and the cross-kubernets cluster monitoring system and method enable a masculine Alcor cluster to operate healthily, enable cluster data to be displayed more clearly and specifically, enhance the effectiveness of monitoring information, reduce the time required for maintenance, and provide a data source and initial classification for big data analysis and cloud computing.
Specifically, the acquisition pressure of monitoring information and the calculation pressure of monitoring items are separated from a set of prometheus, prometheus in each set of open anode Alcor clusters is only responsible for data acquisition and monitoring item calculation of the cluster, the pressure is controllable, prometheus-out is only responsible for synchronizing the monitoring data of all the clusters, the monitoring item calculation is not carried out, and the calculation pressure is reduced; prometheus and alert manager in each set of clusters can be customized according to the characteristics of the clusters, and are not limited by other clusters; when configuration change is carried out, the whole amount of change can be carried out after verification in a certain cluster, so that the risk of configuration change is reduced; the prometheus in the cluster only keeps data for 7 days, so that the pressure of storage in the cluster is reduced, and in the case of fault loss, lost data can be found in prometheus-out, so that high availability of the data is ensured; a user can check monitoring data of containers deployed on different clusters through one access of the grafana-out, and user experience is improved.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes may be made to the embodiment of the present invention by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.
Claims (10)
1. A cross-kubernets cluster monitoring system, comprising: a plurality of open-sun Alcor clusters, a progreus-out component, and a grafana-out component, the progreus-out component and the grafana-out component deployed outside the plurality of open-sun Alcor clusters;
a promoter monitoring component, an alert manager monitoring component, a grafana monitoring component, a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component are installed in each open-sun Alcor cluster;
the prometheus monitoring component is used to: acquiring monitoring data from an open-sun Alcor cluster component and a cluster container Docker, acquiring monitoring data of an open-sun Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component, generating alarm information according to the monitoring data, and sending the alarm information to an alert manager monitoring component;
the alert manager monitoring component is used for: managing the alarm information;
the grafana monitoring component is configured to: acquiring monitoring data from a prometheus monitoring component for displaying;
the prometheus-out component is used to: synchronizing monitoring data from prometheus monitoring components in a plurality of open-sun Alcor clusters, and adding a data distinguishing label to the monitoring data of each open-sun Alcor cluster;
the grafana-out component is to: and acquiring monitoring data from a prometheus-out component for displaying.
2. The system of claim 1, wherein the prometheus cluster monitoring component is specifically configured to:
and acquiring monitoring data from the open-sun Alcor cluster component and the cluster container Docker according to a set data acquisition rule, and acquiring the monitoring data of the open-sun Alcor cluster physical server from the node-exporter data acquisition component, the process-exporter data acquisition component and the blackbox data acquisition component.
3. The cross-kubernets cluster monitoring system of claim 1, wherein the prometheus monitoring component and the prometheus-out component are further configured to:
presetting a monitoring data storage time limit, and storing the monitoring data according to the preset monitoring data storage time limit.
4. The system of claim 3, wherein the monitoring data in the prometheus monitoring component is stored for a period of 7 days; the storage period of the monitoring data in the prometheus-out component is 30 days.
5. The system of claim 1, wherein the prometheus cluster monitoring component is specifically configured to:
and generating alarm information according to the monitoring data based on the set alarm item rule.
6. The system of claim 1, wherein the grapana monitoring component and the grapana-out component are specifically configured to: and displaying the monitoring data through a preset data display diagram.
7. A cross-kubernets cluster monitoring method is characterized by comprising the following steps:
installing a proxy monitoring component, an alert manager monitoring component, a grafana monitoring component, a node-exporter data acquisition component, a process-export data acquisition component and a blackbox data acquisition component in each open-sun Alcor cluster in the plurality of open-sun Alcor clusters, and deploying the proxy-out component and the grafana-out component outside the plurality of open-sun Alcor clusters;
the method comprises the steps that a proxy monitoring component obtains monitoring data from a Kaiyang Alcor cluster component and a cluster container Docker, obtains monitoring data of a Kaiyang Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component, generates alarm information according to the monitoring data, and sends the alarm information to an alertmager monitoring component;
the alert information is managed by the alert manager monitoring component;
the method comprises the following steps that a grafana monitoring component obtains monitoring data from a prometheus monitoring component for displaying;
a prometheus-out component synchronizes monitoring data from prometheus monitoring components in a plurality of open-sun Alcor clusters, and adds a data distinguishing tag to the monitoring data of each open-sun Alcor cluster;
the grafana-out component obtains the monitoring data from the prometheus-out component for displaying.
8. The method of claim 7, wherein the prometheus monitoring component obtains monitoring data from an open-sun Alcor cluster component and a cluster container Docker, and obtains monitoring data from a node-exporter data collection component, a process-exporter data collection component, and a blackbox data collection component, and comprises:
and acquiring monitoring data from the open-sun Alcor cluster component and the cluster container Docker according to a set data acquisition rule, and acquiring the monitoring data of the open-sun Alcor cluster physical server from the node-exporter data acquisition component, the process-exporter data acquisition component and the blackbox data acquisition component.
9. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the method of any of claims 7 to 8 when executing the computer program.
10. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program for executing the method of any one of claims 7 to 8.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010258248.3A CN111459763B (en) | 2020-04-03 | 2020-04-03 | Cross-kubernetes cluster monitoring system and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010258248.3A CN111459763B (en) | 2020-04-03 | 2020-04-03 | Cross-kubernetes cluster monitoring system and method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111459763A true CN111459763A (en) | 2020-07-28 |
CN111459763B CN111459763B (en) | 2023-10-24 |
Family
ID=71685848
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010258248.3A Active CN111459763B (en) | 2020-04-03 | 2020-04-03 | Cross-kubernetes cluster monitoring system and method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111459763B (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112015753A (en) * | 2020-08-31 | 2020-12-01 | 南京易捷思达软件科技有限公司 | Monitoring system and method suitable for containerized deployment of open-source cloud platform |
CN112165502A (en) * | 2020-08-06 | 2021-01-01 | 中信银行股份有限公司 | Service discovery system, method and second server |
CN112162821A (en) * | 2020-09-25 | 2021-01-01 | 中国电力科学研究院有限公司 | Container cluster resource monitoring method, device and system |
CN112286628A (en) * | 2020-10-19 | 2021-01-29 | 烽火通信科技股份有限公司 | System for unifying nanotube Kubernetes heterogeneous applications and operation method |
CN112328456A (en) * | 2021-01-04 | 2021-02-05 | 北京电信易通信息技术股份有限公司 | Cluster resource monitoring system based on service discovery |
CN112511339A (en) * | 2020-11-09 | 2021-03-16 | 宝付网络科技(上海)有限公司 | Container monitoring alarm method, system, equipment and storage medium based on multiple clusters |
CN112698915A (en) * | 2020-12-31 | 2021-04-23 | 北京千方科技股份有限公司 | Multi-cluster unified monitoring alarm method, system, equipment and storage medium |
CN112711512A (en) * | 2020-12-29 | 2021-04-27 | 北京浪潮数据技术有限公司 | Prometheus monitoring method, device and equipment |
CN114003312A (en) * | 2021-10-29 | 2022-02-01 | 广东智联蔚来科技有限公司 | Big data service component management method, computer device and storage medium |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108921551A (en) * | 2018-06-11 | 2018-11-30 | 西安纸贵互联网科技有限公司 | Alliance's block catenary system based on Kubernetes platform |
CN109245931A (en) * | 2018-09-19 | 2019-01-18 | 四川长虹电器股份有限公司 | The log management of container cloud platform based on kubernetes and the implementation method of monitoring alarm |
CN110086674A (en) * | 2019-05-06 | 2019-08-02 | 山东浪潮云信息技术有限公司 | A kind of application high availability implementation method and system based on container |
CN110247810A (en) * | 2019-07-09 | 2019-09-17 | 浪潮云信息技术有限公司 | A kind of system and method for collection vessel service monitoring data |
CN110262944A (en) * | 2019-06-21 | 2019-09-20 | 四川长虹电器股份有限公司 | The method that a kind of pair of K8s cluster container resource is monitored and is alerted |
US20190317824A1 (en) * | 2018-04-11 | 2019-10-17 | Microsoft Technology Licensing, Llc | Deployment of services across clusters of nodes |
-
2020
- 2020-04-03 CN CN202010258248.3A patent/CN111459763B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190317824A1 (en) * | 2018-04-11 | 2019-10-17 | Microsoft Technology Licensing, Llc | Deployment of services across clusters of nodes |
CN108921551A (en) * | 2018-06-11 | 2018-11-30 | 西安纸贵互联网科技有限公司 | Alliance's block catenary system based on Kubernetes platform |
CN109245931A (en) * | 2018-09-19 | 2019-01-18 | 四川长虹电器股份有限公司 | The log management of container cloud platform based on kubernetes and the implementation method of monitoring alarm |
CN110086674A (en) * | 2019-05-06 | 2019-08-02 | 山东浪潮云信息技术有限公司 | A kind of application high availability implementation method and system based on container |
CN110262944A (en) * | 2019-06-21 | 2019-09-20 | 四川长虹电器股份有限公司 | The method that a kind of pair of K8s cluster container resource is monitored and is alerted |
CN110247810A (en) * | 2019-07-09 | 2019-09-17 | 浪潮云信息技术有限公司 | A kind of system and method for collection vessel service monitoring data |
Non-Patent Citations (1)
Title |
---|
罗佳豪: "Prometheus监控Kubernetes系列1——监控框架", pages 1 - 7 * |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112165502A (en) * | 2020-08-06 | 2021-01-01 | 中信银行股份有限公司 | Service discovery system, method and second server |
CN112165502B (en) * | 2020-08-06 | 2022-11-25 | 中信银行股份有限公司 | Service discovery system, method and second server |
CN112015753A (en) * | 2020-08-31 | 2020-12-01 | 南京易捷思达软件科技有限公司 | Monitoring system and method suitable for containerized deployment of open-source cloud platform |
CN112015753B (en) * | 2020-08-31 | 2023-10-31 | 北京易捷思达科技发展有限公司 | Monitoring system and method suitable for containerized deployment of open source cloud platform |
CN112162821B (en) * | 2020-09-25 | 2022-04-26 | 中国电力科学研究院有限公司 | Container cluster resource monitoring method, device and system |
CN112162821A (en) * | 2020-09-25 | 2021-01-01 | 中国电力科学研究院有限公司 | Container cluster resource monitoring method, device and system |
CN112286628A (en) * | 2020-10-19 | 2021-01-29 | 烽火通信科技股份有限公司 | System for unifying nanotube Kubernetes heterogeneous applications and operation method |
CN112286628B (en) * | 2020-10-19 | 2022-05-17 | 烽火通信科技股份有限公司 | System for unifying nanotube Kubernetes heterogeneous applications and operation method |
CN112511339A (en) * | 2020-11-09 | 2021-03-16 | 宝付网络科技(上海)有限公司 | Container monitoring alarm method, system, equipment and storage medium based on multiple clusters |
CN112511339B (en) * | 2020-11-09 | 2023-04-07 | 宝付网络科技(上海)有限公司 | Container monitoring alarm method, system, equipment and storage medium based on multiple clusters |
CN112711512A (en) * | 2020-12-29 | 2021-04-27 | 北京浪潮数据技术有限公司 | Prometheus monitoring method, device and equipment |
CN112698915A (en) * | 2020-12-31 | 2021-04-23 | 北京千方科技股份有限公司 | Multi-cluster unified monitoring alarm method, system, equipment and storage medium |
CN112328456A (en) * | 2021-01-04 | 2021-02-05 | 北京电信易通信息技术股份有限公司 | Cluster resource monitoring system based on service discovery |
CN114003312A (en) * | 2021-10-29 | 2022-02-01 | 广东智联蔚来科技有限公司 | Big data service component management method, computer device and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN111459763B (en) | 2023-10-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111459763B (en) | Cross-kubernetes cluster monitoring system and method | |
CN105653425B (en) | Monitoring system based on complex event processing engine | |
US11640465B2 (en) | Methods and systems for troubleshooting applications using streaming anomaly detection | |
CN107577805B (en) | Business service system for log big data analysis | |
US10146599B2 (en) | System and method for a generic actor system container application | |
CN108874558B (en) | Message subscription method of distributed transaction, electronic device and readable storage medium | |
EP3543866A1 (en) | Resource-efficient record processing in unified automation platforms for robotic process automation | |
CN111666189B (en) | Method and system for declaratively visually configuring Prometheus monitoring alarm | |
CN108762900A (en) | High frequency method for scheduling task, system, computer equipment and storage medium | |
CN105871957B (en) | Monitoring framework design method and monitoring server, agent unit, control server | |
US11743155B2 (en) | Systems and methods of monitoring and controlling remote assets | |
EP3671580A1 (en) | Analyzing device-related data to generate and/or suppress device-related alerts | |
CN110569113A (en) | Method and system for scheduling distributed tasks and computer readable storage medium | |
CN109245908A (en) | A kind of method and apparatus of principal and subordinate's cluster switching | |
CN108768790A (en) | Distributed search cluster monitoring method and device, computing device, storage medium | |
CN105553732B (en) | A kind of distributed network analogy method and system | |
Nguyen et al. | A low-cost two-tier fog computing testbed for streaming IoT-based applications | |
CN113422692A (en) | Method, device and storage medium for detecting and processing node faults in K8s cluster | |
US10331484B2 (en) | Distributed data platform resource allocator | |
WO2023138014A1 (en) | Intelligent operation and maintenance system oriented to computing-network integration scenario and use method thereof | |
CN114090378A (en) | Custom monitoring and alarming method based on Kapacitor | |
CN110784347A (en) | Node management method, system, equipment and storage medium for container cluster | |
EP3011456B1 (en) | Sorted event monitoring by context partition | |
CN115543543A (en) | Application service processing method, device, equipment and medium | |
Li | Online Experiment Platform: A Microservices-based Cloud Native Application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20221010 Address after: 25 Financial Street, Xicheng District, Beijing 100033 Applicant after: CHINA CONSTRUCTION BANK Corp. Address before: 25 Financial Street, Xicheng District, Beijing 100033 Applicant before: CHINA CONSTRUCTION BANK Corp. Applicant before: Jianxin Financial Science and Technology Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant |