CN111459763A - Cross-kubernets cluster monitoring system and method - Google Patents

Cross-kubernets cluster monitoring system and method Download PDF

Info

Publication number
CN111459763A
CN111459763A CN202010258248.3A CN202010258248A CN111459763A CN 111459763 A CN111459763 A CN 111459763A CN 202010258248 A CN202010258248 A CN 202010258248A CN 111459763 A CN111459763 A CN 111459763A
Authority
CN
China
Prior art keywords
component
monitoring
data
cluster
alcor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010258248.3A
Other languages
Chinese (zh)
Other versions
CN111459763B (en
Inventor
董黎阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Construction Bank Corp
Original Assignee
China Construction Bank Corp
CCB Finetech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Construction Bank Corp, CCB Finetech Co Ltd filed Critical China Construction Bank Corp
Priority to CN202010258248.3A priority Critical patent/CN111459763B/en
Publication of CN111459763A publication Critical patent/CN111459763A/en
Application granted granted Critical
Publication of CN111459763B publication Critical patent/CN111459763B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/301Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is a virtual computing platform, e.g. logically partitioned systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3065Monitoring arrangements determined by the means or processing involved in reporting the monitored data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3089Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
    • G06F11/3093Configuration details thereof, e.g. installation, enabling, spatial arrangement of the probes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3089Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
    • G06F11/3096Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents wherein the means or processing minimize the use of computing system or of computing system component resources, e.g. non-intrusive monitoring which minimizes the probe effect: sniffing, intercepting, indirectly deriving the monitored data from other directly available data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/323Visualisation of programs or trace data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/815Virtual

Abstract

The invention provides a cross-kubernets cluster monitoring system and a method, which comprises the following steps: a plurality of open-sun Alcor clusters, proxeus-out and grafana-out components, the proxeus-out and grafana-out components being deployed outside the Alcor clusters; a promoter, an alert manager and a grafana monitoring component, a node-exporter, a process-exporter and a blackbox data acquisition component are installed in the Alcor cluster; the prometheus-out component synchronizes the monitoring data from the prometheus monitoring component; the grafana-out component exposes the monitoring data. The scheme solves the monitoring and data display of cross-cluster data collection.

Description

Cross-kubernets cluster monitoring system and method
Technical Field
The invention relates to the technical field of kubernets cluster monitoring, in particular to a cross-kubernets cluster monitoring system and a cross-kubernets cluster monitoring method.
Background
Container technology is currently the hot door technology and is the leading technology. Since Docker's release, software deployment became easy, and truly realized that deployment was run everywhere. kubernets is an open-source Docker container cluster management system of a cross-host cluster, and provides a whole set of functions of resource scheduling, deployment and operation, service discovery, capacity expansion, capacity reduction and the like for containerized applications. The Kaiyang Alcor is a container cloud platform developed and packaged based on the original kubernets, massive applications and services are deployed in the container cloud platform, the situation that multiple clusters run simultaneously is inevitably formed as the application increases and the user demand complexity is improved, and the kubernets are monitored and data display is carried out through matching of promemeus, alert manager and grafana in the prior art. However, when multiple clusters occur, each cluster has a set of prometheus monitoring component, and no matter a system administrator or a tenant, it is inevitable to switch back and forth among the multiple clusters to observe complete monitoring data, and complexity of the Kaiyang Alcor in extracting the monitoring data is increased, so that the monitoring data in the multiple clusters are difficult to converge, and great challenges are brought to analysis and comparison of the monitoring data when faults occur in the later period. The prior art lacks a cross-cluster solution for monitoring, alarming and data displaying of massive applications and services on a plurality of clusters.
Disclosure of Invention
The embodiment of the invention provides a cross-kubernets cluster monitoring system and a cross-kubernets cluster monitoring method, which solve the technical problem that a cross-cluster solution for monitoring, alarming and displaying data of massive applications and services on a plurality of clusters is lacked in the prior art.
The embodiment of the invention provides a cross-kubernets cluster monitoring system, which comprises:
a plurality of open-sun Alcor clusters, a progreus-out component, and a grafana-out component, the progreus-out component and the grafana-out component deployed outside the plurality of open-sun Alcor clusters;
a promoter monitoring component, an alert manager monitoring component, a grafana monitoring component, a node-exporter data acquisition component, a process-export data acquisition component and a blackbox data acquisition component are installed in each open-sun Alcor cluster;
the prometheus monitoring component is used to: acquiring monitoring data from an open-sun Alcor cluster component and a cluster container Docker, acquiring monitoring data of an open-sun Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component, generating alarm information according to the monitoring data, and sending the alarm information to an alert manager monitoring component;
the alert manager monitoring component is used for: managing the alarm information;
the grafana monitoring component is configured to: acquiring monitoring data from a prometheus monitoring component for displaying;
the prometheus-out component is used to: synchronizing monitoring data from prometheus monitoring components in a plurality of open-sun Alcor clusters, and adding a data distinguishing label to the monitoring data of each open-sun Alcor cluster;
the grafana-out component is to: and acquiring monitoring data from a prometheus-out component for displaying.
The embodiment of the invention also provides a cross-kubernets cluster monitoring method, which comprises the following steps:
installing a proxy monitoring component, an alert manager monitoring component, a grafana monitoring component, a node-exporter data acquisition component, a process-export data acquisition component and a blackbox data acquisition component in each open-sun Alcor cluster in the plurality of open-sun Alcor clusters, and deploying the proxy-out component and the grafana-out component outside the plurality of open-sun Alcor clusters;
the method comprises the steps that a proxy monitoring component obtains monitoring data from a Kaiyang Alcor cluster component and a cluster container Docker, obtains monitoring data of a Kaiyang Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component, generates alarm information according to the monitoring data, and sends the alarm information to an alertmager monitoring component;
the alert information is managed by the alert manager monitoring component;
the method comprises the following steps that a grafana monitoring component obtains monitoring data from a prometheus monitoring component for displaying;
a prometheus-out component synchronizes monitoring data from prometheus monitoring components in a plurality of open-sun Alcor clusters, and adds a data distinguishing tag to the monitoring data of each open-sun Alcor cluster;
the grafana-out component obtains the monitoring data from the prometheus-out component for displaying.
The embodiment of the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and the processor implements the method when executing the computer program.
The embodiment of the invention also provides a computer readable storage medium, and the computer readable storage medium stores a computer program for executing the method.
In the embodiment of the invention, a protemetheus monitoring component, an alert manager monitoring component and a grafana monitoring component are installed in each open-sun Alcor cluster, so that the acquisition pressure of monitoring information and the calculation pressure of monitoring items are separated from one set of protemetheus, the protemetheus in each open-sun Alcor cluster is only responsible for the data acquisition and monitoring item calculation of the cluster, and the pressure is controllable; deploying a proxy-out component and a grafana-out component outside a plurality of open-sun Alcor clusters, wherein the proxy-out is only responsible for synchronizing the monitoring data of all the clusters, and does not perform monitoring item calculation, so that the calculation pressure is reduced; the user can check the monitoring data of the containers deployed on different clusters through the grafana-out portal, and the user experience is improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a structural diagram of a cross-kubernets cluster monitoring system according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Explanation of technical terms
Docker is an open source application container engine that allows developers to package their applications and dependencies into a portable container and then publish them on any popular L inux machine, as well as to implement virtualization.
kubernets, K8s for short, is an open-source application for managing containerization on multiple hosts in a cloud platform, and aims to make it simple and efficient to deploy containerized applications (powerfull), and provides a mechanism for application deployment, planning, updating, and maintenance.
Prometheus is a set of open source monitoring & alarm & time series database combinations. The monitoring data acquisition and storage device is used for acquiring and storing monitoring data of docker and kubernets.
grafana is an open source application written in go language, is mainly used for visualization display of large-scale index data, is the most popular time sequence data display tool in network architecture and application analysis, and supports most common time sequence databases at present. The monitoring data is used for graphically displaying the monitoring data stored by the prometheus.
The alert manager is an independent alarm module, receives alerts sent by clients such as Prometheus, and then processes the alerts by grouping, deleting duplicates, and the like, and sends the alerts to a correct receiver through a route; the alarm mode can be sent to different module responsible persons according to different rules. The alarm of Prometheus is divided into two parts. The alarm rules in the Prometheus server send alarms to Alertmanager. The alert manager then manages these alerts including methods of silence, suppression, aggregation and passage, such as email notification, call notification systems, and instant messaging platforms.
Apiserver is the bus and external API interface for open sun Alcor cluster message communication.
The Controller is the Controller of the open sun Alcor cluster state.
Scheduler is the Scheduler of the open sun Alcor cluster container service.
Coredns is an open Alcor cluster internal DNS resolution service.
Helm chart is a kubernets service template orchestration tool.
The main steps for setting up alerts and notifications are:
setting and configuring an Alertmanager;
configuring Prometheus and Alertmanager dialogue;
alarm rules are created in Prometheus.
Exporter: all programs that can provide monitoring sample data to Prometheus can be referred to as an Exporter.
The Node exporter is mainly used for exposing metrics to Prometeus, wherein the metrics comprises: cpu load, memory usage, network, etc.
Process-exporter is mainly used to expose metrics to Prometeus, where metrics includes: the state of the process running on the server.
Blackbox is primarily used to expose metrics to Prometheus, where metrics include: a server port status.
In an embodiment of the present invention, a method for monitoring a system across kubernets is provided, as shown in fig. 1, including:
a plurality of open-sun Alcor clusters, a progreus-out component, and a grafana-out component, the progreus-out component and the grafana-out component deployed outside the plurality of open-sun Alcor clusters;
a promoter monitoring component, an alert manager monitoring component, a grafana monitoring component, a node-exporter data acquisition component, a process-export data acquisition component and a blackbox data acquisition component are installed in each open-sun Alcor cluster;
the prometheus monitoring component is used to: acquiring monitoring data from an open-sun Alcor cluster component and a cluster container Docker, acquiring monitoring data of an open-sun Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component, generating alarm information according to the monitoring data, and sending the alarm information to an alert manager monitoring component;
the alert manager monitoring component is used for: managing the alarm information;
the grafana monitoring component is configured to: acquiring monitoring data from a prometheus monitoring component for displaying;
the prometheus-out component is used to: synchronizing monitoring data from prometheus monitoring components in a plurality of open-sun Alcor clusters, and adding a data distinguishing label to the monitoring data of each open-sun Alcor cluster;
the grafana-out component is to: and acquiring monitoring data from a prometheus-out component for displaying.
In an embodiment of the present invention, the prometheus monitoring component is specifically configured to:
acquiring monitoring data from an open anode Aclor cluster component (apiserver, controller, scheduler, codedns) and a cluster container Docker according to a set data acquisition rule, and acquiring monitoring data of an open anode Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component. Wherein Docker, kubernets are components of the open-market alcor cluster.
Presetting a monitoring data storage time limit, and storing the monitoring data according to the preset monitoring data storage time limit. Wherein the storage period of the monitoring data is 7 days;
and generating alarm information according to the monitoring data based on the set alarm item rule.
In an embodiment of the present invention, the grafana monitoring component and the grafana-out component are specifically configured to: and displaying the monitoring data through a preset data display diagram.
In an embodiment of the present invention, the prometheus-out component and the grafana-out component may be deployed 2 outside the plurality of open-sun Alcor clusters. Using a prometheus federation mechanism, the pometheus-out is configured to synchronize monitoring data from prometheus in all open-sun Alcor clusters for a preset monitoring data storage period (e.g., 30 days), and to add distinguishable tags to the data of each cluster. And configuring the grafana-out, and defining a data presentation general view applicable to all Kaiyang Alcor clusters.
Based on the same inventive concept, the embodiment of the present invention further provides a cross-kubernets cluster monitoring method, as described in the following embodiments.
Installing a proxy monitoring component, an alert manager monitoring component, a grafana monitoring component, a node-exporter data acquisition component, a process-export data acquisition component and a blackbox data acquisition component in each open-sun Alcor cluster in the plurality of open-sun Alcor clusters, and deploying the proxy-out component and the grafana-out component outside the plurality of open-sun Alcor clusters;
the method comprises the steps that a proxy monitoring component obtains monitoring data from a Kaiyang Alcor cluster component and a cluster container Docker, obtains monitoring data of a Kaiyang Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component, generates alarm information according to the monitoring data, and sends the alarm information to an alertmager monitoring component;
the alert information is managed by the alert manager monitoring component;
the method comprises the following steps that a grafana monitoring component obtains monitoring data from a prometheus monitoring component for displaying;
a prometheus-out component synchronizes monitoring data from prometheus monitoring components in a plurality of open-sun Alcor clusters, and adds a data distinguishing tag to the monitoring data of each open-sun Alcor cluster;
the grafana-out component obtains the monitoring data from the prometheus-out component for displaying.
In the embodiment of the invention, an implementation method of a cross-kubernets cluster monitoring system based on data acquisition in an open-sun Alcor environment is a cross-cluster monitoring and data display method of a large-scale container cloud platform, and specifically comprises the following installation steps:
writing a configuration file of a proxy and an alert manager of a proxy operator, and predefining a data acquisition rule and an alarm item rule so as to realize the installation of monitoring and data display components in each cluster in the form of the operator.
Writing a configuration file of the grafana, and predefining a data display diagram for later integration into an installer of the Kaiyang Alcor through palm charts.
Writing palm characters, integrating the installation of components of a promoter operator, a node-exporter, a process-exporter and a blackbox into an installation program of the open-sun Alcor, and automatically installing a cluster monitoring and data display component when the open-sun Alcor cluster is installed.
And IV, independently deploying 2 sets of proxy-out and grfana-out outside the cluster, modifying a configured proxy-out configuration file to form a federation with the proxy in all the clusters, acquiring monitoring data of the proxy in the cluster in real time, and uniformly displaying the data by the grfana-out.
The embodiment of the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and the processor implements the method when executing the computer program.
The embodiment of the invention also provides a computer readable storage medium, and the computer readable storage medium stores a computer program for executing the method.
In summary, the present invention provides a cross-kubernets cluster monitoring system and method, which relates to a computer technology, a docker container technology, a kubernets large-scale container management technology, a prometheus system monitoring and alarming technology, and a grafana data display technology, and the cross-kubernets cluster monitoring system and method enable a masculine Alcor cluster to operate healthily, enable cluster data to be displayed more clearly and specifically, enhance the effectiveness of monitoring information, reduce the time required for maintenance, and provide a data source and initial classification for big data analysis and cloud computing.
Specifically, the acquisition pressure of monitoring information and the calculation pressure of monitoring items are separated from a set of prometheus, prometheus in each set of open anode Alcor clusters is only responsible for data acquisition and monitoring item calculation of the cluster, the pressure is controllable, prometheus-out is only responsible for synchronizing the monitoring data of all the clusters, the monitoring item calculation is not carried out, and the calculation pressure is reduced; prometheus and alert manager in each set of clusters can be customized according to the characteristics of the clusters, and are not limited by other clusters; when configuration change is carried out, the whole amount of change can be carried out after verification in a certain cluster, so that the risk of configuration change is reduced; the prometheus in the cluster only keeps data for 7 days, so that the pressure of storage in the cluster is reduced, and in the case of fault loss, lost data can be found in prometheus-out, so that high availability of the data is ensured; a user can check monitoring data of containers deployed on different clusters through one access of the grafana-out, and user experience is improved.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes may be made to the embodiment of the present invention by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (10)

1. A cross-kubernets cluster monitoring system, comprising: a plurality of open-sun Alcor clusters, a progreus-out component, and a grafana-out component, the progreus-out component and the grafana-out component deployed outside the plurality of open-sun Alcor clusters;
a promoter monitoring component, an alert manager monitoring component, a grafana monitoring component, a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component are installed in each open-sun Alcor cluster;
the prometheus monitoring component is used to: acquiring monitoring data from an open-sun Alcor cluster component and a cluster container Docker, acquiring monitoring data of an open-sun Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component, generating alarm information according to the monitoring data, and sending the alarm information to an alert manager monitoring component;
the alert manager monitoring component is used for: managing the alarm information;
the grafana monitoring component is configured to: acquiring monitoring data from a prometheus monitoring component for displaying;
the prometheus-out component is used to: synchronizing monitoring data from prometheus monitoring components in a plurality of open-sun Alcor clusters, and adding a data distinguishing label to the monitoring data of each open-sun Alcor cluster;
the grafana-out component is to: and acquiring monitoring data from a prometheus-out component for displaying.
2. The system of claim 1, wherein the prometheus cluster monitoring component is specifically configured to:
and acquiring monitoring data from the open-sun Alcor cluster component and the cluster container Docker according to a set data acquisition rule, and acquiring the monitoring data of the open-sun Alcor cluster physical server from the node-exporter data acquisition component, the process-exporter data acquisition component and the blackbox data acquisition component.
3. The cross-kubernets cluster monitoring system of claim 1, wherein the prometheus monitoring component and the prometheus-out component are further configured to:
presetting a monitoring data storage time limit, and storing the monitoring data according to the preset monitoring data storage time limit.
4. The system of claim 3, wherein the monitoring data in the prometheus monitoring component is stored for a period of 7 days; the storage period of the monitoring data in the prometheus-out component is 30 days.
5. The system of claim 1, wherein the prometheus cluster monitoring component is specifically configured to:
and generating alarm information according to the monitoring data based on the set alarm item rule.
6. The system of claim 1, wherein the grapana monitoring component and the grapana-out component are specifically configured to: and displaying the monitoring data through a preset data display diagram.
7. A cross-kubernets cluster monitoring method is characterized by comprising the following steps:
installing a proxy monitoring component, an alert manager monitoring component, a grafana monitoring component, a node-exporter data acquisition component, a process-export data acquisition component and a blackbox data acquisition component in each open-sun Alcor cluster in the plurality of open-sun Alcor clusters, and deploying the proxy-out component and the grafana-out component outside the plurality of open-sun Alcor clusters;
the method comprises the steps that a proxy monitoring component obtains monitoring data from a Kaiyang Alcor cluster component and a cluster container Docker, obtains monitoring data of a Kaiyang Alcor cluster physical server from a node-exporter data acquisition component, a process-exporter data acquisition component and a blackbox data acquisition component, generates alarm information according to the monitoring data, and sends the alarm information to an alertmager monitoring component;
the alert information is managed by the alert manager monitoring component;
the method comprises the following steps that a grafana monitoring component obtains monitoring data from a prometheus monitoring component for displaying;
a prometheus-out component synchronizes monitoring data from prometheus monitoring components in a plurality of open-sun Alcor clusters, and adds a data distinguishing tag to the monitoring data of each open-sun Alcor cluster;
the grafana-out component obtains the monitoring data from the prometheus-out component for displaying.
8. The method of claim 7, wherein the prometheus monitoring component obtains monitoring data from an open-sun Alcor cluster component and a cluster container Docker, and obtains monitoring data from a node-exporter data collection component, a process-exporter data collection component, and a blackbox data collection component, and comprises:
and acquiring monitoring data from the open-sun Alcor cluster component and the cluster container Docker according to a set data acquisition rule, and acquiring the monitoring data of the open-sun Alcor cluster physical server from the node-exporter data acquisition component, the process-exporter data acquisition component and the blackbox data acquisition component.
9. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the method of any of claims 7 to 8 when executing the computer program.
10. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program for executing the method of any one of claims 7 to 8.
CN202010258248.3A 2020-04-03 2020-04-03 Cross-kubernetes cluster monitoring system and method Active CN111459763B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010258248.3A CN111459763B (en) 2020-04-03 2020-04-03 Cross-kubernetes cluster monitoring system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010258248.3A CN111459763B (en) 2020-04-03 2020-04-03 Cross-kubernetes cluster monitoring system and method

Publications (2)

Publication Number Publication Date
CN111459763A true CN111459763A (en) 2020-07-28
CN111459763B CN111459763B (en) 2023-10-24

Family

ID=71685848

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010258248.3A Active CN111459763B (en) 2020-04-03 2020-04-03 Cross-kubernetes cluster monitoring system and method

Country Status (1)

Country Link
CN (1) CN111459763B (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112015753A (en) * 2020-08-31 2020-12-01 南京易捷思达软件科技有限公司 Monitoring system and method suitable for containerized deployment of open-source cloud platform
CN112165502A (en) * 2020-08-06 2021-01-01 中信银行股份有限公司 Service discovery system, method and second server
CN112162821A (en) * 2020-09-25 2021-01-01 中国电力科学研究院有限公司 Container cluster resource monitoring method, device and system
CN112286628A (en) * 2020-10-19 2021-01-29 烽火通信科技股份有限公司 System for unifying nanotube Kubernetes heterogeneous applications and operation method
CN112328456A (en) * 2021-01-04 2021-02-05 北京电信易通信息技术股份有限公司 Cluster resource monitoring system based on service discovery
CN112511339A (en) * 2020-11-09 2021-03-16 宝付网络科技(上海)有限公司 Container monitoring alarm method, system, equipment and storage medium based on multiple clusters
CN112698915A (en) * 2020-12-31 2021-04-23 北京千方科技股份有限公司 Multi-cluster unified monitoring alarm method, system, equipment and storage medium
CN112711512A (en) * 2020-12-29 2021-04-27 北京浪潮数据技术有限公司 Prometheus monitoring method, device and equipment
CN114003312A (en) * 2021-10-29 2022-02-01 广东智联蔚来科技有限公司 Big data service component management method, computer device and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108921551A (en) * 2018-06-11 2018-11-30 西安纸贵互联网科技有限公司 Alliance's block catenary system based on Kubernetes platform
CN109245931A (en) * 2018-09-19 2019-01-18 四川长虹电器股份有限公司 The log management of container cloud platform based on kubernetes and the implementation method of monitoring alarm
CN110086674A (en) * 2019-05-06 2019-08-02 山东浪潮云信息技术有限公司 A kind of application high availability implementation method and system based on container
CN110247810A (en) * 2019-07-09 2019-09-17 浪潮云信息技术有限公司 A kind of system and method for collection vessel service monitoring data
CN110262944A (en) * 2019-06-21 2019-09-20 四川长虹电器股份有限公司 The method that a kind of pair of K8s cluster container resource is monitored and is alerted
US20190317824A1 (en) * 2018-04-11 2019-10-17 Microsoft Technology Licensing, Llc Deployment of services across clusters of nodes

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190317824A1 (en) * 2018-04-11 2019-10-17 Microsoft Technology Licensing, Llc Deployment of services across clusters of nodes
CN108921551A (en) * 2018-06-11 2018-11-30 西安纸贵互联网科技有限公司 Alliance's block catenary system based on Kubernetes platform
CN109245931A (en) * 2018-09-19 2019-01-18 四川长虹电器股份有限公司 The log management of container cloud platform based on kubernetes and the implementation method of monitoring alarm
CN110086674A (en) * 2019-05-06 2019-08-02 山东浪潮云信息技术有限公司 A kind of application high availability implementation method and system based on container
CN110262944A (en) * 2019-06-21 2019-09-20 四川长虹电器股份有限公司 The method that a kind of pair of K8s cluster container resource is monitored and is alerted
CN110247810A (en) * 2019-07-09 2019-09-17 浪潮云信息技术有限公司 A kind of system and method for collection vessel service monitoring data

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
罗佳豪: "Prometheus监控Kubernetes系列1——监控框架", pages 1 - 7 *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112165502A (en) * 2020-08-06 2021-01-01 中信银行股份有限公司 Service discovery system, method and second server
CN112165502B (en) * 2020-08-06 2022-11-25 中信银行股份有限公司 Service discovery system, method and second server
CN112015753A (en) * 2020-08-31 2020-12-01 南京易捷思达软件科技有限公司 Monitoring system and method suitable for containerized deployment of open-source cloud platform
CN112015753B (en) * 2020-08-31 2023-10-31 北京易捷思达科技发展有限公司 Monitoring system and method suitable for containerized deployment of open source cloud platform
CN112162821B (en) * 2020-09-25 2022-04-26 中国电力科学研究院有限公司 Container cluster resource monitoring method, device and system
CN112162821A (en) * 2020-09-25 2021-01-01 中国电力科学研究院有限公司 Container cluster resource monitoring method, device and system
CN112286628A (en) * 2020-10-19 2021-01-29 烽火通信科技股份有限公司 System for unifying nanotube Kubernetes heterogeneous applications and operation method
CN112286628B (en) * 2020-10-19 2022-05-17 烽火通信科技股份有限公司 System for unifying nanotube Kubernetes heterogeneous applications and operation method
CN112511339A (en) * 2020-11-09 2021-03-16 宝付网络科技(上海)有限公司 Container monitoring alarm method, system, equipment and storage medium based on multiple clusters
CN112511339B (en) * 2020-11-09 2023-04-07 宝付网络科技(上海)有限公司 Container monitoring alarm method, system, equipment and storage medium based on multiple clusters
CN112711512A (en) * 2020-12-29 2021-04-27 北京浪潮数据技术有限公司 Prometheus monitoring method, device and equipment
CN112698915A (en) * 2020-12-31 2021-04-23 北京千方科技股份有限公司 Multi-cluster unified monitoring alarm method, system, equipment and storage medium
CN112328456A (en) * 2021-01-04 2021-02-05 北京电信易通信息技术股份有限公司 Cluster resource monitoring system based on service discovery
CN114003312A (en) * 2021-10-29 2022-02-01 广东智联蔚来科技有限公司 Big data service component management method, computer device and storage medium

Also Published As

Publication number Publication date
CN111459763B (en) 2023-10-24

Similar Documents

Publication Publication Date Title
CN111459763B (en) Cross-kubernetes cluster monitoring system and method
CN105653425B (en) Monitoring system based on complex event processing engine
US11640465B2 (en) Methods and systems for troubleshooting applications using streaming anomaly detection
CN107577805B (en) Business service system for log big data analysis
US10146599B2 (en) System and method for a generic actor system container application
CN108874558B (en) Message subscription method of distributed transaction, electronic device and readable storage medium
EP3543866A1 (en) Resource-efficient record processing in unified automation platforms for robotic process automation
CN111666189B (en) Method and system for declaratively visually configuring Prometheus monitoring alarm
CN108762900A (en) High frequency method for scheduling task, system, computer equipment and storage medium
CN105871957B (en) Monitoring framework design method and monitoring server, agent unit, control server
US11743155B2 (en) Systems and methods of monitoring and controlling remote assets
EP3671580A1 (en) Analyzing device-related data to generate and/or suppress device-related alerts
CN110569113A (en) Method and system for scheduling distributed tasks and computer readable storage medium
CN109245908A (en) A kind of method and apparatus of principal and subordinate's cluster switching
CN108768790A (en) Distributed search cluster monitoring method and device, computing device, storage medium
CN105553732B (en) A kind of distributed network analogy method and system
Nguyen et al. A low-cost two-tier fog computing testbed for streaming IoT-based applications
CN113422692A (en) Method, device and storage medium for detecting and processing node faults in K8s cluster
US10331484B2 (en) Distributed data platform resource allocator
WO2023138014A1 (en) Intelligent operation and maintenance system oriented to computing-network integration scenario and use method thereof
CN114090378A (en) Custom monitoring and alarming method based on Kapacitor
CN110784347A (en) Node management method, system, equipment and storage medium for container cluster
EP3011456B1 (en) Sorted event monitoring by context partition
CN115543543A (en) Application service processing method, device, equipment and medium
Li Online Experiment Platform: A Microservices-based Cloud Native Application

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20221010

Address after: 25 Financial Street, Xicheng District, Beijing 100033

Applicant after: CHINA CONSTRUCTION BANK Corp.

Address before: 25 Financial Street, Xicheng District, Beijing 100033

Applicant before: CHINA CONSTRUCTION BANK Corp.

Applicant before: Jianxin Financial Science and Technology Co.,Ltd.

GR01 Patent grant
GR01 Patent grant