CN118295876A - Distributed monitoring method and system for visually monitoring node operation - Google Patents

Distributed monitoring method and system for visually monitoring node operation Download PDF

Info

Publication number
CN118295876A
CN118295876A CN202410487050.0A CN202410487050A CN118295876A CN 118295876 A CN118295876 A CN 118295876A CN 202410487050 A CN202410487050 A CN 202410487050A CN 118295876 A CN118295876 A CN 118295876A
Authority
CN
China
Prior art keywords
monitoring
module
node
index data
nodes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202410487050.0A
Other languages
Chinese (zh)
Inventor
姬如意
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Matrix Time Digital Technology Co Ltd
Original Assignee
Matrix Time Digital Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matrix Time Digital Technology Co Ltd filed Critical Matrix Time Digital Technology Co Ltd
Priority to CN202410487050.0A priority Critical patent/CN118295876A/en
Publication of CN118295876A publication Critical patent/CN118295876A/en
Pending legal-status Critical Current

Links

Landscapes

  • Debugging And Monitoring (AREA)
  • Testing And Monitoring For Control Systems (AREA)

Abstract

The application discloses a distributed monitoring method and a system for visually monitoring node operation, wherein the method comprises the following steps: firstly, determining the category of each monitoring node, and selecting the category of a sub-module in a corresponding monitoring module; determining the index types of each monitoring node, monitoring the monitored monitoring nodes to obtain index data in a time interval of timing monitoring according to the monitoring configuration requirement, and sending the index data to a calculation module; the calculation module receives the index data and forms an index data chart through statistics and summarization; and finally, the display module displays the index data and the content of the data chart. According to the application, the original manpower needing manual intervention is released through an automatic monitoring means, the system cost is reduced, compared with the manual operation, the data problem of the system operation process is found more accurately and timely, the risk is reduced, and the system availability is improved.

Description

Distributed monitoring method and system for visually monitoring node operation
Technical Field
The application relates to the technical field of computers, in particular to a distributed monitoring method and system for visually monitoring node operation.
Background
In distributed applications, groups of computer nodes coordinate to perform a common task, communicating with each other over a network. With the development of technology, although the failure probability of a single node is low, when the number of nodes reaches a certain scale, the probability of failure becomes high. The distributed application system needs to ensure that the system is still available when a node failure occurs, which, however, requires monitoring the status of the node.
At present, the state of the monitoring node is in a log mode, so that a technician is required to manually analyze the log, but the analysis of the log mode is complicated and not intuitive, early warning is difficult to achieve in advance, and most of the problems are located after the problems occur, so that a more intelligent and more intuitive monitoring system and a monitoring method are required to help the technician to monitor the state of the node rapidly.
Disclosure of Invention
The invention aims to: in order to solve the related technical problems in the background art, the invention provides a distributed monitoring method and a system for visually monitoring node operation, wherein different monitoring nodes are adopted to adapt to different sub-modules in a monitoring module, so as to monitor node data; by means of an automatic monitoring means, the original manpower needing manual intervention is released, the system cost is reduced, compared with the manual more accurate and timely data problems in the system operation process are found, the risk is reduced, and the system usability is improved.
The technical scheme is as follows: the invention discloses a distributed monitoring method for visually monitoring node operation, which comprises the following steps:
(1) Determining the category of each monitoring node, and selecting the sub-module category in the corresponding monitoring module according to the category of each monitoring node;
(2) Determining index types of all monitoring nodes, and setting different monitoring configuration requirements of all monitoring nodes; according to the monitoring configuration requirement, a submodule in the monitoring module monitors the monitored monitoring node to acquire index data in a time interval of timing monitoring, and sends the index data to the calculation module;
(3) The calculation module receives the index data from each monitoring module and forms an index data chart of each monitoring node through statistics and summarization; meanwhile, the calculation module sends the received index data to the display module;
(4) The display module displays the received index data; meanwhile, the display module acquires the display requirement of the user, and acquires the content in the index data chart of the monitoring node from the calculation module according to the display requirement for display.
Further, the categories of the monitoring nodes comprise linux nodes and win nodes; the submodule category in the monitoring module comprises a linux monitoring submodule and a win monitoring submodule.
Further, the index types of the monitoring node include cpu usage, memory occupancy, disk occupancy and task process number.
Further, the setting of different monitoring configuration requirements of each monitoring node means: and the submodule in the monitoring module selects one or more index data from the index types of the monitoring nodes to monitor according to the types of the monitoring nodes and the index types of the monitoring nodes.
Further, the time interval of the timing monitoring is 0.1-10 seconds.
Further, the step of obtaining the index data for monitoring means: and comparing the data parameters of the index data with preset data parameter thresholds, and giving an alarm if the data parameters exceed the preset data parameter thresholds.
The invention also comprises a distributed monitoring system for visually monitoring the operation of the nodes, wherein the system comprises a display module, a calculation module, a monitoring node and a monitoring module; each monitoring node is connected with one monitoring module, each monitoring module is connected with a computing module, and the computing module is connected with a display module;
the monitoring node is used for sending the index data of the monitoring node to the monitoring module;
The monitoring module is used for setting the monitoring configuration requirement of the monitoring node, monitoring the index data acquired by the monitoring node in a time interval of timing monitoring according to the monitoring configuration requirement, and sending the index data to the calculation module;
The computing module is used for receiving the index data from each monitoring module and counting and summarizing the index data to form an index data chart of each monitoring node; and sending the received index data to a display module;
the display module is used for displaying the received index data; and acquiring the display requirement of the user, and acquiring the content in the index data chart of the monitoring node from the calculation module according to the display requirement for display.
Further, the monitoring module comprises a switching module, a linux monitoring sub-module and a win monitoring sub-module; the switching module is respectively connected with the linux monitoring submodule and the win monitoring submodule;
the switching module is used for selecting a linux monitoring sub-module or a win monitoring sub-module corresponding to the category of the monitoring node according to the category of the monitoring node to be connected with the monitoring node;
The linux monitoring sub-module and the win monitoring sub-module are both used for acquiring one or more index data from the monitored monitoring nodes in a time interval of timing monitoring according to the category of the monitoring nodes and the index category of the monitoring nodes, monitoring the monitored monitoring nodes, and sending the index data to the calculation module.
Further, the monitoring nodes are the same in number and in one-to-one correspondence with the monitoring modules.
The beneficial effects are that: according to the invention, different sub-modules in the monitoring module are adapted to different monitoring nodes, so that node data are monitored; then stripping the data monitoring and data calculation, so that the complexity of the monitoring module is reduced, and the usability is improved; the independent calculation module is responsible for summarizing and calculating the data, the data reported by each node is various in category and mixed, the calculation module is responsible for rectifying the data, and then a data chart of each node is drawn according to the data; and finally, displaying the index data and the content of the data chart according to the requirements. According to the invention, the original manpower needing manual intervention is released through an automatic monitoring means, the system cost is reduced, compared with the manual operation, the data problem of the system operation process is found more accurately and timely, the risk is reduced, and the system availability is improved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are needed in the description of the embodiments will be briefly described below, it will be apparent that the drawings in the following description are only some embodiments of the present application, and that other drawings can be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a schematic diagram of a distributed monitoring system for visually monitoring node operation in accordance with the present invention;
fig. 2 is a flow chart of a distributed monitoring method for visually monitoring node operation in accordance with the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application will be described in further detail below with reference to the accompanying drawings, and it is apparent that the described embodiments are only some embodiments of the present application, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
At present, the monitoring of multiple monitoring nodes in a distributed system requires manual log review analysis of each node, log file records are various in content, and manual query is time-consuming and labor-consuming; and the problem that delay occurs is found, so that the operation stability of the whole system is extremely unfavorable, and the invention aims to solve the technical problem.
In view of this, an embodiment of the present invention proposes a distributed monitoring system for visualizing operation of monitoring nodes, as shown in fig. 1, where the system includes a display module 1, a computing module 2, monitoring nodes 4 and monitoring modules 3, the number of the monitoring nodes 4 is multiple, the number of the monitoring nodes 4 is the same as and corresponds to that of the monitoring modules 3 one by one, each monitoring node 4 is connected with one monitoring module 3, each monitoring module 3 is connected with the computing module 2, and the computing module 2 is connected with the display module 1;
The monitoring node 4 is one of a linux node and a win node, and is used for sending index data of the monitoring node to the monitoring module 3; the index data comprises one or more of CPU utilization rate, memory occupancy rate, disk occupancy rate and task process number;
The monitoring module 3 is used for setting a monitoring configuration requirement of the monitoring node 4, monitoring the index data acquired by the monitoring node 4 in a time interval of timing monitoring according to the monitoring configuration requirement, and sending the index data to the calculation module 2; wherein, the set configuration requirement means: the submodule in the monitoring module 3 selects one or more index data from the index types of the monitoring nodes 4 to monitor according to the type of the monitoring nodes 4 and the index types of the monitoring nodes 4. The monitoring module 3 comprises a switching module 31, a linux monitoring sub-module 32 and a win monitoring sub-module 33; the switching module 31 is respectively connected with the linux monitoring sub-module 32 and the win monitoring sub-module 33; the switching module 31 is configured to select, according to the category of the monitoring node 4, a linux monitoring sub-module 32 or a win monitoring sub-module 33 corresponding to the category of the monitoring node 4 to be connected with the monitoring node 4; if the monitoring node 4 is a linux node, the switching module 31 selects a linux monitoring sub-module 32 to be connected with the monitoring node 4, and if the monitoring node 4 is a win node, the switching module 31 selects a win monitoring sub-module 33 to be connected with the monitoring node 4. The linux monitoring sub-module 32 and the win monitoring sub-module 33 are both configured to monitor, according to the category of the monitoring node 4 and the index category of the monitoring node 4, one or more kinds of index data acquired by the monitored monitoring node 4 in a time interval of timing monitoring, and send the index data to the calculation module 2.
The computing module 2 is used for receiving the index data from each monitoring module 3 and forming an index data chart of each monitoring node 4 through statistics and summarization; and transmitting the received index data to the display module 1; the display module 1 is used for displaying the received index data; and acquiring the display requirement of the user, and acquiring the content in the index data chart of the monitoring node 4 from the computing module 2 according to the display requirement for display. The system of the embodiment firstly adapts different sub-modules in the monitoring module aiming at different monitoring nodes, and then monitors node data; then stripping the data monitoring and data calculation, so that the complexity of the monitoring module 3 is reduced, and the usability is improved; the independent calculation module 2 is responsible for summarizing and calculating the data, the data reported by each node are various in category, the data are mixed, the calculation module 2 is responsible for rectifying the data, and then a data chart of each node is drawn according to the data; and finally, displaying the index data and the content of the data chart according to the requirements.
As shown in fig. 2, the embodiment of the invention further includes a distributed monitoring method for visually monitoring node operation, the method adapts different sub-modules in the monitoring module through different monitoring nodes, so as to monitor node data, not only can monitor each node in a targeted manner, but also can find data problems in the system operation process more accurately and timely compared with manual log consulting, and the method specifically includes the following steps:
(1) Firstly, determining the category of each monitoring node, wherein the number of the monitoring nodes is large, and the categories of the monitoring nodes are divided into a linux node and a win node, so that the categories of the monitoring nodes are required to be determined firstly; and selecting the corresponding sub-module category in the monitoring module according to the category of each monitoring node, wherein the sub-module category in the monitoring module comprises a linux monitoring sub-module and a win monitoring sub-module, if the monitoring node is a linux node, switching the linux monitoring sub-module to be connected with the monitoring node, and if the monitoring node is a win node, switching the win monitoring sub-module to be connected with the monitoring node.
(2) Then determining index types of each monitoring node, wherein the index types of the monitoring nodes comprise, but are not limited to, CPU (central processing unit) utilization rate, memory occupancy rate, disk occupancy rate and task process number; such as: the index types of the monitoring node A are CPU utilization rate, memory occupancy rate, disk occupancy rate, task process number, throughput and the like; the index types of the monitoring node B are CPU utilization rate, memory occupancy rate, disk occupancy rate, task process number, CPU temperature, concurrency number and the like; then, setting different monitoring configuration requirements of each monitoring node, wherein the monitoring configuration requirements refer to: and a sub-module linux monitoring sub-module or a win monitoring sub-module in the monitoring module selects one or more index data from index types of the monitoring nodes for monitoring according to the types of the monitoring nodes and the index types of the monitoring nodes. For example: the method comprises the steps that a class linux node of a monitoring node A is monitored, and index types are CPU utilization rate, memory occupancy rate, disk occupancy rate, task process number, throughput and the like, wherein a linux monitoring submodule selects the CPU utilization rate and the memory occupancy rate for monitoring; monitoring a class win node of the node B, wherein index types are CPU utilization rate, memory occupancy rate, disk occupancy rate, task process number, CPU temperature, concurrency number and the like, and a win monitoring submodule selects the memory occupancy rate, the disk occupancy rate, the task process number and the CPU temperature for monitoring; that is, according to the monitoring configuration requirement, the submodule in the monitoring module monitors the monitored monitoring node to obtain the index data in the time interval of timing monitoring, and sends the index data to the calculation module, wherein the time interval of timing monitoring can be selected to be 0.1-10 seconds; monitoring the index data means that the data parameters of the index data are compared with preset data parameter thresholds, and if the data parameters exceed the preset data parameter thresholds, alarming such as short messages, mails and the like is carried out;
(3) The calculation module receives index data from each monitoring module and performs statistics and summary to form an index data chart of each monitoring node, wherein the data chart can be a histogram, a line graph and other statistical tables; meanwhile, the calculation module sends the received index data to the display module;
(4) The display module displays the received index data, wherein the displayed index data is monitoring data in the timing monitoring time, and is real-time; meanwhile, the display module acquires the display requirement of the user, and acquires the content in the index data chart of the monitoring node from the calculation module according to the display requirement for display, which is equivalent to that the display module can acquire the content in the index data chart of the monitoring node from the calculation module for display through the control of the key. According to the invention, the original manpower needing manual intervention is released through an automatic monitoring means, the system cost is reduced, compared with the manual operation, the data problem of the system operation process is found more accurately and timely, the risk is reduced, and the system availability is improved.

Claims (9)

1. The distributed monitoring method for visually monitoring the operation of the nodes is characterized by comprising the following steps of:
(1) Determining the category of each monitoring node, and selecting the sub-module category in the corresponding monitoring module according to the category of each monitoring node;
(2) Determining index types of all monitoring nodes, and setting different monitoring configuration requirements of all monitoring nodes; according to the monitoring configuration requirement, a submodule in the monitoring module monitors the monitored monitoring node to acquire index data in a time interval of timing monitoring, and sends the index data to the calculation module;
(3) The calculation module receives the index data from each monitoring module and forms an index data chart of each monitoring node through statistics and summarization; meanwhile, the calculation module sends the received index data to the display module;
(4) The display module displays the received index data; meanwhile, the display module acquires the display requirement of the user, and acquires the content in the index data chart of the monitoring node from the calculation module according to the display requirement for display.
2. The distributed monitoring method for operation of a visual monitoring node according to claim 1, wherein the categories of the monitoring node include linux nodes and win nodes; the submodule category in the monitoring module comprises a linux monitoring submodule and a win monitoring submodule.
3. The method for distributed monitoring of operation of a visual monitoring node according to claim 1, wherein the index types of the monitoring node include cpu usage, memory occupancy, disk occupancy, and task process number.
4. A method for distributed monitoring of operation of a visual monitoring node according to claim 3, wherein said setting different monitoring configuration requirements of each monitoring node means: and the submodule in the monitoring module selects one or more index data from the index types of the monitoring nodes to monitor according to the types of the monitoring nodes and the index types of the monitoring nodes.
5. A distributed monitoring method for visually monitoring the operation of a node according to claim 1, wherein the time interval of the timing monitoring is 0.1 to 10 seconds.
6. The method for monitoring operation of a visual monitoring node according to claim 1, wherein the step of obtaining the index data for monitoring is: and comparing the data parameters of the index data with preset data parameter thresholds, and giving an alarm if the data parameters exceed the preset data parameter thresholds.
7. The distributed monitoring system for visually monitoring the operation of the nodes is characterized by comprising a display module, a calculation module, monitoring nodes and a monitoring module; each monitoring node is connected with one monitoring module, each monitoring module is connected with a computing module, and the computing module is connected with a display module;
the monitoring node is used for sending the index data of the monitoring node to the monitoring module;
The monitoring module is used for setting the monitoring configuration requirement of the monitoring node, monitoring the index data acquired by the monitoring node in a time interval of timing monitoring according to the monitoring configuration requirement, and sending the index data to the calculation module;
The computing module is used for receiving the index data from each monitoring module and counting and summarizing the index data to form an index data chart of each monitoring node; and sending the received index data to a display module;
the display module is used for displaying the received index data; and acquiring the display requirement of the user, and acquiring the content in the index data chart of the monitoring node from the calculation module according to the display requirement for display.
8. The distributed monitoring system for visually monitoring operation of a node according to claim 7, wherein the monitoring module comprises a switching module, a linux monitoring sub-module and a win monitoring sub-module; the switching module is respectively connected with the linux monitoring submodule and the win monitoring submodule;
the switching module is used for selecting a linux monitoring sub-module or a win monitoring sub-module corresponding to the category of the monitoring node according to the category of the monitoring node to be connected with the monitoring node;
The linux monitoring sub-module and the win monitoring sub-module are both used for acquiring one or more index data from the monitored monitoring nodes in a time interval of timing monitoring according to the category of the monitoring nodes and the index category of the monitoring nodes, monitoring the monitored monitoring nodes, and sending the index data to the calculation module.
9. The distributed monitoring system for visualizing operation of a monitoring node of claim 7, wherein said monitoring node is the same number as and in one-to-one correspondence with a monitoring module.
CN202410487050.0A 2024-04-23 2024-04-23 Distributed monitoring method and system for visually monitoring node operation Pending CN118295876A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202410487050.0A CN118295876A (en) 2024-04-23 2024-04-23 Distributed monitoring method and system for visually monitoring node operation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202410487050.0A CN118295876A (en) 2024-04-23 2024-04-23 Distributed monitoring method and system for visually monitoring node operation

Publications (1)

Publication Number Publication Date
CN118295876A true CN118295876A (en) 2024-07-05

Family

ID=91687420

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202410487050.0A Pending CN118295876A (en) 2024-04-23 2024-04-23 Distributed monitoring method and system for visually monitoring node operation

Country Status (1)

Country Link
CN (1) CN118295876A (en)

Similar Documents

Publication Publication Date Title
CN109660380B (en) Server running state monitoring method, platform, system and readable storage medium
CN111049705B (en) Method and device for monitoring distributed storage system
US7783744B2 (en) Facilitating root cause analysis for abnormal behavior of systems in a networked environment
CN110493065B (en) Alarm correlation degree analysis method and system for cloud center operation and maintenance
CN111309567B (en) Data processing method, device, database system, electronic equipment and storage medium
CN1992636B (en) System and method for processing warning information
CN111104535B (en) Data management system and data management method
CN106411659A (en) Business data monitoring method and apparatus
CN107704387B (en) Method, device, electronic equipment and computer readable medium for system early warning
CN104699759A (en) Method for maintaining automatic operation of database
CN112380089A (en) Data center monitoring and early warning method and system
CN112052134A (en) Service data monitoring method and device
CN113592337A (en) Fault processing method and device, electronic equipment and storage medium
US11120037B2 (en) Test data integration system and method thereof
CN109800124B (en) CPU utilization monitoring method and device, electronic equipment and storage medium
CN116260738B (en) Equipment monitoring method and related equipment
CN112256548B (en) Abnormal data monitoring method and device, server and storage medium
CN118295876A (en) Distributed monitoring method and system for visually monitoring node operation
WO2023273461A1 (en) Robot operating state monitoring system, and method
CN110633191A (en) Method and system for monitoring service health degree of software system in real time
CN116300564A (en) Automatic monitoring operation and maintenance platform supporting cross-region and cross-cluster mixed infrastructure
CN109766238B (en) Session number-based operation and maintenance platform performance monitoring method and device and related equipment
CN107872349B (en) Real-time snapshot statistical method and device and readable storage medium
CN113254310A (en) Server monitoring system and method
US20220129504A1 (en) Method and device for displaying representation indicator of object

Legal Events

Date Code Title Description
PB01 Publication