CN105681103A

CN105681103A - Loongson-chip-based cluster resource monitoring realization method

Info

Publication number: CN105681103A
Application number: CN201610117765.2A
Authority: CN
Inventors: 柳玉巧; 陈乃阔
Original assignee: Shandong Chaoyue Numerical Control Electronics Co Ltd
Current assignee: Shandong Chaoyue Numerical Control Electronics Co Ltd
Priority date: 2016-03-03
Filing date: 2016-03-03
Publication date: 2016-06-15

Abstract

The invention relates to the technical field of the system resource monitoring method of the Loongson platform, especially to a Loongson-chip-based cluster resource monitoring realization method. According to the realization method, cluster system resource monitoring is realized on a Loongson platform. A survival situation of a node, resource usage situations of all nodes of the cluster, fan rotating speeds, processor temperatures, and main board temperatures of all nodes can be monitored; and thus a fault that may occur at a system can be predicted.

Description

A kind of cluster resource monitoring implementation method based on Loongson platform

Technical field

The present invention relates to the system resource monitoring method technical field of Loongson platform, in particular to a kind of cluster resource monitoring implementation method based on Loongson platform.

Background technology

Cluster is one group of computer, and they integrally externally provide network resource. In the view of user, cluster is a system, but not multiple computer system. Cluster has the advantages such as high scalability, high availability, high-performance. In the epoch of information fast development, the appearance of group system allows user that common hardware system is formed cluster, it is possible to increase new hardware according to actual needs at any time in the cluster, it is to increase the retractility of system and operability.

Cluster system resource monitoring is the core of cluster management, mainly the system resource of node is monitored. The data that group system obtains may be used for distribution and the utilization of cluster system resource, and user can also learn whether node breaks down or take measures on customs clearance in advance and take precautions against the generation of fault, the final reliability ensureing cluster.

In autonomous fields such as production domesticization computers, Loongson platform occupies critical role, therefore, it is achieved the cluster system resource monitoring in Loongson platform has significance.

Summary of the invention

In order to solve the problem of prior art, the present invention provides a kind of cluster resource monitoring implementation method based on Loongson platform, and it is in Loongson platform, it is achieved that cluster system resource is monitored.

The technical solution adopted in the present invention is as follows:

Based on a cluster resource monitoring implementation method for Loongson platform, comprise the following steps:

A, arranging monitoring agent, monitor node survival condition based on each node of cluster of Loongson platform, collecting node information, management node is responsible for collecting the information that each monitoring agent is collected;

B, in monitor node deploy database and mapping software;

C, the information collected by each node are stored in database;

D, the information collected is analyzed and show user.

Step D specifically comprises:

D1, analysis node survival data information, shuts down if node breaks down, then point out user to process malfunctioning node;

The temperature of processor of D2, analysis node, the temperature of mainboard, fan rotary speed parameter, according to analytical results, pre-examining system possibility produced problem, warning user takes the precautionary measures in time;

D3, resource information is carried out visualization processing, in the way of curve, disk or cylindricality figure, resource service condition is showed user intuitively.

Inventive design monitoring resource system takes centralized system structure, arranges a monitoring agent in the cluster on each node, and monitoring agent is responsible for obtaining the resource information of this node, and the monitoring order of response monitoring system.Management node (monitor node) is responsible for collecting the node resource information that each monitoring agent obtains, such as treater utilization ratio, internal memory behaviour in service. In addition, the relevant data of collection fan rotating speed, temperature of processor, mainboard temperature are for predicting that node may produced problem.

The useful effect that technical scheme provided by the invention is brought is:

A kind of cluster resource monitoring implementation method based on Loongson platform of the present invention, in Loongson platform, achieve cluster system resource monitoring, the mainly resource service condition of the survival condition of monitor node, the monitoring each node of cluster, the resource informations such as the treater utilization ratio of such as each node, network flow, disk utilization, monitor the fan rotating speed of each node, temperature of processor, mainboard temperature in addition, for the fault that pre-examining system may occur.

Accompanying drawing explanation

In order to the technical scheme being illustrated more clearly in the embodiment of the present invention, below the accompanying drawing used required in embodiment being described is briefly described, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, it is also possible to obtain other accompanying drawing according to these accompanying drawings.

Fig. 1 is the system network architecture figure of a kind of cluster resource monitoring implementation method based on Loongson platform of the present invention;

Fig. 2 is the method flowchart of a kind of cluster resource monitoring implementation method based on Loongson platform of the present invention.

Embodiment

For making the object, technical solutions and advantages of the present invention clearly, below in conjunction with accompanying drawing, embodiment of the present invention is described further in detail.

Embodiment one

The present embodiment designs the system structure of a kind of monitoring resource assembly Chinese style, arranges a monitoring agent in the cluster on each node, and monitoring agent is responsible for obtaining the resource information of this node, and the monitoring order of response monitoring system. Management node (monitor node) is responsible for collecting the node resource information that each monitoring agent obtains, such as treater utilization ratio, internal memory behaviour in service. In addition, the relevant data of collection fan rotating speed, temperature of processor, mainboard temperature are for predicting that node may produced problem. The system network architecture of cluster is as shown in Figure 1.

As shown in Figure 2, the concrete implementation step of the system of the present embodiment is as follows:

(1) arranging monitoring agent, monitor node survival condition based on each node of cluster of Loongson platform, collecting node information, management node is responsible for collecting the information that each monitoring agent is collected;

(2) in monitor node deploy database and mapping software;

(3) data such as the resource information collected by each node are stored in database;

(4) data analysis:

1. analysis node survival data information, the machine if node has been delayed, then remind user to repair malfunctioning node;

2. the data of the temperature of processor of analysis node, fan rotating speed, mainboard temperature, whether prediction can there is fault;

3. adopt and graphically show each node resource (cpu utilization ratio, internal memory utilization ratio etc.) service condition intuitively, facilitate user carry out analyzing to cluster resource service condition and utilize.

The foregoing is only the better embodiment of the present invention, not in order to limit the present invention, within the spirit and principles in the present invention all, any amendment of doing, equivalent replacement, improvement etc., all should be included within protection scope of the present invention.

Claims

1., based on a cluster resource monitoring implementation method for Loongson platform, comprise the following steps:

B, in monitor node deploy database and mapping software;

C, the information collected by each node are stored in database;

D, the information collected is analyzed and show user.

2. a kind of cluster resource monitoring implementation method based on Loongson platform according to claim 1, it is characterised in that, described step D specifically comprises:

3. a kind of cluster resource monitoring implementation method based on Loongson platform according to claim 1, it is characterised in that, described monitoring agent is responsible for obtaining the resource information of this node, and the monitoring order of response monitoring system.