CN111970146A - Monitoring platform and monitoring method for SRDC whole cabinet nodes - Google Patents

Monitoring platform and monitoring method for SRDC whole cabinet nodes Download PDF

Info

Publication number
CN111970146A
CN111970146A CN202010726244.3A CN202010726244A CN111970146A CN 111970146 A CN111970146 A CN 111970146A CN 202010726244 A CN202010726244 A CN 202010726244A CN 111970146 A CN111970146 A CN 111970146A
Authority
CN
China
Prior art keywords
equipment
alarm
information
monitoring
nodes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010726244.3A
Other languages
Chinese (zh)
Other versions
CN111970146B (en
Inventor
胡汉顶
贾伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202010726244.3A priority Critical patent/CN111970146B/en
Publication of CN111970146A publication Critical patent/CN111970146A/en
Application granted granted Critical
Publication of CN111970146B publication Critical patent/CN111970146B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0604Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time
    • H04L41/0609Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time based on severity or priority
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/02Standardisation; Integration
    • H04L41/0213Standardised network management protocols, e.g. simple network management protocol [SNMP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0677Localisation of faults
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/0876Aspects of the degree of configuration automation
    • H04L41/0886Fully automatic configuration
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0817Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/12Network monitoring probes
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Abstract

The invention provides a monitoring platform and a monitoring method of SRDC whole cabinet nodes, wherein the monitoring platform comprises an equipment detector, a data processing module and a data processing module, wherein the equipment detector is used for detecting equipment nodes which survive in a local area network and analyzing the types of the equipment nodes through port numbers of the equipment nodes; the configuration manager is used for configuring the equipment and adding the equipment nodes into the monitoring platform; the data acquisition unit is used for monitoring the equipment nodes and acquiring monitoring information at regular time, processing the acquired information and sending alarm information to the alarm processor; the Trap monitor is used for monitoring Trap messages sent to the monitoring platform by each equipment node, analyzing the Trap message and sending alarm information to the alarm processor; and the alarm processor is used for receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm. The operation and maintenance efficiency is improved, the fault occurrence rate is reduced, the operation and maintenance cost is reduced, and the stable operation of the whole SRDC cabinet is guaranteed.

Description

Monitoring platform and monitoring method for SRDC whole cabinet nodes
Technical Field
The invention relates to the technical field of monitoring operation and maintenance of a whole machine cabinet, in particular to a monitoring platform and a monitoring method for SRDC whole machine cabinet nodes.
Background
With the development of the internet and big Data, due to the defects of complex deployment and slow business online of the traditional IT equipment, the requirement of an SRDC (Single Rack Data center) all-in-one machine is gradually increased. The SRDC all-in-one machine is a whole machine cabinet product of various IT equipment integrating server nodes, a switch and magnetic arrays, can integrate and pre-install client application software, and realizes quick deployment of the IT equipment and quick online of services. With the increase of the product output quantity of the whole cabinet, the problem of how to monitor, operate and maintain the product in the new form comes along with the increase of the product output quantity of the whole cabinet, the traditional operation and maintenance mode is time-consuming and low in efficiency, and therefore the problem which needs to be solved at present is to develop a unified management platform which can monitor all hardware devices of the whole SRDC cabinet through one platform.
Disclosure of Invention
The invention provides a monitoring platform and a monitoring method for nodes of an SRDC whole cabinet, aiming at the problem that the operation and maintenance mode of the traditional SRDC whole cabinet is time-consuming and low in efficiency.
The technical scheme of the invention is as follows:
on one hand, the technical scheme of the invention provides a monitoring platform for the nodes of the whole SRDC cabinet, wherein the monitoring platform is communicated with all equipment nodes in the whole SRDC cabinet through a local area network; the monitoring platform comprises an equipment detector, a configuration manager, a data acquisition unit, a Trap monitor and an alarm processor;
the device detector is used for detecting the device nodes which survive in the local area network and analyzing the types of the device nodes through the port numbers of the device nodes;
the configuration manager is used for adding the equipment nodes into the monitoring platform by utilizing the default initial protocol configuration of each type of equipment node;
the data acquisition unit is used for establishing a monitoring task to monitor the equipment node and regularly acquire monitoring information, processing the acquired information and sending alarm information to the alarm processor;
the Trap monitor is used for monitoring Trap messages sent to the monitoring platform by each equipment node, analyzing the Trap message and sending alarm information to the alarm processor; the Trap monitor realizes passive real-time monitoring, analyzes and processes the monitored message into standardized data and sends the standardized data to the alarm processor;
and the alarm processor is used for receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm.
Preferably, the device detector is configured to detect an IP address of a device node reachable by the network in the local area network at regular time, and analyze and determine the type of the detected device node through a TCP or UDP port opened by each device node;
the equipment detector comprises a to-be-detected list generation module, a to-be-analyzed list generation module, a type judgment module and a to-be-added resource list generation module;
the detection list generating module is used for acquiring the IP address of the monitoring platform and further calculating the IP address to be detected in the local area network to generate a detection list;
the analysis to-be-detected list generating module is used for regularly carrying out network detection on the IP addresses in the detection to-be-detected list by using an fping tool, generating the target reachable addresses into the analysis to-be-detected list, and removing the target reachable addresses from the detection to-be-detected list;
the type judgment module is used for creating a scanning task to perform port scanning on a UPD UDP (user Datagram protocol) and a TCP (transmission control protocol) of an IP address in a to-be-analyzed list at regular time on a corresponding device node and judging the type of the device node according to a protocol port opened by each device node;
and the resource list to be added generation module is used for generating a resource list to be added according to the IP and the type of the equipment node after the port scanning is finished, and removing the IP from the list to be analyzed. The device detector utilizes the network detection task and the port scanning task to discover and analyze the devices and types in the local area network.
Preferably, the configuration manager comprises an initial configuration module, an addition module and an output module;
the initial configuration module is used for acquiring the IP and the type of the equipment node to be added from the resource list to be added at regular time, and acquiring the model and the serial number of the equipment by using corresponding equipment factory default protocol configuration;
the adding module is used for adding the equipment nodes to the monitoring platform and automatically configuring target addresses sent by Trap of each equipment node as monitoring platform IP addresses and SNMP protocol parameters;
and the output module is used for issuing equipment addition information to the data acquisition unit after the addition is finished and deleting the IP and the type of the equipment node from the resource list to be added. The configuration manager realizes automatic configuration and automatic management of the equipment according to the detected equipment type.
Preferably, the data collector is configured to create a timing collection task to actively collect information and states of the device node components after receiving the message sent by the configuration manager, analyze and preprocess collected data, convert the data into a standardized alarm message, and send the standardized alarm message to the alarm processor. The data acquisition unit realizes active monitoring, acquires the information and the state of the equipment component at regular time by utilizing active polling, and carries out standardized processing on abnormal information and sends the abnormal information to the alarm processor.
Preferably, the alarm processor comprises an analysis module, a judgment processing module and a sending module;
the analysis module is used for analyzing the information of the warning source, the position and the grade after receiving the warning information sent by the data acquisition device and the Trap monitor;
the judging and processing module is used for judging and processing the information output by the analysis module, and if the information is alarm information, judging whether to generate new alarm information or to degrade or upgrade the corresponding alarm level stored in the monitoring platform according to the alarm information stored in the monitoring platform; if the alarm recovery information is the alarm recovery information, clearing the corresponding alarm stored in the monitoring platform;
and the sending module is used for sending the alarm information and the position to the touch display screen of the SRDC whole cabinet for rolling display.
On the other hand, the technical scheme of the invention provides a monitoring method of nodes of an SRDC whole cabinet, which is applied to a monitoring system, wherein the system comprises a monitoring platform and equipment nodes in the SRDC whole cabinet, wherein the equipment nodes and the monitoring platform form a local area network for communication; the method comprises the following steps:
detecting equipment nodes which survive in the local area network, and analyzing the types of the equipment nodes through port numbers of the equipment nodes;
adding the equipment nodes into the monitoring platform by utilizing default initial protocol configuration of each type of equipment nodes;
establishing a monitoring task to monitor the equipment node and regularly acquire monitoring information, processing the acquired information and sending warning information;
monitoring Trap messages sent to a monitoring platform by each equipment node, analyzing and processing the Trap message and sending alarm information;
and receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm.
Preferably, the step of detecting a device node alive in the local area network and analyzing the type of the device node through the port number of the device node includes: the IP address of the equipment node which can be reached by the network in the local area network is detected at regular time, and the detected type of the equipment node is analyzed and judged through a TCP or UDP port opened by each equipment node; the method specifically comprises the following steps:
acquiring an IP address of a monitoring platform, and further calculating the IP address to be detected in the local area network to generate a list to be detected;
performing network detection on the IP addresses in the to-be-detected list at regular time by using an fping tool, generating a to-be-analyzed list of the addresses where the targets can reach, and removing the to-be-detected list of the addresses where the targets can reach;
creating a scanning task to perform port scanning on UPD UDP and TCP of an IP address in a list to be analyzed at regular time and corresponding equipment nodes, and judging the equipment node type according to a protocol port started by each equipment node;
and generating a resource list to be added according to the IP and the type of the equipment node after the port scanning is finished, and removing the IP from the list to be analyzed.
Preferably, the step of adding the device node into the monitoring platform using the default initial protocol configuration of each type of device node comprises:
acquiring the IP and the type of the equipment node to be added from the resource list to be added at regular time, and acquiring the model and the serial number of the equipment by using corresponding equipment factory default protocol configuration;
adding the equipment nodes to the monitoring platform, and automatically configuring target addresses sent by Trap of each equipment node as monitoring platform IP addresses and SNMP protocol parameters;
and after the addition is finished, issuing equipment addition information, and deleting the IP and the type of the equipment node from the resource list to be added.
Preferably, the steps of creating a monitoring task to monitor the device node and periodically collect monitoring information, processing the collected information, and sending alarm information include:
and after receiving the sent adding message, creating a timing acquisition task to actively acquire the information and the state of the node part of the equipment, analyzing the running state of the equipment and the health state of each part according to the acquired information, and assembling the information into a standardized alarm message to be issued.
Preferably, the step of receiving the alarm information and determining whether to perform the elimination of the alarm or the generation of the alarm or the upgrading or downgrading operation of the alarm includes:
after receiving the warning information, analyzing warning source, position and grade information;
judging and processing the analyzed information, and if the analyzed information is alarm information, judging whether to generate new alarm information or to degrade or upgrade the corresponding alarm level stored in the monitoring platform according to the alarm information stored in the monitoring platform; if the alarm recovery information is the alarm recovery information, clearing the corresponding alarm stored in the monitoring platform;
and sending the alarm information and the position to a touch display screen of the SRDC whole cabinet for rolling display. And operation and maintenance personnel can position and repair the problems conveniently in time.
The automatic discovery, automatic configuration and automatic management of the equipment nodes in the cabinet are realized, non-invasive real-time monitoring and operation and maintenance of each equipment node are realized by utilizing active acquisition and Trap monitoring, and alarm and positioning information can be given in time when the equipment is abnormal, so that operation and maintenance personnel are prompted to carry out treatment and repair.
According to the technical scheme, the invention has the following advantages: the monitoring platform is communicated with each equipment node in the cabinet through a local area network, automatic discovery, automatic configuration and automatic management of each equipment node can be realized, active/passive monitoring is realized through active acquisition of component information and states of each equipment node and monitoring of Trap messages, and once abnormal information is detected, alarm information and positions can be displayed in a rolling mode through a cabinet touch display screen, so that operation and maintenance personnel can position and repair problems in time. The automatic discovery, the automatic configuration and the automatic nanotube of each equipment node of the whole SRDC cabinet can be realized, the non-invasive monitoring is carried out on the running state of the equipment in real time and high efficiency, the operation and maintenance efficiency is improved, the fault occurrence rate is reduced, the operation and maintenance cost is reduced, and the stable operation of the whole SRDC cabinet is guaranteed.
In addition, the invention has reliable design principle, simple structure and very wide application prospect.
Therefore, compared with the prior art, the invention has prominent substantive features and remarkable progress, and the beneficial effects of the implementation are also obvious.
Drawings
In order to more clearly illustrate the embodiments or technical solutions in the prior art of the present invention, the drawings used in the description of the embodiments or prior art will be briefly described below, and it is obvious for those skilled in the art that other drawings can be obtained based on these drawings without creative efforts.
FIG. 1 is a schematic connection block diagram of a monitoring platform of one embodiment of the present invention.
FIG. 2 is a schematic flow diagram of a method of one embodiment of the invention.
Fig. 3 is a schematic diagram of a device detection flow according to an embodiment of the present invention.
Detailed Description
In order to make those skilled in the art better understand the technical solution of the present invention, the technical solution in the embodiment of the present invention will be clearly and completely described below with reference to the drawings in the embodiment of the present invention, and it is obvious that the described embodiment is only a part of the embodiment of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As shown in fig. 1, an embodiment of the present invention provides a monitoring platform for an SRDC complete cabinet node, where the monitoring platform communicates with each device node in the SRDC complete cabinet through a local area network; the monitoring platform comprises an equipment detector, a configuration manager, a data acquisition unit, a Trap monitor and an alarm processor;
the device detector is used for detecting the device nodes which survive in the local area network and analyzing the types of the device nodes through the port numbers of the device nodes;
the configuration manager is used for adding the equipment nodes into the monitoring platform by utilizing the default initial protocol configuration of each type of equipment node;
the data acquisition unit is used for establishing a monitoring task to monitor the equipment node and regularly acquire monitoring information, processing the acquired information and sending alarm information to the alarm processor;
the Trap monitor is used for monitoring Trap messages sent to the monitoring platform by each equipment node, analyzing the Trap message and sending alarm information to the alarm processor; the Trap monitor realizes passive real-time monitoring, analyzes and processes the monitored message into standardized data and sends the standardized data to the alarm processor;
and the alarm processor is used for receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm.
In some embodiments, the device detector is configured to detect an IP address of a device node reachable by a network in a local area network at regular time, and analyze and determine a type of the detected device node through a TCP or UDP port opened by each device node;
the equipment detector comprises a to-be-detected list generation module, a to-be-analyzed list generation module, a type judgment module and a to-be-added resource list generation module;
the detection list generating module is used for acquiring the IP address of the monitoring platform and further calculating the IP address to be detected in the local area network to generate a detection list;
the analysis to-be-detected list generating module is used for regularly carrying out network detection on the IP addresses in the detection to-be-detected list by using an fping tool, generating the target reachable addresses into the analysis to-be-detected list, and removing the target reachable addresses from the detection to-be-detected list;
the type judgment module is used for creating a scanning task to perform port scanning on a UPD UDP (user Datagram protocol) and a TCP (transmission control protocol) of an IP address in a to-be-analyzed list at regular time on a corresponding device node and judging the type of the device node according to a protocol port opened by each device node;
and the resource list to be added generation module is used for generating a resource list to be added according to the IP and the type of the equipment node after the port scanning is finished, and removing the IP from the list to be analyzed. The device detector utilizes the network detection task and the port scanning task to discover and analyze the devices and types in the local area network.
In some embodiments, the configuration manager includes an initial configuration module, an addition module, and an output module;
the initial configuration module is used for acquiring the IP and the type of the equipment node to be added from the resource list to be added at regular time, and acquiring the model and the serial number of the equipment by using corresponding equipment factory default protocol configuration;
the adding module is used for adding the equipment nodes to the monitoring platform and automatically configuring target addresses sent by Trap of each equipment node as monitoring platform IP addresses and SNMP protocol parameters;
and the output module is used for issuing equipment addition information to the data acquisition unit after the addition is finished and deleting the IP and the type of the equipment node from the resource list to be added. The configuration manager realizes automatic configuration and automatic management of the equipment according to the detected equipment type.
In some embodiments, the data collector is configured to create a timing collection task to actively collect information and status of the device node component after receiving the message sent by the configuration manager, analyze and pre-process the collected data, convert the data into a standardized alarm message, and send the standardized alarm message to the alarm processor. The data acquisition unit realizes active monitoring, acquires the information and the state of the equipment component at regular time by utilizing active polling, and carries out standardized processing on abnormal information and sends the abnormal information to the alarm processor.
In some embodiments, the alarm processor comprises an analysis module, a judgment processing module and a sending module;
the analysis module is used for analyzing the information of the warning source, the position and the grade after receiving the warning information sent by the data acquisition device and the Trap monitor;
the judging and processing module is used for judging and processing the information output by the analysis module, and if the information is alarm information, judging whether to generate new alarm information or to degrade or upgrade the corresponding alarm level stored in the monitoring platform according to the alarm information stored in the monitoring platform; if the alarm recovery information is the alarm recovery information, clearing the corresponding alarm stored in the monitoring platform;
and the sending module is used for sending the alarm information and the position to the touch display screen of the SRDC whole cabinet for rolling display.
As shown in fig. 2, an embodiment of the present invention further provides a monitoring method for nodes of an entire SRDC cabinet, which is applied to a monitoring system, where the system includes a monitoring platform and each device node in the entire SRDC cabinet, where the monitoring platform and the monitoring platform form a local area network for communication; the method comprises the following steps:
s1: detecting equipment nodes which survive in the local area network, and analyzing the types of the equipment nodes through port numbers of the equipment nodes;
s2: adding the equipment nodes into the monitoring platform by utilizing default initial protocol configuration of each type of equipment nodes;
s3: establishing a monitoring task to monitor the equipment node and regularly acquire monitoring information, processing the acquired information and sending warning information;
s4: monitoring Trap messages sent to a monitoring platform by each equipment node, analyzing and processing the Trap message and sending alarm information;
s5: and receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm.
In some embodiments, in the step of detecting the device node alive in the lan and analyzing the type of the device node through the port number of the device node, for example, the server supports IPMI and SNMP protocols, and UDP 623 and UDP 161 ports are opened. The magnetic array supports SMI-S and SNMP protocols, the ports of TCP 5989 and UDP 161 are opened, and the port of UDP 623 is not opened. And the switch only supports the SNMP protocol, and opens the UDP 161 port. Based on the method, the device detector detects the IP address of the device which can be reached by the network in the local area network at regular time, and analyzes and judges the detected device type through the TCP or UDP port opened by each device; as shown in fig. 3, the method specifically includes the following steps:
s11: acquiring an IP address of a monitoring platform, and further calculating the IP address to be detected in the local area network to generate a list to be detected, namely a list A;
s12: performing network detection on the IP addresses in the to-be-detected list at regular time by using an fping tool, generating a to-be-analyzed list, namely a list B, of the addresses where the targets can reach, and removing the addresses where the targets can reach from the list A;
s13: creating a scanning task to perform port scanning on UPD UDP and TCP of an IP address in a list to be analyzed at regular time and corresponding equipment nodes, and judging the equipment node type according to a protocol port started by each equipment node; the port scanning task timing performs port scanning on 3 of the UPD 623, UDP 161, and TCP 5989 of the IP addresses in the list B. And judging the device type according to the protocol port opened by each device. If the UDP 623 port is in an open state, the equipment is a server node; if the port of the TCP 5989 is opened, the device is a magnetic array; if only UDP 161 is started, the device is a switch;
s14: and generating a resource list to be added, namely a list C, by the IP and the type of the equipment node with the scanned port, and removing the IP from the list B.
In some embodiments, the step of adding the device node into the monitoring platform using the default initial protocol configuration for each type of device node comprises:
s21: acquiring the IP and the type of the equipment node to be added from the resource list to be added at regular time, and acquiring the model and the serial number of the equipment by using corresponding equipment factory default protocol configuration;
s22: adding the equipment nodes to the monitoring platform, and automatically configuring target addresses sent by Trap of each equipment node as monitoring platform IP addresses and SNMP protocol parameters;
s23: and after the addition is finished, issuing equipment addition information, and deleting the IP and the type of the equipment node from the resource list to be added. For example, the IP and the type of the device to be added are obtained from the list C at regular time, the model and the serial number of the device are obtained by using the factory default protocol configuration of the corresponding device, the device is added to the monitoring platform, and the target address sent by the Trap of each device is automatically configured as the platform IP address and the SNMP protocol parameter. And after the addition is finished, issuing equipment addition information, and deleting the IP and the type of the equipment from the list C.
In some embodiments, the steps of creating a monitoring task to monitor the device node and periodically collect monitoring information, processing the collected information, and sending alarm information include:
and after receiving the sent adding message, creating a timing acquisition task to actively acquire the information and the state of the node part of the equipment, analyzing the running state of the equipment and the health state of each part according to the acquired information, and assembling the information into a standardized alarm message to be issued. It should be noted that, the corresponding standardized monitoring plug-in is selected according to the type and model of the device to periodically collect the component information and status of the corresponding device.
In some embodiments, the step of receiving the alarm information and determining whether to perform the alarm elimination or the alarm generation or the alarm escalation or downgrade operation comprises:
s51: after receiving the warning information, analyzing warning source, position and grade information;
s52: judging and processing the analyzed information, and if the analyzed information is alarm information, judging whether to generate new alarm information or to degrade or upgrade the corresponding alarm level stored in the monitoring platform according to the alarm information stored in the monitoring platform; if the alarm recovery information is the alarm recovery information, clearing the corresponding alarm stored in the monitoring platform;
s53: and sending the alarm information and the position to a touch display screen of the SRDC whole cabinet for rolling display. And operation and maintenance personnel can position and repair the problems conveniently in time.
The automatic discovery, automatic configuration and automatic management of the equipment nodes in the cabinet are realized, non-invasive real-time monitoring and operation and maintenance of each equipment node are realized by utilizing active acquisition and Trap monitoring, and alarm and positioning information can be given in time when the equipment is abnormal, so that operation and maintenance personnel are prompted to carry out treatment and repair.
Although the present invention has been described in detail by referring to the drawings in connection with the preferred embodiments, the present invention is not limited thereto. Various equivalent modifications or substitutions can be made on the embodiments of the present invention by those skilled in the art without departing from the spirit and scope of the present invention, and these modifications or substitutions are within the scope of the present invention/any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the appended claims.

Claims (10)

1. A monitoring platform for nodes of an SRDC whole cabinet is characterized in that the monitoring platform is communicated with equipment nodes in the SRDC whole cabinet through a local area network; the monitoring platform comprises an equipment detector, a configuration manager, a data acquisition unit, a Trap monitor and an alarm processor;
the device detector is used for detecting the device nodes which survive in the local area network and analyzing the types of the device nodes through the port numbers of the device nodes;
the configuration manager is used for adding the equipment nodes into the monitoring platform by utilizing the default initial protocol configuration of each type of equipment node;
the data acquisition unit is used for establishing a monitoring task to monitor the equipment node and regularly acquire monitoring information, processing the acquired information and sending alarm information to the alarm processor;
the Trap monitor is used for monitoring Trap messages sent to the monitoring platform by each equipment node, analyzing the Trap message and sending alarm information to the alarm processor;
and the alarm processor is used for receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm.
2. The platform according to claim 1, wherein the device detector is configured to detect an IP address of a device node reachable through a network in the local area network at regular time, and analyze and determine a type of the detected device node through a TCP or UDP port opened by each device node;
the equipment detector comprises a to-be-detected list generation module, a to-be-analyzed list generation module, a type judgment module and a to-be-added resource list generation module;
the detection list generating module is used for acquiring the IP address of the monitoring platform and further calculating the IP address to be detected in the local area network to generate a detection list;
the analysis to-be-detected list generating module is used for regularly carrying out network detection on the IP addresses in the detection to-be-detected list by using an fping tool, generating the target reachable addresses into the analysis to-be-detected list, and removing the target reachable addresses from the detection to-be-detected list;
the type judgment module is used for creating a scanning task to perform port scanning on a UPD UDP (user Datagram protocol) and a TCP (transmission control protocol) of an IP address in a to-be-analyzed list at regular time on a corresponding device node and judging the type of the device node according to a protocol port opened by each device node;
and the resource list to be added generation module is used for generating a resource list to be added according to the IP and the type of the equipment node after the port scanning is finished, and removing the IP from the list to be analyzed.
3. The platform of claim 2, wherein the configuration manager comprises an initial configuration module, an addition module, and an output module;
the initial configuration module is used for acquiring the IP and the type of the equipment node to be added from the resource list to be added at regular time, and acquiring the model and the serial number of the equipment by using corresponding equipment factory default protocol configuration;
the adding module is used for adding the equipment nodes to the monitoring platform and automatically configuring target addresses sent by Trap of each equipment node as monitoring platform IP addresses and SNMP protocol parameters;
and the output module is used for issuing equipment addition information to the data acquisition unit after the addition is finished and deleting the IP and the type of the equipment node from the resource list to be added.
4. The monitoring platform for the nodes of the whole SRDC cabinet according to claim 3, wherein the data collector is configured to create a timing collection task to actively collect the information and the status of the node components of the device after receiving the message sent by the configuration manager, perform analysis and preprocessing on the collected data, convert the data into a standardized alarm message, and send the standardized alarm message to the alarm processor.
5. The monitoring platform of the SRDC whole cabinet node according to claim 4, wherein the alarm processor comprises an analysis module, a judgment processing module and a sending module;
the analysis module is used for analyzing the information of the warning source, the position and the grade after receiving the warning information sent by the data acquisition device and the Trap monitor;
the judging and processing module is used for judging and processing the information output by the analysis module, and if the information is alarm information, judging whether to generate new alarm information or to degrade or upgrade the corresponding alarm level stored in the monitoring platform according to the alarm information stored in the monitoring platform; if the alarm recovery information is the alarm recovery information, clearing the corresponding alarm stored in the monitoring platform;
and the sending module is used for sending the alarm information and the position to the touch display screen of the SRDC whole cabinet for rolling display.
6. A monitoring method of SRDC complete cabinet nodes is characterized in that the monitoring method is applied to a monitoring system, and the system comprises a monitoring platform and equipment nodes in an SRDC complete cabinet, wherein the equipment nodes and the monitoring platform form a local area network for communication; the method comprises the following steps:
detecting equipment nodes which survive in the local area network, and analyzing the types of the equipment nodes through port numbers of the equipment nodes;
adding the equipment nodes into the monitoring platform by utilizing default initial protocol configuration of each type of equipment nodes;
establishing a monitoring task to monitor the equipment node and regularly acquire monitoring information, processing the acquired information and sending warning information;
monitoring Trap messages sent to a monitoring platform by each equipment node, analyzing and processing the Trap message and sending alarm information;
and receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm.
7. The method for monitoring the nodes of the SRDC complete equipment cabinet according to claim 6, wherein the step of detecting the equipment nodes living in the local area network and analyzing the types of the equipment nodes through the port numbers of the equipment nodes comprises the following steps: the IP address of the equipment node which can be reached by the network in the local area network is detected at regular time, and the detected type of the equipment node is analyzed and judged through a TCP or UDP port opened by each equipment node; the method specifically comprises the following steps:
acquiring an IP address of a monitoring platform, and further calculating the IP address to be detected in the local area network to generate a list to be detected;
performing network detection on the IP addresses in the to-be-detected list at regular time by using an fping tool, generating a to-be-analyzed list of the addresses where the targets can reach, and removing the to-be-detected list of the addresses where the targets can reach;
creating a scanning task to perform port scanning on UPD UDP and TCP of an IP address in a list to be analyzed at regular time and corresponding equipment nodes, and judging the equipment node type according to a protocol port started by each equipment node;
and generating a resource list to be added according to the IP and the type of the equipment node after the port scanning is finished, and removing the IP from the list to be analyzed.
8. The method for monitoring the nodes of the SRDC complete equipment cabinet according to claim 7, wherein the step of adding the equipment nodes into the monitoring platform by using default initial protocol configuration of each type of equipment node comprises the following steps:
acquiring the IP and the type of the equipment node to be added from the resource list to be added at regular time, and acquiring the model and the serial number of the equipment by using corresponding equipment factory default protocol configuration;
adding the equipment nodes to the monitoring platform, and automatically configuring target addresses sent by Trap of each equipment node as monitoring platform IP addresses and SNMP protocol parameters;
and after the addition is finished, issuing equipment addition information, and deleting the IP and the type of the equipment node from the resource list to be added.
9. The monitoring platform of the SRDC whole cabinet node according to claim 8, wherein the step of creating a monitoring task to monitor the equipment node and periodically collect monitoring information, and the step of processing the collected information and sending an alarm message comprises:
and after receiving the sent adding message, creating a timing acquisition task to actively acquire the information and the state of the node part of the equipment, analyzing the running state of the equipment and the health state of each part according to the acquired information, and assembling the information into a standardized alarm message to be issued.
10. The method for monitoring the nodes of the whole SRDC cabinet according to claim 9, wherein the step of receiving the alarm information and judging whether to perform the elimination of the alarm or the generation of the alarm or the upgrading or downgrading operation of the alarm comprises the following steps:
after receiving the warning information, analyzing warning source, position and grade information;
judging and processing the analyzed information, and if the analyzed information is alarm information, judging whether to generate new alarm information or to degrade or upgrade the corresponding alarm level stored in the monitoring platform according to the alarm information stored in the monitoring platform; if the alarm recovery information is the alarm recovery information, clearing the corresponding alarm stored in the monitoring platform;
and sending the alarm information and the position to a touch display screen of the SRDC whole cabinet for rolling display.
CN202010726244.3A 2020-07-25 2020-07-25 Monitoring platform and monitoring method for SRDC whole cabinet nodes Active CN111970146B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010726244.3A CN111970146B (en) 2020-07-25 2020-07-25 Monitoring platform and monitoring method for SRDC whole cabinet nodes

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010726244.3A CN111970146B (en) 2020-07-25 2020-07-25 Monitoring platform and monitoring method for SRDC whole cabinet nodes

Publications (2)

Publication Number Publication Date
CN111970146A true CN111970146A (en) 2020-11-20
CN111970146B CN111970146B (en) 2022-07-08

Family

ID=73362710

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010726244.3A Active CN111970146B (en) 2020-07-25 2020-07-25 Monitoring platform and monitoring method for SRDC whole cabinet nodes

Country Status (1)

Country Link
CN (1) CN111970146B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112911086A (en) * 2021-01-29 2021-06-04 卡莱特云科技股份有限公司 Classification control method and device for batch video processing equipment
CN113381881A (en) * 2021-05-25 2021-09-10 山东浪潮爱购云链信息科技有限公司 Method and device for monitoring alarm processing of host
CN113688014A (en) * 2021-07-30 2021-11-23 济南浪潮数据技术有限公司 Alarm processing method, device, equipment and medium for SRDC whole cabinet
CN113709210A (en) * 2021-07-30 2021-11-26 济南浪潮数据技术有限公司 Device discovery method, device, system, electronic device and storage medium
CN113727210A (en) * 2021-08-06 2021-11-30 济南浪潮数据技术有限公司 Equipment information management method, system, storage medium and equipment
CN114124989A (en) * 2021-09-28 2022-03-01 山东中创软件商用中间件股份有限公司 Equipment monitoring method, device, equipment and storage medium
CN114785722A (en) * 2022-06-14 2022-07-22 武汉四通信息服务有限公司 Monitoring data processing method and device and computer readable storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101640859A (en) * 2009-05-19 2010-02-03 深圳市永达电子股份有限公司 Equipment cabinet alarm device, monitoring platform, management system and method
CN105282772A (en) * 2015-09-10 2016-01-27 北京爱可生通信技术有限公司 Wireless network data communication equipment monitoring system and equipment monitoring method
CN109240891A (en) * 2018-09-26 2019-01-18 郑州云海信息技术有限公司 A kind of monitoring method and device of SR whole machine cabinet server
CN109361543A (en) * 2018-10-30 2019-02-19 郑州云海信息技术有限公司 A kind of whole machine cabinet monitoring method, device, terminal and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101640859A (en) * 2009-05-19 2010-02-03 深圳市永达电子股份有限公司 Equipment cabinet alarm device, monitoring platform, management system and method
CN105282772A (en) * 2015-09-10 2016-01-27 北京爱可生通信技术有限公司 Wireless network data communication equipment monitoring system and equipment monitoring method
CN109240891A (en) * 2018-09-26 2019-01-18 郑州云海信息技术有限公司 A kind of monitoring method and device of SR whole machine cabinet server
CN109361543A (en) * 2018-10-30 2019-02-19 郑州云海信息技术有限公司 A kind of whole machine cabinet monitoring method, device, terminal and storage medium

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112911086A (en) * 2021-01-29 2021-06-04 卡莱特云科技股份有限公司 Classification control method and device for batch video processing equipment
CN112911086B (en) * 2021-01-29 2023-10-10 卡莱特云科技股份有限公司 Classification control method and device for batch video processing equipment
CN113381881A (en) * 2021-05-25 2021-09-10 山东浪潮爱购云链信息科技有限公司 Method and device for monitoring alarm processing of host
CN113381881B (en) * 2021-05-25 2022-12-09 山东浪潮爱购云链信息科技有限公司 Method and device for monitoring alarm processing of host
CN113688014A (en) * 2021-07-30 2021-11-23 济南浪潮数据技术有限公司 Alarm processing method, device, equipment and medium for SRDC whole cabinet
CN113709210A (en) * 2021-07-30 2021-11-26 济南浪潮数据技术有限公司 Device discovery method, device, system, electronic device and storage medium
CN113688014B (en) * 2021-07-30 2024-02-09 济南浪潮数据技术有限公司 Alarm processing method, device, equipment and medium for SRDC whole cabinet
CN113727210A (en) * 2021-08-06 2021-11-30 济南浪潮数据技术有限公司 Equipment information management method, system, storage medium and equipment
CN113727210B (en) * 2021-08-06 2023-08-22 济南浪潮数据技术有限公司 Equipment information management method, system, storage medium and equipment
CN114124989A (en) * 2021-09-28 2022-03-01 山东中创软件商用中间件股份有限公司 Equipment monitoring method, device, equipment and storage medium
CN114785722A (en) * 2022-06-14 2022-07-22 武汉四通信息服务有限公司 Monitoring data processing method and device and computer readable storage medium
CN114785722B (en) * 2022-06-14 2022-09-30 武汉四通信息服务有限公司 Monitoring data processing method and device and computer readable storage medium

Also Published As

Publication number Publication date
CN111970146B (en) 2022-07-08

Similar Documents

Publication Publication Date Title
CN111970146B (en) Monitoring platform and monitoring method for SRDC whole cabinet nodes
US20200169482A1 (en) Monitoring and analysis of interactions between network endpoints
CN102447570B (en) Monitoring device and method based on health degree analysis
KR100748246B1 (en) Multi-step integrated security monitoring system and method using intrusion detection system log collection engine and traffic statistic generation engine
CN107491375A (en) Equipment detection and fault early warning system and method under a kind of cloud computing environment
EP1742416A1 (en) Methods, computer readable medium and system for analyzing and management of application traffic on networks
US20070297349A1 (en) Method and System for Collecting Information Relating to a Communication Network
CN103546343B (en) The network traffics methods of exhibiting of network traffic analysis system and system
CN110659109B (en) System and method for monitoring openstack virtual machine
US11356335B2 (en) Machine learning-based network analytics, troubleshoot, and self-healing system and method
CN102739802A (en) Service application-oriented IT contralized operation and maintenance analyzing system
EP2372954B1 (en) Method and system for collecting information relating to a communication network
CN104219091A (en) System and method for network operation fault detection
CN112714013B (en) Application fault positioning method in cloud environment
CN109995582A (en) Asset equipment management system and method based on real-time status
US11659449B2 (en) Machine learning-based network analytics, troubleshoot, and self-healing holistic telemetry system incorporating modem-embedded machine analysis of multi-protocol stacks
CN108234161A (en) For the access detection method and system of on-line off-line multitiered network framework
CN114039900A (en) Efficient network data packet protocol analysis method and system
CN105279651B (en) A kind of transaction data monitor processing method and system
CN114679378A (en) Log monitoring and analyzing method and system, storage medium and electronic device
CN112367212B (en) Virtual machine network quality monitoring method and system in cloud environment
CN101656632A (en) Virus monitoring method and virus monitoring device in large network
CN112884176B (en) Management system and method
CN112312376B (en) Method and system for remotely and interactively managing multifunctional electric meter
CN112910726A (en) Cloud environment flow monitoring method, device and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant