CN111970146A - Monitoring platform and monitoring method for SRDC whole cabinet nodes - Google Patents
Monitoring platform and monitoring method for SRDC whole cabinet nodes Download PDFInfo
- Publication number
- CN111970146A CN111970146A CN202010726244.3A CN202010726244A CN111970146A CN 111970146 A CN111970146 A CN 111970146A CN 202010726244 A CN202010726244 A CN 202010726244A CN 111970146 A CN111970146 A CN 111970146A
- Authority
- CN
- China
- Prior art keywords
- equipment
- alarm
- information
- monitoring
- nodes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012544 monitoring process Methods 0.000 title claims abstract description 136
- 238000000034 method Methods 0.000 title claims abstract description 23
- 238000012545 processing Methods 0.000 claims abstract description 29
- 238000001514 detection method Methods 0.000 claims description 21
- 238000004458 analytical method Methods 0.000 claims description 16
- 238000011084 recovery Methods 0.000 claims description 12
- 238000005096 rolling process Methods 0.000 claims description 7
- 230000005540 biological transmission Effects 0.000 claims description 3
- 238000004891 communication Methods 0.000 claims description 3
- 230000008030 elimination Effects 0.000 claims description 3
- 238000003379 elimination reaction Methods 0.000 claims description 3
- 230000036541 health Effects 0.000 claims description 3
- 238000007781 pre-processing Methods 0.000 claims 1
- 238000012423 maintenance Methods 0.000 abstract description 14
- 230000002159 abnormal effect Effects 0.000 description 7
- 230000008439 repair process Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 108010028984 3-isopropylmalate dehydratase Proteins 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000002071 nanotube Substances 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0604—Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time
- H04L41/0609—Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time based on severity or priority
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/02—Standardisation; Integration
- H04L41/0213—Standardised network management protocols, e.g. simple network management protocol [SNMP]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0631—Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0677—Localisation of faults
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/08—Configuration management of networks or network elements
- H04L41/0876—Aspects of the degree of configuration automation
- H04L41/0886—Fully automatic configuration
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0805—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
- H04L43/0817—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/12—Network monitoring probes
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/02—Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]
Abstract
The invention provides a monitoring platform and a monitoring method of SRDC whole cabinet nodes, wherein the monitoring platform comprises an equipment detector, a data processing module and a data processing module, wherein the equipment detector is used for detecting equipment nodes which survive in a local area network and analyzing the types of the equipment nodes through port numbers of the equipment nodes; the configuration manager is used for configuring the equipment and adding the equipment nodes into the monitoring platform; the data acquisition unit is used for monitoring the equipment nodes and acquiring monitoring information at regular time, processing the acquired information and sending alarm information to the alarm processor; the Trap monitor is used for monitoring Trap messages sent to the monitoring platform by each equipment node, analyzing the Trap message and sending alarm information to the alarm processor; and the alarm processor is used for receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm. The operation and maintenance efficiency is improved, the fault occurrence rate is reduced, the operation and maintenance cost is reduced, and the stable operation of the whole SRDC cabinet is guaranteed.
Description
Technical Field
The invention relates to the technical field of monitoring operation and maintenance of a whole machine cabinet, in particular to a monitoring platform and a monitoring method for SRDC whole machine cabinet nodes.
Background
With the development of the internet and big Data, due to the defects of complex deployment and slow business online of the traditional IT equipment, the requirement of an SRDC (Single Rack Data center) all-in-one machine is gradually increased. The SRDC all-in-one machine is a whole machine cabinet product of various IT equipment integrating server nodes, a switch and magnetic arrays, can integrate and pre-install client application software, and realizes quick deployment of the IT equipment and quick online of services. With the increase of the product output quantity of the whole cabinet, the problem of how to monitor, operate and maintain the product in the new form comes along with the increase of the product output quantity of the whole cabinet, the traditional operation and maintenance mode is time-consuming and low in efficiency, and therefore the problem which needs to be solved at present is to develop a unified management platform which can monitor all hardware devices of the whole SRDC cabinet through one platform.
Disclosure of Invention
The invention provides a monitoring platform and a monitoring method for nodes of an SRDC whole cabinet, aiming at the problem that the operation and maintenance mode of the traditional SRDC whole cabinet is time-consuming and low in efficiency.
The technical scheme of the invention is as follows:
on one hand, the technical scheme of the invention provides a monitoring platform for the nodes of the whole SRDC cabinet, wherein the monitoring platform is communicated with all equipment nodes in the whole SRDC cabinet through a local area network; the monitoring platform comprises an equipment detector, a configuration manager, a data acquisition unit, a Trap monitor and an alarm processor;
the device detector is used for detecting the device nodes which survive in the local area network and analyzing the types of the device nodes through the port numbers of the device nodes;
the configuration manager is used for adding the equipment nodes into the monitoring platform by utilizing the default initial protocol configuration of each type of equipment node;
the data acquisition unit is used for establishing a monitoring task to monitor the equipment node and regularly acquire monitoring information, processing the acquired information and sending alarm information to the alarm processor;
the Trap monitor is used for monitoring Trap messages sent to the monitoring platform by each equipment node, analyzing the Trap message and sending alarm information to the alarm processor; the Trap monitor realizes passive real-time monitoring, analyzes and processes the monitored message into standardized data and sends the standardized data to the alarm processor;
and the alarm processor is used for receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm.
Preferably, the device detector is configured to detect an IP address of a device node reachable by the network in the local area network at regular time, and analyze and determine the type of the detected device node through a TCP or UDP port opened by each device node;
the equipment detector comprises a to-be-detected list generation module, a to-be-analyzed list generation module, a type judgment module and a to-be-added resource list generation module;
the detection list generating module is used for acquiring the IP address of the monitoring platform and further calculating the IP address to be detected in the local area network to generate a detection list;
the analysis to-be-detected list generating module is used for regularly carrying out network detection on the IP addresses in the detection to-be-detected list by using an fping tool, generating the target reachable addresses into the analysis to-be-detected list, and removing the target reachable addresses from the detection to-be-detected list;
the type judgment module is used for creating a scanning task to perform port scanning on a UPD UDP (user Datagram protocol) and a TCP (transmission control protocol) of an IP address in a to-be-analyzed list at regular time on a corresponding device node and judging the type of the device node according to a protocol port opened by each device node;
and the resource list to be added generation module is used for generating a resource list to be added according to the IP and the type of the equipment node after the port scanning is finished, and removing the IP from the list to be analyzed. The device detector utilizes the network detection task and the port scanning task to discover and analyze the devices and types in the local area network.
Preferably, the configuration manager comprises an initial configuration module, an addition module and an output module;
the initial configuration module is used for acquiring the IP and the type of the equipment node to be added from the resource list to be added at regular time, and acquiring the model and the serial number of the equipment by using corresponding equipment factory default protocol configuration;
the adding module is used for adding the equipment nodes to the monitoring platform and automatically configuring target addresses sent by Trap of each equipment node as monitoring platform IP addresses and SNMP protocol parameters;
and the output module is used for issuing equipment addition information to the data acquisition unit after the addition is finished and deleting the IP and the type of the equipment node from the resource list to be added. The configuration manager realizes automatic configuration and automatic management of the equipment according to the detected equipment type.
Preferably, the data collector is configured to create a timing collection task to actively collect information and states of the device node components after receiving the message sent by the configuration manager, analyze and preprocess collected data, convert the data into a standardized alarm message, and send the standardized alarm message to the alarm processor. The data acquisition unit realizes active monitoring, acquires the information and the state of the equipment component at regular time by utilizing active polling, and carries out standardized processing on abnormal information and sends the abnormal information to the alarm processor.
Preferably, the alarm processor comprises an analysis module, a judgment processing module and a sending module;
the analysis module is used for analyzing the information of the warning source, the position and the grade after receiving the warning information sent by the data acquisition device and the Trap monitor;
the judging and processing module is used for judging and processing the information output by the analysis module, and if the information is alarm information, judging whether to generate new alarm information or to degrade or upgrade the corresponding alarm level stored in the monitoring platform according to the alarm information stored in the monitoring platform; if the alarm recovery information is the alarm recovery information, clearing the corresponding alarm stored in the monitoring platform;
and the sending module is used for sending the alarm information and the position to the touch display screen of the SRDC whole cabinet for rolling display.
On the other hand, the technical scheme of the invention provides a monitoring method of nodes of an SRDC whole cabinet, which is applied to a monitoring system, wherein the system comprises a monitoring platform and equipment nodes in the SRDC whole cabinet, wherein the equipment nodes and the monitoring platform form a local area network for communication; the method comprises the following steps:
detecting equipment nodes which survive in the local area network, and analyzing the types of the equipment nodes through port numbers of the equipment nodes;
adding the equipment nodes into the monitoring platform by utilizing default initial protocol configuration of each type of equipment nodes;
establishing a monitoring task to monitor the equipment node and regularly acquire monitoring information, processing the acquired information and sending warning information;
monitoring Trap messages sent to a monitoring platform by each equipment node, analyzing and processing the Trap message and sending alarm information;
and receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm.
Preferably, the step of detecting a device node alive in the local area network and analyzing the type of the device node through the port number of the device node includes: the IP address of the equipment node which can be reached by the network in the local area network is detected at regular time, and the detected type of the equipment node is analyzed and judged through a TCP or UDP port opened by each equipment node; the method specifically comprises the following steps:
acquiring an IP address of a monitoring platform, and further calculating the IP address to be detected in the local area network to generate a list to be detected;
performing network detection on the IP addresses in the to-be-detected list at regular time by using an fping tool, generating a to-be-analyzed list of the addresses where the targets can reach, and removing the to-be-detected list of the addresses where the targets can reach;
creating a scanning task to perform port scanning on UPD UDP and TCP of an IP address in a list to be analyzed at regular time and corresponding equipment nodes, and judging the equipment node type according to a protocol port started by each equipment node;
and generating a resource list to be added according to the IP and the type of the equipment node after the port scanning is finished, and removing the IP from the list to be analyzed.
Preferably, the step of adding the device node into the monitoring platform using the default initial protocol configuration of each type of device node comprises:
acquiring the IP and the type of the equipment node to be added from the resource list to be added at regular time, and acquiring the model and the serial number of the equipment by using corresponding equipment factory default protocol configuration;
adding the equipment nodes to the monitoring platform, and automatically configuring target addresses sent by Trap of each equipment node as monitoring platform IP addresses and SNMP protocol parameters;
and after the addition is finished, issuing equipment addition information, and deleting the IP and the type of the equipment node from the resource list to be added.
Preferably, the steps of creating a monitoring task to monitor the device node and periodically collect monitoring information, processing the collected information, and sending alarm information include:
and after receiving the sent adding message, creating a timing acquisition task to actively acquire the information and the state of the node part of the equipment, analyzing the running state of the equipment and the health state of each part according to the acquired information, and assembling the information into a standardized alarm message to be issued.
Preferably, the step of receiving the alarm information and determining whether to perform the elimination of the alarm or the generation of the alarm or the upgrading or downgrading operation of the alarm includes:
after receiving the warning information, analyzing warning source, position and grade information;
judging and processing the analyzed information, and if the analyzed information is alarm information, judging whether to generate new alarm information or to degrade or upgrade the corresponding alarm level stored in the monitoring platform according to the alarm information stored in the monitoring platform; if the alarm recovery information is the alarm recovery information, clearing the corresponding alarm stored in the monitoring platform;
and sending the alarm information and the position to a touch display screen of the SRDC whole cabinet for rolling display. And operation and maintenance personnel can position and repair the problems conveniently in time.
The automatic discovery, automatic configuration and automatic management of the equipment nodes in the cabinet are realized, non-invasive real-time monitoring and operation and maintenance of each equipment node are realized by utilizing active acquisition and Trap monitoring, and alarm and positioning information can be given in time when the equipment is abnormal, so that operation and maintenance personnel are prompted to carry out treatment and repair.
According to the technical scheme, the invention has the following advantages: the monitoring platform is communicated with each equipment node in the cabinet through a local area network, automatic discovery, automatic configuration and automatic management of each equipment node can be realized, active/passive monitoring is realized through active acquisition of component information and states of each equipment node and monitoring of Trap messages, and once abnormal information is detected, alarm information and positions can be displayed in a rolling mode through a cabinet touch display screen, so that operation and maintenance personnel can position and repair problems in time. The automatic discovery, the automatic configuration and the automatic nanotube of each equipment node of the whole SRDC cabinet can be realized, the non-invasive monitoring is carried out on the running state of the equipment in real time and high efficiency, the operation and maintenance efficiency is improved, the fault occurrence rate is reduced, the operation and maintenance cost is reduced, and the stable operation of the whole SRDC cabinet is guaranteed.
In addition, the invention has reliable design principle, simple structure and very wide application prospect.
Therefore, compared with the prior art, the invention has prominent substantive features and remarkable progress, and the beneficial effects of the implementation are also obvious.
Drawings
In order to more clearly illustrate the embodiments or technical solutions in the prior art of the present invention, the drawings used in the description of the embodiments or prior art will be briefly described below, and it is obvious for those skilled in the art that other drawings can be obtained based on these drawings without creative efforts.
FIG. 1 is a schematic connection block diagram of a monitoring platform of one embodiment of the present invention.
FIG. 2 is a schematic flow diagram of a method of one embodiment of the invention.
Fig. 3 is a schematic diagram of a device detection flow according to an embodiment of the present invention.
Detailed Description
In order to make those skilled in the art better understand the technical solution of the present invention, the technical solution in the embodiment of the present invention will be clearly and completely described below with reference to the drawings in the embodiment of the present invention, and it is obvious that the described embodiment is only a part of the embodiment of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As shown in fig. 1, an embodiment of the present invention provides a monitoring platform for an SRDC complete cabinet node, where the monitoring platform communicates with each device node in the SRDC complete cabinet through a local area network; the monitoring platform comprises an equipment detector, a configuration manager, a data acquisition unit, a Trap monitor and an alarm processor;
the device detector is used for detecting the device nodes which survive in the local area network and analyzing the types of the device nodes through the port numbers of the device nodes;
the configuration manager is used for adding the equipment nodes into the monitoring platform by utilizing the default initial protocol configuration of each type of equipment node;
the data acquisition unit is used for establishing a monitoring task to monitor the equipment node and regularly acquire monitoring information, processing the acquired information and sending alarm information to the alarm processor;
the Trap monitor is used for monitoring Trap messages sent to the monitoring platform by each equipment node, analyzing the Trap message and sending alarm information to the alarm processor; the Trap monitor realizes passive real-time monitoring, analyzes and processes the monitored message into standardized data and sends the standardized data to the alarm processor;
and the alarm processor is used for receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm.
In some embodiments, the device detector is configured to detect an IP address of a device node reachable by a network in a local area network at regular time, and analyze and determine a type of the detected device node through a TCP or UDP port opened by each device node;
the equipment detector comprises a to-be-detected list generation module, a to-be-analyzed list generation module, a type judgment module and a to-be-added resource list generation module;
the detection list generating module is used for acquiring the IP address of the monitoring platform and further calculating the IP address to be detected in the local area network to generate a detection list;
the analysis to-be-detected list generating module is used for regularly carrying out network detection on the IP addresses in the detection to-be-detected list by using an fping tool, generating the target reachable addresses into the analysis to-be-detected list, and removing the target reachable addresses from the detection to-be-detected list;
the type judgment module is used for creating a scanning task to perform port scanning on a UPD UDP (user Datagram protocol) and a TCP (transmission control protocol) of an IP address in a to-be-analyzed list at regular time on a corresponding device node and judging the type of the device node according to a protocol port opened by each device node;
and the resource list to be added generation module is used for generating a resource list to be added according to the IP and the type of the equipment node after the port scanning is finished, and removing the IP from the list to be analyzed. The device detector utilizes the network detection task and the port scanning task to discover and analyze the devices and types in the local area network.
In some embodiments, the configuration manager includes an initial configuration module, an addition module, and an output module;
the initial configuration module is used for acquiring the IP and the type of the equipment node to be added from the resource list to be added at regular time, and acquiring the model and the serial number of the equipment by using corresponding equipment factory default protocol configuration;
the adding module is used for adding the equipment nodes to the monitoring platform and automatically configuring target addresses sent by Trap of each equipment node as monitoring platform IP addresses and SNMP protocol parameters;
and the output module is used for issuing equipment addition information to the data acquisition unit after the addition is finished and deleting the IP and the type of the equipment node from the resource list to be added. The configuration manager realizes automatic configuration and automatic management of the equipment according to the detected equipment type.
In some embodiments, the data collector is configured to create a timing collection task to actively collect information and status of the device node component after receiving the message sent by the configuration manager, analyze and pre-process the collected data, convert the data into a standardized alarm message, and send the standardized alarm message to the alarm processor. The data acquisition unit realizes active monitoring, acquires the information and the state of the equipment component at regular time by utilizing active polling, and carries out standardized processing on abnormal information and sends the abnormal information to the alarm processor.
In some embodiments, the alarm processor comprises an analysis module, a judgment processing module and a sending module;
the analysis module is used for analyzing the information of the warning source, the position and the grade after receiving the warning information sent by the data acquisition device and the Trap monitor;
the judging and processing module is used for judging and processing the information output by the analysis module, and if the information is alarm information, judging whether to generate new alarm information or to degrade or upgrade the corresponding alarm level stored in the monitoring platform according to the alarm information stored in the monitoring platform; if the alarm recovery information is the alarm recovery information, clearing the corresponding alarm stored in the monitoring platform;
and the sending module is used for sending the alarm information and the position to the touch display screen of the SRDC whole cabinet for rolling display.
As shown in fig. 2, an embodiment of the present invention further provides a monitoring method for nodes of an entire SRDC cabinet, which is applied to a monitoring system, where the system includes a monitoring platform and each device node in the entire SRDC cabinet, where the monitoring platform and the monitoring platform form a local area network for communication; the method comprises the following steps:
s1: detecting equipment nodes which survive in the local area network, and analyzing the types of the equipment nodes through port numbers of the equipment nodes;
s2: adding the equipment nodes into the monitoring platform by utilizing default initial protocol configuration of each type of equipment nodes;
s3: establishing a monitoring task to monitor the equipment node and regularly acquire monitoring information, processing the acquired information and sending warning information;
s4: monitoring Trap messages sent to a monitoring platform by each equipment node, analyzing and processing the Trap message and sending alarm information;
s5: and receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm.
In some embodiments, in the step of detecting the device node alive in the lan and analyzing the type of the device node through the port number of the device node, for example, the server supports IPMI and SNMP protocols, and UDP 623 and UDP 161 ports are opened. The magnetic array supports SMI-S and SNMP protocols, the ports of TCP 5989 and UDP 161 are opened, and the port of UDP 623 is not opened. And the switch only supports the SNMP protocol, and opens the UDP 161 port. Based on the method, the device detector detects the IP address of the device which can be reached by the network in the local area network at regular time, and analyzes and judges the detected device type through the TCP or UDP port opened by each device; as shown in fig. 3, the method specifically includes the following steps:
s11: acquiring an IP address of a monitoring platform, and further calculating the IP address to be detected in the local area network to generate a list to be detected, namely a list A;
s12: performing network detection on the IP addresses in the to-be-detected list at regular time by using an fping tool, generating a to-be-analyzed list, namely a list B, of the addresses where the targets can reach, and removing the addresses where the targets can reach from the list A;
s13: creating a scanning task to perform port scanning on UPD UDP and TCP of an IP address in a list to be analyzed at regular time and corresponding equipment nodes, and judging the equipment node type according to a protocol port started by each equipment node; the port scanning task timing performs port scanning on 3 of the UPD 623, UDP 161, and TCP 5989 of the IP addresses in the list B. And judging the device type according to the protocol port opened by each device. If the UDP 623 port is in an open state, the equipment is a server node; if the port of the TCP 5989 is opened, the device is a magnetic array; if only UDP 161 is started, the device is a switch;
s14: and generating a resource list to be added, namely a list C, by the IP and the type of the equipment node with the scanned port, and removing the IP from the list B.
In some embodiments, the step of adding the device node into the monitoring platform using the default initial protocol configuration for each type of device node comprises:
s21: acquiring the IP and the type of the equipment node to be added from the resource list to be added at regular time, and acquiring the model and the serial number of the equipment by using corresponding equipment factory default protocol configuration;
s22: adding the equipment nodes to the monitoring platform, and automatically configuring target addresses sent by Trap of each equipment node as monitoring platform IP addresses and SNMP protocol parameters;
s23: and after the addition is finished, issuing equipment addition information, and deleting the IP and the type of the equipment node from the resource list to be added. For example, the IP and the type of the device to be added are obtained from the list C at regular time, the model and the serial number of the device are obtained by using the factory default protocol configuration of the corresponding device, the device is added to the monitoring platform, and the target address sent by the Trap of each device is automatically configured as the platform IP address and the SNMP protocol parameter. And after the addition is finished, issuing equipment addition information, and deleting the IP and the type of the equipment from the list C.
In some embodiments, the steps of creating a monitoring task to monitor the device node and periodically collect monitoring information, processing the collected information, and sending alarm information include:
and after receiving the sent adding message, creating a timing acquisition task to actively acquire the information and the state of the node part of the equipment, analyzing the running state of the equipment and the health state of each part according to the acquired information, and assembling the information into a standardized alarm message to be issued. It should be noted that, the corresponding standardized monitoring plug-in is selected according to the type and model of the device to periodically collect the component information and status of the corresponding device.
In some embodiments, the step of receiving the alarm information and determining whether to perform the alarm elimination or the alarm generation or the alarm escalation or downgrade operation comprises:
s51: after receiving the warning information, analyzing warning source, position and grade information;
s52: judging and processing the analyzed information, and if the analyzed information is alarm information, judging whether to generate new alarm information or to degrade or upgrade the corresponding alarm level stored in the monitoring platform according to the alarm information stored in the monitoring platform; if the alarm recovery information is the alarm recovery information, clearing the corresponding alarm stored in the monitoring platform;
s53: and sending the alarm information and the position to a touch display screen of the SRDC whole cabinet for rolling display. And operation and maintenance personnel can position and repair the problems conveniently in time.
The automatic discovery, automatic configuration and automatic management of the equipment nodes in the cabinet are realized, non-invasive real-time monitoring and operation and maintenance of each equipment node are realized by utilizing active acquisition and Trap monitoring, and alarm and positioning information can be given in time when the equipment is abnormal, so that operation and maintenance personnel are prompted to carry out treatment and repair.
Although the present invention has been described in detail by referring to the drawings in connection with the preferred embodiments, the present invention is not limited thereto. Various equivalent modifications or substitutions can be made on the embodiments of the present invention by those skilled in the art without departing from the spirit and scope of the present invention, and these modifications or substitutions are within the scope of the present invention/any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the appended claims.
Claims (10)
1. A monitoring platform for nodes of an SRDC whole cabinet is characterized in that the monitoring platform is communicated with equipment nodes in the SRDC whole cabinet through a local area network; the monitoring platform comprises an equipment detector, a configuration manager, a data acquisition unit, a Trap monitor and an alarm processor;
the device detector is used for detecting the device nodes which survive in the local area network and analyzing the types of the device nodes through the port numbers of the device nodes;
the configuration manager is used for adding the equipment nodes into the monitoring platform by utilizing the default initial protocol configuration of each type of equipment node;
the data acquisition unit is used for establishing a monitoring task to monitor the equipment node and regularly acquire monitoring information, processing the acquired information and sending alarm information to the alarm processor;
the Trap monitor is used for monitoring Trap messages sent to the monitoring platform by each equipment node, analyzing the Trap message and sending alarm information to the alarm processor;
and the alarm processor is used for receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm.
2. The platform according to claim 1, wherein the device detector is configured to detect an IP address of a device node reachable through a network in the local area network at regular time, and analyze and determine a type of the detected device node through a TCP or UDP port opened by each device node;
the equipment detector comprises a to-be-detected list generation module, a to-be-analyzed list generation module, a type judgment module and a to-be-added resource list generation module;
the detection list generating module is used for acquiring the IP address of the monitoring platform and further calculating the IP address to be detected in the local area network to generate a detection list;
the analysis to-be-detected list generating module is used for regularly carrying out network detection on the IP addresses in the detection to-be-detected list by using an fping tool, generating the target reachable addresses into the analysis to-be-detected list, and removing the target reachable addresses from the detection to-be-detected list;
the type judgment module is used for creating a scanning task to perform port scanning on a UPD UDP (user Datagram protocol) and a TCP (transmission control protocol) of an IP address in a to-be-analyzed list at regular time on a corresponding device node and judging the type of the device node according to a protocol port opened by each device node;
and the resource list to be added generation module is used for generating a resource list to be added according to the IP and the type of the equipment node after the port scanning is finished, and removing the IP from the list to be analyzed.
3. The platform of claim 2, wherein the configuration manager comprises an initial configuration module, an addition module, and an output module;
the initial configuration module is used for acquiring the IP and the type of the equipment node to be added from the resource list to be added at regular time, and acquiring the model and the serial number of the equipment by using corresponding equipment factory default protocol configuration;
the adding module is used for adding the equipment nodes to the monitoring platform and automatically configuring target addresses sent by Trap of each equipment node as monitoring platform IP addresses and SNMP protocol parameters;
and the output module is used for issuing equipment addition information to the data acquisition unit after the addition is finished and deleting the IP and the type of the equipment node from the resource list to be added.
4. The monitoring platform for the nodes of the whole SRDC cabinet according to claim 3, wherein the data collector is configured to create a timing collection task to actively collect the information and the status of the node components of the device after receiving the message sent by the configuration manager, perform analysis and preprocessing on the collected data, convert the data into a standardized alarm message, and send the standardized alarm message to the alarm processor.
5. The monitoring platform of the SRDC whole cabinet node according to claim 4, wherein the alarm processor comprises an analysis module, a judgment processing module and a sending module;
the analysis module is used for analyzing the information of the warning source, the position and the grade after receiving the warning information sent by the data acquisition device and the Trap monitor;
the judging and processing module is used for judging and processing the information output by the analysis module, and if the information is alarm information, judging whether to generate new alarm information or to degrade or upgrade the corresponding alarm level stored in the monitoring platform according to the alarm information stored in the monitoring platform; if the alarm recovery information is the alarm recovery information, clearing the corresponding alarm stored in the monitoring platform;
and the sending module is used for sending the alarm information and the position to the touch display screen of the SRDC whole cabinet for rolling display.
6. A monitoring method of SRDC complete cabinet nodes is characterized in that the monitoring method is applied to a monitoring system, and the system comprises a monitoring platform and equipment nodes in an SRDC complete cabinet, wherein the equipment nodes and the monitoring platform form a local area network for communication; the method comprises the following steps:
detecting equipment nodes which survive in the local area network, and analyzing the types of the equipment nodes through port numbers of the equipment nodes;
adding the equipment nodes into the monitoring platform by utilizing default initial protocol configuration of each type of equipment nodes;
establishing a monitoring task to monitor the equipment node and regularly acquire monitoring information, processing the acquired information and sending warning information;
monitoring Trap messages sent to a monitoring platform by each equipment node, analyzing and processing the Trap message and sending alarm information;
and receiving the alarm information and judging whether to eliminate the alarm or generate the alarm or upgrade or downgrade the alarm.
7. The method for monitoring the nodes of the SRDC complete equipment cabinet according to claim 6, wherein the step of detecting the equipment nodes living in the local area network and analyzing the types of the equipment nodes through the port numbers of the equipment nodes comprises the following steps: the IP address of the equipment node which can be reached by the network in the local area network is detected at regular time, and the detected type of the equipment node is analyzed and judged through a TCP or UDP port opened by each equipment node; the method specifically comprises the following steps:
acquiring an IP address of a monitoring platform, and further calculating the IP address to be detected in the local area network to generate a list to be detected;
performing network detection on the IP addresses in the to-be-detected list at regular time by using an fping tool, generating a to-be-analyzed list of the addresses where the targets can reach, and removing the to-be-detected list of the addresses where the targets can reach;
creating a scanning task to perform port scanning on UPD UDP and TCP of an IP address in a list to be analyzed at regular time and corresponding equipment nodes, and judging the equipment node type according to a protocol port started by each equipment node;
and generating a resource list to be added according to the IP and the type of the equipment node after the port scanning is finished, and removing the IP from the list to be analyzed.
8. The method for monitoring the nodes of the SRDC complete equipment cabinet according to claim 7, wherein the step of adding the equipment nodes into the monitoring platform by using default initial protocol configuration of each type of equipment node comprises the following steps:
acquiring the IP and the type of the equipment node to be added from the resource list to be added at regular time, and acquiring the model and the serial number of the equipment by using corresponding equipment factory default protocol configuration;
adding the equipment nodes to the monitoring platform, and automatically configuring target addresses sent by Trap of each equipment node as monitoring platform IP addresses and SNMP protocol parameters;
and after the addition is finished, issuing equipment addition information, and deleting the IP and the type of the equipment node from the resource list to be added.
9. The monitoring platform of the SRDC whole cabinet node according to claim 8, wherein the step of creating a monitoring task to monitor the equipment node and periodically collect monitoring information, and the step of processing the collected information and sending an alarm message comprises:
and after receiving the sent adding message, creating a timing acquisition task to actively acquire the information and the state of the node part of the equipment, analyzing the running state of the equipment and the health state of each part according to the acquired information, and assembling the information into a standardized alarm message to be issued.
10. The method for monitoring the nodes of the whole SRDC cabinet according to claim 9, wherein the step of receiving the alarm information and judging whether to perform the elimination of the alarm or the generation of the alarm or the upgrading or downgrading operation of the alarm comprises the following steps:
after receiving the warning information, analyzing warning source, position and grade information;
judging and processing the analyzed information, and if the analyzed information is alarm information, judging whether to generate new alarm information or to degrade or upgrade the corresponding alarm level stored in the monitoring platform according to the alarm information stored in the monitoring platform; if the alarm recovery information is the alarm recovery information, clearing the corresponding alarm stored in the monitoring platform;
and sending the alarm information and the position to a touch display screen of the SRDC whole cabinet for rolling display.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010726244.3A CN111970146B (en) | 2020-07-25 | 2020-07-25 | Monitoring platform and monitoring method for SRDC whole cabinet nodes |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010726244.3A CN111970146B (en) | 2020-07-25 | 2020-07-25 | Monitoring platform and monitoring method for SRDC whole cabinet nodes |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111970146A true CN111970146A (en) | 2020-11-20 |
CN111970146B CN111970146B (en) | 2022-07-08 |
Family
ID=73362710
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010726244.3A Active CN111970146B (en) | 2020-07-25 | 2020-07-25 | Monitoring platform and monitoring method for SRDC whole cabinet nodes |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111970146B (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112911086A (en) * | 2021-01-29 | 2021-06-04 | 卡莱特云科技股份有限公司 | Classification control method and device for batch video processing equipment |
CN113381881A (en) * | 2021-05-25 | 2021-09-10 | 山东浪潮爱购云链信息科技有限公司 | Method and device for monitoring alarm processing of host |
CN113688014A (en) * | 2021-07-30 | 2021-11-23 | 济南浪潮数据技术有限公司 | Alarm processing method, device, equipment and medium for SRDC whole cabinet |
CN113709210A (en) * | 2021-07-30 | 2021-11-26 | 济南浪潮数据技术有限公司 | Device discovery method, device, system, electronic device and storage medium |
CN113727210A (en) * | 2021-08-06 | 2021-11-30 | 济南浪潮数据技术有限公司 | Equipment information management method, system, storage medium and equipment |
CN114124989A (en) * | 2021-09-28 | 2022-03-01 | 山东中创软件商用中间件股份有限公司 | Equipment monitoring method, device, equipment and storage medium |
CN114785722A (en) * | 2022-06-14 | 2022-07-22 | 武汉四通信息服务有限公司 | Monitoring data processing method and device and computer readable storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101640859A (en) * | 2009-05-19 | 2010-02-03 | 深圳市永达电子股份有限公司 | Equipment cabinet alarm device, monitoring platform, management system and method |
CN105282772A (en) * | 2015-09-10 | 2016-01-27 | 北京爱可生通信技术有限公司 | Wireless network data communication equipment monitoring system and equipment monitoring method |
CN109240891A (en) * | 2018-09-26 | 2019-01-18 | 郑州云海信息技术有限公司 | A kind of monitoring method and device of SR whole machine cabinet server |
CN109361543A (en) * | 2018-10-30 | 2019-02-19 | 郑州云海信息技术有限公司 | A kind of whole machine cabinet monitoring method, device, terminal and storage medium |
-
2020
- 2020-07-25 CN CN202010726244.3A patent/CN111970146B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101640859A (en) * | 2009-05-19 | 2010-02-03 | 深圳市永达电子股份有限公司 | Equipment cabinet alarm device, monitoring platform, management system and method |
CN105282772A (en) * | 2015-09-10 | 2016-01-27 | 北京爱可生通信技术有限公司 | Wireless network data communication equipment monitoring system and equipment monitoring method |
CN109240891A (en) * | 2018-09-26 | 2019-01-18 | 郑州云海信息技术有限公司 | A kind of monitoring method and device of SR whole machine cabinet server |
CN109361543A (en) * | 2018-10-30 | 2019-02-19 | 郑州云海信息技术有限公司 | A kind of whole machine cabinet monitoring method, device, terminal and storage medium |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112911086A (en) * | 2021-01-29 | 2021-06-04 | 卡莱特云科技股份有限公司 | Classification control method and device for batch video processing equipment |
CN112911086B (en) * | 2021-01-29 | 2023-10-10 | 卡莱特云科技股份有限公司 | Classification control method and device for batch video processing equipment |
CN113381881A (en) * | 2021-05-25 | 2021-09-10 | 山东浪潮爱购云链信息科技有限公司 | Method and device for monitoring alarm processing of host |
CN113381881B (en) * | 2021-05-25 | 2022-12-09 | 山东浪潮爱购云链信息科技有限公司 | Method and device for monitoring alarm processing of host |
CN113688014A (en) * | 2021-07-30 | 2021-11-23 | 济南浪潮数据技术有限公司 | Alarm processing method, device, equipment and medium for SRDC whole cabinet |
CN113709210A (en) * | 2021-07-30 | 2021-11-26 | 济南浪潮数据技术有限公司 | Device discovery method, device, system, electronic device and storage medium |
CN113688014B (en) * | 2021-07-30 | 2024-02-09 | 济南浪潮数据技术有限公司 | Alarm processing method, device, equipment and medium for SRDC whole cabinet |
CN113727210A (en) * | 2021-08-06 | 2021-11-30 | 济南浪潮数据技术有限公司 | Equipment information management method, system, storage medium and equipment |
CN113727210B (en) * | 2021-08-06 | 2023-08-22 | 济南浪潮数据技术有限公司 | Equipment information management method, system, storage medium and equipment |
CN114124989A (en) * | 2021-09-28 | 2022-03-01 | 山东中创软件商用中间件股份有限公司 | Equipment monitoring method, device, equipment and storage medium |
CN114785722A (en) * | 2022-06-14 | 2022-07-22 | 武汉四通信息服务有限公司 | Monitoring data processing method and device and computer readable storage medium |
CN114785722B (en) * | 2022-06-14 | 2022-09-30 | 武汉四通信息服务有限公司 | Monitoring data processing method and device and computer readable storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN111970146B (en) | 2022-07-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111970146B (en) | Monitoring platform and monitoring method for SRDC whole cabinet nodes | |
US20200169482A1 (en) | Monitoring and analysis of interactions between network endpoints | |
CN102447570B (en) | Monitoring device and method based on health degree analysis | |
KR100748246B1 (en) | Multi-step integrated security monitoring system and method using intrusion detection system log collection engine and traffic statistic generation engine | |
CN107491375A (en) | Equipment detection and fault early warning system and method under a kind of cloud computing environment | |
EP1742416A1 (en) | Methods, computer readable medium and system for analyzing and management of application traffic on networks | |
US20070297349A1 (en) | Method and System for Collecting Information Relating to a Communication Network | |
CN103546343B (en) | The network traffics methods of exhibiting of network traffic analysis system and system | |
CN110659109B (en) | System and method for monitoring openstack virtual machine | |
US11356335B2 (en) | Machine learning-based network analytics, troubleshoot, and self-healing system and method | |
CN102739802A (en) | Service application-oriented IT contralized operation and maintenance analyzing system | |
EP2372954B1 (en) | Method and system for collecting information relating to a communication network | |
CN104219091A (en) | System and method for network operation fault detection | |
CN112714013B (en) | Application fault positioning method in cloud environment | |
CN109995582A (en) | Asset equipment management system and method based on real-time status | |
US11659449B2 (en) | Machine learning-based network analytics, troubleshoot, and self-healing holistic telemetry system incorporating modem-embedded machine analysis of multi-protocol stacks | |
CN108234161A (en) | For the access detection method and system of on-line off-line multitiered network framework | |
CN114039900A (en) | Efficient network data packet protocol analysis method and system | |
CN105279651B (en) | A kind of transaction data monitor processing method and system | |
CN114679378A (en) | Log monitoring and analyzing method and system, storage medium and electronic device | |
CN112367212B (en) | Virtual machine network quality monitoring method and system in cloud environment | |
CN101656632A (en) | Virus monitoring method and virus monitoring device in large network | |
CN112884176B (en) | Management system and method | |
CN112312376B (en) | Method and system for remotely and interactively managing multifunctional electric meter | |
CN112910726A (en) | Cloud environment flow monitoring method, device and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |