CN113708967B - System monitoring disaster recovery early warning device and early warning method - Google Patents

System monitoring disaster recovery early warning device and early warning method Download PDF

Info

Publication number
CN113708967B
CN113708967B CN202110988772.0A CN202110988772A CN113708967B CN 113708967 B CN113708967 B CN 113708967B CN 202110988772 A CN202110988772 A CN 202110988772A CN 113708967 B CN113708967 B CN 113708967B
Authority
CN
China
Prior art keywords
service
monitoring
service module
signal transmitting
early warning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110988772.0A
Other languages
Chinese (zh)
Other versions
CN113708967A (en
Inventor
梁亚萍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sinochem Information Technology Co Ltd
Original Assignee
Sinochem Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sinochem Information Technology Co Ltd filed Critical Sinochem Information Technology Co Ltd
Priority to CN202110988772.0A priority Critical patent/CN113708967B/en
Publication of CN113708967A publication Critical patent/CN113708967A/en
Application granted granted Critical
Publication of CN113708967B publication Critical patent/CN113708967B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/04Network management architectures or arrangements
    • H04L41/042Network management architectures or arrangements comprising distributed management centres cooperatively managing the network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/04Processing captured monitoring data, e.g. for logfile generation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1073Registration or de-registration
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/51Discovery or management thereof, e.g. service location protocol [SLP] or web services

Abstract

The invention discloses a system monitoring disaster recovery early warning device and an early warning method, wherein the system monitoring disaster recovery early warning device consists of a service registration center, a monitoring management center and a plurality of service modules; the service registration center is connected with the monitoring management center, a main monitoring signal transmitting device is arranged in each service module, the monitoring management center records and manages the main monitoring signal transmitting device in each service module, a plurality of independently operated service units are arranged in each service module, and a service unit monitoring signal transmitting base device is arranged in each service unit. In the whole monitoring system, the signals are actively sent from bottom to top, and after one sending, the lower device is finished without regard to the operation device of the upper device, so that the network request can be well simplified. If no information is received in the period by the upper service, the lower module is considered to be invalid, and excessive coincidence request inspection is not needed, so that network overhead is reduced, and time delay is reduced.

Description

System monitoring disaster recovery early warning device and early warning method
Technical Field
The invention belongs to the technical field of computers and information systems, and particularly relates to a system monitoring disaster recovery early warning device and method.
Background
At present, the distributed system architecture in the prior art has become an essential scheme for system design, the system design mode of the micro service further reduces the coupling of the system, and the cluster type deployment mode becomes a main mode of the deployment of the large-scale system along with the further reduction of the resource cost of the server. The cluster mode deployment can greatly improve the running stability of the system, but the increase of the running nodes is required to construct a complete method for the management, monitoring, disaster recovery, early warning and the like of the running nodes, so that the running stability of the system is ensured, and meanwhile, the time operation and maintenance of the system can be ensured when the system fails. At present, a large number of virtual machines are used for system deployment and production in enterprises, and special devices are needed for cluster system monitoring, disaster recovery, early warning and the like. Because of the limitation of the network level in the enterprise, the running state of the service needs to be monitored, the standard execution condition required by the virtual machine running by the service needs to be monitored, and the service disaster recovery treatment needs to be performed in time.
Disclosure of Invention
The invention provides a system monitoring disaster recovery early warning device and method for overcoming the defects existing in the prior art.
The invention is realized by the following technical scheme: a system monitoring disaster recovery early warning device mainly comprises a service registration center which is responsible for managing each service module, a monitoring management center which is responsible for monitoring the running condition of each service module and a plurality of service modules; the service registration center is connected with the monitoring management center, a main monitoring signal transmitting device is arranged in each service module, the monitoring management center records and manages the main monitoring signal transmitting device in each service module, a plurality of independently operated service units are arranged in each service module, and a service unit monitoring signal transmitting base device is arranged in each service unit; the service unit monitoring signal transmitting base device transmits a service unit monitoring signal to the main monitoring signal transmitting device of the service module.
The system monitoring disaster recovery early warning method of the invention specifically comprises the following steps:
(1) The monitoring management center sets basic information, and each monitoring module starts a current monitoring period;
(2) Checking whether the monitoring management center receives the information state of the main monitoring signal transmitting device X and transmits basic information of an abnormal service module to the service registration center, and the service registration center downloads the abnormal service module and transmits basic information of service faults to the outside of the system;
(3) If the monitoring management center receives the information state of the main monitoring signal transmitting device X, checking whether the information state of the main monitoring signal transmitting device X is normal;
(4) If the information state of the main monitoring signal transmitting device X is normal, checking whether the main monitoring signal transmitting device X collects basic device monitoring signals in all service modules, and if the main monitoring signal transmitting device X collects basic device monitoring signals in all service modules, the current monitoring period is normal;
(5) If the information state of the main monitoring signal transmitting device X is abnormal, transmitting a monitoring module failure state, service module basic information and a set of failed service unit basic information in the service module to a monitoring center, transmitting the basic information of the abnormal service module to a service registration center, and transmitting the basic information of the service failure to the outside of the system after the abnormal service module is disconnected by the service registration center;
(6) If the main monitoring signal transmitting device X does not collect the basic device monitoring signals in all the service modules, the failure state and the basic information of the service units are transmitted to the main monitoring information transmitting device, then the failure state of the monitoring modules, the basic information of the service modules and the set of the basic information of the failed service units in the service modules are transmitted to the monitoring center, the basic information of the abnormal service modules is transmitted to the service registration center, the abnormal service modules are disconnected from the service registration center, and the basic information of the service faults is transmitted to the outside of the system.
The system monitoring disaster-tolerant early warning device comprises a service registration center and a monitoring management center, wherein each service module is internally provided with a main monitoring signal transmitting device, and each service module is internally provided with a service unit monitoring signal transmitting basic device; the service registration center manages each service module and distributes service requests to each service module; the monitoring management center records and manages a main monitoring signal transmitting device in each service module and is responsible for monitoring the running condition of each service module, transmitting an online and offline notification to the service registration center and transmitting early warning and fault notification outside the system; the main monitoring signal transmitting device is responsible for transmitting monitoring signals in the whole service module and receiving monitoring signals of the service units in the service module; the service unit monitoring signal transmitting base device is responsible for transmitting the service unit monitoring signal to the main monitoring signal transmitting device of the service module.
The monitoring management center sets an early warning mechanism for the service registration center and sends a service fault notification rule to the outside of the system; the monitoring management center receives the service module running state notification of the downstream main monitoring signal transmitting device in each monitoring period, if the notification is not received in the period or the abnormal running state notification of the service module is received, the monitoring management center transmits basic information and fault information of the abnormal running service module to the service registration center, and the service registration center goes off line for the service module and transmits the fault notification to the outside; the service adopts cluster mode deployment, and the service registration center only sends a request to the node where the service module which operates normally is located, so that the request is prevented from being forwarded for the second time.
The main monitoring signal transmitting device is responsible for transmitting the overall operation state of the service module to the monitoring management center in a monitoring period, and if no state signal is transmitted in the monitoring period, the state of the service module is considered to be abnormal; the main monitoring information collects the running state of the full-order service unit in the service module in the monitoring period, and when the full-order state is normal, the service module is normal; when any service unit in the service module is abnormal, the service module is considered to be abnormal, and the basic information of the abnormal service unit is collected and sent to the monitoring management center. The information sent by the main monitoring signal sending device to the monitoring management center comprises the name of the service module, all the physical IP and port numbers of the service module, the name of the service unit in the service module, the corresponding physical IP and operation port number, and the physical server time point when the information is sent.
The service unit monitoring signal transmitting base device transmits the running state of the service unit to the main monitoring signal transmitting device in the monitoring period, and when the service unit monitoring signal transmitting base device does not transmit state information in the monitoring period, the main monitoring signal transmitting device considers that the running state of the service unit is abnormal. The information sent by the service unit monitoring signal sending base device comprises a service unit name, a physical IP (Internet protocol) operated by the service unit and a physical service port.
The system monitoring disaster recovery early warning method also comprises a fault tolerance mechanism, and an early warning notification rule and a service re-online rule are set in a monitoring management center: when a certain service module sends a status report or abnormal status signal to a monitoring management center in a set period, an early warning notice is sent to the outside of the system; the condition of system service state jitter caused by normal interference due to network fluctuation is avoided, the fault tolerance of system early warning is improved, and the accuracy of early warning notification is improved. When a certain service module does not send a status report or send an abnormal status signal to the monitoring management center, the service module is continuously sent to the monitoring management center in a monitoring period with a set threshold number, the service module is considered to run normally, and the name of the online service module and basic physical information of the service module are sent to the service center.
The beneficial effects of the invention are as follows: the invention relates to a system monitoring disaster recovery early warning mechanism, and particularly discloses a system monitoring disaster recovery early warning device and method. In the whole monitoring system, the signals are actively sent from bottom to top, and after one sending, the lower device is finished without regard to the operation device of the upper device, so that the network request can be well simplified. If no information is received in the period by the upper service, the lower module is considered to be invalid, and excessive coincidence request inspection is not needed, so that network overhead is reduced, and time delay is reduced.
Drawings
FIG. 1 is a logic block diagram of a system monitoring disaster recovery early warning device of the present invention;
FIG. 2 is a flow chart of the operation of the system monitoring and disaster recovery warning device of the present invention.
Detailed Description
The invention is described in detail below with reference to the drawings and the detailed description.
As shown in fig. 1, the invention discloses a system monitoring disaster recovery early warning device, which mainly comprises a service registration center responsible for managing each service module, a monitoring management center responsible for monitoring the running condition of each service module and a plurality of service modules; the service registration center is connected with the monitoring management center, a main monitoring signal transmitting device is arranged in each service module, the monitoring management center records and manages the main monitoring signal transmitting device in each service module, a plurality of independently operated service units are arranged in each service module, and a service unit monitoring signal transmitting base device is arranged in each service unit; the service unit monitoring signal transmitting base device transmits a service unit monitoring signal to the main monitoring signal transmitting device of the service module.
As shown in fig. 2, the system monitoring disaster recovery early warning method of the present invention specifically includes the following steps: (1) The monitoring management center sets basic information, and each monitoring module starts a current monitoring period; (2) Checking whether the monitoring management center receives the information state of the main monitoring signal transmitting device X and transmits basic information of an abnormal service module to the service registration center, and the service registration center downloads the abnormal service module and transmits basic information of service faults to the outside of the system; (3) If the monitoring management center receives the information state of the main monitoring signal transmitting device X, checking whether the information state of the main monitoring signal transmitting device X is normal; (4) If the information state of the main monitoring signal transmitting device X is normal, checking whether the main monitoring signal transmitting device X collects basic device monitoring signals in all service modules, and if the main monitoring signal transmitting device X collects basic device monitoring signals in all service modules, the current monitoring period is normal; (5) If the information state of the main monitoring signal transmitting device X is abnormal, transmitting a monitoring module failure state, service module basic information and a set of failed service unit basic information in the service module to a monitoring center, transmitting the basic information of the abnormal service module to a service registration center, and transmitting the basic information of the service failure to the outside of the system after the abnormal service module is disconnected by the service registration center; (6) If the main monitoring signal transmitting device X does not collect the basic device monitoring signals in all the service modules, the failure state and the basic information of the service units are transmitted to the main monitoring information transmitting device, then the failure state of the monitoring modules, the basic information of the service modules and the set of the basic information of the failed service units in the service modules are transmitted to the monitoring center, the basic information of the abnormal service modules is transmitted to the service registration center, the abnormal service modules are disconnected from the service registration center, and the basic information of the service faults is transmitted to the outside of the system.
The system monitoring disaster-tolerant early warning device comprises a service registration center and a monitoring management center, wherein each service module is internally provided with a main monitoring signal transmitting device, and each service module is internally provided with a service unit monitoring signal transmitting basic device; the service registration center manages each service module and distributes service requests to each service module; the monitoring management center records and manages a main monitoring signal transmitting device in each service module and is responsible for monitoring the running condition of each service module, transmitting an online and offline notification to the service registration center and transmitting early warning and fault notification outside the system; the main monitoring signal transmitting device is responsible for transmitting monitoring signals in the whole service module and receiving monitoring signals of the service units in the service module; the service unit monitoring signal transmitting base device is responsible for transmitting the service unit monitoring signal to the main monitoring signal transmitting device of the service module.
The monitoring management center sets an early warning mechanism for the service registration center and sends a service fault notification rule to the outside of the system; the monitoring management center receives the service module running state notification of the downstream main monitoring signal transmitting device in each monitoring period, if the notification is not received in the period or the abnormal running state notification of the service module is received, the monitoring management center transmits basic information and fault information of the abnormal running service module to the service registration center, and the service registration center goes off line for the service module and transmits the fault notification to the outside; the service adopts cluster mode deployment, and the service registration center only sends a request to the node where the service module which operates normally is located, so that the request is prevented from being forwarded for the second time.
The main monitoring signal transmitting device is responsible for transmitting the overall operation state of the service module to the monitoring management center in a monitoring period, and if no state signal is transmitted in the monitoring period, the state of the service module is considered to be abnormal; the main monitoring information collects the running state of the full-order service unit in the service module in the monitoring period, and when the full-order state is normal, the service module is normal; when any service unit in the service module is abnormal, the service module is considered to be abnormal, and the basic information of the abnormal service unit is collected and sent to the monitoring management center. The information sent by the main monitoring signal sending device to the monitoring management center comprises the name of the service module, all the physical IP and port numbers of the service module, the name of the service unit in the service module, the corresponding physical IP and operation port number, and the physical server time point when the information is sent.
The service unit monitoring signal transmitting base device transmits the running state of the service unit to the main monitoring signal transmitting device in the monitoring period, and when the service unit monitoring signal transmitting base device does not transmit state information in the monitoring period, the main monitoring signal transmitting device considers that the running state of the service unit is abnormal. The information sent by the service unit monitoring signal sending base device comprises a service unit name, a physical IP (Internet protocol) operated by the service unit and a physical service port.
The system monitoring disaster recovery early warning method also comprises a fault tolerance mechanism, and an early warning notification rule and a service re-online rule are set in a monitoring management center: when a certain service module sends a status report or abnormal status signal to a monitoring management center in a set period, an early warning notice is sent to the outside of the system; the condition of system service state jitter caused by normal interference due to network fluctuation is avoided, the fault tolerance of system early warning is improved, and the accuracy of early warning notification is improved. When a certain service module does not send a status report or send an abnormal status signal to the monitoring management center, the service module is continuously sent to the monitoring management center in a monitoring period with a set threshold number, the service module is considered to run normally, and the name of the online service module and basic physical information of the service module are sent to the service center.
Finally, it should be noted that the above description is only for illustrating the technical solution of the present invention, and not for limiting the scope of the present invention, and that the simple modification and equivalent substitution of the technical solution of the present invention can be made by those skilled in the art without departing from the spirit and scope of the technical solution of the present invention.

Claims (9)

1. A method for monitoring disaster recovery and early warning of a system is characterized by comprising the following steps: the system monitoring disaster recovery early warning method is carried out by adopting a system monitoring disaster recovery early warning device, and the system monitoring disaster recovery early warning device consists of a service registration center, a monitoring management center and a plurality of service modules; the service registration center is connected with the monitoring management center, a main monitoring signal transmitting device is arranged in each service module, the monitoring management center records and manages the main monitoring signal transmitting device in each service module, a plurality of independently operated service units are arranged in each service module, and a service unit monitoring signal transmitting base device is arranged in each service unit; the service unit monitoring signal transmitting base device transmits a service unit monitoring signal to the main monitoring signal transmitting device of the service module;
the system monitoring disaster recovery early warning method specifically comprises the following steps:
(1) The monitoring management center sets basic information, and each monitoring module starts a current monitoring period;
(2) Checking whether the monitoring management center receives the information state of the main monitoring signal transmitting device X and transmits basic information of an abnormal service module to the service registration center, and the service registration center downloads the abnormal service module and transmits basic information of service faults to the outside of the system;
(3) If the monitoring management center receives the information state of the main monitoring signal transmitting device X, checking whether the information state of the main monitoring signal transmitting device X is normal;
(4) If the information state of the main monitoring signal transmitting device X is normal, checking whether the main monitoring signal transmitting device X collects basic device monitoring signals in all service modules, and if the main monitoring signal transmitting device X collects basic device monitoring signals in all service modules, the current monitoring period is normal;
(5) If the information state of the main monitoring signal transmitting device X is abnormal, transmitting a monitoring module failure state, service module basic information and a set of failed service unit basic information in the service module to a monitoring center, transmitting the basic information of the abnormal service module to a service registration center, and transmitting the basic information of the service failure to the outside of the system after the abnormal service module is disconnected by the service registration center;
(6) If the main monitoring signal transmitting device X does not collect the basic device monitoring signals in all the service modules, the failure state and the basic information of the service units are transmitted to the main monitoring information transmitting device, then the failure state of the monitoring modules, the basic information of the service modules and the set of the basic information of the failed service units in the service modules are transmitted to the monitoring center, the basic information of the abnormal service modules is transmitted to the service registration center, the abnormal service modules are disconnected from the service registration center, and the basic information of the service faults is transmitted to the outside of the system.
2. The system monitoring disaster recovery early warning method according to claim 1, wherein: the system monitoring disaster-tolerant early warning device comprises a service registration center and a monitoring management center, wherein each service module is internally provided with a main monitoring signal transmitting device, and each service module is internally provided with a service unit monitoring signal transmitting basic device; the service registration center manages each service module and distributes service requests to each service module; the monitoring management center records and manages a main monitoring signal transmitting device in each service module and is responsible for monitoring the running condition of each service module, transmitting an online and offline notification to the service registration center and transmitting early warning and fault notification outside the system; the main monitoring signal transmitting device is responsible for transmitting monitoring signals in the whole service module and receiving monitoring signals of the service units in the service module; the service unit monitoring signal transmitting base device is responsible for transmitting the service unit monitoring signal to the main monitoring signal transmitting device of the service module.
3. The system monitoring disaster recovery early warning method according to claim 1, wherein: the monitoring management center sets an early warning mechanism for the service registration center and sends a service fault notification rule to the outside of the system; the monitoring management center receives the service module running state notification of the downstream main monitoring signal transmitting device in each monitoring period, if the notification is not received in the period or the abnormal running state notification of the service module is received, the monitoring management center transmits basic information and fault information of the abnormal running service module to the service registration center, and the service registration center goes off line for the service module and transmits the fault notification to the outside; the service adopts cluster mode deployment, and the service registration center only sends a request to the node where the service module which operates normally is located, so that the request is prevented from being forwarded for the second time.
4. The system monitoring disaster recovery early warning method according to claim 1, wherein: the main monitoring signal transmitting device is responsible for transmitting the overall running state of the service module to the monitoring management center in a monitoring period, and if any state signal is not transmitted in the monitoring period, the state of the service module is considered to be abnormal; the main monitoring information collects the running state of the full-order service unit in the service module in the monitoring period, and when the full-order state is normal, the service module is normal; when any service unit in the service module is abnormal, the service module is considered to be abnormal, and the basic information of the abnormal service unit is collected and sent to the monitoring management center.
5. The system monitoring disaster recovery early warning method according to claim 1, wherein: the service unit monitoring signal transmitting base device transmits the running state of the service unit to the main monitoring signal transmitting device in the monitoring period, and when the service unit monitoring signal transmitting base device does not transmit state information in the monitoring period, the main monitoring signal transmitting device considers that the running state of the service unit is abnormal.
6. The system monitoring disaster recovery early warning method according to claim 3, wherein: the information sent by the main monitoring signal sending device to the monitoring management center comprises the name of the service module, all the physical IP and port numbers of the service module, the name of the service unit in the service module, the corresponding physical IP and operation port number, and the time point of the physical server during sending.
7. The system monitoring disaster recovery early warning method according to claim 4, wherein: the information sent by the service unit monitoring signal sending basic device comprises a service unit name, a physical IP (Internet protocol) operated by the service unit and a physical service port.
8. The system monitoring disaster recovery early warning method according to claim 1, wherein: the system monitoring disaster recovery early warning method also comprises a fault tolerance mechanism, and an early warning notification rule and a service re-online rule are set in a monitoring management center: when a certain service module sends a status report or abnormal status signal to a monitoring management center in a set period, an early warning notice is sent to the outside of the system; when a certain service module does not send a status report or send an abnormal status signal to the monitoring management center, the service module is continuously sent to the monitoring management center in a monitoring period with a set threshold number, the service module is considered to run normally, and the name of the online service module and basic physical information of the service module are sent to the service center.
9. The system monitoring disaster recovery early warning method according to claim 1, wherein: in the system monitoring disaster recovery early warning method, a plurality of nodes are allowed to be deployed in the same service module, and a service registration center randomly distributes service requests to the nodes of the service module which normally run in a random distribution mode.
CN202110988772.0A 2021-08-26 2021-08-26 System monitoring disaster recovery early warning device and early warning method Active CN113708967B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110988772.0A CN113708967B (en) 2021-08-26 2021-08-26 System monitoring disaster recovery early warning device and early warning method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110988772.0A CN113708967B (en) 2021-08-26 2021-08-26 System monitoring disaster recovery early warning device and early warning method

Publications (2)

Publication Number Publication Date
CN113708967A CN113708967A (en) 2021-11-26
CN113708967B true CN113708967B (en) 2024-04-16

Family

ID=78655252

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110988772.0A Active CN113708967B (en) 2021-08-26 2021-08-26 System monitoring disaster recovery early warning device and early warning method

Country Status (1)

Country Link
CN (1) CN113708967B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102651699A (en) * 2011-02-25 2012-08-29 华平信息技术股份有限公司 Tree-like layered structure monitoring system and monitoring method
KR101441435B1 (en) * 2013-03-21 2014-09-25 국방과학연구소 Operating method of distributed multi-layer system for weapon based on service-scale
CN107423192A (en) * 2017-06-14 2017-12-01 华迪计算机集团有限公司 A kind of system and method for multistage operation maintenance monitoring service
CN107634863A (en) * 2017-10-25 2018-01-26 北京百悟科技有限公司 Distributed monitoring device and method for domain name mapping disaster tolerance service
CN110581773A (en) * 2018-06-07 2019-12-17 北京怡合春天科技有限公司 automatic service monitoring and alarm management system
CN112073252A (en) * 2020-11-12 2020-12-11 北京优炫软件股份有限公司 Cloud platform monitoring method and system, electronic equipment and storage medium
CN112714035A (en) * 2019-10-25 2021-04-27 中兴通讯股份有限公司 Monitoring method and system
CN112769622A (en) * 2021-01-18 2021-05-07 孙冬英 Cluster service fault early warning system based on RPC service monitoring

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102651699A (en) * 2011-02-25 2012-08-29 华平信息技术股份有限公司 Tree-like layered structure monitoring system and monitoring method
KR101441435B1 (en) * 2013-03-21 2014-09-25 국방과학연구소 Operating method of distributed multi-layer system for weapon based on service-scale
CN107423192A (en) * 2017-06-14 2017-12-01 华迪计算机集团有限公司 A kind of system and method for multistage operation maintenance monitoring service
CN107634863A (en) * 2017-10-25 2018-01-26 北京百悟科技有限公司 Distributed monitoring device and method for domain name mapping disaster tolerance service
CN110581773A (en) * 2018-06-07 2019-12-17 北京怡合春天科技有限公司 automatic service monitoring and alarm management system
CN112714035A (en) * 2019-10-25 2021-04-27 中兴通讯股份有限公司 Monitoring method and system
CN112073252A (en) * 2020-11-12 2020-12-11 北京优炫软件股份有限公司 Cloud platform monitoring method and system, electronic equipment and storage medium
CN112769622A (en) * 2021-01-18 2021-05-07 孙冬英 Cluster service fault early warning system based on RPC service monitoring

Also Published As

Publication number Publication date
CN113708967A (en) 2021-11-26

Similar Documents

Publication Publication Date Title
CN1863081B (en) Managing system and method based on intelligent platform managing interface
US6038288A (en) System and method for maintenance arbitration at a switching node
US8775589B2 (en) Distributed network management system and method
CN1805400B (en) Notification of failures in a trunk network
US20140372805A1 (en) Self-healing managed customer premises equipment
CN103036719A (en) Cross-regional service disaster method and device based on main cluster servers
JP2015503864A (en) Technology to cope with changes in the state of interconnection nodes
CN112637368B (en) Distributed industrial data acquisition system and method
CN112468592B (en) Terminal online state detection method and system based on electric power information acquisition
CN103139248A (en) Rack system
CN102026042A (en) Keep-alive and self-healing method and device for advanced telecom computing architecture control surface
CN1452348A (en) Method for detecting problem in route protocol module of distribution router
CN112346925A (en) Process-level dual-computer hot standby redundancy system and method
CN102387210B (en) Distribution type file system monitoring method based on rapid synchronization network
CN112601216B (en) Zigbee-based trusted platform alarm method and system
CN106656584B (en) Distributed system invalid node judgment method
CN113708967B (en) System monitoring disaster recovery early warning device and early warning method
CN103138975B (en) Hosting method of multiple rack systems
CN102487332A (en) Fault processing method, apparatus thereof and system thereof
CN108234154B (en) Airborne switching network equipment fault monitoring method
JP4879823B2 (en) Supervisory control system
US9118540B2 (en) Method for monitoring a plurality of rack systems
US20130138803A1 (en) Method for monitoring a plurality of rack systems
CN112636999A (en) Port detection method and network monitoring system
CN109560962A (en) A kind of multinode collecting method of monocycle net type SCADA system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant