CN109857613B - Automatic operation and maintenance system based on collection cluster - Google Patents

Automatic operation and maintenance system based on collection cluster Download PDF

Info

Publication number
CN109857613B
CN109857613B CN201811589080.3A CN201811589080A CN109857613B CN 109857613 B CN109857613 B CN 109857613B CN 201811589080 A CN201811589080 A CN 201811589080A CN 109857613 B CN109857613 B CN 109857613B
Authority
CN
China
Prior art keywords
acquisition
inspection
collection
layer
management module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811589080.3A
Other languages
Chinese (zh)
Other versions
CN109857613A (en
Inventor
万明
张弛
刘嘉华
张伟
康睿
鄂龙慧
柏永飞
朱江
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nari Information and Communication Technology Co
Original Assignee
Nari Information and Communication Technology Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nari Information and Communication Technology Co filed Critical Nari Information and Communication Technology Co
Priority to CN201811589080.3A priority Critical patent/CN109857613B/en
Publication of CN109857613A publication Critical patent/CN109857613A/en
Application granted granted Critical
Publication of CN109857613B publication Critical patent/CN109857613B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The invention discloses an automatic operation and maintenance system based on an acquisition cluster, which is divided into a display layer, a functional layer, an acquisition layer and a resource layer from top to bottom; the display layer is used for issuing an operation instruction to the functional layer; the functional layer calls various acquisition nodes, automatic configuration, daily inspection and fault detection are completed on software and hardware resources of the resource layer, the acquisition layer configures a multi-node acquisition cluster, data of the resource layer is acquired, acquisition results are fed back to the functional layer, and the resource layer is all software and hardware resources managed by the automatic operation and maintenance system. The automatic operation and maintenance system of the invention forms customized configuration and inspection tasks aiming at different configuration and inspection scenes, and realizes automatic acquisition and analysis of index parameters and automatic generation of inspection reports. The invention ensures good load capacity when facing a large number of acquisition tasks by means of the coordination work among a plurality of acquisition nodes, thereby ensuring the reliability and high efficiency of the acquisition process.

Description

Automatic operation and maintenance system based on collection cluster
Technical Field
The invention relates to an automatic operation and maintenance system based on an acquisition cluster, and belongs to the technical field of enterprise operation and maintenance management.
Background
With the expansion of enterprise business range and the enhancement of business capability, the hardware technical environment and the software technical environment related to the enterprise business system are gradually expanded. Operating systems that may be involved are Windows, Linux, Aix, Unix, etc.; databases that may be involved are Oracle, MongoDB, SQLserver, dreams, etc.; possible middleware involved are Weblogic, Tomcat, Jboss, etc.; the network devices relate to switches, routers, network firewalls, and the like. Some devices may differ in details in their manner of operation and technical principles due to differences in brands. The operation and maintenance work has a wide range of related aspects, the requirement on the professional ability of operation and maintenance personnel is high, and the operation and maintenance cost and the operation and maintenance difficulty are increased.
At present, the operation and maintenance mode of an enterprise still informs operation and maintenance personnel after a user operation service system fails, and then the operation and maintenance personnel take corresponding remedial measures. The traditional 'knife cultivation fire' type operation and maintenance mode has the defects of complex work, low efficiency, high personnel requirement, high error rate, difficulty in quick recovery in case of abnormity and the like. In the face of increasingly large operation and maintenance scales and increasingly complex operation and maintenance scenes, how to reduce operation and maintenance cost and improve operation and maintenance efficiency is an urgent issue to be solved.
The traditional operation and maintenance mode can not meet the current operation and maintenance scene, and at present, enterprises need to use specialized, standardized and streamlined means to realize the automatic management of operation and maintenance work, so that pure manual operation is converted into automatic operation and maintenance.
Disclosure of Invention
The technical problem to be solved by the invention is to overcome the defects of the prior art, and provide an automatic operation and maintenance system based on an acquisition cluster, construct an inspection knowledge base, form customized configuration and inspection tasks aiming at different configuration and inspection scenes, realize automatic acquisition and analysis of index parameters and automatic generation of inspection reports, and provide alarm processing and guidance.
In order to solve the technical problem, the invention provides an automatic operation and maintenance system based on an acquisition cluster, which comprises a presentation layer, a functional layer, an acquisition layer and a resource layer; the display layer provides a visual operation management interface, and a user issues an operation instruction to the functional layer through the visual operation management interface; the functional layer receives an operation instruction issued by a user, calls various acquisition nodes, and completes automatic configuration, daily inspection and fault check on software and hardware resources of the resource layer; the acquisition layer is provided with a multi-node acquisition cluster consisting of a plurality of acquisition nodes, the multi-node acquisition cluster receives an acquisition task issued by the functional layer, acquires data of the resource layer and feeds back an acquisition result to the functional layer; the resource layer is all software and hardware resources managed by the automatic operation and maintenance system.
The acquisition nodes are all configured with acquisition plug-ins supporting various network protocols, including SSH acquisition plug-ins, TELNET acquisition plug-ins, SNMP acquisition plug-ins, JMX acquisition plug-ins, WMI acquisition plug-ins, JDBC acquisition plug-ins, and HTTP acquisition plug-ins.
The aforementioned establishing of the corresponding acquisition index and acquisition protocol for all software and hardware resources managed by the automated operation and maintenance system includes: the method comprises the steps that a host system collects index system running time, operating system versions, CPU information, memory information, process information and port information; the database acquisition index comprises database health operation duration, database table space information, database user information and throughput; SNMP or SSH is used as an operating system information acquisition protocol; the database information collection protocol uses JDBC.
The multi-node collection cluster elects a Leader node according to an election rule, the Leader node directly receives a collection task sent by the functional layer, and sends the collection task according to the load condition of the managed collection node; after the distribution, the corresponding acquisition node calls a corresponding acquisition plug-in according to the received acquisition task and collects the related index information of the acquired object; and then returning the obtained acquisition result to the Leader node.
The Leader node issues the acquisition task, and if the issue fails, the task is issued once again, and if the two issues fail, the task is down for the acquisition node, and the acquisition service of the acquisition node is restarted.
The functional layer is provided with an inspection knowledge base and a resource allocation base, wherein the inspection knowledge base comprises inspection indexes, an acquisition protocol, an acquisition script and an acquisition result processing script; the resource configuration library comprises equipment information of the inspection equipment.
The functional layer is provided with a patrol knowledge base management module, a patrol object management module, a patrol script management module, a patrol operation management module, a patrol task scheduling module, a patrol operation execution module, a patrol report management module and an alarm management module; the functional layer forms customized configuration and inspection tasks aiming at different configurations and inspection scenes, realizes automatic acquisition and analysis of inspection index parameters and automatic generation of inspection reports, and provides alarm processing and guidance.
The alarm management module is configured with an alarm response rule and an alarm processing rule, the alarm response rule judges whether an alarm is generated according to the inspection acquisition result, and the alarm processing rule manages a processing mode if the alarm is generated.
The execution process of the inspection task is as follows:
1) taking out the detailed equipment information of the inspection equipment from the resource configuration library;
2) generating a specific inspection object in an inspection object management module;
3) selecting a polling mode in a polling knowledge base according to the equipment type, and generating a specific polling script in a polling script management module;
4) generating the inspection operation by combining the inspection object and the inspection script in the inspection operation management module;
5) customizing a polling mode at a polling task management module according to a trigger condition to generate a polling task;
6) the polling task scheduling module carries out task scheduling and distributes the polling tasks to the polling operation execution module for execution;
7) the inspection operation execution module executes inspection, acquires equipment operation information, generates alarm information by combining an alarm processing strategy configured by the alarm management module, and generates an inspection report according to the style requirement of an inspection result in the inspection report management module.
The inspection task comprises whether an inspection index is in a normal range in a certain period, including an inspection resource index, an acquisition period and an alarm range of the index.
The invention achieves the following beneficial effects:
the invention ensures good load capacity when facing a large number of acquisition tasks by means of the coordination work among a plurality of acquisition nodes, thereby ensuring the reliability and high efficiency of the operation and maintenance process.
Drawings
FIG. 1 illustrates a software environment and a hardware environment involved in a prior art enterprise business system;
FIG. 2 is an architecture diagram of an automated operation and maintenance system based on an acquisition cluster according to the present invention;
FIG. 3 is a flow chart of the automated operation and maintenance system of the present invention executing inspection tasks;
FIG. 4 is a flow chart of the operation of the multi-node collection cluster of the present invention;
FIG. 5 is a schematic diagram of a multi-node collection cluster according to the present invention.
Detailed Description
The invention is further described below. The following examples are only for illustrating the technical solutions of the present invention more clearly, and the protection scope of the present invention is not limited thereby.
The invention provides an automatic operation and maintenance system based on an acquisition cluster, which comprises a display layer, a functional layer, an acquisition layer and a resource layer from top to bottom as shown in fig. 2. The display layer provides a visual operation management interface, and a user issues an operation instruction to the functional layer through the visual operation management interface. And the functional layer receives an operation instruction issued by a user, realizes the functions of various automatic operation and maintenance systems, calls various acquisition nodes through a task scheduling and executing engine, and completes operation and maintenance tasks such as automatic configuration, daily inspection, fault inspection and the like on software and hardware resources in the resource library. In the acquisition layer, a multi-node acquisition cluster formed by a plurality of acquisition nodes, that is, acquisition servers, is configured, referring to fig. 5, each acquisition node is configured with an acquisition plug-in supporting various network protocols, such as: the method comprises the steps that an SSH acquisition plug-in, a TELNET acquisition plug-in, an SNMP acquisition plug-in, a JMX acquisition plug-in, a WMI acquisition plug-in, a JDBC acquisition plug-in and an HTTP acquisition plug-in are used for acquiring acquisition tasks issued by a functional layer, leader nodes of the acquisition clusters distribute the acquisition tasks to all acquisition nodes according to load balance, the acquisition nodes call corresponding acquisition plug-ins according to task contents, data of resource layers are acquired, and acquisition results are fed back to the functional layer. The resource layer is all software and hardware resources managed by the automation operation and maintenance system, the software environment includes an operating system, a database and middleware, and the hardware environment includes a host system, a network device and other hardware devices such as a storage device, a load balancer, a security device, and the like, which is specifically shown in fig. 1.
And establishing a corresponding acquisition index library and an acquisition protocol aiming at most of common software environments and hardware environments involved in the current operation and maintenance process. For example, the host system mainly has system running time, operating system version, CPU information, memory information, process information, port information and the like for collecting indexes; the database acquisition indexes mainly comprise database health operation duration, database table space information, database user information, throughput and the like; SNMP, SSH and the like can be used as an operating system information acquisition protocol; the database information collection protocol uses JDBC. Therefore, most operation and maintenance scenes can be met, and an extensible acquisition plug-in interface is provided, so that the application range of the operation and maintenance system can be expanded conveniently when the current operation and maintenance system cannot meet the requirements.
Referring to fig. 4, the multi-node collection cluster of the present invention works as follows: firstly, selecting a Leader node which has stronger stability and can work for a long time according to a selection rule; then, the Leader node directly receives the acquisition task sent by the functional layer, issues the acquisition task according to the load condition of the managed common acquisition node, and if the issue fails, the acquisition task is issued once again, and if the two issues fail, the acquisition node of the task goes down, and the acquisition service of the acquisition node is restarted; after the distribution, the corresponding acquisition node calls a corresponding acquisition plug-in according to the received acquisition task and collects the related index information of the acquired object; and then, each common acquisition node returns the acquired acquisition result to the Leader node, and the Leader node is displayed by the operation management interface after the functional layer analysis processing.
In the functional layer, according to the experience and method accumulated in the acquisition process of the software and hardware system in the operation and maintenance process, the invention constructs the inspection knowledge base. The system comprises a resource layer system, patrol indexes (such as CPU, memory, disk space and the like) of equipment, acquisition protocols (SSH, TELNET, SNMP, JMX, WMI, JDBC, HTTP and the like), acquisition scripts (Shell/Perl), acquisition result processing scripts and the like. And forming customized configuration and inspection plans aiming at different configuration and inspection scenes, realizing automatic acquisition and analysis of index parameters and automatic generation of inspection reports, and providing alarm processing and guidance. The inspection knowledge base can provide a universal inspection method for a type of inspection plan, for example, relevant index information of the oracle database can be acquired by a JDBC protocol, and CPU and memory information of a host can be acquired by SNMP and SSH protocols.
Fig. 3 is a task scheduling flowchart of the automated operation and maintenance system executing the inspection task, wherein a square is a system function module related to the flow, a column is database data related to the flow, and a waveform diagram represents intermediate information generated and used by the flow. Referring to fig. 3, a patrol knowledge base management module, a patrol object management module, a patrol script management module, a patrol operation management module, a patrol task scheduling module, a patrol operation execution module, a patrol report management module and an alarm management module are configured. The database comprises a resource configuration base and an inspection knowledge base.
In the process of one-time task scheduling, equipment detailed information of the inspection equipment is taken out from the resource configuration library, a specific inspection object is generated in the inspection object management module, and the inspection object comprises equipment inspection concerned information such as equipment type, equipment brand, equipment address, management port and the like. And selecting a proper inspection mode in the inspection knowledge base according to the equipment type, so that a specific inspection script is generated by the inspection script management module and comprises a remote connection script, a database drive, a connection string and the like. And generating the inspection operation by combining the inspection object and the inspection script in the inspection operation management module. The polling task management module customizes polling modes according to trigger conditions, such as generating polling tasks of types of timing polling, periodic polling and the like, wherein one polling task comprises whether a certain resource index is in a normal range in a certain period, including polling resource indexes, acquisition periods, alarm ranges of the indexes and the like. And in the face of various inspection tasks, the inspection task scheduling module performs task scheduling, and distributes the inspection tasks to the inspection operation execution module for execution. And executing the inspection to acquire the equipment operation information, generating alarm information by combining with an alarm processing strategy configured by an alarm management module, and generating an inspection report according to the style requirement on the inspection result in an inspection report management module.
The alarm management module is configured with an alarm response rule and an alarm processing rule, the alarm response rule judges whether an alarm is generated according to the inspection acquisition result, and the alarm processing rule manages the processing modes if the alarm is generated, such as alarm ignoring, mail alarm, short message alarm lamp and the like. And meanwhile, according to the acquisition result and the alarm condition, regularly making a report form, and generating an inspection report for maintenance work.
The function layer is also configured with the functions of an acquisition process management module, including: the module can select a proper acquisition protocol according to an acquisition object, configure acquisition parameters required by the protocol and start an acquisition process. And carrying out planned control on the acquisition tasks customized by the acquisition process management module in the customized inspection plan module, making an inspection plan meeting the requirements by customizing corresponding inspection resource indexes, acquisition periods, index alarm ranges and the like, and periodically inspecting and acquiring the operation and maintenance resources according to the plan.
The above description is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, several modifications and variations can be made without departing from the technical principle of the present invention, and these modifications and variations should also be regarded as the protection scope of the present invention.

Claims (7)

1. An automatic operation and maintenance system based on an acquisition cluster is characterized by comprising a presentation layer, a functional layer, an acquisition layer and a resource layer; the display layer provides a visual operation management interface, and a user issues an operation instruction to the functional layer through the visual operation management interface; the functional layer receives an operation instruction issued by a user, calls various acquisition nodes, and completes automatic configuration, daily inspection and fault check on software and hardware resources of the resource layer; the acquisition layer is provided with a multi-node acquisition cluster consisting of a plurality of acquisition nodes, the multi-node acquisition cluster receives an acquisition task issued by the functional layer, acquires data of the resource layer and feeds back an acquisition result to the functional layer; the resource layer is all software and hardware resources managed by the automatic operation and maintenance system;
the functional layer is configured with an inspection knowledge base and a resource configuration base, wherein the inspection knowledge base comprises inspection indexes, an acquisition protocol, an acquisition script and an acquisition result processing script; the resource configuration library comprises equipment information of the inspection equipment;
the functional layer is provided with a patrol knowledge base management module, a patrol object management module, a patrol script management module, a patrol operation management module, a patrol task scheduling module, a patrol operation execution module, a patrol report management module and an alarm management module; the functional layer forms customized configuration and inspection tasks aiming at different configurations and inspection scenes, realizes automatic acquisition and analysis of inspection index parameters and automatic generation of inspection reports, and provides alarm processing and guidance;
the execution process of the inspection task is as follows:
1) taking out the detailed equipment information of the inspection equipment from the resource configuration library;
2) generating a specific inspection object in an inspection object management module;
3) selecting a polling mode in a polling knowledge base according to the equipment type, and generating a specific polling script in a polling script management module;
4) generating the inspection operation by combining the inspection object and the inspection script in the inspection operation management module;
5) customizing a polling mode at a polling task management module according to a trigger condition to generate a polling task;
6) the polling task scheduling module carries out task scheduling and distributes the polling tasks to the polling operation execution module for execution;
7) the inspection operation execution module executes inspection, acquires equipment operation information, generates alarm information by combining an alarm processing strategy configured by the alarm management module, and generates an inspection report according to the style requirement of an inspection result in the inspection report management module.
2. The automated operation and maintenance system based on collection clusters according to claim 1, wherein each collection node is configured with collection plug-ins supporting various network protocols, including SSH collection plug-ins, TELNET collection plug-ins, SNMP collection plug-ins, JMX collection plug-ins, WMI collection plug-ins, JDBC collection plug-ins, and HTTP collection plug-ins.
3. The collection cluster-based automated operation and maintenance system according to claim 1, wherein establishing corresponding collection indexes and collection protocols for all software and hardware resources managed by the automated operation and maintenance system comprises: the method comprises the steps that a host system collects index system running time, operating system versions, CPU information, memory information, process information and port information; the database acquisition index comprises database health operation duration, database table space information, database user information and throughput; SNMP or SSH is used as an operating system information acquisition protocol; the database information collection protocol uses JDBC.
4. The automated operation and maintenance system based on the collection cluster according to claim 1, wherein the multi-node collection cluster elects a Leader node according to the election rule, the Leader node directly receives the collection task sent by the functional layer, and issues the collection task according to the load condition of the managed collection node; after the distribution, the corresponding acquisition node calls a corresponding acquisition plug-in according to the received acquisition task and collects the related index information of the acquired object; and then returning the obtained acquisition result to the Leader node.
5. The automated operation and maintenance system based on collection clusters according to claim 4, wherein the Leader node issues the collection task, and if the issue fails, the Leader node issues the collection task once again, and if the two issues fail, the collection node is down and the collection service of the collection node is restarted.
6. The automated operation and maintenance system based on collection clusters according to claim 1, wherein the alarm management module configures an alarm response rule and an alarm processing rule, the alarm response rule judges whether an alarm is generated according to the inspection collection result, and the alarm processing rule manages a processing mode if the alarm is generated.
7. The automated operation and maintenance system based on collection cluster according to claim 1, wherein a patrol task includes whether a patrol index is in a normal range in a period, including patrol resource index, collection period and alarm range of index.
CN201811589080.3A 2018-12-25 2018-12-25 Automatic operation and maintenance system based on collection cluster Active CN109857613B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811589080.3A CN109857613B (en) 2018-12-25 2018-12-25 Automatic operation and maintenance system based on collection cluster

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811589080.3A CN109857613B (en) 2018-12-25 2018-12-25 Automatic operation and maintenance system based on collection cluster

Publications (2)

Publication Number Publication Date
CN109857613A CN109857613A (en) 2019-06-07
CN109857613B true CN109857613B (en) 2021-10-08

Family

ID=66892193

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811589080.3A Active CN109857613B (en) 2018-12-25 2018-12-25 Automatic operation and maintenance system based on collection cluster

Country Status (1)

Country Link
CN (1) CN109857613B (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110430081A (en) * 2019-08-13 2019-11-08 北京市天元网络技术股份有限公司 The intelligent method for inspecting and device of automatic editing based on instruction
CN110535710A (en) * 2019-09-09 2019-12-03 锐捷网络股份有限公司 Remote diagnosis method and system, the network equipment and Cloud Server of the network equipment
CN111026602A (en) * 2019-10-22 2020-04-17 烽火通信科技股份有限公司 Health inspection scheduling management method and device of cloud platform and electronic equipment
CN111241447A (en) * 2020-01-13 2020-06-05 浙江省北大信息技术高等研究院 Webpage data acquisition method, system and storage medium
CN111541575B (en) * 2020-04-30 2023-06-09 重庆富民银行股份有限公司 Automatic inspection method and system for closed source network equipment
CN111654411A (en) * 2020-05-14 2020-09-11 山东中创软件商用中间件股份有限公司 Inspection method and related device
CN112000310A (en) * 2020-07-01 2020-11-27 南方电网数字电网研究院有限公司 Electric power automation operation and maintenance system
CN112416902A (en) * 2020-11-20 2021-02-26 上海新炬网络信息技术股份有限公司 One-key inspection method for host and database
CN114765571A (en) * 2021-01-14 2022-07-19 中国石油天然气股份有限公司 Management method and device for operation and maintenance of data center and electronic equipment
CN113052993A (en) * 2021-03-16 2021-06-29 中国工商银行股份有限公司 Fault inspection method and device based on alarm information linkage
CN112990744B (en) * 2021-03-30 2022-07-12 杭州东方通信软件技术有限公司 Automatic operation and maintenance method and device for massive million-level cloud equipment
CN113051147A (en) * 2021-04-25 2021-06-29 中国建设银行股份有限公司 Database cluster monitoring method, device, system and equipment
CN113849244B (en) * 2021-09-13 2024-01-12 广州咨元信息科技有限公司 Configuration method for realizing custom service scene based on orchestration engine
CN117389843B (en) * 2023-12-13 2024-04-09 广州嘉为科技有限公司 Intelligent operation and maintenance system, method, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7624170B2 (en) * 2002-09-26 2009-11-24 International Business Machines Corporation Integrated storage appliance
CN104036025A (en) * 2014-06-27 2014-09-10 蓝盾信息安全技术有限公司 Distribution-base mass log collection system
CN108512691A (en) * 2018-02-07 2018-09-07 复旦大学 Cloud automatic early-warning O&M monitoring system based on Hadoop
CN108959037A (en) * 2018-07-13 2018-12-07 山东汇贸电子口岸有限公司 A kind of data center's automatic detecting method and device

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104699759B (en) * 2015-02-10 2018-05-15 上海新炬网络信息技术股份有限公司 A kind of data base automatic operation and maintenance method
CN105490833A (en) * 2015-11-19 2016-04-13 国家电网公司 Inspection system and method of information system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7624170B2 (en) * 2002-09-26 2009-11-24 International Business Machines Corporation Integrated storage appliance
CN104036025A (en) * 2014-06-27 2014-09-10 蓝盾信息安全技术有限公司 Distribution-base mass log collection system
CN108512691A (en) * 2018-02-07 2018-09-07 复旦大学 Cloud automatic early-warning O&M monitoring system based on Hadoop
CN108959037A (en) * 2018-07-13 2018-12-07 山东汇贸电子口岸有限公司 A kind of data center's automatic detecting method and device

Also Published As

Publication number Publication date
CN109857613A (en) 2019-06-07

Similar Documents

Publication Publication Date Title
CN109857613B (en) Automatic operation and maintenance system based on collection cluster
CN106487596B (en) Distributed service tracking implementation method
US8516106B2 (en) Use tag clouds to visualize components related to an event
US6269396B1 (en) Method and platform for interfacing between application programs performing telecommunications functions and an operating system
CN104699759B (en) A kind of data base automatic operation and maintenance method
WO2020147419A1 (en) Monitoring method and apparatus, computer device and storage medium
US11012305B2 (en) Systems and methods for service mapping
US9419917B2 (en) System and method of semantically modelling and monitoring applications and software architecture hosted by an IaaS provider
CN110278097B (en) Server operation and maintenance system and method based on Android system
CN102868736B (en) A kind of cloud computing Monitoring framework design basis ground motion method and cloud computing treatment facility
US20060074946A1 (en) Point of view distributed agent methodology for network management
CN104731580A (en) Automation operation and maintenance system based on Karaf and ActiveMQ and implement method thereof
CN110046041B (en) Data acquisition method based on battery scheduling framework
Trihinas et al. Monitoring elastically adaptive multi-cloud services
US7251588B2 (en) System for metric introspection in monitoring sources
KR20130019366A (en) Efficiently collecting transction-separated metrics in a distributed enviornment
CN111026602A (en) Health inspection scheduling management method and device of cloud platform and electronic equipment
CN110874272A (en) Resource allocation method and device, computer readable storage medium and electronic device
WO2015192664A1 (en) Device monitoring method and apparatus
WO2018010176A1 (en) Method and device for acquiring fault information
US10122602B1 (en) Distributed system infrastructure testing
CN114244676A (en) Intelligent IT integrated gateway system
US9218205B2 (en) Resource management in ephemeral environments
CN107515864B (en) Method and equipment for monitoring workflow
Rathore et al. Maintaining SmartX multi‐view visibility for OF@ TEIN+ distributed cloud‐native edge boxes

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant