CN111181775A - Integrated operation and maintenance management alarm method based on automatic host asset discovery - Google Patents

Integrated operation and maintenance management alarm method based on automatic host asset discovery Download PDF

Info

Publication number
CN111181775A
CN111181775A CN201911298627.9A CN201911298627A CN111181775A CN 111181775 A CN111181775 A CN 111181775A CN 201911298627 A CN201911298627 A CN 201911298627A CN 111181775 A CN111181775 A CN 111181775A
Authority
CN
China
Prior art keywords
asset
monitoring
alarm
maintenance
automatic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911298627.9A
Other languages
Chinese (zh)
Other versions
CN111181775B (en
Inventor
刘超
范渊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
DBAPPSecurity Co Ltd
Original Assignee
DBAPPSecurity Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by DBAPPSecurity Co Ltd filed Critical DBAPPSecurity Co Ltd
Priority to CN201911298627.9A priority Critical patent/CN111181775B/en
Publication of CN111181775A publication Critical patent/CN111181775A/en
Application granted granted Critical
Publication of CN111181775B publication Critical patent/CN111181775B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/08Logistics, e.g. warehousing, loading or distribution; Inventory or stock management
    • G06Q10/087Inventory or stock management, e.g. order filling, procurement or balancing against orders
    • G06Q10/0875Itemisation or classification of parts, supplies or services, e.g. bill of materials
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Abstract

The invention provides an integrated operation and maintenance management alarm method based on automatic host asset discovery, which comprises the following steps: firstly, the method comprises the following steps: analysis detection of the flow engine; II, secondly: receiving and warehousing by the asset platform; thirdly, the method comprises the following steps: and (4) automatic operation and maintenance asset updating and monitoring alarm flow design. The invention only needs to utilize an asset platform to realize linkage and keep high identification of the flow engine, thus reducing most of daily operation and maintenance work and leaving a large amount of time without processing the complex and repeated work and time-consuming work. By using the method, higher working efficiency can be kept, and the working treatment is realized on one platform.

Description

Integrated operation and maintenance management alarm method based on automatic host asset discovery
Technical Field
The invention relates to an operation and maintenance management method, in particular to an integrated operation and maintenance management alarm method based on automatic host asset discovery.
Background
The high-speed development of the internet, the increase of host assets caused by the diversity of an information system and a service platform, and the loss and omission in the handover process of the host assets in the past cause great troubles to operation and maintenance personnel. A large amount of time and energy are consumed in the asset combing process, the assets of the flow detection host are put in storage and then checked, more accurate asset information and asset lists are provided, real-time change of the assets reduces information errors caused by untimely updating, the assets put in storage are automatically operated and maintained by an automatic operation and maintenance process, the tagged hosts are listed in the monitoring list, automatic distribution monitoring is carried out, alarm information is added, and automatic alarm recovery operation is achieved. Therefore, real-time operation and maintenance monitoring of assets can be achieved, the working pressure of operation and maintenance personnel is relieved, and misoperation is reduced.
Various companies and service platforms have host assets for providing services at places where network services are needed, network services are more and more at present, host assets for providing services are also more and more huge, and the most important problem of host operation and maintenance personnel is how to manage the host assets to ensure that the assets are not lost and the services are not affected. The invention mainly solves the adverse consequences caused by the problems of manpower negligence, alarm processing negligence and the like, realizes automatic inspection, monitoring and alarm recovery and reduces the artificial influence.
In the existing operation and maintenance management mode, assets and automatic operation and maintenance are separated, the asset management still depends on manual modification, the risk of human errors exists, the operation and maintenance platform is not automatically butted with the asset platform, the operation and maintenance platform is limited to the existing asset management, and comparison and modification of newly added assets and changed assets cannot be achieved.
Accordingly, there is a need for improvements in the art.
Disclosure of Invention
The invention aims to provide an efficient integrated operation and maintenance management alarm method based on automatic host asset discovery.
In order to solve the technical problem, the invention provides an integrated operation and maintenance management alarm method based on automatic host asset discovery, which comprises the following steps:
firstly, the method comprises the following steps: analysis detection of the flow engine;
II, secondly: receiving and warehousing by the asset platform;
thirdly, the method comprises the following steps: and (4) automatic operation and maintenance asset updating and monitoring alarm flow design.
As an improvement of the integrated operation and maintenance management alarm method based on automatic host asset discovery of the invention:
the first step comprises the following steps:
port mirroring is carried out on the inlet and outlet flow to a server bearing a flow engine, the flow engine carries out message analysis on the mirrored flow to obtain a source address and a destination address, the obtained addresses are compared to remove duplication, and a final result is obtained and then is recorded into a database of the flow engine.
As a further improvement of the integrated operation and maintenance management alarm method based on automatic host asset discovery of the invention:
the second step comprises the following steps:
after the flow engine stores the data in the warehouse, sending a receiving message to the asset platform, and the asset platform starts to record assets after receiving the message; and the asset platform installs the recorded assets into an internal network and an external network for classification, and updates the original asset list to obtain a new asset list.
As a further improvement of the integrated operation and maintenance management alarm method based on automatic host asset discovery of the invention:
the third step comprises the following steps:
1) after monitoring a new asset list is added in the automatic operation and maintenance module, the automatic operation and maintenance module sends an adding message to the flow design module;
2) the flow design module sends monitoring information to the monitoring alarm module according to the adding message;
3) the monitoring alarm module acquires monitoring data for the corresponding assets in the new asset list according to the monitoring information;
4) and when the monitoring data is equal to the threshold value, an alarm is triggered, and the monitoring module sends alarm information to the user.
As a further improvement of the integrated operation and maintenance management alarm method based on automatic host asset discovery of the invention:
in step three: the monitoring module tries to recover the alarm according to the recovery method, the recovery success informs the user of the recovery success, and if the recovery failure informs the user of the manual troubleshooting as soon as possible.
The integrated operation and maintenance management alarm method based on automatic host asset discovery has the technical advantages that:
the invention only needs to utilize an asset platform to realize linkage and keep high identification of the flow engine, thus reducing most of daily operation and maintenance work and leaving a large amount of time without processing the complex and repeated work and time-consuming work. By using the method, higher working efficiency can be kept, and the working treatment is realized on one platform.
Drawings
The following describes embodiments of the present invention in further detail with reference to the accompanying drawings.
Fig. 1 shows the contents of a traffic message obtained by the traffic engine in step one;
analyzing the message through an engine to obtain a source address and a destination address in the message;
FIG. 2 is a schematic diagram of the process of receiving and warehousing assets by the asset platform in step two;
the flow engine transmits the acquired host asset addresses (source address and destination address) to the asset platform, and after the asset platform receives the data, the received assets are identified according to the asset identification information configured by the platform;
FIG. 3 is a schematic flow chart of an integrated operation and maintenance management alarm method based on automatic host asset discovery according to the present invention;
FIG. 4 is a diagram illustrating an example of the configuration of the asset platform classifying asset information in step two;
FIG. 5 is an asset warehousing reference schematic;
FIG. 6 is a reference schematic diagram of a new process;
FIG. 7 is a schematic view of a node management reference;
FIG. 8 is a schematic diagram of batch upgrade force reference;
fig. 9 is a reference diagram of the creation flow.
Detailed Description
The invention will be further described with reference to specific examples, but the scope of the invention is not limited thereto.
Embodiment 1, an integrated operation and maintenance management alarm method based on automatic host asset discovery, as shown in fig. 1-3, includes three steps:
firstly, the method comprises the following steps: analysis detection of the flow engine;
port mirroring is carried out on the inlet and outlet flow to a server bearing a flow engine, the flow engine carries out message analysis on the mirrored flow to obtain a source address and a destination address, duplicate removal verification is carried out on the obtained source address and destination address to obtain final host assets, and the verified host assets (source address and destination address) are recorded into a database of the host assets to finish warehousing processing.
II, secondly: receiving and warehousing by the asset platform;
and after receiving the marking information, the asset platform starts to actively synchronize the flow engine to record the asset information (host assets) which is put in storage. And after the platform synchronization data is completed, classifying and warehousing the asset information synchronized by the platform according to the asset attribution information set by the platform, and updating the original asset list to obtain a new asset list. The asset platform is provided with a password library, the authority of the password library is only owned by super management, the asset platform acquires asset information by combining the password library and an asset list, and the detailed information is as follows: a memory; a cpu; a hard disk; a server model; a cpu model number; operating system information.
Specific asset platform operation process fig. 2;
after the asset platform finishes synchronously classifying assets, the asset platform tries to connect host assets by using commands, and the account and the user name are stored in the password library, wherein the account and the user name are formed according to internal employee use habits and default passwords, the process of trying to connect the host assets is a library collision operation, and the connected normal host assets are distributed into a set resource pool, and the effect is shown in fig. 5.
The asset platform calls the password library after receiving the asset data, information cannot be sent, the password library is independent and not affected, and the modification right and the content viewing right are owned by only an administrator.
Thirdly, the method comprises the following steps: an automatic operation and maintenance asset updating and monitoring alarm flow design module;
the asset platform generates a new asset list and then calls a monitoring interface to monitor, an automatic operation and maintenance module is connected into the asset platform and combined with the new asset list to achieve the function of managing all assets, a series of automatic upgrading, updating and software deploying operations are completed by using a process design module, a monitoring alarm agent end is deployed in the assets by using an automatic deploying process, resource monitoring and basic service monitoring are configured, and an alarm recovery process is compiled to achieve automatic alarm recovery. The automatic operation and maintenance module is used for importing the host assets into the node management through adding the process into the process template to install the agent client, and after the agent client is installed, the process template is used for issuing updating, upgrading and software deploying operation. A series of monitoring and deployment operations can be added in the automatic operation and maintenance module, the automatic operation and maintenance module sends the added information to the process design module after monitoring a certain service of the host asset, the process design module automatically establishes a monitoring process according to tasks issued by the automatic operation and maintenance module, sends the information back to the automatic operation and maintenance module and the monitoring and warning module, establishes a detailed monitoring and warning mechanism for monitoring and warning, and sends monitoring data back to the automatic operation and maintenance module for display.
The process design module issues tasks according to the automatic operation and maintenance module, the automatic operation and maintenance module generates an operation flow chart after issuing the tasks, the operation and maintenance steps are decomposed into single commands, a shell script is generated at the same time, the information is sent to the process design module through the automatic operation and maintenance module, the process design module arranges the process according to the information sent by the operation and maintenance module, the corresponding shell script is inserted into the corresponding process step, and the task is sent to the automatic operation and maintenance module after the process arrangement is completed. If the monitoring task is performed, the process design module sends the established steps to the monitoring module, and the monitoring module analyzes the process steps to establish corresponding monitoring.
Example (c): monitoring the use condition of a host memory, designing and adjusting a flow after a task is issued, firstly, sending a monitoring key value and configuration content of acquired memory information to the host, then setting data updating frequency, setting a monitoring alarm threshold value, sending a flow notification to a monitoring module after the flow arrangement is finished, and then setting corresponding monitoring by the monitoring module through the flow.
The monitoring and alarming module receives detailed monitoring information sent by an operation and maintenance platform, a monitoring task is newly established through the step of the process design module, a monitoring prototype of the monitoring task is monitored by referring to zabbix, the monitoring task obtains monitoring data through an agent terminal, meanwhile, a trigger is provided, an alarm is triggered when the monitoring data is equal to a threshold value set by the trigger, the monitoring module sends the alarm information to a user through the existing task information, after the alarm is triggered, the next process is executed, a recovery method is required to be brought when the task is issued, the monitoring module tries to recover the alarm according to the recovery methods, the user is informed of successful recovery if the recovery is successful, and the user is informed of manual troubleshooting as soon as possible if the recovery is failed.
The operation and maintenance module and the password library need manual maintenance, the process design and monitoring alarm module has two modes of automatic adjustment and manual intervention, data interaction is carried out between the modules, and message header state information is sent through kafka to change the state.
The process design module is an important ring in automatic operation and maintenance monitoring, the process design module is strict in requirement, no logic error exists, a remedy or interruption measure is provided for an intermediate process error report, the process design module can liberate most of manpower, and the asset movement can be correctly mastered only by inspecting the operation condition of the process.
As shown in fig. 8, the tasks are issued in batches, according to the upgrade flow adding step, the basic attribute setting of the clicked flow box can be performed to set the type, the name, the creation parameter, the setting parameter type, the prototype of the flow design module is the flow of program operation, and the flow is created according to the operation step of the program. As shown in fig. 9.
The operation flow chart of the whole system is shown in fig. 3.
Finally, it is also noted that the above-mentioned lists merely illustrate a few specific embodiments of the invention. It is obvious that the invention is not limited to the above embodiments, but that many variations are possible. All modifications which can be derived or suggested by a person skilled in the art from the disclosure of the present invention are to be considered within the scope of the invention.

Claims (5)

1. An integrated operation and maintenance management alarm method based on automatic host asset discovery is characterized in that: the method comprises the following steps:
firstly, the method comprises the following steps: analysis detection of the flow engine;
II, secondly: receiving and warehousing by the asset platform;
thirdly, the method comprises the following steps: and (4) automatic operation and maintenance asset updating and monitoring alarm flow design.
2. The integrated operation and maintenance management alarm method based on automatic host asset discovery according to claim 1, wherein:
the first step comprises the following steps:
port mirroring is carried out on the inlet and outlet flow to a server bearing a flow engine, the flow engine carries out message analysis on the mirrored flow to obtain a source address and a destination address, the obtained addresses are compared to remove duplication, and a final result is obtained and then is recorded into a database of the flow engine.
3. The integrated operation and maintenance management alarm method based on automatic host asset discovery according to claim 2, wherein:
the second step comprises the following steps:
after the flow engine stores the data in the warehouse, sending a receiving message to the asset platform, and the asset platform starts to record assets after receiving the message; and updating the original asset list to obtain a new asset list.
4. The integrated operation and maintenance management alarm method based on automatic host asset discovery according to claim 3, wherein:
the third step comprises the following steps:
1) after monitoring a new asset list is added in the automatic operation and maintenance module, the automatic operation and maintenance module sends an adding message to the flow design module;
2) the flow design module sends monitoring information to the monitoring alarm module according to the adding message;
3) the monitoring alarm module acquires monitoring data for the corresponding assets in the new asset list according to the monitoring information;
4) and when the monitoring data is equal to the threshold value, an alarm is triggered, and the monitoring module sends alarm information to the user.
5. The integrated operation and maintenance management alarm method based on automatic host asset discovery according to claim 4, wherein:
in step three: the monitoring module tries to recover the alarm according to the recovery method, the recovery success informs the user of the recovery success, and if the recovery failure informs the user of the manual troubleshooting as soon as possible.
CN201911298627.9A 2019-12-17 2019-12-17 Integrated operation and maintenance management alarm method based on automatic host asset discovery Active CN111181775B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911298627.9A CN111181775B (en) 2019-12-17 2019-12-17 Integrated operation and maintenance management alarm method based on automatic host asset discovery

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911298627.9A CN111181775B (en) 2019-12-17 2019-12-17 Integrated operation and maintenance management alarm method based on automatic host asset discovery

Publications (2)

Publication Number Publication Date
CN111181775A true CN111181775A (en) 2020-05-19
CN111181775B CN111181775B (en) 2023-01-31

Family

ID=70657366

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911298627.9A Active CN111181775B (en) 2019-12-17 2019-12-17 Integrated operation and maintenance management alarm method based on automatic host asset discovery

Country Status (1)

Country Link
CN (1) CN111181775B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111722986A (en) * 2020-07-24 2020-09-29 杭州迪普科技股份有限公司 Software performance monitoring method and device
CN114595848A (en) * 2022-04-29 2022-06-07 武汉四通信息服务有限公司 Equipment supervision method and device
CN115840951A (en) * 2022-11-02 2023-03-24 长扬科技(北京)股份有限公司 Method and system for realizing network security based on full-flow asset discovery

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090327102A1 (en) * 2007-03-23 2009-12-31 Jatin Maniar System and method for providing real time asset visibility
CN104506348A (en) * 2014-12-12 2015-04-08 上海新炬网络信息技术有限公司 Method for automatically discovering and configuring monitoring object
CN107862392A (en) * 2017-10-23 2018-03-30 珠海许继芝电网自动化有限公司 A kind of Unit account of plant management-control method based on power distribution network intelligence O&M control platform
CN110083503A (en) * 2019-03-27 2019-08-02 上海德衡数据科技有限公司 Knowledge base information sensing method based on data center's O&M
CN110311931A (en) * 2019-08-02 2019-10-08 杭州安恒信息技术股份有限公司 Assets automatic discovering method and device
CN110413485A (en) * 2019-08-02 2019-11-05 上海数讯信息技术有限公司 A kind of one-stop Networked Control and Management System and method for based on Zabbix Open Source Platform

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090327102A1 (en) * 2007-03-23 2009-12-31 Jatin Maniar System and method for providing real time asset visibility
CN104506348A (en) * 2014-12-12 2015-04-08 上海新炬网络信息技术有限公司 Method for automatically discovering and configuring monitoring object
CN107862392A (en) * 2017-10-23 2018-03-30 珠海许继芝电网自动化有限公司 A kind of Unit account of plant management-control method based on power distribution network intelligence O&M control platform
CN110083503A (en) * 2019-03-27 2019-08-02 上海德衡数据科技有限公司 Knowledge base information sensing method based on data center's O&M
CN110311931A (en) * 2019-08-02 2019-10-08 杭州安恒信息技术股份有限公司 Assets automatic discovering method and device
CN110413485A (en) * 2019-08-02 2019-11-05 上海数讯信息技术有限公司 A kind of one-stop Networked Control and Management System and method for based on Zabbix Open Source Platform

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111722986A (en) * 2020-07-24 2020-09-29 杭州迪普科技股份有限公司 Software performance monitoring method and device
CN114595848A (en) * 2022-04-29 2022-06-07 武汉四通信息服务有限公司 Equipment supervision method and device
CN115840951A (en) * 2022-11-02 2023-03-24 长扬科技(北京)股份有限公司 Method and system for realizing network security based on full-flow asset discovery
CN115840951B (en) * 2022-11-02 2024-02-13 长扬科技(北京)股份有限公司 Method and system for realizing network security based on full-flow asset discovery

Also Published As

Publication number Publication date
CN111181775B (en) 2023-01-31

Similar Documents

Publication Publication Date Title
CN110647580B (en) Distributed container cluster mirror image management main node, slave node, system and method
CN111181775B (en) Integrated operation and maintenance management alarm method based on automatic host asset discovery
US9940373B2 (en) Method and system for implementing an operating system hook in a log analytics system
US6792456B1 (en) Systems and methods for authoring and executing operational policies that use event rates
CN100449548C (en) Method and system for synchronizing data base
US7587483B1 (en) System and method for managing computer networks
US20120166876A1 (en) Application integration testing
US20070282470A1 (en) Method and system for capturing and reusing intellectual capital in IT management
CN110088744B (en) Database maintenance method and system
CN104679574A (en) Virtual machine image management system in cloud computing
CN107800783B (en) Method and device for remotely monitoring server
CN110063042B (en) Database fault response method and terminal thereof
CN110971464A (en) Operation and maintenance automatic system suitable for disaster recovery center
US7512675B2 (en) Cleaning and removing duplicated unique identifiers from remote network nodes
US8244644B2 (en) Supply chain multi-dimensional serial containment process
CN112650688A (en) Automated regression testing method, associated device and computer program product
CN103026337B (en) The extraction of dispensing assembly and reconstruct
EP3514680B1 (en) Identification of changes in functional behavior and runtime behavior of a system during maintenance cycles
JP2003216457A (en) Error log collecting and analyzing agent system
CN109284331B (en) Certificate making information acquisition method based on service data resources, terminal equipment and medium
CN115567392B (en) Automatic deployment upgrading method for customer internal service system
KR100496958B1 (en) System hindrance integration management method
CN109656740A (en) A method of supporting timeout treatment task flow
US20040249828A1 (en) Automated infrastructure audit system
CN112988220A (en) Application configuration updating method and device, storage medium and server

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant