CN110557281B - Intelligent operation and maintenance method and device based on CMDB and alarm map - Google Patents

Intelligent operation and maintenance method and device based on CMDB and alarm map Download PDF

Info

Publication number
CN110557281B
CN110557281B CN201910771332.2A CN201910771332A CN110557281B CN 110557281 B CN110557281 B CN 110557281B CN 201910771332 A CN201910771332 A CN 201910771332A CN 110557281 B CN110557281 B CN 110557281B
Authority
CN
China
Prior art keywords
maintenance
alarm
area
strategy
intelligent
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910771332.2A
Other languages
Chinese (zh)
Other versions
CN110557281A (en
Inventor
初宇飞
常成洋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing MetarNet Technologies Co Ltd
Original Assignee
Beijing MetarNet Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing MetarNet Technologies Co Ltd filed Critical Beijing MetarNet Technologies Co Ltd
Priority to CN201910771332.2A priority Critical patent/CN110557281B/en
Publication of CN110557281A publication Critical patent/CN110557281A/en
Application granted granted Critical
Publication of CN110557281B publication Critical patent/CN110557281B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/0803Configuration setting

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Debugging And Monitoring (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The disclosure relates to an intelligent operation and maintenance method, an intelligent operation and maintenance device, electronic equipment and a storage medium based on a CMDB and an alarm map. Wherein, the method comprises the following steps: configuring isolation region division of a dynamic cache region, configuring data communication rules among the dynamic cache region, an operation and maintenance strategy region and an intelligent analysis region, and configuring data synchronization rules with an external system; synchronizing the configuration and resource change information of the IT environment from the configuration management database to each isolation region according to the preset dimensionality; storing a fault experience library, a manufacturer instruction library and an alarm association rule library and providing operation and maintenance strategy support for an intelligent analysis area; and synchronizing the alarm information of the network operation support system OSS in real time, and generating a panoramic alarm map and a processing strategy group according to a preset algorithm by combining the information of the dynamic cache region and the operation and maintenance strategy region to realize intelligent operation and maintenance of the data center. The intelligent operation and maintenance technology based on the CMDB and the alarm map is used for realizing the fusion of special hardware and general hardware.

Description

Intelligent operation and maintenance method and device based on CMDB and alarm map
Technical Field
The present disclosure relates to the field of communications, and in particular, to an intelligent operation and maintenance method and apparatus based on a CMDB and an alarm graph, an electronic device, and a computer-readable storage medium.
Background
Due to the development of Network Function Virtualization (NFV) technology, telecommunication operator communication network equipment gradually moves from dedicated hardware (different equipment manufacturers) to general hardware (X86 architecture general server), and uses a cloud deployment mode. The traditional operation and maintenance modes such as alarm processing, fault judgment, self-healing and the like are realized based on CT (communication technology) operation and maintenance means, and the method cannot make a closer step towards intellectualization. An intelligent operation and maintenance technology fusing an IT (information technology) idea is needed, namely, the traditional CT (communication technology) operation and maintenance system is changed into a CT + IT operation and maintenance system.
Accordingly, there is a need for one or more methods to address the above-mentioned problems.
It is to be noted that the information disclosed in the above background section is only for enhancement of understanding of the background of the present disclosure, and thus may include information that does not constitute prior art known to those of ordinary skill in the art.
Disclosure of Invention
An object of the present disclosure is to provide an intelligent operation and maintenance method, apparatus, electronic device and computer-readable storage medium based on a CMDB and an alarm graph, thereby overcoming, at least to some extent, one or more of the problems due to the limitations and disadvantages of the related art.
According to one aspect of the disclosure, an intelligent operation and maintenance method based on a CMDB and an alarm map is provided, which includes:
a resource allocation step, namely allocating the isolation area division of the dynamic cache area, allocating data communication rules among the dynamic cache area, the operation and maintenance strategy area and the intelligent analysis area, and allocating data synchronization rules with an external system;
a dynamic caching step, namely synchronizing the configuration and resource change information of the IT environment from the configuration management database to each isolation area according to a preset dimension;
an operation and maintenance strategy support step, wherein a fault experience library, a manufacturer instruction library and an alarm association rule library are stored, and operation and maintenance strategy support is provided for an intelligent analysis area;
and in the intelligent analysis step, the alarm information of the network operation support system OSS is synchronized in real time, and a panoramic alarm map and a processing strategy group are generated according to a preset algorithm by combining the information of the dynamic cache region and the operation and maintenance strategy region, so that the intelligent operation and maintenance of the data center are realized.
In an exemplary embodiment of the present disclosure, the resource configuring step further includes:
and partitioning the isolation region of the dynamic cache region according to the space size or the attributes of the resources, the configuration change time segments and the IT infrastructure types required to be acquired.
In an exemplary embodiment of the present disclosure, the dynamic caching step further includes:
the dynamic cache region comprises a plurality of isolation regions, and the isolation regions are manually pre-divided by a configuration center.
In an exemplary embodiment of the present disclosure, the operation and maintenance policy support step further includes:
the fault experience base comprises an existing alarm processing strategy;
the fault experience library and the alarm association rule library are manufactured and imported through expert experience and historical experience data at the initial stage;
the manufacturer instruction base is provided by each communication equipment manufacturer and compiled into a unified mapping relation in the manufacturer instruction base.
In an exemplary embodiment of the present disclosure, the intelligent analysis step further includes:
the intelligent analysis is in butt joint with a network operation support system, and interaction of actions such as alarming, fault self-processing and the like is carried out; the method comprises the steps of butting with a dynamic cache region to obtain latest resources and configuration data corresponding to faults; and (4) butting with the operation and maintenance strategy area, and calling historical experience, alarm association rules and batch processing instruction templates.
In an exemplary embodiment of the present disclosure, the intelligent analysis step further includes:
when the alarm information is transmitted to the intelligent analysis area from the OSS system, generating a panoramic alarm map according to dynamic resource and configuration data and strategy rule data;
distributing strategies for each branch node of the alarm map, wherein the distribution strategies comprise processing schemes and self-healing instructions;
and sending the strategy back to the OSS system, and determining the action to be automatically executed in the next step by the OSS system so as to realize the self-healing of the fault.
In an exemplary embodiment of the present disclosure, the intelligent analysis step further includes:
the alarm map can set the extensibility according to the actual situation, and the set position is in a configuration center;
alarm maps generated in each time are stored in an intelligent analysis area to form a map warehouse;
after a certain number of maps are accumulated, a new alarm association rule is combed out through a built-in comparison algorithm and is pushed to an operation and maintenance strategy area for storage.
In an exemplary embodiment of the present disclosure, the method further comprises:
strategy distribution and issuing: and distributing strategies for each branch node of the alarm map, sending the strategies back to the OSS system, and determining the action to be automatically executed in the next step by the OSS system.
In an exemplary embodiment of the present disclosure, the policy allocating and issuing step further includes:
and an increased issuing path can be set, and one path is sent to an operation and maintenance personnel terminal, and the operation and maintenance personnel carry out secondary judgment or adjustment of an execution strategy, wherein the adjustment range comprises automatic execution time and range.
In an exemplary embodiment of the present disclosure, the method further comprises:
updating the operation and maintenance strategy area: the OSS system reports the alarm processing result to the intelligent analysis area, and the intelligent analysis area pushes the processing record and the verified strategy to the operation and maintenance strategy area for storage, and a fault experience base and an alarm association relation base are perfected.
In an exemplary embodiment of the present disclosure, the operation and maintenance policy area updating step further includes:
before updating the strategy, the existing strategy in the alarm experience base is automatically compared, only the newly added content is input, and if the existing strategy is similar, the existing strategy is not input.
In one aspect of the present disclosure, an intelligent operation and maintenance device based on a CMDB and an alarm map is provided, which includes:
the configuration center is used for configuring the isolation area division of the dynamic cache area, configuring data communication rules among the dynamic cache area, the operation and maintenance strategy area and the intelligent analysis area and configuring data synchronization rules with an external system;
the dynamic cache region is used for synchronizing the configuration and resource change information of the IT environment from the configuration management database to each isolation region according to the preset dimensionality;
the operation and maintenance strategy area is used for storing the fault experience base, the manufacturer instruction base and the alarm association rule base and providing operation and maintenance strategy support for the intelligent analysis area;
and the intelligent analysis area is used for synchronizing the alarm information of the network operation support system OSS in real time, and generating a panoramic alarm map and a processing strategy group according to a preset algorithm by combining the information of the dynamic cache area and the operation and maintenance strategy area, so that the intelligent operation and maintenance of the data center are realized.
In one aspect of the present disclosure, there is provided an electronic device including:
a processor; and
a memory having computer readable instructions stored thereon which, when executed by the processor, implement a method according to any of the above.
In an aspect of the disclosure, a computer-readable storage medium is provided, on which a computer program is stored, which computer program, when being executed by a processor, realizes the method according to any one of the above.
The intelligent operation and maintenance method based on the CMDB and the alarm map in the exemplary embodiment of the disclosure configures the isolation area division of a dynamic cache area, configures data communication rules among the dynamic cache area, an operation and maintenance strategy area and an intelligent analysis area, and configures data synchronization rules with an external system; synchronizing the configuration and resource change information of the IT environment from the configuration management database to each isolation region according to the preset dimensionality; storing a fault experience library, a manufacturer instruction library and an alarm association rule library and providing operation and maintenance strategy support for an intelligent analysis area; and synchronizing the alarm information of the network operation support system OSS in real time, and generating a panoramic alarm map and a processing strategy group according to a preset algorithm by combining the information of the dynamic cache region and the operation and maintenance strategy region to realize intelligent operation and maintenance of the data center. On one hand, the intelligent operation and maintenance technology based on the CMDB and the alarm map is adopted, the operation guidance and the automatic instruction for automatically processing the faults of all links are more accurately given, the fault duration of the telecommunication network equipment in a cloud deployment mode is shortened, and the user experience is favorably improved; on the other hand, the intelligent operation and maintenance problems such as intelligent fault removal, fault self-healing and the like after the communication network equipment of the telecom operator is generalized and deployed in a cloud mode are achieved.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
The above and other features and advantages of the present disclosure will become more apparent by describing in detail exemplary embodiments thereof with reference to the attached drawings.
FIG. 1 illustrates a flow diagram of a CMDB and alarm graph based intelligent operation and maintenance method according to an exemplary embodiment of the present disclosure;
FIG. 2 is a diagram illustrating a data flow relationship between software functional entity architecture and modules of the CMDB and alarm graph-based intelligent operation and maintenance method and apparatus according to an exemplary embodiment of the present disclosure;
FIG. 3 illustrates a schematic block diagram of a CMDB and alarm graph-based intelligent operation and maintenance device according to an exemplary embodiment of the present disclosure;
FIG. 4 schematically illustrates a block diagram of an electronic device according to an exemplary embodiment of the present disclosure;
and
fig. 5 schematically illustrates a schematic diagram of a computer-readable storage medium according to an exemplary embodiment of the present disclosure.
Detailed Description
Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art. The same reference numerals denote the same or similar parts in the drawings, and thus, a repetitive description thereof will be omitted.
Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided to give a thorough understanding of embodiments of the disclosure. One skilled in the relevant art will recognize, however, that the embodiments of the disclosure can be practiced without one or more of the specific details, or with other methods, components, materials, devices, steps, and so forth. In other instances, well-known structures, methods, devices, implementations, materials, or operations are not shown or described in detail to avoid obscuring aspects of the disclosure.
The block diagrams shown in the figures are functional entities only and do not necessarily correspond to physically separate entities. That is, these functional entities may be implemented in the form of software, or in one or more software-hardened modules, or in different networks and/or processor devices and/or microcontroller devices.
In the embodiment of the present invention, firstly, an intelligent operation and maintenance method based on a CMDB and an alarm map is provided; referring to fig. 1, the intelligent operation and maintenance method based on the CMDB and the alarm map may include the following steps:
a resource allocation step S110, which is to allocate the isolation area division of the dynamic cache area, allocate the data communication rule among the dynamic cache area, the operation and maintenance strategy area and the intelligent analysis area, and allocate the data synchronization rule with an external system;
a dynamic caching step S120, synchronizing the configuration and resource change information of the IT environment from the configuration management database to each isolation region according to a preset dimension;
an operation and maintenance strategy support step S130, storing a fault experience library, a manufacturer instruction library and an alarm association rule library and providing operation and maintenance strategy support for the intelligent analysis area;
and an intelligent analysis step S140, synchronizing alarm information of the network operation support system OSS in real time, and generating a panoramic alarm map and a processing strategy group according to a preset algorithm by combining the information of the dynamic cache region and the operation and maintenance strategy region to realize intelligent operation and maintenance of the data center.
The intelligent operation and maintenance method based on the CMDB and the alarm map in the exemplary embodiment of the disclosure configures the isolation area division of a dynamic cache area, configures data communication rules among the dynamic cache area, an operation and maintenance strategy area and an intelligent analysis area, and configures data synchronization rules with an external system; synchronizing the configuration and resource change information of the IT environment from the configuration management database to each isolation region according to the preset dimensionality; storing a fault experience library, a manufacturer instruction library and an alarm association rule library and providing operation and maintenance strategy support for an intelligent analysis area; and synchronizing the alarm information of the network operation support system OSS in real time, and generating a panoramic alarm map and a processing strategy group according to a preset algorithm by combining the information of the dynamic cache region and the operation and maintenance strategy region to realize intelligent operation and maintenance of the data center. On one hand, the intelligent operation and maintenance technology based on the CMDB and the alarm map is adopted, the operation guidance and the automatic instruction for automatically processing the faults of all links are more accurately given, the fault duration of the telecommunication network equipment in a cloud deployment mode is shortened, and the user experience is favorably improved; on the other hand, the intelligent operation and maintenance problems such as intelligent fault removal, fault self-healing and the like after the communication network equipment of the telecom operator is generalized and deployed in a cloud mode are achieved.
Hereinafter, the intelligent operation and maintenance method based on the CMDB and the alarm map in the present exemplary embodiment will be further described.
Fig. 2 is a diagram showing a data flow relationship between a software functional entity architecture and modules of the intelligent operation and maintenance method and apparatus based on the CMDB and the alarm map.
In the resource configuration step S110, the partition of the isolation region of the dynamic cache region may be configured, and the data communication rule between the dynamic cache region, the operation and maintenance policy region, and the intelligent analysis region, and the data synchronization rule with the external system may be configured.
In this exemplary embodiment, the resource configuring step further includes:
and partitioning the isolation region of the dynamic cache region according to the space size or the attributes of the resources, the configuration change time segments and the IT infrastructure types required to be acquired.
In the dynamic caching step S120, the configuration and resource change information of the IT environment may be synchronized from the configuration management database to each isolation region according to the preset dimension.
In this exemplary embodiment, the dynamic caching step further includes:
the dynamic cache region comprises a plurality of isolation regions, and the isolation regions are manually pre-divided by a configuration center.
In the operation and maintenance policy support step S130, a failure experience library, a manufacturer instruction library, and an alarm association rule library may be stored and an operation and maintenance policy support may be provided for the intelligent analysis area.
In this exemplary embodiment, the operation and maintenance policy support step further includes:
the fault experience base comprises an existing alarm processing strategy;
the fault experience library and the alarm association rule library are manufactured and imported through expert experience and historical experience data at the initial stage;
the manufacturer instruction base is provided by each communication equipment manufacturer and compiled into a unified mapping relation in the manufacturer instruction base.
In the intelligent analysis step S140, the alarm information of the network operation support system OSS may be synchronized in real time, and a panoramic alarm map and a processing policy group are generated according to a preset algorithm by combining the information of the dynamic cache area and the operation and maintenance policy area, so as to implement intelligent operation and maintenance of the data center.
In an embodiment of the present example, the intelligent analysis step further comprises:
the intelligent analysis is in butt joint with a network operation support system, and interaction of actions such as alarming, fault self-processing and the like is carried out; the method comprises the steps of butting with a dynamic cache region to obtain latest resources and configuration data corresponding to faults; and (4) butting with the operation and maintenance strategy area, and calling historical experience, alarm association rules and batch processing instruction templates.
In an embodiment of the present example, the intelligent analysis step further comprises:
when the alarm information is transmitted to the intelligent analysis area from the OSS system, generating a panoramic alarm map according to dynamic resource and configuration data and strategy rule data;
distributing strategies for each branch node of the alarm map, wherein the distribution strategies comprise processing schemes and self-healing instructions;
and sending the strategy back to the OSS system, and determining the action to be automatically executed in the next step by the OSS system so as to realize the self-healing of the fault.
In an embodiment of the present example, the intelligent analysis step further comprises:
the alarm map can set the extensibility according to the actual situation, and the set position is in a configuration center;
alarm maps generated in each time are stored in an intelligent analysis area to form a map warehouse;
after a certain number of maps are accumulated, a new alarm association rule is combed out through a built-in comparison algorithm and is pushed to an operation and maintenance strategy area for storage.
In an embodiment of the present example, the method further comprises:
strategy distribution and issuing: and distributing strategies for each branch node of the alarm map, sending the strategies back to the OSS system, and determining the action to be automatically executed in the next step by the OSS system.
In this exemplary embodiment, the policy allocating and issuing step further includes:
and an increased issuing path can be set, and one path is sent to an operation and maintenance personnel terminal, and the operation and maintenance personnel carry out secondary judgment or adjustment of an execution strategy, wherein the adjustment range comprises automatic execution time and range.
In an embodiment of the present example, the method further comprises:
updating the operation and maintenance strategy area: the OSS system reports the alarm processing result to the intelligent analysis area, and the intelligent analysis area pushes the processing record and the verified strategy to the operation and maintenance strategy area for storage, and a fault experience base and an alarm association relation base are perfected.
In this exemplary embodiment, the operation and maintenance policy area updating step further includes:
before updating the strategy, the existing strategy in the alarm experience base is automatically compared, only the newly added content is input, and if the existing strategy is similar, the existing strategy is not input.
In the embodiment of the example, based on the computer technology and the alarm map technology, the configuration and resource change information of the IT infrastructure is combined with the current alarm, the processing experience and rule, the processing means and other information in a relevant mode in real time, a panoramic alarm map is generated, and the positioning can be rapidly carried out. Operation guidance and automation instructions for automatic fault processing of all links are given more accurately, fault duration of telecommunication network equipment in a cloud deployment mode is shortened, and user experience is improved beneficially. The intelligent operation and maintenance problems of the telecommunication operator, such as the universalization of communication network equipment, the intelligent troubleshooting after the cloud deployment, the self-healing of faults and the like, are realized.
It should be noted that although the various steps of the methods of the present disclosure are depicted in the drawings in a particular order, this does not require or imply that these steps must be performed in this particular order, or that all of the depicted steps must be performed, to achieve desirable results. Additionally or alternatively, certain steps may be omitted, multiple steps combined into one step execution, and/or one step broken down into multiple step executions, etc.
In addition, in the present exemplary embodiment, an intelligent operation and maintenance device based on the CMDB and the alarm map is also provided. The hardware device comprises a processor, a bus, a memory (ROM, RAM and a cache, including an intelligent operation and maintenance related operation program), an I/O interface (used for connecting an external device) and a network adapter. The intelligent operation and maintenance processing system is stored in the ROM and is in butt joint with an operator network management system (OSS) and a CMDB system through an I/O interface to carry out data interaction.
Referring to fig. 3, the CMDB and alarm map based intelligent operation and maintenance device 200 may include: a configuration center 210, a dynamic cache area 220, an operation and maintenance policy area 230, and an intelligent analysis area 240. Wherein:
the configuration center 210 is configured to configure the partition of the dynamic cache region, configure data communication rules among the dynamic cache region, the operation and maintenance policy region, and the intelligent analysis region, and configure data synchronization rules with an external system;
a dynamic cache region 220, configured to synchronize configuration and resource change information of the IT environment from the configuration management database to each isolation region according to a preset dimension;
the operation and maintenance strategy area 230 is used for storing a fault experience base, a manufacturer instruction base and an alarm association rule base and providing operation and maintenance strategy support for the intelligent analysis area;
and the intelligent analysis area 240 is used for synchronizing the alarm information of the network operation support system OSS in real time, and generating a panoramic alarm map and a processing strategy group according to a preset algorithm by combining the information of the dynamic cache area and the operation and maintenance strategy area, so as to realize intelligent operation and maintenance of the data center.
The specific details of each of the above intelligent operation and maintenance device modules based on the CMDB and the alarm map have been described in detail in the corresponding intelligent operation and maintenance method based on the CMDB and the alarm map, and therefore are not described herein again.
It should be noted that although several modules or units of the intelligent operation and maintenance device 200 based on the CMDB and the alarm graph are mentioned in the above detailed description, such partitioning is not mandatory. Indeed, the features and functionality of two or more modules or units described above may be embodied in one module or unit, according to embodiments of the present disclosure. Conversely, the features and functions of one module or unit described above may be further divided into embodiments by a plurality of modules or units.
In addition, in an exemplary embodiment of the present disclosure, an electronic device capable of implementing the above method is also provided.
As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or program product. Thus, various aspects of the invention may be embodied in the form of: an entirely hardware embodiment, an entirely software embodiment (including firmware, microcode, etc.) or an embodiment combining hardware and software aspects that may all generally be referred to herein as a "circuit," module "or" system.
An electronic device 300 according to such an embodiment of the invention is described below with reference to fig. 4. The electronic device 300 shown in fig. 4 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present invention.
As shown in fig. 4, electronic device 300 is embodied in the form of a general purpose computing device. The components of electronic device 300 may include, but are not limited to: the at least one processing unit 310, the at least one memory unit 320, a bus 330 connecting different system components (including the memory unit 320 and the processing unit 310), and a display unit 340.
Wherein the storage unit stores program code that is executable by the processing unit 310 to cause the processing unit 310 to perform steps according to various exemplary embodiments of the present invention as described in the above section "exemplary method" of the present specification. For example, the processing unit 310 may perform steps S110 to S140 as shown in fig. 1.
The storage unit 320 may include readable media in the form of volatile storage units, such as a random access memory unit (RAM)3201 and/or a cache memory unit 3202, and may further include a read only memory unit (ROM) 3203.
The storage unit 320 may also include a program/utility 3204 having a set (at least one) of program modules 3205, such program modules 3205 including, but not limited to: an operating system, one or more application programs, other program modules, and program data, each of which, or some combination thereof, may comprise an implementation of a network environment.
Bus 330 may be one or more of several types of bus structures, including a memory unit bus or memory unit controller, a peripheral bus, an accelerated graphics port, a processing unit, or a local bus using any of a variety of bus architectures.
The electronic device 300 may also communicate with one or more external devices 370 (e.g., keyboard, pointing device, bluetooth device, etc.), with one or more devices that enable a user to interact with the electronic device 300, and/or with any devices (e.g., router, modem, etc.) that enable the electronic device 300 to communicate with one or more other computing devices. Such communication may occur via an input/output (I/O) interface 350. Also, the electronic device 300 may communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network, such as the internet) via the network adapter 360. As shown, network adapter 360 communicates with the other modules of electronic device 300 via bus 330. It should be appreciated that although not shown in the figures, other hardware and/or software modules may be used in conjunction with electronic device 300, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives, and data backup storage systems, among others.
Through the above description of the embodiments, those skilled in the art will readily understand that the exemplary embodiments described herein may be implemented by software, or by software in combination with necessary hardware. Therefore, the technical solution according to the embodiments of the present disclosure may be embodied in the form of a software product, which may be stored in a non-volatile storage medium (which may be a CD-ROM, a usb disk, a removable hard disk, etc.) or on a network, and includes several instructions to enable a computing device (which may be a personal computer, a server, a terminal device, or a network device, etc.) to execute the method according to the embodiments of the present disclosure.
In an exemplary embodiment of the present disclosure, there is also provided a computer-readable storage medium having stored thereon a program product capable of implementing the above-described method of the present specification. In some possible embodiments, aspects of the invention may also be implemented in the form of a program product comprising program code means for causing a terminal device to carry out the steps according to various exemplary embodiments of the invention described in the above-mentioned "exemplary methods" section of the present description, when said program product is run on the terminal device.
Referring to fig. 5, a program product 400 for implementing the above method according to an embodiment of the present invention is described, which may employ a portable compact disc read only memory (CD-ROM) and include program code, and may be run on a terminal device, such as a personal computer. However, the program product of the present invention is not limited in this regard and, in the present document, a readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
The program product may employ any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
A computer readable signal medium may include a propagated data signal with readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A readable signal medium may also be any readable medium that is not a readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Program code embodied on a readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server. In the case of a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).
Furthermore, the above-described figures are merely schematic illustrations of processes involved in methods according to exemplary embodiments of the invention, and are not intended to be limiting. It will be readily understood that the processes shown in the above figures are not intended to indicate or limit the chronological order of the processes. In addition, it is also readily understood that these processes may be performed synchronously or asynchronously, e.g., in multiple modules.
Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.
It will be understood that the present disclosure is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is to be limited only by the terms of the appended claims.

Claims (18)

1. An intelligent operation and maintenance method based on a configuration management database and an alarm map is characterized by comprising the following steps:
a resource allocation step, namely allocating the isolation area division of the dynamic cache area, allocating data communication rules among the dynamic cache area, the operation and maintenance strategy area and the intelligent analysis area, and allocating data synchronization rules with an external system;
a dynamic caching step, namely synchronizing the configuration and resource change information of the IT environment from the configuration management database to each isolation area according to a preset dimension;
an operation and maintenance strategy support step, wherein a fault experience base, a manufacturer instruction base and an alarm association rule base are stored in an operation and maintenance strategy area, and operation and maintenance strategy support is provided for an intelligent analysis area;
and in the intelligent analysis step, the alarm information of the network operation support system OSS is synchronized in real time, and a panoramic alarm map and a processing strategy group are generated according to a preset algorithm by combining the information of the dynamic cache region and the operation and maintenance strategy region, so that the intelligent operation and maintenance of the data center are realized.
2. The method of claim 1, wherein the resource configuring step further comprises:
and partitioning the isolation region of the dynamic cache region according to the space size, the resources required to be acquired, the configuration change time segment and the attributes of the IT infrastructure type.
3. The method of claim 1, wherein the step of dynamically caching further comprises:
the dynamic cache region comprises a plurality of isolation regions, and the isolation regions are manually pre-divided by a configuration center.
4. The method of claim 1, wherein the operation and maintenance policy support step further comprises:
the fault experience base comprises an existing alarm processing strategy;
the fault experience library and the alarm association rule library are manufactured and imported through expert experience and historical experience data at the initial stage;
the manufacturer instruction base is provided by each communication equipment manufacturer and compiled into a unified mapping relation in the manufacturer instruction base.
5. The method of claim 1, wherein the intelligent analysis step further comprises:
the intelligent analysis area is in butt joint with a network operation support system to carry out alarm and fault self-processing; the method comprises the steps of butting with a dynamic cache region to obtain latest resources and configuration data corresponding to faults; and (4) butting with the operation and maintenance strategy area, and calling historical experience, alarm association rules and batch processing instruction templates.
6. The method of claim 1, wherein the intelligent analysis step further comprises:
when the alarm information is transmitted to the intelligent analysis area from the OSS system, generating a panoramic alarm map according to dynamic resource and configuration data and strategy rule data;
distributing strategies for each branch node of the alarm map, wherein the distribution strategies comprise processing schemes and self-healing instructions;
and sending the strategy back to the OSS system, and determining the action to be automatically executed in the next step by the OSS system so as to realize the self-healing of the fault.
7. The method of claim 1, wherein the intelligent analysis step further comprises:
the alarm map is used for setting the extensibility according to the actual situation, and the set position is in a configuration center;
alarm maps generated in each time are stored in an intelligent analysis area to form a map warehouse;
after a certain number of maps are accumulated, a new alarm association rule is combed out through a built-in comparison algorithm and is pushed to an operation and maintenance strategy area for storage.
8. The method of claim 1, wherein the method further comprises:
strategy distribution and issuing: and distributing strategies for each branch node of the alarm map, sending the strategies back to the OSS system, and determining the action to be automatically executed in the next step by the OSS system.
9. The method of claim 8, wherein the policy assigning and issuing step further comprises:
and setting an increased issuing path, sending one path to an operation and maintenance personnel terminal, and performing secondary judgment or adjustment of an execution strategy by the operation and maintenance personnel, wherein the adjustment range comprises automatic execution time and range.
10. The method of claim 1, wherein the method further comprises:
updating the operation and maintenance strategy area: the OSS system reports the alarm processing result to the intelligent analysis area, and the intelligent analysis area pushes the processing record and the verified strategy to the operation and maintenance strategy area for storage, and a fault experience base and an alarm association rule base are perfected.
11. The method of claim 10, wherein the operation and maintenance policy area updating step further comprises:
before updating the strategy, the existing strategy in the fault experience base is automatically compared, only the newly added content is recorded, and if the existing strategy is similar, the new content is not recorded.
12. An intelligent operation and maintenance device based on a configuration management database and an alarm map, which is characterized by comprising:
the configuration center is used for configuring the isolation area division of the dynamic cache area, configuring data communication rules among the dynamic cache area, the operation and maintenance strategy area and the intelligent analysis area and configuring data synchronization rules with an external system;
the dynamic cache region is used for synchronizing the configuration and resource change information of the IT environment from the configuration management database to each isolation region according to the preset dimensionality;
the operation and maintenance strategy area is used for storing the fault experience base, the manufacturer instruction base and the alarm association rule base and providing operation and maintenance strategy support for the intelligent analysis area;
and the intelligent analysis area is used for synchronizing the alarm information of the network operation support system OSS in real time, and generating a panoramic alarm map and a processing strategy group according to a preset algorithm by combining the information of the dynamic cache area and the operation and maintenance strategy area, so that the intelligent operation and maintenance of the data center are realized.
13. The apparatus of claim 12, wherein the configuration center configures the coordination rules among the other three regions, including partition of the dynamic cache region, data call rules among the regions, and data synchronization rules with external systems.
14. The apparatus of claim 12, wherein the dynamic cache region is divided into a plurality of isolation regions, isolation conditions of the isolation regions are preset, the isolation regions are manually pre-divided by the configuration center, and the required dimension of each isolation region synchronizes configuration and resource change information of the corresponding dimension from the configuration management database CMDB.
15. The apparatus of claim 12, wherein the operation and maintenance policy area comprises a failure experience library, a manufacturer instruction library and an alarm association rule library; in the daily operation and maintenance process, successfully completed fault self-healing processing rules and updated alarm association rules are collected and filed, and the database can be updated through manual intervention.
16. The apparatus of claim 12, wherein the intelligent analysis area is used as an analysis hub for intelligent fault handling, and is interfaced with an OSS system for alarm and fault self-handling; the method comprises the steps of butting with a dynamic cache region to obtain latest resources and configuration data corresponding to faults; the operation and maintenance strategy area is connected in a butt joint mode, and a historical obstacle elimination rule, an alarm association rule and a batch processing instruction template are called; finally generating a panoramic alarm map and distributing strategies for each branch node of the alarm map; and the strategy is sent to an OSS system, a fault processing result is tracked, and the correct strategy experience is packaged and pushed to an operation and maintenance strategy area for storage.
17. An electronic device, comprising
A processor; and
a memory having computer readable instructions stored thereon which, when executed by the processor, implement the method of any of claims 1 to 11.
18. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the method according to any one of claims 1 to 11.
CN201910771332.2A 2019-08-21 2019-08-21 Intelligent operation and maintenance method and device based on CMDB and alarm map Active CN110557281B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910771332.2A CN110557281B (en) 2019-08-21 2019-08-21 Intelligent operation and maintenance method and device based on CMDB and alarm map

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910771332.2A CN110557281B (en) 2019-08-21 2019-08-21 Intelligent operation and maintenance method and device based on CMDB and alarm map

Publications (2)

Publication Number Publication Date
CN110557281A CN110557281A (en) 2019-12-10
CN110557281B true CN110557281B (en) 2022-04-26

Family

ID=68737794

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910771332.2A Active CN110557281B (en) 2019-08-21 2019-08-21 Intelligent operation and maintenance method and device based on CMDB and alarm map

Country Status (1)

Country Link
CN (1) CN110557281B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111327466B (en) * 2020-02-16 2023-01-17 苏州浪潮智能科技有限公司 Alarm analysis method, system, equipment and medium
CN112084100B (en) * 2020-09-11 2023-02-28 山东英信计算机技术有限公司 Server operation and maintenance method, device and equipment and readable storage medium
CN115442249B (en) * 2022-08-25 2023-09-01 广州侨远信息科技有限公司 Network automation operation and maintenance method and system based on big data and federal learning technology

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105631522A (en) * 2015-12-23 2016-06-01 北京安托软件技术有限公司 IT system operation and maintenance management system
CN106487585A (en) * 2016-10-09 2017-03-08 深圳市思博创科技有限公司 Equipment complex monitoring system framework
CN107729214A (en) * 2017-10-13 2018-02-23 福建富士通信息软件有限公司 A kind of visual distributed system monitors O&M method and device in real time
CN108206747A (en) * 2016-12-16 2018-06-26 中国移动通信集团山西有限公司 Method for generating alarm and system
CN109086182A (en) * 2018-06-27 2018-12-25 平安科技(深圳)有限公司 The method and terminal device of database auto-alarming
CN109669836A (en) * 2018-09-25 2019-04-23 平安普惠企业管理有限公司 Intelligent IT O&M analysis method, device, equipment and readable storage medium storing program for executing

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8065397B2 (en) * 2006-12-26 2011-11-22 Axeda Acquisition Corporation Managing configurations of distributed devices

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105631522A (en) * 2015-12-23 2016-06-01 北京安托软件技术有限公司 IT system operation and maintenance management system
CN106487585A (en) * 2016-10-09 2017-03-08 深圳市思博创科技有限公司 Equipment complex monitoring system framework
CN108206747A (en) * 2016-12-16 2018-06-26 中国移动通信集团山西有限公司 Method for generating alarm and system
CN107729214A (en) * 2017-10-13 2018-02-23 福建富士通信息软件有限公司 A kind of visual distributed system monitors O&M method and device in real time
CN109086182A (en) * 2018-06-27 2018-12-25 平安科技(深圳)有限公司 The method and terminal device of database auto-alarming
CN109669836A (en) * 2018-09-25 2019-04-23 平安普惠企业管理有限公司 Intelligent IT O&M analysis method, device, equipment and readable storage medium storing program for executing

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于IT 运维视角的资产管理平台建设及应用研究;邓洁等;《软件》;20171231;第38卷(第1期);第102-107页 *

Also Published As

Publication number Publication date
CN110557281A (en) 2019-12-10

Similar Documents

Publication Publication Date Title
US20230244454A1 (en) Software defined network controller
US9940210B2 (en) Healing cloud services during upgrades
CN110557281B (en) Intelligent operation and maintenance method and device based on CMDB and alarm map
US10644952B2 (en) VNF failover method and apparatus
CN110096424B (en) Test processing method and device, electronic equipment and storage medium
CN114650223B (en) Network configuration method and device of Kubernetes cluster and electronic equipment
EP3672155A1 (en) Method, device and system for optimizing sdon architecture model, and computer readable storage medium
CN110609656B (en) Storage management method, electronic device, and computer program product
GB2604203A (en) Managing failures in edge computing environments
CN109726039A (en) Method and apparatus for managing virtual machine
CN113778477A (en) File processing method and device, computer readable storage medium and electronic device
US20210019379A1 (en) License usage management
CN115167874A (en) Automatic driving software mirror image deployment method and device, electronic equipment and readable medium
AU2022245632A1 (en) Run-time communications protocol parameter adjustment in containerized applications
CN114070889A (en) Configuration method, traffic forwarding method, device, storage medium, and program product
US20200293349A1 (en) Open interface management of virtual agent nodes
CN110971665A (en) Management method, system, equipment and storage medium for interfacing multi-type storage
US9760459B2 (en) Synchronization policies among nodes
US11953972B2 (en) Selective privileged container augmentation
CN112242918B (en) VNFD multi-version compatible processing method, device, equipment and storage medium
US11736525B1 (en) Generating access control policies using static analysis
CN115604101B (en) System management method and related equipment
KR102062581B1 (en) Method and apparatus providing user interface for establishing virtual infrastructure management
Ifrah et al. Troubleshooting Amazon AWS Containerized Solutions
CN117539502A (en) Application system management method and device, storage medium and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant