CN113516244B - Intelligent operation and maintenance method and device, electronic equipment and storage medium - Google Patents

Intelligent operation and maintenance method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN113516244B
CN113516244B CN202110850483.4A CN202110850483A CN113516244B CN 113516244 B CN113516244 B CN 113516244B CN 202110850483 A CN202110850483 A CN 202110850483A CN 113516244 B CN113516244 B CN 113516244B
Authority
CN
China
Prior art keywords
maintenance
data
historical
change
strategy set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110850483.4A
Other languages
Chinese (zh)
Other versions
CN113516244A (en
Inventor
宋广磊
吴统帅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shengjing Intelligent Technology Jiaxing Co ltd
Original Assignee
Shengjing Intelligent Technology Jiaxing Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shengjing Intelligent Technology Jiaxing Co ltd filed Critical Shengjing Intelligent Technology Jiaxing Co ltd
Priority to CN202110850483.4A priority Critical patent/CN113516244B/en
Publication of CN113516244A publication Critical patent/CN113516244A/en
Application granted granted Critical
Publication of CN113516244B publication Critical patent/CN113516244B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/70Software maintenance or management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/022Knowledge engineering; Knowledge acquisition

Abstract

The application provides an intelligent operation and maintenance method, an intelligent operation and maintenance device, electronic equipment and a storage medium, wherein the intelligent operation and maintenance method comprises the following steps: acquiring business change data; based on the business change data, a change operation and maintenance strategy set is matched from a historical knowledge base; comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set, and determining a new operation and maintenance strategy set according to the comparison result; and carrying out operation and maintenance on the equipment and the service based on the new operation and maintenance strategy set. According to the intelligent operation and maintenance method, the intelligent operation and maintenance device, the electronic equipment and the storage medium, the change data of the operation and maintenance on the equipment are obtained, and the operation and maintenance strategy set is precisely matched in the historical knowledge base, so that the operation and maintenance events related to the service data are more effectively analyzed and positioned by utilizing the targeted and customized operation and maintenance strategies in the operation and maintenance strategy set, decisions are made, and the IT resource utilization rate and the system service stability are improved.

Description

Intelligent operation and maintenance method and device, electronic equipment and storage medium
Technical Field
The present application relates to the field of big data analysis technologies, and in particular, to an intelligent operation and maintenance method, an intelligent operation and maintenance device, an electronic device, and a storage medium.
Background
In the field of internet technology (Internet Technology, IT) operation and maintenance in big data age, artificial intelligence IT operation and maintenance (Artificial Intelligence Internet Technology Operations, AIOps), comprehensive monitoring, fault automatic positioning, root cause analysis, alarm and the like of hardware equipment are realized by combining big data and data mining technology, and a mature framework and a mature technical scheme exist.
Existing AIOps operation and maintenance systems mainly involve three aspects, including: the intelligent operation and maintenance algorithm unit is mainly used for analyzing and positioning operation and maintenance events and making decisions according to specific operation and maintenance scenes, business rules or expert experience and the like by using an artificial intelligent algorithm; an automatic operation and maintenance tool is mainly used for performing processing operations such as control, monitoring, online and the like on a system based on an operation and maintenance tool of a determination logic; the operation and maintenance big data platform is mainly used for collecting, processing, storing and displaying various operation and maintenance data and a perception and insight system.
The existing intelligent operation and maintenance algorithm unit autonomously perceives the change of operation service on each device, including the use and operation application of the device and the following changes of network environment, machine room and the like, are blank, so that uniform operation and maintenance strategies are implemented on each device, and resource waste is caused, and the key points of operation and maintenance and monitoring cannot be grasped.
Disclosure of Invention
The application provides an intelligent operation and maintenance method, an intelligent operation and maintenance device, electronic equipment and a storage medium, which are used for solving the defects of resource waste and poor operation and maintenance effect caused by implementing a unified operation and maintenance strategy on each equipment in the prior art.
In a first aspect, the present application provides an intelligent operation and maintenance method, including: acquiring business change data; based on the business change data, a change operation and maintenance strategy set is matched from a historical knowledge base; comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set, and determining a new operation and maintenance strategy set according to the comparison result; and carrying out operation and maintenance on the equipment and the service based on the new operation and maintenance strategy set.
According to the intelligent operation and maintenance method provided by the application, when the service variation data comprises application variation data and equipment process data, the acquiring the service variation data comprises the following steps: receiving the application change data, wherein the application change data is uploaded by engineering implementation personnel; and receiving the equipment process data sent by the process monitoring service unit in a heartbeat mode.
According to the intelligent operation and maintenance method provided by the application, before rule matching is carried out in a historical knowledge base based on the business change data so as to obtain a change operation and maintenance strategy set, the intelligent operation and maintenance method further comprises the following steps: and carrying out data mining on the historical operation and maintenance data to formulate association rules between the service data and the intelligent operation and maintenance strategy, and constructing the historical knowledge base.
According to the intelligent operation and maintenance method provided by the application, rule matching is performed in a historical knowledge base based on the business change data so as to obtain a change operation and maintenance strategy set, and the method comprises the following steps: based on the association rule between the service data and the intelligent operation and maintenance strategy, invoking an association rule mining algorithm to mine the historical knowledge base by utilizing the service variation data so as to acquire a frequent item set; and determining the change operation and maintenance strategy set according to the frequent item set.
According to the intelligent operation and maintenance method provided by the application, before the change operation and maintenance strategy set is matched from the historical knowledge base based on the business change data, the intelligent operation and maintenance method further comprises the following steps: and merging the business change data.
According to the intelligent operation and maintenance method provided by the application, the change operation and maintenance strategy set is compared with the current operation and maintenance strategy set, and a new operation and maintenance strategy set is determined according to the comparison result, and the intelligent operation and maintenance method comprises the following steps: under the condition that the comparison result of the changed operation and maintenance strategy set and the current operation and maintenance strategy set is the same, keeping the current operation and maintenance strategy set unchanged; and under the condition that the comparison results of the change operation and maintenance strategy set and the current operation and maintenance strategy set are different, determining the change operation and maintenance strategy set as the new operation and maintenance strategy set.
According to the intelligent operation and maintenance method provided by the application, the operation and maintenance of the equipment and the service are performed based on the new operation and maintenance strategy set, and the method comprises the following steps: collecting equipment state data, and merging equipment process data in the equipment state data; and calling a corresponding operation and maintenance strategy according to the merged process data of the equipment based on the new operation and maintenance strategy set so as to realize operation and maintenance of the equipment.
In a second aspect, the present application further provides an intelligent operation and maintenance device, including: the data acquisition unit is used for acquiring business change data; the policy calling unit is used for matching a change operation and maintenance policy set from the historical knowledge base based on the business change data; the strategy matching unit is used for comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set and determining a new operation and maintenance strategy set according to the comparison result; and the policy running unit is used for running and maintaining the equipment and the service based on the new running and maintenance policy set.
In a third aspect, the present application provides an electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the steps of any one of the intelligent operation and maintenance methods described above when the program is executed by the processor.
In a fourth aspect, the present application also provides a non-transitory computer readable storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of the intelligent operation and maintenance method as described in any of the above.
According to the intelligent operation and maintenance method, the intelligent operation and maintenance device, the electronic equipment and the storage medium, the change data of the operation and maintenance on the equipment are obtained, and the operation and maintenance strategy set is precisely matched in the historical knowledge base, so that the operation and maintenance strategies in the operation and maintenance strategy set are utilized, the operation and maintenance events related to the service data can be more effectively analyzed and positioned, decisions are made, and the IT resource utilization rate and the system service stability are improved.
Drawings
In order to more clearly illustrate the application or the technical solutions of the prior art, the following description will briefly explain the drawings used in the embodiments or the description of the prior art, and it is obvious that the drawings in the following description are some embodiments of the application, and other drawings can be obtained according to the drawings without inventive effort for a person skilled in the art.
FIG. 1 is a schematic flow chart of an intelligent operation and maintenance method provided by the application;
FIG. 2 is a second flow chart of the intelligent operation and maintenance method according to the present application;
FIG. 3 is a third flow chart of the intelligent operation and maintenance method according to the present application;
FIG. 4 is a schematic diagram of the structure of the intelligent operation and maintenance device provided by the application;
fig. 5 is a schematic structural diagram of an electronic device provided by the present application.
Detailed Description
For the purpose of making the objects, technical solutions and advantages of the present application more apparent, the technical solutions of the present application will be clearly and completely described below with reference to the accompanying drawings, and it is apparent that the described embodiments are some embodiments of the present application, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
It should be noted that in the description of embodiments of the present application, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element. The orientation or positional relationship indicated by the terms "upper", "lower", etc. are based on the orientation or positional relationship shown in the drawings, are merely for convenience of description and to simplify the description, and are not indicative or implying that the apparatus or elements in question must have a specific orientation, be constructed and operated in a specific orientation, and therefore should not be construed as limiting the present application. Unless specifically stated or limited otherwise, the terms "mounted," "connected," and "coupled" are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the above terms in the present application can be understood by those of ordinary skill in the art according to the specific circumstances.
The terms first, second and the like in the description and in the claims, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used may be interchanged, as appropriate, such that embodiments of the present application may be implemented in sequences other than those illustrated or described herein, and that the objects identified by "first," "second," etc. are generally of a type, and are not limited to the number of objects, such as the first object may be one or more. Furthermore, in the description and claims, "and/or" means at least one of the connected objects, and the character "/", generally means that the associated object is an "or" relationship.
Under the current technology, AIOps operation and maintenance mainly can realize the following functions:
1) The system abnormality and the fault hidden danger are determined, the possible abnormality or fault hidden danger can be early warned in time, the probability of fault occurrence can be obviously reduced, and the pressure of IT operation and maintenance can be reduced;
2) The performance bottleneck of each application system is found and positioned in time, and the root cause of various faults is positioned rapidly, so that the stability of system service is improved.
3) Providing basis for the manager to make accurate decisions by determining the data connection between the online service and the equipment resource capacity; and meanwhile, resource early warning and resource utilization rate optimization can be provided, including the improvement of the existing IT resource utilization rate of enterprises.
The intelligent operation and maintenance method, the device, the electronic equipment and the storage medium provided by the application are based on a basic framework of AIOps operation and maintenance, and are different in that the application can perform operation and maintenance and monitoring more reasonably and effectively by timely grasping service changes of deployment and implementation on the equipment and calling a targeted and customized operation and maintenance strategy according to the changes of service data.
The following describes an intelligent operation and maintenance method, an intelligent operation and maintenance device, an electronic device and a storage medium provided by the application with reference to fig. 1 to 5.
Fig. 1 is a schematic flow chart of the intelligent operation and maintenance method provided by the present application, as shown in fig. 1, including but not limited to the following steps:
step 101: and acquiring business change data.
The business change data is data which can reflect the change related to the running business on each device and changes, such as network environment data, machine room data, application service data (such as related data corresponding to upgrading, deployment, implementation and the like), data related to application processes, device performance index data and the like. The present application is not particularly limited thereto.
The method for acquiring the service variation data can be a heartbeat mode, namely, the method for acquiring the service related information in real time, the method for acquiring the engineering implementation personnel reporting information and the like, and the application is not particularly limited.
Step 102: and based on the business change data, matching a change operation and maintenance strategy set from a historical knowledge base.
The change operation and maintenance policy set refers to a set of operation and maintenance policies matched from a pre-constructed historical knowledge base according to each group of business change data. The operation and maintenance strategy refers to a strategy of analyzing, positioning or deciding (such as anomaly detection, intelligent early warning and the like) according to operation and maintenance scenes, business rules or expert experience and the like of different business change data by adopting a corresponding artificial intelligent algorithm, and each operation and maintenance strategy can be narrowly defined as an intelligent operation and maintenance algorithm model.
Wherein, the history knowledge base stores related operation and maintenance strategies corresponding to various different business change data in advance.
In step 102, by matching the corresponding operation and maintenance policies from the historical knowledge base according to the preset matching rules for all the service variation data collected in step 101, an operation and maintenance policy set is built, and this operation and maintenance policy set is referred to herein as a change operation and maintenance policy set.
Step 103: and comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set, and determining a new operation and maintenance strategy set according to the comparison result.
After the change operation and maintenance policy set is matched from the historical knowledge base according to the currently collected business change data, the change operation and maintenance policy set needs to be compared with the current operation and maintenance policy set running in the system to determine whether the current operation and maintenance policy set needs to be updated or not.
For example: a new operation policy (the root cause of which is that new service data is present) appears in the change operation and maintenance policy set, but the operation policy is not loaded to the current operation and maintenance policy set, and the operation policy needs to be reloaded to form a new operation and maintenance policy set.
Step 104: and carrying out operation and maintenance on the equipment and the service based on the new operation and maintenance strategy set.
After the current operation and maintenance strategy set is updated through the steps, the new operation and maintenance strategy set is utilized to carry out processing such as control, monitoring and online on each device in the system, so that intelligent operation and maintenance is realized.
According to the intelligent operation and maintenance method provided by the application, the change data of the operation and maintenance service on the equipment are obtained, and the operation and maintenance strategy set is precisely matched in the historical knowledge base, so that the operation and maintenance event related to the service data is more effectively analyzed and positioned by utilizing the targeted and customized operation and maintenance strategy in the operation and maintenance strategy set, and decisions are made, and the IT resource utilization rate and the system service stability are improved.
Based on the foregoing embodiment, as an optional embodiment, in a case where the service variation data includes application variation data and device process data, the acquiring service variation data includes: receiving the application change data, wherein the application change data is uploaded by engineering implementation personnel; and receiving the device process data sent by the process monitoring service unit in a heartbeat mode based on the agent platform.
FIG. 2 is a second flow chart of the intelligent operation and maintenance method provided by the present application, as shown in FIG. 2, the method for obtaining service variation data according to the present application may have the following two aspects, which are all completed by the agent platform. Wherein the agent platform is a carrier for executing agent technology.
The Agent platform is a computer system packaged in a certain environment, is an abstract entity with autonomy, sociality, reactivity and initiative, can independently and autonomously operate in a certain environment, acts on the environment and is also influenced by the environment, and knowledge can be continuously obtained from the environment to improve the capability of the Agent platform.
On one hand, in order to timely grasp the change of application services on the equipment, the application realizes the online and integrated operation and maintenance and management, provides an open management function, and automatically uploads related application change data to the agent platform to realize the sharing of service data while engineering implementation personnel manage the equipment online, upgrade the application equipment or deploy and implement new applications.
On the other hand, the process monitoring service unit of the agent collects the process information of the equipment in real time so as to acquire the process data of the equipment: by implementing the process information of the monitoring device, the service corresponding to the application service process can be analyzed, so that the service content running on the device can be determined.
It should be noted that, because the change of the device process information related to the service content is slow, the application adopts the Heartbean to send the device process data to the agent platform in real time in a Heartbeat service mode, instead of frequent inquiry and call of the device process data by the agent platform.
In addition, the main idea of the intelligent operation and maintenance method provided by the application is as follows: according to different running services on the equipment, different operation and maintenance strategies are called, so that the AIOps operation and maintenance are more targeted, and equipment resources are more effectively utilized. Therefore, the intelligent operation and maintenance method is not only suitable for a distributed or centralized equipment operation and maintenance scene, but also suitable for equipment operation and maintenance of various scales; the intelligent operation and maintenance agent platform is transparent to the equipment, and normal operation can be realized by only remotely configuring the intelligent operation and maintenance agent platform to the corresponding equipment no matter whether the operation and maintenance equipment is a server, a camera or other terminals.
The intelligent operation and maintenance method provided by the application is based on the principle and technology of agent to realize intelligent operation and maintenance, completes real-time calculation based on big data through data sharing and interface service, is applied to the intelligent operation and maintenance field by combining with the technologies of data mining, deep learning and the like, and can effectively improve the effective utilization rate of equipment resources.
Based on the foregoing embodiment, as an optional embodiment, before performing rule matching in the historical knowledge base based on the service variation data to obtain the change operation and maintenance policy set, the method further includes: and carrying out data mining on the historical operation and maintenance data to formulate association rules between the service data and the intelligent operation and maintenance strategy, and constructing the historical knowledge base.
In the intelligent operation and maintenance method provided by the application, the historical knowledge base can be pre-constructed and stored in the system memory for the real-time call of the agent platform.
The construction process of the historical knowledge base can be as follows:
acquiring historical operation and maintenance data in the AIOps operation and maintenance implementation process, wherein the historical operation and maintenance data comprises historical service data and a historical operation and maintenance strategy set which is used for operating and maintaining the historical service data and is formed by historical operation and maintenance strategies;
and carrying out association rule mining on the historical service data and the historical operation and maintenance policies in the historical operation and maintenance data, establishing association rules between all types of service data and the operation and maintenance policies, and constructing a historical knowledge base.
After the historical knowledge base is constructed, the relevant operation and maintenance strategy set can be determined based on high-frequency problems and root cause analysis caused by the conditions of application service, machine room, network and the like of equipment operation in the AIOps operation and maintenance process, the association relation of the equipment, service and the operation and maintenance strategy set is established, and the historical knowledge base is continuously updated according to the relations.
According to the intelligent operation and maintenance method provided by the application, the historical knowledge base is pre-built, and in the AIOps operation and maintenance, the agent platform can timely match the corresponding operation and maintenance strategy set from the historical knowledge base according to the current business change data, so that the operation and maintenance and monitoring of equipment and service can be realized rapidly, the response speed is high, and the operation and maintenance effect is higher.
Based on the foregoing embodiment, as an optional embodiment, the performing rule matching in the historical knowledge base based on the service variation data to obtain a change operation and maintenance policy set includes: based on the association rule between the service data and the intelligent operation and maintenance strategy, invoking an association rule mining algorithm to mine the historical knowledge base by utilizing the service variation data so as to acquire a frequent item set; and determining the change operation and maintenance strategy set according to the frequent item set.
Fig. 3 is a third flow chart of the intelligent operation and maintenance method provided by the present application, as shown in fig. 3, in the intelligent operation and maintenance method provided by the present application, agent technology is adopted, and service variation data on a device is automatically obtained, and a correlation mining algorithm is used in a historical knowledge base, for example: the FP-Growth algorithm performs data mining on the operation and maintenance history data, and comprises the following steps:
and constructing association rules between the service data and the intelligent operation and maintenance strategies, mining, and acquiring frequent items by introducing confidence according to causal relations between the service data and the intelligent operation and maintenance strategies, so as to realize accurate matching of the intelligent operation and maintenance strategies which need to be started in a historical knowledge base, namely automatically matching a change operation and maintenance strategy set corresponding to the service change data.
After the current operation and maintenance strategy set is updated by utilizing the changed operation and maintenance strategy set, the related operation and maintenance strategies in the new operation and maintenance strategy set can be called for intelligent operation and maintenance and monitoring of the equipment through the equipment state data collected and processed in real time.
The intelligent operation and maintenance method provided by the application is used for completing high-frequency problems and root cause analysis caused under the conditions of application service, machine room, network and the like of equipment operation based on the association rule mining algorithm, thereby determining a corresponding operation and maintenance strategy set, further establishing the relationship among equipment, service and operation and maintenance strategy set, and providing operation and maintenance strategies in a targeted and customized manner according to different service data, so that the equipment operation and maintenance and monitoring can be realized more reasonably and effectively.
Based on the foregoing embodiment, as an optional embodiment, after the acquiring service variation data, the method further includes: and merging the business change data to obtain key business data so as to match the change operation and maintenance strategy set from the historical knowledge base by utilizing the key business data.
As shown in FIG. 2, the channels for acquiring the business change data are multiple, and redundant, repeated or invalid data may exist among the business data, so that the intelligent operation and maintenance method provided by the application performs merging and optimization on all acquired business data after equipment process information collected by the agent platform and application change data uploaded by engineering implementation personnel so as to extract key business and remove the repeated and invalid business data.
According to the intelligent operation and maintenance method provided by the application, the agent platform obtains the business change data, and through unified preprocessing of all the data, the data precision can be effectively improved, the operation pressure of the platform is reduced, and meanwhile, the operation and maintenance accuracy can be improved.
Based on the foregoing embodiments, as an optional embodiment, the comparing the change operation and maintenance policy set with the current operation and maintenance policy set, and determining a new operation and maintenance policy set according to the comparison result includes: under the condition that the comparison result of the changed operation and maintenance strategy set and the current operation and maintenance strategy set is the same, keeping the current operation and maintenance strategy set unchanged; and under the condition that the comparison results of the change operation and maintenance strategy set and the current operation and maintenance strategy set are different, determining the change operation and maintenance strategy set as the new operation and maintenance strategy set.
In the intelligent operation and maintenance method provided by the application, the agent platform is preloaded with the current operation and maintenance policy set, and the AIOps operation and maintenance of the equipment is realized based on the current operation and maintenance policy set.
In the AIOps operation and maintenance process, service change data including equipment process data and application change data uploaded by engineering implementation personnel are collected in real time.
After the business change data are combined and optimized, the historical knowledge base is subjected to associated mining based on the matching rule, the mined operation and maintenance strategies are processed by introducing confidence, and a change operation and maintenance strategy set is constructed.
Further, if the change operation and maintenance policy set is different from the current operation and maintenance policy set, the change operation and maintenance policy set is used as a new operation and maintenance policy set to be loaded to the agent platform for calling; if the changed operation and maintenance strategy set is the same as the current operation and maintenance strategy set, the new operation and maintenance strategy is not introduced, and the current operation and maintenance strategy set is not required to be updated at the moment.
In the intelligent operation and maintenance method provided by the application, in the AIOps implementation process, the association rule mining is carried out on the service data and the intelligent operation and maintenance strategy set, and a historical knowledge base is constructed; and the change of the equipment service is timely received in a heartbeat mode, and the intelligent operation and maintenance strategy set is automatically and accurately matched in the history knowledge base by combining the process information, so that more targeted monitoring and operation and maintenance are realized.
Based on the foregoing embodiment, as an optional embodiment, the operating the device and the service based on the new set of operation and maintenance policies includes: collecting equipment state data, and merging equipment process data in the equipment state data; and calling a corresponding operation and maintenance strategy according to the merged process data of the equipment based on the new operation and maintenance strategy set so as to realize operation and maintenance of the equipment.
With reference to fig. 3, the agent platform provided by the application has the following functions:
one is the collection of data, comprising: collecting equipment process data related to equipment performance index information and process information in real time, and calculating in real time; and receiving application change data related to the equipment application, which is reported by engineering implementation personnel.
And secondly, selecting a function, namely, after receiving application change data and equipment process data, carrying out service merging by the agent platform, then calling an association rule mining algorithm, and carrying out rule matching from a historical knowledge base to obtain a change operation and maintenance strategy set (comprising a set of intelligent operation and maintenance algorithm models). If the changed operation and maintenance strategy is different from the current operation and maintenance strategy set, loading and starting a new operation and maintenance strategy set; otherwise pass.
Thirdly, a monitoring function, namely collecting equipment state data in real time by the agent platform, and processing, cleaning and standardizing process related information; and collecting running state data of the equipment in real time, and calling an operation and maintenance strategy to diagnose the equipment.
According to the intelligent operation and maintenance method provided by the application, the agent technology is utilized to automatically acquire the change of the operation service on the equipment, a targeted and customized operation and maintenance strategy set is precisely matched in the historical knowledge base through the association rule, and is started and operated, and the intelligent operation and maintenance and monitoring of the equipment are carried out through the real-time collected and processed equipment state data, so that the intelligent operation and maintenance method is more reasonable and effective.
Fig. 4 is a schematic structural diagram of the intelligent operation and maintenance device provided by the present application, as shown in fig. 4, mainly including a data acquisition unit 41, a policy invoking unit 42, a policy matching unit 43, and a policy running unit 44, where:
the data acquisition unit 41 is mainly used for acquiring business change data; the policy invoking unit 42 is mainly configured to perform rule matching in the historical knowledge base based on the service variation data, so as to obtain a change operation and maintenance policy set; the policy matching unit 43 is mainly configured to compare the changed operation and maintenance policy set with the current operation and maintenance policy set, and determine a new operation and maintenance policy set according to the comparison result; the policy running unit 44 is mainly configured to perform operation and maintenance on the devices and services based on the new operation and maintenance policy set.
It should be noted that, when the intelligent operation and maintenance device provided in the embodiment of the present application is specifically executed, the intelligent operation and maintenance method described in any one of the above embodiments may be executed, which is not described in detail in this embodiment.
According to the intelligent operation and maintenance device provided by the application, the operation and maintenance strategy set is precisely matched in the historical knowledge base by acquiring the change data of the operation and maintenance service on the equipment, so that the operation and maintenance event related to the service data is more effectively analyzed and positioned by utilizing the targeted and customized operation and maintenance strategy in the operation and maintenance strategy set, decisions are made, and the IT resource utilization rate and the system service stability are improved.
Fig. 5 is a schematic structural diagram of an electronic device according to the present application, and as shown in fig. 5, the electronic device may include: processor 510, communication interface (Communications Interface) 520, memory 530, and communication bus 540, wherein processor 510, communication interface 520, memory 530 complete communication with each other through communication bus 540. Processor 510 may invoke logic instructions in memory 530 to perform the intelligent operation and maintenance method, which includes: acquiring business change data; matching a change operation and maintenance strategy set from a historical knowledge base; comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set, and determining a new operation and maintenance strategy set according to the comparison result; and carrying out operation and maintenance on the equipment and the service based on the new operation and maintenance strategy set.
Further, the logic instructions in the memory 530 described above may be implemented in the form of software functional units and may be stored in a computer-readable storage medium when sold or used as a stand-alone product. Based on this understanding, the technical solution of the present application may be embodied essentially or in a part contributing to the prior art or in a part of the technical solution, in the form of a software product stored in a storage medium, comprising several instructions for causing a computer device (which may be a personal computer, a server, a network device, etc.) to perform all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk, or an optical disk, or other various media capable of storing program codes.
In another aspect, the present application also provides a computer program product comprising a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions which, when executed by a computer, are capable of performing the intelligent operation and maintenance method provided by the above methods, the method comprising: acquiring business change data; based on the business change data, a change operation and maintenance strategy set is matched from a historical knowledge base; comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set, and determining a new operation and maintenance strategy set according to the comparison result; and carrying out operation and maintenance on the equipment and the service based on the new operation and maintenance strategy set.
In yet another aspect, the present application further provides a non-transitory computer readable storage medium having stored thereon a computer program which, when executed by a processor, is implemented to perform the intelligent operation and maintenance method provided in the above embodiments, the method comprising: acquiring business change data; based on the business change data, a change operation and maintenance strategy set is matched from a historical knowledge base; comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set, and determining a new operation and maintenance strategy set according to the comparison result; and carrying out operation and maintenance on the equipment and the service based on the new operation and maintenance strategy set.
The apparatus embodiments described above are merely illustrative, wherein the elements illustrated as separate elements may or may not be physically separate, and the elements shown as elements may or may not be physical elements, may be located in one place, or may be distributed over a plurality of network elements. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of this embodiment. Those of ordinary skill in the art will understand and implement the present application without undue burden.
From the above description of the embodiments, it will be apparent to those skilled in the art that the embodiments may be implemented by means of software plus necessary general hardware platforms, or of course may be implemented by means of hardware. Based on this understanding, the foregoing technical solution may be embodied essentially or in a part contributing to the prior art in the form of a software product, which may be stored in a computer readable storage medium, such as ROM/RAM, a magnetic disk, an optical disk, etc., including several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the method described in the respective embodiments or some parts of the embodiments.
Finally, it should be noted that: the above embodiments are only for illustrating the technical solution of the present application, and are not limiting; although the application has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit and scope of the technical solutions of the embodiments of the present application.

Claims (9)

1. An intelligent operation and maintenance method is characterized by comprising the following steps:
acquiring business change data;
based on the business change data, a change operation and maintenance strategy set is matched from a historical knowledge base;
comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set to determine a new operation and maintenance strategy set;
based on the new operation policy set, operating and maintaining equipment and services;
before rule matching is performed in the historical knowledge base based on the business change data to obtain a change operation and maintenance policy set, the method further comprises the following steps:
acquiring historical operation and maintenance data in an AIOps operation and maintenance implementation process, wherein the historical operation and maintenance data comprises historical service data and a historical operation and maintenance strategy set which is used for operating and maintaining the historical service data and is formed by historical operation and maintenance strategies;
performing data mining on the historical operation and maintenance data to formulate association rules between the service data and the intelligent operation and maintenance strategy, and constructing the historical knowledge base;
the business change data is data which reflects the change of running business on each device and changes, and comprises at least one of network environment data, machine room data, application service data, data related to application processes and device performance index data;
the change operation and maintenance strategy set refers to a set of operation and maintenance strategies matched from a pre-constructed historical knowledge base according to each group of business change data; the operation and maintenance strategy is a strategy for analyzing, positioning or deciding different business change data by a pointer.
2. The intelligent operation and maintenance method according to claim 1, wherein in the case where the service variation data includes application variation data and device process data, the acquiring service variation data includes:
receiving the application change data, wherein the application change data is uploaded by engineering implementation personnel;
and receiving the device process data sent by the process monitoring service unit in a heartbeat mode based on the agent platform.
3. The intelligent operation and maintenance method according to claim 1, wherein said performing rule matching in a historical knowledge base based on said business change data to obtain a change operation and maintenance policy set comprises:
based on the association rule between the service data and the intelligent operation and maintenance strategy, invoking an association rule mining algorithm to mine the historical knowledge base by utilizing the service variation data so as to acquire a frequent item set;
and determining the change operation and maintenance strategy set according to the frequent item set.
4. The intelligent operation and maintenance method according to claim 1, further comprising, before said matching the change operation and maintenance policy set from the historical knowledge base based on the business change data:
and merging the business change data.
5. The intelligent operation and maintenance method according to claim 1, wherein the comparing the changed operation and maintenance policy set with the current operation and maintenance policy set, and determining a new operation and maintenance policy set according to the comparison result, comprises:
under the condition that the comparison result of the changed operation and maintenance strategy set and the current operation and maintenance strategy set is the same, keeping the current operation and maintenance strategy set unchanged;
and under the condition that the comparison results of the change operation and maintenance strategy set and the current operation and maintenance strategy set are different, determining the change operation and maintenance strategy set as the new operation and maintenance strategy set.
6. The intelligent operation and maintenance method according to claim 1, wherein operating and maintaining equipment and services based on the new operation and maintenance policy set comprises:
collecting equipment state data, and merging equipment process data related to the equipment state data;
and calling a corresponding operation and maintenance strategy according to the merged process data of the equipment based on the new operation and maintenance strategy set so as to realize operation and maintenance of the equipment.
7. An intelligent operation and maintenance device, which is characterized by comprising:
the data acquisition unit is used for acquiring business change data; the business change data is data which reflects the change of running business on each device and changes, and comprises at least one of network environment data, machine room data, application service data, data related to application processes and device performance index data;
the policy calling unit is used for matching a change operation and maintenance policy set from the historical knowledge base based on the business change data; before rule matching is performed in the historical knowledge base based on the business change data to obtain a change operation and maintenance policy set, the method further comprises the following steps: acquiring historical operation and maintenance data in an AIOps operation and maintenance implementation process, wherein the historical operation and maintenance data comprises historical service data and a historical operation and maintenance strategy set which is used for operating and maintaining the historical service data and is formed by historical operation and maintenance strategies; performing data mining on the historical operation and maintenance data to formulate association rules between the service data and the intelligent operation and maintenance strategy and constructing the historical knowledge base; the change operation and maintenance strategy set refers to a set of operation and maintenance strategies matched from a pre-constructed historical knowledge base according to each group of business change data; the operation and maintenance strategy is a strategy for analyzing, positioning or deciding different business change data by a pointer;
the strategy matching unit is used for comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set to determine a new operation and maintenance strategy set;
and the policy running unit is used for running and maintaining the equipment and the service based on the new running and maintenance policy set.
8. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the intelligent operation and maintenance method steps of any one of claims 1 to 6 when the computer program is executed by the processor.
9. A non-transitory computer readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, implements the intelligent operation and maintenance method steps of any one of claims 1 to 6.
CN202110850483.4A 2021-07-27 2021-07-27 Intelligent operation and maintenance method and device, electronic equipment and storage medium Active CN113516244B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110850483.4A CN113516244B (en) 2021-07-27 2021-07-27 Intelligent operation and maintenance method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110850483.4A CN113516244B (en) 2021-07-27 2021-07-27 Intelligent operation and maintenance method and device, electronic equipment and storage medium

Publications (2)

Publication Number Publication Date
CN113516244A CN113516244A (en) 2021-10-19
CN113516244B true CN113516244B (en) 2023-12-05

Family

ID=78067628

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110850483.4A Active CN113516244B (en) 2021-07-27 2021-07-27 Intelligent operation and maintenance method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113516244B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114553726B (en) * 2022-02-23 2023-06-02 深圳市众功软件有限公司 Network security operation and maintenance method and system based on functions and resource levels
CN115037513A (en) * 2022-04-27 2022-09-09 锐捷网络股份有限公司 Security policy configuration method and device
CN115051930B (en) * 2022-05-23 2023-05-12 中电信数智科技有限公司 AISecOps combined middle station algorithm-based defect optimization method
CN116882978B (en) * 2023-08-01 2024-04-09 中国船舶科学研究中心 Deep sea submersible operation and maintenance support system based on product information frame

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105843210A (en) * 2016-03-22 2016-08-10 清华大学 Power transformer defect information data mining method
CN111159256A (en) * 2019-12-31 2020-05-15 贵州电网有限责任公司 Distribution network information data mining method facing equipment operation and maintenance
CN111190876A (en) * 2019-12-31 2020-05-22 天津浪淘科技股份有限公司 Log management system and operation method thereof
CN111476381A (en) * 2020-04-08 2020-07-31 贵州爱信诺航天信息有限公司 Method and system for operation and maintenance service of innovative application system based on localization information technology

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200160191A1 (en) * 2018-11-19 2020-05-21 International Business Machines Corporation Semi-automated correction of policy rules

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105843210A (en) * 2016-03-22 2016-08-10 清华大学 Power transformer defect information data mining method
CN111159256A (en) * 2019-12-31 2020-05-15 贵州电网有限责任公司 Distribution network information data mining method facing equipment operation and maintenance
CN111190876A (en) * 2019-12-31 2020-05-22 天津浪淘科技股份有限公司 Log management system and operation method thereof
CN111476381A (en) * 2020-04-08 2020-07-31 贵州爱信诺航天信息有限公司 Method and system for operation and maintenance service of innovative application system based on localization information technology

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
智能运维技术在电信大视频业务中的应用研究;吉锋;刘丽霞;文韬;张晗;;信息通信技术(第01期);全文 *
高效、智能的IT运维体系建设;郑仕辉;;金融电子化(第12期);全文 *

Also Published As

Publication number Publication date
CN113516244A (en) 2021-10-19

Similar Documents

Publication Publication Date Title
CN113516244B (en) Intelligent operation and maintenance method and device, electronic equipment and storage medium
CN111047082B (en) Early warning method and device of equipment, storage medium and electronic device
KR102402845B1 (en) Real-Time Data Processing Method for Digital Twin based Construction Machine Intelligence
CN111629043B (en) Cross-platform health management system based on cloud mode
CN102739802A (en) Service application-oriented IT contralized operation and maintenance analyzing system
CN111290913A (en) Fault location visualization system and method based on operation and maintenance data prediction
CN112769605B (en) Heterogeneous multi-cloud operation and maintenance management method and hybrid cloud platform
CN115225536B (en) Virtual machine abnormality detection method and system based on unsupervised learning
CN106940678B (en) System real-time health degree evaluation and analysis method and device
CN112650200B (en) Method and device for diagnosing plant station equipment faults
CN114640581A (en) Computational power network processing method and device based on digital twins
CN111884859B (en) Network fault diagnosis method and device and readable storage medium
CN114706675A (en) Task deployment method and device based on cloud edge cooperative system
CN108023740A (en) The indicating risk method and apparatus of exception information in monitoring
CN116468423A (en) Operation and maintenance emergency coordination method, system and terminal equipment
CN115604082A (en) Fault diagnosis system based on AIOps
CN114374600A (en) Network operation and maintenance method, device, equipment and product based on big data
CN113065001A (en) Fault loss stopping method and device
CN112711508A (en) Intelligent operation and maintenance service system facing large-scale client system
CN112434079A (en) Secondary equipment abnormity discrimination decision method and device based on big data
CN105606353A (en) Mechanical fault diagnosis method and system
Sharma et al. The fundamentals and strategies of maintenance, repair, and overhaul (MRO) in Industry 4.0
CN114422332B (en) Network slice control method, device, processing equipment and storage medium
CN117971536A (en) Abnormal data processing method and device, electronic equipment and storage medium
CN114422392B (en) Intelligent risk monitoring system and risk monitoring method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant