CN113516244A - Intelligent operation and maintenance method and device, electronic equipment and storage medium - Google Patents

Intelligent operation and maintenance method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN113516244A
CN113516244A CN202110850483.4A CN202110850483A CN113516244A CN 113516244 A CN113516244 A CN 113516244A CN 202110850483 A CN202110850483 A CN 202110850483A CN 113516244 A CN113516244 A CN 113516244A
Authority
CN
China
Prior art keywords
maintenance
data
service
strategy set
maintenance strategy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110850483.4A
Other languages
Chinese (zh)
Other versions
CN113516244B (en
Inventor
宋广磊
吴统帅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shengjing Intelligent Technology Jiaxing Co ltd
Original Assignee
Shengjing Intelligent Technology Jiaxing Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shengjing Intelligent Technology Jiaxing Co ltd filed Critical Shengjing Intelligent Technology Jiaxing Co ltd
Priority to CN202110850483.4A priority Critical patent/CN113516244B/en
Publication of CN113516244A publication Critical patent/CN113516244A/en
Application granted granted Critical
Publication of CN113516244B publication Critical patent/CN113516244B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/70Software maintenance or management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/022Knowledge engineering; Knowledge acquisition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Computing Systems (AREA)
  • Evolutionary Computation (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Databases & Information Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention provides an intelligent operation and maintenance method, an intelligent operation and maintenance device, electronic equipment and a storage medium, wherein the method comprises the following steps: acquiring service change data; matching a change operation and maintenance strategy set from a historical knowledge base based on the service change data; comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set, and determining a new operation and maintenance strategy set according to the comparison result; and carrying out operation and maintenance on the equipment and the service based on the new operation and maintenance strategy set. According to the intelligent operation and maintenance method, the intelligent operation and maintenance device, the electronic equipment and the storage medium, the operation and maintenance strategy set is accurately matched in the historical knowledge base by acquiring the change data of the operation service on the equipment, so that the targeted and customized operation and maintenance strategy in the operation and maintenance strategy set is utilized, the operation and maintenance events related to the service data are more effectively analyzed and positioned, the decision is made, and the IT resource utilization rate and the system service stability are improved.

Description

Intelligent operation and maintenance method and device, electronic equipment and storage medium
Technical Field
The invention relates to the technical field of big data analysis, in particular to an intelligent operation and maintenance method, an intelligent operation and maintenance device, electronic equipment and a storage medium.
Background
In the field of Internet Technology (IT) operation and maintenance in the big data era, Artificial Intelligence IT operation and maintenance (AIOps) is combined with big data and data mining technologies to realize comprehensive monitoring, automatic fault positioning, root cause analysis, alarm and the like of hardware equipment, and a mature framework and technical scheme are provided.
The existing AIOps operation and maintenance system mainly relates to three aspects, including: the intelligent operation and maintenance algorithm unit mainly analyzes and positions operation and maintenance events and makes decisions by using an artificial intelligent algorithm according to specific operation and maintenance scenes, business rules or expert experiences and the like; an automatic operation and maintenance tool, which is mainly based on the operation and maintenance tool of the determined logic and is used for carrying out processing operations such as control, monitoring, online and the like on a system; the operation and maintenance big data platform is mainly used for collecting, processing, storing and displaying various operation and maintenance data and a perception and insight system.
The existing intelligent operation and maintenance algorithm unit is blank in how to autonomously sense the change of the operation service on each device, including the use and the operation application of the device and the change of the network environment, the machine room and the like, so that a uniform operation and maintenance strategy is implemented on each device, resource waste is caused, and the key of operation, maintenance and monitoring cannot be grasped.
Disclosure of Invention
The invention provides an intelligent operation and maintenance method, an intelligent operation and maintenance device, electronic equipment and a storage medium, which are used for solving the defects of resource waste and poor operation and maintenance effect caused by the fact that a unified operation and maintenance strategy is implemented on each equipment in the prior art.
In a first aspect, the present invention provides an intelligent operation and maintenance method, including: acquiring service change data; matching a change operation and maintenance strategy set from a historical knowledge base based on the service change data; comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set, and determining a new operation and maintenance strategy set according to the comparison result; and carrying out operation and maintenance on the equipment and the service based on the new operation and maintenance strategy set.
According to the intelligent operation and maintenance method provided by the present invention, in a case that the service change data includes application change data and device process data, the acquiring the service change data includes: receiving the application change data, wherein the application change data is uploaded by engineering implementers; and receiving the equipment process data sent by the process monitoring service unit in a heartbeat mode.
According to the intelligent operation and maintenance method provided by the invention, before rule matching is performed in a historical knowledge base based on the service change data to obtain a change operation and maintenance strategy set, the method further comprises the following steps: and carrying out data mining on the historical operation and maintenance data to formulate an association rule between the business data and the intelligent operation and maintenance strategy, and constructing the historical knowledge base.
According to the intelligent operation and maintenance method provided by the invention, the rule matching is performed in the historical knowledge base based on the service change data to obtain the change operation and maintenance strategy set, and the method comprises the following steps: calling an association rule mining algorithm based on an association rule between the service data and the intelligent operation and maintenance strategy so as to mine the historical knowledge base by using the service change data and obtain a frequent item set; and determining the change operation and maintenance strategy set according to the frequent item set.
According to the intelligent operation and maintenance method provided by the present invention, before the matching out the change operation and maintenance policy set from the historical knowledge base based on the service change data, the method further comprises: and merging the service change data.
According to the intelligent operation and maintenance method provided by the invention, the step of comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set and determining a new operation and maintenance strategy set according to the comparison result comprises the following steps: under the condition that the comparison result of the changed operation and maintenance strategy set is the same as that of the current operation and maintenance strategy set, keeping the current operation and maintenance strategy set unchanged; and under the condition that the comparison result of the changed operation and maintenance strategy set is different from that of the current operation and maintenance strategy set, determining the changed operation and maintenance strategy set as the new operation and maintenance strategy set.
According to the intelligent operation and maintenance method provided by the invention, the operation and maintenance of the equipment and the service based on the new operation and maintenance strategy set comprises the following steps: acquiring equipment state data, and merging equipment process data in the equipment state data; and calling a corresponding operation and maintenance strategy according to the merged device process data based on the new operation and maintenance strategy set so as to realize the operation and maintenance of the device.
In a second aspect, the present invention further provides an intelligent operation and maintenance device, including: the data acquisition unit is used for acquiring service change data; the strategy calling unit is used for matching out a change operation and maintenance strategy set from a historical knowledge base based on the service change data; the strategy matching unit is used for comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set and determining a new operation and maintenance strategy set according to the comparison result; and the strategy operation unit is used for operating and maintaining the equipment and the service based on the new operation and maintenance strategy set.
In a third aspect, the present invention provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor executes the computer program to implement the steps of the intelligent operation and maintenance method according to any one of the above aspects.
In a fourth aspect, the present invention further provides a non-transitory computer readable storage medium, on which a computer program is stored, the computer program, when executed by a processor, implementing the steps of the intelligent operation and maintenance method as described in any one of the above.
According to the intelligent operation and maintenance method, the intelligent operation and maintenance device, the electronic equipment and the storage medium, the operation and maintenance strategy set is accurately matched in the historical knowledge base by acquiring the change data of the operation service on the equipment, and the targeted and customized operation and maintenance strategy in the operation and maintenance strategy set is utilized, so that the operation and maintenance events related to the service data can be more effectively analyzed and positioned, the decision is made, and the IT resource utilization rate and the system service stability are improved.
Drawings
In order to more clearly illustrate the technical solutions of the present invention or the prior art, the drawings needed for the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and those skilled in the art can also obtain other drawings according to the drawings without creative efforts.
FIG. 1 is a schematic flow chart of an intelligent operation and maintenance method provided by the present invention;
FIG. 2 is a second schematic flow chart of the intelligent operation and maintenance method provided by the present invention;
FIG. 3 is a third schematic flow chart of the intelligent operation and maintenance method provided by the present invention;
FIG. 4 is a schematic structural diagram of an intelligent operation and maintenance device provided by the present invention;
fig. 5 is a schematic structural diagram of an electronic device provided in the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
It should be noted that in the description of the embodiments of the present invention, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element. The terms "upper", "lower", and the like, indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings, and are only for convenience in describing the present invention and simplifying the description, but do not indicate or imply that the referred devices or elements must have a specific orientation, be constructed and operated in a specific orientation, and thus, should not be construed as limiting the present invention. Unless expressly stated or limited otherwise, the terms "mounted," "connected," and "connected" are intended to be inclusive and mean, for example, that they may be fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood by those skilled in the art according to specific situations.
The terms first, second and the like in the description and in the claims of the present application are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It will be appreciated that the data so used may be interchanged under appropriate circumstances such that embodiments of the invention may be practiced other than those illustrated or described herein, and that the objects identified as "first," "second," etc. are generally a class of objects and do not limit the number of objects, e.g., a first object may be one or more. In addition, "and/or" in the specification and claims means at least one of connected objects, a character "/" generally means that a preceding and succeeding related objects are in an "or" relationship.
Under the current technology, the AIOps operation and maintenance mainly can realize the following functions:
1) the system abnormity and the fault hidden danger are determined, and possible abnormity or fault hidden danger can be early warned in time, so that the fault occurrence probability can be obviously reduced, and the IT operation and maintenance pressure can be reduced;
2) the method can find and position the performance bottleneck of each application system in time, and quickly position the root cause of each fault, so as to improve the stability of system service.
3) The data relation between the online service and the equipment resource capacity is determined, so that a basis is provided for a manager to make an accurate decision; meanwhile, the system can also provide resource early warning and optimize the resource utilization rate, including improving the existing IT resource utilization rate of enterprises.
The intelligent operation and maintenance method, the intelligent operation and maintenance device, the electronic equipment and the storage medium are based on a basic framework of AIOps operation and maintenance, and are different from the basic framework of AIOps operation and maintenance.
The intelligent operation and maintenance method, apparatus, electronic device and storage medium provided by the present invention are described below with reference to fig. 1 to 5.
Fig. 1 is a schematic flow diagram of an intelligent operation and maintenance method provided by the present invention, as shown in fig. 1, including, but not limited to, the following steps:
step 101: and acquiring service change data.
The service change data is data that can reflect changes and changes related to changes of running services on each device, such as data that has changed in network environment data, machine room data, application service data (such as data related to upgrading, deployment, implementation, and the like), data related to application processes, device performance index data, and the like. The present invention is not particularly limited.
The method for acquiring the service change data may be a heartbeat method, which is not specifically limited in the present invention, and may be a method for acquiring service-related information in real time, acquiring information reported by engineering implementers, and the like.
Step 102: and matching a change operation and maintenance strategy set from a historical knowledge base based on the service change data.
The operation and maintenance strategy set is a set of operation and maintenance strategies matched from a pre-constructed historical knowledge base according to each group of service change data. The operation and maintenance strategy is a strategy for analyzing, positioning or deciding (such as anomaly detection, intelligent early warning and the like) by adopting a corresponding artificial intelligence algorithm according to operation and maintenance scenes, business rules or expert experiences and the like of different business change data, and each operation and maintenance strategy can be narrowly defined as an intelligent operation and maintenance algorithm model.
The historical knowledge base stores relevant operation and maintenance strategies corresponding to various different service change data in advance.
In step 102, matching corresponding operation and maintenance policies from the historical knowledge base according to preset matching rules for all the service change data collected in step 101, and creating an operation and maintenance policy set, where this operation and maintenance policy set is referred to as a changed operation and maintenance policy set.
Step 103: and comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set, and determining a new operation and maintenance strategy set according to the comparison result.
After the changed operation and maintenance strategy set is matched from the historical knowledge base according to the currently acquired service change data, the changed operation and maintenance strategy set needs to be compared with the current operation and maintenance strategy set which is running in the system, so as to determine whether the current operation and maintenance strategy set needs to be updated or not.
For example: when a new operation strategy (the root cause of which is the occurrence of new business data) appears in the changed operation and maintenance strategy set, and the operation strategy is not loaded to the current operation and maintenance strategy set, the operation strategy needs to be reloaded to form a new operation and maintenance strategy set.
Step 104: and carrying out operation and maintenance on the equipment and the service based on the new operation and maintenance strategy set.
After the current operation and maintenance strategy set is updated through the steps, the new operation and maintenance strategy set is utilized to perform processing such as control, monitoring, online and the like on each device in the system, and intelligent operation and maintenance are realized.
According to the intelligent operation and maintenance method provided by the invention, the operation and maintenance strategy set is accurately matched in the historical knowledge base by acquiring the change data of the operation service on the equipment, so that the operation and maintenance events related to the service data are more effectively analyzed and positioned by utilizing the targeted and customized operation and maintenance strategy in the operation and maintenance strategy set, and the decision is made, so that the IT resource utilization rate and the system service stability are improved.
Based on the content of the foregoing embodiment, as an optional embodiment, in a case that the service change data includes application change data and device process data, the acquiring the service change data includes: receiving the application change data, wherein the application change data is uploaded by engineering implementers; and receiving the equipment process data sent by the process monitoring service unit in a heartbeat mode based on the agent platform.
Fig. 2 is a second schematic flow chart of the intelligent operation and maintenance method provided by the present invention, and as shown in fig. 2, the path for acquiring the service change data of the present invention can have the following two aspects, which are both completed by the agent platform. Wherein the agent platform is a carrier for executing the agent technology.
The Agent platform is a computer system packaged in a certain environment, is an abstract entity with autonomy, sociality, reactivity and initiative, can independently and autonomously operate in a certain environment, acts on the environment and is influenced by the environment, and can continuously acquire knowledge from the environment to improve the capability of the Agent platform.
On one hand, in order to timely master the change of the application service on the equipment, the invention realizes operation and maintenance, online management and integration, provides an open management function, and automatically uploads related application change data to an agent platform to realize the sharing of business data while engineering implementers perform online management on the equipment, upgrade the application equipment or deploy and implement new applications.
On the other hand, the process monitoring service unit of the agent collects the device process information in real time to acquire the device process data: by monitoring the equipment process information, the business corresponding to the application service process can be analyzed, and therefore the business content running on the equipment is determined.
It should be noted that, because the device process information related to the service content changes slowly, the present invention uses Heartbeat service to send the device process data to the agent platform in real time, instead of the agent platform inquiring and retrieving the device process data frequently.
In addition, the main idea of the intelligent operation and maintenance method provided by the invention is as follows: according to different operation services on the equipment, different operation and maintenance strategies are called, so that the AIOps operation and maintenance are more targeted, and the equipment resources are more effectively utilized. Therefore, the intelligent operation and maintenance method is suitable for the operation and maintenance scenes of distributed or centralized equipment and is also suitable for the operation and maintenance of equipment of various scales; the intelligent operation and maintenance agent platform is transparent to equipment, and normal work can be realized as long as the intelligent operation and maintenance agent platform is remotely configured to the corresponding equipment regardless of whether the operation and maintenance equipment is a server, a camera or other terminals.
The intelligent operation and maintenance method provided by the invention is based on business drive to realize intelligent operation and maintenance, based on agent's principle and technology, completes real-time calculation based on big data through data sharing and interface service, and is combined with technologies such as data mining and deep learning to be applied to the field of intelligent operation and maintenance, so that the effective utilization rate of equipment resources can be effectively improved.
Based on the content of the foregoing embodiment, as an optional embodiment, before performing rule matching in a historical knowledge base based on the service change data to obtain a change operation and maintenance policy set, the method further includes: and carrying out data mining on the historical operation and maintenance data to formulate an association rule between the business data and the intelligent operation and maintenance strategy, and constructing the historical knowledge base.
In the intelligent operation and maintenance method provided by the invention, the historical knowledge base can be pre-constructed and stored in the system memory for real-time calling of the agent platform.
The construction process of the historical knowledge base can be as follows:
acquiring historical operation and maintenance data in the AIOps operation and maintenance implementation process, wherein the historical operation and maintenance data comprises historical service data and a historical operation and maintenance strategy set which is used for performing operation and maintenance on the historical service data and is composed of historical operation and maintenance strategies;
and performing association rule mining on historical service data and historical operation and maintenance strategies in the historical operation and maintenance data, establishing association rules between all types of service data and the operation and maintenance strategies, and constructing a historical knowledge base.
It should be noted that after the historical knowledge base is constructed, in the AIOps operation and maintenance process, a relevant operation and maintenance policy set may be determined based on high-frequency problems and root cause analysis caused by the application service, the machine room, the network, and the like operated by the device, association relationships between the device, the service, and the operation and maintenance policy set may be established, and the historical knowledge base may be continuously updated according to the relationships.
According to the intelligent operation and maintenance method provided by the invention, through the pre-construction of the historical knowledge base, in the AIOps operation and maintenance, the agent platform can match the corresponding operation and maintenance strategy set from the historical knowledge base in time according to the current business change data, so that the operation, maintenance and monitoring of equipment and services are realized rapidly, the response speed is high, and the operation and maintenance effect is higher.
Based on the content of the foregoing embodiment, as an optional embodiment, the performing rule matching in a historical knowledge base based on the service change data to obtain a change operation and maintenance policy set includes: calling an association rule mining algorithm based on an association rule between the service data and the intelligent operation and maintenance strategy so as to mine the historical knowledge base by using the service change data and obtain a frequent item set; and determining the change operation and maintenance strategy set according to the frequent item set.
Fig. 3 is a third schematic flow chart of the intelligent operation and maintenance method provided by the present invention, as shown in fig. 3, in the intelligent operation and maintenance method provided by the present invention, agent technology is adopted, through automatically acquiring service change data on a device, and through an association mining algorithm in a historical knowledge base, for example: the FP-Growth algorithm carries out data mining on operation and maintenance historical data, and comprises the following steps:
and constructing an association rule between the service data and the intelligent operation and maintenance strategy for mining, and acquiring frequent items by introducing confidence coefficient according to a causal relationship between the service data and the intelligent operation and maintenance strategy, so that the intelligent operation and maintenance strategy to be started is accurately matched in a historical knowledge base, namely a change operation and maintenance strategy set corresponding to the service change data is automatically matched.
After the current operation and maintenance strategy set is updated by the changed operation and maintenance strategy set, the related operation and maintenance strategies in the new operation and maintenance strategy set can be called to carry out intelligent operation, maintenance and monitoring on the equipment through the equipment state data collected and processed in real time.
The intelligent operation and maintenance method provided by the invention is based on the association rule mining algorithm, and completes the high-frequency problem and root cause analysis on the equipment under the conditions of running application service, machine room, network and the like, so as to determine the corresponding operation and maintenance strategy set, further establish the relation between the equipment, the service and the operation and maintenance strategy set, provide the operation and maintenance strategy in a targeted and customized manner according to different service data, and more reasonably and effectively realize the operation, maintenance and monitoring of the equipment.
Based on the content of the foregoing embodiment, as an optional embodiment, after the obtaining the service change data, the method further includes: merging the service change data to obtain key service data, and matching the change operation and maintenance strategy set from the historical knowledge base by using the key service data.
As shown in fig. 2, the channels for acquiring the service change data of the present invention are various, and redundant, repeated or invalid data may exist between service data, so that the intelligent operation and maintenance method provided by the present invention combines and optimizes all acquired service data after the device process information collected by the agent platform and the application change data uploaded by the engineering implementer, so as to extract the key service and remove the repeated and invalid service data.
According to the intelligent operation and maintenance method provided by the invention, the agent platform acquires the service change data, and all the data are subjected to unified preprocessing, so that the data precision can be effectively improved, the operation pressure of the platform is reduced, and the operation and maintenance accuracy can be improved.
Based on the content of the foregoing embodiment, as an optional embodiment, the comparing the changed operation and maintenance policy set with the current operation and maintenance policy set, and determining a new operation and maintenance policy set according to a comparison result includes: under the condition that the comparison result of the changed operation and maintenance strategy set is the same as that of the current operation and maintenance strategy set, keeping the current operation and maintenance strategy set unchanged; and under the condition that the comparison result of the changed operation and maintenance strategy set is different from that of the current operation and maintenance strategy set, determining the changed operation and maintenance strategy set as the new operation and maintenance strategy set.
In the intelligent operation and maintenance method provided by the invention, the agent platform is pre-loaded with the current operation and maintenance strategy set, and the AIOps operation and maintenance of the equipment is realized based on the current operation and maintenance strategy set.
In the process of AIOps operation and maintenance, business change data including equipment process data and application change data uploaded by engineering implementation personnel are collected in real time.
And after merging and optimizing the service change data, performing association mining on the historical knowledge base row based on a matching rule, processing the mined operation and maintenance strategy by introducing confidence, and constructing a change operation and maintenance strategy set.
Further, if the changed operation and maintenance strategy set is different from the current operation and maintenance strategy set, loading the changed operation and maintenance strategy set as a new operation and maintenance strategy set to the agent platform for calling; if the changed operation and maintenance strategy set is the same as the current operation and maintenance strategy set, it indicates that a new operation and maintenance strategy is not introduced, and at this time, the current operation and maintenance strategy set does not need to be updated.
In the intelligent operation and maintenance method provided by the invention, in the implementation process of AIOps, association rule mining is carried out on business data and an intelligent operation and maintenance strategy set, and a historical knowledge base is constructed; and then, the change of the equipment service is received in time in a heartbeat mode, and an intelligent operation and maintenance strategy set is independently and accurately matched in a historical knowledge base by combining process information, so that more targeted monitoring and operation and maintenance are realized.
Based on the content of the foregoing embodiment, as an optional embodiment, the performing operation and maintenance on the device and the service based on the new operation and maintenance policy set includes: acquiring equipment state data, and merging equipment process data in the equipment state data; and calling a corresponding operation and maintenance strategy according to the merged device process data based on the new operation and maintenance strategy set so as to realize the operation and maintenance of the device.
Referring to fig. 3, the agent platform provided by the present invention has the following functions:
one is the collection of data, including: collecting equipment process data related to equipment performance index information and process information in real time, and calculating in real time; and receiving application change data related to the equipment application and reported by engineering implementers.
And the other is a selection function, namely, after receiving the application change data and the equipment process data, the agent platform merges the services, then calls an association rule mining algorithm, and performs rule matching from a historical knowledge base to obtain a change operation and maintenance strategy set (including a set of intelligent operation and maintenance algorithm models). If the changed operation and maintenance strategy is different from the current operation and maintenance strategy set, loading and starting a new operation and maintenance strategy set; otherwise pass.
Thirdly, a monitoring function, namely, the agent platform collects the equipment state data in real time, processes, cleans and standardizes the process related information; and collecting the running state data of the equipment in real time, and calling an operation and maintenance strategy to diagnose the equipment.
The intelligent operation and maintenance method provided by the invention utilizes agent technology to automatically acquire the change of the operation service on the equipment, accurately matches a targeted and customized operation and maintenance strategy set in a historical knowledge base through association rules, starts and operates the operation and maintenance strategy set, and intelligently operates and monitors the equipment through equipment state data collected and processed in real time, so that the method is more reasonable and effective.
Fig. 4 is a schematic structural diagram of the intelligent operation and maintenance device provided by the present invention, as shown in fig. 4, the intelligent operation and maintenance device mainly includes a data acquisition unit 41, a policy invoking unit 42, a policy matching unit 43, and a policy operating unit 44, where:
the data acquisition unit 41 is mainly used for acquiring service change data; the policy invoking unit 42 is mainly configured to perform rule matching in the historical knowledge base based on the service change data to obtain a change operation and maintenance policy set; the policy matching unit 43 is mainly configured to compare the changed operation and maintenance policy set with the current operation and maintenance policy set, and determine a new operation and maintenance policy set according to a comparison result; the policy operation unit 44 is mainly configured to operate and maintain the devices and services based on the new operation and maintenance policy set.
It should be noted that, during specific operation, the intelligent operation and maintenance device provided in the embodiment of the present invention may execute the intelligent operation and maintenance method described in any of the above embodiments, which is not described in detail in this embodiment.
According to the intelligent operation and maintenance device provided by the invention, the operation and maintenance strategy set is accurately matched in the historical knowledge base by acquiring the change data of the operation service on the equipment, so that the operation and maintenance events related to the service data are more effectively analyzed and positioned by utilizing the targeted and customized operation and maintenance strategy in the operation and maintenance strategy set, and the decision is made, so that the IT resource utilization rate and the system service stability are improved.
Fig. 5 is a schematic structural diagram of an electronic device provided in the present invention, and as shown in fig. 5, the electronic device may include: a processor (processor)510, a communication Interface (Communications Interface)520, a memory (memory)530 and a communication bus 540, wherein the processor 510, the communication Interface 520 and the memory 530 communicate with each other via the communication bus 540. Processor 510 may invoke logic instructions in memory 530 to perform an intelligent operation and maintenance method comprising: acquiring service change data; matching a change operation and maintenance strategy set from a historical knowledge base; comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set, and determining a new operation and maintenance strategy set according to the comparison result; and carrying out operation and maintenance on the equipment and the service based on the new operation and maintenance strategy set.
Furthermore, the logic instructions in the memory 530 may be implemented in the form of software functional units and stored in a computer readable storage medium when the software functional units are sold or used as independent products. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
In another aspect, the present invention also provides a computer program product, which includes a computer program stored on a non-transitory computer-readable storage medium, the computer program including program instructions, when the program instructions are executed by a computer, the computer being capable of executing the intelligent operation and maintenance method provided by the above methods, the method including: acquiring service change data; matching a change operation and maintenance strategy set from a historical knowledge base based on the service change data; comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set, and determining a new operation and maintenance strategy set according to the comparison result; and carrying out operation and maintenance on the equipment and the service based on the new operation and maintenance strategy set.
In yet another aspect, the present invention further provides a non-transitory computer-readable storage medium, on which a computer program is stored, the computer program being implemented by a processor to perform the intelligent operation and maintenance method provided in the foregoing embodiments, the method including: acquiring service change data; matching a change operation and maintenance strategy set from a historical knowledge base based on the service change data; comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set, and determining a new operation and maintenance strategy set according to the comparison result; and carrying out operation and maintenance on the equipment and the service based on the new operation and maintenance strategy set.
The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. An intelligent operation and maintenance method is characterized by comprising the following steps:
acquiring service change data;
matching a change operation and maintenance strategy set from a historical knowledge base based on the service change data;
comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set to determine a new operation and maintenance strategy set;
and carrying out operation and maintenance on the equipment and the service based on the new operation and maintenance strategy set.
2. The intelligent operation and maintenance method according to claim 1, wherein in a case that the service change data includes application change data and device process data, the obtaining service change data includes:
receiving the application change data, wherein the application change data is uploaded by engineering implementers;
and receiving the equipment process data sent by the process monitoring service unit in a heartbeat mode based on the agent platform.
3. The intelligent operation and maintenance method according to claim 1, before performing rule matching in a historical knowledge base based on the service change data to obtain a change operation and maintenance policy set, further comprising:
and carrying out data mining on the historical operation and maintenance data to formulate an association rule between the business data and the intelligent operation and maintenance strategy, and constructing the historical knowledge base.
4. The intelligent operation and maintenance method according to claim 3, wherein the performing rule matching in a historical knowledge base based on the service change data to obtain a change operation and maintenance policy set comprises:
calling an association rule mining algorithm based on an association rule between the service data and the intelligent operation and maintenance strategy so as to mine the historical knowledge base by using the service change data and obtain a frequent item set;
and determining the change operation and maintenance strategy set according to the frequent item set.
5. The intelligent operation and maintenance method according to claim 1, wherein before the matching out of the change operation and maintenance policy set from the historical knowledge base based on the service change data, further comprising:
and merging the service change data.
6. The intelligent operation and maintenance method according to claim 1, wherein the comparing the changed operation and maintenance policy set with the current operation and maintenance policy set, and determining a new operation and maintenance policy set according to a comparison result comprises:
under the condition that the comparison result of the changed operation and maintenance strategy set is the same as that of the current operation and maintenance strategy set, keeping the current operation and maintenance strategy set unchanged;
and under the condition that the comparison result of the changed operation and maintenance strategy set is different from that of the current operation and maintenance strategy set, determining the changed operation and maintenance strategy set as the new operation and maintenance strategy set.
7. The intelligent operation and maintenance method according to claim 1, wherein the operation and maintenance of the device and the service based on the new operation and maintenance policy set comprises:
acquiring equipment state data, and merging equipment process data related to the equipment state data;
and calling a corresponding operation and maintenance strategy according to the merged device process data based on the new operation and maintenance strategy set so as to realize the operation and maintenance of the device.
8. An intelligent operation and maintenance device, comprising:
the data acquisition unit is used for acquiring service change data;
the strategy calling unit is used for matching out a change operation and maintenance strategy set from a historical knowledge base based on the service change data;
the strategy matching unit is used for comparing the changed operation and maintenance strategy set with the current operation and maintenance strategy set to determine a new operation and maintenance strategy set;
and the strategy operation unit is used for operating and maintaining the equipment and the service based on the new operation and maintenance strategy set.
9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the intelligent operation and maintenance method steps according to any one of claims 1 to 7 when executing the computer program.
10. A non-transitory computer readable storage medium having a computer program stored thereon, wherein the computer program when executed by a processor implements the intelligent operation and maintenance method steps according to any one of claims 1 to 7.
CN202110850483.4A 2021-07-27 2021-07-27 Intelligent operation and maintenance method and device, electronic equipment and storage medium Active CN113516244B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110850483.4A CN113516244B (en) 2021-07-27 2021-07-27 Intelligent operation and maintenance method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110850483.4A CN113516244B (en) 2021-07-27 2021-07-27 Intelligent operation and maintenance method and device, electronic equipment and storage medium

Publications (2)

Publication Number Publication Date
CN113516244A true CN113516244A (en) 2021-10-19
CN113516244B CN113516244B (en) 2023-12-05

Family

ID=78067628

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110850483.4A Active CN113516244B (en) 2021-07-27 2021-07-27 Intelligent operation and maintenance method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113516244B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114553726A (en) * 2022-02-23 2022-05-27 深圳市众功软件有限公司 Network security operation and maintenance method and system based on function and resource level
CN115037513A (en) * 2022-04-27 2022-09-09 锐捷网络股份有限公司 Security policy configuration method and device
CN115051930A (en) * 2022-05-23 2022-09-13 中电信数智科技有限公司 Defect optimization method based on AISeOps combined with middleware algorithm
CN116882978A (en) * 2023-08-01 2023-10-13 中国船舶科学研究中心 Deep sea submersible operation and maintenance support platform based on product information frame

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105843210A (en) * 2016-03-22 2016-08-10 清华大学 Power transformer defect information data mining method
CN111159256A (en) * 2019-12-31 2020-05-15 贵州电网有限责任公司 Distribution network information data mining method facing equipment operation and maintenance
US20200160191A1 (en) * 2018-11-19 2020-05-21 International Business Machines Corporation Semi-automated correction of policy rules
CN111190876A (en) * 2019-12-31 2020-05-22 天津浪淘科技股份有限公司 Log management system and operation method thereof
CN111476381A (en) * 2020-04-08 2020-07-31 贵州爱信诺航天信息有限公司 Method and system for operation and maintenance service of innovative application system based on localization information technology

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105843210A (en) * 2016-03-22 2016-08-10 清华大学 Power transformer defect information data mining method
US20200160191A1 (en) * 2018-11-19 2020-05-21 International Business Machines Corporation Semi-automated correction of policy rules
CN111159256A (en) * 2019-12-31 2020-05-15 贵州电网有限责任公司 Distribution network information data mining method facing equipment operation and maintenance
CN111190876A (en) * 2019-12-31 2020-05-22 天津浪淘科技股份有限公司 Log management system and operation method thereof
CN111476381A (en) * 2020-04-08 2020-07-31 贵州爱信诺航天信息有限公司 Method and system for operation and maintenance service of innovative application system based on localization information technology

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
吉锋;刘丽霞;文韬;张晗;: "智能运维技术在电信大视频业务中的应用研究", 信息通信技术, no. 01 *
郑仕辉;: "高效、智能的IT运维体系建设", 金融电子化, no. 12 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114553726A (en) * 2022-02-23 2022-05-27 深圳市众功软件有限公司 Network security operation and maintenance method and system based on function and resource level
CN115037513A (en) * 2022-04-27 2022-09-09 锐捷网络股份有限公司 Security policy configuration method and device
CN115051930A (en) * 2022-05-23 2022-09-13 中电信数智科技有限公司 Defect optimization method based on AISeOps combined with middleware algorithm
CN115051930B (en) * 2022-05-23 2023-05-12 中电信数智科技有限公司 AISecOps combined middle station algorithm-based defect optimization method
CN116882978A (en) * 2023-08-01 2023-10-13 中国船舶科学研究中心 Deep sea submersible operation and maintenance support platform based on product information frame
CN116882978B (en) * 2023-08-01 2024-04-09 中国船舶科学研究中心 Deep sea submersible operation and maintenance support system based on product information frame

Also Published As

Publication number Publication date
CN113516244B (en) 2023-12-05

Similar Documents

Publication Publication Date Title
CN113516244B (en) Intelligent operation and maintenance method and device, electronic equipment and storage medium
EP3798846B1 (en) Operation and maintenance system and method
CN111047082B (en) Early warning method and device of equipment, storage medium and electronic device
CN110493025B (en) Fault root cause diagnosis method and device based on multilayer digraphs
CN111629043B (en) Cross-platform health management system based on cloud mode
CN112769605B (en) Heterogeneous multi-cloud operation and maintenance management method and hybrid cloud platform
CN102739802A (en) Service application-oriented IT contralized operation and maintenance analyzing system
CN111290913A (en) Fault location visualization system and method based on operation and maintenance data prediction
CN115225536B (en) Virtual machine abnormality detection method and system based on unsupervised learning
CN111369094A (en) Alarm order dispatching method, device and system and computer readable storage medium
CN112507623B (en) Method and system for constructing algorithm middle station
CN113949652B (en) User abnormal behavior detection method and device based on artificial intelligence and related equipment
EP3843338B1 (en) Monitoring and analyzing communications across multiple control layers of an operational technology environment
CN113887749A (en) Cloud edge cooperation-based multi-dimensional monitoring and disposal method, device and platform for power internet of things
CN114706675A (en) Task deployment method and device based on cloud edge cooperative system
CN117670033A (en) Security check method, system, electronic equipment and storage medium
CN114238383A (en) Big data extraction method and device for supply chain monitoring
CN114374600A (en) Network operation and maintenance method, device, equipment and product based on big data
CN117135030A (en) Alarm association analysis method, device, terminal equipment and storage medium
CN113656452B (en) Method and device for detecting call chain index abnormality, electronic equipment and storage medium
CN116205624A (en) Equipment operation management method based on digital twin and related products
CN115731073A (en) Scheduling operation abnormity monitoring method based on service scene analysis
CN114139747A (en) AIOps intelligent operation and maintenance system based on artificial intelligence technology
CN114416415A (en) Remote online fault detection method and system for Hongmon operating system and storage medium
CN114663698A (en) Production violation event detection method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant