WO2023045417A1 - Fault knowledge graph construction method and apparatus - Google Patents

Fault knowledge graph construction method and apparatus Download PDF

Info

Publication number
WO2023045417A1
WO2023045417A1 PCT/CN2022/098679 CN2022098679W WO2023045417A1 WO 2023045417 A1 WO2023045417 A1 WO 2023045417A1 CN 2022098679 W CN2022098679 W CN 2022098679W WO 2023045417 A1 WO2023045417 A1 WO 2023045417A1
Authority
WO
WIPO (PCT)
Prior art keywords
alarm
relationship
work order
candidate set
fault
Prior art date
Application number
PCT/CN2022/098679
Other languages
French (fr)
Chinese (zh)
Inventor
姜磊
严浩
徐代刚
杜贤俊
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2023045417A1 publication Critical patent/WO2023045417A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/022Knowledge engineering; Knowledge acquisition

Definitions

  • Embodiments of the present disclosure relate to the communication field, and in particular, to a method and device for constructing a fault knowledge graph.
  • knowledge extraction is an essential process. It extracts knowledge from different data sources and data of different structures to form a data structure for fault knowledge map understanding, such as the Resource Description Framework (Resource Description Framework, Referred to as RDF) triples, and finally merged into the fault knowledge map.
  • RDF Resource Description Framework
  • the embodiment of the present disclosure provides a method and device for constructing a fault knowledge map, to at least solve the problem of relying on the mapping conversion of the rule base and the knowledge base in related technologies, which cannot quickly adapt to new business and new scenarios, and relying on manual knowledge extraction is not only relatively inefficient , the quality cannot be guaranteed.
  • a fault knowledge map construction method including:
  • the target candidate set is integrated into the fault knowledge graph constructed according to the rule base to obtain an updated fault knowledge graph.
  • a device for constructing a fault knowledge map including:
  • the first extraction module is configured to extract entities and relationships from the rule base to obtain a first candidate set including alarm relationship instances and work order dispatch instances;
  • the second extraction module is configured to extract entities and relationships from real-time data to obtain a second candidate set of alarm association relationships and work order dispatch relationships;
  • the conflict detection module is configured to perform conflict detection according to the first candidate set and the second candidate set to obtain a target candidate set including effective alarm correlation and work order dispatch knowledge;
  • the update module is configured to integrate the target candidate set into the fault knowledge graph constructed according to the rule base to obtain an updated fault knowledge graph.
  • a computer-readable storage medium where a computer program is stored in the storage medium, wherein the computer program is set to execute any one of the above method embodiments when running in the steps.
  • an electronic device including a memory and a processor, wherein a computer program is stored in the memory, and the processor is configured to run the computer program to perform any of the above Steps in the method examples.
  • the entity and relationship extraction is performed on the rule base to obtain the first candidate set including the alarm relationship instance and the work order dispatch instance; the entity and relationship extraction is performed on the real-time data to obtain the alarm association relationship and the work order dispatch relationship the second candidate set; perform conflict detection according to the first candidate set and the second candidate set, and obtain a target candidate set including effective alarm correlation and work order dispatch knowledge; integrate the target candidate set into the
  • the fault knowledge graph constructed by the rule base can obtain an updated fault knowledge graph, which can solve the problem in related technologies that relies on the mapping conversion of the rule base and the knowledge base, and cannot quickly adapt to new business and new scenarios.
  • the combination of entity and relationship extraction updates the fault knowledge graph, enabling the fault knowledge graph to quickly adapt to new business and new scenarios.
  • FIG. 1 is a block diagram of a hardware structure of a mobile terminal in a method for constructing a fault knowledge map according to an embodiment of the present disclosure
  • FIG. 2 is a flow chart of a fault knowledge map construction method according to an embodiment of the present disclosure
  • FIG. 3 is a schematic diagram of a fault knowledge graph RDF according to an embodiment of the present disclosure.
  • Fig. 4 is a schematic diagram of knowledge extraction in a fault knowledge map according to an embodiment of the present disclosure
  • Fig. 5 is a schematic diagram of an alarm correlation rule according to an embodiment of the present disclosure.
  • Fig. 6 is a schematic diagram of work order dispatching rules according to an embodiment of the present disclosure.
  • FIG. 7 is a schematic diagram (1) of real-time work order fault handling according to an embodiment of the present disclosure.
  • FIG. 8 is a schematic diagram (2) of real-time work order fault handling according to an embodiment of the present disclosure.
  • FIG. 9 is a schematic diagram (1) of knowledge entity vectorization according to an embodiment of the present disclosure.
  • Fig. 10 is a schematic diagram (2) of knowledge entity vectorization according to an embodiment of the present disclosure.
  • Fig. 11 is a flow chart of alarm correlation rule extraction according to an embodiment of the present disclosure.
  • FIG. 12 is a flowchart of real-time data extraction according to an embodiment of the present disclosure.
  • FIG. 13 is a flowchart of feature engineering according to an embodiment of the present disclosure.
  • FIG. 14 is a flowchart of conflict detection according to an embodiment of the present disclosure.
  • Fig. 15 is a block diagram of an apparatus for constructing a fault knowledge graph according to another embodiment of the present disclosure.
  • Fig. 1 is a block diagram of the hardware structure of the mobile terminal according to the fault knowledge map construction method of the embodiment of the present disclosure.
  • the mobile terminal may include one or more (only shown in Fig. 1 a) a processor 102 (the processor 102 may include but not limited to a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data, wherein the above-mentioned mobile terminal may also include a memory for communication Functional transmission device 106 and input and output device 108 .
  • a processor 102 may include but not limited to a processing device such as a microprocessor MCU or a programmable logic device FPGA
  • a memory 104 for storing data
  • the above-mentioned mobile terminal may also include a memory for communication Functional transmission device 106 and input and output device 108 .
  • FIG. 1 is only for illustration, and it does not limit the structure of the above mobile terminal.
  • the mobile terminal may also include more or fewer components than those shown in FIG. 1 , or have a different configuration from that shown in FIG. 1 .
  • the memory 104 can be used to store computer programs, for example, software programs and modules of application software, such as the computer program corresponding to the fault knowledge map construction method in the embodiment of the present disclosure, and the processor 102 runs the computer program stored in the memory 104, thereby Executing various functional applications and slicing processing of the service chain address pool is to realize the above-mentioned method.
  • the memory 104 may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory.
  • the memory 104 may further include a memory that is remotely located relative to the processor 102, and these remote memories may be connected to the mobile terminal through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • the transmission device 106 is used to receive or transmit data via a network.
  • the specific example of the above network may include a wireless network provided by the communication provider of the mobile terminal.
  • the transmission device 106 includes a network interface controller (NIC for short), which can be connected to other network devices through a base station so as to communicate with the Internet.
  • the transmission device 106 may be a radio frequency (Radio Frequency, referred to as RF) module, which is used to communicate with the Internet in a wireless manner.
  • RF Radio Frequency
  • a fault knowledge map construction method running on the above-mentioned mobile terminal or network architecture is provided, which is applied to the terminal, and the terminal accesses the current master node of the source area through a dual connection (Dual Connection, referred to as DC) (Master Node, referred to as MN for short) cell and current secondary node (Secondary Node, referred to as SN for short) cell
  • DC Dual Connection
  • MN Master Node
  • SN Secondary Node
  • Fig. 2 is the flow chart of the fault knowledge map construction method according to an embodiment of the present disclosure, as shown in Fig. 2, this process At least including but not limited to the following steps:
  • Step S202 extracting entities and relationships from the rule base to obtain a first candidate set including alarm relationship instances and work order dispatch instances;
  • Step S204 extracting entities and relationships from the real-time data to obtain a second candidate set of alarm association relationships and work order dispatch relationships;
  • Step S206 performing conflict detection according to the first candidate set and the second candidate set, to obtain a target candidate set including effective alarm correlation and work order dispatch knowledge;
  • Step S208 integrating the target candidate set into the fault knowledge graph constructed according to the rule base to obtain an updated fault knowledge graph.
  • step S202 may specifically include:
  • the association relationship between different alarms is extracted through key fields to form the alarm relationship instance.
  • parent-child alarms and rule description content are extracted according to the first key field; according to the first key field
  • Two key fields carry out left and right participle to described rule description content, obtain left participle and right participle; Described left participle, described right participle and parent-child alarm are carried out category matching respectively, to form described alarm correlation instance;
  • the relationship between different alarms in the order dispatching conditions is extracted through key fields to form the work order dispatching instance.
  • the semi-structured data in the rule base is converted into structured data.
  • the above step S204 may specifically include: performing correlation mining on the alarm data in the real-time data to obtain a correlation alarm; extracting the alarm content according to the key field from the work order data in the real-time data and fault handling content, wherein, the fault handling content includes at least the fault handling situation, the preliminary judgment of the cause of the fault, and the fault handling opinion; word segmentation processing is performed on the alarm content and the fault handling content to obtain one or more cause alarms; Determine the alarm vectors of the one or more cause alarms according to the similarity; match the alarm vectors with the correlation alarms to obtain the alarm association relationship and the work order dispatch relationship.
  • the above step S208 may specifically include: performing the following operations on each alarm association relationship and each work order dispatching relationship in the second candidate set to obtain valid alarm association relations and work order dispatching knowledge
  • the target candidate set formed, wherein, the ongoing alarm association relationship and work order dispatch relationship is called the current alarm association relationship and the current work order dispatch relationship: judging whether the current alarm association relationship exists in the first A candidate set; if the judgment result is no, determine that the current alarm association relationship is valid through machine learning; if the judgment result is yes, when the current alarm association relationship and the alarm in the first candidate set When the relationship instance does not match and there is no conflict, determine whether the data frequency of the current alarm association relationship is less than a preset threshold, and if the determination result is no, determine that the current alarm association relationship is the effective Alarm association relationship; judging whether the current work order dispatch relationship exists in the work order dispatch instance of the first candidate set; if the judgment result is no, correlating the current work order dispatch relationship with Alerts are merged into the ticket dispatch knowledge.
  • the alarm association relationship in the target candidate set is summarized according to the correlation; the work order dispatch relationship in the target candidate set is summarized according to the dispatch order.
  • the above step S208 may specifically include: using a key to represent an alarm, and using ontology mapping and entity mapping to establish the effective alarm association relationship of the target candidate set and the work order assignment in the fault knowledge graph.
  • single knowledge to find the corresponding node when the corresponding node is found, content conflict detection is carried out to obtain the detection result; when the detection result is that there is no content conflict, the effective alarm association relationship is sent to The single knowledge is hung on the corresponding node; if the corresponding node is not in the fault knowledge graph, a new parent node is created in the fault knowledge graph, and the effective alarm association relationship with the working The single-dispatch knowledge is hung on the newly-created parent node.
  • Fig. 3 is a schematic diagram of fault knowledge map RDF according to an embodiment of the present disclosure.
  • RDF fault knowledge map
  • simple RDF triples cannot assist effective intelligent operation and maintenance
  • the RDF of the base station-generated-alarm and base station-activated-cell service part is only a brief description of the occurrence of the alarm, and there is no effective guidance on how to deal with the alarm after it occurs, but it is necessary to establish the RDF of the A alarm-cause-B alarm part , indicating the alarm association relationship, so as to effectively guide the operation and maintenance.
  • the RDF of the fault knowledge graph is represented by triples.
  • the knowledge in the fault knowledge map is expressed as RDF triples, which can be represented by SPO (subject-verb-object).
  • SPO subject-verb-object
  • the same computer room A alarms within 5 minutes
  • the trigger B alarm is generated, where A is the S of SPO, the trigger is P of SPO, and B is 0 of SPO.
  • the location is the same as the computer room and the time is 5 minutes.
  • the knowledge extraction of the fault knowledge map mainly includes:
  • Entity extraction also known as named entity recognition, extracts entities from the rule base, knowledge base, alarm data, and work order data, such as alarm ID (Identify Document, identity), title, network element in the computer room where the alarm occurred, etc. Occurrence level;
  • Relationship extraction such as alarm A causes alarm B to occur, that is, the relationship between alarm A and alarm B, and alarm B is an important alarm that needs to be dispatched, that is, the relationship between the alarm and the work order;
  • Event extraction extracts the alarm data and work order data at runtime, and can obtain events, such as the time and location of the alarm, such as the handler of the work order, the processing time, and the processing result suggestion, to form relevant operation and maintenance knowledge.
  • knowledge is extracted from the rule base, knowledge base, and fault descriptions of operators and equipment manufacturers in the operation and maintenance guarantee, as well as the real-time alarm information and work order information of operators and equipment manufacturers, and the extracted results are integrated into operation and maintenance faults.
  • Knowledge graph the rule base and the knowledge base are fault operation and maintenance knowledge, and most fault solutions of the fault description are also written into the rule base and the knowledge base. For the convenience of explanation, they are collectively referred to as the rule base in this disclosure.
  • Fig. 4 is a schematic diagram of knowledge extraction in a fault knowledge map according to an embodiment of the present disclosure. As shown in Fig. 4, it is divided into a design domain and an execution domain, wherein:
  • Step S401 in the design domain, perform related design of knowledge extraction, including rule base extraction design, semi-structured template design, real-time data extraction design, vectorization design, machine learning design and threshold design. Specifically include:
  • FIG. 5 is a schematic diagram of the alarm association rules according to this embodiment.
  • the alarm association rules shown in FIG. 5 include rule names, association methods, Rule description, associated location, time window, parent alarm and child alarm.
  • the extraction of rules whether it is entity extraction or relationship extraction, mainly adopts the method of mapping, that is, it mainly needs to define key operation and maintenance fields.
  • the key fields are correlation rule name, correlation method, specialty, and correlation location.
  • Time window parent alarm, child alarm.
  • the name of the association rule is the entire content;
  • the association method is three values, including primary and secondary associations, frequency associations, and threshold associations;
  • the professional value is wireless/transmission/power supply/core network, or cross-professional;
  • the association position that is, the same network element association, the same computer room, or the same link;
  • the time window indicates the time difference range of parent and child alarms;
  • the value of the parent alarm is the detailed description of each alarm, the alarm title, alarm ID and alarm level, and the child alarm same.
  • Semi-structured data is shown in Figure 3.
  • the rule description and dispatch rules can be seen from the figure that structured data is easy to extract, but semi-structured data definition templates can also be extracted.
  • rule extraction templates are mainly definitions, fields and values of structured data, and fields and templates of semi-structured data.
  • the rule description can see that the template can be designed as follows:
  • XX is caused by (optional) XX;
  • the relevant associated information can be easily extracted.
  • Fig. 6 is a schematic diagram of a work order dispatching rule according to an embodiment of the present disclosure. As shown in Fig. 6, it includes the name, whether to enable it, the rule type, and dispatching conditions. Among the dispatching conditions, you can design:
  • Alarm project status XX or XX
  • alarm object device type XX
  • alarm level XX
  • alarm ID XX
  • FIG. 7 is a schematic diagram (1) of real-time work order fault handling according to an embodiment of the present disclosure.
  • Service college entrance examination among which: [Dynamic Ring]: Verify that the network management has an AC input power failure alarm; [Wireless]: Verify that the network management has a base station out of service alarm; [Transmission]: No relevant alarms have been queried; The status of the NE is normal; [Preliminary Judgment of Fault Cause]: It can be monitored, and the status of the NE is normal; [The network level of the faulty NE]: ENODEB; ;[Troubleshooting advice]: Check the power outage in the area or check the dynamic ring power supply; [Troubleshooting time]: xx, xx, xx, xx: xx: xx; [Troubleshooter]: xxx; [Troubleshooting status]: The fault has been cleared at xx:
  • the content of the preliminary judgment of the cause of the fault and the opinions on troubleshooting are unstructured data, which requires machine learning to learn in detail.
  • the word segmentation model adopts the standard Jieba (stuttered word segmentation).
  • Jieba sinuttered word segmentation
  • the words are divided into three words “equipment power failure", “AC input power failure” and “caused”. The two words are directly matched with the alarm title.
  • FIG. 8 is a schematic diagram (2) of real-time work order fault handling according to an embodiment of the present disclosure, as shown in FIG. Through communication and analysis with R & D, there is no abnormality in the 5G cell. It is judged that the AAU version is a single-mode version.
  • LTE cell or RRU are not separately reflected in the fault knowledge map, they exist in the resource knowledge map, and the fault knowledge map is based on alarms such as "LTE cell decommissioning" and related associations, as well as corresponding These methods of work order dispatching rules exist.
  • Machine learning uses classification models, and you can choose algorithms such as logistic regression (sigmoid)/decision tree and support vector machine (SVM).
  • logistic regression sigmoid
  • SVM support vector machine
  • the learned labels can be learned as defined in the rule base.
  • Classification learning is defined according to the standard accuracy (Accuracy)/precision (Precision)/recall (Recall). Here, specific values are defined. For example, the classification can be obtained if the accuracy rate reaches more than 80%. model, if it is less than 80%, it needs to be relearned.
  • the frequency of these data should not be too low.
  • the real-time data of only one month only has a single-digit frequency, and the number of samples is too small to be meaningful.
  • Step S402 the rule extraction module extracts knowledge about the operation and maintenance rules.
  • Alarm 1 base station out of service alarm (secondary alarm, ID: 100-1111);
  • Alarm 2 AC input power failure alarm (three-level alarm, 500-0003);
  • Alarm 2 causes alarm 1 (same time window of the computer room);
  • the alarms in the dispatch conditions can be extracted and integrated into the work order instances in the fault knowledge map.
  • the entity is alarm 1 and alarm 2, and the relationship is that alarm 2 will trigger alarm 1;
  • the work order dispatch is dispatched with alarm 1 and the reason is alarm 2.
  • Step S403 the real-time extraction module extracts real-time alarm and work order data.
  • Extracting real-time data can help discover knowledge that is not in the rule base.
  • the following is an example of A alarm-cause-B alarm in Figure 3.
  • the existing rules are generally mature knowledge. For example, in a 4G equipment room in a traditional network, there are dynamic ring (dynamic environment) network elements, wireless network elements, and transmission network elements. If the dynamic ring network elements fail, such as municipal power outages Power failure or power alarm may cause the base station in the same computer room to be out of service, and then cause the 4G cell to be out of service (LTE cell out of service).
  • dynamic ring network elements such as municipal power outages Power failure or power alarm may cause the base station in the same computer room to be out of service, and then cause the 4G cell to be out of service (LTE cell out of service).
  • a 5G equipment room has dynamic ring (dynamic environment) network elements, wireless network elements and transmission network elements. If the dynamic ring network element fails, If the municipal power outage causes a power failure or a power alarm, it may cause the 5G base station in the same computer room to be out of service, and then the 5G cell (CU/DU cell out of service).
  • dynamic ring dynamic environment
  • the alarm correlation has certain conditions, such as the correlation location and time window, so it is necessary to obtain the possibility of correlation through data mining first. Note that frequent set mining is not used, because they do not occur frequently, and correlation is used. Sex mining, such as Pearson coefficient, just get the correlation.
  • the alarm of the work order dispatch including the location of the occurrence, this is structured data
  • the "preliminary judgment of fault cause" is similar to the field content, which may be unstructured data;
  • fault recovery situation in the fault handling situation is similar to the content of the field. This content can be hard-coded whether there are words such as “clear” or “recovery”, which can assist the label indicating whether the fault handling and cause judgment are correct.
  • Fig. 9 is a schematic diagram (1) of knowledge entity vectorization according to an embodiment of the present disclosure. As shown in Fig. 9, since there are not many words, the vectorization uses one-hot model encoding, that is, the position of the word in the vector is 1, and other positions are 0.
  • the extracted data is used as alarm correlation candidates and dispatch rule candidates.
  • Step S404 the conflict detection module performs conflict detection on the knowledge.
  • conflict detection needs to be performed as follows:
  • the real-time data extraction information is taken, and the machine learning algorithm model is used to judge and obtain the correlation.
  • steps S403 and S404 entities are extracted, relationships are also extracted, and alarm work order events are also extracted, so that there is more information for fault knowledge map fusion.
  • Step S405 the summary module summarizes according to the knowledge correlation.
  • the main alarm is the key, and the set of all related alarms is the value, which is aggregated and summarized through hash;
  • the collection of dispatch order rules is the value summary.
  • Step S406 merging into the fault knowledge map.
  • the data preparation in this embodiment refers to the rule base and real-time data, including operation and maintenance data such as work orders and alarms and the corresponding rule base.
  • word segmentation and vectorization are used, and similarity learning makes the extracted knowledge unique.
  • the classification module may be required to use machine learning to judge the newly extracted relationship.
  • the summary module summarizes the extracted knowledge in order to prepare for the fusion of fault knowledge graphs.
  • Fig. 10 is a schematic diagram (2) of knowledge entity vectorization according to an embodiment of the present disclosure.
  • entity alarms are vectorized with one-hot encoding, and all possible associated alarms are vectorized, using one-hot One-hot encoding, that is, in an n-dimensional vector, only itself is 1, and the others are 0.
  • One-hot encoding is sparse, there are not many types of alarms, so you don't need to think too much about memory consumption.
  • Fig. 11 is a flow chart of alarm correlation rule extraction according to an embodiment of the present disclosure, as shown in Fig. 11 , including:
  • Step S1101 preparing an alarm association rule base and a keyword dictionary
  • Step S1102 obtaining alarm association rules
  • Step S1103 acquiring the content of the keywords "association” and “professional”;
  • Step S1104 obtaining the content of keywords "associated position” and "time window”;
  • Step S1105 obtaining the content of keywords "parent alarm” and "child alarm”;
  • Step S1107 perform left and right word segmentation on the "rule description content” with the keyword “cause”;
  • Step S1108 use the left word to perform category matching with the parent-child alarm
  • Step S1109 use the right word to match parent and child alarms
  • Step S1110 forming an alarm related RDF
  • Step S1111 extracting this article ends and proceeds to the next rule.
  • the associated location and time window are extracted, which will not only be used for subsequent fault knowledge map fusion, but may also be used for real-time data mining verification during the extraction process.
  • parent-child alarm here is the alarm entity, but the relationship that requires the main parent-child relationship does not necessarily clarify the root cause, that is, the parent does not necessarily cause the child, nor does the child cause the parent.
  • the left and right words are then matched with the father and son alarms.
  • category matching means that the parent-child alarm entity may be a subcategory of the left and right words.
  • base station decommissioning includes 4G base station decommissioning and 5G base station decommissioning.
  • Power outages in the dynamic ring can include mains power outages, AC input power outage alarms, Low output voltage alarm, etc., so a large category of matching is required.
  • the root cause RDF can be formed, that is, the alarm on the left causes the alarm on the right.
  • instances and relationships can also be extracted, as shown in Figure 6, by extracting the keywords "name” and "dispatch condition”, and getting specific alarms, you can get the work order RDF, namely XX Alarm dispatch XX work order.
  • the entities and relationships extracted from the rule base are relatively accurate and can be used as labels for learning.
  • Fig. 12 is a flowchart of real-time data extraction according to an embodiment of the present disclosure, as shown in Fig. 12 , including:
  • Step S1201 preparing alarm and work order data
  • Step S1202 performing correlation mining on the alarm
  • Step S1203 extract the alarm-related content of the work order through key fields, including occurrence time, location, etc.;
  • Step S1204 extracting the content of the fault handling condition of the work order through the key field
  • Step S1205 extracting "preliminary judgment of fault cause” and "fault handling opinion”
  • Step S1207 calculating the similarity to obtain a corresponding alarm vector
  • Step S1209 obtaining the alarm association relationship and the work order dispatch relationship.
  • the alarm content in the work order data is obtained.
  • the work order dispatching principle it may be that the related alarms that have been found in the early alarm processing have been combined to send a single alarm, or a single alarm. In either case, it is necessary to analyze whether there is any error in the work order dispatching process. True correlative root cause alerts.
  • Fig. 13 is a flowchart of feature engineering according to an embodiment of the present disclosure, as shown in Fig. 13 , including:
  • Step S1301 preparing an alarm association set
  • Step S1302 obtaining the current pair of alarms
  • Step S1303 obtaining the topological relationship of the devices where they occur;
  • Step S1304 obtaining the business relationship of the equipment where they occur;
  • Step S1305 obtain their professional relationship, optional six dimensions of wireless/data/transmission/core/dynamic ring/cross-network;
  • Step S1306, obtain their alarm levels, according to four levels of 1/2/3/4;
  • Step S1307 obtaining the content of "fault recovery status" of "fault handling status" in their work orders;
  • Step S1308 word segmentation, whether there is “alarm clearing” in the similarity calculation, and get the dimension value 1/0 according to whether there is;
  • step S1309 all features are obtained through normalization processing.
  • the alarm level is a dimension, and the value is filled in according to the alarm level. Generally speaking, the higher the alarm level, the more significant it is in the alarm root cause tree model.
  • Topological relationship For example, a pair of alarms A and B occur in the BBU and the base station respectively. Then their topological relationship will have two dimensions. Topological relationship A is the parent of B, and topological relationship B is not the parent of A. Here, the first Fill in 0 for one dimension, and 1 for the second dimension. If there is no topology-subordinate relationship between the two, fill in 0 for both, and the feature of business relationship is handled in the same way.
  • wireless/data/transmission/core/dynamic ring/cross-network For example, if it is wireless, then fill in 1 for the wireless dimension and 0 for other dimensions.
  • the degree of certainty can be used.
  • Each work order has a corresponding "failure recovery situation” content, which is in the field "failure handling situation”, which partly represents whether the reason judgment is correct (as shown in Figure 7 and Figure 8), but the content is non- Structured data, after word segmentation, whether similarity training is similar to "alarm clearing” or “alarm recovery”, if similar, it is 1, otherwise it is 0. Note that this field cannot be used as the label of the root cause in the association, but it can be used as a feature for training.
  • the machine learning knowledge base obtains the SPO classification model, learns the SPO (subject and guest) classification model through machine learning, and obtains the weight values of all dimensions of the above-mentioned features.
  • SPO subject and guest classification model
  • the machine learning knowledge base obtains the weight values of all dimensions of the above-mentioned features.
  • logistic regression decision tree random forest you can use logistic regression decision tree random forest etc. This process is not complicated and is relatively mature. For example, logistic regression has a mature model algorithm library.
  • Fig. 14 is a flow chart of conflict detection according to an embodiment of the present disclosure, as shown in Fig. 14 , including:
  • Step S1401 preparing a candidate set
  • Step S1402 obtaining a current candidate RDF
  • Step S1403 judging whether the candidate RDF is in the rule base, if the judging result is yes, go to step S1404, and if the judging result is no, go to step S1408;
  • Step S1404 matching with the rule base
  • Step S1405 judging whether they match, if the judging result is yes, go to step S1402, and if the judging result is no, go to step S1406;
  • Step S1406 judging whether there is a conflict, if the judging result is yes, go to step S1407, and if the judging result is no, go to step S1402;
  • Step S1407 judging whether the candidate has a low frequency, if the judging result is yes, go to step S1402, and if the judging result is no, go to step S1408;
  • Step S1408 determine that the candidate is valid through machine learning
  • Step S1409 all candidates are summed up and ready to be summed up.
  • rule base may not necessarily be correct. Although this situation is rare, it must be considered, such as possible changes in the network and link relationships.
  • work order rules also need to be checked. Of course, it is relatively simpler, mainly to see if there is no dispatch rule library. If there are no relevant alarms merged into a work order, a new work order dispatch knowledge can be created.
  • FIG. 15 is a block diagram of a device for constructing a fault knowledge graph according to another embodiment of the present disclosure. As shown in FIG. 15 , it includes:
  • the first extraction module 152 is configured to extract entities and relationships from the rule base to obtain a first candidate set including alarm relationship instances and work order dispatch instances;
  • the second extraction module 154 is configured to extract entities and relationships from the real-time data to obtain a second candidate set of alarm association relationships and work order dispatch relationships;
  • the conflict detection module 156 is configured to perform conflict detection according to the first candidate set and the second candidate set, and obtain a target candidate set including effective alarm correlation and work order dispatch knowledge;
  • the update module 158 is configured to integrate the target candidate set into the fault knowledge graph constructed according to the rule base to obtain an updated fault knowledge graph.
  • the first extraction module 152 includes:
  • the first extraction sub-module is configured to extract the association relationship between different alarms through key fields for the alarm association rules in the rule base to form the alarm relationship instance;
  • the second extraction sub-module is configured to extract the relationship between different alarms in the order dispatching conditions through the key fields of the work order dispatching rules in the rule base to form the work order dispatching instance.
  • the first extraction sub-module is further set to
  • the left and right participle is performed on the description content of the rule to obtain the left participle and the right participle;
  • the left participle and the right participle are respectively matched with parent-child alarms to form the alarm association instance.
  • the device also includes:
  • a conversion module configured to convert the semi-structured data in the rule base into structured data.
  • the second extraction module 154 is further configured to
  • the work order data in the real-time data extract the alarm content and fault handling content according to the key fields, wherein the fault handling content includes at least the fault handling situation, the preliminary judgment of the fault cause, and the fault handling opinion;
  • the alarm vector is matched with the correlation alarm to obtain the alarm correlation and the work order dispatching relationship.
  • the conflict detection module 156 is further configured to
  • each alarm association relationship and each work order dispatch relationship in the second candidate set perform the following operations to obtain the target candidate set composed of effective alarm association relationships and work order dispatch knowledge, wherein, for the The alarm association relationship and work order dispatching relationship is called the current alarm association relationship and the current work order dispatching relationship:
  • the device also includes:
  • the first summary module is configured to summarize the alarm correlations in the target candidate set according to the correlation
  • the second summarizing module is configured to sum up the work order dispatching relationships in the target candidate set according to dispatching orders.
  • the update module 158 is further configured to
  • Embodiments of the present disclosure also provide a computer-readable storage medium, in which a computer program is stored, wherein the computer program is configured to execute the steps in any one of the above method embodiments when running.
  • the above-mentioned computer-readable storage medium may include but not limited to: U disk, read-only memory (Read-Only Memory, referred to as ROM), random access memory (Random Access Memory, referred to as RAM) , mobile hard disk, magnetic disk or optical disk and other media that can store computer programs.
  • ROM read-only memory
  • RAM random access memory
  • mobile hard disk magnetic disk or optical disk and other media that can store computer programs.
  • Embodiments of the present disclosure also provide an electronic device, including a memory and a processor, where a computer program is stored in the memory, and the processor is configured to run the computer program to execute the steps in any one of the above method embodiments.
  • the electronic device may further include a transmission device and an input and output device, wherein the transmission device is connected to the processor, and the input and output device is connected to the processor.
  • each module or each step of the above-mentioned disclosure can be realized by a general-purpose computing device, and they can be concentrated on a single computing device, or distributed in a network composed of multiple computing devices In fact, they can be implemented in program code executable by a computing device, and thus, they can be stored in a storage device to be executed by a computing device, and in some cases, can be executed in an order different from that shown here. Or described steps, or they are fabricated into individual integrated circuit modules, or multiple modules or steps among them are fabricated into a single integrated circuit module for implementation. As such, the present disclosure is not limited to any specific combination of hardware and software.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Evolutionary Computation (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Provided are a fault knowledge graph construction method and apparatus. The method comprises: performing entity and relationship extraction on a rule base, so as to obtain a first candidate set that comprises an alarm relationship instance and a work order distribution instance (S202); performing entity and relationship extraction on real-time data, so as to obtain a second candidate set of an alarm association relationship and a work order distribution relationship (S204); performing conflict detection according to the first candidate set and the second candidate set, so as to obtain a target candidate set that comprises a valid alarm association relationship and work order distribution knowledge (S206); and fusing the target candidate set into a fault knowledge graph that is constructed according to the rule base, so as to obtain an updated fault knowledge graph (S208). The problem in the related art of it being impossible to quickly adapt to new services and new scenarios when relying on the mapping conversion between a rule base and a knowledge base can be solved; and by means of combining the entity and relationship extraction of real-time data and the rule base, a fault knowledge graph is updated, such that the fault knowledge graph can quickly adapt to new services and new scenarios.

Description

一种故障知识图谱构建方法及装置A method and device for constructing a fault knowledge map
相关申请的交叉引用Cross References to Related Applications
本公开基于2021年09月24日提交的发明名称为“一种故障知识图谱构建方法及装置”的中国专利申请CN202111123124.5,并且要求该专利申请的优先权,通过引用将其所公开的内容全部并入本公开。This disclosure is based on the Chinese patent application CN202111123124.5 filed on September 24, 2021 with the title of "a method and device for constructing a fault knowledge map", and claims the priority of this patent application, and the disclosed content is incorporated by reference All are incorporated into this disclosure.
技术领域technical field
本公开实施例涉及通信领域,具体而言,涉及一种故障知识图谱构建方法及装置。Embodiments of the present disclosure relate to the communication field, and in particular, to a method and device for constructing a fault knowledge graph.
背景技术Background technique
在搭建故障知识图谱过程中,知识抽取是一个必备的过程,它从不同数据源、不同结构的数据中进行知识提取,形成故障知识图谱理解的数据结构,如资源描述框架(Resource Description Framework,简称为RDF)三元组,最终融合进入故障知识图谱中。In the process of building a fault knowledge map, knowledge extraction is an essential process. It extracts knowledge from different data sources and data of different structures to form a data structure for fault knowledge map understanding, such as the Resource Description Framework (Resource Description Framework, Referred to as RDF) triples, and finally merged into the fault knowledge map.
对运维保障来说仅仅从规则库和知识库抽取知识是不够的,特别是5G场景,很多新业务新场景的规则还没有形成,需要从实时的运维数据,如告警、日志和工单等运维数据进行分析,以人工专家经验,形成规则,注入电信运维故障知识图谱。依靠规则库和知识库的映射转换,无法快速适应新业务新场景,依靠人工进行知识抽取不仅效率相对比较低下,质量也无法得到保障。For O&M support, it is not enough to extract knowledge from the rule base and knowledge base. Especially in 5G scenarios, the rules for many new services and scenarios have not yet been formed, and real-time O&M data, such as alarms, logs, and work orders, must be obtained. Analyze the operation and maintenance data, use the experience of artificial experts to form rules, and inject them into the knowledge map of telecom operation and maintenance faults. Relying on the mapping and transformation of the rule base and knowledge base cannot quickly adapt to new business and new scenarios. Relying on manual knowledge extraction is not only relatively inefficient, but also cannot guarantee the quality.
针对相关技术中依靠规则库和知识库的映射转换,无法快速适应新业务新场景的问题,尚未提出解决方案。Aiming at the problem that the related technology relies on the mapping transformation of the rule base and the knowledge base and cannot quickly adapt to new business and new scenarios, no solution has been proposed yet.
发明内容Contents of the invention
本公开实施例提供了一种故障知识图谱构建方法及装置,以至少解决相关技术中依靠规则库和知识库的映射转换,无法快速适应新业务新场景,依靠人工进行知识抽取不仅效率相对比较低下,质量也无法得到保障的问题。The embodiment of the present disclosure provides a method and device for constructing a fault knowledge map, to at least solve the problem of relying on the mapping conversion of the rule base and the knowledge base in related technologies, which cannot quickly adapt to new business and new scenarios, and relying on manual knowledge extraction is not only relatively inefficient , the quality cannot be guaranteed.
根据本公开的一个实施例,提供了一种故障知识图谱构建方法,包括:According to an embodiment of the present disclosure, a fault knowledge map construction method is provided, including:
对规则库进行实体与关系抽取,得到包括告警关系实例与工单派单实例的第一候选集;Extract entities and relationships from the rule base to obtain the first candidate set including alarm relationship instances and work order dispatch instances;
对实时数据进行实体与关系抽取,得到告警关联关系与工单派单关系的第二候选集;Extract entities and relationships from real-time data to obtain the second candidate set of alarm association relationship and work order dispatch relationship;
根据所述第一候选集与所述第二候选集进行冲突检测,得到包括有效告警关联关系与工单派单知识的目标候选集;Performing conflict detection according to the first candidate set and the second candidate set to obtain a target candidate set including effective alarm correlation and work order dispatch knowledge;
将所述目标候选集融入到根据所述规则库构建的故障知识图谱,得到更新后的故障知识图谱。The target candidate set is integrated into the fault knowledge graph constructed according to the rule base to obtain an updated fault knowledge graph.
根据本公开的另一个实施例,还提供了一种故障知识图谱构建装置,包括:According to another embodiment of the present disclosure, a device for constructing a fault knowledge map is also provided, including:
第一抽取模块,设置为对规则库进行实体与关系抽取,得到包括告警关系实例与工单派单实例的第一候选集;The first extraction module is configured to extract entities and relationships from the rule base to obtain a first candidate set including alarm relationship instances and work order dispatch instances;
第二抽取模块,设置为对实时数据进行实体与关系抽取,得到告警关联关系与工单派单关系的第二候选集;The second extraction module is configured to extract entities and relationships from real-time data to obtain a second candidate set of alarm association relationships and work order dispatch relationships;
冲突检测模块,设置为根据所述第一候选集与所述第二候选集进行冲突检测,得到包括有效告警关联关系与工单派单知识的目标候选集;The conflict detection module is configured to perform conflict detection according to the first candidate set and the second candidate set to obtain a target candidate set including effective alarm correlation and work order dispatch knowledge;
更新模块,设置为将所述目标候选集融入到根据所述规则库构建的故障知识图谱,得到更新后的故障知识图谱。The update module is configured to integrate the target candidate set into the fault knowledge graph constructed according to the rule base to obtain an updated fault knowledge graph.
根据本公开的又一个实施例,还提供了一种计算机可读的存储介质,所述存储介质中存储有计算机程序,其中,所述计算机程序被设置为运行时执行上述任一项方法实施例中的步骤。According to yet another embodiment of the present disclosure, there is also provided a computer-readable storage medium, where a computer program is stored in the storage medium, wherein the computer program is set to execute any one of the above method embodiments when running in the steps.
根据本公开的又一个实施例,还提供了一种电子装置,包括存储器和处理器,所述存储器中存储有计算机程序,所述处理器被设置为运行所述计算机程序以执行上述任一项方法实施例中的步骤。According to yet another embodiment of the present disclosure, there is also provided an electronic device, including a memory and a processor, wherein a computer program is stored in the memory, and the processor is configured to run the computer program to perform any of the above Steps in the method examples.
本公开实施例,对规则库进行实体与关系抽取,得到包括告警关系实例与工单派单实例的第一候选集;对实时数据进行实体与关系抽取,得到告警关联关系与工单派单关系的第二候选集;根据所述第一候选集与所述第二候选集进行冲突检测,得到包括有效告警关联关系与工单派单知识的目标候选集;将所述目标候选集融入到根据所述规则库构建的故障知识图谱,得到更新后的故障知识图谱,可以解决相关技术中依靠规则库和知识库的映射转换,无法快速适应新业务新场景的问题,通过实时数据与规则库的实体与关系抽取的结合,更新故障知识图谱,使得故障知识图谱能快速适应新业务新场景。In the embodiment of the present disclosure, the entity and relationship extraction is performed on the rule base to obtain the first candidate set including the alarm relationship instance and the work order dispatch instance; the entity and relationship extraction is performed on the real-time data to obtain the alarm association relationship and the work order dispatch relationship the second candidate set; perform conflict detection according to the first candidate set and the second candidate set, and obtain a target candidate set including effective alarm correlation and work order dispatch knowledge; integrate the target candidate set into the The fault knowledge graph constructed by the rule base can obtain an updated fault knowledge graph, which can solve the problem in related technologies that relies on the mapping conversion of the rule base and the knowledge base, and cannot quickly adapt to new business and new scenarios. Through real-time data and rule base The combination of entity and relationship extraction updates the fault knowledge graph, enabling the fault knowledge graph to quickly adapt to new business and new scenarios.
附图说明Description of drawings
图1是本公开实施例的故障知识图谱构建方法的移动终端的硬件结构框图;FIG. 1 is a block diagram of a hardware structure of a mobile terminal in a method for constructing a fault knowledge map according to an embodiment of the present disclosure;
图2是根据本公开一实施例的故障知识图谱构建方法的流程图;FIG. 2 is a flow chart of a fault knowledge map construction method according to an embodiment of the present disclosure;
图3是根据本公开一实施例的故障知识图谱RDF的示意图;FIG. 3 is a schematic diagram of a fault knowledge graph RDF according to an embodiment of the present disclosure;
图4是根据本公开一实施例的故障知识图谱中知识抽取的示意图;Fig. 4 is a schematic diagram of knowledge extraction in a fault knowledge map according to an embodiment of the present disclosure;
图5是根据本公开一实施例的告警关联规则的示意图;Fig. 5 is a schematic diagram of an alarm correlation rule according to an embodiment of the present disclosure;
图6是根据本公开一实施例的工单派单规则示意图;Fig. 6 is a schematic diagram of work order dispatching rules according to an embodiment of the present disclosure;
图7是根据本公开一实施例的实时工单故障处理情况的示意图(一);7 is a schematic diagram (1) of real-time work order fault handling according to an embodiment of the present disclosure;
图8是根据本公开一实施例的实时工单故障处理情况的示意图(二);FIG. 8 is a schematic diagram (2) of real-time work order fault handling according to an embodiment of the present disclosure;
图9是根据本公开一实施例的知识实体向量化的示意图(一);FIG. 9 is a schematic diagram (1) of knowledge entity vectorization according to an embodiment of the present disclosure;
图10是根据本公开一实施例的知识实体向量化的示意图(二);Fig. 10 is a schematic diagram (2) of knowledge entity vectorization according to an embodiment of the present disclosure;
图11是根据本公开一实施例的告警关联关系规则抽取的流程图;Fig. 11 is a flow chart of alarm correlation rule extraction according to an embodiment of the present disclosure;
图12是根据本公开一实施例的实时数据抽取的流程图;FIG. 12 is a flowchart of real-time data extraction according to an embodiment of the present disclosure;
图13是根据本公开一实施例的特征工程的流程图;FIG. 13 is a flowchart of feature engineering according to an embodiment of the present disclosure;
图14是根据本公开一实施例的冲突检测的流程图;FIG. 14 is a flowchart of conflict detection according to an embodiment of the present disclosure;
图15是根据本公开另一实施例的故障知识图谱构建装置的框图。Fig. 15 is a block diagram of an apparatus for constructing a fault knowledge graph according to another embodiment of the present disclosure.
具体实施方式Detailed ways
下文中将参考附图并结合实施例来详细说明本公开的实施例。Hereinafter, embodiments of the present disclosure will be described in detail with reference to the drawings and in combination with the embodiments.
需要说明的是,本公开的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。It should be noted that the terms "first" and "second" in the specification and claims of the present disclosure and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence.
本公开实施例中所提供的方法实施例可以在移动终端、计算机终端或者类似的运算装置中执行。以运行在移动终端上为例,图1是本公开实施例的故障知识图谱构建方法的移动终端的硬件结构框图,如图1所示,移动终端可以包括一个或多个(图1中仅示出一个)处理器102(处理器102可以包括但不限于微处理器MCU或可编程逻辑器件FPGA等的处理装置)和用于存储数据的存储器104,其中,上述移动终端还可以包括用于通信功能的传输设备106以及输入输出设备108。本领域普通技术人员可以理解,图1所示的结构仅为示意,其并不对上述移动终端的结构造成限定。例如,移动终端还可包括比图1中所示更多或者更少的组件,或者具有与图1所示不同的配置。The method embodiments provided in the embodiments of the present disclosure may be executed in mobile terminals, computer terminals or similar computing devices. Taking running on a mobile terminal as an example, Fig. 1 is a block diagram of the hardware structure of the mobile terminal according to the fault knowledge map construction method of the embodiment of the present disclosure. As shown in Fig. 1, the mobile terminal may include one or more (only shown in Fig. 1 a) a processor 102 (the processor 102 may include but not limited to a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data, wherein the above-mentioned mobile terminal may also include a memory for communication Functional transmission device 106 and input and output device 108 . Those skilled in the art can understand that the structure shown in FIG. 1 is only for illustration, and it does not limit the structure of the above mobile terminal. For example, the mobile terminal may also include more or fewer components than those shown in FIG. 1 , or have a different configuration from that shown in FIG. 1 .
存储器104可用于存储计算机程序,例如,应用软件的软件程序以及模块,如本公开实施例中的故障知识图谱构建方法对应的计算机程序,处理器102通过运行存储在存储器104内的计算机程序,从而执行各种功能应用以及业务链地址池切片处理,即实现上述的方法。存储器104可包括高速随机存储器,还可包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器104可进一步包括相对于处理器102远程设置的存储器,这些远程存储器可以通过网络连接至移动终端。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 104 can be used to store computer programs, for example, software programs and modules of application software, such as the computer program corresponding to the fault knowledge map construction method in the embodiment of the present disclosure, and the processor 102 runs the computer program stored in the memory 104, thereby Executing various functional applications and slicing processing of the service chain address pool is to realize the above-mentioned method. The memory 104 may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some examples, the memory 104 may further include a memory that is remotely located relative to the processor 102, and these remote memories may be connected to the mobile terminal through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
传输装置106用于经由一个网络接收或者发送数据。上述的网络具体实例可包括移动终端的通信供应商提供的无线网络。在一个实例中,传输装置106包括一个网络适配器(Network Interface Controller,简称为NIC),其可通过基站与其他网络设备相连从而可与互联网进行通讯。在一个实例中,传输装置106可以为射频(Radio Frequency,简称为RF)模块,其用于通过无线方式与互联网进行通讯。The transmission device 106 is used to receive or transmit data via a network. The specific example of the above network may include a wireless network provided by the communication provider of the mobile terminal. In one example, the transmission device 106 includes a network interface controller (NIC for short), which can be connected to other network devices through a base station so as to communicate with the Internet. In one example, the transmission device 106 may be a radio frequency (Radio Frequency, referred to as RF) module, which is used to communicate with the Internet in a wireless manner.
在本实施例中提供了一种运行于上述移动终端或网络架构的故障知识图谱构建方法,应用于终端,所述终端通过双连接(Dual Connection,简称为DC)接入源区域的当前主节点(Master Node,简称为MN)小区与当前辅节点(Secondary Node,简称为SN)小区,图2是根据本公开一实施例的故障知识图谱构建方法的流程图,如图2所示,该流程至少包括但不限于如下步骤:In this embodiment, a fault knowledge map construction method running on the above-mentioned mobile terminal or network architecture is provided, which is applied to the terminal, and the terminal accesses the current master node of the source area through a dual connection (Dual Connection, referred to as DC) (Master Node, referred to as MN for short) cell and current secondary node (Secondary Node, referred to as SN for short) cell, Fig. 2 is the flow chart of the fault knowledge map construction method according to an embodiment of the present disclosure, as shown in Fig. 2, this process At least including but not limited to the following steps:
步骤S202,对规则库进行实体与关系抽取,得到包括告警关系实例与工单派单实例的第一候选集;Step S202, extracting entities and relationships from the rule base to obtain a first candidate set including alarm relationship instances and work order dispatch instances;
步骤S204,对实时数据进行实体与关系抽取,得到告警关联关系与工单派单关系的第二候选集;Step S204, extracting entities and relationships from the real-time data to obtain a second candidate set of alarm association relationships and work order dispatch relationships;
步骤S206,根据所述第一候选集与所述第二候选集进行冲突检测,得到包括有效告警关联关系与工单派单知识的目标候选集;Step S206, performing conflict detection according to the first candidate set and the second candidate set, to obtain a target candidate set including effective alarm correlation and work order dispatch knowledge;
步骤S208,将所述目标候选集融入到根据所述规则库构建的故障知识图谱,得到更新后的故障知识图谱。Step S208, integrating the target candidate set into the fault knowledge graph constructed according to the rule base to obtain an updated fault knowledge graph.
通过上述步骤S202至S208,可以解决相关技术中依靠规则库和知识库的映射转换,无法快速适应新业务新场景的问题,通过实时数据与规则库的实体与关系抽取的结合,更新故障知识图谱,使得故障知识图谱能快速适应新业务新场景。Through the above steps S202 to S208, it is possible to solve the problem of relying on the mapping conversion of the rule base and the knowledge base in related technologies, which cannot quickly adapt to new business and new scenarios, and update the fault knowledge map through the combination of real-time data and entity and relationship extraction of the rule base , so that the fault knowledge map can quickly adapt to new business and new scenarios.
本公开实施例中,上述步骤S202具体可以包括:In the embodiment of the present disclosure, the above step S202 may specifically include:
对所述规则库中的告警关联规则,通过关键字段抽取不同告警之间的关联关系,组成所述告警关系实例,具体的,根据第一关键字段抽取父子告警与规则描述内容;根据第二关键 字段对所述规则描述内容进行左右分词,得到左分词与右分词;分别将所述左分词、所述右分词与父子告警进行大类匹配,以形成所述告警关联实例;For the alarm association rules in the rule base, the association relationship between different alarms is extracted through key fields to form the alarm relationship instance. Specifically, parent-child alarms and rule description content are extracted according to the first key field; according to the first key field Two key fields carry out left and right participle to described rule description content, obtain left participle and right participle; Described left participle, described right participle and parent-child alarm are carried out category matching respectively, to form described alarm correlation instance;
对所述规则库中的工单派单规则,通过关键字段抽取派单条件中的不同告警之间的关联关系,组成所述工单派单实例。For the work order dispatching rules in the rule base, the relationship between different alarms in the order dispatching conditions is extracted through key fields to form the work order dispatching instance.
在一可选的实施例中,在上述步骤S202之前,将所述规则库中的半结构化数据转换为结构化数据。In an optional embodiment, before the above step S202, the semi-structured data in the rule base is converted into structured data.
本公开实施例中,上述步骤S204具体可以包括:对所述实时数据中的告警数据进行相关性挖掘,得到相关性告警;对所述实时数据中的工单数据,根据关键字段抽取告警内容与故障处理内容,其中,所述故障处理内容至少包括故障处理情况、故障原因初步判断、故障处理意见;对所述告警内容与所述故障处理内容进行分词处理,得到一个或多个原因告警;根据相似度确定所述一个或多个原因告警的告警向量;将所述告警向量与所述相关性告警进行匹配,得到所述告警关联关系与所述工单派单关系。In the embodiment of the present disclosure, the above step S204 may specifically include: performing correlation mining on the alarm data in the real-time data to obtain a correlation alarm; extracting the alarm content according to the key field from the work order data in the real-time data and fault handling content, wherein, the fault handling content includes at least the fault handling situation, the preliminary judgment of the cause of the fault, and the fault handling opinion; word segmentation processing is performed on the alarm content and the fault handling content to obtain one or more cause alarms; Determine the alarm vectors of the one or more cause alarms according to the similarity; match the alarm vectors with the correlation alarms to obtain the alarm association relationship and the work order dispatch relationship.
本公开实施例中,上述步骤S208具体可以包括:对所述第二候选集中的每个告警关联关系与每个工单派单关系,执行以下操作,得到有效告警关联关系与工单派单知识组成的所述目标候选集,其中,对于正在执行的告警关联关系与工单派单关系称为当前告警关联关系与当前工单派单关系:判断所述当前告警关联关系是否存在于所述第一候选集中;在判断结果为否的情况下,通过机器学习确定所述当前告警关联关系有效;在判断结果为是的情况下,当所述当前告警关联关系与所述第一候选集中的告警关系实例不匹配且不存在冲突时,确定所述当前告警关联关系的数据频度是否小于预设阈值,在确定结果为否的情况下,通过机器学习确定所述当前告警关联关系为所述有效告警关联关系;判断所述当前工单派单关系是否存在于所述第一候选集的工单派单实例中;在判断结果为否的情况下,将所述当前工单派单关系与相关告警合并成所述工单派单知识。In the embodiment of the present disclosure, the above step S208 may specifically include: performing the following operations on each alarm association relationship and each work order dispatching relationship in the second candidate set to obtain valid alarm association relations and work order dispatching knowledge The target candidate set formed, wherein, the ongoing alarm association relationship and work order dispatch relationship is called the current alarm association relationship and the current work order dispatch relationship: judging whether the current alarm association relationship exists in the first A candidate set; if the judgment result is no, determine that the current alarm association relationship is valid through machine learning; if the judgment result is yes, when the current alarm association relationship and the alarm in the first candidate set When the relationship instance does not match and there is no conflict, determine whether the data frequency of the current alarm association relationship is less than a preset threshold, and if the determination result is no, determine that the current alarm association relationship is the effective Alarm association relationship; judging whether the current work order dispatch relationship exists in the work order dispatch instance of the first candidate set; if the judgment result is no, correlating the current work order dispatch relationship with Alerts are merged into the ticket dispatch knowledge.
在另一可选的实施例中,在上述步骤S208之前,对所述目标候选集中的告警关联关系按照相关性进行汇总;对所述目标候选集中的工单派单关系按照派单进行汇总。In another optional embodiment, before the above step S208, the alarm association relationship in the target candidate set is summarized according to the correlation; the work order dispatch relationship in the target candidate set is summarized according to the dispatch order.
本公开实施例中,上述步骤S208具体可以包括:使用key代表告警,通过本体映射和实体映射在所述故障知识图谱中为所述目标候选集的所述有效告警关联关系与所述工单派单知识寻找对应的节点;当找到对应的所述节点时,进行内容冲突检测,得到检测结果;当所述检测结果为不存在内容冲突时,将所述有效告警关联关系与所述工单派单知识挂在对应的所述节点上;如果对应的所述节点没有在所述故障知识图谱中,则在所述故障知识图谱中新建一个父节点,将所述有效告警关联关系与所述工单派单知识挂在新建的所述父节点上。In the embodiment of the present disclosure, the above step S208 may specifically include: using a key to represent an alarm, and using ontology mapping and entity mapping to establish the effective alarm association relationship of the target candidate set and the work order assignment in the fault knowledge graph. single knowledge to find the corresponding node; when the corresponding node is found, content conflict detection is carried out to obtain the detection result; when the detection result is that there is no content conflict, the effective alarm association relationship is sent to The single knowledge is hung on the corresponding node; if the corresponding node is not in the fault knowledge graph, a new parent node is created in the fault knowledge graph, and the effective alarm association relationship with the working The single-dispatch knowledge is hung on the newly-created parent node.
图3是根据本公开一实施例的故障知识图谱RDF的示意图,如图3所示,对于一个电信行业运维等故障知识图谱来说,简单的RDF三元组并不能协助有效的智能运维,基站-产生-告警与基站-开通-小区业务部分的RDF,只是简单描述的告警发生,并没有有效指导告警发生后应该如何有效的处理,而需要建立A告警-引起-B告警部分的RDF,指明告警关联关系,从而能够有效指导运维。Fig. 3 is a schematic diagram of fault knowledge map RDF according to an embodiment of the present disclosure. As shown in Fig. 3, for a fault knowledge map such as operation and maintenance in the telecom industry, simple RDF triples cannot assist effective intelligent operation and maintenance , the RDF of the base station-generated-alarm and base station-activated-cell service part is only a brief description of the occurrence of the alarm, and there is no effective guidance on how to deal with the alarm after it occurs, but it is necessary to establish the RDF of the A alarm-cause-B alarm part , indicating the alarm association relationship, so as to effectively guide the operation and maintenance.
本实施例中,故障知识图谱的RDF用三元组来表示。故障知识图谱中的知识表示为RDF三元组可以用SPO(主谓宾)来表示,以故障知识图谱的告警间根因关联关系为例,如一条关联根因规则,5分钟同机房A告警产生引发B告警,其中A即SPO的S,引发即SPO的P,B即SPO的0,位置同机房和时间5分钟即是属性,而某个时间段它们发生了,则是实例。In this embodiment, the RDF of the fault knowledge graph is represented by triples. The knowledge in the fault knowledge map is expressed as RDF triples, which can be represented by SPO (subject-verb-object). Taking the root cause correlation relationship between alarms in the fault knowledge map as an example, such as a correlation root cause rule, the same computer room A alarms within 5 minutes The trigger B alarm is generated, where A is the S of SPO, the trigger is P of SPO, and B is 0 of SPO. The location is the same as the computer room and the time is 5 minutes.
因此故障知识图谱的知识抽取主要包括:Therefore, the knowledge extraction of the fault knowledge map mainly includes:
实体抽取,它又称为命名实体识别,从规则库和知识库以及告警数据、工单数据中抽取实体,如告警ID(Identify Document,身份标识)、标题,告警发生的机房网元等,告警发生级别;Entity extraction, also known as named entity recognition, extracts entities from the rule base, knowledge base, alarm data, and work order data, such as alarm ID (Identify Document, identity), title, network element in the computer room where the alarm occurred, etc. Occurrence level;
关系抽取,如告警A导致告警B发生,即告警A和告警B的关联关系,告警B为重要告警需要派发工单,即告警和工单的关系;Relationship extraction, such as alarm A causes alarm B to occur, that is, the relationship between alarm A and alarm B, and alarm B is an important alarm that needs to be dispatched, that is, the relationship between the alarm and the work order;
事件抽取,对运行时的告警数据和工单数据进行抽取,能够得到事件,如告警的发生时间、地点,如工单的处理人、处理时长和处理结果建议等属性,形成相关运维知识。Event extraction extracts the alarm data and work order data at runtime, and can obtain events, such as the time and location of the alarm, such as the handler of the work order, the processing time, and the processing result suggestion, to form relevant operation and maintenance knowledge.
本实施例中,对运营商和设备商在运维保障的规则库、知识库和故障说明书,对运营商和设备商的实时告警信息和工单信息,进行知识抽取,抽取结果融入运维故障知识图谱。其中,规则库、知识库都是故障运维知识,故障说明书大多数故障解决方案也写入了规则库和知识库,为了阐述方便,本公开中统称为规则库。In this embodiment, knowledge is extracted from the rule base, knowledge base, and fault descriptions of operators and equipment manufacturers in the operation and maintenance guarantee, as well as the real-time alarm information and work order information of operators and equipment manufacturers, and the extracted results are integrated into operation and maintenance faults. Knowledge graph. Among them, the rule base and the knowledge base are fault operation and maintenance knowledge, and most fault solutions of the fault description are also written into the rule base and the knowledge base. For the convenience of explanation, they are collectively referred to as the rule base in this disclosure.
图4是根据本公开一实施例的故障知识图谱中知识抽取的示意图,如图4所示,分为设计域和执行域,其中:Fig. 4 is a schematic diagram of knowledge extraction in a fault knowledge map according to an embodiment of the present disclosure. As shown in Fig. 4, it is divided into a design domain and an execution domain, wherein:
步骤S401,设计域中,进行知识抽取的相关设计,包括规则库抽取设计,半结构化模板设计,实时数据抽取设计,向量化设计,机器学习设计以及阈值设计。具体包括:Step S401 , in the design domain, perform related design of knowledge extraction, including rule base extraction design, semi-structured template design, real-time data extraction design, vectorization design, machine learning design and threshold design. Specifically include:
1)规则库抽取设计;1) Rule base extraction design;
一般来说规则库的数据都是结构化数据和部分半结构化数据,图5是根据本实施例的告警关联规则的示意图,如图5所示的告警关联规则,包括规则名称、关联方式、规则描述、关联位置、时间窗口、父告警以及子告警。Generally speaking, the data in the rule base is structured data and partially semi-structured data. FIG. 5 is a schematic diagram of the alarm association rules according to this embodiment. The alarm association rules shown in FIG. 5 include rule names, association methods, Rule description, associated location, time window, parent alarm and child alarm.
对规则的抽取,无论是实体抽取还是关系抽取,主要是采用映射的方法,也即主要需要定义关键运维字段。The extraction of rules, whether it is entity extraction or relationship extraction, mainly adopts the method of mapping, that is, it mainly needs to define key operation and maintenance fields.
下面以告警关联规则举例进行说明。The following uses an example of alarm correlation rules for illustration.
对告警关联规则,关键字段为关联规则名称,关联方式,专业,关联位置。时间窗口,父告警,子告警。对字段值,关联规则名称为整个内容;关联方式为三个值,包括主次关联,频次关联,阈值关联;专业的值即为无线/传输/电源/核心网,或者是跨专业;关联位置,即是相同网元关联,还是相同机房,还是相同链路;时间窗口表示父子告警发生的时间差范围;父告警的值即为各告警的详细描述,告警标题、告警ID和告警级别,子告警同样。For alarm correlation rules, the key fields are correlation rule name, correlation method, specialty, and correlation location. Time window, parent alarm, child alarm. For field values, the name of the association rule is the entire content; the association method is three values, including primary and secondary associations, frequency associations, and threshold associations; the professional value is wireless/transmission/power supply/core network, or cross-professional; the association position , that is, the same network element association, the same computer room, or the same link; the time window indicates the time difference range of parent and child alarms; the value of the parent alarm is the detailed description of each alarm, the alarm title, alarm ID and alarm level, and the child alarm same.
同样,对派单规则,可以同样设计相应的关键字段和值的提取方式。这些关键字段会作为词典保存。Similarly, for dispatch order rules, the corresponding key fields and value extraction methods can also be designed. These key fields are saved as dictionaries.
本实施例中所有关键字段都是可以设计的,所以不存在这种情况,如字段为“关联方式”,而用“关联方法”等其它字面就绕开本专利。后面的关键字段都在这个范围内。All key fields in this embodiment can be designed, so there is no such situation, such as the field is "association method", and other literals such as "association method" just bypass this patent. The following key fields are all within this range.
2)半结构化模板设计;2) Semi-structured template design;
半结构化数据如图3所示的规则描述和派单规则从图上能够看到结构化数据很容易抽取,但半结构化数据定义模板也能够抽取。Semi-structured data is shown in Figure 3. The rule description and dispatch rules can be seen from the figure that structured data is easy to extract, but semi-structured data definition templates can also be extracted.
所以,规则抽取模板主要是定义,结构化数据的字段和值,和半结构化数据的字段和模板。Therefore, rule extraction templates are mainly definitions, fields and values of structured data, and fields and templates of semi-structured data.
如图5所示,规则描述能看到模板可以这样设计:As shown in Figure 5, the rule description can see that the template can be designed as follows:
由(可选)XX引起XX;XX is caused by (optional) XX;
则可以轻易提取到相关关联信息。The relevant associated information can be easily extracted.
图6是根据本公开一实施例的工单派单规则示意图,如图6所示,包括名称、是否启用,规则类型以及派单条件,派单条件中,可以设计:Fig. 6 is a schematic diagram of a work order dispatching rule according to an embodiment of the present disclosure. As shown in Fig. 6, it includes the name, whether to enable it, the rule type, and dispatching conditions. Among the dispatching conditions, you can design:
告警工程状态=XX or XX,告警对象设备类型=XX,告警级别=XX,告警ID=XX;Alarm project status = XX or XX, alarm object device type = XX, alarm level = XX, alarm ID = XX;
其中,“由”、“引起”、“告警工程状态”、“告警对象设备类型”、“告警级别”和“告警ID”为模板关键字段,这样设计后,已经把半结构化数据转换成结构化数据了。Among them, "caused by", "caused by", "alarm project status", "alarm object device type", "alarm level" and "alarm ID" are the key fields of the template. After this design, the semi-structured data has been converted into structured data.
3)实时数据抽取设计;3) Real-time data extraction design;
对于实时运行数据,工单和告警,图7是根据本公开一实施例的实时工单故障处理情况的示意图(一),如图7所示,故障处理情况为:经网管查询,存在基站退服高考,其中:【动环】:核实网管存在交流输入停电告警;【无线】:核实网管存在基站退服告警;【传输】:未查询到相关告警;【故障网元状态】:可监控,网元状态正常;【故障原因初步判断】:可监控,网元状态正常;【故障网元所处网络层级】:ENODEB;【故障原因初步判断】:初步判断为设备电源故障或交流输入停电导致;【故障处理意见】:查看区域停电情况或查看动环电源情况;【故障处理时间】:xxxx年xx月xx日xx:xx:xx;【故障处理人】:xxx;【故障恢复情况】:故障已清除,清除时间xxxx年xx月xx日xx:xx:xx。For real-time operation data, work orders and alarms, FIG. 7 is a schematic diagram (1) of real-time work order fault handling according to an embodiment of the present disclosure. As shown in FIG. Service college entrance examination, among which: [Dynamic Ring]: Verify that the network management has an AC input power failure alarm; [Wireless]: Verify that the network management has a base station out of service alarm; [Transmission]: No relevant alarms have been queried; The status of the NE is normal; [Preliminary Judgment of Fault Cause]: It can be monitored, and the status of the NE is normal; [The network level of the faulty NE]: ENODEB; ;[Troubleshooting advice]: Check the power outage in the area or check the dynamic ring power supply; [Troubleshooting time]: xx, xx, xx, xx: xx: xx; [Troubleshooter]: xxx; [Troubleshooting status]: The fault has been cleared at xx:xx:xx on xx month xx xxxx.
定义需要抽取的字段,如“故障原因初步判断”,即运维保障字段,包括工单主题、告警标题、网元名称、故障设备型号、故障地市、告警级别、告警ID、告警发生时间、告警派单时间、故障处理情况、故障原因、故障处理时长,故障处理人等字段,注意,这些字段在不同的运营商可能命名不同但意思相同,所以需要设计定义成字典模式,并在抽取时动态解析这些字段对应的内容。Define the fields that need to be extracted, such as "preliminary judgment of fault cause", that is, the operation and maintenance guarantee field, including work order subject, alarm title, network element name, faulty device model, faulty city, alarm level, alarm ID, alarm occurrence time, Fields such as alarm dispatch time, fault handling situation, fault cause, fault processing time, fault handler, etc., note that these fields may have different names but the same meaning in different operators, so it needs to be designed and defined as a dictionary mode, and when extracting Dynamically parse the content corresponding to these fields.
故障原因初步判断和故障处理意见的内容就是非结构化数据,需要机器学习来具体学习。The content of the preliminary judgment of the cause of the fault and the opinions on troubleshooting are unstructured data, which requires machine learning to learn in detail.
4)向量化设计;4) Vectorized design;
在抽取过程,对内容中非结构化数据,如故障处理情况的描述的解析,需要用到分词和向量化表示进行相似度学习,并对可能存在的关联父子关系(根因关系)进行机器学习,那么需要设计模型算法和评价指标,有监督学习,则需要定义标签。In the extraction process, the analysis of unstructured data in the content, such as the description of fault handling, requires the use of word segmentation and vectorized representation for similarity learning, and machine learning for possible parent-child relationships (root cause relationships) , then you need to design model algorithms and evaluation indicators, and for supervised learning, you need to define labels.
对非结构描述信息进行分词,分词模型采用标准的Jieba(结巴分词)。对如图7中字段“故障原因初步判断”的内容“初步判断为设备电源故障或交流输入停电导致”,分词得到“设备电源故障”、“交流输入停电”和“导致”三个词,前两个词和告警标题直接匹配,图8是根据本公开一实施例的实时工单故障处理情况的示意图(二),如图8所示,但字段“故障原因初步判断”的内容--“通过和研发交流分析,5G小区未出现异常,判断AAU版本为单模版本,应该是AAU版本异常导致RRU异常,RRU异常导致4G小区退服”--这些字样,一些分词如“导致”、“AAU版本异常”和“RRU异常”等没有问题,但其它如“5G小区异常”,“4G小区退服”,和标准的拼词,“CU/DU小区异常”,“LTE小区退服”就不一样,甚至因人而异,一些人会写成“退服”,另一些人会写成“退出服务”,因此需要在分词后进行向量化然后进行相似度学习,这样它们能够以向量化表示其唯一性,相似度学习的算法可以选择标准的NLP(自然语言处理)的TFIDF或LSI模型进行相似度学习。To segment the non-structural description information, the word segmentation model adopts the standard Jieba (stuttered word segmentation). For the content of the field "Preliminary Judgment of Fault Cause" in Figure 7, "preliminary judgment is caused by equipment power failure or AC input power failure", the words are divided into three words "equipment power failure", "AC input power failure" and "caused". The two words are directly matched with the alarm title. FIG. 8 is a schematic diagram (2) of real-time work order fault handling according to an embodiment of the present disclosure, as shown in FIG. Through communication and analysis with R & D, there is no abnormality in the 5G cell. It is judged that the AAU version is a single-mode version. There is no problem with "abnormal AAU version" and "abnormal RRU", but others such as "abnormal 5G cell", "out of service in 4G cell", and standard spelling, "abnormal in CU/DU cell" and "out of service in LTE cell" are Not the same, even varies from person to person, some people will write "withdrawal", others will write "exit service", so it needs to be vectorized after word segmentation and then similarity learning, so that they can represent other The algorithm for uniqueness and similarity learning can choose the standard NLP (Natural Language Processing) TFIDF or LSI model for similarity learning.
这里需要注意的是“LTE小区”或者RRU这些并未在故障知识图谱中单独体现,它们存在于资源知识图谱,故障知识图谱是以“LTE小区退服”这样的告警和相关关联关系,以及相应工单派单规则这些方式存在的。It should be noted here that the "LTE cell" or RRU are not separately reflected in the fault knowledge map, they exist in the resource knowledge map, and the fault knowledge map is based on alarms such as "LTE cell decommissioning" and related associations, as well as corresponding These methods of work order dispatching rules exist.
5)机器学习设计;5) Machine learning design;
如“AAU版本异常”是否导致“RRU异常”是规则库中没有的,也需要进行机器学习判断其关联关系,而“RRU异常”导致“LTE小区退服”和“交流输入停电”导致“基站退服”是规则库中有的,而且不冲突,可以不用再学习。For example, whether "AAU version abnormality" leads to "RRU abnormality" does not exist in the rule base, and machine learning is also required to determine its correlation, while "RRU abnormality" causes "LTE cell decommissioning" and "AC input power failure" causes "base station failure". "Retirement" is in the rule base, and there is no conflict, so you don't need to learn it.
机器学习采用分类模型,可以选择算法如逻辑回归(sigmoid)/决策树以及支持向量机(SVM)等。Machine learning uses classification models, and you can choose algorithms such as logistic regression (sigmoid)/decision tree and support vector machine (SVM).
学习的标签可以按规则库中已经定义了的来学习。The learned labels can be learned as defined in the rule base.
同时需要对学习效果的评估的定义,分类学习按标准的准确率(Accuracy)/精确率(Precision)/召回率(Recall)定义,这里定义具体数值,比如准确率到达80%以上即可得到分类模型,如果不到80%则需要重新学习。At the same time, the definition of the evaluation of the learning effect is required. Classification learning is defined according to the standard accuracy (Accuracy)/precision (Precision)/recall (Recall). Here, specific values are defined. For example, the classification can be obtained if the accuracy rate reaches more than 80%. model, if it is less than 80%, it needs to be relearned.
机器学习的特征如下:The characteristics of machine learning are as follows:
特征如下定义:Traits are defined as follows:
--关联告警的拓扑关联度,网元/板卡/端口;--Topological association degree of associated alarms, network element/board/port;
--关联告警的业务关系度,比如基站下面关联小区;--The degree of business relationship associated with the alarm, such as the associated cell under the base station;
--关联告警的重要程度,比如告警级别;--The importance of the associated alarm, such as the alarm level;
--关联告警的专业关系,比如动环-无线-传输之间的互相影响;--Professional relationships associated with alarms, such as the mutual influence between dynamic ring-wireless-transmission;
--关联告警的支持度/确信度;--Support/confidence of associated alarms;
--告警或故障是否清楚(故障处理情况中的“故障恢复情况”的内容)。--Whether the alarm or fault is clear (the content of "fault recovery situation" in the fault handling situation).
6)阈值设计。6) Threshold design.
包括数据挖掘的频繁次数,和工单处理中分析的关联告警出现次数。Including the frequency of data mining and the number of occurrences of associated alarms analyzed in work order processing.
这些数据频度不能太低,如只有1个月的实时数据只有个位数次数,样本数太少,没有抽取意义。The frequency of these data should not be too low. For example, the real-time data of only one month only has a single-digit frequency, and the number of samples is too small to be meaningful.
以上,即为设计域的内容。The above is the content of the design domain.
步骤S402,规则抽取模块对运维规则抽取知识。Step S402, the rule extraction module extracts knowledge about the operation and maintenance rules.
按照上述步骤S401定义的词典,对规则进行提取。The rules are extracted according to the dictionary defined in step S401 above.
对告警规则,提取出告警和告警关系,通过RDF三元组组成一条告警关联规则,以以下规则举例:For alarm rules, extract alarms and alarm relationships, and form an alarm correlation rule through RDF triples. Take the following rules as an example:
告警1:基站退服告警(二级告警,ID:100-1111);Alarm 1: base station out of service alarm (secondary alarm, ID: 100-1111);
告警2:交流输入停电告警(三级告警,500-0003);Alarm 2: AC input power failure alarm (three-level alarm, 500-0003);
RDF:告警2引起告警1(同机房时间窗口);RDF: Alarm 2 causes alarm 1 (same time window of the computer room);
告警和它们关联关系就会被融入到图3所示中的故障知识图谱的告警关系实例。Alarms and their associated relationships will be integrated into the alarm relationship instance of the fault knowledge graph shown in Figure 3.
对工单派单规则,提取派单条件中的告警,即可融入故障知识图谱中的工单实例。For work order dispatch rules, the alarms in the dispatch conditions can be extracted and integrated into the work order instances in the fault knowledge map.
在这一步,即对实体进行了抽取,也对关系进行了抽取。In this step, both entities and relationships are extracted.
对告警关联来说,实体就是告警1和告警2,关系就是告警2会引发告警1;For alarm correlation, the entity is alarm 1 and alarm 2, and the relationship is that alarm 2 will trigger alarm 1;
对工单派单来说,工单派单以告警1派发,原因是告警2。For work order dispatch, the work order dispatch is dispatched with alarm 1 and the reason is alarm 2.
步骤S403,实时抽取模块抽取实时告警、工单数据。Step S403, the real-time extraction module extracts real-time alarm and work order data.
对实时数据,如告警数据和工单数据进行抽取,能够协助发现规则库没有的知识,下面以图3中的A告警-引起-B告警举例说明。Extracting real-time data, such as alarm data and work order data, can help discover knowledge that is not in the rule base. The following is an example of A alarm-cause-B alarm in Figure 3.
现有规则一般都是成熟的知识,如传统网络中一个4G设备机房,有动环(动力环境)网 元,也有无线网元和传输网元,那么动环网元产生故障,如市政停电引发掉电或者电源告警,可能会导致同机房的基站退服,再会导致4G小区退服(LTE小区退服)。The existing rules are generally mature knowledge. For example, in a 4G equipment room in a traditional network, there are dynamic ring (dynamic environment) network elements, wireless network elements, and transmission network elements. If the dynamic ring network elements fail, such as municipal power outages Power failure or power alarm may cause the base station in the same computer room to be out of service, and then cause the 4G cell to be out of service (LTE cell out of service).
按照这种先验知识,5G网络建设好后,同样存在类似情况,一个5G设备机房,有动环(动力环境)网元,也有无线网元和传输网元,那么动环网元产生故障,如市政停电引发掉电或者电源告警,可能会导致同机房的5G基站退服,再会导致5G小区退服(CU/DU小区退服)。According to this prior knowledge, after the 5G network is built, there will be similar situations. A 5G equipment room has dynamic ring (dynamic environment) network elements, wireless network elements and transmission network elements. If the dynamic ring network element fails, If the municipal power outage causes a power failure or a power alarm, it may cause the 5G base station in the same computer room to be out of service, and then the 5G cell (CU/DU cell out of service).
这种情况,即使规则库没有相应知识,当有类似的告警和工单,运维人员也很容易根据4G网络规则库的先验知识来处理故障,并把这种5G运维知识导入到知识库中。In this case, even if the rule base has no corresponding knowledge, when there are similar alarms and work orders, the operation and maintenance personnel can easily handle the fault based on the prior knowledge of the 4G network rule base, and import this 5G operation and maintenance knowledge into the knowledge library.
大多数终端用户还没有升级到5G手机的情况下,存在5G基站反向开通4G,提供4G小区服务,这种情况下,5G网元的AAU(Active Antenna Unit)需要即支持5G小区,也要支持4G小区,即双模模式(如果只支持5G小区,就是单模模式),但可能软件bug或者软件升级的时间差,还是单模模式,与4G网元的RRU通信出现故障,则RRU会有异常,导致4G小区退服,即图3中的虚线对应的关系,而此时,它服务的5G小区正常。When most end users have not yet upgraded to 5G mobile phones, there are 5G base stations that activate 4G in reverse to provide 4G cell services. It supports 4G cells, that is, dual-mode mode (if it only supports 5G cells, it is single-mode mode), but there may be software bugs or software upgrade time differences, and it is still in single-mode mode. If the RRU communication with the 4G network element fails, the RRU will have Abnormal, resulting in the decommissioning of the 4G cell, that is, the relationship corresponding to the dotted line in Figure 3, and at this time, the 5G cell it serves is normal.
如图8所示,除非有非常丰富的研发知识,否则运维人员很不容易预先定义规则或者实时发现故障原因,即使在研发人员和运维人员协作后发现问题,也不会立即导入规则库,而且类似的问题也需要通过故障知识图谱融合后推理得到,因此需要通过对实时运维数据的学习,对知识的抽取能更快速更准确的发现运维关联知识。As shown in Figure 8, unless they have very rich R&D knowledge, it is not easy for operation and maintenance personnel to define rules in advance or find out the cause of failures in real time. , and similar problems also need to be reasoned through the fusion of fault knowledge graphs. Therefore, it is necessary to learn real-time operation and maintenance data and extract knowledge to discover operation and maintenance related knowledge more quickly and accurately.
而告警相关性是有一定条件的,如关联位置和时间窗口,因此需要先通过数据挖掘来获取相关性的可能性,注意,不用采用频繁集挖掘,因为它们是不频繁发生的,而采用相关性挖掘,如皮尔森系数,得到相关性即可。The alarm correlation has certain conditions, such as the correlation location and time window, so it is necessary to obtain the possibility of correlation through data mining first. Note that frequent set mining is not used, because they do not occur frequently, and correlation is used. Sex mining, such as Pearson coefficient, just get the correlation.
然后根据工单数据抽取如下信息:Then extract the following information based on the work order data:
工单派单的告警,包括发生的位置,此为结构化数据;The alarm of the work order dispatch, including the location of the occurrence, this is structured data;
故障处理情况中的“故障原因初步判断”类似字段内容,此内容可能为非结构化数据;In the fault handling situation, the "preliminary judgment of fault cause" is similar to the field content, which may be unstructured data;
故障处理情况中的“故障处理意见”类似字段内容,此内容可能为非结构化数据;The field content similar to "Troubleshooting Opinion" in the troubleshooting situation, this content may be unstructured data;
故障处理情况中的“故障恢复情况”类似字段内容,此内容可以硬编码是否有“清除”或“恢复”等字样,可以辅助表示此故障处理和原因判断是否正确的标签。The "fault recovery situation" in the fault handling situation is similar to the content of the field. This content can be hard-coded whether there are words such as "clear" or "recovery", which can assist the label indicating whether the fault handling and cause judgment are correct.
抽取完毕后,用挖掘出来的相关性告警来匹配上面的信息,如果有一条或多条,则保存抽取原始信息。After the extraction is complete, use the excavated correlation alarms to match the above information, and if there are one or more, save the extracted original information.
这些原始信息,如上有两处为非结构化数据,需要通过NLP(自然语言处理)进行分词和向量化表示。These raw information, such as the above two places, are unstructured data, which need to be segmented and vectorized through NLP (Natural Language Processing).
图9是根据本公开一实施例的知识实体向量化的示意图(一),如图9所示,由于词语并不多,所以向量化用独热模型编码,即该词在向量中它的位置为1,其它位置为0。Fig. 9 is a schematic diagram (1) of knowledge entity vectorization according to an embodiment of the present disclosure. As shown in Fig. 9, since there are not many words, the vectorization uses one-hot model encoding, that is, the position of the word in the vector is 1, and other positions are 0.
向量化后进行相似度学习,采用标准的LSI或TFIDF模型进行学习,即得到唯一性,即“LTE小区退服”和“4G小区退服”以及“LTE小区退出服务”是一个实体,则去重取唯一值。Carry out similarity learning after vectorization, use the standard LSI or TFIDF model to learn, that is, get the uniqueness, that is, "LTE cell decommissioning" and "4G cell decommissioning" and "LTE cell decommissioning service" are one entity, then go to Refetch unique values.
抽取后的数据作为告警相关性候选和派单规则候选。The extracted data is used as alarm correlation candidates and dispatch rule candidates.
步骤S404,冲突检测模块对知识进行冲突检测。Step S404, the conflict detection module performs conflict detection on the knowledge.
当规则库抽取和实时数据抽取完毕后,还需要进行冲突检测,按如下方式进行:After the rule base extraction and real-time data extraction are completed, conflict detection needs to be performed as follows:
如果实时数据抽取和规则库在告警和相关性上完全匹配,则取其一;If the real-time data extraction and the rule base completely match in terms of alarms and correlations, choose one;
如果实时数据抽取和规则库在告警匹配,但相关性冲突,则判断该告警的工单相关处理字段和原因字段出现次数是否大于阈值,如果大于,则使用机器学习算法模型进行判断,否则采取规则库;If the real-time data extraction and the rule base match the alarm, but the correlation conflicts, then judge whether the number of occurrences of the work order-related processing field and the reason field of the alarm is greater than the threshold. If it is greater than the threshold, use the machine learning algorithm model to judge, otherwise adopt the rule library;
如果实时数据抽取完全不在规则库,则取实时数据抽取信息,使用机器学习算法模型进行判断得到相关性。If the real-time data extraction is not in the rule base at all, the real-time data extraction information is taken, and the machine learning algorithm model is used to judge and obtain the correlation.
冲突检测不仅在告警关联规则库上进行检测,还要在工单派单规则库进行检测。因为可能发现了新的告警关联关系,如果以前的派单规则库没有,那么可能新增一条相关知识。Conflict detection is not only detected in the alarm correlation rule base, but also in the work order dispatch rule base. Because a new alarm correlation may be found, if there is no previous dispatch rule base, then a new related knowledge may be added.
抽取完毕,准备后续的融入故障知识图谱。After the extraction is completed, prepare for subsequent integration into the fault knowledge map.
步骤S403与S404,即对实体进行了抽取,也对关系进行了抽取,还对告警工单事件进行了抽取,对故障知识图谱融合有更多的信息。In steps S403 and S404, entities are extracted, relationships are also extracted, and alarm work order events are also extracted, so that there is more information for fault knowledge map fusion.
步骤S405,汇总模块按照知识相关性汇总。Step S405, the summary module summarizes according to the knowledge correlation.
为了更方便的融入故障知识图谱,需要把这些知识进行汇总。In order to integrate into the fault knowledge map more conveniently, it is necessary to summarize these knowledge.
汇总不是简单的排队入列,而是分成两部分汇总:The summary is not simply queued up, but divided into two parts:
按照相关性汇总,按照主告警进行汇总,主告警为key,它所有相关告警集合为value,通过hash聚合汇总;Summarize according to the correlation, and summarize according to the main alarm. The main alarm is the key, and the set of all related alarms is the value, which is aggregated and summarized through hash;
按照派单汇总,按照派单告警为key,派单规则集合为value汇总。According to the summary of the dispatch order, according to the dispatch order alarm as the key, the collection of dispatch order rules is the value summary.
这样汇总,后续更方便融入故障知识图谱。This summary makes it easier to integrate into the fault knowledge graph later.
步骤S406,融合进入故障知识图谱。Step S406, merging into the fault knowledge map.
当得到一个汇总后,需要融入故障知识图谱中。用key代表的告警,通过本体映射和实体映射在故障知识图谱中寻找节点,如果找到,则进行内容冲突检测,如何无问题挂上相应节点;如果父节点没有在图谱中,则在故障知识图谱中新建一个父节点。When a summary is obtained, it needs to be integrated into the fault knowledge map. For an alarm represented by a key, find a node in the fault knowledge graph through ontology mapping and entity mapping. Create a new parent node.
本实施例,通过对规则库和实时运维数据的抽取,当新类型新业务新拓扑的告警产生的时候,很容易融入在故障知识图谱中;通过抽取和冲突检测的事先判断,能够对后面融入故障知识图谱的准确性有较大的帮助;对5G切片复杂网络和业务的告警根因模型有比较好的参考作用。In this embodiment, through the extraction of rule base and real-time operation and maintenance data, when alarms of new types, new services and new topologies are generated, they can be easily integrated into the fault knowledge map; through the prior judgment of extraction and conflict detection, it is possible to Incorporating the accuracy of the fault knowledge map is of great help; it is a good reference for the alarm root cause model of complex networks and services in 5G slices.
本实施例中的数据准备,对规则库和实时数据,包括工单和告警等运维数据和相应的规则库。The data preparation in this embodiment refers to the rule base and real-time data, including operation and maintenance data such as work orders and alarms and the corresponding rule base.
先对规则库进行抽取,这个主要是结构化数据,相对比较容易。First extract the rule base, which is mainly structured data, which is relatively easy.
再对实时告警和实时工单进行抽取,中间会用到数据挖掘。Then extract real-time alarms and real-time work orders, and use data mining in the middle.
抽取过程中,会用到分词和向量化,相似度学习让抽取出来的知识唯一。During the extraction process, word segmentation and vectorization are used, and similarity learning makes the extracted knowledge unique.
学习后,进行冲突检测,冲突检测中可能需要分类模块对新抽取出来的关系用机器学习进行判断。After learning, perform conflict detection. In conflict detection, the classification module may be required to use machine learning to judge the newly extracted relationship.
最后汇总模块把抽取出来的知识进行按序汇总,为故障知识图谱融合做好准备。Finally, the summary module summarizes the extracted knowledge in order to prepare for the fusion of fault knowledge graphs.
这个就是整个系统的运行流程,如图4所示。This is the operation process of the whole system, as shown in Figure 4.
图10是根据本公开一实施例的知识实体向量化的示意图(二),如图10所示,实体告警向量化one-hot编码,对所有可能的关联告警,进行向量化,采用one-hot独热编码,即在一个n维向量中,只有自己是1,其它都是0。独热编码虽然具有稀疏性,但是本身告警类型并不算多,所以,可以不用太考虑内存消耗。Fig. 10 is a schematic diagram (2) of knowledge entity vectorization according to an embodiment of the present disclosure. As shown in Fig. 10 , entity alarms are vectorized with one-hot encoding, and all possible associated alarms are vectorized, using one-hot One-hot encoding, that is, in an n-dimensional vector, only itself is 1, and the others are 0. Although one-hot encoding is sparse, there are not many types of alarms, so you don't need to think too much about memory consumption.
从规则库抽取实体和关系,在准备好规则库和关键字词典后,对规则库进行解析。下面 以告警关联关系规则库为例进行说明。图11是根据本公开一实施例的告警关联关系规则抽取的流程图,如图11所示,包括:Extract entities and relations from the rule base, and analyze the rule base after preparing the rule base and keyword dictionary. The following uses the alarm correlation rule base as an example to illustrate. Fig. 11 is a flow chart of alarm correlation rule extraction according to an embodiment of the present disclosure, as shown in Fig. 11 , including:
步骤S1101,准备告警关联规则库和关键词字典;Step S1101, preparing an alarm association rule base and a keyword dictionary;
步骤S1102,获取告警关联规则;Step S1102, obtaining alarm association rules;
步骤S1103,获取关键字“关联方式”、“专业”的内容;Step S1103, acquiring the content of the keywords "association" and "professional";
步骤S1104,获取关键字“关联位置”、“时间窗口”内容;Step S1104, obtaining the content of keywords "associated position" and "time window";
步骤S1105,获取关键字“父告警”、“子告警”内容;Step S1105, obtaining the content of keywords "parent alarm" and "child alarm";
步骤S1106,获取关键字“规则描述”内容;Step S1106, obtaining the content of the keyword "rule description";
步骤S1107,对“规则描述内容”用关键字“引起”等进行左右分词;Step S1107, perform left and right word segmentation on the "rule description content" with the keyword "cause";
步骤S1108,用左词跟父子告警进行大类匹配;Step S1108, use the left word to perform category matching with the parent-child alarm;
步骤S1109,用右词跟父子告警进行大类匹配;Step S1109, use the right word to match parent and child alarms;
步骤S1110,形成告警关联RDF;Step S1110, forming an alarm related RDF;
步骤S1111,抽取本条结束继续下一条规则。Step S1111, extracting this article ends and proceeds to the next rule.
先通过关键字即列名获取相关内容,如关联方式和专业,它们的内容仅仅是被抽取出来,在后续故障知识图谱融合时会使用。First obtain relevant content through keywords, that is, column names, such as association methods and majors. Their content is only extracted and will be used in the subsequent fusion of fault knowledge graphs.
关联位置和时间窗口被抽取出来,不仅后续故障知识图谱融合会使用,在抽取过程中也可能会用来实时数据挖掘验证。The associated location and time window are extracted, which will not only be used for subsequent fault knowledge map fusion, but may also be used for real-time data mining verification during the extraction process.
然后取出来父子告警,这里就是告警实体了,但需要主要父子关系的关联并不一定明确了根因,即不一定父导致了子,也不一定子导致了父。取父子,除了告警合并外,在派单过程中,往往会派重要的“父告警”,“子告警”作为工单派单中的可能原因具有参考价值。Then take out the parent-child alarm, here is the alarm entity, but the relationship that requires the main parent-child relationship does not necessarily clarify the root cause, that is, the parent does not necessarily cause the child, nor does the child cause the parent. Taking father and son, in addition to alarm merging, in the process of dispatching, often important "parent alarms" and "child alarms" are of reference value as possible reasons in work order dispatching.
所以,父子还无法用根因RDF来描述,需要再解析规则描述。Therefore, the father and son cannot be described by the root cause RDF, and the rule description needs to be parsed.
规则内容取出来后,用关键字“引起”分词(注意,这里关键字“引起”仅为举例说明,关键词还可以为其它定义的词语,如“导致”等),“引起”左边的词是根因,右边的词是被触发的告警。After the rule content is taken out, use the keyword "cause" to segment the word (note that the keyword "cause" here is only an example, and the keyword can also be other defined words, such as "cause", etc.), and the word on the left of "cause" is the root cause, and the word on the right is the triggered alarm.
左右词再跟父子告警进行大类匹配。所谓大类匹配,即指父子告警实体可能只是左右词的一个子类,如基站退服包括了4G基站退服和5G基站退服,动环断电可以有市电停电、交流输入停电告警、输出电压过低告警等,所以需要大类匹配。The left and right words are then matched with the father and son alarms. The so-called category matching means that the parent-child alarm entity may be a subcategory of the left and right words. For example, base station decommissioning includes 4G base station decommissioning and 5G base station decommissioning. Power outages in the dynamic ring can include mains power outages, AC input power outage alarms, Low output voltage alarm, etc., so a large category of matching is required.
匹配后,即可组成根因RDF,即左侧告警引起右侧告警。After matching, the root cause RDF can be formed, that is, the alarm on the left causes the alarm on the right.
同样,对工单规则库,也可以同样抽取实例和关系,如图6所示,对关键字“名称”和“派单条件”内容抽取,得到具体告警,即可得到工单RDF,即XX告警派发XX工单。Similarly, for the work order rule base, instances and relationships can also be extracted, as shown in Figure 6, by extracting the keywords "name" and "dispatch condition", and getting specific alarms, you can get the work order RDF, namely XX Alarm dispatch XX work order.
一般来说规则库抽取出来的实体和关系都是比较准确的,可以拿来作为标签学习。Generally speaking, the entities and relationships extracted from the rule base are relatively accurate and can be used as labels for learning.
本实施例中,从实时数据抽取实体和关系,在准备好告警数据和工单数据后,要对实时数据进行解析。图12是根据本公开一实施例的实时数据抽取的流程图,如图12所示,包括:In this embodiment, entities and relationships are extracted from real-time data, and after the alarm data and work order data are prepared, the real-time data needs to be analyzed. Fig. 12 is a flowchart of real-time data extraction according to an embodiment of the present disclosure, as shown in Fig. 12 , including:
步骤S1201,准备告警和工单数据;Step S1201, preparing alarm and work order data;
步骤S1202,对告警进行相关性挖掘;Step S1202, performing correlation mining on the alarm;
步骤S1203,通过关键字段提取工单的告警相关内容,包括发生时间、位置等;Step S1203, extract the alarm-related content of the work order through key fields, including occurrence time, location, etc.;
步骤S1204,通过关键字段提取工单的故障处理情况的内容;Step S1204, extracting the content of the fault handling condition of the work order through the key field;
步骤S1205,提取“故障原因初步判断”、“故障处理意见”;Step S1205, extracting "preliminary judgment of fault cause" and "fault handling opinion";
步骤S1206,分词,得到一个或多个原因告警;Step S1206, word segmentation, to obtain one or more cause alarms;
步骤S1207,相似度计算得到对应的告警向量;Step S1207, calculating the similarity to obtain a corresponding alarm vector;
步骤S1208,跟挖掘出来的相关性告警进行匹配;Step S1208, matching with the excavated correlation alarm;
步骤S1209,得到告警关联关系和工单派单关系。Step S1209, obtaining the alarm association relationship and the work order dispatch relationship.
先通过相关性挖掘算法,如皮尔森系数等对告警进行挖掘,不采用频繁集挖掘,是因为可能告警很重要但发生并不频繁,也可能关联的告警发生很少,如图8所示的情况。Mining the alarms first through correlation mining algorithms, such as Pearson coefficient, etc., instead of using frequent set mining, is because the alarms may be important but occur infrequently, or the associated alarms may occur rarely, as shown in Figure 8 Condition.
得到告警关联关系后先保存下来,然后对工单数据进行抽取。After obtaining the alarm association relationship, save it first, and then extract the work order data.
根据关键字段,如告警、告警发生时间、发生位置、告警ID、告警级别等,获取工单数据中的告警内容。According to key fields, such as alarm, alarm occurrence time, occurrence location, alarm ID, alarm level, etc., the alarm content in the work order data is obtained.
根据工单派单原则,可能是前期告警处理时已经发现关联后的告警经过合并后派一条单,也可能是单独一条告警,无论是哪种情况,需要在工单派单处理中分析是否有真正的关联根因告警。According to the work order dispatching principle, it may be that the related alarms that have been found in the early alarm processing have been combined to send a single alarm, or a single alarm. In either case, it is necessary to analyze whether there is any error in the work order dispatching process. True correlative root cause alerts.
因此提取关键字段“故障处理情况”中的“故障原因初步判断”和“故障处理意见”等内容,获取相关性告警,可能是根因告警,可能不止一条。Therefore, extract the "preliminary judgment of fault cause" and "troubleshooting opinions" in the key field "fault handling situation" to obtain related alarms, which may be root cause alarms, and there may be more than one.
由于这里是人工填写,可能不是精确语言,所以需要用NLP进行分词和相似度学习得到相关告警唯一标识。Since this is filled in manually, it may not be an accurate language, so it is necessary to use NLP for word segmentation and similarity learning to obtain the unique identification of relevant alarms.
再和前面挖掘出来的相关告警进行匹配,如果匹配成功,得到初步的RDF候选,如告警关联关系和工单派单RDF。Then match with the relevant alarms excavated earlier. If the matching is successful, get preliminary RDF candidates, such as alarm correlation and work order dispatch RDF.
这些候选集合,要在后面进行冲突检测。These candidate sets are to be checked for conflicts later.
特征工程处理,以前面说的几个特征举例,告警相互间的网元拓扑关系/业务关系/专业关系/告警级别/频繁集的确信度/关联告警发生区间/关联告警的相似度,当然,特征不仅仅局限于这些特征。图13是根据本公开一实施例的特征工程的流程图,如图13所示,包括:Feature engineering processing, taking the above-mentioned features as an example, network element topology relationship/business relationship/professional relationship/alarm level/frequent set certainty/associated alarm occurrence interval/similarity of associated alarms between alarms, of course, Features are not limited to these features. Fig. 13 is a flowchart of feature engineering according to an embodiment of the present disclosure, as shown in Fig. 13 , including:
步骤S1301,准备告警关联集合;Step S1301, preparing an alarm association set;
步骤S1302,获取当前一对告警;Step S1302, obtaining the current pair of alarms;
步骤S1303,获取它们发生所在设备的拓扑关系;Step S1303, obtaining the topological relationship of the devices where they occur;
步骤S1304,获取它们发生所在设备的业务关系;Step S1304, obtaining the business relationship of the equipment where they occur;
步骤S1305,获取它们的专业关系,可选六个维度无线/数据/传输/核心/动环/跨网;Step S1305, obtain their professional relationship, optional six dimensions of wireless/data/transmission/core/dynamic ring/cross-network;
步骤S1306,获取它们的告警级别,按1/2/3/4四个级别;Step S1306, obtain their alarm levels, according to four levels of 1/2/3/4;
步骤S1307,获取它们的工单中“故障处理情况”的“故障恢复情况”内容;Step S1307, obtaining the content of "fault recovery status" of "fault handling status" in their work orders;
步骤S1308,分词,相似度计算是否有“告警清除”,按是否有得到该维度值1/0;Step S1308, word segmentation, whether there is "alarm clearing" in the similarity calculation, and get the dimension value 1/0 according to whether there is;
步骤S1309,归一化处理得到所有特征。In step S1309, all features are obtained through normalization processing.
要把除了告警级别的所有数据归一化处理,这里归一化就是把各维度的数据处理成1和0。It is necessary to normalize all the data except the alarm level, where the normalization is to process the data of each dimension into 1 and 0.
告警级别就是一个维度,值就按告警级别来填写,一般来说,告警级别越高,在告警根因树模型上越显著。The alarm level is a dimension, and the value is filled in according to the alarm level. Generally speaking, the higher the alarm level, the more significant it is in the alarm root cause tree model.
拓扑关系,比如一对告警A和B分别发生在BBU和基站,那么它们的拓扑关系,将会有两个维度,拓扑关系A是B的父,拓扑关系B不是A的父,在这里,第一个维度填写0,第二个维度填写1,如果两个没有拓扑上下级关系,则两项都填写为0,业务关系这个特征同样处理。Topological relationship. For example, a pair of alarms A and B occur in the BBU and the base station respectively. Then their topological relationship will have two dimensions. Topological relationship A is the parent of B, and topological relationship B is not the parent of A. Here, the first Fill in 0 for one dimension, and 1 for the second dimension. If there is no topology-subordinate relationship between the two, fill in 0 for both, and the feature of business relationship is handled in the same way.
专业关系,按无线/数据/传输/核心/动环/跨网六个维度处理,比如是无线的,那么无线 那个维度填写1,其它维度填写0。Professional relationship is handled according to the six dimensions of wireless/data/transmission/core/dynamic ring/cross-network. For example, if it is wireless, then fill in 1 for the wireless dimension and 0 for other dimensions.
对相关性数据挖掘的维度来说,采用确信度即可。For the dimension of correlation data mining, the degree of certainty can be used.
每条工单有相应的“故障恢复情况”内容,它在字段“故障处理情况中”,它部分程度代表了这个原因判断是否正确(如图7和图8所示),但是该内容为非结构化数据,需要分词后,用相似度训练与“告警清除”或“告警恢复”是否近似,如果近似,则为1,否则为0。注意,该字段并不能当作关联关系中根因的标签,但可以拿来当作一个特征进行训练。Each work order has a corresponding "failure recovery situation" content, which is in the field "failure handling situation", which partly represents whether the reason judgment is correct (as shown in Figure 7 and Figure 8), but the content is non- Structured data, after word segmentation, whether similarity training is similar to "alarm clearing" or "alarm recovery", if similar, it is 1, otherwise it is 0. Note that this field cannot be used as the label of the root cause in the association, but it can be used as a feature for training.
机器学习知识库得到SPO分类模型,通过机器学习来学习SPO(主和宾)的分类模型,得到上面所述特征所有维度的权重值。采用有监督学习分类学习算法,可以采用逻辑回归决策树随机森林等。这个过程并不复杂,都比较成熟,比如逻辑回归都有成熟的模型算法库。The machine learning knowledge base obtains the SPO classification model, learns the SPO (subject and guest) classification model through machine learning, and obtains the weight values of all dimensions of the above-mentioned features. Using supervised learning classification learning algorithm, you can use logistic regression decision tree random forest etc. This process is not complicated and is relatively mature. For example, logistic regression has a mature model algorithm library.
图14是根据本公开一实施例的冲突检测的流程图,如图14所示,包括:Fig. 14 is a flow chart of conflict detection according to an embodiment of the present disclosure, as shown in Fig. 14 , including:
步骤S1401,准备候选集合;Step S1401, preparing a candidate set;
步骤S1402,获取当前一条候选RDF;Step S1402, obtaining a current candidate RDF;
步骤S1403,判断候选RDF是否在规则库中,在判断结果为是的情况下,执行步骤S1404,在判断结果为否是情况下,执行步骤S1408;Step S1403, judging whether the candidate RDF is in the rule base, if the judging result is yes, go to step S1404, and if the judging result is no, go to step S1408;
步骤S1404,与规则库进行匹配;Step S1404, matching with the rule base;
步骤S1405,判断是否匹配,在判断结果为是的情况下,执行步骤S1402,在判断结果为否是情况下,执行步骤S1406;Step S1405, judging whether they match, if the judging result is yes, go to step S1402, and if the judging result is no, go to step S1406;
步骤S1406,判断是否冲突,在判断结果为是的情况下,执行步骤S1407,在判断结果为否是情况下,执行步骤S1402;Step S1406, judging whether there is a conflict, if the judging result is yes, go to step S1407, and if the judging result is no, go to step S1402;
步骤S1407,判断该候选是否频度低频,在判断结果为是的情况下,执行步骤S1402,在判断结果为否是情况下,执行步骤S1408;Step S1407, judging whether the candidate has a low frequency, if the judging result is yes, go to step S1402, and if the judging result is no, go to step S1408;
步骤S1408,通过机器学习确定该条候选有效;Step S1408, determine that the candidate is valid through machine learning;
步骤S1409,所有候选归结准备汇总。Step S1409, all candidates are summed up and ready to be summed up.
对于新的抽取出来的关联关系,无论是5G的告警,还是3G/4G/5G混合组网的产生的告警,都将和规则库进行对比是否有冲突。For the newly extracted association relationship, whether it is a 5G alarm or an alarm generated by a 3G/4G/5G hybrid network, it will be compared with the rule base to see if there is any conflict.
先检查候选集合中的告警是否存在,如果不存在,则用机器学习判断关联关系,得到新的关联关系。First check whether the alarms in the candidate set exist. If not, use machine learning to judge the association relationship and obtain a new association relationship.
如果告警存在,先检查和规则库完全匹配,则继续下一条。If the alarm exists, first check that it matches the rule base completely, and then continue to the next one.
如果不冲突,继续下一条,如果冲突则不一定是规则库一定正确,虽然这种情况很少,但要考虑,如可能组网发生变化,链路关系发生变化等。If there is no conflict, continue to the next one. If there is a conflict, the rule base may not necessarily be correct. Although this situation is rare, it must be considered, such as possible changes in the network and link relationships.
先判断是否发生次数很少是低频,则小概率事件不用考虑,否则要用机器学习重新判断。First judge whether the number of occurrences is low and low frequency, then the low-probability events do not need to be considered, otherwise, use machine learning to re-judge.
注意,匹配之后还要判断是否冲突,是因为告警关联是比较复杂的过程,可能有多层关联。Note that after matching, it is necessary to determine whether there is a conflict, because alarm correlation is a relatively complicated process, and there may be multiple layers of correlation.
同样,对工单一样,工单规则也需要检测,当然相对更简单,主要是看派单规则库是否没有,如果没有相关告警合并成一条工单,即可以新建一条工单派单知识。Similarly, as with work orders, work order rules also need to be checked. Of course, it is relatively simpler, mainly to see if there is no dispatch rule library. If there are no relevant alarms merged into a work order, a new work order dispatch knowledge can be created.
冲突检测完后,即可汇总,然后准备融入故障知识图谱。After the conflict is detected, it can be summarized and then ready to be integrated into the fault knowledge map.
根据本公开的另一个实施例,还提供了一种故障知识图谱构建装置,图15是根据本公开另一实施例的故障知识图谱构建装置的框图,如图15所示,包括:According to another embodiment of the present disclosure, a device for constructing a fault knowledge graph is also provided. FIG. 15 is a block diagram of a device for constructing a fault knowledge graph according to another embodiment of the present disclosure. As shown in FIG. 15 , it includes:
第一抽取模块152,设置为对规则库进行实体与关系抽取,得到包括告警关系实例与工单派单实例的第一候选集;The first extraction module 152 is configured to extract entities and relationships from the rule base to obtain a first candidate set including alarm relationship instances and work order dispatch instances;
第二抽取模块154,设置为对实时数据进行实体与关系抽取,得到告警关联关系与工单派单关系的第二候选集;The second extraction module 154 is configured to extract entities and relationships from the real-time data to obtain a second candidate set of alarm association relationships and work order dispatch relationships;
冲突检测模块156,设置为根据所述第一候选集与所述第二候选集进行冲突检测,得到包括有效告警关联关系与工单派单知识的目标候选集;The conflict detection module 156 is configured to perform conflict detection according to the first candidate set and the second candidate set, and obtain a target candidate set including effective alarm correlation and work order dispatch knowledge;
更新模块158,设置为将所述目标候选集融入到根据所述规则库构建的故障知识图谱,得到更新后的故障知识图谱。The update module 158 is configured to integrate the target candidate set into the fault knowledge graph constructed according to the rule base to obtain an updated fault knowledge graph.
在一示例性实施例中,所述第一抽取模块152包括:In an exemplary embodiment, the first extraction module 152 includes:
第一抽取子模块,设置为对所述规则库中的告警关联规则,通过关键字段抽取不同告警之间的关联关系,组成所述告警关系实例;The first extraction sub-module is configured to extract the association relationship between different alarms through key fields for the alarm association rules in the rule base to form the alarm relationship instance;
第二抽取子模块,设置为对所述规则库中的工单派单规则,通过关键字段抽取派单条件中的不同告警之间的关联关系,组成所述工单派单实例。The second extraction sub-module is configured to extract the relationship between different alarms in the order dispatching conditions through the key fields of the work order dispatching rules in the rule base to form the work order dispatching instance.
在一示例性实施例中,所述第一抽取子模块,还设置为In an exemplary embodiment, the first extraction sub-module is further set to
根据第一关键字段抽取父子告警与规则描述内容;Extract parent-child alarm and rule description content according to the first key field;
根据第二关键字段对所述规则描述内容进行左右分词,得到左分词与右分词;According to the second key field, the left and right participle is performed on the description content of the rule to obtain the left participle and the right participle;
分别将所述左分词、所述右分词与父子告警进行大类匹配,以形成所述告警关联实例。The left participle and the right participle are respectively matched with parent-child alarms to form the alarm association instance.
在一示例性实施例中,所述装置还包括:In an exemplary embodiment, the device also includes:
转换模块,设置为将所述规则库中的半结构化数据转换为结构化数据。A conversion module, configured to convert the semi-structured data in the rule base into structured data.
在一示例性实施例中,所述第二抽取模块154,还设置为In an exemplary embodiment, the second extraction module 154 is further configured to
对所述实时数据中的告警数据进行相关性挖掘,得到相关性告警;Carrying out correlation mining on the alarm data in the real-time data to obtain a correlation alarm;
对所述实时数据中的工单数据,根据关键字段抽取告警内容与故障处理内容,其中,所述故障处理内容至少包括故障处理情况、故障原因初步判断、故障处理意见;For the work order data in the real-time data, extract the alarm content and fault handling content according to the key fields, wherein the fault handling content includes at least the fault handling situation, the preliminary judgment of the fault cause, and the fault handling opinion;
对所述告警内容与所述故障处理内容进行分词处理,得到一个或多个原因告警;Perform word segmentation processing on the alarm content and the fault handling content to obtain one or more cause alarms;
根据相似度确定所述一个或多个原因告警的告警向量;determining an alarm vector of the one or more cause alarms according to the similarity;
将所述告警向量与所述相关性告警进行匹配,得到所述告警关联关系与所述工单派单关系。The alarm vector is matched with the correlation alarm to obtain the alarm correlation and the work order dispatching relationship.
在一示例性实施例中,所述冲突检测模块156,还设置为In an exemplary embodiment, the conflict detection module 156 is further configured to
对所述第二候选集中的每个告警关联关系与每个工单派单关系,执行以下操作,得到有效告警关联关系与工单派单知识组成的所述目标候选集,其中,对于正在执行的告警关联关系与工单派单关系称为当前告警关联关系与当前工单派单关系:For each alarm association relationship and each work order dispatch relationship in the second candidate set, perform the following operations to obtain the target candidate set composed of effective alarm association relationships and work order dispatch knowledge, wherein, for the The alarm association relationship and work order dispatching relationship is called the current alarm association relationship and the current work order dispatching relationship:
判断所述当前告警关联关系是否存在于所述第一候选集中;在判断结果为否的情况下,通过机器学习确定所述当前告警关联关系有效;在判断结果为是的情况下,当所述当前告警关联关系与所述第一候选集中的告警关系实例不匹配且不存在冲突时,确定所述当前告警关联关系的数据频度是否小于预设阈值,在确定结果为否的情况下,通过机器学习确定所述当前告警关联关系为所述有效告警关联关系;Judging whether the current alarm association relationship exists in the first candidate set; if the judgment result is no, determine that the current alarm association relationship is valid through machine learning; if the judgment result is yes, when the When the current alarm association relationship does not match the alarm relationship instance in the first candidate set and there is no conflict, determine whether the data frequency of the current alarm association relationship is less than a preset threshold, and if the determination result is no, pass Machine learning determines that the current alarm association relationship is the effective alarm association relationship;
判断所述当前工单派单关系是否存在于所述第一候选集的工单派单实例中;在判断结果为否的情况下,将所述当前工单派单关系与相关告警合并成所述工单派单知识。Judging whether the current work order dispatching relationship exists in the work order dispatching instance of the first candidate set; if the judgment result is no, merging the current work order dispatching relationship and related alarms into the Describe the work order dispatch knowledge.
在一示例性实施例中,所述装置还包括:In an exemplary embodiment, the device also includes:
第一汇总模块,设置为对所述目标候选集中的告警关联关系按照相关性进行汇总;The first summary module is configured to summarize the alarm correlations in the target candidate set according to the correlation;
第二汇总模块,设置为对所述目标候选集中的工单派单关系按照派单进行汇总。The second summarizing module is configured to sum up the work order dispatching relationships in the target candidate set according to dispatching orders.
在一示例性实施例中,所述更新模块158,还设置为In an exemplary embodiment, the update module 158 is further configured to
使用key代表告警,通过本体映射和实体映射在所述故障知识图谱中为所述目标候选集的所述有效告警关联关系与所述工单派单知识寻找对应的节点;Using a key to represent an alarm, searching for a corresponding node in the fault knowledge graph for the effective alarm association relationship of the target candidate set and the work order dispatch knowledge through ontology mapping and entity mapping;
当找到对应的所述节点时,进行内容冲突检测,得到检测结果;When the corresponding node is found, content conflict detection is performed to obtain a detection result;
当所述检测结果为不存在内容冲突时,将所述有效告警关联关系与所述工单派单知识挂在对应的所述节点上;When the detection result is that there is no content conflict, hang the effective alarm association relationship and the work order dispatch knowledge on the corresponding node;
当对应的所述节点没有在所述故障知识图谱中时,则在所述故障知识图谱中新建一个父节点,将所述有效告警关联关系与所述工单派单知识挂在新建的所述父节点上。When the corresponding node is not in the fault knowledge graph, a new parent node is created in the fault knowledge graph, and the effective alarm association relationship and the work order dispatch knowledge are linked to the newly created on the parent node.
本公开的实施例还提供了一种计算机可读存储介质,该计算机可读存储介质中存储有计算机程序,其中,该计算机程序被设置为运行时执行上述任一项方法实施例中的步骤。Embodiments of the present disclosure also provide a computer-readable storage medium, in which a computer program is stored, wherein the computer program is configured to execute the steps in any one of the above method embodiments when running.
在一个示例性实施例中,上述计算机可读存储介质可以包括但不限于:U盘、只读存储器(Read-Only Memory,简称为ROM)、随机存取存储器(Random Access Memory,简称为RAM)、移动硬盘、磁碟或者光盘等各种可以存储计算机程序的介质。In an exemplary embodiment, the above-mentioned computer-readable storage medium may include but not limited to: U disk, read-only memory (Read-Only Memory, referred to as ROM), random access memory (Random Access Memory, referred to as RAM) , mobile hard disk, magnetic disk or optical disk and other media that can store computer programs.
本公开的实施例还提供了一种电子装置,包括存储器和处理器,该存储器中存储有计算机程序,该处理器被设置为运行计算机程序以执行上述任一项方法实施例中的步骤。Embodiments of the present disclosure also provide an electronic device, including a memory and a processor, where a computer program is stored in the memory, and the processor is configured to run the computer program to execute the steps in any one of the above method embodiments.
在一个示例性实施例中,上述电子装置还可以包括传输设备以及输入输出设备,其中,该传输设备和上述处理器连接,该输入输出设备和上述处理器连接。In an exemplary embodiment, the electronic device may further include a transmission device and an input and output device, wherein the transmission device is connected to the processor, and the input and output device is connected to the processor.
本实施例中的具体示例可以参考上述实施例及示例性实施方式中所描述的示例,本实施例在此不再赘述。For specific examples in this embodiment, reference may be made to the examples described in the foregoing embodiments and exemplary implementation manners, and details will not be repeated here in this embodiment.
显然,本领域的技术人员应该明白,上述的本公开的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本公开不限制于任何特定的硬件和软件结合。Obviously, those skilled in the art should understand that each module or each step of the above-mentioned disclosure can be realized by a general-purpose computing device, and they can be concentrated on a single computing device, or distributed in a network composed of multiple computing devices In fact, they can be implemented in program code executable by a computing device, and thus, they can be stored in a storage device to be executed by a computing device, and in some cases, can be executed in an order different from that shown here. Or described steps, or they are fabricated into individual integrated circuit modules, or multiple modules or steps among them are fabricated into a single integrated circuit module for implementation. As such, the present disclosure is not limited to any specific combination of hardware and software.
以上所述仅为本公开的优选实施例而已,并不用于限制本公开,对于本领域的技术人员来说,本公开可以有各种更改和变化。凡在本公开的原则之内,所作的任何修改、等同替换、改进等,均应包含在本公开的保护范围之内。The above descriptions are only preferred embodiments of the present disclosure, and are not intended to limit the present disclosure. For those skilled in the art, the present disclosure may have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the principle of the present disclosure shall be included in the protection scope of the present disclosure.

Claims (11)

  1. 一种故障知识图谱构建方法,包括:A fault knowledge graph construction method, comprising:
    对规则库进行实体与关系抽取,得到包括告警关系实例与工单派单实例的第一候选集;Extract entities and relationships from the rule base to obtain the first candidate set including alarm relationship instances and work order dispatch instances;
    对实时数据进行实体与关系抽取,得到告警关联关系与工单派单关系的第二候选集;Extract entities and relationships from real-time data to obtain the second candidate set of alarm association relationship and work order dispatch relationship;
    根据所述第一候选集与所述第二候选集进行冲突检测,得到包括有效告警关联关系与工单派单知识的目标候选集;Performing conflict detection according to the first candidate set and the second candidate set to obtain a target candidate set including effective alarm correlation and work order dispatch knowledge;
    将所述目标候选集融入到根据所述规则库构建的故障知识图谱,得到更新后的故障知识图谱。The target candidate set is integrated into the fault knowledge graph constructed according to the rule base to obtain an updated fault knowledge graph.
  2. 根据权利要求1所述的方法,其中,对规则库进行实体与关系抽取,得到包括告警关系实例与工单派单实例的第一候选集包括:The method according to claim 1, wherein the entity and relationship extraction is performed on the rule base, and the first candidate set including the alarm relationship instance and the work order dispatch instance obtained includes:
    对所述规则库中的告警关联规则,通过关键字段抽取不同告警之间的关联关系,组成所述告警关系实例;For the alarm association rules in the rule base, the association relationship between different alarms is extracted through key fields to form the alarm relationship instance;
    对所述规则库中的工单派单规则,通过关键字段抽取派单条件中的不同告警之间的关联关系,组成所述工单派单实例。For the work order dispatching rules in the rule base, the relationship between different alarms in the order dispatching conditions is extracted through key fields to form the work order dispatching instance.
  3. 根据权利要求2所述的方法,其中,对所述规则库中的告警规则,通过关键字段抽取不同告警之间的关联关系,组成告警关系实例包括:The method according to claim 2, wherein, for the alarm rules in the rule base, the association relationship between different alarms is extracted through the key field, and the alarm relationship instance comprises:
    根据第一关键字段抽取父子告警与规则描述内容;Extract parent-child alarm and rule description content according to the first key field;
    根据第二关键字段对所述规则描述内容进行左右分词,得到左分词与右分词;According to the second key field, the left and right participle is performed on the description content of the rule to obtain the left participle and the right participle;
    分别将所述左分词、所述右分词与父子告警进行大类匹配,以形成所述告警关联实例。The left participle and the right participle are respectively matched with parent-child alarms to form the alarm association instance.
  4. 根据权利要求2所述的方法,其中,在对规则库进行实体与关系抽取,得到包括告警关系实例与工单派单实例的第一候选集之前,所述方法还包括:The method according to claim 2, wherein, before performing entity and relationship extraction on the rule base to obtain the first candidate set including alarm relationship instances and work order dispatch instances, the method further includes:
    将所述规则库中的半结构化数据转换为结构化数据。The semi-structured data in the rule base is converted into structured data.
  5. 根据权利要求1所述的方法,其中,对实时数据进行实体与关系抽取,得到告警关联关系与工单派单关系的第二候选集包括:The method according to claim 1, wherein the entity and relationship extraction is performed on the real-time data, and the second candidate set for obtaining the alarm association relationship and the work order dispatch relationship includes:
    对所述实时数据中的告警数据进行相关性挖掘,得到相关性告警;Carrying out correlation mining on the alarm data in the real-time data to obtain a correlation alarm;
    对所述实时数据中的工单数据,根据关键字段抽取告警内容与故障处理内容,其中,所述故障处理内容至少包括故障处理情况、故障原因初步判断、故障处理意见;For the work order data in the real-time data, extract the alarm content and fault handling content according to the key fields, wherein the fault handling content includes at least the fault handling situation, the preliminary judgment of the fault cause, and the fault handling opinion;
    对所述告警内容与所述故障处理内容进行分词处理,得到一个或多个原因告警;Perform word segmentation processing on the alarm content and the fault handling content to obtain one or more cause alarms;
    根据相似度确定所述一个或多个原因告警的告警向量;determining an alarm vector of the one or more cause alarms according to the similarity;
    将所述告警向量与所述相关性告警进行匹配,得到所述告警关联关系与所述工单派单关系。The alarm vector is matched with the correlation alarm to obtain the alarm correlation and the work order dispatching relationship.
  6. 根据权利要求1所述的方法,其中,根据所述第一候选集与所述第二候选集进行冲突检测,得到包括有效告警关联关系与工单派单知识的所述目标候选集包括:The method according to claim 1, wherein, performing conflict detection according to the first candidate set and the second candidate set, obtaining the target candidate set including effective alarm association relationship and work order dispatch knowledge includes:
    对所述第二候选集中的每个告警关联关系与每个工单派单关系,执行以下操作,得到有效告警关联关系与工单派单知识组成的所述目标候选集,其中,对于正在执行的告警关联关系与工单派单关系称为当前告警关联关系与当前工单派单关系:For each alarm association relationship and each work order dispatch relationship in the second candidate set, perform the following operations to obtain the target candidate set composed of effective alarm association relationships and work order dispatch knowledge, wherein, for the The alarm association relationship and work order dispatching relationship is called the current alarm association relationship and the current work order dispatching relationship:
    判断所述当前告警关联关系是否存在于所述第一候选集中;在判断结果为否的情况下,通过机器学习确定所述当前告警关联关系有效;在判断结果为是的情况下,当所述当前告警 关联关系与所述第一候选集中的告警关系实例不匹配且不存在冲突时,确定所述当前告警关联关系的数据频度是否小于预设阈值,在确定结果为否的情况下,通过机器学习确定所述当前告警关联关系为所述有效告警关联关系;Judging whether the current alarm association relationship exists in the first candidate set; if the judgment result is no, determine that the current alarm association relationship is valid through machine learning; if the judgment result is yes, when the When the current alarm association relationship does not match the alarm relationship instance in the first candidate set and there is no conflict, determine whether the data frequency of the current alarm association relationship is less than a preset threshold, and if the determination result is no, pass Machine learning determines that the current alarm association relationship is the effective alarm association relationship;
    判断所述当前工单派单关系是否存在于所述第一候选集的工单派单实例中;在判断结果为否的情况下,将所述当前工单派单关系与相关告警合并成所述工单派单知识。Judging whether the current work order dispatching relationship exists in the work order dispatching instance of the first candidate set; if the judgment result is no, merging the current work order dispatching relationship and related alarms into the Describe the work order dispatch knowledge.
  7. 根据权利要求1至6中任一项所述的方法,其中,在将所述目标候选集融入到根据所述规则库构建的故障知识图谱,得到更新后的故障知识图谱之前,所述方法还包括:The method according to any one of claims 1 to 6, wherein, before incorporating the target candidate set into the fault knowledge graph constructed according to the rule base to obtain the updated fault knowledge graph, the method further include:
    对所述目标候选集中的告警关联关系按照相关性进行汇总;Summarizing the alarm correlations in the target candidate set according to the correlation;
    对所述目标候选集中的工单派单关系按照派单进行汇总。Summarize work order dispatch relationships in the target candidate set according to dispatch.
  8. 根据权利要求1至6中任一项所述的方法,其中,将所述目标候选集融入到根据所述规则库构建的故障知识图谱,得到更新后的故障知识图谱包括:The method according to any one of claims 1 to 6, wherein incorporating the target candidate set into the fault knowledge graph constructed according to the rule base, and obtaining the updated fault knowledge graph comprises:
    使用key代表告警,通过本体映射和实体映射在所述故障知识图谱中为所述目标候选集的所述有效告警关联关系与所述工单派单知识寻找对应的节点;Using a key to represent an alarm, searching for a corresponding node in the fault knowledge graph for the effective alarm association relationship of the target candidate set and the work order dispatch knowledge through ontology mapping and entity mapping;
    当找到对应的所述节点时,进行内容冲突检测,得到检测结果;When the corresponding node is found, content conflict detection is performed to obtain a detection result;
    当所述检测结果为不存在内容冲突时,将所述有效告警关联关系与所述工单派单知识挂在对应的所述节点上;When the detection result is that there is no content conflict, hang the effective alarm association relationship and the work order dispatch knowledge on the corresponding node;
    当对应的所述节点没有在所述故障知识图谱中时,则在所述故障知识图谱中新建一个父节点,将所述有效告警关联关系与所述工单派单知识挂在新建的所述父节点上。When the corresponding node is not in the fault knowledge graph, a new parent node is created in the fault knowledge graph, and the effective alarm association relationship and the work order dispatch knowledge are linked to the newly created on the parent node.
  9. 一种故障知识图谱构建装置,其中,包括:A fault knowledge map construction device, including:
    第一抽取模块,设置为对规则库进行实体与关系抽取,得到包括告警关系实例与工单派单实例的第一候选集;The first extraction module is configured to extract entities and relationships from the rule base to obtain a first candidate set including alarm relationship instances and work order dispatch instances;
    第二抽取模块,设置为对实时数据进行实体与关系抽取,得到告警关联关系与工单派单关系的第二候选集;The second extraction module is configured to extract entities and relationships from real-time data to obtain a second candidate set of alarm association relationships and work order dispatch relationships;
    冲突检测模块,设置为根据所述第一候选集与所述第二候选集进行冲突检测,得到包括有效告警关联关系与工单派单知识的目标候选集;The conflict detection module is configured to perform conflict detection according to the first candidate set and the second candidate set to obtain a target candidate set including effective alarm correlation and work order dispatch knowledge;
    更新模块,设置为将所述目标候选集融入到根据所述规则库构建的故障知识图谱,得到更新后的故障知识图谱。The update module is configured to integrate the target candidate set into the fault knowledge graph constructed according to the rule base to obtain an updated fault knowledge graph.
  10. 一种计算机可读的存储介质,所述存储介质中存储有计算机程序,其中,所述计算机程序被设置为运行时执行所述权利要求1至8任一项中所述的方法。A computer-readable storage medium, in which a computer program is stored, wherein the computer program is configured to perform the method described in any one of claims 1 to 8 when running.
  11. 一种电子装置,包括存储器和处理器,所述存储器中存储有计算机程序,所述处理器被设置为运行所述计算机程序以执行所述权利要求1至8任一项中所述的方法。An electronic device, comprising a memory and a processor, wherein a computer program is stored in the memory, and the processor is configured to run the computer program to perform the method described in any one of claims 1 to 8.
PCT/CN2022/098679 2021-09-24 2022-06-14 Fault knowledge graph construction method and apparatus WO2023045417A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111123124.5A CN115858796A (en) 2021-09-24 2021-09-24 Fault knowledge graph construction method and device
CN202111123124.5 2021-09-24

Publications (1)

Publication Number Publication Date
WO2023045417A1 true WO2023045417A1 (en) 2023-03-30

Family

ID=85652804

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/098679 WO2023045417A1 (en) 2021-09-24 2022-06-14 Fault knowledge graph construction method and apparatus

Country Status (2)

Country Link
CN (1) CN115858796A (en)
WO (1) WO2023045417A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116415002A (en) * 2023-04-17 2023-07-11 合肥工业大学 Power grid fault recovery error-proof checking method based on graph matching
CN116719946A (en) * 2023-08-10 2023-09-08 中国电信股份有限公司 Work order recommending method, work order recommending device, storage medium and electronic equipment
CN116756250A (en) * 2023-06-06 2023-09-15 电子科技大学 Processing method and device of fault related data, computer equipment and storage medium
CN116911386A (en) * 2023-09-12 2023-10-20 中国长江电力股份有限公司 Knowledge graph construction method of hydroelectric equipment based on knowledge context service-oriented scene
CN117131929A (en) * 2023-10-27 2023-11-28 北京华控智加科技有限公司 Operation and maintenance data management method and device
CN117192373A (en) * 2023-08-08 2023-12-08 浙江凌骁能源科技有限公司 Power battery fault analysis method, device, computer equipment and storage medium

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116542634A (en) * 2023-06-21 2023-08-04 中国电信股份有限公司 Work order processing method, apparatus and computer readable storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100223499A1 (en) * 2009-02-27 2010-09-02 Microsoft Corporation Fingerprinting event logs for system management troubleshooting
CN106250288A (en) * 2016-07-29 2016-12-21 浪潮软件集团有限公司 Root alarm analysis and identification method based on data mining
CN106878038A (en) * 2015-12-10 2017-06-20 华为技术有限公司 Fault Locating Method and device in a kind of communication network
US20200288295A1 (en) * 2019-03-08 2020-09-10 Rapidsos, Inc. Apparatus and method for emergency dispatch
CN112699681A (en) * 2020-12-17 2021-04-23 国网冀北电力有限公司信息通信分公司 Power communication system defect fault order dispatching method and device based on knowledge graph
CN112948436A (en) * 2021-03-01 2021-06-11 创新奇智(成都)科技有限公司 Alarm work order merging method, device, equipment and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100223499A1 (en) * 2009-02-27 2010-09-02 Microsoft Corporation Fingerprinting event logs for system management troubleshooting
CN106878038A (en) * 2015-12-10 2017-06-20 华为技术有限公司 Fault Locating Method and device in a kind of communication network
CN106250288A (en) * 2016-07-29 2016-12-21 浪潮软件集团有限公司 Root alarm analysis and identification method based on data mining
US20200288295A1 (en) * 2019-03-08 2020-09-10 Rapidsos, Inc. Apparatus and method for emergency dispatch
CN112699681A (en) * 2020-12-17 2021-04-23 国网冀北电力有限公司信息通信分公司 Power communication system defect fault order dispatching method and device based on knowledge graph
CN112948436A (en) * 2021-03-01 2021-06-11 创新奇智(成都)科技有限公司 Alarm work order merging method, device, equipment and storage medium

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116415002A (en) * 2023-04-17 2023-07-11 合肥工业大学 Power grid fault recovery error-proof checking method based on graph matching
CN116415002B (en) * 2023-04-17 2023-11-14 合肥工业大学 Power grid fault recovery error-proof checking method based on graph matching
CN116756250A (en) * 2023-06-06 2023-09-15 电子科技大学 Processing method and device of fault related data, computer equipment and storage medium
CN117192373A (en) * 2023-08-08 2023-12-08 浙江凌骁能源科技有限公司 Power battery fault analysis method, device, computer equipment and storage medium
CN117192373B (en) * 2023-08-08 2024-05-07 浙江凌骁能源科技有限公司 Power battery fault analysis method, device, computer equipment and storage medium
CN116719946A (en) * 2023-08-10 2023-09-08 中国电信股份有限公司 Work order recommending method, work order recommending device, storage medium and electronic equipment
CN116719946B (en) * 2023-08-10 2023-11-28 中国电信股份有限公司 Work order recommending method, work order recommending device, storage medium and electronic equipment
CN116911386A (en) * 2023-09-12 2023-10-20 中国长江电力股份有限公司 Knowledge graph construction method of hydroelectric equipment based on knowledge context service-oriented scene
CN116911386B (en) * 2023-09-12 2023-11-28 中国长江电力股份有限公司 Knowledge graph construction method of hydroelectric equipment based on knowledge context service-oriented scene
CN117131929A (en) * 2023-10-27 2023-11-28 北京华控智加科技有限公司 Operation and maintenance data management method and device

Also Published As

Publication number Publication date
CN115858796A (en) 2023-03-28

Similar Documents

Publication Publication Date Title
WO2023045417A1 (en) Fault knowledge graph construction method and apparatus
US11782981B2 (en) Method, apparatus, server, and storage medium for incorporating structured entity
WO2020001373A1 (en) Method and apparatus for ontology construction
US11455306B2 (en) Query classification and processing using neural network based machine learning
WO2020108063A1 (en) Feature word determining method, apparatus, and server
JP2020027649A (en) Method, apparatus, device and storage medium for generating entity relationship data
WO2022048668A1 (en) Knowledge graph construction method and apparatus, check method and storage medium
CN113722493B (en) Text classification data processing method, apparatus and storage medium
US20210157983A1 (en) Hybrid in-domain and out-of-domain document processing for non-vocabulary tokens of electronic documents
CN113657100B (en) Entity identification method, entity identification device, electronic equipment and storage medium
CN110555205B (en) Negative semantic recognition method and device, electronic equipment and storage medium
US20220100772A1 (en) Context-sensitive linking of entities to private databases
CN112579789A (en) Equipment fault diagnosis method and device and equipment
US20190171947A1 (en) Methods and apparatus for semantic knowledge transfer
US11507747B2 (en) Hybrid in-domain and out-of-domain document processing for non-vocabulary tokens of electronic documents
US20220100967A1 (en) Lifecycle management for customized natural language processing
CN115114419A (en) Question and answer processing method and device, electronic equipment and computer readable medium
CN108932225B (en) Method and system for converting natural language requirements into semantic modeling language statements
EP4222635A1 (en) Lifecycle management for customized natural language processing
CN112632223A (en) Case and event knowledge graph construction method and related equipment
CN117473054A (en) Knowledge graph-based general intelligent question-answering method and device
CN116821423A (en) Intelligent analysis and knowledge type fault processing auxiliary system and method for power distribution network
CN116467459A (en) Internet of things equipment fault reporting method and device, computer equipment and storage medium
US20220129418A1 (en) Method for determining blood relationship of data, electronic device and storage medium
CN107577760B (en) text classification method and device based on constraint specification

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22871478

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE