CN114186073A

CN114186073A - Operation and maintenance fault diagnosis and analysis method based on subgraph matching and distributed query

Info

Publication number: CN114186073A
Application number: CN202111520430.2A
Authority: CN
Inventors: 顾昊旻; 陆宏波; 袁以友; 高德荃; 来风刚; 赵子岩; 徐浩; 曲延盛; 王云霄
Original assignee: State Grid Information and Telecommunication Co Ltd; Anhui Jiyuan Software Co Ltd; Information and Telecommunication Branch of State Grid Shandong Electric Power Co Ltd
Current assignee: State Grid Information and Telecommunication Co Ltd; Anhui Jiyuan Software Co Ltd; Information and Telecommunication Branch of State Grid Shandong Electric Power Co Ltd
Priority date: 2021-12-13
Filing date: 2021-12-13
Publication date: 2022-03-15

Abstract

The invention discloses an operation and maintenance fault diagnosis and analysis method based on subgraph matching and distributed query, which comprises the steps of 1) establishing a fault handling measure retrieval model of a knowledge graph based on a subgraph matching method; 2) based on similarity calculation of graph structures and semantic information, sorting the result sub-graphs to obtain an optimal query result; 3) optimizing based on a Top-k query model, and accelerating the query speed by using a distributed query method; 4) classifying the operation and maintenance alarm data and screening related network element attributes; 5) based on the large-scale intelligent operation and maintenance knowledge graph, processing steps of each fault are regularized; 6) directly calling an entity-relation-entity object in an intelligent operation and maintenance decision analysis module based on the knowledge graph platform in the steps 1), 2) and 3) to finally form a key operation and maintenance fault diagnosis analysis report. The invention solves the usability problem and the efficiency problem of the prior art by optimizing in the directions of sub-graph matching, retrieval algorithm, distributed processing and the like.

Description

Operation and maintenance fault diagnosis and analysis method based on subgraph matching and distributed query

Technical Field

The invention relates to the field of intelligent retrieval analysis, in particular to an operation and maintenance fault diagnosis and analysis method based on subgraph matching and distributed query.

Background

With the continuous development of artificial intelligence, the intelligent retrieval analysis method based on the knowledge graph is gradually applied to the fields of search engines, education, medical treatment, smart power grids and the like. Semantic information such as entities, attributes, relations and the like is extracted from data of each field through an extraction technology, a knowledge base is constructed through technologies such as knowledge fusion, knowledge processing and the like, and then retrieval analysis services required by a user are realized through matching analysis among the entities. Meanwhile, the knowledge graph adopts a format expressed by ontology terms and semantics, has a standard conceptual model, and can well solve a large amount of multi-source heterogeneous operation data accumulated by a power grid system, including numbers, characters, images and the like; moreover, the knowledge graph enhances the incidence relation among the data through the semantic link function, so that the data expression is more standard, the structuralization is stronger, the application scenes of technologies such as intelligent question answering, intelligent retrieval, auxiliary decision making and the like can be well adapted, and meanwhile, the method is also suitable for retrieval analysis of power grid knowledge.

The operation and maintenance data of the national network company oriented in the method are dispersed and large in scale, the data volume reaches ZB level scale, the constructed intelligent operation and maintenance knowledge graph collects data from a complex structure network, and the characteristics of data center dispersion, complex data network and large data scale are presented, and the characteristics make it difficult for a user to quickly obtain a satisfactory query result. Aiming at the characteristics, how to realize fast and efficient knowledge graph query is a problem to be solved urgently by the current system. The traditional knowledge graph query work generally simply models the knowledge graph query into a sub-graph matching problem, but in practical application, a plurality of defects exist.

First, most of the conventional knowledge graph query models require that query results are matched with user queries accurately, but due to the fact that noise data exists in knowledge graphs, the models can omit the query results which are interested by users, and the problem of poor usability exists.

Secondly, in order to accelerate the query speed, a graph indexing technology is generally adopted in the traditional knowledge graph query algorithm, but the data scale of the intelligent operation and maintenance knowledge graph in the project is large, and the graph index is established by consuming high time and space expenses.

Finally, the intelligent operation and maintenance knowledge graph network is complex and large in scale, so that the query process needs to be realized in a distributed mode, but the traditional distributed graph data processing platform is not optimized for the execution process of knowledge graph query, and the problem of low execution efficiency exists.

Disclosure of Invention

The invention aims to overcome the defects of the prior art, and provides an operation and maintenance fault diagnosis and analysis method based on sub-graph matching and distributed query, so that the usability problem and the efficiency problem in the prior art are solved through optimization in the directions of sub-graph matching, retrieval algorithm, distributed processing and the like.

The purpose of the invention is realized as follows: an operation and maintenance fault diagnosis and analysis method based on subgraph matching and distributed query comprises the following steps:

step 1) establishing a fault handling measure retrieval model of the knowledge graph based on a subgraph matching method: in an existing operation and maintenance knowledge graph, an operation and maintenance fault treatment measure retrieval model based on the knowledge graph is constructed through five steps of defining a retrieval graph, matching sub graphs, sub retrieval division, performing sub retrieval and connecting sub retrieval results;

step 2) checking the topological structure characteristics of the query graph and the result graph in the knowledge graph, and sorting the result graphs based on similarity calculation of graph structures and semantic information to obtain an optimal query result: similarity calculation based on graph structures is carried out on the query graph and the result subgraph, and semantic similarity calculation is carried out on semantic information between graphs through semantic feature description;

carrying out linear superposition on similarity calculation based on a graph structure and similarity calculation based on semantic information to obtain a final comprehensive Score Score of each sub-graph, and sequencing result sub-graphs through the Score to obtain an optimal query result so as to obtain optimal k result graphs;

and 3) optimizing based on the Top-k query model, accelerating the query speed by using a distributed query method, and optimizing the execution efficiency of distributed knowledge graph query from two aspects of job scheduling and data storage on a distributed graph data processing platform: optimizing based on a Top-k query model, accelerating query speed by using computing power of a distributed environment, and optimizing execution efficiency of distributed knowledge graph query from two aspects of job scheduling and data storage on a distributed graph data processing platform;

step 4), classifying the operation and maintenance alarm data and screening the related network element attributes: according to the problem information of different levels in a large amount of alarm data, important and key alarms are preferentially grabbed, and fault information is classified; when fault information occurs, preliminarily judging the processing level of the fault information and the affected service according to alarm classification, searching a network element attribution relation and a user capacity report form of a performance system through the network element attribution relation, and screening out the attribution relation, the number of registered users and the coverage area attribute according to the fault network element;

and 5) regularizing the processing steps of each fault based on the large-scale intelligent operation and maintenance knowledge graph: based on a large-scale intelligent operation and maintenance knowledge map, processing steps of each fault are regularized according to information in a historical fault database;

step 6) directly calling an entity-relation-entity object in an intelligent operation and maintenance decision analysis module based on the knowledge graph platform in the steps 1), 2) and 3), and finally forming a key operation and maintenance fault diagnosis analysis report: determining an entity-relation-entity object through a large-scale intelligent operation and maintenance knowledge map, and outputting a fault diagnosis description; the fault diagnosis knowledge conversion adopts an automatic means, and directly calls an entity-relation-entity object in an intelligent operation and maintenance decision analysis prototype module based on a knowledge map platform to finally form a one-key fault diagnosis analysis report.

As a further limitation of the present invention, the step 1) specifically comprises:

step 1.1) defining a retrieval graph: for search graph Q ═ E_Q，R_Q) Containing a set of points E_QAnd edge set R_QEach retrieval point corresponds to a specific entity description, and the edge represents the relationship between any two points;

step 1.2) matching subgraphs: for a given knowledge-graph G ═ (E)_G，R_G，E_G) And search subgraph Q ═ E_Q，R_Q) The purpose of matching subgraphs is to find a matching subgraph phi (Q) of subgraph Q in graph G, phi being the point E in subgraph Q_QMapping to a point phi (E) in the map G_G) In (2), the edge R in the subgraph Q_QMapping to an edge phi (R) in the graph G_G) In the method, sub-graphs satisfying the relevant mapping function in the graph G are defined as matching sub-graphs phi (Q);

step 1.3) sub-retrieval division: dividing the retrieval graph into a plurality of sub retrieval graphs with small number of top points and single edge characteristics to reduce the retrieval difficulty, and dividing the sub retrieval graphs into a two-layer tree structure to enable each self retrieval graph to comprise a root node, a layer of sub nodes and edges; obtaining the retrieval result of the sub-retrieval through the layer-by-layer matching so as to obtain the retrieval result of the retrieval graph;

step 1.4) sub-search is carried out: decomposing the sub-search graph in the step 1.3) into a minimum spanning tree, inputting the data graph and the divided sub-search graph, and initializing a sub-search result set D_iIf the matching point pair set T is empty, the root node generates an alternative matching point pair set T, and if all nodes of the sub-retrieval graph Q are contained in the set T and the edges of the calculation graph meet the standard, the result meeting the judgment standard is stored into a sub-retrieval result set D_iAnd finally obtaining a result set D after completing all matching_i；

Step 1.5) connector search results: for the sub-retrieval results obtained in the step 1.4), connecting all the sub-retrieval results together to generate a matching sub-graph; if and only if Q_i、Q_jWhen two sub-searches have a common vertex, connecting search results; the basic process of the concatenation of the sub-search results is as follows: initializing a sub-search result set D, for a partitioned sub-search set Q_i∈(Q₁，Q₂，…Q_N) Performing all Q's in accordance with a sub-search progression method_iAnd obtaining all sub-retrieval results, performing Hash connection on all the sub-retrieval results, storing the results with the matching degree meeting the threshold lambda into C, sorting the results according to the matching degree, evaluating the retrieval results stored in C by using an evaluation model to obtain the importance degree f of the retrieval results, returning to the retrieval result set C, and completing retrieval.

As a further limitation of the present invention, the step 2) specifically includes:

step 2.1) similarity calculation based on graph structure: carrying out quantitative analysis on the structures of the query map and the result subgraph; definition if there are two knowledge-maps G₁Node a, G in₂In the node b, the neighbor nodes in the two maps are similar, and then the node a is similar to the node b; similarly, if the starting point and the end point of the edge are similar, the edges are similar; defining that if the similarity of any node or any edge is higher, the degree of subgraph matching is higher; similar in structureThe degree is mainly measured by a matrix formed by the node similarity and the edge similarity; definition map G₁In the map, i nodes are present₂If there are j nodes, the size of the similarity matrix i x j is expressed by x_abRepresentation map G₁Middle node a and graph G₂Similarity of middle node b, y_cdRepresentation map G₁Middle border c and map G₂And (3) obtaining the score solving formula of the following nodes and edges according to the similarity of the middle edge d:

wherein SX represents map G₁And map G₂Node similarity score matrix, x_i(k) Representing the similarity of each point in the two maps after k iterations; SY represents map G₁And map G₂Edge similarity score matrix, y_i(k) Representing the similarity of each edge in the two maps after k iterations; obtaining a structural similarity score S of the query graph and the result sub-graph by averaging the graph point similarity and the graph edge similarity and adding the graph point similarity and the graph edge similarity_simThe formula is as follows:

wherein n is₁And n₂Respectively represent a map G₁And map G₂Number of middle nodes, m₁And m₂Respectively represent a map G₁And map G₂The number of middle edges;

step 2.2) similarity calculation based on semantic information: for a given query graph G_s＝(g₁，g₂，…，g_n) And result sub-graph G_r＝(r₁，r₂，…，r_n) Wherein r is_iFor triples, likelihood estimation probability p (G) is defined for similarity of query graph and result subgraph_s|G_r) Representing, judging the similarity according to the probability, sorting the result sub-graphs, and estimating the similarity based on likelihoodProbability p (G)_s|G_r) Semantic similarity score of

The calculation method is as follows:

wherein, p (g)_i|G_r) Representing query graph G_sCan generate words g_iProbability of using g_iProbability p (g) generated in multiple trigram models_i|r_j) Is expressed by the average value of;

step 2.3) obtaining a linear weighted similarity score: general pairs of structural similarity scores S in step 2.1)_simScoring semantic similarity with step 2.2)

And performing linear weighted fusion to obtain the final similarity score condition, wherein the formula is shown as follows:

wherein eta is a variable parameter with a value of [0, 1] and is used for adjusting the proportion of the two similarity scores in the comprehensive similarity score; and sequencing the result subgraphs by the scores of the comprehensive similarity to obtain the optimal query result and finish the fault retrieval.

As a further limitation of the present invention, the step 3) specifically includes:

step 3.1) optimizing based on a Top-k query model, accelerating query speed by utilizing computing power of a distributed environment, and calculating the distance between entities in the knowledge graph in real time by adopting a distributed breadth-first search method; the checking and optimizing method based on the bounding technology is proposed to accelerate the checking speed, the accurate distance is replaced by the upper and lower bounds of the distance between the entities, and the optimal k result graphs are deduced based on the upper and lower bounds, so that the checking time is reduced; the knowledge graph spectrogram query algorithm is realized in a distributed environment, and the distributed query algorithm is ensured to be executed in an actual environment through a storage mode of the knowledge graph in the distributed environment and an interaction mode among query tasks;

and 3.2) on the distributed graph data processing platform, optimizing the execution efficiency of the distributed knowledge graph query from two aspects of job scheduling and data storage: data loading time of the distributed graph checking task is optimized; scheduling the tasks to the computing nodes where the data are located through a data locality oriented task scheduling algorithm; through a data map multiplexing technology based on a shared memory, knowledge map data in the memory is multiplexed by a plurality of checking tasks.

By adopting the technical scheme, compared with the prior art, the invention has the beneficial effects that: 1) the improved retrieval method is designed on the basis of sub-graph matching, and the retrieval accuracy is effectively improved and the influence of noise data is reduced by linearly overlapping the similarity based on the graph structure and the similarity based on the semantic information; 2) the invention adopts a distributed method to realize the query process, optimizes the query time and accelerates the query speed; 3) on a distributed graph data processing platform, the execution efficiency of distributed knowledge graph query is optimized from two aspects of job scheduling and data storage, the data I/O overhead is reduced, and the overall query completion time is further shortened.

Drawings

Figure 1 is an overall block diagram of the present invention.

FIG. 2 is a conceptual diagram of a retrieval subgraph constructed by the present invention.

FIG. 3 is a conceptual diagram of the search subgraph partitioning of the present invention.

Detailed Description

The operation and maintenance fault diagnosis and analysis method based on subgraph matching and distributed query as shown in fig. 1 comprises the following steps:

step 1.3) sub-retrieval division: considering that the number of vertexes and edges of a retrieval graph is too large, dividing the retrieval graph into a plurality of sub retrieval graphs with small number of vertexes and single edge characteristics, and reducing retrieval difficulty, dividing the sub retrieval graphs into a two-layer tree structure to enable each self retrieval graph to comprise a root node, a layer of sub nodes and edges; obtaining the retrieval result of the sub-retrieval through the layer-by-layer matching so as to obtain the retrieval result of the retrieval graph; the constructed retrieval subgraph is shown in FIG. 2, and the division of the retrieval subgraph is shown in FIG. 3;

Step 1.5) connector search results: for the sub-retrieval results obtained in the step 1.4), connecting the results obtained by all the sub-retrievalConnecting together to generate a matching subgraph; if and only if Q_i、Q_jWhen two sub-searches have a common vertex, connecting search results; the basic process of the concatenation of the sub-search results is as follows: initializing a sub-search result set D, for a partitioned sub-search set Q_i∈(Q₁，Q₂，…Q_N) Performing all Q's in accordance with a sub-search progression method_iAnd obtaining all sub-retrieval results, performing Hash connection on all the sub-retrieval results, storing the results with the matching degree meeting the threshold lambda into C, sorting the results according to the matching degree, evaluating the retrieval results stored in C by using an evaluation model to obtain the importance degree f of the retrieval results, returning to the retrieval result set C, and completing retrieval.

step 2.1) similarity calculation based on graph structure: carrying out quantitative analysis on the structures of the query map and the result subgraph; definition if there are two knowledge-maps G₁Node a, G in₂In the node b, the neighbor nodes in the two maps are similar, and then the node a is similar to the node b; similarly, if the starting point and the end point of the edge are similar, the edges are similar; defining that if the similarity of any node or any edge is higher, the degree of subgraph matching is higher; the structural similarity is mainly measured by a matrix formed by the node similarity and the edge similarity; definition map G₁In the map, i nodes are present₂If there are j nodes, the size of the similarity matrix i x j is expressed by x_abRepresentation map G₁Middle node a and graph G₂Similarity of middle node b, y_cdRepresentation map G₁Middle border c and map G₂And (3) obtaining the score solving formula of the following nodes and edges according to the similarity of the middle edge d:

step 2.2) similarity calculation based on semantic information: for a given query graph G_s＝(g₁，g₂，…，g_n) And result sub-graph G_r＝(r₁，r₂，…，r_n) Wherein r is_iFor triples, likelihood estimation probability p (G) is defined for similarity of query graph and result subgraph_s|G_r) Representing that the similarity is judged according to the probability and the result sub-graphs are sorted, and the probability p (G) is estimated based on the likelihood_s|G_r) Semantic similarity score of

The calculation method is as follows:

And 3) optimizing based on the Top-k query model, accelerating the query speed by using a distributed query method, and optimizing the execution efficiency of distributed knowledge graph query from two aspects of job scheduling and data storage on a distributed graph data processing platform: optimizing based on a Top-k query model, accelerating query speed by using computing power of a distributed environment, achieving the purpose of quickly responding to a query request, and optimizing the execution efficiency of distributed knowledge graph query from two aspects of job scheduling and data storage on a distributed graph data processing platform;

step 3.1) optimizing based on a Top-k query model, and accelerating the query speed by using the computing power of a distributed environment to achieve the aim of quickly responding to the query request; in order to achieve the purpose of index avoidance, the distance between the entities in the knowledge graph is calculated in real time by adopting a distributed breadth-first search method, and the distance between any two entities is prevented from being calculated in advance and stored; in order to accelerate the query speed, a checking and querying optimization method based on a clearance technology is proposed to accelerate the query speed, the accurate distance is replaced by the upper and lower bounds of the distance between the entities, and the optimal k result graphs are derived based on the upper and lower bounds, so that the purpose of effectively reducing the query time is achieved; the knowledge graph spectrogram query algorithm is realized in a distributed environment, and the distributed query algorithm is ensured to be executed in an actual environment through a storage mode of the knowledge graph in the distributed environment and an interaction mode among query tasks;

and 3.2) on the distributed graph data processing platform, optimizing the execution efficiency of the distributed knowledge graph query from two aspects of job scheduling and data storage: by optimizing the data loading time of the distributed checking task, the execution performance of the checking task is improved; scheduling tasks to computing nodes where data are located through a data locality-oriented task scheduling algorithm so as to avoid the influence of network I/O on checking performance as much as possible; through a data map multiplexing technology based on a shared memory, knowledge map data in the memory is multiplexed by a plurality of checking tasks, and I/O (input/output) overhead caused by repeated loading of data maps is avoided.

Step 4), classifying the operation and maintenance alarm data and screening the related network element attributes: according to the problem information of different levels in a large amount of alarm data, important and key alarms are preferentially grabbed, and fault information is classified; when fault information occurs, the processing level of the fault information and possibly influenced services are preliminarily judged according to alarm classification, the home relationship of a network element of a performance system and a user capacity report are searched through the home relationship of the network element, the home relationship, the number of registered users and the coverage area attribute are screened out according to the fault network element, and support of relevant information is provided for fault auxiliary decision making;

and 5) regularizing the processing steps of each fault based on the large-scale intelligent operation and maintenance knowledge graph: based on a large-scale intelligent operation and maintenance knowledge map, processing steps of each fault are regularized according to information in a historical fault database; such as the equipment which needs to be inquired after the key alarm appears clearly, and the specific inquiry content of different professional equipment.

Aiming at the characteristics of a cloud data center that the intelligent operation and maintenance knowledge graph has more noise data and large data scale, the operation and maintenance fault diagnosis and analysis method based on sub-graph matching and distributed query is provided, so that the problems of availability and efficiency in the prior art are solved through optimization in the directions of sub-graph matching, retrieval algorithm, distributed processing and the like, and support is provided for intelligent operation and maintenance decision analysis.

The present invention is not limited to the above-mentioned embodiments, and based on the technical solutions disclosed in the present invention, those skilled in the art can make some substitutions and modifications to some technical features without creative efforts according to the disclosed technical contents, and these substitutions and modifications are all within the protection scope of the present invention.

Claims

1. An operation and maintenance fault diagnosis and analysis method based on subgraph matching and distributed query is characterized by comprising the following steps:

step 6) directly calling an entity-relation-entity object in an intelligent operation and maintenance decision analysis module based on the knowledge graph platform in the steps 1), 2) and 3), and finally forming a one-key operation and maintenance fault diagnosis analysis report: determining an entity-relation-entity object through a large-scale intelligent operation and maintenance knowledge map, and outputting a fault diagnosis description; the fault diagnosis knowledge conversion adopts an automatic means, and directly calls an entity-relation-entity object in an intelligent operation and maintenance decision analysis prototype module based on a knowledge map platform to finally form a one-key fault diagnosis analysis report.

2. The operation and maintenance fault diagnosis analysis method based on subgraph matching and distributed query according to claim 1, wherein the step 1) specifically comprises:

step 1.4) sub-search is carried out: decomposing the sub-search graph in the step 1.3) into a minimum spanning tree, inputting the data graph and the divided sub-search graph, and initializing a sub-search result set D_iIf the matching point pair set T is empty, generating an alternative matching point pair set T by the root node, and if all the nodes of the sub retrieval graph Q are contained in the set T, calculating whether the edges of the graph meet the standard or not,storing the result meeting the judgment standard into a sub-retrieval result set D_iAnd finally obtaining a result set D after completing all matching_i；

3. The operation and maintenance fault diagnosis analysis method based on subgraph matching and distributed query according to claim 1, wherein the step 2) specifically comprises:

step 2.1) similarity calculation based on graph structure: carrying out quantitative analysis on the structures of the query map and the result subgraph; definition if there are two knowledge-maps G₁Node a, G in₂In the node b, the neighbor nodes in the two maps are similar, and then the node a is similar to the node b; similarly, if the starting point and the end point of the edge are similar, the edges are similar; defining that if the similarity of any node or any edge is higher, the degree of subgraph matching is higher; the structural similarity is mainly measured by a matrix formed by the node similarity and the edge similarity; definition map G₁In the map, i nodes are present₂If there are j nodes, the size of the similarity matrix i x j is expressed by x_abRepresentation map G₁Middle node a and graph G₂Similarity of middle node b, y_cdRepresentation map G₁Middle border c and map G₂The similarity of the middle edge d is obtained by the following score solving formula of the node and the edge：

The calculation method is as follows:

4. The operation and maintenance fault diagnosis analysis method based on subgraph matching and distributed query according to claim 1, wherein the step 3) specifically comprises: