CN113157934A - Knowledge graph origin processing method and system, electronic device and storage medium - Google Patents
Knowledge graph origin processing method and system, electronic device and storage medium Download PDFInfo
- Publication number
- CN113157934A CN113157934A CN202110246378.XA CN202110246378A CN113157934A CN 113157934 A CN113157934 A CN 113157934A CN 202110246378 A CN202110246378 A CN 202110246378A CN 113157934 A CN113157934 A CN 113157934A
- Authority
- CN
- China
- Prior art keywords
- graph
- query
- knowledge
- queries
- origin
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000003860 storage Methods 0.000 title claims abstract description 13
- 238000003672 processing method Methods 0.000 title claims abstract description 11
- 238000000034 method Methods 0.000 claims abstract description 49
- 238000009795 derivation Methods 0.000 claims abstract description 25
- 230000008569 process Effects 0.000 claims abstract description 25
- 230000001364 causal effect Effects 0.000 claims abstract description 10
- 238000012545 processing Methods 0.000 claims description 24
- 238000004590 computer program Methods 0.000 claims description 9
- 238000013507 mapping Methods 0.000 claims description 7
- 238000004364 calculation method Methods 0.000 abstract description 3
- 238000004891 communication Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 238000000605 extraction Methods 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000012423 maintenance Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000005923 long-lasting effect Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention provides a knowledge-graph origin processing method and system, electronic equipment and a storage medium, wherein the knowledge-graph origin processing method comprises the following steps: tracking derivation of query results on the knowledge graph based on cause origin and process tracing; wherein the causal origin is a set of relational edges provided by particular derivations of particular result items; the process traces back in a representation of a source polynomial representing a potential match to any set of queries; upon an update of the knowledge-graph, the origin polynomial is updated based on the updated knowledge-graph. The present invention tracks derivations of query results on a knowledge graph by proposing dynamic provenance polynomial concepts and encoding relational edges that generate answers. Repeated calculation of the query result after the knowledge graph is updated every time is avoided, the query efficiency of the knowledge graph is greatly improved, and computing resources are saved; meanwhile, the accuracy of origin tracing of the query result is ensured.
Description
Technical Field
The present invention relates to the field of message passing technologies, and in particular, to a method and a system for processing an origin of a knowledge graph, an electronic device, and a storage medium.
Background
Knowledge graphs have increasingly become the mainstay of many knowledge-centric key applications. The use of large-scale knowledge graphs, which model interrelationships between various entities occurring in real life, has become common in many knowledge-centric applications. In addition to playing a key role in network search systems (e.g., google Knowledge Graph, Microsoft Bing Satori, etc.), they are also used in e-government, technical support, drug management, academic search, etc. scenarios.
While some knowledge-graphs are elaborated by hand, most of the large-scale knowledge-graphs that are in practical use are automatically constructed using one or more information extraction pipelines on various underlying data sources, based on the same or similar extraction techniques. Thus, a knowledge-graph may contain knowledge entities that are related by different mechanisms, or query results may be generated by a combination of facts from different sources. Therefore, it is not sufficient to track the fine-grained sources of a single entity in only one knowledge-graph, and we also need to track the traversability of the results of a single query. Therefore, it becomes important to establish a source trace for query results to determine how the results are produced. Result traceability is useful for assessing the trustworthiness of query results, generation of knowledge maps themselves, and providing answer interpretations.
In many applications, some queries are set as long-lasting, high-frequency queries whose query results are often displayed in a solid state based on efficiency and resource considerations. However, the knowledge graph itself is constantly changing due to changes in source data, improvements in extraction techniques, refinement/enrichment of information, and the like. Therefore, a problem is brought, namely how to efficiently and dynamically trace the source of the query result when the large-scale knowledge graph is queried, instead of recalculating the knowledge graph from the beginning every time the knowledge graph is updated, which becomes an important link for improving the working efficiency of the knowledge graph.
Disclosure of Invention
The invention provides a knowledge graph origin processing method and system, electronic equipment and a storage medium, which are used for solving the technical defects in the prior art.
The invention provides a knowledge graph origin processing method, which comprises the following steps:
tracking derivation of query results on the knowledge graph based on cause origin and process tracing;
wherein the causal origin is a set of relational edges provided by particular derivations of particular result items; the process traces back in a representation of a source polynomial representing a potential match to any set of queries;
upon an update of the knowledge-graph, the origin polynomial is updated based on the updated knowledge-graph.
Preferably, the method for processing the origin of the knowledge-graph, wherein the potential match represents that any sub-graph of the knowledge-graph can become an actual match of a query after inserting an edge, the potential match includes a first pattern potential match and a second pattern potential match, the first pattern potential match is a triple pattern of a newly added edge matching query, so that the sub-graph becomes an actual match of the query; the second pattern potential matches are triplets of a newly added edge matching query, making the subgraph an actual match for the query.
Preferably, the method for processing the origin of the knowledge-graph comprises the following steps:
the registered queries are classified according to the type of potential match into regular queries representing queries with only a first pattern potential match and multi-mapped queries representing queries with a second pattern potential match.
Preferably, the method for processing the origin of the knowledge-graph comprises the following steps: obtaining information of sub-queries from a basic graph schema of a query by maintaining each time;
the sub-queries represent: if the size of the given query is n, namely n triple patterns exist, deleting one triple pattern to generate two subgraphs in the generated sub-query at most; these two sub-graphs are denoted SQ1 and SQ2, without loss of generality, | SQ1| ═ k, | SQ2| ═ n-k-1 is set, where 0 ≦ k ≦ n-1.
The invention also provides a knowledge-graph origin processing system, comprising:
the query result tracking module is used for tracking derivation of a query result on the knowledge graph based on reason origin and process tracing;
wherein the causal origin is a set of relational edges provided by particular derivations of particular result items; the process traces back in a representation of a source polynomial representing a potential match to any set of queries;
and the origin polynomial updating module is used for updating the origin polynomial based on the updated knowledge graph when the knowledge graph is updated.
Preferably, the system for processing knowledge-graph provenance comprises a first module for matching a first pattern potential match with a second module for matching a second pattern potential match with a second pattern potential match, and a second module for matching a second pattern potential match with a second pattern potential match; the second pattern potential matches are triplets of a newly added edge matching query, making the subgraph an actual match for the query.
Preferably, the system for processing the knowledge-graph origin comprises:
the query classification module is used for classifying the registered queries into conventional queries and multi-mapping queries according to the types of the potential matches, wherein the conventional queries represent the queries with only the first pattern potential matches, and the multi-mapping queries represent the queries with the second pattern potential matches.
Preferably, the system for processing the knowledge-graph origin comprises:
the sub-query module is used for obtaining the information of the sub-query from the basic graph mode of the query by maintaining each time;
the sub-queries represent: if the size of the given query is n, namely n triple patterns exist, deleting one triple pattern to generate two subgraphs in the generated sub-query at most; these two sub-graphs are denoted SQ1 and SQ2, without loss of generality, | SQ1| ═ k, | SQ2| ═ n-k-1 is set, where 0 ≦ k ≦ n-1.
The present invention also provides an electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the steps of the method for processing knowledge-graph origins as described in any of the above when executing the program.
The present invention also provides a non-transitory computer readable storage medium having stored thereon a computer program which, when executed by a processor, performs the steps of the method of knowledge-graph provenance processing as described in any one of the above.
The invention provides a knowledge graph origin processing method, which tracks derivation of query results on a knowledge graph by proposing a dynamic origin polynomial concept and encoding relationship edges for generating answers. When the knowledge graph is updated, the origin polynomial is updated based on the updated knowledge graph, repeated calculation of query results after the knowledge graph is updated every time is avoided, query efficiency of the knowledge graph is greatly improved, and computing resources are saved; meanwhile, the accuracy of origin tracing of the query result is ensured.
Drawings
In order to more clearly illustrate the technical solutions of the present invention or the prior art, the drawings needed for the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and those skilled in the art can also obtain other drawings according to the drawings without creative efforts.
FIG. 1 is a schematic flow diagram of a knowledge-graph provenance processing method provided by the present invention;
FIG. 2 is a schematic diagram of a knowledge-graph provenance processing system provided by the present invention;
FIG. 3 is a schematic structural diagram of an electronic device provided by the present invention;
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The embodiment of the invention discloses a knowledge graph origin processing method, which is shown in figure 1 and comprises the following steps:
s1, tracking derivation of the query result on the knowledge graph based on reason origin and process tracing;
at S1, the query origin includes two important categories, namely, a reason origin, which is a set of edges provided by a particular derivation of a particular result item, and a process trace back. On the other hand, the method is used for describing the interaction among relations in the derivation process of one answer by figurative coding retrospectively. In the origin of the cause, the expression mode is set to be expressed as that a polynomial and the edge of each single item represent binding to result in an answer, and the same or highly approximate result is obtained by combining a plurality of groups of different relational edges; for process tracing, more information about the production result needs to be captured, using the origin polynomial representation.
The basis of the construction method of the origin polynomial is to match various concepts of a specific query q. Intuitively, these potential matches for a query correspond to subgraphs of the knowledge-graph that partially match the given query graph pattern. For maintenance of potential matches for queries, only one relationship edge needs to be inserted to convert them to a full match.
Consider a query Q that contains n triplet patterns. Assuming that S ∈ G is a subgraph, it can match n-1 triplet patterns of query Q, and only 1 triplet pattern does not match. The latter new edge e unmatched triplet pattern matched Q is the new subgraph added to S, which becomes an actual match for Q, in other words S ∈ a (Q), the query result set of queries for a (Q), S is the potential match. Any sub-graph S of the knowledge-graph G, after inserting an edge, can become an actual match for the query Q, called a potential match.
Wherein the causal origin is a set of relational edges provided by particular derivations of particular result items; the process traces back in a representation of a source polynomial representing a potential match to any set of queries;
at the time of the update of the knowledge-graph, the origin polynomial is updated based on the updated knowledge-graph, S2.
The potential match represents that any sub-graph of the knowledge graph can become an actual match of a query after inserting an edge, the potential match comprises a first pattern potential match and a second pattern potential match, the first pattern potential match is a triple pattern of a newly added edge matching query, and the sub-graph becomes an actual match of the query; the second pattern potential matches are triplets of a newly added edge matching query, making the subgraph an actual match for the query.
The present invention tracks derivations of query results on a knowledge graph by proposing dynamic provenance polynomial concepts and encoding relational edges that generate answers. When the knowledge graph is updated, the origin polynomial is updated based on the updated knowledge graph, repeated calculation of query results after the knowledge graph is updated every time is avoided, query efficiency of the knowledge graph is greatly improved, and computing resources are saved; meanwhile, the accuracy of origin tracing of the query result is ensured. At the same time, automatic maintenance updates to the provenance polynomial can be implemented in the face of updates (insertion and deletion of facts) to the underlying knowledge-graph.
Specifically, the first pattern potential match is a 1:1 potential match. If the newly added edge e matches only one triplet pattern of query Q, such that S ═ S ue { e } becomes one actual match for Q, then sub-graph S is referred to as a 1:1 potential match, denoted by PM1: 1. The second pattern potential match is a 1: M potential match, and if a newly added edge e matches the three patterns of query Q, making S ═ S { e } an actual match for Q, then sub-graph S is called a 1: M potential match, denoted by PM1: M.
The method comprises the following steps:
the registered queries are classified according to the type of potential match into regular queries representing queries with only a first pattern potential match and multi-mapped queries representing queries with a second pattern potential match. Both queries are join queries, which differ only in the distribution of predicates in the ternary patterns. Queries with only 1:1 potential matches (no 1: m potential matches) are called regular queries, consisting of QregAnd (4) showing. A query with 1: m potential matches (possibly in addition to 1:1 potential matches) is called a multiple-mapping query, denoted by QmmAnd (4) showing.
The potential matches and queries are separated into different classes because they need to be processed in different ways. The query registration process policy is based on the following.
Introduction 1: PM1:1 may satisfy one and only one child query of size n-1 given parent query Q of size n.
2, leading: the PM1 m cannot satisfy a child query of size n-1 given a parent query Q of size n.
And 3, introduction: after a conditional edge insertion, PM1:m satisfies parent query Q if and only if PM1:m satisfies all children of Q.
The results of long-term queries, maintained for all registrations, are maintained using an originating polynomial, along with facts relevant to the query. By maintaining information for sub-queries at a time obtained from the basic graph schema of the query. In order to improve efficiency in all queries/sub-queries in the workload, the queried results are shared AND utilized, AND the sub-query execution plans generated by using an AND-OR graph are combined to form a single global execution tree. When the knowledge-graph is updated, the use of the screening and optimization paradigm to compute updated query results also facilitates fast recalculation of sub-query results.
Preferably, the above method comprises: obtaining information of sub-queries from a basic graph schema of a query by maintaining each time;
the sub-queries represent: if the size of the given query is n, namely n triple patterns exist, deleting one triple pattern to generate two subgraphs in the generated sub-query at most; these two sub-graphs are denoted SQ1 and SQ2, without loss of generality, | SQ1| ═ k, | SQ2| ═ n-k-1 is set, where 0 ≦ k ≦ n-1.
Depending on the size of the subgraph and the connection points of the results, the sub-queries can be divided into the following different types:
the sub-query has a singleton sub-graph k equal to 0, and is generated when the triple pattern corresponding to the edge of the leaf node of the query graph is deleted.
Type II a subgraph contains only one triple pattern, i.e. k 1, in which case eventually all edges are identified as tie points using the triple pattern in SQ2, which may justify a large number of potential entities in the knowledge graph. To avoid this, this patent only identifies the connection point from a (SQ 1).
Both types III, SQ1 and SQ2 contain at least two triplet patterns, i.e., 2 ≦ k < n-2. For example, a deletion composer generates two sub-queries, each of which differs in size by 2, which results in at most | A (SQ1) | + | A (SQ2) | nexus.
Type IV in the last type, deleting a triplet pattern from a query does not disconnect the query graph, i.e., k is n-1, yielding 2 potential matches per match SQ 1.
The specific implementation process of the invention is as follows:
data set, using the common YAG02 data set.
The query set, for the YAG02 data set, was validated using RDF-3X.
The knowledge-graph samples, for each knowledge-graph, generate an insertion workload in the following manner. From the initial starting graph, a pair of unconnected vertices is randomly selected and connected with a randomly selected predicate.
Neo4j was selected to store a knowledge graph, baseline system. Origin computation customization is supported using an open-source tripleprofv.
The knowledge-graph origin processing system provided by the invention is described below, and the knowledge-graph origin processing system described below and the knowledge-graph origin processing method described above can be referred to correspondingly.
The embodiment of the invention discloses a knowledge graph origin processing system, which is shown in figure 2 and comprises the following components:
a query result tracking module 10, configured to track derivation of a query result on a knowledge graph based on cause origin and process tracing;
wherein the causal origin is a set of relational edges provided by particular derivations of particular result items; the process traces back in a representation of a source polynomial representing a potential match to any set of queries;
an origin polynomial updating module 20 for updating the origin polynomial based on the updated knowledge-graph upon an update of the knowledge-graph.
The potential match represents that any sub-graph of the knowledge graph can become an actual match of a query after inserting an edge, the potential match comprises a first pattern potential match and a second pattern potential match, the first pattern potential match is a triple pattern of a newly added edge matching query, and the sub-graph becomes an actual match of the query; the second pattern potential matches are triplets of a newly added edge matching query, making the subgraph an actual match for the query.
The system of the present invention comprises:
the query classification module is used for classifying the registered queries into conventional queries and multi-mapping queries according to the types of the potential matches, wherein the conventional queries represent the queries with only the first pattern potential matches, and the multi-mapping queries represent the queries with the second pattern potential matches.
Further, the system of the present invention comprises:
the sub-query module is used for obtaining the information of the sub-query from the basic graph mode of the query by maintaining each time;
the sub-queries represent: if the size of the given query is n, namely n triple patterns exist, deleting one triple pattern to generate two subgraphs in the generated sub-query at most; these two sub-graphs are denoted SQ1 and SQ2, without loss of generality, | SQ1| ═ k, | SQ2| ═ n-k-1 is set, where 0 ≦ k ≦ n-1.
Fig. 3 illustrates a physical structure diagram of an electronic device, which may include: a processor (processor)310, a communication Interface (communication Interface)320, a memory (memory)330 and a communication bus 340, wherein the processor 310, the communication Interface 320 and the memory 330 communicate with each other via the communication bus 340. Processor 310 may invoke logic instructions in memory 330 to perform a method of knowledge-graph origin processing, the method comprising:
s1, tracking derivation of the query result on the knowledge graph based on reason origin and process tracing;
wherein the causal origin is a set of relational edges provided by particular derivations of particular result items; the process traces back in a representation of a source polynomial representing a potential match to any set of queries;
at the time of the update of the knowledge-graph, the origin polynomial is updated based on the updated knowledge-graph, S2.
In addition, the logic instructions in the memory 330 may be implemented in the form of software functional units and stored in a computer readable storage medium when the software functional units are sold or used as independent products. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
In another aspect, the present invention also provides a computer program product comprising a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions which, when executed by a computer, the computer is capable of performing a method of knowledge-graph provenance processing, the method comprising:
s1, tracking derivation of the query result on the knowledge graph based on reason origin and process tracing;
wherein the causal origin is a set of relational edges provided by particular derivations of particular result items; the process traces back in a representation of a source polynomial representing a potential match to any set of queries;
at the time of the update of the knowledge-graph, the origin polynomial is updated based on the updated knowledge-graph, S2.
In yet another aspect, the present invention also provides a non-transitory computer readable storage medium having stored thereon a computer program that when executed by a processor is implemented to perform a method of knowledge-graph provenance processing, the method comprising:
s1, tracking derivation of the query result on the knowledge graph based on reason origin and process tracing;
wherein the causal origin is a set of relational edges provided by particular derivations of particular result items; the process traces back in a representation of a source polynomial representing a potential match to any set of queries;
at the time of the update of the knowledge-graph, the origin polynomial is updated based on the updated knowledge-graph, S2.
The above-described system embodiments are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.
Claims (10)
1. A knowledge-graph provenance processing method, comprising:
tracking derivation of query results on the knowledge graph based on cause origin and process tracing;
wherein the causal origin is a set of relational edges provided by particular derivations of particular result items; the process traces back in a representation of a source polynomial representing a potential match to any set of queries;
upon an update of the knowledge-graph, the origin polynomial is updated based on the updated knowledge-graph.
2. The method of claim 1, wherein the potential matches represent that any sub-graph of the knowledge-graph can be an actual match of a query after inserting an edge, and the potential matches include a first pattern potential match and a second pattern potential match, and the first pattern potential match is a triple pattern of a newly added edge matching query, so that the sub-graph becomes an actual match of the query; the second pattern potential matches are triplets of a newly added edge matching query, making the subgraph an actual match for the query.
3. The method of knowledge-graph provenance processing according to claim 2, comprising:
the registered queries are classified according to the type of potential match into regular queries representing queries with only a first pattern potential match and multi-mapped queries representing queries with a second pattern potential match.
4. The method of knowledge-graph provenance processing according to claim 1, comprising: obtaining information of sub-queries from a basic graph schema of a query by maintaining each time;
the sub-queries represent: if the size of the given query is n, namely n triple patterns exist, deleting one triple pattern to generate two subgraphs in the generated sub-query at most; these two sub-graphs are denoted SQ1 and SQ2, without loss of generality, | SQ1| ═ k, | SQ2| ═ n-k-1 is set, where 0 ≦ k ≦ n-1.
5. A knowledge-graph provenance processing system, comprising:
the query result tracking module is used for tracking derivation of a query result on the knowledge graph based on reason origin and process tracing;
wherein the causal origin is a set of relational edges provided by particular derivations of particular result items; the process traces back in a representation of a source polynomial representing a potential match to any set of queries;
and the origin polynomial updating module is used for updating the origin polynomial based on the updated knowledge graph when the knowledge graph is updated.
6. The system of claim 5, wherein the potential matches represent any sub-graph of the knowledge-graph that can be an actual match of a query after inserting an edge, the potential matches comprising a first pattern potential match and a second pattern potential match, the first pattern potential match being a triple pattern of a newly added edge matching query, such that the sub-graph becomes an actual match of the query; the second pattern potential matches are triplets of a newly added edge matching query, making the subgraph an actual match for the query.
7. The system of knowledge-graph provenance processing according to claim 6, comprising:
the query classification module is used for classifying the registered queries into conventional queries and multi-mapping queries according to the types of the potential matches, wherein the conventional queries represent the queries with only the first pattern potential matches, and the multi-mapping queries represent the queries with the second pattern potential matches.
8. The system of knowledge-graph provenance processing according to claim 5, comprising:
the sub-query module is used for obtaining the information of the sub-query from the basic graph mode of the query by maintaining each time;
the sub-queries represent: if the size of the given query is n, namely n triple patterns exist, deleting one triple pattern to generate two subgraphs in the generated sub-query at most; these two sub-graphs are denoted SQ1 and SQ2, without loss of generality, | SQ1| ═ k, | SQ2| ═ n-k-1 is set, where 0 ≦ k ≦ n-1.
9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the steps of the method of knowledge-graph origin processing according to any one of claims 1 to 4 are implemented when the program is executed by the processor.
10. A non-transitory computer readable storage medium having stored thereon a computer program, wherein the computer program when executed by a processor implements the steps of the method of knowledge-graph provenance processing of any of claims 1 to 4.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110246378.XA CN113157934A (en) | 2021-03-05 | 2021-03-05 | Knowledge graph origin processing method and system, electronic device and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110246378.XA CN113157934A (en) | 2021-03-05 | 2021-03-05 | Knowledge graph origin processing method and system, electronic device and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113157934A true CN113157934A (en) | 2021-07-23 |
Family
ID=76884361
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110246378.XA Pending CN113157934A (en) | 2021-03-05 | 2021-03-05 | Knowledge graph origin processing method and system, electronic device and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113157934A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114579826A (en) * | 2022-04-27 | 2022-06-03 | 支付宝(杭州)信息技术有限公司 | Task processing method and device based on knowledge graph |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111061750A (en) * | 2019-12-17 | 2020-04-24 | Oppo广东移动通信有限公司 | Query processing method and device and computer readable storage medium |
-
2021
- 2021-03-05 CN CN202110246378.XA patent/CN113157934A/en active Pending
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111061750A (en) * | 2019-12-17 | 2020-04-24 | Oppo广东移动通信有限公司 | Query processing method and device and computer readable storage medium |
Non-Patent Citations (1)
Title |
---|
GARIMA GAUR 等: "How and Why is An Answer (Still) Correct? Maintaining Provenance in Dynamic Knowledge Graphs", 《CIKM \'20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114579826A (en) * | 2022-04-27 | 2022-06-03 | 支付宝(杭州)信息技术有限公司 | Task processing method and device based on knowledge graph |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11995124B2 (en) | Query language interoperability in a graph database | |
CN110472068B (en) | Big data processing method, equipment and medium based on heterogeneous distributed knowledge graph | |
CN111382174B (en) | Multi-party data joint query method, device, server and storage medium | |
US9507875B2 (en) | Symbolic hyper-graph database | |
US11971867B2 (en) | Global column indexing in a graph database | |
CN110908997A (en) | Data blood margin construction method and device, server and readable storage medium | |
CN107251021B (en) | Filtering data lineage graph | |
CN111046237B (en) | User behavior data processing method and device, electronic equipment and readable medium | |
US9703817B2 (en) | Incremental information integration using a declarative framework | |
CN111026874A (en) | Data processing method and server of knowledge graph | |
Cheng et al. | Rule-based graph repairing: Semantic and efficient repairing methods | |
CN107391557B (en) | Block chain serial query method and system for setting out-of-chain fault table | |
CN106557307B (en) | Service data processing method and system | |
CN109753573B (en) | Processing method and device for constructing preset model based on graph database | |
US11720543B2 (en) | Enforcing path consistency in graph database path query evaluation | |
CN107463596B (en) | Block chain parallel query method and system for setting out-of-chain fault table | |
CN111814458A (en) | Rule engine system optimization method and device, computer equipment and storage medium | |
CN115203435A (en) | Entity relation generation method and data query method based on knowledge graph | |
US10983997B2 (en) | Path query evaluation in graph databases | |
CN114691356A (en) | Data parallel processing method and device, computer equipment and readable storage medium | |
CN113157934A (en) | Knowledge graph origin processing method and system, electronic device and storage medium | |
CN113704248A (en) | Block chain query optimization method based on external index | |
CN104050264A (en) | Method and device for generating SQL statement | |
CN116257636A (en) | Unified management method and device for enumerated data dictionary, electronic equipment and storage medium | |
CN114996246A (en) | Data cleaning method for checking multiple fields of table based on NiFi assembly |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20210723 |