CN113392143B - Construction and processing method of reachability query index facing multiple relational graphs - Google Patents

Construction and processing method of reachability query index facing multiple relational graphs Download PDF

Info

Publication number
CN113392143B
CN113392143B CN202110792966.3A CN202110792966A CN113392143B CN 113392143 B CN113392143 B CN 113392143B CN 202110792966 A CN202110792966 A CN 202110792966A CN 113392143 B CN113392143 B CN 113392143B
Authority
CN
China
Prior art keywords
index
relation
node
reachable
relationship
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN202110792966.3A
Other languages
Chinese (zh)
Other versions
CN113392143A (en
Inventor
王潇杨
傅仙明
吴艳萍
陈晨
卢旭峰
张梦琪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Gongshang University
Original Assignee
Zhejiang Gongshang University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Gongshang University filed Critical Zhejiang Gongshang University
Priority to CN202110792966.3A priority Critical patent/CN113392143B/en
Publication of CN113392143A publication Critical patent/CN113392143A/en
Application granted granted Critical
Publication of CN113392143B publication Critical patent/CN113392143B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Fuzzy Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a construction and processing method of reachability query index facing multiple relational graphs. The conventional reachability query method is not applicable to the multiple relationship graph. The invention provides two new reachable models in a multi-relation graph, and for a necessary relation reachable model, the necessary relation between two nodes can be satisfied: a path can be found between two nodes, each edge in the path containing all the relationships in a given set of relationships. For the existence relation reachable model, the existence relation reachable between two nodes meets the following conditions: a path can be found between two nodes, and each edge in the path at least contains one relation in a given relation set. In consideration of the attributes of the necessary relationship reachable model and the existing relationship reachable model, the invention provides a corresponding index construction and processing method to effectively reduce the query time. To get the index faster, the present invention proposes a novel pruning strategy to speed up the construction of the index.

Description

Construction and processing method of reachability query index facing multiple relational graphs
Technical Field
The invention belongs to the technical field of multimedia data mining, and particularly relates to a construction and processing method of reachability query indexes facing a multi-relation graph.
Background
There are many applications for graph structures, such as social networks, communication networks, road networks, etc. that can be modeled as a graph. For example, in Twitter, each user can be regarded as a node in the graph, and the relationship between users is an edge between nodes. Different relationships may be involved in the edges, as two users may be connected by "focus", "comment", "forward", etc. Similarly, in a scientific collaboration network (e.g., DBLP), each node represents an author, an edge between two nodes represents collaboration between the authors, the edge may contain areas of author collaboration, and since the author's research area is not usually single, there may be multiple relationships on the edge. In graph analysis, the reachability query problem is a basic operation. I.e. given a graph (typically a directed graph) and two nodes u and v, it is checked whether there is a path from the starting point v to the end point u. Recently, many studies have attempted to determine reachability between nodes with constraints. Jin et al combines the chain structure with the jump idea and proposes a 3-hop structure to reduce the index size of the compact graph. Zhou et al propose a DAG reduction method to accelerate reachability queries in large graphs. Valstar et al propose a landmark-based index that can be used for large graphs, select a small number of landmarks, and calculate an index corresponding to each landmark in advance, and when performing query, first perform BFS from a source node to the landmark, and then acquire a shortcut of a target node by using the index. However, previous studies on reachability issues only considered the case of only one relationship on an edge, and did not discuss the case of multiple relationships on an edge. In consideration of the fact that the edges in the multi-relationship graph have various relationships, the invention provides a necessary relationship reachable and existing relationship reachable model, and aims to effectively identify whether the nodes in the multi-relationship graph have certain specific relationships.
Disclosure of Invention
In order to determine the relationship between two nodes, the invention provides two new reachable models on the multi-relationship graph, namely necessary relationship reachable and existing relationship reachable. Wherein the necessary relationship can be satisfied: a path can be found between the two nodes, and each edge in the path contains all the relationships in the given relationship set R; the existence relation can be satisfied: a path can be found between two nodes, and each edge in the path at least contains one relation in the given relation set R.
The invention respectively constructs the index structure of the reachable query of the necessary relationship and the existing relationship based on the thought of 2-hop coverage, thereby effectively shortening the time required by the query. Aiming at the construction process of the index, the invention develops three novel pruning strategies, avoids a large amount of invalid calculations and obviously shortens the time for constructing the index.
Unnecessary nodes and index items in the index construction process are filtered through three pruning strategies, which comprise the following steps:
node-based pruning strategy: if one node v has already calculated its corresponding index, then encounter node v can skip node v directly while calculating the index of other nodes;
pruning strategy based on index entry: if the reachable relation between a pair of nodes corresponding to one index item can be obtained through the current index, the index item is directly skipped;
index-based pruning strategy: because the index item with the largest number of relationships in the index items is always processed with priority when the necessary relationship index is constructed, and the index item with the smallest number of relationships in the index items is always processed with priority when the relationship index exists, if the reachable relationship between the corresponding nodes in one index item can not be obtained from the existing index query, the index item is directly inserted into the index.
The structure of the index is specifically as follows:
for all nodes v, there will be two corresponding sets in the necessary relationship index
Figure BDA0003161725500000021
And
Figure BDA0003161725500000022
each index item in the set corresponds to a node u and a relation set R u Indicating that the node v is R in the relation set R u Node u may be reached when the subset of;
Figure BDA0003161725500000023
each index item in the set corresponds to a node u and a relation set R u Indicating that the node u is R in the relation set R u Node v may be reached when it is a subset; based on the three pruning strategies in step (2), the necessary relationship index is extremely small, and deleting any one of the indexes can cause the index to fail to correctly answer the reachable query.
For all nodes v, there will be two corresponding sets in the relational index
Figure BDA0003161725500000024
And
Figure BDA0003161725500000025
each index item in the set corresponds to a node u and a relation set R u Indicating that the node v is R in the relation set R u Node u can be reached when superset;
Figure BDA0003161725500000026
each index item in the set corresponds to a node u and a relation set R u Indicating that the node u is R in the relation set R u Can reach node v when superset; based on the three pruning strategies in step (2), the existing relational index is extremely small, and deletion of any one of the indexes can cause the index to fail to correctly answer the reachable query.
In the construction process of the index structure, a priority is allocated to each node according to the degree of the node, namely the number of neighbors of the node, wherein the higher the degree is, the higher the priority of the node is; processing the nodes according to the sequence of the priorities of the nodes from high to low; the index entries in the index are ordered according to the priority of the nodes, so that no additional ordering of the index entries is required when reachable queries are performed.
In the technical scheme for solving the technical problem, the reachable query of the necessary relationship specifically comprises the following steps: traversing node v in the necessary relationship index given the start point v and end point u of the query, and the set of relationships R
Figure BDA0003161725500000027
And of node u
Figure BDA0003161725500000028
(a) If it is currently
Figure BDA0003161725500000029
If the existing node is u and the corresponding relation set is an index item of a superset of the given relation set R, judging that the reachable path of the necessary relation from v to u exists;
(b) if it is currently
Figure BDA00031617255000000210
If the existing node is v and the corresponding relation set is an index item of a superset of the given relation set R, judging that the reachable path of the necessary relation from v to u exists;
(c) compare the current
Figure BDA00031617255000000211
The priority and the current of the node in the index entry in (1)
Figure BDA00031617255000000212
If the former is higher than the latter, then access is performed
Figure BDA00031617255000000213
The next index entry in; if the former is lower than the latter, then access is made
Figure BDA00031617255000000214
The next index entry in; if the former and the latter are the same and the corresponding relationship set is an index item of the superset of the given relationship set R, judging that the necessary relationship reachable path from v to u exists;
(d) if the conditions (a), (b) and (c) do not occur after the traversal is finished, the situation shows that the reachable path of the necessary relation does not exist.
The specific steps of performing reachable query on the existing relationship are as follows: traversing node v in the presence of relational index given starting point v and end point u of query, and relational set R
Figure BDA0003161725500000031
And of node u
Figure BDA0003161725500000032
(a) If it is currently
Figure BDA0003161725500000033
If the existing node is u and the corresponding relation set is an index item of the subset of the given relation set R, judging that the existing relation reachable path from v to u exists;
(b) if it is currently
Figure BDA0003161725500000034
If the existing node is v and the corresponding relation set is an index item of the subset of the given relation set R, judging that the existing relation reachable path from v to u exists;
(c) compare the current
Figure BDA0003161725500000035
The priority and the current of the node in the index entry in (1)
Figure BDA0003161725500000036
If the former is higher than the latter, then access is performed
Figure BDA0003161725500000037
The next index entry in; if the former is lower than the latter, then access is made
Figure BDA0003161725500000038
The next index entry in; if the former and the latter are the same and the corresponding relationship set is an index item of the subset of the given relationship set R, judging that the existing relationship reachable path from v to u exists;
(d) and if the conditions (a), (b) and (c) do not occur after the traversal is finished, the reachable path of the existing relationship does not exist.
The invention also provides a computer device, which comprises a memory and a processor, wherein the memory stores computer readable instructions, and the computer readable instructions, when executed by the processor, cause the processor to execute the steps of the construction and processing method of the reachability query index facing to the multiple relation graph.
The present invention also provides a storage medium storing computer-readable instructions, which when executed by one or more processors, cause the one or more processors to perform the steps of the method for constructing and processing the reachability query index for multiple relational maps.
The invention has the beneficial effects that: building an index to answer reachability queries between nodes in a graph is a common solution to the reachability problem. However, since the relationship combinations among the nodes in the multi-relationship graph increase exponentially with the number of the relationships and the path length among the nodes, it is necessary to design a proper index structure. The invention provides an index structure based on 2-hop coverage, which can be used for answering the reachability of nodes in a graph under a given relationship. In addition, in order to ensure that the obtained index structure is extremely small and accelerate the construction process of the index, the invention provides three novel pruning strategies, so that the index can be quickly obtained. Therefore, the construction and processing method of the reachability query index oriented to the multiple relational graphs has great benefits for determining the relations between the nodes.
Drawings
FIG. 1 is a flow chart of a method for constructing and processing a reachability query index for a multiple relationship graph implemented in the present invention;
FIG. 2 is an exemplary diagram of a practical application scenario of the present invention;
FIG. 3 is a diagram of a multiple relationship diagram of an implementation of the present invention;
FIG. 4 is a diagram of the necessary relationship reachable query index constructed for the multiple relationship graph of FIG. 3;
FIG. 5 is a presence relationship reachable query index constructed on the multiple relationship graph of FIG. 3.
Detailed Description
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in detail below.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, but the present invention may be practiced in other ways than those specifically described and will be readily apparent to those of ordinary skill in the art without departing from the spirit of the present invention, and therefore the present invention is not limited to the specific embodiments disclosed below.
The method is used for the reachability and relation reachability query of necessary relation and the existence relation. The necessary relationships can be satisfied: a path can be found between the two nodes, and each edge in the path contains all the relationships in the given relationship set R; the existence relation can be satisfied: a path can be found between two nodes, and each edge in the path at least contains one relation in the given relation set R. The method comprises three novel pruning strategies and an efficient index-based reachability query algorithm. The implementation of each part is described in detail below.
Three novel pruning strategies refer to skipping invalid nodes and index entries, thereby significantly reducing index construction time and space, including the following:
node-based pruning strategy: if one node v has already calculated its corresponding index, then encounter node v can skip node v directly while calculating the index of other nodes;
and (3) proving that: when the index is built, for each calculated node, all reachable nodes and path information of other nodes which can reach the node are recorded in the index, so that skipping over the nodes which have already calculated the index in the later process does not affect the correctness of the index.
Pruning strategy based on index entry: if the reachable relation between a pair of nodes corresponding to one index item can be obtained through the current index, the index item is directly skipped;
and (3) proving that: the reachability of an index entry can already be derived from the current index, indicating that the index entry is redundant, so skipping the index does not affect the correctness of the index.
Index-based pruning strategy: because the index item with the most relation number in the index items is always processed preferentially when the necessary relation index is constructed, and the index item with the least relation number in the index items is always processed preferentially when the relation index exists, if the reachable relation between the corresponding nodes in one index item can not be obtained by the existing index query, the index item is directly inserted into the index;
and (3) proving that: when the necessary relationship index is constructed, the index item with the largest number of relationships in the index items is preferentially processed, so that the number of relationships corresponding to the index item entering the index later is less than the number of relationships in the index, the index item in the index cannot be replaced by the index item entering the index later, and the index item can be directly inserted into the index later without checking whether some index items need to be removed and then inserted. When the existing relationship index is constructed, the index item with the minimum relationship number in the index items is processed preferentially, so that the relationship number corresponding to the index item entering the index later is more than the relationship number in the index, the index item in the index cannot be replaced by the index item entering later, and the index item can be directly inserted into the index later without checking whether certain index items need to be removed and then inserted.
Respectively constructing an index structure of the reachable query of the necessary relation and the existing relation based on the 2-hop coverage, wherein the index structure is as follows:
for all nodes v, there will be two corresponding sets in the necessary relationship index
Figure BDA0003161725500000051
And
Figure BDA0003161725500000052
each index item in the set corresponds to a node u and a relation set R u Indicating that the node v is R in the relation set R u Node u may be reached when the subset of;
Figure BDA0003161725500000053
each index item in the set corresponds to a node u and a relation set R u Indicating that the node u is R in the relation set R u Is a subset ofNode v can be reached; based on the three pruning strategies in step (2), the necessary relationship index is extremely small, and deleting any one of the indexes can cause the index to fail to correctly answer the reachable query.
For all nodes v, there will be two corresponding sets in the relational index
Figure BDA0003161725500000054
And
Figure BDA0003161725500000055
each index item in the set corresponds to a node u and a relation set R u Indicating that the node v is R in the relation set R u Node u can be reached when superset;
Figure BDA0003161725500000056
each index item in the set corresponds to a node u and a relation set R u Indicating that the node u is R in the relation set R u Can reach node v when superset; based on the three pruning strategies in step (2), the existing relational index is extremely small, and deletion of any one of the indexes can cause the index to fail to correctly answer the reachable query.
In the construction process of the index structure, a priority is allocated to each node according to the degree of the node, namely the number of neighbors of the node, wherein the higher the degree is, the higher the priority of the node is; processing the nodes according to the sequence of the priorities of the nodes from high to low; the index entries in the index are ordered according to the priority of the nodes, so that no additional ordering of the index entries is required when reachable queries are performed.
Performing reachable query on the necessary relation according to the index, specifically: traversing node v in the necessary relationship index given the start point v and end point u of the query, and the set of relationships R
Figure BDA0003161725500000057
And of node u
Figure BDA0003161725500000058
(a) If it is currently
Figure BDA0003161725500000059
If the existing node is u and the corresponding relation set is an index item of a superset of the given relation set R, judging that the reachable path of the necessary relation from v to u exists;
(b) if it is currently
Figure BDA00031617255000000510
If the existing node is v and the corresponding relation set is an index item of a superset of the given relation set R, judging that the reachable path of the necessary relation from v to u exists;
(c) compare the current
Figure BDA00031617255000000511
The priority and the current of the node in the index entry in (1)
Figure BDA00031617255000000512
If the former is higher than the latter, then access is performed
Figure BDA00031617255000000513
The next index entry in; if the former is lower than the latter, then access is made
Figure BDA00031617255000000514
The next index entry in; if the former and the latter are the same and the corresponding relationship set is an index item of the superset of the given relationship set R, judging that the necessary relationship reachable path from v to u exists;
(d) if the conditions (a), (b) and (c) do not occur after the traversal is finished, the situation shows that the reachable path of the necessary relation does not exist.
Performing reachable query on the existing relationship according to the index, specifically: traversing node v in the presence of relational index given starting point v and end point u of query, and relational set R
Figure BDA00031617255000000515
And of node u
Figure BDA00031617255000000516
(a) If it is currently
Figure BDA0003161725500000061
If the existing node is u and the corresponding relation set is an index item of the subset of the given relation set R, judging that the existing relation reachable path from v to u exists;
(b) if it is currently
Figure BDA0003161725500000062
If the existing node is v and the corresponding relation set is an index item of the subset of the given relation set R, judging that the existing relation reachable path from v to u exists;
(c) compare the current
Figure BDA0003161725500000063
The priority and the current of the node in the index entry in (1)
Figure BDA0003161725500000064
If the former is higher than the latter, then access is performed
Figure BDA0003161725500000065
The next index entry in; if the former is lower than the latter, then access is made
Figure BDA0003161725500000066
The next index entry in; if the former and the latter are the same and the corresponding relationship set is an index item of the subset of the given relationship set R, judging that the existing relationship reachable path from v to u exists;
(d) and if the conditions (a), (b) and (c) do not occur after the traversal is finished, the reachable path of the existing relationship does not exist.
Fig. 1 is a flowchart of a method for constructing and processing a reachability query index for a multiple relationship graph in an embodiment of the present invention.
Figure 2 shows a Twitter network with six users. In the figure, there is a relationship between like and request on a path from Tom to Bob, which means Tom will like and forward Bob's tweet. The follow and comment relationship from Bob to Mick means that Bob has focused on Mick and Bob reviews Mick's tweet. For the required relationship reachable query, Jack is able to reach Alice under the required relationship constraint { like, follow } because there is a path from Jack to Alice (Jack, Mick, Alice) where each edge on the path contains the relationship { like, follow }. Similarly, for a presence relationship reachable query, Tom may reach Alice under the presence relationship constraint { follow, reload }. As illustrated by way of example, the two proposed models are designed for different scenarios, and the necessary relationships may require many strict constraints.
Fig. 3 is a diagram of multiple relationships simulating the real world. Given a given relational constraint { r } 1 r 2 Is reachable for the necessary relationship, from the starting point v 1 To the end point v 4 Can find a path<v 1 ,v 3 ,v 4 >(ii) a Reachable for presence relationship, from starting point v 1 To the end point v 5 Can find a path<v 1 ,v 3 ,v 4 ,v 5 >。
By the method of the invention, the necessary relation reachable query index and the existing relation reachable query index are respectively constructed for the multiple relation graphs in FIG. 3, as shown in FIG. 4 and FIG. 5.
Furthermore, the present invention performed extensive experiments on multiple relationship plots to evaluate the effectiveness and efficiency of the proposed method. To evaluate the performance of the proposed method, we performed experiments by changing the set of nodes and relationships of the inputs. The invention uses the algorithm time consumption to respectively measure the effectiveness and the efficiency of the proposed method. For each setting, the invention was run 100000 times and averaged. All programs are realized in standard c + +, and all experiments are carried out on a server equipped with an Intel Xeon 2.2GHz CPU and a 128GB main memory. Experiments show that for the necessary relation reachable query and the existing relation reachable query, the construction and processing method of the reachability query index for the multiple relation graphs, which is provided by the invention, is 25 times faster than the basic online algorithm on the maximum data set.
In one embodiment, a computer device is provided, which includes a memory and a processor, where the memory stores computer readable instructions, and the computer readable instructions, when executed by the processor, cause the processor to execute the steps in the method for constructing and processing the reachability query index for multiple relationship graph in the foregoing embodiments.
In one embodiment, a storage medium storing computer-readable instructions is provided, and the computer-readable instructions, when executed by one or more processors, cause the one or more processors to execute the steps of the method for constructing and processing the reachability query index for multiple relationship graph in the embodiments described above. The storage medium may be a nonvolatile storage medium.
Those skilled in the art will appreciate that all or part of the steps in the methods of the above embodiments may be implemented by associated hardware instructed by a program, which may be stored in a computer-readable storage medium, and the storage medium may include: read Only Memory (ROM), Random Access Memory (RAM), magnetic or optical disks, and the like.
The foregoing is only a preferred embodiment of the present invention, and although the present invention has been disclosed in the preferred embodiments, it is not intended to limit the present invention. Those skilled in the art can make numerous possible variations and modifications to the present teachings, or modify equivalent embodiments to equivalent variations, without departing from the scope of the present teachings, using the methods and techniques disclosed above. Therefore, any simple modification, equivalent change and modification made to the above embodiments according to the technical essence of the present invention are still within the scope of the protection of the technical solution of the present invention, unless the contents of the technical solution of the present invention are departed.

Claims (6)

1. A construction and processing method of reachability query index facing multiple relation graphs is characterized in that the method is used for necessary relation reachable and existing relation reachable queries, and the necessary relation reachable needs to meet the following requirements: a path can be found between the two nodes, and each edge in the path contains all the relationships in the given relationship set R; the existing relation can meet the following requirements: a path can be found between the two nodes, and each edge in the path at least comprises one relation in the given relation set R; the method comprises the following steps:
(1) respectively constructing an index structure of the reachable query of the necessary relation and the existing relation based on 2-hop coverage, wherein the index structure comprises reachable paths between any two points under any relation;
(2) unnecessary nodes and index items in the index construction process are filtered through three pruning strategies, which comprise the following steps:
node-based pruning strategy: if one node v has already calculated its corresponding index, then encounter node v can skip node v directly while calculating the index of other nodes;
pruning strategy based on index entry: if the reachable relation between a pair of nodes corresponding to one index item can be obtained through the current index, the index item is directly skipped;
index-based pruning strategy: because the index item with the most relation number in the index items is always processed preferentially when the necessary relation index is constructed, and the index item with the least relation number in the index items is always processed preferentially when the relation index exists, if the reachable relation between the corresponding nodes in one index item can not be obtained by the existing index query, the index item is directly inserted into the index;
the index structure is specifically as follows:
for all nodes v, there will be two corresponding sets in the necessary relationship index
Figure FDA0003627006580000011
And
Figure FDA0003627006580000012
each index item in the set corresponds to a node u and a relation set R u Indicating that the node v is R in the relation set R u Node u may be reached when the subset of;
Figure FDA0003627006580000013
each index item in the set corresponds to a node u and a relation set R u Indicating that the node u is R in the relation set R u Node v may be reached when it is a subset; based on the three pruning strategies in the step (2), the necessary relationship index is extremely small, and deletion of any one of the indexes can cause that the index cannot correctly answer the reachable query;
for all nodes v, there will be two corresponding sets in the relational index
Figure FDA0003627006580000014
And
Figure FDA0003627006580000015
each index item in the set corresponds to a node u and a relation set R u Indicating that the node v is R in the relation set R u Node u can be reached when superset;
Figure FDA0003627006580000016
each index item in the set corresponds to a node u and a relation set R u Indicating that the node u is R in the relation set R u Can reach node v when superset; based on the three pruning strategies in the step (2), the existing relational index is extremely small, and deletion of any one of the indexes can cause that the indexes cannot correctly answer the reachable query;
(3) based on the constructed index structure, given a starting point and an end point, reachable paths are found which satisfy necessary relationships or exist relationships.
2. The method according to claim 1, characterized in that in the construction process of the index structure, each node is assigned a priority according to the degree of the node, i.e. the number of neighbors of the node, the higher the degree is, the higher the priority of the node is; processing the nodes according to the sequence of the priorities of the nodes from high to low; the index entries in the index are ordered according to the priority of the nodes, so that no additional ordering of the index entries is required when reachable queries are performed.
3. The method according to claim 1, wherein the reachable query for the necessary relationship according to the index in (3) is specifically: traversing node v in the necessary relationship index given the start point v and end point u of the query, and the set of relationships R
Figure FDA0003627006580000021
And of node u
Figure FDA0003627006580000022
(a) If it is currently
Figure FDA0003627006580000023
If the existing node is u and the corresponding relation set is an index item of a superset of the given relation set R, judging that the reachable path of the necessary relation from v to u exists;
(b) if it is currently
Figure FDA0003627006580000024
If the existing node is v and the corresponding relation set is an index item of a superset of the given relation set R, judging that the reachable path of the necessary relation from v to u exists;
(c) compare the current
Figure FDA0003627006580000025
The priority and the current of the node in the index entry in (1)
Figure FDA0003627006580000026
If the former is higher than the latter, then access is performed
Figure FDA0003627006580000027
The next index entry in; if the former is lower than the latter, then access is made
Figure FDA0003627006580000028
The next index entry in; if the former and the latter are the same and the corresponding relationship set is an index item of the superset of the given relationship set R, judging that the necessary relationship reachable path from v to u exists;
(d) if the conditions (a), (b) and (c) do not occur after the traversal is finished, the situation shows that the reachable path of the necessary relation does not exist.
4. The method according to claim 1, wherein the performing reachable query on the existing relationship according to the index in (3) specifically includes: traversing node v in the presence of relational index given starting point v and end point u of query, and relational set R
Figure FDA0003627006580000029
And of node u
Figure FDA00036270065800000210
(a) If it is currently
Figure FDA00036270065800000211
If the existing node is u and the corresponding relation set is an index item of the subset of the given relation set R, judging that the existing relation reachable path from v to u exists;
(b) if it is currently
Figure FDA00036270065800000212
If the existing node is v and the corresponding relation set is an index item of the subset of the given relation set R, judging that the existing relation reachable path from v to u exists;
(c) compare the current
Figure FDA00036270065800000213
The priority and the current of the node in the index entry in (1)
Figure FDA00036270065800000214
If the former is higher than the latter, then access is performed
Figure FDA00036270065800000215
The next index entry in; if the former is lower than the latter, then access is made
Figure FDA00036270065800000216
The next index entry in; if the former and the latter are the same and the corresponding relationship set is an index item of the subset of the given relationship set R, judging that the existing relationship reachable path from v to u exists;
(d) and if the conditions (a), (b) and (c) do not occur after the traversal is finished, the reachable path of the existing relationship does not exist.
5. A computer device comprising a memory and a processor, the memory having stored therein computer readable instructions which, when executed by the processor, cause the processor to perform the steps of the method of constructing and processing a multiple relationship graph oriented reachability query index of any of claims 1-4.
6. A storage medium storing computer readable instructions which, when executed by one or more processors, cause the one or more processors to perform the steps of the method of constructing and processing a multiple relationship graph oriented reachability query index of any one of claims 1-4.
CN202110792966.3A 2021-07-14 2021-07-14 Construction and processing method of reachability query index facing multiple relational graphs Expired - Fee Related CN113392143B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110792966.3A CN113392143B (en) 2021-07-14 2021-07-14 Construction and processing method of reachability query index facing multiple relational graphs

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110792966.3A CN113392143B (en) 2021-07-14 2021-07-14 Construction and processing method of reachability query index facing multiple relational graphs

Publications (2)

Publication Number Publication Date
CN113392143A CN113392143A (en) 2021-09-14
CN113392143B true CN113392143B (en) 2022-08-02

Family

ID=77626027

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110792966.3A Expired - Fee Related CN113392143B (en) 2021-07-14 2021-07-14 Construction and processing method of reachability query index facing multiple relational graphs

Country Status (1)

Country Link
CN (1) CN113392143B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103399902A (en) * 2013-07-23 2013-11-20 东北大学 Generation and search method for reachability chain list of directed graph in parallel environment
EP2731023A1 (en) * 2012-11-12 2014-05-14 Software AG Method and system for processing graph queries
CN107402927A (en) * 2016-05-19 2017-11-28 上海斯睿德信息技术有限公司 A kind of enterprise's incidence relation topology method for building up and querying method based on graph model
CN110516119A (en) * 2019-08-27 2019-11-29 西南交通大学 A kind of organizational scheduling method, device and the storage medium of natural resources contextual data
CN111666468A (en) * 2020-05-11 2020-09-15 浙江工商大学 Method for searching personalized influence community in social network based on cluster attributes
CN112667862A (en) * 2021-01-06 2021-04-16 北京理工大学 Graph reachability query method based on shared bloom filter

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2731023A1 (en) * 2012-11-12 2014-05-14 Software AG Method and system for processing graph queries
CN103399902A (en) * 2013-07-23 2013-11-20 东北大学 Generation and search method for reachability chain list of directed graph in parallel environment
CN107402927A (en) * 2016-05-19 2017-11-28 上海斯睿德信息技术有限公司 A kind of enterprise's incidence relation topology method for building up and querying method based on graph model
CN110516119A (en) * 2019-08-27 2019-11-29 西南交通大学 A kind of organizational scheduling method, device and the storage medium of natural resources contextual data
CN111666468A (en) * 2020-05-11 2020-09-15 浙江工商大学 Method for searching personalized influence community in social network based on cluster attributes
CN112667862A (en) * 2021-01-06 2021-04-16 北京理工大学 Graph reachability query method based on shared bloom filter

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于机器学习的程序路径可达性判定方法研究与实现;杨阳;《北京邮电大学学报》;20210630;全文 *

Also Published As

Publication number Publication date
CN113392143A (en) 2021-09-14

Similar Documents

Publication Publication Date Title
EP2743845B1 (en) Graph traversal operator inside a column store
US8984019B2 (en) Scalable summarization of data graphs
US8396852B2 (en) Evaluating execution plan changes after a wakeup threshold time
US9928310B2 (en) In-memory graph pattern matching
Fan et al. Querying big graphs within bounded resources
US9892143B2 (en) Association index linking child and parent tables
CN109683811A (en) A kind of request processing method mixing memory key-value pair storage system
CN106951526B (en) Entity set extension method and device
US8812492B2 (en) Automatic and dynamic design of cache groups
Zhuge et al. Basic operations, completeness and dynamicity of cyber physical socio semantic link network CPSocio‐SLN
CN109522428A (en) A kind of external memory access method of the figure computing system based on index positioning
Haddadan et al. Repbublik: Reducing polarized bubble radius with link insertions
WO2013138441A1 (en) Systems, methods, and software for computing reachability in large graphs
CN110502532A (en) Optimization method, device, equipment and the storage medium of remote data base object
CN113918126B (en) AI modeling flow arrangement method and system based on graph algorithm
CN113392143B (en) Construction and processing method of reachability query index facing multiple relational graphs
US10241927B2 (en) Linked-list-based method and device for application caching management
CN110222055A (en) The single-wheel core value maintaining method of multiple edge update under a kind of Dynamic Graph
WO2023030461A1 (en) Distributed database detection method and apparatus
Kumazawa et al. Exploration strategies for balancing efficiency and comprehensibility in model checking with ant colony optimization
CN113779025A (en) Optimization method, system and application of classified data retrieval efficiency in block chain
Brisaboa et al. Rank-based strategies for cleaning inconsistent spatial databases
CN113051080A (en) Computation graph execution method and device and heterogeneous platform
Liu et al. An efficient method for top-k graph based node matching
US20230385337A1 (en) Systems and methods for metadata based path finding

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20220802