CN109325029A - RDF data storage and querying method based on sparse matrix - Google Patents

RDF data storage and querying method based on sparse matrix Download PDF

Info

Publication number
CN109325029A
CN109325029A CN201811004427.3A CN201811004427A CN109325029A CN 109325029 A CN109325029 A CN 109325029A CN 201811004427 A CN201811004427 A CN 201811004427A CN 109325029 A CN109325029 A CN 109325029A
Authority
CN
China
Prior art keywords
rdf
sparse matrix
sparse
matrix
algorithm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811004427.3A
Other languages
Chinese (zh)
Inventor
张小旺
张明月
冯志勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University filed Critical Tianjin University
Priority to CN201811004427.3A priority Critical patent/CN109325029A/en
Publication of CN109325029A publication Critical patent/CN109325029A/en
Pending legal-status Critical Current

Links

Abstract

The present invention relates to the data storage and query process field of database engine, the characteristics of to make full use of real human's social activities relation data, by part, closely entity relationship is stored, and is improved search efficiency, is saved a large amount of memory spaces.The present invention, RDF data storage and querying method based on sparse matrix, steps are as follows: step 1: the character string type Hash of original RDF data is encoded into integer type;Step 2: RDF cube RDF Cube is constructed to the RDF data after coding;Step 3: RDF Cube is stored by way of establishing predicate index using a series of sparse matrixes;Step 4:, which parsing, and optimizes SPARQL query statement obtains optimal inquiry plan;Step 5: as a result the Join query execution based on sparse matrix exports.Present invention is mainly applied to store and query processing occasion.

Description

RDF data storage and querying method based on sparse matrix
Technical field
The present invention relates to the data storage and query process fields of database engine.Specifically, one kind is devised to be based on The memory module of sparse matrix stores a large amount of RDF datas, devises the Join operation based on sparse matrix multiplication and executes The inquiry of RDF data.
Background technique
Resource description framework (Resource Description Framework, RDF) is a kind of popular data Model indicates the information on Web in the form of triple (subject, predicate, object).One RDF data collection It can also be described as an oriented label figure, a triple means that a line, and subject and object indicate two Vertex, predicate indicate the label of this edge.SPARQL(SPARQL Protocol and RDF Query It Language) is the RDF graph data query for the standard that World Wide Web Consortium (World Wide Web Consortium, W3C) recommends Language.Extensive RDF data in real world, such as DBpedia (a very special semantic net exemplary applications) With YAGO (linked database) data, but there is a kind of important data characteristic --- " sparsities " for they.RDF data Sparsity mean that the abutment points on each vertex in RDF graph only occupy the very small part on vertex in entire figure.In fact, The sparsity of RDF data is ubiquitous.For example in social activities relationship situation in real life, have between men Different social activities relationships, and also include many complicated entity type data in these social activities relationships, than If place is associated with, character relation, time point being related to etc..Any two entity can be contacted by certain relationships Together.We are also able to reflect out the variation relation of human social activity by the association between data, such as outside somewhere Come the variation of population, traffic route usage amount, vacation trip etc. can be analyzed by human society relationship. But mankind's entirety social activities relationship has the characteristic that part is close and the overall situation is sparse again, such as, the area A is (for example, Shan Xi Sheng) relationship between internal any entity is just relatively closer, but compared to the area A and the area B (for example, Shaanxi Province and day Jinshi City) but there was only a small amount of associated connection between two areas.So it is whole for human social activity's relationship, in reality Entity between show the close situation in part.Such as Fig. 1, in DBpedia data (42966066 nodes in total), The maximum degree (out-degree and in-degree) for having more than 99.41% node is 43;(38734252 sections in total in YAGO data Point), the maximum degree more than 95.17% node is 39.SPARQL is implemented in basic chart-pattern (Basic Graph Pattern, BGP) and SPARQL algebraic operation on, Join operation be SPARQL query assessment core operation, because of BGP It is exactly the join of triple mode.And in RDF graph, the connection on two vertex can be counted by the product of adjacency matrix It calculates.
It is analyzed by the RDF data to real human's social activities relationship, we have invented one kind to be based on sparse square The memory module of battle array carries out memory-efficient data, and it is high to data progress to have invented a kind of join algorithm based on sparse matrix Effect inquiry.That is, the join algorithm based on sparse matrix is used to inquire the RDF data with " sparsity " feature.Cause And finding one kind can fully consider and using Deta sparseness come in efficient storage and inquiry human social activity's relationship The solution of RDF data is a meaningful project.
The most like prior art implementation with the present invention:
The method of the existing SPARQL inquiry for assessing RDF data can be divided into two classes: system based on relationship and System based on figure.RDF data is stored and indexed using relational approach based on the system of relationship.Such as Jena, Oracle, Sesame, 3store and SOR, these systems safeguard all triples using a big table with 3 column, respectively Corresponding to subject, predicate and object.Then SPARQL inquiry is converted into SQL query, and passes through the multiple of table It is inquired from connection.It is recently proposed many RDF triples that graph model is stored based on the method for figure, such as gStore, DipLODocus [RDF], TurboHOM++ and AMBER.SPARQL query processing is usually considered as son by these methods based on figure Figure matching, this helps to retain and query semantics information.All predicates and predicate value are mapped as bit string by gStore, Then it is organized as a VS*-tree tree.Since each layer of VS*-tree tree is all the summary of entire RDF graph, GStore can effectively handle SPARQL inquiry.DipLODocus [RDF] is started with mixing storage mode, in order to search point Submanifold and inquiry velocity is promoted by the help of cluster correlation data, first the graph structure of consideration RDF data and data analysis It is required that.TurboHOM++ develops TurboISO by the way that RDF graph is converted to general data figure, and AMBER by RDF data and SPARQL inquiry is expressed as scheming more.
The shortcomings that prior art:
The existing querying method to RDF data improves the performance of query assessment really to a certain extent.However, he Improvement in terms of handling the extensive RDF data from real world it is still limited, the reason is that they are in storing data It does not solve truthful data bring sparsity and causes to waste big quantity space when storing data.And it is being based on relationship A large amount of in system to will lead to the prolonged response time from connection, this is the potential bottleneck of system.Due to lacking concurrency, institute There is above system computationally all costly in pretreatment.In addition, when there is the querying method of time-consuming traversal will increase operation Between expense.Thus, it would be desirable to such a RDF query processing method based on sparse matrix, can either efficient storage it is a large amount of RDF data, and can quickly carry out query processing.
Summary of the invention
In order to overcome the deficiencies of the prior art, the present invention solves the sparsity of Fiel's meeting activity relationship data, proposes one Data model storage of the kind based on sparse matrix and search algorithm join based on sparse matrix.The storage mould that the present invention designs Formula, the characteristics of can make full use of real human's social activities relation data, by part, closely entity relationship is stored, Not only high-efficiency compact stores initial data in this way, but also reduces the range of inquiry data set, more improves inquiry effect Rate.Memory module of the invention can save a large amount of memory spaces, and data are expressed as sparse matrix and are only tieed up in RDF graph The mode of shield actual relationship is handled.Search algorithm of the invention is based on the matrix multiplication on sparse storage mode Join operation is carried out, can be obviously improved search efficiency.For this reason, the technical scheme adopted by the present invention is that being based on sparse matrix RDF data storage and querying method, steps are as follows:
Step 1: the character string type Hash of original RDF data is encoded into integer type;
Step 2: RDF cube RDF Cube is constructed to the RDF data after coding;
Step 3: by establishing predicate index, RDF Cube is stored in the form of a series of sparse matrixes;
Step 4:, which parsing, and optimizes SPARQL query statement obtains optimal inquiry plan;
Step 5: as a result the Join query execution based on sparse matrix exports.
Step 4 is specifically: for the SPARQL query statement of a given RDF data, being parsed into an inquiry Figure, be then ranked up further according to the statistical data of each edge in query graph, guarantee while with while connect in the case where reconstruct look into Ask executive plan.
Step 4 is further specifically: input is a SPARQL query statement, and form is a L=(e1, e2,...,en), all sides comprising this query statement initialize one first and are denoted as the point set of N to record the side of selection Point, and L is ranked up according to the statistical data on side all in L;Then it is chosen in the query statement L after sequence A line is added in Q, and the point on selected side is added in N, and update L;As long as L is not empty, the slave L circuited sequentially One statistical number of middle selection is small and is added in Q while related with having selected before, and the point ∈ N on selected side is simultaneously selected by this The point on side is added in N, updates L;Finally obtained inquiry plan Q is optimal inquiry plan.
Step 5 is specifically: the inquiry plan after given optimization is obtained by step 4, to each in inquiry plan A sparse matrix successively executes the Join algorithm based on sparse matrix and obtains final query result, wherein the sparse square of every two What battle array specifically executed is SMJoin algorithm.
Step 5 is further specifically: algorithm 1 is that each sparse matrix in the works after given optimization is carried out Join algorithm calculation process, exports final query result;Algorithm 2 is that given two sparse matrixes execute core SMJoin algorithm;
For algorithm 1, input is a given optimized inquiry plan Q, first from the inquiry meter after optimization It draws taking-up first in Q and is denoted as RtSparse matrix simultaneously updates Q, then successively takes out that next to be denoted as SpTable sparse from Q Matrix, two sparse matrix RtSMJoin operation is carried out with SpTable, and updates result to Rt, it is duplicate chosen from Q it is next A sparse matrix carries out SMJoin operation until Q is sky, and final query result is Rt
For algorithm 2, SMJoin describes two sparse matrixes A and B giving in algorithm 1 and carries out join operation, first The non-zero row of each of first Ergodic Matrices A is looked for from matrix B then to each nonzero term in the non-zero row of each of matrix A To matched nonzero term, the result after being matched if meeting condition is exported to matrix of consequence RtIn, final output The matrix of consequence of join operation is Rt
The features of the present invention and beneficial effect are:
The present invention is directed to the RDF graph data of extensive real human's social activities relationship, is proposed using the sparsity of data A kind of data model storage based on sparse matrix and search algorithm join based on sparse matrix solve in storage number According to when do not solve the problems, such as truthful data bring sparsity and cause to waste big quantity space when storing data, pass through solution The sparse characteristic of RDF data is to reach efficient storage and efficiently inquiry to extensive RDF data.Guaranteeing correctness of algorithm While, query responding time is reduced by optimisation strategy.From the point of view of the experimental result of the invention, as shown in fig. 6, being directed to Various types of social activities relational query sentences, our method is to the search efficiency of RDF graph data obviously than existing inquiry System (gStore and RDF-3X) is more preferable.
Detailed description of the invention:
Fig. 1 is that the present invention analyzes the sparsity of Fiel's meeting relationship resource RDF data.
A is DBpedia DataSet, and b is YAGO DataSet.
Fig. 2 is that the present invention is based on the general frames of the RDF query of sparse matrix processing.Including memory module, inquiry Optimization module and query execution module.
Fig. 3 is the memory module of the invention based on sparse matrix.
Fig. 4 is the citing of query optimization module of the invention.
Fig. 5 is that the present invention has carried out inquiry experiment on the generated data collection Watdiv of different scales.
The inquiry of a WatDiv:F snowflake type, the inquiry of b WatDiv:L line style;
The star-like inquiry of c WatDiv:S, the inquiry of d WatDiv:C complexity.
Fig. 6 is that the present invention can carry out inquiring experiment in Fiel on relational dataset DBpdedia and YAGO.
A is YAGO DataSet, and b is DBpedia DataSet.
Specific embodiment
One proposed the present invention be directed to the local close and whole sparse characteristic of Fiel's meeting activity relationship data Data model storage of the kind based on sparse matrix and the join querying method based on sparse matrix.
Step 1: the character string type Hash of original RDF data is encoded into integer type;
Step 2: RDF Cube (RDF cube) is constructed to the RDF data after coding;
Step 3: by establishing predicate index, RDF Cube is stored in the form of a series of sparse matrixes;
Step 4:, which parsing, and optimizes SPARQL query statement obtains optimal inquiry plan;
Step 5: as a result the Join query execution based on sparse matrix exports;
General frame figure is as shown in Fig. 2, there are three modules in total:
Memory module: including Step 1: step 2 and step 3, memory module are the forms based on sparse matrix to store And then RDF data construct RDF cube, then pass through foundation as shown in figure 3, RDF graph data are carried out Hash coding first Predicate indexes to carry out the storage of sparse matrix form;
Query optimization module: including step 4, the SPARQL query statement that inquired for one is high in order to obtain The inquiry reaction result of effect needs to optimize it according to memory module and search algorithm to obtain optimal inquiry plan. Our prioritization scheme is to reduce all intermediate result sizes generated in query process, as shown in figure 4, according to query graph The size on side be ranked up, reconstruct inquiry plan makes intermediate result minimum, as shown in algorithm 3;
Query execution: including step 5, the Join algorithm based on sparse matrix multiplication of design, such as 2 institute of algorithm 1 and algorithm Show, the inquiry plan after inputting an optimization obtains final query result by SMJoin algorithm;
The present invention is described in further detail below in conjunction with the accompanying drawings.
Referring to Fig. 1, the present invention analyzes the sparsity of RDF truthful data, (in total 42966066 in DBpedia data A node), the maximum degree (out-degree and in-degree) for having more than 99.41% node is 43;In YAGO data (in total 38734252 nodes), the maximum degree more than 95.17% node is 39.It is obvious that in the RDF data of real world In, most of point is only and the point of minimum a part is associated, whole that the sparse characteristic of the compact overall situation in part is presented.
Referring to fig. 2, the present invention is based on the general frames of the RDF query of sparse matrix processing.Including memory module, look into Ask optimization module and query execution module.Memory module has fully considered the sparsity of RDF data, proposes a kind of based on dilute Dredge the data model storage of matrix;Query optimization module is optimisation strategy under the premise of guaranteeing correct result, in order that subtracting Few query responding time;Query execution module is querying method of the present invention to RDF data, is proposed a kind of based on sparse square The Join algorithm of battle array multiplication.Wherein, algorithm 1 is to carry out Join for each sparse matrix in the works after given optimization Algorithm calculation process exports final query result.Algorithm 2 is that given two sparse matrixes execute core SMJoin algorithm.
Algorithm 1: query execution
Input: the inquiry plan Q after given optimization
Output: query result matrix table Rt
For algorithm 1, the SPARQL inquiry that SMJoin operation is carried out comprising multiple sparse matrixes is described.Input Be a given optimized inquiry plan Q, first from the inquiry plan Q after optimization take out first sparse square Battle array (is denoted as Rt) and update Q, it is then successively taken out from Q next sparse matrix (being denoted as SpTable), two sparse matrix Rt SMJoin operation is carried out with SpTable, and updates result to Rt, duplicate that next sparse matrix progress is chosen from Q SMJoin operation is until Q is sky.Final query result is Rt
Algorithm 2: specific two sparse matrixes execute SMJoin algorithm SMJoin (A, B)
Input: given two the sparse matrixes A and B chosen
Output: matrix of consequence Rt
For algorithm 2, SMJoin describes two sparse matrixes A and B giving in algorithm 1 and carries out join operation.It is first The non-zero row of each of first Ergodic Matrices A is looked for from matrix B then to each nonzero term in the non-zero row of each of matrix A To matched nonzero term, the result after being matched if meeting condition is exported to matrix of consequence RtIn.Final output The matrix of consequence of join operation is Rt
Referring to Fig. 3, the memory module of the invention based on sparse matrix, including Step 1: step 2 and step 3, this hair Bright memory module is the form based on sparse matrix to store RDF data, and RDF graph data are carried out Hash coding first, will The character string type of original RDF data is encoded into integer string;And then one RDF cubes is constructed to the RDF data after coding Body;The storage of sparse matrix form finally is carried out by establishing predicate index again, each predicate represents a sparse matrix Table, each table only store point associated therewith;
Referring to fig. 4, the citing of query optimization module of the invention, prioritization scheme is reduced to be generated in whole query process Intermediate result size.The SPARQL query statement of a given RDF data, is parsed into a query graph, then further according to The statistical data of each edge is ranked up in query graph, guarantee while with while connect in the case where reconstruct query execution plan.Tool Body inquiry plan generating algorithm is as shown in algorithm 3.
Algorithm 3: inquiry plan generates
Input: all side List:L=(e of SPARQL sentence1, e2..., en)
Output: the inquiry plan Q after optimization
For algorithm 3, describes to optimize a SPARQL query statement, ultimately generate optimal inquiry meter It draws.Input is a SPARQL query statement, and form is a L=(e1, e2..., en), it include this query statement All sides.Initialize a point set (being denoted as N) first to record the point on the side of selection, and according to side all in L Statistical data is ranked up L;Then it chooses a line in the query statement L after sequence to be added in Q, and will be selected The point on side be added in N, and update L.As long as L be not it is empty, chosen in the slave L circuited sequentially a statistical number it is small and and it It is preceding that (the point ∈ N on selected side) while related has been selected to be added in Q, and the point on the selected side is added in N, update L.This Sample both ensure that orderly selection, also ensure that each edge is relevant with side before.Finally obtained inquiry plan Q is most Excellent inquiry plan.
Referring to the data detail parameters of following table experimental situation of the invention:
#Dataset #Triples #(S∩O) #P
WatDiv100M 108 997 714 10 250 947 86
WatDiv200M 219 783 842 20 296 483 86
WatDiv300M 329 827 477 30 221 812 86
WatDiv400M 439 433 765 40 040 420 86
WatDiv500M 549 246 141 49 771 433 86
Yago 200 737 655 38 734 252 46
DBpedia 120 978 080 42 966 066 4 282
Social relationships generated data collection Watdiv and two RDF real data set DBpedias of the present invention in different scales With progress query processing experiment on YAGO.Wherein, the data set scale of generated data collection Watdiv is 100,000,000,200,000,000,300,000,000,400,000,000 With 500,000,000 or so;The triple quantity of DBpedia data is 120978080, and predicate is 4282, the quantity of subject and object There are 42966066;The triple quantity of YAGO data is 200737655, and predicate is 46, and the quantity of subject and object has 38734252.
The experimental situation of Fig. 6 is described below, the Installed System Memory for the gallery that this experiment uses is 72GB, and CPU parameter is Intel(R)Xeon(R)E5-2603v4@1.70GHz.Test the inquiry system that compares be currently a popular RDF-3X and gStore。
Referring to Fig. 5, the present invention and other systems have carried out inquiry experiment on the generated data collection Watdiv of different scales Compare.Inquiry for Watdiv data, we be categorized into four kinds of query types (snowflake type: F, line style: L, it is star-like: S And complexity: C) inquire classifying and compare.Wherein, for gStore querying method, inquiring data scale can only achieve 300,000,000 Left and right.Thus experimental result can be seen that the generated data collection for different scales, and the inquiry reaction time of the invention tests knot Fruit is all significantly lower than RDF-3X and gStore in the case where different type is inquired.
Referring to Fig. 6, the present invention and other systems compare the experiment effect of the query processing of true RDF data.By scheming As it can be seen that the inquiry reaction time of the invention is all significantly lower than the query processing time of RDF-3X and gStore.
Either generated data or truthful data, the present invention can be significant for the query statement of various social relationships Promotion search efficiency.The close characteristic of local relation that the present invention takes full advantage of in Fiel's meeting activity relationship data will Data carry out the storage of sparse matrix mode, have been obviously improved storage efficiency, and reduce the relation extents of inquiry, accelerate Query responding time.

Claims (5)

1. a kind of RDF data storage and querying method based on sparse matrix, characterized in that steps are as follows:
Step 1: the character string type Hash of original RDF data is encoded into integer type;
Step 2: RDF cube RDF Cube is constructed to the RDF data after coding;
Step 3: by establishing predicate index, RDF Cube is stored in the form of a series of sparse matrixes;
Step 4:, which parsing, and optimizes SPARQL query statement obtains optimal inquiry plan;
Step 5: as a result the Join query execution based on sparse matrix exports.
2. the RDF data storage based on sparse matrix and querying method as described in claim 1, characterized in that step 4 tool Body: for the SPARQL query statement of a given RDF data, being parsed into a query graph, then further according to looking into The statistical data of each edge is ranked up in inquiry figure, guarantee while with while connect in the case where reconstruct query execution plan.
3. as claimed in claim 2 based on sparse matrix RDF data storage and querying method, characterized in that step 4 into One step is specifically: input is a SPARQL query statement, and form is a L=(e1,e2,...,en), it is looked into comprising this All sides of sentence are ask, one is initialized first and is denoted as the point set of N to record the point on the side of selection, and own according in L The statistical data on side L is ranked up;Then it chooses a line in the query statement L after sequence to be added in Q, and by institute The point on the side of selection is added in N, and updates L;As long as L be not it is empty, chosen in the slave L circuited sequentially a statistical number it is small and and Selected and be added in Q while related before, it is selected while point ∈ N and this is selected while point be added in N, update L;Finally Obtained inquiry plan Q is optimal inquiry plan.
4. the RDF data storage based on sparse matrix and querying method as described in claim 1, characterized in that step 5 tool Body: the inquiry plan after given optimization is obtained by step 4, each of inquiry plan sparse matrix is successively held Join algorithm of the row based on sparse matrix obtains final query result, wherein what every two sparse matrix specifically executed is SMJoin algorithm.
5. as claimed in claim 4 based on sparse matrix RDF data storage and querying method, characterized in that step 5 into One step is specifically: algorithm 1 is to carry out Join algorithm calculation process for each sparse matrix in the works after given optimization, Export final query result;Algorithm 2 is that given two sparse matrixes execute core SMJoin algorithm;
For algorithm 1, input is a given optimized inquiry plan Q, first from the inquiry plan Q after optimization It takes out first and is denoted as RtSparse matrix simultaneously updates Q, then successively taken out from Q it is next be denoted as SpTable sparse matrix, two A sparse matrix RtSMJoin operation is carried out with SpTable, and updates result to Rt, duplicate that next sparse square is chosen from Q Battle array carries out SMJoin operation until Q is sky, and final query result is Rt
For algorithm 2, SMJoin describes two sparse matrixes A and B giving in algorithm 1 and carries out join operation, first time Go through the non-zero row of each of matrix A, then to each nonzero term in the non-zero row of each of matrix A, found from matrix B and its Matched nonzero term, the result after being matched if meeting condition are exported to matrix of consequence RtIn, final output join behaviour The matrix of consequence of work is Rt
CN201811004427.3A 2018-08-30 2018-08-30 RDF data storage and querying method based on sparse matrix Pending CN109325029A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811004427.3A CN109325029A (en) 2018-08-30 2018-08-30 RDF data storage and querying method based on sparse matrix

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811004427.3A CN109325029A (en) 2018-08-30 2018-08-30 RDF data storage and querying method based on sparse matrix

Publications (1)

Publication Number Publication Date
CN109325029A true CN109325029A (en) 2019-02-12

Family

ID=65263685

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811004427.3A Pending CN109325029A (en) 2018-08-30 2018-08-30 RDF data storage and querying method based on sparse matrix

Country Status (1)

Country Link
CN (1) CN109325029A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109992658A (en) * 2019-04-09 2019-07-09 智言科技(深圳)有限公司 A kind of SPARQL inquiring structuring method of Knowledge driving
CN110837510A (en) * 2019-11-13 2020-02-25 北京深演智能科技股份有限公司 Data processing method, device, storage medium and processor
CN110909111A (en) * 2019-10-16 2020-03-24 天津大学 Distributed storage and indexing method based on knowledge graph RDF data characteristics
CN110990638A (en) * 2019-10-28 2020-04-10 北京大学 Large-scale data query acceleration device and method based on FPGA-CPU heterogeneous environment
CN111400410A (en) * 2020-02-28 2020-07-10 湖南大学 Knowledge graph query method and device, computer equipment and storage medium
CN112905591A (en) * 2021-02-04 2021-06-04 成都信息工程大学 Data table connection sequence selection method based on machine learning
CN114356977A (en) * 2022-03-16 2022-04-15 湖南大学 Distributed RDF graph query method, device, equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102521299A (en) * 2011-11-30 2012-06-27 华中科技大学 Method for processing data of resource description framework
CN104462609A (en) * 2015-01-06 2015-03-25 福州大学 RDF data storage and query method combined with star figure coding

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102521299A (en) * 2011-11-30 2012-06-27 华中科技大学 Method for processing data of resource description framework
CN104462609A (en) * 2015-01-06 2015-03-25 福州大学 RDF data storage and query method combined with star figure coding

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
XIAOWANG ZHANG ETC.: "《gSMat: A Scalable Sparse Matrix-based Join》", 《ACM》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109992658A (en) * 2019-04-09 2019-07-09 智言科技(深圳)有限公司 A kind of SPARQL inquiring structuring method of Knowledge driving
CN109992658B (en) * 2019-04-09 2023-04-11 智言科技(深圳)有限公司 Knowledge-driven SPARQL query construction method
CN110909111A (en) * 2019-10-16 2020-03-24 天津大学 Distributed storage and indexing method based on knowledge graph RDF data characteristics
CN110909111B (en) * 2019-10-16 2023-07-14 天津大学 Distributed storage and indexing method based on RDF data characteristics of knowledge graph
CN110990638A (en) * 2019-10-28 2020-04-10 北京大学 Large-scale data query acceleration device and method based on FPGA-CPU heterogeneous environment
CN110990638B (en) * 2019-10-28 2023-04-28 北京大学 Large-scale data query acceleration device and method based on FPGA-CPU heterogeneous environment
CN110837510A (en) * 2019-11-13 2020-02-25 北京深演智能科技股份有限公司 Data processing method, device, storage medium and processor
CN110837510B (en) * 2019-11-13 2020-08-07 北京深演智能科技股份有限公司 Data processing method, device, storage medium and processor
CN111400410A (en) * 2020-02-28 2020-07-10 湖南大学 Knowledge graph query method and device, computer equipment and storage medium
CN112905591A (en) * 2021-02-04 2021-06-04 成都信息工程大学 Data table connection sequence selection method based on machine learning
CN114356977A (en) * 2022-03-16 2022-04-15 湖南大学 Distributed RDF graph query method, device, equipment and storage medium
CN114356977B (en) * 2022-03-16 2023-02-10 湖南大学 Distributed RDF graph query method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN109325029A (en) RDF data storage and querying method based on sparse matrix
Ali et al. A survey of RDF stores & SPARQL engines for querying knowledge graphs
US9946780B2 (en) Interpreting relational database statements using a virtual multidimensional data model
CN110990638B (en) Large-scale data query acceleration device and method based on FPGA-CPU heterogeneous environment
US9798772B2 (en) Using persistent data samples and query-time statistics for query optimization
CN107291807B (en) SPARQL query optimization method based on graph traversal
Maccioni et al. Scalable pattern matching over compressed graphs via dedensification
US9053210B2 (en) Graph query processing using plurality of engines
CN105955999B (en) A kind of ThetaJoin inquiry processing method of extensive RDF graph
CN110909111A (en) Distributed storage and indexing method based on knowledge graph RDF data characteristics
CN109739882A (en) A kind of big data enquiring and optimizing method based on Presto and Elasticsearch
Wang et al. TSMH Graph Cube: A novel framework for large scale multi-dimensional network analysis
Saleem Storage, indexing, query processing, and benchmarking in centralized and distributed RDF engines: a survey
CN110825738A (en) Data storage and query method and device based on distributed RDF
Sridhar et al. RAPID: Enabling scalable ad-hoc analytics on the semantic web
CN111125199B (en) Database access method and device and electronic equipment
Sakr et al. Centralized RDF query processing
Cao et al. LogCEP-Complex event processing based on pushdown automaton
Li et al. Parallel search processing of tree-structured data in a big data environment
CN112835920A (en) Distributed SPARQL query optimization method based on hybrid storage mode
CN112148830A (en) Semantic data storage and retrieval method and device based on maximum area grid
Hammer et al. CubiST++: Evaluating ad-hoc CUBE queries using statistics trees
Li et al. Query optimization for massive RDF data based on Spark
Zhu et al. Hydb: Access optimization for data-intensive service
WO2023125718A1 (en) Data query method and system based on knowledge graph, and device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190212