CN106156319A - Telescopic distributed resource description framework data storage method and device - Google Patents

Telescopic distributed resource description framework data storage method and device Download PDF

Info

Publication number
CN106156319A
CN106156319A CN201610523135.5A CN201610523135A CN106156319A CN 106156319 A CN106156319 A CN 106156319A CN 201610523135 A CN201610523135 A CN 201610523135A CN 106156319 A CN106156319 A CN 106156319A
Authority
CN
China
Prior art keywords
memory node
data
memory
predicate
subject
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610523135.5A
Other languages
Chinese (zh)
Inventor
胡春明
汪细勖
师斌
李博
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beihang University
Original Assignee
Beihang University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beihang University filed Critical Beihang University
Priority to CN201610523135.5A priority Critical patent/CN106156319A/en
Publication of CN106156319A publication Critical patent/CN106156319A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/316Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/38Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the present invention provides a kind of telescopic distributed resource description framework data storage method and device, and described method includes: be assigned in the internal memory of at least two memory node store by all resource description framework RDF data;Data in the internal memory of each first memory node are backuped in the persistent storage device of the second memory node, described first memory node and described second memory node are any one memory node in described at least two memory node, and described second memory node is adjacent with described first memory node.Use the telescopic distributed resource description framework data storage method and device that the embodiment of the present invention provides, the efficiency of data storage and inquiry can be promoted, and ensure when certain memory node lost efficacy, can be rapidly from the adjacent storage nodes local recovery data of this memory node, it is ensured that data storage and inquiry service are persistently provided.

Description

Telescopic distributed resource description framework data storage method and device
Technical field
The present invention relates to computer realm, particularly relate to a kind of telescopic distributed resource description framework data storage Method and device.
Background technology
Semantic net is to add, by the document on WWW, the semanteme (Meta Data) that can be understood by machine, so that Whole the Internet becomes a general information medium of exchange.Resource description framework (Resource Description Framework, be called for short: RDF) be for describe the World Wide Web Consortium of Internet resources (World Wide Web Consortium, It is called for short: W3C) standard, for expressing the metadata about internet web page resource, the such as title of webpage, author, amendment day Phase, content and copyright information etc..Based on RDF data, semantic net is studied, always the study hotspot of scientific circles.And such as What high-performance, the easily storage of extension ground and inquiry magnanimity RDF data become the problem needing solution badly.
In prior art, for the RDF data of magnanimity, academia has been developed for the RDF data storage side of some maturations Method, these storage methods mostly use traditional centralized relevant database to store RDF data.But, along with RDF data Growth rapidly, traditional centralized relational data storage mode cannot meet storage and inquiry calculate demand, because of This, the storage of distributed RDF data has obtained increasing application with inquiry system.Such as, a kind of RDF data storage scheme In, use key assignments (key-value) to store the system Redis storage system as RDF data, by passing through many by RDF data Individual memory node stores, parallel processing query task, thus improves search efficiency.
But, this distributed RDF data storage method provided in prior art, when some data memory node loses During effect, data will be lost, and causes service disruption.
Summary of the invention
The embodiment of the present invention provides a kind of telescopic distributed resource description framework data storage method and device, uses In solving the problem that in resource description framework data storage method of the prior art, memory node inefficacy causes service disruption.
The embodiment of the present invention provides the peripheral hardware management and control detection method of a kind of intelligent terminal, including:
It is assigned in the internal memory of at least two memory node store by all resource description framework RDF data;
Data in the internal memory of each first memory node are backuped in the persistent storage device of the second memory node, institute State the first memory node and described second memory node is any one memory node in described at least two memory node, institute State the second memory node adjacent with described first memory node.
On the basis of above-described embodiment, described method also includes:
Monitor the storage state of described at least two memory node;
If the storage state of described first memory node was for losing efficacy, then, by the persistent storage of described second memory node In device, the data in the internal memory of described first memory node of backup, return in the internal memory of described second memory node.
On the basis of above-described embodiment, the described at least two that is assigned to by all resource description framework RDF data stores Before the internal memory of node stores, also include:
Obtain the subject of described RDF data, predicate and object;
According to described subject, predicate and object, described RDF data is set up index;
Described being assigned in the internal memory of at least two memory node by all RDF data stores, including:
According to concordance hash algorithm, the described RDF data establishing index is assigned to described at least two memory node Internal memory in store.
On the basis of above-described embodiment, described according to described subject, predicate and object, described RDF data is set up rope Draw, including the combination of following any one or more mode:
With subject and predicate as key, described RDF data is set up index for value by object;
With subject and object as key, described RDF data is set up index for value by predicate;
With predicate and object as key, described RDF data is set up index for value by subject;
With subject as key, described RDF data is set up index for value by predicate and object;
With predicate as key, described RDF data is set up index for value by subject and predicate;
With object as key, described RDF data is set up index for value by subject and predicate.
The embodiment of the present invention also provides for a kind of telescopic distributed resource description framework data storage device, including:
Distribution module, for being assigned to all resource description framework RDF data in the internal memory of at least two memory node Store;Described at least two memory node includes the first memory node and the second memory node;
Backup module, for backuping to described second memory node by the data in the internal memory of described first memory node In persistent storage device, described second memory node is adjacent with described first memory node.
On the basis of above-described embodiment, described device also includes:
Monitoring module, for monitoring the storage state of described at least two memory node;
Data recovery module, for monitoring the storage state of described first memory node for losing efficacy when described monitoring module Time, by the data in the internal memory of described first memory node of backup in the persistent storage device of described second memory node, extensive Answer in the internal memory of described second memory node.
On the basis of above-described embodiment, described device also includes:
Acquisition module, deposits for all resource description framework RDF data being assigned at least two in described distribution module Before the internal memory of storage node stores, obtain the subject of described RDF data, predicate and object;
Memory storage management module, for setting up index according to described subject, predicate and object to described RDF data;
Described distribution module, for being assigned to institute according to concordance hash algorithm by the described RDF data establishing index State in the internal memory of at least two memory node and store.
On the basis of above-described embodiment, described memory storage management module, specifically for:
With subject and predicate as key, described RDF data is set up index for value by object;
With subject and object as key, described RDF data is set up index for value by predicate;
With predicate and object as key, described RDF data is set up index for value by subject;
With subject as key, described RDF data is set up index for value by predicate and object;
With predicate as key, described RDF data is set up index for value by subject and predicate;
With object as key, described RDF data is set up index for value by subject and predicate.
The telescopic distributed resource description framework data storage method and device that the embodiment of the present invention provides, passes through All RDF data are assigned to each memory node, improve data storage and the efficiency accessed.Meanwhile, memory node is supported Be dynamically added and exit, and the data in each memory node are backuped to the adjacent storage nodes in concordance Hash ring In, it is ensured that when certain memory node lost efficacy, it is possible to rapidly from the adjacent storage nodes local recovery data of this memory node, protect Card system is not collapsed, and persistently provides data storage and inquiry service.When newly-increased memory node, data are assisted according to concordance Hash View Autonomic Migration Framework, newly-increased node can come into operation rapidly.Meanwhile, use Redis cluster as the memory storage system of bottom, And use SP_O, SO_P, OP_S, S_OP, P_SO, O_SP 6 index, and construct distributed RDF query on this basis Engine, it is achieved that the distributed RDF system of a quick search, and any kind of data query can be supported.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing The accompanying drawing used required in having technology to describe does one and introduces simply, it should be apparent that, the accompanying drawing in describing below is this Some bright embodiments, for those of ordinary skill in the art, on the premise of not paying creative work, it is also possible to Other accompanying drawing is obtained according to these accompanying drawings.
Fig. 1 is the storage of the embodiment of the present invention telescopic distributed resource description framework data and the structure of inquiry system Schematic diagram;
Fig. 2 is the structural representation of the query plan tree that inquiry plan generation module generates in the embodiment of the present invention;
Fig. 3 is the flow process signal of the embodiment of the present invention telescopic distributed resource description framework data storage method Figure;
Fig. 4 is that the embodiment of the present invention telescopic distributed resource description framework data storage method carries out data distribution And the schematic flow sheet of backup;
Fig. 5 is that the embodiment of the present invention telescopic distributed resource description framework data storage method is in memory node The process schematic of distribution data;
Fig. 6 is the structural representation of the embodiment of the present invention telescopic distributed resource description framework data storage device Figure.
Detailed description of the invention
For making the purpose of the embodiment of the present invention, technical scheme and advantage clearer, below in conjunction with the embodiment of the present invention In accompanying drawing, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is The a part of embodiment of the present invention rather than whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art The every other embodiment obtained under not making creative work premise, broadly falls into the scope of protection of the invention.
The telescopic distributed resource description framework data storage method and device of embodiment of the present invention intelligent terminal, For realizing resource description framework (Resource Description Framework is called for short RDF) based on key assignments point of data Cloth storage and inquiry, improve data reading performance using redundancy, simultaneously by backing up data in adjacent storage nodes so that certain is deposited During storage node failure, data can be recovered from adjacent storage nodes, prevent system crash, service disruption, affect data storage With query performance.
Fig. 1 is the storage of the embodiment of the present invention telescopic distributed resource description framework data and the structure of inquiry system Schematic diagram.
Referring to Fig. 1, the telescopic distributed resource description framework data that the embodiment of the present invention provides stores and looks into Inquiry system includes main controlled node 10, and at least two is from node 20, and is used for realizing described main controlled node 10 with described from node 20 Between communication communication module 30.Described main controlled node 10 and can be that there is data-handling capacity and storage capacity from node 20 Equipment, such as computer.Described main controlled node 10 includes: RDF data resolver 11, dictionary 12, RDF data statistical module 13, RDF data sheer 14, query statement resolver 15, inquiry plan generation module 16 and Optimizing Queries module 17.Described from joint Point 20 includes local search processor 21, memory storage management module 22, backs up and recover module 23, disk storage management module 24 and monitoring module 25.
Specifically, described RDF data resolver 11 is responsible for extracting subject, meaning from the RDF data file of different-format Language and object (subject, predicate, object) tlv triple.RDF data source is generally from the forms such as NT, N3, TTL at present Data file.They describe RDF data figure in a different manner, but their basic element is all tlv triple.Described RDF Data parser 11 is capable of identify that the data file of these forms, and extracts tlv triple from every statement.One data literary composition The big then 10G of part, described RDF data resolver 11 can rapidly process these files.
Described dictionary 12 is for being mapped to ID by the tlv triple of string format, in order to encodes tlv triple and arranges Sequence, and save memory space.First, from described RDF data resolver 11, the tlv triple of string format, then root are received According to each tuple to dictionary is searched corresponding mark (Ientification is called for short ID).Should if there is the most directly returning ID, if there is no then generating a unique ID, and is stored in corresponding with described character string for described ID in dictionary, returns described ID。
Described RDF data type statistics module 13 is used for adding up the quantity of the tlv triple of various combination, and by statistical data It is saved in the internal memory of described main controlled node 10.Described statistical data provides help for the optimization of future query plan, with optimum Change the order of connection, reduce the generation of intermediate data.In view of subject (be called for short S), predicate (is called for short P), and object (is called for short O), subject and Predicate (is called for short SP), and the tlv triple of predicate and object (being called for short PO) type is the most conventional, and described data type is added up Module 13 stores identical S, identical P, identical O, identical SP or the quantity of identical PO, needs to determine that connection is suitable in query script During sequence, judge according to these statistical datas.The first connection that quantity is few, the rear connection that quantity is many, thus improve search efficiency.
Described RDF data sheer 14 is for being assigned to each according to concordance Hash rule from node 20 by data. Owing to concordance hash algorithm supports being dynamically added and exiting of node, therefore system can be realized by concordance hash algorithm Fault tolerance.First build an annular hash space, then RDF data is hashing onto in ring, then memory node is breathed out Wish in ring, according to clockwise, data are stored in adjacent memory node.When deleting a memory node, according to Clockwise, will be routed in next memory node by the data of abreviation point.When a newly-increased memory node, will be newly-increased Data between node and the most adjacent memory node thereof are all routed in newly-increased memory node.
Described query statement resolver 15 is for by query language and data acquisition protocols (Simple Protocol and RDF Query Language, is called for short: SPARQL) query statement resolves to a query graph, and identifies variable therein, often Amount and query pattern.
Described inquiry plan generation module 16 is for generating an inquiry plan according to query statement, and inquiry plan is one Operation tree, on tree, each node is an operator, and operator includes scan operation, attended operation.
Fig. 2 is the structural representation of the query plan tree that inquiry plan generation module generates in the embodiment of the present invention.Such as Fig. 2 Shown in, DIS represents scan operation, finds, from storage, the query pattern met, and DMJ represents and is attached two query patterns Operation, all of result is attached by last DHJ, generates final inquiry plan.
Described Optimizing Queries module 17, for according to the described inquiry plan generated and described statistical data, optimizes scanning And the order of connection, enable the operation executed in parallel on query plan tree, and reduce the generation of intermediate object program as far as possible.
The described described local search processor 21 from node 20 receives from described main controlled node 10 for basis Described inquiry plan performs relevant scan operation and attended operation, and responsible each mutually transmits middle junction between node 20 Really.Wherein scan operation represent according to described query pattern from storage find qualified tlv triple, attended operation represents Two query patterns are calculated cartesian product and obtains intermediate object program or final result.
The RDF tlv triple data that described memory storage management module 22 is used for receiving are with key assignments (key-value) Mode is stored in internal memory, and searches data rapidly according to search request.Currently for the possible situation of inquiry, set up altogether 6 Class indexes.Respectively: SP is key, O is value;SO is key, and P is value;OP is key, and S is value;S is key, and OP is value;P is key, and SO is Value;O is key, and SP is value.Data after index are stored in Redis data base with (String, Set<String>) character string structure In.
Described backup and recovery module 23 are for storing described according to concordance hash algorithm from the internal memory of node 20 Data backup to the adjacent persistent storage device from node, such as in disk.Data in disk with key value structure by disk Management module 24 is responsible for organization and management.Until all data have loaded, backup completes the most accordingly.Lose from node 20 when described Data can be recovered from the disk of node from adjacent rapidly, it is ensured that inquiry service is not interrupted during effect.
Described monitoring module 25 be used for monitoring each from node 20, once any one lost efficacy from node 20, fast notification Described main controlled node 10 recovers described inefficacy data from node.After described main controlled node 10 receives request, notify that other are deposited That lives recovers data from nodal parallel, it is achieved fault-tolerant.Specifically, described monitoring module 24 carries out failure detection and make use of The configuration management of Zookeeper and group membership management service.
Fig. 3 is the flow process signal of the embodiment of the present invention telescopic distributed resource description framework data storage method Figure.
Refer to Fig. 3, the telescopic distributed resource description framework data storage method that the embodiment of the present invention provides Including:
S101: be assigned in the internal memory of at least two memory node store by all resource description framework RDF data;
Described memory node is in the storage of above-mentioned telescopic distributed resource description framework data and inquiry system From node.Specifically, described memory node can be the electronic equipment with storage function, such as computer, mobile terminal etc..
Before performing described step S101, after main controlled node receives RDF data, first obtain described RDF data Subject, predicate and object;Then according to described subject, predicate and object, described RDF data is set up index.
Described being assigned in the internal memory of at least two memory node by all RDF data stores, including:
According to concordance hash algorithm, the described RDF data establishing index is assigned to described at least two memory node Internal memory in store.
Specifically, described according to described subject, predicate and object to described RDF data set up index, including following arbitrarily The combination of one or more modes:
With subject and predicate as key, described RDF data is set up index for value by object;
With subject and object as key, described RDF data is set up index for value by predicate;
With predicate and object as key, described RDF data is set up index for value by subject;
With subject as key, described RDF data is set up index for value by predicate and object;
With predicate as key, described RDF data is set up index for value by subject and predicate;
With object as key, described RDF data is set up index for value by subject and predicate.
Use above-mentioned six kinds of modes to set up index, cover RDF data subject, predicate, object tlv triple according to key assignments knot The combination in any mode of structure so that the distributed resource description framework data storage method that the embodiment of the present invention provides can be propped up Hold any kind of data query.
S102: the data in the internal memory of each first memory node are backuped to the persistent storage device of the second memory node In, described first memory node and described second memory node are any one the storage joint in described at least two memory node Point, described second memory node is adjacent with described first memory node.
Further, described method also includes:
Monitor the storage state of described at least two memory node;
If the storage state of described first memory node was for losing efficacy, then, by the persistent storage of described second memory node In device, the data in the internal memory of described first memory node of backup, return in the internal memory of described second memory node.
Fig. 4 is that the embodiment of the present invention telescopic distributed resource description framework data storage method carries out data distribution And the schematic flow sheet of backup.
Referring to Fig. 3, when carrying out data distribution, main controlled node parses subject, predicate and object from RDF data file Tlv triple, sets up the index of above-mentioned 6 kinds of key assignmentses to every tlv triple, and according to concordance Hash agreement by RDF data It is distributed in multiple corresponding memory node preserve.After multiple memory nodes receive data, storage is a to internal memory, and according to Concordance Hash agreement finds this memory node adjacent storage nodes on concordance Hash ring, backs up data to this adjacent The persistent storage device of memory node, such as in disk.
Fig. 5 is that the embodiment of the present invention telescopic distributed resource description framework data storage method is in memory node The process schematic of distribution data.
As a example by Fig. 5, distribution, backup and the recovery process of data is illustrated below.
Refer to Fig. 5, when data are allocated, first, build an annular hash space, then by all RDF numbers According to being hashing onto in ring, then each memory node is hashing onto in ring, according to clockwise, data is stored to adjacent storage In node.Such as Fig. 3, illustratively, data 1-3 are stored in the internal memory of memory node A, and data 4-6 are stored in the interior of memory node B In depositing, data 7-9 are stored in the internal memory of memory node C, and data 10-12 are stored in the internal memory of memory node D, class successively Push away.Meanwhile, data 1-3 back up in the disk of adjacent storage nodes B of memory node A, and data 4-6 back up in memory node B Adjacent storage nodes C disk in, data 7-9 back up in the disk of memory node D, data 10-12 back up in storage joint In the disk of some A, the like.
When memory node D lost efficacy, and needed to delete memory node D, system is in the following manner to the number in memory node D According to recovering:
1) from the disk of memory node A, data 10-12 are recovered in internal memory;
2) all data 10-12 in memory node A disk and 1-3 are transferred in the disk of memory node B;
3) all data in the disk of memory node A are emptied;
4) all data in memory node C internal memory are backuped in the disk of memory node A.
Illustratively, please continue to refer to Fig. 4, when distributed system PC cluster ability and storage capacity are not enough, need newly-increased During memory node D, the data migration process in each memory node is as follows:
1) calculate the cryptographic Hash of newly-increased memory node D, determine that memory node is at Hash ring according to the cryptographic Hash of memory node D In position;
2) from the disk of memory node B, the data in the memory range of memory node D are returned in memory node D In depositing;
3) data in the memory range of memory node C will be transferred to by the data of the disk of memory node A storage joint In the disk of some D;
4) data in the disk of situation memory node A;
5) from the disk of memory node B, the data in the memory range of memory node D are transferred to the magnetic of memory node A In dish;
6) from the internal memory of memory node A, delete the data in the memory range of memory node D;
7) from the disk of memory node B, delete the data in the memory range of memory node D.
The telescopic distributed resource description framework data storage method that the embodiment of the present invention provides, passes through concordance All RDF data are assigned to each memory node by hash algorithm, improve the efficiency that data storage accesses.Meanwhile, support to deposit Store up being dynamically added and exiting of node, and the data in each memory node are backuped to the adjacent storage in concordance Hash ring In node, it is ensured that when certain memory node lost efficacy, it is possible to rapidly from the adjacent storage nodes local recovery number of this memory node According to, it is ensured that system is not collapsed, and persistently provides data storage and inquiry service.When newly-increased memory node, data are according to concordance Hash agreement Autonomic Migration Framework, newly-increased node can come into operation rapidly.Meanwhile, employing Redis cluster is deposited as the internal memory of bottom Storage system, and use SP_O, SO_P, OP_S, S_OP, P_SO, O_SP 6 index, and construct distributed on this basis RDF query engine, it is achieved that the distributed RDF system of a quick search, and any kind of data query can be supported.
Fig. 6 is the structural representation of the embodiment of the present invention telescopic distributed resource description framework data storage device Figure.
Refer to Fig. 6, the telescopic distributed resource description framework data storage device that the embodiment of the present invention provides, At least include distributing module 610 and backup module 620.
Described distribution module 610, for being assigned at least two memory node by all resource description framework RDF data Internal memory stores;Described at least two memory node includes the first memory node and the second memory node;
Described backup module 620, deposits for the data in the internal memory of described first memory node are backuped to described second In the persistent storage device of storage node, described second memory node is adjacent with described first memory node.
On the basis of above-described embodiment, further, described device also includes:
Monitoring module 630, for monitoring the storage state of described at least two memory node;
Data recovery module 640, for when described monitoring module monitors the storage state of described first memory node being During inefficacy, by the number in the internal memory of described first memory node of backup in the persistent storage device of described second memory node According to, return in the internal memory of described second memory node.
On the basis of above-described embodiment, further, described device also includes:
Acquisition module 650, for being assigned at least two in described distribution module by all resource description framework RDF data Before the internal memory of memory node stores, obtain the subject of described RDF data, predicate and object;
Memory storage management module 660, for setting up index according to described subject, predicate and object to described RDF data;
Described distribution module 610, for being assigned to the described RDF data establishing index according to concordance hash algorithm The internal memory of described at least two memory node stores.
On the basis of above-described embodiment, further, described memory storage management module 660, specifically for:
With subject and predicate as key, described RDF data is set up index for value by object;
With subject and object as key, described RDF data is set up index for value by predicate;
With predicate and object as key, described RDF data is set up index for value by subject;
With subject as key, described RDF data is set up index for value by predicate and object;
With predicate as key, described RDF data is set up index for value by subject and predicate;
With object as key, described RDF data is set up index for value by subject and predicate.
The telescopic distributed resource description framework data storage device that the embodiment of the present invention provides is for realizing Stating the telescopic distributed resource description framework data storage method that embodiment of the method provides, it implements principle and skill Art effect is similar with embodiment of the method, does not repeats them here.
One of ordinary skill in the art will appreciate that: all or part of step realizing above-mentioned each method embodiment can be led to The hardware crossing programmed instruction relevant completes.Aforesaid program can be stored in a computer, mobile phone or other portable units In read/write memory medium.This program upon execution, performs to include the step of above-mentioned each method embodiment;And aforesaid storage is situated between Matter includes: the various media that can store program code such as ROM, RAM, magnetic disc or CD.
Last it is noted that various embodiments above is only in order to illustrate technical scheme, it is not intended to limit;To the greatest extent The present invention has been described in detail by pipe with reference to foregoing embodiments, it will be understood by those within the art that: it depends on So the technical scheme described in foregoing embodiments can be modified, or the most some or all of technical characteristic is entered Row equivalent;And these amendments or replacement, do not make the essence of appropriate technical solution depart from various embodiments of the present invention technology The scope of scheme.

Claims (8)

1. a telescopic distributed resource description framework data storage method, it is characterised in that including:
It is assigned in the internal memory of at least two memory node store by all resource description framework RDF data;
Data in the internal memory of each first memory node are backuped in the persistent storage device of the second memory node, described One memory node and described second memory node are any one memory node in described at least two memory node, described Two memory nodes are adjacent with described first memory node.
Method the most according to claim 1, it is characterised in that also include:
Monitor the storage state of described at least two memory node;
If the storage state of described first memory node was for losing efficacy, then, by the persistent storage device of described second memory node Data in the internal memory of described first memory node of backup, return in the internal memory of described second memory node.
Method the most according to claim 1 and 2, it is characterised in that described by the distribution of all resource description framework RDF data Before storing in the internal memory of at least two memory node, also include:
Obtain the subject of described RDF data, predicate and object;
According to described subject, predicate and object, described RDF data is set up index;
Described being assigned in the internal memory of at least two memory node by all RDF data stores, including:
According to concordance hash algorithm, the described RDF data establishing index is assigned in described at least two memory node Store in depositing.
Method the most according to claim 3, it is characterised in that described according to described subject, predicate and object to described RDF Data set up index, including the combination of following any one or more mode:
With subject and predicate as key, described RDF data is set up index for value by object;
With subject and object as key, described RDF data is set up index for value by predicate;
With predicate and object as key, described RDF data is set up index for value by subject;
With subject as key, described RDF data is set up index for value by predicate and object;
With predicate as key, described RDF data is set up index for value by subject and predicate;
With object as key, described RDF data is set up index for value by subject and predicate.
5. a telescopic distributed resource description framework data storage device, it is characterised in that including:
Distribution module, for being assigned in the internal memory of at least two memory node carry out by all resource description framework RDF data Storage;Described at least two memory node includes the first memory node and the second memory node;
Backup module, for backuping to the lasting of described second memory node by the data in the internal memory of described first memory node Changing in memorizer, described second memory node is adjacent with described first memory node.
Device the most according to claim 5, it is characterised in that also include:
Monitoring module, for monitoring the storage state of described at least two memory node;
Data recovery module, is used for when described monitoring module monitors the storage state of described first memory node for losing efficacy, By the data in the internal memory of described first memory node of backup in the persistent storage device of described second memory node, return to In the internal memory of described second memory node.
7. according to the device described in claim 5 or 6, it is characterised in that also include:
Acquisition module, saves for all resource description framework RDF data being assigned at least two storage in described distribution module Before the internal memory of point stores, obtain the subject of described RDF data, predicate and object;
Memory storage management module, for setting up index according to described subject, predicate and object to described RDF data;
Described distribution module, described according to concordance hash algorithm the described RDF data establishing index is assigned to extremely The internal memory of few two memory nodes stores.
Device the most according to claim 7, it is characterised in that described memory storage management module, specifically for:
With subject and predicate as key, described RDF data is set up index for value by object;
With subject and object as key, described RDF data is set up index for value by predicate;
With predicate and object as key, described RDF data is set up index for value by subject;
With subject as key, described RDF data is set up index for value by predicate and object;
With predicate as key, described RDF data is set up index for value by subject and predicate;
With object as key, described RDF data is set up index for value by subject and predicate.
CN201610523135.5A 2016-07-05 2016-07-05 Telescopic distributed resource description framework data storage method and device Pending CN106156319A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610523135.5A CN106156319A (en) 2016-07-05 2016-07-05 Telescopic distributed resource description framework data storage method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610523135.5A CN106156319A (en) 2016-07-05 2016-07-05 Telescopic distributed resource description framework data storage method and device

Publications (1)

Publication Number Publication Date
CN106156319A true CN106156319A (en) 2016-11-23

Family

ID=58061342

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610523135.5A Pending CN106156319A (en) 2016-07-05 2016-07-05 Telescopic distributed resource description framework data storage method and device

Country Status (1)

Country Link
CN (1) CN106156319A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107122442A (en) * 2017-04-24 2017-09-01 上海兴容通信技术有限公司 A kind of distributed data base and its access method
CN107169138A (en) * 2017-06-13 2017-09-15 电子科技大学 A kind of data distributing method of Based on Distributed memory database query engine
CN110096515A (en) * 2019-05-10 2019-08-06 天津大学深圳研究院 A kind of RDF data management method, device and storage medium based on triple
CN110110034A (en) * 2019-05-10 2019-08-09 天津大学深圳研究院 A kind of RDF data management method, device and storage medium based on figure
CN110134335A (en) * 2019-05-10 2019-08-16 天津大学深圳研究院 A kind of RDF data management method, device and storage medium based on key-value pair
CN113377577A (en) * 2021-06-10 2021-09-10 上海西信信息科技股份有限公司 Cloud platform architecture based on big data and data processing method thereof

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103425734A (en) * 2012-02-23 2013-12-04 富士通株式会社 Database, apparatus, and method for storing encoded triples
US20130332484A1 (en) * 2012-06-06 2013-12-12 Rackspace Us, Inc. Data Management and Indexing Across a Distributed Database
CN104778277A (en) * 2015-04-30 2015-07-15 福州大学 RDF (radial distribution function) data distributed type storage and querying method based on Redis

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103425734A (en) * 2012-02-23 2013-12-04 富士通株式会社 Database, apparatus, and method for storing encoded triples
US20130332484A1 (en) * 2012-06-06 2013-12-12 Rackspace Us, Inc. Data Management and Indexing Across a Distributed Database
CN104778277A (en) * 2015-04-30 2015-07-15 福州大学 RDF (radial distribution function) data distributed type storage and querying method based on Redis

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107122442A (en) * 2017-04-24 2017-09-01 上海兴容通信技术有限公司 A kind of distributed data base and its access method
CN107122442B (en) * 2017-04-24 2021-04-16 上海兴容信息技术有限公司 Distributed database and access method thereof
CN107169138A (en) * 2017-06-13 2017-09-15 电子科技大学 A kind of data distributing method of Based on Distributed memory database query engine
CN107169138B (en) * 2017-06-13 2020-07-14 电子科技大学 Data distribution method for distributed memory database query engine
CN110096515A (en) * 2019-05-10 2019-08-06 天津大学深圳研究院 A kind of RDF data management method, device and storage medium based on triple
CN110110034A (en) * 2019-05-10 2019-08-09 天津大学深圳研究院 A kind of RDF data management method, device and storage medium based on figure
CN110134335A (en) * 2019-05-10 2019-08-16 天津大学深圳研究院 A kind of RDF data management method, device and storage medium based on key-value pair
CN110134335B (en) * 2019-05-10 2022-08-12 天津大学深圳研究院 RDF data management method and device based on key value pair and storage medium
CN113377577A (en) * 2021-06-10 2021-09-10 上海西信信息科技股份有限公司 Cloud platform architecture based on big data and data processing method thereof

Similar Documents

Publication Publication Date Title
CN106156319A (en) Telescopic distributed resource description framework data storage method and device
Koloniari et al. Peer-to-peer management of XML data: issues and research challenges
US8166074B2 (en) Index data structure for a peer-to-peer network
CN103020315B (en) A kind of mass small documents storage means based on master-salve distributed file system
Ji et al. Big data processing: Big challenges and opportunities
CN104102710A (en) Massive data query method
Gajendran A survey on nosql databases
Cambazoglu et al. Scalability challenges in web search engines
CN104820717A (en) Massive small file storage and management method and system
CN106569896B (en) A kind of data distribution and method for parallel processing and system
CN103812939A (en) Big data storage system
CN103246749A (en) Matrix data base system for distributed computing and query method thereof
CN104239377A (en) Platform-crossing data retrieval method and device
Mami et al. Uniform access to multiform data lakes using semantic technologies
CN103617276A (en) Method for storing distributed hierarchical RDF data
CN103823846A (en) Method for storing and querying big data on basis of graph theories
Hu et al. Towards big linked data: a large-scale, distributed semantic data storage
Von der Weth et al. Multiterm keyword search in NoSQL systems
CN107682395A (en) A kind of big data cloud computing runtime and method
Chang et al. Integration and optimization of multiple big data processing platforms
Abu-Salih et al. Introduction to big data technology
CN107943412A (en) A kind of subregion division, the method, apparatus and system for deleting data file in subregion
CN116541427B (en) Data query method, device, equipment and storage medium
KR101955376B1 (en) Processing method for a relational query in distributed stream processing engine based on shared-nothing architecture, recording medium and device for performing the method
CN112417225A (en) Joint query method and system for multi-source heterogeneous data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20161123

RJ01 Rejection of invention patent application after publication