CN108984617A - A kind of metadata catalog structure implementation method towards memory cloud - Google Patents

A kind of metadata catalog structure implementation method towards memory cloud Download PDF

Info

Publication number
CN108984617A
CN108984617A CN201810604826.7A CN201810604826A CN108984617A CN 108984617 A CN108984617 A CN 108984617A CN 201810604826 A CN201810604826 A CN 201810604826A CN 108984617 A CN108984617 A CN 108984617A
Authority
CN
China
Prior art keywords
fileid
node
pid
key
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810604826.7A
Other languages
Chinese (zh)
Inventor
侯迪
侯智琦
齐勇
王培健
赵文嘉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xian Jiaotong University
Original Assignee
Xian Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xian Jiaotong University filed Critical Xian Jiaotong University
Priority to CN201810604826.7A priority Critical patent/CN108984617A/en
Publication of CN108984617A publication Critical patent/CN108984617A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Metadata catalog structure implementation method towards memory cloud of the invention, using the form of directory tree, bibliographic structure is separated with directory node content, improve the variable flexibility of node content, each directory node individually adds abbreviation pid field, the pid field of child node is directed toward the fileid field of father node, and secondary index is added to pid field, when user search catalogue, utilize the advantage of pid secondary index, Key-Value and memory storage, catalogue recall precision is improved, carry efficiency is also greatly improved.

Description

A kind of metadata catalog structure implementation method towards memory cloud
Technical field
The present invention relates to computer fields, are related to a kind of metadata catalog structure implementation method towards memory cloud.
Background technique
Memory cloud (RAMCloud) is a kind of Key-Value type new types of data central store system, it is by thousands of Large scale system composed by the main memory of platform common server, whenever, all information be stored in these quick DRAM In (dynamic RAM, that is, the memory being commonly called as), memory instead of the hard disk in legacy system, and hard disk be only used as it is standby Part uses.The cluster of memory cloud is mainly made of primary server (Master) and backup server (Backup), wherein Master For storing and calculating;Backup is then the fast quick-recovery after restarting system.In addition, there are one coordinators in cluster (Coordinator), its NameNode node being functionally similar in Hadoop distributed file system is responsible for management configuration The functions such as information.
All data are stored in DRAM by RAMCloud, and performance can achieve the hard-disc storage system than current peak performance It unites taller 100~1000 times.In terms of access delay, a process in the application server is run in RAMCloud scheme Reading hundreds of byte datas only by network from the storage server of same data center needs 5~10 μ s, and reality is at present 0.5~10ms is spent as unified, being specifically dependent upon data is in server memory caching or in hard disk.Moreover, one Multicore storage server is per second can to service at least 1,000,000 read requests.And in hard-disk system it is same machine is per second can only 1000~10000 requests of service.It can be seen that memory cloud will be a big important breakthrough point in the following storage.
Traditional high-energy physics metadata by using disk storage, using its bibliographic structure of Mysq1 database purchase with And metadata information, it can satisfy the retrieval of the metadata catalog structure of the data volume generation of TB rank.And in the feelings of EB rank Under condition, since its bibliographic structure is deeper, retrieval performance is not obviously able to satisfy using the requirements for access to metadata catalog.
Summary of the invention
It is an object of the invention to overcome the shortage of prior art, it is real to provide a kind of metadata catalog structure towards memory cloud Existing method, bibliographic structure is separated with directory node content, overcomes existing memory cloud metadata catalog structure deeper, retrieval performance Lower problem.
The present invention uses following scheme to achieve the above object:
A kind of metadata catalog structure implementation method towards memory cloud, includes the following steps:
1) RAMCloud memory database is used, bibliographic structure tree, each node definition fileid field and pid word are defined Section, data field carry out assignment as needed, and the pid field of child node is directed toward the fileid field of father node;
2) TABLE_INDEX table and TABLE_DATA table is defined respectively to separate bibliographic structure and directory information content; TABLE_INDEX table is used for storing directory structure, and using pid and name as key, fileid is as value;TABLE_DATA table For storing node content, using fileid as key, node other information is packaged into unified structural body as value;
3) increase secondary index mechanism, define the index IndexKey::IndexKeyRange type in memory cloud, by Pid+name and individual pid forms compound Key, and VALUE is fileid and query result is the compound of fileid set VALUE, the range of definition search IndexLookup type, pass through IndexLookup chaining search subdirectory;
4) i.e. the KEY of TABLE_DATA table is the VALUE of TABLE_INDEX table, when some file of user search, first Understand catalog structure, the catalogue where retrieving file simultaneously obtains the fileid of destination node, then passes through TABLE_DATA Fileid is obtained file content as key by table.
Metadata catalog structure implementation method of the present invention towards memory cloud, uses the full memory of Key-Value non-relational Database RAMCloud redesigns traditional bibliographic structure tree, using the form of directory tree, by bibliographic structure and mesh Record node content is separated, and the variable flexibility of node content is improved, and each directory node individually adds Parent_fileid Field (abbreviation pid), the pid field of child node is directed toward the fileid field of father node, and adds secondary index to pid field, When user search catalogue, using the advantage of pid secondary index, Key-Value and memory storage, catalogue retrieval effect is improved Rate, carry efficiency are also greatly improved.
Using secondary index accelerate retrieval subdirectory, when user search directory information, using pid secondary index, The advantage of Key-Value and full memory improve catalogue recall precision.
Detailed description of the invention
Fig. 1 is abstract table structure figure of the invention
Fig. 2 is secondary index architecture diagram of the invention
Fig. 3 is the illustraton of model of the embodiment of the present invention
Specific embodiment
Present invention is further described in detail in the following with reference to the drawings and specific embodiments, but not as to limit of the invention It is fixed.
As shown in Figure 1, the metadata catalog structure implementation method of the invention towards memory cloud, the specific steps are as follows:
1) tradition mysql disk database is replaced using RAMCloud memory database;
It 2) is TABLE_ respectively by defining two metadata tables for bibliographic structure and directory information content separate design INDEX table and TABLE_DATA table;
3) TABLE_INDEX table is used for storing directory structure, and using pid and name as key, fileid is as value, together When ensure that the uniqueness of key and value;
4) TABLE_DATA table is for storing node content, using fileid as key, with name, nlink, atime ... Equal nodes other information is packaged into unified structural body as value;
5) increase secondary index mechanism simultaneously, define the index IndexKey::IndexKeyRange type in memory cloud, Original Key being made of pid+name is converted into the compound Key formed by pid+name and individual pid, original VALUE is converted into the compound VALUE by unique fileid and query result for fileid set by single fileid, defines model It encloses and searches IndexLookup type, pass through IndexLookup chaining search subdirectory, such as Fig. 2;
6) i.e. the KEY of TABLE_DATA table is the VALUE of TABLE_INDEX table, therefore when some file of user search When, first can catalog structure, catalogue where retrieving file simultaneously obtains the fileid of destination node, then passes through Fileid is obtained file content as key by TABLE_DATA table.
As shown in figure 3, by embodiment, the present invention will be further described in conjunction with Fig. 3.
With bibliographic structure/A ,/B/D ,/B/E, for/C.Root node/fileid be 1, pid 0, represent root node, on Grade directory node is sky, and the fileid of A is 2, pid 1, be connected to root node/, similarly, the fileid of B is 3, pid 1, C's Fileid is 3, pid 1, and the fileid of D is 5, pid 3, and the fileid of E is 6, pid 3, forms directory tree.
Each directory node has a fileid, the parent_fileid (abbreviation pid) of child node to be directed toward father's section Point, the pid of child node is directed toward the fileid of father node, and passes through this side using secondary index to pid addition secondary index Formula realizes the storage of bibliographic structure, accelerates retrieval subdirectory.
Finally it should be noted that: the above examples are only used to illustrate the technical scheme of the present invention rather than its limitations, to the greatest extent Pipe is described the invention in detail referring to above-described embodiment, it should be understood by those ordinary skilled in the art that: still may be used With modifications or equivalent substitutions are made to specific embodiments of the invention, and repaired without departing from any of spirit and scope of the invention Change or equivalent replacement, should all cover in present claims range.

Claims (1)

1. a kind of metadata catalog structure implementation method towards memory cloud, it is characterised in that include the following steps:
1) RAMCloud memory database is used, bibliographic structure tree, each node definition fileid field and pid field are defined, Data field carries out assignment as needed, and the pid field of child node is directed toward the fileid field of father node;
2) TABLE_INDEX table and TABLE_DATA table is defined respectively to separate bibliographic structure and directory information content;TABLE_ INDEX table is used for storing directory structure, and using pid and name as key, fileid is as value;TABLE_DATA table is for depositing Node content is put, using fileid as key, node other information is packaged into unified structural body as value;
3) increase secondary index mechanism, the index IndexKey::IndexKeyRange type in memory cloud is defined, by pid+ Name and individual pid forms compound Key, and VALUE is fileid and query result is the compound VALUE of fileid set, fixed Adopted range-based searching IndexLookup type passes through IndexLookup chaining search subdirectory;
4) i.e. the KEY of TABLE_DATA table is the VALUE of TABLE_INDEX table, when some file of user search, can be examined first Rope bibliographic structure, catalogue where retrieving file simultaneously obtain the fileid of destination node, then will by TABLE_DATA table Fileid obtains file content as key.
CN201810604826.7A 2018-06-13 2018-06-13 A kind of metadata catalog structure implementation method towards memory cloud Pending CN108984617A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810604826.7A CN108984617A (en) 2018-06-13 2018-06-13 A kind of metadata catalog structure implementation method towards memory cloud

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810604826.7A CN108984617A (en) 2018-06-13 2018-06-13 A kind of metadata catalog structure implementation method towards memory cloud

Publications (1)

Publication Number Publication Date
CN108984617A true CN108984617A (en) 2018-12-11

Family

ID=64541201

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810604826.7A Pending CN108984617A (en) 2018-06-13 2018-06-13 A kind of metadata catalog structure implementation method towards memory cloud

Country Status (1)

Country Link
CN (1) CN108984617A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104270412A (en) * 2014-06-24 2015-01-07 南京邮电大学 Three-level caching method based on Hadoop distributed file system
US20170039137A1 (en) * 2015-08-07 2017-02-09 Oracle International Corporation Distributed directory service for in-memory compression unit home location
CN106874383A (en) * 2017-01-10 2017-06-20 清华大学 A kind of decoupling location mode of metadata of distributed type file system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104270412A (en) * 2014-06-24 2015-01-07 南京邮电大学 Three-level caching method based on Hadoop distributed file system
US20170039137A1 (en) * 2015-08-07 2017-02-09 Oracle International Corporation Distributed directory service for in-memory compression unit home location
CN106874383A (en) * 2017-01-10 2017-06-20 清华大学 A kind of decoupling location mode of metadata of distributed type file system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
侯智琦: ""A High Energy Physical Metadata Directory Structure"", 《IEEE XPLORE》 *

Similar Documents

Publication Publication Date Title
Padhy et al. RDBMS to NoSQL: reviewing some next-generation non-relational database’s
Zhang et al. An efficient multi-dimensional index for cloud data management
CN111427847B (en) Indexing and querying method and system for user-defined metadata
CN105117417A (en) Read-optimized memory database Trie tree index method
CN100458784C (en) Researching system and method used in digital labrary
CN106708993A (en) Spatial data storage processing middleware framework realization method based on big data technology
CN102609490B (en) Column-storage-oriented B+ tree index method for DWMS (data warehouse management system)
Banane et al. Storing RDF data into big data NoSQL databases
El Alami et al. Supply of a key value database redis in-memory by data from a relational database
Weintraub et al. Needle in a haystack queries in cloud data lakes.
Anand et al. MongoDB and Oracle NoSQL: A technical critique for design decisions
CN107273443B (en) Mixed indexing method based on metadata of big data model
Qi Digital forensics and NoSQL databases
Kuhn et al. Coupling storage systems and self-describing data formats for global metadata management
Arora et al. An algorithm for transformation of data from MySQL to NoSQL (MongoDB)
CN108984617A (en) A kind of metadata catalog structure implementation method towards memory cloud
Gupta et al. An extended HDFS with an AVATAR NODE to handle both small files and to eliminate single point of failure
US10762139B1 (en) Method and system for managing a document search index
CN115248829A (en) Data storage method, data query method and device
CN105468599A (en) Metadata hierarchy management method for storage virtualization system
Chang et al. Resilient distributed computing platforms for big data analysis using Spark and Hadoop
CN108241624A (en) The generation method and device of a kind of query script
CN108153911B (en) Distributed cloud storage method of data
JP2006092409A (en) Composite database retrieval system, composite database retrieval method, and program therefor
CN117435559B (en) Metadata hierarchical management method and device, storage medium and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20181211