CN103761275A - Management method for metadata in distributed file system - Google Patents

Management method for metadata in distributed file system Download PDF

Info

Publication number
CN103761275A
CN103761275A CN201410009764.7A CN201410009764A CN103761275A CN 103761275 A CN103761275 A CN 103761275A CN 201410009764 A CN201410009764 A CN 201410009764A CN 103761275 A CN103761275 A CN 103761275A
Authority
CN
China
Prior art keywords
metadata
data server
meta
meta data
file system
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410009764.7A
Other languages
Chinese (zh)
Other versions
CN103761275B (en
Inventor
闫宁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Inspur Smart Computing Technology Co Ltd
Original Assignee
Inspur Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Electronic Information Industry Co Ltd filed Critical Inspur Electronic Information Industry Co Ltd
Priority to CN201410009764.7A priority Critical patent/CN103761275B/en
Publication of CN103761275A publication Critical patent/CN103761275A/en
Application granted granted Critical
Publication of CN103761275B publication Critical patent/CN103761275B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/13File access structures, e.g. distributed indices
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems

Abstract

The invention provides a management method for metadata in a distributed file system. The method includes the special management processes that the system is divided into a metadata server providing metadata services and a data server, and a user can acquire a file system view through the metadata sever to obtain a directory structure and a file list of the file system; when the user visits a metadata cluster, if the metadata to be accessed are not found in the metadata cluster, data are provided to the user through the data server; when a memory occupied by a metadata cache exceeds a threshold value, part of the memory is released so as to control memory occupancy of the metadata server. Compared with the prior art, the management method for the metadata in the distributed file system has the advantages that single-point problems are solved by the metadata by the aid of a cluster mode, the metadata are stored in the data server, further the system is started rapidly, and system availability is improved.

Description

The management method of metadata in a kind of distributed file system
Technical field
The present invention relates to the Technology On Data Encryption of cluster computer, more specifically say the management method of metadata in distributed file system.
Background technology
Along with development, cloud computing and the arrival of data age greatly of infotech, people need to process original more data, generally by distributed file system, store the data of magnanimity, distributed file system exists various problems at present, such as the Single Point of Faliure of meta data server, this problem generally solves by two-node cluster hot backup.In order to reduce load, the general method of using metadata and data separating, metadata spanned file is persisted to local disk, the start-up time of system is by linear with the scale of system like this, when file system scale exceedes 5,000 ten thousand, to reach dozens of minutes start-up time, the time that system is recovered from fault is long especially, have a strong impact on the availability of system, when meta data file damages accidentally, system will be thoroughly unavailable, need to take the technological means of various complexity to improve the security of meta data file, but meta data file still has the risk of loss.
Based on this, need a kind of management method that can effectively solve metadata loss, the metadata in distributed file system.
Summary of the invention
Technical assignment of the present invention is to solve the deficiencies in the prior art, and the management method of metadata in a kind of distributed file system is provided.
Technical scheme of the present invention realizes in the following manner, the management method of metadata in this kind of distributed file system, and its concrete management process is:
One, first system is divided into two parts: meta data server and data server, wherein meta data server provides Metadata Service, and user can obtain file system view by meta data server, obtains bibliographic structure and the listed files of file system; Data server provides data, services, and user's read-write operation provides service by data server;
Two, when system starts, by data server, converge and obtain metadata, data server is reported a small amount of metadata simultaneously, these metadata caches are on meta data server, a meta data server cluster of many machine compositions, metadata in metadata cluster has been preserved a part of metadata separately, and these metadata are not identical;
Three, when user's accesses meta-data cluster, if do not find the metadata that will access in metadata cluster, meta data server just sends broadcast to data server, when data server is received broadcast request, the state of inquiring about oneself, returns to meta data server by data, then returns to user, and be buffered in the internal memory of meta data server, for access next time;
When the internal memory four, taking when metadata cache exceeds threshold value, partial memory is discharged, to control the EMS memory occupation of meta data server.
Save data and metadata on described data server, in the extended attribute of meta-data preservation in local file system, when data server starts, sweep test metadata sends to meta data server, completion system starts, and then starts thread, the disk of scanning management, by meta-data preservation in internal memory, with the broadcast request of quick response element data server.
The detailed process of described step 2 is: when data server starts, managed disk is carried out to range traversal, only travel through two-stage catalogue, finish traversal, traversing result is sent to the startup that meta data server is completion system, if file directory level is more shallow, all concentrate on a two-stage, travel through the regular hour stop traversal, on the meta data server that metadata is sent, with the start-up time of control system, metadata arrives after meta data server cluster, and metadata cluster deposits and share the storage of bibliographic structure to metadata groupings.
When the internal memory that the metadata of the meta data server buffer memory in described step 4 takies exceeds threshold value, the internal memory that meta data server discharges is the metadata of least commonly using.
The beneficial effect that the present invention compared with prior art produced is:
Metadata in a kind of distributed file system of the present invention in the management method of metadata is used trunking mode to solve single-point problem, meta-data preservation is on data server, system converges to metadata on meta data server while starting, and scale is converged in control, to start rapidly system, improve system availability, together with metadata is kept at data, can greatly strengthen the security of metadata; In the extended attribute of meta-data preservation in local file system, together with data coupling, metadata no longer includes the risk of loss, guarantees consistance and the security of data, practical, is easy to promote.
Accompanying drawing explanation
Accompanying drawing 1 is the structural representation of the embodiment of the present invention.
Embodiment
Below in conjunction with accompanying drawing, the management method of metadata in a kind of distributed file system of the present invention is described in detail below.
The invention provides the management method of metadata in a kind of distributed file system, its concrete management process is:
One, first system is divided into two parts: meta data server and data server, wherein meta data server provides Metadata Service, and user can obtain file system view by meta data server, obtains bibliographic structure and the listed files of file system; Data server provides data, services, and user's read-write operation provides service by data server;
Two, when system starts, by data server, converge and obtain metadata, data server is reported a small amount of metadata simultaneously, these metadata caches are on meta data server, a meta data server cluster of many machine compositions, metadata in metadata cluster has been preserved a part of metadata separately, and these metadata are not identical;
Three, when user's accesses meta-data cluster, if do not find the metadata that will access in metadata cluster, meta data server just sends broadcast to data server, when data server is received broadcast request, the state of inquiring about oneself, returns to meta data server by data, then returns to user, and be buffered in the internal memory of meta data server, for access next time;
When the internal memory four, taking when metadata cache exceeds threshold value, partial memory is discharged, to control the EMS memory occupation of meta data server.
The structure of invention is take metadata cluster as core, and in metadata cluster, each meta data server has been preserved a part of metadata, does mutually mirror image between meta data server, to tackle the fault of meta data server.
System is Preservation Metadata file not, metadata is by dynamically generating, when user uses file system, can not access all files, according to principle of locality, meta data server there is no need to preserve all metadata, as long as preserve part, uses, when the metadata of access does not exist, by broadcasting to data server request.Data server Preservation Metadata, metadata is shared on all data servers, system loading is shared on all nodes, system there is better retractility.
When data server starts, sweep test metadata, sends on meta data server the quick startup of completion system, then start thread, the disk that scanning is managed, reads metadata information in internal memory, with the broadcast request of quick response element data server.Meta-data preservation is in the extended attribute of local file system, together with deposit data, to guarantee consistance and the security of data.
Embodiment: as shown in Figure 1, the entirety of this file system forms meta data server cluster by 4 meta data servers and 5 data servers form.Metadata cluster has been shared the meta-data preservation work of dirA and dirB, and backups each other between two.
5 data servers have respectively been stored 100,000 files, when data server starts, managed disk is carried out to range traversal, only travel through two-stage catalogue, finish traversal, traversing result is sent to the startup that meta data server is completion system, if file directory level is more shallow, all concentrate on a two-stage, travel through the regular hour stop traversal, on the meta data server that metadata is sent, with the start-up time of control system.
Metadata arrives after meta data server cluster, metadata cluster is deposited metadata groupings, such as having two catalogue dirA and dirB, meta data server 1 and 2 is deposited dirA, meta data server 3 and 4 is deposited dirB, 1 and 2,3 and 4 backup each other, to tackle the damage of metadata.Meta data server cluster is shared the storage of bibliographic structure, effectively reduces the load of single meta data server, can support more massive application.When the metadata of user's access does not exist in meta data server, meta data server sends broadcast to data server, request metadata, and data server returns to metadata, then returns to user.
After data server starts, start thread, the disk that scanning is managed, meta-data preservation, in the extended attribute of local file system, is read out and is then kept in internal memory, with the broadcast request of quick response element data server.
User understands the state of metadata cluster, know metadata cluster how many nodes consists of, each node is safeguarded the metadata of which catalogue, mirror between node, thereby ask suitable meta data server, and can automatically switch in the meta data server of mirror image when the meta data server of request breaks down.
The internal memory taking when the metadata of meta data server buffer memory exceeds after threshold value, and meta data server discharges least conventional metadata, to control the use to internal memory.
The foregoing is only embodiments of the invention, within the spirit and principles in the present invention all, any modification of doing, be equal to replacement, improvement etc., within all should being included in protection scope of the present invention.

Claims (4)

1. a management method for metadata in distributed file system, is characterized in that its concrete management process is:
One, first system is divided into two parts: meta data server and data server, wherein meta data server provides Metadata Service, and user can obtain file system view by meta data server, obtains bibliographic structure and the listed files of file system; Data server provides data, services, and user's read-write operation provides service by data server;
Two, when system starts, by data server, converge and obtain metadata, data server is reported a small amount of metadata simultaneously, these metadata caches are on meta data server, a meta data server cluster of many machine compositions, metadata in metadata cluster has been preserved a part of metadata separately, and these metadata are not identical;
Three, when user's accesses meta-data cluster, if do not find the metadata that will access in metadata cluster, meta data server just sends broadcast to data server, when data server is received broadcast request, the state of inquiring about oneself, returns to meta data server by data, then returns to user, and be buffered in the internal memory of meta data server, for access next time;
When the internal memory four, taking when metadata cache exceeds threshold value, partial memory is discharged, to control the EMS memory occupation of meta data server.
2. the management method of metadata in a kind of distributed file system according to claim 1, it is characterized in that: save data and metadata on described data server, in the extended attribute of meta-data preservation in local file system, when data server starts, sweep test metadata sends to meta data server, and completion system starts, then start thread, the disk of scanning management, by meta-data preservation in internal memory, with the broadcast request of quick response element data server.
3. the management method of metadata in a kind of distributed file system according to claim 2, it is characterized in that: the detailed process of described step 2 is: when data server starts, managed disk is carried out to range traversal, only travel through two-stage catalogue, finish traversal, traversing result is sent to the startup that meta data server is completion system, if file directory level is more shallow, all concentrate on a two-stage, travel through the regular hour stop traversal, on the meta data server that metadata is sent, with the start-up time of control system, metadata arrives after meta data server cluster, metadata cluster deposits and shares the storage of bibliographic structure to metadata groupings.
4. the management method of metadata in a kind of distributed file system according to claim 2, it is characterized in that: when the internal memory that the metadata of the meta data server buffer memory in described step 4 takies exceeds threshold value, the internal memory that meta data server discharges is the metadata of least commonly using.
CN201410009764.7A 2014-01-09 2014-01-09 The management method of metadata in a kind of distributed file system Active CN103761275B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410009764.7A CN103761275B (en) 2014-01-09 2014-01-09 The management method of metadata in a kind of distributed file system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410009764.7A CN103761275B (en) 2014-01-09 2014-01-09 The management method of metadata in a kind of distributed file system

Publications (2)

Publication Number Publication Date
CN103761275A true CN103761275A (en) 2014-04-30
CN103761275B CN103761275B (en) 2018-04-13

Family

ID=50528512

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410009764.7A Active CN103761275B (en) 2014-01-09 2014-01-09 The management method of metadata in a kind of distributed file system

Country Status (1)

Country Link
CN (1) CN103761275B (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105095495A (en) * 2015-08-21 2015-11-25 浪潮(北京)电子信息产业有限公司 Distributed file system cache management method and system
CN105426126A (en) * 2015-11-11 2016-03-23 浪潮(北京)电子信息产业有限公司 Method and device for constructing multi-channel constant rate IO (Input Output) cache of cloud storage client
CN107015989A (en) * 2016-01-27 2017-08-04 博雅网络游戏开发(深圳)有限公司 Data processing method and device
CN107633070A (en) * 2017-09-22 2018-01-26 郑州云海信息技术有限公司 Balance Control Scheme method, apparatus and storage medium without the MDS of configuration
CN107665253A (en) * 2017-09-22 2018-02-06 郑州云海信息技术有限公司 Configurable MDS balance control method, device and storage medium
CN108287664A (en) * 2018-01-02 2018-07-17 江苏科海智能系统有限公司 A kind of fast large based on NVM storage devices is according to system and its design method
CN109189480A (en) * 2018-07-02 2019-01-11 新华三技术有限公司成都分公司 File system starts method and device
CN109582649A (en) * 2018-12-03 2019-04-05 郑州云海信息技术有限公司 A kind of metadata storing method, device, equipment and readable storage medium storing program for executing
CN109583222A (en) * 2018-12-07 2019-04-05 浪潮(北京)电子信息产业有限公司 Distributed file system metadata service device recycles client jurisdiction method and device
CN109976667A (en) * 2016-01-26 2019-07-05 华为技术有限公司 A kind of mirror image management method, apparatus and system
CN110263061A (en) * 2019-06-17 2019-09-20 郑州阿帕斯科技有限公司 A kind of data query method and system
CN111078593A (en) * 2019-12-29 2020-04-28 浪潮电子信息产业股份有限公司 Object metadata access method, system and related device
CN114153374A (en) * 2021-08-04 2022-03-08 北京天德科技有限公司 Distributed storage system for storing metadata and data together

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101697526A (en) * 2009-10-10 2010-04-21 中国科学技术大学 Method and system for load balancing of metadata management in distributed file system
CN101997884A (en) * 2009-08-18 2011-03-30 升东网络科技发展(上海)有限公司 Distributed storage system and method
US20110191691A1 (en) * 2010-01-29 2011-08-04 Spears Joseph L Systems and Methods for Dynamic Generation and Management of Ancillary Media Content Alternatives in Content Management Systems
CN102546751A (en) * 2011-12-06 2012-07-04 华中科技大学 Hierarchical metadata cache control method of distributed file system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101997884A (en) * 2009-08-18 2011-03-30 升东网络科技发展(上海)有限公司 Distributed storage system and method
CN101697526A (en) * 2009-10-10 2010-04-21 中国科学技术大学 Method and system for load balancing of metadata management in distributed file system
US20110191691A1 (en) * 2010-01-29 2011-08-04 Spears Joseph L Systems and Methods for Dynamic Generation and Management of Ancillary Media Content Alternatives in Content Management Systems
CN102546751A (en) * 2011-12-06 2012-07-04 华中科技大学 Hierarchical metadata cache control method of distributed file system

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105095495B (en) * 2015-08-21 2019-01-25 浪潮(北京)电子信息产业有限公司 A kind of distributed file system buffer memory management method and system
CN105095495A (en) * 2015-08-21 2015-11-25 浪潮(北京)电子信息产业有限公司 Distributed file system cache management method and system
CN105426126A (en) * 2015-11-11 2016-03-23 浪潮(北京)电子信息产业有限公司 Method and device for constructing multi-channel constant rate IO (Input Output) cache of cloud storage client
CN105426126B (en) * 2015-11-11 2018-06-05 浪潮(北京)电子信息产业有限公司 The construction method and device of cloud storage client multichannel constant rate of speed IO cachings
CN109976667A (en) * 2016-01-26 2019-07-05 华为技术有限公司 A kind of mirror image management method, apparatus and system
CN109976667B (en) * 2016-01-26 2021-09-14 华为技术有限公司 Mirror image management method, device and system
CN107015989A (en) * 2016-01-27 2017-08-04 博雅网络游戏开发(深圳)有限公司 Data processing method and device
CN107633070A (en) * 2017-09-22 2018-01-26 郑州云海信息技术有限公司 Balance Control Scheme method, apparatus and storage medium without the MDS of configuration
CN107665253A (en) * 2017-09-22 2018-02-06 郑州云海信息技术有限公司 Configurable MDS balance control method, device and storage medium
CN107665253B (en) * 2017-09-22 2022-02-18 郑州云海信息技术有限公司 Configurable MDS balance control method and device and storage medium
CN108287664A (en) * 2018-01-02 2018-07-17 江苏科海智能系统有限公司 A kind of fast large based on NVM storage devices is according to system and its design method
CN109189480B (en) * 2018-07-02 2021-11-09 新华三技术有限公司成都分公司 File system starting method and device
CN109189480A (en) * 2018-07-02 2019-01-11 新华三技术有限公司成都分公司 File system starts method and device
CN109582649A (en) * 2018-12-03 2019-04-05 郑州云海信息技术有限公司 A kind of metadata storing method, device, equipment and readable storage medium storing program for executing
CN109582649B (en) * 2018-12-03 2022-02-18 郑州云海信息技术有限公司 Metadata storage method, device and equipment and readable storage medium
CN109583222A (en) * 2018-12-07 2019-04-05 浪潮(北京)电子信息产业有限公司 Distributed file system metadata service device recycles client jurisdiction method and device
CN110263061A (en) * 2019-06-17 2019-09-20 郑州阿帕斯科技有限公司 A kind of data query method and system
CN111078593A (en) * 2019-12-29 2020-04-28 浪潮电子信息产业股份有限公司 Object metadata access method, system and related device
CN111078593B (en) * 2019-12-29 2022-08-19 浪潮电子信息产业股份有限公司 Object metadata access method, system and related device
CN114153374A (en) * 2021-08-04 2022-03-08 北京天德科技有限公司 Distributed storage system for storing metadata and data together
CN114153374B (en) * 2021-08-04 2022-06-28 北京天德科技有限公司 Distributed storage system for jointly storing metadata and data

Also Published As

Publication number Publication date
CN103761275B (en) 2018-04-13

Similar Documents

Publication Publication Date Title
CN103761275A (en) Management method for metadata in distributed file system
US11301154B2 (en) Distributed storage method and device
CN107169083B (en) Mass vehicle data storage and retrieval method and device for public security card port and electronic equipment
WO2017167171A1 (en) Data operation method, server, and storage system
EP2863310B1 (en) Data processing method and apparatus, and shared storage device
US20160212206A1 (en) Deterministic database system and data transferring method thereof
US11245774B2 (en) Cache storage for streaming data
US10191915B2 (en) Information processing system and data synchronization control scheme thereof
CN104243425A (en) Content management method, device and system in content delivery network
CN106202112A (en) CACHE DIRECTORY method for refreshing and device
CN103518364A (en) Data update method for distributed storage system and server
CN102710763B (en) The method and system of a kind of distributed caching pond, burst and Failure Transfer
US9262511B2 (en) System and method for indexing streams containing unstructured text data
CN105159845A (en) Memory reading method
US20120278429A1 (en) Cluster system, synchronization controlling method, server, and synchronization controlling program
WO2016095149A1 (en) Data compression and storage method and device, and distributed file system
CN106326239A (en) Distributed file system and file meta-information management method thereof
CN104516974A (en) Management method and device for file system directory entry
CN108540510B (en) Cloud host creation method and device and cloud service system
US20230409527A1 (en) Method And System For Deleting Obsolete Files From A File System
CN103501319A (en) Low-delay distributed storage system for small files
CN102984256A (en) Processing method and system for metadata based on authorization manner
CN103365987A (en) Clustered database system and data processing method based on shared-disk framework
CN109165078B (en) Virtual distributed server and access method thereof
JP2012018607A (en) Distributed cache system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20180313

Address after: 510640 Guangdong city of Guangzhou province Whampoa Tianhe District Road No. 163 Xiping Yunlu 9 layer A tower by 01 unit

Applicant after: Guangdong Inspur Big Data Research Co., Ltd.

Address before: 250014 Shandong Province, Ji'nan City hi tech Development Zone, Nga Road No. 1036

Applicant before: Langchao Electronic Information Industry Co., Ltd.

GR01 Patent grant
GR01 Patent grant