CN106599308A - Distributed metadata management method and system - Google Patents

Distributed metadata management method and system Download PDF

Info

Publication number
CN106599308A
CN106599308A CN201611247844.1A CN201611247844A CN106599308A CN 106599308 A CN106599308 A CN 106599308A CN 201611247844 A CN201611247844 A CN 201611247844A CN 106599308 A CN106599308 A CN 106599308A
Authority
CN
China
Prior art keywords
metadata
meta data
data server
server
node
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201611247844.1A
Other languages
Chinese (zh)
Other versions
CN106599308B (en
Inventor
郭晓凤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongke Tianji (Xinjiang) Aerospace Information Co.,Ltd.
Original Assignee
郭晓凤
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 郭晓凤 filed Critical 郭晓凤
Priority to CN201611247844.1A priority Critical patent/CN106599308B/en
Publication of CN106599308A publication Critical patent/CN106599308A/en
Application granted granted Critical
Publication of CN106599308B publication Critical patent/CN106599308B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/16File or folder operations, e.g. details of user interfaces specifically adapted to file systems
    • G06F16/164File meta data generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/1727Details of free space management performed by the file system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Human Computer Interaction (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a distributed metadata management method and system, and relates to the technical field of metadata management. By means of the method and the system, two strategies including static load balancing when metadata is distributed and dynamic load balancing in system operation are used; therefore, the utilization rate of metadata service resources is increased; the load balancing of the system is ensured; the expandability of the system is increased; in addition, because a metadata delay movement scheme based on a directory redirection table is used, the problem of a large number of metadata movement due to renaming operation can be solved; and the stability of the system efficiency is ensured.

Description

A kind of distributed meta-data management method and system
Technical field
The present invention relates to metadata management technical field, more particularly to a kind of distributed meta-data management method and system.
Background technology
At present, in the face of increasing mass data, because the restriction of performance and price, existing storage mode has been got over Can not more meet demand, market demand is the data-storage system with large buffer memory, expansible, safety and High Availabitity, So as to distributed storage is arisen at the historic moment under this demand.
In order to effectively manage each memory node in distributed memory system, according to metadata sum in file system According to storage and accessing characteristic, distributed file system is typically stored separately metadata and data.Metadata storage system is Connection user and the bridge of data storage server.Therefore efficient metadata management is to realizing the high property of distributed memory system Can be most important with extensibility, the distributed management of metadata becomes an important study hotspot.Existing metadata pipe There is load imbalance in reason strategy, renaming operation can cause the substantial amounts of metadata to move and metadata management system expands The problems such as malleability is bad.
The content of the invention
It is an object of the invention to provide a kind of distributed meta-data management method and system, so as to solve prior art in The foregoing problems of presence.
To achieve these goals, the technical solution used in the present invention is as follows:
A kind of distributed meta-data management method, including:The static load balancing method and the dynamic of metadata of metadata Load-balancing method;
The static load balancing method of the metadata is:Taken using the uniformity hash function and metadata of dummy node Business device list, by the data allocations to metadata server node;Wherein, the meta data server list is record All dummy nodes store one to the table of the mapping relations between meta data server on each described metadata server node The list of the dummy node stored on the individual node;
The dynamic load balancing method of the metadata is:By the way of metadata migration, by part metadata from mistake Load node is moved to and kicked the beam on node.
Preferably, the static load balancing method of the metadata, comprises the steps:
A1, after start-up, meta data server manager matches somebody with somebody system according to each meta data server information and list items Confidence breath generates meta data server list;
A2, according to the fullpath of file, using uniformity hash function, in finding the meta data server list , and find corresponding target metadata server;
A3, according to the list of the dummy node stored on the metadata server node, in target metadata clothes Addition metadata information in the dummy node of business device.
Preferably, each described meta data server occurs in the item number in the meta data server list, using such as Minor function is calculated:
Wherein, what Ui was represented is the number of times that i-th meta data server occurs in list, and C represents the item number of list, N represents the sum of the meta data server.
Preferably, the uniformity hash function is:
NameNode_Locator=Hash (f) mod NNT_Length,
Wherein, NameNode_Locator represents the item in the meta data server list of selection, and f is the complete road of file Footpath title, NNT_Length is the total item in meta data server list.
Preferably, the dynamic load balancing method of the metadata, comprises the steps:
B1, the meta data server taken at regular intervals load information, and it is sent to meta data server manager;
B2, the meta data server manager periodically calculates the load balancing degrees of the meta data server, if institute The load balancing degrees for stating meta data server have exceeded the threshold value of setting, then the meta data server is overload node, if The threshold value that the load balancing degrees of the meta data server not up to set, then the meta data server is the node that kicks the beam;
Part metadata is moved to the node that kicks the beam by B3, the meta data server manager from the overload node On;
B4, the overload node and the node updates load information that kicks the beam, and it is sent to the meta data server pipe Reason person.
Preferably, the load balancing degrees of the meta data server are calculated using equation below:
Ti1di2mi,
In formula,
jiIt is i-node load balancing index in time t;
wiIt is the loading index of i-th metadata server node in t;
N is the number of units of meta data server;
η12=1,
TiIt is the loading index of the i items in meta data server list in t, common n items;
diIt is the operation operating lag of i items in meta data server list in t;
miIt is the number of i item meta data servers in t meta data server list.
Preferably, the dynamic load balancing method of the metadata, also including step:
The overall load degree of computing system, if the overall load degree of system exceedes the threshold value of setting, adds in systems Metadata server node;Wherein, using the overall load degree of the system as described in minor function calculating:
Wherein,
E is the loading index of system,
N is metadata server node number;
wiIt is the loading index of i-th metadata server node in t.
Preferably, also include:Metadata is carried out using catalogue redirection table and postpones movement, solve metadata locally consistent The method of sex chromosome mosaicism, specially:
A directory path redirection table, the directory path redirection table are safeguarded on each meta data server For storage metadata information not on current meta data server;
Each item in the directory path redirection table is a pair of key assignments<Hash (directory path), dummy node>, The former is the cryptographic Hash of the directory path after renaming, and the latter is the storage location for needing mobile metadata current.
A kind of distributed meta-data management system, including:Meta data server manager and meta data server, the unit Data server manager includes meta data server list maintenance module, the selecting module of meta data server and load balancing Module;The meta data server includes metadata processing module and load measure module;
The meta data server list maintenance module is responsible for safeguarding between dummy node and metadata server node Correct corresponding relation;
The selecting module of the meta data server is used to complete the random distribution of metadata,
The load balancing module is used to receive each meta data server load information, computing system load value And the load to each meta data server is ranked up, system load it is unbalanced or metadata server cluster need The movement of metadata is carried out during adjustment;
The load measure module is used to be responsible for collecting the load information on current server, calculates each dummy node Load, the load on current server is thus calculated again, and load information is sent to into meta data server manager;
The metadata processing module includes read through model, writing module and the modified module of metadata, and the read through model is responsible for The acquisition of metadata, the writing module is responsible for the storage of metadata, and the modified module is responsible for after renaming operation to metadata Process, safeguard a catalogue redirection table, the directory path redirection table be used for storage not current meta data take Metadata information on business device.
Preferably, also including backup server, the backup server includes the gerentocratic backup clothes of meta data server The backup server of business device and meta data server, the gerentocratic backup server of the meta data server, in first number It is responsible for replacing its work when breaking down according to server managers, and its data is recovered;The meta data server Backup server, is responsible for carrying out the recovery of data to it when metadata server node breaks down.
The invention has the beneficial effects as follows:Distributed meta-data management method and system provided in an embodiment of the present invention, pass through Using the two kinds of strategies of dynamic load leveling in the static load balancing and system operation during meta-data distribution, metadata is improve The utilization rate of Service Source, it is ensured that the load balancing of system, improves the extensibility of system;In addition, by using being based on The metadata that the mobile scheme of metadata delay of catalogue redirection table solves renaming operation and can cause moves in a large number problem, Ensure that stablizing for system effectiveness.
Description of the drawings
Fig. 1 is the Organization Chart of the distributed file system containing metadata management system of the invention;
Fig. 2 is NameNode list exemplary plots in metadata management system provided in an embodiment of the present invention;
Fig. 3 is the reading flow process of metadata provided in an embodiment of the present invention;
Fig. 4 is guaranteed reliability's policy map of metadata management system provided in an embodiment of the present invention.
Specific embodiment
In order that the objects, technical solutions and advantages of the present invention become more apparent, below in conjunction with accompanying drawing, the present invention is entered Row is further described.It should be appreciated that specific embodiment described herein is not used to only to explain the present invention Limit the present invention.
Embodiment one
A kind of distributed meta-data management method is embodiments provided, including:The static load balancing of metadata The dynamic load balancing method of method and metadata;
The static load balancing method of the metadata is:Taken using the uniformity hash function and metadata of dummy node Business device list, by the data allocations to metadata server node;Wherein, the meta data server list is record All dummy nodes store one to the table of the mapping relations between meta data server on each described metadata server node The list of the dummy node stored on the individual node;
The dynamic load balancing method of the metadata is:By the way of metadata migration, by part metadata from mistake Load node is moved to and kicked the beam on node.
In the embodiment of the present invention, including the distributed file system of distributed meta-data management system, its framework can be found in Shown in Fig. 1, it will be seen from figure 1 that the overall architecture of the distributed file system includes four parts:Data storage server DN (DataNode), as the memory node of application data, the data block after storage file cutting;Meta data server NN (NameNode), as metadata response and more new node, it is responsible for safeguarding global name space, wherein comprising file and file Folder attribute, NN safeguards NameSpace tree and preserves mapping of the data block to DN in file.NN has one or many in a cluster It is individual;User Client, supports to the reading and writing of file system, deletes file, establishment and the operation such as deltree, and Client and NN is handed over Mutual control information (metadata), with DN interaction data streams (application data);The management node of NameNode NNManager, is responsible for the status information of each NN of periodic harvest, safeguards NameNode lists;Wherein, NameNode lists NNT, for storing NameNode;NameNode Personal (NNP), for storing place NN in respective items information;NNT It is responsible for safeguarding and is updated by NNManger with NNP;Catalogue redirection table DPRT, for storing metadata information not current Directory information list on meta data server, can safeguard a DPRT on each NN.
In said method, when initial distribution is carried out to metadata, by consistent using what is optimized using virtual machine point Property hash function is assigning it on metadata server node, it is ensured that load balancing of the metadata in static distribution;
With the operation of system, metadata server node occurs the situation of load imbalance, by adopting metadata The mode of migration, part metadata is moved to from overload node and is kicked the beam on node, so as to realize multiple meta data server sections Load balancing between point;When the metadata of system storage is sufficiently large, it may appear that the overall load degree of system exceedes threshold value Phenomenon, by adding the method for metadata server node to system system load is reduced;
So, the embodiment of the present invention, by using the static load balancing and system operation during meta-data distribution in it is dynamic Two kinds of strategies of state load balancing, improve the utilization rate of Metadata Service resource, it is ensured that the load balancing of system, improve and are The extensibility of system.
In a preferred embodiment of the invention, the static load balancing method of the metadata, comprises the steps:
A1, after start-up, meta data server manager matches somebody with somebody system according to each meta data server information and list items Confidence breath generates meta data server list;
A2, according to the fullpath of file, using uniformity hash function, in finding the meta data server list , and find corresponding target metadata server;
A3, according to the list of the dummy node stored on the metadata server node, in target metadata clothes Addition metadata information in the dummy node of business device.
Meta data server manager safeguards that a meta data server list (is expressed as:NameNode lists or NNT), NameNode lists are the tables for recording all dummy nodes to the mapping relations between meta data server.After system starts, table Interior item number is constant, i.e. the number of dummy node is constant.To make meta data server adjustment of load process more flexible, granularity is more Little, item number is sufficiently large within the specific limits.
Wherein, each described meta data server occurs in the item number in the meta data server list, using as follows Function is calculated:
Wherein, what Ui was represented is the number of times that i-th meta data server occurs in list, and C represents the item number of list, N represents the sum of the meta data server.
Fig. 2 shows meta data server list exemplary plot.7 are had in list, respectively to there is A, B, C and D etc. 4 Meta data server.
When actually used, said method can be adopted to be implemented with the following method:
Client can obtain NameNode lists when accessing first time to meta data server manager, afterwards in system During operation, if NameNode lists change, meta data server manager can be newest Metadata Service Device list is sent to client.When client reads a file, storage is calculated according to the cryptographic Hash of file complete path name Dummy node numbering, then according to NameNode list lookups go out metadata storage server which is.
In a preferred embodiment of the invention, the uniformity hash function is:
NameNode_Locator=Hash (f) mod NNT_Length,
Wherein, NameNode_Locator represents the item in the meta data server list of selection, and f is the complete road of file Footpath title, NNT_Length is the total item in meta data server list.
In a preferred embodiment of the invention, the dynamic load balancing method of the metadata, comprises the steps:
B1, the meta data server taken at regular intervals load information, and it is sent to meta data server manager;
B2, the meta data server manager periodically calculates the load balancing degrees of the meta data server, if institute The load balancing degrees for stating meta data server have exceeded the threshold value of setting, then the meta data server is overload node, if The threshold value that the load balancing degrees of the meta data server not up to set, then the meta data server is the node that kicks the beam;
Part metadata is moved to the node that kicks the beam by B3, the meta data server manager from the overload node On;
B4, the overload node and the node updates load information that kicks the beam, and it is sent to the meta data server pipe Reason person.
Wherein, the load balancing degrees of the meta data server are calculated using equation below:
Ti1di2mi,
In formula,
In formula,
jiIt is i-node load balancing index in time t;
wiIt is the loading index of i-th metadata server node in t;
N is the number of units of meta data server;
η12=1,
TiIt is the loading index of the i items in meta data server list in t, common n items;
diIt is the operation operating lag of i items in meta data server list in t;
miIt is the number of i item meta data servers in t meta data server list.
In the embodiment of the present invention, the dynamic load balancing method of the metadata, also including step:
The overall load degree of computing system, if the overall load degree of system exceedes the threshold value of setting, adds in systems Metadata server node;Wherein, using the overall load degree of the system as described in minor function calculating:
Wherein,
E is the loading index of system,
N is metadata server node number;
wiIt is the loading index of i-th metadata server node in t.
In the embodiment of the present invention, the dynamic load balancing method of metadata includes two aspects:
One is when the threshold value that the load of certain metadata server node is arranged beyond system, then to need maximum from load Meta data server on select the maximum dummy node of load, the metadata information above it is moved to into the minimum unit of load On data server;Two is, when the load of whole system has exceeded the threshold value of setting, to illustrate the meta data server of current scale Cluster can not meet the demand of system, need to add meta data server, then enter according still further to the strategy of the first situation The adjustment of row load balancing, after the completion of adjustment, meta data server manager can be adjusted to NameNode lists, and will most New NameNode lists are sent to client, meta data server and data storage server.
Distributed meta-data management method provided in an embodiment of the present invention, can also include:Using catalogue redirection table Carry out metadata and postpone movement, the method for solving metadata locally consistent sex chromosome mosaicism, specially:
A directory path redirection table, the directory path redirection table are safeguarded on each meta data server For storage metadata information not on current meta data server;
Each item in the directory path redirection table is a pair of key assignments<Hash (directory path), dummy node>, The former is the cryptographic Hash of the directory path after renaming, and the latter is the storage location for needing mobile metadata current.
Wherein, catalogue redirection table can be expressed as DPRT;
As shown in figure 3, the specific implementation process of said method can be:
When client accesses file, compiled according to the dummy node that the cryptographic Hash of file complete path name calculates storage Number, which the server that metadata storage is gone out according to meta data server list lookup is, i.e. target metadata server, because The metadata adopted in this paper systems postpones mobile method, it is possible that the metadata that access occurs does not take in current goal Situation on business device, so, when target metadata information is inquired about, search on target metadata server safeguard thereon first DPRT on either with or without the corresponding cryptographic Hash item of file complete path name, if it has, then illustrate the metadata information to be inquired about not On target metadata server, then the server that metadata of arriving is located inquires about up target metadata information, and by unit Data message is moved on target metadata server, deletes the respective items on the DPRT for safeguarding thereon;If it is not, in mesh Target metadata information is inquired about on mark meta data server.
In the method, just moved when catalogue or filename is changed, but metadata is just carried out when accessing Movement, so, the movement of metadata is postponed, when large-scale metadata occurring moving, it is possible to ensure system throughput The stability of amount.
Embodiment two
A kind of distributed meta-data management system is embodiments provided, including:Meta data server manager and Meta data server, the meta data server manager includes meta data server list maintenance module, meta data server Selecting module and load balancing module;The meta data server includes metadata processing module and load measure module;
The meta data server list maintenance module is responsible for safeguarding between dummy node and metadata server node Correct corresponding relation;
The selecting module of the meta data server is used to complete the random distribution of metadata,
The load balancing module is used to receive each meta data server load information, computing system load value And the load to each meta data server is ranked up, system load it is unbalanced or metadata server cluster need The movement of metadata is carried out during adjustment;
The load measure module is used to be responsible for collecting the load information on current server, calculates each dummy node Load, the load on current server is thus calculated again, and load information is sent to into meta data server manager;
The metadata processing module includes read through model, writing module and the modified module of metadata, and the read through model is responsible for The acquisition of metadata, the writing module is responsible for the storage of metadata, and the modified module is responsible for after renaming operation to metadata Process, safeguard a catalogue redirection table, the directory path redirection table be used for storage not current meta data take Metadata information on business device.
The distributed meta-data management system of said structure, its management method to metadata has been entered in embodiment one Detailed description is gone, will not be described in detail herein.
The distributed meta-data management system of the structure, it is possible to achieve following function:Static state in meta-data distribution is born Carry the dynamic load leveling in balanced and system operation, improve the utilization rate of Metadata Service resource, it is ensured that system it is negative Carry balanced, improve the extensibility of system;Furthermore it is possible to postpone to move by using the metadata based on catalogue redirection table Dynamic scheme solves the problems, such as that the renaming metadata that can cause of operation is mobile in a large number, it is ensured that system effectiveness is stablized.
Distributed meta-data management system provided in an embodiment of the present invention, can also include backup server, the backup Server includes the backup server of the gerentocratic backup server of meta data server and meta data server, the metadata The backup server of server managers, for being responsible for replacing its work when meta data server manager is broken down, and Its data is recovered;The backup server of the meta data server, is responsible for being broken down in metadata server node When the recovery of data is carried out to it.
Used as the guaranteed reliability of metadata management system, its actual course of work can be above-mentioned backup server:
The gerentocratic backup server backNNM of meta data server, what is taken is redundancy scheme;Meta data server Backup server backNN, what is taken is log mechanism.
Main NNM and backNNM runs identical program simultaneously, is led to using network by module for reading and writing between the two Letter, is mainly responsible for supervision main NNM when backNNM is flat, the state of main NNM is analyzed by message processing module, and main NNM can be regular Heartbeat message is sent to backNNM, the state of its own is informed, if it exceeds a cycle backNNM does not receive main NNM sending out The heartbeat message for coming, then it is considered that main NNM there occurs failure, backNNM can take over all working on main NNM, be System provides service, and main NNM is recovered.After main NNM recovers, heartbeat message can be sent to backNNM, inform it Recovered normal, and the adapter all working from backNNM, backNNM then returns to listening state, and the strategy can ensure that clothes Business is not interrupted;NN (meta data server) is also to be interacted by communication module and backNN between, and NN and backNN sets up After connection, data are received and send, wherein data mainly include journal file and metadata mirror image.Then backNN is by synthesis Journal file and metadata mirror image are synthesized new metadata image file by module in internal memory, and the strategy can cause in service It is disconnected.Specifically can be found in Fig. 4.
By using above-mentioned technical proposal disclosed by the invention, having obtained following beneficial effect:The embodiment of the present invention is carried For distributed meta-data management method and system, by using in the static load balancing and system operation during meta-data distribution Two kinds of strategies of dynamic load leveling, improve the utilization rate of Metadata Service resource, it is ensured that the load balancing of system, improve The extensibility of system;In addition, postponing mobile scheme by using the metadata based on catalogue redirection table solves weight The metadata that naming operation can cause moves in a large number problem, it is ensured that system effectiveness is stablized.
Specifically, it is real by carrying out meta-data distribution using the uniformity hash function being optimized using dummy node Static load balancing during meta-data distribution is showed;When the load balancing degrees of metadata server node exceed given threshold, The dynamic load leveling of metadata is realized by way of using metadata migration;When the overall load of system exceedes setting threshold During value, by the load that system is reduced to the addition of metadata server node;When renaming operation causes the big of metadata When amount is mobile, by postponing mobile scheme using the metadata based on catalogue re-direction table stablizing for system effectiveness is ensured.
Each embodiment in this specification is described by the way of progressive, what each embodiment was stressed be with The difference of other embodiment, between each embodiment identical similar part mutually referring to.
Those skilled in the art should be understood that the sequential of the method and step that above-described embodiment is provided can be entered according to actual conditions Row accommodation, is concurrently carried out also dependent on actual conditions.
All or part of step in the method that above-described embodiment is related to can be instructed by program correlation hardware come Complete, described program can be stored in the storage medium that computer equipment can read, for performing the various embodiments described above side All or part of step described in method.The computer equipment, for example:Personal computer, server, the network equipment, intelligent sliding Dynamic terminal, intelligent home device, wearable intelligent equipment, vehicle intelligent equipment etc.;Described storage medium, for example:RAM、 ROM, magnetic disc, tape, CD, flash memory, USB flash disk, portable hard drive, storage card, memory stick, webserver storage, network cloud storage Deng.
Finally, in addition it is also necessary to explanation, herein, such as first and second or the like relational terms be used merely to by One entity or operation make a distinction with another entity or operation, and not necessarily require or imply these entities or operation Between there is any this actual relation or order.And, term " including ", "comprising" or its any other variant meaning Covering including for nonexcludability, so that a series of process, method, commodity or equipment including key elements not only includes that A little key elements, but also including other key elements being not expressly set out, or also include for this process, method, commodity or The intrinsic key element of equipment.In the absence of more restrictions, the key element for being limited by sentence "including a ...", does not arrange Except also there is other identical element in including the process of the key element, method, commodity or equipment.
The above is only the preferred embodiment of the present invention, it is noted that for the ordinary skill people of the art For member, under the premise without departing from the principles of the invention, some improvements and modifications can also be made, these improvements and modifications also should Depending on protection scope of the present invention.

Claims (10)

1. a kind of distributed meta-data management method, it is characterised in that include:The static load balancing method of metadata and first number According to dynamic load balancing method;
The static load balancing method of the metadata is:Using the uniformity hash function and meta data server of dummy node List, by the data allocations to metadata server node;Wherein, the meta data server list is that record is all , to the table of the mapping relations between meta data server, one is stored on each described metadata server node should for dummy node The list of the dummy node stored on node;
The dynamic load balancing method of the metadata is:By the way of metadata migration, by part metadata from overload section Point is moved to and kicked the beam on node.
2. distributed meta-data management method according to claim 1, it is characterised in that the static load of the metadata Equalization methods, comprise the steps:
A1, after start-up, meta data server manager matches somebody with somebody confidence to system according to each meta data server information and list items Breath generates meta data server list;
A2, according to the fullpath of file, using uniformity hash function, finds the item in the meta data server list, And find corresponding target metadata server;
A3, according to the list of the dummy node stored on the metadata server node, in the target metadata server Dummy node in addition metadata information.
3. distributed meta-data management method according to claim 2, it is characterised in that each described meta data server The item number in the meta data server list is occurred in, is calculated using such as minor function:
Wherein, what Ui was represented is the number of times that i-th meta data server occurs in list, and C represents the item number of list, n tables Show the sum of the meta data server.
4. distributed meta-data management method according to claim 2, it is characterised in that the uniformity hash function For:
NameNode_Locator=Hash (f) mod NNT_Length,
Wherein, NameNode_Locator represents the item in the meta data server list of selection, and f is the complete path name of file Claim, NNT_Length is the total item in meta data server list.
5. distributed meta-data management method according to claim 1, it is characterised in that the dynamic load of the metadata Equalization methods, comprise the steps:
B1, the meta data server taken at regular intervals load information, and it is sent to meta data server manager;
B2, the meta data server manager periodically calculates the load balancing degrees of the meta data server, if the unit The load balancing degrees of data server have exceeded the threshold value of setting, then the meta data server is overload node, if described The threshold value that the load balancing degrees of meta data server not up to set, then the meta data server is the node that kicks the beam;
Part metadata is moved to described kicking the beam on node by B3, the meta data server manager from the overload node;
B4, the overload node and the node updates load information that kicks the beam, and it is sent to the meta data server manager.
6. distributed meta-data management method according to claim 5, it is characterised in that the meta data server it is negative Carry equilibrium degree to be calculated using equation below:
j i = w i - &Sigma; k = 1 n w k / n ,
w i = &Sigma; k = 1 n T i ,
Ti1di2mi,
In formula,
jiIt is i-node load balancing index in time t;
wiIt is the loading index of i-th metadata server node in t;
N is the number of units of meta data server;
η12=1,
TiIt is the loading index of the i items in meta data server list in t, common n items;
diIt is the operation operating lag of i items in meta data server list in t;
miIt is the number of i item meta data servers in t meta data server list.
7. distributed meta-data management method according to claim 6, it is characterised in that the dynamic load of the metadata Equalization methods, also including step:
The overall load degree of computing system, if the overall load degree of system exceedes the threshold value of setting, adds in systems unit Data server node;Wherein, using the overall load degree of the system as described in minor function calculating:
E = &Sigma; i = 1 n w i / n ,
Wherein,
E is the loading index of system,
N is metadata server node number;
wiIt is the loading index of i-th metadata server node in t.
8. distributed meta-data management method according to claim 1, it is characterised in that also include:Reset using catalogue Metadata is carried out to form postpone movement, the method for solving metadata locally consistent sex chromosome mosaicism, specially:
A directory path redirection table is safeguarded on each meta data server, the directory path redirection table is used for Storage metadata information not on current meta data server;
Each item in the directory path redirection table is a pair of key assignments<Hash (directory path), dummy node>, the former It is the cryptographic Hash of the directory path after renaming, the latter is the storage location for needing mobile metadata current.
9. a kind of distributed meta-data management system, it is characterised in that include:Meta data server manager and Metadata Service Device, the meta data server manager includes meta data server list maintenance module, the selecting module of meta data server And load balancing module;The meta data server includes metadata processing module and load measure module;
The meta data server list maintenance module is responsible for safeguarding correct between dummy node and metadata server node Corresponding relation;
The selecting module of the meta data server is used to complete the random distribution of metadata,
The load balancing module is used to receive each meta data server load information, and computing system load value is simultaneously right The load of each meta data server is ranked up, system load it is unbalanced or metadata server cluster need adjustment The movement of Shi Jinhang metadata;
The load measure module is used to be responsible for collecting the load information on current server, calculates the negative of each dummy node Carry, the load on current server is thus calculated again, and load information is sent to into meta data server manager;
The metadata processing module includes read through model, writing module and the modified module of metadata, and the read through model is responsible for first number According to acquisition, the writing module is responsible for the storage of metadata, and the modified module is responsible for the place after renaming operation to metadata Reason, safeguards a catalogue redirection table, and the directory path redirection table is used for storage not in current meta data server On metadata information.
10. distributed meta-data management system according to claim 9, it is characterised in that also including backup server, institute Backup server is stated including the gerentocratic backup server of meta data server and the backup server of meta data server, it is described The gerentocratic backup server of meta data server, for being responsible for replacing its work when meta data server manager is broken down Make, and its data is recovered;The backup server of the meta data server, is responsible in metadata server node generation The recovery of data is carried out to it during failure.
CN201611247844.1A 2016-12-29 2016-12-29 distributed metadata management method and system Active CN106599308B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611247844.1A CN106599308B (en) 2016-12-29 2016-12-29 distributed metadata management method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611247844.1A CN106599308B (en) 2016-12-29 2016-12-29 distributed metadata management method and system

Publications (2)

Publication Number Publication Date
CN106599308A true CN106599308A (en) 2017-04-26
CN106599308B CN106599308B (en) 2020-01-31

Family

ID=58604033

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611247844.1A Active CN106599308B (en) 2016-12-29 2016-12-29 distributed metadata management method and system

Country Status (1)

Country Link
CN (1) CN106599308B (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107528924A (en) * 2017-10-09 2017-12-29 郑州云海信息技术有限公司 A kind of distributed type assemblies Metadata Service dispositions method and system
CN107704212A (en) * 2017-10-31 2018-02-16 紫光华山信息技术有限公司 A kind of data processing method and device
CN108920613A (en) * 2018-06-28 2018-11-30 郑州云海信息技术有限公司 A kind of metadata management method, system and equipment and storage medium
CN108989370A (en) * 2017-05-31 2018-12-11 华为软件技术有限公司 Date storage method, equipment and system in a kind of CDN system
WO2019000949A1 (en) * 2017-06-28 2019-01-03 华为技术有限公司 Metadata storage method and system in distributed storage system, and storage medium
CN109218340A (en) * 2017-06-29 2019-01-15 上海云教信息技术有限公司 A kind of online data citation system
CN109407977A (en) * 2018-09-25 2019-03-01 佛山科学技术学院 A kind of big data distributed storage management method and system
CN109688187A (en) * 2018-09-07 2019-04-26 平安科技(深圳)有限公司 Flow load balance method, apparatus, equipment and readable storage medium storing program for executing
CN109726212A (en) * 2018-12-29 2019-05-07 杭州宏杉科技股份有限公司 Data-storage system and method
CN111078120A (en) * 2018-10-18 2020-04-28 深信服科技股份有限公司 Data migration method and system of distributed file system and related components
CN111737017A (en) * 2020-08-20 2020-10-02 北京东方通科技股份有限公司 Distributed metadata management method and system
CN112256438A (en) * 2020-06-28 2021-01-22 腾讯科技(深圳)有限公司 Load balancing control method and device, storage medium and electronic equipment
US20220357998A1 (en) * 2021-05-08 2022-11-10 Dell Products L.P. Multiple metric-based workload balancing between storage resources
CN115510004A (en) * 2022-11-22 2022-12-23 广东省信息安全测评中心 Government affair data resource naming method and management system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101841565A (en) * 2010-04-20 2010-09-22 中国科学院软件研究所 Database cluster system load balancing method and database cluster system
CN102333029A (en) * 2011-06-23 2012-01-25 北京新媒传信科技有限公司 Routing method in server cluster system
CN103179192A (en) * 2013-02-07 2013-06-26 杭州华三通信技术有限公司 Method, system and NAT (network address translation) for forwarding message about virtual server migration
CN103354923A (en) * 2012-02-09 2013-10-16 华为技术有限公司 Method, device and system for data reconstruction
CN106161120A (en) * 2016-10-08 2016-11-23 电子科技大学 The distributed meta-data management method of dynamic equalization load

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101841565A (en) * 2010-04-20 2010-09-22 中国科学院软件研究所 Database cluster system load balancing method and database cluster system
CN102333029A (en) * 2011-06-23 2012-01-25 北京新媒传信科技有限公司 Routing method in server cluster system
CN103354923A (en) * 2012-02-09 2013-10-16 华为技术有限公司 Method, device and system for data reconstruction
CN103179192A (en) * 2013-02-07 2013-06-26 杭州华三通信技术有限公司 Method, system and NAT (network address translation) for forwarding message about virtual server migration
CN106161120A (en) * 2016-10-08 2016-11-23 电子科技大学 The distributed meta-data management method of dynamic equalization load

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108989370B (en) * 2017-05-31 2020-11-06 华为技术有限公司 Data storage method, equipment and system in CDN system
CN108989370A (en) * 2017-05-31 2018-12-11 华为软件技术有限公司 Date storage method, equipment and system in a kind of CDN system
WO2019000949A1 (en) * 2017-06-28 2019-01-03 华为技术有限公司 Metadata storage method and system in distributed storage system, and storage medium
CN109218340A (en) * 2017-06-29 2019-01-15 上海云教信息技术有限公司 A kind of online data citation system
CN107528924A (en) * 2017-10-09 2017-12-29 郑州云海信息技术有限公司 A kind of distributed type assemblies Metadata Service dispositions method and system
CN107704212A (en) * 2017-10-31 2018-02-16 紫光华山信息技术有限公司 A kind of data processing method and device
CN108920613A (en) * 2018-06-28 2018-11-30 郑州云海信息技术有限公司 A kind of metadata management method, system and equipment and storage medium
CN109688187B (en) * 2018-09-07 2022-04-22 平安科技(深圳)有限公司 Flow load balancing method, device, equipment and readable storage medium
CN109688187A (en) * 2018-09-07 2019-04-26 平安科技(深圳)有限公司 Flow load balance method, apparatus, equipment and readable storage medium storing program for executing
CN109407977B (en) * 2018-09-25 2021-08-31 佛山科学技术学院 Big data distributed storage management method and system
CN109407977A (en) * 2018-09-25 2019-03-01 佛山科学技术学院 A kind of big data distributed storage management method and system
CN111078120A (en) * 2018-10-18 2020-04-28 深信服科技股份有限公司 Data migration method and system of distributed file system and related components
CN111078120B (en) * 2018-10-18 2023-11-03 深信服科技股份有限公司 Data migration method and system of distributed file system and related components
CN109726212A (en) * 2018-12-29 2019-05-07 杭州宏杉科技股份有限公司 Data-storage system and method
CN112256438A (en) * 2020-06-28 2021-01-22 腾讯科技(深圳)有限公司 Load balancing control method and device, storage medium and electronic equipment
CN112256438B (en) * 2020-06-28 2021-06-25 腾讯科技(深圳)有限公司 Load balancing control method and device, storage medium and electronic equipment
CN111737017A (en) * 2020-08-20 2020-10-02 北京东方通科技股份有限公司 Distributed metadata management method and system
CN111737017B (en) * 2020-08-20 2020-12-18 北京东方通科技股份有限公司 Distributed metadata management method and system
US20220357998A1 (en) * 2021-05-08 2022-11-10 Dell Products L.P. Multiple metric-based workload balancing between storage resources
CN115510004A (en) * 2022-11-22 2022-12-23 广东省信息安全测评中心 Government affair data resource naming method and management system

Also Published As

Publication number Publication date
CN106599308B (en) 2020-01-31

Similar Documents

Publication Publication Date Title
CN106599308A (en) Distributed metadata management method and system
US9646038B2 (en) Distributed indexing system for data storage
CN102222085B (en) Data de-duplication method based on combination of similarity and locality
US9773015B2 (en) Dynamically varying the number of database replicas
CN102171661B (en) Restoring selected objects from a monolithic database backup
US20170220614A1 (en) Consistent ring namespaces facilitating data storage and organization in network infrastructures
CN106066896B (en) Application-aware big data deduplication storage system and method
US9547706B2 (en) Using colocation hints to facilitate accessing a distributed data storage system
CN102521072B (en) Virtual tape library equipment and data recovery method
US20110196899A1 (en) Parallel file system processing
US20160212203A1 (en) Multi-site heat map management
US7636736B1 (en) Method and apparatus for creating and using a policy-based access/change log
Frey et al. Probabilistic deduplication for cluster-based storage systems
CN103929500A (en) Method for data fragmentation of distributed storage system
US9405643B2 (en) Multi-level lookup architecture to facilitate failure recovery
CN101916289B (en) Method for establishing digital library storage system supporting mass small files and dynamic backup number
CN105160039A (en) Query method based on big data
CN102521063A (en) Shared storage method suitable for migration and fault tolerance of virtual machine
CN102662992A (en) Method and device for storing and accessing massive small files
CN105117502A (en) Search method based on big data
US11080207B2 (en) Caching framework for big-data engines in the cloud
CN109241004A (en) Meta data file size restoration methods, system, device and readable storage medium storing program for executing
CN106899654A (en) A kind of sequence value generation method, apparatus and system
CN101399765B (en) Method and system for reducing hot node load in P2P network
CN106547484B (en) A kind of reliability method of realization internal storage data and system based on RAID5

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20200514

Address after: Room 306, zone B, floor 3, Wanhe pharmaceutical company building, No. 8, Gaoxin Middle Road, Maling community, Yuehai street, Nanshan District, Shenzhen City, Guangdong Province

Patentee after: SHENZHEN SKYVISION TECHNOLOGY Co.,Ltd.

Address before: 100089 Beijing city Haidian District Dazhongsi Road No. 9 Beijing Science and technology building B block 117 in a constant cloud

Patentee before: Guo Xiaofeng

TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20210910

Address after: 071000 room 1001, unit 1, building 1, century Huating, Hengxiang North Street, Beishi District, Baoding City, Hebei Province

Patentee after: Guo Xiaofeng

Address before: Room 306, block B, 3 / F, Wanhe pharmaceutical company building, No.8, Gaoxin Zhongyi Road, Maling community, Yuehai street, Nanshan District, Shenzhen City, Guangdong Province

Patentee before: SHENZHEN SKYVISION TECHNOLOGY Co.,Ltd.

TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20211119

Address after: 834000 No. 4980-1-1, Jinxi Third Street, Baijiantan District, Karamay City, Xinjiang Uygur Autonomous Region - 104

Patentee after: Zhongke Tianji (Xinjiang) Aerospace Information Co.,Ltd.

Address before: 071000 room 1001, unit 1, building 1, century Huating, Hengxiang North Street, Beishi District, Baoding City, Hebei Province

Patentee before: Guo Xiaofeng

TR01 Transfer of patent right