WO2018233332A1

WO2018233332A1 - Distributed storage internal storage management method and system, and computer storage medium

Info

Publication number: WO2018233332A1
Application number: PCT/CN2018/079685
Authority: WO
Inventors: 江汛洋; 梁松涛; 李道兵; 许式伟
Original assignee: 上海七牛信息技术有限公司
Priority date: 2017-06-22
Filing date: 2018-03-20
Publication date: 2018-12-27
Also published as: CN107329695B; CN107329695A

Abstract

A distributed storage internal storage management method and system, and a computer storage medium, the method comprising: dividing an internal storage management module into a plurality of different size data block types (S110); writing a file to a first node, and setting a lease for the file, such that the file is bound to the first node (S120); in the internal storage, corresponding the file to a data block type, dividing the file in the internal storage into one or a plurality of data blocks of the same size, and simultaneously generating data block access context, the data block access context comprising data block navigation and the associated file coding (S130); storing the data blocks and the access context in the first node and updating the version (S140); when a file is written to the second node and the lease of the file is already held by the first node, then the first node receives the file forwarded by the second node, writes the file to the local storage of the first node, and updates the version (S150). The data writing is a data block-based operation, such that the file can support random writing in the distributed storage.

Description

Distributed storage memory management method, system and computer storage medium

Technical field

The present invention relates to the field of storage technologies, and in particular, to a distributed storage memory management method, system, and computer storage medium.

Background technique

Distributed storage systems distribute data across multiple independent devices. The traditional network storage system uses a centralized storage server to store all data. The storage server becomes a bottleneck of system performance, and is also the focus of reliability and security, and cannot meet the needs of large-scale storage applications. The distributed network storage system adopts a scalable system structure, uses multiple storage servers to share the storage load, and uses the location server to locate the storage information, which not only improves the reliability, availability and access efficiency of the system, but also is easy to expand.

In the distributed storage system, the existing memory management method cannot perform high-quality reading and writing operations on files by multiple machines.

Summary of the invention

The technical problem to be solved by the present invention is to provide an efficient distributed storage memory management method, system and computer storage medium.

The object of the present invention is achieved by the following technical solutions:

A distributed storage memory management method includes:

Divide the memory management module into a plurality of data block types of different sizes;

Writing a file at the first node, and setting a lease to the file, so that the file is bound to the first node;

In the memory, the file corresponds to a data block type, and the file is divided into one or more data blocks of the same size in the memory, and an access context of the data block is generated, and the access context includes the data block guide and the file to which it belongs. coding;

Storing the data block and the access context in the first node and updating the version;

When a file is written at the second node, the lease of the file has been held by the first node, the first node receives the file forwarded from the second node and writes the file to the local storage of the first node And update the version.

Further, it also includes:

Reading data from one or more data blocks at the first node;

If the data is in the first node, read it directly;

If the data is in the second node, the data is requested from the second node, and after the data is requested, the data is cached to the first node and the version is updated.

Further, after the requesting the data, caching the data to the first node and updating the version further includes:

After requesting the data, determine the data local storage version and the global version;

Load directly if the data version is the latest version;

If the data version is lower than the global version, the latest version of the data is requested, and the data is cached to the first node after requesting the latest version of the data.

Further, the method further includes: setting a working data block chain and an idle data block chain in the memory management module;

Operate the data block and insert the data block into the working data block chain or the idle data block chain according to the state;

Inserting the data block in the working data block chain into the free data block chain;

A data block is taken from the free blockchain chain and placed in the working data block chain.

Further, it also includes:

The data blocks in the free blockchain are released at preset intervals and at a preset ratio.

Further, the operating on the data block includes:

At least one of writing, reading, releasing, swiping into a disk, synchronizing to a network, deleting, updating a file size, or failing a version of a file in memory.

Further, the method further includes: the memory is read-only into the file part of the data block for operation.

A distributed storage memory management system comprising:

a memory management module for dividing a memory management module into a plurality of data block types of different sizes;

a writing module, configured to write a file at the first node, and set a lease to the file, so that the file is bound to the first node;

a memory processing module, configured to, in memory, correspond to a data block type, and divide the file into one or more data blocks of the same size in memory, and generate a data block access context, where the access context includes a data block Guidance and file encoding;

a storage module, configured to store the data block and the access context in the first node and update the version;

The writing module is further configured to: when a file is written in the second node, the lease of the file is already held by the first node, the first node receives the file forwarded from the second node, and the file is Write to the local storage of the first node and update the version.

Further, the system further includes:

a reading module, configured to read data from one or more data blocks respectively at the first node; directly read the data if the data is in the first node; and request data from the second node if the data is in the second node, requesting data The data is then cached to the first node and the version is updated.

Further, the reading module is further configured to: after requesting the data, determine the data local storage version and the global version; if the data version is the latest version, directly load; if the data version is lower than the global version, request the latest version of the data, request The latest version of the data is cached to the first node.

Further, the memory management module includes a working data block chain and a free data block chain; and is further configured to operate the data block, and insert the data block into the working data block chain or the idle data block chain according to the state;

The memory management module is further configured to insert a data block in the working data block chain and insert the data block into the free data block chain; and take a data block from the free data block chain into the working data block chain.

Further, the memory management module is further configured to release the data blocks in the idle data block chain by a preset period and at a preset ratio.

Further, the memory processing module is further configured to: write, read, release, brush into a disk, synchronize to a network, delete, update a file size, and disable a file in a memory. .

A computer storage medium storing a program, the program performing the steps of any of the above.

In the present invention, in a distributed storage, a single file is stored in a memory and divided into one or more data blocks of the same size, a file is written in the first node, and a lease is set on the file, so that the file is bound to the first node. The memory management module is divided into a plurality of data block types, a single file corresponds to a certain size data block type, and an access context of the data block is generated, and the access context includes a data block guide and a file encoding; the data block and the access context are stored. At the first node and updating the version; when the file is written at the second node, the lease of the file has been held by the first node, the first node receives the file forwarded from the second node, and writes the file to the first The node is stored locally and updated. Such data is written based on the operation of the data block, and because the file is written, it will be bound to the first node to set up a lease. If other nodes write the file, it will be merged into the first node, so that the file can support random write. There will be no problems. Can easily cope with concurrent write operations.

DRAWINGS

1 is a flowchart of a distributed storage memory management method according to an embodiment of the present invention;

2 is a schematic diagram of a method for requesting and releasing data blocks in an internal memory according to an embodiment of the present invention;

3 is a schematic diagram of a method for writing a file according to an embodiment of the present invention;

4 is a schematic diagram of a processing method when data is written to node A but the lease is held by node B according to an embodiment of the present invention;

FIG. 5 is a schematic diagram of a method for reading data from a node according to an embodiment of the present invention; FIG.

6 is a block diagram of a distributed storage memory management system in accordance with an embodiment of the present invention.

Detailed ways

Before discussing the exemplary embodiments in more detail, it should be noted that some exemplary embodiments are described as a process or method depicted as a flowchart. Although the flowcharts describe various operations as a sequential process, many of the operations can be implemented in parallel, concurrently or concurrently. In addition, the order of operations can be rearranged. The process may be terminated when its operation is completed, but may also have additional steps not included in the figures.

It should also be noted that in some alternative implementations, the functions/acts noted may occur in a different order than that illustrated in the drawings. For example, two figures shown in succession may in fact be executed substantially concurrently or sometimes in the reverse order, depending on the function/acts involved.

The invention will now be further described with reference to the drawings and preferred embodiments.

As shown in FIG. 1, a distributed storage memory management method includes steps S110-S150. among them:

S110: Divide the memory management module into a plurality of data block types of different sizes;

S120: Write a file at the first node, and set a lease on the file, so that the file is bound to the first node;

S130: In the memory, the file corresponds to a data block type, and the file is divided into one or more data blocks of the same size in the memory, and an access context of the data block is generated, where the access context includes a data block guide and File of the file;

S140: Store the data block and the access context in the first node and update the version;

S150: When a file is written in the second node, the lease of the file is already held by the first node, the first node receives the file forwarded from the second node, and writes the file to the first node. Store and update the version locally.

In this embodiment, in the distributed storage, the writing of the data is based on the operation of the data block, and because the file is written and bound to the first node for binding, if the other node writes the file, it will be merged into the first A node that enables files to support random writes without problems. Can easily cope with concurrent write operations. Consistency is achieved by using the lease concept to write the same file to a machine. Use the version number concept to implement cross-network read caching.

Optionally, the method further includes:

Reading data from one or more data blocks at the first node;

If the data is in the first node, read it directly;

Load directly if the data version is the latest version;

In this embodiment, since the reading of data is also based on the operation of the data block in the distributed storage, if the read node of the file holds the lease, that is, the data is stored in the node, the data is directly read, if the data is in other nodes, then Request data reading from other nodes to make the data uniform and support out-of-order reading. The version judgment mechanism is also set when reading data, so that the read data is the latest. Enables the file to support random reads based on the ability to support random writes.

Optionally, the method further includes:

Setting a working data block chain and an idle data block chain in the memory management module;

In this embodiment, since the file is divided into a plurality of data blocks, the data blocks are respectively operated, which is more convenient and flexible. The data blocks are added to the working data block chain and the idle data block chain, and can be performed between the working data block chain and the idle data block chain. Conversion makes it easy to manage and control multiple data blocks and files, making overall memory management more efficient.

Optionally, the method further includes: setting different number of blocks corresponding to the plurality of data blocks, thereby facilitating efficient and rational allocation of resources of the memory.

Optionally, the method further includes: releasing the data blocks in the idle data block chain by a preset period and at a preset ratio.

The memory management module releases the data blocks in the idle data block chain according to the preset period and according to the preset ratio, so as to avoid occupying a large amount of memory all the time. The preset period can be automatically set according to the system, or can be set by the user, and the preset ratio can be automatically set according to the system. It can also be set by the user.

The operation of the data block includes: at least one operation of writing, reading, releasing, brushing into a disk, synchronizing into a network, deleting, updating a file size, and version failure of a file in a memory. . A single block of data in memory can be manipulated in the following order: write, read, release, flush to disk, synchronize to the network, delete, update file size, version invalidation, and so on.

Optionally, the method further includes: the memory is read-only into the file part of the data block for operation, and the efficiency is improved.

In the distributed storage, the node roles are divided into node and mds (meta data service), node is responsible for specific reading and writing, and mds node is responsible for metadata management and coordination of node nodes.

As shown in FIG. 2, the method for requesting and releasing data blocks in the memory includes: a read module in the memory, a write module, buffer recovery, data synchronization to other nodes, read-ahead from other nodes, etc., and data block access is generated. Context, the context includes the data block index (guide) and the file ID shown; then enters the memory data block request; then combines the LRU algorithm (Least Recently Used) with the access context to trigger the data block release of the working data block chain The data block after the data release is put into the idle data block chain; the idle data block chain applies for the data block to the memory data block; the work data block chain also brushes the data into the physical storage as required, and the memory data block applies to the working data block. Chain and free blockchains trigger block release on a periodic basis.

In the entire distributed system, data flow is based on data blocks, and conflicts between various modules can be avoided.

As shown in FIG. 3, the writing of the file specifically includes:

Write a file to the node node.

The node node requests the mds node to write a file to the lease.

The node node divides the file into data block A, data block B, and may also include more data blocks. This embodiment only exemplifies two data blocks, but does not limit the number of data blocks. Multiple blocks are generated in memory and data blocks are written to different blocks. If there is no data block content in the local storage, the data blocks are loaded from other nodes according to the reading process.

Both Block A and Block B are asynchronously flushed to disk and updated by policy.

As shown in Figure 4, the data is written to node A, but the lease is handled by node B:

Write data to the node A node.

The node A node requests the mds node to write a file to the lease, but the lease has been held by the node B node.

The node A node divides the file into data block A and data block B, and may also include more data blocks. This embodiment only exemplifies two data blocks, but does not limit the number of data blocks.

Data block A and data block B forward the data block write content to the node B node.

The node B node then writes the data locally and updates the file version.

As shown in FIG. 5, the method for reading data from a node includes:

The data is read from the node A node, and the data is read from the data block A and the data block B respectively. Of course, more data blocks can be included. In this embodiment, only two data blocks are exemplified, but the number of data blocks is not limited. .

If the data is at Node B, then request data from Node B.

If the data block is not stored locally, the file local storage version and the global version are determined. If the file version is the latest version, the file is directly loaded into the local storage. If the file local storage version is lower than the global version, the latest version is requested from the node B node. Then node B requests the file to cache the data locally and records the version number.

If the data block is not stored locally, node B requests the file to cache the data locally and records the version number.

Another preferred embodiment of the present invention is a distributed storage memory management system including a memory management module 210, a write module 220, a memory processing module 230, and a storage module 240.

The memory management module 210 is configured to divide the memory management module into a plurality of data block types of different sizes.

The writing module 220 is configured to write a file at the first node, and set a lease for the file, so that the file is bound to the first node.

The memory processing module 230 is configured to: in the memory, associate the file with a data block type, and divide the file into one or more data blocks of the same size in the memory, and generate a data block access context, where the access context includes data. Block navigation and associated file encoding.

The storage module 240 is configured to store the data block and the access context in the first node and update the version.

In this embodiment, in the distributed storage, the writing of the data is based on the operation of the data block, and because the file is written and bound to the first node for binding, if the other node writes the file, it will be merged into the first A node that enables files to support random writes without problems. It is easy to deal with concurrent write operations. Consistency is achieved by using the lease concept to write the same file to a machine. Use the version number concept to implement cross-network read caching.

Optionally, the system further includes a reading module, configured to read data from the one or more data blocks respectively at the first node; if the data is in the first node, directly read; if the data is in the second node, Request data from the second node, request the data, cache the data to the first node, and update the version.

In this embodiment, since the reading of data is also based on the operation of the data block in the distributed storage, if the read node of the file holds the lease, that is, the data is stored in the node, the data is directly read, if the data is in other nodes, then Request data reading from other nodes to make the data uniform and support out-of-order reading. The version judgment mechanism is also set when reading data, so that the read data is the latest. Enables the file to support random reads based on the ability to support random writes. The memory processing module 230 includes a working data block chain and a free data block chain; and is further configured to operate on the data block, and insert the data block into the working data block chain or the idle data block chain according to the state; the memory management module further uses After releasing the data block in the working data block chain, insert the free data block chain; take a data block from the free data block chain and put it into the working data block chain.

Optionally, the memory processing module is further configured to set a different number of blocks corresponding to the plurality of data blocks, thereby facilitating efficient and rational allocation of resources.

Optionally, the memory processing module is further configured to release the data blocks in the idle data block chain by a preset period and at a preset ratio. The memory processing module releases the data blocks in the idle data block chain according to a preset period and according to a preset ratio, so as to avoid occupying a large amount of memory all the time. The preset period may be automatically set according to the system, or may be set by the user, and the preset ratio may be automatically according to the system. Settings can also be set by the user.

Optionally, the memory processing module is further configured to: write, read, release, flush to a disk, synchronize to the network, delete, update a file size, and disable a single data block in a file. operating. A single block of data in memory can be manipulated in the following order: write, read, release, flush to disk, synchronize to the network, delete, update file size, version invalidation, and so on.

Another preferred embodiment of the present invention is a computer storage medium having a program stored thereon, the program execution comprising the steps of any of the above embodiments.

The above is a further detailed description of the present invention in connection with the specific preferred embodiments, and the specific embodiments of the present invention are not limited to the description. It will be apparent to those skilled in the art that the present invention may be made without departing from the spirit and scope of the invention.

Claims

A distributed storage memory management method, comprising:

Divide the memory management module into a plurality of data block types of different sizes;

Writing a file at the first node, and setting a lease to the file, so that the file is bound to the first node;

In the memory, the file corresponds to a data block type, and the file is divided into one or more data blocks of the same size in the memory, and an access context of the data block is generated, and the access context includes the data block guide and the file to which it belongs. coding;

Storing the data block and the access context in the first node and updating the version;

When a file is written at the second node, the lease of the file has been held by the first node, the first node receives the file forwarded from the second node and writes the file to the local storage of the first node And update the version.
A distributed storage memory management method according to claim 1, further comprising:

Reading data from one or more data blocks at the first node;

If the data is in the first node, read it directly;

If the data is in the second node, the data is requested from the second node, and after the data is requested, the data is cached to the first node and the version is updated.
The distributed storage memory management method according to claim 2, wherein the requesting the data to cache the data to the first node and updating the version further comprises:

After requesting the data, determine the data local storage version and the global version;

Load directly if the data version is the latest version;

If the data version is lower than the global version, the latest version of the data is requested, and the data is cached to the first node after requesting the latest version of the data.
The distributed storage memory management method according to claim 1, further comprising: setting a working data block chain and an idle data block chain in the memory management module;

Operate the data block and insert the data block into the working data block chain or the idle data block chain according to the state;

Inserting the data block in the working data block chain into the free data block chain;

A data block is taken from the free blockchain chain and placed in the working data block chain.
The distributed storage memory management method according to claim 4, further comprising: releasing the data blocks in the idle data block chain by a preset period and at a preset ratio.
The distributed storage memory management method according to claim 4, wherein the operating the data block comprises:

At least one of writing, reading, releasing, swiping into a disk, synchronizing to a network, deleting, updating a file size, or failing a version of a file in memory.
A distributed storage memory management method according to claim 1, further comprising: a memory read-only file partial data block for operation.
A distributed storage memory management system, comprising:

a memory management module for dividing a memory management module into a plurality of data block types of different sizes;

a writing module, configured to write a file at the first node, and set a lease to the file, so that the file is bound to the first node;

a memory processing module, configured to, in memory, correspond to a data block type, and divide the file into one or more data blocks of the same size in memory, and generate a data block access context, where the access context includes a data block Guidance and file encoding;

a storage module, configured to store the data block and the access context in the first node and update the version;

The writing module is further configured to: when a file is written in the second node, the lease of the file is already held by the first node, the first node receives the file forwarded from the second node, and the file is Write to the local storage of the first node and update the version.
A distributed storage memory management system according to claim 8, wherein the system further comprises:

a reading module, configured to read data from one or more data blocks respectively at the first node; directly read the data if the data is in the first node; and request data from the second node if the data is in the second node, requesting data The data is then cached to the first node and the version is updated.
The distributed storage memory management system according to claim 9, wherein the reading module is further configured to: after requesting the data, determine the data local storage version and the global version; if the data version is the latest version, directly load If the data version is lower than the global version, request the latest version of the data, and request the latest version of the data to cache the data to the first node.
A distributed storage memory management system according to claim 8, wherein said memory management module comprises a working data block chain and a free data block chain; and is further configured to operate the data block and base the data block according to The state is inserted into the working data block chain or the free data block chain; the memory management module is further configured to insert the data block in the working data block chain into the free data block chain; and take a data block from the free data block chain and put it into the work. Data block chain.
A distributed storage memory management system according to claim 11, wherein the memory management module is further configured to release data blocks in the idle data block chain by a preset period and at a preset ratio.
The distributed storage memory management system according to claim 8, wherein the memory processing module is further configured to write, read, release, flush, and synchronize a single data block of the file in the memory to the disk. At least one of the operations in the network, deleting, updating the file size, and version invalidation.
A computer storage medium, characterized in that the computer storage medium can store a program, the program execution comprising the steps of any one of claims 1-7.