CN106021381A - Data access/storage method and device for cloud storage service system - Google Patents

Data access/storage method and device for cloud storage service system Download PDF

Info

Publication number
CN106021381A
CN106021381A CN201610311690.1A CN201610311690A CN106021381A CN 106021381 A CN106021381 A CN 106021381A CN 201610311690 A CN201610311690 A CN 201610311690A CN 106021381 A CN106021381 A CN 106021381A
Authority
CN
China
Prior art keywords
data
file
data block
read
metadata
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610311690.1A
Other languages
Chinese (zh)
Inventor
罗安宁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sohu New Media Information Technology Co Ltd
Original Assignee
Beijing Sohu New Media Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sohu New Media Information Technology Co Ltd filed Critical Beijing Sohu New Media Information Technology Co Ltd
Priority to CN201610311690.1A priority Critical patent/CN106021381A/en
Publication of CN106021381A publication Critical patent/CN106021381A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • G06F16/1824Distributed file systems implemented using Network-attached Storage [NAS] architecture
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/172Caching, prefetching or hoarding of files

Abstract

Embodiments of the invention disclose a data caching method and device. The method comprises the following steps that a metadata cache region and a data file cache region are established in a client, wherein the metadata cache region is used for storing metadata of each data file, and metadata is information used for describing the data file; the data file cache region is used for storing a data block of each data file; and each data file is split into a plurality of data blocks with the fixed size, an independent lock is configured for each data block, and the data is updated to the cloud or the data is read to the local cache from the cloud in units of the data blocks. According to the technical scheme provided by the invention, the metadata and the data files are stored in the client through a cache mechanism, so that the IO (Input Output) frequency can be reduced, and furthermore, the data files are stored in a way of the data blocks, so that the concurrency can be increased to support an efficient read-write mechanism.

Description

The data access of a kind of cloud storage service system/storage method and device
Technical field
The present invention relates to cloud storage field, particularly relate to the data access of a kind of cloud storage service system/deposit Method for storing and device.
Background technology
Cloud storage is a kind of Network storage technology, refers to by cluster application, network technology or distributed The functions such as file system, by various types of storage devices a large amount of in network by application software collection Collaborative work altogether, the common system that data storage and Operational Visit function are externally provided.Simple next Saying, cloud storage is exactly that storage resources is placed on cloud the scheme for people's access, and user can be any Time, any place, the device through any network-connectable is connected on cloud access data easily.
At present, client generally loads the shared storage disk of cloud storage by the way of loading disk To this locality, to realize remote access.At present, conventional mode has NFS (Network File System, NFS) and user's space file system (File system in User space, FUSE);Its In, NFS is the one in the file system that FreeBSD supports, it allows between the computer in network By TCP/IP network shared resource.In the application of NFS, the client application of local NFS can To read and write the file being positioned on high in the clouds (cloud store-service end) nfs server pellucidly, just as accessing Local file is the same.FUSE be in Linux for some cyberspace of carry to local file system Module.
Present stage, the most network storage, such as: GlusterFS, HDFS, and Amazon AWS all has client file systems based on FUSE.Cloud will can be there is by these clients End network disk equipment be mounted to local directory.Wherein, s3fs-fuse is one is Amazon s3 The open source projects of cloud platform exploitation, achieves the file system of user's space based on FUSE.It provides To metadata and the management of data, but caching is lacked and further divides and manage by s3fs-fuse Reason, s3fs-fuse provides a single lock for a file, it means that for file in part During renewal, other parts cannot be modified, file data can only accessed sequentially and amendment, Lack concurrency.
Being similar to the other system of above-mentioned s3fs-fuse, single being locked into line pipe if used for file Reason, the problem that all can there is above-mentioned shortage concurrency, affect the effect that user accesses, revises high in the clouds data Rate, Consumer's Experience is not so good.
Summary of the invention
In order to solve in prior art, the present invention provides a kind of data cache method and device, client Hold by caching mechanism storage metadata and data file, so can reduce by IO number, by with number Store data file according to the mode of block, by increasing capacitance it is possible to increase concurrency, thus support to read and write mechanism efficiently.
Providing a kind of data cache method in the application first aspect, described method includes:
Metadata cache district and data file cache district, wherein, described metadata is set up in client Buffer area is for storing the metadata of each data file, and metadata refers to for describing data file Information;Described data file buffer area is for storing the data block of each data file;
Each data file is cut into the data block of multiple fixed size, for each data block configuration Independent lock, it is achieved in units of data block, data are updated to high in the clouds or data are read by high in the clouds To local cache.
Optionally, described in client, set up metadata cache district and data file cache district, including:
Metadata cache district is set up, so that metadata cache is in the content in the internal memory of client;
Data file buffer area is set up, so that data file is buffered in disk in the disk of client In.
Optionally, described method also includes:
Receiving the read request of operating system nucleus, described read request includes: filename to be read and treating The data segments read;
According to the content of described metadata cache district record, search with described in data file buffer area The data file that file name is corresponding;
Contrast the current version of data block to be read and the current version of file to be read;
If, the current version of data block less than the current version of file, then the more new data block from high in the clouds, After renewal, from data file caches, read data and return;
If the current version of data block is identical with the current version of file, then cache from data file Middle reading data also return.
Optionally, during more new data block, if, the entity tag ETag that network request returns Differ with the entity tag ETag of record in the metadata cache of data file, then update data file Current version and metadata cache, return described contrast operation after renewal;Described ETag refer to The mark of web resource association.
5, method according to claim 3, it is characterised in that the version of described data block is set For shaping atom variable.
Optionally, a data block several data blocks that pre-read is adjacent simultaneously are being read.
Optionally, described method also includes:
Receive operating system nucleus write request, described write request includes: file name to be written and Data to be written;
Obtain the lock of data block to be written, start this lock, write data in the buffer area of correspondence;
Write during, by written into data block upload to high in the clouds in advance;
At the end of all data blocks are uploaded, metadata corresponding in amendment metadata cache.
Optionally, described write during, by written into data block upload to high in the clouds in advance, Including:
Monitoring ablation process, it is judged that written into Data Position whether already more than the end of data block, And whether the state of uploading of this data block is designated to be uploaded, if it is, use asynchronous system in advance This data block is uploaded to high in the clouds, and is uploaded status indicator for upload.
Providing a kind of data buffer storage device in the application second aspect, described device includes:
Buffer area sets up unit, for setting up metadata cache district and data file cache in client District, wherein, described metadata cache district is for storing the metadata of each data file, and metadata is Refer to the information for describing data file;Described data file buffer area is used for storing each data file Data block;
Data block divides and buffer unit, for each data file is cut into multiple fixed size Data block, for the lock of each data block configuration independence, it is achieved data updated in units of data block Local cache is read by high in the clouds to high in the clouds or by data.
Optionally, described buffer area sets up unit, including:
Metadata cache district sets up subelement, for setting up metadata cache district in the internal memory of client, So that metadata cache is in the content;
Data file buffer area sets up subelement, delays for setting up data file in the disk of client Deposit district, so that data file is buffered in disk.
Optionally, described device also includes:
Read request response unit, for receiving the read request of operating system nucleus, described read request includes: Filename to be read and data segments to be read;
Search unit, for the content according to described metadata cache district record, cache in data file District searches the data file corresponding with described file name;
Contrast unit, for contrasting the current version of data block to be read and working as of file to be read Front version;
First reads unit, if for, the current version of data block is less than the current version of file, The then more new data block from high in the clouds, after renewal, reads data from data file caches and returns;
Second reads unit, if for, the current version of data block is identical with the current version of file, From data file caches, then read data and return.
Optionally, described first reads unit, including:
Update subelement, be used for during more new data block, if, the reality that network request returns Body label E Tag differs, then with the entity tag ETag of record in the metadata cache of data file Update current version and the metadata cache of data file, after renewal, return described contrast operation;Institute State the mark that ETag refers to associate with web resource.
Optionally, described device also includes:
Unit is set, is shaping atom variable for pre-setting the version of described data block.
Optionally, described device also includes:
Pre-read unit, reads section one data block of reading for reading unit or second first Several data blocks that pre-read is adjacent simultaneously.
Optionally, described device also includes:
Write request response unit, for receiving the write request of operating system nucleus, described write request includes: File name to be written and data to be written;
Data write unit, for obtaining the lock of data block to be written, starts this lock, data is write Enter in the buffer area of correspondence;
Data uploading unit, for write during, by written into data block upload in advance High in the clouds;
Metadata updates unit, at the end of all data blocks are uploaded, revises in metadata cache Corresponding metadata.
Optionally, described data uploading unit, including:
Upload subelement, be used for monitoring ablation process, it is judged that written into Data Position surpassed Cross the end of data block, and whether the state of uploading of this data block is designated to be uploaded, if it is, Use asynchronous system that this data block uploads to high in the clouds in advance, and uploaded status indicator for upload.
Relative to prior art, the advantage of the application technique scheme is:
Metadata cache district and data file cache district, wherein, described metadata is set up in client Buffer area is for storing the metadata of each data file, and metadata refers to for describing data file Information;Described data file buffer area is for storing the data block of each data file;By each data File is cut into the data block of multiple fixed size, for the lock of each data block configuration independence, it is achieved In units of data block, data are updated to high in the clouds or data are read local cache by high in the clouds.This This buffer structure that application proposes, is divided data file, is carried out with the form of data block Storage management so that data file can add concurrency, simultaneously by concurrent access and amendment For the respective version of data block and file record, can upgrade in time by the way of judging version data, It is thus possible to meet the requirement of data consistency.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present application or technical scheme of the prior art, below by right In embodiment or description of the prior art, the required accompanying drawing used is briefly described, it should be apparent that, Accompanying drawing in describing below is only some embodiments described in the application, skill common for this area From the point of view of art personnel, on the premise of not paying creative work, it is also possible to obtain it according to these accompanying drawings Its accompanying drawing.
The flow chart of a kind of data cache method that Fig. 1 provides for the application;
The schematic diagram of the data buffer storage structure that Fig. 2 provides for the application;
The flow chart of the method for reading data that Fig. 3 provides for the application;
The flow chart of the method for writing data that Fig. 4 provides for the application;
The structural representation of a kind of data buffer storage device that Fig. 5 provides for the application.
Detailed description of the invention
In order to make those skilled in the art be more fully understood that the present invention program, below in conjunction with the present invention Accompanying drawing in embodiment, is clearly and completely described the technical scheme in the embodiment of the present invention, Obviously, described embodiment is only a part of embodiment of the present invention rather than whole embodiments. Based on the embodiment in the present invention, those of ordinary skill in the art are not making creative work premise Lower obtained every other embodiment, broadly falls into the scope of protection of the invention.
See the flow chart that Fig. 1, Fig. 1 are a kind of data cache methods that the application provides, such as Fig. 1 Shown in, the method may comprise steps of 101 and step 102.
Step 101: set up metadata cache district and data file cache district in client, wherein, Described metadata cache district is for storing the metadata of each data file, and metadata refers to for describing The information of data file;Described data file buffer area is for storing the data block of each data file.
Step 102: each data file is cut into the data block of multiple fixed size, for each The lock of data block configuration independence, it is achieved in units of data block, data are updated to high in the clouds or by data Local cache is read by high in the clouds.
The buffer structure proposed for clearer explanation the application, carries the application below in conjunction with Fig. 2 The buffer structure gone out is further explained explanation.
As in figure 2 it is shown, the buffer structure that the application provides includes that metadata cache district and data file are delayed Deposit these two parts of district.In view of in an operating system, metadata access is more frequent, and data ratio itself Metadata takies the features such as bigger disk space, in order to preferably utilize FTP client FTP resource, the application The preferred implementation provided, this implementation includes:
Metadata cache district is set up, so that metadata cache is in the content in the internal memory of client; And set up data file buffer area in the disk of client, so that data file is buffered in disk.
The data cached file of data file buffer area, this data file is that data file is having visit beyond the clouds A corresponding local cache file is dynamically set up when asking request, and at filec descriptor final plant closure Time delete this cache file.
In this application, data file is not to cache with overall document form, but is split Becoming the data block of multiple fixed size, this fixed size can configure when program starts.In read-write During request, thus in units of data block, data are updated to high in the clouds or by data by high in the clouds Read local cache.
Each data block is required to configure a Read-Write Locks, and Read-Write Locks is used for anti-during more new data from high in the clouds The only read requests of client.It addition, data block also needs to be configured with current version, working as of data block Front version is set to integer atom variable.Shaping atom variable is without keeping line while locking Cheng Anquan, can realize adding 1 operation in a cpu instruction, and efficiency is high.Simultaneously, Mei Gehuan Depositing file and also have an extra current version variable, current file version is to be determined by ETag, ETag Being the value returned by high in the clouds, it is the MD5 value of file.
The application is exactly based on the size of the current version of data block and the current version of cache file and sentences Data block in disconnected buffer area is the most consistent with high in the clouds.
The above-mentioned buffering scheme provided based on the application, the application also provides for the digital independent scheme of correspondence With data writing scheme, below by embodiment, both schemes are explained respectively.
See the flow chart that Fig. 3, Fig. 3 are the method for reading data that the application provides, this digital independent Method is to realize, as it is shown on figure 3, the method includes based on the buffering scheme shown in above-mentioned Fig. 1 Following steps 301-step 305;
Step 301: receiving the read request of operating system nucleus, described read request includes: to be read Filename and data segments to be read.
Step 302: according to the content of described metadata cache district record, in data file buffer area Search the data file corresponding with described file name.
Step 303: contrast the current version of data block to be read and the current version of file to be read This.
Step 304: if the current version of data block is less than the current version of file, then from high in the clouds More new data block, after renewal, reads data from data file caches and returns.
Step 305: if the current version of data block is identical with the current version of file, then from number According to file cache reading data and returning.
When realizing, client can receive by the kernel requests of the system forwards such as Fuse or NFS, should Request includes the data segments that the filename that needs access accesses with needs;Then, client contrast needs The current version of data block to be accessed and the magnitude relationship of the current version of file, when working as of data block When front version is less than the current version of file, the most more new data block, at the same time it can also be to adjacent Data block initiates asynchronous refresh request.When the current version of data block and the current version of file are identical, Then directly read from caching and return data.
During more new data block, if the ETag of network request return and current version metadata In caching, the Etag of record is not inconsistent, then update current version and the metadata cache of file, again return to Step 303 contrasts.After data renewal terminates, then read and return from data file caches Data.
Etag is the MD5 value of data file on the line that high in the clouds returns, therefore, when data file becomes During change, Etag is bound to change.Further, when Etag changes, it is simply that automatically update The FileVersion of Installed System Memory storage, FileVersion is integer atom variable.So, it becomes possible to by right The concordance of the on-line off-line data of data is ensured than Etag.
During the read-write of data file, whenever version change being detected on line, file current Version adds 1, and the current version causing data block may be less than the current version of file.The most all literary compositions Part is data cached will detect inefficacy, and need when next time accesses to update to latest edition.
After data block updates, the current version of data block will be registered as the current version of file. Meanwhile, by update every time request of data return ETag all carry out with the ETag in metadata cache right Ratio, when inconsistent, shows that the data in high in the clouds are the most updated, now, and will by the current version of file Again add 1 repetition aforesaid operations, until local cache data is consistent with high in the clouds.
During the read-write of multithreading read-write or multi-client, it is necessary to consider the concordance of data Problem.When high in the clouds data are modified, this amendment should reflect in the data of client in time. To achieve it, metadata cache needs a less time-out time, to ensure that metadata is delayed Deposit and can upgrade in time.When accessing file, by documents ETag and the record in metadata cache ETag the most consistent.To again update in the case of inconsistencies file data caching and file unit Data buffer storage.
See the flow chart that Fig. 4, Fig. 4 are the method for reading data that the application provides, this digital independent Method is to realize based on the buffering scheme shown in above-mentioned Fig. 1, and as shown in Figure 4, the method includes Following steps 401-step 404;
Step 401: receiving the write request of operating system nucleus, described write request includes: to be written File name and data to be written.
Step 402: obtain the lock of data block to be written, starts this lock, writes data into correspondence In buffer area.
Step 403: write during, by written into data block upload to high in the clouds in advance.
When realizing, step 403 may include that monitoring ablation process, it is judged that written into data bit Put the end whether alreading more than data block, and whether the state of uploading of this data block be designated to be uploaded, If it is, use asynchronous system that this data block uploads to high in the clouds in advance, and uploaded state mark Know for upload.
Step 404: at the end of all data blocks are uploaded, first number corresponding in amendment metadata cache According to.
Similar with the read method flow process shown in above-mentioned Fig. 3, the wiring method that the application provides is first Receiving the kernel requests forwarded by Fuse or NFS, this request includes the filename that needs write And data, then, find the cache file of correspondence according to metadata cache district recorded content, if, The most then create new cache file.Caching number is write data into after the lock getting data block According in block, and it is 0,0 to be used for identifying state to be uploaded by the status indication of uploading of the data revised. If the position being ultimately written has been over the end of certain data block, and the uploading of this data block Status indication is 0, then this data block of asynchronous upload, and state of being uploaded changes to 1, and 1 is used for Mark has uploaded state.After the end of file is uploaded, revise metadata cache data.
Due to, it is achieved efficiently read-write needs the problem considered mainly reduce by IO number and increase concurrent Property.Therefore, the technical scheme that the application provides, in order to reduce IO, needs the data accessed and unit's number According to increasing by one layer of caching.In an operating system, metadata access is more frequent, and data itself are than unit Data take bigger disk space.Herein file metadata is separated, by unit with file data caching Data are placed in internal memory, and data are divided and are placed on disk.For file data, then enter according to file size One step cutting, is divided into several equal-sized data block, this cache way, is conducive to reading at file During, background thread meeting several data blocks that pre-read is adjacent simultaneously, improve data reading performance using redundancy. In the process of file write, the data block having been written into can be uploaded by background thread in advance.Due to backstage Carry out during the read-write operation of thread and client, data when client reads next data block simultaneously Already in local, therefore its performance can be close to local IP access.
The method provided based on above example, the embodiment of the present invention additionally provides the device of correspondence, under Face combines accompanying drawing to describe its operation principle in detail.
Device embodiment
See the structure chart that Fig. 5, Fig. 5 are a kind of data buffer storage devices that the present invention provides, such as Fig. 5 Shown in, this device can include with lower unit:
Buffer area sets up unit 501, for setting up metadata cache district and data file in client Buffer area, wherein, described metadata cache district is used for storing the metadata of each data file, unit's number According to the information referred to for describing data file;Described data file buffer area is used for storing each data The data block of file;
Data block divides and buffer unit 502, multiple fixing big for each data file being cut into Little data block, for the lock of each data block configuration independence, it is achieved by data in units of data block Update to high in the clouds or data are read local cache by high in the clouds.
Optionally, described buffer area sets up unit 501, including:
Metadata cache district sets up subelement, for setting up metadata cache district in the internal memory of client, So that metadata cache is in the content;
Data file buffer area sets up subelement, delays for setting up data file in the disk of client Deposit district, so that data file is buffered in disk.
Optionally, described device also includes:
Read request response unit, for receiving the read request of operating system nucleus, described read request includes: Filename to be read and data segments to be read;
Search unit, for the content according to described metadata cache district record, cache in data file District searches the data file corresponding with described file name;
Contrast unit, for contrasting the current version of data block to be read and working as of file to be read Front version;
First reads unit, if for, the current version of data block is less than the current version of file, The then more new data block from high in the clouds, after renewal, reads data from data file caches and returns;
Second reads unit, if for, the current version of data block is identical with the current version of file, From data file caches, then read data and return.
Optionally, described first reads unit, including:
Update subelement, be used for during more new data block, if, the reality that network request returns Body label E Tag differs, then with the entity tag ETag of record in the metadata cache of data file Update current version and the metadata cache of data file, after renewal, return described contrast operation;Institute State the mark that ETag refers to associate with web resource.
Optionally, described device also includes:
Unit is set, is shaping atom variable for pre-setting the version of described data block.
Optionally, described device also includes:
Pre-read unit, reads section one data block of reading for reading unit or second first Several data blocks that pre-read is adjacent simultaneously.
Optionally, described device also includes:
Write request response unit, for receiving the write request of operating system nucleus, described write request includes: File name to be written and data to be written;
Data write unit, for obtaining the lock of data block to be written, starts this lock, data is write Enter in the buffer area of correspondence;
Data uploading unit, for write during, by written into data block upload in advance High in the clouds;
Metadata updates unit, at the end of all data blocks are uploaded, revises in metadata cache Corresponding metadata.
Optionally, described data uploading unit, including:
Upload subelement, be used for monitoring ablation process, it is judged that written into Data Position surpassed Cross the end of data block, and whether the state of uploading of this data block is designated to be uploaded, if it is, Use asynchronous system that this data block uploads to high in the clouds in advance, and uploaded status indicator for upload.
It should be noted that one of ordinary skill in the art will appreciate that and realize in said method embodiment All or part of flow process, can be by computer program and complete to instruct relevant hardware, institute The program stated can be stored in a computer read/write memory medium, and this program is upon execution, it may include Flow process such as above-mentioned each method embodiment.Wherein, described storage medium can be magnetic disc, CD, read-only Store-memory body (Read-Only Memory, ROM) or random store-memory body (Random Access Memory, RAM) etc..
Each embodiment in this specification all uses the mode gone forward one by one to describe, phase between each embodiment As homophase part see mutually, each embodiment stress with other embodiments Difference.For device embodiment, owing to it is substantially similar to embodiment of the method, So describing fairly simple, relevant part sees the part of embodiment of the method and illustrates.Above institute The device embodiment described is only schematically, the wherein said unit illustrated as separating component and Module can be or may not be physically separate.Furthermore it is also possible to according to the actual needs Select some or all of unit therein and module to realize the purpose of the present embodiment scheme.This area Those of ordinary skill, in the case of not paying creative work, is i.e. appreciated that and implements.
The above is only the detailed description of the invention of the present invention, it is noted that common for the art For technical staff, under the premise without departing from the principles of the invention, it is also possible to make some improvements and modifications, These improvements and modifications also should be regarded as protection scope of the present invention.

Claims (16)

1. a data cache method, it is characterised in that described method includes:
Metadata cache district and data file cache district, wherein, described metadata is set up in client Buffer area is for storing the metadata of each data file, and metadata refers to for describing data file Information;Described data file buffer area is for storing the data block of each data file;
Each data file is cut into the data block of multiple fixed size, for each data block configuration Independent lock, it is achieved in units of data block, data are updated to high in the clouds or data are read by high in the clouds To local cache.
Method the most according to claim 1, it is characterised in that described foundation unit in client Data buffer area and data file cache district, including:
Metadata cache district is set up, so that metadata cache is in the content in the internal memory of client;
Data file buffer area is set up, so that data file is buffered in disk in the disk of client In.
Method the most according to claim 1, it is characterised in that described method also includes:
Receiving the read request of operating system nucleus, described read request includes: filename to be read and treating The data segments read;
According to the content of described metadata cache district record, search with described in data file buffer area The data file that file name is corresponding;
Contrast the current version of data block to be read and the current version of file to be read;
If, the current version of data block less than the current version of file, then the more new data block from high in the clouds, After renewal, from data file caches, read data and return;
If the current version of data block is identical with the current version of file, then cache from data file Middle reading data also return.
Method the most according to claim 3, it is characterised in that during more new data block, If, the entity tag ETag that network request returns and the reality of record in the metadata cache of data file Body label E Tag differs, then update current version and the metadata cache of data file, after renewal Return described contrast operation;Described ETag refers to the mark associated with web resource.
Method the most according to claim 3, it is characterised in that the version of described data block is set For shaping atom variable.
Method the most according to claim 3, it is characterised in that
Reading a data block several data blocks that pre-read is adjacent simultaneously.
Method the most according to claim 1, it is characterised in that described method also includes:
Receive operating system nucleus write request, described write request includes: file name to be written and Data to be written;
Obtain the lock of data block to be written, start this lock, write data in the buffer area of correspondence;
Write during, by written into data block upload to high in the clouds in advance;
At the end of all data blocks are uploaded, metadata corresponding in amendment metadata cache.
Method the most according to claim 7, it is characterised in that described during write, By written into data block upload to high in the clouds in advance, including:
Monitoring ablation process, it is judged that written into Data Position whether already more than the end of data block, And whether the state of uploading of this data block is designated to be uploaded, if it is, use asynchronous system in advance This data block is uploaded to high in the clouds, and is uploaded status indicator for upload.
9. a data buffer storage device, it is characterised in that described device includes:
Buffer area sets up unit, for setting up metadata cache district and data file cache in client District, wherein, described metadata cache district is for storing the metadata of each data file, and metadata is Refer to the information for describing data file;Described data file buffer area is used for storing each data file Data block;
Data block divides and buffer unit, for each data file is cut into multiple fixed size Data block, for the lock of each data block configuration independence, it is achieved data updated in units of data block Local cache is read by high in the clouds to high in the clouds or by data.
Device the most according to claim 9, it is characterised in that described buffer area sets up unit, Including:
Metadata cache district sets up subelement, for setting up metadata cache district in the internal memory of client, So that metadata cache is in the content;
Data file buffer area sets up subelement, delays for setting up data file in the disk of client Deposit district, so that data file is buffered in disk.
11. devices according to claim 9, it is characterised in that described device also includes:
Read request response unit, for receiving the read request of operating system nucleus, described read request includes: Filename to be read and data segments to be read;
Search unit, for the content according to described metadata cache district record, cache in data file District searches the data file corresponding with described file name;
Contrast unit, for contrasting the current version of data block to be read and working as of file to be read Front version;
First reads unit, if for, the current version of data block is less than the current version of file, The then more new data block from high in the clouds, after renewal, reads data from data file caches and returns;
Second reads unit, if for, the current version of data block is identical with the current version of file, From data file caches, then read data and return.
12. devices according to claim 11, it is characterised in that described first reads unit, Including:
Update subelement, be used for during more new data block, if, the reality that network request returns Body label E Tag differs, then with the entity tag ETag of record in the metadata cache of data file Update current version and the metadata cache of data file, after renewal, return described contrast operation;Institute State the mark that ETag refers to associate with web resource.
13. devices according to claim 11, it is characterised in that also include:
Unit is set, is shaping atom variable for pre-setting the version of described data block.
14. devices according to claim 11, it is characterised in that described device also includes:
Pre-read unit, reads section one data block of reading for reading unit or second first Several data blocks that pre-read is adjacent simultaneously.
15. devices according to claim 9, it is characterised in that described device also includes:
Write request response unit, for receiving the write request of operating system nucleus, described write request includes: File name to be written and data to be written;
Data write unit, for obtaining the lock of data block to be written, starts this lock, data is write Enter in the buffer area of correspondence;
Data uploading unit, for write during, by written into data block upload in advance High in the clouds;
Metadata updates unit, at the end of all data blocks are uploaded, revises in metadata cache Corresponding metadata.
16. devices according to claim 15, it is characterised in that described data uploading unit, Including:
Upload subelement, be used for monitoring ablation process, it is judged that written into Data Position surpassed Cross the end of data block, and whether the state of uploading of this data block is designated to be uploaded, if it is, Use asynchronous system that this data block uploads to high in the clouds in advance, and uploaded status indicator for upload.
CN201610311690.1A 2016-05-11 2016-05-11 Data access/storage method and device for cloud storage service system Pending CN106021381A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610311690.1A CN106021381A (en) 2016-05-11 2016-05-11 Data access/storage method and device for cloud storage service system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610311690.1A CN106021381A (en) 2016-05-11 2016-05-11 Data access/storage method and device for cloud storage service system

Publications (1)

Publication Number Publication Date
CN106021381A true CN106021381A (en) 2016-10-12

Family

ID=57100022

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610311690.1A Pending CN106021381A (en) 2016-05-11 2016-05-11 Data access/storage method and device for cloud storage service system

Country Status (1)

Country Link
CN (1) CN106021381A (en)

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106779936A (en) * 2016-12-08 2017-05-31 上海携程国际旅行社有限公司 Firm Order system and Firm Order method
CN107786670A (en) * 2017-11-10 2018-03-09 郑州云海信息技术有限公司 A kind of system and method that data and cloud storage R. concomitans are locally stored
CN108021339A (en) * 2017-11-03 2018-05-11 网宿科技股份有限公司 A kind of method of disk read-write, equipment and computer-readable recording medium
CN108427703A (en) * 2017-02-15 2018-08-21 谷歌有限责任公司 The system and method accessed the data file for being stored in data-storage system are provided
CN108959405A (en) * 2018-06-06 2018-12-07 深圳市远行科技股份有限公司 The strong consistency read method and terminal device of data
CN109710194A (en) * 2018-12-29 2019-05-03 武汉思普崚技术有限公司 The storage method and device of upper transmitting file
CN109886647A (en) * 2019-01-29 2019-06-14 广东华伦招标有限公司 Procurement business managing functional module method, apparatus, equipment and storage medium
CN110188080A (en) * 2019-05-17 2019-08-30 北京航空航天大学 Telefile Research of data access performance optimization based on client high-efficiency caching
CN110263060A (en) * 2019-06-06 2019-09-20 零搜科技(深圳)有限公司 A kind of ERP electronic accessories management method and computer equipment
CN110362599A (en) * 2018-04-08 2019-10-22 北京京东尚科信息技术有限公司 Date storage method, device, electronic equipment and computer-readable medium
CN110555001A (en) * 2019-09-05 2019-12-10 腾讯科技(深圳)有限公司 data processing method, device, terminal and medium
CN111198885A (en) * 2019-12-30 2020-05-26 北京奇艺世纪科技有限公司 Data processing method and device
CN107423404B (en) * 2017-07-27 2020-06-26 东软集团股份有限公司 Flow instance data synchronous processing method and device
CN111367865A (en) * 2020-03-10 2020-07-03 山东汇贸电子口岸有限公司 AWS 3-based file fragment uploading implementation method
CN111427966A (en) * 2020-06-10 2020-07-17 腾讯科技(深圳)有限公司 Database transaction processing method and device and server
CN112181916A (en) * 2020-09-14 2021-01-05 星辰天合(北京)数据科技有限公司 File pre-reading method and device based on user space file system FUSE, and electronic equipment
CN112685363A (en) * 2020-12-30 2021-04-20 湖南博匠信息科技有限公司 User file reading and writing method and device
CN112860644A (en) * 2021-02-05 2021-05-28 深圳市木浪云数据有限公司 Data storage method and device based on cloud storage
CN113032335A (en) * 2021-05-24 2021-06-25 北京世纪好未来教育科技有限公司 File access method, device, equipment and storage medium
CN113220211A (en) * 2020-01-21 2021-08-06 上海商汤智能科技有限公司 Data storage system, data access method and related device
CN113242321A (en) * 2021-07-12 2021-08-10 广东睿江云计算股份有限公司 Data transmission method for mobile storage device
CN113342837A (en) * 2021-08-06 2021-09-03 北京金山云网络技术有限公司 Data transmission method, device, electronic equipment and computer readable medium
CN115242817A (en) * 2022-07-21 2022-10-25 阿里巴巴(中国)有限公司 Data access processing method, device, equipment and storage medium
CN117311645A (en) * 2023-11-24 2023-12-29 武汉纺织大学 LSM storage metadata read amplification optimization method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102004778A (en) * 2010-11-19 2011-04-06 清华大学 Text index online updating method in cloud environment
CN102035881A (en) * 2010-11-19 2011-04-27 清华大学 Data caching method of cloud storage system
CN102882923A (en) * 2012-07-25 2013-01-16 北京亿赛通科技发展有限责任公司 Secure storage system and method for mobile terminal
CN102882983A (en) * 2012-10-22 2013-01-16 南京云创存储科技有限公司 Rapid data memory method for improving concurrent visiting performance in cloud memory system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102004778A (en) * 2010-11-19 2011-04-06 清华大学 Text index online updating method in cloud environment
CN102035881A (en) * 2010-11-19 2011-04-27 清华大学 Data caching method of cloud storage system
CN102882923A (en) * 2012-07-25 2013-01-16 北京亿赛通科技发展有限责任公司 Secure storage system and method for mobile terminal
CN102882983A (en) * 2012-10-22 2013-01-16 南京云创存储科技有限公司 Rapid data memory method for improving concurrent visiting performance in cloud memory system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
霍严梅 等: "并行文件系统研究综述", 《小型微型计算机系统》 *

Cited By (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106779936A (en) * 2016-12-08 2017-05-31 上海携程国际旅行社有限公司 Firm Order system and Firm Order method
CN106779936B (en) * 2016-12-08 2020-09-22 上海携程国际旅行社有限公司 Order verification system and order verification method
CN108427703A (en) * 2017-02-15 2018-08-21 谷歌有限责任公司 The system and method accessed the data file for being stored in data-storage system are provided
US11232080B2 (en) 2017-02-15 2022-01-25 Google Llc Systems and methods for providing access to a data file stored at a data storage system
CN107423404B (en) * 2017-07-27 2020-06-26 东软集团股份有限公司 Flow instance data synchronous processing method and device
CN108021339A (en) * 2017-11-03 2018-05-11 网宿科技股份有限公司 A kind of method of disk read-write, equipment and computer-readable recording medium
CN107786670A (en) * 2017-11-10 2018-03-09 郑州云海信息技术有限公司 A kind of system and method that data and cloud storage R. concomitans are locally stored
CN110362599A (en) * 2018-04-08 2019-10-22 北京京东尚科信息技术有限公司 Date storage method, device, electronic equipment and computer-readable medium
CN108959405A (en) * 2018-06-06 2018-12-07 深圳市远行科技股份有限公司 The strong consistency read method and terminal device of data
CN109710194A (en) * 2018-12-29 2019-05-03 武汉思普崚技术有限公司 The storage method and device of upper transmitting file
CN109886647A (en) * 2019-01-29 2019-06-14 广东华伦招标有限公司 Procurement business managing functional module method, apparatus, equipment and storage medium
CN110188080A (en) * 2019-05-17 2019-08-30 北京航空航天大学 Telefile Research of data access performance optimization based on client high-efficiency caching
CN110188080B (en) * 2019-05-17 2021-12-17 北京航空航天大学 Remote file data access performance optimization method based on client-side efficient cache
CN110263060A (en) * 2019-06-06 2019-09-20 零搜科技(深圳)有限公司 A kind of ERP electronic accessories management method and computer equipment
CN110555001A (en) * 2019-09-05 2019-12-10 腾讯科技(深圳)有限公司 data processing method, device, terminal and medium
CN110555001B (en) * 2019-09-05 2021-05-28 腾讯科技(深圳)有限公司 Data processing method, device, terminal and medium
CN111198885A (en) * 2019-12-30 2020-05-26 北京奇艺世纪科技有限公司 Data processing method and device
CN113220211A (en) * 2020-01-21 2021-08-06 上海商汤智能科技有限公司 Data storage system, data access method and related device
CN111367865A (en) * 2020-03-10 2020-07-03 山东汇贸电子口岸有限公司 AWS 3-based file fragment uploading implementation method
CN111427966A (en) * 2020-06-10 2020-07-17 腾讯科技(深圳)有限公司 Database transaction processing method and device and server
CN112181916A (en) * 2020-09-14 2021-01-05 星辰天合(北京)数据科技有限公司 File pre-reading method and device based on user space file system FUSE, and electronic equipment
CN112181916B (en) * 2020-09-14 2024-04-09 北京星辰天合科技股份有限公司 File pre-reading method and device based on user space file system FUSE, and electronic equipment
CN112685363A (en) * 2020-12-30 2021-04-20 湖南博匠信息科技有限公司 User file reading and writing method and device
CN112860644A (en) * 2021-02-05 2021-05-28 深圳市木浪云数据有限公司 Data storage method and device based on cloud storage
CN113032335A (en) * 2021-05-24 2021-06-25 北京世纪好未来教育科技有限公司 File access method, device, equipment and storage medium
CN113242321A (en) * 2021-07-12 2021-08-10 广东睿江云计算股份有限公司 Data transmission method for mobile storage device
CN113342837A (en) * 2021-08-06 2021-09-03 北京金山云网络技术有限公司 Data transmission method, device, electronic equipment and computer readable medium
CN115242817A (en) * 2022-07-21 2022-10-25 阿里巴巴(中国)有限公司 Data access processing method, device, equipment and storage medium
CN115242817B (en) * 2022-07-21 2023-10-24 阿里巴巴(中国)有限公司 Data access processing method, device, equipment and storage medium
CN117311645A (en) * 2023-11-24 2023-12-29 武汉纺织大学 LSM storage metadata read amplification optimization method
CN117311645B (en) * 2023-11-24 2024-02-06 武汉纺织大学 LSM storage metadata read amplification optimization method

Similar Documents

Publication Publication Date Title
CN106021381A (en) Data access/storage method and device for cloud storage service system
US11593319B2 (en) Virtualized data storage system architecture
US10853339B2 (en) Peer to peer ownership negotiation
JP4547264B2 (en) Apparatus and method for proxy cache
CN103116618B (en) Based on Telefile mirror method and the system of the lasting buffer memory of client
JP4547263B2 (en) Apparatus and method for processing data in a network
JP4824085B2 (en) System and method for caching a network file system
US9069792B1 (en) Method and system for persistently cached, copy-on-write view of revision control trees
US20040133570A1 (en) Shared file system
US20150293984A1 (en) Decoupled content and metadata in a distributed object storage ecosystem
US20070038697A1 (en) Multi-protocol namespace server
WO2011121746A1 (en) File server apparatus, method of controlling storage system, and program
CN102662992A (en) Method and device for storing and accessing massive small files
US20170091215A1 (en) Write-back cache transaction replication to object-based storage
CN105138275B (en) A kind of Lustre memory system datas sharing method
Srinivasan et al. EMFS: email-based personal cloud storage
US8612717B2 (en) Storage system
EP3436973A1 (en) File system support for file-level ghosting
US11341163B1 (en) Multi-level replication filtering for a distributed database
KR100785774B1 (en) Obeject based file system and method for inputting and outputting
Suralkar et al. Review of distributed file systems: Case studies
Jerger Modeling and Performance Prediction of HDF5 data on Objectstorage
Junping Analysis of key technologies of distributed file system based on big data [J]
CN117687970A (en) Metadata retrieval method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20161012

RJ01 Rejection of invention patent application after publication