CN112732667A - Usability enhancing method and system for distributed file system - Google Patents

Usability enhancing method and system for distributed file system Download PDF

Info

Publication number
CN112732667A
CN112732667A CN202110054134.1A CN202110054134A CN112732667A CN 112732667 A CN112732667 A CN 112732667A CN 202110054134 A CN202110054134 A CN 202110054134A CN 112732667 A CN112732667 A CN 112732667A
Authority
CN
China
Prior art keywords
data
cache layer
file system
distributed file
storage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110054134.1A
Other languages
Chinese (zh)
Inventor
潘钰
刘远
郭镔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Minglue Zhaohui Technology Co Ltd
Original Assignee
Beijing Minglue Zhaohui Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Minglue Zhaohui Technology Co Ltd filed Critical Beijing Minglue Zhaohui Technology Co Ltd
Priority to CN202110054134.1A priority Critical patent/CN112732667A/en
Publication of CN112732667A publication Critical patent/CN112732667A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/172Caching, prefetching or hoarding of files
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/178Techniques for file synchronisation in file systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The application discloses a method and a system for enhancing the availability of a distributed file system, wherein the method for enhancing the availability comprises the following steps: an input step: writing any data into the fastDFS distributed file system through the client; creating a cache layer: creating a cache layer between a tracking server and a storage server of the fastDFS distributed file system; the processing steps are as follows: processing the data through the cache layer; an output step: and after the data is processed by the cache layer, uploading the data to each storage server of the fastDFS distributed file system, and returning a mark of successful uploading to the client. The method solves the problem of data loss in the fastDFS distributed file system during data writing.

Description

Usability enhancing method and system for distributed file system
Technical Field
The invention belongs to the field of availability enhancement methods of distributed file systems, and particularly relates to an availability enhancement method of a distributed file system.
Background
The FastDFS distributed file system aims to realize high availability and high concurrency of file operation, is particularly suitable for storage operation of small and medium files, and is widely applied to various domestic large enterprises due to the three characteristics of unique lightweight, group storage and peer-to-peer structures.
FastDFS consists of three parts: a tracking server (trakcer server), a storage server (storageserver), and a client (client). The tracking server mainly performs scheduling work and plays a role in load balancing. And recording the state information of all storage groups and storage servers in the cluster in the memory, wherein the state information is a hub for interaction between the client and the data server. The system is responsible for managing all storage servers (storage servers) and groups (groups), each storage server is connected with a Tracker after being started, information such as the group to which the storage server belongs is informed, periodic heartbeat is kept, and the Tracker establishes a group mapping table according to heartbeat information of the storage servers. The storage server is responsible for storing files and file attributes (metadata). The client is used as an initiator of the service request to perform data interaction with the tracker server or the storage node through a proprietary interface by using a TCP/IP protocol.
The high availability of the distributed file system is always a high concern, and how to enhance the availability of the distributed file system has great significance in improving the system performance. Although the fastDFS distributed file system has the advantages of low requirement on machine configuration, high uploading speed and the like, in the fastDFS, in a time window from the completion of file writing to synchronization to other storage in a group, once a failure occurs to the source storage, user data loss may be caused, and the data loss is generally unacceptable for the storage system, meanwhile, when the fastDFS synchronizes the data of a newly added node, a storage node is selected in a polling manner, and the node may store a large amount of data or is in use, so that the synchronization efficiency is very low. It can be seen that the existing fastDFS distributed file system does not reasonably solve the above problems, and the synchronization mode and the synchronization mechanism thereof still need to be further improved to avoid the problems of low efficiency and poor stability which may be caused in the traditional mode.
A schematic diagram of a fastDFS distributed file system uploading mechanism, as shown in fig. 1:
in the prior art, when the client writes a file into a fastDFS distributed file system, the client writes the file into one storage server in a group, namely the file is considered to be successfully written, and after the storage server writes the file, the background thread synchronizes the file to other storage servers in the same group.
The original synchronization scheme of the system is to select an existing storage node in a round-robin manner, and perform file synchronization work on a new storage node one by using the existing storage node
Disclosure of Invention
The embodiment of the application provides a method for enhancing the availability of a distributed file system, which is used for at least solving the problem of subjective factor influence in the related technology.
The invention provides a method for enhancing the availability of a distributed file system, which comprises the following steps:
an input step: writing any data into the fastDFS distributed file system through the client;
creating a cache layer: creating a cache layer between a tracking server and a storage server of the fastDFS distributed file system;
the processing steps are as follows: processing the data through the cache layer;
an output step: and after the data is processed by the cache layer, uploading the data to each storage server of the fastDFS distributed file system, and returning a mark of successful uploading to the client.
The usability enhancing method described above, wherein the step of creating a cache layer includes:
the cache tier is a pool of storage created on a cluster of SSDs.
The usability enhancing method described above, wherein the processing step includes:
the write-back mode cache layer processing steps are as follows: writing data to the cache layer through a client, immediately receiving a write-in confirmation, migrating the data from the cache layer to the storage server by the cache layer, synchronizing the data on the rest of the storage servers in the same group, and finally deleting the data from the cache layer by a cache agent; and/or;
the read-only mode cache layer processing step comprises: when processing read operation from client, the caching layering proxy copies the requested data from the storage layer to the caching layer, and deletes inactive objects from the caching layer.
The usability enhancing method described above, wherein the processing step includes:
and the storage server informs the cache layer of the state information of all current storage nodes of the cache layer and endows different weights to the storage nodes.
The usability enhancing method described above, wherein the processing step includes:
and selecting one storage node according to the weight to complete the synchronization task.
The invention also includes a system for enhancing the availability of a distributed file system, comprising:
an input module: writing any data into the fastDFS distributed file system through the client;
a module for creating a cache layer: creating a cache layer between a tracking server and a storage server of the fastDFS distributed file system;
a processing module: processing the data through the cache layer;
an output module: and after the data is processed by the cache layer, uploading the data to each storage server of the fastDFS distributed file system, and returning a mark of successful uploading to the client.
The usability enhancing system, wherein the module for creating a cache layer includes:
the cache tier is a pool of storage created on a cluster of SSDs.
The usability enhancing system as described above, wherein the processing module includes:
the write-back mode cache layer processing unit: writing data to the cache layer through a client, immediately receiving a write-in confirmation, migrating the data from the cache layer to the storage server by the cache layer, synchronizing the data on the rest of the storage servers in the same group, and finally deleting the data from the cache layer by a cache agent; and/or;
the read-only mode cache layer processing unit: when processing read operation from client, the caching layering proxy copies the requested data from the storage layer to the caching layer, and deletes inactive objects from the caching layer.
The usability enhancing system as described above, wherein the processing module includes:
and the storage server informs the cache layer of the state information of all current storage nodes of the cache layer and endows different weights to the storage nodes.
The usability enhancing system as described above, wherein the processing module includes:
and selecting one storage node according to the weight to complete the synchronization task.
The invention also provides a computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements a method of enhancing the availability of a distributed file system as described in any of the above when executing the computer program.
The invention also provides a storage medium having a computer program stored thereon, which program, when executed by a processor, performs a method of enhancing the availability of a distributed file system as defined in any one of the preceding claims.
The invention has the beneficial effects that:
1. the method solves the problem of data loss in the fastDFS distributed file system during data writing.
2. The invention also solves the synchronization problem of the newly added nodes and avoids the problems of low efficiency and poor stability possibly caused by the traditional mode.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the application and together with the description serve to explain the application and not to limit the application.
In the drawings:
FIG. 1 is a flow diagram of a method of availability enhancement for a distributed file system;
FIG. 2 is a flow chart illustrating the substeps of step S3 in FIG. 1;
FIG. 3 is a schematic diagram of a fastDFS distributed file system upload mechanism;
FIG. 4 is a schematic diagram of a fastDFS distributed file system;
FIG. 5 is a schematic diagram of a distributed file system availability enhancement system;
fig. 6 is a block diagram of a computer device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application will be described and illustrated below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments provided in the present application without any inventive step are within the scope of protection of the present application.
It is obvious that the drawings in the following description are only examples or embodiments of the present application, and that it is also possible for a person skilled in the art to apply the present application to other similar contexts on the basis of these drawings without inventive effort. Moreover, it should be appreciated that in the development of any such actual implementation, as in any engineering or design project, numerous implementation-specific decisions must be made to achieve the developers' specific goals, such as compliance with system-related and business-related constraints, which may vary from one implementation to another.
Reference in the specification to "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the specification. The appearances of the phrase in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. Those of ordinary skill in the art will explicitly and implicitly appreciate that the embodiments described herein may be combined with other embodiments without conflict.
Unless defined otherwise, technical or scientific terms referred to herein shall have the ordinary meaning as understood by those of ordinary skill in the art to which this application belongs. Reference to "a," "an," "the," and similar words throughout this application are not to be construed as limiting in number, and may refer to the singular or the plural. The present application is directed to the use of the terms "including," "comprising," "having," and any variations thereof, which are intended to cover non-exclusive inclusions; for example, a process, method, system, article, or apparatus that comprises a list of steps or modules (elements) is not limited to the listed steps or elements, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus. Reference to "connected," "coupled," and the like in this application is not intended to be limited to physical or mechanical connections, but may include electrical connections, whether direct or indirect. The term "plurality" as referred to herein means two or more. "and/or" describes an association relationship of associated objects, meaning that three relationships may exist, for example, "A and/or B" may mean: a exists alone, A and B exist simultaneously, and B exists alone. The character "/" generally indicates that the former and latter associated objects are in an "or" relationship. Reference herein to the terms "first," "second," "third," and the like, are merely to distinguish similar objects and do not denote a particular ordering for the objects.
The present invention is described in detail with reference to the embodiments shown in the drawings, but it should be understood that these embodiments are not intended to limit the present invention, and those skilled in the art should understand that functional, methodological, or structural equivalents or substitutions made by these embodiments are within the scope of the present invention.
Before describing in detail the various embodiments of the present invention, the core inventive concepts of the present invention are summarized and described in detail by the following several embodiments.
Before describing in detail the various embodiments of the present invention, the core inventive concepts of the present invention are summarized and described in detail by the following several embodiments.
Referring to fig. 1, fig. 1 is a flowchart of a method for enhancing availability of a distributed file system. As shown in fig. 1, the method for enhancing the availability of the distributed file system of the present invention includes:
input step S1: writing any data into the fastDFS distributed file system through the client;
create cache layer step S2: creating a cache layer between a tracking server and a storage server of the fastDFS distributed file system;
processing step S3: processing the data through the cache layer;
output step S4: and after the data is processed by the cache layer, uploading the data to each storage server of the fastDFS distributed file system, and returning a mark of successful uploading to the client.
The method, wherein the step of creating the cache layer includes:
the cache tier is a pool of storage created on a cluster of SSDs.
Referring to fig. 2, fig. 2 is a flowchart illustrating a sub-step of step S3 in fig. 1. As shown in fig. 2, the processing step S3 includes:
write-back mode cache layer processing step S31: writing data to the cache layer through a client, immediately receiving a write-in confirmation, migrating the data from the cache layer to the storage server by the cache layer, synchronizing the data on the rest of the storage servers in the same group, and finally deleting the data from the cache layer by a cache agent;
read-only mode cache layer processing step S32: when processing read operation from client, the caching layering proxy copies the requested data from the storage layer to the caching layer, and deletes inactive objects from the caching layer.
Wherein the processing step S3 includes:
and the storage server informs the cache layer of the state information of all current storage nodes of the cache layer and endows different weights to the storage nodes.
Wherein the processing step S3 further includes:
and selecting one storage node according to the weight to complete the synchronization task.
Hereinafter, the usability enhancing method of the distributed file system according to the present invention will be described in detail with reference to the following embodiments.
The first embodiment is as follows:
the invention improves the usability of the system by solving the problem of data loss during synchronization in the fastDFS distributed file system. When data is uploaded, after a client uploads a file to a Storage server node, the client informs that the file is successfully stored, a subsequent Storage has a single thread to synchronize with other Storage services in the same group, and if the Storage data is damaged before synchronization, the data uploaded by a user is permanently lost.
In order to solve the problems that data are lost when data are uploaded in synchronization and the efficiency is low when a synchronization node is selected in a round-robin mode when data of a newly added node are synchronized in the prior art, the usability enhancing method of the distributed file system is provided, so that the usability of the distributed file system is enhanced, and the efficiency of data synchronization and the like are improved.
As shown in fig. 3, which illustrates the existing architecture and working principle of the fastDFS distributed file system, the present invention adds a cache layer between a tracking Server (Tracker Server) and a Storage Server (Storage Server) on the basis of the original architecture, where the cache layer is a Storage pool created on a small SSD cluster, and then all client I/O operations are processed by the cache pool first, and then data is written back to the existing data Storage pool, so that the client can enjoy better performance on the cache pool, and the final data is also written into the Storage Server; in addition, the invention also provides a method for carrying out file synchronization on the new storage node by adopting an improved optimization algorithm to replace the original round-robin algorithm aiming at the new storage node.
The principle of the invention is shown in figure 4:
under the working scheme shown in fig. 4, each storage is connected to the cache layer after being started, information such as a group to which the storage belongs is informed, a periodic heartbeat is kept, then the cache layer is connected with the tracking server, and the tracking server is informed of each update. When a client uploads a file to a fastDFS distributed file system, a tracking server stores an idle IP and a port number in a storage layer, then the client directly uploads the file to a cache layer, then the cache layer accesses a storage server storageserver according to the IP and the port number, and a successful uploading mark is returned to the client until the cache layer uploads the file to each storageserver.
The cache layer can be configured in the following two modes:
1) write-back mode: when the cache pool is configured in writeback mode, the client writes the data to the cache pool and then immediately receives a write acknowledgement. The cache pool then migrates the data from the cache pool to the storage server and synchronizes on other storage in the same group, and finally is deleted from the caching layer by the caching agent. When the read operation from the client is processed, the data is firstly migrated from the storage layer to the cache layer by the cache layering agent and then returned to the client. Data is deleted from the cache pool only when it becomes no longer active or becomes cold.
2) read-only mode: when the cache pool is configured in read-only mode, it is only adapted to handle read operations by clients. The write operation of the client does not involve the cache layer, and all client writes are completed on the storage layer. In processing a read operation from a client, the caching layering proxy copies the requested data from the storage tier to the caching tier. The cache pool deletes inactive objects from the cache pool based on the policy we have configured for it.
Aiming at the data synchronization problem of the newly added storage nodes, the storage server informs the cache layer of the existing basic information, and also informs the cache layer of the state information of all the current storage nodes and endows different weights to all the storage nodes.
Example two:
referring to fig. 5, fig. 5 is a schematic structural diagram of an availability enhancement system of a distributed file system according to the present invention. Fig. 5 shows an availability enhancement system of a distributed file system according to the present invention, which includes:
an input module: writing any data into the fastDFS distributed file system through the client;
a module for creating a cache layer: creating a cache layer between a tracking server and a storage server of the fastDFS distributed file system;
a processing module: processing the data through the cache layer;
an output module: and after the data is processed by the cache layer, uploading the data to each storage server of the fastDFS distributed file system, and returning a mark of successful uploading to the client.
The above system, wherein the module for creating a cache layer includes:
the cache tier is a pool of storage created on a cluster of SSDs.
The system, wherein the processing module comprises:
the write-back mode cache layer processing unit: writing data to the cache layer through a client, immediately receiving a write-in confirmation, migrating the data from the cache layer to the storage server by the cache layer, synchronizing the data on the rest of the storage servers in the same group, and finally deleting the data from the cache layer by a cache agent; and/or;
the read-only mode cache layer processing unit: when processing read operation from client, the caching layering proxy copies the requested data from the storage layer to the caching layer, and deletes inactive objects from the caching layer.
Wherein the processing module comprises:
and the storage server informs the cache layer of the state information of all current storage nodes of the cache layer and endows different weights to the storage nodes.
Wherein the processing module comprises:
and selecting one storage node according to the weight to complete the synchronization task.
Example three:
referring to fig. 6, the present embodiment discloses an embodiment of a computer device. The computer device may comprise a processor 81 and a memory 82 in which computer program instructions are stored.
Specifically, the processor 81 may include a Central Processing Unit (CPU), or A Specific Integrated Circuit (ASIC), or may be configured to implement one or more Integrated circuits of the embodiments of the present Application.
Memory 82 may include, among other things, mass storage for data or instructions. By way of example, and not limitation, memory 82 may include a Hard Disk Drive (Hard Disk Drive, abbreviated to HDD), a floppy Disk Drive, a Solid State Drive (SSD), flash memory, an optical Disk, a magneto-optical Disk, tape, or a Universal Serial Bus (USB) Drive or a combination of two or more of these. Memory 82 may include removable or non-removable (or fixed) media, where appropriate. The memory 82 may be internal or external to the data processing apparatus, where appropriate. In a particular embodiment, the memory 82 is a Non-Volatile (Non-Volatile) memory. In particular embodiments, Memory 82 includes Read-Only Memory (ROM) and Random Access Memory (RAM). The ROM may be mask-programmed ROM, Programmable ROM (PROM), Erasable PROM (EPROM), Electrically Erasable PROM (EEPROM), Electrically rewritable ROM (EAROM), or FLASH Memory (FLASH), or a combination of two or more of these, where appropriate. The RAM may be a Static Random-Access Memory (SRAM) or a Dynamic Random-Access Memory (DRAM), where the DRAM may be a Fast Page Mode Dynamic Random-Access Memory (FPMDRAM), an Extended data output Dynamic Random-Access Memory (EDODRAM), a Synchronous Dynamic Random-Access Memory (SDRAM), and the like.
The memory 82 may be used to store or cache various data files for processing and/or communication use, as well as possible computer program instructions executed by the processor 81.
The processor 81 implements the usability enhancing method of any of the above embodiments by reading and executing computer program instructions stored in the memory 82.
In some of these embodiments, the computer device may also include a communication interface 83 and a bus 80. As shown in fig. 5, the processor 81, the memory 82, and the communication interface 83 are connected via the bus 80 to complete communication therebetween.
The communication interface 83 is used for implementing communication between modules, devices, units and/or equipment in the embodiment of the present application. The communication port 83 may also be implemented with other components such as: the data communication is carried out among external equipment, image/data acquisition equipment, a database, external storage, an image/data processing workstation and the like.
Bus 80 includes hardware, software, or both to couple the components of the computer device to each other. Bus 80 includes, but is not limited to, at least one of the following: data Bus (Data Bus), Address Bus (Address Bus), Control Bus (Control Bus), Expansion Bus (Expansion Bus), and Local Bus (Local Bus). By way of example, and not limitation, Bus 80 may include an Accelerated Graphics Port (AGP) or other Graphics Bus, an Enhanced Industry Standard Architecture (EISA) Bus, a Front-Side Bus (FSB), a Hyper Transport (HT) Interconnect, an ISA (ISA) Bus, an InfiniBand (InfiniBand) Interconnect, a Low Pin Count (LPC) Bus, a memory Bus, a microchannel Architecture (MCA) Bus, a PCI (Peripheral Component Interconnect) Bus, a PCI-Express (PCI-X) Bus, a Serial Advanced Technology Attachment (SATA) Bus, a Video Electronics Bus (audio Electronics Association), abbreviated VLB) bus or other suitable bus or a combination of two or more of these. Bus 80 may include one or more buses, where appropriate. Although specific buses are described and shown in the embodiments of the application, any suitable buses or interconnects are contemplated by the application.
The computer device may implement the methods described in connection with fig. 1-2 based on the enhancement of the availability of the distributed file system.
In addition, in combination with the usability enhancing method of the distributed file system in the foregoing embodiments, the embodiments of the present application may be implemented by providing a computer-readable storage medium. The computer readable storage medium having stored thereon computer program instructions; the computer program instructions, when executed by a processor, implement a method of usability enhancement for a distributed file system as in any of the above embodiments.
The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
In conclusion, the method has the advantages that the problem of data loss in the fastDFS distributed file system during data writing is solved; the invention also solves the synchronization problem of the newly added nodes and avoids the problems of low efficiency and poor stability possibly caused by the traditional mode.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (10)

1. A method for enhancing availability of a distributed file system, comprising:
an input step: writing any data into the fastDFS distributed file system through the client;
creating a cache layer: creating a cache layer between a tracking server and a storage server of the fastDFS distributed file system;
the processing steps are as follows: processing the data through the cache layer;
an output step: and after the data is processed by the cache layer, uploading the data to each storage server of the fastDFS distributed file system, and returning a mark of successful uploading to the client.
2. The method of enhancing availability of a distributed file system of claim 1, wherein the step of creating a cache layer comprises:
the cache tier is a pool of storage created on a cluster of SSDs.
3. The method of enhancing availability of a distributed file system of claim 1, wherein the processing step comprises:
the write-back mode cache layer processing steps are as follows: writing data to the cache layer through a client, immediately receiving a write-in confirmation, migrating the data from the cache layer to the storage server by the cache layer, synchronizing the data on the rest of the storage servers in the same group, and finally deleting the data from the cache layer by a cache agent;
the read-only mode cache layer processing step comprises: when processing read operation from client, the caching layering proxy copies the requested data from the storage layer to the caching layer, and deletes inactive objects from the caching layer.
4. The method of enhancing availability of a distributed file system of claim 1, wherein the processing step comprises:
and the storage server informs the cache layer of the state information of all current storage nodes of the cache layer and endows different weights to the storage nodes.
5. The method of enhancing availability of a distributed file system of claim 4, wherein the processing step comprises:
and selecting one storage node according to the weight to complete the synchronization task.
6. A system for enhancing the availability of a distributed file system, comprising:
an input module: writing any data into the fastDFS distributed file system through the client;
a module for creating a cache layer: creating a cache layer between a tracking server and a storage server of the fastDFS distributed file system;
a processing module: processing the data through the cache layer;
an output module: and after the data is processed by the cache layer, uploading the data to each storage server of the fastDFS distributed file system, and returning a mark of successful uploading to the client.
7. The system for enhancing availability of a distributed file system of claim 1, wherein said creating a cache layer module comprises:
the cache tier is a pool of storage created on a cluster of SSDs.
8. The system for enhancing availability of a distributed file system of claim 1, wherein the processing module comprises:
the write-back mode cache layer processing unit: writing data to the cache layer through a client, immediately receiving a write-in confirmation, migrating the data from the cache layer to the storage server by the cache layer, synchronizing the data on the rest of the storage servers in the same group, and finally deleting the data from the cache layer by a cache agent; and/or;
the read-only mode cache layer processing unit: when processing read operation from client, the caching layering proxy copies the requested data from the storage layer to the caching layer, and deletes inactive objects from the caching layer.
9. The system for enhancing availability of a distributed file system of claim 1, wherein the processing module comprises:
and the storage server informs the cache layer of the state information of all current storage nodes of the cache layer and endows different weights to the storage nodes.
10. The method for enhancing availability of a distributed file system of claim 9, wherein the processing module comprises:
and selecting one storage node according to the weight to complete the synchronization task.
CN202110054134.1A 2021-01-15 2021-01-15 Usability enhancing method and system for distributed file system Pending CN112732667A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110054134.1A CN112732667A (en) 2021-01-15 2021-01-15 Usability enhancing method and system for distributed file system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110054134.1A CN112732667A (en) 2021-01-15 2021-01-15 Usability enhancing method and system for distributed file system

Publications (1)

Publication Number Publication Date
CN112732667A true CN112732667A (en) 2021-04-30

Family

ID=75591567

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110054134.1A Pending CN112732667A (en) 2021-01-15 2021-01-15 Usability enhancing method and system for distributed file system

Country Status (1)

Country Link
CN (1) CN112732667A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113472874A (en) * 2021-06-28 2021-10-01 北京明略软件系统有限公司 File concurrent transmission method, system, storage medium and electronic equipment

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113472874A (en) * 2021-06-28 2021-10-01 北京明略软件系统有限公司 File concurrent transmission method, system, storage medium and electronic equipment

Similar Documents

Publication Publication Date Title
US11301154B2 (en) Distributed storage method and device
US10715622B2 (en) Systems and methods for accelerating object stores with distributed caching
WO2019127916A1 (en) Data read/write method and device implemented on the basis of distributed consensus protocol
US20080271130A1 (en) Minimizing client-side inconsistencies in a distributed virtual file system
EP2824893B1 (en) Data storage method and device
CN107832423B (en) File reading and writing method for distributed file system
CN105493474B (en) System and method for supporting partition level logging for synchronizing data in a distributed data grid
CN107153643B (en) Data table connection method and device
US10771358B2 (en) Data acquisition device, data acquisition method and storage medium
CN108540510B (en) Cloud host creation method and device and cloud service system
CN113419824A (en) Data processing method, device, system and computer storage medium
CN113806300B (en) Data storage method, system, device, equipment and storage medium
CN111984191A (en) Multi-client caching method and system supporting distributed storage
CN113792885A (en) Execution method and related device for deep learning training
WO2016101748A1 (en) Method and device for caching network connection
JP2020532803A (en) Asynchronous updates of metadata tracks in response to cache hits generated via synchronous ingress and out, systems, computer programs and storage controls
JP5893028B2 (en) System and method for efficient sequential logging on a storage device that supports caching
WO2016101662A1 (en) Data processing method and relevant server
CN112732667A (en) Usability enhancing method and system for distributed file system
US20200394077A1 (en) Map reduce using coordination namespace hardware acceleration
CN109254958A (en) Distributed data reading/writing method, equipment and system
CN116560562A (en) Method and device for reading and writing data
WO2019109209A1 (en) Data replacement method for memory, server node, and data storage system
Yuan et al. Towards efficient deployment of cloud applications through dynamic reverse proxy optimization
CN116594551A (en) Data storage method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination