WO2015184925A1

WO2015184925A1 - Data processing method for distributed file system and distributed file system

Info

Publication number: WO2015184925A1
Application number: PCT/CN2015/076473
Authority: WO
Inventors: 朱鹏; 林健; 胡剑华
Original assignee: 中兴通讯股份有限公司
Priority date: 2014-10-24
Filing date: 2015-04-13
Publication date: 2015-12-10
Also published as: CN105589887A; WO2016061956A1; CN105589887B

Abstract

A data processing method for a distributed file system and a distributed file system. The method comprises: an FAC acquires file data and pushes the file data to an FAS (101); the FAS records the file data pushed by the FAC, records modification of corresponding metadata in the FAS this time in a buffer area, writes the modification into a log file, and returns a file data pushing completion message to the FAC (102); the FAC sends a metadata modification change request to an FLR (103); the FLR modifies the corresponding metadata according to the metadata modification change request, and records the modification in a log file system (104); and when unexpected restart of the FAS is detected, the FLR implements rollback operation of the corresponding modified data according to the log records (105).

Description

Data processing method of distributed file system and distributed file system

Technical field

This paper relates to the field of distributed file storage technology, and in particular relates to a data processing method and a distributed file system of a distributed file system.

Background technique

With the rapid development of the multimedia industry, due to cost, reliability and other considerations, more and more manufacturers choose to deploy self-developed distributed upper-layer storage systems in their products, and the distributed file system has also been quickly development of. The distributed file system can provide high throughput rate, can provide several times the throughput rate of the common local file system, and provide high reliability. Through multiple copies and redundant copy technology, the reliability of data in the case of abnormal single machine can be improved. For devices such as magnetic arrays, there are advantages of being inexpensive and versatile.

Currently, in most distributed file systems, some focus on throughput performance, but reduce the guarantee of file system consistency. The other part, while ensuring the consistency of synchronization, greatly reduces the performance of writing and modification. For a large number of machines in a distributed system, downtime restart is already a normal problem. How to ensure the consistency of data in multiple copies of a file after the server is restarted is very necessary.

Summary of the invention

This paper provides a data processing method and distributed file system for distributed file system to avoid data inconsistency between multiple copies caused by FAS downtime.

A data processing method for a distributed file system, comprising:

The FAC obtains the file data and pushes it to the FAS;

The FAS records the file data pushed by the FAC, records the modification of the corresponding metadata on the FAS in the buffer, writes the log file, and returns a file data push completion message to the FAC;

After receiving the file data push completion message returned by the FAS, the FAC sends a message to the FLR. Send metadata modification request;

The FLR modifies the change request according to the metadata, modifies the corresponding metadata, and records the file to the log file system;

When the abnormal restart of the FAS is detected, the FLR performs a rollback operation of the corresponding modified data according to the log record, and completes the repair of the log file system.

Optionally, the step of the FLR modifying the change request according to the metadata, modifying the corresponding metadata, and recording to the log file system further includes:

The FLR adds the relevant processed entries to the buffer of the corresponding FAS in chronological order.

Optionally, when the abnormal restart of the FAS is detected, the FLR performs a rollback operation of the corresponding modified data according to the log record, and the step of completing the repair of the log file system includes:

When the abnormal restart of the FAS is detected, the FLR returns the modified data of the log record from the current time point of the log record according to the log record, and the modified data of the set time length corresponds to the All changes to the FAS record;

Sending a rollback request to the FLR to roll back the corresponding data when the FAS is powered on;

The FLR rolls back the corresponding data to the buffer of the corresponding FAS according to the rollback request, and completes the repair of the log file system.

Optionally, the step of the FLR monitoring the FAS abnormality includes:

Receiving, by the FLR, a heartbeat message periodically sent by the FAS;

When it is detected that the heartbeat message is lost multiple times in succession, it is determined that the FAS is abnormal.

Optionally, after the FAC receives the file data push completion message returned by the FAS, the step of sending a metadata modification change request to the FLR includes:

After receiving the file data push completion message returned by the FAS, the FAC fills in the corresponding metadata modification change request into the modify to be notified buffer;

When the set timing time arrives, all metadata modification change requests in the modify notification buffer are sent to the FLR.

The embodiment of the invention further provides a distributed file system, including: FAC, FAS and FLR, wherein:

The FAC is set to: obtain file data, and push it to the FAS;

The FAS is configured to: record file data pushed by the FAC, record the modification of the corresponding metadata on the FAS in the buffer, write the log file, and return a file data push completion message to the FAC;

The FAC is further configured to: after receiving the file data push completion message returned by the FAS, send a metadata modification change request to the FLR;

The FLR is configured to: modify the change request according to the metadata, modify the corresponding metadata, and record to the log file system;

The FLR is further configured to: when the abnormal restart of the FAS is detected, perform a rollback operation of the corresponding modified data according to the log record, and complete the repair of the log file system.

Optionally, the FLR is further configured to: add the related processed entries to the buffer of the corresponding FAS in time sequence.

Optionally, the FLR is set to: when the abnormal restart of the FAS is detected, the modified data of the log record is retracted from the current time point of the log record for a set time length, where the set time length is The modified data corresponds to all modified records of the FAS;

The FAS is configured to: when the FAS is powered on, send a rollback request to the FLR to roll back the corresponding data;

The FLR is configured to: roll back the corresponding data to the buffer of the corresponding FAS according to the rollback request, and complete the repair of the log file system.

Optionally, the FLR is configured to: receive a heartbeat message periodically sent by the FAS, and determine that the FAS is abnormal when a continuous lost heartbeat message is detected.

Optionally, the FAC is configured to: after receiving the file data push completion message returned by the FAS, fill the corresponding metadata modification change request into the modify to be notified buffer; when the set timing time arrives Sends all metadata modification change requests in the modify notification buffer to the FLR.

A computer readable storage medium storing computer executable instructions, the computer being executable Line instructions are used to perform the above methods.

The data processing method and the distributed file system of the distributed file system proposed by the embodiment of the present invention, the FAC obtains the file data and pushes it to the FAS; the FAS records the file data pushed by the FAC, and records the corresponding FAS in the buffer. Modifying the metadata, writing the log file, and returning the file data push completion message to the FAC; after receiving the file data push completion message returned by the FAS, the FAC sends a metadata modification change request to the FLR; Describe the metadata modification change request, modify the corresponding metadata, and record to the log file system; when the FAS abnormal restart is detected, the FLR performs a rollback operation of the corresponding modified data according to the log record, and completes the repair of the log file system. , to ensure the final high consistency of the file after the reset file system reset and restart, to avoid the data inconsistency between multiple copies caused by the machine downtime, and to minimize the delay caused by the addition of the log system. And performance loss.

BRIEF abstract

1 is a schematic flow chart of an embodiment of a data processing method of a distributed file system according to the present invention;

2 is a schematic diagram of an interaction process between a FAC, a FAS, and an FLR according to an embodiment of the present invention;

3 is a schematic diagram of the interaction between the FAC and the FAS and the FAS flashing and writing in the embodiment of the present invention;

4 is a schematic flowchart of a process for a FAC to send a metadata modification change request to an FLR according to an embodiment of the present invention;

FIG. 5 is a schematic diagram of a processing flow of an FLR according to an embodiment of the present invention; FIG.

FIG. 6 is a schematic structural diagram of an embodiment of a distributed file system according to the present invention.

The embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

Embodiments of the invention

The solution of the embodiment of the present invention includes: the FAC acquires the file data and pushes it to the FAS; the FAS records the file data pushed by the FAC, records the modification of the corresponding metadata on the FAS in the buffer, writes the log file, and writes to the log file. The FAC returns a file data push completion message; after receiving the file data push completion message returned by the FAS, the FAC sends a metadata modification change to the FLR. The FLR modifies the change request according to the metadata, modifies the corresponding metadata, and records the data to the log file system; when the FAS is abnormally restarted, the FLR performs a rollback operation of the corresponding modified data according to the log record, and completes The repair of the log file system ensures the final high consistency of the files after the reset and reset of the distributed file system, avoiding the inconsistency of data between multiple copies caused by the machine restart, and minimizing the addition of the log system. Brings corresponding delays and performance losses.

The system operating environment involved in the method embodiment of the present invention includes: FAC, FAS, and FLR, wherein:

FAC (File Access Client, also known as File Service Client): Set to: Provide the user with the internal data of the distributed file system.

FAS (File Access Server, also known as File Data Server): Set to: store the actual data of the file.

FLR (File Location Register): Set to: store information related to metadata such as files and data.

Since most of the distributed file systems currently focus on throughput performance, they reduce the guarantee of file system consistency and do not provide a guarantee similar to the local file system log file system. The other part, while ensuring the consistency of synchronization, greatly reduces the performance of writing and modification. The related solution cannot guarantee the consistency of data in multiple copies of the file after the server is restarted.

The solution of the embodiment provides a log file system mode for the double-layer metadata, which can provide all the characteristics of the lagging log file system without restoring the response of the file system, and ensure the system reset after restarting. High consistency of the file.

About the role of the log file: Take the local file system as an example. The ext2 file system (Second extended file system) is a general file system. It does not have the function of the log file system. It is reset and powered off. It is very likely that some data being written or modified will be lost in the process, resulting in inconsistency between metadata and data. For this problem, the ext3 file system (Third extended file system) has been improved, adding the function of the log system, and correcting the file system by replaying the log part during power-on. To be sexual.

The double-layer metadata involved in this embodiment means that there are components corresponding to metadata on both FLR and FAS, and FLR corresponds to file segment data location name information, and the FAS stores the slice name and the actual disk block. Corresponding information. In layman's terms, a distributed file system with management metadata built on top of the local file system falls into the category of such a two-tier metadata distributed file system.

In the solution of this embodiment, the role of the FAC is to send related metadata modification request, which can itself rely on the related functions of the original distributed file system.

FAS itself is a function built on the lower metadata of the double-layer metadata class. Through this part, it is guaranteed that on the FAS, a valid metadata modification log portion can be constructed to ensure the consistency of the FAS side.

The FLR is built on the upper layer metadata of the double-layer metadata, mainly ensuring the log replay rollback problem after the modification of the upper metadata layer.

The interaction process between FAC, FAS and FLR in the system can be as shown in Figures 1 and 2.

As shown in FIG. 1, an embodiment of the present invention provides a data processing method for a distributed file system, including:

Step S101, the FAC obtains file data and pushes it to the FAS;

Step S102, the FAS records the file data pushed by the FAC, records the modification of the corresponding metadata on the FAS in the buffer, writes the log file, and returns a file data push completion message to the FAC;

In addition, the FAS periodically writes the modified buffer brush to the normal log file before the data.

After the FAS flashes the data to the disk, the modified metadata is successfully written into the buffer and periodically written into the log file.

Among them, the interaction between the FAC and the FAS and the FAS flash write timing can be as shown in FIG.

Take the FAC as the example of sending data a and data b to FAS. The processing flow is as follows:

1. The FAC sends data a to the FAS.

2. The FAS inserts the notification of the modified data a into the modification buffer.

3. The FAS writes the data a to the data buffer.

4. The FAS returns to the FAC to inform the FAC that a has successfully written the data. (After this time, the metadata modification notification is sent to the FLR)

5. The FAC sends the data b to the FAS.

6. The FAS inserts the notification of the modified data b into the modification buffer.

7. The FAS writes the data b to the data buffer.

8. The FAS returns to the FAC, notifying the FAC that b has successfully written the data. (Steps 5-8 represent different data, here is the speed of asynchronous notification)

9. The timer log task is written, and the modification notices of a and b are written to the disk.

10. The data of a is written to disk.

11, a data is written to the disk completion notification to insert the modified buffer.

12. The data of b is written to disk.

13. The completion of the b data write to the disk inserts the modification buffer.

14. The timer log task is written, and the write completion notifications of a and b are written to the disk.

At this point, the complete log flow is written, and the FAS side log system is completely written.

Step S103, after receiving the file data push completion message returned by the FAS, the FAC sends a metadata modification change request to the FLR;

After receiving the file data push completion message returned by the FAS, the FAC sends a metadata modification change request to the FLR, and the relevant data of the log file system is attached to the metadata modification change request.

As an optional implementation manner, when the FAC sends a metadata modification change request to the FLR, the following scheme may be adopted:

After receiving the file data push completion message returned by the FAS, the FAC fills in the corresponding metadata modification change request into the modify to be notified buffer.

The FAC sends the metadata modification change request of the data a, the metadata modification change request of the data b, the metadata modification change request of the data c, and the metadata modification change request of the data d to the FLR as an example, and the FAC sends the metadata modification to the FLR. The processing flow of the change request can be as shown in FIG. 4.

1. After the FAC writes the x file, the modification of a is filled in the modified to be notified buffer;

2. After the FAC writes the x file, the modification of b is filled in the modified to be notified buffer;

3. After the FAC writes the x file, fill in the modification of c into the buffer to be notified;

4. After the FAC writes the y file, the modification of d is filled in the buffer to be notified.

At this time, when the detection time has reached the required time interval, and the timer has not been triggered, the metadata synchronization message is triggered to be sent to the FLR, and the timer is reset.

After a period of time, the timer is triggered, the message in the buffer to be notified is notified to the FLR and the timer is reset. In this way, the number of FLR master messages can be greatly reduced, and the real-time performance can be maintained as much as possible in a short time interval.

Step S104, the FLR modifies the change request according to the metadata, and modifies the corresponding metadata, and records the file to the log file system;

After receiving the metadata modification, the FLR modifies the corresponding metadata and modifies the relevant metadata to the log system by attaching the log related data record. At the same time, the FAS flashes the data into the disk and writes the log after determining that the write is successful.

In addition, the FLR adds the relevant processed entries to the buffer of the corresponding FAS in chronological order.

Step S105: When it is detected that the FAS is abnormally restarted, the FLR performs a rollback operation of the corresponding modified data according to the log record, and completes the repair of the log file system.

The FLR monitors whether the FAS is abnormal by receiving a heartbeat message periodically sent by the FAS.

The FAS periodically sends a still alive message to indicate that the FAS is still working.

When the heartbeat message from the FAS is detected, it is determined that the FAS is normal, and when the heartbeat message is continuously lost multiple times, the FAS is determined to be abnormal.

For heartbeat packets sent by the FAS, the FLR does not process, but if a continuous loss of heartbeat message occurs, the FLR needs to lag the FAS of the lost heartbeat message to ensure that if it is a real FAS downtime reset , will do the relevant operation back scrolling.

When the abnormal restart of the FAS is detected, the FLR performs a rollback operation according to the log record, that is, the modified data of the log record is forwarded back to a specific length of time from the current time point. The modified data of the fixed time length corresponds to all the modified records of the FAS, that is, the data modification changes reported by the FAC.

When the FAS is powered on, a rollback request is sent to the FLR to roll back the corresponding data; the FLR rolls back the corresponding data to the buffer of the corresponding FAS according to the rollback request, and completes the repair of the log file system.

The processing flow of the FLR in this embodiment can be as shown in FIG. 5.

When one of the FASs is abnormally restarted, the log system enters the repair process. The process is first triggered on the FLR. When the FLR confirms that an FAS is restarted, the log system will roll back all the modification records corresponding to the FAS for a certain length of time through the log records on the FLR. At the same time, when the FAS is powered on, the logs recorded by the FAS are used to roll back related data written to the FAS but not written to the disk, and a rollback request is sent to the FLR to roll back the corresponding data.

When the two processes are completed, the repair process is successfully completed, and the system still provides consistent data through the existence of other replicas in the repair process, which is invisible to the user.

The system of this embodiment can provide all the characteristics of the lagging log file system without reducing the response of the file system, and ensure high consistency of the file after the system resets and restarts.

Compared with the related technology, in the solution of the embodiment, the FAC obtains the file data and pushes it to the FAS; the FAS records the file data pushed by the FAC, and records the modification of the corresponding metadata on the FAS in the buffer, and writes the log file. And returning a file data push completion message to the FAC; after receiving the file data push completion message returned by the FAS, the FAC sends a metadata modification change request to the FLR; the FLR modifies the change request according to the metadata, and modifies the corresponding element. The data is recorded to the log file system; when the FAS abnormal restart is detected, the FLR performs a rollback operation of the corresponding modified data according to the log record, completes the repair of the log file system, and ensures that the distributed file system is reset and restarted. The ultimate high consistency avoids the inconsistency of data between multiple copies caused by machine downtime, while minimizing the corresponding delay and performance loss due to the addition of the log system.

The log system of this embodiment has no sensitivity and correlation to the scale of the distributed system, and the system pressure is constant, and the pressure of the log system is not increased due to the expansion of the cluster. Have a good income Convergent, without the overhead on the network. The pressure on the disk where the log system resides is extremely small, which is a high-performance, low-latency log file system at the expense of high error rate.

As shown in FIG. 6, an embodiment of the present invention provides a distributed file system, including: FAC201, FAS 202, and FLR 203, where:

The FAC 201 is configured to: obtain file data, and send it to the FAS 202;

The FAS 202 is configured to: record the file data pushed by the FAC 201, record the modification of the corresponding metadata on the FAS 202 in the buffer, write the log file, and return the file data push completion message to the FAC 201. ;

The FAC 201 is further configured to: after receiving the file data push completion message returned by the FAS 202, send a metadata modification change request to the FLR 203;

The FLR 203 is configured to: modify the change request according to the metadata, modify the corresponding metadata, and record to the log file system;

The FLR 203 is further configured to: when it is detected that the FAS 202 is abnormally restarted, perform a rollback operation of the corresponding modified data according to the log record, and complete the repair of the log file system.

FAC 201: The file service client is configured to provide a connection between the user and the internal data of the distributed file system.

FAS 202: The file data server is set to: store the actual data of the file.

FLR 203: The file location register is set to: store metadata related to data and data.

About the role of the log file: Take the local file system as an example. The ext2 file system is a general file system. It does not have the function of the log file system. It is likely to lose some of the write or modify during the reset and power off process. Data, resulting in inconsistency between metadata and data. To solve this problem, the ext3 file system has been improved, and the function of the log system has been added. When the power is turned on, the consistency of the file system is corrected by replaying the log portion.

The double-layer metadata involved in this embodiment means that there are components corresponding to the metadata on the FLR 203 and the FAS 202, and the FLR 203 corresponds to the file segment data location name information, and the FAS 202 stores the slice name and Corresponding information of the actual disk block. In layman's terms, a distributed file system with management metadata built on top of the local file system falls into the category of such a two-tier metadata distributed file system.

In the solution of this embodiment, the role of the FAC 201 is to send related metadata modification request, and the related function of the original distributed file system can be utilized by itself.

FAS 202 itself is a function built on the lower layer metadata of the double-layer metadata class. Through this part, it is guaranteed that on the FAS 202, a valid metadata modification log portion can be constructed to ensure the consistency of the FAS 202 side.

The FLR 203 is built on the upper layer metadata of the double layer metadata, mainly ensuring the log replay rollback problem after the modification of the upper layer metadata layer.

The interaction process between the FAC 201, the FAS 202, and the FLR 203 in the system can be as shown in FIG. 2.

First, the FAC 201 acquires the file data and pushes it to the FAS 202 for storing the data.

The FAS 202 records the file data pushed by the FAC 201, records the modification of the metadata on the FAS 202 in the buffer, and returns a file data push completion message to the FAC 201.

In addition, the FAS 202 periodically writes the modified buffer to the normal log file prior to the data.

After the FAS 202 flashes the data to the disk, the metadata modification succeeded in the flashing is completed and put into the buffer, and the brush is periodically written into the log file.

The interaction between the FAC 201 and the FAS 202 and the FAS 202 flash write timing can be as shown in FIG. 3.

Taking the data a and the data b to the FAS 202 sent by the FAC 201 as an example, the processing flow is as follows:

1. The FAC 201 transmits data a to the FAS 202.

2. The FAS 202 inserts a notification to modify the data a into the modification buffer.

3. The FAS 202 writes the data a to the data buffer.

4. The FAS 202 returns to the FAC 201, notifying the FAC 201 that a has successfully written the data. (After this time, the metadata modification notification is sent to FLR 203)

5. The FAC 201 sends the data b to the FAS 202.

6. The FAS 202 inserts the notification of the modified data b into the modification buffer.

7. The FAS 202 writes the data b to the data buffer.

8. The FAS 202 returns to the FAC 201, notifying the FAC 201 that b has successfully written the data. (Steps 5-8 represent different data, here is the speed of asynchronous notification)

10. The data of a is written to disk.

12. The data of b is written to disk.

At this point, the complete log flow is written, and the FAS 202 side log system is completely written.

Upon receiving the file data push completion message returned by the FAS 202, the FAC 201 transmits a metadata modification change request to the FLR 203, and the relevant data of the log file system is attached to the metadata modification change request.

As an optional implementation manner, when the FAC 201 sends a metadata modification change request to the FLR 203, the following scheme may be adopted:

After receiving the file data push completion message returned by the FAS 202, the FAC 201 fills in the corresponding metadata modification change request into the modify to be notified buffer.

When the set timing time arrives, all metadata modifications in the buffer to be notified will be modified. The request is sent to FLR 203.

The FAC 201 sends the metadata modification change request of the data a to the FLR 203, the metadata modification change request of the data b, the metadata modification change request of the data c, and the metadata modification change request of the data d as an example, and the FAC 201 to the FLR 203 The processing flow for sending the metadata modification change request can be as shown in FIG.

1. After the FAC 201 writes the x file, the modification of a is filled in the modified to be notified buffer;

2. After the FAC 201 writes the x file, the modification of b is filled in the modification to be notified buffer;

3. After the FAC 201 writes the x file, the modification of c is filled in the modified to be notified buffer;

4. After the FAC 201 writes the y file, the modification of d is filled in the modification to be notified buffer.

At this time, when the detection time has reached the required time interval, and the timer has not been triggered, the metadata synchronization message is triggered to be sent to the FLR 203, and the timer is reset.

After a period of time, the timer is triggered, and the message in the buffer to be notified is notified to the FLR 203 and the timer is reset. This kind of processing can greatly reduce the number of master messages for the FLR 203, and at the same time keep the real-time performance as much as possible in a short time interval.

After receiving the metadata modification, the FLR 203 modifies the corresponding metadata, and modifies the relevant metadata into the log system by attaching the log related data record. At the same time, the FAS 202 flashes the data into the disk and writes the log after determining that the write is successful.

In addition, the FLR 203 adds the relevant processed entries to the buffer of the corresponding FAS 202 in chronological order.

When it is detected that the FAS 202 is abnormally restarted, the FLR 203 performs a rollback operation of the corresponding modified data according to the log record, and completes the repair of the log file system.

The FLR 203 monitors whether the FAS 202 is abnormal by receiving a heartbeat message periodically sent by the FAS 202.

The FAS 202 periodically sends a still alive message to indicate that the FAS 202 is still working.

When the heartbeat message from the FAS 202 is detected, it is determined that the FAS 202 is normal, and when the consecutively lost heartbeat messages are detected, it is determined that the FAS 202 is abnormal.

For heartbeat messages sent by FAS 202, FLR 203 does not process, but if some In the case of continuous loss of heartbeat messages, the FLR 203 needs to lag the FAS 202 of the lost heartbeat message to ensure that if the real FAS 202 is down, it will do the scrolling of the related operations.

When it is detected that the FAS 202 is abnormally restarted, the FLR 203 performs a rollback operation according to the log record, that is, the modified data of the log record is retracted from the current time point of the log record for a set time length, the set time The modified data of the length corresponds to all the modified records of the FAS 202, that is, the data modification changes reported by the FAC.

When the FAS 202 is powered on, a rollback request is sent to the FLR 203 to roll back the corresponding data; the FLR 203 rolls back the corresponding data to the buffer of the corresponding FAS 202 according to the rollback request, and completes the log file system. repair.

The processing flow of the FLR 203 in this embodiment can be as shown in FIG. 5.

When one of the FAS 202s is abnormally restarted, the log system enters the repair process. The flow is first triggered on the FLR 203. When the FLR 203 confirms that a FAS 202 has been restarted, the log system will roll back through the log records on the FLR 203 all the modification records corresponding to the FAS 202 for a specific length of time. At the same time, when the FAS 202 is powered on, through the log recorded locally by the FAS 202, the related data written to the FAS 202 but not written to the disk is rolled back, and a rollback request is sent to the FLR 203 to roll back the corresponding data.

Compared with the related art, in the solution of this embodiment, the FAC 201 acquires the file data and sends it to the FAS 202; the FAS 202 records the file data pushed by the FAC 201, and records the modification and writing of the corresponding metadata on the FAS 202 in the buffer. a log file, and returning a file data push completion message to the FAC 201; after receiving the file data push completion message returned by the FAS 202, the FAC 201 sends a metadata modification change request to the FLR 203; the FLR 203 is based on the metadata Modify the change request, modify the corresponding metadata, and record to the log file system; when the FAS is detected When the abnormal restart is performed, the FLR 203 performs the rollback operation of the corresponding modified data according to the log record, completes the repair of the log file system, and ensures the final high consistency of the file after the reset file system is reset and restarted, thereby avoiding the machine restarting. The inconsistency of data between multiple copies, while minimizing the corresponding delay and performance loss due to the addition of the log system.

In the embodiment of the present invention, the log system has no sensitivity and correlation to the scale of the distributed system, and the system pressure is constant, and the pressure of the log system is not increased due to the expansion of the cluster. Has good convergence and no overhead on the network. The pressure on the disk where the log system resides is extremely small, which is a high-performance, low-latency log file system at the expense of high error rate.

One of ordinary skill in the art will appreciate that all or a portion of the steps of the above-described embodiments can be implemented using a computer program flow, which can be stored in a computer readable storage medium, such as on a corresponding hardware platform (eg, The system, device, device, device, etc. are executed, and when executed, include one or a combination of the steps of the method embodiments.

Alternatively, all or part of the steps of the above embodiments may also be implemented by using an integrated circuit. These steps may be separately fabricated into individual integrated circuit modules, or multiple modules or steps may be fabricated into a single integrated circuit module. achieve.

The devices/function modules/functional units in the above embodiments may be implemented by a general-purpose computing device, which may be centralized on a single computing device or distributed over a network of multiple computing devices.

When the device/function module/functional unit in the above embodiment is implemented in the form of a software function module and sold or used as a stand-alone product, it can be stored in a computer readable storage medium. The above mentioned computer readable storage medium may be a read only memory, a magnetic disk or an optical disk or the like.

Industrial applicability

The embodiment of the invention ensures the final high consistency of the file after the reset file system is reset and restarted, avoids the data inconsistency between multiple copies caused by the machine restart, and minimizes the corresponding increase of the log system. The delay and performance loss.

Claims

A data processing method for a distributed file system, comprising:

The file service client FAC obtains the file data and sends it to the file data server FAS;

The FAS records the file data pushed by the FAC, records the modification of the corresponding metadata on the FAS in the buffer, writes the log file, and returns a file data push completion message to the FAC;

After receiving the file data push completion message returned by the FAS, the FAC sends a metadata modification change request to the file location register FLR;

The FLR modifies the change request according to the metadata, modifies the corresponding metadata, and records the file to the log file system;

When the abnormal restart of the FAS is detected, the FLR performs a rollback operation of the corresponding modified data according to the log record, and completes the repair of the log file system.
The method according to claim 1, wherein the step of modifying the change request according to the metadata, modifying the corresponding metadata, and recording to the log file system further includes:

The FLR adds the relevant processed entries to the buffer of the corresponding FAS in chronological order.
The method according to claim 1, wherein the FLR performs a rollback operation of the corresponding modified data according to the log record when the abnormal restart of the FAS is detected, and the step of completing the repair of the log file system includes:

When the abnormal restart of the FAS is detected, the FLR returns the modified data of the log record from the current time point of the log record according to the log record, and the modified data of the set time length corresponds to the All changes to the FAS record;

Sending a rollback request to the FLR to roll back the corresponding data when the FAS is powered on;

The FLR rolls back the corresponding data to the buffer of the corresponding FAS according to the rollback request, and completes the repair of the log file system.
The method of claim 1, 2 or 3 wherein said FLR monitors FAS anomalies The steps include:

Receiving, by the FLR, a heartbeat message periodically sent by the FAS;

When it is detected that the heartbeat message is lost multiple times in succession, it is determined that the FAS is abnormal.
The method according to claim 4, wherein the step of the FAC transmitting the metadata modification change request to the FLR after receiving the file data push completion message returned by the FAS comprises:

After receiving the file data push completion message returned by the FAS, the FAC fills in the corresponding metadata modification change request into the modify to be notified buffer;

When the set timing time arrives, all metadata modification change requests in the modify notification buffer are sent to the FLR.
A distributed file system comprising: a file service client FAC, a file data server FAS, and a file location register FLR, wherein:

The FAC is set to: obtain file data, and push it to the FAS;

The FAS is configured to: record file data pushed by the FAC, record the modification of the corresponding metadata on the FAS in the buffer, write the log file, and return a file data push completion message to the FAC;

The FAC is further configured to: after receiving the file data push completion message returned by the FAS, send a metadata modification change request to the FLR;

The FLR is configured to: modify the change request according to the metadata, modify the corresponding metadata, and record to the log file system;

The FLR is further configured to: when the abnormal restart of the FAS is detected, perform a rollback operation of the corresponding modified data according to the log record, and complete the repair of the log file system.
The system of claim 6 wherein

The FLR is further configured to add the related processed entries to the buffer of the corresponding FAS in order of time.
The system of claim 6 wherein

The FLR is set to: when the abnormal restart of the FAS is detected, according to the log record, And setting the modified data of the log record to the set time length from the current time point of the log record, where the modified data of the set time length corresponds to all the modified records of the FAS;

The FAS is configured to: when the FAS is powered on, send a rollback request to the FLR to roll back the corresponding data;

The FLR is configured to: roll back the corresponding data to the buffer of the corresponding FAS according to the rollback request, and complete the repair of the log file system.
A system according to claim 6, 7 or 8, wherein

The FLR is configured to: receive a heartbeat message periodically sent by the FAS; and determine that the FAS is abnormal when a continuous lost heartbeat message is detected.
The system of claim 9 wherein

The FAC is configured to: after receiving the file data push completion message returned by the FAS, fill the corresponding metadata modification change request into the modify to be notified buffer; when the set timing time arrives, the modification is to be performed. All metadata modification change requests within the notification buffer are sent to the FLR.
A computer readable storage medium storing computer executable instructions for performing the method of any of claims 1-5.