CN102833273A

CN102833273A - Data restoring method when meeting temporary fault and distributed caching system

Info

Publication number: CN102833273A
Application number: CN2011101576931A
Authority: CN
Inventors: 郭斌; 陈典强; 韩银俊; 宫微微
Original assignee: ZTE Corp
Current assignee: ZTE Corp
Priority date: 2011-06-13
Filing date: 2011-06-13
Publication date: 2012-12-19
Anticipated expiration: 2031-06-13
Also published as: CN102833273B; WO2012171345A1

Abstract

The invention discloses a data restoring method when meeting a temporary fault. The method comprises the following steps: a collaboration server initiates data operation to a replica server, and finds out a fault in the replica server, then generates Key change record containing operated data keys; after the replica server is recovered from the fault, the collaboration server initiates a data restoring operation to the replica server according to the Key change record; the replica server performs local data restoring according to the data restoring operation initiated by the collaboration server. The invention further discloses a distributed caching system for data restoring when meeting the temporary fault. According to the method and the system, the consistency among multiple replicas of data after the temporary fault is solved can be ensured, the accuracy of the data stored in the distributed caching system is improved, promote the quality attribute of the distributed caching system is promoted, and the application experience is optimized.

Description

Data recovery method during temporary derangement and distributed caching system

Technical field

The present invention relates to the cloud computing technical field, data recovery method and distributed caching system when relating in particular to a kind of temporary derangement.

Background technology

Cloud computing (Cloud Computing) is the product that grid computing (Grid Computing), Distributed Calculation (Distributed Computing), parallel computation (Parallel Computing), effectiveness calculating (Utility Computing) network storage (Network Storage Technologies), virtual (Virtualization), load balancing traditional calculations machine technologies such as (Load Balance) and network technical development merge.It is intended to be integrated into a system with powerful calculating ability to the relatively low computational entity of a plurality of costs through network.Distributed caching is a field in the cloud computing category, and its effect provides the distributed storage service of mass data and the ability of high speed reads write access.

The distributed caching system is connected and composed mutually by some server nodes and client, and wherein, server node is responsible for the storage of data, operation such as client can write the server node data, reads, upgrades, deletion.In general; The data that write can not only be kept on the individual server node, but on the multiple servers node, preserve the copy of same data, backup each other; Said data are made up of key (Key) and value (Value); Key is equivalent to the index of data, and Value is the data content of Key representative data, and Key and Value concern one to one in logic.

In the distributed caching system, guarantee that the consistency of data is key issues.After fault recovery, each copy that data are preserved on each server node in the distributed caching system may become inconsistent.For example; The corresponding data of Key are carried out repeatedly write, upgrade, during the action such as deletion; If have network failure or various hardware and software failure, then after fault recovery, the Value that the said Key that preserves on the different server nodes is corresponding maybe be different.

Prior art is after fault recovery; If will pass through the Key reading of data immediately, then each copy is taken out and comparison, select correct Value according to certain versions of data comparison rule; Simultaneously legacy data is repaired, with the consistency of a plurality of copies of keeping same data.If but from the fault recovery to needs, pass through between the Key reading of data; Repeatedly fault has taken place in the server node at a plurality of copies place in succession; Then when needs pass through the Key reading of data; May occur reading less than data read older data or each copy of reading between the situation such as new and old of having no way of, thereby reduced the qualitative attribute of distributed caching system, and had a strong impact on the application experience of distributed caching system.

Summary of the invention

In view of this, data recovery method and distributed caching system when main purpose of the present invention is to provide a kind of temporary derangement can keep the consistency of same each copy of data after the server node fault recovery in the distributed caching system.

For achieving the above object, technical scheme of the present invention is achieved in that

Data recovery method when the invention provides a kind of temporary derangement, said method comprises:

When Collaboration Server is initiated data manipulation to replica server, find to have the replica server fault, then generate the Key change record that includes each data key (Key) of operating;

After said replica server recovered from fault, said Collaboration Server was initiated the data repair operation according to said Key change record to said replica server;

The local data reparation is carried out in the data repair operation that said replica server is initiated according to said Collaboration Server.

In such scheme, said to replica server initiation data manipulation, comprising: initiate the write operation of data or upgrade operation to replica server.

In such scheme, said generation includes the Key change record of each the data Key that carried out operation between age at failure, also comprises:

Said Collaboration Server is that said replica server is set up saveset;

Between said replica server age at failure, said Collaboration Server generates the Key change record that includes each data Key that the fault manipulate crosses, and is saved in the saveset of said replica server.

In such scheme, said Collaboration Server is initiated the data repair operation according to the Key change record of being preserved to said replica server, comprising:

Said Collaboration Server obtains in the said Key change record all corresponding data trnascriptions of each Key, and identifies the data trnascription that each Key is corresponding in the said Key change record last time operated;

The said last time that use identifies carried out the data trnascription of operation said replica server was initiated the data repair operation.

In such scheme, the data trnascription that the said last time of identifying each Key correspondence in the said Key change record operated, for:

A plurality of data trnascriptions of identical Key in all data trnascriptions that obtained are carried out version relatively, obtain the data trnascription that the corresponding last time of said each Key operated.

In such scheme, said Collaboration Server obtains all corresponding data trnascriptions of each Key in the said Key change record, for:

Said Collaboration Server reads the corresponding data trnascription of said each Key and from self obtaining the corresponding data trnascription of said each Key from all corresponding replica servers of said each Key.

In such scheme, the local data reparation is carried out in the data repair operation that said replica server is initiated according to said Collaboration Server, comprising:

The data trnascription that said replica server was operated according to the last time that each Key is corresponding in the said Key change record upgrades the local data trnascription of preserving.

In such scheme, in the data repair operation that said replica server is initiated according to said Collaboration Server, to carry out after the local data reparation, said method also comprises:

Said replica server returns the reparation result to said Collaboration Server after upgrading the local data trnascription of preserving;

When said reparation result was failure, said Collaboration Server continued to initiate the Data Update operation to said replica server.

The present invention also provides a kind of distributed caching system of data repair when being used for temporary derangement, and said system comprises: Collaboration Server and one or more replica server, wherein,

Collaboration Server is used for when said one or more replica servers are initiated data manipulation, finding to have the replica server fault, then generates the Key change record that includes each the data Key that operated; And, be used for after said replica server recovers from fault,, initiating the data repair operation to said replica server according to said Key change record;

Said one or more replica server is used for after fault recovery, carries out the local data reparation according to the data repair operation that said Collaboration Server is initiated.

In such scheme, said Collaboration Server also is used for said each replica server and sets up saveset; Between said each replica server age at failure, generate the Key change record include each data Key that the fault manipulate crosses, and be saved in the saveset of said each replica server.

In such scheme; Said Collaboration Server; Also be used for obtaining all corresponding data trnascriptions of each Key of said Key change record; Identify in the said Key change record data trnascription that the corresponding last time of each Key operate, and the data trnascription that uses said last time of identifying to operate is operated to said replica server initiation data repair.

In such scheme, said replica server also is used for initiating the data trnascription that said last time that data repair manipulates operate, the data trnascription of renewal this locality preservation according to said Collaboration Server.

In such scheme, said replica server also is used for after upgrading the local data trnascription of preserving, returning the reparation result to said Collaboration Server; Said Collaboration Server also is used for when the reparation result of said replica server feedback is failure, continues to initiate the Data Update operation to said replica server.

Data recovery method and distributed caching system during temporary derangement provided by the present invention when finding the replica server fault is arranged, generate the Key change record by Collaboration Server; After said replica server recovers from fault; Initiate the data repair operation according to said Key change record to said replica server, make said replica server can in time carry out the local data reparation, thereby after having guaranteed that temporary derangement recovers; Still can keep consistency between a plurality of copies of data; Improve the accuracy of distributed caching system preservation data, promoted the qualitative attribute of distributed caching system, optimized the experience of using.

Description of drawings

The realization flow figure of the data recovery method of Fig. 1 during for a kind of temporary derangement of the present invention;

Fig. 2 is the composition structural representation of distributed caching system in a kind of specific embodiment of the present invention;

Fig. 3 is data repair realization process flow chart during the temporary derangement of distributed caching system in a kind of specific embodiment of the present invention.

Embodiment

Basic thought of the present invention is: when carrying out data manipulation, as carry out writing or when upgrading, when the Collaboration Server in the distributed caching system is found the replica server fault is arranged, generating the change record of said data and preserve of data; After said replica server fault recovery; Collaboration Server carries out data repair according to the change record of said data to said replica server; Make that the copy of data is consistent described in copy and other replica servers of said the above data of replica server; So, guaranteed that temporary derangement recovers the consistency between a plurality of copies of back data.

Data recovery method during a kind of temporary derangement of the present invention is applied to the distributed caching system, can, after temporary derangement recovers, keep the consistency between data trnascription fast, with reference to shown in Figure 1, said method mainly may further comprise the steps:

Step 101: when Collaboration Server is initiated data manipulation to replica server, find to have the replica server fault, then generate the Key change record that includes each the data Key that operated;

Particularly; Collaboration Server is after the data of the Key-Value that receives the client initiation write request or Data Update request; Need be when each replica server be initiated the write operation of data or is upgraded operation, finding has the replica server fault, then generates the Key change record.

Wherein, Collaboration Server is the normal server node of operation in the distributed caching system, is used to receive the data manipulation that client is initiated, and initiates data manipulation to each replica server accordingly.

Replica server be preserve in the Servers-all node of the current data trnascription that needs operating data in the distributed caching system, each server node except that said Collaboration Server.

Wherein, in the Key change record.

In the practical application, said Collaboration Server can be set up saveset for each replica server; Between each replica server age at failure; Said Collaboration Server generates the Key change record that includes each data Key that the fault manipulate crosses; Promptly comprise and took place between age at failure to write or the Key change record of the Key of data updated, and be saved in the saveset of each replica server.So, only need the Key that preserves data to get final product in the change record, need not preserve the Value of data, cost is very little, saves resource.

Step 102: after said replica server recovered from fault, said Collaboration Server was initiated the data repair operation according to said Key change record to said replica server;

Particularly, said Collaboration Server obtains in the said Key change record all corresponding data trnascriptions of each Key, and identifies the data trnascription that each Key is corresponding in the said Key change record last time operated; The said last time that use identifies carried out the data trnascription of operation said replica server was initiated the data repair operation.

Here, Collaboration Server carries out version relatively through a plurality of data trnascriptions to identical Key in all data trnascriptions that obtained, and obtains the data trnascription that the corresponding last time of said each Key operated.

Here; Said Collaboration Server can read the corresponding data trnascription of said each Key and from self obtaining the corresponding data trnascription of said each Key, accomplish obtaining of all corresponding data trnascriptions of said each Key from all corresponding replica servers of said each Key.

Step 103: the data repair operation that said replica server is initiated according to said Collaboration Server, carry out the local data reparation.

Particularly, the data trnascription that said replica server was operated according to the last time that each Key is corresponding in the said Key change record upgrades the local data trnascription of preserving.

Here, the data trnascription that said replica server uses when initiating the data repair operation is saved in this locality, the renewal of completion local data copy with the Value of Key that operation took place between age at failure to write or upgraded and correspondence and version number information etc.

Here, after step 103, said method also comprises: said replica server returns the reparation result to said Collaboration Server after upgrading the local data trnascription of preserving; When said reparation result was failure, said Collaboration Server continued to initiate the Data Update operation to said replica server.In said reparation result is successfully the time, finishes current data repair process.

Accordingly; The present invention also provides a kind of distributed caching system of the data repair when being used for temporary derangement, and said system comprises: Collaboration Server and one or more replica server, wherein; Collaboration Server; Be used for when said one or more replica servers are initiated data manipulation, finding to have the replica server fault, then generate the Key change record that includes each the data Key that operated; And, be used for after said replica server recovers from fault,, initiating the data repair operation to said replica server according to said Key change record; Said one or more replica server is used for after fault recovery, carries out the local data reparation according to the data repair operation that said Collaboration Server is initiated.

Wherein, said Collaboration Server also is used for said each replica server and sets up saveset; Between said each replica server age at failure, generate the Key change record include each data Key that the fault manipulate crosses, and be saved in the saveset of said each replica server.

Particularly; Said Collaboration Server; Also be used for obtaining all corresponding data trnascriptions of each Key of said Key change record; Identify in the said Key change record data trnascription that the corresponding last time of each Key operate, and the data trnascription that uses said last time of identifying to operate is operated to said replica server initiation data repair.

Wherein, said replica server also is used for initiating the data trnascription that said last time that data repair manipulates operate, the data trnascription of renewal this locality preservation according to said Collaboration Server.

Wherein, said replica server can also be used for after upgrading the local data trnascription of preserving, returning the reparation result to said Collaboration Server; Said Collaboration Server can also be used for when the reparation result of said replica server feedback is failure, continues to initiate the Data Update operation to said replica server, carries out data repair again, is successfully up to said reparation result.

Embodiment one

In the present embodiment; Distributed caching system by server node and client constitute is as shown in Figure 2; This distributed caching system comprises three server nodes (first server node, second server node and the 3rd server node) and two clients (first client and second client); Wherein, each client and each server node connect, and connect mutually between server node.

After client is initiated the Data Update operation, in data updating process, carry out the concrete implementation procedure of the data repair of temporary derangement, as shown in Figure 3, concrete steps are following:

Step 301, first client are initiated the Data Update operation, select a station server node as Collaboration Server according to the Key of data, and to said Collaboration Server is sent the Data Update request to a Key-Value;

Particularly; Key for a particular data; Can the server cluster of distributed caching system be can be regarded as the cluster of a Collaboration Server and a plurality of replica servers according to certain priority, different Key possibly have different Collaboration Servers and replica server.In addition, Collaboration Server choose the network condition that also needs reference equivalent, this network condition comprises that whether normal the operating state of each server node etc.

In the present embodiment, upgrade Key and the current network condition of the data of operation as required, select first server node as Collaboration Server.

Step 302, Collaboration Server receive said Data Update request, and the Key and the Value of the data of sending stores the renewal local data when said first client sent the Data Update request.

Here, when Collaboration Server upgrades local data,, then return the response of upgrading failure, can return step 301 and carry out again, can also finish current flow process to said first client if upgrade failure.

Step 303, Collaboration Server identifies the corresponding replica server of Key of said data according to certain rule, and initiates Data Update to each replica server that identifies and operate;

Here, Collaboration Server can be according to consistency Hash rule or according to discerning replica server by the field chopping rule.

For example; Can obtain the corresponding cryptographic hash of Key of said data through hash algorithm; Find correspondence to preserve other server nodes of the corresponding data trnascription of said Key by resulting cryptographic hash, other server nodes that found are the corresponding replica server of Key of said data.

In the present embodiment, Collaboration Server identifies the second server node and the 3rd server node is the corresponding replica server of said Key, sends the Data Update request to second server node and the 3rd server node, initiates the Data Update operation.

Step 304, Collaboration Server are after initiating the Data Update operation, and finding has server node to have fault in the corresponding replica server of said Key, generates the change record of said Key and is temporary in this locality;

Particularly, if there is fault in server node, server node can't receive information and send information.Collaboration Server is when initiating the Data Update operation to each replica server, and discovery can't be initiated the Data Update operation to a replica server, in the time of promptly can't this replica server being sent in the Data Update request, thinks that then there is fault in this replica server.

In the present embodiment, Collaboration Server is found to have fault as the 3rd server node of replica server, at this moment, generates the change record of said Key and is temporary in this locality.

Here, the change record of said Key comprises all Key that carried out current renewal operation.

Step 305, Collaboration Server receive the response that each replica server of normal operation returns, and the renewal operating result that will include the local update result of response that each replica server returns and said Collaboration Server returns to first client;

Here; After each replica server of normal operation received the Data Update request of Collaboration Server initiation, Key and the Value with data in the said Data Update request stored respectively, upgraded local data; If upgrade successfully; Then return and upgrade successful response,, then return the response of upgrading failure to said Collaboration Server if upgrade failure to said Collaboration Server.

In the practical application, under the not enough situation or analogue of memory capacity, the result of failure can appear upgrading.

If all replica servers all return the response of upgrading failure, then Collaboration Server is thought and is this time upgraded operation failure, can return step 303 or step 301 and carry out again, can also finish current flow process; Otherwise Collaboration Server thinks that this time renewal is operated successfully, can continue flow.

Here; If carrying out local data, upgrades successfully said Collaboration Server; Then return expression and upgrade successful local update result to said first client; Upgrade failure if said Collaboration Server carries out local data, then return the local update result that failure is upgraded in expression to said first client.

Said local update result carries out Data Update for said Collaboration Server

Step 306, replica server recovers normal in the fault, and beginning externally provides service;

Step 307, Collaboration Server find that replica server recovers normal, and the change record that generates in the load step 304 prepares to carry out data repair;

In the practical application; Replica server in the fault can be built couplet with said Collaboration Server after recovering normally, connects said Collaboration Server again; And can be after connection; Each server node (comprising Collaboration Server) beginning externally provides service in the distribution of notifications formula caching system, so, can know just after the notice of Collaboration Server replica server in receiving fault that replica server has recovered normal.

Step 308, Collaboration Server read the Key of the said data of upgrading operation and the version number information of Value and correspondence according to the change record that generates in the step 304 from local and all replica servers, obtain a plurality of copies of said data;

Particularly; Collaboration Server is initiated data read operation to each replica server (comprising the replica server that from fault, recovers) respectively; And carry out local data and read; Each replica server returns the result that reads who includes said data trnascription to Collaboration Server, obtains said data and is kept at the copy in each server node (comprising Collaboration Server and all replica servers).

Step 309, Collaboration Server carries out version relatively to a plurality of copies that obtain in the step 308, identifies the copy of last update;

Particularly, Collaboration Server compares the version number information of said each copy of data, identifies the copy that the last time upgrades.

Step 310, Collaboration Server carries out data repair to the replica server that from temporary derangement, recovers in the step 306, uses the copy of the last update operation that draws in the step 309;

Particularly, Collaboration Server uses the copy of the last update operation that draws in the step 309, initiates data repair to the replica server that from temporary derangement, recovers (the 3rd server node of present embodiment).

In the practical application, the replica server that Collaboration Server recovers in said temporary derangement sends the data repair request, and this data repair request package contains the copy of said data last update operation.

Step 311, the replica server that from temporary derangement, recovers is accepted data repair, carries out local data and upgrades; And return and repair the result to Collaboration Server; If repair successfully, then finish current flow process, if repairing failure; Then return step 307 and repeat data repair, up to said data repair success.

Particularly; The replica server that from temporary derangement, recovers receives the data repair request that said Collaboration Server sends; From said data repair request, extract the copy of said data last update operation; And the Key of data described in the copy of said data last update operation preserved with Value, accomplish the local data renewal.

Here,, then repair successfully, return expression to said Collaboration Server and repair successful reparation result, finish current flow process if the said replica server that from temporary derangement, recovers upgrades the local data success; If the said replica server that from temporary derangement, recovers upgrades the local data failure, then repairing failure returns the reparation result of expression repairing failure to said Collaboration Server, and returns step 307 and repeat data repair, up to said data repair success.So, after client was initiated the Data Update operation, the server node that sends temporary derangement can in time carry out data repair after recovery, guaranteed the consistency of each copy of data.

In the practical application, the replica server that recovers in the temporary derangement in the repair process once more fault or network failure or server busy for a long time response all can cause and revise failure.

The above is merely preferred embodiment of the present invention, is not to be used to limit protection scope of the present invention.

Claims

1. the data recovery method a during temporary derangement is characterized in that said method comprises:

2. the data recovery method during according to the said temporary derangement of claim 1 is characterized in that, and is said to replica server initiation data manipulation, comprising: initiate the write operation of data or upgrade operation to replica server.

3. the data recovery method during according to claim 1 or 2 said temporary derangements is characterized in that, said generation includes the Key change record of each the data Key that carried out operation between age at failure, also comprises:

Said Collaboration Server is that said replica server is set up saveset;

4. the data recovery method during according to claim 1 or 3 said temporary derangements is characterized in that, said Collaboration Server is initiated the data repair operation according to the Key change record of being preserved to said replica server, comprising:

5. the data recovery method during according to the said temporary derangement of claim 4 is characterized in that, the data trnascription that the said last time of identifying each Key correspondence in the said Key change record operated, for:

6. the data recovery method during according to the said temporary derangement of claim 4 is characterized in that, said Collaboration Server obtains all corresponding data trnascriptions of each Key in the said Key change record, for:

7. the data recovery method during according to the said temporary derangement of claim 4 is characterized in that, the local data reparation is carried out in the data repair operation that said replica server is initiated according to said Collaboration Server, comprising:

8. the data recovery method during according to the said temporary derangement of claim 7 is characterized in that, in the data repair operation that said replica server is initiated according to said Collaboration Server, carries out after the local data reparation, and said method also comprises:

9. the distributed caching system of a data repair when being used for temporary derangement is characterized in that said system comprises: Collaboration Server and one or more replica server, wherein,

10. distributed caching according to claim 9 system is characterized in that,

Said Collaboration Server also is used for said each replica server and sets up saveset; Between said each replica server age at failure, generate the Key change record include each data Key that the fault manipulate crosses, and be saved in the saveset of said each replica server.

11. distributed caching according to claim 9 system; It is characterized in that; Said Collaboration Server; Also be used for obtaining all corresponding data trnascriptions of each Key of said Key change record, identify the data trnascription that each Key is corresponding in the said Key change record last time operate, and the data trnascription that uses said last time of identifying to operate is operated said replica server initiation data repair.

12. distributed caching according to claim 11 system; It is characterized in that; Said replica server also is used for initiating the data trnascription that said last time that data repair manipulates operate, the data trnascription of renewal this locality preservation according to said Collaboration Server.

13. distributed caching according to claim 12 system is characterized in that,

Said replica server also is used for after upgrading the local data trnascription of preserving, returning the reparation result to said Collaboration Server;

Said Collaboration Server also is used for when the reparation result of said replica server feedback is failure, continues to initiate the Data Update operation to said replica server.