CN106559263A

CN106559263A - A kind of improved distributed consensus algorithm

Info

Publication number: CN106559263A
Application number: CN201611010538.6A
Authority: CN
Inventors: 熊中哲
Original assignee: Hangzhou Wo Qu Polytron Technologies Inc
Current assignee: Hangzhou Wo Qu Polytron Technologies Inc
Priority date: 2016-11-17
Filing date: 2016-11-17
Publication date: 2017-04-05

Abstract

The present invention relates to computer realm, discloses a kind of improved distributed consensus algorithm, comprises the steps：(1) initialization of distributed type assemblies；(2) fault recovery of distributed type assemblies；(3) data syn-chronization of distributed type assemblies；The present invention is by improving distributed consensus algorithm, enable according to set rule, to adjust the selection course of cluster host node, the selection course of so whole cluster host node is more flexible and efficient, so that distributed type assemblies can be rapidly switched to most suitable new node when host node is in collapse and recover response, thus eliminating the need the unstability of whole distributed system, and the situation that the long-time that may exist is not responding to, and can at any time according to the timely regulation rule of specific node situation, distributed type assemblies are enable more flexibly to adjust migration.

Description

A kind of improved distributed consensus algorithm

Technical field

The present invention relates to field of computer technology, more particularly to a kind of improved distributed consensus algorithm.

Background technology

In a distributed system, comprising many physically separate servers, they communicate by network connection, altogether With one group system in logic of composition.Therefore certainly exist certain server and collapse because of various unexpected or state is different Often, cannot thus ensure that they reach an agreement with other servers state.Cluster needs to ensure in this case again simultaneously Stable external offer service is provided, and can continue to add cluster work when server recovers normal in this section Make and be consistent with cluster state.

Distributed consensus algorithm, the algorithm of such issues that exactly solve.And Raft is then using extensive distributed one Cause property algorithm, its cardinal principle are as follows：

Distributed cluster system wants hold mode consistent in normal state, be exactly first Servers-all node (with Call in the following text：Node) in, one is selected as host node (Leader),, used as from node (Follower), host node continues fixed for other When send messages to from node.Then host node is passed through to the service request of whole distributed type assemblies, then from host node to which He issues request from node.After in cluster, more than half acknowledges receipt of simultaneously application request, host node has returned this request immediately The message for completing, notifies that requesting party has completed.For the abnormal nodes that other minorities are not received or do not returned, host node meeting Unacknowledged request is retransmitted persistently, until its recovery is normal and keeps up with host node.As long as this ensures that exceeding in the cluster The normal whole cluster of the node of half just can normal work.

When host node surprisingly collapses or when Network Abnormal, cluster loses host node, is at this time put into " election " state：All timed message that host node cannot be received from node, after the time-out time of a regulation, can all be converted into and treat The person of choosing (Candidate), then initiates the ballot to oneself, and each person to be selected has a ballot paper, first vote after initiating ballot to Oneself, then asks ballot paper to other persons to be selected again, and after the ballot paper more than node total number half is obtained, this person to be selected is immediately New host node being converted into, then timed message being sent to other persons to be selected, distributed type assemblies enter next stable period.Such as Fruit obtain ballot paper not enough, this person to be selected understands dormancy (such as 150-300 milliseconds) for a period of time, then initiates next round ballot.When In this period of dormancy, receive that other persons to be selected send seeks ticket request (and his data volume is greater than this person to be selected), It will be thrown the ballot paper of oneself to that person to be selected for seeking ticket, then again dormancy for a period of time (such as 150-300 milliseconds), If this period of dormancy, new host node was selected again, oneself is reformed into from node, if not selecting new host node, is initiated Next round is voted.

After new host node is selected, all submission states of cluster can be distributed to all sections of cluster by new host node again Point, so that the state of distributed type assemblies recovers available again.

During wherein re-electing new host node after host node collapse, whole distributed type assemblies formula is in special shape State, it is impossible to respond request, and it is all from node be all equality, all can be in one random time of dormancy (150-300 milliseconds) Mutually lobbying afterwards, in a wheel ballot, obtains the ballot paper more than half if none of node, then can then dormancy one it is random Time, and next round ballot is carried out, until selecting new host node.Can thus there is such case：If each node Dormancy time very close to, may result in after dormancy terminates all to throw ballot paper immediately and give oneself, so as to host node cannot be selected, then Dormancy is voted into next round, and dormancy is thrown ballot paper immediately after terminating again and gives oneself, does not so stop circulation, until many pollings Lift, can just select new host node, cluster could recover, and this results in distributed type assemblies performance may be very unstable, or even Under extreme case, long-time is unavailable.

The content of the invention

Shortcoming of the present invention for the poorly efficient unstable, underaction of whole distributed cluster system in prior art, there is provided A kind of improved distributed consensus algorithm.

In order to solve above-mentioned technical problem, the present invention is addressed by following technical proposals.

A kind of improved distributed consensus algorithm, comprises the steps：

(1) initialization of distributed type assemblies：The code library of startup, load-on module including server and execution distributed The initialization program of cause property algorithm；Meanwhile, by way of configuration file, according to the height of node server configuration, to node Election weighted value be configured from high to low；

(2) fault recovery of distributed type assemblies：When suspension or crash reason cause host node to depart from whole cluster, collect mass-brain The response for causing host node obtain more than half node is split, into the fault recovery stage；One is re-elected from whole cluster Individual person to be selected, becomes host node, is conducted an election according to the election weighted value of setting node server in step (1), selects section In point, weighted value highest node is used as new host node；

(3) data syn-chronization of distributed type assemblies：After node abnormal in cluster recovers normal, can be from host node synchronization Data, are consistent with other node datas in cluster until reaching.

Preferably, in step (1), node server configuration includes internal memory, CPU and access bandwidth.

The present invention as a result of above technical scheme, with significant technique effect：The present invention is distributed by improving Consistency algorithm, enables the selection course of cluster host node is adjusted according to set rule, the main section of so whole cluster The selection course of point is more flexible and efficient, so that distributed type assemblies when host node is in collapse can be rapidly switched to most close Suitable new node simultaneously recovers response, thus eliminating the need the unstability of whole distributed system, and the length that may exist The situation that time is not responding to, and distributed type assemblies can be enable more at any time according to the timely regulation rule of specific node situation Plus flexible adjustment migration.

Description of the drawings

Fig. 1 is a kind of election process schematic diagram of improved distributed consensus algorithm of the present invention；

Fig. 2 is a kind of schematic flow sheet of improved distributed consensus algorithm of the present invention.

Specific embodiment

The present invention is described in further detail with embodiment below in conjunction with the accompanying drawings.

As shown in Figure 1 to Figure 2, a kind of improved distributed consensus algorithm, comprises the steps：

(1) initialization of distributed type assemblies：The code library of startup, load-on module including server and execution distributed The initialization program of cause property algorithm；Meanwhile, by way of configuration file, according to the height of node server configuration, to node Election weighted value be configured from high to low；Node server configuration includes internal memory, CPU and access bandwidth, then whole to collect Group's initialization is completed, and the process of host node is elected into first time, due to being provided with different weights before, each selected person, According to weight dormancy different time, host node is just selected an election cycle substantially, and then other nodes become from section Point, whole distributed type assemblies enter the normal state for providing service；

(2) fault recovery of distributed type assemblies：When suspension or crash reason cause host node to depart from whole cluster, collect mass-brain The response for causing host node obtain more than half node is split, into the fault recovery stage；One is re-elected from whole cluster Individual person to be selected, becomes host node, is conducted an election according to the election weighted value of setting node server in step (1), selects section In point, used as new host node, election process is with before, basic wheel election can be selected surplus for weighted value highest node In lower node, used as new host node, then cluster recovery normal work, therefore this process is to outside clothes for optimum node Business affects can be minimum；

(3) data syn-chronization of distributed type assemblies：After node abnormal in cluster recovers normal, can be from host node synchronization Data, are consistent with other node datas in cluster until reaching, because host node is optimum node all the time, recover Process also can be rapider.

The present invention enables according to set rule, adjusts the main section of cluster by improving distributed consensus algorithm The selection course of point, the selection course of so whole cluster host node is more flexible and efficient, so that distributed type assemblies are in main section Point can be rapidly switched to most suitable new node when collapse and recover response, thus eliminating the need whole distributed system Unstability, and the situation that the long-time that may exist is not responding to, and can at any time according to specific node situation and When regulation rule, enable distributed type assemblies more flexibly to adjust migration.

Embodiment 1

(1) initialization of the startup of server, the code library for loading necessary module and execution distributed consensus algorithm Program；Meanwhile, by way of configuration file, according to the height of node server configuration, to the election weighted value of node from height Be configured to low, have in a distributed type assemblies 3 it is higher with server, 10 are relatively low with server, then often according to configuration Height weight successively decreases 200 successively from 5000, and minimum is then 2600 with server；

(2) fault recovery of distributed type assemblies：When suspension or collapse or other reasonses cause host node to depart from whole cluster, Cluster fissure causes host node obtain response more than half node, into the fault recovery stage；From whole cluster again One person to be selected of election, becomes host node, is conducted an election according to the election weighted value of setting node server in step (1), In selecting node, weighted value highest node is used as new host node；

In a word, presently preferred embodiments of the present invention, all equalizations made according to scope of the present invention patent be the foregoing is only Change and modification, should all belong to the covering scope of patent of the present invention.

Claims

1. a kind of improved distributed consensus algorithm, it is characterised in that comprise the steps：

(1) initialization of distributed type assemblies：The code library and execution distributed consensus of startup, load-on module including server The initialization program of algorithm；Meanwhile, by way of configuration file, according to the height of node server configuration, the choosing to node Lift weighted value to be configured from high to low；

(2) fault recovery of distributed type assemblies：When suspension or crash reason cause host node to depart from whole cluster, cluster fissure is led Cause host node cannot obtain the response more than half node, into the fault recovery stage；One is re-elected from whole cluster to treat The person of choosing, becomes host node, is conducted an election according to the election weighted value of setting node server in step (1), is selected in node Weighted value highest node is used as new host node；

(3) data syn-chronization of distributed type assemblies：After node abnormal in cluster recovers normal, can be from the same step number of host node According to being consistent with other node datas in cluster until reaching.

2. a kind of improved distributed consensus algorithm according to claim 1, it is characterised in that：In step (1), node Server configuration includes internal memory, CPU and access bandwidth.