CN107153680B

CN107153680B - Method and system for on-line node expansion of distributed memory database

Info

Publication number: CN107153680B
Application number: CN201710253625.2A
Authority: CN
Inventors: 王金山
Original assignee: Beijing Si Tech Information Technology Co Ltd
Current assignee: Beijing Si Tech Information Technology Co Ltd
Priority date: 2017-04-18
Filing date: 2017-04-18
Publication date: 2021-02-02
Anticipated expiration: 2037-04-18
Also published as: CN107153680A

Abstract

The invention relates to a method and a system for on-line node expansion of a distributed memory database, wherein the method comprises the following steps: newly adding nodes to generate a new routing rule; migrating the related data from the old node to the corresponding new node; migrating the relevant REDO log from the old node to the corresponding new node; sending a switching instruction to an application client, and after the REDO log is migrated, the application client accesses a distributed memory database under a new routing rule; the system comprises a routing rule generation module, a data migration module, an REDO log migration module and a switching access module. The invention expands the nodes on line under the condition of not stopping external service, so that the application system can smoothly transit and automatically route to a new node.

Description

Method and system for on-line node expansion of distributed memory database

Technical Field

The invention relates to the field of distributed memory databases, in particular to a method and a system for expanding nodes of a distributed memory database on line.

Background

The distributed memory database has wide application in systems such as telecommunication charging, real-time online transaction and the like by virtue of the ultrahigh memory access speed, and the memory data are uniformly distributed on each single-machine memory database node in a slicing mode to provide uniform data service for the outside.

As traffic changes, the distributed in-memory database may require dynamically adding nodes to meet the increasing demand for data volume.

In the conventional technology, the distributed memory database service is generally temporarily stopped, the routing rule is recalculated for the original data of each node according to a new HASH algorithm, and the routing rule is uniformly re-distributed on a new node, and then the distributed memory database service is restarted. Due to the fact that redistribution of mass data is involved, time consumption is long, and the distributed memory database cannot provide services for the outside for a long time.

The system using the memory database service is generally an online real-time system, has high requirements on real-time performance and transaction consistency, and cannot accept long-time interruption of the memory database service.

Disclosure of Invention

The technical problem to be solved by the invention is as follows: when the distributed memory database dynamically expands nodes, the redistribution of mass data is involved, the distributed memory database service is temporarily stopped, the time consumption is long, and the distributed memory database service is interrupted for a long time.

The technical scheme for solving the technical problems is as follows: a method for on-line node expansion of a distributed memory database comprises the following steps:

step 1: newly adding nodes, and generating a new routing rule according to the newly added nodes to re-plan the routing corresponding relation;

step 2: reading data of the old node, and migrating the data belonging to the newly added node under the new routing rule from the old node to the corresponding newly added node;

and step 3: reading the REDO log of the old node, and migrating the REDO log belonging to the newly added node under the new routing rule from the old node to the corresponding newly added node;

and 4, step 4: and sending a switching instruction to the application client, and after the REDO log is migrated, the application client accesses the distributed memory database under the new routing rule.

The invention has the beneficial effects that: the invention adopts the routing comparison table rule to redistribute the data, so that the transferred data volume is reduced to the minimum degree; the invention adopts a routing comparison table mode, and only migrates partial data from the old node to the new node, so that the data can be uniformly distributed again, the data migration scale is reduced, and the migration speed is accelerated.

On the basis of the technical scheme, the invention can be further improved as follows.

Further, the step 1 comprises the following steps:

step 1.1: newly adding nodes;

step 1.2: creating a new routing rule comparison table;

step 1.3: and re-planning the routing corresponding relation according to the newly added node.

Further, the step 2 comprises the following steps:

step 2.1: recording a data migration starting time point T;

step 2.2: adding the newly added nodes into a distributed memory database cluster;

step 2.3: and reading all the data of the old nodes, and migrating the data of all the old nodes belonging to the newly added nodes under the new routing rule from the old nodes to the corresponding newly added nodes.

Further, the step 3 comprises the following steps:

step 3.1: reading the REDO log of the old node from the time point T;

step 3.2: and all the old nodes simultaneously migrate the REDO logs belonging to the newly added nodes under the new routing rule from the old nodes to the corresponding newly added nodes.

Further, the step 4 comprises the following steps:

step 4.1: judging whether backlog exists in the REDO log to be migrated, if yes, continuing to execute the step 3, and if not, executing the step 4.2;

step 4.2: setting the distributed memory database to be in an intermediate state, and hiding an REDO log which is not in accordance with the new routing rule on an old node by the distributed memory database service;

step 4.3: and sending a switching instruction to the application client, and after the REDO log is migrated, the application client accesses the distributed memory database under the new routing rule.

The beneficial effect of adopting the further technical scheme is that: after the data migration is finished, because redundant data also exists on the original old node, the invention hides the records on the node which are not in accordance with the routing rule according to the new routing rule, and can ensure the accuracy of the externally provided data.

Further, the method also comprises the step 5: and deleting the redundant data of the old node.

Further, the method further comprises the step 6: and changing the distributed memory database from the intermediate state to the normal state.

Further, the redundant data is data that does not comply with the new routing rule.

The other technical scheme provided by the invention is as follows: a distributed memory database online extension node system comprises a routing rule generation module, a data migration module, an REDO log migration module and a switching access module;

the routing rule generating module is used for adding nodes, replanning routing corresponding relation according to the added nodes and generating a new routing rule;

the data migration module is used for reading the data of the old node and migrating the data belonging to the newly added node under the new routing rule from the old node to the corresponding newly added node;

the REDO log migration module is used for reading the REDO log of the old node and migrating the REDO log belonging to the newly added node under the new routing rule from the old node to the corresponding newly added node;

and the switching access module is used for sending a switching instruction to the application client, and the application client accesses the distributed memory database under the new routing rule after the REDO log migration is completed.

Further, the system also comprises a deleting module and a state switching module;

the deleting module is used for deleting the redundant data of the old node;

the state switching module is used for changing the distributed memory database from the intermediate state to the normal state.

Drawings

Fig. 1 is a schematic flow chart of a method for online node expansion of a distributed memory database according to the present invention;

Detailed Description

The principles and features of this invention are described below in conjunction with the following drawings, which are set forth by way of illustration only and are not intended to limit the scope of the invention.

As shown in fig. 1, a method for online node expansion of a distributed memory database according to an embodiment of the present invention includes the following steps:

Wherein, step 1 includes the following steps:

step 1.1: newly adding nodes;

step 1.2: creating a new routing rule comparison table: the _T _ HASH _ RULES _ NEW table replans the routing corresponding relation according to the newly added nodes, and the principle is as uniform as possible and the migration quantity is minimum.

Wherein, step 2 includes the following steps:

step 2.1: recording a data migration starting time point T;

Wherein, step 3 comprises the following steps:

step 3.1: reading the REDO log of the old node from the time point T;

Wherein, step 4 comprises the following steps:

step 4.2: updating a routing rule comparison table, and setting the distributed memory database to be in an intermediate state, wherein the intermediate state is that the distributed memory database service hides an REDO log which is inconsistent with the new routing rule on an old node;

step 4.3: and sending a switching instruction to the application client to prompt the application to reload the routing rule, wherein at the moment, if the application accesses the old node, the application is still normal, if the application accesses the new node, the application client only waits, and after the backlog migration of the application client is completed, the application client accesses the distributed memory database under the new routing rule.

In the process, data related to migration and REDO log records are all recorded on old and new nodes, so that when a multi-node concurrent query is carried out, the query result is incorrect (the data is redundant).

And deleting these records takes a long time, and the application cannot wait for a long time. Therefore, it is necessary to hide these redundant data in advance, and the method is as follows:

a new table is created-T ALTER NODE into which the newly added NODE is inserted before updating the routing rule lookup table.

When the distributed memory database provides data service to the outside, whether the table has data is checked, and if not, the distributed memory database is executed according to normal logic. If the data exist, checking whether the record to be checked accords with the new rule, and if not, hiding.

Then, the redundant data of the old node is deleted one by one. After the deletion is completed, the _ T _ ALTER _ NODE table is cleared.

And when the dynamic migration quantity of the REDO log is changed into 0, changing the intermediate state of the distributed memory database into a normal state.

At this time, when the distributed memory database provides data service to the outside, the distributed memory database naturally recovers to normal logic, and the expansion is finished.

The redundant data is data that does not comply with the new routing rule.

The embodiment of the invention provides a distributed memory database online node expansion system, which comprises a routing rule generation module, a data migration module, an REDO log migration module and a switching access module;

The system also comprises a deleting module and a state switching module;

the deleting module is used for deleting the redundant data of the old node;

and the state switching module is used for changing the distributed memory database from the intermediate state to the normal state.

Assuming that the HASH bucket value is 12 (actually configurable, for example, 1024, which is required to be much larger than the number of nodes), there are initially 3 (M) nodes, and 1 (N) node needs to be added during subsequent expansion:

table 1 example of route comparison

After the nodes are added, records with hash values of 3, 7 and 11 are required to be migrated to the newly added nodes. The total migration data volume is N/(M + N), and the method can minimize the migration volume, thereby shortening the migration time and reducing the influence on an application system.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims

1. A method for on-line node expansion of a distributed memory database is characterized by comprising the following steps:

and 4, step 4: sending a switching instruction to an application client, and after the REDO log is migrated, the application client accesses a distributed memory database under a new routing rule;

wherein, the step 4 specifically comprises: step 4.1: judging whether backlog exists in the REDO log to be migrated, if yes, continuing to execute the step 3, and if not, executing the step 4.2;

step 4.2: setting the distributed memory database to be in an intermediate state, wherein the intermediate state is an REDO log which is in service hiding of the distributed memory database and is not in accordance with the new routing rule on an old node;

step 4.3: and sending a switching instruction to the application client, and accessing the distributed memory database under the new routing rule after the backlog migration of the application client is completed.

2. The method for online expanding nodes of the distributed memory database according to claim 1, wherein the step 1 comprises the following steps:

step 1.1: newly adding nodes;

step 1.2: creating a new routing rule comparison table;

3. The method for online expanding nodes of the distributed memory database according to claim 1 or 2, wherein the step 2 comprises the following steps:

step 2.1: recording a data migration starting time point T;

4. The method for online expanding nodes of the distributed memory database according to claim 3, wherein the step 3 comprises the following steps:

step 3.1: reading the REDO log of the old node from the time point T;

5. The method for online expanding nodes of a distributed memory database according to claim 1, wherein the method further comprises the step 5: and deleting the redundant data of the old node.

6. The method for online expanding nodes of distributed memory databases according to claim 5, further comprising the step 6: and changing the distributed memory database from the intermediate state to the normal state.

7. The method according to claim 5, wherein the redundant data is data that does not comply with a new routing rule.

8. A distributed memory database online extension node system is characterized by comprising a routing rule generation module, a data migration module, an REDO log migration module and a switching access module;

the switching access module is used for sending a switching instruction to the application client, and the application client accesses the distributed memory database under a new routing rule after the backlog migration is completed;

the switching access module is specifically configured to determine whether backlog exists in the REDO log to be migrated, if yes, continue to migrate the REDO log to be migrated, and if not, set the distributed memory database in an intermediate state, where the intermediate state is that the REDO log on the old node is hidden by the distributed memory database service and does not conform to the new routing rule, send a switching instruction to the application client, and after migration of the backlog to be backlogged by the application client is completed, access the distributed memory database under the new routing rule.

9. The system according to claim 8, wherein the system further comprises a deletion module and a status switching module;

the deleting module is used for deleting the redundant data of the old node;