CN104504012A

CN104504012A - Database data migration method based on inverse topology

Info

Publication number: CN104504012A
Application number: CN201410758140.5A
Authority: CN
Inventors: 赵淦森; 林巧英; 李立波; 高晓杰; 周尚勤; 王欣明
Original assignee: South China Normal University
Current assignee: South China Normal University
Priority date: 2014-12-10
Filing date: 2014-12-10
Publication date: 2015-04-08

Abstract

The invention discloses a database data migration method based on an inverse topology. The database data migration method based on the inverse topology comprises the following steps of acquiring a topological relationship graph according to a database requiring to be migrated; obtaining an extended sequential sequence during a migration process according to the topological relationship graph and a mode transformation model; immigrating data and modes into NoSQL according to the obtained extended sequential sequence. According to the database data migration method based on the inverse topology, mode migration is carried out while data migration is carried out, so that dependent data in a data table and inverted data are correctly integrated into the migrated NoSQL database, and any one join inquiry in a SQL (Structured Query Language) database before migration can find a correct result in an individual table in the NoSQL database after migration, and access to multiple tables is not needed, and the inquiry efficiency of the migrated NoSQL database is effectively improved. The database data migration method based on the inverse topology can be widely applied in the field of computers.

Description

A kind of database data migration method based on inverse topology

Technical field

The present invention relates to field of computer technology, particularly relate to a kind of database data migration method based on inverse topology.

Background technology

In current existing correlative study work, mainly contain following three aspects: denormalization, data batchmove and patten transformation.Denormalization work mainly pointer, to the normal constraint of traditional Relational DataBase, is carried out suitable redundant storage and is carried out design database to improve the modes such as search efficiency.Data batchmove work mainly refers to it is the resettlement being only data in database from SQL database to NoSQL database migration, such as, in SQL database, the data of certain table are dumped to merely certain Collection in MongoDB data or the table of certain in other NoSQL databases, do not carry out patten transformation, the dependence of the table in SQL database is not moved.Patten transformation work mainly refers to is not only data dump in SQL database in NoSQL database, and corresponding pattern is also moved.

In the data migration method of traditional Relational DataBase to NoSQL database, a lot of research work is at present the content (data batchmove) of merely migration table, do not carry out patten transformation, namely in transition process, lost the incidence relation between the SQL database table before migration.NoSQL database so like this after migration can not be supported in the result of the join query statement drawing original SQL database in single table, needs to carry out separately inquiry and repeatedly, causes search efficiency low.

Summary of the invention

In order to solve the problems of the technologies described above, the object of this invention is to provide one can carry out patten transformation, improves a kind of database data migration method based on inverse topology of the rear search efficiency of migration.

The technical solution adopted in the present invention is:

Based on a database data migration method for inverse topology, comprise the following steps:

A, the database moved as required, obtain its topological relation figure;

B, according to topological relation figure and patten transformation model, draw the extended order sequence in transition process;

Data and pattern are migrated in NoSQL by the extended order sequence that C, basis obtain.

As the further improvement of described a kind of database data migration method based on inverse topology, described step B comprises:

B1, from the leaf node topological relation figure concentrate appoint get a node as start node;

B2, to process with the limit of this start node for arc tail node all, obtain the integration order of its corresponding sides;

B3, this start node is shifted out leaf node collection, put into isolated node collection;

B4, integration order processing procedure obtained are added to extended order sequence;

B5, return perform step B1, until leaf node integrates as empty set.

As the further improvement of described a kind of database data migration method based on inverse topology, described step B2 comprises:

B21, appoint arbitrary the limit that to get with this start node be arc tail node, and expansion integration is carried out to the arc head node on this limit, obtain the integration order on this limit;

B22, judge whether this arc head node is leaf node when not comprising this limit, if so, then this starting point is put into leaf node collection; Otherwise, perform step B23;

B23, return perform step B21, until all limits that this start node is arc tail node are all disposed.

As the further improvement of described a kind of database data migration method based on inverse topology, described step C comprises:

C1, the data only appearing at the node of arc tail are moved to respectively in tables of data corresponding in NoSQL database;

C2, according to extended order sequence, according to the pattern after conversion successively by tables of data corresponding to NoSQL database for the Data Migration of residue node.

As the further improvement of described a kind of database data migration method based on inverse topology, described step C2 comprises:

Integration order in C21, successively taking-up extended order sequence;

C22, the information of arc tail node in the integration order of taking-up incorporated in arc head node and arc head node is expanded;

C23, by tables of data corresponding to NoSQL database for the Data Migration of arc head node after expansion.

The invention has the beneficial effects as follows:

A kind of database data migration method based on inverse topology of the present invention is by carrying out data batchmove simultaneously, carry out mode shifts, make the data of the tables of data having dependence and drive in the wrong direction that is correctly incorporated in the NoSQL database after migration, thus ensure that any one join in the SQL database before migration inquires about and can find correct result in the single table in the NoSQL database after migration, no longer need to conduct interviews to multiple table, effectively improve the search efficiency to NoSQL database after migration.

Accompanying drawing explanation

Below in conjunction with accompanying drawing, the specific embodiment of the present invention is described further:

Fig. 1 is the flow chart of steps of a kind of database data migration method based on inverse topology of the present invention;

Fig. 2 is the patten transformation schematic diagram of a kind of database data migration method embodiment one based on inverse topology of the present invention;

Fig. 3 is the topological relation schematic diagram of a kind of database data migration method embodiment two based on inverse topology of the present invention.

Embodiment

With reference to figure 1, a kind of database data migration method based on inverse topology of the present invention, comprises the following steps:

A, the database moved as required, obtain its topological relation figure;

B5, return perform step B1, until leaf node integrates as empty set.

Wherein, being specifically summarized as follows of step B:

(1) in the leaf node collection P topological relation figure, taking-up node u is appointed;

(2) take out arbitrary the limit <v that be arc tail with this node u, u>, expands the arc head node v on this limit, namely the full detail of node u is injected in node v, circular treatment complete all take u as the limit of arc tail;

(3) when node v expands complete, namely not with this limit for arc head, just node is put into leaf node collection P.When leaf node u is no longer relied on by any node, namely not with this node for arc tail, just this node is shifted out set of node P, puts into isolated node collection T;

(4) circular treatment leaf node collection P like this, until leaf node integrates P as empty set.

The specific embodiment one of step B of the present invention is as follows:

With reference to figure 2, the relevant database that such as will carry out moving has table T3, T2, T1, and dependence has that T3 relies on the external key of T2, T2 relies on the external key of T1.The table of database and dependence have T3, T2, T1 tri-nodes and < T3, T2>, < T2, T1> two table after being expressed as corresponding directed acyclic graph.Here is all concrete steps of present case:

S1, from leaf node collection P={ T1 } appoint taking-up node T1;

S2, arbitrary the limit <T2 that to take out with this node T1 be arc tail, T1>, carry out expansion to the arc head node T2 on this limit and integrate, namely the full detail of node T1 is injected in node T2.Node T2 expands complete, namely not with this limit for arc head, just node is put into leaf node collection P, P={T1, T2}.When leaf node T1 is no longer relied on by any node, namely not with this node for arc tail, just this node is shifted out set of node P, P={T2}, puts into isolated node collection T, T={T1};

S3, integration order <T2, T1> are added in the sequence S of integration order result of tables of data, S={<T2, T1>};

S4, continue from leaf node collection P={ T2 } appoint taking-up node T2;

S5, arbitrary the limit <T3 that to take out with this node T2 be arc tail, T2>, carry out expansion to the arc head node T3 on this limit and integrate, namely the full detail of node T2 is injected in node T3.Node T3 expands complete, namely not with this limit for arc head, just node is put into leaf node collection P, P={T2, T3}.When leaf node T2 is no longer relied on by any node, namely not with this node for arc tail, just this node is shifted out set of node P, P={T3}, puts into isolated node collection T, T={T1, T2};

S6, integration order <T3, T2> are added in the sequence S of integration order result of tables of data, S={<T2, T1>, <T3, T2>};

S7, continue to appoint taking-up node T3 from leaf node collection P={T3}, the limit being not arc tail with this node T2, shifts out set of node P this node, P={}, puts into isolated node collection T, T={T1, T2, T3};

S8, now, P={}, P are empty set;

The sequence S of the integration order result of S9, output tables of data, S={<T2, T1>, <T3, T2>}.

Integration order in C21, successively taking-up extended order sequence;

The specific embodiment two of step C of the present invention is as follows:

With reference to figure 3, wherein, extended order sequence:

S=[<T2,T1>,<T3,T1>,<T6,T5>,<T3,T2>,<T6,T2>]。

1, first the data batchmove in all leaf node T1, T5 in the table T1 ' in NoSQL database and T5 ';

2, taking out tuple first tuple <T2, the T1> in S, the full content of T1 is incorporated in T2, is T2 after T2 expansion ", T2 " data batchmove be put in the T2 ' of NoSQL database;

3, sequence spreading [<T3 remaining in the extended method treatment S of step 2 is repeated, T1>, <T6, T5>, <T3, T2>, <T6, T2>] until S is empty set.

More than that better enforcement of the present invention is illustrated, but the invention is not limited to described embodiment, those of ordinary skill in the art also can make all equivalent variations or replacement under the prerequisite without prejudice to spirit of the present invention, and these equivalent distortion or replacement are all included in the application's claim limited range.

Claims

1., based on a database data migration method for inverse topology, it is characterized in that: comprise the following steps:

A, the database moved as required, obtain its topological relation figure;

2. a kind of database data migration method based on inverse topology according to claim 1, is characterized in that: described step B comprises:

B5, return perform step B1, until leaf node integrates as empty set.

3. a kind of database data migration method based on inverse topology according to claim 2, is characterized in that: described step B2 comprises:

4. a kind of database data migration method based on inverse topology according to claim 1, is characterized in that: described step C comprises:

5. a kind of database data migration method based on inverse topology according to claim 4, is characterized in that: described step C2 comprises:

Integration order in C21, successively taking-up extended order sequence;