CN105049508A

CN105049508A - Cloud data migration method

Info

Publication number: CN105049508A
Application number: CN201510429337.9A
Authority: CN
Inventors: 姜雪松; 袁家恒; 孙涛
Original assignee: Qilu University of Technology
Current assignee: Qilu University of Technology
Priority date: 2015-07-21
Filing date: 2015-07-21
Publication date: 2015-11-11

Abstract

The invention discloses a cloud data migration method. The cloud data migration method comprises the following specific realization processes: data is partitioned in a partition form organization in each storage node, namely, data of one storage node can be partitioned into a plurality of partitions; all storage nodes are respectively classified in a migration-in node set In_set and a migration-out node set Out_set according to a relationship of a standardized load value and 1/n, wherein the In_set comprises all the nodes, the load values of which are less than 1/n, and the Out_set comprises all the nodes, the load values of which are more than 1/n; cloud data migration is carried out, namely, partial data in the nodes in the Out_set is migrated onto the nodes in the In_set. Compared with the prior art, the cloud data migration method disclosed by the invention has the beneficial effects that: the average shortest time for migrating each data partition between the nodes is calculated by utilizing a particle swarm optimization algorithm; therefore, global and local searches are controlled; the resource optimization of a system is finally realized; and the cloud data migration method is high in practicability and liable to popularize.

Description

A kind of cloud data migration method

Technical field

The present invention relates to field of data storage, specifically a kind of practical cloud data migration method.

Background technology

The mass data storage possessing high reliability and extensibility is concerning Internet firm being a huge challenge, traditional database is often difficult to meet this demand, and many times for the retrieval of the specific system overwhelming majority be all based on major key inquiry, use relevant database to make inefficiency in this case, and expansion also will become a following very large difficult problem.Under these circumstances, use storage will be one well to select.For the storage system being deployed in cloud environment, Data Migration realizes the node dynamic expansion key technology balanced with elastic load, mainly comprises that migration plan (migrationplan) is formulated, routing iinformation is synchronous, the core content such as user's request forward and data coherence management.A large amount of state synchronized adjoint in data migration process can bring certain influence to systematic function, and therefore, how effectively reducing migration overhead is the problem that cloud service provider need put forth effort to solve.But, storage system have strict low of state, new virtualized environment, user to postpone to require and the unpredictability of access load and time variation bring new challenge to Data Migration.

Existing many a lot of algorithms are in order to solve time and the system resources consumption problem of Data Migration, there is following several method: 1) for the QoS security problem in stores service data migration process, author proposes a kind of method based on FEEDBACK CONTROL, periodically solve the optimum migration rate meeting QoS constraint, the main control problem paying close attention to migration bandwidth.2) based on the Data Migrating Strategy of greedy method, mainly for the data migration problems of Key/Value storage system, basic thought is, adopts statistical method on-line monitoring focus subregion, preferentially the partial data of focus subregion is migrated to the lighter neighbor node of load.For simplifying the complexity of migration operation, hash algorithm can keep the sequencing between Key value.The main deficiency of this two parts work is not consider migration overhead.

For cloud storage system, Data Migration realizes the node dynamic expansion key technology balanced with elastic load.How reducing transit time, reducing Data Migration affects duration to system, and reducing overhead is the problem that cloud service provider need put forth effort to solve.Although existing research work is mostly for the data migration problems under non-virtualized environment, these methods for cloud storage system often and inapplicable.For tackling above-mentioned challenge, data migration problems being included in load-balancing scenario and solving.The invention provides a kind of cloud data migration method, the method works out Data Migrating Strategy based on equilibrium degree, chooses optimum data migration operation, to reach the load balancing of system.

Summary of the invention

Technical assignment of the present invention is for above weak point, provides a kind of practical, cloud data migration method.

A kind of cloud data migration method, its specific implementation process is:

In each memory node, data are split with the form tissue of subregion, and namely the data of a memory node can be divided into some subregions, and subregion is the base unit of Data Migration and load monitoring;

All memory nodes are included into according to the relation of the load value after standardization and 1/n the node set In_set and the node set Out_set that moves out that moves into respectively, In_set comprises the node of all load values lower than 1/n, and Out_set then comprises the node of all load values higher than 1/n;

Carry out cloud Data Migration, to move in In_set on each node by the partial data in node each in Out_set, and the average shortest time of moving between node by calculating each data partition controls the overall situation and Local Search, make the equilibrium being reached system load in cloud environment between each node by Data Migration.

The detailed process of described cloud Data Migration is:

1) first set following parameter, set up storage system model: population, dimension size, the detected value/entropy of the load balancing that maximum iteration time Tmax or needs reach, the size of inertia weight Wmax, Wmix and optimal solution set P;

2) subregion obtained using each node of moving out is as a population, then occur in system that each population of multiple population is relatively independent, information transmission between each subregion is only limitted between the subregion of same population gained, and each population of initialization, calculates the fitness value of each particle;

3) carry out the migration in systems in which of particle according to the fitness value of each particle, follow in transition process, particle is to the minimum node migrates of the minimum fitness value of expense; To each particle, the desired positions Pbest of its adaptive value and its process is made comparisons, if better, then it can be used as current desired positions Pbest;

4) to each particle, its adaptive value and all particle desired positions Gbest are made comparisons, if better, then it can be used as current desired positions Gbest;

5) new speed and the position of particle is obtained according to formula iteration; When occurring that the load value of certain node of moving into reaches threshold values system total load value/nodes, now this node removes moving in set of node automatically, finds optimum and move into node in particle set of node of moving at this moment;

6) end condition: iterations reaches the maximum iteration time of setting or reaches the load balancing degrees detected value/entropy of setting.

The system model set up in described step 1) is:

Whole storage system is N dimension, searches for, forms initial population X by m subregion;

；

The positional information of each particle i represents with N dimensional vector:

；

Each particle i migration velocity be:

；

In above-mentioned formula, m is Population Size, i=1,2,3 ..., m.

Described step 2) in, fitness value is calculated by fitness function, and this function is specially:

Fi(x1，x2，x3)=x1^2+x2^2+x3^2；

Wherein:

X1 represents the throughput of network, i.e. the amount of network transmitting data in the unit interval;

X2 represents the size of partition data amount;

X3 represents the distance of node of the moving out subregion obtained and node of moving into;

I represents node of moving out.

In described step 5), after finding Pbest, Gbest two optimal solutions, the iterative formula that particle i obtains new speed and position is:

Wherein,

the speed being subregion i in kth time iteration and position;

C1 and c2 is Studying factors or claims accelerator coefficient, is respectively regulated to the maximum step-length of Pbest and Gbest direction migration, and reflection subregion individual experience and colony's empirical log, according to the impact of traveling locus, reflect the information interchange between each subregion;

Rand1 and rand2 is the random number between ［ 0,1 ］, increases the randomness of particle flight;

the position of subregion i at individual extreme point;

it is the position of the global extremum point of whole population.

In described step 6), when reaching stopping criterion for iteration termination of iterations step, if the maximal rate restriction vmax of particle: for preventing subregion away from search volume, every one dimension speed of particle should meet [-vmax, + vmax] between, then search volume is defined as interval [-xmax ,+xmax];

As vi >=vmax, make vi=xmax;

As vi≤-vmax, make vi=-xmax.

A kind of cloud data migration method of the present invention, has the following advantages:

A kind of cloud data migration method that the present invention proposes, works out Data Migrating Strategy based on equilibrium degree, chooses optimum data migration operation, to reach the load balancing of system; Be reduced in duration in data migration process, reduce the impact on systematic function, improve load balancing degrees; Effective reduction load tilts, thus reduce the overhead in data migration process, practical, be easy to promote.

Accompanying drawing explanation

Accompanying drawing 1 is realization flow figure of the present invention.

Embodiment

Below in conjunction with the drawings and specific embodiments, the invention will be further described.

The invention provides a kind of cloud data migration method, as shown in Figure 1, its specific implementation process is:

For more effectively identifying hot spot data, reduce Data Migration amount as far as possible, in each memory node, data are split with the form tissue of subregion simultaneously, namely the data of a memory node can be divided into some subregions, and subregion is the base unit of Data Migration and load monitoring;

The detailed process of described cloud Data Migration is:

The system model set up in described step 1) is:

；

Each particle i migration velocity be:

；

In above-mentioned formula, m is Population Size, i=1,2,3 ..., m.

Fi(x1，x2，x3)=x1^2+x2^2+x3^2；

Wherein:

X2 represents the size of partition data amount;

I represents node of moving out.

Wherein,

the speed being subregion i in kth time iteration and position;

the position of subregion i at individual extreme point;

it is the position of the global extremum point of whole population.

In described step 6), when reaching stopping criterion for iteration termination of iterations step, be generally set to maximum iteration time Tmax, computational accuracy or the maximum stagnation step number △ t of optimal solution, in the model, with maximum iteration time Tmax for stopping criterion for iteration.If the maximal rate restriction vmax of particle: for preventing subregion away from search volume, every one dimension speed of particle should meet between [-vmax ,+vmax], then search volume is defined as interval [-xmax ,+xmax];

As vi >=vmax, make vi=xmax;

As vi≤-vmax, make vi=-xmax.

If vmax is too large, perhaps subregion can fly over outstanding region, if this value is too little, then particles possibly cannot detect the region beyond the optimal region of local fully, and are absorbed in local optimum.

Above-mentioned embodiment is only concrete case of the present invention; scope of patent protection of the present invention includes but not limited to above-mentioned embodiment; claims of any a kind of cloud data migration method according to the invention and the those of ordinary skill of any described technical field to its suitable change done or replacement, all should fall into scope of patent protection of the present invention.

Claims

1. a cloud data migration method, is characterized in that, its specific implementation process is,

2. a kind of cloud data migration method according to claim 1, is characterized in that, the detailed process of described cloud Data Migration is:

3. a kind of cloud data migration method according to claim 2, it is characterized in that, the system model set up in described step 1) is:

；

Each particle i migration velocity be:

；

In above-mentioned formula, m is Population Size, i=1,2,3 ..., m.

4. a kind of cloud data migration method according to claim 2, is characterized in that, described step 2) in, fitness value is calculated by fitness function, and this function is specially:

Fi(x1，x2，x3)=x1^2+x2^2+x3^2；

Wherein:

X2 represents the size of partition data amount;

I represents node of moving out.

5. a kind of cloud data migration method according to claim 3 or 4, is characterized in that, in described step 5), after finding Pbest, Gbest two optimal solutions, the iterative formula that particle i obtains new speed and position is:

Wherein,

the speed being subregion i in kth time iteration and position;

the position of subregion i at individual extreme point;

it is the position of the global extremum point of whole population.

6. a kind of cloud data migration method according to claim 2, it is characterized in that, in described step 6), when reaching stopping criterion for iteration termination of iterations step, if the maximal rate restriction vmax of particle: for preventing subregion away from search volume, every one dimension speed of particle should meet between [-vmax ,+vmax], then search volume is defined as interval [-xmax ,+xmax];

As vi >=vmax, make vi=xmax;

As vi≤-vmax, make vi=-xmax.

7. a kind of cloud data migration method according to claim 5, it is characterized in that, in described step 6), when reaching stopping criterion for iteration termination of iterations step, if the maximal rate restriction vmax of particle: for preventing subregion away from search volume, every one dimension speed of particle should meet between [-vmax ,+vmax], then search volume is defined as interval [-xmax ,+xmax];

As vi >=vmax, make vi=xmax;

As vi≤-vmax, make vi=-xmax.