CN101923496A - Method for achieving parallel reconstruction of data on RAID - Google Patents

Method for achieving parallel reconstruction of data on RAID Download PDF

Info

Publication number
CN101923496A
CN101923496A CN 201010241282 CN201010241282A CN101923496A CN 101923496 A CN101923496 A CN 101923496A CN 201010241282 CN201010241282 CN 201010241282 CN 201010241282 A CN201010241282 A CN 201010241282A CN 101923496 A CN101923496 A CN 101923496A
Authority
CN
China
Prior art keywords
block
data
reconstruction
disks
subarray
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 201010241282
Other languages
Chinese (zh)
Inventor
万继光
谢长生
谭志虎
王继彬
杨寅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huazhong University of Science and Technology
Original Assignee
Huazhong University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huazhong University of Science and Technology filed Critical Huazhong University of Science and Technology
Priority to CN 201010241282 priority Critical patent/CN101923496A/en
Publication of CN101923496A publication Critical patent/CN101923496A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method for achieving the parallel reconstruction of data on an RAID (redundant array of independent disk), particularly achieving the parallel reconstruction thereof in a sub-array distribution manner, wherein the sub-array comprises the blocks of different disks. The method comprises the following steps: respectively and simultaneously searching the other blocks constituting the same sub-array together with each block in a data disk; and reconstructing the data in the block according to the data in the other blocks. The method of the invention can shorten the process of data reconstruction by times, more particularly, all the operations of reconstruction, either reading or writing the related data, are parallel operations, and the requests from onstage users can be responded on a real-time basis in the reconstruction process, thus effectively reducing the average time for responding to the users, improving the reliability of the system and enhancing the performance thereof at the same time.

Description

The parallel reconstruction of data method of a kind of RAID
Technical field
The invention belongs to array data reconfiguration technique field, particularly relate to a kind of method of supporting reconstructing parallel data.
Background technology
Disk failure will make storage system be operated under the degraded mode, and the overall performance decline problem that system works will cause under this pattern therefore in order to guarantee the Performance And Reliability of system, must be reconstructed the data of failed disk.How can guarantee during data reconstruction that accelerating reconstruct speed again in the performance of system is two aspects that data reconstruction need be considered, many data reconstruction technology have been related to thus, data reconstruction is at reconstitution time and user's draw response time two indexs at present, mainly distribute and two aspects of data-flow analysis are done and optimized improvement, improve system performance with this from data or check block.
First kind of mode of data reconstruction mainly is to utilize the distribution of data or check block to improve the performance of reconstruct.(Distributed sparing) method of disperseing spare drive of having researched and proposed is arranged, the space availability ratio that this mode has improved Hot Spare spare dish is accelerated the speed of reconstruct to a certain extent, this method mainly also is applied to idle at ordinary times spare dish in the array in the DATA DISTRIBUTION of array RAID, in each disk in array all the reserve part clear area use as spare dish, the data of same band are through writing the clear area in the same band when data failure occurring after the verification, this method has effectively been avoided the bottleneck problem of writing of traditional spare dish, shortened the data reconstruction time, but owing to need a large amount of read operations in order to finishing data restore operation, still be a bottleneck to the read operation of single-deck.Checking data dispersion ParityDeclustering method is then balanced data reconstruction time and user's average response time two aspects, the key distinction of its checking data distribution mode and traditional RAID distribution (particularly RAID5) is, the checking data dispersion degree is littler than traditional RAID 5 modes, be that data are lacked than RAID 5 modes with the number of disks that its checking data distributes, this characteristics make also that after disk failure system can finish user's request that data reconstruction also can utilize the disk response foreground that does not participate in reconstruct simultaneously.Adopt the data reconstruction performance of this distribution mode few with respect to traditional reconstruct poor performance, just it is better than the former on user's average response time performance index.Utilize data flow characteristics, it is the another kind of mode of present data reconstruction that the storage distribution is optimized.Use and propose and be primarily aimed at Streaming Media class (Continuous-media Servers) based on data reconstruction (Pipelined Reconstruction) method of instruction stream, this application scenario is higher to the reliability requirement of system, if the disk failure situation must guarantee foreground user's request responding is not influenced or influences not quite, therefore the disk reconstruct of this method has made full use of the succession characteristics of media data, the read operation that is reconstructed is the unit with the magnetic track, and deposits unnecessary data in buffer memory in order to next reconstructed operation use.That in addition utilizes data flow characteristics also has based on the restructing algorithm (Popularity-based multithreaded Reconstruction Optimization) of hot spot data etc.Although said method all has more or less improvement or lifting on reconstitution time and average response time, reconstitution time is still very long, and this has also constituted threat to the reliability of system.
Summary of the invention
The present invention proposes the parallel reconstruction of data method of a kind of RAID, has reduced the disk reconstitution time at double, and active balance reconstitution time and user's average response time have improved the reliability of system.
The parallel reconstruction of data method of a kind of RAID, relate to plural data disks and at least one spare drive, each data disks is divided into the block of equal number, utilize each block to make up subarray, make each block of any one data disks belong to different subarrays and only belong to a subarray;
The data reconstruction of individual data dish carries out in the following manner: for each block in this data disks, seek other block respectively simultaneously with the same subarray of its formation, be reconstructed according to the data of other block data, and reconstruction result be saved in spare drive or do not participate in the data disks of this reconstruct this block.
As optimization, for any two subarrays, one of them subarray has only a block of a block and another subarray to belong to a data disks at most.
Described data disks adopts RAID 1~6 data storage rank, and described subarray is constructed in the following manner:
(11) data disks is divided into the c group, the 1st to c group data disks m 0, m 1..., m C-1Block constitute the 1st, 2 respectively ..., c group block distribution matrix, if data disks can not be divided into the c group, then by adding the virtual disk of dividing with the data disks same block or reducing data disks and satisfy and divide equally requirement, c is greater than 2;
(12) numbering of each block in the initialization first block distribution matrix, the j row element of i+1 component cloth matrix be by the j row element ring shift right or the j-1 position that moves to left of i group matrix, i=1 ..., c-1, j=1 ..., n, n are the block counts that data disks is divided;
(13) choose subarray of block formation that each data disks has identical numbering.
Described data disks adopts RAID 10 data storage ranks, and the data disks number is to be not less than 4 even number, and described subarray is constructed in the following manner:
(21) data disks is divided into two groups, the first, two group of data disks m 0, m 1Block constitute first and second group block distribution matrix respectively;
(22) capable each block of x of first group of block distribution matrix is carried out the initialization numbering of shape such as x.y form, x=0,1, ..., m/2-1, y=0,1, ..., m/2-1, capable each the block numbering of the x of second group of block distribution matrix is by x capable each the block numbering ring shift right or the x position that moves to left of the first component cloth matrix;
(23) from each data disks, choose block and constitute a subarray with identical numbering.
Technique effect of the present invention is embodied in: by parallel reconfiguration algorithm disk is carried out subregion, select suitable R AID rank that disk is formed a plurality of sub-RAID according to partition table then, when in the array disk failure being arranged, this distribution mode is different from traditional array reconstruct, its reconstruct to disk will can not produce the read-write disk interference, therefore reconstruct speed will significantly improve, simultaneously in restructuring procedure, the disk that does not participate in reconstruct can also respond foreground user request, the present invention can active balance reconstitution time and user's average response time, thereby has improved the performance of system.
Description of drawings
Fig. 1 is the matrix distribution figure of 16 disks;
Fig. 2 is the matrix distribution figure of 10 disks;
Fig. 3 is the matrix distribution of 10 disks figure that optimizes structure;
Fig. 4 is parallel reconstruction RAID 5 data reconstruction structural drawing;
Fig. 5 is the logical mappings figure of parallel reconstruction RAID 5;
Fig. 6 is parallel reconstruction RAID 5 reconstruct process flow diagrams;
Fig. 7 is parallel reconstruction RAID 5 spare drive optimisation strategy figure
Fig. 8 is parallel reconstruction RAID 10 data reconstruction process flow diagrams;
Fig. 9 is test system structure figure;
Figure 10 is a parallel reconstruction reconstitution time test pattern (unit: second)
Figure 11 is a parallel reconstruction user average response time test pattern (unit: millisecond)
Embodiment
1, theoretical foundation
What the present invention mainly realized is the parallel reconstruction of RAID, the parallel reconstruction truly that traditional RAID DATA DISTRIBUTION form is not enough to realize, for example RAID 5 distribution forms of standard after system enters degraded mode with log-on data reconstruct, but because have the data write conflict in this process, therefore the time of whole data reconstruction will be very very long; Particularly for on-line reorganization, it also wants corresponding foreground user request in reconstruct, and this will further increase the time of reconstruct.And proposition of the present invention mainly is at this problem, by adopting parallel reconstruction data block Distribution Strategy and setting up the RAID structure of standard, by with disk zoning piece and with the block being the composition RAID of unit, the present invention of this RAID mode is referred to as subarray (also can be called subRAID), this RAID organizational form can realize the data reconstruction that walks abreast, and do not have read/write conflict in this restructuring procedure, thereby shortening reconstitution time that can be at double;
Table 1 symbol definition
Figure BDA0000023910410000051
Figure BDA0000023910410000061
At first introduce the distribution of parallel reconstruction with standard RAID 5 constituted modes, we carry out as giving a definition.
Below will be with regard to D=9, the situation of N=3 distributes to the parallel reconstruction data block and does a description.At first the initial distribution table is divided into 3 groups of sub-block tables, every group of 3 disks are the matrix form of 3 * 3 row with first group echo, use m 0Represent first group of sub-block table, and to m 0Carry out the initialization numbering, concrete distribution mode is as follows
m 0 = 0 1 2 3 4 5 6 7 8 , p 0,1=(3?4?5),p 0,2=(6?7?8);
M in the following formula 0What the element numbering in the distribution matrix was represented is the block numbering of subarray, structure distribution matrix m 1, m 2Adopt
Figure BDA0000023910410000063
Function can finally determine the j of i+1 distribution matrix capable be p I, jDistribution mode, promptly obtain i+1 the capable distribution form of distribution matrix j by the circulative shift operation that worked of the j to i distribution matrix, thus and the like can construct complete distribution table.The cyclic shift function here
Figure BDA0000023910410000064
Will be according to p I, jDifference displacement figure place different, and direction of displacement must unify, and can be ring shift left or ring shift right, the m in this example 1, m 2Adopt the ring shift right operation to generate.
m 1 = SH r 0 ( p 0,0 ) SH r 1 ( p 0,1 ) SH r 2 ( p 0,2 ) = 0 1 2 5 3 4 7 8 6
m 2 = SH r 0 ( p 1,0 ) SH r 1 ( p 1,1 ) SH r 2 ( p 1,2 ) = 0 1 2 4 5 3 8 6 7 = SH r 0 ( p 0,0 ) SH r 2 ( p 0,1 ) SH r 4 ( p 0,2 )
Generally speaking if known initial distribution matrix m 0Distribution mode, the distribution mode that then is numbered the matrix of i is
m i = SH r 0 ( p i - 1,1 ) SH r 1 ( p i - 1,2 ) · · · SH r N - 1 ( p i - 1 , N ) Or m i = SH l 0 ( p i - 1,1 ) SH l 1 ( p i - 1,2 ) · · · SH l N - 1 ( p i - 1 , N ) , I ∈ [1, N-1] formula 1
In the formula 1 Function parameter r and l are expressed as right circulation and left side circulation respectively, are M=(m thereby obtain final matrix distribution table 0, m 1..., m N-1).
Then the parallel reconstruction distribution mode of 9 disks is,
N = 3 , M = 0 1 2 0 1 2 0 1 2 3 4 5 5 3 4 4 5 3 6 7 8 7 8 6 8 6 7 Formula 2
Formation for subarray then is to number the identical numbering group subarray that constitutes among the matrix distribution table M that promptly chooses final generation by the same block of choosing among each distribution matrix m, the matrix distribution table M that generates in the formula 2 can constitute 9 subarrays, can adopt similar subRAID 0 to these 9 subarrays, subRAID 1, ..., subRAID 8 represents.
2, theoretical proof
The present invention can prove theoretically that data disks number (also the can be understood as block counts) N of the subarray of supposition formation in theory is a prime number, then can construct the array distribution form with parallel reconstruction characteristic by translation mode.The array that for example adopts 9 disks to constitute, per 3 disks constitute one group, and the set mode of its parallel distributed can be expressed as: regard each disk as row, the number table of each row is shown the block number of this disk, such as
Figure BDA0000023910410000081
Matrix m 0Expression array initial distribution is wherein established set u 0,0, u 0,1, u 0,2Represent m respectively 0The initial distribution matrix three row, then
Figure BDA0000023910410000082
Set be expressed as U 0=(u 0,0, u 0,1, u 0,2), m in like manner 1, m 2Set be expressed as U 1=(u 1,0, u 1,1, u 1,2), U 2=(u 2,0, u 2,1, u 2,2).m 1, m 2The matrix distribution mode as noted above; If will realize that parallel reconstruction then must satisfy condition: u between group I, j∩ u J, i={ in e} and the group
Figure BDA0000023910410000083
I, j, k ∈ [0, N-1] promptly shows between the group and has only an identical element, and column element does not occur simultaneously in the group.
To prove that below for convenience the matrix subscript of proof this moment is since 1 with regard to above-mentioned set expression way, rather than above-mentioned 0.If symbol m I, pThe p row of representing i submatrix, m J, qIf the q row of representing j submatrix are known m I, pP tabulation be shown
Figure BDA0000023910410000084
m J, qQ tabulation be shown
Figure BDA0000023910410000085
a I, jAnd b I, jRepresent distribution matrix m respectively iThe capable j of i row because the colleague finishes by the circulation translation mutually, so we can use m I, pRepresent m J, q
b 1, q=a 1, q, b 2, q=a 2, (q+j-i) %N, b R, q=a R, [q+2 * (j-i)] %N..., b R, [q+ (R-1) * (j-i)] %N, promptly have following formula to set up.
m j , q = b 1 , q b 2 , q b 3 , q · · · b R , q = a 1 , q a 2 , ( q + j - i ) % N a 3 , [ ( q + 2 ) × ( j - i ) % N ] · · · a R , [ q + ( R - 1 ) × ( j - i ) ] % N , R ∈ [1, N] and R ∈ I
We know arbitrarily
Figure BDA0000023910410000092
M as can be known then I, pAnd m J, qWith identical element is arranged in the delegation.
a 1 , p a 2 , p a 3 , p · · · a R , p ⇔ a 1 , q a 2 , ( q + j - i ) % N a 3 , [ ( q + 2 ) × ( j - i ) % N ] · · · a R , [ q + ( R - 1 ) × ( j - i ) ] % N
Suppose to exist two row elements identical, be made as m and n respectively, and m ≠ n, then have ready conditions
Figure BDA0000023910410000094
Set up, i.e. [(m-n) * (j-i)] %N=0, because
Figure BDA0000023910410000095
Then can draw N is prime number scarcely, and this is false for prime number contradiction event hypothesis with hypothesis N, so and m I, pAnd m J, qCommon factor have only an identical element, must demonstrate,prove, so this distribution mode can realize conflict free parallel reconstruction.
3, other situation explanations
By top proof as can be known,, constitute the required block counts N of subarray and be necessary for prime number, and can to draw the data disks number that needs by the process that constitutes the parallel reconstruction distribution matrix be N if will realize optimum parallel reconstruction performance 2This also will increase the cost of system virtually, therefore the present invention also proposed still to carry out under a kind of situation of counting deficiency at data disks high performance parallel reconstruct the balance scheme, if will constitute the parallel reconstruction array, can adopt two kinds of strategies to realize, promptly reduce to constitute the block or the disk division block counts dual mode of subarray.
Illustrate the block strategy that reduces to constitute subarray below, if this moment, the data disks in the system was 16, because desired data dish several 16 is between 3 2With 5 2Between, in this case, the distribution mode of these 16 data disks can adopt the distribution mode of 25 data disks, promptly adds 9 virtual disks and helps to make up distribution matrixs, last only need be with the distributions of preceding 16 row of the matrix distribution table of 25 disks as 16 data disks.See specifically shown in the accompanying drawing 1 that 25 disks are divided into 5 groups, by to m 0Initialization, and generate m by circulative shift operation 1, m 2, m 3, m 4The matrix distribution table.Owing to have only 16 disks at present, then adopt the distribution form of preceding 16 column distribution forms this moment as existing 16 disks.One has 25 subRAID among the figure, but constitute the prime number that is necessary for that the block counts of subRAID does not then have above-mentioned requirements this moment, for example subRAID 0 is numbered 0 block by 4 and constitutes among the figure, and subRAID1 is numbered 1 block by 3 and constitutes, and adopting this mode is to adopt to reduce subarray block counts mode and realize parallel reconstruction in essence.
Usually under the situation if the data disks number is m in this moment system, if m ∈ (a 2, b 2], a, b are prime number and a greater than 2 2And b 2The parallel reconstruction of individual disk is respectively M aAnd M b, constituted mode is obtained by formula 1, and then the distribution form of m disk adopts b at this moment 2The distribution M of individual disk bPreceding m row constitute.But this distribution mode neither be fit to the situation of each m, such as the situation of m=10, satisfies 10 ∈ (3 this moment 2, 5 2], if still get 5 2Preceding 10 row that individual disk distributes then can not constitute subarray, and as shown in Figure 2, because each subarray all has the minimum block counts of requirement, just as traditional RAID 5 needed 3 disks at least, in order to solve these special circumstances of m=10, the present invention had provided 4 2The parallel reconstruction distribution form of individual disk, generating mode still adopt formula 1 to generate, but in order to realize that conflict free optimum parallel reconstruction taked to reduce disk here and divided the method that block counts and subarray block counts combine, are about to 4 2Distribution mode the third line numbering remove, this moment, each disk was divided into three blocks, preceding 10 row of getting this distribution this moment constitute the distribution form of m=10, as shown in Figure 3, subRAID 0, subRAID 1, subRAID 6, subRAID 7, subRAID 14 and subRAID 15 constitute by three blocks, other subRAID then can not constitute standard RAID structure, therefore these block numberings do not constitute sub-permutation, 10 disks have only constituted 6 subarrays, space availability ratio has only accounted for half, this constituted mode is not recommended, and can adopt 32 distribution to solve this problem, but like this only in view of the situation of m=10.
The data organization mode of subarray of the present invention is except RAID 2~6 structures commonly used, the present invention has also provided the parallel reconstruction distribution mode of RAID 10, this mode requires data disks to count m and is not less than 4 even number, it requires harsh unlike other RAID ranks commonly used such as 5 pairs of structures of RAID parallel reconstruction disk array, parallel reconstruction distribution for RAID 10 does not then have such requirement, and its parallel multiple is only with to form this distribution array disk number relevant, for its half.To elaborate with regard to the block table construction process of RAID 10 distribution modes below.
Constitute submatrix with 8 disks and classify example as, each subarray adopts RAID 10 ranks, then in order to realize maximum parallel multiple, half that the disk block number need be divided into total disk number promptly 4, simultaneously all masters are divided into one group in logic, mirror drive is divided into another group, and block is carried out label.
The block distribution matrix of master group represents that mode is
Figure BDA0000023910410000111
Wherein the block that disk is divided is shown in each tabulation of this matrix, promptly be divided into 0.0,1.0,2.0 and 3.0 4 blocks, the x in shape such as the x.y form represents the numbering of subarray, y represents to constitute the block numbering of x subarray, 0.0 expression constitutes first block of subarray 0, can be understood as first disk that constitutes the traditional magnetic disk array, 0.1,0.2 0.3 is similar.The block distribution matrix of mirror set is represented number to carry out obtaining by formula 1 mode equally according to the block that constitutes subarray in the master group.
Be referred to as the original block table among the dish component cloth matrix the present invention here, if to original block table matrix m in the example 1The mirror set distribution mode that adopts the ring shift right mode to constitute is
Figure BDA0000023910410000121
Here shape such as x.y ' are expressed as the mirror image of block numbering x.y, and then parallel reconstruction RAID 10 distributed architectures of 8 disk formations are
Figure BDA0000023910410000122
This table can be realized the reconstructing parallel data of 4 times of speed, these RAID 10 structures are made of four subarrays 0~4, the band span is 4, and there is not conflict in the reconstruct visit to disk, here the blockette number of disk also can be divided into less than 4, the block table of this moment only need be selected wherein corresponding line to reconstitute new block distribution table on the block table basis of 4 times of speed to get final product, but parallel reconstruction speed at this moment will reduce.Usually, be for the RAID 10 block distribution table forms of the composition of the parallel reconstruction that the individual disk of N (N is not less than 4 even number) is arranged,
Initial table
Figure BDA0000023910410000123
Mirror list
Figure BDA0000023910410000131
P 0First row element of expression initial table, P 0... P (N/2)-1It is similar,
Figure BDA0000023910410000132
Represent a translation function, be about to (i+1) row element ring shift right n position in the initial table, wherein r also can replace with l and represents ring shift left.
4, implementation procedure
(1) at first suitable parallel reconstruction mode is selected according to the available data dish by system, can adopt RAID 10 or RAID 2~6 modes, and this implementation procedure is implemented with the RAID 5 of 9 disks.As shown in Figure 4, the block numbering of 9 disks among the figure distributes shown in formula 2, and each disk is divided into three blocks, and per three blocks constitute a subarray, one has 9 subarrays, and the form of shape such as x.y is expressed as the block that is numbered y of forming subarray x.SubRAID 6 is respectively by disk D0,6.0,6.1 and 6.2 blocks among D5 and the D7 constitute, in like manner subRAID 7 and subRAID 8 have three blocks to constitute equally, the subarray here (subRAID) adopts the data organization mode of standard RAID 5, and subarray of the present invention also can adopt other data organization modes.
(2) logical mappings table such as Fig. 5 of all these RAID, the mapping table information that 9 RAID are arranged among the figure, here we suppose that the mapping table size of each RAID is K data unit, in order to reach the purpose of DATA DISTRIBUTION equilibrium, we have carried out the distribution shown in the figure to each data cell, here storing the corresponding corresponding disk address information of each RAID in each data cell, this mapping mode can guarantee the load balancing of each disk.
(3) after parallel reconstruction RAID configuration is finished, there is difference in its response mode under normal mode with respect to traditional RAID, can know by the previous step mapping process, with respect to traditional RAID, the band span of parallel reconstruction mode is narrower than traditional RAID, and this structure improves for the read-write operation performance, particularly when RAID is in degradation or reconfiguration mode, concrete respective process is the same with traditional RAID, and detailed process is not done introduction.
(4) when disk failure occurring among the RAID, system will enter degraded mode, to carry out the data reconstruction operation subsequently, system will read corresponding disk according to the logical mappings relation and be reconstructed, because the parallel reconstruction that adopts the present invention to propose distributes, make reconstruction progress not have read/write conflict to the data reconstruction operation of failed disk.RAID 5 structures that this process will be formed with 9 disks are that example describes, D0~D9 is the numbering of these 9 disks, SD0~SD2 is the required backup diskette of system, as shown in Figure 6, the numbering of similar a.b among the figure, a represents the label of subRAID, because 9 RAID are arranged, here we are referred to as subRAID a, and b represents (b+1) individual Logical Disk of sub-RAID.Disk D4 lost efficacy among the figure, and the inefficacy that this will cause subRAID 1, subRAID 3, subRAID 6 relates separately to block 1.1,3.1 and 8.1.Reconstruct to block 1.1, system is with 1.2 block data information on 1.0 blocks on the reading disk D1 and the disk D7, to the result be write back to standby dish SD1 by xor operation then, till this process lasts till that always the data of whole block 1.1 are all finished by reconstruct, meanwhile, two other reconstruction progress is the reconstruct of finishing respectively block 3.1 and 8.1, and reconstruct is write back to standby dish SD0 and SD2 respectively.
(5) after restructuring procedure is finished, the information of the disk D4 of three disk storage of SD0~SD2, system will lead the information of three standby dishes in idle period of time and get back in the dish of replacing D4, revise relevant map information simultaneously.
(6) the too much problem of standby dish that needs in view of this parallel reconstruction structure, the present invention has also proposed solution, notice that disk D3 and D5 among Fig. 6 do not participate in reconstructed operation in restructuring procedure, therefore can be at the parallel reconstruction disk tell the information that a block is used to store reconstruct again, structure as shown in Figure 7, can be written to the reconstruct data to block 1.1 among the disk D3 this moment, the data reconstruction of block 3.1 is written among the D5, block 8.1 is written among the standby dish SD0, this distribution mode can realize parallel reconstruction equally, has also reduced the use of two standby dishes simultaneously.
(7) the sub-RAID of this parallel reconstruction mode can use common RAID 1, RAID5, RAID6 etc., for RAID 10 this array modes, Fig. 8 has provided the DATA DISTRIBUTION mode of RAID10 parallel reconstruction, distribution mode adopts shifting function to finish equally, what implement among the figure is reconstructed operation to disk D2, and concrete restructuring procedure and RAID 5 are similar.
(8) in order to assess the performance of this parallel reconstruction of the present invention, main is reconstitution time and user's average response time according to two indexs, Fig. 9 is the structural drawing of test, adopt the client-server mode to test, connect by ISCSI, the present invention is mainly used in the parallel RAID reconstructed module among the figure, test case has adopted three standard testing use-case (Fin1, Fin2, Web), contrast test is MD (Multiple Device) (not needing Chinese) to liking linux system soft RAID commonly used down.
(9) Figure 10 represents the reconstitution time test, and as can be seen from the figure parallel reconstruction RAID5 is with respect to the RAID 5 of standard MD formation, no matter its which test case, the reconstitution time of parallel reconstruction RAID 5 has several times raising; Aspect user's average response time, the result who provides from Figure 11 as can be seen, under the situation of same load, user's average response time of parallel reconstruction RAID 5 also has reduction in various degree.
The present invention adopts circulation translation method to make up just wherein a kind of optimal way of subarray, and the technician in field, place can adopt alternate manner to make up, and makes up and requires as long as satisfy subarray.

Claims (4)

1. the parallel reconstruction of data method of a RAID, relate to plural data disks and at least one spare drive, each data disks is divided into the block of equal number, utilize each block to make up subarray, make each block of any one data disks belong to different subarrays and only belong to a subarray;
The data reconstruction of individual data dish carries out in the following manner: for each block in this data disks, seek other block respectively simultaneously with the same subarray of its formation, be reconstructed according to the data of other block data, and reconstruction result be saved in spare drive or do not participate in the data disks of this reconstruct this block.
2. parallel reconstruction of data method according to claim 1 is characterized in that, for any two subarrays, one of them subarray has only a block of a block and another subarray to belong to a data disks at most.
3. parallel reconstruction of data method according to claim 1 and 2 is characterized in that, described data disks adopts RAID 1~6 data storage rank, and described subarray is constructed in the following manner:
(11) data disks is divided into the c group, the 1st to c group data disks m 0, m 1..., m C-1Block constitute the 1st, 2 respectively ..., c group block distribution matrix, if data disks can not be divided into the c group, then by adding the virtual disk of dividing with the data disks same block or reducing data disks and satisfy and divide equally requirement, c is greater than 2;
(12) numbering of each block in the initialization first block distribution matrix, the j row element of i+1 component cloth matrix be by the j row element ring shift right or the j-1 position that moves to left of i group matrix, i=1 ..., c-1, j=1 ..., n, n are the block counts that data disks is divided;
(13) choose subarray of block formation that each data disks has identical numbering.
4. parallel reconstruction of data method according to claim 1 and 2 is characterized in that, described data disks adopts RAID 10 data storage ranks, and the data disks number is to be not less than 4 even number, and described subarray is constructed in the following manner:
(21) data disks is divided into two groups, the first, two group of data disks m 0, m 1Block constitute first and second group block distribution matrix respectively;
(22) capable each block of x of first group of block distribution matrix is carried out the initialization numbering of shape such as x.y form, x=0,1, ..., m/2-1, y=0,1, ..., m/2-1, capable each the block numbering of the x of second group of block distribution matrix is by x capable each the block numbering ring shift right or the x position that moves to left of the first component cloth matrix;
(23) from each data disks, choose block and constitute a subarray with identical numbering.
CN 201010241282 2010-07-30 2010-07-30 Method for achieving parallel reconstruction of data on RAID Pending CN101923496A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201010241282 CN101923496A (en) 2010-07-30 2010-07-30 Method for achieving parallel reconstruction of data on RAID

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201010241282 CN101923496A (en) 2010-07-30 2010-07-30 Method for achieving parallel reconstruction of data on RAID

Publications (1)

Publication Number Publication Date
CN101923496A true CN101923496A (en) 2010-12-22

Family

ID=43338450

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201010241282 Pending CN101923496A (en) 2010-07-30 2010-07-30 Method for achieving parallel reconstruction of data on RAID

Country Status (1)

Country Link
CN (1) CN101923496A (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102053801A (en) * 2010-12-29 2011-05-11 成都市华为赛门铁克科技有限公司 Disk hot standby method and device as well as storage system
CN102419697A (en) * 2011-11-02 2012-04-18 华中科技大学 Method for reconstructing single disk in vertical redundant array of independent disks (RAID)-6 coding
CN102609223A (en) * 2012-02-13 2012-07-25 浪潮(北京)电子信息产业有限公司 Redundant array of independent disk system and initializing method thereof
CN103699343A (en) * 2013-12-25 2014-04-02 华为技术有限公司 RAID (redundant arrays of independent disks) determination method and device
CN104142872A (en) * 2014-08-13 2014-11-12 浪潮电子信息产业股份有限公司 Quick reestablishment method for RAID (redundan array of inexpensive disk)
CN105138468A (en) * 2015-08-19 2015-12-09 浙江宇视科技有限公司 Method and device for improving reconstruction performance of virtual disk group
CN106201782A (en) * 2016-07-12 2016-12-07 乐视控股(北京)有限公司 Date storage method and system
CN106919340A (en) * 2015-12-28 2017-07-04 南宁富桂精密工业有限公司 Improve the system and method for RAID reading performances
CN107870973A (en) * 2017-09-20 2018-04-03 广东东为信息技术有限公司 A kind of document storage system accelerated multi-path monitoring and played back simultaneously
CN108519926A (en) * 2018-03-31 2018-09-11 北京联想核芯科技有限公司 A kind of adaptive RAID grouping computational methods and device
CN109189338A (en) * 2018-08-27 2019-01-11 郑州云海信息技术有限公司 A kind of method, system and the equipment of HotSpare disk addition

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5872906A (en) * 1993-10-14 1999-02-16 Fujitsu Limited Method and apparatus for taking countermeasure for failure of disk array
US20050102470A1 (en) * 2003-11-11 2005-05-12 Nec Corporation Disk array device
CN101276302A (en) * 2007-03-29 2008-10-01 中国科学院计算技术研究所 Magnetic disc fault processing and data restructuring method in magnetic disc array system
CN101436149A (en) * 2008-12-19 2009-05-20 华中科技大学 Method for rebuilding data of magnetic disk array

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5872906A (en) * 1993-10-14 1999-02-16 Fujitsu Limited Method and apparatus for taking countermeasure for failure of disk array
US20050102470A1 (en) * 2003-11-11 2005-05-12 Nec Corporation Disk array device
CN101276302A (en) * 2007-03-29 2008-10-01 中国科学院计算技术研究所 Magnetic disc fault processing and data restructuring method in magnetic disc array system
CN101436149A (en) * 2008-12-19 2009-05-20 华中科技大学 Method for rebuilding data of magnetic disk array

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102053801A (en) * 2010-12-29 2011-05-11 成都市华为赛门铁克科技有限公司 Disk hot standby method and device as well as storage system
CN102419697A (en) * 2011-11-02 2012-04-18 华中科技大学 Method for reconstructing single disk in vertical redundant array of independent disks (RAID)-6 coding
CN102419697B (en) * 2011-11-02 2013-12-18 华中科技大学 Method for reconstructing single disk in vertical redundant array of independent disks (RAID)-6 coding
CN102609223B (en) * 2012-02-13 2015-06-24 浪潮(北京)电子信息产业有限公司 Redundant array of independent disk system and initializing method thereof
CN102609223A (en) * 2012-02-13 2012-07-25 浪潮(北京)电子信息产业有限公司 Redundant array of independent disk system and initializing method thereof
US9389790B2 (en) 2013-12-25 2016-07-12 Huawei Technologies Co., Ltd. Method and apparatus for determining redundant array of independent disks
CN103699343A (en) * 2013-12-25 2014-04-02 华为技术有限公司 RAID (redundant arrays of independent disks) determination method and device
CN104142872A (en) * 2014-08-13 2014-11-12 浪潮电子信息产业股份有限公司 Quick reestablishment method for RAID (redundan array of inexpensive disk)
CN105138468A (en) * 2015-08-19 2015-12-09 浙江宇视科技有限公司 Method and device for improving reconstruction performance of virtual disk group
CN105138468B (en) * 2015-08-19 2018-11-13 浙江宇视科技有限公司 A kind of method and apparatus improving virtual disk group reconstruction property
CN106919340A (en) * 2015-12-28 2017-07-04 南宁富桂精密工业有限公司 Improve the system and method for RAID reading performances
CN106919340B (en) * 2015-12-28 2020-03-17 南宁富桂精密工业有限公司 System and method for improving RAID reading performance
CN106201782A (en) * 2016-07-12 2016-12-07 乐视控股(北京)有限公司 Date storage method and system
CN107870973A (en) * 2017-09-20 2018-04-03 广东东为信息技术有限公司 A kind of document storage system accelerated multi-path monitoring and played back simultaneously
CN108519926A (en) * 2018-03-31 2018-09-11 北京联想核芯科技有限公司 A kind of adaptive RAID grouping computational methods and device
CN108519926B (en) * 2018-03-31 2020-12-29 深圳忆联信息系统有限公司 Self-adaptive RAID (redundant array of independent disks) group calculation method and device
CN109189338A (en) * 2018-08-27 2019-01-11 郑州云海信息技术有限公司 A kind of method, system and the equipment of HotSpare disk addition
CN109189338B (en) * 2018-08-27 2021-06-18 郑州云海信息技术有限公司 Method, system and equipment for adding hot spare disk

Similar Documents

Publication Publication Date Title
CN101923496A (en) Method for achieving parallel reconstruction of data on RAID
CN101923501B (en) Disk array multi-level fault tolerance method
US20220107729A1 (en) Reorganizing disks and raid members to split a disk array during capacity expansion
US10997025B2 (en) RAID-based globally resource-shared data storage system
US5301297A (en) Method and means for managing RAID 5 DASD arrays having RAID DASD arrays as logical devices thereof
US8327225B2 (en) Error correction in a stacked memory
CN101625627B (en) Data read-in method, disc redundant array and controller thereof
CN102200892B (en) Capacity expansion method based on dynamic redundant array of independent disks (RAID) system
US20050229023A1 (en) Dual redundant data storage format and method
JP5722225B2 (en) Loose coupling between RAID volumes and drive groups for improved performance
JPH05505481A (en) Data storage device and method
JPH05257611A (en) Method for partitioning disk array
CN102880428A (en) Distributed RAID (redundant array of independent disks) establishing method and device
US11327668B1 (en) Predictable member assignment for expanding flexible raid system
CN103617006A (en) Storage resource management method and device
CN112799604B (en) N-Code-based RAID6 disk array capacity expansion method and data filling method
CN105988727A (en) RAID-based storage method and storage device
CN104714758B (en) A kind of array construction method and read-write system based on verification RAID addition mirror-image structures
US7133965B2 (en) Raid storage device
US9811280B2 (en) Efficient method of combining parity groups for uniform load distribution and maximizing parallelization in parity de-clustered and sliced disk raid architecture
CN103207761A (en) Data backup method and data reconfiguration method for RAID (redundant arrays of independent disks) 5 system hot backup disks
CN106095330A (en) A kind of storage method and apparatus of metadata
CN105824578B (en) A kind of equation selection method reducing RAID-6 decoding I/O data volume
CN100489772C (en) Stream data-oriented resequencing access storage buffering method and device
US20080104445A1 (en) Raid array

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20101222