WO2015058542A1 - Reconstruction method and device for redundant array of independent disks - Google Patents

Reconstruction method and device for redundant array of independent disks Download PDF

Info

Publication number
WO2015058542A1
WO2015058542A1 PCT/CN2014/080499 CN2014080499W WO2015058542A1 WO 2015058542 A1 WO2015058542 A1 WO 2015058542A1 CN 2014080499 W CN2014080499 W CN 2014080499W WO 2015058542 A1 WO2015058542 A1 WO 2015058542A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
disk
raid
degraded
written
Prior art date
Application number
PCT/CN2014/080499
Other languages
French (fr)
Chinese (zh)
Inventor
梁永贵
彭书锋
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2015058542A1 publication Critical patent/WO2015058542A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/08Error detection or correction by redundancy in data representation, e.g. by using checking codes
    • G06F11/10Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's
    • G06F11/1076Parity data used in redundant arrays of independent storages, e.g. in RAID systems
    • G06F11/1088Reconstruction on already foreseen single or plurality of spare disks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/08Error detection or correction by redundancy in data representation, e.g. by using checking codes
    • G06F11/10Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's
    • G06F11/1076Parity data used in redundant arrays of independent storages, e.g. in RAID systems
    • G06F11/1084Degraded mode, e.g. caused by single or multiple storage removals or disk failures

Definitions

  • TECHNICAL FIELD Embodiments of the present invention relate to communication technologies, and in particular, to a method and an apparatus for reconstructing an independent redundant disk array.
  • a storage system usually consists of a storage device such as a storage server. It has its own interface and protocol. It is connected to the host computer through a coaxial cable, network cable, or optical fiber. The storage center serves as a storage center for the computer.
  • Redundant Array of Independent Disks is a method of storing the same data in different places on multiple hard disks.
  • Figure 1 is a schematic diagram of the structure of RAID 5. As shown in Figure 1, four disks form a RAID5, all the disks are striped according to the length of the logical block address (LBA). The length of the LBA is called the stripe depth. Each disk in the disk group is combined with the strips at the same position. It is called a stripe of the RAID group. As shown in Figure 1, D0, Dl, D2 and PI are combined into one RAID stripe. One stripe in the RAID group strip is used as the check area, and the other strips are used as the data area.
  • LBA logical block address
  • D0, Dl, and D2 are the data areas
  • P1 is the check area.
  • the data sent by the upper application is stored in the data area, and XOR is performed between all the strips in the data strip as a data area.
  • the data in the failed disk can be obtained by XORing all the data of the other disks.
  • the embodiment of the invention provides a method and a device for reconfiguring an independent redundant disk array, which is used to improve the RAID reconstruction efficiency, reduce the pressure of the member disks other than the failed disk in the RAID group, and reduce the impact on the normal service 10. .
  • a first aspect of the present invention provides a method for reconfiguring an independent redundant disk array, including: performing a copy process of data on a failed disk in an independent redundant disk array RAID to a spare disk in the RAID;
  • the degraded data is written to the spare disk, and the degraded data includes data that needs to be written to the failed disk after the RAID is degraded and before the copy processing is completed.
  • the power is re-powered.
  • the method before the degrading data is written to the spare disk, the method further includes: Determining the degraded data in the memory of the controller in the RAID according to the time sequence of the degraded data writing;
  • Writing the degraded data to the spare disk includes:
  • the degraded data in the memory is written to the spare disk in the order of the number of the stripe and the time the degraded data is written.
  • the step of writing the degraded data according to a stripe of the faulty disk Recording the degraded data in the memory of the controller in the RAID including:
  • the number of the first strip and the first data are recorded in the memory.
  • a fourth possible implementation manner of the first aspect Determining that there is a bad track in the second strip in the faulty disk during the copying process, and reading a component in the member disk other than the faulty disk in the RAID and the second stripe component Second data in the strip of strips;
  • the third data is written to the spare disk.
  • a second aspect of the present invention provides a reconfigurable apparatus for an independent redundant disk array, including: a processing module, configured to execute data on a failed disk in an independent redundant disk array RAID to a copy of the spare disk in the RAID deal with;
  • a write module configured to write the degraded data to the spare disk, where the degraded data includes data that needs to be written to the failed disk after the RAID is degraded and before the copy processing is completed.
  • the processing module is further configured to: perform, on the failed disk in the independent redundant disk array RAID, a copy of the data on the spare disk in the RAID Before the processing, after the power supply to the faulty disk is stopped, the power is re-powered.
  • the processing module is further configured to: write the degraded data into the standby Before the disk, the degraded data is recorded in the memory of the controller in the RAID according to the time sequence of the degraded data writing in units of strips of the failed disk;
  • the writing module is further configured to:
  • the degraded data in the memory is written to the spare disk in the order of the number of the stripe and the time the degraded data is written.
  • the processing module is further configured to:
  • the processing module is further configured to execute In the process of copy processing, if it is determined that there is a bad track in the second strip in the faulty disk during the copy processing, reading the member disks in the RAID other than the faulty disk
  • the second strip constitutes second data in the strip of the strip
  • the third data is written to the spare disk.
  • the method and device for reconfiguring the independent redundant disk array provided by the embodiment of the present invention, by performing the copy processing of the data on the failed disk in the RAID to the spare disk in the RAID, and then writing the degraded data to the spare disk, the degradation data includes After the RAID is degraded, the data of the failed disk needs to be written, which can effectively improve the data acquisition rate and improve the data reconstruction efficiency. In the data reconstruction process, the entire data read area of the member disks other than the failed disk can be avoided, and other data can be reduced.
  • the I/O pressure of the member disk reduces the impact on the normal service I/O, which reduces the overhead of the RAID group system, and avoids the bad sectors on the other member disks in the reconfiguration. The data cannot be reconstructed. Construct the validity and integrity of the data and reduce the impact on business applications.
  • Figure 1 is a schematic structural view of a RAID 5
  • FIG. 2 is a flowchart of Embodiment 1 of a method for reconstructing an independent redundant disk array according to the present invention
  • FIG. 3 is a flowchart of Embodiment 2 of a method for reconstructing an independent redundant disk array according to the present invention
  • 4B is a flowchart of recording degraded data in Embodiment 2 of a method for reconstructing an independent redundant disk array according to the present invention
  • FIG. 5 is a flowchart of Embodiment 3 of a method for reconstructing an independent redundant disk array according to the present invention
  • FIG. 6 is a schematic structural diagram of Embodiment 1 of a device for reconstructing an independent redundant disk array according to the present invention
  • FIG. 7 is a schematic structural diagram of Embodiment 2 of a reconfigurable apparatus for an independent redundant disk array according to the present invention. detailed description
  • Embodiment 1 is a flowchart of Embodiment 1 of a method for reconfiguring an independent redundant disk array according to the present invention. As shown in FIG. 2, the method for reconstructing an independent redundant disk array in this embodiment includes:
  • the execution body of this embodiment may be a reconstruction device of an independent redundant disk array, and the reconstruction device of the independent redundant disk array may be included in the controller of the RAID group.
  • the scenario in this embodiment may be that the disk in the RAID group fails, and the data in the fault disk is reconstructed.
  • the method may further include:
  • the faulty disk can be used again, but the health state of the disk is seriously degraded, and the probability of recurring the fault is extremely high.
  • the data on the failed disk in the RAID is processed to the copy of the spare disk in the RAID.
  • the degraded data to the spare disk includes data that needs to be written to the failed disk after the RAID is degraded and before the copy processing is completed.
  • the reconfigurable device of the independent redundant disk array records the RAID group as a degraded state after the disk in the RAID group fails. After the RAID group is degraded, the data can be received as the normal time of the RAID group. After the RAID is demoted, before the copy processing is completed, new data that needs to be written to the failed disk is written to the spare disk, so that the data in the spare disk is restored to the latest state.
  • the reconstruction method of the independent redundant disk array of the embodiment can be applied to other levels of RAID, such as RAID6 and RAID50, in addition to the RAID5.
  • the data on the failed disk in the RAID is copied to the backup disk in the RAID, and the degraded data is written to the spare disk.
  • the downgraded data is required after the RAID is degraded.
  • the data in the faulty disk is reconstructed by writing data of the faulty disk, and the data in the faulty disk is obtained by XORing all the data of the disk other than the faulty disk.
  • the I/O pressure of the member disk reduces the impact on the normal service I/O, thereby reducing the overhead of the RAID group system.
  • the member disk other than the failed disk appears in the data reconstruction.
  • Bad sectors can't reconstruct valid data and can only mark bad sectors of disks, even RAID groups
  • the final reconfiguration is completed, but the data of the bad sectors marked by the disk is invalid. If the bad track mark area of the disk is the file system key area data, the operating system reports that the file system is damaged, the service is interrupted, and the embodiment directly copies
  • the data on the faulty disk avoids the bad sectors on the other member disks in the reconfiguration and the data cannot be reconstructed. This ensures the validity and integrity of the reconstructed data and reduces the impact on the service application.
  • the method for reconfiguring the independent redundant disk array of the embodiment is performed by performing the copy processing of the data on the failed disk in the RAID to the spare disk in the RAID, and then writing the degraded data to the spare disk, and the downgraded data is included after the RAID is degraded.
  • the data of the failed disk needs to be written, which can effectively improve the data acquisition rate and improve the data reconstruction efficiency.
  • the entire data read area of the member disks other than the failed disk can be avoided. Reduce the I/O pressure of other member disks, reduce the impact on normal service I/O, and reduce the overhead of the RAID group system, and avoid the bad sectors on other member disks in the reconstruction process, so that the data cannot be reconstructed.
  • Embodiment 2 is a flowchart of Embodiment 2 of a method for reconfiguring an independent redundant disk array according to the present invention. As shown in FIG. 3, the method for reconstructing an independent redundant disk array in this embodiment records degraded data into a RAID control.
  • the memory of the device it can include:
  • S203 Record the degraded data in the memory of the controller in the RAID according to the time sequence of the degraded data in the stripe of the faulty disk.
  • the amount of data change on a single disk in the RAID group is limited, so the data that needs to be written to the failed disk can be recorded as a bitmap after the failure of the RAID group and before the copy processing is completed.
  • the degraded data recorded in the memory is written to the spare disk.
  • recording the degraded data in the memory of the controller in the RAID is a possible implementation.
  • Other possible degraded data recording locations, such as dedicated memory, can also be used to implement the embodiment. Not limited.
  • FIG. 4A is a schematic diagram of degraded data recorded in Embodiment 2 of a method for reconfiguring an independent redundant disk array according to the present invention.
  • a record of a faulty disk is recorded in units of a stripe number of the failed disk.
  • the physical start address (P) and data length (LEN) of the data are recorded in chronological order in the form of a linked list for multiple data modifications on the same stripe.
  • FIG. 4B is a flowchart of recording the degradation data in the second embodiment of the method for reconfiguring the independent redundant disk array according to the present invention. As shown in FIG. 4B, the process of recording the degradation data in this embodiment may include:
  • the first data is recorded to a member disk other than the faulty disk. Specifically, if the degraded data of the first strip is recorded in the memory, the first data is recorded after the degraded data of the first strip according to the time sequence of data writing, otherwise, it is determined whether the copy processing is completed, if , the first data is directly written to the spare disk, and does not need to be recorded in the memory; if not, it indicates that there is no record in the memory that needs to write the first stripe, and the first stripe is recorded in the memory. Number and first data.
  • the degraded data in the memory needs to be written to the spare disk in the order of the stripe number and the degraded data write time.
  • the method for reconfiguring the independent redundant disk array of the embodiment is performed by performing the copy processing of the data on the failed disk in the RAID to the backup disk in the RAID, and the time of writing the data according to the degraded data in units of the strips of the failed disk.
  • the degraded data is recorded in the memory of the controller in the RAID, and the degraded data in the memory is written to the spare disk in the order of the stripe number and the degraded data write time, which can effectively improve the data acquisition rate and improve data reconstruction.
  • Efficiency avoiding the full data read area of other member disks except the failed disk during data reconstruction, which can reduce the I/O pressure of other member disks, reduce the impact on normal service I/O, and thus reduce the RAID group.
  • the overhead of the system and can avoid the bad sectors on other member disks in the reconstruction, so that the data cannot be reconstructed, ensuring the validity and integrity of the reconstructed data, and reducing the impact on the business application.
  • FIG. 5 is a flowchart of Embodiment 3 of a method for reconfiguring an independent redundant disk array according to the present invention. As shown in FIG. 5, the method for reconstructing an independent redundant disk array in this embodiment is in a faulty disk during the reconstruction process.
  • the description of the scenario in which the strip has failed includes:
  • the copy processing is completed. Specifically, after the copy processing is completed, the state of the RAID group can be recorded as a state of completion of reconstruction.
  • the RAID group status can be adjusted to the reconfiguration completion state, and the alarm is reported to the customer to replace the failed disk.
  • the method for reconstructing the independent redundant disk array of the embodiment by performing the copy processing of the data on the failed disk in the RAID to the spare disk in the RAID, if determining the second item in the failed disk during the copy processing If there is a bad track in the band, the second data in the stripe of the member disk other than the faulty disk in the RAID and the second stripe is read; the XOR operation is performed between the second data to obtain the first
  • the third data is written to the spare disk, which can effectively improve the data acquisition rate and improve the data reconstruction efficiency. In the data reconstruction process, the entire data read area of the member disks other than the failed disk can be avoided, and other data can be reduced.
  • the I/O pressure of the member disk reduces the impact on the normal service I/O, which reduces the overhead of the RAID group system, and avoids the bad sectors on the other member disks in the reconfiguration.
  • the data cannot be reconstructed. Construct the validity and integrity of the data and reduce the impact on business applications.
  • FIG. 6 is a schematic structural diagram of Embodiment 1 of an apparatus for reconfiguring an independent redundant disk array according to the present invention.
  • the apparatus for reconfiguring an independent redundant disk array of the present embodiment may include: a processing module 61 and The writing module 62 is configured to perform the copy processing of the data on the failed disk in the RAID to the spare disk in the RAID; the writing module 62 is configured to write the degraded data to the spare disk, and the degraded data is included in the RAID. After the downgrade, the data of the failed disk needs to be written before the copy processing is completed.
  • the processing module 61 is further configured to re-power the failed disk after powering down the data on the failed disk in the RAID to the backup disk on the spare disk in the RAID.
  • the device in this embodiment can be used to implement the technical solution of the method embodiment shown in FIG. 2, and the principle and the technical effect are similar, and details are not described herein again.
  • the processing module 61 is further configured to record the degraded data in the RAID in the time sequence of the degraded data before the degraded data is written to the spare disk.
  • the memory of the device In the memory of the device;
  • the write module 62 is also used to: The degraded data in memory is written to the spare disk in the order in which the stripe number is written and the degraded data is written.
  • the processing module 61 is further configured to:
  • the number of the first strip and the first data are recorded in the memory.
  • the device in this embodiment can be used to implement the technical solution of the method embodiment shown in FIG. 3, and the principle and the technical effect are similar, and details are not described herein again.
  • the processing module 61 is further configured to: in the process of performing the copy processing, if it is determined that there is a bad track in the second strip in the faulty disk during the copy processing, the fault is read in the read RAID. a second data in a strip of the member disk other than the disk and the second stripe;
  • the device in this embodiment can be used to implement the technical solution of the method embodiment shown in FIG. 5.
  • the principle and the technical effect are similar, and details are not described herein again.
  • FIG. 7 is a schematic structural diagram of Embodiment 2 of an apparatus for reconfiguring an independent redundant disk array according to the present invention.
  • the apparatus for reconstructing an independent redundant disk array of the present embodiment may include: at least one bus 71 At least one processor 72 connected to the bus 71 and at least one memory 73 connected to the bus 71, wherein the processor 72 calls the code stored in the memory 73 via the bus 71 for:
  • the degraded data is written to the spare disk, and the downgraded data includes data that needs to be written to the failed disk after the RAID is degraded and before the copy processing is completed.
  • the device of this embodiment may be used to perform any one of the technical solutions of the method embodiment shown in FIG. 2, FIG. 3 or FIG. 5, and the implementation principle and technical effects are similar, and details are not described herein again. It should be noted that the corresponding structural diagrams of the foregoing embodiments are merely illustrative, and the connection relationship of each part or module is not limited to the form shown in the figure, and may be subject to the actual application.
  • the disclosed apparatus and method may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be electrical, mechanical or otherwise.
  • the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
  • the above-described integrated unit implemented in the form of a software functional unit can be stored in a computer readable storage medium.
  • the software functional unit is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor to perform the method of various embodiments of the present invention.
  • a computer device which may be a personal computer, a server, or a network device, etc.
  • the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a disk or an optical disk, and the like, which can store program codes. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Techniques For Improving Reliability Of Storages (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)

Abstract

A reconstruction method and device for a redundant array of independent disks. The method comprises: copying data in a fault disk into a redundant array of independent disks (RAID) to a standby disk in the RAID; and writing degraded data into the standby disk, the degraded data comprising data which needs to be written into the fault disk after the RAID is degraded. The data acquisition rate can be effectively improved, thereby improving the data reconstruction efficiency. In the process of data reconstruction, avoiding reading all the data of other member disks except the fault disk can reduce the I/O pressure of other member disks, so that the influence on normal I/O service is reduced, and thus the overhead of a RAID group system is reduced.

Description

独立冗余磁盘阵列的重构方法及装置  Method and device for reconstructing independent redundant disk array
本申请要求于 2013年 10月 21 日提交中国专利局、 申请号为  This application is submitted to the Chinese Patent Office on October 21, 2013, and the application number is
201310496672.1、 发明名称为 "独立冗余磁盘阵列的重构方法及装置" 的中 国专利申请的优先权, 其全部内容通过引用结合在本申请中。 201310496672.1, the entire disclosure of which is hereby incorporated by reference in its entirety in its entirety in its entirety in its entirety in the the the the the the the the the the
技术领域 本发明实施例涉及通信技术, 尤其涉及一种独立冗余磁盘阵列的重构方 法及装置。 背景技术 随着计算机技术以及网络技术的高速发展, 计算机自带的存储器逐渐难 以满足存取速度快、 存储容量大和成本低等诸多要求, 于是发展出相对独立 的存储系统。 存储系统通常由存储服务器等存储设备组成, 具有自己的接口 和协议, 通过同轴电缆、 网线、 光纤等方式与计算机主机连接, 作为数据的 存储中心为计算机主机提供存储服务。 TECHNICAL FIELD Embodiments of the present invention relate to communication technologies, and in particular, to a method and an apparatus for reconstructing an independent redundant disk array. BACKGROUND OF THE INVENTION With the rapid development of computer technology and network technology, the memory that comes with the computer is gradually difficult to meet the requirements of fast access speed, large storage capacity, and low cost, so that a relatively independent storage system is developed. A storage system usually consists of a storage device such as a storage server. It has its own interface and protocol. It is connected to the host computer through a coaxial cable, network cable, or optical fiber. The storage center serves as a storage center for the computer.
独立磁盘冗余阵列 (RAID, Redundant Array of Independent Disks) 是把 相同的数据存储在多个硬盘的不同的地方的方法, 图 1为 RAID5的结构示意 图, 如图 1所示, 4个磁盘组成一个 RAID5 , 将所有磁盘按照同等的逻辑区 块地址 (Logical Block Address, 简称 LBA)长度划分条带, 该 LBA长度称为 分条深度; 该磁盘组中每个磁盘取同一位置的条带组合在一起称为该 RAID 组的一个分条, 如图 1中 D0、 Dl、 D2、 PI组合成一个 RAID分条, 将 RAID 组分条中的一个条带作为校验区, 其它条带作为数据区, 如图 1中 D0、 Dl、 D2 为数据区, P1 为校验区, 将上层应用下发的数据存储在数据区, 将一个 分条中所有作为数据区的条带间做异或 (XOR) 运算, 运算数据作为该分条 校验区数据记录, gp : Pl = D0 xor Dl xor D2。 当四块磁盘中任意一块磁盘出 现故障, 可以通过将其它磁盘的数据读出来, 全部进行 XOR运算得到故障磁 盘中的数据, 例如, 当第一块盘故障时, 其上面的数据 DO = Dl xor D2 xor PI ; D3= D5 xor D4 xor P2 , 该过程称为 RAID重构, 再将重构出的数据写入 另外一块空磁盘上, 这样就保障了数据不丢失。 然而, 随着存储技术的发展, 单个磁盘的容量越来越大, 但单盘性能提 升较少, 导致重构时间越来越长, 在重构中 RAID组中其它成员磁盘既要继 续承担正常业务输入 /输出(Input/Output,简称 I/O),又要承担额外的重构 I/O, 使磁盘压力较大, 导致磁盘故障率较高。 发明内容 Redundant Array of Independent Disks (RAID) is a method of storing the same data in different places on multiple hard disks. Figure 1 is a schematic diagram of the structure of RAID 5. As shown in Figure 1, four disks form a RAID5, all the disks are striped according to the length of the logical block address (LBA). The length of the LBA is called the stripe depth. Each disk in the disk group is combined with the strips at the same position. It is called a stripe of the RAID group. As shown in Figure 1, D0, Dl, D2 and PI are combined into one RAID stripe. One stripe in the RAID group strip is used as the check area, and the other strips are used as the data area. As shown in Figure 1, D0, Dl, and D2 are the data areas, and P1 is the check area. The data sent by the upper application is stored in the data area, and XOR is performed between all the strips in the data strip as a data area. The operation, the operation data is recorded as the data of the stripe check area, gp : Pl = D0 xor Dl xor D2. When any one of the four disks fails, the data in the failed disk can be obtained by XORing all the data of the other disks. For example, when the first disk fails, the data above it is DO = Dl xor D2 xor PI ; D3 = D5 xor D4 xor P2 , this process is called RAID reconfiguration, and the reconstructed data is written to another empty disk, thus ensuring that data is not lost. However, with the development of storage technology, the capacity of a single disk is getting larger and larger, but the performance of a single disk is less improved, resulting in longer and longer reconstruction time. In the reconstruction, other member disks in the RAID group must continue to assume normal. The input/output (I/O) of the service has to bear additional reconfiguration I/O, which makes the disk pressure higher, resulting in a higher disk failure rate. Summary of the invention
本发明实施例提供一种独立冗余磁盘阵列的重构方法及装置, 用以提升 RAID的重构效率, 减轻 RAID组中除故障磁盘以外的成员盘的 10压力, 减 少对正常业务 10的影响。  The embodiment of the invention provides a method and a device for reconfiguring an independent redundant disk array, which is used to improve the RAID reconstruction efficiency, reduce the pressure of the member disks other than the failed disk in the RAID group, and reduce the impact on the normal service 10. .
本发明第一方面, 提供一种独立冗余磁盘阵列的重构方法, 包括: 执行独立冗余磁盘阵列 RAID中的故障磁盘上的数据到所述 RAID中的 备用磁盘的拷贝处理;  A first aspect of the present invention provides a method for reconfiguring an independent redundant disk array, including: performing a copy process of data on a failed disk in an independent redundant disk array RAID to a spare disk in the RAID;
将降级数据写入所述备用磁盘, 所述降级数据包括在所述 RAID降级之 后、 所述拷贝处理完成之前, 需要写入所述故障磁盘的数据。  The degraded data is written to the spare disk, and the degraded data includes data that needs to be written to the failed disk after the RAID is degraded and before the copy processing is completed.
在第一方面的第一种可能的实现方式中, 所述执行独立冗余磁盘阵列 In a first possible implementation manner of the first aspect, the performing the independent redundant disk array
RAID中的故障磁盘上的数据到所述 RAID中的备用磁盘上的拷贝处理之前, 还包括: Before the data on the failed disk in the RAID is processed to the copy on the spare disk in the RAID, it also includes:
对所述故障磁盘停止供电之后, 再重新供电。  After the faulty disk is powered off, the power is re-powered.
结合第一方面或第一方面的第一种可能的实现方式, 在第一方面的第二 种可能的实现方式中, 所述将降级数据写入所述备用磁盘之前, 还包括: 以所述故障磁盘的条带为单位, 按照所述降级数据写入的时间顺序将所 述降级数据记录在所述 RAID中控制器的内存中;  With reference to the first aspect or the first possible implementation manner of the first aspect, in a second possible implementation manner of the first aspect, before the degrading data is written to the spare disk, the method further includes: Determining the degraded data in the memory of the controller in the RAID according to the time sequence of the degraded data writing;
所述降级数据写入所述备用磁盘, 包括:  Writing the degraded data to the spare disk includes:
将所述内存中的降级数据按照所述条带的编号和所述降级数据写入的时 间顺序写入所述备用磁盘。  The degraded data in the memory is written to the spare disk in the order of the number of the stripe and the time the degraded data is written.
根据第一方面的第二种可能的实现方式, 在第一方面的第三种可能的实 现方式中, 所述以所述故障磁盘的条带为单位, 按照所述降级数据写入的时 间顺序将所述降级数据记录在所述 RAID中控制器的内存中, 包括:  According to a second possible implementation manner of the first aspect, in a third possible implementation manner of the first aspect, the step of writing the degraded data according to a stripe of the faulty disk Recording the degraded data in the memory of the controller in the RAID, including:
若确定第一数据需要写入所述故障磁盘中的第一条带, 则判断所述内存 中是否记录有所述第一条带的降级数据; 若是, 则根据所述第一数据更新所述第一条带的降级数据; 若否, 则判断所述拷贝处理是否完成; If it is determined that the first data needs to be written into the first strip in the faulty disk, determining whether the degraded data of the first stripe is recorded in the memory; If yes, updating the degraded data of the first strip according to the first data; if not, determining whether the copy processing is completed;
若是, 则将所述第一数据写入所述备用磁盘;  If yes, writing the first data to the spare disk;
若否, 则在所述内存中记录所述第一条带的编号和所述第一数据。 结合第一方面或第一方面的第一种至第三种可能的实现方式中任意一 种, 在第一方面的第四种可能的实现方式中, 在执行所述拷贝处理的过程中, 若确定在所述拷贝处理过程中所述故障磁盘中的第二条带中存在坏道, 则读 取所述 RAID中除所述故障磁盘之外的成员磁盘中与所述第二条带组成分条 的条带中的第二数据;  If not, the number of the first strip and the first data are recorded in the memory. With reference to the first aspect, or any one of the first to third possible implementation manners of the first aspect, in a fourth possible implementation manner of the first aspect, Determining that there is a bad track in the second strip in the faulty disk during the copying process, and reading a component in the member disk other than the faulty disk in the RAID and the second stripe component Second data in the strip of strips;
在所述第二数据之间执行异或运算, 得到第三数据;  Performing an exclusive OR operation between the second data to obtain third data;
将所述第三数据写入所述备用磁盘。  The third data is written to the spare disk.
本发明第二方面, 提供一种独立冗余磁盘阵列的重构装置, 包括: 处理模块, 用于执行独立冗余磁盘阵列 RAID中的故障磁盘上的数据到 所述 RAID中的备用磁盘的拷贝处理;  A second aspect of the present invention provides a reconfigurable apparatus for an independent redundant disk array, including: a processing module, configured to execute data on a failed disk in an independent redundant disk array RAID to a copy of the spare disk in the RAID deal with;
写入模块, 用于将降级数据写入所述备用磁盘, 所述降级数据包括在所 述 RAID降级之后、所述拷贝处理完成之前, 需要写入所述故障磁盘的数据。  And a write module, configured to write the degraded data to the spare disk, where the degraded data includes data that needs to be written to the failed disk after the RAID is degraded and before the copy processing is completed.
在第二方面的第一种可能的实现方式中, 所述处理模块, 还用于在所述 执行独立冗余磁盘阵列 RAID中的故障磁盘上的数据到所述 RAID中的备用 磁盘上的拷贝处理之前, 对所述故障磁盘停止供电之后, 再重新供电。  In a first possible implementation manner of the second aspect, the processing module is further configured to: perform, on the failed disk in the independent redundant disk array RAID, a copy of the data on the spare disk in the RAID Before the processing, after the power supply to the faulty disk is stopped, the power is re-powered.
结合第二方面或第二方面的第一种可能的实现方式, 在第二方面的第二 种可能的实现方式中, 所述处理模块, 还用于在所述将降级数据写入所述备 用磁盘之前, 以所述故障磁盘的条带为单位, 按照所述降级数据写入的时间 顺序将所述降级数据记录在所述 RAID中控制器的内存中;  With reference to the second aspect or the first possible implementation manner of the second aspect, in a second possible implementation manner of the second aspect, the processing module is further configured to: write the degraded data into the standby Before the disk, the degraded data is recorded in the memory of the controller in the RAID according to the time sequence of the degraded data writing in units of strips of the failed disk;
所述写入模块, 还用于:  The writing module is further configured to:
将所述内存中的降级数据按照所述条带的编号和所述降级数据写入的时 间顺序写入所述备用磁盘。  The degraded data in the memory is written to the spare disk in the order of the number of the stripe and the time the degraded data is written.
根据第二方面的第二种可能的实现方式, 在第二方面的第三种可能的实 现方式中, 所述处理模块, 还用于:  According to a second possible implementation manner of the second aspect, in a third possible implementation manner of the second aspect, the processing module is further configured to:
若确定第一数据需要写入所述故障磁盘中的第一条带, 则判断所述内存 中是否记录有所述第一条带的降级数据; 若是, 则根据所述第一数据更新所述第一条带的降级数据; 若否, 则判断所述拷贝处理是否完成; If it is determined that the first data needs to be written into the first strip in the faulty disk, determining whether the degraded data of the first stripe is recorded in the memory; If yes, updating the degraded data of the first strip according to the first data; if not, determining whether the copy processing is completed;
若是, 则将所述第一数据写入所述备用磁盘;  If yes, writing the first data to the spare disk;
若否, 则在所述内存中记录所述第一条带的编号和所述第一数据。 结合第二方面或第二方面的第一种至第三种可能的实现方式中任意一 种, 在第二方面的第四种可能的实现方式中, 所述处理模块, 还用于在执行 所述拷贝处理的过程中, 若确定在所述拷贝处理过程中所述故障磁盘中的第 二条带中存在坏道, 则读取所述 RAID中除所述故障磁盘之外的成员磁盘中 与所述第二条带组成分条的条带中的第二数据;  If not, the number of the first strip and the first data are recorded in the memory. With reference to the second aspect, or any one of the first to the third possible implementation manners of the second aspect, in a fourth possible implementation manner of the second aspect, the processing module is further configured to execute In the process of copy processing, if it is determined that there is a bad track in the second strip in the faulty disk during the copy processing, reading the member disks in the RAID other than the faulty disk The second strip constitutes second data in the strip of the strip;
在所述第二数据之间执行异或运算, 得到第三数据;  Performing an exclusive OR operation between the second data to obtain third data;
将所述第三数据写入所述备用磁盘。  The third data is written to the spare disk.
本发明实施例提供的独立冗余磁盘阵列的重构方法及装置, 通过执行 RAID中的故障磁盘上的数据到 RAID中的备用磁盘的拷贝处理,再将降级数 据写入备用磁盘, 降级数据包括在 RAID降级之后需要写入故障磁盘的数据, 可以有效提升数据获取速率, 提升数据重构效率, 在数据重构过程中避免对 除故障磁盘以外的其它成员磁盘的全盘数据读区, 可以减少其它成员磁盘的 I/O压力, 减少对正常业务 I/O的影响, 进而减小 RAID组系统的开销, 并且 可以避免在重构中其它成员磁盘上有坏道导致数据无法重构出来, 保障重构 数据的有效性和完整性, 减少对业务应用的影响。 附图说明  The method and device for reconfiguring the independent redundant disk array provided by the embodiment of the present invention, by performing the copy processing of the data on the failed disk in the RAID to the spare disk in the RAID, and then writing the degraded data to the spare disk, the degradation data includes After the RAID is degraded, the data of the failed disk needs to be written, which can effectively improve the data acquisition rate and improve the data reconstruction efficiency. In the data reconstruction process, the entire data read area of the member disks other than the failed disk can be avoided, and other data can be reduced. The I/O pressure of the member disk reduces the impact on the normal service I/O, which reduces the overhead of the RAID group system, and avoids the bad sectors on the other member disks in the reconfiguration. The data cannot be reconstructed. Construct the validity and integrity of the data and reduce the impact on business applications. DRAWINGS
为了更清楚地说明本发明实施例或现有技术中的技术方案, 下面将对实 施例或现有技术描述中所需要使用的附图作一简单地介绍, 显而易见地, 下 面描述中的附图是本发明的一些实施例, 对于本领域普通技术人员来讲, 在 不付出创造性劳动性的前提下, 还可以根据这些附图获得其他的附图。  In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, a brief description of the drawings used in the embodiments or the prior art description will be briefly described below. Obviously, the drawings in the following description It is a certain embodiment of the present invention, and other drawings can be obtained from those skilled in the art without any inventive labor.
图 1为 RAID5的结构示意图;  Figure 1 is a schematic structural view of a RAID 5;
图 2为本发明提供的独立冗余磁盘阵列的重构方法实施例一的流程图; 图 3为本发明提供的独立冗余磁盘阵列的重构方法实施例二的流程图; 图 4A为本发明提供的独立冗余磁盘阵列的重构方法实施例二中记录的 降级数据的示意图; 图 4B 为本发明提供的独立冗余磁盘阵列的重构方法实施例二中记录降 级数据的流程图; 2 is a flowchart of Embodiment 1 of a method for reconstructing an independent redundant disk array according to the present invention; FIG. 3 is a flowchart of Embodiment 2 of a method for reconstructing an independent redundant disk array according to the present invention; The schematic diagram of the degraded data recorded in the second embodiment of the method for reconstructing the independent redundant disk array provided by the invention; 4B is a flowchart of recording degraded data in Embodiment 2 of a method for reconstructing an independent redundant disk array according to the present invention;
图 5为本发明提供的独立冗余磁盘阵列的重构方法实施例三的流程图; 图 6为本发明提供的独立冗余磁盘阵列的重构装置实施例一的结构示意 图;  5 is a flowchart of Embodiment 3 of a method for reconstructing an independent redundant disk array according to the present invention; FIG. 6 is a schematic structural diagram of Embodiment 1 of a device for reconstructing an independent redundant disk array according to the present invention;
图 7为本发明提供的独立冗余磁盘阵列的重构装置实施例二的结构示意 图。 具体实施方式  FIG. 7 is a schematic structural diagram of Embodiment 2 of a reconfigurable apparatus for an independent redundant disk array according to the present invention. detailed description
为使本发明实施例的目的、 技术方案和优点更加清楚, 下面将结合本发 明实施例中的附图, 对本发明实施例中的技术方案进行清楚、 完整地描述, 显然, 所描述的实施例是本发明一部分实施例, 而不是全部的实施例。 基于 本发明中的实施例, 本领域普通技术人员在没有作出创造性劳动前提下所获 得的所有其他实施例, 都属于本发明保护的范围。  The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is a partial embodiment of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
图 2为本发明提供的独立冗余磁盘阵列的重构方法实施例一的流程图, 如图 2所示, 本实施例的独立冗余磁盘阵列的重构方法包括:  2 is a flowchart of Embodiment 1 of a method for reconfiguring an independent redundant disk array according to the present invention. As shown in FIG. 2, the method for reconstructing an independent redundant disk array in this embodiment includes:
5101、 执行 RAID中的故障磁盘上的数据到 RAID中的备用磁盘的拷贝 处理。  5101. Perform the copy processing of the data on the failed disk in the RAID to the spare disk in the RAID.
本实施例的执行主体可以是独立冗余磁盘阵列的重构装置, 独立冗余磁 盘阵列的重构装置可以包含在 RAID组的控制器中。  The execution body of this embodiment may be a reconstruction device of an independent redundant disk array, and the reconstruction device of the independent redundant disk array may be included in the controller of the RAID group.
具体来说, 本实施例适用的场景可以是 RAID组中的磁盘发生故障, 对 故障盘中的数据进行重构。  Specifically, the scenario in this embodiment may be that the disk in the RAID group fails, and the data in the fault disk is reconstructed.
在本发明的一个实施例中, S101之前还可以包括:  In an embodiment of the present invention, before S101, the method may further include:
对故障磁盘停止供电之后, 再重新供电。  After the failed disk is powered off, re-power it.
具体来说, 从目前磁盘故障情况来看, 对于故障磁盘进行重新上下电之 后, 故障磁盘都是可以再次使用的, 只是该磁盘的健康状态严重下降, 再度 出现故障的概率极高, 本实施例在对故障磁盘进行重新上下电之后, 执行 RAID中的故障磁盘上的数据到 RAID中的备用磁盘的拷贝处理。  Specifically, from the current disk fault condition, after the faulty disk is powered on and off again, the faulty disk can be used again, but the health state of the disk is seriously degraded, and the probability of recurring the fault is extremely high. After the failed disk is powered on and off, the data on the failed disk in the RAID is processed to the copy of the spare disk in the RAID.
5102、 将降级数据写入备用磁盘, 降级数据包括在 RAID降级之后、 拷 贝处理完成之前, 需要写入故障磁盘的数据。 具体来说, 本实施例中独立冗余磁盘阵列的重构装置在 RAID组中的磁 盘出现故障之后, 记录 RAID组为降级状态, 在 RAID组降级后, 可以按照 RAID组正常时一样接收数据, 将 RAID降级之后、拷贝处理完成之前, 需要 写入到故障磁盘的新数据写入备用磁盘, 使得备用磁盘中的数据恢复到最新 状态。 5102. Write the degraded data to the spare disk, and the degraded data includes data that needs to be written to the failed disk after the RAID is degraded and before the copy processing is completed. Specifically, in the embodiment, the reconfigurable device of the independent redundant disk array records the RAID group as a degraded state after the disk in the RAID group fails. After the RAID group is degraded, the data can be received as the normal time of the RAID group. After the RAID is demoted, before the copy processing is completed, new data that needs to be written to the failed disk is written to the spare disk, so that the data in the spare disk is restored to the latest state.
举例来说, 本实施例的独立冗余磁盘阵列的重构方法除了可以应用于 RAID5之外, 还可以应用到 RAID6、 RAID50等其它级别的 RAID。  For example, the reconstruction method of the independent redundant disk array of the embodiment can be applied to other levels of RAID, such as RAID6 and RAID50, in addition to the RAID5.
本实施例在 RAID组中的磁盘出现故障后, 通过执行 RAID中的故障磁 盘上的数据到 RAID中的备用磁盘的拷贝处理,再将降级数据写入备用磁盘, 降级数据包括在 RAID降级之后需要写入故障磁盘的数据, 实现对故障磁盘 中的数据的重构, 而现有技术通过将除故障磁盘以外的其它磁盘的数据读出 来,全部进行 XOR运算得到故障磁盘中的数据,本实施例通过执行故障磁盘 到备用磁盘的数据拷贝处理, 可以有效提升数据获取速率, 提升数据重构效 率, 在数据重构过程中避免对除故障磁盘以外的其它成员磁盘的全盘数据读 区, 可以减少其它成员磁盘的 I/O压力, 减少对正常业务 I/O的影响, 进而减 小 RAID组系统的开销, 并且, 现有技术在数据重构中, 若遇到除故障磁盘 以外的其它成员磁盘出现坏道而无法重构出有效数据而只能标记磁盘坏道, 即使 RAID组最终重构完成, 但其中被标记磁盘坏道的数据是无效的, 若标 记磁盘坏道标记区域为文件系统关键区域数据时,操作系统报文件系统损坏, 则业务中断, 本实施例通过直接拷贝故障磁盘上的数据, 避免在重构中其它 成员磁盘上有坏道导致数据无法重构出来,保障重构数据的有效性和完整性, 减少对业务应用的影响。  In this embodiment, after the disk in the RAID group fails, the data on the failed disk in the RAID is copied to the backup disk in the RAID, and the degraded data is written to the spare disk. The downgraded data is required after the RAID is degraded. The data in the faulty disk is reconstructed by writing data of the faulty disk, and the data in the faulty disk is obtained by XORing all the data of the disk other than the faulty disk. By performing data copy processing of the failed disk to the spare disk, the data acquisition rate can be effectively improved, the data reconstruction efficiency can be improved, and the entire data read area of the member disks other than the failed disk can be avoided during the data reconstruction process, and the other data can be reduced. The I/O pressure of the member disk reduces the impact on the normal service I/O, thereby reducing the overhead of the RAID group system. Moreover, in the data reconstruction, the member disk other than the failed disk appears in the data reconstruction. Bad sectors can't reconstruct valid data and can only mark bad sectors of disks, even RAID groups The final reconfiguration is completed, but the data of the bad sectors marked by the disk is invalid. If the bad track mark area of the disk is the file system key area data, the operating system reports that the file system is damaged, the service is interrupted, and the embodiment directly copies The data on the faulty disk avoids the bad sectors on the other member disks in the reconfiguration and the data cannot be reconstructed. This ensures the validity and integrity of the reconstructed data and reduces the impact on the service application.
本实施例的独立冗余磁盘阵列的重构方法, 通过执行 RAID中的故障磁 盘上的数据到 RAID中的备用磁盘的拷贝处理,再将降级数据写入备用磁盘, 降级数据包括在 RAID降级之后、 拷贝处理完成之前, 需要写入故障磁盘的 数据, 可以有效提升数据获取速率, 提升数据重构效率, 在数据重构过程中 避免对除故障磁盘以外的其它成员磁盘的全盘数据读区, 可以减少其它成员 磁盘的 I/O压力,减少对正常业务 I/O的影响,进而减小 RAID组系统的开销, 并且可以避免在重构中其它成员磁盘上有坏道导致数据无法重构出来, 保障 重构数据的有效性和完整性, 减少对业务应用的影响。 图 3为本发明提供的独立冗余磁盘阵列的重构方法实施例二的流程图, 如图 3 所示, 本实施例的独立冗余磁盘阵列的重构方法将降级数据记录到 RAID中控制器的内存中, 可以包括: The method for reconfiguring the independent redundant disk array of the embodiment is performed by performing the copy processing of the data on the failed disk in the RAID to the spare disk in the RAID, and then writing the degraded data to the spare disk, and the downgraded data is included after the RAID is degraded. Before the copy processing is completed, the data of the failed disk needs to be written, which can effectively improve the data acquisition rate and improve the data reconstruction efficiency. During the data reconstruction process, the entire data read area of the member disks other than the failed disk can be avoided. Reduce the I/O pressure of other member disks, reduce the impact on normal service I/O, and reduce the overhead of the RAID group system, and avoid the bad sectors on other member disks in the reconstruction process, so that the data cannot be reconstructed. Ensure the effectiveness and integrity of the reconstructed data and reduce the impact on business applications. 3 is a flowchart of Embodiment 2 of a method for reconfiguring an independent redundant disk array according to the present invention. As shown in FIG. 3, the method for reconstructing an independent redundant disk array in this embodiment records degraded data into a RAID control. In the memory of the device, it can include:
S201、 对故障磁盘停止供电之后, 再重新供电。  S201. After powering off the faulty disk, re-power.
S202、 执行 RAID中的故障磁盘上的数据到 RAID中的备用磁盘的拷贝 处理。  S202. Perform a copy processing of the data on the failed disk in the RAID to the spare disk in the RAID.
S203、 以故障磁盘的条带为单位, 按照降级数据写入的时间顺序将降级 数据记录在 RAID中控制器的内存中。  S203: Record the degraded data in the memory of the controller in the RAID according to the time sequence of the degraded data in the stripe of the faulty disk.
具体来说, 由于在数据重构过程中, RAID组中单个磁盘上的数据变化量 有限, 所以可以将 RAID组出现故障之后、 拷贝处理完成之前, 需要写入故 障磁盘的数据以位图形式记录在 RAID中控制器的内存中, 并在拷贝处理完 成后将内存中记录的降级数据刷写入备用磁盘。  Specifically, in the data reconstruction process, the amount of data change on a single disk in the RAID group is limited, so the data that needs to be written to the failed disk can be recorded as a bitmap after the failure of the RAID group and before the copy processing is completed. In the memory of the controller in the RAID, and after the copy processing is completed, the degraded data recorded in the memory is written to the spare disk.
可以理解的是, 将降级数据记录在 RAID中控制器的内存中, 是一种可 能的实现方式, 其它可能的降级数据记录位置, 例如专用存储器也可以用于 实现本实施例, 本发明对此并不限制。  It can be understood that recording the degraded data in the memory of the controller in the RAID is a possible implementation. Other possible degraded data recording locations, such as dedicated memory, can also be used to implement the embodiment. Not limited.
图 4A为本发明提供的独立冗余磁盘阵列的重构方法实施例二中记录的 降级数据的示意图, 如图 4A所示, 以故障磁盘的条带编号为单位, 记录需 要写入故障磁盘的数据的物理起始地址(P)、 数据长度(LEN), 对于同一个 条带上的多次数据修改以链表的形式按照时间顺序记录。  4A is a schematic diagram of degraded data recorded in Embodiment 2 of a method for reconfiguring an independent redundant disk array according to the present invention. As shown in FIG. 4A, a record of a faulty disk is recorded in units of a stripe number of the failed disk. The physical start address (P) and data length (LEN) of the data are recorded in chronological order in the form of a linked list for multiple data modifications on the same stripe.
图 4B 为本发明提供的独立冗余磁盘阵列的重构方法实施例二中记录降 级数据的流程图, 如图 4B所示, 本实施例记录降级数据的过程可以包括:  FIG. 4B is a flowchart of recording the degradation data in the second embodiment of the method for reconfiguring the independent redundant disk array according to the present invention. As shown in FIG. 4B, the process of recording the degradation data in this embodiment may include:
5301、 判断写入 RAID组中的第一数据是否需要写入故障磁盘中的第一 条带, 若是, 则执行 S302, 否则, 执行 S307。  S301. Determine whether the first data written in the RAID group needs to be written into the first strip in the faulty disk. If yes, execute S302. Otherwise, execute S307.
5302、判断内存中是否记录有第一条带的降级数据,若是,则执行 S303 , 否则, 执行 S304。  5302. Determine whether the degraded data of the first strip is recorded in the memory. If yes, execute S303. Otherwise, execute S304.
5303、 根据第一数据更新第一条带的降级数据。  5303. Update the degraded data of the first strip according to the first data.
5304、 判断拷贝处理是否完成, 若是, 则执行 S305 , 否则, 执行 S306。  5304. Determine whether the copy processing is completed. If yes, execute S305. Otherwise, execute S306.
5305、 将第一数据写入备用磁盘。  5305. Write the first data to the spare disk.
5306、 在内存中记录第一条带的编号和第一数据。  5306. Record the number and the first data of the first strip in the memory.
S307、 将第一数据写入除故障磁盘以外的成员磁盘。 具体来说, 若内存中记录有第一条带的降级数据, 则按照数据写入的时 间顺序, 在第一条带的降级数据后记录第一数据, 否则, 则判断拷贝处理是 否完成, 若是, 则直接将第一数据写入备用磁盘, 而不需要再记录在内存中; 若否, 则表明内存中没有记录有需要写入第一条带的数据, 在内存中记录第 —条带的编号和第一数据。 S307. Write the first data to a member disk other than the faulty disk. Specifically, if the degraded data of the first strip is recorded in the memory, the first data is recorded after the degraded data of the first strip according to the time sequence of data writing, otherwise, it is determined whether the copy processing is completed, if , the first data is directly written to the spare disk, and does not need to be recorded in the memory; if not, it indicates that there is no record in the memory that needs to write the first stripe, and the first stripe is recorded in the memory. Number and first data.
S204、 将内存中的降级数据按照条带的编号和降级数据写入的时间顺序 写入备用磁盘。  S204. Write the degraded data in the memory to the spare disk according to the number of the stripe and the time sequence of the degraded data write.
具体来说, 由于降级数据记录至内存中的时间顺序不同, 所以需要将内 存中的降级数据按照条带的编号和降级数据写入的时间顺序写入备用磁盘。  Specifically, since the time sequence of the degraded data record into the memory is different, the degraded data in the memory needs to be written to the spare disk in the order of the stripe number and the degraded data write time.
本实施例的独立冗余磁盘阵列的重构方法, 通过执行 RAID中的故障磁 盘上的数据到 RAID中的备用磁盘的拷贝处理, 以故障磁盘的条带为单位, 按照降级数据写入的时间顺序将降级数据记录在 RAID中控制器的内存中, 再将内存中的降级数据按照条带的编号和降级数据写入的时间顺序写入备用 磁盘, 可以有效提升数据获取速率, 提升数据重构效率, 在数据重构过程中 避免对除故障磁盘以外的其它成员磁盘的全盘数据读区, 可以减少其它成员 磁盘的 I/O压力,减少对正常业务 I/O的影响,进而减小 RAID组系统的开销, 并且可以避免在重构中其它成员磁盘上有坏道导致数据无法重构出来, 保障 重构数据的有效性和完整性, 减少对业务应用的影响。  The method for reconfiguring the independent redundant disk array of the embodiment is performed by performing the copy processing of the data on the failed disk in the RAID to the backup disk in the RAID, and the time of writing the data according to the degraded data in units of the strips of the failed disk. The degraded data is recorded in the memory of the controller in the RAID, and the degraded data in the memory is written to the spare disk in the order of the stripe number and the degraded data write time, which can effectively improve the data acquisition rate and improve data reconstruction. Efficiency, avoiding the full data read area of other member disks except the failed disk during data reconstruction, which can reduce the I/O pressure of other member disks, reduce the impact on normal service I/O, and thus reduce the RAID group. The overhead of the system, and can avoid the bad sectors on other member disks in the reconstruction, so that the data cannot be reconstructed, ensuring the validity and integrity of the reconstructed data, and reducing the impact on the business application.
图 5为本发明提供的独立冗余磁盘阵列的重构方法实施例三的流程图, 如图 5所示, 本实施例的独立冗余磁盘阵列的重构方法对重构过程中故障磁 盘中的条带出现故障的场景进行说明, 包括:  FIG. 5 is a flowchart of Embodiment 3 of a method for reconfiguring an independent redundant disk array according to the present invention. As shown in FIG. 5, the method for reconstructing an independent redundant disk array in this embodiment is in a faulty disk during the reconstruction process. The description of the scenario in which the strip has failed includes:
5401、 对故障磁盘停止供电之后, 再重新供电。  5401. After the faulty disk is powered off, re-power it.
5402、 执行 RAID中的故障磁盘上的数据到 RAID中的备用磁盘的拷贝 处理。  5402. Perform a copy processing of the data on the failed disk in the RAID to the spare disk in the RAID.
S403、 判断在执行拷贝处理的过程中, 故障磁盘中的第二条带中是否存 在坏道, 若是, 则执行 S404, 否则, 执行 S405。  S403. Determine whether there is a bad track in the second strip in the faulty disk during the execution of the copy processing, and if yes, execute S404; otherwise, execute S405.
S404、 读取 RAID中除故障磁盘之外的成员磁盘中与第二条带组成分条 的条带中的第二数据, 在第二数据之间执行异或运算, 得到第三数据, 将第 三数据写入备用磁盘。  S404. The second data in the stripe of the member disk other than the faulty disk in the RAID and the second stripe is read, and the XOR operation is performed between the second data to obtain the third data. Three data is written to the spare disk.
S405、 拷贝处理完成。 具体来说, 在拷贝处理完成之后, 可以将 RAID组的状态记录为重构即 将完成状态。 S405. The copy processing is completed. Specifically, after the copy processing is completed, the state of the RAID group can be recorded as a state of completion of reconstruction.
S406、 将记录的降级数据刷写入备用磁盘。  S406. Write the recorded degraded data brush to the spare disk.
具体来说, 将记录的降级数据刷写入备用磁盘之后, 可以调整 RAID组 状态为重构完成状态, 并上报告警提示客户更换故障磁盘。  Specifically, after the recorded degraded data is written to the spare disk, the RAID group status can be adjusted to the reconfiguration completion state, and the alarm is reported to the customer to replace the failed disk.
本实施例的独立冗余磁盘阵列的重构方法, 通过在执行 RAID中的故障 磁盘上的数据到 RAID中的备用磁盘的拷贝处理中, 若确定在拷贝处理过程 中故障磁盘中的第二条带中存在坏道, 则读取 RAID中除故障磁盘之外的成 员磁盘中与第二条带组成分条的条带中的第二数据; 在第二数据之间执行异 或运算, 得到第三数据; 将第三数据写入备用磁盘, 可以有效提升数据获取 速率, 提升数据重构效率, 在数据重构过程中避免对除故障磁盘以外的其它 成员磁盘的全盘数据读区, 可以减少其它成员磁盘的 I/O压力, 减少对正常 业务 I/O的影响, 进而减小 RAID组系统的开销, 并且可以避免在重构中其 它成员磁盘上有坏道导致数据无法重构出来, 保障重构数据的有效性和完整 性, 减少对业务应用的影响。  The method for reconstructing the independent redundant disk array of the embodiment, by performing the copy processing of the data on the failed disk in the RAID to the spare disk in the RAID, if determining the second item in the failed disk during the copy processing If there is a bad track in the band, the second data in the stripe of the member disk other than the faulty disk in the RAID and the second stripe is read; the XOR operation is performed between the second data to obtain the first The third data is written to the spare disk, which can effectively improve the data acquisition rate and improve the data reconstruction efficiency. In the data reconstruction process, the entire data read area of the member disks other than the failed disk can be avoided, and other data can be reduced. The I/O pressure of the member disk reduces the impact on the normal service I/O, which reduces the overhead of the RAID group system, and avoids the bad sectors on the other member disks in the reconfiguration. The data cannot be reconstructed. Construct the validity and integrity of the data and reduce the impact on business applications.
图 6为本发明提供的独立冗余磁盘阵列的重构装置实施例一的结构示意 图, 如图 6所示, 本实施例的独立冗余磁盘阵列的重构装置, 可以包括: 处 理模块 61和写入模块 62, 其中, 处理模块 61用于执行 RAID中的故障磁盘 上的数据到 RAID中的备用磁盘的拷贝处理; 写入模块 62用于将降级数据写 入备用磁盘, 降级数据包括在 RAID降级之后、 拷贝处理完成之前, 需要写 入故障磁盘的数据。  FIG. 6 is a schematic structural diagram of Embodiment 1 of an apparatus for reconfiguring an independent redundant disk array according to the present invention. As shown in FIG. 6, the apparatus for reconfiguring an independent redundant disk array of the present embodiment may include: a processing module 61 and The writing module 62 is configured to perform the copy processing of the data on the failed disk in the RAID to the spare disk in the RAID; the writing module 62 is configured to write the degraded data to the spare disk, and the degraded data is included in the RAID. After the downgrade, the data of the failed disk needs to be written before the copy processing is completed.
在本发明的一个实施例中,处理模块 61还用于在执行 RAID中的故障磁 盘上的数据到 RAID中的备用磁盘上的拷贝处理之前, 对故障磁盘停止供电 之后, 再重新供电。  In one embodiment of the present invention, the processing module 61 is further configured to re-power the failed disk after powering down the data on the failed disk in the RAID to the backup disk on the spare disk in the RAID.
本实施例的装置, 可以用于执行图 2所示方法实施例的技术方案, 其实 现原理和技术效果类似, 此处不再赘述。  The device in this embodiment can be used to implement the technical solution of the method embodiment shown in FIG. 2, and the principle and the technical effect are similar, and details are not described herein again.
在本发明的一个实施例中,处理模块 61还用于在将降级数据写入备用磁 盘之前, 以故障磁盘的条带为单位, 按照降级数据写入的时间顺序将降级数 据记录在 RAID中控制器的内存中;  In an embodiment of the present invention, the processing module 61 is further configured to record the degraded data in the RAID in the time sequence of the degraded data before the degraded data is written to the spare disk. In the memory of the device;
写入模块 62还用于: 将内存中的降级数据按照条带的编号和降级数据写入的时间顺序写入备 用磁盘。 The write module 62 is also used to: The degraded data in memory is written to the spare disk in the order in which the stripe number is written and the degraded data is written.
在本发明的一个实施例中, 处理模块 61还用于:  In an embodiment of the invention, the processing module 61 is further configured to:
若确定第一数据需要写入故障磁盘中的第一条带, 则判断内存中是否记 录有第一条带的降级数据;  If it is determined that the first data needs to be written into the first strip in the faulty disk, it is determined whether the degraded data of the first stripe is recorded in the memory;
若是, 则根据第一数据更新第一条带的降级数据;  If yes, updating the degraded data of the first strip according to the first data;
若否, 则判断拷贝处理是否完成;  If not, it is determined whether the copy processing is completed;
若是, 则将第一数据写入备用磁盘;  If yes, the first data is written to the spare disk;
若否, 则在内存中记录第一条带的编号和第一数据。  If not, the number of the first strip and the first data are recorded in the memory.
本实施例的装置, 可以用于执行图 3所示方法实施例的技术方案, 其实 现原理和技术效果类似, 此处不再赘述。  The device in this embodiment can be used to implement the technical solution of the method embodiment shown in FIG. 3, and the principle and the technical effect are similar, and details are not described herein again.
在本发明的一个实施例中, 处理模块 61 还用于在执行拷贝处理的过程 中, 若确定在拷贝处理过程中故障磁盘中的第二条带中存在坏道, 则读取 RAID 中除故障磁盘之外的成员磁盘中与第二条带组成分条的条带中的第二 数据;  In an embodiment of the present invention, the processing module 61 is further configured to: in the process of performing the copy processing, if it is determined that there is a bad track in the second strip in the faulty disk during the copy processing, the fault is read in the read RAID. a second data in a strip of the member disk other than the disk and the second stripe;
在第二数据之间执行异或运算, 得到第三数据;  Performing an exclusive OR operation between the second data to obtain third data;
将第三数据写入备用磁盘。  Write the third data to the spare disk.
本实施例的装置, 可以用于执行图 5所示方法实施例的技术方案, 其实 现原理和技术效果类似, 此处不再赘述。  The device in this embodiment can be used to implement the technical solution of the method embodiment shown in FIG. 5. The principle and the technical effect are similar, and details are not described herein again.
图 7为本发明提供的独立冗余磁盘阵列的重构装置实施例二的结构示意 图, 如图 7所示, 本实施例的独立冗余磁盘阵列的重构装置, 可以包括: 至 少一个总线 71、 与总线 71相连的至少一个处理器 72以及与总线 71相连的 至少一个存储器 73, 其中, 处理器 72通过总线 71, 调用存储器 73中存储的 代码, 以用于:  FIG. 7 is a schematic structural diagram of Embodiment 2 of an apparatus for reconfiguring an independent redundant disk array according to the present invention. As shown in FIG. 7, the apparatus for reconstructing an independent redundant disk array of the present embodiment may include: at least one bus 71 At least one processor 72 connected to the bus 71 and at least one memory 73 connected to the bus 71, wherein the processor 72 calls the code stored in the memory 73 via the bus 71 for:
执行独立冗余磁盘阵列 RAID中的故障磁盘上的数据到 RAID中的备用 磁盘的拷贝处理;  Performing copy processing of the data on the failed disk in the independent redundant disk array RAID to the spare disk in the RAID;
将降级数据写入备用磁盘, 降级数据包括在 RAID降级之后、 拷贝处理 完成之前, 需要写入故障磁盘的数据。  The degraded data is written to the spare disk, and the downgraded data includes data that needs to be written to the failed disk after the RAID is degraded and before the copy processing is completed.
本实施例的装置, 可以用于执行图 2、 图 3或图 5所示方法实施例的技 术方案中的任意一种, 其实现原理和技术效果类似, 此处不再赘述。 需要说明的是, 上述各实施例对应的结构示意图仅为一种示意, 各部分 或模块的连接关系不限于图中示出的形式, 可以以实际应用中的情况为准。 The device of this embodiment may be used to perform any one of the technical solutions of the method embodiment shown in FIG. 2, FIG. 3 or FIG. 5, and the implementation principle and technical effects are similar, and details are not described herein again. It should be noted that the corresponding structural diagrams of the foregoing embodiments are merely illustrative, and the connection relationship of each part or module is not limited to the form shown in the figure, and may be subject to the actual application.
在本发明所提供的几个实施例中, 应该理解到, 所揭露的装置和方法, 可以通过其它的方式实现。例如, 以上所描述的装置实施例仅仅是示意性的, 例如, 所述单元的划分, 仅仅为一种逻辑功能划分, 实际实现时可以有另外 的划分方式, 例如多个单元或组件可以结合或者可以集成到另一个系统, 或 一些特征可以忽略, 或不执行。 另一点, 所显示或讨论的相互之间的耦合或 直接耦合或通信连接可以是通过一些接口, 装置或单元的间接耦合或通信连 接, 可以是电性, 机械或其它的形式。  In the several embodiments provided by the present invention, it should be understood that the disclosed apparatus and method may be implemented in other manners. For example, the device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not executed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be electrical, mechanical or otherwise.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的, 作 为单元显示的部件可以是或者也可以不是物理单元, 即可以位于一个地方, 或者也可以分布到多个网络单元上。 可以根据实际的需要选择其中的部分或 者全部单元来实现本实施例方案的目的。  The units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solution of the embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中, 也可以是各个单元单独物理存在, 也可以两个或两个以上单元集成在一个单 元中。 上述集成的单元既可以采用硬件的形式实现, 也可以采用硬件加软件 功能单元的形式实现。  In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
上述以软件功能单元的形式实现的集成的单元, 可以存储在一个计算机 可读取存储介质中。 上述软件功能单元存储在一个存储介质中, 包括若干指 令用以使得一台计算机设备(可以是个人计算机, 服务器, 或者网络设备等) 或处理器 (processor) 执行本发明各个实施例所述方法的部分歩骤。 而前述 的存储介质包括: U盘、移动硬盘、只读存储器(Read-Only Memory, ROM), 随机存取存储器 (Random Access Memory, RAM), 磁碟或者光盘等各种可 以存储程序代码的介质。  The above-described integrated unit implemented in the form of a software functional unit can be stored in a computer readable storage medium. The software functional unit is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor to perform the method of various embodiments of the present invention. Partially. The foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a disk or an optical disk, and the like, which can store program codes. .
本领域技术人员可以清楚地了解到, 为描述的方便和简洁, 仅以上述各 功能模块的划分进行举例说明, 实际应用中, 可以根据需要而将上述功能分 配由不同的功能模块完成, 即将装置的内部结构划分成不同的功能模块, 以 完成以上描述的全部或者部分功能。 上述描述的装置的具体工作过程, 可以 参考前述方法实施例中的对应过程, 在此不再赘述。  A person skilled in the art can clearly understand that for the convenience and brevity of the description, only the division of each functional module described above is exemplified. In practical applications, the above function assignment can be completed by different functional modules as needed, that is, the device is installed. The internal structure is divided into different functional modules to perform all or part of the functions described above. For the specific working process of the device described above, refer to the corresponding process in the foregoing method embodiment, and details are not described herein again.
最后应说明的是: 以上各实施例仅用以说明本发明的技术方案, 而非对 其限制; 尽管参照前述各实施例对本发明进行了详细的说明, 本领域的普通 技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改, 或者对其中部分或者全部技术特征进行等同替换; 而这些修改或者替换, 并 不使相应技术方案的本质脱离本发明各实施例技术方案的范围。 Finally, it should be noted that the above embodiments are only used to illustrate the technical solution of the present invention, and not The invention is described in detail with reference to the foregoing embodiments, and those skilled in the art should understand that the technical solutions described in the foregoing embodiments may be modified, or some or all of the technical features may be modified. The equivalents are made without departing from the scope of the technical solutions of the embodiments of the present invention.

Claims

权利要求 Rights request
1、 一种独立冗余磁盘阵列的重构方法, 其特征在于, 包括: 1. A method for reconstructing an independent redundant disk array, which is characterized by including:
执行独立冗余磁盘阵列 RAID中的故障磁盘上的数据到所述 RAID中的 备用磁盘的拷贝处理; Perform copy processing of data on the failed disk in the independent redundant disk array RAID to the spare disk in the RAID;
将降级数据写入所述备用磁盘, 所述降级数据包括在所述 RAID降级之 后、 所述拷贝处理完成之前, 需要写入所述故障磁盘的数据。 Write degraded data to the spare disk, where the degraded data includes data that needs to be written to the failed disk after the RAID is degraded and before the copy processing is completed.
2、 根据权利要求 1所述的方法, 其特征在于, 所述执行独立冗余磁盘阵 列 RAID中的故障磁盘上的数据到所述 RAID中的备用磁盘上的拷贝处理之 m , 还包括: 2. The method according to claim 1, characterized in that the step of copying the data on the failed disk in the independent redundant disk array RAID to the spare disk in the RAID also includes:
对所述故障磁盘停止供电之后, 再重新供电。 After powering off the faulty disk, power it again.
3、 根据权利要求 1或 2所述的方法, 其特征在于, 所述将降级数据写入 所述备用磁盘之前, 还包括: 3. The method according to claim 1 or 2, characterized in that, before writing the downgraded data to the spare disk, it further includes:
以所述故障磁盘的条带为单位, 按照所述降级数据写入的时间顺序将所 述降级数据记录在所述 RAID中控制器的内存中; Taking the stripe of the failed disk as a unit, the degraded data is recorded in the memory of the controller in the RAID according to the time sequence in which the degraded data is written;
所述降级数据写入所述备用磁盘, 包括: The degraded data is written to the spare disk, including:
将所述内存中的降级数据按照所述条带的编号和所述降级数据写入的时 间顺序写入所述备用磁盘。 The degraded data in the memory is written to the spare disk in the order of the stripe number and the writing time of the degraded data.
4、 根据权利要求 3所述的方法, 其特征在于, 所述以所述故障磁盘的条 带为单位, 按照所述降级数据写入的时间顺序将所述降级数据记录在所述 RAID中控制器的内存中, 包括: 4. The method according to claim 3, characterized in that, based on the stripe of the failed disk as a unit, the degraded data is recorded in the RAID control according to the time sequence of writing the degraded data. The memory of the processor includes:
若确定第一数据需要写入所述故障磁盘中的第一条带, 则判断所述内存 中是否记录有所述第一条带的降级数据; If it is determined that the first data needs to be written to the first strip in the failed disk, determine whether the degraded data of the first strip is recorded in the memory;
若是, 则根据所述第一数据更新所述第一条带的降级数据; If so, update the degraded data of the first strip according to the first data;
若否, 则判断所述拷贝处理是否完成; If not, determine whether the copy processing is completed;
若是, 则将所述第一数据写入所述备用磁盘; If so, write the first data to the spare disk;
若否, 则在所述内存中记录所述第一条带的编号和所述第一数据。 If not, record the number of the first strip and the first data in the memory.
5、 根据权利要求 1-4所述的方法, 其特征在于, 在执行所述拷贝处理的 过程中, 若确定在所述拷贝处理过程中所述故障磁盘中的第二条带中存在坏 道, 则读取所述 RAID中除所述故障磁盘之外的成员磁盘中与所述第二条带 组成分条的条带中的第二数据; 在所述第二数据之间执行异或运算, 得到第三数据; 5. The method according to claims 1-4, characterized in that, during the copy processing, if it is determined that bad sectors exist in the second strip of the failed disk during the copy processing, , then read the second data in the stripe that forms the stripe with the second stripe in the member disks in the RAID except the faulty disk; Perform an XOR operation between the second data to obtain the third data;
将所述第三数据写入所述备用磁盘。 The third data is written to the spare disk.
6、 一种独立冗余磁盘阵列的重构装置, 其特征在于, 包括: 6. A reconstruction device for an independent redundant disk array, characterized by including:
处理模块, 用于执行独立冗余磁盘阵列 RAID中的故障磁盘上的数据到 所述 RAID中的备用磁盘的拷贝处理; A processing module, used to perform copy processing of data on the failed disk in the independent redundant disk array RAID to the spare disk in the RAID;
写入模块, 用于将降级数据写入所述备用磁盘, 所述降级数据包括在所 述 RAID降级之后、所述拷贝处理完成之前, 需要写入所述故障磁盘的数据。 A writing module, configured to write degraded data to the spare disk, where the degraded data includes data that needs to be written to the failed disk after the RAID is degraded and before the copy processing is completed.
7、 根据权利要求 6所述的装置, 其特征在于, 所述处理模块, 还用于在 所述执行独立冗余磁盘阵列 RAID中的故障磁盘上的数据到所述 RAID中的 备用磁盘上的拷贝处理之前, 对所述故障磁盘停止供电之后, 再重新供电。 7. The device according to claim 6, wherein the processing module is further configured to transfer data from a failed disk in an independent redundant disk array RAID to a spare disk in the RAID. Before the copy process, power supply to the faulty disk is stopped, and then power is supplied again.
8、 根据权利要求 6或 7所述的装置, 其特征在于, 所述处理模块, 还用 于在所述将降级数据写入所述备用磁盘之前, 以所述故障磁盘的条带为单位, 按照所述降级数据写入的时间顺序将所述降级数据记录在所述 RAID中控制 器的内存中; 8. The device according to claim 6 or 7, wherein the processing module is further configured to, before writing the degraded data to the spare disk, take the stripe of the failed disk as a unit, Record the degraded data in the memory of the controller in the RAID according to the time sequence in which the degraded data is written;
所述写入模块, 还用于: The writing module is also used for:
将所述内存中的降级数据按照所述条带的编号和所述降级数据写入的时 间顺序写入所述备用磁盘。 The degraded data in the memory is written to the spare disk in the order of the stripe number and the writing time of the degraded data.
9、 根据权利要求 8所述的装置, 其特征在于, 所述处理模块, 还用于: 若确定第一数据需要写入所述故障磁盘中的第一条带, 则判断所述内存 中是否记录有所述第一条带的降级数据; 9. The device according to claim 8, wherein the processing module is further configured to: if it is determined that the first data needs to be written to the first stripe in the faulty disk, determine whether the first stripe in the memory is The degradation data of the first strip is recorded;
若是, 则根据所述第一数据更新所述第一条带的降级数据; If so, update the degraded data of the first strip according to the first data;
若否, 则判断所述拷贝处理是否完成; If not, determine whether the copy processing is completed;
若是, 则将所述第一数据写入所述备用磁盘; If so, write the first data to the spare disk;
若否, 则在所述内存中记录所述第一条带的编号和所述第一数据。 If not, record the number of the first strip and the first data in the memory.
10、 根据权利要求 6-9所述的装置, 其特征在于, 所述处理模块, 还用 于在执行所述拷贝处理的过程中, 若确定在所述拷贝处理过程中所述故障磁 盘中的第二条带中存在坏道, 则读取所述 RAID中除所述故障磁盘之外的成 员磁盘中与所述第二条带组成分条的条带中的第二数据; 10. The device according to claims 6-9, characterized in that the processing module is further configured to, during the execution of the copy processing, if it is determined that the faulty disk in the failed disk is If there are bad sectors in the second strip, read the second data in the strips in the member disks in the RAID other than the faulty disk that form a stripe with the second strip;
在所述第二数据之间执行异或运算, 得到第三数据; Perform an XOR operation between the second data to obtain the third data;
将所述第三数据写入所述备用磁盘。 The third data is written to the spare disk.
PCT/CN2014/080499 2013-10-21 2014-06-23 Reconstruction method and device for redundant array of independent disks WO2015058542A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201310496672.1 2013-10-21
CN201310496672.1A CN103513942B (en) 2013-10-21 2013-10-21 The reconstructing method of raid-array and device

Publications (1)

Publication Number Publication Date
WO2015058542A1 true WO2015058542A1 (en) 2015-04-30

Family

ID=49896742

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/080499 WO2015058542A1 (en) 2013-10-21 2014-06-23 Reconstruction method and device for redundant array of independent disks

Country Status (2)

Country Link
CN (1) CN103513942B (en)
WO (1) WO2015058542A1 (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103513942B (en) * 2013-10-21 2016-06-29 华为技术有限公司 The reconstructing method of raid-array and device
CN104461791B (en) * 2014-11-28 2017-02-01 华为技术有限公司 Information processing method and device
US20170063399A1 (en) 2015-08-28 2017-03-02 Qualcomm Incorporated Systems and methods for repair redundancy control for large erasure coded data storage
US10001944B2 (en) * 2015-08-28 2018-06-19 Qualcomm Incorporated Systems and methods for data organization in storage systems using large erasure codes
CN105117172B (en) * 2015-08-31 2019-04-02 深圳神州数码云科数据技术有限公司 A kind of disk array history falls the store method of disk record
US10007587B2 (en) 2015-09-18 2018-06-26 Qualcomm Incorporated Systems and methods for pre-generation and pre-storage of repair fragments in storage systems
CN109213428B (en) * 2017-06-30 2021-05-28 伊姆西Ip控股有限责任公司 Method and apparatus for managing storage system
CN111007992B (en) * 2020-03-04 2020-08-04 广东电网有限责任公司佛山供电局 Disk data storage representation method, system and storage medium
CN112181298B (en) * 2020-09-25 2022-05-17 杭州宏杉科技股份有限公司 Array access method, array access device, storage equipment and machine-readable storage medium
CN115344205A (en) * 2022-08-15 2022-11-15 超聚变数字技术有限公司 Data reading and writing method and computing device for disk firmware upgrading process
CN117472292B (en) * 2023-12-27 2024-03-12 苏州元脑智能科技有限公司 Data reconstruction method for redundant array of independent disk, storage system and controller

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101609420A (en) * 2009-07-17 2009-12-23 杭州华三通信技术有限公司 Realize method and the redundant arrays of inexpensive disks and the controller thereof of rebuilding of disc redundant array
CN102521058A (en) * 2011-12-01 2012-06-27 北京威视数据系统有限公司 Disk data pre-migration method of RAID (Redundant Array of Independent Disks) group
CN102981778A (en) * 2012-11-15 2013-03-20 浙江宇视科技有限公司 Redundant array of independent disks (RAID) array reconstruction method and device thereof
US20130080828A1 (en) * 2011-09-23 2013-03-28 Lsi Corporation Methods and apparatus for marking writes on a write-protected failed device to avoid reading stale data in a raid storage system
CN103019894A (en) * 2012-12-25 2013-04-03 创新科存储技术(深圳)有限公司 Reconstruction method for redundant array of independent disks
CN103513942A (en) * 2013-10-21 2014-01-15 华为技术有限公司 Method and device for reconstructing independent redundancy array of inexpensive disks

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101609420A (en) * 2009-07-17 2009-12-23 杭州华三通信技术有限公司 Realize method and the redundant arrays of inexpensive disks and the controller thereof of rebuilding of disc redundant array
US20130080828A1 (en) * 2011-09-23 2013-03-28 Lsi Corporation Methods and apparatus for marking writes on a write-protected failed device to avoid reading stale data in a raid storage system
CN102521058A (en) * 2011-12-01 2012-06-27 北京威视数据系统有限公司 Disk data pre-migration method of RAID (Redundant Array of Independent Disks) group
CN102981778A (en) * 2012-11-15 2013-03-20 浙江宇视科技有限公司 Redundant array of independent disks (RAID) array reconstruction method and device thereof
CN103019894A (en) * 2012-12-25 2013-04-03 创新科存储技术(深圳)有限公司 Reconstruction method for redundant array of independent disks
CN103513942A (en) * 2013-10-21 2014-01-15 华为技术有限公司 Method and device for reconstructing independent redundancy array of inexpensive disks

Also Published As

Publication number Publication date
CN103513942A (en) 2014-01-15
CN103513942B (en) 2016-06-29

Similar Documents

Publication Publication Date Title
WO2015058542A1 (en) Reconstruction method and device for redundant array of independent disks
JP5768587B2 (en) Storage system, storage control device, and storage control method
US9519554B2 (en) Storage system with rebuild operations
US8392752B2 (en) Selective recovery and aggregation technique for two storage apparatuses of a raid
EP2703991B1 (en) Scalable storage protection
JP4986045B2 (en) Failover and failback of write cache data in dual active controllers
US8839028B1 (en) Managing data availability in storage systems
US8060772B2 (en) Storage redundant array of independent drives
JP4815825B2 (en) Disk array device and method for reconstructing the same
JP2016530637A (en) RAID parity stripe reconstruction
KR101251245B1 (en) Optimized reconstruction and copyback methodology for a disconnected drive in the presence of a global hot spare disk
TWI451257B (en) Method and apparatus for protecting the integrity of cached data in a direct-attached storage (das) system
US20090313617A1 (en) Method for Updating Control Program of Physical Storage Device in Storage Virtualization System and Storage Virtualization Controller and System Thereof
JPWO2006123416A1 (en) Disk failure recovery method and disk array device
TW201324130A (en) Data storage method for a plurality of RAID systems and data storage system thereof
US10649892B2 (en) Operating a raw mirror within a RAID system
JP2005276196A (en) System and method for performing drive recovery subsequent to drive failure
WO2024119762A1 (en) Raid card construction method and system, and related apparatus
WO2024113685A1 (en) Data recovery method for raid array and related apparatus
WO2021088367A1 (en) Data recovery method and related device
US8639968B2 (en) Computing system reliability
CN113377569A (en) Method, apparatus and computer program product for recovering data
US8433949B2 (en) Disk array apparatus and physical disk restoration method
CN116204137B (en) Distributed storage system, control method, device and equipment based on DPU
JP4831093B2 (en) Disk array system, disk array control method, and disk array control program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14855228

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14855228

Country of ref document: EP

Kind code of ref document: A1