CN103246478B - A kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk - Google Patents

A kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk Download PDF

Info

Publication number
CN103246478B
CN103246478B CN201210026590.6A CN201210026590A CN103246478B CN 103246478 B CN103246478 B CN 103246478B CN 201210026590 A CN201210026590 A CN 201210026590A CN 103246478 B CN103246478 B CN 103246478B
Authority
CN
China
Prior art keywords
disk
overall
raid
hotspare disk
overall hotspare
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210026590.6A
Other languages
Chinese (zh)
Other versions
CN103246478A (en
Inventor
王道邦
周泽湘
张伟涛
李艳国
章珉
潘兴旺
张恒
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING TOYOU FEIJI ELECTRONICS Co Ltd
Original Assignee
BEIJING TOYOU FEIJI ELECTRONICS Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING TOYOU FEIJI ELECTRONICS Co Ltd filed Critical BEIJING TOYOU FEIJI ELECTRONICS Co Ltd
Priority to CN201210026590.6A priority Critical patent/CN103246478B/en
Publication of CN103246478A publication Critical patent/CN103246478A/en
Application granted granted Critical
Publication of CN103246478B publication Critical patent/CN103246478B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention relates to a kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk, comprise one or more RAID, one or more free disk, more than 0 or 1 overall HotSpare disk, system disk and storage operating system.Described RAID is in (SuSE) Linux OS, uses "-C " or "-create " parameter of mdadm software to complete establishment, and is managed by mdadm.Described system disk is connected with each RAID, overall HotSpare disk and free disk; The content stored in system disk includes but not limited to: the storage operating system manage RAID and application program.Described storage operating system is stored in system disk, be one and expanded the (SuSE) Linux OS of kernel, comprising: human-computer interaction module, overall HotSpare disk creation module, overall HotSpare disk retrieval module, overall HotSpare disk removing module and RAID low-quality disk check processing module.The overall HotSpare disk that the present invention is arranged, not by the restriction of RAID group, and without the need to configuration file, is convenient to Data Migration.

Description

A kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk
Technical field
The present invention relates to a kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk, belong to computer data field of storage.
Background technology
In mass data storage field, usually adopt various types of RAID to build architecture, carry out data storage, RAID is the abbreviation of RedundantArraysofIndependentDisks, is called Redundant Array of Independent Disks (RAID), is called for short disk array.Relatively more conventional RAID has RAID0, RAID1, RAID5 and RAID6 etc.Wherein RAID0 is mainly in order to improve readwrite performance, does not have redundant ability.Other RAID then has data redundancy in various degree: RAID1 has done mirror image to disk, and redundance reaches 50%, and another mirrored disk can be utilized during certain disk failures to rebuild; Each band of RAID5 contains 1 check block, supports to damage arbitrarily one of them disk, carry out data reconstruction by the parity block on other disk; Each band of RAID6 contains 2 check blocks, support to damage arbitrarily wherein two disks, carry out data reconstruction by the parity block on other disk.For these redundancy RAID, when the low-quality disk number occurred in allowed limits time, can not valid data be lost, but the decline of system performance can be caused, now can manual operation, replace out by the disk of damage, the disk changed into carries out data reconstruction; But if the disk damaged can not be replaced in time, or the low-quality disk number occurred has exceeded the low-quality disk number of this redundancy RAID permission simultaneously, just loss of data can be there is, now again low-quality disk being replaced to It dones't help the situation, for avoiding the generation of this situation, can HotSpare disk being set.
Disc array system comprises one or more RAID.HotSpare disk is idle in disc array system but is in the disk powering up holding state.HotSpare disk is divided into local HotSpare disk (LocalSpareDrive) and overall HotSpare disk (GlobalSpareDrive) two kinds, and local HotSpare disk is only responsible for single RAID, and overall HotSpare disk is then responsible for the group of multiple RAID composition.The HotSpare disk that overall situation HotSpare disk can realize between multiple RAID is shared, and cost-saving as far as possible.HotSpare disk will be not less than member's dish that in the RAID that this HotSpare disk is responsible for, capacity is minimum on capacity.After certain member's adjustment debit in RAID is bad, HotSpare disk can replace the disk damaged automatically, and the data reconstruction be originally stored on this damage disk on HotSpare disk, rebuilds successfully, HotSpare disk is just transformed into member's dish of this RAID, ensure that performance and the data integrity of RAID.
Software PLC is some disks, according to user to difference, be aggregating and become a large virtual RAID device, if each disk size is inconsistent, based on the disk of minimum capacity.Software PLC needs the support of operating system in realization, and in (SuSE) Linux OS, software PLC is created by " mdadm " and managed." mdadm " is one and is used for specially creating and the software of management software PLC, default installation in most of linux system.In the (SuSE) Linux OS installing mdadm, the method for the most frequently used establishment RAID uses mdadm "-C " or "-create " parameter to create.The method is when creating RAID, the spatial placement that ad-hoc location on RAID member's dish or HotSpare disk can be started 4KB size is superblock (superblocks), and the information of RAID is written in the superblock (superblocks) of each member, the rank that the information of RAID comprises RAID, the member comprised, and the UUID etc. of RAID.RAID creates successfully, and for convenience of management, usually the main configuration information record of RAID be called in the configuration file of " mdadm.conf " at "/etc " catalogue next one, this configuration file needs real-time update, to reflect the RAID situation that system is current, RAID situation comprises the information such as the rank of RAID, current state and member's dish of comprising, the current state of RAID, comprises init state, rebuilding, online, warning, critical, offline etc., and wherein, init state represents carries out initialization, namely applies the process that mdadm creates RAID, rebuilding represents and rebuilds, and namely utilizes HotSpare disk to replace the low-quality disk in RAID, online is that initialization completes, the state that can normally use, warning, critical, this RAID of offline state representation has problems, such as, the RAID5 that three pieces of disks are set up, after initialization completes, for online state, and wherein one piece of hard disk generation physical fault time, this RAID is critical state, but still can continue to use, if there are two pieces of disk failures, this RAID can become offline state, cannot continue to use, warning state only just has in RAID6, a RAID6 set up by four pieces of disks, one piece of disk failures is had to be warning state, two pieces of disk failures are critical, three pieces of faults are then offline states.
Existing based in the disc array system of software PLC, the physical disk that can normally use can take on two kinds of roles: or participate in the structure of certain RAID, and as member's dish or local HotSpare disk, be the member of RAID; Being in idle condition, is free disk, does not participate in the structure of any RAID.Judge what state certain block disk is in, comparatively directly and reliably determination methods is by judging whether store effective superblock structure example in superblock, can know the current state of disk.That is: if store effective superblock structure example in the superblock of certain disk, then this disk is member's dish of RAID or local HotSpare disk; If do not comprise effective superblock structure example in the superblock of certain disk, then this disk is free disk.This is because: a superblock structure is defined in mdadm inside, the first member of this structure is the data of a regular length and content, as the foundation judging superblock structure, once certain block disk take part in the establishment of software PLC, will be responsible for by mdadm, the 4KB space to be preserved started by ad-hoc location on this block disk is out as the superblock storage area of this disk, and utilize create RAID necessary configuration information in internal memory, safeguard the example of a superblock structure, then with the ad-hoc location of specifying for reference position, the example of this structure is saved in the superblock storage area of this disk, when disk is in idle condition, the 4KB space that this ad-hoc location starts, namely the superblock storage area of this block disk, generally can not deposit effective information, also would not occur the example of superblock structure.
Existing based in the disc array system of software PLC, overall situation HotSpare disk is arranged for certain RAID group, a group is formed by several RAID, for this group arranges overall HotSpare disk, like this, when member's dish of any one RAID goes wrong in this group, overall HotSpare disk can be automatically used to carry out data reconstruction.In software PLC environment, from realization means, the overall HotSpare disk arranged needs the group information belonging to RAID to be recorded in " mdadm.conf " configuration file, although this facilitates management, but by certain configuration file, overall HotSpare disk is managed, there is following problem: 1. " mdadm.conf " configuration file belongs to the configuration file of operating system, in the bulk migration process of disk, configuration file can not move thereupon, thus cause the application reliability of overall HotSpare disk that arranges lower, be also unfavorable for Data Migration.When 2. having multiple RAID group in based on the disc array system of software PLC, the RAID member in group that overall HotSpare disk only can be responsible for for it be coiled responsible, can not realize sharing of overall HotSpare disk between different RAID group.3. from bottom layer realization mechanism, proper overall HotSpare disk is there is not in software PLC, the group formed for multiple RAID and the overall HotSpare disk arranged, the local HotSpare disk being taken as certain RAID in group in fact when bottom layer realization, when member's adjustment debit bad time of certain RAID in group, first check whether this RAID has local HotSpare disk, if had, local HotSpare disk is then utilized to rebuild, if there is no local HotSpare disk, check belonging to this RAID, whether group has overall HotSpare disk again, if had, need first overall HotSpare disk to be removed from the RAID at its place, be set to the local HotSpare disk of the RAID producing low-quality disk again, and then carry out data reconstruction.
Summary of the invention
The object of the invention is, for the deficiency of existing disc array system existence in the setting and realization of overall HotSpare disk based on software PLC at present, to propose a kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk.
Support, without a disc array system for packet type overall situation HotSpare disk, to comprise: one or more RAID, one or more free disk, more than 0 or 1 overall HotSpare disk, system disk and storage operating system based on software PLC.
Described RAID is in (SuSE) Linux OS, uses "-C " or "-create " parameter of mdadm software to complete establishment, and is managed by mdadm; Have superblock structure example in the superblock of described RAID member, the first member of superblock structure example is the data of a regular length and content, is called RAID member flag position.Described RAID is redundancy RAID.
Described system disk is connected with each RAID, overall HotSpare disk and free disk; The content stored in system disk includes but not limited to: the storage operating system manage RAID and application program.
Described storage operating system is stored in system disk, be one and expanded the (SuSE) Linux OS of kernel, comprising: human-computer interaction module, overall HotSpare disk creation module, overall HotSpare disk retrieval module, overall HotSpare disk removing module and RAID low-quality disk check processing module.
The function of described human-computer interaction module sends for user provides the interface that overall HotSpare disk creates the overall HotSpare disk title that instruction, overall HotSpare disk search instruction, overall HotSpare disk delete instruction and input will be deleted; Human-computer interaction module also exports the operating result of overall HotSpare disk creation module, overall HotSpare disk retrieval module, overall HotSpare disk removing module and RAID low-quality disk check processing module.
The function of described overall HotSpare disk creation module is receiving after overall HotSpare disk that human-computer interaction module sends creates instruction, for described disc array system creates an overall HotSpare disk, and establishment result sent to human-computer interaction module.
The function of described overall HotSpare disk retrieval module is receiving overall HotSpare disk search instruction that human-computer interaction module sends or after receiving the overall HotSpare disk search instruction that RAID low-quality disk check processing module sends, produce an overall HotSpare disk list, and result for retrieval is sent to human-computer interaction module.
The function of described overall HotSpare disk removing module is after receiving the human-computer interaction module overall HotSpare disk delete instruction sent and the overall HotSpare disk title that will delete, remove the content in superblock on this overall HotSpare disk, this dish is become free disk, and operating result is sent to human-computer interaction module.
The function of described RAID low-quality disk check processing module is: 1. timing detects the state of all RAID in described disc array system, and the state of RAID comprises: init state, rebuilding state, online state, warning state, critical state, offline state etc.; When certain RAID of discovery is in critical state, (this RAID still can work, but the member that the number of working disks is less than this RAID coils sum), and when this RAID does not have a local HotSpare disk, send overall HotSpare disk recall signal to overall HotSpare disk retrieval module; 2. when RAID low-quality disk check processing module receives overall HotSpare disk list from overall HotSpare disk retrieval module, from overall HotSpare disk list, selection one is not less than the overall HotSpare disk of member's dish that in the RAID comprising low-quality disk, capacity is minimum at capacity, low-quality disk is replaced, and completes information to the reparation of human-computer interaction module transmission low-quality disk.
What the present invention proposed a kind ofly based on software PLC support without the annexation of each functional module of the disc array system of packet type overall situation HotSpare disk is:
Storage operating system and application storage are on system disk, and system disk is arranged on the inner without the disc array system of packet type overall situation HotSpare disk based on software PLC support of the present invention's proposition, and system disk is connected with all RAID, overall HotSpare disk and free disk; Human-computer interaction module respectively with overall HotSpare disk creation module, overall HotSpare disk removing module, overall HotSpare disk retrieval module, RAID low-quality disk check processing model calling; Overall situation HotSpare disk creation module is connected with human-computer interaction module, all RAID, whole overall HotSpare disk and free disk respectively; Overall situation HotSpare disk removing module is connected with human-computer interaction module, whole overall HotSpare disk and free disk respectively; Overall situation HotSpare disk retrieval module is connected with RAID low-quality disk check processing module, human-computer interaction module, all RAID, overall HotSpare disk and free disk respectively; RAID low-quality disk check processing module is connected with overall HotSpare disk retrieval module, human-computer interaction module, all RAID, overall HotSpare disk and free disk respectively.
What the present invention proposed a kind ofly comprise overall HotSpare disk visioning procedure based on software PLC support without the workflow of disc array system of packet type overall situation HotSpare disk, overall HotSpare disk list checks that flow process, RAID low-quality disk detection procedure and overall HotSpare disk delete flow process.
Overall situation HotSpare disk visioning procedure comprises the 1.1st step to the 1.3rd step, is specially:
1.1st step: described disc array system is in normal course of operation, and user sends overall HotSpare disk by human-computer interaction module to overall HotSpare disk creation module and creates instruction;
1.2nd step: the content deposited in the superblock of overall HotSpare disk creation module according to RAID member's dishes all in described disc array system, calculates the minimum capacity requirement of overall HotSpare disk;
Preferably, the step that the minimum capacity calculating overall HotSpare disk requires comprises 1.2.1 step to 1.2.2 step, is specially:
1.2.1 walks: the capability value counting member's dish that capacity is minimum in all RAID member's dishes in described disc array system;
1.2.2 walks: in the result obtained from 1.2.1 step, find out the minimum capacity requirement of maximal value as overall HotSpare disk.
1.3rd step: overall HotSpare disk creation module finds out the disk being in idle condition in described disc array system, and therefrom select a capacity to be not less than the disk of overall HotSpare disk minimum capacity requirement, this free disk arranges superblock, and on free disk, the start address of superblock is identical with the start address of the superblock of RAID member; Then from the start address of the superblock of this free disk, deposit overall HotSpare disk zone bit, be created as overall HotSpare disk, and operating result is sent to human-computer interaction module.Described overall HotSpare disk zone bit is the data of a regular length preset and content, and they are different from the content of the RAID member flag position in the superblock of RAID member in described disc array system.
The list of overall situation HotSpare disk checks that flow process comprises the 2.1st step to the 2.2nd step, is specially:
2.1st step: described disc array system is in normal course of operation, and user sends overall HotSpare disk search instruction by human-computer interaction module to overall HotSpare disk retrieval module;
2.2nd step: after overall HotSpare disk retrieval module receives overall HotSpare disk search instruction, produces overall HotSpare disk list, and sends to human-computer interaction module;
The method of described generation overall HotSpare disk list is: from described disc array system the superblock of each disk start address, read the content of overall HotSpare disk zone bit length, if it is overall HotSpare disk zone bit, then the title of this block disk is joined in overall HotSpare disk list.
Preferably, the method of described generation overall HotSpare disk list is: during every secondary generation overall situation HotSpare disk list, regenerate overall HotSpare disk list of the same name, then from described disc array system the superblock of each disk start address read the content of overall HotSpare disk zone bit length, if it is overall HotSpare disk zone bit, then the title of this block disk is joined in overall HotSpare disk list.
RAID low-quality disk detection procedure comprises the 3.1st step to the 3.4th step, is specially:
3.1st step: in disc array system normal course of operation, the timing of RAID low-quality disk check processing module is monitored the state of RAID all in described disc array system;
3.2nd step: (this RAID's RAID low-quality disk check processing module still can work once find to be in critical state by certain RAID, but the member that the number of working disks is less than this RAID coils sum), and when this RAID does not have a local HotSpare disk, send overall HotSpare disk recall signal to overall HotSpare disk retrieval module;
3.3rd step: overall HotSpare disk retrieval module, after receiving the overall HotSpare disk recall signal that RAID low-quality disk check processing module sends, produces overall HotSpare disk list, and sends to RAID low-quality disk check processing module;
3.4th step: after RAID low-quality disk check processing module receives overall HotSpare disk list from overall HotSpare disk retrieval module, from overall HotSpare disk list, selection one is not less than the overall HotSpare disk of member's dish that in the RAID being in critical state, capacity is minimum at capacity, this overall HotSpare disk is used to replace the low-quality disk be in the RAID of critical state, the content of superblock storage area on overall HotSpare disk is rewritten by mdadm, then data reconstruction is performed, send to human-computer interaction module the RAID being in critical state after being replaced successfully and repaired information.
Overall situation HotSpare disk is deleted flow process and is comprised the 4.1st step to the 4.2nd step, is specially:
4.1st step: in disc array system normal course of operation, user sends overall HotSpare disk erasure signal with the overall HotSpare disk title that will delete to overall HotSpare disk removing module by human-computer interaction module;
4.2nd step: overall HotSpare disk removing module removes the content on this overall HotSpare disk in superblock, this dish is become free disk, and operating result is sent to human-computer interaction module.
Beneficial effect
The a kind of of the present invention's proposition has the following advantages compared with current existing disc array system based on the disc array system of software PLC support without packet type overall situation HotSpare disk:
(1) without the need to configuration file, Data Migration is convenient to.In the disk array that general software PLC mode realizes, the relevant informations such as the RAID group that overall situation HotSpare disk is responsible for need to be recorded in the configuration file of " mdadm.conf " by name, this file belongs to the configuration file of operating system, in the bulk migration process of disk, configuration file can not move thereupon, thus cause the application reliability of the overall HotSpare disk arranged lower, also Data Migration is unfavorable for, and directly the superblock storage area of overall HotSpare disk is operated in the present invention, as long as the storage operating system version before and after disk migration is identical, just can directly use.
(2) the shared use of overall HotSpare disk between all RAID can be realized.When having multiple RAID group in the disk array of general employing software PLC, the overall HotSpare disk of setting only can coil responsible for the RAID member in this group, and the overall HotSpare disk adopting the present invention to arrange, can be used for the current all RAID of disc array system to share and use.
(3) overall HotSpare disk truly under software PLC environment is achieved.From bottom layer realization mechanism, proper overall HotSpare disk is there is not in general software PLC environment, the group formed for multiple RAID and the overall HotSpare disk arranged, the local HotSpare disk being taken as certain RAID in group in fact when bottom layer realization, and the overall HotSpare disk that the present invention is arranged is totally independent of concrete RAID before not used, when only there is the low-quality disk needing to replace with overall HotSpare disk in certain RAID, just under capacity meets the prerequisite of rebuilding and requiring, can be used by concrete RAID.
Accompanying drawing explanation
Fig. 1 is a kind of architectural schematic supporting the disc array system without packet type overall situation HotSpare disk based on software PLC in the specific embodiment of the invention.
Embodiment
Below in conjunction with the drawings and specific embodiments, the present invention is described in further detail.
Of the present invention a kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk, its structure as shown in Figure 1, comprising: each 1 of RAID1, RAID5, RAID6,2 free disk, 1 overall HotSpare disk and system disk, storage operating systems.
Described RAID1, RAID5, RAID6 are in (SuSE) Linux OS, use "-C " or "-create " parameter of the mdadm software of 3.1.2 version to complete establishment, and are managed by mdadm; The superblock of described RAID member is the 4KB space from the 0x1000 of its storage space, has superblock structure example in superblock, and the first member of superblock structure example is the data of a regular length and content, is called RAID member flag position.In this example, RAID member flag position is " 0xa92b4efc ".
Described system disk is connected with each RAID, overall HotSpare disk and free disk; The content stored in system disk comprises: the storage operating system manage RAID and application program.
Described storage operating system is stored in system disk, be one and expanded the (SuSE) Linux OS of kernel, comprising: human-computer interaction module, overall HotSpare disk creation module, overall HotSpare disk retrieval module, overall HotSpare disk removing module and RAID low-quality disk check processing module.
The function of described human-computer interaction module sends for user provides the interface that overall HotSpare disk creates the overall HotSpare disk title that instruction, overall HotSpare disk search instruction, overall HotSpare disk delete instruction and input will be deleted; Human-computer interaction module also exports the operating result of overall HotSpare disk creation module, overall HotSpare disk retrieval module, overall HotSpare disk removing module and RAID low-quality disk check processing module.
The function of described overall HotSpare disk creation module is receiving after overall HotSpare disk that human-computer interaction module sends creates instruction, for described disc array system creates an overall HotSpare disk, and establishment result sent to human-computer interaction module.
The function of described overall HotSpare disk retrieval module is receiving overall HotSpare disk search instruction that human-computer interaction module sends or after receiving the overall HotSpare disk search instruction that RAID low-quality disk check processing module sends, produce an overall HotSpare disk list, and result for retrieval is sent to human-computer interaction module.
The function of described overall HotSpare disk removing module is after receiving the human-computer interaction module overall HotSpare disk delete instruction sent and the overall HotSpare disk title that will delete, remove the content in superblock on this overall HotSpare disk, this dish is become free disk, and operating result is sent to human-computer interaction module.
The function of described RAID low-quality disk check processing module is: 1. timing detects the state of all RAID in described disc array system, and the state of RAID comprises: init state, rebuilding state, online state, warning state, critical state, offline state etc.; When certain RAID of discovery is in critical state, (this RAID still can work, but the member that the number of working disks is less than this RAID coils sum), and when this RAID does not have a local HotSpare disk, send overall HotSpare disk recall signal to overall HotSpare disk retrieval module; 2. when RAID low-quality disk check processing module receives overall HotSpare disk list from overall HotSpare disk retrieval module, from overall HotSpare disk list, selection one is not less than the overall HotSpare disk of member's dish that in the RAID comprising low-quality disk, capacity is minimum at capacity, low-quality disk is replaced, and completes information to the reparation of human-computer interaction module transmission low-quality disk.
What the present invention proposed a kind ofly based on software PLC support without the annexation of each functional module of the disc array system of packet type overall situation HotSpare disk is:
Storage operating system and application storage are on system disk, system disk is arranged on the inner without the disc array system of packet type overall situation HotSpare disk based on software PLC support of the present invention's proposition, and system disk is connected with RAID1, RAID5, RAID6, overall HotSpare disk 1, free disk 1 and free disk 2; Human-computer interaction module respectively with overall HotSpare disk creation module, overall HotSpare disk removing module, overall HotSpare disk retrieval module, RAID low-quality disk check processing model calling; Overall situation HotSpare disk creation module is connected with human-computer interaction module, RAID1, RAID5, RAID6, overall HotSpare disk 1, free disk 1 and free disk 2 respectively; Overall situation HotSpare disk removing module is connected with human-computer interaction module, overall HotSpare disk 1, free disk 1 and free disk 2 respectively; Overall situation HotSpare disk retrieval module is connected with RAID low-quality disk check processing module, human-computer interaction module, RAID1, RAID5, RAID6, overall HotSpare disk 1, free disk 1 and free disk 2 respectively; RAID low-quality disk check processing module is connected with overall HotSpare disk retrieval module, human-computer interaction module, RAID1, RAID5, RAID6, overall HotSpare disk 1, free disk 1 and free disk 2 respectively.
What the present invention proposed a kind ofly comprise overall HotSpare disk visioning procedure based on software PLC support without the workflow of disc array system of packet type overall situation HotSpare disk, overall HotSpare disk list checks that flow process, RAID low-quality disk detection procedure and overall HotSpare disk delete flow process.
Overall situation HotSpare disk visioning procedure comprises the 1.1st step to the 1.3rd step, is specially:
1.1st step: described disc array system is in normal course of operation, and user sends overall HotSpare disk by human-computer interaction module to overall HotSpare disk creation module and creates instruction;
1.2nd step: the content deposited in the superblock of overall HotSpare disk creation module according to RAID member's dishes all in described disc array system, calculates the minimum capacity requirement of overall HotSpare disk;
The step that the minimum capacity of the overall HotSpare disk of described calculating requires comprises 1.2.1 step to 1.2.2 step, is specially:
1.2.1 walks: the capability value counting member's dish that capacity is minimum in all RAID member's dishes in described disc array system;
1.2.2 walks: in the result obtained from 1.2.1 step, find out the minimum capacity requirement of maximal value as overall HotSpare disk.
1.3rd step: overall HotSpare disk creation module finds out the disk being in idle condition in described disc array system, and therefrom select a capacity to be not less than the disk of overall HotSpare disk minimum capacity requirement, the 4KB space started by the 0x1000 of its storage space is as superblock storage area; From 0x1000, deposit overall HotSpare disk zone bit, be created as overall HotSpare disk, in this example, overall HotSpare disk zone bit is set as " 0x55554444 ", and operating result is sent to human-computer interaction module.
The list of overall situation HotSpare disk checks that flow process comprises the 2.1st step to the 2.2nd step, is specially:
2.1st step: described disc array system is in normal course of operation, and user sends overall HotSpare disk search instruction by human-computer interaction module to overall HotSpare disk retrieval module;
2.2nd step: after overall HotSpare disk retrieval module receives overall HotSpare disk search instruction, produces overall HotSpare disk list, and sends to human-computer interaction module;
The method of described generation overall HotSpare disk list is: during every secondary generation overall situation HotSpare disk list, regenerate overall HotSpare disk list of the same name, then from described disc array system the superblock of each disk start address read the content of overall HotSpare disk zone bit length, if it is overall HotSpare disk zone bit, then the title of this block disk is joined in overall HotSpare disk list.
RAID low-quality disk detection procedure comprises the 3.1st step to the 3.4th step, is specially:
3.1st step: in disc array system normal course of operation, the timing of RAID low-quality disk check processing module is monitored the state of RAID all in described disc array system;
3.2nd step: (this RAID's RAID low-quality disk check processing module still can work once find to be in critical state by certain RAID, but the member that the number of working disks is less than this RAID coils sum), and when this RAID does not have a local HotSpare disk, send overall HotSpare disk recall signal to overall HotSpare disk retrieval module;
3.3rd step: overall HotSpare disk retrieval module, after receiving the overall HotSpare disk recall signal that RAID low-quality disk check processing module sends, produces overall HotSpare disk list, and sends to RAID low-quality disk check processing module;
3.4th step: after RAID low-quality disk check processing module receives overall HotSpare disk list from overall HotSpare disk retrieval module, from overall HotSpare disk list, selection one is not less than the overall HotSpare disk of member's dish that in the RAID being in critical state, capacity is minimum at capacity, this overall HotSpare disk is used to replace the low-quality disk be in the RAID of critical state, the content of superblock storage area on overall HotSpare disk is rewritten by mdadm, then data reconstruction is performed, send to human-computer interaction module the RAID being in critical state after being replaced successfully and repaired information.
Overall situation HotSpare disk is deleted flow process and is comprised the 4.1st step to the 4.2nd step, is specially:
4.1st step: in disc array system normal course of operation, user sends overall HotSpare disk erasure signal with the overall HotSpare disk title that will delete to overall HotSpare disk removing module by human-computer interaction module;
4.2nd step: overall HotSpare disk removing module removes the content on this overall HotSpare disk in superblock, this dish is become free disk, and operating result is sent to human-computer interaction module.
Below in conjunction with specific embodiments technical scheme of the present invention is described; but these explanations can not be understood to limit scope of the present invention; protection scope of the present invention is limited by the claims of enclosing, and any change on the claims in the present invention basis is all protection scope of the present invention.

Claims (4)

1. support, without a disc array system for packet type overall situation HotSpare disk, to it is characterized in that: comprising: one or more RAID, one or more free disk, more than 0 or 1 overall HotSpare disk, system disk and storage operating system based on software PLC;
Described RAID is in (SuSE) Linux OS, uses "-C " or "-create " parameter of mdadm software to complete establishment, and is managed by mdadm; Have superblock structure example in the superblock of described RAID member, the first member of superblock structure example is the data of a regular length and content, is called RAID member flag position; Described RAID is redundancy RAID;
Described system disk is connected with each RAID, overall HotSpare disk and free disk; The content stored in system disk includes but not limited to: the storage operating system manage RAID and application program;
Described storage operating system is stored in system disk, be one and expanded the (SuSE) Linux OS of kernel, comprising: human-computer interaction module, overall HotSpare disk creation module, overall HotSpare disk retrieval module, overall HotSpare disk removing module and RAID low-quality disk check processing module;
The function of described human-computer interaction module sends for user provides the interface that overall HotSpare disk creates the overall HotSpare disk title that instruction, overall HotSpare disk search instruction, overall HotSpare disk delete instruction and input will be deleted; Human-computer interaction module also exports the operating result of overall HotSpare disk creation module, overall HotSpare disk retrieval module, overall HotSpare disk removing module and RAID low-quality disk check processing module;
The function of described overall HotSpare disk creation module is receiving after overall HotSpare disk that human-computer interaction module sends creates instruction, for described disc array system creates an overall HotSpare disk, and establishment result sent to human-computer interaction module;
The function of described overall HotSpare disk retrieval module is receiving overall HotSpare disk search instruction that human-computer interaction module sends or after receiving the overall HotSpare disk search instruction that RAID low-quality disk check processing module sends, produce an overall HotSpare disk list, and result for retrieval is sent to human-computer interaction module;
The function of described overall HotSpare disk removing module is after receiving the human-computer interaction module overall HotSpare disk delete instruction sent and the overall HotSpare disk title that will delete, remove the content in superblock on this overall HotSpare disk, this dish is become free disk, and operating result is sent to human-computer interaction module;
The function of described RAID low-quality disk check processing module is: 1. timing detects the state of all RAID in described disc array system, and the state of RAID comprises: init state, rebuilding state, online state, warning state, critical state, offline state; When certain RAID of discovery is in critical state, namely this RAID still can work, but the member that the number of working disks is less than this RAID coils sum, and when this RAID does not have a local HotSpare disk, sends overall HotSpare disk recall signal to overall HotSpare disk retrieval module; 2. when RAID low-quality disk check processing module receives overall HotSpare disk list from overall HotSpare disk retrieval module, from overall HotSpare disk list, selection one is not less than the overall HotSpare disk of member's dish that in the RAID comprising low-quality disk, capacity is minimum at capacity, low-quality disk is replaced, and completes information to the reparation of human-computer interaction module transmission low-quality disk;
In the state of RAID, init state represents carries out initialization, namely applies the process that mdadm creates RAID; Rebuilding represents and rebuilds, and namely utilizes HotSpare disk to replace the low-quality disk in RAID; Online is that initialization completes, the state that can normally use; This RAID of warning, critical, offline state representation has problems, for the RAID5 that three pieces of disks are set up, after initialization completes, for online state, and wherein one piece of hard disk generation physical fault time, this RAID is critical state, but still can continue to use, if there are two pieces of disk failures, this RAID can become offline state, cannot continue to use; Warning state only has in RAID6, and a RAID6 set up by four pieces of disks, and have one piece of disk failures to be warning state, two pieces of disk failures are critical, and three pieces of faults are then offline states;
Based on software PLC support without the annexation of each functional module of the disc array system of packet type overall situation HotSpare disk be:
Storage operating system and application storage are on system disk, and system disk is arranged on the inner without the disc array system of packet type overall situation HotSpare disk based on software PLC support of the present invention's proposition, and system disk is connected with all RAID, overall HotSpare disk and free disk; Human-computer interaction module respectively with overall HotSpare disk creation module, overall HotSpare disk removing module, overall HotSpare disk retrieval module, RAID low-quality disk check processing model calling; Overall situation HotSpare disk creation module is connected with human-computer interaction module, all RAID, whole overall HotSpare disk and free disk respectively; Overall situation HotSpare disk removing module is connected with human-computer interaction module, whole overall HotSpare disk and free disk respectively; Overall situation HotSpare disk retrieval module is connected with RAID low-quality disk check processing module, human-computer interaction module, all RAID, overall HotSpare disk and free disk respectively; RAID low-quality disk check processing module is connected with overall HotSpare disk retrieval module, human-computer interaction module, all RAID, overall HotSpare disk and free disk respectively.
2. a kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk as claimed in claim 1, it is characterized in that: its workflow comprises overall HotSpare disk visioning procedure, overall HotSpare disk list checks that flow process, RAID low-quality disk detection procedure and overall HotSpare disk delete flow process;
Overall situation HotSpare disk visioning procedure comprises the 1.1st step to the 1.3rd step, is specially:
1.1st step: described disc array system is in normal course of operation, and user sends overall HotSpare disk by human-computer interaction module to overall HotSpare disk creation module and creates instruction;
1.2nd step: the content deposited in the superblock of overall HotSpare disk creation module according to RAID member's dishes all in described disc array system, calculates the minimum capacity requirement of overall HotSpare disk;
1.3rd step: overall HotSpare disk creation module finds out the disk being in idle condition in described disc array system, and therefrom select a capacity to be not less than the disk of overall HotSpare disk minimum capacity requirement, this free disk arranges superblock, and on free disk, the start address of superblock is identical with the start address of the superblock of RAID member; Then from the start address of the superblock of this free disk, deposit overall HotSpare disk zone bit, be created as overall HotSpare disk, and operating result is sent to human-computer interaction module; Described overall HotSpare disk zone bit is the data of a regular length preset and content, and they are different from the content of the RAID member flag position in the superblock of RAID member in described disc array system;
The list of overall situation HotSpare disk checks that flow process comprises the 2.1st step to the 2.2nd step, is specially:
2.1st step: described disc array system is in normal course of operation, and user sends overall HotSpare disk search instruction by human-computer interaction module to overall HotSpare disk retrieval module;
2.2nd step: after overall HotSpare disk retrieval module receives overall HotSpare disk search instruction, produces overall HotSpare disk list, and sends to human-computer interaction module;
The method of described generation overall HotSpare disk list is: from described disc array system the superblock of each disk start address, read the content of overall HotSpare disk zone bit length, if it is overall HotSpare disk zone bit, then the title of this block disk is joined in overall HotSpare disk list;
RAID low-quality disk detection procedure comprises the 3.1st step to the 3.4th step, is specially:
3.1st step: in disc array system normal course of operation, the timing of RAID low-quality disk check processing module is monitored the state of RAID all in described disc array system;
3.2nd step: RAID low-quality disk check processing module is once find that certain RAID is in critical state, namely this RAID still can work, but the member that the number of working disks is less than this RAID coils sum, and when this RAID does not have a local HotSpare disk, send overall HotSpare disk recall signal to overall HotSpare disk retrieval module;
3.3rd step: overall HotSpare disk retrieval module, after receiving the overall HotSpare disk recall signal that RAID low-quality disk check processing module sends, produces overall HotSpare disk list, and sends to RAID low-quality disk check processing module;
3.4th step: after RAID low-quality disk check processing module receives overall HotSpare disk list from overall HotSpare disk retrieval module, from overall HotSpare disk list, selection one is not less than the overall HotSpare disk of member's dish that in the RAID being in critical state, capacity is minimum at capacity, this overall HotSpare disk is used to replace the low-quality disk be in the RAID of critical state, the content of superblock storage area on overall HotSpare disk is rewritten by mdadm, then data reconstruction is performed, send to human-computer interaction module the RAID being in critical state after being replaced successfully and repaired information,
Overall situation HotSpare disk is deleted flow process and is comprised the 4.1st step to the 4.2nd step, is specially:
4.1st step: in disc array system normal course of operation, user sends overall HotSpare disk erasure signal with the overall HotSpare disk title that will delete to overall HotSpare disk removing module by human-computer interaction module;
4.2nd step: overall HotSpare disk removing module removes the content on this overall HotSpare disk in superblock, this dish is become free disk, and operating result is sent to human-computer interaction module.
3. a kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk as claimed in claim 2, it is characterized in that: the step that the minimum capacity calculating overall HotSpare disk described in the 1.2nd step of overall HotSpare disk visioning procedure requires comprises 1.2.1 step to 1.2.2 step, is specially:
1.2.1 walks: the capability value counting member's dish that capacity is minimum in all RAID member's dishes in described disc array system;
1.2.2 walks: in the result obtained from 1.2.1 step, find out the minimum capacity requirement of maximal value as overall HotSpare disk.
4. a kind of disc array system supporting without the overall HotSpare disk of packet type based on software PLC as described in Claims 2 or 3, it is characterized in that: the method for optimizing producing overall HotSpare disk list described in the 2.2nd step that flow process is checked in overall HotSpare disk list is: during every secondary generation overall situation HotSpare disk list, regenerate overall HotSpare disk list of the same name, then from described disc array system the superblock of each disk start address read the content of overall HotSpare disk zone bit length, if it is overall HotSpare disk zone bit, then the title of this block disk is joined in overall HotSpare disk list.
CN201210026590.6A 2012-02-08 2012-02-08 A kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk Active CN103246478B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210026590.6A CN103246478B (en) 2012-02-08 2012-02-08 A kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210026590.6A CN103246478B (en) 2012-02-08 2012-02-08 A kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk

Publications (2)

Publication Number Publication Date
CN103246478A CN103246478A (en) 2013-08-14
CN103246478B true CN103246478B (en) 2015-11-25

Family

ID=48926017

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210026590.6A Active CN103246478B (en) 2012-02-08 2012-02-08 A kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk

Country Status (1)

Country Link
CN (1) CN103246478B (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106020727B (en) * 2016-05-25 2019-03-15 浪潮电子信息产业股份有限公司 A kind of method of Smart Rack performance tuning
CN108153622B (en) * 2016-12-06 2021-08-31 华为技术有限公司 Fault processing method, device and equipment
CN108282347A (en) * 2016-12-30 2018-07-13 航天信息股份有限公司 A kind of server data online management method and system
CN107391042A (en) * 2017-07-28 2017-11-24 郑州云海信息技术有限公司 The design method and system of a kind of disk array
CN107515731B (en) * 2017-07-31 2019-12-24 华中科技大学 Evolution storage system based on solid-state disk and working method thereof
CN108334280B (en) * 2017-12-28 2021-01-08 深圳创新科技术有限公司 RAID5 disk group fast reconstruction method and device
CN109189338B (en) * 2018-08-27 2021-06-18 郑州云海信息技术有限公司 Method, system and equipment for adding hot spare disk
CN108984133B (en) * 2018-08-27 2022-01-28 杭州阿姆科技有限公司 Method for realizing RAID in SSD
CN110046065A (en) * 2019-04-19 2019-07-23 苏州浪潮智能科技有限公司 A kind of storage array method for reconstructing, device, equipment and storage medium
CN111913647B (en) * 2019-05-08 2022-10-11 华为技术有限公司 Wear leveling method and device for storage equipment and related equipment
CN110928724B (en) * 2019-11-29 2023-04-28 重庆紫光华山智安科技有限公司 Global hot standby disc management method and device, storage medium and electronic equipment

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101825994A (en) * 2010-04-16 2010-09-08 苏州壹世通科技有限公司 Firmware-based flash memory array management device and method independent of operating system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7320052B2 (en) * 2003-02-10 2008-01-15 Intel Corporation Methods and apparatus for providing seamless file system encryption and redundant array of independent disks from a pre-boot environment into a firmware interface aware operating system

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101825994A (en) * 2010-04-16 2010-09-08 苏州壹世通科技有限公司 Firmware-based flash memory array management device and method independent of operating system

Also Published As

Publication number Publication date
CN103246478A (en) 2013-08-14

Similar Documents

Publication Publication Date Title
CN103246478B (en) A kind of based on the disc array system of software PLC support without packet type overall situation HotSpare disk
US8103825B2 (en) System and method for providing performance-enhanced rebuild of a solid-state drive (SSD) in a solid-state drive hard disk drive (SSD HDD) redundant array of inexpensive disks 1 (RAID 1) pair
CN101276302B (en) Magnetic disc fault processing and data restructuring method in magnetic disc array system
EP3617867B1 (en) Fragment management method and fragment management apparatus
CN102033786B (en) Method for repairing consistency of copies in object storage system
US20030014584A1 (en) Storage device with I/O counter for partial data reallocation
CN103534688B (en) Data reconstruction method, memory device and storage system
US8386837B2 (en) Storage control device, storage control method and storage control program
CN102682012A (en) Method and device for reading and writing data in file system
CN102799533B (en) Method and apparatus for shielding damaged sector of disk
CN101567211A (en) Method for improving usability of disk and disk array controller
CN101916173A (en) RAID (Redundant Array of Independent Disks) based data reading and writing method and system thereof
CN103019623B (en) Memory disc disposal route and device
US20090198942A1 (en) Storage system provided with a plurality of controller modules
WO2012089152A1 (en) Method and device for implementing redundant array of independent disk protection in file system
WO2015058542A1 (en) Reconstruction method and device for redundant array of independent disks
CN103544995B (en) A kind of bad track repairing method and bad track repairing device
CN102999399A (en) Method and device of automatically restoring storage of JBOD (just bundle of disks) array
US8433949B2 (en) Disk array apparatus and physical disk restoration method
CN103049407B (en) Date storage method, Apparatus and system
CN104484135A (en) Method and device for quickly reading data
CN111857540A (en) Data access method, device and computer program product
WO2021088367A1 (en) Data recovery method and related device
US7600151B2 (en) RAID capacity expansion interruption recovery handling method and system
CN102262657A (en) Method and system for storing multimedia data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant