JP2004213441A

JP2004213441A - Disk array apparatus

Info

Publication number: JP2004213441A
Application number: JP2003000849A
Authority: JP
Inventors: Kazutoshi Motoo; 和敏本尾
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2003-01-07
Filing date: 2003-01-07
Publication date: 2004-07-29
Anticipated expiration: 2023-01-07
Also published as: JP4158525B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a disk array apparatus in which the danger of the recurrence of a fault is reduced by performing fault restoration while considering the product classification of an HDD (hard disk drive) causing the fault, the contents of the fault and the constitution of an array rank. <P>SOLUTION: A fault history management part 108 manages fault history including the product classification of HDDs in which faults arise and the contents of the faults. When a faulty HDD occurs in an HDD group 104, an array rank changing part 106 determines whether the possibility of the recurrence of a fault having the same fault contents as the produced fault in an HDD of the same product classification of the HDD causing the fault is high or not. When determining the high possibility of the recurrence, the constitution of an array rank using the HDD of the same product classification as the HDD causing the fault as a constitutional element is changed to constitution whose reliability is higher than that of present constitution. <P>COPYRIGHT: (C)2004,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、複数の物理ディスクを有するディスクアレイ装置に関し、特に、ディスクアレイ装置の信頼性を向上させる技術に関する。
【０００２】
【従来の技術】
ディスクアレイ装置は、物理ディスクであるハードディスクドライブ（ＨＤＤ）の大容量化により、数十ＴＢ（テラバイト）を有する規模になってきている。現在、一台のハードディスクの記憶容量は１００ＧＢ（ギガバイト）を越えるレベルに達しており、今後もますます大容量化が図られると考えられる。これに伴い、ディスクアレイ装置も大容量になると思われる。
【０００３】
ところで、ディスクアレイ装置の特徴の１つとして、アレイランクを構成する複数のＨＤＤの内の１台に障害が発生した場合、アレイランク内の他の正常なＨＤＤを使用してデータアクセスを継続できる点が挙げられる。
【０００４】
この点について、図１１及び図１２を参照して説明する。今、アレイランクが、図１１に示すように４台のＨＤＤ０１〜０４によって構成されているとする。また、データＤが３つのデータＤ１〜Ｄ３に３分割されてＨＤＤ０１〜０３に格納され、データＤ１〜Ｄ３の冗長データであるパリティＰがＨＤＤ０４に格納されているとする。このような構成を有するアレイランクにおいて、図１２に示すように、ＨＤＤ０３に障害が発生したとする。
【０００５】
ＨＤＤ０３に障害が発生した後、その障害が復旧する前に、ホストコンピュータからＨＤＤ０３に格納されているデータＤ３に対するリード指令が出力された場合は、アレイランク内の正常なＨＤＤ０１，０２，０４からデータＤ１，Ｄ２及びパリティＰを読み出し、それに基づいてデータＤ３を復元し、ホストコンピュータに転送する。
【０００６】
また、障害復旧前にホストコンピュータからＨＤＤ０３に対してライト指令が出力された場合には、新たなパリティＰ’を生成し、それをＨＤＤ０４に書き込む。また、ホストコンピュータから正常なＨＤＤ（例えば、ＨＤＤ０１とする）に対するライト指令が出力された場合には、新たなデータＤ１’をＨＤＤ０１に書き込むと共に、新たなパリティＰ’を生成し、ＨＤＤ０４に書き込む。
【０００７】
以上のように、ディスクアレイ装置においては、アレイランクを構成する複数のＨＤＤの内の１つに障害が発生しても、他の正常なＨＤＤを使用してデータアクセスを継続することができる。
【０００８】
また、障害の発生したＨＤＤ０３が交換された場合、或いは予めスペアＨＤＤが用意されている場合には、障害復旧処理が行われる。障害復旧処理では、正常なＨＤＤ０１，０２，０４に格納されているデータＤ１，Ｄ２及びパリティＰに基づいてデータＤ３を復元し、復元したデータＤ３をスペアＨＤＤ或いは交換されたＨＤＤに書き戻す。
【０００９】
上記した障害復旧処理は、従来一般的に行われているものであるが、これ以外にも、複数のドライブに構成される記憶領域を複数のボリュームに分割し、障害発生時には、アクセス頻度の高いボリュームから優先的に、障害ドライブに格納されているデータをスペアドライブに復旧するようにした技術や（例えば、特許文献１参照）、ＨＤＤに障害が発生した時、複数の論理ディスクの内の優先論理ディスクについては、障害の発生したＨＤＤに格納されているデータを不揮発性の半導体メモリ上に復元するようにした技術（例えば、特許文献２参照）も知られている。
【００１０】
上述した従来の技術は、障害の発生したＨＤＤに格納されていたデータを、スペアＨＤＤ等の他のＨＤＤ上に復元する技術であり、アレイランクの構成を変更するものではない。なお、アレイランクの構成を変更する技術としては、ライトデータ量に応じてＲＡＩＤクラスを変更する技術（例えば、特許文献３参照）や、ユーザがデータ保持部に保持されているアレイ構成情報を変更すると、ディスクインタフェース変更部がディスクインタフェース接続部に対して、アレイ制御部の配線とディスク装置群の配線との接続状態を、変更後のアレイ構成情報に応じたものにすることを指示する技術（例えば、特許文献４参照）が知られているが、復旧処理に関連するものではない。
【００１１】
【特許文献１】
特開平１１−２４８５０号公報
【特許文献２】
特開２００１−１７５４２３号公報
【特許文献３】
特開平６−１８７２４９号公報
【特許文献４】
特開２０００−２９８５５７号公報
【００１２】
【発明が解決しようとする課題】
上述したように、従来、一般的に行われている障害復旧処理や、特許文献１，２に記載されている技術は、障害の発生したＨＤＤに格納されていたデータを、スペアＨＤＤ等の他の物理ディスク上に復元するだけであり、障害の発生した物理ディスクの製品種別や、障害内容や、アレイランクの構成を考慮していないため、障害復旧後に再び障害が発生する可能性が高いという問題がある。
【００１３】
そこで、本発明の目的は、障害の発生した物理ディスクの製品種別や、障害内容や、アレイランクの構成を考慮して障害復旧を行うことにより、障害が再発する危険性を少なくすることにある。
【００１４】
【課題を解決するための手段】
本発明にかかる第１のディスクアレイ装置は、上記目的を達成するため、
複数の物理ディスクを有するディスクアレイ装置において、
前記複数の物理ディスクの内の何れかに障害が発生したとき、前記発生した障害と同一障害内容の障害が、前記障害の発生した物理ディスクと同一製品種別の物理ディスクにおいて再発する可能性が高いか否かを判定し、再発する可能性が高いと判定した場合は、前記障害の発生した物理ディスクと同一製品種別の物理ディスクを構成要素としているアレイランクの構成を、現時点の構成よりも信頼性の高い構成に変更する手段を備えたことを特徴とする。
【００１５】
より具体的には、本発明にかかる第２のディスクアレイ装置は、
複数の物理ディスクを有するディスクアレイ装置において、
障害の発生した物理ディスクの製品種別と前記障害の障害内容とを含む障害履歴を管理する障害履歴管理部と、
前記複数の物理ディスクの内の何れかに障害が発生したとき、前記障害履歴管理部で管理されている障害履歴に基づいて、前記発生した障害と同一障害内容の障害が、前記障害の発生した物理ディスクと同一製品種別の物理ディスクにおいて再発する可能性が高いか否かを判定し、再発する可能性が高いと判定した場合は、前記障害の発生した物理ディスクと同一製品種別の物理ディスクを構成要素としているアレイランクの構成を、現時点の構成よりも信頼性の高い構成に変更するアレイランク変更部を備えたことを特徴とする。
【００１６】
また、本発明にかかる第３のディスクアレイ装置は、第２のディスクアレイ装置において、
現時点のアレイランク構成をどのように変更するのかを示す変更ポリシー情報が格納された変更ポリシー情報記憶部を備え、且つ、
前記アレイランク変更部が、構成変更対象にしているアレイランクの現時点の構成と、前記変更ポリシー情報記憶部に格納されている変更ポリシー情報とに基づいて、前記構成変更対象にしているアレイランクの構成を変更する構成を有することを特徴とする。
【００１７】
また、本発明にかかる第４のディスクアレイ装置は、第３のディスクアレイ装置において、
前記変更ポリシー情報記憶部には、信頼性の異なるアレイランクの構成が複数格納されており、
前記アレイランク変更部が、構成変更対象にしているアレイランクの構成を、前記変更ポリシー情報記憶部に格納されている各構成の内の、前記構成対象にしているアレイランクの現時点の構成よりも１ランク信頼性が高い構成に変更する構成を有することを特徴とする。
【００１８】
また、本発明にかかる第５のディスクアレイ装置は、第４のディスクアレイ装置において、
スペアの物理ディスクを複数備え、且つ、
前記アレイランク変更部が、前記スペアの物理ディスクを用いて、前記構成変更対象にしているアレイランクよりも１ランク信頼性の高いアレイランクを構成し、前記構成変更対象にしているアレイランクの構成要素中の正常な物理ディスクをスペアの物理ディスクに変更する構成を有することを特徴とする。
【００１９】
また、本発明にかかる第６のディスクアレイ装置は、第５のディスクアレイ装置において、
前記変更ポリシー情報記憶部には、或るアレイランクの構成よりも１ランク信頼性が高い構成として、ＲＡＩＤクラスが同一で、構成要素とする物理ディスクの台数が１台少ない構成が格納されていることを特徴とする。
【００２０】
また、本発明にかかる第７のディスクアレイ装置は、第６のディスクアレイ装置において、
アレイランク毎に、そのアレイランクを構成する物理ディスクを示す情報及びＲＡＩＤクラスが格納されたアレイランク構成情報記憶部を備え、
前記アレイランク変更部が、前記アレイランク構成情報記憶部の内容を書き換えることによりアレイランクの構成を変更する構成を有することを特徴とする。
【００２１】
また、本発明にかかる第８のディスクアレイ装置は、第２〜第７の何れか１つのディスクアレイ装置において、
ユーザによって入力される履歴参照要求に応答して前記障害履歴管理部で管理されている障害履歴情報を表示部に表示し、ユーザによって入力される履歴変更要求に応答して前記障害履歴管理部が管理している障害履歴情報を変更する履歴情報変更／参照部を備えたことを特徴とする。
【００２２】
また、本発明にかかる第９のディスクアレイ装置は、第３〜第７記載の何れか１つのディスクアレイ装置において、
ユーザによって入力される変更ポリシー参照要求に応答して前記変更ポリシー情報記憶部に格納されている変更ポリシー情報を表示部に表示し、ユーザによって入力される変更ポリシー変更要求に応答して前記変更ポリシー情報記憶部に格納されている変更ポリシー情報を変更する変更ポリシー変更／参照部を備えたことを特徴とする。
【００２３】
【作用】
ディスクアレイ装置を構成している複数の物理ディスクの内の何れかに障害が発生したとき、例えば、障害履歴などに基づいて、上記発生した障害と同一障害内容の障害が、上記障害の発生した物理ディスクと同一製品種別の物理ディスクにおいて再発する可能性が高いか否かを判定する。そして、再発する可能性が高いと判定した場合は、上記障害の発生した物理ディスクと同一製品種別の物理ディスクを構成要素としているアレイランクの構成を、現時点の構成よりも信頼性の高い構成に変更する。現時点の構成よりも信頼性の高い構成としては、例えば、ＲＡＩＤクラスが同一で、物理ディスク数が１台少ない構成を採用することができる。このように、障害の発生した物理ディスクの製品種別や、障害内容や、アレイランクの構成を考慮して障害を復旧するようにしているので、障害復旧後に障害が再発する危険性を少なくすることができる。
【００２４】
【発明の実施の形態】
次に本発明の実施の形態について図面を参照して詳細に説明する。
【００２５】
図１は本発明の一実施例のブロック図であり、ディスクアレイ装置１と、ホストコンピュータ２と、両者を接続する上位Ｉ／Ｆ（上位インタフェース）３とから構成されている。
【００２６】
ディスクアレイ装置１は、上位Ｉ／Ｆ制御部１０１と、アレイ制御部１０２と、ＨＤＤ（ハードディスクドライブ）制御部１０３と、ＨＤＤ群１０４と、変更ポリシー情報記憶部１０５と、アレイランク変更部１０６と、ＨＤＤ状態情報記憶部１０７と、障害履歴管理部１０８と、障害履歴情報記憶部１０９と、アレイランク構成情報記憶部１１０とから構成されている。
【００２７】
ＨＤＤ群１０４は、１５台のＨＤＤ０１〜１５から構成されており、各ＨＤＤ０１〜１５にはそれぞれＨＤＤ番号「０１」〜「１５」が付与されている。また、ＨＤＤ０１〜０５により、アレイランク番号「０１」のアレイランクが構成され、ＨＤＤ０６〜１０によりアレイ番号「０２」のアレイランクが構成されている。アレイランク番号「０１」，「０２」のアレイランクは共にＲＡＩＤクラスが「ＲＡＩＤ−５」のアレイランクであり、アレイランク番号「０１」のアレイランクには、アドレス「Ａ０」〜「Ａ１」が割り当てられ、アレイランク番号「０２」のアレイランクには、「アドレス（Ａ１＋１）」〜「Ａ２」が割り当てられている。また、ＨＤＤ１１〜１５は、スペアのＨＤＤとなっている。
【００２８】
アレイランク構成情報記憶部１１０には、ＨＤＤ群１０４内に存在する各アレイランクの構成を示す情報や、各アレイランクに割り当てられているアドレスや、スペアのＨＤＤを示す情報が格納されている。
【００２９】
図２にアレイランク構成情報記憶部１１０の内容例を示す。同図に示すように、アレイランク構成情報記憶部１１０には、アレイランク番号と、ＲＡＩＤクラスと、ＨＤＤ構成と、構成要素とするＨＤＤのＨＤＤ番号と、アドレスとが格納される。図２の例は、アレイランク番号「０１」，「０２」のアレイランクは、それぞれＨＤＤ番号「０１」〜「０５」，「０６」〜「１０」のＨＤＤ０１〜０５，０６〜１０によって構成され、ＲＡＩＤクラスが共に「ＲＡＩＤ−５」で、ＨＤＤ構成が共に「４Ｄ＋１Ｐ」で、割り当てられているアドレスがそれぞれ「Ａ０」〜「Ａ１」，「（Ａ１＋１）」〜「Ａ２」であることを示している。更に、この例は、ＨＤＤ番号「１１」〜「１５」のＨＤＤ１１〜１５は、スペアのＨＤＤであることを示している。
【００３０】
ＨＤＤ状態情報記憶部１０７には、各ＨＤＤ０１〜１５の製品種別や、各ＨＤＤ０１〜１５の現在の状態を示す情報（「ＮＯＲＭＡＬ」か「ＡＢＮＯＲＭＡＬ」）や、障害内容が格納される。
【００３１】
図３にＨＤＤ状態情報記憶部１０７の内容例を示す。同図に示すように、ＨＤＤ状態情報記憶部１０７には、ＨＤＤ番号と、ベンダーと、製品名と、バージョンと、現在の状態と、障害内容とが格納される。この例では、ベンダー，製品名及びバージョンによりＨＤＤの製品種別を表すようにしている。また、図３の例は、ＨＤＤ０１〜０５，１１〜１３がベンダー「ＮＮＮ」，製品名「ＡＡＡ」，バージョン「１．１０」の製品で、ＨＤＤ０６〜１０，１４，１５がベンダー「ＨＨＨ」，製品名「ＢＢＢ」，バージョン「２．００」の製品であることを示している。更に、図３の例は、ＨＤＤ０１〜１５の内、ＨＤＤ１０にのみ障害内容「タイムアウト」の障害（ＡＢＮＯＲＭＡＬ）が発生し、他のＨＤＤは正常（ＮＯＲＭＡＬ）であることを示している。
【００３２】
障害履歴情報記憶部１０９には、ＨＤＤ群１０４において発生した障害の履歴を示す情報が格納される。
【００３３】
図４に障害履歴情報記憶部１０９の内容例を示す。同図に示すように、障害履歴情報記憶部１０９には、履歴番号と、ベンダーと、製品名と、バージョンと、障害内容とが格納される。図４の例は、ベンダー「ＮＮＮ」，製品名「ＡＡＡ」，バージョン「１．１０」のＨＤＤにおいて、「エラー訂正不可能なデータエラー」が過去に１回発生し、ベンダー「ＨＨＨ」，製品名「ＢＢＢ」，バージョン「２．００」のＨＤＤにおいて「タイムアウト」が過去に２回発生していることを示している。
【００３４】
変更ポリシー情報記憶部１０５には、アレイランクの構成を変更する際のポリシーが格納されている。
【００３５】
図５に変更ポリシー情報記憶部１０５の内容例を示す。同図に示すように、変更ポリシー情報記憶部１０５には、フェーズ番号と、ＲＡＩＤクラスと、ＨＤＤ構成とが格納される。ＲＡＩＤクラスとＨＤＤ構成とによって、アレイランクの構成が特定されており、フェーズ番号が若いものほど信頼性の高い構成になっている。フェーズ番号「０１」のアレイランク構成は、２台のＨＤＤに同一データを格納したミラー構成である。フェーズ番号「０２」〜「０４」のアレイランク構成は共にＲＡＩＤクラス「ＲＡＩＤ−５」のアレイランク構成であるが、構成要素であるＨＤＤの台数はフェーズ番号が１つ増える毎に１台増えている。即ち、フェーズ番号が若いものほど、信頼性の高い構成となっている。これらの情報は、ホストコンピュータ２から上位Ｉ／Ｆ３，上位Ｉ／Ｆ制御部１０１を介して変更ポリシー情報記憶部１０５に格納される。
【００３６】
上位Ｉ／Ｆ制御部１０１は、上位Ｉ／Ｆ３を介してホストコンピュータ２との間でデータ等をやり取りする機能を有する。
【００３７】
アレイ制御部１０２は、上位Ｉ／Ｆ制御部１０１を介してホストコンピュータ２から送られてくるリード指令やライト指令に従ってＨＤＤ制御部１０３を制御することにより、ＨＤＤ群１０４中の該当するＨＤＤからデータを読み出したり、データを書き込む機能を有する。
【００３８】
ＨＤＤ制御部１０３は、アレイ制御部１０２の制御に従ってＨＤＤ群１０４中の該当するＨＤＤからデータを読み出したり、データを書き込む機能を有する。更に、ＨＤＤ制御部１０３は、障害の発生したＨＤＤを検出する機能や、障害の発生したＨＤＤを検出した時、ＨＤＤ状態情報記憶部１０７中の上記ＨＤＤの現在状態を「ＡＢＮＯＲＭＡＬ」に変更すると共に障害内容を書き込む機能を有する。
【００３９】
障害履歴情報管理部１０８は、ＨＤＤ状態情報記憶部１０７を監視しており、現在の状態がＡＢＮＯＲＭＡＬに変更されたＨＤＤが発生した時、そのＨＤＤのベンダー，製品名，バージョン及び障害内容を障害履歴情報記憶部１０９に追加登録する機能や、追加登録完了後にアレイランク変更部１０６に対して起動指示を出力する機能を有する。
【００４０】
アレイランク変更部１０６は、障害履歴管理部１０８から起動指示が入力されると、障害履歴情報記憶部１０９の記憶内容に基づいて今回発生した障害と同一内容の障害が、今回障害の発生したＨＤＤと同一製品種別のＨＤＤにおいて再発する可能性が高いか否かを判定し、再発する可能性が高いと判断した場合、変更ポリシー情報記憶部１０５の内容に基づいて上記製品種別のＨＤＤを構成要素としているアレイランクの構成を、現時点の構成よりも信頼性の高い構成に変更する機能を有する。なお、構成の変更方法については、後で詳細に説明する。
【００４１】
【実施例の動作の説明】
次に、本実施例の動作について詳細に説明する。
【００４２】
先ず、リード時の動作について説明する。
【００４３】
ホストコンピュータ２は、データのリード時、リード指令（アドレス，種別，サイズを含む）を出力する。このリード指令は、上位Ｉ／Ｆ３，上位Ｉ／Ｆ制御部１０１を介してアレイ制御部１０２に送られる。
【００４４】
アレイ制御部１０２は、リード指令を受け取ると、アレイランク構成情報記憶部１１０に格納されている各アレイランクに割り当てられているアドレスに基づいて、最初にリードするデータが格納されているアクセス対象ＨＤＤを求める（図６、ステップＳ６２）。
【００４５】
その後、アレイ制御部１０２は、ＨＤＤ状態情報記憶部１０７を参照し、アクセス対象ＨＤＤの現在の状態が「ＮＯＲＭＡＬ」であるか否かを調べる（ステップＳ６３）。
【００４６】
そして、「ＮＯＲＭＡＬ」である場合、即ちアクセス対象ＨＤＤが正常である場合（ステップＳ６３がＹＥＳ）は、ＨＤＤ制御部１０３を制御してアクセス対象ＨＤＤからデータをリードし、ホストコンピュータ２へ転送する（ステップＳ６４，Ｓ６５）。
【００４７】
これに対して、「ＡＢＮＯＲＭＡＬ」である場合、即ちアクセス対象ＨＤＤに障害が発生している場合（ステップＳ６３がＮＯ）は、ＨＤＤ制御部１０３を制御してアクセス対象ＨＤＤと同一アレイランク内の他の正常なＨＤＤに格納されているデータをリードし、それらに基づいてアクセス対象ＨＤＤに格納されているデータを復元する（ステップＳ６６，Ｓ６７）。その後、復元したデータをホストコンピュータ２へ転送する。以上の処理をリード指令によって指令された全てのデータをリードするまで（ステップＳ６１がＹＥＳとなるまで）繰り返し行う。
【００４８】
次に、ライト時の動作について説明する。
【００４９】
ホストコンピュータ２は、データライト時、ライト指令（アドレス，種別，サイズを含む）を出力する。このライト指令は、上位Ｉ／Ｆ３，上位Ｉ／Ｆ制御部１０１を介してアレイ制御部１０２へ送られる。
【００５０】
アレイ制御部１０２は、ライト指令を受け取ると、アレイランク構成情報記憶部１１０に格納されている情報に基づいて、最初にデータをライトするアクセス対象ＨＤＤを求める（図７，ステップＳ７２）。
【００５１】
その後、アレイ制御部１０２は、ＨＤＤ状態情報記憶部１０７を参照し、アクセス対象ＨＤＤの現在の状態が「ＡＢＮＯＲＭＡＬ」であるか否かを調べる（ステップＳ７３）。
【００５２】
そして、アクセス対象ＨＤＤの内の少なくとも１つの現在の状態が「ＡＢＮＯＲＭＡＬ」である場合（ステップＳ７３がＹＥＳ）は、ホストコンピュータ２から送られてくるライトデータと、ＨＤＤ制御部１０３を制御することにより読み込んだパリティＰとから新たなパリティＰ’を生成し、新たなパリティＰ’を該当するＨＤＤに書き込むと共に、他の正常なＨＤＤに対して該当するデータをライトする（ステップＳ７５）。
【００５３】
これに対して、アクセス対象ＨＤＤの現在の状態が全て「ＮＯＲＭＡＬ」である場合は（ステップＳ７３がＮＯ）、ＨＤＤ状態情報記憶部１０７を参照し、パリティが格納されているＨＤＤの現在の状態が「ＡＢＮＯＲＭＡＬ」であるか否かを調べる（ステップＳ７４）。
【００５４】
そして、パリティの格納されているＨＤＤの現在の状態が「ＡＢＮＯＲＭＡＬ」である場合（ステップＳ７４がＹＥＳ）は、ＨＤＤ制御部１０３を制御し、アクセス対象ＨＤＤにホストコンピュータ２から送られてくるデータを書き込む（ステップＳ７６）。
【００５５】
これに対して、パリティの格納されているＨＤＤの現在の状態が「ＮＯＲＭＡＬ」である場合（ステップＳ７４がＮＯ）は、ホストコンピュータから送られてくるライトデータと、ＨＤＤ制御部１０３を制御することにより読み込んだパリティＰとから新たなパリティＰ’を生成する（ステップＳ７７）。その後、ステップＳ７７で生成した新たなパリティＰ’を該当するＨＤＤに書き込むと共に、ホストコンピュータ２から送られてきたライトデータをアクセス対象ＨＤＤに書き込む（ステップＳ７８）。以上の処理を、ライト指令によって指令されたデータを全て書き込むまで（ステップＳ７１がＹＥＳとなるまで）、繰り返し行う。
【００５６】
次に、アレイランクの構成変更時の動作について説明する。
【００５７】
今、例えば、ＨＤＤ番号「１０」のＨＤＤ１０に障害が発生したとする。ＨＤＤ制御部１０３は、ＨＤＤ１０に障害が発生したことを検出すると、ＨＤＤ状態情報記憶部１０７に格納されているＨＤＤ１０の現在の状態を「ＮＯＲＭＡＬ」から「ＡＢＮＯＲＭＡＬ」に変更すると共に、障害内容（例えば、「タイムアウト」とする）を書き込む。この処理により、ＨＤＤ状態情報記憶部１０７の内容は、図３に示すものとなる。
【００５８】
障害履歴管理部１０８は、ＨＤＤ状態情報記憶部１０７を監視しており、ＨＤＤ番号「１０」に対応する現在の状態が「ＡＢＮＯＲＭＡＬ」になったことを検出すると、ＨＤＤ番号「１０」に対応するベンダー「ＨＨＨ」，製品名「ＢＢＢ」，バージョン「２．００」及び障害内容「タイムアウト」を障害履歴情報記憶部１０９に追加登録する。今、例えば、上記処理により、障害履歴情報記憶部１０９に履歴番号「００００３」の履歴情報が追加登録され、障害履歴情報記憶部１０９の内容が図４に示すものになったとする。その後、障害履歴管理部１０８は、アレイランク変更部１０６に対して起動指示を出力する。
【００５９】
アレイランク変更部１０６は、障害履歴管理部１０８から起動指示が加えられると、障害履歴情報記憶部１０９に格納されている障害履歴情報の内の、最も履歴番号が大きい障害履歴情報を参照し、障害の発生したＨＤＤ１０の製品種別及び障害内容を認識する（図８，ステップＳ８１）。この例の場合、アレイランク変更部１０６は、履歴番号「００００３」の障害履歴情報を参照し、障害の発生したＨＤＤ１０の製品種別が、ベンダー「ＨＨＨ」，製品名「ＢＢＢ」，バージョン「２．００」であることを認識すると共に、障害内容が「タイムアウト」であることを認識する。
【００６０】
その後、アレイランク変更部１０６は、今回発生した障害（タイムアウト）と同一障害内容の障害が、今回障害の発生したＨＤＤ１０と同一製品種別のＨＤＤにおいて再発する可能性が高いか否かを判定する（ステップＳ８２）。この判断は、例えば、次のようにして行う。先ず、障害履歴情報記憶部１０９を検索し、履歴番号が最も大きい障害履歴情報（この例では、履歴番号「００００３」の障害履歴情報）と同一の製品種別および障害内容を含んでいる障害履歴情報の数を求める。その後、上記求めた障害履歴情報の数と予め定められている所定数とを比較する。そして、予め定められている所定数の方が小さい場合には、再発する可能性が高いと判定し、そうでない場合は再発する可能性が低いと判定する。
【００６１】
そして、再発の可能性が低いと判定した場合（ステップＳ８２がＮＯ）は、アレイランク変更部１０６は、その処理を終了する。
【００６２】
これに対して、再発の可能性が高いと判定した場合（ステップＳ８２がＹＥＳ）は、ＨＤＤ状態情報記憶部１０７を参照し、障害の発生したＨＤＤ１０と同一の製品種別のＨＤＤのＨＤＤ番号を取得する（ステップＳ８３）。この例の場合、ＨＤＤ１０と同一製品種別のＨＤＤのＨＤＤ番号として「０６」〜「０９」，「１４」，「１５」が取得される。
【００６３】
その後、アレイランク変更部１０６は、アレイランク構成情報記憶部１１０を参照し、障害の発生したＨＤＤ１０及びＨＤＤ１０と同一製品種別のＨＤＤ０６〜０９，１４，１５の内の何れかを構成要素にしているアレイランクのＲＡＩＤクラス及びＨＤＤ構成を取得する（ステップＳ８４）。今、例えば、アレイランク構成情報記憶部１１０の内容が図２に示すものであるとすると、アレイランク番号「０２」に対応して登録されているＲＡＩＤクラス「ＲＡＩＤ−５」及びＨＤＤ構成「４Ｄ＋１Ｐ」を取得することになる。
【００６４】
更に、アレイランク変更部１０６は、アレイランク構成情報記憶部１１０を参照し、スペアのＨＤＤのＨＤＤ番号を取得する（ステップＳ８５）。この例の場合、スペアのＨＤＤ番号として「１１」〜「１５」を取得することになる。
【００６５】
その後、アレイランク変更部１０６は、変更ポリシー情報記憶部１０５を参照し、ステップＳ８４で取得したＲＡＩＤクラス及びＨＤＤ構成によって特定されるアレイランクの構成よりも、１ランク信頼性の高いアレイランクの構成を示すＲＡＩＤクラス及びＨＤＤ構成を取得する（ステップＳ８６）。この処理を詳しく説明すると、次のようになる。先ず、変更ポリシー情報記憶部１０５を検索し、ステップＳ８４で取得したＲＡＩＤクラス「ＲＡＩＤ−５」，ＨＤＤ構成「４Ｄ＋１Ｐ」によって特定されるアレイランク構成が格納されているエントリを求める。この例の場合、フェーズ番号「０４」のエントリが求められる。その後、上記フェーズ番号「０４」よりも１つフェーズ番号が少ないフェーズ番号「０３」のエントリに格納されているＲＡＩＤクラス「ＲＡＩＤ−５」，ＨＤＤ構成「３Ｄ＋１Ｐ」を取得する。以上がステップＳ８６で行う処理の詳細である。
【００６６】
その後、アレイランク変更部１０６は、ステップＳ８５でＨＤＤ番号を取得したスペアのＨＤＤ１１〜１５の内の、ステップＳ８６で取得したＨＤＤ構成「３Ｄ＋１Ｐ」によって示される台数分（４台）のＨＤＤを使用して、アレイランクの構成を変更する（ステップＳ８７）。
【００６７】
このステップＳ８７の処理を詳しく説明すると、次のようになる。先ず、スペアのＨＤＤ１１〜１５の中から、構成変更に使用するＨＤＤを選択する。その際、障害の発生したＨＤＤとは製品種別の異なるＨＤＤを優先的に選択するようにする。今、例えば、構成変更に使用するＨＤＤとしてＨＤＤ番号が「１１」〜「１４」のＨＤＤを選択したとする。その後、アレイランク変更部１０６は、障害の発生したＨＤＤ１０に格納されていたデータを、他の正常なＨＤＤ０６〜０９に格納されているデータに基づいて復元しながら、ＨＤＤ０６〜１０を構成要素としているアレイランクに格納されていたデータをＨＤＤ１１〜１４を構成要素とするアレイランクに書き込む。その際、パリティを生成し、書き込む処理も行う。以上がステップＳ８７で行う処理の詳細である。ここで、ＨＤＤ数の少ないアレイランク構成に変更すると、構成変更前のアレイランクに格納されているデータ量によっては、データが溢れてしまい、構成変更後のアレイランクに全データを格納できなくなってしまう場合がある。このようなことを防ぐためには、例えば、各アレイランクに格納するデータ量を、フェーズ番号「０１」のアレイランクに格納可能なデータ量に制限すれば良い。このようにするためには、例えば、各フェーズ番号のアレイランクに割り当てるアドレスのサイズを同一値とし、且つ、その値をフェーズ番号「０１」のアレイランクが格納可能なデータ量に応じたものにすれば良い。
【００６８】
その後、アレイランク変更部１０６は、アレイランク構成情報記憶部１１０に格納されている、今回構成変更を行ったアレイランク番号「０２」のアレイランクの構成情報を変更する（ステップＳ８８）。この例の場合、アレイランク番号が「０２」のアレイランクの構成情報は、図９に示すように、ＲＡＩＤクラス「ＲＡＩＤ−５」，ＨＤＤ構成「３Ｄ＋１Ｐ」，構成要素「１１」，「１２」，「１３」，「１４」に変更される。アドレスは、元のままである。
【００６９】
なお、上述した実施例においては、ＨＤＤに障害が発生した場合、アレイランク変更部１０６が直ちに図８の流れ図に示す処理を開始するようにしたが、通常のＩ／Ｏ動作に悪影響を与えないようにするために、比較的負荷の軽いときに、図８に示す処理を実行するようにしても良い。また、上述した実施例においては、ＨＤＤの製品種別をベンダー，製品名およびバージョンにより表すようにしたが、これに限られるものではない。
【００７０】
【発明の他の実施例】
次に、本発明の他の実施例について説明する。本実施例は、障害履歴情報記憶部に格納されている障害履歴情報をコンソールから参照可能にすると共に、コンソールから障害履歴情報記憶部の障害履歴情報を変更可能にしたことを特徴とする。
【００７１】
図１０は本発明の他の実施例のブロック図であり、図１に示した実施例との相違点は、ディスクアレイ装置１の代わりにディスクアレイ装置１ａを備えている点およびコンソール４が追加されている点である。
【００７２】
ディスクアレイ装置１ａは、変更ポリシー変更／参照部１１１および履歴情報変更／参照部１１２を備えている点が図１に示したディスクアレイ装置１と相違している。
【００７３】
変更ポリシー変更／参照部１１１は、ユーザがコンソール４を用いて入力する変更ポリシー変更要求に従って変更ポリシー情報記憶部１０５に格納されている変更ポリシー情報を変更する機能や、ユーザがコンソール４を用いて入力する変更ポリシー参照要求に従って変更ポリシー情報記憶部１０５から変更ポリシーを読み出し、それをコンソール４の表示部（図示せず）に表示する機能を有する。
【００７４】
履歴情報変更／参照部１１２は、ユーザがコンソール４を用いて入力する履歴情報変更要求に従って障害履歴情報記憶部１０９に格納されている障害履歴情報を変更する機能や、ユーザがコンソール４を用いて入力する履歴情報参照要求に従って障害履歴情報記憶部１０９から障害履歴情報を読み出し、コンソール４の表示部に表示する機能を有する。
【００７５】
次に本実施例の動作について説明する。尚、変更ポリシー変更／参照部１１１及び履歴情報変更／参照部１１２以外の動作は図１に示した実施例と同じであるので、ここでは変更ポリシー変更／参照部１１１及び履歴情報変更／参照部１１２の動作についてのみ説明する。
【００７６】
ユーザは、変更ポリシー情報記憶部１０５に格納されている変更ポリシー情報を参照したい場合、コンソール４を用いて変更ポリシー参照要求を入力する。変更ポリシー変更／参照部１１１は、変更ポリシー参照要求が入力されると、変更ポリシー情報記憶部１０５から変更ポリシー情報を読み出し、コンソール４の表示部に表示する。
【００７７】
また、ユーザは、変更ポリシー情報記憶部１０５に格納されている変更ポリシー情報を変更する場合には、変更後の変更ポリシー情報を含んだ変更ポリシー変更要求をコンソール４から入力する。変更ポリシー変更／参照手段１１１は、変更ポリシー変更要求が入力されると、それに含まれている変更ポリシー情報で変更ポリシー情報記憶部１０５の内容を置き換える。
【００７８】
また、ユーザは、障害履歴情報記憶部１０９に格納されている障害履歴情報を参照したい場合、履歴参照要求をコンソール４から入力する。履歴情報変更／参照部１１２は、履歴参照要求が入力されると、障害履歴情報記憶部１０９から障害履歴情報を読み出し、それをコンソール４の表示部に表示する。
【００７９】
また、ユーザは、障害履歴情報記憶部１０９に格納されている障害履歴情報を変更したい場合には、変更後の履歴情報を含んだ履歴設定要求をコンソール４から入力する。履歴情報変更／参照手段１１２は、履歴設定要求が入力されると、それに含まれている障害履歴情報で障害履歴情報記憶部１０９の内容を置き換える。
【００８０】
このように、本実施例では、障害履歴情報を外部にあるコンソールから参照したり、変更できるので、例えば、稼働中のディスクアレイ装置内の障害履歴情報記憶部１０９に格納されている障害履歴情報を収集し、収集した障害履歴情報を、例えば、使用を開始しようとしているディスクアレイ装置等の他のディスクアレイ装置の障害履歴情報記憶部１０９に格納するといったことが可能になる。このようにすることにより、使用を開始しようとしているディスクアレイ装置等の他のディスクアレイ装置において、より適切な時期にアレイランクの構成変更が行えるようになる。また、変更ポリシー情報の変更／参照も可能となっているので、変更ポリシー情報がディスクアレイ装置の利用環境に合っているか否かを容易にチェックすることができると共に、変更ポリシー情報をディスクアレイ装置の利用環境に合ったものに容易に変更することができる。
【００８１】
【発明の効果】
以上説明したように、本発明のディスクアレイ装置は、障害の発生した物理ディスクの製品種別や、障害内容や、アレイランクの構成を考慮して障害を復旧するアレイランク変更部を備えているので、障害復旧後に障害が再発する危険性を少なくすることができる。
【００８２】
また、アレイランク変更部により障害を復旧した後は、アレイランクの信頼性が障害復旧前よりも高くなるので、それ以後に１つの物理ディスクに障害が発生したとしても、更に別の物理ディスクに障害が発生し、アレイランク内のデータが失われてしまうといった最悪の事態が発生する危険性を極めて少なくすることができる。
【００８３】
また、本発明のディスクアレイ装置は、履歴情報変更／参照部を備えているので、或るディスクアレイ装置の障害履歴情報を他のディスクアレイ装置に設定することが可能になる。その結果、上記他のディスクアレイ装置において、より適切な時期にアレイランクの構成を変更することが可能になる。
【００８４】
また、本発明のディスクアレイ装置は、変更ポリシー変更／参照部を備えているので、変更ポリシー情報がディスクアレイ装置の利用環境に合っているか否かを容易にチェックすることができると共に、変更ポリシー情報をディスクアレイ装置の利用環境に合ったものに容易に変更することができる。。
【図面の簡単な説明】
【図１】本発明の一実施例のブロック図である。
【図２】アレイランク構成情報記憶部１１０の内容例を示す図である。
【図３】ＨＤＤ状態情報記憶部１０７の内容例を示す図である。
【図４】障害履歴情報記憶部１０９の内容例を示す図である。
【図５】変更ポリシー情報記憶部１０５の内容例を示す図である。
【図６】リード時の処理例を示す流れ図である。
【図７】ライト時の処理例を示す流れ図である。
【図８】アレイランク変更部１０６の処理例を示す流れ図である。
【図９】アレイランクの構成を変更した後のアレイランク構成情報記憶部１１０の内容例を示す図である。
【図１０】本発明の他の実施例のブロック図である。
【図１１】アレイランクの構成例を示した図である。
【図１２】ＨＤＤに障害が発生したときの、リードアクセス，ライトアクセスを説明するための図である。
【符号の説明】
１，１ａ…ディスクアレイ装置
１０１…上位Ｉ／Ｆ制御部
１０２…アレイ制御部
１０３…ＨＤＤ制御部
１０４…ＨＤＤ群
１０５…変更ポリシー情報記憶部
１０６…アレイランク変更部
１０７…ＨＤＤ状態情報記憶部
１０８…障害履歴管理部
１０９…障害履歴情報記憶部
１１０…アレイランク構成情報記憶部
１１１…変更ポリシー変更／参照部
１１２…履歴情報変更／参照部
２…ホストコンピュータ
３…上位Ｉ／Ｆ
４…コンソール[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to a disk array device having a plurality of physical disks, and more particularly, to a technique for improving the reliability of a disk array device.
[0002]
[Prior art]
The disk array device has a scale of several tens of TB (terabyte) due to an increase in capacity of a hard disk drive (HDD) as a physical disk. At present, the storage capacity of one hard disk has reached a level exceeding 100 GB (gigabyte), and it is expected that the capacity will be further increased in the future. With this, the capacity of the disk array device is expected to increase.
[0003]
Incidentally, as one of the features of the disk array device, when one of the plurality of HDDs constituting the array rank fails, data access can be continued using another normal HDD in the array rank. Points.
[0004]
This point will be described with reference to FIGS. Now, it is assumed that the array rank is constituted by four HDDs 01 to 04 as shown in FIG. It is also assumed that data D is divided into three pieces of data D1 to D3 and stored in HDDs 01 to 03, and parity P which is redundant data of data D1 to D3 is stored in HDD 04. In the array rank having such a configuration, it is assumed that a failure has occurred in the HDD 03 as shown in FIG.
[0005]
If a read command for the data D3 stored in the HDD 03 is output from the host computer after the occurrence of the failure in the HDD 03 and before the recovery from the failure, the data is output from the normal HDDs 01, 02, and 04 in the array rank. D1 and D2 and the parity P are read, and the data D3 is restored based on the data and transferred to the host computer.
[0006]
If a write command is output from the host computer to the HDD 03 before recovery from the failure, a new parity P ′ is generated and written to the HDD 04. When a write command for a normal HDD (for example, HDD01) is output from the host computer, new data D1 'is written to HDD01, and a new parity P' is generated and written to HDD04.
[0007]
As described above, in the disk array device, even if a failure occurs in one of the plurality of HDDs constituting the array rank, data access can be continued using another normal HDD.
[0008]
Further, when the failed HDD 03 is replaced or a spare HDD is prepared in advance, a failure recovery process is performed. In the failure recovery processing, the data D3 is restored based on the data D1, D2 and the parity P stored in the normal HDDs 01, 02, 04, and the restored data D3 is written back to the spare HDD or the replaced HDD.
[0009]
Although the above-described failure recovery processing is generally performed in the past, in addition to this, the storage area configured by a plurality of drives is divided into a plurality of volumes, and when a failure occurs, the frequency of access is high. A technique that preferentially recovers data stored in a failed drive from a volume to a spare drive (for example, see Patent Document 1), and when a failure occurs in an HDD, priority is given to a plurality of logical disks. With respect to a logical disk, a technique for restoring data stored in a failed HDD in a nonvolatile semiconductor memory is also known (for example, see Patent Document 2).
[0010]
The conventional technique described above is a technique for restoring data stored in a failed HDD to another HDD such as a spare HDD, and does not change the configuration of the array rank. As a technique for changing the configuration of the array rank, a technique for changing the RAID class according to the write data amount (for example, see Patent Document 3), or a technique for changing the array configuration information held in the data holding unit by the user. Then, the disk interface changing unit instructs the disk interface connecting unit to change the connection state between the wiring of the array control unit and the wiring of the disk device group according to the changed array configuration information ( For example, Patent Literature 4) is known, but is not related to recovery processing.
[0011]
[Patent Document 1]
JP-A-11-24850
[Patent Document 2]
JP 2001-175423 A
[Patent Document 3]
JP-A-6-187249
[Patent Document 4]
JP 2000-298557 A
[0012]
[Problems to be solved by the invention]
As described above, the failure recovery processing generally performed conventionally and the techniques described in Patent Literatures 1 and 2 transfer data stored in a failed HDD to another HDD such as a spare HDD. It is said that there is a high possibility that a failure will occur again after recovery from a failure because the product type of the failed physical disk, the details of the failure, and the configuration of the array rank are not taken into account. There's a problem.
[0013]
Therefore, an object of the present invention is to reduce the risk of a reoccurrence of a failure by performing a failure recovery in consideration of the product type of the failed physical disk, the details of the failure, and the configuration of the array rank. .
[0014]
[Means for Solving the Problems]
A first disk array device according to the present invention achieves the above object,
In a disk array device having a plurality of physical disks,
When a failure occurs in any of the plurality of physical disks, there is a high possibility that a failure having the same failure content as the failed failure will recur on a physical disk of the same product type as the failed physical disk. If it is determined that there is a high possibility of reoccurrence, the configuration of the array rank that includes a physical disk of the same product type as the physical disk in which the failure has occurred is more reliable than the current configuration. A means for changing the configuration to a highly operable one.
[0015]
More specifically, the second disk array device according to the present invention comprises:
In a disk array device having a plurality of physical disks,
A failure history management unit that manages a failure history including the product type of the failed physical disk and the failure content of the failure,
When a failure occurs in any of the plurality of physical disks, based on the failure history managed by the failure history management unit, a failure having the same failure content as the failed failure has occurred. It is determined whether the physical disk of the same product type as the physical disk is likely to recur, and if it is determined that there is a high possibility of recurring, the physical disk of the same product type as the failed physical disk is determined. An array rank changing unit for changing the configuration of the array rank as a component to a configuration more reliable than the current configuration is provided.
[0016]
Further, a third disk array device according to the present invention, in the second disk array device,
A change policy information storage unit that stores change policy information indicating how to change the current array rank configuration; and
The array rank changing unit, based on the current configuration of the array rank being the configuration change target and the change policy information stored in the change policy information storage unit, the array rank change target of the array change It has a feature of changing the configuration.
[0017]
A fourth disk array device according to the present invention is the third disk array device,
The change policy information storage unit stores a plurality of array rank configurations having different reliability.
The array rank changing unit sets the configuration of the array rank that is the configuration change target to be more than the current configuration of the array rank that is the configuration target among the configurations stored in the change policy information storage unit. It is characterized by having a configuration for changing to a configuration having one rank reliability.
[0018]
A fifth disk array device according to the present invention is the fourth disk array device,
A plurality of spare physical disks, and
The array rank changing unit uses the spare physical disk to configure an array rank that is one rank higher in reliability than the array rank to be changed, and configures the array rank to be changed. The present invention is characterized in that a normal physical disk in an element is changed to a spare physical disk.
[0019]
A sixth disk array device according to the present invention is the fifth disk array device,
In the change policy information storage unit, a configuration in which the RAID class is the same and the number of physical disks as constituent elements is one less is stored as a configuration having one rank reliability higher than that of a certain array rank. It is characterized by the following.
[0020]
A seventh disk array device according to the present invention is the sixth disk array device,
An array rank configuration information storage unit in which information indicating a physical disk configuring the array rank and a RAID class is stored for each array rank,
The array rank changing unit may change the array rank configuration by rewriting the contents of the array rank configuration information storage unit.
[0021]
Further, an eighth disk array device according to the present invention, in any one of the second to seventh disk array devices,
The failure history information managed by the failure history management unit is displayed on a display unit in response to a history reference request input by a user, and the failure history management unit is displayed in response to a history change request input by the user. A history information change / reference unit for changing the managed failure history information is provided.
[0022]
A ninth disk array device according to the present invention is the disk array device according to any one of the third to seventh aspects,
A change policy information stored in the change policy information storage unit is displayed on a display unit in response to a change policy reference request input by a user, and the change policy is displayed in response to a change policy change request input by a user. A change policy change / reference unit for changing the change policy information stored in the information storage unit is provided.
[0023]
[Action]
When a failure occurs in any of a plurality of physical disks constituting the disk array device, for example, based on a failure history, a failure having the same failure content as the failure has occurred. It is determined whether the physical disk of the same product type as the physical disk has a high possibility of recurrence. If it is determined that the possibility of recurrence is high, the configuration of the array rank, which includes a physical disk of the same product type as that of the failed physical disk, is changed to a configuration more reliable than the current configuration. change. As a configuration more reliable than the current configuration, for example, a configuration in which the RAID class is the same and the number of physical disks is one less can be adopted. In this way, since the failure is recovered in consideration of the product type of the failed physical disk, the details of the failure, and the configuration of the array rank, the risk of the failure recurring after the failure recovery is reduced. Can be.
[0024]
BEST MODE FOR CARRYING OUT THE INVENTION
Next, embodiments of the present invention will be described in detail with reference to the drawings.
[0025]
FIG. 1 is a block diagram of one embodiment of the present invention, which comprises a disk array device 1, a host computer 2, and an upper I / F (upper interface) 3 for connecting the two.
[0026]
The disk array device 1 includes a higher-level I / F controller 101, an array controller 102, an HDD (hard disk drive) controller 103, an HDD group 104, a change policy information storage unit 105, an array rank change unit 106, , An HDD status information storage unit 107, a failure history management unit 108, a failure history information storage unit 109, and an array rank configuration information storage unit 110.
[0027]
The HDD group 104 is composed of 15 HDDs 01 to 15, and HDD numbers “01” to “15” are assigned to the HDDs 01 to 15, respectively. The HDDs 01 to 05 constitute an array rank of the array rank number “01”, and the HDDs 06 to 10 constitute an array rank of the array number “02”. The array ranks of the array rank numbers “01” and “02” are both array ranks of the RAID class “RAID-5”, and the addresses “A0” to “A1” are array ranks of the array rank number “01”. “Address (A1 + 1)” to “A2” are assigned to the array rank having the assigned array rank number “02”. The HDDs 11 to 15 are spare HDDs.
[0028]
The array rank configuration information storage unit 110 stores information indicating the configuration of each array rank existing in the HDD group 104, addresses assigned to each array rank, and information indicating a spare HDD.
[0029]
FIG. 2 shows an example of the contents of the array rank configuration information storage unit 110. As shown in the figure, the array rank configuration information storage unit 110 stores an array rank number, a RAID class, an HDD configuration, an HDD number of an HDD as a component, and an address. In the example of FIG. 2, the array ranks of array rank numbers “01” and “02” are configured by HDDs 01 to 05 and 06 to 10 of HDD numbers “01” to “05” and “06” to “10”, respectively. , The RAID class is “RAID-5”, the HDD configuration is “4D + 1P”, and the assigned addresses are “A0” to “A1” and “(A1 + 1)” to “A2”, respectively. ing. Further, this example indicates that the HDDs 11 to 15 with the HDD numbers “11” to “15” are spare HDDs.
[0030]
The HDD status information storage unit 107 stores the product type of each of the HDDs 01 to 15, information indicating the current status of each of the HDDs 01 to 15 (“NORMAL” or “ABNORMAL”), and details of the failure.
[0031]
FIG. 3 shows an example of the contents of the HDD status information storage unit 107. As shown in the figure, the HDD status information storage unit 107 stores an HDD number, a vendor, a product name, a version, a current status, and details of a failure. In this example, the product type of the HDD is represented by the vendor, product name, and version. In the example of FIG. 3, HDDs 01 to 05 and 11 to 13 are products of vendor “NNN”, product name “AAA”, and version “1.10”, and HDDs 06 to 10, 14, and 15 are vendors “HHH”, This indicates that the product has the product name “BBB” and the version “2.00”. Further, the example in FIG. 3 shows that a failure (ABNORMAL) having the failure content “timeout” occurs only in the HDD 10 among the HDDs 01 to 15, and the other HDDs are normal (NORMAL).
[0032]
The failure history information storage unit 109 stores information indicating the history of failures that have occurred in the HDD group 104.
[0033]
FIG. 4 shows an example of the contents of the failure history information storage unit 109. As shown in the figure, the failure history information storage unit 109 stores a history number, a vendor, a product name, a version, and details of a failure. In the example of FIG. 4, in the HDD of the vendor “NNN”, the product name “AAA”, and the version “1.10”, an “error-correctable data error” occurs once in the past, and the vendor “HHH” and the product This indicates that “timeout” has occurred twice in the past in the HDD with the name “BBB” and version “2.00”.
[0034]
The change policy information storage unit 105 stores a policy for changing the configuration of the array rank.
[0035]
FIG. 5 shows an example of the contents of the change policy information storage unit 105. As shown in the drawing, the change policy information storage unit 105 stores a phase number, a RAID class, and an HDD configuration. The configuration of the array rank is specified by the RAID class and the HDD configuration, and the smaller the phase number, the higher the reliability. The array rank configuration of the phase number “01” is a mirror configuration in which the same data is stored in two HDDs. The array rank configurations of the phase numbers “02” to “04” are all array rank configurations of the RAID class “RAID-5”, but the number of HDDs as constituent elements increases by one each time the phase number increases by one. I have. In other words, the smaller the phase number, the higher the reliability. These pieces of information are stored in the change policy information storage unit 105 from the host computer 2 via the upper I / F 3 and the upper I / F control unit 101.
[0036]
The upper I / F controller 101 has a function of exchanging data and the like with the host computer 2 via the upper I / F 3.
[0037]
The array control unit 102 controls the HDD control unit 103 according to a read command or a write command sent from the host computer 2 via the higher-level I / F control unit 101, so that data from the corresponding HDD in the HDD group 104 is And a function to write data.
[0038]
The HDD control unit 103 has a function of reading data from or writing data to a corresponding HDD in the HDD group 104 under the control of the array control unit 102. Further, the HDD control unit 103 changes the current state of the HDD in the HDD state information storage unit 107 to “ABNORMAL” while detecting the function of detecting the failed HDD and detecting the failed HDD. It has a function to write the details of the failure.
[0039]
The failure history information management unit 108 monitors the HDD status information storage unit 107, and when an HDD whose current state is changed to ABNORMAL occurs, the failure history information management unit 108 stores the HDD vendor, product name, version, and failure content in the failure history. It has a function of additionally registering information in the information storage unit 109 and a function of outputting a start instruction to the array rank changing unit 106 after the completion of additional registration.
[0040]
When a start instruction is input from the failure history management unit 108, the array rank change unit 106 replaces the failure with the same content as the failure that has occurred this time based on the storage content of the failure history information storage unit 109 with the HDD in which the failure has occurred this time. It is determined whether there is a high possibility of reoccurrence in an HDD of the same product type as above. Has a function of changing the configuration of the array rank to a configuration that is more reliable than the current configuration. The method of changing the configuration will be described later in detail.
[0041]
Description of operation of the embodiment
Next, the operation of this embodiment will be described in detail.
[0042]
First, the operation at the time of reading will be described.
[0043]
When reading data, the host computer 2 outputs a read command (including an address, a type, and a size). This read command is sent to the array controller 102 via the upper I / F 3 and the upper I / F controller 101.
[0044]
Upon receiving the read command, the array control unit 102, based on the address assigned to each array rank stored in the array rank configuration information storage unit 110, stores the HDD to be accessed in which the data to be read first is stored. (FIG. 6, step S62).
[0045]
Thereafter, the array control unit 102 refers to the HDD status information storage unit 107 and checks whether or not the current status of the access target HDD is “NORMAL” (Step S63).
[0046]
If it is "NORMAL", that is, if the HDD to be accessed is normal (YES in step S63), the HDD controller 103 is controlled to read data from the HDD to be accessed and transfer it to the host computer 2 ( Steps S64 and S65).
[0047]
On the other hand, if the status is “ABNORMAL”, that is, if a failure has occurred in the HDD to be accessed (NO in step S63), the HDD control unit 103 is controlled to control other HDDs within the same array rank as the HDD to be accessed. Then, the data stored in the normal HDD is read, and the data stored in the HDD to be accessed is restored based on the data (steps S66 and S67). After that, the restored data is transferred to the host computer 2. The above process is repeated until all data specified by the read command is read (until step S61 becomes YES).
[0048]
Next, the operation at the time of writing will be described.
[0049]
When writing data, the host computer 2 outputs a write command (including an address, a type, and a size). This write command is sent to the array controller 102 via the upper I / F 3 and the upper I / F controller 101.
[0050]
Upon receiving the write command, the array control unit 102 obtains an access target HDD to which data is first written based on the information stored in the array rank configuration information storage unit 110 (FIG. 7, step S72).
[0051]
Thereafter, the array control unit 102 refers to the HDD status information storage unit 107 and checks whether the current status of the access target HDD is “ABNORMAL” (step S73).
[0052]
If at least one of the current HDDs to be accessed is “ABNORMAL” (YES in step S 73), the write data sent from the host computer 2 and the HDD control unit 103 are controlled. A new parity P 'is generated from the read parity P, the new parity P' is written to the corresponding HDD, and the corresponding data is written to another normal HDD (step S75).
[0053]
On the other hand, if all the current states of the HDDs to be accessed are “NORMAL” (NO in step S73), the current state of the HDD storing the parity is referred to the HDD state information storage unit 107. It is checked whether it is "ABNORMAL" (step S74).
[0054]
If the current state of the HDD in which the parity is stored is “ABNORMAL” (YES in step S74), the HDD control unit 103 is controlled to transmit the data sent from the host computer 2 to the HDD to be accessed. Write (step S76).
[0055]
On the other hand, when the current state of the HDD in which the parity is stored is “NORMAL” (NO in step S74), the write data sent from the host computer and the HDD control unit 103 are controlled. A new parity P 'is generated from the parity P read by (1) (step S77). Thereafter, the new parity P ′ generated in step S77 is written to the corresponding HDD, and the write data sent from the host computer 2 is written to the access target HDD (step S78). The above processing is repeatedly performed until all the data instructed by the write command is written (until step S71 becomes YES).
[0056]
Next, the operation when the configuration of the array rank is changed will be described.
[0057]
Now, for example, it is assumed that a failure has occurred in the HDD 10 having the HDD number “10”. When detecting that a failure has occurred in the HDD 10, the HDD control unit 103 changes the current state of the HDD 10 stored in the HDD state information storage unit 107 from “NORMAL” to “ABNORMAL” and describes the failure (for example, , “Time out”). By this processing, the contents of the HDD status information storage unit 107 are as shown in FIG.
[0058]
The failure history management unit 108 monitors the HDD status information storage unit 107. When detecting that the current status corresponding to the HDD number “10” has become “ABNORMAL”, the failure history management unit 108 corresponds to the HDD number “10”. The vendor “HHH”, the product name “BBB”, the version “2.00”, and the failure content “timeout” are additionally registered in the failure history information storage unit 109. Now, for example, it is assumed that the history information of the history number “00003” is additionally registered in the failure history information storage unit 109 by the above processing, and the contents of the failure history information storage unit 109 are as shown in FIG. After that, the failure history management unit 108 outputs a start instruction to the array rank change unit 106.
[0059]
When a start instruction is added from the failure history management unit 108, the array rank changing unit 106 refers to the failure history information having the largest history number among the failure history information stored in the failure history information storage unit 109, The product type of the failed HDD 10 and the content of the failure are recognized (step S81 in FIG. 8). In this example, the array rank changing unit 106 refers to the failure history information of the history number “00003”, and the product type of the HDD 10 in which the failure has occurred is the vendor “HHH”, the product name “BBB”, and the version “2. 00 "as well as that the failure content is" timeout ".
[0060]
Thereafter, the array rank changing unit 106 determines whether or not a failure having the same failure content as the failure (timeout) that has occurred this time is highly likely to recur in an HDD of the same product type as the HDD 10 in which the failure has occurred this time ( Step S82). This determination is made, for example, as follows. First, the failure history information storage unit 109 is searched, and failure history information that includes the same product type and failure content as the failure history information with the largest history number (in this example, failure history information with the history number “00003”) Find the number of. Thereafter, the number of the obtained failure history information is compared with a predetermined number. If the predetermined number is smaller, it is determined that the possibility of recurrence is high, and if not, it is determined that the possibility of recurrence is low.
[0061]
If it is determined that the possibility of recurrence is low (NO in step S82), the array rank changing unit 106 ends the processing.
[0062]
On the other hand, when it is determined that the possibility of recurrence is high (YES in step S82), the HDD number of the HDD of the same product type as the failed HDD 10 is acquired by referring to the HDD status information storage unit 107. (Step S83). In this example, “06” to “09”, “14”, and “15” are acquired as the HDD numbers of the HDDs of the same product type as the HDD 10.
[0063]
Thereafter, the array rank changing unit 106 refers to the array rank configuration information storage unit 110 and configures any one of the failed HDD 10 and the HDDs 06 to 09, 14, and 15 of the same product type as the HDD 10 as constituent elements. The RAID class and the HDD configuration of the array rank are acquired (step S84). Now, for example, assuming that the contents of the array rank configuration information storage unit 110 are as shown in FIG. 2, the RAID class “RAID-5” and the HDD configuration “4D + 1P” registered corresponding to the array rank number “02”. "Will be obtained.
[0064]
Further, the array rank changing unit 106 acquires the HDD number of the spare HDD by referring to the array rank configuration information storage unit 110 (Step S85). In this example, “11” to “15” are acquired as spare HDD numbers.
[0065]
Thereafter, the array rank changing unit 106 refers to the change policy information storage unit 105, and configures the array rank configuration that is one rank more reliable than the configuration of the array rank specified by the RAID class and the HDD configuration acquired in step S84. Is acquired (step S86). This processing will be described in detail below. First, the change policy information storage unit 105 is searched to find an entry in which the array rank configuration specified by the RAID class “RAID-5” and the HDD configuration “4D + 1P” acquired in step S84 is stored. In the case of this example, the entry of the phase number “04” is obtained. Thereafter, the RAID class “RAID-5” and the HDD configuration “3D + 1P” stored in the entry of the phase number “03” having one less phase number than the phase number “04” are acquired. The above is the details of the processing performed in step S86.
[0066]
After that, the array rank changing unit 106 uses the HDDs (four) of the number indicated by the HDD configuration “3D + 1P” obtained in step S86 among the spare HDDs 11 to 15 whose HDD numbers have been obtained in step S85. Then, the configuration of the array rank is changed (step S87).
[0067]
The processing in step S87 will be described in detail below. First, an HDD used for the configuration change is selected from the spare HDDs 11 to 15. At this time, an HDD having a different product type from the failed HDD is preferentially selected. Now, for example, it is assumed that HDDs with HDD numbers “11” to “14” are selected as HDDs used for configuration change. Thereafter, the array rank changing unit 106 uses the HDDs 06 to 10 as components while restoring the data stored in the failed HDD 10 based on the data stored in the other normal HDDs 06 to 09. The data stored in the array rank is written to the array rank having the HDDs 11 to 14 as constituent elements. At this time, a process of generating and writing parity is also performed. The above is the details of the processing performed in step S87. Here, when the configuration is changed to an array rank configuration with a small number of HDDs, data overflows depending on the amount of data stored in the array rank before the configuration change, and all data cannot be stored in the array rank after the configuration change. May be lost. In order to prevent such a situation, for example, the amount of data stored in each array rank may be limited to the amount of data that can be stored in the array rank having the phase number “01”. In order to do this, for example, the size of the address assigned to the array rank of each phase number is set to the same value, and the value is set according to the amount of data that can be stored in the array rank of the phase number “01”. Just do it.
[0068]
Thereafter, the array rank changing unit 106 changes the configuration information of the array rank of the array rank number “02” whose configuration was changed this time, stored in the array rank configuration information storage unit 110 (step S88). In the case of this example, as shown in FIG. 9, the configuration information of the array rank having the array rank number “02” includes the RAID class “RAID-5”, the HDD configuration “3D + 1P”, and the components “11” and “12”. , “13”, and “14”. The address remains unchanged.
[0069]
In the above-described embodiment, when a failure occurs in the HDD, the array rank changing unit 106 immediately starts the processing shown in the flowchart of FIG. 8, but does not adversely affect the normal I / O operation. For this purpose, the processing shown in FIG. 8 may be executed when the load is relatively light. Further, in the above-described embodiment, the product type of the HDD is represented by the vendor, the product name, and the version. However, the present invention is not limited to this.
[0070]
Another Embodiment of the Invention
Next, another embodiment of the present invention will be described. This embodiment is characterized in that the failure history information stored in the failure history information storage unit can be referenced from the console, and the failure history information in the failure history information storage unit can be changed from the console.
[0071]
FIG. 10 is a block diagram of another embodiment of the present invention, which differs from the embodiment shown in FIG. 1 in that a disk array device 1a is provided instead of the disk array device 1 and a console 4 is added. That is the point.
[0072]
The disk array device 1a is different from the disk array device 1 shown in FIG. 1 in that a change policy change / reference unit 111 and a history information change / reference unit 112 are provided.
[0073]
The change policy change / reference unit 111 has a function of changing the change policy information stored in the change policy information storage unit 105 in accordance with a change policy change request input by the user using the console 4, or the user using the console 4. It has a function of reading a change policy from the change policy information storage unit 105 in accordance with the input change policy reference request and displaying the read change policy on a display unit (not shown) of the console 4.
[0074]
The history information change / reference unit 112 has a function of changing the failure history information stored in the failure history information storage unit 109 in accordance with a history information change request input by the user using the console 4, or the user using the console 4. It has a function of reading out fault history information from the fault history information storage unit 109 in accordance with the input history information reference request and displaying it on the display unit of the console 4.
[0075]
Next, the operation of this embodiment will be described. Since the operations other than the change policy change / reference unit 111 and the history information change / reference unit 112 are the same as those in the embodiment shown in FIG. 1, here, the change policy change / reference unit 111 and the history information change / reference unit are used. Only the operation of 112 will be described.
[0076]
When the user wants to refer to the change policy information stored in the change policy information storage unit 105, he inputs a change policy reference request using the console 4. When the change policy reference request is input, the change policy change / reference unit 111 reads the change policy information from the change policy information storage unit 105 and displays it on the display unit of the console 4.
[0077]
When changing the change policy information stored in the change policy information storage unit 105, the user inputs a change policy change request including the changed change policy information from the console 4. When a change policy change request is input, the change policy change / reference unit 111 replaces the contents of the change policy information storage unit 105 with the change policy information included therein.
[0078]
When the user wants to refer to the failure history information stored in the failure history information storage unit 109, the user inputs a history reference request from the console 4. When the history reference request is input, the history information change / reference unit 112 reads the failure history information from the failure history information storage unit 109 and displays it on the display unit of the console 4.
[0079]
When the user wants to change the failure history information stored in the failure history information storage unit 109, the user inputs a history setting request including the changed history information from the console 4. When a history setting request is input, the history information change / reference unit 112 replaces the contents of the failure history information storage unit 109 with the failure history information included therein.
[0080]
As described above, in the present embodiment, the failure history information can be referred to or changed from an external console. For example, the failure history information stored in the failure history information storage unit 109 in the operating disk array device can be used. , And store the collected fault history information in the fault history information storage unit 109 of another disk array device such as the disk array device whose use is to be started. By doing so, the configuration of the array rank can be changed at a more appropriate time in another disk array device such as the disk array device whose use is to be started. Further, since the change policy information can be changed / referenced, it can be easily checked whether or not the change policy information is suitable for the usage environment of the disk array device, and the change policy information can be read from the disk array device. It can be easily changed to one that matches the usage environment.
[0081]
【The invention's effect】
As described above, the disk array device of the present invention is provided with the array rank changing unit that recovers the failure in consideration of the product type of the failed physical disk, the content of the failure, and the array rank configuration. Thus, it is possible to reduce the risk that the failure will recur after the recovery from the failure.
[0082]
Also, after the failure is recovered by the array rank changing unit, the reliability of the array rank becomes higher than before the recovery from the failure. Therefore, even if a failure occurs on one physical disk after that, the failure is transferred to another physical disk. The risk of a worst-case situation in which a failure occurs and data in the array rank is lost can be extremely reduced.
[0083]
Further, since the disk array device of the present invention includes the history information change / reference unit, it is possible to set the failure history information of a certain disk array device in another disk array device. As a result, in the other disk array device, it is possible to change the configuration of the array rank at a more appropriate time.
[0084]
Further, since the disk array device of the present invention includes the change policy change / reference unit, it is possible to easily check whether the change policy information is suitable for the usage environment of the disk array device, and to change the change policy information. The information can be easily changed to one suitable for the use environment of the disk array device. .
[Brief description of the drawings]
FIG. 1 is a block diagram of one embodiment of the present invention.
FIG. 2 is a diagram showing an example of the contents of an array rank configuration information storage unit 110;
FIG. 3 is a diagram showing an example of the contents of an HDD status information storage unit 107;
FIG. 4 is a diagram illustrating an example of contents of a failure history information storage unit 109;
FIG. 5 is a diagram illustrating an example of contents of a change policy information storage unit 105;
FIG. 6 is a flowchart showing a processing example at the time of reading.
FIG. 7 is a flowchart illustrating an example of a write process;
FIG. 8 is a flowchart showing a processing example of an array rank changing unit 106;
FIG. 9 is a diagram showing an example of the contents of an array rank configuration information storage unit 110 after the configuration of the array rank has been changed.
FIG. 10 is a block diagram of another embodiment of the present invention.
FIG. 11 is a diagram showing a configuration example of an array rank.
FIG. 12 is a diagram for explaining read access and write access when a failure occurs in an HDD.
[Explanation of symbols]
1, 1a: Disk array device
101: Upper I / F control unit
102: Array control unit
103 ... HDD control unit
104: HDD group
105: Change policy information storage unit
106: Array rank changing unit
107: HDD status information storage unit
108: Failure history management unit
109: Failure history information storage unit
110: Array rank configuration information storage unit
111: Change policy change / reference section
112: History information change / reference section
2. Host computer
3 High-order I / F
4. Console

Claims

In a disk array device having a plurality of physical disks, when a failure occurs in any of the plurality of physical disks, a failure having the same failure content as the failed failure is the same product as the failed physical disk. It is determined whether there is a high possibility of reoccurrence in the physical disk of the type, and if it is determined that there is a high possibility of reoccurrence, an array including a physical disk of the same product type as the physical disk in which the failure has occurred is an element. A disk array device comprising means for changing a rank configuration to a configuration that is more reliable than the current configuration.

In a disk array device having a plurality of physical disks, a failure history management unit that manages a failure history including a product type of the failed physical disk and the failure content of the failure,
When a failure occurs in any of the plurality of physical disks, based on the failure history managed by the failure history management unit, a failure having the same failure content as the failed failure has occurred. It is determined whether the physical disk of the same product type as the physical disk is likely to recur, and if it is determined that there is a high possibility of recurring, the physical disk of the same product type as the failed physical disk is determined. What is claimed is: 1. A disk array device comprising: an array rank changing unit configured to change a configuration of an array rank as a constituent element to a configuration more reliable than a current configuration.

The disk array device according to claim 2,
A change policy information storage unit that stores change policy information indicating how to change the current array rank configuration; and
The array rank changing unit, based on the current configuration of the array rank being the configuration change target and the change policy information stored in the change policy information storage unit, the array rank change target of the array change A disk array device having a configuration for changing the configuration.

The disk array device according to claim 3,
The change policy information storage unit stores a plurality of array rank configurations having different reliability.
The array rank changing unit sets the configuration of the array rank that is the configuration change target to be more than the current configuration of the array rank that is the configuration target among the configurations stored in the change policy information storage unit. A disk array device having a configuration for changing to a configuration having one rank higher reliability.

The disk array device according to claim 4,
A plurality of spare physical disks, and
The array rank changing unit uses the spare physical disk to configure an array rank that is one rank higher in reliability than the array rank to be changed, and configures the array rank to be changed. A disk array device having a configuration in which a normal physical disk in an element is changed to a spare physical disk.

The disk array device according to claim 5,
In the change policy information storage unit, a configuration in which the RAID class is the same and the number of physical disks as constituent elements is one less is stored as a configuration having one rank reliability higher than that of a certain array rank. A disk array device characterized by the above-mentioned.

The disk array device according to claim 6,
An array rank configuration information storage unit in which information indicating a physical disk configuring the array rank and a RAID class is stored for each array rank,
A disk array device, wherein the array rank changing unit changes the array rank configuration by rewriting the contents of the array rank configuration information storage unit.

The disk array device according to any one of claims 2 to 7,
The failure history information managed by the failure history management unit is displayed on a display unit in response to a history reference request input by a user, and the failure history management unit is displayed in response to a history change request input by the user. A disk array device comprising a history information change / reference unit for changing managed failure history information.

The disk array device according to any one of claims 3 to 7,
A change policy information stored in the change policy information storage unit is displayed on a display unit in response to a change policy reference request input by a user, and the change policy is displayed in response to a change policy change request input by a user. A disk array device comprising a change policy change / reference unit for changing change policy information stored in an information storage unit.