JPWO2013108351A1

JPWO2013108351A1 - Computer system and logical storage area management method

Info

Publication number: JPWO2013108351A1
Application number: JP2013554106A
Authority: JP
Inventors: 浩也松葉; 鵜飼　敏之; 敏之鵜飼
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2012-01-16
Filing date: 2012-01-16
Publication date: 2015-05-11
Anticipated expiration: 2032-01-16
Also published as: WO2013108351A1; JP6005668B2

Abstract

一般的な計算機を用いて、耐故障機能を有する共有ストレージを構築可能な計算機システムを提供することを目的とする。複数の計算機を備える計算機システムであって、計算機は、複数の計算機の各々の記憶媒体が提供する記憶領域を用いて論理記憶領域を生成する冗長化処理部と、論理記憶領域を用いて前記サービスを提供するサービス提供部とを有し、冗長化処理部は、主系計算機及び副系計算機の記憶媒体が提供する記憶領域を用いて論理記憶領域を生成し、論理記憶領域と、論理記憶領域を構成する主系計算機及び副系計算機の記憶媒体との対応関係を含む冗長化情報を生成し、主系計算機及び副系計算機の記憶媒体に冗長化情報を書き込み、論理記憶領域をサービス提供部に提供して、サービスの開始を命令し、アクセス要求を受信した場合に、冗長化情報を参照して論理記憶領域にアクセスする。An object of the present invention is to provide a computer system capable of constructing a shared storage having a fault tolerance function using a general computer. A computer system including a plurality of computers, wherein the computer uses a storage area provided by each storage medium of the plurality of computers to generate a logical storage area, and the service using the logical storage area. The redundancy processing unit generates a logical storage area using the storage areas provided by the storage medium of the primary computer and the secondary computer, the logical storage area, and the logical storage area The redundant information including the correspondence with the storage medium of the primary computer and the secondary computer constituting the system is generated, the redundant information is written in the storage medium of the primary computer and the secondary computer, and the logical storage area is provided as a service providing unit. When the access request is received, the logical storage area is accessed with reference to the redundancy information.

Description

本発明は、複数の計算機を用いて構築された分散共有ファイルシステムに関する。特に、耐故障機能を有する共有ストレージを構築可能な計算機システム及び共有ストレージの管理方法に関する。 The present invention relates to a distributed shared file system constructed using a plurality of computers. In particular, the present invention relates to a computer system capable of constructing a shared storage having a fault tolerance function and a shared storage management method.

複数の計算機から構成される計算機システムでは、一台の計算機に故障が発生して場合であってもシステム全体として動作を継続できる耐故障機能が求められる。 A computer system composed of a plurality of computers is required to have a fault-tolerant function capable of continuing operation as a whole system even when a failure occurs in one computer.

耐故障機能としては、通常使用する主系計算機と、主系計算機の故障に備えて待機する副系計算機とを準備し、主系計算機に故障が発生した場合に副系計算機が動作を引き継ぐ方式が採られる。この場合、主系計算機と副系計算機とが共有ストレージを有し、主系計算機によって変更されたストレージの内容を副系計算機が引き継ぐことができるように構成される。 As a fault-tolerant function, there is a method that prepares a primary computer that is normally used and a secondary computer that stands by in preparation for a failure of the primary computer, and when the primary computer fails, the secondary computer takes over the operation. Taken. In this case, the main computer and the sub computer have a shared storage, and the sub computer can take over the contents of the storage changed by the main computer.

しかし、前述したような構成では、副系計算機が動作を引き継いだ後に、意図せず主系計算機が共有ストレージへアクセスすることによって、データを破壊する可能性がある。前述したようなデータ破壊の危険性を排除するため、共有ストレージに対するアクセスの排他制御が必要である。 However, in the configuration as described above, there is a possibility that the data is destroyed when the primary computer unintentionally accesses the shared storage after the secondary computer takes over the operation. In order to eliminate the risk of data destruction as described above, exclusive control of access to the shared storage is necessary.

例えば、特許文献１には、機能正否監視手段によって機能の停止が検出された場合に、複数のノードが共有ディスクに対して該共有ディスクの占有を指示するコマンドを発行し、占有権を取得したノードのみが共有ディスクの制御を可能にする方式が開示されている。 For example, in Patent Document 1, when a function stoppage is detected by the function correctness monitoring unit, a plurality of nodes issue commands to instruct the shared disk to occupy the shared disk, and acquire the occupation right A method is disclosed in which only a node can control a shared disk.

特開２００２−１８５４７８号公報JP 2002-185478 A

一般的に、前述したような共有ストレージは、専用のストレージシステムを用いて構成される。前述したような複数の計算機が共有して使用できる記憶領域を提供する機能（以下、共有機能と記載する）、及び、排他制御の機能は、当該ストレージシステムが備える。そのため、一般的な計算機だけでは共有ストレージを構成できない問題がある。 Generally, the shared storage as described above is configured using a dedicated storage system. The storage system includes a function for providing a storage area that can be shared and used by a plurality of computers as described above (hereinafter referred to as a shared function) and an exclusive control function. Therefore, there is a problem that the shared storage cannot be configured only with a general computer.

例えば、特許文献１は、計算機とは別に、外部に専用の共有ストレージ装置を用意しているが、計算機が有する記憶装置を用いて共有ストレージを構成するものではない。 For example, Patent Document 1 prepares a dedicated shared storage device outside the computer, but does not constitute a shared storage using a storage device included in the computer.

また、計算機が有する記憶装置を用いて共有ストレージを構成した場合に、１台の計算機に障害が発生した場合に、共有ストレージをどのように制御するかについては記載されていない。 Also, there is no description on how to control the shared storage when a failure occurs in one computer when the shared storage is configured using a storage device included in the computer.

例えば、複数の計算機が有する記憶装置を用いて構成された共有ストレージを利用する場合、１台の計算機に障害が発生すると、共有ストレージに障害が発生した状態となる。そのため、主系計算機又は副系計算機は、当該共有ストレージを用いてサービスを継続することできない。特許文献１には、共有ストレージの障害を認識されることなく、サービスを継続する構成については記載されていない。 For example, when a shared storage configured using storage devices included in a plurality of computers is used, if a failure occurs in one computer, a failure occurs in the shared storage. Therefore, the primary computer or the secondary computer cannot continue the service using the shared storage. Patent Document 1 does not describe a configuration for continuing the service without recognizing the failure of the shared storage.

本発明は、上記の問題点に鑑みてなされてものであり、共有機能及び排他制御の機能を備えていない計算機を用いて、安価、かつ、耐故障機能を有する計算機システムを提供することを目的とする。 The present invention has been made in view of the above problems, and an object of the present invention is to provide a computer system that is inexpensive and has a fault-tolerant function using a computer that does not have a sharing function and an exclusive control function. To do.

本願において開示される発明の代表的な一例を示せば以下の通りである。すなわち、複数の計算機を備える計算機システムであって、前記複数の計算機の各々は、プロセッサと、前記プロセッサに接続されるメモリと、前記プロセッサに接続される記憶媒体と、ネットワークを介して他の装置と接続するためのネットワークインタフェースとを有し、前記複数の計算機は、サービスを提供する１台の主系計算機と、前記主系計算機に障害が発生した場合に前記サービスを引き継ぐ１台以上の副系計算機とを含み、前記主系計算機は、前記複数の計算機の各々の前記記憶媒体が提供する記憶領域を用いて論理記憶領域を生成し、前記生成された論理記憶領域へのアクセスを管理する第１の冗長化処理部と、前記論理記憶領域を用いて前記サービスを提供する第１のサービス提供部と、を有し、前記副系計算機は、前記主系計算機を監視し、前記主系計算機の障害を検知した場合に、前記サービスを引き継ぐための処理を実行する障害制御部と、前記論理記憶領域を用いて前記サービスを提供する第２のサービス提供部と、を有し、前記第１の冗長化処理部は、前記主系計算機の記憶媒体及び前記副系計算機の記憶媒体が提供する記憶領域を用いて第１の論理記憶領域を生成し、前記第１の論理記憶領域と、当該第１の論理記憶領域を構成する前記主系計算機の記憶媒体及び前記副系計算機の記憶媒体との対応関係を含む第１の冗長化情報を生成し、前記第１の論理記憶領域を構成する前記主系計算機の記憶媒体及び前記副系計算機の記憶媒体に、前記第１の冗長化情報を書き込み、前記第１の論理記憶領域を前記第１のサービス提供部に提供して、当該第１のサービス提供部に前記サービスの開始を命令し、前記第１のサービス提供部からアクセス要求を受信した場合に、前記第１の冗長化情報を参照して、前記第１の論理記憶領域にアクセスし、前記障害制御部は、前記主系計算機の障害を検知した場合に、前記第２のサービス提供部を起動させ、前記第２のサービス提供部は、前記第１の論理記憶領域に格納された情報を用いて前記サービスを継続することを特徴とする。 A typical example of the invention disclosed in the present application is as follows. That is, a computer system including a plurality of computers, each of the plurality of computers including a processor, a memory connected to the processor, a storage medium connected to the processor, and another device via a network A plurality of computers, each of which includes one main computer that provides a service and one or more secondary computers that take over the service when a failure occurs in the main computer. The main computer generates a logical storage area using a storage area provided by the storage medium of each of the plurality of computers, and manages access to the generated logical storage area. A first redundancy processing unit, and a first service providing unit that provides the service using the logical storage area, and the subordinate computer includes: A fault control unit that monitors a system computer and detects a failure of the main computer, and executes a process for taking over the service; and a second service provision that provides the service using the logical storage area And the first redundancy processing unit generates a first logical storage area using a storage area provided by the storage medium of the primary computer and the storage medium of the secondary computer, Generating first redundancy information including a correspondence relationship between the first logical storage area and the storage medium of the primary computer and the storage medium of the secondary computer constituting the first logical storage area; The first redundancy information is written in the storage medium of the primary computer and the storage medium of the secondary computer constituting the first logical storage area, and the first logical storage area is stored in the first service. Providing to the provision department, the first When a service providing unit is instructed to start the service, and an access request is received from the first service providing unit, the first logical storage area is accessed with reference to the first redundancy information. The failure control unit activates the second service providing unit when detecting a failure of the primary computer, and the second service providing unit is stored in the first logical storage area. The service is continued using information.

本発明の一形態によれば、一般的な計算機を用いて耐故障機能を備えた論理記憶領域（共有ストレージ）を構築することが可能となる。また、主系計算機の障害発生時には、業務処理部は論理記憶領域の障害を意識させることなくサービスを継続できる。 According to one embodiment of the present invention, it is possible to construct a logical storage area (shared storage) having a fault tolerance function using a general computer. In addition, when a failure occurs in the main computer, the business processing unit can continue the service without being aware of the failure in the logical storage area.

本発明の第一の実施形態の計算機システムの構成を示すブロック図である。It is a block diagram which shows the structure of the computer system of 1st embodiment of this invention. 本発明の第一の実施形態における構成設定部（主系）が実行する処理を説明したフローチャートである。It is a flowchart explaining the process which the structure setting part (main system) in 1st embodiment of this invention performs. 本発明の第一の実施形態における副系選択部が実行する処理を説明するフローチャートである。It is a flowchart explaining the process which the sub system selection part in 1st embodiment of this invention performs. 本発明の第一の実施形態における冗長化処理部（主系）が実行する処理を説明するフローチャートである。It is a flowchart explaining the process which the redundancy process part (primary system) in 1st embodiment of this invention performs. 本発明の第一の実施形態における構成設定部（副系）が実行する処理を説明するフローチャートである。It is a flowchart explaining the process which the structure setting part (secondary system) in 1st embodiment of this invention performs. 本発明の第一の実施形態における障害制御部が実行する処理を説明するフローチャートである。It is a flowchart explaining the process which the failure control part in 1st embodiment of this invention performs. 本発明の第一の実施形態における構成設定部（副系）が、障害発生時に実行する処理を説明するフローチャートである。It is a flowchart explaining the process which the structure setting part (secondary system) in 1st embodiment of this invention performs when a failure generate | occur | produces. 本発明の第一の実施形態における冗長化処理部（副系）が実行する処理を説明するフローチャートである。It is a flowchart explaining the process which the redundancy process part (subsystem) in 1st embodiment of this invention performs. 本発明の第一の実施形態における構成回復部（主系）が実行する処理を説明するフローチャートである。It is a flowchart explaining the process which the structure recovery part (primary system) in 1st embodiment of this invention performs. 本発明の第一の実施形態における構成回復部（副系）が実行する処理を説明するフローチャートである。It is a flowchart explaining the process which the structure recovery part (secondary system) in 1st embodiment of this invention performs. 本発明の第二の実施形態の計算機システムの構成を示したブロック図である。It is the block diagram which showed the structure of the computer system of 2nd embodiment of this invention.

以下、本発明の実施の形態について、図面を用いて詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

（第一の実施形態） (First embodiment)

図１は、本発明の第一の実施形態の計算機システムの構成を示すブロック図である。 FIG. 1 is a block diagram showing the configuration of the computer system according to the first embodiment of this invention.

本実施形態の計算機システムは、計算機１０１Ａ及び計算機１０１Ｂから構成される。計算機１０１Ａ及び計算機１０１Ｂは、ネットワーク１８０を介して互いに接続される。ネットワーク１８０は、例えば、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）が考えられる。ただし、本発明は、ネットワーク１８０の接続形式に限定されない。以下、計算機１０１Ａ及び計算機１０１Ｂを区別しない場合、計算機１０１と記載する。 The computer system according to this embodiment includes a computer 101A and a computer 101B. The computer 101A and the computer 101B are connected to each other via the network 180. For example, the network 180 may be a LAN (Local Area Network). However, the present invention is not limited to the connection form of the network 180. Hereinafter, when the computer 101A and the computer 101B are not distinguished, they are described as the computer 101.

本実施形態では、計算機１０１Ａ及び計算機１０１Ｂが備える記憶媒体を用いて共有ストレージを構成する。また、計算機１０１Ａは、共有ストレージを用いてサービスを提供する主系計算機として稼動し、計算機１０１Ｂは計算機１０１Ａに障害が発生した場合にサービスを継続する副系計算機として稼動するものとする。なお、図１では、計算機１０１が２台であるが、３台以上あってもよい。 In the present embodiment, the shared storage is configured using a storage medium included in the computer 101A and the computer 101B. Further, it is assumed that the computer 101A operates as a primary computer that provides a service using a shared storage, and the computer 101B operates as a secondary computer that continues the service when a failure occurs in the computer 101A. In FIG. 1, there are two computers 101, but there may be three or more computers.

まず、計算機１０１のハードウェア構成について説明する。 First, the hardware configuration of the computer 101 will be described.

計算機１０１Ａは、プロセッサ１０２Ａ、メモリ１０３Ａ、ストレージインタフェース１０４Ａ、ディスク装置１０５Ａ及びネットワークインタフェース１０６Ａを備える。 The computer 101A includes a processor 102A, a memory 103A, a storage interface 104A, a disk device 105A, and a network interface 106A.

プロセッサ１０２Ａは、メモリ１０３Ａに格納されるプログラムを実行する。プロセッサ１０２Ａがプログラムを実行することによって、計算機１０１Ａの機能を実現できる。以下、プログラムを主語に処理を説明する場合、プロセッサ１０２Ａによってプログラムが実行されていることを示す。 The processor 102A executes a program stored in the memory 103A. The function of the computer 101A can be realized by the processor 102A executing the program. Hereinafter, when processing is described with the program as the subject, it indicates that the program is being executed by the processor 102A.

メモリ１０３Ａは、プロセッサ１０２Ａが実行するプログラム及び当該プログラムを実行するために必要なデータを格納する。メモリ１０３Ａは、例えば、ＤＲＡＭのような半導体メモリが考えられ、ディスク装置１０５Ａに比べ高速にアクセスすることができる。メモリ１０３Ａに格納されるプログラム及びデータについては後述する。 The memory 103A stores a program executed by the processor 102A and data necessary for executing the program. As the memory 103A, for example, a semiconductor memory such as a DRAM can be considered, and the memory 103A can be accessed at a higher speed than the disk device 105A. The program and data stored in the memory 103A will be described later.

ストレージインタフェース１０４Ａは、大容量のデータを格納可能なディスク装置１０５Ａに接続するためのインタフェースである。 The storage interface 104A is an interface for connecting to a disk device 105A capable of storing a large amount of data.

ディスク装置１０５Ａは、所定のサービスに必要な情報（例えば、ファイルデータ）を格納する。ディスク装置１０５Ａは、例えば、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）が考えられる。 The disk device 105A stores information (for example, file data) necessary for a predetermined service. As the disk device 105A, for example, an HDD (Hard Disk Drive) can be considered.

なお、ディスク装置１０５以外のＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等の記憶媒体であってよい。また、ディスク装置１０５Ａは、複数あってもよい。また、ディスク装置１０５Ａは計算機１０１Ａに外付けされた形式でもよい。 In addition, it may be a storage medium such as an SSD (Solid State Drive) other than the disk device 105. Further, there may be a plurality of disk devices 105A. Further, the disk device 105A may be in the form of being externally attached to the computer 101A.

ネットワークインタフェース１０６Ａは、ネットワーク１８０を介して他の装置と接続するためのインタフェースである。 The network interface 106A is an interface for connecting to other devices via the network 180.

なお、計算機１０１Ａは、メモリ１０３Ａ及びディスク装置１０５Ａ以外に、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）等の情報を格納する記憶装置を備えていてもよい。 In addition to the memory 103A and the disk device 105A, the computer 101A may include a storage device that stores information such as an OS (Operating System).

計算機１０１Ｂのハードウェア構成は計算機１０１Ａと同一であるため説明を省略する。 Since the hardware configuration of the computer 101B is the same as that of the computer 101A, description thereof is omitted.

次に、計算機１０１のソフトウェア構成について説明する。 Next, the software configuration of the computer 101 will be described.

計算機１０１Ａのメモリ１０３Ａには、サービス提供部１５０、冗長化処理部１５１Ａ、ディスクドライバ部１５２Ａ、ネットワークディスクドライバ部１５３Ａ、副系選択部１５４、構成設定部１５５Ａ及び構成回復部１５６Ａを実現するプログラムが格納される。 In the memory 103A of the computer 101A, a program for realizing the service providing unit 150, the redundancy processing unit 151A, the disk driver unit 152A, the network disk driver unit 153A, the sub system selection unit 154, the configuration setting unit 155A, and the configuration recovery unit 156A. Stored.

サービス提供部１５０は、共有ストレージを用いて所定のサービスを提供する。サービス提供部１５０は、サービスの提供時に共有ストレージに対するアクセス要求を出力する。 The service providing unit 150 provides a predetermined service using the shared storage. The service providing unit 150 outputs an access request for the shared storage when providing the service.

冗長化処理部１５１Ａは、共有ストレージに対するアクセス要求を受信し、当該アクセス要求に対応するアクセス処理を実行する。具体的には、冗長化処理部１５１Ａは、ディスクドライバ部１５２Ａ及びネットワークディスクドライバ部１５３Ｂに対してアクセス要求を出力する。冗長化処理部１５１Ａが実行する処理の詳細は、図４を用いて後述する。冗長化処理部１５１Ａは、例えば、ＯＳが備えるソフトウェアＲＡＩＤ機能によって実現することができる。 The redundancy processing unit 151A receives an access request for the shared storage, and executes an access process corresponding to the access request. Specifically, the redundancy processing unit 151A outputs an access request to the disk driver unit 152A and the network disk driver unit 153B. Details of the processing executed by the redundancy processing unit 151A will be described later with reference to FIG. The redundancy processing unit 151A can be realized by a software RAID function provided in the OS, for example.

ディスクドライバ部１５２Ａは、冗長化処理部１５１Ａから出力されたアクセス要求に基づいて、共有ストレージを構成するディスク装置１０５Ａにアクセスする。 The disk driver unit 152A accesses the disk device 105A constituting the shared storage based on the access request output from the redundancy processing unit 151A.

ネットワークディスクドライバ部１５３Ａは、ネットワーク１８０を介して、共有ストレージを構成するディスク装置１０５Ｂにアクセスする。具体的には、ネットワークディスクドライバ部１５３Ａは、ネットワークディスクドライバ部１５３Ｂにアクセス要求を送信する。 The network disk driver unit 153A accesses the disk device 105B constituting the shared storage via the network 180. Specifically, the network disk driver unit 153A transmits an access request to the network disk driver unit 153B.

副系選択部１５４は、計算機システムに含まれる複数の計算機１０１から副系計算機となる計算機１０１を選択する。副系選択部１５４が実行する処理の詳細は、図３を用いて後述する。 The sub system selection unit 154 selects a computer 101 that is a sub system computer from a plurality of computers 101 included in the computer system. Details of processing executed by the sub system selection unit 154 will be described later with reference to FIG.

構成設定部１５５Ａは、主系計算機として稼動するために必要な情報を設定する。構成設定部１５５Ａが実行する処理の詳細は、図２を用いて後述する。 The configuration setting unit 155A sets information necessary for operating as a main computer. Details of the processing executed by the configuration setting unit 155A will be described later with reference to FIG.

構成回復部１５６Ａは、計算機１０１Ａに障害が発生した場合に、当該障害を回復するための処理を実行する。構成回復部１５６Ａが実行する処理の詳細は、図９を用いて後述する。 When a failure occurs in the computer 101A, the configuration recovery unit 156A executes a process for recovering the failure. Details of the processing executed by the configuration recovery unit 156A will be described later with reference to FIG.

なお、メモリ１０３Ａに格納されるプログラムは、ディスク装置１０５Ａ又は外部の装置（図示省略）に格納されていてもよい。この場合、ディスク装置１０５Ａから各プログラムが読み出され、又は、ネットワーク１８０を介して外部の装置から各プログラムが読み出され、メモリ１０３Ａに格納される。 The program stored in the memory 103A may be stored in the disk device 105A or an external device (not shown). In this case, each program is read from the disk device 105A, or each program is read from an external device via the network 180 and stored in the memory 103A.

計算機１０１Ｂのメモリ１０３Ｂには、障害制御部１７０、代替サービス提供部１７１、冗長化処理部１５１Ｂ、ディスクドライバ部１５２Ｂ、ネットワークディスクドライバ部１５３Ｂ、構成設定部１５５Ｂ、及び構成回復部１５６Ｂを実現するプログラムが格納される。 In the memory 103B of the computer 101B, a program for realizing the failure control unit 170, the alternative service providing unit 171, the redundancy processing unit 151B, the disk driver unit 152B, the network disk driver unit 153B, the configuration setting unit 155B, and the configuration recovery unit 156B Is stored.

障害制御部１７０は、主系計算機として稼動する計算機１０１Ａの動作を監視し、計算機１０１Ａの障害を検知した場合に、計算機１０１Ｂがサービスを継続するための処理を実行する。障害制御部１７０が実行する処理の詳細は、図６を用いて後述する。 The failure control unit 170 monitors the operation of the computer 101A that operates as the primary computer, and when the failure of the computer 101A is detected, the computer 101B executes processing for continuing the service. Details of processing executed by the failure control unit 170 will be described later with reference to FIG.

代替サービス提供部１７１は、障害が発生した計算機１０１Ａの代わりにサービスを提供する。 The alternative service providing unit 171 provides a service instead of the computer 101A in which a failure has occurred.

冗長化処理部１５１Ｂは、冗長化処理部１５１Ａと同一のものであり、代替サービス提供部１７１から出力されたアクセス要求を受信し、共有ストレージに対するアクセス処理を実行する。ディスクドライバ部１５２Ｂは、ディスクドライバ部１５２Ｂと同一のものである。冗長化処理部１５１Ｂが実行する処理の詳細は、図８を用いて後述する。 The redundancy processing unit 151B is the same as the redundancy processing unit 151A, receives the access request output from the alternative service providing unit 171 and executes access processing for the shared storage. The disk driver unit 152B is the same as the disk driver unit 152B. Details of the processing executed by the redundancy processing unit 151B will be described later with reference to FIG.

ネットワークディスクドライバ部１５３Ｂは、ネットワークディスクドライバ部１５３Ａから受信したアクセス要求に基づいて、ディスクドライバ部１５２Ｂに対してアクセス要求を出力する。これによって、共有ストレージを構成するディスク装置１０５Ｂへのアクセスを実現できる。また、ネットワークディスクドライバ部１５３Ｂは、ネットワーク１８０を介して、共有ストレージを構成するディスク装置１０５Ａにアクセスする。 The network disk driver unit 153B outputs an access request to the disk driver unit 152B based on the access request received from the network disk driver unit 153A. Thereby, access to the disk device 105B constituting the shared storage can be realized. Further, the network disk driver unit 153B accesses the disk device 105A constituting the shared storage via the network 180.

構成設定部１５５Ｂは、副系計算機として稼動するために必要な情報を設定する。構成設定部１５５Ｂが実行する処理の詳細は、図５及び図７を用いて後述する。 The configuration setting unit 155B sets information necessary for operating as a secondary computer. Details of processing executed by the configuration setting unit 155B will be described later with reference to FIGS.

構成回復部１５６Ｂは、主系計算機の障害が回復した後に、再び副系計算機として稼動するための処理を実行する。構成回復部１５６Ｂが実行する処理の詳細は、図１０を用いて後述する。 The configuration recovery unit 156B executes processing for operating as a secondary computer again after the failure of the primary computer is recovered. Details of the processing executed by the configuration recovery unit 156B will be described later with reference to FIG.

なお、メモリ１０３Ｂに格納されるプログラムは、ディスク装置１０５Ｂ又は外部の装置（図示省略）に格納されていてもよい。この場合、ディスク装置１０５Ｂから各プログラムが読み出され、又は、ネットワーク１８０を介して外部の装置から各プログラムが読み出され、メモリ１０３Ｂに格納される。 The program stored in the memory 103B may be stored in the disk device 105B or an external device (not shown). In this case, each program is read from the disk device 105B, or each program is read from an external device via the network 180 and stored in the memory 103B.

また、構成設定部１５５Ａ及び構成設定部１５５Ｂは、同一の機能を提供するプログラムであり、主系計算機の設定処理及び副系計算機の設定処理を実行することができる。構成回復部１５６Ａ及び構成回復部１５６Ｂは、同一の機能を提供するプログラムであり、主系計算機の回復処理及び副系計算機の回復処理を実行することができる。 The configuration setting unit 155A and the configuration setting unit 155B are programs that provide the same function, and can execute the setting process of the main computer and the setting process of the sub computer. The configuration recovery unit 156A and the configuration recovery unit 156B are programs that provide the same function, and can execute recovery processing for the main computer and recovery processing for the sub computer.

以下、ディスクドライバ部１５２Ａ及びディスクドライバ部１５２Ｂを区別しない場合、ディスクドライバ部１５２と記載し、ネットワークディスクドライバ部１５３Ａ及びネットワークディスクドライバ部１５３Ｂを区別しない場合、ネットワークディスクドライバ部１５３と記載する。また、構成設定部１５５Ａ及び構成設定部１５５Ｂを区別しない場合、構成設定部１５５と記載し、構成回復部１５６Ａ及び構成回復部１５６Ｂを区別しない場合、構成回復部１５６と記載する。 Hereinafter, when the disk driver unit 152A and the disk driver unit 152B are not distinguished from each other, the disk driver unit 152 is referred to as the disk driver unit 152. When the configuration setting unit 155A and the configuration setting unit 155B are not distinguished from each other, the configuration setting unit 155 is described. When the configuration recovery unit 156A and the configuration recovery unit 156B are not distinguished from each other, they are described as the configuration recovery unit 156.

以下、各構成の処理について説明する。まず、図２〜図５を用いて、主系計算機及び副系計算機の設定方法について説明する。 Hereinafter, processing of each configuration will be described. First, the setting method of the main computer and the sub computer will be described with reference to FIGS.

図２は、本発明の第一の実施形態における構成設定部１５５Ａ（主系）が実行する処理を説明したフローチャートである。 FIG. 2 is a flowchart illustrating processing executed by the configuration setting unit 155A (main system) in the first embodiment of the present invention.

構成設定部１５５Ａは、計算機システムの管理者からの指示を受信すると処理を開始する（ステップＳ２０１）。このとき、当該指示には、計算機システム内の計算機１０１の総数と構成設定部１５５Ａが実行される計算機１０１Ａの識別番号が含まれる。 The configuration setting unit 155A starts processing upon receiving an instruction from the administrator of the computer system (step S201). At this time, the instruction includes the total number of computers 101 in the computer system and the identification number of the computer 101A on which the configuration setting unit 155A is executed.

例えば、図１の例では、計算機１０１の総数は「２」、計算機１０１Ａの識別番号は「１」、計算機１０１Ｂの識別番号が「２」となる。 For example, in the example of FIG. 1, the total number of computers 101 is “2”, the identification number of the computer 101A is “1”, and the identification number of the computer 101B is “2”.

構成設定部１５５Ａは、副系選択部１５４を呼び出す（ステップＳ２０２）。呼び出された副系選択部１５４が後述する処理（図３参照）を実行することによって、副系計算機となる計算機１０１を決定することができる。このとき、構成設定部１５５Ａは、副系選択部１５４から副系計算機となる計算機１０１の識別情報を取得する。 The configuration setting unit 155A calls the sub system selection unit 154 (step S202). The called sub-system selection unit 154 executes a process described later (see FIG. 3), so that the computer 101 to be a sub-system computer can be determined. At this time, the configuration setting unit 155A acquires the identification information of the computer 101 that is the sub computer from the sub system selection unit 154.

計算機１０１の識別情報は、計算機システム内において計算機１０１を一意に識別できる情報であればよく、例えば、計算機１０１のコンピュータ名、ＭＡＣアドレス及びＩＰアドレス等が考えられる。 The identification information of the computer 101 only needs to be information that can uniquely identify the computer 101 in the computer system. For example, the computer name, MAC address, and IP address of the computer 101 can be considered.

構成設定部１５５Ａは、ネットワークディスクドライバ部１５３Ａに対して副系計算機が備えるネットワークディスクドライバ部１５３と接続するよう指示する（ステップＳ２０３）。図１に示す例では、ネットワークディスクドライバ部１５３Ａは、ネットワークディスクドライバ部１５３Ｂと接続するように指示される。これによって、ネットワークディスクドライバ部１５３Ａは、ネットワークディスクドライバ部１５３Ｂにアクセス要求を送信することができる。 The configuration setting unit 155A instructs the network disk driver unit 153A to connect to the network disk driver unit 153 included in the secondary computer (step S203). In the example shown in FIG. 1, the network disk driver unit 153A is instructed to connect to the network disk driver unit 153B. Accordingly, the network disk driver unit 153A can transmit an access request to the network disk driver unit 153B.

構成設定部１５５Ａは、冗長化処理部１５１Ａに論理デバイス（共有ストレージ）の生成を指示する（ステップＳ２０４）。ここで、論理デバイスとは、サービス提供部１５０が一つのディスク装置として認識可能な論理的なディスク装置である。本発明では、複数の計算機１０１が有するディスク装置１０５が有する記憶領域から生成される論理デバイスが、共有ストレージとして用いられる。 The configuration setting unit 155A instructs the redundancy processing unit 151A to generate a logical device (shared storage) (step S204). Here, the logical device is a logical disk device that the service providing unit 150 can recognize as one disk device. In the present invention, a logical device generated from the storage area of the disk device 105 of the plurality of computers 101 is used as the shared storage.

これによって、ディスク装置１０５Ａ及びディスク装置１０５Ｂが有する記憶領域から論理デバイスが生成される。また、ディスクドライバ部１５２Ａ及びネットワークディスクドライバ部１５３Ａのそれぞれに同一の書込要求を出力される。そのため、論理デバイスへの書込処理では、ディスク装置１０５Ａ及びディスク装置１０５Ｂのそれぞれにデータが書き込まれる。前述のようにデータが、異なるディスク装置１０５に格納されるため、主系計算機に障害が発生しても副系計算機のディスク装置１０５に格納されるデータを用いてサービスを継続することができる。 As a result, logical devices are generated from the storage areas of the disk device 105A and the disk device 105B. The same write request is output to each of the disk driver unit 152A and the network disk driver unit 153A. Therefore, in the writing process to the logical device, data is written to each of the disk device 105A and the disk device 105B. Since the data is stored in the different disk devices 105 as described above, even if a failure occurs in the main computer, the service can be continued using the data stored in the disk device 105 of the secondary computer.

なお、論理デバイス上には、分散共有ファイルシステムを構築することができる。共有ファイルシステム上に配置されたファイルには、すべての計算機１０１が同じようにアクセスできる。 A distributed shared file system can be constructed on the logical device. All the computers 101 can access the files arranged on the shared file system in the same way.

構成設定部１５５Ａは、サービス提供部１５０に論理デバイスにアクセスするように指示し、処理を終了する（ステップＳ２０５、ステップＳ２０６）。具体的には、構成設定部１５５Ａは、サービスに用いる記憶領域として論理デバイスの識別情報を通知する。これによってサービス提供部１５０は、論理デバイスに対してアクセス要求を出力する。 The configuration setting unit 155A instructs the service providing unit 150 to access the logical device, and ends the process (steps S205 and S206). Specifically, the configuration setting unit 155A notifies the identification information of the logical device as a storage area used for the service. As a result, the service providing unit 150 outputs an access request to the logical device.

図３は、本発明の第一の実施形態における副系選択部１５４が実行する処理を説明するフローチャートである。 FIG. 3 is a flowchart illustrating processing executed by the sub system selection unit 154 according to the first embodiment of this invention.

副系選択部１５４は、構成設定部１５５Ａから呼び出されると処理を開始する（ステップＳ３０１）。なお、副系選択部１５４は、呼び出されるときに計算機システムを構成する計算機１０１の総数と、構成設定部１５５Ａを実行する計算機１０１の識別番号とを構成設定部１５５Ａから受け取る。 When called from the configuration setting unit 155A, the sub system selection unit 154 starts processing (step S301). When called, the secondary system selection unit 154 receives from the configuration setting unit 155A the total number of computers 101 constituting the computer system and the identification number of the computer 101 that executes the configuration setting unit 155A.

副系選択部１５４は、自身の識別番号が計算機システムを構成する計算機１０１に割り当てられた識別番号のうち、最大の識別番号であるか否かを判定する（ステップＳ３０２）。例えば、副系選択部１５４は、自身の識別番号が計算機１０１の総数と同一であるか否かを判定する。自身の識別番号が計算機１０１の総数と同一である場合には、自身の識別番号が最大の識別番号であると判定される。 The sub system selection unit 154 determines whether or not its own identification number is the maximum identification number among the identification numbers assigned to the computers 101 constituting the computer system (step S302). For example, the sub system selection unit 154 determines whether or not its own identification number is the same as the total number of computers 101. If its own identification number is the same as the total number of computers 101, it is determined that its own identification number is the maximum identification number.

自身の識別番号が最大であると判定された場合、副系選択部１５４は、最小の識別番号（例えば、識別番号が「１」）の計算機１０１を副系計算機に選択し、ステップＳ３０５に進む（ステップＳ３０３）。 When it is determined that its own identification number is the largest, the sub system selection unit 154 selects the computer 101 having the smallest identification number (for example, the identification number is “1”) as the sub system computer, and the process proceeds to step S305. (Step S303).

自身の識別番号が最大でないと判定された場合、副系選択部１５４は、自身の識別番号に「１」を加算した識別番号の計算機１０１を副系計算機に選択し、ステップＳ３０５に進む（ステップＳ３０４）。 When it is determined that its own identification number is not the maximum, the sub system selection unit 154 selects the computer 101 having the identification number obtained by adding “1” to its own identification number as the sub system computer, and proceeds to step S305 (step S305). S304).

副系選択部１５４は、副系計算機として選択された計算機１０１の識別情報を取得し、処理を終了する（ステップＳ３０５、ステップＳ３０６）。 The sub system selection unit 154 acquires the identification information of the computer 101 selected as the sub system computer, and ends the process (steps S305 and S306).

例えば、予め、識別番号と識別情報とを対応づけたデータを準備しておき、副系選択部１５４が、副系計算機の識別番号に基づいて当該データを参照することによって識別情報を取得する方法が考えられる。 For example, a method in which data in which an identification number is associated with identification information is prepared in advance, and the subsystem selection unit 154 obtains the identification information by referring to the data based on the identification number of the subsystem computer Can be considered.

なお、副系計算機の選択方法は、図３に示すものに限定されず、計算機１０１のリソース量、使用率等に基づいて選択する方法であってもよい。また、２台以上の副系計算機を選択する場合、副系計算機として選択された計算機１０１の識別番号を新たな入力として図３に示す処理を繰り返し実行すればよい。 The selection method of the subordinate computer is not limited to that shown in FIG. 3, and may be a method of selecting based on the resource amount, usage rate, etc. of the computer 101. When two or more secondary computers are selected, the process shown in FIG. 3 may be repeatedly executed with the identification number of the computer 101 selected as the secondary computer as a new input.

図４は、本発明の第一の実施形態における冗長化処理部１５１Ａ（主系）が実行する処理を説明するフローチャートである。 FIG. 4 is a flowchart for explaining processing executed by the redundancy processing unit 151A (main system) in the first embodiment of the present invention.

冗長化処理部１５１Ａは、構成設定部１５５Ａから論理デバイスの生成指示を受信すると処理を開始する（ステップＳ４０１）。冗長化処理部１５１Ａは、副系計算機の識別情報を取得する（ステップＳ４０２）。 The redundancy processing unit 151A starts processing upon receiving a logical device generation instruction from the configuration setting unit 155A (step S401). The redundancy processing unit 151A acquires the identification information of the secondary computer (step S402).

冗長化処理部１５１Ａは、主系計算機のディスク装置１０５及び副系計算機のディスク装置１０５を統合して論理デバイスを生成する（ステップＳ４０３）。図１に示す例では、ディスク装置１０５Ａ及びディスク装置１０５Ｂが統合された論理デバイスが生成される。 The redundancy processing unit 151A generates a logical device by integrating the disk device 105 of the primary computer and the disk device 105 of the secondary computer (step S403). In the example shown in FIG. 1, a logical device in which the disk device 105A and the disk device 105B are integrated is generated.

冗長化処理部１５１Ａは、論理デバイスと各ディスク装置１０５とを対応づけた冗長化情報を生成する（ステップＳ４０４）。冗長化情報は、少なくとも、論理デバイスの識別情報及びディスク装置１０５の識別情報を含む。なお、冗長化情報は、計算機１０１の識別情報等その他の情報を含んでいてもよい。 The redundancy processing unit 151A generates redundancy information in which the logical device is associated with each disk device 105 (step S404). The redundancy information includes at least logical device identification information and disk device 105 identification information. The redundancy information may include other information such as identification information of the computer 101.

冗長化処理部１５１Ａは、冗長化情報を各ディスク装置１０５へ書き込み、処理を終了する（ステップＳ４０５、ステップＳ４０６）。図１に示す例では、冗長化処理部１５１Ａは、ディスクドライバ部１５２Ａ及びネットワークディスクドライバ部１５３Ａのそれぞれに冗長化情報の書込要求を出力する。これによって、共有ストレージ、すなわち、論理デバイスを構成するディスク装置１０５Ａ及びディスク装置１０５Ｂに冗長化情報が格納される。 The redundancy processing unit 151A writes the redundancy information to each disk device 105, and ends the processing (steps S405 and S406). In the example shown in FIG. 1, the redundancy processing unit 151A outputs a request for writing redundancy information to each of the disk driver unit 152A and the network disk driver unit 153A. As a result, the redundancy information is stored in the shared storage, that is, the disk device 105A and the disk device 105B constituting the logical device.

図５は、本発明の第一の実施形態における構成設定部１５５Ｂ（副系）が実行する処理を説明するフローチャートである。 FIG. 5 is a flowchart for explaining processing executed by the configuration setting unit 155B (secondary system) according to the first embodiment of the present invention.

構成設定部１５５Ｂは、計算機システムの管理者からの指示を受信すると処理を開始する（ステップＳ５０１）。なお、構成設定部１５５Ｂの処理は、構成設定部１５５Ａの処理よりも前に実行される。 The configuration setting unit 155B starts processing upon receiving an instruction from the administrator of the computer system (step S501). Note that the processing of the configuration setting unit 155B is executed before the processing of the configuration setting unit 155A.

構成設定部１５５Ｂは、ネットワークディスクドライバ部１５３Ｂに対して、他の計算機１０１から自身のディスク装置１０５へのアクセスを許可するよう設定し、処理を終了する（ステップＳ５０２、ステップＳ５０３）。図１に示す例では、計算機１０１Ａからディスク装置１０５Ｂへのアクセスが許可される。構成設定部１５５Ｂが実行する処理によって、計算機１０１Ａがディスク装置１０５Ｂにアクセス可能となり、論理デバイスを構成する記憶領域を提供することが可能となる。 The configuration setting unit 155B sets the network disk driver unit 153B to permit access from the other computer 101 to its own disk device 105, and ends the processing (steps S502 and S503). In the example shown in FIG. 1, access from the computer 101A to the disk device 105B is permitted. By the processing executed by the configuration setting unit 155B, the computer 101A can access the disk device 105B and can provide a storage area constituting the logical device.

以上が、共有ストレージの構成時に実行される処理である。図２〜図５の処理が終了した後、サービス提供部１５０は、論理デバイスを用いて所定のサービスを提供する。 The above is the processing executed when the shared storage is configured. After the processes in FIGS. 2 to 5 are completed, the service providing unit 150 provides a predetermined service using the logical device.

このとき、冗長化処理部１５１は、サービス提供部１５０から論理デバイスへの書込要求を受信すると、ディスク装置１０５Ａから冗長化情報を読み出し、当該冗長化情報に基づいて、ディスク装置１０５Ａ及びディスク装置１０５Ｂのそれぞれにデータを書き込む。なお、読み出された冗長化情報は、メモリ１０３Ａに一時的に格納される。これによって、論理デバイスへのアクセス時にディスク装置１０５へのＩ／Ｏ発生を低減することができる。 At this time, when receiving the write request to the logical device from the service providing unit 150, the redundancy processing unit 151 reads the redundancy information from the disk device 105A, and based on the redundancy information, the disk device 105A and the disk device Data is written to each of 105B. Note that the read redundancy information is temporarily stored in the memory 103A. This can reduce the occurrence of I / O to the disk device 105 when accessing the logical device.

また、冗長化処理部１５１は、サービス提供部１５０から論理デバイスへの読出要求を受信すると、冗長化情報を参照して、ディスク装置１０５Ａからデータを読み出す。 Further, upon receiving a read request from the service providing unit 150 to the logical device, the redundancy processing unit 151 reads data from the disk device 105A with reference to the redundancy information.

次に、図６〜図８を用いて、主系計算機として稼動する計算機１０１Ａに障害が発生した場合に実行される処理について説明する。 Next, processing executed when a failure occurs in the computer 101A operating as the main computer will be described with reference to FIGS.

図６は、本発明の第一の実施形態における障害制御部１７０が実行する処理を説明するフローチャートである。 FIG. 6 is a flowchart illustrating processing executed by the failure control unit 170 according to the first embodiment of this invention.

障害制御部１７０は、周期的に、主系計算機を監視しており、主系計算機の障害を検知すると処理を開始する（ステップＳ６０１）。図１に示す例では、障害制御部１７０は、主系計算機として稼動する計算機１０１Ａを監視し、計算機１０１Ａの障害を検知すると処理を開始する。 The failure control unit 170 periodically monitors the main computer, and starts processing when a failure of the main computer is detected (step S601). In the example illustrated in FIG. 1, the failure control unit 170 monitors the computer 101A that operates as the primary computer, and starts processing when it detects a failure in the computer 101A.

監視方法としては、ネットワーク１８０を介した通信を監視する方法などが考えられる。ただし、本発明は、主系計算機の障害検出方法に限定されない。 As a monitoring method, a method of monitoring communication via the network 180 can be considered. However, the present invention is not limited to the failure detection method of the main computer.

なお、副系計算機が複数ある場合には、予め、副系計算機に優先順位を与えておき、優先順位が高い副系計算機が主導的に処理を実行するように構成すればよい。 If there are a plurality of secondary computers, priorities may be given to the secondary computers in advance, and the secondary computer having a higher priority may be configured to execute the process.

障害制御部１７０は、ネットワークディスクドライバ部１５３Ｂに対して、論理デバイスを構成する他の計算機１０１からディスク装置１０５へのアクセスを禁止するように指示する（ステップＳ６０２）。副系計算機が２台以上ある場合には、主系計算機だけではなく他の副系計算機からのアクセスも禁止される。図１に示す例では、計算機１０１Ａからディスク装置１０５Ｂへのアクセスが禁止される。 The failure control unit 170 instructs the network disk driver unit 153B to prohibit access to the disk device 105 from another computer 101 configuring the logical device (step S602). When there are two or more secondary computers, access not only from the primary computer but also from other secondary computers is prohibited. In the example shown in FIG. 1, access from the computer 101A to the disk device 105B is prohibited.

障害制御部１７０は、構成設定部１５５Ｂを呼び出す（ステップＳ６０３）。障害制御部１７０は、構成設定部１５５Ｂからの処理完了の通知を待つ。構成設定部１５５Ｂが実行する処理の詳細は、図７を用いて後述する。 The failure control unit 170 calls the configuration setting unit 155B (step S603). The failure control unit 170 waits for notification of processing completion from the configuration setting unit 155B. Details of the processing executed by the configuration setting unit 155B will be described later with reference to FIG.

その後、障害制御部１７０は、代替サービス提供部１７１の処理の開始を指示して、処理を終了する（ステップＳ６０４、ステップＳ６０５）。 Thereafter, the failure control unit 170 instructs the alternative service providing unit 171 to start processing, and ends the processing (steps S604 and S605).

以上の処理によって、代替サービス提供部１７１が、サービス提供部１５０に代わってサービスを継続することができる。 Through the above processing, the alternative service providing unit 171 can continue the service in place of the service providing unit 150.

ステップＳ６０２では、計算機１０１Ａからディスク装置１０５Ｂへのアクセスを禁止している。これは、計算機１０１Ａが予期せず再び動作を始めた場合に、代替サービス提供部１７１及びサービス提供部１５０からディスク装置１０５Ｂへのアクセスが衝突してデータが失われる危険を回避するためである。 In step S602, access from the computer 101A to the disk device 105B is prohibited. This is to avoid the risk of data loss due to collision between accesses from the alternative service providing unit 171 and the service providing unit 150 to the disk device 105B when the computer 101A starts again unexpectedly.

図７は、本発明の第一の実施形態における構成設定部１５５Ｂ（副系）が、障害発生時に実行する処理を説明するフローチャートである。 FIG. 7 is a flowchart for explaining processing executed by the configuration setting unit 155B (secondary system) in the first embodiment of the present invention when a failure occurs.

構成設定部１５５Ｂは、障害制御部１７０から読み出されると処理を開始する（ステップＳ７０１）。構成設定部１５５Ｂは、ネットワークディスクドライバ部１５３Ｂに対して他の副系計算機が備えるネットワークディスクドライバ部１５３Ｂと接続するよう指示する（ステップＳ７０２）。これによって、ネットワークディスクドライバ部１５３Ｂは、他の副系計算機のネットワークディスクドライバ部１５３にアクセス要求を送信することができる。 The configuration setting unit 155B starts processing when read from the failure control unit 170 (step S701). The configuration setting unit 155B instructs the network disk driver unit 153B to connect to the network disk driver unit 153B included in another secondary computer (step S702). As a result, the network disk driver unit 153B can transmit an access request to the network disk driver unit 153 of the other secondary computer.

なお、副系計算機はすでに選択されているため、ステップＳ２０２に対応する処理は省略される。また、構成設定部１５５Ｂは、冗長化情報を参照することによって他の副系計算機を特定することができる。 Since the secondary computer has already been selected, the processing corresponding to step S202 is omitted. In addition, the configuration setting unit 155B can specify another sub computer by referring to the redundancy information.

構成設定部１５５Ｂは、冗長化処理部１５１Ｂに論理デバイス（共有ストレージ）の生成を指示する（ステップＳ７０３）。図１に示す例では、計算機１０１Ｂが備えるディスク装置１０５Ｂから論理デバイスが生成される。 The configuration setting unit 155B instructs the redundancy processing unit 151B to create a logical device (shared storage) (step S703). In the example illustrated in FIG. 1, a logical device is generated from the disk device 105B included in the computer 101B.

構成設定部１５５Ｂは、代替サービス提供部１７１に新たに生成された論理デバイスにアクセスするように指示して、処理を終了する（ステップＳ７０４、ステップＳ７０５）。 The configuration setting unit 155B instructs the alternative service providing unit 171 to access the newly created logical device, and ends the process (steps S704 and S705).

図８は、本発明の第一の実施形態における冗長化処理部１５１Ｂ（副系）が実行する処理を説明するフローチャートである。 FIG. 8 is a flowchart for explaining processing executed by the redundancy processing unit 151B (secondary system) according to the first embodiment of this invention.

冗長化処理部１５１Ｂは、構成設定部１５５Ｂから読み出されると処理を開始する（ステップＳ８０１）。冗長化処理部１５１Ｂは、副系計算機の識別情報を取得する（ステップＳ８０２）。ステップＳ８０２の処理は、ステップＳ４０２と同一の処理である。 The redundancy processing unit 151B starts processing when read from the configuration setting unit 155B (step S801). The redundancy processing unit 151B acquires the identification information of the secondary computer (step S802). The process of step S802 is the same process as step S402.

冗長化処理部１５１Ｂは、副系計算機のディスク装置１０５を統合して論理デバイスを生成する（ステップＳ８０３）。 The redundancy processing unit 151B generates a logical device by integrating the disk devices 105 of the secondary computers (step S803).

ステップＳ８０３の処理はステップＳ４０３と異なり、主系計算機のディスク装置１０５を除いたディスク装置１０５から論理デバイスが生成される。すなわち、副系計算機のディスク装置１０５のみから論理デバイスが生成される。 The processing in step S803 differs from step S403 in that a logical device is generated from the disk device 105 excluding the disk device 105 of the main computer. That is, a logical device is generated only from the disk device 105 of the secondary computer.

図１に示す例では、ディスク装置１０５Ｂのみから論理デバイスが生成される。 In the example shown in FIG. 1, a logical device is generated only from the disk device 105B.

冗長化処理部１５１Ｂは、論理デバイスと各ディスク装置１０５とを対応づけた冗長化情報を生成する（ステップＳ８０４）。ステップＳ８０４の処理は、ステップＳ４０４と同一の処理である。 The redundancy processing unit 151B generates redundancy information in which the logical device is associated with each disk device 105 (step S804). The process of step S804 is the same process as step S404.

冗長化処理部１５１Ｂは、冗長化情報を各ディスク装置１０５へ書き込み、処理を終了する（ステップＳ８０５、ステップＳ８０６）。図１に示す例では、冗長化処理部１５１Ｂは、ネットワークディスクドライバ部１５３Ｂに冗長化情報の書込要求を出力する。これによって、論理デバイスを構成するディスク装置１０５Ｂに冗長化情報が格納される。 The redundancy processing unit 151B writes the redundancy information to each disk device 105 and ends the processing (steps S805 and S806). In the example shown in FIG. 1, the redundancy processing unit 151B outputs a request for writing redundancy information to the network disk driver unit 153B. As a result, the redundancy information is stored in the disk device 105B constituting the logical device.

なお、新たに生成された論理デバイスの識別情報は、最初に生成された論理デバイスの識別情報と同一となるように設定する。これによって、障害発生前と同一の動作環境の下サービスを継続することができる。ただし、異なる論理デバイスの識別情報が設定されてもよい。 The newly generated logical device identification information is set to be the same as the first generated logical device identification information. As a result, the service can be continued under the same operating environment as before the occurrence of the failure. However, identification information of different logical devices may be set.

本実施形態では、前述した処理によって耐障害性を高める効果がある。通常、冗長化処理部１５１Ｂは、書込要求を受信した場合、ディスク装置１０５に格納される冗長化情報を読み出して、論理デバイスを構成するディスク装置１０５を特定する。さらに、冗長化処理部１５１Ｂは、特定された全てのディスク装置１０５に同一のデータを書き込む。 In the present embodiment, there is an effect of improving the fault tolerance by the processing described above. Normally, when the redundancy processing unit 151B receives a write request, the redundancy processing unit 151B reads the redundancy information stored in the disk device 105 and identifies the disk device 105 constituting the logical device. Further, the redundancy processing unit 151B writes the same data to all the specified disk devices 105.

しかし、主系計算機として稼働する計算機１０１に障害が発生すると、論理デバイスを構成する主系計算機のディスク装置１０５を利用できない状態となる。そのため、冗長化処理部１５１Ｂは、論理デバイスにエラーが発生しており、論理デバイスを利用できないと判定する。したがって、冗長化処理部１５１Ｂは、論理デバイスを構成する複数のディスク装置１０５にデータを書き込むことができない。すなわち、データの冗長化が実現できない。 However, when a failure occurs in the computer 101 that operates as the primary computer, the disk device 105 of the primary computer constituting the logical device cannot be used. Therefore, the redundancy processing unit 151B determines that an error has occurred in the logical device and the logical device cannot be used. Therefore, the redundancy processing unit 151B cannot write data to the plurality of disk devices 105 constituting the logical device. That is, data redundancy cannot be realized.

そこで、本実施形態では、障害制御部１７０から呼び出された冗長化処理部１５１Ｂが、主系計算機のディスク装置１０５を除く他のディスク装置１０５を用いて新たに論理デバイスを構築する。これによって、冗長化処理部１５１Ｂは、代替サービス提供部１７１から書込要求を受信した場合に、複数のディスク装置１０５にデータを書き込むことができる。 Therefore, in this embodiment, the redundancy processing unit 151B called from the failure control unit 170 constructs a new logical device using the other disk devices 105 other than the disk device 105 of the primary computer. As a result, the redundancy processing unit 151B can write data to the plurality of disk devices 105 when receiving a write request from the alternative service providing unit 171.

また、図１に示すように、論理デバイスを構成するディスク装置１０５が１台のみであっても、代替サービス提供部１７１からは論理デバイスに障害が発生しているとは認識されずにサービスを継続することができるという効果がある。 In addition, as shown in FIG. 1, even if there is only one disk device 105 constituting a logical device, the alternative service providing unit 171 does not recognize that a failure has occurred in the logical device, There is an effect that it can be continued.

図９は、本発明の第一の実施形態における構成回復部１５６Ａ（主系）が実行する処理を説明するフローチャートである。 FIG. 9 is a flowchart for explaining processing executed by the configuration recovery unit 156A (main system) in the first embodiment of the present invention.

構成回復部１５６Ａは、障害が回復した計算機１０１Ａが再起動した後、計算機システムの管理者から処理開始の指示を受信すると処理を開始する（ステップＳ９０１）。構成回復部１５６Ａは、副系計算機の構成回復部１５６Ｂを呼び出す（ステップＳ９０２）。これによって、代替サービス提供部１７１からサービス提供部１５０へサービスを引き継ぐための処理（図１０参照）が実行される。 The configuration recovery unit 156A starts processing upon receiving an instruction to start processing from the administrator of the computer system after the computer 101A in which the failure has been recovered restarts (step S901). The configuration recovery unit 156A calls the configuration recovery unit 156B of the secondary computer (step S902). As a result, processing for taking over the service from the alternative service providing unit 171 to the service providing unit 150 (see FIG. 10) is executed.

構成回復部１５６Ａは、ネットワークディスクドライバ部１５３Ａに対して副系計算機が備えるネットワークディスクドライバ部１５３Ｂと接続するよう指示する（ステップＳ９０３）。ステップＳ９０３の処理は、ステップＳ２０３と同一の処理である。ステップＳ９０３の処理によって、計算機１０１Ａは論理デバイスへアクセスが可能となる。 The configuration recovery unit 156A instructs the network disk driver unit 153A to connect to the network disk driver unit 153B included in the secondary computer (step S903). The process of step S903 is the same process as step S203. By the processing in step S903, the computer 101A can access the logical device.

構成回復部１５６Ａは、冗長化処理部１５１Ａに対して、副系計算機から冗長化情報の取得を指示する（ステップＳ９０４）。当該指示を受信した冗長化処理部１５１Ａは、ネットワークディスクドライバ部１５３Ａに、副系計算機から冗長化情報を取得するためのアクセス要求を出力する。 The configuration recovery unit 156A instructs the redundancy processing unit 151A to acquire redundancy information from the secondary computer (step S904). The redundancy processing unit 151A that has received the instruction outputs an access request for acquiring redundancy information from the secondary computer to the network disk driver unit 153A.

ステップＳ９０４では、ステップ８０３において生成された新たな論理デバイスの冗長化構成が読み出される。これによって、冗長化構成を維持したままサービスを継続することができる。 In step S904, the redundant configuration of the new logical device generated in step 803 is read. As a result, the service can be continued while maintaining the redundant configuration.

図１に示す例では、論理デバイスを構成するディスク装置１０５Ｂが一個であるためデータの二重化はできないが、論理デバイスに障害が発生しているとは認識されることなくサービスを継続することができるという効果がある。 In the example shown in FIG. 1, data cannot be duplicated because there is only one disk device 105B constituting the logical device, but the service can be continued without recognizing that a failure has occurred in the logical device. There is an effect.

構成回復部１５６Ａは、サービス提供部１５０に論理デバイスにアクセスするように指示し、さらに、サービスの開始を指示する（ステップＳ９０５）。ステップＳ９０５の処理は、ステップＳ２０５と同一の処理である。 The configuration recovery unit 156A instructs the service providing unit 150 to access the logical device, and further instructs the start of the service (step S905). The process of step S905 is the same process as step S205.

構成回復部１５６Ａは、冗長化処理部１５１Ａに対してディスク装置１０５Ａを管理下に置くよう指示する、すなわち、論理デバイスの再構成を指示する（ステップＳ９０６）。 The configuration recovery unit 156A instructs the redundancy processing unit 151A to place the disk device 105A under management, that is, instructs the reconfiguration of the logical device (step S906).

当該指示を受信した冗長化処理部１５１Ａは、図４に示す処理と同様の処理を実行する。具体的には、ステップＳ４０３では、冗長化処理部１５１Ａは、主系計算機のディスク装置１０５及び副系計算機のディスク装置１０５の全てを用いて論理ディスクを生成する。すなわち、障害が発生する前と同一の構成の論理デバイスが生成される。また、ステップＳ４０５では、冗長化処理部１５１Ａは、主系計算機のディスク装置１０５及び副系計算機のディスク装置１０５のそれぞれに冗長化情報を書き込む。なお、この時点では、論理デバイスを構成する主系計算機のディスク装置１０５にはデータが反映されていない。 The redundancy processing unit 151A that has received the instruction executes processing similar to the processing illustrated in FIG. Specifically, in step S403, the redundancy processing unit 151A generates a logical disk using all of the disk device 105 of the primary computer and the disk device 105 of the secondary computer. That is, a logical device having the same configuration as that before the failure occurs is generated. In step S405, the redundancy processing unit 151A writes the redundancy information to each of the disk device 105 of the primary computer and the disk device 105 of the secondary computer. At this point, data is not reflected on the disk device 105 of the main computer constituting the logical device.

以上の処理によって、障害発生前の論理デバイスの構成を回復することができる。図１に示す例では、ステップＳ７０６の処理によって、データの二重書き込みが可能となる。 With the above processing, the configuration of the logical device before the failure can be recovered. In the example shown in FIG. 1, the data can be double-written by the processing in step S706.

構成回復部１５６Ａは、論理デバイスが生成された後、冗長化処理部１５１Ａに、副系計算機のディスク装置１０５に格納されるデータを主系計算機のディスク装置１０５にコピーするように指示し、処理を終了する（ステップＳ９０７、ステップＳ９０８）。 After the logical device is generated, the configuration recovery unit 156A instructs the redundancy processing unit 151A to copy the data stored in the disk device 105 of the secondary computer to the disk device 105 of the primary computer. Is finished (step S907, step S908).

当該指示を受信した冗長化処理部１５１Ａは、ネットワークディスクドライバ部１５３Ａに対してアクセス要求（読出要求）を出力する。これによって、副系計算機のディスク装置１０５からデータを取得することができる。また、冗長化処理部１５１Ａは、ディスクドライバ部１５２Ａに対して、取得されたデータのアクセス要求（書込要求）を出力する。これによって、主系計算機のディスク装置１０５にデータが書き込まれる。 The redundancy processing unit 151A that has received the instruction outputs an access request (read request) to the network disk driver unit 153A. As a result, data can be acquired from the disk device 105 of the secondary computer. Also, the redundancy processing unit 151A outputs an access request (write request) for the acquired data to the disk driver unit 152A. As a result, data is written to the disk device 105 of the main computer.

ステップＳ９０７の処理によって、論理デバイスを構成する全てのディスク装置に同一のデータが反映される。すなわち、主系計算機が停止している間に代替サービス提供部１７１によって論理デバイスに書き込まれたデータが主系計算機のディスク装置１０５に書き込まれる。 By the processing in step S907, the same data is reflected in all the disk devices constituting the logical device. That is, the data written to the logical device by the alternative service providing unit 171 while the main computer is stopped is written to the disk device 105 of the main computer.

図１０は、本発明の第一の実施形態における構成回復部１５６Ｂ（副系）が実行する処理を説明するフローチャートである。 FIG. 10 is a flowchart illustrating processing executed by the configuration recovery unit 156B (secondary system) according to the first embodiment of this invention.

構成回復部１５６Ｂは、構成回復部１５６Ａから呼び出されると処理を開始する（ステップＳ１００１）。構成回復部１５６Ｂは、代替サービス提供部１７１を停止する（ステップＳ１００２）。 The configuration recovery unit 156B starts processing when called from the configuration recovery unit 156A (step S1001). The configuration recovery unit 156B stops the alternative service providing unit 171 (step S1002).

構成回復部１５６Ｂは、ネットワークディスクドライバ部１５３Ｂに対して、計算機１０１Ａからディスク装置１０５Ｂへのアクセスを許可するよう設定し、処理を終了する（ステップＳ１００３、ステップＳ１００４）。 The configuration recovery unit 156B sets the network disk driver unit 153B to permit access from the computer 101A to the disk device 105B, and ends the processing (steps S1003 and S1004).

以上の処理によって、他の計算機１０１からネットワークディスクドライバ部１５３Ｂへのアクセスを再開させ、計算機１０１Ａからのアクセスが可能となる。 With the above processing, access from the other computer 101 to the network disk driver unit 153B is resumed, and access from the computer 101A becomes possible.

なお、計算機１０１Ｂに障害が発生した場合、計算機１０１Ｂの再起動後に構成設定部１５５Ｂを起動させる。これによって、ネットワークディスクドライバ部１５３Ａからネットワークディスクドライバ部１５３Ｂへのアクセスを回復できる。その後、冗長化処理部１５１Ａが、自動的にディスク装置１０５Ａから１０５Ｂへデータをコピーする。 When a failure occurs in the computer 101B, the configuration setting unit 155B is started after the computer 101B is restarted. As a result, access from the network disk driver unit 153A to the network disk driver unit 153B can be recovered. Thereafter, the redundancy processing unit 151A automatically copies data from the disk devices 105A to 105B.

第一の実施形態では、計算機１０１Ａがサービスを提供する主系計算機、計算機１０１Ｂが副系計算機であるものとして説明したが、一例であって、それぞれの計算機１０１が同一の構成を備えてもよい。これによって、お互いが他方の副系計算機となれるような構成が可能となる。 In the first embodiment, it has been described that the computer 101A is a primary computer that provides a service, and the computer 101B is a secondary computer. However, this is an example, and each computer 101 may have the same configuration. . As a result, a configuration is possible in which each other can be the other subsystem computer.

第一の実施形態によれば、複数の計算機のディスク装置を用いて共有ストレージを構成した場合に、いずれかの計算機に障害が発生しても、副系計算機の代替サービス提供部１７１は、共有ストレージを用いてサービスを継続することができる。すなわち、副系計算機の代替サービス提供部１７１は、共有ストレージの冗長化構成が維持されているものと認識することができる。 According to the first embodiment, when a shared storage is configured using disk devices of a plurality of computers, even if a failure occurs in any computer, the alternative service providing unit 171 of the secondary computer Services can be continued using storage. That is, the alternative service providing unit 171 of the secondary computer can recognize that the redundant configuration of the shared storage is maintained.

これによって、専用のストレージシステムを用いることなく、一般的な計算機を用いて共有ストレージを構築することができ、また、障害への耐性も確保することができる。 As a result, it is possible to construct a shared storage using a general computer without using a dedicated storage system, and it is possible to ensure resistance to failures.

（第二の実施形態） (Second embodiment)

第二の実施形態について説明する。第二の実施形態では、具体的な装置構成を用いて実際のシステム構築例を説明する。 A second embodiment will be described. In the second embodiment, an actual system construction example will be described using a specific device configuration.

以下、第一の実施形態との差異を中心に説明する。 Hereinafter, the difference from the first embodiment will be mainly described.

図１１は、本発明の第二の実施形態の計算機システムの構成を示したブロック図である。 FIG. 11 is a block diagram showing a configuration of a computer system according to the second embodiment of this invention.

計算機システムの構成は、第一の実施形態と同一であるため説明を省略する。 Since the configuration of the computer system is the same as that of the first embodiment, description thereof is omitted.

第二の実施形態では、計算機１０１のハードウェア構成が一部異なる。具体的には、第二の実施形態では、ネットワークインタフェース１０６Ａ、１０６Ｂがイーサネットコントローラ１１０１Ａ、１１０１Ｂとなる（イーサネットは登録商標、以下同じ。）。その他のハードウェア構成は、第一の実施形態と同一であるため説明を省略する。 In the second embodiment, the hardware configuration of the computer 101 is partially different. Specifically, in the second embodiment, the network interfaces 106A and 106B are Ethernet controllers 1101A and 1101B (Ethernet is a registered trademark, the same applies hereinafter). Since other hardware configurations are the same as those of the first embodiment, the description thereof is omitted.

また第二の実施形態では、各計算機１０１が備えるソフトウェア構成が異なる。具体的には、第二の実施形態では、冗長化処理部１５１Ａ、１５１ＢはＯＳが備えるソフトウェアＲＡＩＤ機能部１１１１Ａ、１１１１Ｂとなり、ネットワークディスクドライバ部１５３ＡはｉＳＣＳＩイニシエータ１１１２となり、ネットワークディスクドライバ部１５３ＢはｉＳＣＳＩターゲット１１１３となる。また、計算機１０１Ｂは、ネットワーク１８０を介した計算機１０１からのアクセスを制御する構成としてネットワークフィルタ部１１１４を備える。 In the second embodiment, the software configuration of each computer 101 is different. Specifically, in the second embodiment, the redundancy processing units 151A and 151B are the software RAID function units 1111A and 1111B included in the OS, the network disk driver unit 153A is the iSCSI initiator 1112, and the network disk driver unit 153B is the iSCSI. It becomes the target 1113. Further, the computer 101B includes a network filter unit 1114 as a configuration for controlling access from the computer 101 via the network 180.

構成設定部１５５Ａの処理は、以下の点が異なる。ステップＳ２０２では、構成設定部１５５Ａは、副系計算機の名称を取得する。また、ステップＳ２０３では、構成設定部１５５Ａは、取得された副系計算機の名称をＩＰｖ４のアドレス又はＩＰｖ６のアドレスに変換し、変換されたアドレスを用いてｉＳＣＳＩターゲット１１１３に接続するように指示する。その他の処理は、第一の実施形態と同一である。 The processing of the configuration setting unit 155A is different in the following points. In step S202, the configuration setting unit 155A acquires the name of the secondary computer. In step S203, the configuration setting unit 155A converts the acquired name of the secondary computer into an IPv4 address or an IPv6 address, and instructs to connect to the iSCSI target 1113 using the converted address. Other processes are the same as those in the first embodiment.

障害制御部１７０の処理は、以下の点が異なる。 The processing of the failure control unit 170 is different in the following points.

ステップＳ６０２では、障害制御部１７０は、ｉＳＣＳＩターゲット１１１３を停止させ、又は、ネットワークフィルタ部１１１４に対してｉＳＣＳＩターゲット１１１３に送信されたＴＣＰ／ＩＰプロトコルのパケットを破棄するように指示する。その他の処理は、第一の実施形態と同一である。 In step S602, the failure control unit 170 stops the iSCSI target 1113 or instructs the network filter unit 1114 to discard the TCP / IP protocol packet transmitted to the iSCSI target 1113. Other processes are the same as those in the first embodiment.

構成回復部１５６Ｂの処理は、以下の点が異なる。 The processing of the configuration recovery unit 156B is different in the following points.

ステップＳ８０３では、構成回復部１５６Ｂが、ｉＳＣＳＩターゲット１１１３を再開させ、又は、ネットワークフィルタ部１１１４に対してｉＳＣＳＩターゲット１１１３に送信されるＴＣＰ／ＩＰプロトコルのパケットの破棄を中止するように指示する。その他の処理は、第一の実施形態と同一である。 In step S803, the configuration recovery unit 156B instructs the iSCSI target 1113 to restart, or instructs the network filter unit 1114 to cancel discarding of the TCP / IP protocol packet transmitted to the iSCSI target 1113. Other processes are the same as those in the first embodiment.

第二の実施形態によれば、ソフトウェアＲＡＩＤ、ｉＳＣＳＩターゲット、ｉＳＣＳＩイニシエータ及びイーサネットを用いて、安価に共有ストレージを構築することができる。 According to the second embodiment, a shared storage can be constructed at low cost by using software RAID, iSCSI target, iSCSI initiator, and Ethernet.

なお、本発明は前述した実施形態に限定されるものではなく、様々な変形例が含まれる。例えば、前述した実施形態は本発明を分かりやすく説明するために詳細に説明したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また、ある実施形態の構成の一部を他の実施形態の構成に置き換えることが可能であり、また、ある実施形態の構成に他の実施形態の構成を加えることも可能である。また、各実施形態の構成の一部について、他の構成の追加、削除、置換をすることが可能である。 In addition, this invention is not limited to embodiment mentioned above, Various modifications are included. For example, the above-described embodiments have been described in detail for easy understanding of the present invention, and are not necessarily limited to those having all the configurations described. Further, a part of the configuration of an embodiment can be replaced with the configuration of another embodiment, and the configuration of another embodiment can be added to the configuration of an embodiment. Moreover, it is possible to add, delete, and replace other configurations for a part of the configuration of each embodiment.

また、前述の各構成、機能、処理部、処理手段等は、それらの一部又は全部を、例えば集積回路で設計する等によってハードウェアを用いて実現してもよい。また、前述の各構成、機能等は、プロセッサがそれぞれの機能を実現するプログラムを解釈し、実行することによってソフトウェアを用いて実現してもよい。各機能を実現するプログラム、テーブル、ファイル等の情報は、メモリ、ＨＤＤ及びＳＳＤ等の記録装置、又は、ＩＣカード、ＳＤカード及びＤＶＤ等の記録媒体に格納することができる。また、制御線や情報線は説明上必要と考えられるものを示しており、製品上必ずしも全ての制御線や情報線を示しているとは限らない。実際にはほとんど全ての構成が相互に接続されていると考えてもよい。 Further, each of the above-described configurations, functions, processing units, processing means, and the like may be realized using hardware by designing a part or all of them with, for example, an integrated circuit. In addition, each of the above-described configurations, functions, and the like may be realized using software by the processor interpreting and executing a program that realizes each function. Information such as programs, tables, and files for realizing each function can be stored in a recording device such as a memory, HDD, or SSD, or a recording medium such as an IC card, SD card, or DVD. Further, the control lines and information lines indicate what is considered necessary for the explanation, and not all the control lines and information lines on the product are necessarily shown. Actually, it may be considered that almost all the components are connected to each other.

Claims

A computer system comprising a plurality of computers,
Each of the plurality of computers has a processor, a memory connected to the processor, a storage medium connected to the processor, and a network interface for connecting to another device via a network,
The plurality of computers includes one main computer that provides a service, and one or more sub computers that take over the service when a failure occurs in the main computer,
The main computer is
A first redundancy processing unit that generates a logical storage area using a storage area provided by the storage medium of each of the plurality of computers, and manages access to the generated logical storage area;
A first service providing unit that provides the service using the logical storage area;
The subsystem computer is
A failure control unit that monitors the main computer and executes a process for taking over the service when a failure of the main computer is detected;
A second service providing unit that provides the service using the logical storage area;
The first redundancy processing unit includes:
Generating a first logical storage area using a storage area provided by the storage medium of the main computer and the storage medium of the sub computer;
Generating first redundancy information including a correspondence relationship between the first logical storage area and the storage medium of the primary computer and the storage medium of the secondary computer constituting the first logical storage area;
Writing the first redundancy information to the storage medium of the primary computer and the storage medium of the secondary computer constituting the first logical storage area;
Providing the first logical storage area to the first service providing unit and instructing the first service providing unit to start the service;
When an access request is received from the first service providing unit, referring to the first redundancy information, accessing the first logical storage area,
The failure control unit activates the second service providing unit when detecting a failure of the primary computer,
The computer system according to claim 2, wherein the second service providing unit continues the service using data stored in the first logical storage area.

The computer system according to claim 1,
The secondary computer has a second redundancy processing unit that generates the logical storage area and manages access to the generated logical storage area,
When the failure control unit detects a failure of the main computer, the failure control unit calls the second redundancy processing unit,
The second redundancy processing unit includes:
A new second logical storage area is generated using only the storage medium of the secondary computer, excluding the storage medium of the primary computer,
Generating second redundancy information including a correspondence relationship between the second logical storage area and the storage medium of the secondary computer constituting the second logical storage area;
Writing the second redundancy information to a storage medium of the secondary computer constituting the second logical storage area;
The failure control unit provides the second logical storage area to the second service providing unit, and instructs the second service providing unit to start the service,
When the access request is received from the second service providing unit, the computer system is configured to access the second logical storage area with reference to the second redundancy information.

The computer system according to claim 2,
The failure control unit prohibits access from the main computer before calling the second redundancy processing unit.

The computer system according to claim 2,
The first redundancy processing unit includes:
When a write request for the first logical recording area is received, the same data is written to the storage medium of the primary computer and the storage medium of the secondary computer that constitute the first logical storage area,
When a read request for the first logical storage area is received, data is read from the storage medium of the main computer constituting the first logical storage area;
The second redundancy processing unit includes:
When a write request for the second logical storage area is received, the same data is written to the storage medium of the subsystem computer that constitutes the second logical storage area,
When a read request to the second logical storage area is received, data is stored from the storage medium of the subordinate computer that constitutes the second logical storage area and that executes the second service providing unit. A computer system characterized by reading.

The computer system according to claim 2,
The main computer has a recovery unit for recovering from a failure of the main computer,
The recovery unit is
Stopping the second service providing unit;
Obtaining the second redundancy information from a storage medium of the secondary computer constituting the second logical storage area;
Providing the second logical storage area to the first service providing unit and instructing the first service providing unit to start the service;
Calling the first redundancy processing unit;
The first redundancy processing unit includes:
A third logical storage area is generated using a storage area provided by the storage medium of the primary computer and the storage medium of the secondary computer,
Generating third redundancy information including a correspondence relationship between the third logical storage area and the storage medium of the primary computer and the storage medium of the secondary computer constituting the third logical storage area;
The third redundancy information is written to the storage medium of the primary computer and the storage medium of the secondary computer constituting the third logical storage area,
The recovery unit copies the data stored in the storage medium of the secondary computer constituting the third logical storage area to the storage medium of the primary computer constituting the third logical storage area. A computer system characterized by

The computer system according to claim 1,
The main computer has a selection unit for selecting the sub computer,
The selection unit includes:
Selecting one or more secondary computers from the plurality of computers included in the computer system;
A computer system characterized by establishing a connection with the selected sub computer.

A logical storage area management method in a computer system comprising a plurality of computers,
Each of the plurality of computers has a processor, a memory connected to the processor, a storage medium connected to the processor, and a network interface for connecting to another device via a network,
The plurality of computers includes one main computer that provides a service, and one or more sub computers that take over the service when a failure occurs in the main computer,
The method
A first step in which the main computer generates a first logical storage area using a storage area provided by the storage medium of the main computer and the storage medium of the sub computer;
The primary computer includes a first redundancy including a correspondence relationship between the first logical storage area and the storage medium of the primary computer and the storage medium of the secondary computer constituting the first logical storage area. A second step of generating the conversion information;
A third step in which the main computer writes the first redundancy information to a storage medium of the main computer and a storage medium of the sub computer constituting the first logical storage area;
A fourth step in which the main computer starts the service using the first logical storage area;
A fifth step of accessing the first logical storage area with reference to the first redundancy information when the main computer receives an access request to the first logical storage area; ,
And a sixth step of continuing the service using the data stored in the first logical storage area when the secondary computer detects a failure of the primary computer. Logical storage area management method.

The logical storage area management method according to claim 7,
The sixth step includes
A seventh step in which the secondary computer generates a new second logical storage area using only the storage medium of the secondary computer, excluding the storage medium of the primary computer;
The sub system computer generates second redundancy information including a correspondence relationship between the second logical storage area and the storage medium of the sub system composing the second logical storage area. Steps,
A ninth step in which the secondary computer writes the second redundancy information to a storage medium of the secondary computer constituting the second logical storage area;
A tenth step in which the secondary computer starts the service using the second logical storage area;
The method further comprises:
An eleventh step of accessing the second logical storage area with reference to the second redundancy information when the secondary computer receives an access request to the second logical storage area; A logical storage area management method comprising:

The logical storage area management method according to claim 8, comprising:
The seventh step includes a step of prohibiting access from the primary computer before the secondary computer generates the second logical storage region.

The logical storage area management method according to claim 8, comprising:
In the fifth step, when the main computer receives a write request for the first logical recording area, the storage medium of the main computer and the secondary computer constituting the first logical storage area When the same data is written to the storage medium of the system computer, and when a read request for the first logical storage area is received, the data is read from the storage medium of the main computer constituting the first logical storage area,
In the eleventh step, when the secondary computer receives a write request for the second logical storage area, it is the same as the storage medium of the secondary computer that constitutes the second logical storage area. When data is written and a read request for the second logical storage area is received, the second logical storage area is configured and the data is read from the storage medium of the secondary computer that provides the service A logical storage area management method.

The logical storage area management method according to claim 8, comprising:
The method further comprises:
The primary computer stopping the service executed in the secondary computer;
The primary computer obtaining the second redundancy information from a storage medium of the secondary computer constituting the second logical storage area;
The main computer starts the service using the second logical storage area; and
The main computer generates a third logical storage area using a storage area provided by the storage medium of the main computer and the storage medium of the sub computer; and
The main computer includes a third redundancy including a correspondence relationship between the third logical storage area and the storage medium of the main computer and the storage medium of the sub computer constituting the third logical storage area. Generating archiving information; and
The main computer writes the third redundancy information to a storage medium of the main computer and a storage medium of the sub computer constituting the third logical storage area;
The primary computer copies the data stored in the storage medium of the secondary computer constituting the third logical storage area to the storage medium of the primary computer constituting the third logical storage area And a logical storage area management method comprising the steps of:

The logical storage area management method according to claim 7,
The first step includes
The primary computer selecting one or more secondary computers from the plurality of computers included in the computer system;
And a step of establishing a connection between the main computer and the selected sub computer.