JP7164175B2

JP7164175B2 - DISTRIBUTED FILE DEVICE, FAILOVER METHOD, PROGRAM AND RECORDING MEDIUM

Info

Publication number: JP7164175B2
Application number: JP2018230768A
Authority: JP
Inventors: 平竹本
Original assignee: NEC Solutions Innovators Ltd
Current assignee: NEC Solutions Innovators Ltd
Priority date: 2018-12-10
Filing date: 2018-12-10
Publication date: 2022-11-01
Anticipated expiration: 2038-12-10
Also published as: JP2020095322A

Description

本発明は、分散ファイル装置、フェイルオーバ方法、プログラム及び記録媒体に関する。 The present invention relates to a distributed file device, failover method, program and recording medium.

現用装置と待機装置（バックアップ装置）との分散ファイル装置におけるデータ管理には、同期コピーと非同期コピーの二つの方式がある。前記二つの方式のうち、同期コピーの方がバックアップとしては望ましい。しかしながら、同期コピーでは、同じ災害で現用装置及び待機装置の双方に影響が及ぶことのないように、両者が遠隔にある、例えば、地理的に離れた異なる都市にある場合には、処理に時間を要する。したがって、処理時間の短縮のためには、非同期コピーが好ましい（例えば、特許文献１参照）。 There are two methods of data management in a distributed file device between an active device and a standby device (backup device): synchronous copy and asynchronous copy. Of the two methods, synchronous copy is preferable as a backup. However, a synchronous copy takes time to process if both are remote, for example in different geographically separated cities, so that the same disaster does not affect both the working and standby devices. requires. Therefore, asynchronous copying is preferable for shortening the processing time (see, for example, Patent Document 1).

特開２０１８－１３６５９６号公報JP 2018-136596 A

しかしながら、非同期コピーには、障害発生によりアプリケーションの実行を現用装置から待機装置へと引き継ぐときに、現用装置から待機装置にコピーされたデータが、どの時点のものか明確でないという問題がある。 However, the asynchronous copy has a problem that when the execution of the application is handed over from the active device to the standby device due to the occurrence of a failure, it is not clear at what point in time the data copied from the active device to the standby device.

そこで、本発明は、現用装置から待機装置にコピーされたデータが、どの時点のものかが明確化された非同期コピーの分散ファイル装置及びそれを用いたフェイルオーバ方法を提供することを目的とする。 SUMMARY OF THE INVENTION Accordingly, it is an object of the present invention to provide an asynchronous copy distributed file device in which the point in time of data copied from an active device to a standby device is clarified, and a failover method using the distributed file device.

前記目的を達成するために、本発明の分散ファイル装置は、
アプリケーションを実行する第１装置と、
前記第１装置に障害が発生したときに前記アプリケーションの実行を引き継ぐ第２装置と、
を含み、
前記第１装置は、第１制御部、第１記憶部、及び、反映実行部、を含み、
前記第２装置は、第２制御部、未反映データ記憶部、及び、第２記憶部、を含み、
前記第１制御部及び前記第２制御部は、通信回線網を介して接続可能であり、
前記第１制御部は、前記アプリケーションの実行に関するデータに、前記第２装置に未反映であるとのフラグを立てて前記第１記憶部に登録し、かつ、前記データに送信時刻を付して前記第２制御部に送信し、
前記第２制御部は、前記送信時刻が付された前記データを、前記未反映データ記憶部に登録し、
前記反映実行部は、予め設定されたトリガー条件を満たしたとき、前記第２装置に未反映のデータの前記第２装置への反映を前記第２制御部に指示するように前記第１制御部に指示し、
前記指示を受けたとき、前記第１制御部は、前記未反映のデータの反映を、前記第２制御部に指示し、
前記指示を受けたとき、前記第２制御部は、前記未反映データ記憶部に登録された前記未反映のデータに付された前記送信時刻を、チェックポイント時刻に変更した後、前記データを前記第２記憶部に登録し、かつ、前記データの反映の完了信号を、前記第１制御部に送信し、
前記完了信号を受信したとき、前記第１制御部は、前記第２装置に未反映であるとのフラグを立てて前記第１記憶部に登録されたデータの前記フラグを倒す、装置である。 In order to achieve the above object, the distributed file device of the present invention
a first device executing an application;
a second device that takes over execution of the application when the first device fails;
including
The first device includes a first control unit, a first storage unit, and a reflection execution unit,
The second device includes a second control unit, an unreflected data storage unit, and a second storage unit,
The first control unit and the second control unit are connectable via a communication network,
The first control unit registers data related to execution of the application in the first storage unit with a flag indicating that the data has not been reflected in the second device, and adds a transmission time to the data. Send to the second control unit,
the second control unit registers the data to which the transmission time is attached in the unreflected data storage unit;
The reflection execution unit, when a preset trigger condition is satisfied, instructs the second control unit to reflect data that has not been reflected in the second device to the second device. to
When receiving the instruction, the first control unit instructs the second control unit to reflect the unreflected data;
Upon receiving the instruction, the second control unit changes the transmission time attached to the unreflected data registered in the unreflected data storage unit to the checkpoint time, and then transfers the data to the registering in the second storage unit, and transmitting a completion signal for reflecting the data to the first control unit;
The first control unit sets a flag indicating that the data has not been reflected in the second device and clears the flag of the data registered in the first storage unit when the completion signal is received.

本発明のフェイルオーバ方法は、
アプリケーションを実行する第１装置と、
前記第１装置に障害が発生したときに前記アプリケーションの実行を引き継ぐ第２装置と、
を含む分散ファイル装置を用い、
前記第１装置は、第１制御部、第１記憶部、及び、反映実行部、を含み、
前記第２装置は、第２制御部、未反映データ記憶部、及び、第２記憶部、を含み、
前記第１制御部及び前記第２制御部は、通信回線網を介して接続可能であり、
前記第１制御部が、前記アプリケーションの実行に関するデータに、前記第２装置に未反映であるとのフラグを立てて前記第１記憶部に登録し、かつ、前記データに送信時刻を付して前記第２制御部に送信する未反映データ処理工程と、
前記第２制御部が、前記送信時刻が付された前記データを、前記未反映データ記憶部に登録する未反映データ登録工程と、
前記反映実行部が、予め設定されたトリガー条件を満たしたとき、前記第２装置に未反映のデータの前記第２装置への反映を前記第２制御部に指示するように前記第１制御部に指示する第１反映指示工程と、
前記指示を受けたとき、前記第１制御部が、前記未反映のデータの反映を、前記第２制御部に指示する第２反映指示工程と、
前記指示を受けたとき、前記第２制御部が、前記未反映データ記憶部に登録された前記未反映のデータに付された前記送信時刻を、チェックポイント時刻に変更した後、前記データを前記第２記憶部に登録し、かつ、前記データの反映の完了信号を、前記第１制御部に送信する反映工程と、
前記完了信号を受信したとき、前記第１制御部が、前記第２装置に未反映であるとのフラグを立てて前記第１記憶部に登録されたデータの前記フラグを倒す登録工程と、
を含む、方法である。 The failover method of the present invention is
a first device executing an application;
a second device that takes over execution of the application when the first device fails;
using a distributed file device containing
The first device includes a first control unit, a first storage unit, and a reflection execution unit,
The second device includes a second control unit, an unreflected data storage unit, and a second storage unit,
The first control unit and the second control unit are connectable via a communication network,
The first control unit registers data related to execution of the application in the first storage unit with a flag indicating that the data has not been reflected in the second device, and adds a transmission time to the data. a non-reflected data processing step for transmitting to the second control unit;
an unreflected data registration step in which the second control unit registers the data to which the transmission time is attached in the unreflected data storage unit;
The first control unit, when the reflection execution unit satisfies a preset trigger condition, instructs the second control unit to reflect data that has not been reflected in the second device to the second device. a first reflection instruction step of instructing to
a second reflection instruction step in which, when receiving the instruction, the first control unit instructs the second control unit to reflect the unreflected data;
When receiving the instruction, the second control unit changes the transmission time attached to the unreflected data registered in the unreflected data storage unit to the checkpoint time, and then transfers the data to the a reflection step of registering the data in a second storage unit and transmitting a reflection completion signal of the data to the first control unit;
a registration step in which, when the completion signal is received, the first control unit raises a flag indicating that the data has not been reflected in the second device and clears the flag of the data registered in the first storage unit;
A method comprising:

本発明によれば、現用装置から待機装置にコピーされたデータが、どの時点のものかが明確化された非同期コピーの分散ファイル装置及びそれを用いたフェイルオーバ方法を提供可能となる。 According to the present invention, it is possible to provide an asynchronous copy distributed file device in which the point in time of data copied from an active device to a standby device is clarified, and a failover method using the distributed file device.

図１は、実施形態１の分散ファイル装置の一例の構成を示すブロック図である。FIG. 1 is a block diagram showing an example configuration of a distributed file device according to the first embodiment. 図２は、実施形態１の分散ファイル装置において、第１装置から第２装置へのデータの反映指示が出されていない定常状態におけるデータの流れの一例を示すブロック図である。FIG. 2 is a block diagram showing an example of data flow in a steady state in which the first device does not issue a data reflection instruction to the second device in the distributed file device of the first embodiment. 図３は、実施形態１の分散ファイル装置において、第１装置から第２装置へのデータの反映指示が出された状態におけるデータの流れの一例を示すブロック図である。FIG. 3 is a block diagram showing an example of data flow in the distributed file system according to the first embodiment in a state in which a data reflection instruction has been issued from the first device to the second device. 図４は、実施形態２の分散ファイル装置の一例の構成を示すブロック図である。FIG. 4 is a block diagram showing an example configuration of a distributed file device according to the second embodiment. 図５は、実施形態３の分散ファイル装置の一例の構成を示すブロック図である。FIG. 5 is a block diagram showing an example configuration of a distributed file device according to the third embodiment. 図６は、実施形態４の分散ファイル装置の一例の構成を示すブロック図である。FIG. 6 is a block diagram showing an example configuration of a distributed file device according to the fourth embodiment. 図７は、実施形態５の分散ファイル装置の一例の構成を示すブロック図である。FIG. 7 is a block diagram showing an example configuration of a distributed file device according to the fifth embodiment. 図８は、実施形態１の分散ファイル装置における第１装置のハードウエア構成の一例を示すブロック図である。8 is a block diagram showing an example of the hardware configuration of the first device in the distributed file system of the first embodiment; FIG. 図９は、実施形態１の分散ファイル装置における第２装置のハードウエア構成の一例を示すブロック図である。9 is a block diagram showing an example of a hardware configuration of a second device in the distributed file system of the first embodiment; FIG. 図１０は、実施形態１から３の分散ファイル装置における処理の一例を示すフローチャートである。FIG. 10 is a flow chart showing an example of processing in the distributed file device according to the first to third embodiments. 図１１は、実施形態４の分散ファイル装置における処理の一例を示すフローチャートである。FIG. 11 is a flow chart showing an example of processing in the distributed file system according to the fourth embodiment. 図１２は、実施形態５の分散ファイル装置における処理の一例を示すフローチャートである。FIG. 12 is a flow chart showing an example of processing in the distributed file system according to the fifth embodiment. 図１３は、実施形態５の分散ファイル装置における処理の別の例を示すフローチャートである。FIG. 13 is a flow chart showing another example of processing in the distributed file system of the fifth embodiment.

本発明において、例えば、第１装置が、現用装置であり、第２装置が、待機装置（バックアップ装置）であってもよい。 In the present invention, for example, the first device may be the active device and the second device may be the standby device (backup device).

本発明において、「フェイルオーバ」とは、例えば、障害が発生した第１装置（現用装置）でのアプリケーションの実行を、第２装置（待機装置）が引き継ぐことを言う。 In the present invention, "failover" means, for example, that the second device (standby device) takes over the execution of an application on the failed first device (active device).

本発明の分散ファイル装置及びフェイルオーバ方法は、前記予め設定されたトリガー条件が、所定時間の経過又は所定時刻の到来である、という態様であってもよい。 In the distributed file system and failover method of the present invention, the preset trigger condition may be elapse of a predetermined time or arrival of a predetermined time.

本発明の分散ファイル装置及びフェイルオーバ方法は、前記未反映データ記憶部が、データ量の閾値を有し、前記予め設定されたトリガー条件が、前記未反映データ記憶部に登録されたデータ量が前記閾値を超えたことである、という態様であってもよい。 In the distributed file system and the failover method of the present invention, the unreflected data storage unit has a data amount threshold, and the preset trigger condition is such that the amount of data registered in the unreflected data storage unit is It may be that the threshold value is exceeded.

本発明の分散ファイル装置及びフェイルオーバ方法は、前記予め設定されたトリガー条件が、前記アプリケーションから前記第１制御部にチェックポイント取得要求がされたことである、という態様であってもよい。 In the distributed file system and failover method of the present invention, the preset trigger condition may be that the application issues a checkpoint acquisition request to the first control unit.

本発明の分散ファイル装置は、前記未反映データ記憶部に登録された前記送信時刻が付された未反映のデータの一部又は全部をロストしたとき、前記第１制御部が、前記第２装置に未反映であるとのフラグを立てて前記第１記憶部に登録されたデータと、前記未反映データ記憶部に登録されたデータとの差分を、前記第２制御部に送信する、という態様であってもよい。本態様によれば、短時間で、前記未反映データ記憶部から前記未反映のデータがロストする前の状態に復旧することができる。 In the distributed file system of the present invention, when part or all of the unreflected data to which the transmission time is attached and which is registered in the unreflected data storage unit is lost, the first controller controls the second device and sending a difference between the data registered in the first storage unit and the data registered in the unreflected data storage unit to the second control unit. may be According to this aspect, it is possible to restore the state before the unreflected data is lost from the unreflected data storage unit in a short period of time.

本発明のフェイルオーバ方法は、前記未反映データ記憶部に登録された前記送信時刻が付された未反映のデータの一部又は全部をロストしたとき、前記第１制御部が、前記第２装置に未反映であるとのフラグを立てて前記第１記憶部に登録されたデータと、前記未反映データ記憶部に登録されたデータとの差分を、前記第２制御部に送信する差分送信工程を含んでもよい。 In the failover method of the present invention, when part or all of the unreflected data with the transmission time registered in the unreflected data storage unit is lost, the first control unit instructs the second device to a difference transmission step of transmitting to the second control unit the difference between the data registered in the first storage unit and the data registered in the unreflected data storage unit with a flag indicating that it is not reflected; may contain.

本発明の分散ファイル装置は、前記第１装置に障害が発生し、前記第２装置が前記アプリケーションの実行を引き継いだとき、前記第２制御部が、下記条件１及び下記条件２のいずれかを選択する、という態様であってもよい。
（条件１）
前記第２記憶部に登録された前記チェックポイント時刻が付された反映済みのデータのみを、前記アプリケーションの実行に使用する。
（条件２）
前記第２記憶部に登録された前記チェックポイント時刻が付された反映済みのデータと、前記未反映データ記憶部に登録された前記送信時刻が付された未反映のデータとを、前記アプリケーションの実行に使用する。 In the distributed file system of the present invention, when a failure occurs in the first device and the second device takes over the execution of the application, the second control unit satisfies either condition 1 or condition 2 below. It may be a mode of selecting.
(Condition 1)
Only the reflected data with the checkpoint time registered in the second storage unit is used for execution of the application.
(Condition 2)
The application stores the reflected data with the checkpoint time registered in the second storage unit and the unreflected data with the transmission time registered in the unreflected data storage unit. used for execution.

本発明のフェイルオーバ方法は、前記第１装置に障害が発生し、前記第２装置が前記アプリケーションの実行を引き継いだとき、前記第２制御部が、下記条件１及び下記条件２のいずれかを選択する選択工程を含んでもよい。
（条件１）
前記第２記憶部に登録された前記チェックポイント時刻が付された反映済みのデータのみを、前記アプリケーションの実行に使用する。
（条件２）
前記第２記憶部に登録された前記チェックポイント時刻が付された反映済みのデータと、前記未反映データ記憶部に登録された前記送信時刻が付された未反映のデータとを、前記アプリケーションの実行に使用する。 In the failover method of the present invention, when a failure occurs in the first device and the second device takes over execution of the application, the second control unit selects either condition 1 or condition 2 below. A selection step may be included.
(Condition 1)
Only the reflected data with the checkpoint time registered in the second storage unit is used for execution of the application.
(Condition 2)
The application stores the reflected data with the checkpoint time registered in the second storage unit and the unreflected data with the transmission time registered in the unreflected data storage unit. used for execution.

本発明の分散ファイル装置は、前記第２制御部が、前記条件１を選択したときに前記第２装置が前記アプリケーションを起動するまでに要する時間と、前記条件２を選択したときに前記第２装置が前記アプリケーションを起動するまでに要する時間とを予測する、という態様であってもよい。 In the distributed file system of the present invention, when the second control unit selects the condition 1, the time required for the second device to start the application, and when the condition 2 is selected, the second control unit It is also possible to predict the time required for the device to start the application.

本発明のフェイルオーバ方法は、前記第２制御部が、前記条件１を選択したときに前記第２装置が前記アプリケーションを起動するまでに要する時間と、前記条件２を選択したときに前記第２装置が前記アプリケーションを起動するまでに要する時間とを予測する予測工程を含んでもよい。 In the failover method of the present invention, the second control unit controls the time required for the second device to start the application when the condition 1 is selected, and the second device when the condition 2 is selected. to launch the application.

本発明のプログラムは、本発明のフェイルオーバ方法をコンピュータ上で実行可能なプログラムである。 A program of the present invention is a program capable of executing the failover method of the present invention on a computer.

本発明の記録媒体は、本発明のプログラムを記録しているコンピュータ読み取り可能な記録媒体である。 A recording medium of the present invention is a computer-readable recording medium recording the program of the present invention.

つぎに、本発明の実施形態について、図１から図１３を用いて説明する。本発明は、下記の実施形態によって何ら限定及び制限されない。なお、図１から図１３において、同一部分には、同一符号を付している。各実施形態における説明は、それぞれ、互いを援用できる。 Next, an embodiment of the present invention will be described with reference to FIGS. 1 to 13. FIG. The present invention is not limited or restricted by the following embodiments. In addition, in FIGS. 1 to 13, the same parts are denoted by the same reference numerals. The descriptions in each embodiment can be used with each other.

［実施形態１］
図１は、本実施形態の分散ファイル装置の一例の構成を示すブロック図である。図１に示すように、分散ファイル装置１０は、アプリケーション２０を実行する第１装置１１と、第１装置１１に障害が発生したときにアプリケーション２０の実行を引き継ぐ第２装置１２と、を含む。分散ファイル装置１０において、例えば、第１装置１１が、現用装置であり、第２装置１２が、待機装置（バックアップ装置）であってもよい。第１装置１１は、第１制御部１１ａ、第１記憶部１１ｂ、及び、反映実行部１１ｃ、を含む。第２装置１２は、第２制御部１２ａ、未反映データ記憶部１２ｂ、及び、第２記憶部１２ｃ、を含む。第１制御部１１ａ及び第２制御部１２ａは、通信回線網３０を介して接続可能である。通信回線網３０は、特に制限されず、公知のネットワークを使用でき、例えば、有線でも無線でもよい。通信回線網３０は、例えば、インターネット回線、ＷＷＷ（ＷｏｒｌｄＷｉｄｅＷｅｂ）、電話回線、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）、ＷｉＦｉ（ＷｉｒｅｌｅｓｓＦｉｄｅｌｉｔｙ）等が挙げられる。第１装置１１及び第２装置１２は、例えば、パーソナルコンピュータ（ＰＣ）であってもよい。 [Embodiment 1]
FIG. 1 is a block diagram showing the configuration of an example of a distributed file device according to this embodiment. As shown in FIG. 1, the distributed file system 10 includes a first device 11 that executes an application 20 and a second device 12 that takes over execution of the application 20 when the first device 11 fails. In the distributed file system 10, for example, the first device 11 may be the active device and the second device 12 may be the standby device (backup device). The first device 11 includes a first control unit 11a, a first storage unit 11b, and a reflection execution unit 11c. The second device 12 includes a second control section 12a, an unreflected data storage section 12b, and a second storage section 12c. The first control unit 11 a and the second control unit 12 a can be connected via a communication network 30 . The communication network 30 is not particularly limited, and a known network can be used. For example, it may be wired or wireless. The communication line network 30 includes, for example, the Internet line, WWW (World Wide Web), telephone line, LAN (Local Area Network), WiFi (Wireless Fidelity), and the like. The first device 11 and the second device 12 may be, for example, personal computers (PCs).

図８に、第１装置１１のハードウエア構成のブロック図を例示する。第１装置１１は、例えば、ＣＰＵ（中央処理装置）１１１、メモリ１１２、記憶装置１１４、入力装置１１５、ディスプレイ１１６、通信デバイス１１７等を有する。第１装置１１の各部は、バス１１３を介して、相互に接続されている。 FIG. 8 illustrates a block diagram of the hardware configuration of the first device 11. As shown in FIG. The first device 11 has, for example, a CPU (Central Processing Unit) 111, a memory 112, a storage device 114, an input device 115, a display 116, a communication device 117, and the like. Each unit of the first device 11 is interconnected via a bus 113 .

図９に、第２装置１２のハードウエア構成のブロック図を例示する。第２装置１２は、例えば、ＣＰＵ（中央処理装置）１２１、メモリ１２２、記憶装置１２４、入力装置１２５、ディスプレイ１２６、通信デバイス１２７等を有する。第２装置１２の各部は、バス１２３を介して、相互に接続されている。 FIG. 9 illustrates a block diagram of the hardware configuration of the second device 12. As shown in FIG. The second device 12 has, for example, a CPU (Central Processing Unit) 121, a memory 122, a storage device 124, an input device 125, a display 126, a communication device 127, and the like. Each part of the second device 12 is interconnected via a bus 123 .

ＣＰＵ１１１は、第１装置１１の全体の制御を担う。第１装置１１において、ＣＰＵ１１１により、例えば、本発明のプログラムやその他のプログラムが実行され、また、各種情報の読み込みや書き込みが行われる。具体的には、例えば、ＣＰＵ１１１が、第１制御部１１ａ、及び、反映実行部１１ｃとして機能する。 The CPU 111 is responsible for overall control of the first device 11 . In the first device 11, the CPU 111 executes, for example, the program of the present invention and other programs, and reads and writes various information. Specifically, for example, the CPU 111 functions as the first control unit 11a and the reflection execution unit 11c.

ＣＰＵ１２１は、第２装置１２の全体の制御を担う。第２装置１２において、ＣＰＵ１２１により、例えば、本発明のプログラムやその他のプログラムが実行され、また、各種情報の読み込みや書き込みが行われる。具体的には、例えば、ＣＰＵ１２１が、第２制御部１２ａとして機能する。 The CPU 121 is responsible for overall control of the second device 12 . In the second device 12, the CPU 121 executes, for example, the program of the present invention and other programs, and reads and writes various information. Specifically, for example, the CPU 121 functions as the second control unit 12a.

バス１１３は、例えば、外部機器とも接続できる。前記外部機器は、例えば、外部記憶装置（外部データベース等）、プリンター等が挙げられる。第１装置１１は、例えば、バス１１３に接続された通信デバイス１１７により、通信回線網（図示せず）に接続でき、通信回線網を介して、前記外部機器と接続することもできる。同様に、バス１２３は、例えば、外部機器とも接続できる。前記外部機器は、例えば、外部記憶装置（外部データベース等）、プリンター等が挙げられる。第２装置１２は、例えば、バス１２３に接続された通信デバイス１２７により、通信回線網（図示せず）に接続でき、通信回線網を介して、前記外部機器と接続することもできる。 The bus 113 can also be connected to external equipment, for example. Examples of the external device include an external storage device (external database, etc.), a printer, and the like. The first device 11 can be connected to a communication network (not shown), for example, by a communication device 117 connected to the bus 113, and can also be connected to the external device via the communication network. Similarly, the bus 123 can also be connected to external equipment, for example. Examples of the external device include an external storage device (external database, etc.), a printer, and the like. The second device 12 can be connected to a communication network (not shown), for example, by a communication device 127 connected to the bus 123, and can also be connected to the external device via the communication network.

メモリ１１２は、例えば、メインメモリを含み、前記メインメモリは、主記憶装置ともいう。ＣＰＵ１１１が処理を行う際には、例えば、後述する記憶装置１１４に記憶されている本発明のプログラム等の種々の動作プログラムを、メモリ１１２が読み込み、ＣＰＵ１１１は、メモリ１１２からデータを受け取って、プログラムを実行する。前記メインメモリは、例えば、ＲＡＭ（ランダムアクセスメモリ）である。メモリ１１２は、例えば、さらに、ＲＯＭ（読み出し専用メモリ）を含む。同様に、メモリ１２２は、例えば、メインメモリを含み、前記メインメモリは、主記憶装置ともいう。ＣＰＵ１２１が処理を行う際には、例えば、後述する記憶装置１２４に記憶されている本発明のプログラム等の種々の動作プログラムを、メモリ１２２が読み込み、ＣＰＵ１２１は、メモリ１２２からデータを受け取って、プログラムを実行する。前記メインメモリは、例えば、ＲＡＭ（ランダムアクセスメモリ）である。メモリ１２２は、例えば、さらに、ＲＯＭ（読み出し専用メモリ）を含む。 The memory 112 includes, for example, main memory, which is also referred to as main memory. When the CPU 111 performs processing, for example, the memory 112 reads various operating programs such as the program of the present invention stored in a storage device 114 to be described later, and the CPU 111 receives data from the memory 112 and processes the programs. to run. The main memory is, for example, RAM (random access memory). Memory 112, for example, further includes ROM (read only memory). Similarly, memory 122 includes, for example, main memory, which is also referred to as main memory. When the CPU 121 performs processing, for example, the memory 122 reads various operation programs such as the program of the present invention stored in a storage device 124 described later, and the CPU 121 receives data from the memory 122 and processes the programs. to run. The main memory is, for example, RAM (random access memory). Memory 122, for example, further includes ROM (read only memory).

記憶装置１１４は、例えば、前記メインメモリ（主記憶装置）に対して、いわゆる補助記憶装置ともいう。前述のように、記憶装置１１４には、本発明のプログラムを含む動作プログラムが格納されている。記憶装置１１４は、第１記憶部１１ｂを含む。記憶装置１１４は、例えば、記憶媒体と、前記記憶媒体に読み書きするドライブとを含む。前記記憶媒体は、特に制限されず、例えば、内蔵型でも外付け型でもよく、ＨＤ（ハードディスク）、ＦＤ（フロッピー（登録商標）ディスク）、ＣＤ－ＲＯＭ、ＣＤ－Ｒ、ＣＤ－ＲＷ、ＭＯ、ＤＶＤ、フラッシュメモリー、メモリーカード等が挙げられ、前記ドライブは、特に制限されない。記憶装置１１４は、例えば、記憶媒体とドライブとが一体化されたハードディスクドライブ（ＨＤＤ）であってもよい。 The storage device 114 is also called a so-called auxiliary storage device, for example, in contrast to the main memory (main storage device). As described above, the storage device 114 stores operating programs including the program of the present invention. The storage device 114 includes a first storage section 11b. Storage device 114 includes, for example, a storage medium and a drive that reads from and writes to the storage medium. The storage medium is not particularly limited, and may be, for example, a built-in type or an external type. Examples include DVD, flash memory, memory card, etc., and the drive is not particularly limited. The storage device 114 may be, for example, a hard disk drive (HDD) in which a storage medium and drive are integrated.

記憶装置１２４は、例えば、前記メインメモリ（主記憶装置）に対して、いわゆる補助記憶装置ともいう。前述のように、記憶装置１２４には、本発明のプログラムを含む動作プログラムが格納されている。記憶装置１２４は、未反映データ記憶部１２ｂ、及び、第２記憶部１２ｃを含む。記憶装置１２４は、例えば、記憶媒体と、前記記憶媒体に読み書きするドライブとを含む。前記記憶媒体及び前記ドライブとしては、前述の記憶装置１１４におけるのと同様である。記憶装置１２４は、例えば、記憶媒体とドライブとが一体化されたハードディスクドライブ（ＨＤＤ）であってもよい。 The storage device 124 is also called a so-called auxiliary storage device, for example, in contrast to the main memory (main storage device). As described above, the storage device 124 stores operating programs including the program of the present invention. The storage device 124 includes an unreflected data storage section 12b and a second storage section 12c. Storage device 124 includes, for example, a storage medium and a drive that reads from and writes to the storage medium. The storage medium and the drive are the same as in the storage device 114 described above. The storage device 124 may be, for example, a hard disk drive (HDD) in which a storage medium and drive are integrated.

第１装置１１は、例えば、さらに、入力装置１１５、ディスプレイ１１６を有する。入力装置１１５は、例えば、タッチパネル、キーボード、マウス等である。ディスプレイ１１６は、例えば、ＬＥＤディスプレイ、液晶ディスプレイ等が挙げられる。同様に、第２装置１２は、例えば、さらに、入力装置１２５、ディスプレイ１２６を有する。入力装置１２５は、例えば、タッチパネル、キーボード、マウス等である。ディスプレイ１２６は、例えば、ＬＥＤディスプレイ、液晶ディスプレイ等が挙げられる。 The first device 11 further has an input device 115 and a display 116, for example. The input device 115 is, for example, a touch panel, keyboard, mouse, or the like. Examples of the display 116 include an LED display and a liquid crystal display. Similarly, the second device 12 also has an input device 125 and a display 126, for example. The input device 125 is, for example, a touch panel, keyboard, mouse, or the like. The display 126 may be, for example, an LED display, a liquid crystal display, or the like.

第１装置１１において、メモリ１１２及び記憶装置１１４は、ユーザーからのアクセス情報及びログ情報、並びに、外部データベース（図示せず）から取得した情報を記憶することも可能である。同様に、第２装置１２において、メモリ１２２及び記憶装置１２４は、ユーザーからのアクセス情報及びログ情報、並びに、外部データベース（図示せず）から取得した情報を記憶することも可能である。 In the first device 11, the memory 112 and the storage device 114 can also store access information and log information from users, and information obtained from an external database (not shown). Similarly, in the second device 12, the memory 122 and storage device 124 can store access information and log information from users, as well as information obtained from an external database (not shown).

次に、分散ファイル装置１０における処理の一例について説明する。 Next, an example of processing in the distributed file device 10 will be described.

まず、分散ファイル装置１０において、第１装置１１から第２装置１２へのデータの反映指示が出されていない定常状態における処理の一例を、図２のブロック図及び図１０のフローチャートに基づき説明する。 First, in the distributed file system 10, an example of processing in a steady state in which the first device 11 does not issue a data reflection instruction to the second device 12 will be described with reference to the block diagram of FIG. 2 and the flowchart of FIG. .

まず、第１制御部１１ａは、アプリケーション２０の実行に関するデータを、第２装置１２に未反映であるとのフラグを立てて（未反映データとして）第１記憶部１１ｂに登録し、かつ、前記データに送信時刻を付して第２制御部１２ａに送信する（Ｓ１）。 First, the first control unit 11a registers data related to the execution of the application 20 in the first storage unit 11b with a flag indicating that it has not been reflected in the second device 12 (as unreflected data). The transmission time is attached to the data and transmitted to the second control unit 12a (S1).

次に、第２制御部１２ａは、前記送信時刻が付された前記データを、未反映データ記憶部１２ｂに登録する（Ｓ２）。 Next, the second control unit 12a registers the data with the transmission time in the unreflected data storage unit 12b (S2).

次に、分散ファイル装置１０において、第１装置１１から第２装置１２へのデータの反映指示が出された状態における処理の一例を、図３のブロック図及び図１０のフローチャートに基づき説明する。 Next, in the distributed file system 10, an example of processing in a state in which the first device 11 issues a data reflection instruction to the second device 12 will be described with reference to the block diagram of FIG. 3 and the flowchart of FIG.

まず、反映実行部１１ｃは、予め設定されたトリガー条件を満たしたとき、第２装置１２に未反映のデータの第２装置１２への反映を第２制御部１２ａに指示するように第１制御部１１ａに指示する（Ｓ３）。本実施形態によれば、予め設定されたトリガー条件を満たしたときに第２装置１２へのデータの反映を指示する非同期コピーを採用しているため、例えば、第１装置１１と第２装置１２が遠隔にある場合でも、同期コピーよりも処理時間を短縮できる。本実施形態では、前記予め設定されたトリガー条件が、所定時間（例えば、１２時間、２４時間等）の経過であり、例えば、１２時間毎、２４時間毎等、定期的に、前記指示がなされる。また、本実施形態では、前記予め設定されたトリガー条件が、所定時刻（例えば、１：００、２０：００）等であり、例えば、１：００、２０：００等に、定期的に、前記指示がなされてもよい。 First, when a preset trigger condition is satisfied, the reflection execution unit 11c performs first control to instruct the second control unit 12a to reflect data that has not been reflected in the second device 12 to the second device 12. The part 11a is instructed (S3). According to the present embodiment, since asynchronous copying is employed in which an instruction to reflect data to the second device 12 is given when a preset trigger condition is satisfied, for example, the first device 11 and the second device 12 Even if the copy is remote, the processing time can be shortened compared to synchronous copy. In the present embodiment, the preset trigger condition is the elapse of a predetermined period of time (for example, 12 hours, 24 hours, etc.), and the instruction is given periodically, for example, every 12 hours, every 24 hours, etc. be. Further, in the present embodiment, the preset trigger condition is a predetermined time (for example, 1:00, 20:00), etc., for example, periodically at 1:00, 20:00, etc. Instructions may be given.

次に、前記指示を受けたとき、第１制御部１１ａは、前記未反映のデータの反映を、第２制御部１２ａに指示する（Ｓ４）。 Next, when receiving the instruction, the first control unit 11a instructs the second control unit 12a to reflect the unreflected data (S4).

次に、前記指示を受けたとき、第２制御部１２ａは、未反映データ記憶部１２ｂに登録された前記未反映のデータに付された送信時刻を、チェックポイント時刻に変更した後、前記データを第２記憶部１２ｃに登録し、かつ、前記データの反映の完了信号を、第１制御部１１ａに送信する（Ｓ５）。 Next, when receiving the instruction, the second control unit 12a changes the transmission time attached to the unreflected data registered in the unreflected data storage unit 12b to the checkpoint time, and then is registered in the second storage unit 12c, and a signal indicating completion of reflection of the data is transmitted to the first control unit 11a (S5).

次に、前記完了信号を受信したとき、第１制御部１１ａは、第２装置１２に未反映であるとのフラグを立てて第１記憶部１１ｂに登録されたデータの前記フラグを倒す（Ｓ６）。 Next, when the completion signal is received, the first control unit 11a raises a flag indicating that the data has not been reflected in the second device 12, and clears the flag of the data registered in the first storage unit 11b (S6). ).

本実施形態によれば、第２装置１２に未反映のデータに送信時刻を付し、当該送信時刻を、第２装置１２への反映時にチェックポイント時刻に変換した後、第２記憶部１２ｃに登録することで、第１装置１１から第２装置１２にコピーされたデータが、どの時点のものかが明確化される。 According to this embodiment, the transmission time is added to the data that has not yet been reflected in the second device 12, and after the transmission time is converted to the checkpoint time when the data is reflected in the second device 12, the data is stored in the second storage unit 12c. By registering, it is clarified at what point in time the data copied from the first device 11 to the second device 12 is.

［実施形態２］
図４は、本実施形態の分散ファイル装置の一例の構成を示すブロック図である。図４に示すように、本実施形態の分散ファイル装置１０は、未反映データ記憶部１２ｂが、データ量の閾値を有し、前記予め設定されたトリガー条件が、未反映データ記憶部１２ｂに登録されたデータ量が前記閾値を超えたことである点を除き、実施形態１の分散ファイル装置１０と同様である。 [Embodiment 2]
FIG. 4 is a block diagram showing the configuration of an example of the distributed file device of this embodiment. As shown in FIG. 4, in the distributed file device 10 of the present embodiment, the unreflected data storage unit 12b has a data amount threshold, and the preset trigger condition is registered in the unreflected data storage unit 12b. This is the same as the distributed file device 10 of the first embodiment except that the amount of data processed exceeds the threshold.

次に、本実施形態の分散ファイル装置１０における処理の一例を、図２から図４のブロック図及び図１０のフローチャートに基づき説明する。 Next, an example of processing in the distributed file system 10 of this embodiment will be described with reference to the block diagrams of FIGS. 2 to 4 and the flowchart of FIG.

まず、実施形態１と同様にして、未反映データの処理及び登録を行う（Ｓ１及びＳ２）。 First, as in the first embodiment, processing and registration of unreflected data are performed (S1 and S2).

次に、反映実行部１１ｃは、予め設定されたトリガー条件を満たしたとき、第２装置１２に未反映のデータの第２装置１２への反映を第２制御部１２ａに指示するように第１制御部１１ａに指示する（Ｓ３）。本実施形態では、未反映データ記憶部１２ｂに登録された前記送信時刻が付されたデータの量が、前記閾値を超えたときに、反映実行部１１ｃが、第２装置１２に未反映のデータの第２装置１２への反映を第２制御部１２ａに指示するように第１制御部１１ａに指示する。 Next, when a preset trigger condition is satisfied, the reflection execution unit 11c instructs the second control unit 12a to reflect the data that has not been reflected in the second device 12 to the second device 12. The controller 11a is instructed (S3). In this embodiment, when the amount of data with the transmission time registered in the unreflected data storage unit 12b exceeds the threshold value, the reflection executing unit 11c is instructed to the first control unit 11a to instruct the second control unit 12a to reflect to the second device 12.

次に、実施形態１と同様にして、第１制御部１１ａから第２制御部１２ａへの指示（Ｓ４）、反映（Ｓ５）、及び、登録（Ｓ６）を行う。 Next, as in the first embodiment, instructions (S4), reflection (S5), and registration (S6) are performed from the first control unit 11a to the second control unit 12a.

本実施形態によっても、実施形態１と同様に、同期コピーよりも処理時間を短縮できるとともに、第１装置１１から第２装置１２にコピーされたデータが、どの時点のものかが明確化される。 According to the present embodiment, as in the first embodiment, the processing time can be shortened compared to the synchronous copy, and the point in time of the data copied from the first device 11 to the second device 12 can be clarified. .

［実施形態３］
図５は、本実施形態の分散ファイル装置の一例の構成を示すブロック図である。図５に示すように、本実施形態の分散ファイル装置１０は、前記予め設定されたトリガー条件が、アプリケーション２０から第１制御部１１ａにチェックポイント取得要求がされたことである点を除き、実施形態１の分散ファイル装置１０と同様である。 [Embodiment 3]
FIG. 5 is a block diagram showing the configuration of an example of the distributed file device of this embodiment. As shown in FIG. 5, the distributed file device 10 of the present embodiment is implemented except that the preset trigger condition is that the application 20 issues a checkpoint acquisition request to the first control unit 11a. It is the same as the distributed file device 10 of form 1.

次に、本実施形態の分散ファイル装置１０における処理の一例を、図２、図３、図５のブロック図及び図１０のフローチャートに基づき説明する。 Next, an example of processing in the distributed file system 10 of this embodiment will be described with reference to the block diagrams of FIGS. 2, 3 and 5 and the flowchart of FIG.

次に、反映実行部１１ｃは、予め設定されたトリガー条件を満たしたとき、第２装置１２に未反映のデータの第２装置１２への反映を第２制御部１２ａに指示するように第１制御部１１ａに指示する（Ｓ３）。本実施形態では、アプリケーション２０から第１制御部１１ａにチェックポイント取得要求がされたときに、反映実行部１１ｃが、第２装置１２に未反映のデータの第２装置１２への反映を第２制御部１２ａに指示するように第１制御部１１ａに指示する。 Next, when a preset trigger condition is satisfied, the reflection execution unit 11c instructs the second control unit 12a to reflect the data that has not been reflected in the second device 12 to the second device 12. The controller 11a is instructed (S3). In this embodiment, when the application 20 issues a checkpoint acquisition request to the first control unit 11a, the reflection execution unit 11c causes the second device 12 to reflect the data that has not been reflected in the second device 12 to the second device 12. The first controller 11a is instructed to instruct the controller 12a.

［実施形態４］
図６は、本実施形態の分散ファイル装置の一例の構成を示すブロック図である。図６に示すように、本実施形態の分散ファイル装置１０は、未反映データ記憶部１２ｂに登録された前記送信時刻が付された未反映のデータの一部又は全部をロストしたとき、第１制御部１１ａが、第２装置１２に未反映であるとのフラグを立てて第１記憶部１１ｂに登録されたデータと、未反映データ記憶部１２ｂに登録されたデータとの差分を、第２制御部１２ａに送信する点を除き、実施形態１の分散ファイル装置１０と同様である。 [Embodiment 4]
FIG. 6 is a block diagram showing the configuration of an example of the distributed file device of this embodiment. As shown in FIG. 6, the distributed file device 10 of the present embodiment loses part or all of the unreflected data to which the transmission time is attached, which is registered in the unreflected data storage unit 12b. The control unit 11a calculates the difference between the data registered in the first storage unit 11b with a flag indicating that it has not been reflected in the second device 12 and the data registered in the unreflected data storage unit 12b. It is the same as the distributed file device 10 of the first embodiment except that it is transmitted to the control unit 12a.

次に、本実施形態の分散ファイル装置１０における処理の一例を、図２、図３、図６のブロック図及び図１１のフローチャートに基づき説明する。 Next, an example of processing in the distributed file system 10 of this embodiment will be described with reference to the block diagrams of FIGS. 2, 3 and 6 and the flowchart of FIG.

次に、未反映データ記憶部１２ｂに登録された前記送信時刻が付された未反映のデータの一部又は全部をロストしたとき、第１制御部１１ａが、第２装置１２に未反映であるとのフラグを立てて第１記憶部１１ｂに登録されたデータと、未反映データ記憶部１２ｂに登録されたデータとの差分を、第２制御部１２ａに送信する（Ｓ７）。図１１には、この差分送信工程（Ｓ７）を、未反映データの登録（Ｓ２）後に実施する例を示したが、前記差分の送信は、未反映データ記憶部１２ｂに登録された前記送信時刻が付された未反映のデータの一部又は全部をロストしたときより後の任意のタイミングで実施してよい。 Next, when part or all of the unreflected data with the transmission time registered in the unreflected data storage unit 12b is lost, the first control unit 11a detects that the data has not been reflected in the second device 12. is set and the difference between the data registered in the first storage unit 11b and the data registered in the unreflected data storage unit 12b is transmitted to the second control unit 12a (S7). FIG. 11 shows an example in which this difference transmission step (S7) is performed after registration of unreflected data (S2). This may be performed at any timing after the loss of part or all of the unreflected data marked with .

次に、実施形態１と同様にして、反映実行部１１ｃから第１制御部１１ａへの指示（Ｓ３）、第１制御部１１ａから第２制御部１２ａへの指示（Ｓ４）、反映（Ｓ５）、及び、登録（Ｓ６）を行う。本実施形態において、反映実行部１１ｃから第１制御部１１ａへの指示（Ｓ３）は、実施形態２又は実施形態３と同様にして実施してもよい。 Next, in the same manner as in the first embodiment, an instruction from the reflection execution unit 11c to the first control unit 11a (S3), an instruction from the first control unit 11a to the second control unit 12a (S4), and a reflection (S5) , and registration (S6). In this embodiment, the instruction (S3) from the reflection execution unit 11c to the first control unit 11a may be performed in the same manner as in the second or third embodiment.

本実施形態によれば、短時間で、未反映データ記憶部１２ｂから前記未反映のデータがロストする前の状態に復旧することができる。 According to this embodiment, it is possible to recover the state before the unreflected data is lost from the unreflected data storage unit 12b in a short time.

［実施形態５］
図７は、本実施形態の分散ファイル装置の一例の構成を示すブロック図である。図７に示すように、本実施形態の分散ファイル装置１０は、第１装置１１に障害が発生し、第２装置１２がアプリケーション２０の実行を引き継いだとき、第２制御部１２ａが、下記条件１及び下記条件２のいずれかを選択する点を除き、実施形態１の分散ファイル装置１０と同様である。
（条件１）
第２記憶部１２ｃに登録された前記チェックポイント時刻が付された反映済みのデータのみを、アプリケーション２０の実行に使用する。
（条件２）
第２記憶部１２ｃに登録された前記チェックポイント時刻が付された反映済みのデータと、未反映データ記憶部１２ｂに登録された前記送信時刻が付された未反映のデータとを、アプリケーション２０の実行に使用する。 [Embodiment 5]
FIG. 7 is a block diagram showing the configuration of an example of the distributed file device of this embodiment. As shown in FIG. 7, in the distributed file device 10 of this embodiment, when a failure occurs in the first device 11 and the second device 12 takes over the execution of the application 20, the second control unit 12a performs the following conditions: This is the same as the distributed file device 10 of the first embodiment except that either one of condition 1 and condition 2 below is selected.
(Condition 1)
Only the reflected data with the checkpoint time registered in the second storage unit 12c is used for the execution of the application 20. FIG.
(Condition 2)
The application 20 transfers the reflected data with the checkpoint time registered in the second storage unit 12c and the unreflected data with the transmission time registered in the unreflected data storage unit 12b. used for execution.

次に、本実施形態の分散ファイル装置１０における処理の一例を、図２、図３、図７のブロック図及び図１２のフローチャートに基づき説明する。 Next, an example of processing in the distributed file system 10 of this embodiment will be described with reference to the block diagrams of FIGS. 2, 3 and 7 and the flowchart of FIG.

まず、実施形態１と同様にして、未反映データの処理及び登録（Ｓ１及びＳ２）、反映実行部１１ｃから第１制御部１１ａへの指示（Ｓ３）、第１制御部１１ａから第２制御部１２ａへの指示（Ｓ４）、反映（Ｓ５）、及び、登録（Ｓ６）を行う。本実施形態において、反映実行部１１ｃから第１制御部１１ａへの指示（Ｓ３）は、実施形態２又は実施形態３と同様にして実施してもよい。また、本実施形態において、反映データ記憶部１２ｂに登録された前記送信時刻が付された未反映のデータの一部又は全部をロストしたときは、それより後の任意のタイミングで、実施形態４の差分送信工程（Ｓ７）を実施してよい。 First, as in the first embodiment, processing and registration of unreflected data (S1 and S2), instructions from the reflection execution unit 11c to the first control unit 11a (S3), 12a (S4), reflected (S5), and registered (S6). In this embodiment, the instruction (S3) from the reflection execution unit 11c to the first control unit 11a may be performed in the same manner as in the second or third embodiment. Further, in this embodiment, when part or all of the unreflected data with the transmission time registered in the reflected data storage unit 12b is lost, at an arbitrary timing after that, in the fourth embodiment, , the difference transmission step (S7) may be carried out.

次に、第１装置１１に障害が発生し、第２装置１２がアプリケーション２０の実行を引き継いだとき、第２制御部１２ａが、前記条件１及び前記条件２のいずれかを選択する（Ｓ８）。 Next, when a failure occurs in the first device 11 and the second device 12 takes over the execution of the application 20, the second control unit 12a selects either condition 1 or condition 2 (S8). .

本実施形態では、図１３に例示するように、前記選択に先立ち、第２制御部１２ａが、前記条件１を選択したときに第２装置１２がアプリケーション２０を起動するまでに要する時間と、前記条件２を選択したときに第２装置１２がアプリケーション２０を起動するまでに要する時間とを予測してもよい（Ｓ９）。これにより、前記条件１及び前記条件２の両者の処理時間を参酌した上で、前記選択を行える。 In the present embodiment, as exemplified in FIG. 13, prior to the selection, the second control unit 12a controls the time required for the second device 12 to start the application 20 when the condition 1 is selected, The time required for the second device 12 to activate the application 20 when the condition 2 is selected may be predicted (S9). Thus, the selection can be made after considering the processing times for both the condition 1 and the condition 2.

以上、実施形態を参照して本発明を説明したが、本発明は、上記実施形態に限定されるものではない。本発明の構成や詳細には、本発明のスコープ内で当業者が理解しうる様々な変更をできる。 Although the present invention has been described with reference to the embodiments, the present invention is not limited to the above embodiments. Various changes can be made to the configuration and details of the present invention within the scope of the present invention that can be understood by those skilled in the art.

本発明によれば、例えば、現用装置から待機装置にコピーされたデータが、どの時点のものかが明確化された非同期コピーの分散ファイル装置及びそれを用いたフェイルオーバ方法を提供可能となる。 According to the present invention, for example, it is possible to provide an asynchronous copy distributed file device in which the point in time of data copied from the active device to the standby device is clarified, and a failover method using the distributed file device.

１０分散ファイル装置
１１第１装置
１１ａ第１制御部
１１ｂ第１記憶部
１１ｃ反映実行部
１２第２装置
１２ａ第２制御部
１２ｂ未反映データ記憶部
１２ｃ第２記憶部
２０アプリケーション
３０通信回線網
１１１、１２１ＣＰＵ
１１２、１２２メモリ
１１３、１２３バス
１１４、１２４記憶装置
１１５、１２５入力装置
１１６、１２６ディスプレイ
１１７、１２７通信デバイス

10 distributed file device 11 first device 11a first control unit 11b first storage unit 11c reflection execution unit 12 second device 12a second control unit 12b unreflected data storage unit 12c second storage unit 20 application 30 communication network 111, 121 CPUs
112, 122 memory 113, 123 bus 114, 124 storage device 115, 125 input device 116, 126 display 117, 127 communication device

Claims

a first device executing an application;
a second device that takes over execution of the application when the first device fails;
including
The first device includes a first control unit, a first storage unit, and a reflection execution unit,
The second device includes a second control unit, an unreflected data storage unit, and a second storage unit,
The first control unit and the second control unit are connectable via a communication network,
The first control unit registers data related to execution of the application in the first storage unit with a flag indicating that the data has not been reflected in the second device, and adds a transmission time to the data. Send to the second control unit,
the second control unit registers the data to which the transmission time is attached in the unreflected data storage unit;
The reflection execution unit, when a preset trigger condition is satisfied, instructs the second control unit to reflect data that has not been reflected in the second device to the second device. to
When receiving the instruction, the first control unit instructs the second control unit to reflect the unreflected data;
Upon receiving the instruction, the second control unit changes the transmission time attached to the unreflected data registered in the unreflected data storage unit to the checkpoint time, and then transfers the data to the registering in the second storage unit, and transmitting a completion signal for reflecting the data to the first control unit;
When the completion signal is received, the first control unit raises a flag indicating that the data has not been reflected in the second device, and clears the flag of the data registered in the first storage unit.
Distributed File Device.

wherein the preset trigger condition is elapse of a predetermined time or arrival of a predetermined time;
2. The distributed file system according to claim 1.

the unreflected data storage unit has a data amount threshold,
wherein the preset trigger condition is that the amount of data registered in the unreflected data storage unit exceeds the threshold;
2. The distributed file system according to claim 1.

The preset trigger condition is that the application requests the first control unit to acquire a checkpoint,
2. The distributed file system according to claim 1.

When part or all of the unreflected data with the transmission time registered in the unreflected data storage unit is lost, the first control unit provides a flag indicating that the unreflected data has not been reflected in the second device. and transmitting the difference between the data registered in the first storage unit and the data registered in the unreflected data storage unit to the second control unit;
5. A distributed file device according to any one of claims 1 to 4.

When a failure occurs in the first device and the second device takes over execution of the application, the second control unit selects either condition 1 or condition 2 below.
A distributed file device according to any one of claims 1 to 5.
(Condition 1)
Only the reflected data with the checkpoint time registered in the second storage unit is used for execution of the application.
(Condition 2)
The application stores the reflected data with the checkpoint time registered in the second storage unit and the unreflected data with the transmission time registered in the unreflected data storage unit. used for execution.

A time required for the second device to activate the application when the second control unit selects the condition 1, and a time required for the second device to activate the application when the condition 2 is selected. predict the time and
7. The distributed file system according to claim 6.

a first device executing an application;
a second device that takes over execution of the application when the first device fails;
using a distributed file device containing
The first device includes a first control unit, a first storage unit, and a reflection execution unit,
The second device includes a second control unit, an unreflected data storage unit, and a second storage unit,
The first control unit and the second control unit are connectable via a communication network,
The first control unit registers data related to execution of the application in the first storage unit with a flag indicating that the data has not been reflected in the second device, and adds a transmission time to the data. a non-reflected data processing step for transmitting to the second control unit;
an unreflected data registration step in which the second control unit registers the data to which the transmission time is attached in the unreflected data storage unit;
The first control unit, when the reflection execution unit satisfies a preset trigger condition, instructs the second control unit to reflect data that has not been reflected in the second device to the second device. a first reflection instruction step of instructing to
a second reflection instruction step in which, when receiving the instruction, the first control unit instructs the second control unit to reflect the unreflected data;
When receiving the instruction, the second control unit changes the transmission time attached to the unreflected data registered in the unreflected data storage unit to the checkpoint time, and then transfers the data to the a reflection step of registering the data in a second storage unit and transmitting a reflection completion signal of the data to the first control unit;
a registration step in which, when the completion signal is received, the first control unit raises a flag indicating that the data has not been reflected in the second device and clears the flag of the data registered in the first storage unit;
failover methods, including

wherein the preset trigger condition is elapse of a predetermined time or arrival of a predetermined time;
9. The failover method according to claim 8.

the unreflected data storage unit has a data amount threshold,
wherein the preset trigger condition is that the amount of data registered in the unreflected data storage unit exceeds the threshold;
9. The failover method according to claim 8.

The preset trigger condition is that the application requests the first control unit to acquire a checkpoint,
9. The failover method according to claim 8.

When part or all of the unreflected data with the transmission time registered in the unreflected data storage unit is lost, the first control unit provides a flag indicating that the unreflected data has not been reflected in the second device. and transmitting the difference between the data registered in the first storage unit and the data registered in the unreflected data storage unit to the second control unit,
A failover method according to any one of claims 8 to 11.

When a failure occurs in the first device and the second device takes over execution of the application, the second control unit selects either condition 1 or condition 2 below.
A failover method according to any one of claims 8 to 12.
(Condition 1)
Only the reflected data with the checkpoint time registered in the second storage unit is used for execution of the application.
(Condition 2)
The application stores the reflected data with the checkpoint time registered in the second storage unit and the unreflected data with the transmission time registered in the unreflected data storage unit. used for execution.

A time required for the second device to activate the application when the second control unit selects the condition 1, and a time required for the second device to activate the application when the condition 2 is selected. including a prediction step of predicting the time required for
14. The failover method according to claim 13.

A program executable on a computer for the method according to any one of claims 8 to 14.

A computer-readable recording medium recording the program according to claim 15.