JP2000010868A

JP2000010868A - Decentralized system and its backup method

Info

Publication number: JP2000010868A
Application number: JP10174705A
Authority: JP
Inventors: Katsufumi Fujimoto; 本克文藤
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1998-06-22
Filing date: 1998-06-22
Publication date: 2000-01-14

Abstract

PROBLEM TO BE SOLVED: To provide a decentralized system which places a small load on computers and a communication line and is reliable, and its backup method. SOLUTION: Master data 3 are held on an in-operation system computer 1A and backup data 4 corresponding to the contents of the master data 3 at a specific point of time and update history files 5 corresponding to an updating process on the in-operation system computer 1A are held on computers 1B and 1C. The in-operation system computer 1A holds an update history file 5 corresponding to an updating process for the master data 3 and also manages a position and order information file 6 for specifying the positions of update history files 5 divided and held between the backup computers 1B and 1C. Further, the backup computers 1B and 1C manage position and order information files 6 for specifying the positions of the update history files 5 divided and held between the backup computers 1B and 1C.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は通信路を介して複数
の計算機が接続された分散システムに係り、とりわけ分
散システムにおけるデータベース等のバックアップ方法
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a distributed system in which a plurality of computers are connected via a communication path, and more particularly to a backup method for a database or the like in the distributed system.

【０００２】[0002]

【従来の技術】計算機におけるデータベース等のバック
アップは一般に、マスタデータの所定時点での内容に対
応するバックアップデータと、このバックアップデータ
の記録後に行われた更新処理に対応する更新履歴とに基
づいて行われている。このような計算機において、マス
タデータが破壊された場合には、更新履歴に基づいてバ
ックアップデータが更新されて最新のデータが復元され
る。2. Description of the Related Art In general, backup of a database or the like in a computer is performed based on backup data corresponding to the contents of master data at a predetermined point in time and an update history corresponding to an update process performed after recording the backup data. Have been done. In such a computer, when the master data is destroyed, the backup data is updated based on the update history, and the latest data is restored.

【０００３】図１６は従来の分散システムにおけるデー
タベース等のバックアップ方法を説明するための図であ
る。図１６に示すように、分散システムは、通信路２を
介して接続された３台の計算機１Ａ，１Ｂ，１Ｃから構
成されており、計算機（稼働系計算機）１Ａにてデータ
ベースの更新処理を含むオンライン処理等の業務が行わ
れ、計算機（バックアップ先計算機）１Ｂにて計算機１
Ａの処理およびデータがバックアップされるようになっ
ている。ここで、稼働系計算機１Ａにはマスタデータ３
が保持され、バックアップ先計算機１Ｂにはバックアッ
プデータ４が保持されている。また、稼働系計算機１Ａ
で行われるデータベースの更新処理に対応する更新履歴
は稼働系計算機１Ａからバックアップ先計算機１Ｂへ転
送されている。このような分散システムにおいて、何ら
かの原因により稼働系計算機１Ａに障害が発生した場
合、または稼働系計算機１Ａが保持するマスタデータ３
が使用不能となった場合には、バックアップ先計算機１
Ｂが更新履歴ファイル５に基づいてバックアップデータ
４を更新して最新のデータを復元し、稼働系計算機１Ａ
の業務を引き継ぐようになっている。FIG. 16 is a diagram for explaining a backup method of a database or the like in a conventional distributed system. As shown in FIG. 16, the distributed system is composed of three computers 1A, 1B, and 1C connected via a communication path 2, and includes a computer (operating computer) 1A including a database update process. Business such as online processing is performed, and the computer (backup destination computer) 1B
The process A and data are backed up. Here, the master data 3 is stored in the active computer 1A.
And the backup destination computer 1B holds the backup data 4. The operating system computer 1A
The update history corresponding to the database update process performed in step 1 is transferred from the active computer 1A to the backup destination computer 1B. In such a distributed system, when a failure occurs in the active computer 1A for some reason, or when the master data 3 held by the active computer 1A is lost.
Becomes unavailable, the backup destination computer 1
B updates the backup data 4 based on the update history file 5, restores the latest data, and updates the active computer 1A.
Business is being taken over.

【０００４】ところで、このような分散システムにおい
て、バックアップ先計算機１Ｂに障害が発生した場合に
は、主として次のような２つの方法によりバックアップ
が継続される。[0004] In such a distributed system, when a failure occurs in the backup destination computer 1B, backup is continued mainly by the following two methods.

【０００５】すなわち、第１の従来方法は、バックアッ
プ元である稼働系計算機１Ａで更新履歴の記録を開始
し、バックアップ先計算機１Ｂが復旧したときに稼働系
計算機１Ａからバックアップ先計算機１Ｂへ更新履歴を
転送する方法であり、これによりバックアップ先計算機
１Ｂは稼働系計算機１Ａから転送された更新履歴に基づ
いてバックアップデータ４を更新してバックアップデー
タ４を最新の状態に保つことできる。That is, in the first conventional method, recording of the update history is started at the active computer 1A as the backup source, and when the backup destination computer 1B is restored, the update history is transferred from the active computer 1A to the backup destination computer 1B. This allows the backup destination computer 1B to update the backup data 4 based on the update history transferred from the active computer 1A and keep the backup data 4 in the latest state.

【０００６】また、第２の従来方法は、稼働系計算機１
Ａのバックアップ先をバックアップ先計算機１Ｂから他
の計算機（例えば図１６の計算機１Ｃ）へ変更する方法
であり、これにより計算機１Ｃにおいて稼働系計算機１
Ａのバックアップを継続することができる。[0006] The second conventional method uses the active computer 1.
In this method, the backup destination of A is changed from the backup destination computer 1B to another computer (for example, the computer 1C in FIG. 16).
The backup of A can be continued.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら、上述し
た第１および第２の従来方法ではいずれも、バックアッ
プ先計算機の復旧時またはバックアップ先計算機の変更
時に、稼働系計算機１Ａからバックアップ先計算機１Ｂ
またはバックアップ先計算機１Ｃへ更新履歴を転送しな
ければならないので、計算機および通信路に対して一時
的に高い負荷がかかり、データベースの更新処理以外の
他の業務に影響を与える可能性がある。However, in the first and second conventional methods described above, when the backup destination computer is restored or the backup destination computer is changed, the active computer 1A and the backup destination computer 1B are used.
Alternatively, since the update history has to be transferred to the backup destination computer 1C, a heavy load is temporarily applied to the computer and the communication path, which may affect other operations other than the database update processing.

【０００８】また、上述した第１および第２の従来方法
ではいずれも、計算機１Ｂ，１Ｃの障害時に稼働系計算
機１Ａのみが最新のデータを保持している状態となるの
で、この状態で稼働系計算機１Ａが保持するデータが使
用不能となった場合には、業務の停止のみならず、最新
のデータの消失により、ユーザに対して重大な損害を与
える可能性がある。In the first and second conventional methods described above, when the computers 1B and 1C fail, only the active computer 1A holds the latest data. If the data held by the computer 1A becomes unusable, there is a possibility that serious damage to the user may be caused not only by stopping the business but also by losing the latest data.

【０００９】本発明はこのような点を考慮してなされた
ものであり、計算機および通信路に対する負荷が低くか
つ信頼性が高い分散システムおよびそのバックアップ方
法を提供することを目的とする。SUMMARY OF THE INVENTION The present invention has been made in consideration of the above points, and an object of the present invention is to provide a highly reliable distributed system having a low load on a computer and a communication path, and a backup method thereof.

【００１０】[0010]

【課題を解決するための手段】本発明の第１の特徴は、
マスタデータに対して更新処理を行う稼働系計算機と、
前記マスタデータの所定時点での内容に対応するバック
アップデータ、および前記更新処理に対応する更新履歴
を保持する少なくとも２つのバックアップ用計算機とを
備え、前記各バックアップ用計算機は、それぞれのバッ
クアップ用計算機の間で分割して保持される前記更新履
歴の分割単位ごとの位置を特定するための位置情報を管
理することを特徴とする分散システムである。A first feature of the present invention is as follows.
An active computer that updates the master data,
Backup data corresponding to the content of the master data at a predetermined point in time, and at least two backup computers that hold an update history corresponding to the update process, wherein each of the backup computers is A distributed system for managing position information for specifying a position of each update unit of the update history which is divided and held.

【００１１】本発明の第１の特徴においては、前記稼働
系計算機は、前記更新処理に対応する更新履歴を保持す
るとともに、前記各バックアップ用計算機の間で分割し
て保持される前記更新履歴の分割単位ごとの位置を特定
するための位置情報を管理することが好ましく、また
前記各バックアップ用計算機は、前記位置情報に基づい
て他のバックアップ用計算機または前記稼働系計算機か
ら更新履歴を収集することが好ましい。In a first aspect of the present invention, the active computer holds an update history corresponding to the update processing, and stores the update history of the update history divided and held between the backup computers. It is preferable to manage position information for specifying a position for each division unit, and
It is preferable that each of the backup computers collects an update history from another backup computer or the active computer based on the location information.

【００１２】本発明の第２の特徴は、マスタデータに対
して更新処理を行う稼働系計算機と、前記マスタデータ
の所定時点での内容に対応するバックアップデータを保
持する少なくとも２つのバックアップ用計算機とを備え
た分散システムのバックアップ方法において、前記稼働
系計算機からバックアップ先となるバックアップ用計算
機に対して前記更新処理に対応する更新履歴を転送する
ステップと、前記各バックアップ用計算機の間で分割し
て保持される前記更新履歴の分割単位ごとの位置を特定
するための位置情報を管理するステップとを含むことを
特徴とする分散システムのバックアップ方法である。A second feature of the present invention is that an active computer for updating master data and at least two backup computers for holding backup data corresponding to the contents of the master data at a predetermined time. Transferring the update history corresponding to the update process from the active computer to the backup computer serving as a backup destination, and dividing the backup computer between the backup computers. Managing position information for specifying a position of each of the update histories to be held for each division unit.

【００１３】本発明の第２の特徴においては、いずれか
のバックアップ用計算機において前記位置情報に基づい
て他のバックアップ用計算機または前記稼働系計算機か
ら更新履歴を収集するステップをさらに含むことが好ま
しい。In the second aspect of the present invention, it is preferable that the method further includes a step of collecting an update history from another backup computer or the active computer based on the location information in any one of the backup computers.

【００１４】本発明の第３の特徴は、上述した第２の特
徴において、いずれかのバックアップ用計算機において
最新のデータを復元する必要が生じた場合に、前記位置
情報に基づいて他のバックアップ用計算機または前記稼
働系計算機から収集すべき更新履歴のデータ量を比較
し、このデータ量が最も小さいバックアップ用計算機に
おいて前記位置情報に基づいて他の計算機から更新履歴
を収集することにより前記バックアップデータから最新
のデータを復元するステップをさらに含むことを特徴と
する分散システムのバックアップ方法である。According to a third feature of the present invention, in the above-mentioned second feature, when it becomes necessary to restore the latest data in any one of the backup computers, another backup computer is restored based on the position information. Compare the data amount of the update history to be collected from the computer or the active computer, and collect the update history from another computer based on the location information in the backup computer having the smallest data amount, from the backup data. A backup method for a distributed system, further comprising the step of restoring the latest data.

【００１５】本発明の第４の特徴は、上述した第２の特
徴において、バックアップ先を選択する必要が生じた場
合に、前記位置情報に基づいて他のバックアップ用計算
機または前記稼働系計算機から収集すべき更新履歴のデ
ータ量を比較し、このデータ量が最も小さいバックアッ
プ用計算機を新たなバックアップ先として選択するステ
ップをさらに含むことを特徴とする請求項４記載の分散
システムのバックアップ方法である。According to a fourth feature of the present invention, in the above-described second feature, when it is necessary to select a backup destination, the data is collected from another backup computer or the active computer based on the location information. 5. The distributed system backup method according to claim 4, further comprising the step of comparing the data amounts of update history to be performed and selecting a backup computer having the smallest data amount as a new backup destination.

【００１６】本発明の第１および第２の特徴によれば、
稼働系計算機で行われる更新処理に対応する一連の更新
履歴を複数のバックアップ用計算機の間で分割して保持
し、位置情報に基づいて稼働系計算機および各バックア
ップ用計算機が保持する更新履歴を相互に収集できるよ
うになっているので、バックアップ先計算機の変更時ま
たはバックアップ先計算機の復旧時等にバックアップ先
計算機等へ過去の大量の更新履歴を転送するような必要
がなく、夜間等の負荷が軽い時間帯または業務を行って
いない時間帯等に更新履歴を転送すればよく、このため
計算機および通信路に対する負荷を軽減させて他の業務
に与える影響を最小限にすることができる。また、稼働
系計算機の障害時またはバックアップ先計算機の変更時
等においても、少なくとも２台の計算機が最新のデータ
を保持している状態、または最新のデータを復元可能な
状態となっているので、この状態で稼働系計算機が保持
するデータが使用不能となった場合でも、他のバックア
ップ用計算機等に基づいて最新のデータを復元すること
ができ、このため最新のデータが消失することを防止し
て信頼性を向上させることができる。さらに、稼働系計
算機と各バックアップ用計算機との間で同一内容の更新
履歴を保持しているので、必要な更新履歴を複数の計算
機から収集することができ、このため例えば負荷の軽い
計算機から更新履歴を収集することで計算機の負荷を軽
減することができる。また、特定の計算機または通信路
に負荷が集中しないよう更新履歴を収集することで更新
履歴の転送時間を短縮することができる。According to the first and second aspects of the present invention,
A series of update histories corresponding to the update processing performed by the active computer is divided and held between multiple backup computers, and the update histories held by the active computer and each backup computer are exchanged based on the location information. It is not necessary to transfer a large amount of past update history to the backup destination computer when the backup destination computer is changed or when the backup destination computer is restored, and the load at night is reduced. The update history may be transferred during a light time period or a time period during which no work is performed, so that the load on the computer and the communication path can be reduced and the influence on other work can be minimized. Also, at the time of failure of the active computer or change of the backup destination computer, at least two computers are in the state of holding the latest data or in the state of being able to restore the latest data, Even if the data held by the active computer becomes unusable in this state, the latest data can be restored based on other backup computers, etc., thereby preventing the latest data from being lost. Reliability can be improved. Further, since the same update history is maintained between the active computer and each backup computer, necessary update histories can be collected from a plurality of computers. By collecting the history, the load on the computer can be reduced. Further, by collecting update histories so that a load is not concentrated on a specific computer or a communication path, the transfer time of the update histories can be reduced.

【００１７】本発明の第３の特徴によれば、稼働系計算
機の障害等によりいずれかのバックアップ用計算機にて
最新のデータを復元する必要が生じた場合に、他のバッ
クアップ用計算機から収集すべき更新履歴のデータ量が
最も少ないバックアップ用計算機を復元先として選択す
るので、計算機または通信路への負荷を軽減することが
でき、このため短時間で最新のデータを復元することが
できる。According to the third feature of the present invention, when it becomes necessary to restore the latest data in one of the backup computers due to a failure of the active computer or the like, the latest data is collected from another backup computer. Since the backup computer having the smallest data volume of the update history to be selected is selected as the restoration destination, the load on the computer or the communication path can be reduced, and the latest data can be restored in a short time.

【００１８】本発明の第４の特徴によれば、バックアッ
プ用計算機の復旧等により新たなバックアップ先を選択
する必要が生じた場合に、他の計算機から収集すべき更
新履歴のデータ量が最も少ないバックアップ用計算機を
バックアップ先として選択するので、計算機または通信
路への負荷を軽減して短時間で最新のデータを復元する
ことができるとともに、業務の運用中に更新履歴ファイ
ルを転送する場合であっても、計算機または通信路への
負荷を軽減して業務への影響を軽減することができる。According to the fourth feature of the present invention, when it becomes necessary to select a new backup destination due to restoration of a backup computer or the like, the amount of update history data to be collected from another computer is the smallest. Since the backup computer is selected as the backup destination, it is possible to reduce the load on the computer or the communication path, restore the latest data in a short time, and transfer the update history file during business operation. However, it is possible to reduce the load on the computer or the communication path and reduce the influence on the business.

【００１９】[0019]

【発明の実施の形態】第１の実施の形態以下、図面を参照して本発明の実施の形態について説明
する。図１乃至図１３は本発明による分散システムおよ
びそのバックアップ方法の第１の実施の形態を説明する
ための図である。DESCRIPTION OF THE PREFERRED EMBODIMENTS First Embodiment An embodiment of the present invention will be described below with reference to the drawings. FIGS. 1 to 13 are diagrams for explaining a first embodiment of a distributed system and its backup method according to the present invention.

【００２０】まず、図１により、分散システムのシステ
ム構成について説明する。図１に示すように、分散シス
テムは、通信路２を介して接続された３台の計算機１
Ａ，１Ｂ，１Ｃから構成されており、計算機１Ａにてデ
ータベースの更新処理を含むオンライン処理等の業務が
行われ、計算機１Ｂ，１Ｃにて計算機１Ａの処理および
データがバックアップされるようになっている。すなわ
ち、計算機１Ａはマスタデータ３に対して更新処理を行
う稼働系計算機であり、計算機１Ｂ，１Ｃはマスタデー
タ３の所定時点での内容に対応するバックアップデータ
４、および稼働系計算機１Ａにおける更新処理に対応す
る更新履歴ファイル５を保持するバックアップ用計算機
である。なお、マスタデータ３、バックアップデータ４
および更新履歴ファイル５はいずれも各計算機１Ａ，１
Ｂ，１Ｃに設けられた所定の記憶装置内に保持される。First, a system configuration of a distributed system will be described with reference to FIG. As shown in FIG. 1, the distributed system includes three computers 1 connected via a communication path 2.
A, 1B, and 1C, the computer 1A performs operations such as online processing including database update processing, and the computers 1B and 1C back up the processing and data of the computer 1A. I have. That is, the computer 1A is an active computer that performs an update process on the master data 3, and the computers 1B and 1C are backup data 4 corresponding to the contents of the master data 3 at a predetermined time, and the update process in the active computer 1A. Is a backup computer that holds the update history file 5 corresponding to. Note that master data 3 and backup data 4
And the update history file 5 are stored in each computer 1A, 1
B and 1C are stored in a predetermined storage device.

【００２１】ここで稼働系計算機１Ａは、マスタデータ
３に対する更新処理に対応する更新履歴ファイル５を保
持するとともに、各バックアップ用計算機１Ｂ，１Ｃの
間で分割して保持される更新履歴ファイル５の位置を特
定するための位置・順序情報ファイル（位置情報）６を
管理している。また各バックアップ用計算機１Ｂ，１Ｃ
は、それぞれのバックアップ用計算機１Ｂ，１Ｃの間で
分割して保持される更新履歴ファイル５の位置を特定す
るための位置・順序情報ファイル６を管理している。各
計算機１Ａ，１Ｂ，１Ｃが保持する位置・順序情報ファ
イル６は全て同一内容であり、分割して保持される更新
履歴ファイル名、更新履歴ファイルを保持する計算機
名、および更新履歴ファイルの順序情報が含まれてい
る。Here, the active computer 1A holds the update history file 5 corresponding to the update processing for the master data 3, and stores the update history file 5 divided and held between the backup computers 1B and 1C. A position / order information file (position information) 6 for specifying the position is managed. Each backup computer 1B, 1C
Manages a position / order information file 6 for specifying the position of the update history file 5 that is divided and held between the backup computers 1B and 1C. The position / order information files 6 held by each of the computers 1A, 1B, 1C have the same contents, and the update history file name divided and held, the computer name holding the update history file, and the order information of the update history file It is included.

【００２２】次に、このような構成からなる分散システ
ムにおける各種の処理について説明する。Next, various processes in the distributed system having such a configuration will be described.

【００２３】（業務およびバックアップの開始時）ま
ず、図２により、図１に示す分散システムにおいて業務
およびバックアップを開始する場合の処理について説明
する。(At the Start of Business and Backup) First, referring to FIG. 2, a description will be given of a process for starting a business and backup in the distributed system shown in FIG.

【００２４】図１および図２に示すように、稼働系計算
機１Ａのオンライン処理の業務およびバックアップを開
始する場合には、まず、稼働系計算機１Ａにてバックア
ップ先となる計算機（図１ではバックアップ用計算機１
Ｂ）を選択し（ステップ１０１）、次いで、稼働系計算
機１Ａにて位置・順序情報ファイル６を更新する（ステ
ップ１０２）。ここで稼働系計算機１Ａにおいては、図
３に示すような位置・順序情報ファイル６が作成され
る。図３に示すように、位置・順序情報ファイル６に
は、更新履歴ファイルの順序番号（“１”）、更新履歴
ファイル名（“log1”）、更新履歴ファイルを保持する
計算機名（“計算機Ａ”，“計算機Ｂ”）、および更新
履歴ファイルの更新状態（“更新中”）が含まれてい
る。As shown in FIGS. 1 and 2, when starting the operation of online processing and backup of the active computer 1A, first, the backup computer (in FIG. 1, the backup computer) is used as the backup destination. Calculator 1
B) is selected (step 101), and then the position / order information file 6 is updated by the active computer 1A (step 102). Here, in the active computer 1A, a position / order information file 6 as shown in FIG. 3 is created. As shown in FIG. 3, the position / order information file 6 includes an update history file sequence number (“1”), an update history file name (“log1”), and a computer name (“computer A”) holding the update history file. , "Computer B"), and the update state of the update history file ("Updating").

【００２５】その後、稼働系計算機１Ａから、この位置
・順序情報ファイル６をシステムに組み込まれている全
ての計算機（図１ではバックアップ用計算機１Ｂ，１
Ｃ）へ通信路２を介して転送し（ステップ１０３）、稼
働系計算機１Ａの業務およびバックアップを開始する
（ステップ１０４）。ステップ１０４においては、稼働
系計算機１Ａにてデータベースであるマスタデータ３に
対して更新処理が行われると、この更新処理に対応する
更新履歴を更新履歴ファイル（log1）５に保持するとと
もに、稼働系計算機１Ａからバックアップ用計算機１Ｂ
へ転送する。なお、バックアップ用計算機１Ｂにおいて
は、稼働系計算機１Ａから転送された更新履歴を更新履
歴ファイル（log1）５に保持する。Thereafter, from the active computer 1A, the position / sequence information file 6 is transferred to all computers (in FIG. 1, backup computers 1B and 1 in FIG. 1) incorporated in the system.
C) via the communication path 2 (step 103), and the work and backup of the active computer 1A are started (step 104). In step 104, when the active computer 1A updates the master data 3 as a database, the update history corresponding to the update process is stored in the update history file (log1) 5, and the active computer 1A is updated. Computer 1A to backup computer 1B
Transfer to In the backup computer 1B, the update history transferred from the active computer 1A is stored in the update history file (log1) 5.

【００２６】（バックアップ先計算機の障害時）次に、
図４乃至図６により、このような分散システムにおいて
稼働系計算機１Ａのバックアップ先計算機に障害が発生
してバックアップ先が切り替えられる場合の処理につい
て説明する。(When a backup destination computer fails)
With reference to FIGS. 4 to 6, a description will be given of a process in a case where a failure occurs in the backup destination computer of the active computer 1A and the backup destination is switched in such a distributed system.

【００２７】図４および図５に示すように、バックアッ
プ先であるバックアップ用計算機１Ｂに障害が発生して
稼働系計算機１Ａのバックアップを行うことができなく
なった場合には、稼働系計算機１Ａにてバックアップ用
計算機１Ｂの障害を検出し（ステップ２０１）、稼働系
計算機１Ａの業務およびバックアップを一時中断すると
ともに（ステップ２０２）、障害が発生したバックアッ
プ用計算機１Ｂをシステムから切り離す（ステップ２０
３）。As shown in FIGS. 4 and 5, when a failure occurs in the backup computer 1B, which is the backup destination, and the backup of the active computer 1A cannot be performed, the active computer 1A is disabled. A failure of the backup computer 1B is detected (Step 201), and the operation and backup of the active computer 1A are temporarily suspended (Step 202), and the failed backup computer 1B is disconnected from the system (Step 20).
3).

【００２８】その後、稼働系計算機１Ａにて位置・順序
情報ファイル６を更新する（ステップ２０４）。ここで
稼働系計算機１Ａにおいては、図６（ａ）に示すような
位置・順序情報ファイル６が作成される。そして、稼働
系計算機１Ａから、この位置・順序情報ファイル６をシ
ステムに組み込まれている全ての計算機（図４ではバッ
クアップ用計算機１Ｃ）へ通信路２を介して転送し（ス
テップ２０５）、稼働系計算機１Ａの業務およびバック
アップを再開する（ステップ２０６）。なお、稼働系計
算機１Ａの業務およびバックアップを再開する場合に
は、図２に示す処理が再度行われる。すなわち、稼働系
計算機１Ａにてバックアップ先となる計算機（図４では
バックアップ用計算機１Ｃ）を選択し、図６（ｂ）に示
すような位置・順序情報ファイル６を作成してバックア
ップ用計算機１Ｃへ転送した後、稼働系計算機１Ａの業
務およびバックアップを開始する。Thereafter, the position / sequence information file 6 is updated by the active computer 1A (step 204). Here, in the active computer 1A, a position / order information file 6 as shown in FIG. 6A is created. Then, the position / order information file 6 is transferred from the active computer 1A to all the computers (the backup computer 1C in FIG. 4) incorporated in the system via the communication path 2 (step 205). The operation and backup of the computer 1A are restarted (step 206). When the operation and backup of the active computer 1A are restarted, the processing shown in FIG. 2 is performed again. That is, the backup computer (the backup computer 1C in FIG. 4) is selected in the active computer 1A, the position / order information file 6 as shown in FIG. 6B is created, and the backup computer 1C is sent to the backup computer 1C. After the transfer, the operation and backup of the active computer 1A are started.

【００２９】（バックアップ用計算機の復旧時）次に、
図７乃至図１０により、このような分散システムにおい
て障害が発生したバックアップ用計算機が復旧してバッ
クアップ先が切り替えられる場合の処理について説明す
る。(When the backup computer is restored)
With reference to FIGS. 7 to 10, a description will be given of a process in a case where a backup computer in which a failure has occurred in such a distributed system is restored and a backup destination is switched.

【００３０】図７および図８に示すように、障害により
切り離されたバックアップ用計算機１Ｂが復旧してシス
テムに再度組み込まれる場合には、組込み対象となるバ
ックアップ用計算機１Ｂから、システムに組み込まれて
いる他の計算機（図７では稼働系計算機１Ａおよびバッ
クアップ用計算機１Ｃ）へ組込みを要求する（ステップ
３０１）。ここで、要求を受けた稼働系計算機１Ａまた
はバックアップ用計算機１Ｃは、全ての計算機における
位置・順序情報ファイル６の内容を一致させるため、組
込み対象となるバックアップ用計算機１Ｂへ位置・順序
情報ファイル６を転送する（ステップ３０２）。これに
より、バックアップ用計算機１Ｂはシステムに組み込ま
れ、業務およびバックアップが可能な状態へ移行する
（ステップ３０３）。As shown in FIGS. 7 and 8, when the backup computer 1B disconnected due to a failure recovers and is re-integrated into the system, the backup computer 1B to be incorporated is incorporated into the system. The other computers (in FIG. 7, the active computer 1A and the backup computer 1C in FIG. 7) are requested to be incorporated (step 301). Here, the active computer 1A or the backup computer 1C, which has received the request, sends the position / order information file 6 to the backup computer 1B to be incorporated in order to match the contents of the position / order information files 6 in all the computers. Is transferred (step 302). As a result, the backup computer 1B is incorporated into the system, and shifts to a state where work and backup can be performed (step 303).

【００３１】なお、以上のようにしてバックアップ用計
算機１Ｂがシステムから切り離されたり復旧したりした
場合には、図７および図９に示すように、バックアップ
用計算機１Ｂが組み込まれた後（ステップ４０１）、稼
働系計算機１Ａの業務およびバックアップを一時中断す
るとともに（ステップ４０２）、稼働系計算機１Ａにて
バックアップ先となる計算機（図７ではバックアップ用
計算機１Ｂ）を選択する（ステップ４０３）。When the backup computer 1B is disconnected from the system or restored as described above, as shown in FIGS. 7 and 9, the backup computer 1B is installed (step 401). ), The operation and backup of the active computer 1A are temporarily suspended (step 402), and a computer (backup computer 1B in FIG. 7) to be a backup destination is selected by the active computer 1A (step 403).

【００３２】その後、稼働系計算機１Ａにて位置・順序
情報ファイル６を更新する（ステップ４０４）。ここで
稼働系計算機１Ａにおいては、図１０（ａ）に示すよう
な位置・順序情報ファイル６が作成される。そして、稼
働系計算機１Ａから、この位置・順序情報ファイル６を
システムに組み込まれている全ての計算機（図７ではバ
ックアップ用計算機１Ｂ，１Ｃ）へ通信路２を介して転
送し（ステップ４０５）、稼働系計算機１Ａの業務およ
びバックアップを再開する（ステップ４０６）。なお、
稼働系計算機１Ａの業務およびバックアップを再開する
場合には、図２に示す処理が再度行われる。すなわち、
稼働系計算機１Ａにてバックアップ先となる計算機（図
７ではバックアップ用計算機１Ｂ）を選択し、図１０
（ｂ）に示すような位置・順序情報ファイル６を作成し
てバックアップ用計算機１Ｂ，１Ｃへ転送した後、稼働
系計算機１Ａの業務およびバックアップを開始する。After that, the operating computer 1A updates the position / order information file 6 (step 404). Here, in the active computer 1A, a position / order information file 6 as shown in FIG. 10A is created. Then, the position / order information file 6 is transferred from the active computer 1A to all the computers (backup computers 1B and 1C in FIG. 7) incorporated in the system via the communication path 2 (step 405). The operation and backup of the active computer 1A are resumed (step 406). In addition,
When the operation and backup of the active computer 1A are restarted, the processing shown in FIG. 2 is performed again. That is,
The active computer 1A selects a computer to be a backup destination (the backup computer 1B in FIG. 7), and FIG.
After creating the position / sequence information file 6 as shown in (b) and transferring it to the backup computers 1B and 1C, the work and backup of the active computer 1A are started.

【００３３】なお、このようにしてバックアップ用計算
機１Ｂが復旧した場合には、図７に示すように、稼働系
計算機１Ａから転送された更新履歴は各バックアップ用
計算機１Ｂ，１Ｃに分割して保持されることとなる。こ
こで、バックアップ用計算機１Ｂ，１Ｃは、位置・順序
情報ファイル６に基づいて稼働系計算機１Ａまたは他の
バックアップ用計算機１Ｃ，１Ｂが保持する更新履歴フ
ァイル５の位置を特定して必要な更新履歴を収集するこ
とができる。例えば、図７に示す場合には、バックアッ
プ用計算機１Ｂがバックアップ用計算機１Ｃから更新履
歴ファイル（log2）を収集し、この収集された更新履歴
ファイルと、バックアップ用計算機１Ｂがもともと保持
していた更新履歴ファイル（log1，log3）とに基づいて
バックアップデータ４を更新することにより、バックア
ップデータ４から最新のデータを復元することができ
る。なお、このような更新履歴ファイルの転送および最
新のデータの復元は、バックアップ先計算機の変更時ま
たはバックアップ先計算機の復旧時等に行う必要はな
く、例えば夜間等の負荷が軽い時間帯または業務を行っ
ていない時間帯等にまとめて行うことができる。When the backup computer 1B is restored in this way, as shown in FIG. 7, the update history transferred from the active computer 1A is divided and held in the backup computers 1B and 1C. Will be done. Here, the backup computers 1B and 1C specify the position of the update history file 5 held by the active computer 1A or the other backup computers 1C and 1B based on the position / order information file 6 and update the necessary update history. Can be collected. For example, in the case shown in FIG. 7, the backup computer 1B collects an update history file (log2) from the backup computer 1C, and updates the collected update history file and the update originally held by the backup computer 1B. By updating the backup data 4 based on the history files (log1, log3), the latest data can be restored from the backup data 4. Such transfer of the update history file and restoration of the latest data need not be performed when the backup destination computer is changed or when the backup destination computer is restored. It can be performed at a time when it is not performed.

【００３４】（稼働系計算機の障害時）次に、図１１に
より、このような分散システムにおいて稼働系計算機に
障害が発生した場合の処理について説明する。(At the time of failure of the active computer) Next, referring to FIG. 11, the processing when a failure occurs in the active computer in such a distributed system will be described.

【００３５】例えば図７に示す分散システムにおいて、
バックアップ元である稼働系計算機１Ａに障害が発生し
た場合には、図１１に示すように、稼働系計算機１Ａの
障害を検出し（ステップ５０１）、稼働系計算機１Ａの
業務およびバックアップを一時中断するとともに（ステ
ップ５０２）、稼働系計算機１Ａをシステムから切り離
す（ステップ５０３）。For example, in the distributed system shown in FIG.
When a failure occurs in the active computer 1A as the backup source, as shown in FIG. 11, a failure in the active computer 1A is detected (step 501), and the work and backup of the active computer 1A are temporarily suspended. At the same time (step 502), the active computer 1A is disconnected from the system (step 503).

【００３６】そして、稼働系計算機１Ａの業務を引き継
ぐ新たな稼働系計算機（例えばバックアップ用計算機１
Ｂ）を選択した後（ステップ５０４）、この新たな稼働
系計算機１Ｂにて位置・順序情報ファイル６を更新する
とともに（ステップ５０５）、新たな稼働系計算機１Ｂ
から、この位置・順序情報ファイル６をシステムに組み
込まれている全ての計算機（バックアップ用計算機１
Ｃ）へ通信路２を介して転送する（ステップ５０６）。Then, a new active computer (for example, the backup computer 1) taking over the work of the active computer 1A
After selecting B) (step 504), the new active computer 1B updates the position / sequence information file 6 (step 505) and the new active computer 1B.
From this, the position / sequence information file 6 is used for all computers (backup computer 1) incorporated in the system.
C) via the communication path 2 (step 506).

【００３７】その後、新たな稼働系計算機１Ｂは、位置
・順序情報ファイル６に基づいて他のバックアップ用計
算機１Ｃが保持する更新履歴ファイル５の位置を特定し
て必要な更新履歴を収集するとともに、この収集された
更新履歴ファイルと、新たな稼働系計算機１Ｂがもとも
と保持していた更新履歴ファイルとに基づいてバックア
ップデータ４を更新して最新のデータを復元した後（ス
テップ５０７）、新たな稼働系計算機１Ｂにて業務およ
びバックアップを再開する（ステップ５０８）。Thereafter, the new active computer 1B specifies the position of the update history file 5 held by the other backup computer 1C based on the position / order information file 6, and collects necessary update histories. After updating the backup data 4 and restoring the latest data based on the collected update history file and the update history file originally held by the new operating computer 1B (step 507), the new operation The operation and backup are restarted in the system computer 1B (step 508).

【００３８】（最新のデータの復元時）次に、図１２お
よび図１３により、稼働系計算機の障害時等において最
新のデータを復元するために特定の計算機（転送先計算
機）へ他の計算機（転送元計算機）から更新履歴を転送
する場合の処理について説明する。なおここでは、図１
２に示すように、稼働系計算機１Ａおよびバックアップ
用計算機１Ｂ，１Ｃのいずれもがシステムに組み込まれ
ている状態で、バックアップ用計算機１Ｃにて最新のデ
ータを復元する場合を想定する。(At the time of restoring the latest data) Next, referring to FIGS. 12 and 13, in order to restore the latest data in the event of a failure of the active computer, a specific computer (transfer destination computer) is transferred to another computer (transfer destination computer). A process when the update history is transferred from the transfer source computer) will be described. Here, FIG.
As shown in FIG. 2, it is assumed that the backup computer 1C restores the latest data in a state where both the active computer 1A and the backup computers 1B and 1C are incorporated in the system.

【００３９】図１２および図１３に示すように、まず、
転送先計算機であるバックアップ用計算機１Ｃにて位置
・順序情報ファイル６を更新し（ステップ６０１）、次
いで、バックアップ用計算機１Ｃから、システムに組み
込まれている全計算機（図１２では稼働系計算機１Ａお
よびバックアップ用計算機１Ｂ）へ位置・順序情報ファ
イル６を転送する（ステップ６０２）。As shown in FIGS. 12 and 13, first,
The backup / computer 1C, which is the transfer destination computer, updates the position / order information file 6 (step 601), and then, from the backup computer 1C, all the computers incorporated in the system (the active computers 1A and 1A in FIG. 12). The position / order information file 6 is transferred to the backup computer 1B) (step 602).

【００４０】その後、転送元計算機である稼働系計算機
１Ａおよびバックアップ用計算機１Ｂからバックアップ
用計算機１Ｃへ最新のデータを復元するために必要な更
新履歴ファイル５を転送し（ステップ６０３）、バック
アップ用計算機１Ｃにて位置・順序情報ファイル６を更
新した後（ステップ６０４）、バックアップ用計算機１
Ｃからシステムに組み込まれている全計算機（図１２で
は稼働系計算機１Ａおよびバックアップ用計算機１Ｂ）
へ位置・順序情報ファイル６を転送する（ステップ６０
５）。Thereafter, the update history file 5 necessary for restoring the latest data from the active computer 1A and the backup computer 1B, which are transfer source computers, to the backup computer 1C is transferred (step 603). After updating the position / order information file 6 at 1C (step 604), the backup computer 1
All computers incorporated into the system from C (the active computer 1A and the backup computer 1B in FIG. 12)
To the position / order information file 6 (step 60).
5).

【００４１】なお、図１２に示す場合には、バックアッ
プ用計算機１Ｃへ稼働系計算機１Ａおよびバックアップ
用計算機１Ｂから更新履歴ファイル（Ｌ１，Ｌ２）５を
転送することとなるが、更新履歴ファイル（Ｌ１）５に
ついては稼働系計算機１Ａおよびバックアップ用計算機
１Ｂの両方で保持されているので、図１２に示すよう
に、稼働系計算機１Ａからは更新履歴ファイル（Ｌ２）
５を、バックアップ用計算機１Ｂからは更新履歴ファイ
ル（Ｌ１）５をそれぞれ並行に転送する。In the case shown in FIG. 12, the update history files (L1, L2) 5 are transferred from the active computer 1A and the backup computer 1B to the backup computer 1C. 12) is held in both the active computer 1A and the backup computer 1B, so that the update log file (L2) is transmitted from the active computer 1A as shown in FIG.
5, and the update history file (L1) 5 is transferred in parallel from the backup computer 1B.

【００４２】このように本発明の第１の実施の形態によ
れば、稼働系計算機１Ａで行われる更新処理に対応する
一連の更新履歴を複数のバックアップ用計算機１Ｂ，１
Ｃの間で分割して保持し、位置・順序情報ファイル６に
基づいて稼働系計算機１Ａおよび各バックアップ用計算
機１Ｂ，１Ｃが保持する更新履歴ファイル５を相互に収
集できるようになっている。従って、最新のデータの維
持が主たる目的である分散システム（例えば処理の引継
ぎは行わず、最新のデータのバックアップのみを行う分
散システム）においては、バックアップ先計算機の変更
時またはバックアップ先計算機の復旧時等にバックアッ
プ先計算機等へ過去の大量の更新履歴を転送するような
必要がなく、夜間等の負荷が軽い時間帯または業務を行
っていない時間帯等に更新履歴を転送すればよく、この
ため計算機および通信路に対する負荷を軽減させて他の
業務に与える影響を最小限にすることができる。As described above, according to the first embodiment of the present invention, a series of update histories corresponding to the update processing performed by the active computer 1A is stored in the backup computers 1B and 1B.
The update history files 5 held by the active computer 1A and each of the backup computers 1B and 1C can be mutually collected based on the position / order information file 6 while being divided and held. Therefore, in a distributed system whose main purpose is to maintain the latest data (for example, a distributed system that only backs up the latest data without taking over processing), when the backup destination computer is changed or when the backup destination computer is restored. For example, there is no need to transfer a large amount of past update history to the backup destination computer, etc., and it is sufficient to transfer the update history during a time when the load is light, such as at night, or during a time when business is not performed. The load on computers and communication paths can be reduced to minimize the impact on other tasks.

【００４３】また本発明の第１の実施の形態によれば、
稼働系計算機１Ａの障害時またはバックアップ先計算機
の変更時等においても、少なくとも２台の計算機が最新
のデータを保持している状態、または最新のデータを復
元可能な状態となっているので、この状態で稼働系計算
機１Ａが保持するデータが使用不能となった場合でも、
他のバックアップ用計算機等に基づいて最新のデータを
復元することができ、このため最新のデータが消失する
ことを防止して信頼性を向上させることができる。According to the first embodiment of the present invention,
Even when the active computer 1A fails or when the backup destination computer is changed, at least two computers are in a state where the latest data is held or the latest data can be restored. Even if the data held by the active computer 1A becomes unavailable in this state,
The latest data can be restored based on another backup computer or the like, so that the latest data can be prevented from being lost and reliability can be improved.

【００４４】さらに本発明の第１の実施の形態によれ
ば、稼働系計算機１Ａと各バックアップ用計算機１Ｂ，
１Ｃとの間で同一内容の更新履歴ファイル５を保持して
いるので、必要な更新履歴を複数の計算機から収集する
ことができ、このため例えば負荷の軽い計算機から更新
履歴を収集することで計算機の負荷を軽減することがで
きる。また、特定の計算機または通信路に負荷が集中し
ないよう更新履歴を収集することで更新履歴の転送時間
を短縮することができる。Further, according to the first embodiment of the present invention, the active computer 1A and each backup computer 1B,
Since the update history file 5 having the same contents is held between the computer 1C and the computer 1C, necessary update histories can be collected from a plurality of computers. Load can be reduced. Further, by collecting update histories so that a load is not concentrated on a specific computer or a communication path, the transfer time of the update histories can be reduced.

【００４５】第２の実施の形態次に、図１４により、本発明の第２の実施の形態につい
て説明する。本発明の第２の実施の形態は、稼働系計算
機の障害等によりバックアップ用計算機にて最新のデー
タを復元する場合における最適な復元先の選択方法につ
いてのものである。本発明の第２の実施の形態における
分散システムの基本的な構成および処理等については図
１乃至図１３に示す第１の実施の形態と略同一であるの
で、図１乃至図１３に示す第１の実施の形態と同一部分
には同一符号を付して詳細な説明は省略する。Second Embodiment Next, a second embodiment of the present invention will be described with reference to FIG. The second embodiment of the present invention relates to a method of selecting an optimal restoration destination when restoring the latest data on a backup computer due to a failure of an active computer or the like. The basic configuration, processing, and the like of the distributed system according to the second embodiment of the present invention are substantially the same as those of the first embodiment shown in FIGS. The same parts as those of the first embodiment are denoted by the same reference numerals, and detailed description is omitted.

【００４６】図１４（ａ）（ｂ）に示すように、例えば
図７に示す分散システムにおいて、バックアップ元であ
る稼働系計算機１Ａに障害が発生した場合には、バック
アップ用計算機１Ｃからバックアップ用計算機１Ｂへ更
新履歴ファイル（Ｌ２）５を転送してバックアップ用計
算機１Ｂで最新のデータを復元するか、バックアップ用
計算機１Ｂからバックアップ用計算機１Ｃへ更新履歴フ
ァイル（Ｌ１）５を転送してバックアップ用計算機１Ｃ
で最新のデータを復元する必要がある。As shown in FIGS. 14A and 14B, for example, in the distributed system shown in FIG. 7, when a failure occurs in the active computer 1A as the backup source, the backup computer 1C switches from the backup computer 1C to the backup computer. Transfer the update history file (L2) 5 to the backup computer 1B by transferring the update history file (L2) 5 to the backup computer 1B, or transfer the update history file (L1) 5 from the backup computer 1B to the backup computer 1C. 1C
Need to restore the latest data.

【００４７】この場合には、各バックアップ用計算機１
Ｂ，１Ｃにおいて、位置・順序情報ファイル６に基づい
て他のバックアップ用計算機１Ｃ，１Ｂから収集すべき
更新履歴のデータ量（更新履歴ファイル（Ｌ１，Ｌ２）
のサイズ）を比較する（図１４（ａ）参照）。そして、
このデータ量が最も小さいバックアップ用計算機を最新
のデータを復元する計算機として選択し、この選択され
たバックアップ用計算機において位置・順序情報ファイ
ル６に基づいて他のバックアップ用計算機から更新履歴
を収集することによりバックアップデータから最新のデ
ータを復元する（図１４（ｂ）参照）。なお、図１４に
示す場合には、更新履歴ファイル（Ｌ２）５のサイズが
更新履歴ファイル（Ｌ１）５のサイズよりも小さいの
で、更新履歴ファイル（Ｌ２）５がバックアップ用計算
機１Ｃからバックアップ用計算機１Ｂへ転送され、バッ
クアップ用計算機１Ｂにおいて最新のデータの復元およ
び処理の引継ぎ等が行われる。In this case, each backup computer 1
B, 1C, the data amount of the update history to be collected from the other backup computers 1C, 1B based on the position / order information file 6 (update history file (L1, L2)
Are compared (see FIG. 14A). And
The backup computer having the smallest data amount is selected as a computer for restoring the latest data, and the selected backup computer collects update history from another backup computer based on the position / order information file 6. To restore the latest data from the backup data (see FIG. 14B). In the case shown in FIG. 14, since the size of the update history file (L2) 5 is smaller than the size of the update history file (L1) 5, the update history file (L2) 5 is transmitted from the backup computer 1C to the backup computer 1C. 1B, and the backup computer 1B restores the latest data and takes over the processing.

【００４８】このように本発明の第２の実施の形態によ
れば、稼働系計算機１Ａの障害等によりいずれかのバッ
クアップ用計算機１Ｂ，１Ｃにて最新のデータを復元す
る必要が生じた場合に、他のバックアップ用計算機１
Ｃ，１Ｂから収集すべき更新履歴のデータ量が最も少な
いバックアップ用計算機を復元先として選択するので、
計算機または通信路への負荷を軽減することができ、こ
のため短時間で最新のデータを復元することができる。As described above, according to the second embodiment of the present invention, when it is necessary to restore the latest data in one of the backup computers 1B and 1C due to a failure of the active computer 1A or the like. , Another backup computer 1
Since the backup computer that has the least amount of update history data to be collected from C and 1B is selected as the restoration destination,
The load on the computer or the communication path can be reduced, so that the latest data can be restored in a short time.

【００４９】第３の実施の形態次に、図１５により、本発明の第３の実施の形態につい
て説明する。本発明の第３の実施の形態は、バックアッ
プ用計算機の復旧等によりバックアップ先計算機を変更
する場合における最適なバックアップ先の選択方法につ
いてのものである。本発明の第３の実施の形態における
分散システムの基本的な構成および処理等については図
１乃至図１３に示す第１の実施の形態と略同一であるの
で、図１乃至図１３に示す第１の実施の形態と同一部分
には同一符号を付して詳細な説明は省略する。 Third Embodiment Next, a third embodiment of the present invention will be described with reference to FIG. The third embodiment of the present invention relates to a method of selecting an optimal backup destination when changing a backup destination computer due to restoration of a backup computer or the like. The basic configuration, processing, and the like of the distributed system according to the third embodiment of the present invention are substantially the same as those of the first embodiment shown in FIGS. The same parts as those of the first embodiment are denoted by the same reference numerals, and detailed description is omitted.

【００５０】図１５に示すように、例えば図５に示す分
散システムにおいて、バックアップ用計算機１Ｂが復旧
した場合には、現在のバックアップ先であるバックアッ
プ用計算機１Ｃを引き続きバックアップ先とするか、復
旧したバックアップ用計算機１Ｂを新たなバックアップ
先とするかを選択する必要がある。なお、ここでは図１
５に示すように、稼働系計算機１Ａには更新履歴ファイ
ル（Ｌ１，Ｌ２，Ｌ３，Ｌ４，Ｌ５）５が保持され、バ
ックアップ用計算機１Ｂには更新履歴ファイル（Ｌ１，
Ｌ２，Ｌ３）５が保持され、バックアップ用計算機１Ｃ
には更新履歴ファイル（Ｌ４，Ｌ５）５が保持されてお
り、かつバックアップ用計算機１Ｂ，１Ｃにおいては更
新履歴ファイル（Ｌ１，Ｌ２，Ｌ４）５が既にバックア
ップデータに反映済みであるものとする。なお、各バッ
クアップ用計算機１Ｂ，１Ｃにおいては、位置・順序情
報ファイル６に加えて、どの更新履歴ファイル５が各バ
ックアップ用計算機１Ｂ，１Ｃのバックアップデータに
反映済みなのかを示す情報が保持されているものとす
る。As shown in FIG. 15, for example, in the distributed system shown in FIG. 5, when the backup computer 1B is restored, the backup computer 1C which is the current backup destination is continuously set as the backup destination or restored. It is necessary to select whether the backup computer 1B is a new backup destination. Here, FIG.
As shown in FIG. 5, the active computer 1A holds an update history file (L1, L2, L3, L4, L5) 5, and the backup computer 1B holds the update history file (L1, L1).
L2, L3) 5 are held and the backup computer 1C
Holds an update history file (L4, L5) 5, and in the backup computers 1B, 1C, the update history file (L1, L2, L4) 5 has already been reflected in the backup data. Each of the backup computers 1B and 1C holds, in addition to the position / order information file 6, information indicating which update history file 5 has been reflected in the backup data of each backup computer 1B and 1C. Shall be

【００５１】この場合には、バックアップ先として選択
される可能性があるバックアップ用計算機１Ｂ，１Ｃに
おいて、位置・順序情報ファイル６に基づいて他のバッ
クアップ用計算機１Ｃ，１Ｂまたは稼働系計算機１Ａか
ら収集すべき更新履歴のデータ量を比較する。このと
き、現在バックアップ先となっているバックアップ用計
算機１Ｃと復旧したバックアップ用計算機１Ｂとが保持
する更新履歴ファイル５の全体のサイズＳ１を位置・順
序情報ファイル６に基づいて計算するとともに、このサ
イズＳ１から既にバックアップデータへ反映済みの更新
履歴ファイル５の全体のサイズＳ２を計算し、サイズＳ
１からサイズＳ２を引いた値Ｓ３が最も小さいバックア
ップ用計算機を次の新たなバックアップ先として選択す
る。In this case, the backup computers 1B and 1C, which may be selected as backup destinations, collect from the other backup computers 1C and 1B or the active computer 1A based on the position / order information file 6. The data amount of the update history to be compared is compared. At this time, the entire size S1 of the update history file 5 held by the backup computer 1C as the current backup destination and the restored backup computer 1B is calculated based on the position / order information file 6, and this size is calculated. From S1, the entire size S2 of the update history file 5 already reflected in the backup data is calculated, and the size S
The backup computer having the smallest value S3 obtained by subtracting the size S2 from 1 is selected as the next new backup destination.

【００５２】すなわち、復旧したバックアップ用計算機
１Ｂの更新履歴ファイル５のサイズＳ３が現在バックア
ップ先となっているバックアップ用計算機１Ｃの更新履
歴ファイル５のサイズＳ３よりも小さい場合には、復旧
したバックアップ用計算機１Ｂを新たなバックアップ先
として選択する。一方、復旧したバックアップ用計算機
１Ｂの更新履歴ファイル５のサイズＳ３が現在バックア
ップ先となっているバックアップ用計算機１Ｃの更新履
歴ファイル５のサイズＳ３よりも大きい場合には現在バ
ックアップ先となっているバックアップ用計算機１Ｃを
引き続きバックアップ先として選択する。That is, if the size S3 of the update history file 5 of the restored backup computer 1B is smaller than the size S3 of the update history file 5 of the backup computer 1C that is currently the backup destination, the restored backup The computer 1B is selected as a new backup destination. On the other hand, when the size S3 of the update history file 5 of the restored backup computer 1B is larger than the size S3 of the update history file 5 of the backup computer 1C which is the current backup destination, the backup currently being the backup destination is performed. Computer 1C is continuously selected as the backup destination.

【００５３】なお、図１５に示す場合には、復旧したバ
ックアップ用計算機１Ｂの更新履歴ファイル５のサイズ
Ｓ３（更新履歴ファイル（Ｌ３）５のサイズ）が現在バ
ックアップ先となっているバックアップ用計算機１Ｃの
更新履歴ファイル５のサイズＳ３（更新履歴ファイル
（Ｌ５）５のサイズ）よりも大きいので、現在バックア
ップ先となっているバックアップ用計算機１Ｃを引き続
きバックアップ先として選択する。In the case shown in FIG. 15, the size S3 (the size of the update history file (L3) 5) of the update history file 5 of the restored backup computer 1B is the backup computer 1C which is the current backup destination. Is larger than the size S3 of the update history file 5 (the size of the update history file (L5) 5), the backup computer 1C which is currently the backup destination is continuously selected as the backup destination.

【００５４】このように本発明の第３の実施の形態によ
れば、バックアップ用計算機１Ｂの復旧等により新たな
バックアップ先を選択する必要が生じた場合に、他の計
算機から収集すべき更新履歴のデータ量が最も少ないバ
ックアップ用計算機をバックアップ先として選択するの
で、計算機または通信路への負荷を軽減して短時間で最
新のデータを復元することができるとともに、業務の運
用中に更新履歴ファイルを転送する場合であっても、計
算機または通信路への負荷を軽減して業務への影響を軽
減することができる。As described above, according to the third embodiment of the present invention, when it becomes necessary to select a new backup destination due to the recovery of the backup computer 1B, the update history to be collected from another computer. The backup computer with the least amount of data is selected as the backup destination, so the load on the computer or communication path can be reduced and the latest data can be restored in a short time. Even if the data is transferred, the load on the computer or the communication path can be reduced, and the effect on the business can be reduced.

【００５５】[0055]

【発明の効果】以上説明したように本発明によれば、計
算機および通信路に対する負荷が低くかつ信頼性が高い
分散システムおよびそのバックアップ方法を提供するこ
とができる。As described above, according to the present invention, it is possible to provide a distributed system with a low load on a computer and a communication path and a high reliability, and a backup method thereof.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明による分散システムの第１の実施の形態
のシステム構成を示す図。FIG. 1 is a diagram showing a system configuration of a first embodiment of a distributed system according to the present invention.

【図２】本発明による分散システムの第１の実施の形態
における業務およびバックアップの開始時の動作を説明
するためのフローチャート。FIG. 2 is a flowchart for explaining operations at the start of business and backup in the first embodiment of the distributed system according to the present invention.

【図３】図１に示す分散システムの各計算機が保持する
位置・順序情報ファイルの一例を示す図。FIG. 3 is a view showing an example of a position / order information file held by each computer of the distributed system shown in FIG. 1;

【図４】本発明による分散システムの第１の実施の形態
においてバックアップ先が切り替えられる場合の処理を
説明するための図。FIG. 4 is a view for explaining processing when a backup destination is switched in the first embodiment of the distributed system according to the present invention;

【図５】本発明による分散システムの第１の実施の形態
におけるバックアップ先計算機の障害時の動作を説明す
るためのフローチャート。FIG. 5 is a flowchart for explaining an operation when a backup destination computer fails in the first embodiment of the distributed system according to the present invention;

【図６】図４に示す分散システムの各計算機が保持する
位置・順序情報ファイルの一例を示す図。FIG. 6 is a view showing an example of a position / order information file held by each computer of the distributed system shown in FIG. 4;

【図７】本発明による分散システムの第１の実施の形態
において障害が発生した計算機が復旧してバックアップ
先が切り替えられる場合の処理を説明するための図。FIG. 7 is a diagram for explaining processing when a failed computer is restored and a backup destination is switched in the first embodiment of the distributed system according to the present invention;

【図８】本発明による分散システムの第１の実施の形態
における計算機の組込み時の動作を説明するためのフロ
ーチャート。FIG. 8 is a flowchart for explaining the operation of the distributed system according to the first embodiment of the present invention when a computer is incorporated.

【図９】本発明による分散システムの第１の実施の形態
におけるバックアップ用計算機の復旧時の動作を説明す
るためのフローチャート。FIG. 9 is a flowchart for explaining an operation at the time of restoration of the backup computer in the first embodiment of the distributed system according to the present invention.

【図１０】図７に示す分散システムの各計算機が保持す
る位置・順序情報ファイルの一例を示す図。FIG. 10 is a view showing an example of a position / order information file held by each computer of the distributed system shown in FIG. 7;

【図１１】本発明による分散システムの第１の実施の形
態における稼働系計算機の障害時の動作を説明するため
のフローチャート。FIG. 11 is a flowchart for explaining an operation at the time of failure of the active computer in the first embodiment of the distributed system according to the present invention;

【図１２】本発明による分散システムの第１の実施の形
態における更新履歴ファイルの収集時の動作を説明する
ための図。FIG. 12 is a diagram for explaining an operation at the time of collecting update history files in the distributed system according to the first embodiment of the present invention.

【図１３】本発明による分散システムの第１の実施の形
態における更新履歴ファイルの転送時の動作を説明する
ためのフローチャート。FIG. 13 is a flowchart for explaining an operation at the time of transferring an update history file in the distributed system according to the first embodiment of the present invention.

【図１４】本発明による分散システムの第２の実施の形
態における最新のデータの復元先の選択方法を説明する
ための図。FIG. 14 is a diagram for explaining a method of selecting a latest data restoration destination in the distributed system according to the second embodiment of the present invention.

【図１５】本発明による分散システムの第３の実施の形
態におけるバックアップ先の選択方法を説明するための
図。FIG. 15 is a view for explaining a method of selecting a backup destination in the third embodiment of the distributed system according to the present invention.

【図１６】従来の分散システムにおけるデータベース等
のバックアップ方法を説明するための図。FIG. 16 is a view for explaining a backup method of a database or the like in a conventional distributed system.

[Explanation of symbols]

１Ａ稼働系計算機１Ｂ，１Ｃバックアップ用計算機２通信路３マスタデータ４バックアップデータ５更新履歴ファイル６位置・順序情報ファイル（位置情報） 1A Active computer 1B, 1C Backup computer 2 Communication path 3 Master data 4 Backup data 5 Update history file 6 Position / sequence information file (position information)

Claims

[Claims]

1. An active computer for performing an update process on master data; backup data corresponding to the content of the master data at a predetermined time; and at least two backups holding an update history corresponding to the update process. And a backup computer, wherein each backup computer manages location information for specifying a location for each division unit of the update history that is divided and held between the respective backup computers. Distributed system.

2. The operating system computer holds an update history corresponding to the update processing and specifies a position of each update unit divided and held among the backup computers for each division unit. 2. The distributed system according to claim 1, wherein location information for managing the information is managed.

3. The distributed system according to claim 1, wherein each of the backup computers collects an update history from another backup computer or the active computer based on the location information.

4. A backup system for a distributed system, comprising: an active computer for updating master data; and at least two backup computers for holding backup data corresponding to the contents of the master data at a predetermined time. In the method, a step of transferring an update history corresponding to the update processing from the active computer to a backup computer serving as a backup destination; and Managing position information for specifying a position for each division unit.

5. The distributed system according to claim 4, further comprising a step of collecting an update history from another backup computer or the active computer based on the location information in any one of the backup computers. Backup method.

6. A data amount of an update history to be collected from another backup computer or the active computer based on the location information when it is necessary to restore the latest data in one of the backup computers. And recovering the latest data from the backup data by collecting update history from another computer based on the location information in the backup computer having the smallest data amount. The method for backing up a distributed system according to claim 4.

7. When it becomes necessary to select a backup destination, the data amount of update history to be collected from another backup computer or the active computer is compared based on the location information. 5. The distributed system backup method according to claim 4, further comprising the step of selecting the smallest backup computer as a new backup destination.