JPH04369735A - Backup system for computer system - Google Patents

Backup system for computer system

Info

Publication number
JPH04369735A
JPH04369735A JP3147050A JP14705091A JPH04369735A JP H04369735 A JPH04369735 A JP H04369735A JP 3147050 A JP3147050 A JP 3147050A JP 14705091 A JP14705091 A JP 14705091A JP H04369735 A JPH04369735 A JP H04369735A
Authority
JP
Japan
Prior art keywords
computer
storage device
active
auxiliary storage
computers
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP3147050A
Other languages
Japanese (ja)
Inventor
Akira Yamada
明 山田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Original Assignee
Toshiba Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Toshiba Corp filed Critical Toshiba Corp
Priority to JP3147050A priority Critical patent/JPH04369735A/en
Publication of JPH04369735A publication Critical patent/JPH04369735A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To automatically detect the occurrence of a fault in any operation system computer and to automatically raise a standby system computer. CONSTITUTION:A common storage device 6 where backup data areas for respective operation system computers 1A, 1B and 1C are secured is provided. The operation system computers 1A, 1B and 1C successively rewrite data on the self-auxiliary storage devices 4A, 4B and 4C during an operation and successively rewrite self backup data in the common storage device 6. The standby system computer 3 supervises the occurrence of the fault of the respective operation system computers 1A, 1B and 1C. When the occurrence of the fault in any operation system computer is detected, backup data concerned in the common storage device 6 is called, is copied in a self auxiliary storage device 5 and the backup processing of the computer where the fault occurs is started.

Description

【発明の詳細な説明】[Detailed description of the invention]

【0001】0001

【産業上の利用分野】本発明は、複数台の稼働系計算機
を1台の待機系計算機を備えた計算機システムのバック
アップ方式に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a backup method for a computer system having a plurality of active computers and one standby computer.

【0002】0002

【従来の技術】従来より、複数の稼働系計算機とこれら
稼働系計算機のバックアップを行う待機系計算機とをオ
ンライン接続し、かつ各稼働系計算機および待機系計算
機のそれぞれに補助記憶装置を備えた計算機システムが
知られている。
[Background Art] Conventionally, there has been a computer system in which a plurality of active computers and a standby computer that backs up these active computers are connected online, and each active computer and standby computer is provided with an auxiliary storage device. The system is known.

【0003】この従来システムでは、稼働系計算機のい
ずれかに障害が発生してダウンした場合、人間により、
ダウン発生時のデータを収集しこのデータを待機系計算
機の補助記憶装置に複写して待機系計算機を手動にて立
ち上げるようにしている。
[0003] In this conventional system, if a failure occurs in one of the operating computers and it goes down, humans can
The system collects data when a downtime occurs, copies this data to the auxiliary storage device of the standby computer, and then manually starts up the standby computer.

【0004】0004

【発明が解決しようとする課題】しかしながら、上記従
来システムによれば、稼働系計算機の故障発生の際、待
機系計算機の立ち上げを人間が手動にて行う方式である
ため、操作する者の単純ミスを完全に防止することが困
難であり、単純ミスに起因して計算機が立ち上がらない
場合がある。また、正常に立ち上がったとしても時間が
かかり効率が極めて悪いという問題点があった。
[Problems to be Solved by the Invention] However, according to the above-mentioned conventional system, when a failure occurs in the active computer, the standby computer is manually started up by a human, which makes it difficult for the operator to easily start up the standby computer. It is difficult to completely prevent mistakes, and a simple mistake may cause the computer to not start up. Furthermore, even if the system starts up normally, it takes a long time and is extremely inefficient.

【0005】本発明は上記従来の問題点に鑑みてなされ
たものであり、その目的は、いずれかの稼働系計算機の
故障発生を自動検出して待機系計算機を自動で立ち上げ
ることのできる計算機システムのバックアップ方式を提
供することにある。
The present invention has been made in view of the above conventional problems, and its object is to provide a computer that can automatically detect the occurrence of a failure in any active computer and automatically start up a standby computer. Its purpose is to provide a system backup method.

【0006】[0006]

【課題を解決するための手段】上記の目的を達成するた
めに本発明は、複数の稼働系計算機とこれら稼働系計算
機のバックアップを行う待機系計算機とをオンライン接
続し、かつ各稼働系計算機および待機系計算機のそれぞ
れに補助記憶装置を備えた計算機システムにおいて、前
記各稼働系計算機毎のバックアップ用データ領域が確保
された共有記憶装置を備えるとともに、前記稼働系計算
機は、稼働中に自身の補助記憶装置のデータを逐次書き
替え、かつ前記共有記憶装置内の自身のバックアップ用
データを逐次書き替える一方、前記待機系計算機は各稼
働系計算機の故障発生を監視し、いずれかの稼働系計算
機の故障発生を検知した場合には、前記共有記憶装置の
該当するバックアップ用データを呼び出して自身の補助
記憶装置に複写した後、当該故障発生計算機のバックア
ップ処理を開始することを特徴とする。
[Means for Solving the Problems] In order to achieve the above object, the present invention connects a plurality of active computers and a standby computer that backs up these active computers online, and connects each active computer and In a computer system in which each of the standby computers is provided with an auxiliary storage device, each of the active computers is provided with a shared storage device in which a backup data area is secured, and the active computer has its own auxiliary storage device during operation. While sequentially rewriting the data in the storage device and sequentially rewriting its own backup data in the shared storage device, the standby computer monitors the occurrence of a failure in each active computer, and if any of the active computers When the occurrence of a failure is detected, the corresponding backup data in the shared storage device is called up and copied to its own auxiliary storage device, and then backup processing for the computer in which the failure has occurred is started.

【0007】また、上記の目的を達成するために本発明
は、複数の稼働系計算機とこれら稼働系計算機のバック
アップを行う待機系計算機とをオンライン接続し、かつ
各稼働系計算機および待機系計算機のそれぞれに補助記
憶装置を備えた計算機システムにおいて、前記各補助記
憶装置を各稼働系計算機および待機系計算機からアクセ
ス可能に構成し、前記稼働系計算機は、稼働中に自身の
補助記憶装置のデータを逐次書き替える一方、前記待機
系計算機は各稼働系計算機の故障発生を監視し、いずれ
かの稼働系計算機の故障発生を検知した場合には、故障
発生した補助記憶装置内データに基づいて当該故障発生
計算機のバックアップ処理を開始することを特徴とする
Further, in order to achieve the above object, the present invention connects a plurality of active computers and a standby computer that backs up these active computers online, and connects each active computer and standby computer to each other. In a computer system each having an auxiliary storage device, each auxiliary storage device is configured to be accessible from each active computer and standby computer, and the active computer stores data in its own auxiliary storage device during operation. While rewriting sequentially, the standby computer monitors the occurrence of a failure in each active computer, and if it detects a failure in any active computer, it updates the failure based on the data in the auxiliary storage device where the failure occurred. It is characterized by starting backup processing of the generating computer.

【0008】[0008]

【作用】上記構成によれば、稼働系計算機は、稼働中に
おいてプロセスデータやシステムの状態データ等の各種
データを収集して自身の補助記憶装置に逐次書き込む、
とともに前記共有記憶装置内の自身のエリアにバックア
ップ用データを逐次書き込む。前記待機系計算機は各稼
働系計算機の故障発生を監視し、いずれかの稼働系計算
機の故障発生を検知した場合には、前記共有記憶装置の
該当するバックアップ用データを呼び出して自身の補助
記憶装置に複写した後、自動にて立ち上がり当該故障発
生計算機のバックアップ処理を開始する。
[Operation] According to the above configuration, the operating system computer collects various data such as process data and system status data during operation, and sequentially writes them to its own auxiliary storage device.
At the same time, backup data is sequentially written to its own area in the shared storage device. The standby computer monitors the occurrence of a failure in each active computer, and if it detects a failure in any of the active computers, it calls the corresponding backup data from the shared storage device and stores it in its own auxiliary storage device. After copying to the computer, it will automatically start up and start backup processing for the failed computer.

【0009】また、他の構成によれば、待機系計算機は
、いずれかの稼働系計算機の故障発生を検知した場合に
は、稼働系計算機が稼働中に収集して自身の補助記憶装
置に逐次書き込んだプロセスデータやシステムの状態デ
ータ等の各種データを読み出して自身の補助記憶装置に
複写した後、自動にて立ち上がり当該故障発生計算機の
バックアップ処理を開始する。
[0009] According to another configuration, when the standby computer detects the occurrence of a failure in any of the active computers, the active computer collects data while the active computer is in operation and sequentially stores the information in its own auxiliary storage device. After reading various data such as written process data and system status data and copying them to its own auxiliary storage device, it automatically boots up and starts backup processing for the failed computer.

【0010】0010

【実施例】図1は本発明方式が適用された計算機システ
ムの一実施例を示す構成図である。
DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 is a block diagram showing an embodiment of a computer system to which the method of the present invention is applied.

【0011】図示するように、複数台の稼働系計算機1
A,1B,1C,…は伝送路2を介して待機系計算機3
に接続されている。各稼働系計算機1A,1B,1C,
…は各別に補助記憶装置4A,4B,4C,…を備えて
おり、また、待機系計算機3も個別に補助記憶装置5を
備えている。稼働系計算機の各補助記憶装置4A,4B
,4C,…は、自身の計算機で収集された各種データを
更新しつつ保存するものであり、また、待機系計算機の
補助記憶装置5は、故障が発生した稼働系計算機のバッ
クアップ用データを収集して保存するとともに、バック
アップ時において稼働計算機として機能する場合に収集
された各種データを更新しつつ保存する。
As shown in the figure, a plurality of operating computers 1
A, 1B, 1C, ... are connected to the standby computer 3 via the transmission line 2.
It is connected to the. Each operating system computer 1A, 1B, 1C,
... are each equipped with an auxiliary storage device 4A, 4B, 4C, . . . , and the standby computer 3 is also individually equipped with an auxiliary storage device 5. Each auxiliary storage device 4A, 4B of the active computer
, 4C, ... are used to update and save various data collected by their own computers, and the auxiliary storage device 5 of the standby computer collects backup data of the active computer in which a failure has occurred. At the same time, various data collected when functioning as an operating computer at the time of backup are updated and saved.

【0012】本実施例では、特に、各稼働系計算機1A
,1B,1C,…および待機系計算機5が共通にアクセ
ス可能な共有補助記憶装置6が設けられている。この共
有補助記憶装置6には、図2に示すように各稼働系計算
機1A,1B,1C,…毎のバックアップ用データを格
納するエリアが設けられている。
In this embodiment, in particular, each active computer 1A
, 1B, 1C, . . . and a standby computer 5 are provided with a shared auxiliary storage device 6 that can be commonly accessed. As shown in FIG. 2, this shared auxiliary storage device 6 is provided with an area for storing backup data for each active computer 1A, 1B, 1C, . . . .

【0013】次に本実施例の作用を図3に示すフローチ
ャートを参照して説明する。
Next, the operation of this embodiment will be explained with reference to the flowchart shown in FIG.

【0014】稼働系計算機1A,1B,1C,…は、稼
働中においてプロセスデータやシステムの状態データ等
の各種データを収集して自身の補助記憶装置4A,4B
,4C…に所定の周期で書き込む。これと並行して、各
稼働系計算機1A,1B,1C,…は前記各種データの
全部または前記各種データ中からバックアップに必要な
データを共有補助記憶装置6の対応エリアに前記所定周
期で書き込む。
The active computers 1A, 1B, 1C, . . . collect various data such as process data and system status data during operation and store them in their own auxiliary storage devices 4A, 4B.
, 4C... at a predetermined cycle. In parallel, each active computer 1A, 1B, 1C, . . . writes all of the various data or data necessary for backup from among the various data to the corresponding area of the shared auxiliary storage device 6 at the predetermined period.

【0015】一方、稼働系計算機1A,1B,1C,…
の稼働中、待機系計算機3はいずれかの稼働系計算機1
A,1B,1C,…に故障発生したか否かを監視する(
ステップST1)。
On the other hand, the operating computers 1A, 1B, 1C,...
While the standby computer 3 is in operation, the standby computer 3 is connected to one of the active computers 1.
Monitor whether a failure has occurred in A, 1B, 1C, etc. (
Step ST1).

【0016】故障発生が検知される(ステップST1Y
ES)と、対応する稼働系計算機のバックアップ用デー
タを共有補助記憶装置6の対応エリアから読み出して自
身の補助記憶装置5に複写する(ステップST2,ST
3)。バックアップ用データの複写が完了する(ステッ
プST4YES)と、この待機系計算機3は自動立ち上
げを行い稼働系計算機として故障発生計算機に代わって
処理を続行する(ステップST5,ST6)。
[0016] Occurrence of failure is detected (step ST1Y
ES) and the backup data of the corresponding active computer are read from the corresponding area of the shared auxiliary storage device 6 and copied to its own auxiliary storage device 5 (steps ST2, ST
3). When the copying of the backup data is completed (step ST4 YES), the standby computer 3 automatically starts up and continues processing as an active computer in place of the failed computer (steps ST5, ST6).

【0017】このように本実施例によれば、稼働系計算
機1A,1B,1C,…のいずれかに故障が発生してか
ら待機系計算機3が立ち上がるまでをすべて人手を介す
ることなく自動にて行うことができるので、操作する者
の単純ミスに起因して計算機が立ち上がらないという事
態の発生を防止できる。
As described above, according to this embodiment, the entire process from when a failure occurs in any of the active computers 1A, 1B, 1C, . . . until the standby computer 3 starts up is automatically performed without any human intervention. Therefore, it is possible to prevent the occurrence of a situation where the computer does not start up due to a simple mistake by the operator.

【0018】なお、待機系計算機3は、前記ステップS
T6の処理続行中にも、稼働系計算機の状態を監視し、
故障した計算機が復旧した場合には、自身の補助記憶装
置5内のデータを故障回復した稼働系計算機の補助記憶
装置および共有補助記憶装置6に転送して故障回復した
稼働系計算機を立ち上がらせて再度処理を移行するよう
にしてもよい。
Note that the standby computer 3 performs the step S
Even while T6 processing continues, the status of the active computer is monitored,
When the failed computer is restored, the data in its own auxiliary storage device 5 is transferred to the auxiliary storage device and shared auxiliary storage device 6 of the working computer that has recovered from the failure, and the working computer that has recovered from the failure is started up. The process may be transferred again.

【0019】また、図4に示すように、共有補助記憶装
置5を持たずに各計算機の補助記憶装置を各計算機から
共通にアクセス可能に接続するようにしてもよい。この
実施例構成によれば、いずれかの稼働系計算機が故障し
た場合に、その補助記憶装置内のデータを待機系計算機
3が読み出して自身の補助記憶装置5に複写した後、前
記図3のフローチャートに示すと同様に自動立ち上げを
行い、故障が発生した稼働系計算機に代わって処理を続
行できる。なお、この実施例の場合、稼働していない計
算機を待機系計算機として位置づけ、他の稼働中の計算
機の故障発生を監視させるようにしても良い。
Furthermore, as shown in FIG. 4, the shared auxiliary storage device 5 may not be provided, and the auxiliary storage devices of each computer may be connected so as to be commonly accessible from each computer. According to the configuration of this embodiment, when one of the active computers fails, the standby computer 3 reads out the data in the auxiliary storage device and copies it to its own auxiliary storage device 5, and then the data shown in FIG. As shown in the flowchart, automatic startup is performed and processing can be continued in place of the failed active computer. In the case of this embodiment, a computer that is not in operation may be positioned as a standby computer to monitor the occurrence of a failure in other computers that are in operation.

【0020】[0020]

【発明の効果】以上説明したように本発明によれば、い
ずれかの稼働系計算機の故障発生を自動検出して待機系
計算機を自動で立ち上げることができる。その結果、操
作する者の単純ミスに起因して計算機が立ち上がらない
という事態の発生を防止でき、操作性、信頼性の高い計
算機システムを構築できる。
As described above, according to the present invention, it is possible to automatically detect the occurrence of a failure in any active computer and automatically start up a standby computer. As a result, it is possible to prevent the occurrence of a situation in which the computer does not start up due to a simple mistake by the operator, and it is possible to construct a computer system with high operability and reliability.

【図面の簡単な説明】[Brief explanation of the drawing]

【図1】本発明方式が適用された一実施例を示す構成図
である。
FIG. 1 is a configuration diagram showing an embodiment to which the system of the present invention is applied.

【図2】共有補助記憶装置の構成例を示す説明図である
FIG. 2 is an explanatory diagram showing a configuration example of a shared auxiliary storage device.

【図3】本発明の一実施例の作用を説明するフローチャ
ートである。
FIG. 3 is a flowchart illustrating the operation of an embodiment of the present invention.

【図4】本発明方式が適用された他の実施例を示す構成
図である。
FIG. 4 is a configuration diagram showing another embodiment to which the method of the present invention is applied.

【符号の説明】[Explanation of symbols]

1A,1B,1C  稼働系計算機 2  伝送路 3  待機系計算機 4A,4B,4C,5  補助記憶装置6  共有補助
記憶装置
1A, 1B, 1C Active computer 2 Transmission line 3 Standby computer 4A, 4B, 4C, 5 Auxiliary storage device 6 Shared auxiliary storage device

Claims (2)

【特許請求の範囲】[Claims] 【請求項1】  複数の稼働系計算機とこれら稼働系計
算機のバックアップを行う待機系計算機とをオンライン
接続し、かつ各稼働系計算機および待機系計算機のそれ
ぞれに補助記憶装置を備えた計算機システムにおいて、
前記各稼働系計算機毎のバックアップ用データ領域が確
保された共有記憶装置を備えるとともに、前記稼働系計
算機は、稼働中に自身の補助記憶装置のデータを逐次書
き替え、かつ前記共有記憶装置内の自身のバックアップ
用データを逐次書き替える一方、前記待機系計算機は各
稼働系計算機の故障発生を監視し、いずれかの稼働系計
算機の故障発生を検知した場合には、前記共有記憶装置
の該当するバックアップ用データを呼び出して自身の補
助記憶装置に複写した後、当該故障発生計算機のバック
アップ処理を開始することを特徴とする計算機システム
のバックアップ方式。
Claim 1. A computer system in which a plurality of active computers and a standby computer that backs up these active computers are connected online, and each of the active computers and standby computers is provided with an auxiliary storage device,
A shared storage device is provided in which a backup data area is secured for each active computer, and the active computer sequentially rewrites data in its own auxiliary storage device during operation, and writes data in the shared storage device. While sequentially rewriting its own backup data, the standby computer monitors the occurrence of a failure in each active computer, and if it detects a failure in any active computer, it updates the corresponding shared storage device. A computer system backup method characterized in that after calling backup data and copying it to its own auxiliary storage device, backup processing for the computer in which a failure has occurred is started.
【請求項2】  複数の稼働系計算機とこれら稼働系計
算機のバックアップを行う待機系計算機とをオンライン
接続し、かつ各稼働系計算機および待機系計算機のそれ
ぞれに補助記憶装置を備えた計算機システムにおいて、
前記各補助記憶装置を各稼働系計算機および待機系計算
機からアクセス可能に構成し、前記稼働系計算機は、稼
働中に自身の補助記憶装置のデータを逐次書き替える一
方、前記待機系計算機は各稼働系計算機の故障発生を監
視し、いずれかの稼働系計算機の故障発生を検知した場
合には、故障発生した補助記憶装置内データに基づいて
当該故障発生計算機のバックアップ処理を開始すること
を特徴とする計算機システムのバックアップ方式。
2. A computer system in which a plurality of active computers and a standby computer that backs up these active computers are connected online, and each active computer and standby computer is provided with an auxiliary storage device,
Each of the auxiliary storage devices is configured to be accessible from each active computer and a standby computer, and the active computer sequentially rewrites data in its own auxiliary storage device during operation, while the standby computer It is characterized by monitoring the occurrence of a failure in the system computers, and when detecting the occurrence of a failure in any of the operating computers, starting backup processing for the computer in which the failure has occurred based on the data in the auxiliary storage device where the failure has occurred. Backup method for computer systems.
JP3147050A 1991-06-19 1991-06-19 Backup system for computer system Pending JPH04369735A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP3147050A JPH04369735A (en) 1991-06-19 1991-06-19 Backup system for computer system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP3147050A JPH04369735A (en) 1991-06-19 1991-06-19 Backup system for computer system

Publications (1)

Publication Number Publication Date
JPH04369735A true JPH04369735A (en) 1992-12-22

Family

ID=15421369

Family Applications (1)

Application Number Title Priority Date Filing Date
JP3147050A Pending JPH04369735A (en) 1991-06-19 1991-06-19 Backup system for computer system

Country Status (1)

Country Link
JP (1) JPH04369735A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10275090A (en) * 1997-03-28 1998-10-13 Nec Corp Duplexing system for basic processor
JP2000082040A (en) * 1998-09-07 2000-03-21 Sumitomo Bank Ltd Business store backup system, center server, business store server and business store backup method and recording medium
WO2004079573A1 (en) * 2003-03-04 2004-09-16 Fujitsu Limited Multi-processor system
JP2008226063A (en) * 2007-03-15 2008-09-25 Nec Corp Backup system, backup method, backup program, and program recording medium
JP2020091618A (en) * 2018-12-05 2020-06-11 アズビル株式会社 Facility monitoring system and communication method in facility monitoring system

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10275090A (en) * 1997-03-28 1998-10-13 Nec Corp Duplexing system for basic processor
JP2000082040A (en) * 1998-09-07 2000-03-21 Sumitomo Bank Ltd Business store backup system, center server, business store server and business store backup method and recording medium
WO2004079573A1 (en) * 2003-03-04 2004-09-16 Fujitsu Limited Multi-processor system
JPWO2004079573A1 (en) * 2003-03-04 2006-06-08 富士通株式会社 Multiprocessor system
JP2008226063A (en) * 2007-03-15 2008-09-25 Nec Corp Backup system, backup method, backup program, and program recording medium
JP2020091618A (en) * 2018-12-05 2020-06-11 アズビル株式会社 Facility monitoring system and communication method in facility monitoring system
CN111273577A (en) * 2018-12-05 2020-06-12 阿自倍尔株式会社 Facility monitoring system and communication method for facility monitoring system
CN111273577B (en) * 2018-12-05 2023-12-01 阿自倍尔株式会社 Facility monitoring system and communication method for facility monitoring system

Similar Documents

Publication Publication Date Title
JP3481737B2 (en) Dump collection device and dump collection method
KR20040047209A (en) Method for automatically recovering computer system in network and recovering system for realizing the same
JPH09168015A (en) Method and device for data backup of data communication terminal equipment
JP2006012004A (en) Hot standby system
JPH08320835A (en) Fault detecting method for external bus
JPH07234808A (en) System dump acquisition system
JPH04369735A (en) Backup system for computer system
JP3551079B2 (en) Recovery method and device after replacement of modified load module
JP2011053780A (en) Restoration system, restoration method and backup control system
JPH0736760A (en) Method for acquiring high reliability for external memory device provided with device multiplexing function also with inter-module sharing function
JPH06348535A (en) Abnormality generation history storage device
US20120089716A1 (en) Method for accelerating start up of a computerized system
JPH1040123A (en) System and method for job management
JPS59180897A (en) Double structure system of battery back-up memory
JP2699291B2 (en) Power failure processing device
JP3309198B2 (en) Database multiplexing method
JPH0730651A (en) Diagnostic system
JPH07261989A (en) Control program restoration system
JP2578908B2 (en) Restart method
JP2522610B2 (en) Production monitoring system restoration method
JPH03253945A (en) Abnormality recovery processing function confirming system for data processing system
JPS6242252A (en) Switching system for communication controller
JPS6398764A (en) File recovery system for multi-computer system
JPH0250232A (en) Data preserving method for computer system
JP2003167791A (en) Control device provided with backup function and production system using the control device