JPH01125631A

JPH01125631A - System fault recovering system

Info

Publication number: JPH01125631A
Application number: JP62283263A
Authority: JP
Inventors: Teizaburo Kanai; 金居　貞三郎; Takashi Sumiyoshi; 住吉　孝史
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1987-11-11
Filing date: 1987-11-11
Publication date: 1989-05-18

Abstract

PURPOSE:To quickly recover a system fault by setting the check points during the operation of a system and replacing and storing the back-up copies of the residence information for each check point. CONSTITUTION:The area where plural pieces of memory residence information 12 are added to one or more nonvolatile memory (CPU/memory) 10. The transaction processing is carried out while a system is working and the history information produced by said transaction processing is acquired for a recording history information file 50 and a system recovering history file 40. When the file 40 acquires a fixed number of items, the check points are added. Then the areas 34 and 38 of a check point dump file 30 are alternately selected for each check point and the contents of storage are replaced. Then, the residence information 12 are recovered at a system fault with use of the replacement history information stored in the file 40 and based on the final storage contents of both areas 34 and 38. Thus the previous histories can be omitted and the system fault can be recovered at a high speed.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明はトランザクション処理を行う情報処理システム
に係り、特に障害によるシステム停止時に高速にシステ
ム回復を行う必要がある情報処理システムに好適なシス
テムの障害回復方式に関する。[Detailed Description of the Invention] [Field of Industrial Application] The present invention relates to an information processing system that performs transaction processing, and is particularly suitable for an information processing system that requires rapid system recovery when the system is stopped due to a failure. Concerning disaster recovery methods.

[Conventional technology]

オンラインシステム等のトランザクション処理を行う情
報処理システムでは、電源断やシステムダウン等の障害
が発生した場合にもシステムの論理的一貫性を保つ必要
がある０通常、システムの各種状態を管理している情報
は主記憶装置あるいは仮想記憶装置上に保持されており
、これらは上記障害時に消失する。このため、従来より
、シス、　　テム稼動中にシステムの各種履歴情報を取
得するとともに、回復処理の高速化を目的とし、システ
ム稼動中に一定間隔でチェックポイントを設け。In information processing systems that perform transaction processing such as online systems, it is necessary to maintain the logical consistency of the system even in the event of a failure such as a power outage or system down.Normally, various states of the system are managed. Information is held on the main memory or virtual memory, and these are lost in the event of the above failure. For this reason, various historical information about the system has been acquired while the system is running, and checkpoints have been set at regular intervals during system operation to speed up recovery processing.

チェックポイント時に主記憶装置や仮想記憶装置上の各
種情報を不揮発性記憶装置に記録することが広く行われ
ている。（Ｊ、Ｎ、グレイ著、ノーツ　オン　データベ
ース　オペレーティング　システムズ（１９７８）、オ
ペレーティング　システムダウン　アドバンスト　コー
ス、Ｒ，ベイヤー、Ｒ，Ｍ、グラハム、＆　Ｇ、シーブ
ミュラー編、シュプリンガー　フェア　ラーク（Ｊ、Ｎ
。2. Description of the Related Art It is widely practiced to record various types of information on a main storage device or a virtual storage device into a nonvolatile storage device at the time of a checkpoint. (J.N. Gray, Notes on Database Operating Systems (1978), Operating System Down Advanced Course, R. Beyer, R.M., Graham, & G. Schiebmuller, eds., Springer Verlag (J.N.)
.

Ｇｒａｙ、　Ｎｏｔｅｓ　ｏｎ　Ｄａｔａ　Ｂａ５ｅ　
０ｐｅｒａｔｉｎ（５ｙｓｔｅ＋＊５（１９７８）、Ｏ
ｐｅｒａｔｉｎｇ　Ｓｙｓｔｅｍｓ：Ａｎ　Ａｄｖａｎ
ｃｅｄ　Ｃｏｕｒｓｅ。Gray, Notes on Data Ba5e
0peratin(5yste+*5(1978), O
Perating Systems: An Advantage
ced Course.

Ｒ＋Ｒａｙｅｒ、　Ｒ，Ｍ、Ｇｒａｈａｍ、　ａｎｄ　
Ｇ、　Ｓｅｅｇｍｕｌｌｅｒ（ｅｄｓ、）Ｓｐｒｉｎｇ
ｅｒ−Ｖｅｒｌａｇ）　）。この方式によれば、障害に
よりシステムが停止した場合、上記の不揮発性記憶装置
に格納した内容と履歴情報によりシステムを障害発生前
の論理的一貫性を保った状態に回復することが可能であ
る。R+Rayer, R, M, Graham, and
G. Seegmuller (eds,) Spring
er-Verlag) ). According to this method, if the system stops due to a failure, it is possible to restore the system to a logically consistent state before the failure using the contents stored in the nonvolatile storage device and history information. .

[Problem that the invention seeks to solve]

しかして、上記従来技術は、以下に述べる問題点を有し
ていた。すなわち、（１）チェックポイント時に、メモリ（主記憶装置ある
いは仮想記憶装置）上の常駐情報の全内容を不揮発性記
憶装置に取得するため、その書出し処理がボトルネック
となり、システム性能を低下させる恐れがある。However, the above-mentioned conventional technology had the following problems. In other words, (1) At the time of a checkpoint, all the resident information in memory (main memory or virtual memory) is acquired to a non-volatile storage device, so the writing process may become a bottleneck and reduce system performance. There is.

（２）障害によるシステム停止時の回復処理では。(2) Recovery processing when the system stops due to a failure.

チェックポイント時に取得したメモリ常駐情報を不揮発
性記憶装置からメモリに読込んだ後、チェックポイント
時以降の常駐情報更新履歴情報を用いて常駐情報の内容
を回復する。したがって、チェックポイントの間隔を短
くすることにより、システム回復時間を短縮することが
できる。一方、銀行オンラインシステム等の大規模ＤＢ
／ＤＣニジステムの障害によるシステム停止の社会的影
響は近年盤々大きくなっており、システム障害時に短時
間でシステムを回復することが要求されている。しかし
、これらの大規模システムでは主記憶装置の記憶容量増
加につれメモリ常駐情報が大量化しつつあり、チェック
ポイント時のメモリ常駐情報取得時間は増加する傾向に
ある。上述のチェックポイントの間隔は常駐情報を不揮
発性記憶装置に取得するために要する時間よりも短縮す
ることはできないため、従来方式ではチェックポイント
を短くすることは難しく、障害時のシステム停止時間は
今後は長引く可能性がある。After reading the memory resident information acquired at the time of the checkpoint into the memory from the nonvolatile storage device, the contents of the resident information are recovered using the resident information update history information after the time of the checkpoint. Therefore, by shortening the checkpoint interval, system recovery time can be shortened. On the other hand, large-scale databases such as bank online systems
The social impact of a system outage due to a system failure has been increasing dramatically in recent years, and there is a demand for system recovery in a short period of time in the event of a system failure. However, in these large-scale systems, the amount of memory-resident information is increasing as the storage capacity of the main storage device increases, and the time required to acquire memory-resident information at checkpoints tends to increase. The above-mentioned checkpoint interval cannot be made shorter than the time required to acquire resident information to a nonvolatile storage device, so it is difficult to shorten checkpoints with conventional methods, and system downtime in the event of a failure will be reduced in the future. may be prolonged.

本発明は上記情報に鑑みてなされたものであり、その目
的とするところは、従来の大規模ＤＢ／ＤＣシステムの
障害回復方式における上述の問題点を解消し、チェック
ポイント時のメモリ常駐情報を不揮発性記憶装置に取得
するオーバヘッドを少なくしてシステム性能を大幅に向
上させるとともに、障害によるシステム停止時の回復時
間を大幅に短縮するシステムの障害回復方式を提供する
ことにある。The present invention has been made in view of the above information, and its purpose is to solve the above-mentioned problems in the failure recovery method of conventional large-scale DB/DC systems, and to save memory resident information at the time of checkpoint. An object of the present invention is to provide a system failure recovery method that greatly improves system performance by reducing the overhead acquired in a nonvolatile storage device, and also significantly shortens the recovery time when the system is stopped due to a failure.

[Means for solving problems]

本発明の上記目的は、以下の（１）〜（３）により達成
される。The above objects of the present invention are achieved by the following (1) to (3).

（１）不揮発性記憶装置上にメモリ常駐情報を格納する
領域を１個以上設け、各領域にメモリ常駐情報のバック
アップコピーを格納する。各領域は、同一の不揮発性記
憶装置上にあってもよいし、それぞれ別の不揮発性記憶
装置上にあってもよい。(1) One or more areas for storing memory-resident information are provided on a nonvolatile storage device, and a backup copy of the memory-resident information is stored in each area. Each area may be located on the same nonvolatile storage device, or may be located on different nonvolatile storage devices.

（２）チェックポイント時には、上記の複数個のメモリ
常駐情報格納領域から１つの領域を選択し。(2) At the time of a checkpoint, one area is selected from the plurality of memory resident information storage areas.

選択した領域に格納されたバックアップコピーを更新す
る。領域の選択方法は、複数個の領域をチェックポイン
ト毎に順に選ぶ。Update the backup copy stored in the selected area. The area selection method is to sequentially select multiple areas for each checkpoint.

（３）メモリ常駐情報格納領域毎に該領域に格納された
バックアップコピーの前回更新時以降に常駐情報のどの
部分が更新されたかを記憶する更新部分管理情報を備え
、常駐情報更新時に更新された部分を記憶する。上述の
チェックポイント時のバックアップコピー更新処理にお
いては、チェックポイント時に選択された領域に対応す
る更新部分管理情報に基づいて、該領域に格納されたバ
ックアップコピーの前回更新時以降の更新部分のみを更
新する。(3) Each memory resident information storage area is provided with updated part management information that stores which part of the resident information has been updated since the last update of the backup copy stored in the area, and is updated when the resident information is updated. Memorize parts. In the above-mentioned backup copy update processing at checkpoint, based on the updated part management information corresponding to the area selected at checkpoint, only the updated part of the backup copy stored in the area after the previous update is updated. do.

[Effect]

上記手段の作用を以下に述べる。 The operation of the above means will be described below.

（１）メモリ常駐情報のバックアップコピーを格納する
領域を複数面設け、チェックポイント時に該チェックポ
イントで更新対象とする領域を順に選択するので、更新
処理中に障害が発生して更新中のバックアップコピーの
内容が破壊されても前回のチェックポイント時に選択し
た領域に格納されたバックアップコピーを用いてシステ
ムを回復することができる。(1) Multiple areas are provided to store backup copies of memory-resident information, and at the time of a checkpoint, the areas to be updated at the checkpoint are selected in order, so if a failure occurs during the update process, the backup copy will be copied during the update. Even if the contents of the system are destroyed, the system can be recovered using the backup copy stored in the area selected during the previous checkpoint.

（２）チェックポイント時、メモリ常駐情報のバックア
ップコピーに対して、該バックアップコピーの前回更新
時以降に更新されたメモリ常駐情報を更新するので、チ
ェックポイント時の処理オーバヘッドが小さく、処理時
間も短い、したがって、システム性能を向上させるとと
もに、チェックポイントの間隔を十分短くすることによ
り障害によるシステム停止時に高速にシステムを回復す
ることが可能になる。(2) At the time of a checkpoint, the memory resident information that has been updated since the last update of the backup copy is updated for the backup copy of the memory resident information, so the processing overhead at the time of a checkpoint is small and the processing time is short. Therefore, in addition to improving system performance, by making the checkpoint interval sufficiently short, it is possible to quickly recover the system when the system is stopped due to a failure.

〔Example〕

以下、本発明の一実施例を図面を引用しながら説明する
。An embodiment of the present invention will be described below with reference to the drawings.

第１図は本実施例の動作の概要を示す図であり、本発明
の特徴を示す図である。この図については、本実施例の
動作の概要を述べる時に説明する。FIG. 1 is a diagram showing an overview of the operation of this embodiment, and is a diagram showing the features of the present invention. This diagram will be explained when describing the outline of the operation of this embodiment.

最初に、本実施例の構成を説明する。第２図は本実施例
における情報処理システムの全体構成を示す図であり、
ＣＰＵ／メモリ（主記憶装置あるいは仮想記憶装置）１
０に、磁気ディスク等に格納されたデータベース２０．
チェックポイントダンプファイル（以下ＣＤファイルと
略）３０．システム回復用履歴情報ファイル４０．記録
用履歴情報ファイル５０が接続されている。First, the configuration of this embodiment will be explained. FIG. 2 is a diagram showing the overall configuration of the information processing system in this embodiment,
CPU/memory (main storage or virtual storage) 1
0, a database 20.0 stored on a magnetic disk or the like.
Checkpoint dump file (hereinafter referred to as CD file) 30. System recovery history information file 40. A recording history information file 50 is connected.

第３図はメモリ１１の内容を示す図であり、情報処理シ
ステテムのトランザクシ至ン処理で用いる常駐テーブル
等の常駐情報１２．・チェックポイント時の制御情報１
３．および後述するメモリ常駐情報１２のバックアップ
コピー格納領域毎に設けた更新部分管理情報１４ならび
に１５がメモリ１１上に存在する。FIG. 3 is a diagram showing the contents of the memory 11, which includes resident information 12, such as resident tables used in transaction processing of the information processing system.・Control information 1 at checkpoint
3. Also, update portion management information 14 and 15 provided for each backup copy storage area of memory resident information 12, which will be described later, exists on the memory 11.

ＣＤファイル３０は、システム稼動中に一定間隔で設け
たチェックポイント時に、障害によるシステム停止時の
回復処理においてメモリ上の常駐情報１２を回復するた
めに必要な情報を取得するファイルである。ＣＤファイ
ル３０の構成を第４図に示す。The CD file 30 is a file that acquires information necessary to recover the resident information 12 on the memory in a recovery process when the system is stopped due to a failure, at checkpoints set at regular intervals during system operation. The structure of the CD file 30 is shown in FIG.

本実施例においては、メモリ常駐情報１２の不揮発性記
憶装置上のバックアップコピーを格納する領域を２面設
ける。バックアップコピー格納領域は最低１面あればよ
いが、バックアップコピー更新処理中に障害が発生した
場合、書込み処理中断により当該部分の内容が失われる
可能性があり。In this embodiment, two areas are provided for storing backup copies of the memory resident information 12 on the nonvolatile storage device. It is sufficient to have at least one backup copy storage area, but if a failure occurs during the backup copy update process, the contents of that part may be lost due to interruption of the write process.

その場合は当該部分の内容の回復に長時間を必要とする
。よって、本実施例においては、メモリ常駐情報１２の
バックアップコピー格納領域を２面とした。第４図にお
いて、３１および３５は上記の各格納領域および関連情
報格納領域から構成される領域であり、各々チェックポ
イント時の管理情報領域３２ならびに３６、他の格納領
域に対する更新部分管理情報格納領域３３ならびに３７
、メモリ常駐情報１２のバックアップコピー格納領域３
４ならびに３８から構成される。ＣＤファイル３０の媒
体としては磁気ディスク等のＤＡＳＤ（直接アクセス記
憶装置）を通常用いる。本実施例においては、銀行オン
ラインシステム等の高トラフィツクかつシステム障害時
の高速回復が要求されるシステムを対象とし、バッテリ
により半導体記憶装置を不揮発化した半導体ディスクを
用いる。In that case, it will take a long time to recover the contents of that part. Therefore, in this embodiment, there are two backup copy storage areas for the memory resident information 12. In FIG. 4, 31 and 35 are areas composed of the above-mentioned storage areas and related information storage areas, including management information areas 32 and 36 at checkpoint, and an update partial management information storage area for other storage areas, respectively. 33 and 37
, backup copy storage area 3 of memory resident information 12
4 and 38. A DASD (direct access storage device) such as a magnetic disk is normally used as the medium for the CD file 30. This embodiment is intended for a system such as a bank online system that requires high traffic and high speed recovery in the event of a system failure, and uses a semiconductor disk in which the semiconductor storage device is made non-volatile by a battery.

システム回復用履歴情報ファイル４０は、障害によるシ
ステム停止時のシステム回復処理に必要な履歴情報を格
納するファイルである。システム回復処理で必要となる
履歴情報には、データベースやメモリ上の常駐情報の更
新履歴情報、入出力メツセージに関する履歴情報、トラ
ンザクションの完了／キャンセルを示す履歴情報１等が
ある。The system recovery history information file 40 is a file that stores history information necessary for system recovery processing when the system is stopped due to a failure. History information required in the system recovery process includes update history information of resident information in a database or memory, history information regarding input/output messages, history information 1 indicating transaction completion/cancellation, and the like.

システム回復用履歴情報ファイルの媒体としては、通常
磁気ディスク等のＤＡＳＤを用いるが、本実施例ではＣ
Ｄファオル３０と同様半導体ディスクを用いる。システ
ム回復処理で必要な履歴情報は、基本的には最終の有効
なチェックポイント時以降の情報である。したがって、
これらの範囲の履歴情報を蓄積するのに十分なファイル
容量（できれば必要最小限度の数倍程度）を確保してお
けば、システム回復用履歴情報ファイル４０をラップア
ラウンド方式で古い情報を消去し繰り返し使用すること
ができる。Normally, a DASD such as a magnetic disk is used as the medium for the system recovery history information file, but in this embodiment, a CSD is used.
Like the D Faol 30, a semiconductor disk is used. The history information required for system recovery processing is basically information after the last valid checkpoint. therefore,
If you have secured enough file capacity (preferably several times the minimum required amount) to store history information in these ranges, you can use the wraparound method to delete old information from the system recovery history information file 40 and repeat it. can be used.

記録用履歴情報ファイル５０には、データベース媒体障
害回復用履歴情報、業務処理内容を記録するための履歴
情報、各種業務処理を行うユーザプログラムで用いる履
歴情報１等が含まれる。取得媒体としては、磁気ディス
ク等のＤＡＳＤあるいは磁気テープを用いる。記録用履
歴情報は保存が必要な期間が長いため、ＤＡＳＤに取得
した場合にも最終的には磁気テープにコピーして保管し
ておく必要がある。The recording history information file 50 includes history information for database medium failure recovery, history information for recording business process contents, history information 1 used in user programs that perform various business processes, and the like. As the acquisition medium, a DASD such as a magnetic disk or a magnetic tape is used. Recording history information needs to be stored for a long time, so even if it is acquired on a DASD, it must ultimately be copied onto a magnetic tape and stored.

次に、システムの動作を説明する。最初に、システム動
作の概要を第１図を用いて説明する０本システムでは、
稼動中の各種障害に備えてトランザクション処理に伴う
各種履歴情報をシステム回復用履歴情報ファイル５０お
よび記録用履歴情報ファイル６０に取得している。さら
に、障害によるシステム停止時の回復処理を短縮するた
め、メモリ１１上の常駐情報１２のバックアップコピー
をＣＤファイル３ｏ中の領域３４および３８に格納し、
システム稼動中に一定間隔で設けたチェックポイント時
に、これらのバックアップコピーを更新している。そし
て、障害によるシステム停止時には、ＣＤファイル３０
中の領域３４あるいは３８に格納されたバックアップコ
ピーのうち最後に更新されたバックアップコピーをもと
に、システム回復用履歴情報ファイル４０中の常駐情報
更新履歴情報を用いて常駐情報１２を回復する。Next, the operation of the system will be explained. First, we will explain the outline of system operation using Figure 1.
In preparation for various failures during operation, various types of history information associated with transaction processing are acquired in a system recovery history information file 50 and a recording history information file 60. Furthermore, in order to shorten the recovery process when the system stops due to a failure, a backup copy of the resident information 12 on the memory 11 is stored in areas 34 and 38 in the CD file 3o,
These backup copies are updated at checkpoints set at regular intervals during system operation. When the system stops due to a failure, the CD file 30
The resident information 12 is recovered using the resident information update history information in the system recovery history information file 40 based on the last updated backup copy of the backup copies stored in the area 34 or 38.

次に、システム動作をシステム開始時動作、通常時動作
、トランザクション障害時動作、チェックポイント時動
作、システム障害時動作に分けて述べる。Next, system operations will be described by dividing them into operations at system startup, operations at normal times, operations at transaction failures, operations at checkpoints, and operations at system failures.

（１）システム開始時動作システム開始時、メモリ１１上の常駐情報１２のバック
アップコピーをＣＤファイル３０中のバックアップコピ
ー格納領域３４および３８に取得する。さらに、メモリ
１１上の更新部分管理情報１４および１５を初期化、す
なわちメモリ常駐情報１２の全ての部分が更新なしの状
態に設定する。(1) Operation at System Startup When the system is started, a backup copy of the resident information 12 on the memory 11 is acquired into the backup copy storage areas 34 and 38 in the CD file 30. Further, the updated portion management information 14 and 15 on the memory 11 is initialized, that is, all portions of the memory resident information 12 are set to a non-updated state.

（２）通常時動作トランザクションの実行開始から終了までの本発明に係
わる処理は以下のとおりである。(2) Normal operation The processing according to the present invention from the start to the end of transaction execution is as follows.

（ａ）履歴情報取得トランザクション処理中に、データベースやメモリ１１
上の常駐情報１２の更新、メツセージの入出力、トラン
ザクションの完了等の事象が発生すると、それらに関す
る履歴情報をシステム回復用履歴情報ファイル４０およ
び記録用履歴情報ファイル５ｏ取得する。(a) During historical information acquisition transaction processing, database and memory 11
When events such as updating of the above resident information 12, message input/output, and transaction completion occur, history information related to these events is acquired from the system recovery history information file 40 and the recording history information file 5o.

各種履歴情報をこれらのファイルのいずれに、あるいは
両方に取得するかは、システム初期設定時に指定される
。なお、各履歴情報はただちに上記ファイルに書出され
るのではなく、メモリ上のバッファに蓄積される。そし
て、バッファ満杯時あるいはシステム回復用履歴情報の
場合はトランザクション完了時等の時点で上記ファイル
の媒体に書出される。Which or both of these files should be used to acquire various types of history information is specified at the time of system initialization. Note that each history information is not immediately written to the above file, but is accumulated in a buffer on the memory. Then, when the buffer is full or in the case of system recovery history information, it is written to the above-mentioned file medium at a time such as when a transaction is completed.

（ｂ）更新部分管理情報の設定メモリ常駐情報１２が更新された場合、上述の履歴情報
取得とともに、メモリ１１上の更新ページ管理情報１４
および１５の更新部分に対応する部分に更新されたこと
を設定する。(b) Setting of updated partial management information When the memory resident information 12 is updated, the updated page management information 14 on the memory 11 is
And it is set that the part corresponding to the updated part of 15 has been updated.

（ｃ）トランザク゛ジョン完了処理トランザクション終了後、以下の手順で処理を行う。(c) Transaction completion processing After the transaction is completed, perform the following steps.

■　該トランザクションの未取得履歴情報の取得。■ Obtaining unobtained history information for the transaction.

■　トランザクション完了を示す履歴情報の取得。■ Acquisition of history information indicating transaction completion.

■　データベース更新を媒体に反映させる。■ Reflect database updates on the media.

■　トランザクションの全処理が完了したことを示す履
歴情報を取得する。■ Obtain historical information that indicates that all processing of a transaction has been completed.

（３）トランザクション障害時動作部分的な障害によりトランザクションの処理結果を取り
消す場合、以下の手順で処理を行う。(3) Operation in the event of transaction failure When canceling the transaction processing result due to a partial failure, the following steps are performed.

（ａ）メモリ常駐情報１２の戻し処理該トランザクションが更新したメモリ常駐情報１２の内
容を、該トランザクションに関する履歴情報を用いて更
新前の状態に回復する。それとともに、常駐情報１２の
該データを更新前の内容に戻したことを示す更新履歴情
報をシステム回復用履歴情報ファイル４゜に取得する。(a) Restoration process of memory resident information 12 The contents of memory resident information 12 updated by the transaction are restored to the state before the update using history information regarding the transaction. At the same time, update history information indicating that the data in the resident information 12 has been restored to its pre-update content is acquired in the system recovery history information file 4°.

（ｂ）データベースの戻し処理データベースの該トランザクションが更新した部分を該
トランザクションに関する履歴情報を用いて更新前の内
容に回復する。(b) Restoring the database The portion of the database updated by the transaction is restored to its pre-update content using history information regarding the transaction.

（ｃ）キャンセル処理完了履歴情報取得該トランザクシ
ョンのキャンセル処理完了を示す履歴情報をシステム回
復用履歴情報ファイル４０に取得する。(c) Acquisition of cancellation processing completion history information History information indicating the completion of cancellation processing of the transaction is obtained in the system recovery history information file 40.

（４）チェックポイント時動作システム回復用履歴情報を一定件数（システムの初期設
定時に指定）取得する毎にチェックポイントを設ける。(4) Operation at checkpoint A checkpoint is set every time a certain number of pieces of system recovery history information (specified at the time of system initialization) are acquired.

チエツクポインと時の処理は以下のとおりである。The checkpoint and time processing are as follows.

（ａ）バックアップコピー更新対象領域の選択該チェッ
クポイント時に更新するバックアップコピーを格納して
いる領域を選択し、選択した領域を識別する情報をメモ
リ１１上のチェックポイント制御情報１３に設定する。(a) Selection of backup copy update target area Select the area storing the backup copy to be updated at the time of the checkpoint, and set information identifying the selected area in the checkpoint control information 13 on the memory 11.

本実施例では、バックアップコピー格納領域は３４およ
び３８の２面あり、チェックポイント毎にこれらの領域
を交互に選択する。以下、領域３４が選択された場合に
ついて説明する。In this embodiment, there are two backup copy storage areas 34 and 38, and these areas are alternately selected for each checkpoint. The case where area 34 is selected will be described below.

（ｂ）更新部分管理情報の取得上述の処理により選択された領域と異なる領域に対応す
るメモリ１１上の更新部分管理情報を、ＣＤファイル３
０中の選択された領域に対応する更新部分管理情報格納
領域に書込む。本実施例の場合には、バックアップコピ
ー格納領域３４に更新部分管理情報１４が対応し、領域
３８に更新部分管理情報１５が対応しているとする。し
たがって、領域３４が選択された場合、更新部分管理情
報１５の内容が更新部分管理情報格納領域３３に書込ま
れる。(b) Acquisition of updated partial management information Updated partial management information on the memory 11 corresponding to an area different from the area selected by the above process is transferred to the CD file 3.
0 to the updated partial management information storage area corresponding to the selected area. In the case of this embodiment, it is assumed that the updated portion management information 14 corresponds to the backup copy storage area 34 and the updated portion management information 15 corresponds to the area 38. Therefore, when the area 34 is selected, the contents of the updated partial management information 15 are written to the updated partial management information storage area 33.

（ｃ）バックアップコピーの更新選択された領域３４に対応する更新部分管理情報１４の
チェックポイント時の内容に基づき、領域３４に格納さ
れている常駐情報１２のバックアップコピーを更新する
。すなわち、メモリ常駐情報１２中、更新部分管理情報
１４に更新されたことが設定されている部分の内容を該
バックアップコピーに反映させる。その後、領域３４に
対応する更新部分管理情報１４を初期化する。(c) Updating the backup copy The backup copy of the resident information 12 stored in the area 34 is updated based on the contents of the updated partial management information 14 corresponding to the selected area 34 at the time of the checkpoint. That is, the contents of the portion of the memory resident information 12 that is set to have been updated in the updated portion management information 14 are reflected in the backup copy. Thereafter, the updated portion management information 14 corresponding to the area 34 is initialized.

（ｄ）チェックポイント処理の有効化メモリ１１上の履歴情報バッファ中のシステム回復用履
歴情報をシステム回復用履歴情報ファイル４０に書出す
。そして、チェックポイント開始時点におけるシステム
回復用履歴情報ファイル４０中の最後に取得された履歴
情報の番号（履歴情報には、各履歴情報を識別するため
の一貫番号を付与されている）を、ＣＤファイル３０中
の選択された領域３４に対応する管理情報領域３２に設定する。(d) Enabling checkpoint processing The system recovery history information in the history information buffer on the memory 11 is written to the system recovery history information file 40. Then, the number of the last acquired history information in the system recovery history information file 40 at the start of the checkpoint (the history information is given a consistent number to identify each piece of history information) is transferred to the CD. The management information area 32 corresponding to the selected area 34 in the file 30 is set.

これらの２つの処理により、当該チェックポイント時の
処理は有効になる。These two processes enable the process at the checkpoint.

（５）システム障害時動作障害によるシステム停止時には、ＣＤファイル３０中の
メモリ常駐情報１２のバックアップコピーおよびシステ
ム回復用履歴情報ファイル４０中の各種履歴情報を用い
てシステム回復処理を行う。システム回復処理の概略フ
ローを第５図に示す。以下、第５図に従って説明する。(5) At the time of system failure When the system is stopped due to an operational failure, system recovery processing is performed using a backup copy of the memory resident information 12 in the CD file 30 and various types of history information in the system recovery history information file 40. FIG. 5 shows a schematic flow of system recovery processing. This will be explained below with reference to FIG.

（ａ）読出し対象領域選択（ステップ１０１）ＣＤファ
イル３０から管理情報領域３２および３６に設定されて
いる履歴情報番号を読出し、新しい方の番号が設定され
ている領域に対応するバックアップコピー格納領域を読
出し対象領域として選択する。以下、管理情報領域３２
に設定されている番号の方があたらしいものとして説明
する。したがって、バックアップコピー読出し対象領域
として領域３４が選択される。(a) Selection of read target area (step 101) Read the history information numbers set in the management information areas 32 and 36 from the CD file 30, and select the backup copy storage area corresponding to the area to which the new number is set. Select as the read target area. Below, management information area 32
The explanation will be based on the assumption that the number set in . Therefore, area 34 is selected as the backup copy readout target area.

（ｂ）バックアップコピー読出しくステップ１０２）上
述の処理により選択した領域３４から、メモリ常駐情報
１２のバックアップコピーを読出し、メモリｌｌ上に常
駐情報１２として展開する。さらに、ＣＤファイル３ｏ
中の更新部分管理情報格納領域３７から更新部分管理情
報を読出し、メモリ１１上に領域３８に対する更新部分
管理情報１５として展開する。(b) Backup copy reading step 102) The backup copy of the memory resident information 12 is read from the area 34 selected by the above process and developed as the resident information 12 on the memory II. Furthermore, CD file 3o
The updated portion management information is read from the updated portion management information storage area 37 in the storage area 37, and is developed as updated portion management information 15 for the area 38 on the memory 11.

そして、領域３４に対する更新部分管理情報１４の方は
初期化しておく。Then, the updated portion management information 14 for the area 34 is initialized.

（ｃ）トランザクションの状態解析および未完了トラン
ザクションの戻し処理（ステップシステム回復用履歴情
報ファイル４０中の履歴情報を最新取得情報から逆方向
に′チェックポイント開始時点の情報まで（正確には、
チェックポイント開始時点に実行中かつシステム停止時
に完了していなかったトランザクションに関する最初の
履歴情報まで）読み、各トランザクションの処理状態（
完了／未完了）を解析し、未完了トランザクションにつ
いては更新データの戻し処理を行う。トランザクション
の処理状態は、トランザクション完了を示す履歴情報、
トランザクションの全処理の完了を示す履歴情報および
トランザクションのキャンセル処理完了を示す履歴情報
の取得有無により解析する。回復処理の内容により、ト
ランザクションの処理状態は第６図に示すように７つに
分かれる。以下、第６図を用いて説明する。(c) Transaction status analysis and uncompleted transaction return processing (step) The history information in the system recovery history information file 40 is moved backwards from the latest acquisition information to the information at the start of the checkpoint (more precisely,
reads the processing status of each transaction (up to the first historical information about transactions that were running at the time the checkpoint started and had not completed when the system stopped).
Completed/uncompleted transactions are analyzed, and updated data is returned for uncompleted transactions. The transaction processing status includes historical information indicating transaction completion,
Analysis is performed based on whether history information indicating the completion of all transaction processing and history information indicating completion of transaction cancellation processing is obtained. Depending on the content of the recovery process, the transaction processing status is divided into seven states as shown in FIG. This will be explained below using FIG. 6.

■　タイプ■のトランザクション第６図中の１１１のトランザクションのように、最終の
有効チェックポイント時以前に完了（キャンセルの場合
を含もりしたトランザクションである。タイプＩのトラ
ンザクションは１回復処理は不要であり。■ Transactions of type ■ Transactions that complete (including the case of cancellation) before the last valid checkpoint, such as transaction 111 in Figure 6. Transactions of type I do not require 1 recovery processing. .

処理状態解析の対象外である。Not subject to processing status analysis.

■　タイプＩＩのトランザクション第６図中の１１２のトランザクションのように、最終の
有効チェックポイント開始時点以降に正常に完了したト
ランザクションである（実行開始時点は無関係）、タイ
プ■のトランザクションの回復処理はステップ１０３で
行う。■ Type II transactions Like the transaction 112 in Figure 6, the recovery process for type ■ transactions, which are transactions that completed normally after the start of the last valid checkpoint (the execution start point is irrelevant), is as follows: Do it in 103.

■　タイプ■のトランザクション第６図中の１１３のトランザクションのように、最終の
有効チェックポイント終了前に始まり、トランザクショ
ン障害発生後、キャンセル処理を完了したトランザクシ
ョンである。タイプ■のトランザクションの回復処理は
ステップ１０３で行う。■Transaction of type ■Like the transaction 113 in FIG. 6, this is a transaction that started before the end of the last valid checkpoint and completed the cancellation process after a transaction failure occurred. Recovery processing for the type ■ transaction is performed in step 103 .

■　タイプ■のトランザクション第６図中の１１４のトランザクションのように、最終の
有効チェックポイント終了後に始まり、トランザクショ
ン障害発生後、キャンセル処理を完了したトランザクシ
ョンである。タイプ■のトランザクションの場合、デー
タベース２０の該トランザクションによる更新部分を更
新前の内容に戻す処理は完了している。また、該１−ラ
ンザクジョンによる常駐情報１２の更新結果はＣＤファ
イル３０中のバックアップコピーには反映されておらず
、戻し処理は不要である。よって、タイプ■のトランザ
クションの回復処理は不要である。■Transaction of type ■Like the transaction 114 in FIG. 6, this is a transaction that starts after the end of the last valid checkpoint and completes the cancellation process after a transaction failure occurs. In the case of a type ■ transaction, the process of returning the updated portion of the database 20 by the transaction to its pre-update content has been completed. Further, the update result of the resident information 12 by the 1-ranzaktion is not reflected in the backup copy in the CD file 30, and no return processing is necessary. Therefore, recovery processing for type ■ transactions is unnecessary.

■　タイプ■のトランザクション第６図中の１１５のトランザクションのように、トラン
ザクション完了を示すｆｆ＆ｆｉ情報取得後、トランザ
クションの全処理完了を示す履歴情報を取得する前に障
害によるシステム停止が発生し、処理が中断したトラン
ザクションである。タイプ■のトランザクションの回復
処理はステップ１０３で行う。■ Transaction of type ■ As in transaction 115 in Figure 6, after acquiring ff&fi information indicating transaction completion, but before acquiring history information indicating completion of all processing of the transaction, a system stoppage occurs due to a failure, and processing is interrupted. This is an aborted transaction. Recovery processing for the type ■ transaction is performed in step 103 .

■　タイプ■のトランザクション第６図中の１１６のトランザクションのように、最終の
有効チェックポイント終了前に始まり、トランザクショ
ン完了を示す履歴情報を取得する前に障害によるシステ
ム停止が発生し、処理が中断したトランザクションであ
る。タイプ■のトランザクションは、本ステップにおい
てデータベース２０および常駐情報１２の戻し処理を行
う。■ Transactions of type ■ Like the transaction 116 in Figure 6, it started before the end of the last valid checkpoint, and the system stopped due to a failure before acquiring the history information indicating transaction completion, and the processing was interrupted. It is a transaction. In the type ■ transaction, the database 20 and resident information 12 are returned in this step.

すなわち、システム回復用履歴情報ファイル４０中の該
トランザクションの更新前情報を用いて、該トランザク
ションによるデータベース２０および常駐情報１２の更
新部分を更新前の内容に回復する。そして。That is, using the pre-update information of the transaction in the system recovery history information file 40, the updated portions of the database 20 and resident information 12 by the transaction are restored to their pre-update contents. and.

更新部分管理情報１４および１５に該更新部分が更新さ
れたことを登録する。The fact that the updated portion has been updated is registered in the updated portion management information 14 and 15.

■　タイプ■のトランザクション第６図中の１１７のトランザクションのように、最終の
有効チェックポイント終了後に始まり、トランザクショ
ン完了を示す履歴情報を取得する前に障害によるシステ
ム停止が発生し、処理が中断したトランザクションであ
る。タイプ■のトランザクションは、該トランザクショ
ンによる常駐情報１２の更新結果はＣＤファイル３ｏ中
のバックアップコピーには反映されておらず。■ Transactions of type ■ Transactions, such as transaction 117 in Figure 6, which started after the end of the last valid checkpoint, and whose processing was interrupted due to a system stoppage due to a failure before acquiring history information indicating transaction completion. It is. For transactions of type ■, the result of updating the resident information 12 by the transaction is not reflected in the backup copy in the CD file 3o.

常駐情報１２の戻し処理は不要である。しかし、データ
ベース２０の実更新を行っている可能性があり、本ステ
ップにおいてシステム回復用Ｈ歴情報ファイル４０中の
該トランザクションのデータベース更新前情報を用いて
更新前の内容に戻す。Processing to return the resident information 12 is not necessary. However, there is a possibility that the database 20 has been actually updated, and in this step, the pre-update database information of the transaction in the system recovery H history information file 40 is used to restore the content before the update.

（ｄ）完了トランザクションの更新結果反映およびキャ
ンセルトランザクションの戻し処理（ステップ１０４）システム回復用履歴情報ファイル４０中の履歴情報を、
最終の有効チェックポイント開始時点の情報から（正確
には、上記のタイプ■のトランザクションに関する最初
の履歴情報と最終の有効チェックポイント開始時点の情
報の古い方から）順方向に最終取得情報まで読み、完了
トランザクションの更新結果反映およびキャンセルトラ
ンザクションの戻し処理を行う。本ステップで回復処理
が必要なトランザクションは、タイプ■、■および■の
トランザクションである。(d) Reflection of updated results of completed transactions and return processing of canceled transactions (step 104) The history information in the system recovery history information file 40 is
Read forward from the information at the start of the last valid checkpoint (more precisely, from the oldest of the first history information regarding transactions of type ■ above and the information at the start of the last valid checkpoint) to the last acquired information, Reflects update results of completed transactions and returns canceled transactions. Transactions that require recovery processing in this step are transactions of types ■, ■, and ■.

■　タイプ■のトランザクションタイプ■のトランザクションの場合、データベース２０
の実更新は完了している。■ Transaction of type ■ For transaction of type ■, database 20
The actual update has been completed.

よって、システム回復用履歴情報ファイル４０から読出
した該トランザクションの最終の有効チェックポイント
開始以後の常駐情報１２の更新履歴情報を用いて、常駐
情報１２に該トランザクションの更新結果を反映させる
。そして、更新部分管理情報１４および１５に該更新部
分が更新されたことを登録する。Therefore, the updated result of the transaction is reflected in the resident information 12 using the update history information of the resident information 12 after the start of the last valid checkpoint of the transaction read from the system recovery history information file 40. Then, it is registered in the updated portion management information 14 and 15 that the updated portion has been updated.

■　タイプ■のトランザクションタイプＩｎのトランザクションの場合、データベースの
該トランザクションによる更新部分を更新前の内容に戻
す処理は完了している。したがって、システム回復用履
歴情報ファイル４０から読出した該トランザクションの
最終の有効チェックポイント開始以後の常駐情報１２の
更新履歴情報（更新データを更新前の内容に戻したこと
を示す更新履歴情報）を用いて、常駐情報１２の戻し処
理を行う。そして、更新部分管理情報１４および１５に
該更新部分が更新されたことを登録する。(2) Transaction of type (2) In the case of a transaction of type In, the process of restoring the updated portion of the database by the transaction to its pre-updated content has been completed. Therefore, the update history information of the resident information 12 after the start of the last valid checkpoint of the transaction read from the system recovery history information file 40 (update history information indicating that the update data has been returned to the content before the update) is used. Then, the resident information 12 is returned. Then, it is registered in the updated portion management information 14 and 15 that the updated portion has been updated.

■　タイプ■のトランザクションタイプ■のトランザクションの場合、外部からは完了ト
ランザクションであるが、システムの内部ではデータベ
ース２０の実更新が終了していない可能性がある。した
がって、タイプ■のトランザクションで行った常駐情報
１２への更新結果の反映および更新部分管理情報１４お
よび１５への更新事実の登録の他に、システム回復用履
歴情報ファイル４０中の該トランザクションのデータベ
ース更新履歴情報を用いて、該トランザクションによる
データベース更新を媒体に反映させる。なお、本処理を
行うために該トランザクションに関する最初の履歴情報
までは戻る必要があり、最終の有効チェックポイント開
始以前に遡る可能性がある。これを避けるためには、複
数のチエツクポイントにまたがる長大トランザクション
の場合には、最終の有効チエツクポイント開始までのデ
ータペニス更新をチエツクポイント処理終了までに媒体
に反映させるようにすればよい。■ Transaction of type ■ In the case of a transaction of type ■, it is a completed transaction from the outside, but there is a possibility that the actual update of the database 20 has not been completed inside the system. Therefore, in addition to reflecting the update results performed in the transaction of type ■ to the resident information 12 and registering the update fact to the updated partial management information 14 and 15, the database of the transaction in the system recovery history information file 40 is updated. The database update by the transaction is reflected on the medium using the history information. Note that in order to perform this process, it is necessary to go back to the first history information regarding the transaction, and there is a possibility that it goes back to before the start of the last valid checkpoint. To avoid this, in the case of a long transaction that spans multiple checkpoints, data updates up to the start of the last valid checkpoint may be reflected in the medium by the end of checkpoint processing.

以上、本実施例によれば、チェックポイント時のオーバ
ヘッドを減少することによりシステムの全体性能を向上
させること、および障害によるシステム停止時に高速に
システムを回復すること、が可能になる。また１本実施
例特有の効果として。As described above, according to this embodiment, it is possible to improve the overall performance of the system by reducing the overhead during checkpointing, and to quickly recover the system when the system is stopped due to a failure. In addition, there is one effect unique to this embodiment.

ＣＤファイル３０およびシステム回復用履歴情報ファイ
ル４０の媒体として半導体ディスクを用いることにより
、システムの全体性能向上およびシステム回復高速化の
効果をさらに高めることができる。さらに、チェックポ
イント時にバックアップコピー更新対象の領域以外の領
域に対するメモリ常駐情報更新部分管理情報をＣＤファ
イル３０に取得することにより、システム回復時にメモ
リ１１上の更新部分管理情報１４および１５を回復する
ことが容易にできる。これにより、システム回復処理終
了時にシステム回復処理で用いなかったバックアップコ
ピーを更新することなく通常処理を開始することができ
るため、システム回復時間を短縮することができる。な
お１本実施例に示した本発明の方式は、メモリ常駐情報
があまり多くない（数ＭＢ程度以下）７場合やトランザ
クション１件当たりの更新量が大きくしかも更新部分に
偏りがあまりない場合には、効果は期待できない。By using a semiconductor disk as a medium for the CD file 30 and the system recovery history information file 40, it is possible to further improve the overall performance of the system and speed up system recovery. Furthermore, by acquiring memory-resident information updated partial management information for areas other than the backup copy update target area at the time of a checkpoint into the CD file 30, the updated partial management information 14 and 15 on the memory 11 can be recovered at the time of system recovery. can be easily done. Thereby, when the system recovery process ends, normal processing can be started without updating the backup copy that was not used in the system recovery process, so the system recovery time can be shortened. Note that the method of the present invention shown in this embodiment is suitable for cases where the amount of memory-resident information is not very large (about several MB or less)7 or when the amount of updates per transaction is large and the updated portion is not biased. , the effect cannot be expected.

これらの場合を考慮し、メモリ常駐情報を各部分の更新
特性により分類して本発明の方式と従来方式を組み合わ
せたり１時間的な変動により両方式を使い分けることに
より、本発明の適用効果を向上させることができる。Taking these cases into consideration, the effect of applying the present invention can be improved by classifying memory-resident information according to the update characteristics of each part, combining the method of the present invention with the conventional method, or using both methods depending on hourly fluctuations. can be done.

〔Effect of the invention〕

本発明によれば、チェックポイント時のオーバヘッドを
減少することによりシステムの全体性能を向上させると
ともに、障害によるシステム停止時に高速にシステムを
回復することができる。以下１代表的な銀行オンライン
システムを例に効果を見積もる。According to the present invention, the overall performance of the system can be improved by reducing the overhead during checkpointing, and the system can be quickly recovered when the system is stopped due to a failure. The effects will be estimated below using one typical bank online system as an example.

（ａ）前提条件（ｉ）システム回復用履歴情報の取得量・１トランザク
シヨン当たり　　１２ＫＢ（ｉｉ）システム回復用履歴
情報ファイル・媒体は、従来方式および本発明の方式と
も半導体ディスクとする。(a) Preconditions (i) Amount of system recovery history information to be acquired per transaction: 12 KB (ii) System recovery history information file/medium is a semiconductor disk in both the conventional method and the method of the present invention.

（ｎｉ）メモリ常駐情報領域・４０ＭＢと１００ＭＢの両方を想定する。(ni) Memory resident information area - Assuming both 40MB and 100MB.

・上記領域中、システム領域を４ＭＢ、残りをユーザ領
域とする。- Among the above areas, the system area is 4MB, and the rest is the user area.

（ｔｖ）ＣＤファイル・媒体は、従来方式および本発明の方式とも半導体ディ
スクとする。(tv) The CD file/medium is a semiconductor disk in both the conventional method and the present invention method.

（ｖ）半導体ディスクの性能 ■アクセス時間・１回の入出力処理で前のブロックと連続するブロック
を処理する場合０．５０　ミリ秒・上記以外の場合の平均アクセス時間０．７３　ミリ秒 ■データ転送速度　　６ＭＢ／秒（ｖｉ　）各種情報の媒体格納形式％式％・システム回復用履歴情報およびＣＤ情報ともに、チェ
ックポイント時およびシステム回復処理時には２０個の
ブロックを−括入出力する。(v) Semiconductor disk performance ■Access time - 0.50 milliseconds when processing a block consecutive to the previous block in one input/output process - Average access time for other cases 0.73 milliseconds ■Data Transfer rate: 6 MB/sec (vi) Media storage format for various information: % Formula % - For both system recovery history information and CD information, 20 blocks are input/output in batches during checkpoints and system recovery processing.

・不連続ブロック入出力時のオーバヘッド環は、無視す
る。- Overhead rings during discontinuous block input/output are ignored.

（ｉ）トラフィック・１００件／秒とする。(i) Traffic - 100 items/second.

（ｍ）更新回数・１トランザクシヨン当たりのメモリ常駐情報更新回数
を４回とする。内訳は、システム領域およびユーザ領域
各２回とする。(m) Number of updates - The number of times memory resident information is updated per transaction is 4 times. The breakdown is twice for the system area and twice for the user area.

・更新部分の分布はランダムとする。・The distribution of updated parts is random.

（ｘ）システム回復時間の見積り・ＣＤファイルからメモリ常駐情報を読む時間およびシ
ステム回復用履歴情報を順、逆の両方向に読む時間の合
計とする。(x) Estimation of system recovery time - The total time of reading memory resident information from the CD file and reading system recovery history information in both forward and reverse directions.

（ｂ）効果見積り従来方式および本発明の方式の各々について、システム
回復時間を求める。(b) Effect estimation The system recovery time is determined for each of the conventional method and the method of the present invention.

（ｉ）従来方式 ■メモリ常駐情報領域が４０ＭＢの場合・ＣＤ情報入出
力時間　　１１．８秒・チェックポイント間隔ＣＤ情報入出力時間とトラフィックより、最小で１２０
０件。(i) Conventional method ■When the memory resident information area is 40MB ・CD information input/output time 11.8 seconds ・Checkpoint interval From the CD information input/output time and traffic, the minimum is 120 MB
0 items.

・システム回復用履歴情報を読む時間・１２ＫＢＸ１２００Ｘ２−２８．８ＭＢの情報を読む
必要があり、８．５秒かかる。- Time to read system recovery history information - It is necessary to read 12 KB x 1200 x 2 - 28.8 MB of information, which takes 8.5 seconds.

・システム回復時間　　計２０．３秒・チェックポイント時の処理オーβへッドトランザク２
３２１件当たりの２０ブロック−柄入出力処理処理回数
にして、０．４２回 ■メモリ常駐情報領域が１００ＭＢの場合・ＣＤ情報入
出力時間　　２９．５秒・チェックポイント間隔最小で、３０００件６・システム回復用履歴情報を読む時間２１．２秒・システム回復時間　　計５０．７秒・チェックポイント時の処理オーβへッドトランザク２
３２１件当たりの２０ブロック−柄入出力処理処理回数
にして、０．４２回（ｉｉ）本発明の方式 ■メモリ常駐情報領域が４０ＭＢの場合・更新ブロック
数チェックポイント時に更新が必要なブロック数は第′１
表のとおりとなる（ＣＰはチェックポイントの略）。・System recovery time: 20.3 seconds in total ・Processing at checkpoint Oβ head transaction 2
20 blocks per 321 items - Number of pattern input/output processing times: 0.42 times - When memory resident information area is 100MB - CD information input/output time: 29.5 seconds - Minimum checkpoint interval: 3000 items 6 - Time to read system recovery history information: 21.2 seconds ・Total system recovery time: 50.7 seconds ・Checkpoint processing Oβ head transaction 2
20 blocks per 321 items - 0.42 times in pattern input/output processing (ii) Method of the present invention - When the memory resident information area is 40MB - Number of updated blocks The number of blocks that need to be updated at checkpoint is '1st
As shown in the table (CP is an abbreviation for checkpoint).

第　　１　　表・システム回復時間および処理オーバヘッドは第２表の
とおりとなる。Table 1: System recovery time and processing overhead are shown in Table 2.

■と同様にして、見積もる（回復時間の単位は秒、処理
オーバヘッドの単位はトランザクション１件当たりの２
０ブロック−柄入出力処理処理回数）。Estimate in the same way as above (recovery time is in seconds, processing overhead is in units of 2 seconds per transaction).
0 block - number of pattern input/output processing processes).

第　　２　　表 ■メモリ常駐情報領域が１００ＭＢの場合・更新ブロッ
ク数チェックポイント時に更新が必要なブロック数は第３表
のとおりとなる。Table 2 - When the memory resident information area is 100 MB - Number of updated blocks The number of blocks that need to be updated at checkpoint is as shown in Table 3.

第　　３　　表・システム回復時間および処理オーバヘッドは第４表の
とおりとなる。Table 3 - System recovery time and processing overhead are as shown in Table 4.

第　　４　　表以上のとおり１本発明の方式では従来方式と比較して障
害時のシステム回復時間をほぼ半減させることができ、
この効果はメモリ常駐情報の量が増えるほど大きい。ま
た、処理オーバヘッドについては、１５〜３０％程度減
少する。As shown in Table 4, the method of the present invention can reduce the system recovery time in the event of a failure by almost half compared to the conventional method.
This effect becomes larger as the amount of memory-resident information increases. Furthermore, processing overhead is reduced by about 15 to 30%.

[Brief explanation of the drawing]

第１図は本発明の１実施例の概要の説明図、第２図は本
実施例における情報処理システムの全体構成図、第３図
はメモリの内容を示す図、第４図はＣＤファイルの内容
を示す図、第５図はシステム回復処理の概略フロー図、
第６図はトランザクションの処理状態の分類の説明図で
ある。１０・・・ＣＰＵ／メモリ、１２・・・メモリ常駐情報
、１４および１５・・・メモリ常駐情報更新部分管理情
報、２０・・・データベース、３０・・・チェックポイ
ントダンプファイル、３３および３７・・・メモリ常駐
情報更新部分管理情報格納領域、３４および３８・・・
メモリ常駐情報バックアップコピー格納領域。４０・・・システム回復用履歴情報ファイル、５０・・
・記録用履歴情報ファイル。Fig. 1 is an explanatory diagram of the outline of one embodiment of the present invention, Fig. 2 is an overall configuration diagram of the information processing system in this embodiment, Fig. 3 is a diagram showing the contents of the memory, and Fig. 4 is a diagram showing the contents of the CD file. A diagram showing the contents, Figure 5 is a schematic flow diagram of system recovery processing,
FIG. 6 is an explanatory diagram of classification of transaction processing states. 10...CPU/memory, 12...Memory resident information, 14 and 15...Memory resident information update partial management information, 20...Database, 30...Checkpoint dump file, 33 and 37...・Memory resident information update partial management information storage area, 34 and 38...
Memory resident information backup copy storage area. 40... System recovery history information file, 50...
・Historical information file for recording.

Claims

[Claims]

1. In an information processing system that performs transaction processing using information resident on the main storage device or virtual storage device, multiple areas for storing the above-mentioned resident information are provided on one or more non-volatile storage devices, and the transaction processing A history information storage means for acquiring processing history information in a history information file, an updated part storage means for storing an updated part of the above-mentioned resident information, and an update part storage means for storing the above-mentioned information used for recovery processing in the event of a system failure at checkpoints set during system operation. It is equipped with a checkpoint processing means for acquiring a checkpoint in a non-volatile storage device, and a system recovery means for restoring the system to the correct state when the system is stopped, and the update partial storage means stores the previous update to each resident information storage area for each of the above-mentioned resident information storage areas. When the resident information is updated, the fact of the partial update is set in the updated part management information that stores the updated part after the time of partial writing, and the checkpoint processing means reads the data from the plurality of resident information storage areas at the time of the checkpoint. Selecting an area to be used and writing an updated part of the resident information to the area based on the updated part management information corresponding to the area, and the above system recovery means is a final effective check during recovery processing when the system is stopped due to a failure. A failure recovery method for a system characterized by reading resident information from a resident information storage area updated at the time of a point, and recovering the resident information using resident information update history information in a history information file.