JPS61139847A

JPS61139847A - Trouble range localizing method of program

Info

Publication number: JPS61139847A
Application number: JP59262357A
Authority: JP
Inventors: Takashi Yamamoto; 隆山本
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1984-12-11
Filing date: 1984-12-11
Publication date: 1986-06-27

Abstract

PURPOSE:To facilitate explicating trouble causes by managing the execution of an application program for every transaction and preserving the state at the start time of each transaction. CONSTITUTION:When an application program 2 is executed, a transaction management part 3 discriminates the first transaction 6 of the application program 2 and starts it, and register values, etc. at the start time of the transaction 6 are stored in a buffer 10 to preserve the start at the start time of the transaction 6. If the transaction 6 is terminated normally, this normal end is reported to the management part 3. Data, updated data, etc. to be delivered to a transaction 7 to be started next are stored in the buffer 10. If the transaction is terminated abnormally on the way, a transaction recovery part 4 is started, and register values, etc. at the start time which are stored in the buffer 10 are transferred, and required data are transferred form an auxiliary storage device 12, and the program is restored to the state at the start time.

Description

【発明の詳細な説明】（産業上の利用分野〕本発明はプログラムの障害範囲の局所化方法に係り、特
に応用プログラムが異常終了するに至ったプロセスを容
易に再現可能とし、障害追及のための資料を効果的且つ
迅速に採取し得るようにするための情報処理システムの
管理方法の改良に関する。[Detailed Description of the Invention] (Industrial Application Field) The present invention relates to a method for localizing the range of a fault in a program, and in particular, it enables easy reproduction of the process that led to the abnormal termination of an application program, and a method for tracing the fault. The present invention relates to an improvement in a management method for an information processing system so that data can be collected effectively and quickly.

[Conventional technology]

情報処理システムにおいて実行中の応用プログラムが異
常終了した場合には、異常終了時のメモリの内容が取得
されているが、障害範囲を局所化することが出来ない。When an application program being executed in an information processing system terminates abnormally, the contents of the memory at the time of the abnormal termination have been acquired, but the extent of the failure cannot be localized.

そのため異常終了するに至ったプロセスを追及しようと
すると、その応用プログラムを最初から再実行させる以
外に方法がなく、多大の手間を必要とするので、現実に
は異常終了するに至ったプロセスの再現は非常に困難で
ある。Therefore, if you try to track down the process that led to the abnormal termination, the only way is to re-run the application program from the beginning, which requires a lot of effort. is extremely difficult.

また現在再試行を行わせる機能がないため、ジョブが異
常終了したときでも再試行することによって正常動作し
得る場合があるにも拘わらず、その機会を逃がしている
。Furthermore, since there is currently no function to perform a retry, even if a job ends abnormally, it may be possible to return to normal operation by retrying, but the opportunity to do so is being missed.

[Problem that the invention seeks to solve]

上述したように従来の情報処理システムにおいては、異
常終了に対処する有効な手段を持っていなかった。As described above, conventional information processing systems do not have effective means for dealing with abnormal termination.

[Means for solving problems]

本発明は上記問題点を解消することを目的とする。その
ため本発明においては、まず、各応用プログラムを複数
個のトランザクションに分割した構成とする。各トラン
ザクシリンの先頭と末尾には区切を示す命令を付加する
。オペレーティング・システムはトランザクション管理
手段と、トランザクションリカバリ手段と、トランザク
ションの再試行手段とを設ける。上記応用プログラムの
実行に際しては、トランザクション管理手段が各トラン
ザクションを識別し、実行させようとするトランザクシ
ョンに引き渡すデータと該トランザクションの開始時に
おけるレジスタの値等を退避させておく。異常終了が生
じた場合にはトランザクションリカバリ手段が上述の退
避させておいたデータとレジスタの値等を用いて上記異
常終了したトランザクションを開始時の状態に復元する
。The present invention aims to solve the above problems. Therefore, in the present invention, first, each application program is divided into a plurality of transactions. A delimiter command is added to the beginning and end of each transaction. The operating system provides transaction management means, transaction recovery means, and transaction retry means. When executing the application program, the transaction management means identifies each transaction and saves the data to be delivered to the transaction to be executed and the register values at the start of the transaction. When an abnormal termination occurs, the transaction recovery means restores the abnormally terminated transaction to its starting state using the above-mentioned saved data and register values.

この後トランザクション再試行手段は上記トランザクシ
ョンを先頭から再試行させ、該トランザクション内で更
新のあったデータ等のトレースデータを所定のバッファ
に格納するようにした。Thereafter, the transaction retry means retries the transaction from the beginning, and stores trace data such as data updated within the transaction in a predetermined buffer.

[Effect]

応用プログラムの実行をトランザクション毎に管理し、
各トランザクションの開始時の状態を保存しておくこと
により、異常終了が生じた場合には障害範囲を局所化出
来、しかも異常終了したトランザクションを開始時の状
態に復元することが可能である。従って正常終了したト
ランザクションを再度実行させることなく、異常終了が
生じたトランザクションのみを再試行させることが出来
ることとなる。このように再試行すべき障害範囲を特定
し、局所化することによって、トレースデータを迅速且
つ効果的に取得でき、障害原因の解明が容易となる。Manage application program execution for each transaction,
By saving the state at the start of each transaction, in the event of an abnormal termination, it is possible to localize the failure range, and moreover, it is possible to restore the abnormally terminated transaction to the state at the start. Therefore, only the transaction that ended abnormally can be retried without having to re-execute the transaction that ended normally. By identifying and localizing the fault range to be retried in this way, trace data can be acquired quickly and effectively, and the cause of the fault can be easily clarified.

〔Example〕

図は本発明の一実施例に用いた情報処理システムの構成
を示す要部ブロック図である。同図において、■はオペ
レーティング・システム、２は応用プログラム、３〜５
は本実施例で新たに設けた部分で、３はトランザクショ
ン管理部、４はトランザクションリカバリ部、５はトラ
ンザクション再試行部、６〜９はそれぞれ上記応用プロ
グラム２を構成するトランザクション、１０及び１１は
バッファメモリ、１２は補助記憶装置（ＤＡＳＤ）を示
す。The figure is a block diagram of main parts showing the configuration of an information processing system used in an embodiment of the present invention. In the same figure, ■ is the operating system, 2 is the application program, and 3 to 5 are
are newly provided parts in this embodiment, 3 is a transaction management section, 4 is a transaction recovery section, 5 is a transaction retry section, 6 to 9 are transactions constituting the above application program 2, and 10 and 11 are buffers. Memory 12 indicates an auxiliary storage device (DASD).

上述のように構成したシステムにおいて応用プログラム
２を実行させるに当たって、トランザクション管理部３
が応用プログラム２の先頭のトランザクション６を識別
し、これをスタートさせ〔図の矢印■〕、まずトランザ
クション６開始時のレジスタ値等をバッファ１０に格納
させる〔図の矢印■〕ことにより、トランザクション６
の開始時の状態を保存する。In executing the application program 2 in the system configured as described above, the transaction management unit 3
identifies transaction 6 at the beginning of application program 2, starts it [arrow ■ in the figure], and first stores register values etc. at the start of transaction 6 in buffer 10 [arrow ■ in figure].
Save the starting state of .

トランザクション６の実行途中においてデータの更新が
あれば、この更新データもバッファ１０に順次格納され
る〔図の矢印■〕。If data is updated during the execution of transaction 6, this updated data is also sequentially stored in buffer 10 [arrow ■ in the figure].

トランザクション６が正常に終了した場合には。If transaction 6 ends normally.

その旨がトランザクション管理部３に通知〔矢印■〕さ
れる。この時次に起動されるトランザクション７に引き
渡されるデータ及び更新データ等は。The transaction management unit 3 is notified of this [arrow ■]. At this time, the data, update data, etc. handed over to the next activated transaction 7 are as follows.

既にバッファ１０に格納されている。It is already stored in the buffer 10.

トランザクション管理部３はバッファ１０に格納されて
いる更新データをＤ　Ａ　Ｓ　Ｄ　１２に転送〔矢印■
〕させたのち、次位のトランザクション７を起動〔矢印
■〕し、レジスタの値をバッファ１０に格納させる〔矢
印■〕とともに、バッファ１０からはトランザクション
６から引き渡すべきデータを転送〔矢印■〕させる。The transaction management unit 3 transfers the updated data stored in the buffer 10 to the D ASD 12 [arrow ■
], then activates the next transaction 7 [arrow ■], stores the register value in buffer 10 [arrow ■], and transfers the data to be transferred from transaction 6 from buffer 10 [arrow ■] .

このトランザクション７が途中で異常終了１３シたとす
る。この場合にはトランザクションリカバリ部４が起動
され、これからの指令〔矢印■〕により、トランザクシ
ョン７に対してバッファ１０からは先に格納しておいた
開始時のレジスタ値等が、またＤ　Ａ　Ｓ　Ｄ　１２か
らは必要なデータが転送〔矢印［相］、■〕され、トラ
ンザクション７は開始時の状態に復元される。Assume that this transaction 7 ends abnormally in the middle. In this case, the transaction recovery unit 4 is activated, and according to a future command [arrow ■], the previously stored register values at the start from the buffer 10 for transaction 7 are restored to D A S D Necessary data is transferred from 12 [arrow [phase], ■], and transaction 7 is restored to its starting state.

一方、トランザクション７が異常終了するまでに更新さ
れてバッファ１０に格納されたデータは、まだＤ　Ａ　
Ｓ　Ｄ　１２には転送されていない。従ってＤＡ　Ｓ　
Ｄ　１２はトランザクション７の開始時の状態を保って
いる。On the other hand, the data updated and stored in buffer 10 until transaction 7 terminates abnormally is still DA
It has not been transferred to SD12. Therefore, D.A.S.
D12 maintains the state at the start of transaction 7.

そこでトランザクション再試行部５はトランザクション
７を再度起動〔矢印＠〕シ、最初から実行させ、トレー
スデータ等必要なデータを取得させる゛。取得されたデ
ータはバッファ１１に格納〔矢印０〕する。Therefore, the transaction retry unit 5 starts the transaction 7 again [arrow @], executes it from the beginning, and obtains necessary data such as trace data. The acquired data is stored in the buffer 11 [arrow 0].

このようにトランザクション７の異常終了を再現させ、
その間に取得したデータを調べることにより、異常の原
因を容易に突き止めることが出来る。In this way, reproduce the abnormal termination of transaction 7,
By examining the data acquired during that time, the cause of the abnormality can be easily determined.

もし再試行の結果トランザクション７が正常に終了した
場合には、自動的に再実行が行われたものと見なしてよ
い。If transaction 7 ends normally as a result of the retry, it may be assumed that the transaction has been automatically re-executed.

以上の如く本実施例によれば、再試行すべき範囲は狭い
範囲に特定される。このように局所化された範囲内のト
レースデータ等を取得することは容易であり、従って詳
細にデータを採取することが可能となり、障害原因の解
明が容易となる。As described above, according to this embodiment, the range to be retried is specified to be a narrow range. It is easy to acquire trace data and the like within a localized range in this way, and therefore it becomes possible to collect detailed data, and it becomes easy to clarify the cause of the failure.

〔Effect of the invention〕

以上説明した如く本発明によれば、正常終了したトラン
ザクションを再度実行させることなく、異常路〒が生じ
たトランザクションのみを再試行させることが可能とな
る。従って障害原因解明のためのデータを迅速に、且つ
効果的にしかも容易に採取することが出来る。As described above, according to the present invention, it is possible to retry only a transaction in which an abnormal path has occurred, without having to re-execute a transaction that has terminated normally. Therefore, data for elucidating the cause of the failure can be collected quickly, effectively, and easily.

[Brief explanation of drawings]

図は本発明の一実施例に用いた情報処理システムの構成
を示す要部ブロック図である。図において、ｌはオペレーティング・システム、２は応
用プログラム、３，４．５はそれぞれトランザクション
管理部、トランザクションリカバリ部、トランザクショ
ン再試行部、６〜９それぞれ応用プログラム２を構成す
るトランザクション、１０、１１はバッファメモリ、１
２はＤＡＳＤを示す。The figure is a block diagram of main parts showing the configuration of an information processing system used in an embodiment of the present invention. In the figure, 1 is an operating system, 2 is an application program, 3, 4.5 are transaction management section, transaction recovery section, and transaction retry section, 6 to 9 are transactions that constitute application program 2, respectively, and 10 and 11 are transactions that constitute application program 2. Buffer memory, 1
2 indicates DASD.

Claims

[Claims]

An operating system for managing the execution of an application program consisting of a plurality of sequential transactions is configured to include a transaction management means, a transaction recovery means, and a transaction retry means, and when executing the application program, The transaction management means detects the end of each transaction, stores the state at the start of the detected transaction, and when the application program terminates abnormally, the transaction that was being executed at that time is recovered. A method for localizing a failure range of a program, characterized in that the program restores the saved starting state by the means, and then causes the transaction retry means to re-execute the transaction to obtain desired trace data. .