JPH07295862A

JPH07295862A - System control method applying error tracing

Info

Publication number: JPH07295862A
Application number: JP6092715A
Authority: JP
Inventors: Masato Ushijima; 正人牛島
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1994-04-28
Filing date: 1994-04-28
Publication date: 1995-11-10
Anticipated expiration: 2014-01-13
Also published as: JP2846238B2

Abstract

PURPOSE:To accurately and easily decide an error occurrence place based on the stored trace information, i.e., the attribute information on the state change of a program. CONSTITUTION:A trace processing hardware 22 functions to collect the attribute information on the state change of programs and stores these collected information in a tracing buffer 24 when the trace processing is started by a user program 21a or a system program 21b. In a system control method which applies the error tracing, the process attribute information is stored in a tracing buffer memory or a tracing buffer area of the memory when the state change of the program occurs. When an error is detected, the attribute information stored in the buffer memory or the buffer area is traced. Then an error deciding program decides the program of the error occurrence place.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、エラー発生源の認定
を容易にすると共に、認定したエラー発生源に応じてシ
ステムを制御し管理するエラートレースによるシステム
制御管理方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a system control management method by error tracing for facilitating the recognition of an error source and controlling and managing a system according to the recognized error source.

【０００２】[0002]

【従来の技術】低速で動作するメモリシステムと少量の
高速で動作するハードウェアにより構成された従来のコ
ンピュータシステムは、エラー発生源の認定は、エラー
発生とエラー検出が同時であること、またエラー発生元
がプログラムの状態変化を生ずる前にエラー検出が行わ
れることを想定している。すなわち、エラー原因となる
動作を起動したプログラムとエラー検出時に実行されて
いたプログラムとは同じプログラムの状態（ユーザプロ
グラムまたはシステムプログラム）であり、エラーの原
因となるプログラムとエラーを検出したプログラムとは
同じプロセスであるとの前提で、エラー原因とエラー発
生元の認定を行い、エラー処理を行っていた。2. Description of the Related Art A conventional computer system composed of a memory system operating at a low speed and a small amount of hardware operating at a high speed determines the source of an error by simultaneously detecting the error and detecting the error. It is assumed that the error detection is performed before the source causes the state change of the program. That is, the program that started the operation that causes the error and the program that was being executed when the error was detected are in the same program state (user program or system program), and the program that causes the error and the program that detected the error are Assuming that the processes are the same, the cause of the error and the source of the error were certified, and the error was processed.

【０００３】図２３は、コンピュータシステムにおける
エラー発生状況を模式的に時間の経過に従って示した説
明図であり、図において１と２は異なる２つのプロセス
ＡとプロセスＢを示している。また、３はシステムプロ
グラムの実行状態、４はユーザプログラムの実行状態を
示している。５〜９はエラーを発生させる動作が行われ
るタイミングを示し、１０〜１３は前記タイミング５〜
９のエラーを発生させる動作によるエラーの発生に対応
した夫々のエラーが検出されるタイミングを示してい
る。タイミング５とタイミング１０、タイミング６とタ
イミング１１、タイミング７とタイミング１２、タイミ
ング８とタイミング１３、タイミング９とタイミング１
４が夫々のエラー発生とエラー検出の組である。FIG. 23 is an explanatory diagram schematically showing an error occurrence situation in a computer system with the passage of time, and in the figure, two processes A and B different from 1 and 2 are shown. Further, 3 indicates the execution state of the system program, and 4 indicates the execution state of the user program. Reference numerals 5 to 9 indicate timings at which an operation for generating an error is performed, and reference numerals 10 to 13 indicate the timings 5 to 5.
9 shows the timings at which the respective errors corresponding to the occurrence of the error due to the operation of causing the error of No. 9 are detected. Timing 5 and Timing 10, Timing 6 and Timing 11, Timing 7 and Timing 12, Timing 8 and Timing 13, Timing 9 and Timing 1
Reference numeral 4 is a set of error occurrence and error detection.

【０００４】エラー発生元がプログラムの状態変化を生
ずる前にエラー検出が行われることを前提としてなされ
る従来のエラー発生元の認定では、従来のコンピュータ
システムが低速で動作するメモリシステムと少容量の高
速で動作するハードウェア（キャッシュメモリや書き込
み高速化のための書き込みバッファなどをいう）により
構成されていることから、エラーを発生させる動作が行
われるタイミングと発生したエラーが検出されるタイミ
ングとの間隔は図２３に示すほど大きくはならず、前記
エラー発生とエラー検出が同時であること、またエラー
発生元がプログラムの状態変化を生ずる前にエラー検出
が行われるという前提の基にエラーの認定が行われてい
た。In the conventional error source qualification, which is performed on the assumption that the error source is detected before the error source causes a change in the state of the program, the conventional computer system has a memory system operating at a low speed and a small capacity. Since it is composed of hardware that operates at high speed (refers to cache memory, write buffer for speeding up writing, etc.), the timing of the operation that causes an error and the timing of detecting the error that occurs The interval is not so large as shown in FIG. 23, the error is detected and the error is detected at the same time, and the error is detected based on the premise that the error is detected before the error source causes the program state change. Was being done.

【０００５】[0005]

【発明が解決しようとする課題】従来のコンピュータシ
ステムのエラー発生源の認定は以上のような環境下で行
われていたのに対し、近年のハードウェア技術の進展に
よりさらに高速のＣＰＵと高速のキャッシュメモリを大
量に使用したコンピュータシステムが構成可能となり、
このためにエラーを発生する動作をＣＰＵが実行しても
この結果発生するエラーはすぐには検出されず、またそ
の一方でＣＰＵはキャッシュメモリ内のプログラムを高
速で実行できるためエラー検出時にはプログラムがエラ
ー発生元からかなり先まで実行されている状況が生じ、
実行中のプログラム状態とエラー発生元のプログラム状
態とが一致しない場合が生ずることになる。While the conventional method of recognizing the error source of a computer system has been performed under the above-mentioned environment, the progress of hardware technology in recent years has made it possible to further improve the CPU and speed. A computer system that uses a large amount of cache memory can be configured,
For this reason, even if the CPU executes an operation that causes an error, the resulting error is not immediately detected, and on the other hand, the CPU can execute the program in the cache memory at high speed, so the program is not detected when the error is detected. There is a situation where it is running far from the error source,
The state of the program being executed and the state of the program where the error occurred may not match.

【０００６】すなわち、図２３に示すタイミング５とタ
イミング１０、タイミング６とタイミング１１、タイミ
ング７とタイミング１２、タイミング８とタイミング１
３、タイミング９とタイミング１４など夫々のエラー発
生とエラー検出の組の内で、タイミング６とタイミング
１１、タイミング８とタイミング１３、タイミング９と
タイミング１４のエラー発生元とエラー検出の組では、
エラーが検出されるタイミングでのプログラム状態とエ
ラー発生元のプログラム状態とが異なることになるため
エラー発生元が正確に認定できない。タイミング６とタ
イミング１１の組ではユーザプログラムがエラー発生元
であるのに対しシステムプログラムがエラー発生元であ
ると誤って認定され、またタイミング８とタイミング１
３の組ではシステムプログラムがエラー発生元であるの
に対しユーザプログラムがエラー発生元であると誤って
認定され、さらにタイミング９とタイミング１４の組で
は無関係のプロセスＢがエラー発生元と認定されること
になる。That is, timing 5 and timing 10, timing 6 and timing 11, timing 7 and timing 12, timing 8 and timing 1 shown in FIG.
3, among the pairs of error occurrence and error detection such as timing 9 and timing 14, among the pairs of error occurrence source and error detection of timing 6 and timing 11, timing 8 and timing 13, timing 9 and timing 14,
Since the program state at the time when an error is detected is different from the program state of the error source, the error source cannot be accurately identified. In the set of timing 6 and timing 11, the user program is the error source, but the system program is erroneously recognized as the error source, and the timing 8 and the timing 1 are set.
In the set of 3, the system program is the error source, but the user program is mistakenly recognized as the error source, and in the set of timing 9 and timing 14, the unrelated process B is recognized as the error source. It will be.

【０００７】つまり、従来のコンピュータシステムでは
容易に成立したエラーの原因となった実行プログラムと
エラーを検出した実行プログラムとは同じプロセスであ
るとの前提条件は成立しなくなり、正確なエラー検出が
困難になる問題点が生ずる。That is, in the conventional computer system, the precondition that the execution program that causes the error and the execution program that has detected the error are the same process is not satisfied, and it is difficult to accurately detect the error. The problem arises.

【０００８】本発明は上記のような問題点を解消するた
めになされたもので、請求項１の発明はエラー発生元の
認定を正確かつ容易に行うことのできるエラートレース
によるシステム制御管理方法を得ることを目的とする。The present invention has been made to solve the above-mentioned problems, and the invention of claim 1 provides a system control management method by an error trace capable of accurately and easily recognizing an error occurrence source. The purpose is to get.

【０００９】請求項２の発明は、エラー発生時のプログ
ラム状態とエラー検出時のプログラム状態が異なる場合
でもエラー発生元の認定を容易にするエラートレースに
よるシステム制御管理方法を得ることを目的とする。It is an object of the present invention to provide a system control management method by an error trace that facilitates recognition of an error occurrence source even when the program state at the time of error occurrence and the program state at the time of error detection are different. .

【００１０】請求項３の発明は、エラー発生原因となっ
た資源を使用しているプログラムの認定を正確かつ容易
に行うことのできるエラートレースによるシステム制御
管理方法を得ることを目的とする。It is an object of the invention of claim 3 to provide a system control management method by error tracing which can accurately and easily identify a program using a resource that has caused an error.

【００１１】請求項４の発明は、エラー発生時のプログ
ラム状態とエラー検出時のプログラム状態が異なる場合
でもエラー発生元の認定を正確かつ容易に行うことので
きるエラートレースによるシステム制御管理方法を得る
ことを目的とする。According to a fourth aspect of the present invention, there is obtained a system control management method by an error trace capable of accurately and easily recognizing an error source even when a program state at the time of error occurrence and a program state at the time of error detection are different. The purpose is to

【００１２】請求項５の発明は、エラー発生時のプログ
ラム状態とエラー検出時のプログラム状態が異なる場合
でもエラー発生元の認定を正確かつ容易に行うことので
きるエラートレースによるシステム制御管理方法を得る
ことを目的とする。According to a fifth aspect of the present invention, there is provided a system control management method based on an error trace capable of accurately and easily recognizing an error source even when a program state at the time of error occurrence and a program state at the time of error detection are different. The purpose is to

【００１３】請求項６の発明は、トレース用バッファの
有効利用を可能にし、エラー発生元の認定を正確かつ容
易に行うことのできるエラートレースによるシステム制
御管理方法を得ることを目的とする。It is an object of the present invention to provide a system control management method by error trace which enables effective use of a trace buffer and can accurately and easily identify an error occurrence source.

【００１４】請求項７の発明は、エラー発生元の認定を
正確かつ容易に行うことのできるエラートレースによる
システム制御管理方法を得ることを目的とする。It is an object of the present invention to provide a system control management method by error tracing which can accurately and easily identify an error source.

【００１５】請求項８の発明は、エラー発生元の認定を
正確かつ容易に行うことのできるエラートレースによる
システム制御管理方法を得ることを目的とする。It is an object of the present invention to provide a system control management method by error tracing, which can accurately and easily identify an error source.

【００１６】請求項９の発明は、エラー発生元の認定を
正確かつ容易に行うことのできるエラートレースによる
システム制御管理方法を得ることを目的とする。It is an object of the present invention to provide a system control management method by error tracing, which can accurately and easily identify an error source.

【００１７】請求項１０の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。It is an object of the invention of claim 10 to obtain a system control management method by an error trace capable of accurately and easily recognizing an error source.

【００１８】請求項１１の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。It is an object of the invention of claim 11 to obtain a system control management method by an error trace capable of accurately and easily recognizing an error source.

【００１９】請求項１２の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。It is an object of the invention of claim 12 to obtain a system control management method by an error trace capable of accurately and easily identifying an error occurrence source.

【００２０】請求項１３の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。It is an object of the invention of claim 13 to obtain a system control management method by an error trace capable of accurately and easily recognizing an error source.

【００２１】請求項１４の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。It is an object of the present invention to provide a system control management method by error tracing, which can accurately and easily identify an error source.

【００２２】請求項１５の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。It is an object of the invention of claim 15 to obtain a system control management method by an error trace capable of accurately and easily recognizing an error occurrence source.

【００２３】請求項１６の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。An object of the 16th aspect of the present invention is to obtain a system control management method by an error trace capable of accurately and easily recognizing an error occurrence source.

【００２４】請求項１７の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。An object of the 17th aspect of the invention is to obtain a system control management method based on an error trace capable of accurately and easily recognizing an error occurrence source.

【００２５】請求項１８の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。It is an object of the invention of claim 18 to obtain a system control management method by an error trace capable of accurately and easily recognizing an error occurrence source.

【００２６】請求項１９の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。It is an object of the invention of claim 19 to obtain a system control management method by error tracing capable of accurately and easily recognizing an error occurrence source.

【００２７】請求項２０の発明は、エラー発生元の認定
に必要なデータを効率良く収集でき、エラー発生元の認
定を正確かつ容易に行うことのできるエラートレースに
よるシステム制御管理方法を得ることを目的とする。According to the twentieth aspect of the present invention, it is possible to obtain a system control management method by an error trace that can efficiently collect the data necessary for certifying the error source and can accurately and easily certify the error source. To aim.

【００２８】請求項２１の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。It is an object of the invention of claim 21 to obtain a system control management method by an error trace capable of accurately and easily recognizing an error occurrence source.

【００２９】請求項２２の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。It is an object of the invention of claim 22 to obtain a system control management method by error tracing capable of accurately and easily recognizing an error occurrence source.

【００３０】請求項２３の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。An object of the 23rd aspect of the invention is to obtain a system control management method by error tracing, which can accurately and easily identify the error source.

【００３１】請求項２４の発明は、エラー発生元の認定
を正確かつ容易に行うことのできるエラートレースによ
るシステム制御管理方法を得ることを目的とする。It is an object of the invention of claim 24 to obtain a system control management method by error tracing capable of accurately and easily recognizing an error occurrence source.

【００３２】請求項２５の発明は、エラー発生元の認定
を正確かつ容易に行うと共にシステムの信頼性を向上さ
せるエラートレースによるシステム制御管理方法を得る
ことを目的とする。It is an object of the invention of claim 25 to obtain a system control management method by an error trace for accurately and easily recognizing an error occurrence source and improving system reliability.

【００３３】請求項２６の発明は、エラー発生元の認定
を正確かつ容易に行うと共に有効資源の利用率を向上さ
せ、システムの信頼性を向上させるエラートレースによ
るシステム制御管理方法を得ることを目的とする。The object of the 26th aspect of the invention is to obtain a system control management method by an error trace for accurately and easily recognizing an error source, improving the utilization rate of effective resources, and improving system reliability. And

【００３４】請求項２７の発明は、エラー発生元の認定
を正確かつ容易に行うと共にシステムの信頼性を向上さ
せるエラートレースによるシステム制御管理方法を得る
ことを目的とする。It is an object of the invention of claim 27 to obtain a system control management method based on an error trace for accurately and easily recognizing an error occurrence source and improving system reliability.

【００３５】請求項２８の発明は、エラー発生元の認定
を正確かつ容易に行うと共にシステムの信頼性を向上さ
せるエラートレースによるシステム制御管理方法を得る
ことを目的とする。It is an object of the invention of claim 28 to obtain a system control management method by an error trace for accurately and easily recognizing an error occurrence source and improving system reliability.

【００３６】請求項２９の発明は、エラー発生元の認定
を正確かつ容易に行うと共にシステムの信頼性を向上さ
せるエラートレースによるシステム制御管理方法を得る
ことを目的とする。It is an object of the invention of claim 29 to obtain a system control management method by error tracing that accurately and easily identifies an error source and improves system reliability.

【００３７】請求項３０の発明は、エラー発生元の認定
を正確かつ容易に行うと共にシステムの信頼性を向上さ
せるエラートレースによるシステム制御管理方法を得る
ことを目的とする。It is an object of the invention of claim 30 to obtain a system control management method by an error trace for accurately and easily recognizing an error occurrence source and improving system reliability.

【００３８】請求項３１の発明は、エラー発生元の認定
を正確かつ容易に行うと共にシステムの信頼性を向上さ
せるエラートレースによるシステム制御管理方法を得る
ことを目的とする。An object of the invention of claim 31 is to obtain a system control management method by error tracing which accurately and easily identifies an error source and improves system reliability.

【００３９】[0039]

【課題を解決するための手段】請求項１の発明に係るエ
ラートレースによるシステム制御管理方法は、トレース
用バッファにはプログラムの状態変化を示す属性情報を
記憶し、記憶した属性情報を基にエラー発生源を認定す
る構成を備えたものである。According to another aspect of the present invention, there is provided an error trace system control management method, wherein attribute information indicating a change in a program state is stored in a trace buffer, and an error is recorded based on the stored attribute information. It is equipped with a structure that certifies the source.

【００４０】請求項２の発明に係るエラートレースによ
るシステム制御管理方法は、トレース用バッファにはプ
ログラムの状態変化を示す属性情報を発生時間順に記憶
し、この記憶した属性情報を前記発生時間順に進みある
いは戻り検索することでエラー発生原因に関係するプロ
グラムを認定する構成を備えたものである。In the system control management method by error tracing according to the second aspect of the present invention, the attribute information indicating the program state change is stored in the trace buffer in the order of occurrence time, and the stored attribute information is advanced in the order of occurrence time. Alternatively, it is configured to certify the program related to the cause of the error by performing a return search.

【００４１】請求項３の発明に係るエラートレースによ
るシステム制御管理方法は、プロセスがどの資源を使用
しているかを知ることのできるトレース用バッファに記
憶した属性情報を検索し、前記プロセスが使用している
資源を知ることでエラー発生原因となった資源を使用し
ているプログラムを認定する構成を備えたものである。According to the third aspect of the present invention, there is provided an error trace system control management method, which retrieves attribute information stored in a trace buffer, which allows a process to know which resource the process uses, and uses the attribute information. It is provided with a configuration for recognizing the program that uses the resource that has caused the error by knowing the existing resource.

【００４２】請求項４の発明に係るエラートレースによ
るシステム制御管理方法は、プログラムの状態が変化す
る前のプログラム状態あるいは変化した後のプログラム
状態を属性情報としてトレース用バッファに格納する構
成を備えたものである。According to a fourth aspect of the present invention, there is provided a system control management method using error tracing, which has a configuration in which a program state before a program state change or a program state after a program state change is stored as attribute information in a trace buffer. It is a thing.

【００４３】請求項５の発明に係るエラートレースによ
るシステム制御管理方法は、書き込みバッファやキャッ
シュなどの記憶装置を使用しているときには、エラーが
発生すると前記記憶装置内のデータを固定し、プログラ
ムの状態が変化する前のプログラム状態あるいは変化し
た後のプログラム状態などのトレース用バッファに格納
された属性情報を検索し、エラー発生源を認定する構成
を備えたものである。According to a fifth aspect of the present invention, there is provided a system control management method using error trace, wherein when a storage device such as a write buffer or a cache is used, when an error occurs, the data in the storage device is fixed and the program is stored. The attribute information stored in the trace buffer such as the program state before the state change or the program state after the state change is searched for, and the error source is identified.

【００４４】請求項６の発明に係るエラートレースによ
るシステム制御管理方法は、トレース用バッファをサイ
クリックに使用してプログラムの状態変化を示す属性情
報を記憶する構成を備えたものである。An error tracing system control management method according to a sixth aspect of the present invention comprises a structure for cyclically using a trace buffer to store attribute information indicating a change in the state of a program.

【００４５】請求項７の発明に係るエラートレースによ
るシステム制御管理方法は、オペレーティングシステム
によりプログラムの使用状況に応じて作成管理されたあ
るいは予め定義されてなるプログラムの管理テーブルへ
のポインタ、または前記管理テーブルのコピーまたはプ
ログラムの識別子の属性情報をトレース用バッファに記
憶する構成を備えたものである。According to a seventh aspect of the present invention, there is provided a system control management method using error tracing, which is a pointer to a management table of a program which is created and managed by the operating system according to the usage status of the program or is defined in advance, or the management. It is provided with a structure for storing a copy of a table or attribute information of a program identifier in a trace buffer.

【００４６】請求項８の発明に係るエラートレースによ
るシステム制御管理方法は、プログラムの状態変化が発
生した時刻またはプログラムの状態が変化してからの経
過時間などに関しての属性情報をトレース用バッファに
記憶する構成を備えたものである。In the system control management method by error tracing according to an eighth aspect of the present invention, the attribute information about the time when the program state change occurs or the elapsed time after the program state change is stored in the trace buffer. It is equipped with a configuration.

【００４７】請求項９の発明に係るエラートレースによ
るシステム制御管理方法は、エラーの検出時刻と発生し
たエラーが検出されるのに要したエラー検出時間から実
際にエラーの発生したエラー発生時刻を求め、さらにト
レース用バッファに記憶したプログラムの状態変化が発
生した時刻またはプログラムの状態が変化してからの経
過時間などの属性情報により、前記求めたエラー発生時
刻に実行されていたプログラムを検索して求め、エラー
発生源を認定する構成を備えたものである。In the system control management method based on the error trace according to the ninth aspect of the present invention, the error occurrence time at which the error actually occurs is calculated from the error detection time and the error detection time required to detect the error. Further, by searching attribute programs such as the time when the program state change stored in the trace buffer or the elapsed time after the program state change, the program executed at the obtained error occurrence time is searched. It has a configuration for asking for and certifying the error source.

【００４８】請求項１０の発明に係るエラートレースに
よるシステム制御管理方法は、発生したエラーの種類に
応じたエラーを検出するのに要するエラー検出時間のリ
ストをあらかじめ求めておき、発生したエラーの種類に
応じたエラー検出時間を前記リストから検索して求め、
実際のエラーの検出時刻と前記リストより求めたエラー
検出時間からエラーの種類に応じた実際のエラー発生時
刻を求め、さらに属性情報としてトレース用バッファに
記憶したプログラムの状態変化が発生した時刻またはプ
ログラムの状態が変化してからの経過時間の情報によ
り、前記求めたエラー発生時刻に実行されていたプログ
ラムを検索して求め、エラー発生源を認定する構成を備
えたものである。In the system control management method based on the error trace according to the tenth aspect of the invention, a list of error detection times required to detect an error corresponding to the type of error that has occurred is obtained in advance, and the type of error that has occurred. The error detection time according to
The actual error occurrence time according to the type of error is obtained from the actual error detection time and the error detection time obtained from the list, and the time or program in which the state change of the program stored in the trace buffer as attribute information occurs According to the information of the elapsed time from the change of the state of (1), the program executed at the obtained error occurrence time is searched for and obtained, and the error generation source is identified.

【００４９】請求項１１の発明に係るエラートレースに
よるシステム制御管理方法は、プログラムが使用してい
る資源のリストを属性情報としてトレース用バッファに
格納する構成を備えたものである。The system control management method by error tracing according to the eleventh aspect of the present invention has a configuration for storing a list of resources used by a program as attribute information in a trace buffer.

【００５０】請求項１２の発明に係るエラートレースに
よるシステム制御管理方法は、トレース用バッファへの
プログラムの状態変化を示す属性情報の格納をプロセス
間のコンテクストスイッチの発生を基に行う構成を備え
たものである。According to the twelfth aspect of the present invention, there is provided a system control management method using error tracing, which comprises a structure in which attribute information indicating a change in the state of a program is stored in a trace buffer based on the occurrence of a context switch between processes. It is a thing.

【００５１】請求項１３の発明に係るエラートレースに
よるシステム制御管理方法は、トレース用バッファへの
プログラムの状態変化に関する属性情報の格納をＣＰＵ
の実行状態の遷移を基に行う構成を備えたものである。According to a thirteenth aspect of the present invention, there is provided a system control management method based on error tracing, wherein attribute information relating to a change in the state of a program is stored in a trace buffer by a CPU
The configuration is performed based on the transition of the execution state of.

【００５２】請求項１４の発明に係るエラートレースに
よるシステム制御管理方法は、トレース用バッファへの
プログラムの状態変化に関しての属性情報の格納を、Ｃ
ＰＵの実行状態がユーザプログラムからシステムプログ
ラムへ、あるいはシステムプログラムからユーザプログ
ラムへの遷移を基に行う構成を備えたものである。In the system control management method by error tracing according to the fourteenth aspect of the present invention, the attribute information regarding the change of the state of the program in the trace buffer is stored in C
The configuration is such that the execution state of the PU is based on the transition from the user program to the system program or from the system program to the user program.

【００５３】請求項１５の発明に係るエラートレースに
よるシステム制御管理方法は、ＣＰＵの実行状態が非特
権レベルから特権レベルへ、あるいは特権レベルから非
特権レベルへの遷移を基に、トレース用バッファへプロ
グラムの状態変化に関しての属性情報の格納を行う構成
を備えたものである。According to the fifteenth aspect of the present invention, there is provided a system control management method using error tracing, wherein the CPU execution state is changed to a trace buffer based on a transition from a non-privilege level to a privilege level or from a privilege level to a non-privilege level. It is provided with a configuration for storing attribute information regarding a change in the state of a program.

【００５４】請求項１６の発明に係るエラートレースに
よるシステム制御管理方法は、割り込み処理の起動ある
いは終了したタイミング、または割り込み処理の起動お
よび終了したタイミング、または割り込み処理中の任意
の時間タイミングでトレース用バッファへプログラムの
状態変化に関しての属性情報の格納を行う構成を備えた
ものである。According to a sixteenth aspect of the present invention, there is provided a system control management method based on error tracing, which is used for tracing at a timing at which interrupt processing is started or ended, at a timing at which interrupt processing is started and ended, or at an arbitrary timing during interrupt processing. The buffer is provided with a configuration for storing attribute information relating to a change in the state of the program.

【００５５】請求項１７の発明に係るエラートレースに
よるシステム制御管理方法は、入出力装置における入出
力動作終了時または入出力動作中のエラー発生またはハ
ードウェアエラーまたは例外を割り込み要因とする割り
込み処理の起動あるいは終了したタイミング、または前
記割り込み処理の起動および終了したタイミング、また
は前記割り込み処理中の任意の時間タイミングでトレー
ス用バッファへプログラムの状態変化に関しての属性情
報の格納を行う構成を備えたものである。According to a seventeenth aspect of the present invention, there is provided a system control management method based on error tracing, which is an interrupt process which uses an error occurrence at the end of an input / output operation in an input / output device or during an input / output operation, a hardware error or an exception as an interrupt factor. It is provided with a configuration for storing attribute information regarding a change in the state of a program in a trace buffer at the timing of activation or termination, the timing of activation and termination of the interrupt processing, or an arbitrary time timing during the interrupt processing. is there.

【００５６】請求項１８の発明に係るエラートレースに
よるシステム制御管理方法は、ユーザプログラムからシ
ステムプログラムへの遷移が行われるシステムコールが
起動されるタイミングあるいはシステムコールからの終
了処理が実行されるタイミング、または前記システムコ
ールが起動されるタイミングおよびシステムコールから
の終了処理が実行されるタイミング、またはシステムコ
ールの処理中の任意の時間タイミングを基にトレース用
バッファへプログラムの状態変化に関しての属性情報の
格納を行う構成を備えたものである。According to the eighteenth aspect of the present invention, there is provided a system control management method using error tracing, in which a system call for transition from a user program to a system program is activated or a termination process is executed from the system call. Alternatively, the attribute information relating to the change of the program state is stored in the trace buffer based on the timing at which the system call is started and the timing at which the termination processing from the system call is executed, or the arbitrary timing during the processing of the system call. It is equipped with a configuration for performing.

【００５７】請求項１９の発明に係るエラートレースに
よるシステム制御管理方法は、アプリケーションプログ
ラムに提供されたインターフェースを用いてプログラム
の状態変化に関しての属性情報をトレース情報として格
納する構成を備えたものである。According to a nineteenth aspect of the present invention, there is provided a system control management method using error tracing, which has a structure for storing attribute information regarding a change in the state of a program as trace information by using an interface provided to an application program. .

【００５８】請求項２０の発明に係るエラートレースに
よるシステム制御管理方法は、プログラムカウンタ値，
ＣＰＵ状態値，時間要素，トレース事象番号，各種レジ
スタの内容，プログラム名，プロセス名，メッセージ，
アドレス空間識別子，プロセス識別子，プログラム識別
子，プロセス属性情報，プログラム属性情報，メッセー
ジキュー識別子，メッセージキュー属性情報のいずれか
あるいはそれらの組み合わせをインターフェースのパラ
メータとし、アプリケーションプログラムに提供された
インターフェースを用いてプログラムの状態変化に関し
ての属性情報をトレース情報として格納する構成を備え
たものである。According to a twentieth aspect of the present invention, there is provided a system control management method using an error trace, wherein a program counter value,
CPU status value, time element, trace event number, contents of various registers, program name, process name, message,
A program using the interface provided to the application program, using any one of address space identifier, process identifier, program identifier, process attribute information, program attribute information, message queue identifier, message queue attribute information or a combination thereof as an interface parameter. The attribute information about the state change of is stored as trace information.

【００５９】請求項２１の発明に係るエラートレースに
よるシステム制御管理方法は、予め指定された要因が発
生したときにのみ、トレース用バッファへプログラムの
状態変化に関しての属性情報の格納を行う構成を備えた
ものである。The system control management method by error tracing according to the twenty-first aspect of the present invention is provided with a structure for storing the attribute information relating to the change of the program state in the trace buffer only when a prespecified factor occurs. It is a thing.

【００６０】請求項２２の発明に係るエラートレースに
よるシステム制御管理方法は、割り込みの種類、システ
ムコールの種類、プログラムまたはプロセスの識別子、
プログラム名またはプロセス名または関数名、特定のア
ドレス領域に対する書き込みや読み出しおよび実行など
の予め指定された要因が発生したときにのみ、トレース
用バッファへプログラムの状態変化に関しての属性情報
の格納を行う構成を備えたものである。According to a twenty-second aspect of the present invention, there is provided an error trace system control management method, including an interrupt type, a system call type, a program or process identifier,
A configuration that stores the attribute information regarding the program state change in the trace buffer only when a preset factor such as program name, process name, function name, writing to, reading from, or executing a specific address area occurs. It is equipped with.

【００６１】請求項２３の発明に係るエラートレースに
よるシステム制御管理方法は、プログラムの状態変化に
関しての属性情報をハードウェアの制御回路内に設けら
れたトレース用バッファに記憶する構成を備えたもので
ある。The system control management method by error tracing according to the twenty-third aspect of the present invention is provided with a structure for storing the attribute information regarding the change of the state of the program in the trace buffer provided in the hardware control circuit. is there.

【００６２】請求項２４の発明に係るエラートレースに
よるシステム制御管理方法は、ＣＰＵと主メモリとのイ
ンターフェース回路、外部バス制御回路、ローカルバス
制御回路、入出力制御回路、出力メモリなどのハードウ
ェアに設けられたトレース用バッファにプログラムの状
態変化に関しての属性情報を記憶する構成を備えたもの
である。According to a twenty-fourth aspect of the present invention, there is provided a system control management method using error tracing, which is applied to hardware such as an interface circuit between a CPU and a main memory, an external bus control circuit, a local bus control circuit, an input / output control circuit and an output memory. The trace buffer provided is provided with a structure for storing attribute information regarding a change in the state of the program.

【００６３】請求項２５の発明に係るエラートレースに
よるシステム制御管理方法は、エラー発生源と認定され
たプログラムの種類に応じて、前記認定されたプログラ
ムの実行を停止させたりシステムを停止させ、あるいは
前記プログラムの実行を延期させてエラー原因が回復し
た時点で処理を再開させ継続させることでシステムを制
御し管理する構成を備えたものである。According to a twenty-fifth aspect of the present invention, there is provided a system control management method using error tracing, which stops the execution of the certified program or stops the system, depending on the type of the program certified as the error source. The system is configured to control and manage the system by suspending the execution of the program and resuming and continuing the process when the error cause is recovered.

【００６４】請求項２６の発明に係るエラートレースに
よるシステム制御管理方法は、エラー発生源と認定され
たプログラムがユーザプログラムまたはユーザプロセス
の場合に、前記ユーザプログラムまたはユーザプロセス
が使用している資源をそのシステムの管理を実行してい
るシステム管理手段に返却することでシステムを制御し
管理する構成を備えたものである。According to a twenty-sixth aspect of the system control management method by error tracing, when the program identified as the error source is a user program or a user process, the resources used by the user program or the user process are The system is controlled and managed by returning it to the system management means executing the management of the system.

【００６５】請求項２７の発明に係るエラートレースに
よるシステム制御管理方法は、エラー発生源と認定され
たプログラムが使用していたメモリ領域または各種の管
理用テーブルまたは入出力装置などの資源を夫々の空き
資源管理プールに返却することでシステムを制御し管理
する構成を備えたものである。In the system control management method by error tracing according to the twenty-seventh aspect of the present invention, resources such as a memory area or various management tables or input / output devices used by a program recognized as an error generation source are allocated. It has a configuration that controls and manages the system by returning it to the free resource management pool.

【００６６】請求項２８の発明に係るエラートレースに
よるシステム制御管理方法は、エラー発生源と認定され
たプログラムでエラーを発生させたメモリページまたは
その他のエラーを発生させた資源を、夫々の空き資源管
理プールに返却しないことでシステムを制御し管理する
構成を備えたものである。According to the 28th aspect of the present invention, there is provided a system control management method using error tracing, wherein a memory page in which an error has occurred in a program recognized as an error source or other resource in which an error has occurred is set as an empty resource. It has a configuration that controls and manages the system by not returning it to the management pool.

【００６７】請求項２９の発明に係るエラートレースに
よるシステム制御管理方法は、エラー発生源と認定され
たプログラムの種類がオペレーティングシステムなどの
システムプログラムであるときには、システム全体の停
止処理を行うことでシステムを制御し管理する構成を備
えたものである。According to a twenty-ninth aspect of the present invention, there is provided a system control management method by error tracing, wherein when the type of the program recognized as the error source is a system program such as an operating system, the system is stopped to perform the system stop processing. It has a configuration for controlling and managing the.

【００６８】請求項３０の発明に係るエラートレースに
よるシステム制御管理方法は、エラー発生源と認定され
たプログラムがユーザプログラムまたはユーザプロセス
の場合に、前記ユーザプログラムまたはユーザプロセス
とそれらに関連する関連プログラムまたはプロセス群の
処理を停止させると共に他の処理については継続実行さ
せることでシステムを制御し管理する構成を備えたもの
である。According to a thirtieth aspect of the present invention, there is provided a system control management method using error tracing, wherein when the program recognized as an error source is a user program or a user process, the user program or the user process and related programs related thereto. Alternatively, the system is controlled and managed by stopping the processing of the process group and continuously executing other processing.

【００６９】請求項３１の発明に係るエラートレースに
よるシステム制御管理方法は、エラー発生源と認定され
たプログラムでエラーを発生した資源を代替資源と入れ
替えることでエラーからの回復を行い、エラー原因が回
復した時点で処理を再開させ継続させることでシステム
を制御し管理する構成を備えたものである。In the system control management method by error tracing according to the thirty-first aspect of the present invention, the resource which has an error in the program recognized as the error source is replaced with the alternative resource to recover from the error, and the cause of the error is The system is configured to control and manage the system by restarting and continuing the process at the time of recovery.

【００７０】[0070]

【作用】請求項１の発明におけるエラートレースによる
システム制御管理方法は、トレース用バッファにプログ
ラムの状態変化を示す属性情報をトレース情報として記
憶しておき、記憶したプログラムの状態変化を示す属性
情報によりエラー発生元の認定を正確かつ容易にする。According to another aspect of the present invention, there is provided a system control management method based on error tracing, wherein attribute information indicating a program state change is stored in a trace buffer as trace information, and the stored attribute information indicating a program state change is used. Accurate and easy certification of error sources.

【００７１】請求項２の発明におけるエラートレースに
よるシステム制御管理方法は、トレース用バッファにプ
ログラムの状態変化を示す属性情報を発生時間順に記憶
し、この記憶した属性情報を発生時間順に進みあるいは
戻り検索することでエラー発生元の認定を正確かつ容易
にする。In the system control management method based on the error trace according to the second aspect of the present invention, the attribute information indicating the state change of the program is stored in the trace buffer in the order of occurrence time, and the stored attribute information is advanced or returned in the order of occurrence time. By doing so, it is possible to accurately and easily identify the error source.

【００７２】請求項３の発明におけるエラートレースに
よるシステム制御管理方法は、トレース用バッファにプ
ロセスがどの資源を使用しているかを知ることのできる
属性情報を記憶しておき、この属性情報を検索し、検索
した属性情報を基に前記プロセスがどの資源を使用して
いるかを知ることで、エラーの発生源が前記プロセスの
所有している資源であるときには当該資源を使用してい
るプログラムをエラー発生源として認定する。In the system control management method based on the error trace according to the third aspect of the present invention, attribute information capable of knowing which resource a process is using is stored in the trace buffer, and this attribute information is searched. By knowing which resource the process uses based on the retrieved attribute information, if the source of the error is the resource owned by the process, an error occurs in the program using the resource. Certified as a source.

【００７３】請求項４の発明におけるエラートレースに
よるシステム制御管理方法は、プログラムの状態が変化
する前のプログラム状態あるいは変化した後のプログラ
ム状態を属性情報としてトレース用バッファに記憶し、
記憶した属性情報によりエラー発生源の認定を正確かつ
容易にする。In the system control management method by error tracing according to the invention of claim 4, the program state before the change of the program state or the program state after the change is stored in the trace buffer as attribute information,
Accurate and easy recognition of error source by stored attribute information.

【００７４】請求項５の発明におけるエラートレースに
よるシステム制御管理方法は、エラーが発生すると書き
込みバッファやキャッシュなどの記憶装置内のデータを
固定し、実行されているプログラムの状態が変化する前
のプログラム状態あるいは変化した後のプログラム状態
などのトレース用バッファに格納された属性情報を検索
し、エラー発生源を正確かつ容易に認定する。In the system control management method based on the error trace of the fifth aspect of the invention, when an error occurs, the data in the storage device such as the write buffer and the cache is fixed, and the program before execution changes the state of the program. The attribute information stored in the trace buffer, such as the state or the program state after the change, is searched to accurately and easily identify the error source.

【００７５】請求項６の発明におけるエラートレースに
よるシステム制御管理方法は、エラー発生時のプログラ
ムの状態変化を示すト属性情報をトレース用バッファを
サイクリックに使用して記憶することでトレース用バッ
ファの有効利用を実現すると共にエラー発生源を正確か
つ容易に認定する。According to a sixth aspect of the present invention, there is provided a system control management method based on an error trace, in which trace attribute information indicating a change in the state of a program when an error occurs is stored by cyclically using the trace buffer. Accurately and easily certify the error source while achieving effective utilization.

【００７６】請求項７の発明におけるエラートレースに
よるシステム制御管理方法は、オペレーティングシステ
ムによりプログラムの使用状況に応じて作成管理された
り予め定義されてなるプログラムの管理テーブルへのポ
インタ、または前記管理テーブルのコピーまたはプログ
ラムの識別子などを属性情報としてトレース用バッファ
に記憶し、この記憶した属性情報を検索してエラー発生
源を正確かつ容易に認定する。According to a seventh aspect of the present invention, there is provided a system control management method based on error tracing, which is a pointer to a management table of a program which is created and managed by the operating system according to the usage status of the program or which is defined in advance, or the management table. A copy or program identifier is stored as attribute information in the trace buffer, and the stored attribute information is searched to accurately and easily identify the error source.

【００７７】請求項８の発明におけるエラートレースに
よるシステム制御管理方法は、プログラムの状態変化が
発生した時刻またはプログラムの状態が変化してからの
経過時間を属性情報としてトレース用バッファに記憶し
て、この記憶した属性情報を検索してエラー発生源を正
確かつ容易に認定する。In the system control management method based on the error trace according to the invention of claim 8, the time when the program state change occurs or the elapsed time after the program state change is stored in the trace buffer as attribute information, The stored attribute information is searched to accurately and easily identify the error source.

【００７８】請求項９の発明におけるエラートレースに
よるシステム制御管理方法は、エラーの検出時刻とエラ
ー検出時間から求められた実際にエラーの発生したエラ
ー発生時刻と、トレース用バッファに記憶したプログラ
ムの状態変化が発生した時刻またはプログラムの状態が
変化してからの経過時間などの属性情報とにより、前記
エラー発生時刻に実行されていたプログラムを検索して
求め、エラー発生源を正確かつ容易に認定することを可
能にする。According to the ninth aspect of the present invention, there is provided a system control management method based on error tracing, wherein an error detection time, an error occurrence time at which an error actually occurs, which is obtained from the error detection time, and a state of the program stored in the trace buffer. Using the attribute information such as the time when the change occurred or the time elapsed since the state of the program changed, the program that was being executed at the time when the error occurred is searched for and obtained, and the error source is accurately and easily identified. To enable that.

【００７９】請求項１０の発明におけるエラートレース
によるシステム制御管理方法は、発生したエラーの種類
に応じてリストから求められたエラーを検出するのに要
するエラー検出時間と、エラーの検出時刻とからエラー
の種類に応じた実際のエラー発生時刻を求め、さらにト
レース用バッファに記憶したプログラムの状態変化が発
生した時刻またはプログラムの状態が変化してからの経
過時間などの属性情報により、前記求めたエラー発生時
刻に実行されていたプログラムを検索して求め、エラー
発生源であるプログラムを正確かつ容易に認定すること
を可能にする。According to a tenth aspect of the present invention, there is provided a system control management method based on error tracing, wherein an error is detected from an error detection time required to detect an error obtained from a list according to the type of error that has occurred and an error detection time. The actual error occurrence time according to the type of the error is obtained, and the error obtained by the attribute information such as the time when the program state change stored in the trace buffer occurs or the elapsed time after the program state changes The program that was being executed at the time of occurrence can be searched and obtained, and the program that is the source of the error can be accurately and easily certified.

【００８０】請求項１１の発明におけるエラートレース
によるシステム制御管理方法は、プログラムが使用して
いる資源のリストを属性情報としてトレース用バッファ
に格納し、この属性情報によりエラー発生源を正確かつ
容易に認定することを可能にする。In the system control management method by error tracing according to the invention of claim 11, the list of resources used by the program is stored in the trace buffer as attribute information, and the attribute information makes it possible to accurately and easily identify the error source. Allows you to certify.

【００８１】請求項１２の発明におけるエラートレース
によるシステム制御管理方法は、プロセス間のコンテク
ストスイッチの発生を基にトレース用バッファへプログ
ラムの状態変化を示す属性情報を格納し、この属性情報
を基にエラー発生源を正確かつ容易に認定することを可
能にする。According to the twelfth aspect of the present invention, there is provided a system control management method based on error tracing, in which attribute information indicating a program state change is stored in a trace buffer based on the occurrence of a context switch between processes, and based on this attribute information. Enables accurate and easy identification of error sources.

【００８２】請求項１３の発明におけるエラートレース
によるシステム制御管理方法は、ＣＰＵの実行状態の遷
移を基にトレース用バッファへのプログラムの状態変化
を示す属性情報の格納を行うことで、エラー発生源を正
確かつ容易に認定することを可能にする。According to a thirteenth aspect of the present invention, there is provided a system control management method based on an error trace, in which attribute information indicating a change in the state of a program is stored in a trace buffer on the basis of a transition of an execution state of a CPU. Allows for accurate and easy certification.

【００８３】請求項１４の発明におけるエラートレース
によるシステム制御管理方法は、ＣＰＵの実行状態がプ
ログラムの状態変化としてユーザプログラムからシステ
ムプログラムへ遷移し、あるいはシステムプログラムか
らユーザプログラムへ遷移することでトレース用バッフ
ァへのプログラムの状態変化を示す属性情報の格納を行
い、エラー発生源を正確かつ容易に認定することを可能
にする。According to the system control management method by error tracing in the fourteenth aspect of the present invention, the execution state of the CPU is traced by changing the state of the program from the user program to the system program or from the system program to the user program. By storing the attribute information indicating the program state change in the buffer, it is possible to identify the error source accurately and easily.

【００８４】請求項１５の発明におけるエラートレース
によるシステム制御管理方法は、プログラムの状態変化
としてＣＰＵの実行状態が非特権レベルから特権レベル
へ、あるいは特権レベルから非特権レベルへ遷移するこ
とでトレース用バッファへのプログラムの状態変化を示
す属性情報の格納を行い、エラー発生源を正確かつ容易
に認定することを可能にする。In the system control management method by error tracing according to the fifteenth aspect of the present invention, for the tracing, the CPU execution state transitions from the non-privileged level to the privileged level or from the privileged level to the non-privileged level as a change in the state of the program. By storing the attribute information indicating the program state change in the buffer, it is possible to identify the error source accurately and easily.

【００８５】請求項１６の発明におけるエラートレース
によるシステム制御管理方法は、割り込み処理の起動あ
るいは終了したタイミング、またはその両方、または割
り込み処理中の任意の時間タイミングでトレース用バッ
ファへのプログラムの状態変化を示す属性情報の格納を
行い、エラー発生源を正確かつ容易に認定することを可
能にする。According to the sixteenth aspect of the present invention, there is provided a system control management method based on error tracing, in which the state of the program is changed to the trace buffer at the timing at which the interrupt processing is activated or terminated, or both, or at an arbitrary time timing during the interrupt processing. By storing the attribute information indicating the, it is possible to identify the error source accurately and easily.

【００８６】請求項１７の発明におけるエラートレース
によるシステム制御管理方法は、入出力装置における入
出力動作終了時または入出力動作中のエラー発生または
ハードウェアエラーまたは例外を割り込み要因とする割
り込み処理の起動あるいは終了したタイミング、または
前記割り込み処理の起動および終了したタイミング、ま
たは前記割り込み処理中の任意の時間タイミングでトレ
ース用バッファへのプログラムの状態変化を示す属性情
報の格納を行い、エラー発生源を正確かつ容易に認定す
ることを可能にする。According to a seventeenth aspect of the present invention, there is provided a system control management method based on error tracing, wherein an interrupt process is started when an input / output operation of an input / output device ends or an error occurs during the input / output operation or a hardware error or an exception is an interrupt factor. Alternatively, the attribute information indicating the program state change is stored in the trace buffer at the timing of completion, at the timing of starting and ending the interrupt processing, or at any timing during the interrupt processing, and the error source is accurately identified. And allow easy certification.

【００８７】請求項１８の発明におけるエラートレース
によるシステム制御管理方法は、ユーザプログラムから
システムプログラムへの遷移が行われるシステムコール
が起動されるタイミングあるいはシステムコールからの
終了処理が実行されるタイミング、または前記システム
コールが起動されるタイミングおよびシステムコールか
らの終了処理が実行されるタイミング、またはシステム
コールの処理中の任意の時間タイミングでプログラムの
状態変化を示す属性情報の格納を行い、エラー発生源を
正確かつ容易に認定することを可能にする。According to the eighteenth aspect of the present invention, there is provided a system control management method using error tracing, in which a system call for transition from a user program to a system program is activated or a termination process is executed from the system call, or The attribute information indicating the state change of the program is stored at the timing when the system call is started and the timing when the termination processing from the system call is executed, or at an arbitrary time during the processing of the system call, and the error source is stored. Allows for accurate and easy certification.

【００８８】請求項１９の発明におけるエラートレース
によるシステム制御管理方法は、アプリケーションプロ
グラムに提供されたインターフェースを用いてプログラ
ムの属性情報をトレース用バッファに格納し、エラー発
生源を正確かつ容易に認定することを可能にする。In the system control management method by error tracing according to the nineteenth aspect of the present invention, the attribute information of the program is stored in the trace buffer by using the interface provided to the application program, and the error source is accurately and easily identified. To enable that.

【００８９】請求項２０の発明におけるエラートレース
によるシステム制御管理方法は、プログラムカウンタ
値，ＣＰＵ状態値，時間要素，トレース事象番号，各種
レジスタの内容，プログラム名，プロセス名，メッセー
ジ，アドレス空間識別子，プロセス識別子，プログラム
識別子，プロセス属性情報，プログラム属性情報，メッ
セージキュー識別子，メッセージキュー属性情報のいず
れかあるいはそれらの組み合わせをインターフェースの
パラメータとし、アプリケーションプログラムに提供さ
れた前記インターフェースを用いてプログラムの属性情
報をトレース用バッファに格納し、エラー発生元の認定
に必要なデータを効率良く収集してエラー発生源を正確
かつ容易に認定することを可能にする。A system control management method by error tracing according to the invention of claim 20 is a program counter value, CPU state value, time element, trace event number, contents of various registers, program name, process name, message, address space identifier, Any one or a combination of a process identifier, a program identifier, a process attribute information, a program attribute information, a message queue identifier, a message queue attribute information is used as an interface parameter, and the attribute information of the program is provided by using the interface provided to the application program. Is stored in the trace buffer, and the data necessary for certifying the error source can be collected efficiently to certify the error source accurately and easily.

【００９０】請求項２１の発明におけるエラートレース
によるシステム制御管理方法は、予め指定された要因が
発生したときにのみ、トレース用バッファへプログラム
の状態変化を示す属性情報を格納し、エラー発生源を正
確かつ容易に認定することを可能にする。In the system control management method by error tracing according to the twenty-first aspect of the present invention, the attribute information indicating the program state change is stored in the trace buffer only when a prespecified factor occurs, and the error source is set. Allows for accurate and easy certification.

【００９１】請求項２２の発明におけるエラートレース
によるシステム制御管理方法は、割り込みの種類、シス
テムコールの種類、プログラムまたはプロセスの識別
子、プログラム名またはプロセス名または関数名、特定
のアドレス領域に対する書き込みや読み出し、実行など
の予め指定された要因が発生したときのプロセス間のコ
ンテクストスイッチの発生あるいはＣＰＵの実行状態の
遷移を基に、トレース用バッファへプログラムの状態変
化を示す属性情報を格納し、エラー発生源を正確かつ容
易に認定することを可能にする。According to the 22nd aspect of the present invention, there is provided a system control management method based on an error trace, which includes an interrupt type, a system call type, a program or process identifier, a program name or a process name or a function name, and a write or read to a specific address area. , Attribute information indicating program state change is stored in the trace buffer based on occurrence of context switch between processes or transition of CPU execution state when a predetermined factor such as execution occurs, and error occurs Allows the source to be accurately and easily certified.

【００９２】請求項２３の発明におけるエラートレース
によるシステム制御管理方法は、ハードウェアの制御回
路内に設けられたトレース用バッファにプログラムの状
態変化を示す属性情報を記憶し、記憶した属性情報によ
りエラー発生源の認定を正確かつ容易にする。In the system control management method by error trace according to the twenty-third aspect of the present invention, attribute information indicating a change in the state of the program is stored in the trace buffer provided in the hardware control circuit, and the stored attribute information causes an error. Accurate and easy source identification.

【００９３】請求項２４の発明におけるエラートレース
によるシステム制御管理方法は、ＣＰＵと主メモリとの
インターフェース回路、外部バス制御回路、ローカルバ
ス制御回路、入出力制御回路、出力メモリなどのハード
ウェアに設けられたトレース用バッファにプログラムの
状態変化を示す属性情報であるトレース情報を記憶し、
記憶したトレース情報によりエラー発生源の認定を正確
かつ容易にする。A system control management method by error tracing according to a twenty-fourth aspect of the present invention is provided in hardware such as an interface circuit between a CPU and a main memory, an external bus control circuit, a local bus control circuit, an input / output control circuit, and an output memory. The trace information, which is the attribute information indicating the state change of the program, is stored in the trace buffer,
Accurate and easy identification of error source by the stored trace information.

【００９４】請求項２５の発明におけるエラートレース
によるシステム制御管理方法は、エラー発生源と認定さ
れたプログラムの種類に応じて、前記認定されたプログ
ラムの実行を停止させ、あるいはシステムを停止させ、
あるいは前記プログラムの実行を延期させ、エラー原因
が回復した時点で前記プログラムの実行や前記システム
の動作を再開させ継続させ、発生したエラーに対するシ
ステムの信頼性を向上させる。According to the system control management method by error tracing in the twenty-fifth aspect of the invention, the execution of the certified program is stopped or the system is stopped according to the type of the program certified as the error source.
Alternatively, the execution of the program is postponed, and when the cause of the error is recovered, the execution of the program or the operation of the system is resumed and continued, thereby improving the reliability of the system with respect to the generated error.

【００９５】請求項２６の発明におけるエラートレース
によるシステム制御管理方法は、エラー発生源と認定さ
れたプログラムがユーザプログラムまたはユーザプロセ
スの場合に、前記ユーザプログラムまたはユーザプロセ
スが使用している資源をそのシステムの管理を実行して
いるシステム管理手段に返却し、有効資源の利用率を向
上させ、発生したエラーに対するシステムの信頼性を向
上させる。According to a twenty-sixth aspect of the present invention, in the system control management method by error tracing, when the program recognized as the error source is a user program or user process, the resources used by the user program or user process are The system is returned to the system management means executing the system management, the utilization rate of effective resources is improved, and the reliability of the system with respect to the generated error is improved.

【００９６】請求項２７の発明におけるエラートレース
によるシステム制御管理方法は、エラー発生源と認定さ
れたプログラムがユーザプログラムまたはユーザプロセ
スの場合に、前記ユーザプログラムまたはユーザプロセ
スが使用しているメモリ領域または各種の管理用テーブ
ルまたは入出力装置などの資源を夫々の空き資源管理プ
ールに返却し、有効資源の利用率を向上させ、発生した
エラーに対するシステムの信頼性を向上させる。According to a 27th aspect of the present invention, there is provided a system control management method based on an error trace, wherein when a program identified as an error source is a user program or a user process, a memory area used by the user program or the user process or Resources such as various management tables or input / output devices are returned to the respective free resource management pools, the utilization rate of effective resources is improved, and the reliability of the system against an error that occurs is improved.

【００９７】請求項２８の発明におけるエラートレース
によるシステム制御管理方法は、エラー発生源と認定さ
れたプログラムでエラーを発生させたメモリページまた
はその他のエラーを発生させた資源を、夫々の空き資源
管理プールに返却しないことでエラーを発生させた資源
の再使用によるエラー発生を防止し、発生したエラーに
対するシステムの信頼性を向上させる。In the system control management method based on the error trace according to the twenty-eighth aspect of the present invention, a program identified as an error generation source manages a memory page in which an error has occurred, or other resources in which an error has occurred, by managing each free resource. By not returning the error to the pool, it is possible to prevent the error from occurring due to the reuse of the resource that caused the error and improve the reliability of the system against the error.

【００９８】請求項２９の発明におけるエラートレース
によるシステム制御管理方法は、エラー発生源と認定さ
れたプログラムの種類がオペレーティングシステムなど
のシステムプログラムであるときには、システム全体の
停止処理を行い発生したエラーに対するシステムの信頼
性を向上させる。According to the 29th aspect of the present invention, in the system control management method based on the error trace, when the type of the program recognized as the error generation source is a system program such as an operating system, the entire system is stopped and an error occurs. Improve system reliability.

【００９９】請求項３０の発明におけるエラートレース
によるシステム制御管理方法は、ユーザプログラムまた
はユーザプロセスがエラー発生源と認定された場合に
は、前記ユーザプログラムまたはユーザプロセスとそれ
らに関連する関連プログラムまたはプロセス群の処理を
停止させると共に他の処理については継続実行させ、発
生したエラーに対するシステムの信頼性を向上させる。According to a thirtieth aspect of the present invention, there is provided a system control management method by error tracing, wherein when the user program or user process is recognized as an error source, the user program or user process and the related program or process related thereto. The processing of the group is stopped and the other processing is continuously executed to improve the reliability of the system with respect to the generated error.

【０１００】請求項３１の発明におけるエラートレース
によるシステム制御管理方法は、エラー発生源と認定さ
れたプログラムに応じて、前記認定されたプログラムの
実行を停止させ、エラー発生源と認定されたプログラム
でのエラーを発生した資源を代替資源と入れ替えること
でエラーからの回復を行い、エラー原因が回復した時点
で処理を再開させ継続させて発生したエラーに対するシ
ステムの信頼性を向上させる。According to a thirty-first aspect of the present invention, there is provided a system control management method based on error tracing, which stops the execution of the certified program according to the program certified as the error generation source, and executes the program certified as the error generation source. By replacing the resource in which the error has occurred with the alternative resource, recovery from the error is performed, and when the cause of the error is recovered, the process is restarted and continued to improve the reliability of the system with respect to the error.

【０１０１】[0101]

【Example】

実施例１．以下、請求項１の発明の一実施例を図につい
て説明する。図１は、本実施例のエラートレースによる
システム制御管理方法の構成を示すブロック図であり、
同図（イ）はシステムプログラムによりエラートレース
によるシステム制御管理方法を実現する場合のブロック
図、同図（ロ）はトレース処理用ハードウェアを用いた
エラートレースによるシステム制御管理方法の構成を示
す。図１の（イ）において２１はシステムプログラムま
たはユーザプログラムの実行処理を示し、２２はプログ
ラムの属性情報であるトレース情報が書き込まれるトレ
ース用バッファ、２３はエラーの発生元を判定するエラ
ー判定プログラムを示している。また、同図（ロ）にお
いて２２はユーザプログラム２１ａやシステムプログラ
ム２１ｂによりトレース処理の起動が行われることでプ
ログラムの状態変化に関しての属性情報を収集してトレ
ース用バッファ２４に格納する処理を実行するトレース
処理用ハードウェアである。Example 1. An embodiment of the invention of claim 1 will be described below with reference to the drawings. FIG. 1 is a block diagram showing the configuration of a system control management method by error tracing according to this embodiment.
FIG. 9A is a block diagram when the system control management method by the error trace is realized by the system program, and FIG. 11B shows the configuration of the system control management method by the error trace using the trace processing hardware. In FIG. 1A, reference numeral 21 indicates an execution process of a system program or a user program, 22 is a trace buffer in which trace information that is attribute information of the program is written, and 23 is an error determination program for determining an error occurrence source. Shows. Further, in FIG. 9B, 22 executes a process of collecting the attribute information regarding the state change of the program and storing it in the trace buffer 24 by starting the trace process by the user program 21a and the system program 21b. This is the hardware for trace processing.

【０１０２】このエラートレースによるシステム制御管
理方法では、プログラムの状態変化が発生するときにプ
ロセスの属性情報をトレース用バッファメモリあるいは
メモリにおけるトレース用のバッファ領域に格納する。
そして、エラーが発生したときには、前記トレース用バ
ッファメモリあるいはトレース用のバッファ領域に格納
された属性情報を辿り、エラー判定プログラムがエラー
の発生元のプログラムを認定する。In the system control management method based on this error trace, the attribute information of the process is stored in the trace buffer memory or the trace buffer area in the memory when the program state change occurs.
When an error occurs, the error determination program recognizes the program that is the source of the error by tracing the attribute information stored in the trace buffer memory or the trace buffer area.

【０１０３】図２は、これら一連の動作を示すフローチ
ャートである。このフローチャートによれば、プログラ
ムの状態変化が発生すると（ステップＳＴ１）、プログ
ラムが状態変化を示したときのプログラムの属性情報を
トレース用バッファ２４に格納する（ステップＳＴ
２）。そして、起動されているエラー発生の検出処理機
能および検出したエラー発生に対するエラー処理機能に
より、エラーの発生が検出されると（ステップＳＴ
３）、次にエラーの原因調査が行われる（ステップＳＴ
４）。このエラーの原因調査は、エラー情報からエラー
原因を知り、トレース用バッファ２４からプログラムの
属性情報を読み出し（ステップＳＴ５）、この属性情報
からエラー発生元を調査・認定する（ステップＳＴ
６）。そして、前記エラー原因に応じて前記認定したエ
ラー発生元に対してエラー処理を行う（ステップＳＴ
７）。FIG. 2 is a flow chart showing a series of these operations. According to this flowchart, when a program state change occurs (step ST1), the attribute information of the program when the program shows the state change is stored in the trace buffer 24 (step ST).
2). Then, when an error occurrence is detected by the activated error occurrence detection processing function and the error processing function for the detected error occurrence (step ST
3) Next, the cause of the error is investigated (step ST
4). In investigating the cause of this error, the cause of the error is known from the error information, the attribute information of the program is read from the trace buffer 24 (step ST5), and the error source is investigated and certified from this attribute information (step ST).
6). Then, error processing is performed on the recognized error source according to the cause of the error (step ST
7).

【０１０４】この結果、本実施例によれば、トレース用
バッファあるいはトレース用バッファ領域として使用さ
れるメモリ領域がないときには全てのプロセスのテーブ
ルまたはその他の明示的あるいは暗示的なシステム内管
理情報を網羅的に検索してエラー発生元を認定するのに
対し、トレース用バッファあるいはトレース用バッファ
領域として使用されるメモリ領域にプログラムの状態変
化に関しての属性情報が格納されているのでこの属性情
報を基に効率的にエラー発生元のプログラムの認定を行
うことが可能となる。As a result, according to the present embodiment, when there is no trace buffer or memory area used as a trace buffer area, a table of all processes or other explicit or implicit system management information is covered. Whereas the error source is identified by performing a manual search, the attribute information related to the program status change is stored in the trace buffer or the memory area used as the trace buffer area. It is possible to efficiently certify the program of the error source.

【０１０５】実施例２．以下、請求項２の発明の一実施
例を図について説明する。図３は、トレース用バッファ
２４に格納されたプログラムの属性情報を示しており、
プログラムの属性情報１〜ｎはプログラムの状態変化の
発生時間順にトレース用バッファ２４に格納される。本
実施例のトレース用バッファ２４に格納されているプロ
グラムの属性情報は、プログラムの発生時間の流れに従
って昇順（順次時間の流れと逆に過去のプログラムの属
性情報を辿る）あるいは降順（順次時間の流れと同一の
方向にプログラムの属性情報を辿る）で検索する。Example 2. An embodiment of the invention of claim 2 will be described below with reference to the drawings. FIG. 3 shows the attribute information of the program stored in the trace buffer 24,
The program attribute information 1 to n are stored in the trace buffer 24 in the order of the time when the program state change occurs. The attribute information of the programs stored in the trace buffer 24 of the present embodiment is in ascending order (following the sequence of time, the attribute information of the past programs is traced in reverse order) or in descending order (sequential time of the program). Follow the attribute information of the program in the same direction as the flow).

【０１０６】従って、属性情報を発生時間順に格納する
ことで、時間順に新しいものまたは新しいものから順に
経時的変化に従って検索することが容易になってエラー
発生元の認定が効率的に行えるようになり、プログラム
の属性情報を基に効率的なエラー発生元のプログラムの
認定が可能となる。Therefore, by storing the attribute information in the order of time of occurrence, it becomes easy to retrieve new ones in time order or newest to oldest ones according to changes over time, and the error source can be efficiently identified. , It is possible to efficiently certify the program of the error source based on the attribute information of the program.

【０１０７】実施例３．以下、請求項３の発明の一実施
例を図について説明する。図４は、前記実施例１におい
て説明した図２のフローチャートのステップＳＴ６の詳
細なフローチャートである。本実施例では、エラーを発
生した資源を検出し、エラー発生源である資源からエラ
ー発生元であるプログラムを認定する。Example 3. An embodiment of the invention of claim 3 will be described below with reference to the drawings. FIG. 4 is a detailed flowchart of step ST6 of the flowchart of FIG. 2 described in the first embodiment. In the present embodiment, the resource in which an error has occurred is detected, and the program that is the error source is identified from the resource that is the error source.

【０１０８】これらのフローチャートによれば、ステッ
プＳＴ５においてトレース用バッファ２４から読み出さ
れたプログラムの属性情報よりプロセスが使用している
資源を確認する（ステップＳＴ６ａ）。すなわち、プロ
グラムの属性情報にはそのプロセスが使用している資源
などの情報が含まれている。また、この場合の資源と
は、そのプログラムが使用しているメモリ領域あるいは
入出力装置などである。次に、ステップＳＴ６ａにおい
て確認した資源情報を検出して（ステップＳＴ６ｂ）、
発生したエラー原因が前記検出した資源から発生したも
のであるか否かを判定する（ステップＳＴ６ｃ）。発生
したエラー原因が前記検出した資源から発生したもので
ないと判定されたときには、ステップＳＴ６ａに戻り、
ステップＳＴ６ｂからステップＳＴ６ｃまでの処理を、
発生したエラーの原因が前記検出した資源から発生した
ものであると判定されるまで繰り返す。ステップＳＴ６
ｃにおいて発生したエラー原因が前記検出した資源から
発生したものであると判定されたときには、そのプロセ
スをエラーの発生元であると認定する（ステップＳＴ６
ｄ）。According to these flowcharts, the resource used by the process is confirmed from the attribute information of the program read from the trace buffer 24 in step ST5 (step ST6a). That is, the attribute information of the program includes information such as resources used by the process. Further, the resource in this case is a memory area or an input / output device used by the program. Next, the resource information confirmed in step ST6a is detected (step ST6b),
It is determined whether or not the error cause that has occurred is caused by the detected resource (step ST6c). When it is determined that the cause of the error that has occurred is not caused by the detected resource, the process returns to step ST6a,
The processing from step ST6b to step ST6c
The process is repeated until it is determined that the cause of the error that has occurred is from the detected resource. Step ST6
When it is determined that the cause of the error that occurred in c is caused by the detected resource, the process is identified as the source of the error (step ST6).
d).

【０１０９】従って、本実施例でもプログラムの属性情
報を基に効率的にエラー発生元のプログラムの認定を行
うことが可能となる。Therefore, also in this embodiment, it is possible to efficiently certify the program of the error source based on the attribute information of the program.

【０１１０】実施例４．以下、請求項４の発明の一実施
例を図について説明する。図５は、本実施例のトレース
用バッファ２４に格納されたプログラムの属性情報を示
しており、プログラムの属性情報１〜ｎはプログラムの
状態変化の例えば発生時間順にトレース用バッファに格
納され、また格納されている属性情報は状態変化が発生
したときの一つ前のプログラムの属性情報である。この
場合、トレース用バッファ２４に格納されているプログ
ラムの属性情報は、プログラムの発生時間の流れに従っ
て昇順あるいは降順で検索してもよい。Example 4. An embodiment of the invention of claim 4 will be described below with reference to the drawings. FIG. 5 shows the attribute information of the program stored in the trace buffer 24 of this embodiment. The attribute information 1 to n of the program are stored in the trace buffer in the order of, for example, the time of occurrence of a change in the state of the program. The stored attribute information is the attribute information of the immediately preceding program when the state change occurs. In this case, the program attribute information stored in the trace buffer 24 may be searched in ascending or descending order according to the flow of the program generation time.

【０１１１】また、図６もトレース用バッファ２４に格
納されたプログラムの属性情報を示しており、プログラ
ムの属性情報１〜ｎはプログラムの状態変化の例えば発
生時間順にトレース用バッファ２４に格納され、また格
納されている属性情報は状態変化が発生したときのその
状態変化後のプログラムの属性情報である。この場合、
トレース用バッファに格納されているプログラムの属性
情報は、プログラムの発生時間の流れに従って昇順ある
いは降順で検索してもよい。FIG. 6 also shows the attribute information of the program stored in the trace buffer 24. The attribute information 1 to n of the program is stored in the trace buffer 24 in the order of, for example, the time of occurrence of a change in the state of the program. The stored attribute information is the attribute information of the program after the state change when the state change occurs. in this case,
The program attribute information stored in the trace buffer may be searched in ascending or descending order according to the flow of program generation time.

【０１１２】従って、本実施例では、プログラムの状態
変化によりトレース用バッファ２４に格納されたそのプ
ログラムの状態変化前あるいは状態変化後の属性情報を
基に効率的にエラー発生元のプログラムの認定を行うこ
とが可能となる。Therefore, in this embodiment, the program of the error occurrence source is efficiently identified based on the attribute information before or after the state change of the program stored in the trace buffer 24 due to the state change of the program. It becomes possible to do.

【０１１３】実施例５．以下、請求項５の発明の一実施
例について説明する。本実施例のエラートレースによる
システム制御管理方法では、ハードウェア構成が書き込
みバッファあるいはキャッシュを有している場合には、
エラー発生時に書き込みバッファを凍結してトレース用
バッファの検索を行う。ただし、ここではキャッシュま
たは書き込みバッファはＦＩＦＯ方式で動作するものと
している。すなわち、トレース用バッファに状態が変化
した後の新しいプログラムの状態を格納するものとした
場合、エラーを発生させる動作が行われた直後にプログ
ラムの状態変化があり、ＣＰＵは新しい状態に遷移した
後にエラーを検出すると、ＣＰＵは状態変化をトレース
用バッファに書き込む動作を行うが、このとき書き込み
バッファには状態変化をトレース用バッファに書き込む
動作が残っているので、図７のフローチャートに示すよ
うにこの時点で書き込みバッファを一時的に凍結させて
状態変化をトレース用バッファに書き込む動作を停止さ
せる。そして、トレース用バッファに格納されている属
性情報を検索して調べることでエラー発生元の認定を容
易にする。この場合のトレース用バッファに格納されて
いる属性情報の検索は、属性情報が発生時間順に格納さ
れている場合には、前記実施例２において説明した昇順
あるいは降順で検索するようにしてもよい。Example 5. An embodiment of the invention of claim 5 will be described below. In the system control management method by error trace of the present embodiment, when the hardware configuration has a write buffer or cache,
When an error occurs, the write buffer is frozen and the trace buffer is searched. However, here, the cache or the write buffer is assumed to operate in the FIFO method. In other words, if the state of a new program after the state change is stored in the trace buffer, the state of the program changes immediately after the operation that causes the error occurs, and the CPU changes to the new state. When an error is detected, the CPU writes the state change to the trace buffer. At this time, the write buffer still has the operation to write the state change to the trace buffer. Therefore, as shown in the flowchart of FIG. At this point, the write buffer is temporarily frozen to stop the operation of writing the state change to the trace buffer. Then, the attribute information stored in the trace buffer is searched and checked to facilitate the recognition of the error occurrence source. In this case, the attribute information stored in the trace buffer may be searched in ascending order or descending order described in the second embodiment when the attribute information is stored in the order of occurrence time.

【０１１４】実施例６．以下、請求項６の発明の一実施
例を図について説明する。図８は、本実施例のエラート
レースによるシステム制御管理方法におけるトレース用
バッファの構成を示す説明図である。図において３１は
リング状に構成されたトレース用バッファに最も新しく
格納される属性情報、３２〜３３はリング状に構成され
たトレース用バッファの夫々に古い順に格納された属性
情報である。Example 6. An embodiment of the invention of claim 6 will be described below with reference to the drawings. FIG. 8 is an explanatory diagram showing the configuration of the trace buffer in the system control management method by error tracing according to this embodiment. In the figure, 31 is the attribute information stored most recently in the ring-shaped trace buffer, and 32 to 33 are attribute information stored in the ring-shaped trace buffers in the order of oldness.

【０１１５】本実施例では、トレース用バッファあるい
はトレース用バッファ領域をリング状のリングバッファ
構成にして、トレース用バッファあるいはトレース用バ
ッファ領域をサイクリックに用いる。この結果、トレー
ス用バッファあるいはトレース用バッファ領域を広く設
定することなく、リングバッファが満杯になったときに
は最も古い情報を格納したバッファから繰り返し使用し
てトレース用バッファの使用効率を向上させることが可
能となる。In the present embodiment, the trace buffer or trace buffer area is formed into a ring ring structure, and the trace buffer or trace buffer area is used cyclically. As a result, it is possible to improve the usage efficiency of the trace buffer by repeatedly using the buffer that stores the oldest information when the ring buffer is full without setting the trace buffer or trace buffer area to a large value. Becomes

【０１１６】また、ポインタによる接続情報によりリン
グバッファを構成する以外に、配列内インデックスを使
用することで配列構成をリング状にしてリングバッファ
を構成することも可能である。また、リングバファを構
成している各バッファに格納される情報が有効であるか
否かを示すフラグや格納時間を示す情報を格納すること
も可能である。Further, instead of configuring the ring buffer by the connection information by the pointer, it is also possible to configure the ring buffer by using the index in the array to make the array configuration into a ring shape. It is also possible to store a flag indicating whether or not the information stored in each buffer constituting the ring buffer is valid and information indicating the storage time.

【０１１７】実施例７．なお、以上の実施例１〜実施例
６で説明した属性情報として、図９に示すようにプログ
ラムの管理テーブルへのポインタあるいは管理テーブル
のコピーの一部または全て、あるいはプロセスを識別す
るためのプロセスＩＤやタスクを識別するためのタスク
ＩＤなどのプログラムの識別子を他の情報と共にトレー
ス用バッファに格納するようにしてもよい。Example 7. As the attribute information described in the first to sixth embodiments, as shown in FIG. 9, a pointer to the management table of the program, a part or all of the copy of the management table, or a process for identifying the process The program identifier such as the ID or the task ID for identifying the task may be stored in the trace buffer together with other information.

【０１１８】さらにこの情報を資源毎に管理変数領域の
ビット位置で管理することも可能である。Further, it is possible to manage this information for each resource at the bit position of the management variable area.

【０１１９】実施例８．また、前記実施例１〜実施例６
で説明した属性情報として、プログラムの状態変化が発
生した時刻あるいはプログラムの状態が変化してからの
経過時間をトレース用バッファに格納するようにしても
よく、図１０は請求項８および請求項９記載のエラート
レースによるシステム制御管理方法の一実施例の特徴を
説明するための説明図である。同図（イ）は、エラーの
発生時刻Ｔ１とエラー発生からその発生したエラーがＣ
ＰＵにより認識されるまでの時間Ｔ２などを時間軸上に
示した説明図、同図（ロ）は同時に進行しているプロセ
ス実行の遷移を示す説明図である。Example 8. Moreover, the said Example 1-Example 6
As the attribute information described above, the time at which the program state change occurs or the elapsed time after the program state change may be stored in the trace buffer. It is explanatory drawing for demonstrating the characteristic of one Example of the system control management method by the described error trace. In the figure (a), the error occurrence time T1 and the error that has occurred since the error occurred are C
An explanatory diagram showing a time T2 and the like until it is recognized by the PU on a time axis, and FIG. 8B is an explanatory diagram showing transitions of process executions that are simultaneously proceeding.

【０１２０】図１０に基づいて本実施例の動作を説明す
る。トレース用バッファには、プログラムの状態変化が
発生した時刻あるいはプログラムの状態が変化してから
の経過時間が属性情報として格納されている。上述した
ように、エラーの発生時刻をＴ１とし、またエラー発生
からその発生したエラーがＣＰＵにより認識されるまで
に要する時間をＴ２とした場合、図１０の（イ）に示す
ように時刻Ｔｅすなわち（Ｔ１−Ｔ２）により示される
時刻に実行されているプログラムがエラーを発生させた
と認定できることになる。従って、図１０の（ロ）に示
すプロセス実行の遷移を示す説明図から、時刻Ｔｅでは
ユーザプログラムが実行されているのでエラーの発生元
はユーザプログラムにあると認定できる。The operation of this embodiment will be described with reference to FIG. The trace buffer stores, as attribute information, the time at which a program state change occurs or the elapsed time after the program state change. As described above, when the error occurrence time is T1 and the time required from the occurrence of the error until the error is recognized by the CPU is T2, as shown in (a) of FIG. It can be recognized that the program executed at the time indicated by (T1-T2) has caused an error. Therefore, from the explanatory diagram showing the transition of the process execution shown in (B) of FIG. 10, since the user program is being executed at time Te, it can be determined that the error source is the user program.

【０１２１】実施例９．また、前記エラー発生からその
発生したエラーがＣＰＵにより認識されるまでに要する
時間Ｔ２が、発生したエラーの種類に依存する場合に
は、あらかじめエラーの種類毎にそのエラーがＣＰＵに
より認識されるまでに要する時間Ｔ２を求め図１１に示
すようにテーブル化しておき、発生したエラー種類に応
じた適切な発生エラー認識時間Ｔ２を読み出してエラー
発生元を精度良く認定することもできる。Example 9. In addition, when the time T2 required from the occurrence of the error until the CPU recognizes the generated error depends on the type of the generated error, until the CPU recognizes the error for each type of error in advance. It is also possible to determine the time T2 required for the calculation and make a table as shown in FIG. 11, and read an appropriate occurrence error recognition time T2 according to the type of error that has occurred to accurately identify the error occurrence source.

【０１２２】実施例１０．また、図１２に示すようにト
レース用バッファにプログラムとそのプログラムが使用
している資源のリストを格納することで、エラー原因と
なった資源を使用しているプログラムをトレース用バッ
ファを検索することで知り、エラー発生元のプログラム
の設定を正確かつ容易に行うことが可能である。Example 10. Further, as shown in FIG. 12, by storing a list of programs and resources used by the programs in the trace buffer, it is possible to search the trace buffer for a program using the resource that has caused the error. Then, it is possible to accurately and easily set the error source program.

【０１２３】実施例１１．また、プログラムの属性情報
をトレース用バッファに格納するタイミングとしてプロ
セスのコンテクストスイッチを採用することも可能であ
り、本実施例では図２のステップＳＴ２においてあるプ
ロセスから別のプロセスへ切り替えが行われるタイミン
グでトレース用バッファへ属性情報を格納する。この結
果、トレース用バッファにはプロセスの実行順序に関し
ての情報が属性情報として格納されることになり、エラ
ーが発生したときにはトレース用バッファに格納された
属性情報を検索することで、エラー発生前に実行されて
いたプロセスの実行順序を容易に知ることができ、エラ
ー発生時に実行されていたプロセスを容易に認定するこ
とが可能になる。Example 11. It is also possible to employ a process context switch as the timing for storing the program attribute information in the trace buffer. In this embodiment, the timing at which one process is switched to another process in step ST2 of FIG. To store the attribute information in the trace buffer. As a result, information about the execution order of processes will be stored as attribute information in the trace buffer, and when an error occurs, the attribute information stored in the trace buffer will be searched before the error occurs. It is possible to easily know the execution order of the processes that have been executed, and it is possible to easily identify the process that was being executed when the error occurred.

【０１２４】以上の説明を図１０を用いてさらに具体的
に説明すると、プロセスの実行遷移（ここではプロセス
Ａ→プロセスＢ→プロセスＣ）が行われる毎にプロセス
の属性情報を時間情報あるいは時刻情報と共に格納する
ことにより、実際のエラー発生時刻がＴｅであることが
判明したときにはこの時刻Ｔｅにより、前記時刻情報と
共に格納されている属性情報から時刻Ｔｅに実行されて
いたプロセスはプロセスＢであることが容易に認定され
る。The above description will be more specifically described with reference to FIG. 10. Every time a process execution transition (process A.fwdarw.process B.fwdarw.process C) is performed, the process attribute information is changed to time information or time information. When it is determined that the actual error occurrence time is Te, the process executed at time Te based on the attribute information stored together with the time information is process B when the actual error occurrence time is found to be Te. Is easily certified.

【０１２５】実施例１２．以下、請求項１３〜請求項１
８の発明の一実施例について説明する。本実施例では、
図２のステップＳＴ２においてＣＰＵの実行レベルが変
化する毎にプログラムの状態の変化をトレース用バッフ
ァに格納する。すなわち、システムコールや特権レベル
の変更や割り込みなどが発生する毎にプログラムの属性
情報をトレース用バッファに格納することでエラー発生
元がユーザプログラムかシステムプログラムか割り込み
プログラムかのエラー発生元の認定が容易になる。Example 12. Hereinafter, claim 13 to claim 1
An embodiment of the invention of No. 8 will be described. In this embodiment,
In step ST2 of FIG. 2, each time the execution level of the CPU changes, the change in the state of the program is stored in the trace buffer. That is, by storing the program attribute information in the trace buffer each time a system call, privilege level change, or interrupt occurs, it is possible to identify the error source as to whether the error source is a user program, system program, or interrupt program. It will be easier.

【０１２６】図１０において、プロセスの実行遷移中の
（Ｓ）はシステムプログラムによるＣＰＵの動作状態、
（Ｕ）はユーザプログラムによるＣＰＵの動作状態を示
している。このＣＰＵの実行動作状態を時間情報と共に
トレース用バッファに格納することにより時刻Ｔｅにお
いてはＣＰＵはユーザプログラムによる実行状態にあ
り、時刻Ｔｅにおけるエラーの発生元はユーザプログラ
ムにあることが容易に認定できる。In FIG. 10, (S) during the execution transition of the process is the operating state of the CPU by the system program,
(U) shows the operating state of the CPU according to the user program. By storing the execution operation state of the CPU in the trace buffer together with the time information, it can be easily determined that the CPU is in the execution state by the user program at time Te and the source of the error at time Te is in the user program. .

【０１２７】またさらに、二つのタイミングでのプログ
ラムの状態の変化をトレース用バッファに格納すること
により、時間軸上でのプログラムの実行履歴（すなわ
ち、プロセスＡ「ユーザ状態」→プロセスＡ「システム
状態」→プロセスＢ「システム状態」→プロセスＢ「ユ
ーザ状態」のような実行履歴）が得られるようになり、
より確実なエラー発生元の認定が可能となる。Furthermore, by storing the change in the state of the program at two timings in the trace buffer, the execution history of the program on the time axis (that is, process A "user state"-> process A "system state" → process B “system state” → process B “user state” execution history),
It is possible to certify the error source more reliably.

【０１２８】また、割り込みなどが発生する毎にプログ
ラムの属性情報をトレース用バッファに格納する場合
に、入出力割り込みや例外、ハードウェアエラーなどを
割り込み要因とする割り込み発生毎にプログラムの属性
情報をトレース用バッファに格納し、さらにこれら割り
込み処理の起動あるいは終了したタイミング、またはそ
の両方、あるいは割り込み処理中の任意のタイミングで
プログラムの属性情報をトレース用バッファに格納する
ようにしてもよい。When the program attribute information is stored in the trace buffer each time an interrupt or the like occurs, the program attribute information is displayed each time an interrupt occurs, which causes an input / output interrupt, an exception, a hardware error, or the like. The attribute information of the program may be stored in the trace buffer, and further, the attribute information of the program may be stored in the trace buffer at the timing at which these interrupt processes are activated or ended, or both, or at any timing during the interrupt process.

【０１２９】実施例１３．以下、請求項１９および請求
項２０の発明の一実施例を図について説明する。図１３
は、本実施例のエラートレースによるシステム制御管理
方法の構成を示す概念図である。本実施例では、ユーザ
プログラムから明示的に属性情報をトレース用バッファ
に格納するためのアプリケーションプログラムによるソ
フトウェアインターフェースを提供するものである。図
において４１はユーザプログラム、４２はシステムプロ
グラム、４３はトレース用バッファに格納された属性情
報を元にエラー発生元を認定するエラー判定プログラ
ム、４４はユーザプログラムから明示的に属性情報をト
レース用バッファに格納するためのアプリケーションプ
ログラムによるソフトウェアインターフェースである。Example 13 An embodiment of the inventions of claims 19 and 20 will be described below with reference to the drawings. FIG.
FIG. 3 is a conceptual diagram showing a configuration of a system control management method by error tracing according to this embodiment. In the present embodiment, a software interface by an application program for explicitly storing attribute information in a trace buffer from a user program is provided. In the figure, 41 is a user program, 42 is a system program, 43 is an error judgment program for recognizing the error source based on the attribute information stored in the trace buffer, and 44 is a trace buffer for explicitly displaying attribute information from the user program. Is a software interface with an application program for storing in.

【０１３０】本実施例では、ユーザプログラムから明示
的に属性情報をトレース用バッファに格納するためのソ
フトウェアインターフェースが提供され、このソフトウ
ェアインターフェースによりユーザプログラム４１から
明示的に属性情報がトレース用バッファ２４に格納され
る。この場合、ユーザプログラムはソフトウェアインタ
ーフェース４４を介しての属性情報のトレース用バッフ
ァ２４への格納を明確に意識したものとなる。In this embodiment, a software interface for explicitly storing the attribute information in the trace buffer is provided from the user program, and this software interface allows the user program 41 to explicitly store the attribute information in the trace buffer 24. Is stored. In this case, the user program becomes clearly aware of storing the attribute information in the trace buffer 24 via the software interface 44.

【０１３１】また、ソフトウェアインターフェースのパ
ラメータとしては、プログラムカウンタの値，ＣＰＵの
状態値，時間要素，トレース事象番号，各種レジスタの
内容，プログラム名またはプロセス名，メッセージ，ア
ドレス空間識別子，プロセス識別子，プログラム識別
子，プロセス属性情報，プログラム属性情報，メッセー
ジキュー識別子，メッセージキュー属性情報などがあ
り、属性情報についてのこれらの情報がソフトウェアイ
ンターフェース４４を介してトレース用バッファ２４に
格納される。Further, as the parameters of the software interface, the value of the program counter, the state value of the CPU, the time element, the trace event number, the contents of various registers, the program name or the process name, the message, the address space identifier, the process identifier, the program There are identifiers, process attribute information, program attribute information, message queue identifiers, message queue attribute information, and the like, and these pieces of attribute information are stored in the trace buffer 24 via the software interface 44.

【０１３２】この結果、前記プログラムカウンタの値，
ＣＰＵの状態値，時間要素，トレース事象番号，各種レ
ジスタの内容，プログラム名またはプロセス名，メッセ
ージ，アドレス空間識別子，プロセス識別子，プログラ
ム識別子，プロセス属性情報，プログラム属性情報，メ
ッセージキュー識別子，メッセージキュー属性情報など
の属性情報のトレース用バッファ２４への格納をソフト
ウェアにより制御することが可能となり、エラー発生元
の認定に有効なデータを効率良くトレース用バッファ２
４に収集できることになる。As a result, the value of the program counter,
CPU status value, time element, trace event number, contents of various registers, program name or process name, message, address space identifier, process identifier, program identifier, process attribute information, program attribute information, message queue identifier, message queue attribute The storage of the attribute information such as information in the trace buffer 24 can be controlled by software, and the data effective for certifying the error occurrence source can be efficiently stored in the trace buffer 2.
It will be possible to collect in 4.

【０１３３】実施例１４．以下、請求項２１および請求
項２２の発明の一実施例について説明する。前記実施例
１１および実施例１２では、プログラムの属性情報をト
レース用バッファに格納するタイミングとしてプロセス
のコンテクストスイッチやＣＰＵの実行レベルが変化す
る毎に行うものとして説明したが、あらかじめ指定した
要因が発生したときのみ属性情報をトレース用バッファ
に格納するようにしてもよく、この場合のあらかじめ指
定する要因としては割り込みの種類、システムコールの
種類、プロセスまたはプログラムの識別子、プロセス名
またはプログラム名または関数名、特定のアドレス領域
の読み書きおよび実行などを前記要因とすることが可能
である。Example 14 An embodiment of the inventions of claims 21 and 22 will be described below. Although the eleventh and twelfth embodiments have been described as the timing of storing the attribute information of the program in the trace buffer each time the context switch of the process or the execution level of the CPU changes, a prespecified factor occurs. The attribute information may be stored in the trace buffer only when this is done. In this case, the factors to be specified in advance are interrupt type, system call type, process or program identifier, process name or program name or function name. The read / write and execution of a specific address area can be the factors.

【０１３４】実施例１５．以下、請求項２３および請求
項２４の発明の一実施例について説明する。本実施例で
は、トレース用バッファをハードウェアの制御回路内に
設定し、またこの制御回路としてはＣＰＵと主メモリの
インターフェース回路や外部バス制御回路やローカルバ
ス制御回路や入出力制御回路であり、また主メモリの所
定のエリアにトレース用バッファ領域を設定する。Example 15. An embodiment of the inventions of claims 23 and 24 will be described below. In this embodiment, the trace buffer is set in the hardware control circuit, and the control circuit is an interface circuit between the CPU and the main memory, an external bus control circuit, a local bus control circuit, or an input / output control circuit. Also, a trace buffer area is set in a predetermined area of the main memory.

【０１３５】図１４は、本実施例のエラートレースによ
るシステム制御管理方法の構成を示す概念図である。図
において５１はＣＰＵ、５２はＣＰＵ５１と主メモリ５
３との間のインターフェース、５４は外部バス制御回
路、５５はローカルバス制御回路、５６は入出力制御回
路である。FIG. 14 is a conceptual diagram showing the structure of a system control management method by error tracing according to this embodiment. In the figure, 51 is a CPU, 52 is a CPU 51 and a main memory 5.
3, 54 is an external bus control circuit, 55 is a local bus control circuit, and 56 is an input / output control circuit.

【０１３６】図１５は、主メモリ５３とＣＰＵ５１間の
インターフェース５２（以下、ＰＭＩ，Ｐｒｏｃｅｓｓ
ｏｒＭｅｍｏｒｙＩｎｔｅｒｆａｃｅという）にト
レース用バッファ（領域）を設けた場合の概念図であ
る。図１５において図１４と同一または相当の部分につ
いては同一の符号を付し説明を省略するが６０はキャッ
シュメモリ、６１は書き込みバッファ、６２はトレース
用バッファ（領域）である。FIG. 15 shows an interface 52 (hereinafter, PMI, Process) between the main memory 53 and the CPU 51.
or Memory Interface) is a conceptual diagram when a trace buffer (region) is provided. In FIG. 15, the same or corresponding portions as those in FIG. 14 are designated by the same reference numerals and the description thereof is omitted, but 60 is a cache memory, 61 is a write buffer, and 62 is a trace buffer (area).

【０１３７】ＣＰＵ５１は、ＰＭＩ５２を介してデータ
を外部に書き込む場合にキャッシュメモリ６０と書き込
みバッファ６１を使用して書き込みを行うことで書き込
み処理の高速化を実現している。すなわち、ＣＰＵ５１
の外部への書き込み動作はその書き込み対象がキャッシ
ュされる領域の場合には一度キャッシュメモリ６０へ格
納し、その後、書き込みバッファ６１へキューイングす
る。ＣＰＵ５１は、キャッシュメモリ６０への書き込み
を終了した段階で別の動作に制御を移行し、その後に前
記書き込み動作が実行されることになり、このためＣＰ
Ｕ５１は書き込みによる処理の遅れの影響を受けること
はない。キャッシュメモリ６０の内容は、書き込みバッ
ファ６１を経由して主メモリ５３などに書き込まれる。
書き込みバッファ６１とＣＰＵ５１のキャッシュメモリ
６０はＦＩＦＯ方式により書き込み処理を実行している
ものとすれば、書き込みバッファ６１とＣＰＵ５１のキ
ャッシュメモリ６０のために、実際のＣＰＵ５１の書き
込み動作と実際の物理的書き込みは同期的に行われるこ
とにはならず、処理の高速化が実現される一方でエラー
発生と発生したエラーの検出との間にはタイムラグが生
じエラーの検出が困難となる。従って、この書き込み処
理時のコンピュータシステムのエラートレース処理の非
同期性から生ずる問題点を、トレース用バッファを特定
のハードウェアの制御回路内に設定することで解決す
る。When the CPU 51 writes data to the outside via the PMI 52, the CPU 51 performs writing by using the cache memory 60 and the write buffer 61, thereby realizing high-speed writing processing. That is, the CPU 51
In the external write operation, if the write target is an area to be cached, it is once stored in the cache memory 60, and then queued in the write buffer 61. The CPU 51 shifts the control to another operation at the stage when the writing to the cache memory 60 is completed, and then the write operation is executed. Therefore, the CP
U51 is not affected by the processing delay due to writing. The contents of the cache memory 60 are written to the main memory 53 or the like via the write buffer 61.
Assuming that the write buffer 61 and the cache memory 60 of the CPU 51 are executing the write processing by the FIFO method, the write operation of the actual CPU 51 and the actual physical write are performed because of the write buffer 61 and the cache memory 60 of the CPU 51. Are not performed synchronously, and while speeding up the processing is realized, there is a time lag between the occurrence of an error and the detection of the error that has occurred, making it difficult to detect the error. Therefore, the problem caused by the asynchronous nature of the error trace processing of the computer system during the write processing is solved by setting the trace buffer in the control circuit of the specific hardware.

【０１３８】本実施例では、プログラムの状態が変化す
るときにトレース用バッファ６２に状態変化後の新しい
プログラム状態を格納する。例えば、エラー発生動作が
行われた直後にプログラムの状態変化があり、ＣＰＵ５
１が新しい状態に遷移した後にエラーを検出したとす
る。In the present embodiment, when the program state changes, the new program state after the state change is stored in the trace buffer 62. For example, there is a change in the state of the program immediately after the error occurrence operation is performed, and the CPU 5
Assume that 1 has detected an error after transitioning to a new state.

【０１３９】つまり、（１）エラーを発生させる動作が
ＣＰＵ５１により行われるが、状態変化のトレース用バ
ッファ６２への実際の書き込み動作は書き込みバッファ
６１に格納される。（２）さらにプログラムが状態遷移
する。状態遷移をトレース用バッファ６２に書き込む動
作を行うが、状態変化のトレース用バッファ６２への実
際の書き込み動作は書き込みバッファ６１に蓄積され
る。（３）書き込みバッファ６１はＦＩＦＯなのでエラ
ーを発生させる動作が先に実行されエラーが検出され
る。以上の（１），（２），（３）の処理が順番に進ん
だ場合に状態変化をトレース用バッファ６２に書き込む
動作は、（３）によりエラー検出時に書き込みバッファ
６２内に残っていることになる。従ってこの場合には、
書き込みバッファ６１を一時的に凍結しプログラムの状
態変化の動作を書き込みバッファ６１内で固定してトレ
ース用バッファ６２に格納されているプログラムの状態
変化などについての属性情報を調べることで検出したエ
ラーの発生元の認定が容易となる。That is, (1) the CPU 51 performs the operation of causing an error, but the actual write operation of the state change trace buffer 62 is stored in the write buffer 61. (2) The state of the program further transits. Although the operation of writing the state transition to the trace buffer 62 is performed, the actual write operation of the state change to the trace buffer 62 is accumulated in the write buffer 61. (3) Since the write buffer 61 is a FIFO, the operation that causes an error is executed first and the error is detected. The operation of writing the state change to the trace buffer 62 when the above processes (1), (2), and (3) proceed in order must be left in the write buffer 62 when an error is detected by (3). become. So in this case,
An error detected by temporarily freezing the write buffer 61, fixing the behavior of the program state change in the write buffer 61, and checking the attribute information about the program state change stored in the trace buffer 62 The source can be easily certified.

【０１４０】図１６は、エラーが発生する前の書き込み
バッファ６１の状態を示す説明図であり、２５はエラー
を発生させる書き込み動作のエラー書き込み動作情報、
２６はプログラムの状態変化についてトレース用バッフ
ァ６２に書き込み動作を行う状態変化書き込み動作情報
である。この場合、エラー書き込み動作情報２５により
エラーを発生させる書き込み動作が行われエラーが発生
したときには書き込みバッファ６１を一時的に凍結し、
それ以降のプログラムの状態変化についてトレース用バ
ッファ６２への書き込み動作が実行されないようにす
る。従ってトレース用バッファ６２には、エラー発生時
にそれ以前の状態が残っており、エラー発生時のトレー
ス用バッファ６２に残っている属性情報によりエラー発
生元の認定を容易に行うことが可能となる。FIG. 16 is an explanatory diagram showing a state of the write buffer 61 before an error occurs, and 25 is error write operation information of a write operation that causes an error.
Reference numeral 26 is state change write operation information for performing a write operation in the trace buffer 62 regarding a change in the state of the program. In this case, a write operation that causes an error is performed according to the error write operation information 25, and when an error occurs, the write buffer 61 is temporarily frozen,
The write operation to the trace buffer 62 is prevented from being executed with respect to the subsequent change in the state of the program. Therefore, the state before the error remains in the trace buffer 62 when the error occurs, and it is possible to easily identify the error source by the attribute information remaining in the trace buffer 62 when the error occurs.

【０１４１】次に、この場合のエラー発生元の認定方法
を図１７および図１８および図２３を用いて具体的に説
明する。図２３においてＳはシステムプログラムの実行
中を示し、またＵはユーザプログラムの実行中を示して
いる。従来の技術で説明したようにエラー発生とエラー
検出の組（６，１１）、（８，１３）、（９，１４）に
関してはエラー発生元の認定を正しく行うことができな
かった。これに対し本実施例では、エラー発生とエラー
検出の組（９，１４）についてはエラー検出時のプロセ
スが異なっているので、トレース用バッファ６２の属性
情報にプロセスの識別子を書き込むことでエラー発生元
の認定が可能となる。Next, the method of recognizing the error occurrence source in this case will be specifically described with reference to FIGS. 17, 18 and 23. In FIG. 23, S indicates that the system program is being executed, and U indicates that the user program is being executed. As described in the related art, it is not possible to correctly certify the error source for the pairs (6, 11), (8, 13) and (9, 14) of error occurrence and error detection. On the other hand, in the present embodiment, since the process at the time of error detection is different for the set of error occurrence and error detection (9, 14), an error occurs by writing the process identifier in the attribute information of the trace buffer 62. The original certification is possible.

【０１４２】図１７は、それ以外の場合のエラー発生元
の認定についての対応表を示す説明図である。すなわ
ち、図２３において、エラー検出時１１とエラー検出時
１３では６と８のエラー発生についてのプログラムの属
性情報を更新する要求は書き込みバッファ６１内に残っ
ているので、トレース用バッファ６２を調べることでエ
ラー検出時１１において検出したエラーについてのプロ
グラム状態がユーザ状態であったこと、またエラー検出
時１３において検出したエラーについてのプログラム状
態がシステム状態であったことを知ることが出来る。FIG. 17 is an explanatory diagram showing a correspondence table for recognizing the error source in other cases. That is, in FIG. 23, at the time of error detection 11 and at the time of error detection 13, the request to update the attribute information of the program regarding the occurrence of errors 6 and 8 remains in the write buffer 61, so check the trace buffer 62. It is possible to know that the program state of the error detected at the time of error detection 11 is the user state and the program state of the error detected at the time of error detection 13 is the system state.

【０１４３】また、エラー検出１０とエラー検出１２の
場合には、トレース用バッファ６２の属性情報とエラー
検出時のＣＰＵの状態とは一致しているのでトレース用
バッファ６２の属性情報によりエラー発生元のＣＰＵの
状態を知ることができる。従って、図１７に示すように
エラー検出時１０とエラー検出時１１のエラー発生元は
ユーザプログラムにあり、またエラー検出時１２とエラ
ー検出時１３のエラー発生元はシステムプログラムにあ
ることが明らかとなる。すなわち、従来では正しく認定
できなかったエラー発生とエラー検出の組（６，１１）
と（８，１３）についてもエラー発生元の認定が正しく
行われる。In the case of error detection 10 and error detection 12, since the attribute information of the trace buffer 62 and the state of the CPU at the time of error detection match, the error occurrence source is determined by the attribute information of the trace buffer 62. It is possible to know the state of the CPU. Therefore, as shown in FIG. 17, it is clear that the error source at error detection 10 and error detection 11 is in the user program, and the error source at error detection 12 and error detection 13 is in the system program. Become. In other words, the combination of error occurrence and error detection that could not be correctly recognized in the past (6, 11)
With respect to (8, 13), the error source is correctly identified.

【０１４４】また、プログラムの状態が変化するときに
トレース用バッファ６２にプログラムの状態が変化する
前のプログラム状態を格納するようにしても同様の効果
が得られる。図１８は、プログラムの状態が変化すると
きにトレース用バッファ６２にプログラムの状態が変化
する前のプログラム状態を格納したときの対応表であ
り、図１７の対応表に示した場合と同様にエラー検出時
１０とエラー検出時１１のエラー発生元はシステムプロ
グラム状態の変化後のユーザプログラム状態にあり、ま
たエラー検出時１２とエラー検出時１３のエラー発生元
はユーザプログラム状態の変化後のシステムプログラム
状態にあることが明らかとなる。すなわち、従来では正
しく認定できなかったエラー発生とエラー検出の組
（６，１１）と（８，１３）についてもエラー発生元の
認定が正しく行われる。The same effect can be obtained by storing the program state before the program state is changed in the trace buffer 62 when the program state is changed. FIG. 18 is a correspondence table when the program state before the change of the program state is stored in the trace buffer 62 when the state of the program changes, and an error similar to the case shown in the correspondence table of FIG. The error occurrence source at the time of detection 10 and the error detection 11 is in the user program state after the change of the system program state, and the error source at the time of error detection 12 and the error detection 13 is the system program after the change of the user program state It becomes clear that it is in a state. That is, with respect to the error occurrence and error detection pairs (6, 11) and (8, 13), which could not be correctly recognized in the past, the error source is correctly recognized.

【０１４５】実施例１６．以下、請求項２５〜請求項３
１記載の発明の一実施例を説明する。本実施例は、以上
説明してきた実施例１〜実施例１５によりエラー発生元
の認定が行われた場合に実行される処理について述べ
る。このエラー発生元の認定による処理については、エ
ラーを発生させたプログラムだけをシステムから消去し
てそれ以外の処理を継続実行することも可能であり、ま
たシステム全体を停止させることも可能である。たとえ
ば、ユーザプロセスがエラーの発生元であった場合に
は、そのプログラムが使用していた資源を全て空き資源
プールに開放し、またシステムプログラムが発生元であ
った場合にはシステム全体を停止させるか、別のシステ
ムに現在実行中の全プログラムまたは一部のプログラム
を移動させることが可能である。Example 16. Hereinafter, claim 25 to claim 3
An embodiment of the invention described in 1 will be described. The present embodiment describes a process executed when the error source is identified by the first to fifteenth embodiments described above. Regarding the processing based on the recognition of the error occurrence source, it is possible to erase only the program in which the error has occurred from the system and continue execution of other processing, or it is possible to stop the entire system. For example, if a user process is the source of an error, all resources used by that program are released to the free resource pool, and if a system program is the source, the entire system is stopped. Alternatively, it is possible to move all or some of the currently running programs to another system.

【０１４６】また、エラーを発生させたプログラムの実
行を停止させ、それ以外のプログラムの実行を継続した
り、エラー発生プロセスを実行延期にしてエラー原因が
回復したときに実行継続させるなどの方法が可能であ
る。すなわち、エラー発生原因がＩ／Ｏエラーであり、
その入出力デバイスなしではプログラムの実行継続が不
可能である場合にはそのプログラムを停止させ、代替入
出力デバイスが後に利用可能である場合には、その代替
入出力デバイスが使用可能になるまでそのエラーを発生
させたプログラムの実行を一時中断し延期させる。Further, there is a method such as stopping the execution of the program which caused the error and continuing the execution of the other programs, or postponing the execution process of the error and continuing the execution when the cause of the error is recovered. It is possible. That is, the cause of the error is an I / O error,
If the program cannot continue running without the I / O device, stop the program, and if an alternate I / O device is available later, wait until the alternate I / O device becomes available. Suspend and suspend the execution of the program that caused the error.

【０１４７】また、エラーを発生させたプログラムが使
用していた資源を元に戻すことで有効資源の利用率を向
上させることが可能である。Further, it is possible to improve the utilization rate of effective resources by restoring the resources used by the program that has generated the error.

【０１４８】さらに、エラーの発生原因であった資源は
空き資源管理プールに返却しないことでエラーの再発を
防ぐことが可能である。すなわち、エラー発生原因がメ
モリの故障であり、プログラムが使用していた主メモリ
の内で故障していないものを空きメモリとしてシステム
に返却し、エラーを発生したメモリは使用不能として空
き資源管理プールに返却せず、エラー原因が回復するま
で使用を禁止する。エラーを発生させた資源が代替資源
と入れ替え可能である場合には、使用不能として空き資
源管理プールに返却せず、エラー原因が回復するまで使
用を禁止することでシステムの信頼性を向上させること
が可能である。すなわち、メモリデバイスの故障が原因
でエラーを発生し、エラー発生時の動作が前記メモリデ
バイスへの書き込み処理であるような場合、別のメモリ
を代替メモリとして用意しそのメモリを使用して書き込
み処理を行うことでエラー回復が可能となる。Furthermore, it is possible to prevent the recurrence of the error by not returning the resource that caused the error to the free resource management pool. In other words, the cause of the error is a memory failure, and the main memory used by the program that has not failed is returned to the system as free memory. Do not return to, and prohibit use until the cause of the error is recovered. If the resource that caused the error can be replaced with a substitute resource, do not return it to the free resource management pool as an unusable resource, and improve system reliability by prohibiting use until the cause of the error is recovered. Is possible. In other words, if an error occurs due to a memory device failure and the operation at the time of error occurrence is a write process to the memory device, another memory is prepared as an alternative memory and the write process is performed using that memory. By doing this, error recovery becomes possible.

【０１４９】図１９は、請求項２５または請求項２９ま
たは請求項３０記載の発明の一実施例におけるエラー発
生元の認定後の動作を示すフローチャート、図２０は、
請求項２６または請求項２７記載の発明の一実施例にお
けるエラー発生元の認定後の動作を示すフローチャー
ト、図２１は、請求項２８記載の発明の一実施例におけ
るエラー発生元の認定後の動作を示すフローチャート、
図２２は請求項３１記載の発明の一実施例におけるエラ
ー発生元の認定後の動作を示すフローチャートである。FIG. 19 is a flow chart showing the operation after recognizing the error source in the embodiment of the invention of claim 25, claim 29 or claim 30, and FIG.
FIG. 21 is a flowchart showing the operation after recognizing the error source in the embodiment of the invention described in claim 26 or claim 27. FIG. 21 is the operation after recognizing the error source in the embodiment of the invention described in claim 28. A flow chart showing
FIG. 22 is a flow chart showing the operation after recognizing the error source in the embodiment of the invention of claim 31.

【０１５０】すなわち、図１９のフローチャートに示す
動作によればエラー発生元であると認定されたプログラ
ムの種類（システムプログラムあるいはユーザプログラ
ムなどの種類）と同一種類のプログラムの特定を行い
（ステップＳＴ１１）、この特定したプログラムの種類
はシステムプログラムであるか否かを判定し（ステップ
ＳＴ１２）、システムプログラムであると判定したとき
には、システム全体を停止させる（ステップＳＴ１
３）。一方、ステップＳＴ１２においてシステムプログ
ラムでない、すなわちユーザプログラムであると判定さ
れたときには、ステップＳＴ１１において特定したプロ
グラムに対応するユーザプログラムの実行を停止させる
（ステップＳＴ１４）。この状態では、前記特定したプ
ログラムに対応するユーザプログラムは停止状態にある
が他のエラー発生元でないプログラムは停止することな
く実行が継続され、エラー原因を回復するための操作が
行われる。That is, according to the operation shown in the flowchart of FIG. 19, a program of the same type as the type of program (type of system program or user program) recognized as the error source is identified (step ST11). Then, it is determined whether or not the identified program type is a system program (step ST12), and when it is determined to be a system program, the entire system is stopped (step ST1).
3). On the other hand, when it is determined in step ST12 that the program is not the system program, that is, the user program, the execution of the user program corresponding to the program specified in step ST11 is stopped (step ST14). In this state, the user program corresponding to the specified program is in the stopped state, but the other programs that are not the source of the error continue to be executed without stopping, and the operation for recovering the cause of the error is performed.

【０１５１】ユーザプログラムが停止している状態で
は、次にエラーの原因が回復したか否かを判定し（ステ
ップＳＴ１５）、エラーの原因が回復したと判定したと
きには、停止しているユーザプログラムの実行を再開し
継続する（ステップＳＴ１６）。In the state where the user program is stopped, it is next determined whether or not the cause of the error is recovered (step ST15). When it is determined that the cause of the error is recovered, the stopped user program The execution is resumed and continued (step ST16).

【０１５２】また、図２０に示すフローチャートに示す
動作によれば、エラー発生元であると認定されたプログ
ラムの種類（システムプログラムあるいはユーザプログ
ラムなどの種類）と同一種類のプログラムの特定を行い
（ステップＳＴ２１）、さらに特定したプログラムが使
用している資源を確認する（ステップＳＴ２２）。この
資源としては、発生元であると認定されたプログラムが
使用しているメモリ領域，各種の管理用テーブル，入出
力装置などである。そして、ステップＳＴ２２において
確認した資源の全部または一部を前記資源を管理してい
る資源管理システムに返却する（ステップＳＴ２３）。Further, according to the operation shown in the flow chart of FIG. 20, a program of the same type as the type of the program (system program or user program) recognized as the error source is identified (step In ST21), the resources used by the further specified program are confirmed (step ST22). The resources include a memory area used by a program that is identified as the source, various management tables, an input / output device, and the like. Then, all or part of the resources confirmed in step ST22 is returned to the resource management system managing the resources (step ST23).

【０１５３】また、図２１に示すフローチャートでは、
図２０のフローチャートに示したステップＳＴ２３にお
ける資源を管理している資源管理システムに資源を返却
する動作を行わないようにしたものであり（ステップＳ
Ｔ３３）、エラー発生原因となっている資源が資源管理
システムに返却され、使用されることでエラーの再発生
原因となるのを回避する。Further, in the flow chart shown in FIG.
The operation of returning the resource to the resource management system managing the resource in step ST23 shown in the flowchart of FIG. 20 is not performed (step S
T33), the resource causing the error is prevented from being returned to the resource management system and used again to cause the error again.

【０１５４】また、図２２に示すフローチャートに示す
動作によれば、エラー発生元であるプログラムが認定さ
れるとエラー処理が起動され、認定されたプログラムの
実行を停止する（ステップＳＴ４１）。そして、前記エ
ラーの発生元であると認定されたプログラムが使用して
いた資源の確認を行い（ステップＳＴ４２）、この資源
に代る代替資源の有無を判定する（ステップＳＴ４
３）。代替資源がないと判定し、また代替資源を用いな
ければ以後の処理を継続できないと判定したときには、
処理を中断する中断処理を行う（ステップＳＴ４４）。
一方、ステップＳＴ４３において代替資源があると判定
したときには、エラー発生原因である資源を代替資源と
入れ替える（ステップＳＴ４５）。そして、新しく入れ
替えた代替資源の状態が入れ替える前の資源の状態と同
一の状態になるように必要な設定データなどを再設定し
（ステップＳＴ４６）、前記ステップＳＴ４１において
中断したプログラムの実行を開始し継続する（ステップ
ＳＴ４７）。Further, according to the operation shown in the flow chart of FIG. 22, when the program which is the source of the error is certified, the error processing is started and the execution of the certified program is stopped (step ST41). Then, the resource used by the program recognized as the source of the error is confirmed (step ST42), and it is determined whether or not there is an alternative resource replacing this resource (step ST4).
3). When it is determined that there is no alternative resource, and it is determined that the subsequent processing cannot be continued unless the alternative resource is used,
An interruption process for interrupting the process is performed (step ST44).
On the other hand, when it is determined in step ST43 that there is an alternative resource, the resource that causes the error is replaced with the alternative resource (step ST45). Then, necessary setting data and the like are reset so that the state of the newly replaced alternative resource becomes the same as the state of the resource before the replacement (step ST46), and the execution of the program interrupted in step ST41 is started. Continue (step ST47).

【０１５５】[0155]

【発明の効果】以上のように、請求項１の発明によれば
トレース用バッファにはプログラムの状態変化に関する
属性情報であるトレース情報を記憶し、記憶した属性情
報を基にエラー発生源を認定するように構成したので、
エラー発生元の認定が正確かつ容易になる効果がある。As described above, according to the first aspect of the invention, the trace buffer stores the trace information which is the attribute information relating to the change in the state of the program, and the error source is identified based on the stored attribute information. I configured it to
Accurate and easy recognition of the error source is effective.

【０１５６】請求項２の発明によれば、トレース用バッ
ファに実行されているプログラムの状態変化に関する属
性情報をトレース情報として発生時間順に記憶し、この
記憶した属性情報を前記発生時間順に進みあるいは戻り
検索することでエラー発生原因に関係するプログラムを
認定するように構成したので、エラー発生元の認定が正
確かつ容易になる効果がある。According to the second aspect of the present invention, the attribute information relating to the state change of the program being executed in the trace buffer is stored as trace information in the order of occurrence time, and the stored attribute information is advanced or returned in the order of occurrence time. Since the program related to the cause of the error occurrence is certified by searching, it is possible to accurately and easily identify the error source.

【０１５７】請求項３の発明によれば、トレース用バッ
ファに記憶した当該プロセスがどの資源を使用している
かを知ることのできるプロセスの属性情報を検索し、こ
の属性情報を基に前記プロセスが使用している資源を知
ることでエラー発生原因となった資源を使用しているプ
ログラムを認定するように構成したので、エラー発生元
の認定が正確かつ容易になる効果がある。According to the third aspect of the present invention, the attribute information of the process, which is stored in the trace buffer and which can be used to know which resource the process is using, is searched, and based on this attribute information, the process is searched. Since the program that uses the resource that has caused the error has been configured to be recognized by knowing the resource that is being used, the error source can be accurately and easily identified.

【０１５８】請求項４の発明によれば、トレース用バッ
ファには実行されているプログラムの状態が変化する前
のプログラム状態あるいは変化した後のプログラム状態
を属性情報として格納し、この記憶した属性情報を基に
エラー発生源を認定するように構成したので、エラー発
生元の認定が正確かつ容易になる効果がある。According to the fourth aspect of the present invention, the trace buffer stores the program state before the state of the program being executed or the program state after the change as attribute information, and the stored attribute information. Since the error source is identified based on the above, there is an effect that the error source is identified accurately and easily.

【０１５９】請求項５の発明によれば、書き込みバッフ
ァやキャッシュなどの記憶装置を使用しているときにエ
ラーが発生すると、前記記憶装置内のデータを固定し実
行されているプログラムの状態が変化する前のプログラ
ム状態あるいは変化した後のプログラム状態などのトレ
ース用バッファに格納された属性情報を検索し、エラー
発生源を認定するように構成したので、エラー発生元の
認定が正確かつ容易になる効果がある。According to the fifth aspect of the invention, when an error occurs while using a storage device such as a write buffer or a cache, the state of the program being executed while fixing the data in the storage device changes. The attribute information stored in the trace buffer, such as the program state before the change or the program state after the change, is searched and the source of the error is configured to be certified, so that the source of the error is accurately and easily certified. effective.

【０１６０】請求項６の発明によれば、エラー発生時の
プログラムの状態に関しての属性情報をトレース用バッ
ファをサイクリックに使用して記憶し、この記憶した属
性情報を基にエラー発生元を認定するように構成したの
で、トレース用バッファの有効利用が可能になり、エラ
ー発生元の認定が正確かつ容易になる効果がある。According to the sixth aspect of the present invention, the attribute information regarding the state of the program at the time of error is stored by using the trace buffer cyclically, and the error source is identified based on the stored attribute information. With this configuration, the trace buffer can be effectively used, and the error source can be accurately and easily identified.

【０１６１】請求項７の発明によれば、オペレーティン
グシステムによりプログラムの使用状況に応じて作成管
理されあるいは予め定義されてなるプログラムの管理テ
ーブルへのポインタ、または前記管理テーブルのコピー
またはプログラムの識別子などを属性情報としてトレー
ス用バッファに記憶し、この記憶した属性情報を基にエ
ラー発生元を認定するように構成したので、エラー発生
元の認定が正確かつ容易になる効果がある。According to the invention of claim 7, a pointer to a management table of a program which is created and managed by the operating system according to the usage status of the program or is defined in advance, or a copy of the management table or an identifier of the program, etc. Is stored as the attribute information in the trace buffer and the error source is identified based on the stored attribute information. Therefore, the error source can be identified accurately and easily.

【０１６２】請求項８の発明によれば、プログラムの状
態変化が発生した時刻またはプログラムの状態が変化し
てからの経過時間を属性情報としてトレース用バッファ
に記憶し、この記憶した属性情報を基にエラー発生元を
認定するように構成したので、エラー発生元の認定が正
確かつ容易になる効果がある。According to the eighth aspect of the present invention, the time when the program state change occurs or the elapsed time after the program state change is stored in the trace buffer as attribute information, and the stored attribute information is used as a basis. Since it is configured to certify the error source, the error source can be accurately and easily certified.

【０１６３】請求項９の発明によれば、エラーの検出時
刻と発生したエラーが検出されるのに要したエラー検出
時間から実際にエラーの発生したエラー発生時刻を求
め、さらにトレース用バッファに記憶したプログラムの
状態変化が発生した時刻またはプログラムの状態が変化
してからの経過時間などの属性情報により、前記求めた
エラー発生時刻に実行されていたプログラムを検索して
求め、エラー発生源を認定するように構成したので、エ
ラー発生元の認定が正確かつ容易になる効果がある。According to the ninth aspect of the invention, the error occurrence time at which an error actually occurs is obtained from the error detection time and the error detection time required to detect the error that has occurred, and is further stored in the trace buffer. Based on the attribute information such as the time when the status change of the program occurred or the elapsed time since the status of the program changed, the program that was executed at the time when the error occurred was searched for and found, and the error source was identified. Since it is configured to do so, it is effective in certifying the error source accurately and easily.

【０１６４】請求項１０の発明によれば、発生したエラ
ーの種類に応じたエラーを検出するのに要するエラー検
出時間のリストをあらかじめ求めておき、発生したエラ
ーの種類に応じた前記エラー検出時間を前記リストから
検索して求め、エラーの検出時刻と前記リストより求め
たエラー検出時間からエラーの種類に応じた実際のエラ
ー発生時刻を求め、さらに属性情報としてトレース用バ
ッファに記憶したプログラムの状態変化が発生した時刻
またはプログラムの状態が変化してからの経過時間の情
報により、前記求めたエラー発生時刻に実行されていた
プログラムを検索して求め、エラー発生源を認定するよ
うに構成したので、エラー発生時のプログラム状態とエ
ラー検出時のプログラム状態が異なる場合でもエラー発
生元の認定が正確かつ容易になる効果がある。According to the tenth aspect of the invention, a list of error detection times required to detect an error corresponding to the type of error that has occurred is obtained in advance, and the error detection time corresponding to the type of error that has occurred is obtained. From the list, the actual error occurrence time corresponding to the type of error is found from the error detection time and the error detection time found from the list, and the state of the program stored in the trace buffer as attribute information Since it is configured to search for the program that was being executed at the obtained error occurrence time by the information of the time when the change occurred or the time elapsed since the state of the program changed, and to determine the error source. Accurate certification of the error source even when the program state when the error occurs and the program state when the error occurs are different One is easy to become effective.

【０１６５】請求項１１の発明によれば、トレース用バ
ッファには、プログラムが使用している資源のリストを
属性情報として格納し、この属性情報を基にエラー発生
元を認定するように構成したので、エラー発生元の認定
が正確かつ容易になる効果がある。According to the invention of claim 11, the list of resources used by the program is stored in the trace buffer as attribute information, and the error source is identified based on this attribute information. Therefore, it is effective to accurately and easily identify the error source.

【０１６６】請求項１２の発明によれば、トレース用バ
ッファへの属性情報の格納をプロセス間のコンテクスト
スイッチの発生を基に行い、トレース用バッファに格納
された属性情報によりエラー発生元を認定するように構
成したので、プロセスの実行順序を容易に知ることがで
き、エラー発生元の認定が正確かつ容易になる効果があ
る。According to the twelfth aspect of the present invention, the attribute information is stored in the trace buffer based on the occurrence of the context switch between processes, and the error source is identified by the attribute information stored in the trace buffer. Since it is configured as described above, it is possible to easily know the execution order of the processes, and it is possible to accurately and easily identify the error source.

【０１６７】請求項１３の発明によれば、トレース用バ
ッファへの属性情報の格納をＣＰＵの実行状態の遷移を
基に行い、この結果トレース用バッファに格納された属
性情報を基にエラー発生元を認定するように構成したの
で、エラー発生元の認定が正確かつ容易になる効果があ
る。According to the thirteenth aspect of the present invention, the attribute information is stored in the trace buffer based on the transition of the execution state of the CPU, and as a result, the error occurrence source is determined based on the attribute information stored in the trace buffer. Since it is configured to certify, the effect of certifying the error source is accurate and easy.

【０１６８】請求項１４の発明によれば、ＣＰＵの実行
状態がユーザプログラムからシステムプログラムへ、あ
るいはシステムプログラムからユーザプログラムへ遷移
したことを基にトレース用バッファへ属性情報を格納
し、トレース用バッファに格納されたプログラムの状態
変化に関しての属性情報を基にエラー発生元を認定する
ように構成したので、エラー発生時のプログラム状態と
エラー検出時のプログラム状態が異なる場合でもエラー
発生元の認定が正確かつ容易になる効果がある。According to the fourteenth aspect of the present invention, the attribute information is stored in the trace buffer based on the transition of the CPU execution state from the user program to the system program or from the system program to the user program. Since the error source is configured to be certified based on the attribute information related to the program state change stored in, the error source can be certified even when the program state when an error occurs and the program state when an error is detected are different. It has the effect of being accurate and easy.

【０１６９】請求項１５の発明によれば、プログラムの
状態変化としてＣＰＵの実行状態が非特権レベルから特
権レベルへ、あるいは特権レベルから非特権レベルへ遷
移することでトレース用バッファへ属性情報を格納し、
このトレース用バッファに格納された属性情報を基にエ
ラー発生元を認定するように構成したので、エラー発生
元の認定が正確かつ容易になる効果がある。According to the fifteenth aspect of the present invention, the attribute information is stored in the trace buffer when the execution state of the CPU transits from the non-privileged level to the privileged level or from the privileged level to the non-privileged level as the state change of the program. Then
Since the error source is identified based on the attribute information stored in the trace buffer, the error source can be identified accurately and easily.

【０１７０】請求項１６の発明によれば、割り込み処理
の起動あるいは終了したタイミング、または割り込み処
理の起動および終了したタイミング、または割り込み処
理中の任意の時間タイミングでトレース用バッファへ属
性情報を格納し、このトレース用バッファに格納された
属性情報を基にエラー発生元を認定するように構成した
ので、エラー発生元の認定が正確かつ容易になる効果が
ある。According to the sixteenth aspect of the present invention, the attribute information is stored in the trace buffer at the timing when the interrupt processing is started or ended, the timing when the interrupt processing is started and ended, or an arbitrary time during the interrupt processing. Since the error source is identified based on the attribute information stored in the trace buffer, the error source can be identified accurately and easily.

【０１７１】請求項１７の発明によれば、入出力装置に
おける入出力動作終了時や入出力動作中のエラー発生や
ハードウェアエラーや例外を割り込み要因とし、これら
割り込み要因が発生したときにトレース用バッファへ属
性情報を格納し、このトレース用バッファに格納された
属性情報を基にエラー発生元を認定するように構成した
ので、エラー発生時のプログラム状態とエラー検出時の
プログラム状態が異なる場合でもエラー発生元の認定が
正確かつ容易になる効果がある。According to the seventeenth aspect of the present invention, when an input / output operation in the input / output device is ended or during an input / output operation, an error occurrence, a hardware error or an exception is used as an interrupt factor, and when these interrupt factors occur, tracing is performed. Since the attribute information is stored in the buffer and the error source is identified based on the attribute information stored in the trace buffer, even if the program state when an error occurs and the program state when an error is detected are different. Accurate and easy recognition of the error source is effective.

【０１７２】請求項１８の発明によれば、ユーザプログ
ラムからシステムプログラムへの遷移が行われるシステ
ムコールが起動されるタイミングあるいはシステムコー
ルからの終了処理が実行されるタイミング、または前記
システムコールが起動されるタイミングおよびシステム
コールからの終了処理が実行されるタイミング、または
システムコールの処理中の任意の時間タイミングでトレ
ース用バッファへ属性情報を格納し、このトレース用バ
ッファに格納された属性情報を基にエラー発生元を認定
するように構成したので、エラー発生時のプログラム状
態とエラー検出時のプログラム状態が異なる場合でもエ
ラー発生元の認定が正確かつ容易になる効果がある。According to the eighteenth aspect of the present invention, the timing at which the system call for transitioning from the user program to the system program is activated, the timing for performing the termination processing from the system call, or the system call is activated. The attribute information is stored in the trace buffer at the timing when the system call is terminated and when the system call is terminated, or at any time during the processing of the system call, and based on the attribute information stored in the trace buffer. Since the error source is configured to be certified, even if the program state at the time of error occurrence and the program state at the time of error detection are different, the error source can be certified accurately and easily.

【０１７３】請求項１９の発明によれば、アプリケーシ
ョンプログラムに提供されたインターフェースを用いて
属性情報を格納し、この格納された属性情報を基にエラ
ー発生元を認定するように構成したので、エラー発生元
の認定が正確かつ容易になる効果がある。According to the nineteenth aspect of the invention, since the attribute information is stored using the interface provided to the application program and the error occurrence source is identified based on the stored attribute information, an error occurs. This has the effect of making the source identification accurate and easy.

【０１７４】請求項２０の発明によれば、プログラムカ
ウンタ値，ＣＰＵ状態値，時間要素，トレース事象番
号，各種レジスタの内容，プログラム名，プロセス名，
メッセージ，アドレス空間識別子，プロセス識別子，プ
ログラム識別子，プロセス属性情報，プログラム属性情
報，メッセージキュー識別子，メッセージキュー属性情
報のいずれかあるいはそれらの組み合わせをインターフ
ェースのパラメータとして、アプリケーションプログラ
ムに提供されたインターフェースを用いて属性情報を格
納し、格納したトレース情報を基にエラー発生元を認定
するように構成したので、エラー発生元の認定に有効な
データを効率良く収集でき、エラー発生元の認定が正確
かつ容易になる効果がある。According to the twentieth aspect of the invention, the program counter value, the CPU state value, the time element, the trace event number, the contents of various registers, the program name, the process name,
The interface provided to the application program is used with the message, the address space identifier, the process identifier, the program identifier, the process attribute information, the program attribute information, the message queue identifier, the message queue attribute information, or a combination thereof as the interface parameter. The attribute information is stored by using the attribute information, and the error source is certified based on the stored trace information. Effective data for certifying the error source can be collected efficiently, and the error source can be certified accurately and easily. Is effective.

【０１７５】請求項２１の発明によれば、あらかじめ指
定された要因が発生したときにのみトレース用バッファ
への属性情報の格納を行い、格納した属性情報を基にエ
ラー発生元を認定するように構成したので、エラー発生
元の認定が正確かつ容易になる効果がある。According to the twenty-first aspect of the present invention, the attribute information is stored in the trace buffer only when a prespecified factor occurs, and the error occurrence source is identified based on the stored attribute information. Since it is configured, it has the effect of certifying the error source accurately and easily.

【０１７６】請求項２２の発明によれば、割り込みの種
類、システムコールの種類、プログラムまたはプロセス
の識別子、プログラム名またはプロセス名または関数
名、特定のアドレス領域に対する書き込みや読み出しお
よび実行などの予め指定された要因が発生したときにの
み、トレース用バッファへの属性情報の格納を行い、記
憶した属性情報を基にエラー発生元を認定するように構
成したので、エラー発生元の認定が正確かつ容易になる
効果がある。According to the twenty-second aspect of the invention, the type of interrupt, the type of system call, the identifier of the program or process, the program name or the process name or the function name, and the writing, reading, and execution in a specific address area are designated in advance. The attribute information is stored in the trace buffer only when the specified factor occurs, and the error source is identified based on the stored attribute information, so the error source can be identified accurately and easily. Is effective.

【０１７７】請求項２３の発明によれば、ハードウェア
の制御回路内に設けられたトレース用バッファにプログ
ラムの状態変化を示す属性情報を記憶し、記憶した属性
情報を基にエラー発生元を認定するように構成したの
で、エラー発生時のプログラム状態とエラー検出時のプ
ログラム状態が異なる場合でもエラー発生元の認定が正
確かつ容易になる効果がある。According to the twenty-third aspect of the present invention, the attribute information indicating the state change of the program is stored in the trace buffer provided in the hardware control circuit, and the error source is identified based on the stored attribute information. With this configuration, even if the program state at the time of error occurrence and the program state at the time of error detection are different, it is possible to accurately and easily identify the error source.

【０１７８】請求項２４の発明によれば、ＣＰＵと主メ
モリとのインターフェース回路、外部バス制御回路、ロ
ーカルバス制御回路、入出力制御回路、出力メモリをハ
ードウェアの制御回路として、これらハードウェアに設
けられたトレース用バッファに属性情報を記憶し、記憶
した属性情報を基にエラー発生元を認定するように構成
したので、エラー発生元の認定が正確かつ容易になる効
果がある。According to the twenty-fourth aspect of the present invention, the interface circuit between the CPU and the main memory, the external bus control circuit, the local bus control circuit, the input / output control circuit, and the output memory are used as hardware control circuits, and these hardware are used. Since the attribute information is stored in the trace buffer provided and the error source is identified based on the stored attribute information, the error source can be identified accurately and easily.

【０１７９】請求項２５の発明によれば、エラー発生源
と認定されたプログラムの種類に応じて、前記認定され
たプログラムの実行を停止させ、あるいはシステムを停
止させ、あるいは前記プログラムの実行を延期させエラ
ー原因が回復した時点で前記プログラムの実行や前記シ
ステムの動作を再開させ継続させる構成を備えたので、
システムの信頼性を向上させるエラートレースによるシ
ステム制御管理方法が得られる効果がある。According to the twenty-fifth aspect of the invention, the execution of the certified program is stopped, the system is stopped, or the execution of the program is postponed according to the type of the program recognized as the error source. When the cause of the error is recovered, the program is restarted and the operation of the system is resumed at the time of recovery, so that
There is an effect that a system control management method by error tracing that improves system reliability can be obtained.

【０１８０】請求項２６の発明によればエラー発生源と
認定されたプログラムがユーザプログラムまたはユーザ
プロセスの場合に、前記ユーザプログラムまたはユーザ
プロセスが使用している資源をそのシステムの管理を実
行しているシステム管理手段に返却することでシステム
を制御し管理するように構成したので、有効資源の利用
率を向上させ、システムの信頼性を向上させるエラート
レースによるシステム制御管理方法が得られる効果があ
る。According to the twenty-sixth aspect of the present invention, when the program recognized as the error source is a user program or a user process, the resources used by the user program or the user process are managed by the system. Since it is configured to control and manage the system by returning it to the existing system management means, there is an effect that a system control management method by error tracing that improves the utilization rate of effective resources and improves system reliability can be obtained. .

【０１８１】請求項２７の発明によればエラー発生源と
認定されたプログラムが使用していたメモリ領域や各種
の管理用テーブルや入出力装置などの資源を夫々の空き
資源管理プールに返却することでシステムを制御し管理
するように構成したので、有効資源の利用率を向上さ
せ、システムの信頼性を向上させるエラートレースによ
るシステム制御管理方法が得られる効果がある。According to the twenty-seventh aspect of the present invention, the resources such as the memory area used by the program identified as the error source, the various management tables, and the input / output devices are returned to the respective free resource management pools. Since the system is configured to be controlled and managed by the system, there is an effect that a system control management method by error tracing that improves the utilization rate of effective resources and improves system reliability can be obtained.

【０１８２】請求項２８の発明によれば、エラー発生源
と認定されたプログラムでエラーを発生させたメモリペ
ージまたはその他のエラーを発生させた資源を、夫々の
空き資源管理プールに返却しないことでシステムを制御
し管理するように構成したので、エラーの発生が繰り返
されることがなくなり、システムの信頼性を向上させる
エラートレースによるシステム制御管理方法が得られる
効果がある。According to the twenty-eighth aspect of the invention, the memory page in which an error has occurred in the program identified as the error source or the resource in which another error has occurred are not returned to the respective free resource management pools. Since the system is configured to be controlled and managed, it is possible to obtain a system control management method by error tracing that improves the reliability of the system by preventing the occurrence of errors repeatedly.

【０１８３】請求項２９の発明によれば、エラー発生源
と認定されたプログラムの種類がオペレーティングシス
テムなどのシステムプログラムであるときには、システ
ム全体の停止処理を行うように構成したので、認定した
エラー発生源に応じてシステムを制御しシステムの信頼
性を向上させるエラートレースによるシステム制御管理
方法が得られる効果がある。According to the twenty-ninth aspect of the invention, when the type of the program recognized as the error generation source is a system program such as an operating system, the system is configured to execute the stop processing of the entire system. There is an effect that a system control management method by error tracing that controls the system according to the source and improves the reliability of the system can be obtained.

【０１８４】請求項３０の発明によれば、エラー発生源
と認定されたプログラムがユーザプログラムまたはユー
ザプロセスの場合に、前記ユーザプログラムまたはユー
ザプロセスとそれらに関連する関連プログラムまたはプ
ロセス群の処理を停止させると共に他の処理については
継続実行させるように構成したので、認定したエラー発
生源に応じてシステムを制御しシステムの信頼性を向上
させるエラートレースによるシステム制御管理方法が得
られる効果がある。According to the thirtieth aspect of the invention, when the program recognized as the error source is the user program or the user process, the processing of the user program or the user process and the related program or process group related thereto is stopped. Since the other processing is configured to be continuously executed, the system control management method by the error trace that controls the system according to the recognized error source and improves the system reliability can be obtained.

【０１８５】請求項３１の発明によれば、エラー発生源
と認定されたプログラムでエラーを発生した資源を代替
資源と入れ替えることでエラーからの回復を行い、エラ
ー原因が回復した時点で処理を再開させ継続させるよう
に構成したので、システムの信頼性を向上させるエラー
トレースによるシステム制御管理方法が得られる効果が
ある。According to the thirty-first aspect of the invention, the error is recovered by replacing the resource in which an error has occurred in the program recognized as the error source with the alternative resource, and the process is restarted when the cause of the error is recovered. Since it is configured to continue the operation, there is an effect that a system control management method by error tracing that improves system reliability can be obtained.

[Brief description of drawings]

【図１】請求項１記載の発明の一実施例によるエラー
トレースによるシステム制御管理方法の構成を示すブロ
ック図である。FIG. 1 is a block diagram showing a configuration of a system control management method by error tracing according to an embodiment of the invention described in claim 1.

【図２】請求項１記載の発明の一実施例によるエラー
トレースによるシステム制御管理方法の動作を示すフロ
ーチャートである。FIG. 2 is a flowchart showing an operation of a system control management method by error tracing according to an embodiment of the invention described in claim 1.

【図３】請求項２記載の発明の一実施例によるエラー
トレースによるシステム制御管理方法における発生時間
順にトレース用バッファに格納されたプログラムの属性
情報を示す説明図である。FIG. 3 is an explanatory diagram showing attribute information of programs stored in a trace buffer in order of occurrence time in a system control management method by error tracing according to an embodiment of the invention as set forth in claim 2;

【図４】請求項３記載の発明の一実施例によるエラー
トレースによるシステム制御管理方法におけるエラー発
生元の調査・認定処理の動作を示すフローチャートであ
る。FIG. 4 is a flow chart showing an operation of error source investigation / authorization processing in a system control management method by error tracing according to an embodiment of the invention as set forth in claim 3;

【図５】請求項４記載の発明の一実施例によるエラー
トレースによるシステム制御管理方法におけるトレース
用バッファに格納されたプログラムの属性情報を示す説
明図である。FIG. 5 is an explanatory diagram showing attribute information of a program stored in a trace buffer in a system control management method by error tracing according to an embodiment of the invention described in claim 4;

【図６】請求項４記載の発明の一実施例によるエラー
トレースによるシステム制御管理方法におけるトレース
用バッファに格納されたプログラムの属性情報を示す説
明図である。FIG. 6 is an explanatory diagram showing attribute information of a program stored in a trace buffer in a system control management method by error tracing according to an embodiment of the invention described in claim 4;

【図７】請求項５記載の発明の一実施例によるエラー
トレースによるシステム制御管理方法の動作を示すフロ
ーチャートである。FIG. 7 is a flow chart showing an operation of a system control management method by error tracing according to an embodiment of the invention described in claim 5;

【図８】請求項６記載の発明の一実施例によるエラー
トレースによるシステム制御管理方法におけるリング状
のトレース用バッファの構成を示す説明図である。FIG. 8 is an explanatory diagram showing a configuration of a ring-shaped trace buffer in a system control management method by error tracing according to an embodiment of the present invention.

【図９】請求項６記載の発明の一実施例によるエラー
トレースによるシステム制御管理方法のトレース用バッ
ファに格納された属性情報を示す説明図である。FIG. 9 is an explanatory diagram showing attribute information stored in a trace buffer of a system control management method by error tracing according to an embodiment of the invention described in claim 6;

【図１０】請求項８および請求項９記載の発明の一実
施例のエラートレースによるシステム制御管理方法の特
徴を説明するための説明図である。FIG. 10 is an explanatory diagram for explaining the features of the system control management method based on error tracing according to an embodiment of the invention described in claims 8 and 9;

【図１１】請求項１０記載の発明の一実施例のエラー
トレースによるシステム制御管理方法におけるエラー検
出時間のリストを示す説明図である。FIG. 11 is an explanatory diagram showing a list of error detection times in a system control management method by error tracing according to an embodiment of the invention as set forth in claim 10;

【図１２】請求項１１記載の発明の一実施例のエラー
トレースによるシステム制御管理方法における使用資源
のリストを示す説明図である。FIG. 12 is an explanatory diagram showing a list of resources used in a system control management method by error tracing according to an embodiment of the invention described in claim 11;

【図１３】請求項１９および請求項２０記載の発明の
一実施例のエラートレースによるシステム制御管理方法
の構成を示す概念図である。FIG. 13 is a conceptual diagram showing a configuration of a system control management method using error tracing according to an embodiment of the invention described in claims 19 and 20.

【図１４】請求項２３および請求項２４記載の発明の
一実施例のエラートレースによるシステム制御管理方法
の構成を示す概念図である。FIG. 14 is a conceptual diagram showing the configuration of a system control management method by error tracing according to an embodiment of the invention described in claims 23 and 24.

【図１５】請求項２３および請求項２４記載の発明の
一実施例のエラートレースによるシステム制御管理方法
の構成のＰＭＩ５２にトレース用バッファを設けた場合
の概念図である。FIG. 15 is a conceptual diagram in the case where a trace buffer is provided in the PMI 52 of the configuration of the system control management method by error trace according to the embodiment of the invention described in claims 23 and 24.

【図１６】請求項２３および請求項２４記載の発明の
一実施例のエラートレースによるシステム制御管理方法
の構成におけるエラーが発生する前の書き込みバッファ
の状態を示す説明図である。FIG. 16 is an explanatory diagram showing a state of a write buffer before an error occurs in the configuration of the system control management method by error tracing according to the embodiment of the invention described in claims 23 and 24;

【図１７】請求項２３および請求項２４記載の発明の
一実施例のエラートレースによるシステム制御管理方法
の構成におけるトレース用バッファに記憶したプログラ
ムの状態によりエラー発生元を知るための対応表を示す
説明図である。FIG. 17 shows a correspondence table for knowing an error occurrence source from the state of the program stored in the trace buffer in the configuration of the system control management method by error trace according to the embodiment of the invention described in claims 23 and 24. FIG.

【図１８】請求項２３および請求項２４記載の発明の
一実施例のエラートレースによるシステム制御管理方法
の構成におけるトレース用バッファに記憶したプログラ
ムの状態によりエラー発生元を知るための対応表を示す
説明図である。FIG. 18 shows a correspondence table for knowing an error occurrence source from the state of the program stored in the trace buffer in the configuration of the system control management method by error tracing according to the embodiment of the invention described in claims 23 and 24. FIG.

【図１９】請求項２５または請求項２９または請求項
３０記載の発明の一実施例におけるエラー発生元の認定
後の動作を示すフローチャートである。FIG. 19 is a flow chart showing an operation after recognizing an error source in one embodiment of the invention of claim 25, claim 29 or claim 30.

【図２０】請求項２６または請求項２７記載の発明の
一実施例におけるエラー発生元の認定後の動作を示すフ
ローチャートである。FIG. 20 is a flowchart showing an operation after recognizing an error source in one embodiment of the invention of claim 26 or claim 27.

【図２１】請求項２８記載の発明の一実施例における
エラー発生元の認定後の動作を示すフローチャートであ
る。FIG. 21 is a flowchart showing an operation after recognizing an error occurrence source in one embodiment of the invention of claim 28.

【図２２】請求項３１記載の発明の一実施例における
エラー発生元の認定後の動作を示すフローチャートであ
る。FIG. 22 is a flow chart showing an operation after recognition of an error source in one embodiment of the invention of claim 31;

【図２３】従来のコンピュータシステムにおけるエラ
ー発生状況を模式的に時間の経過に従って示した説明図
である。FIG. 23 is an explanatory diagram schematically showing an error occurrence situation in a conventional computer system over time.

[Explanation of symbols]

２４，６２トレース用バッファ、４４ソフトウェア
インターフェース、５１ＣＰＵ、５３主メモリ、５
４外部バス制御回路、５５ローカルバス制御回路、
５６入出力制御回路、６０キャッシュメモリ、６１
書き込みバッファ。24, 62 trace buffer, 44 software interface, 51 CPU, 53 main memory, 5
4 external bus control circuit, 55 local bus control circuit,
56 input / output control circuit, 60 cache memory, 61
Write buffer.

Claims

[Claims]

1. A system control management method using an error trace, wherein trace information at the time of error occurrence is stored in a trace buffer, an error generation source is identified by the stored trace information, and a system is controlled and managed. A system control management method by error tracing, wherein attribute information relating to a change in the state of a program is stored as trace information in a buffer for use, and an error generation source is identified based on the stored attribute information.

2. The trace buffer stores attribute information relating to a change in the state of a program being executed as trace information in the order of occurrence time, and the stored trace information is searched for forward or backward in the order of occurrence time, resulting in an error. 2. The system control management method by error trace according to claim 1, wherein a program related to the cause of occurrence is certified.

3. Retrieval of attribute information that allows the process stored in the trace buffer to know which resource the process is using, and based on this attribute information, retrieves the resource used by the process. The system control management method by error trace according to claim 1, wherein the program that uses the resource that caused the error is recognized by knowing.

4. The error trace according to claim 1, wherein the trace buffer stores the program state before the state of the program being executed changes or the program state after the change as attribute information. System control management method.

5. When a storage device such as a write buffer or a cache is used, when an error occurs, the data in the storage device is fixed and the state of the program being executed or change before the state of the program being executed changes. 2. The system control management method by error trace according to claim 1, wherein the attribute information stored in the trace buffer such as the program state after the search is searched to identify the error source.

6. The system control management method by error trace according to claim 1, wherein the trace buffer is cyclically used to store the attribute information when an error occurs.

7. A trace buffer as a pointer to a management table of a program which is created and managed by the operating system according to the usage status of the program or is defined in advance or a copy of the management table or an identifier of the program as attribute information. The system control management method by error trace according to claim 1, wherein

8. The error tracing system according to claim 1, wherein the trace buffer stores attribute information relating to a time when the program state is changed or an elapsed time after the state of the program is changed. Control management method.

9. The error occurrence time at which an error actually occurs is determined from the error detection time and the error detection time required to detect the error, and the status of the program stored in the trace buffer changes. The error source is identified by searching for the program that was being executed at the obtained error occurrence time, based on the attribute information such as the time when the error occurred or the time elapsed since the state of the program changed. A system control management method based on the error trace according to item 1.

10. A list of error detection times required to detect an error corresponding to the type of error that has occurred is obtained in advance, and the error detection time corresponding to the type of error that has occurred is searched from the list. The error detection time and the error detection time obtained from the above list are used to obtain the actual error occurrence time according to the type of error, and the time when the state change of the program stored in the trace buffer occurs or the state of the program is 2. The error trace source according to claim 1, wherein the program that was executed at the obtained error occurrence time is searched and obtained by attribute information such as the elapsed time since the change, and the error occurrence source is identified. System control management method.

11. The system control management method according to claim 1, wherein a list of resources used by the program is stored as attribute information in the trace buffer.

12. The system control management method according to claim 1, wherein the attribute information is stored in the trace buffer based on the occurrence of a context switch between processes.

13. The system control management method according to claim 1, wherein the attribute information is stored in the trace buffer based on the transition of the execution state of the CPU.

14. The transition of the execution state of the CPU is a transition of the execution state of the CPU from a user program to a system program or from a system program to a user program as a state change of the program. System control management method by error trace of the.

15. The transition of the execution state of the CPU is a transition of the execution state of the CPU from a non-privileged level to a privileged level or from a privileged level to a non-privileged level as a state change of a program. 13. A system control management method based on the error trace described in 13.

16. The transition of the execution state of the CPU is a timing at which the interrupt processing is activated or terminated, a timing at which the interrupt processing is activated and terminated, or an arbitrary time timing during the interrupt processing. 13. A system control management method based on the error trace described in 13.

17. The error trace according to claim 16, wherein the interrupt processing uses an error occurrence at the end of the input / output operation in the input / output device or a hardware error or an exception during the input / output operation as an interrupt factor. System control management method.

18. The transition of the execution state of the CPU is performed at the timing of activating a system call for transition from the user program to the system program or at the timing of executing termination processing from the system call, or at the activation of the system call. 14. The method for managing system control by error trace according to claim 13, wherein the timing is a timing at which the system call is terminated and a timing at which a termination process from the system call is executed, or an arbitrary time timing during the processing of the system call.

19. The system control management method according to claim 1, wherein the attribute information is stored as trace information by using an interface provided to the application program.

20. When storing attribute information as trace information by using an interface provided to an application program, a program counter value, a CPU state value, a time element, a trace event number, contents of various registers, a program name, a process A name, a message, an address space identifier, a process identifier, a program identifier, a process attribute information, a program attribute information, a message queue identifier, a message queue attribute information, or a combination thereof is used as an interface parameter. 21. A system control management method based on the error trace described in 19.

21. The system control management method by error tracing according to claim 1, wherein the attribute information regarding the change in the state of the program is stored in the trace buffer only when a predetermined factor occurs.

22. When a predetermined factor such as an interrupt type, a system call type, a program or process identifier, a program name or a process name or a function name, writing to, reading from, and executing a specific address area occurs. 22. The system control management method by error tracing according to claim 21, wherein the attribute information is stored in the trace buffer only in the case.

23. The error trace system control management method according to claim 1, wherein attribute information indicating a program state change is stored in a trace buffer provided in a hardware control circuit.

24. The error according to claim 23, wherein the interface circuit between the CPU and the main memory, the external bus control circuit, the local bus control circuit, the input / output control circuit, and the main memory are hardware control circuits. System control management method by trace.

25. When the execution of the certified program is stopped, the system is stopped, or the execution of the program is postponed and the cause of the error is recovered according to the type of the program recognized as the error source. 2. The system control management method by error trace according to claim 1, wherein the system is controlled and managed by executing the program and restarting and continuing the operation of the system.

26. When the program recognized as the error source is a user program or a user process, the resource is returned to the system management means that manages the resource used by the user program or the user process. The system is controlled and managed by performing the following.
A system control management method based on the error trace described in 5.

27. The system is controlled and managed by returning resources such as a memory area used by a program recognized as an error source or various management tables or input / output devices to each free resource management pool. Claim 2
6. A system control management method based on the error trace described in 6.

28. A system for controlling and managing the system by not returning the memory page in which an error has occurred or other resource in which an error has occurred in a program identified as an error source to each free resource management pool. Item 27. A system control management method by error tracing according to Item 26.

29. A system based on an error trace according to claim 25, wherein when the type of the program recognized as the error source is a system program such as an operating system, the system is controlled and managed by performing a stop processing of the entire system. Control management method.

30. When the program recognized as the error source is a user program or a user process, the processing of the user program or the user process and related programs or processes related to them is stopped and other processing is performed. 26. The system control management method by error trace according to claim 25, wherein the system is controlled and managed by continuously executing it.

31. Recovery from an error is performed by replacing a resource in which an error has occurred with a program identified as an error source with an alternative resource, and the system is restarted by continuing the process when the cause of the error is recovered. 26. The system control management method according to claim 25, wherein the system control is performed by controlling.