JPH05342058A

JPH05342058A - Process abnormality detection system

Info

Publication number: JPH05342058A
Application number: JP4150228A
Authority: JP
Inventors: Yoshimi Kagaya; 芳美加賀屋
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1992-06-10
Filing date: 1992-06-10
Publication date: 1993-12-24

Abstract

PURPOSE:To provide the high reliability of a system by performing processing while detecting the abnormal end of processes. CONSTITUTION:An application program (process) group 3 is constituted so as to be activated from a managing process 1 and further, a monitor process 2 activated from the managing process 1 is constituted so as to monitor the managing process 1 and to easily detect the abnormal end of the managing process 1 itself as well. Thus, since the abnormal end of processes can be detected and processing can be performed even after the abnormal end, the high-reliability system can be provided.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、例えばＵＮＩＸシス
テム（ＵＮＩＸはＡＴ＆Ｔ社の登録商標である）におけ
るプロセスの異常終了を検知するシステムに関するもの
である。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a system for detecting abnormal termination of a process in, for example, UNIX system (UNIX is a registered trademark of AT & T Corporation).

【０００２】[0002]

【従来の技術】図３は、ＵＮＩＸシステムにおける、ア
プリケーションプログラムを作成した場合のプロセス構
成の図である。アプリケーションプロセス群４は、それ
ぞれ、システムにより起動されたプロセスの集合であ
る。次に動作について説明する。各アプリケーションプ
ロセス群４に対して、システムが異常を検知した場合、
システムは、プロセス群４に対して強制終了の割り込み
（ＵＮＩＸシステムでは、ＳＩＧＫＩＬＬというｓｉｇ
ｎａｌ）を発生させる。強制終了の割り込みを受けたプ
ロセスは、その割り込みをキャッチすることができない
ため、異常終了という形で終了させられてしまう。この
プロセスの異常終了を、アプリケーション側で検知する
ことができないことにより、そのプロセスが終了した場
合の処理が行えないため、このまま、システムを動作さ
せた場合、アプリケーションにより構築したシステムに
対して誤動作を引き起す結果となるおそれがでてくる。2. Description of the Related Art FIG. 3 is a diagram showing a process configuration when an application program is created in a UNIX system. The application process group 4 is a set of processes activated by the system. Next, the operation will be described. When the system detects an error for each application process group 4,
The system interrupts the process group 4 by a forced termination (in UNIX systems, a sig called SIGKILL).
nal) is generated. The process that received the interrupt for forced termination cannot catch the interrupt, so it is terminated in the form of abnormal termination. Abnormal termination of this process cannot be detected by the application, so processing when the process is terminated cannot be performed.Therefore, if the system is operated as it is, malfunction will occur in the system constructed by the application. There is a risk of causing this.

【０００３】[0003]

【発明が解決しようとする課題】従来のシステムにおい
ては、プロセスが強制終了させられた場合、そのプロセ
スの異常終了を検知することができないため、そのプロ
セス終了時の異常処理が行えないことによりその後のシ
ステムの正常動作が保障できないという問題点があっ
た。本発明は、上記のような問題点を解消するためにな
されたもので、プロセスの異常終了の検知を行えるプロ
セス管理ができるとともに、プロセス異常終了に対する
信頼性を向上させることのできるプロセス異常検出方式
を提供することを目的としている。In the conventional system, when a process is forcibly terminated, the abnormal termination of the process cannot be detected. Therefore, the abnormal processing at the end of the process cannot be performed. There was a problem that normal operation of the system could not be guaranteed. The present invention has been made in order to solve the above problems, and is a process abnormality detection method capable of performing process management capable of detecting abnormal termination of a process and improving reliability with respect to abnormal process termination. Is intended to provide.

【０００４】[0004]

【課題を解決するための手段】この発明に係るプロセス
異常検出方式は、例えば管理プロセスが各アプリケーシ
ョンプログラム（プロセス）を起動させ異常終了を検知
し、異常処理を行なうとともに、その管理プロセスの異
常終了に対しても検知できるような監視プロセスを備え
たものであり、以下の要素を有するものである。（ａ）以下の要素を有する管理プロセス、（ａ１）プロ
セスを起動させる起動手段、（ａ２）上記起動手段によ
り起動されたプロセスの異常終了を検知する検知手段、
（ａ３）上記検知手段により検知されたプロセスの異常
発生後の処理を行なう事後処理手段、（ｂ）上記起動手
段により起動され、上記管理プロセスの監視を行なう監
視プロセス。In the process abnormality detection method according to the present invention, for example, a management process activates each application program (process) to detect an abnormal termination, performs abnormal processing, and abnormally terminates the management process. It is equipped with a monitoring process that can detect even the following, and has the following elements. (A) a management process having the following elements, (a1) a starting means for starting the process, (a2) a detecting means for detecting an abnormal end of the process started by the starting means,
(A3) Post-processing means for performing processing after the occurrence of an abnormality in the process detected by the detection means, and (b) a monitoring process which is started by the starting means and monitors the management process.

【０００５】[0005]

【作用】この発明によるプロセス異常検出方式は、プロ
セスを管理プロセスにより起動させることにより、各プ
ロセスの異常終了に対する検知が可能となり、異常終了
発生後の処理が行えることになる。また、管理プロセス
の異常終了に対しても監視プロセスにより検知可能とな
る。このことにより、プロセスの異常終了を即時に検知
し異常処理を行なえるため信頼性の高いシステムを得る
事が出来る。In the process abnormality detecting method according to the present invention, by starting the process by the management process, the abnormal end of each process can be detected, and the processing after the abnormal end can be performed. Further, the abnormal termination of the management process can be detected by the monitoring process. As a result, abnormal termination of a process can be immediately detected and abnormal processing can be performed, so that a highly reliable system can be obtained.

【０００６】[0006]

【Example】

実施例１．図１は、本発明に係るプロセス構成図の一実
施例を示す図であり、１は管理プロセス、２は管理プロ
セス１の異常終了を監視するための監視プロセス、３は
各アプリケーションにより作成されたプロセス群であ
る。Example 1. FIG. 1 is a diagram showing an embodiment of a process configuration diagram according to the present invention. 1 is a management process, 2 is a monitoring process for monitoring abnormal termination of the management process 1, and 3 is created by each application. It is a group of processes.

【０００７】管理プロセス１は、システムからの起動時
に、監視プロセス２を起動させるようにする。一般に、
ＵＮＩＸシステムにおいて、あるプロセスより起動され
たプロセスは、親子というプロセス関係を結ぶことにな
る。この場合、起動したプロセスが親プロセスとなり、
起動されたプロセスが子プロセスとなる。従って、図１
において、管理プロセス１より起動されたプロセスはす
べて管理プロセスが親プロセスとなる。つまり、管理プ
ロセス１より起動された、プロセス群３の中のプロセス
Ａ、プロセスＢは、管理プロセス１を親とする子プロセ
スということになる。また、システム起動時に管理プロ
セス１により起動された監視プロセス２も同様に管理プ
ロセス１を親とする子プロセスである。The management process 1 activates the monitoring process 2 when the system is activated. In general,
In the UNIX system, a process activated by a certain process has a parent-child process relationship. In this case, the started process becomes the parent process,
The started process becomes a child process. Therefore, FIG.
In, all the processes started by the management process 1 are parent processes. That is, the processes A and B in the process group 3 started by the management process 1 are child processes having the management process 1 as a parent. The monitoring process 2 started by the management process 1 when the system is started is also a child process having the management process 1 as a parent.

【０００８】次に動作について、図２を用いて説明す
る。図２は、図１で示された各プロセスの処理を時間の
経過にともなって表わしたものである。点線は、親プロ
セスから子プロセス、または子プロセスから親プロセス
への動作をあらわす。まず、管理プロセス１はシステム
を起動させ、初めに監視プロセス３を起動させる。以
後、監視プロセス３は、ｇｅｔｐｐｉｄという親プロセ
スの存在を確認できるシステムコールを周期的に使用す
ることで親プロセスである管理プロセス１が異常終了し
ていないことを確認する。Next, the operation will be described with reference to FIG. FIG. 2 shows the processing of each process shown in FIG. 1 over time. The dotted line represents the operation from the parent process to the child process or from the child process to the parent process. First, the management process 1 activates the system, and first activates the monitoring process 3. After that, the monitoring process 3 periodically uses a system call called getppid, which can confirm the existence of the parent process, and confirms that the management process 1, which is the parent process, has not terminated abnormally.

【０００９】次に管理プロセスは、プロセスＡを起動さ
せ、続いてプロセスＢを起動させる。ここで、プロセス
Ａが異常終了した例を想定し、プロセスＢは、プロセス
Ａが正常終了した時にのみ更新処理を行なうものとす
る。プロセスＡは異常終了した事を管理プロセス１に知
らせる。異常終了を検知した管理プロセス１は、事後処
理を行なうよう、プロセスＢに通知する。通知を受けた
プロセスＢは、事後処理（たとえば更新結果を元にもど
す等の処理）を行ない、正常終了しその旨管理プロセス
１に通知する。その間監視プロセス２は、管理プロセス
１が動作している事を確認している。Next, the management process activates process A and subsequently process B. Here, assuming an example in which the process A abnormally ends, the process B performs the update processing only when the process A ends normally. The process A notifies the management process 1 of the abnormal termination. The management process 1 that has detected the abnormal termination notifies the process B to perform post-processing. The process B that has received the notification performs post-processing (for example, processing such as returning the update result to the original), completes normally, and notifies the management process 1 to that effect. Meanwhile, the monitoring process 2 confirms that the management process 1 is operating.

【００１０】この様にして管理プロセス１によって起動
されたアプリケーションプロセス群３の中のプロセスが
強制終了させられた場合、子プロセスは終了時に必ず親
プロセスに対して、その終了を通知することから、アプ
リケーションプロセス群３の中の各プロセスの異常終了
が必ず管理プロセス１に通知されることになり、異常終
了が検知可能となる。そして、異常終了が検知可能とな
るため、異常終了発生後の処理がタイミングよく行なえ
ることになる。さらに管理プロセス１が異常終了した場
合は、監視プロセス２が管理プロセス１の異常終了を検
知することができる。When a process in the application process group 3 activated by the management process 1 is forcibly terminated in this way, the child process always notifies the parent process of its termination at the time of termination. Abnormal termination of each process in the application process group 3 is always notified to the management process 1, and the abnormal termination can be detected. Since the abnormal end can be detected, the processing after the abnormal end occurs can be performed at a proper timing. Further, when the management process 1 ends abnormally, the monitoring process 2 can detect the abnormal end of the management process 1.

【００１１】実施例２．上記実施例１においては、ＵＮ
ＩＸシステムの場合を例にして説明したが、この発明
は、ＵＮＩＸシステムに限らず、あるプロセスが他のプ
ロセスの異常終了を検出できるシステムであれば適用す
ることができる。Embodiment 2. In the first embodiment, the UN
Although the case of the IX system has been described as an example, the present invention is not limited to the UNIX system, but can be applied to any system in which a process can detect abnormal termination of another process.

【００１２】[0012]

【発明の効果】以上のように、この発明によれば管理プ
ロセスによりプロセスの励動を行うように構成したので
アプリケーションプロセスの異常終了を検知できるよう
になり、又、管理プロセス自体の異常終了に対しても容
易に検知できるように構成したため、異常終了に対する
システムの高信頼性を容易に構築できる効果がある。As described above, according to the present invention, since the process is excited by the management process, the abnormal termination of the application process can be detected, and the abnormal termination of the management process itself can be detected. Since it is configured so that it can be easily detected, it is possible to easily establish high reliability of the system against abnormal termination.

[Brief description of drawings]

【図１】この発明の実施例１のプロセス構成を示した図
である。FIG. 1 is a diagram showing a process configuration according to a first embodiment of the present invention.

【図２】この発明の実施例１の動作を示した図である。FIG. 2 is a diagram showing the operation of the first embodiment of the present invention.

【図３】従来のプロセス構成を示した図である。FIG. 3 is a diagram showing a conventional process configuration.

[Explanation of symbols]

１管理プロセス２監視プロセス３アプリケーションプロセス群 1 Management process 2 Monitoring process 3 Application process group

Claims

[Claims]

1. A process abnormality detection method having the following elements: (a) a management process having the following elements, (a1) an activation means for activating a process, (a2) an abnormal termination of a process activated by the activation means. Detection means to detect,
(A3) Post-processing means for performing processing after occurrence of an abnormality in the process detected by the detection means, (b) Monitoring process started by the starting means and monitoring the management process.