JPH01266637A - System monitor controller - Google Patents

System monitor controller

Info

Publication number
JPH01266637A
JPH01266637A JP63096118A JP9611888A JPH01266637A JP H01266637 A JPH01266637 A JP H01266637A JP 63096118 A JP63096118 A JP 63096118A JP 9611888 A JP9611888 A JP 9611888A JP H01266637 A JPH01266637 A JP H01266637A
Authority
JP
Japan
Prior art keywords
monitoring
program
monitor
arithmetic processing
processing unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP63096118A
Other languages
Japanese (ja)
Inventor
Yukihisa Murata
幸久 村田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mitsubishi Electric Corp
Original Assignee
Mitsubishi Electric Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsubishi Electric Corp filed Critical Mitsubishi Electric Corp
Priority to JP63096118A priority Critical patent/JPH01266637A/en
Publication of JPH01266637A publication Critical patent/JPH01266637A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To monitor an operating state of a system, while operating its system and to execute a cause analysis and a recovery processing by on-line at the time when abnormality has been generated by providing an exclusive operation processor for executing only a monitor control of the system and storing a monitor program in an area of a main storage device. CONSTITUTION:A monitor program is started at a time interval which has been determined in advance at every monitor program by using a monitor 20, and an executing state of an execution program corresponding to the started program is monitored by a monitor operation processor 2 which has been connected to a main storage device 3. When abnormality has been detected, the cause is analyzed, and if it can be recovered, a recovery processing is performed, and when no abnormality is detected, regular monitoring is executed continuously. In such a way, when abnormality has been generated, the system is recovered by operating only the monitor operation processor 2, and other device than the device concerned of plural operation processors 11, 12 which have been connected to the main storage device 3 is allowed to continue a processing of a reference and updating.

Description

【発明の詳細な説明】 [産業上の利用分野] この発明は複数の演算処理装置が主記憶装置を共有して
動作するシステムの動作を監視し異常状態の回復処理を
制御するシステム監視制御装置に関するものである。
[Detailed Description of the Invention] [Industrial Application Field] The present invention provides a system monitoring and control device for monitoring the operation of a system in which a plurality of arithmetic processing units share a main memory and controlling recovery processing from an abnormal state. It is related to.

[従来の技術] 第4図は従来のこの種の装置を示すブロック図であって
、図において(11)は演算処理袋fI、(12)は演
算処理装置■、(3)は主記憶装置である。各演算処理
装置(11)、(12)は主記憶装置(3)からプログ
ラムとデータとを読み出し、プログラムに従ってデータ
処理を行い、処理結果のデータを主記憶装置(3)に書
き込む。
[Prior Art] Fig. 4 is a block diagram showing a conventional device of this type, in which (11) is an arithmetic processing bag fI, (12) is an arithmetic processing unit ■, and (3) is a main storage device. It is. Each arithmetic processing unit (11), (12) reads a program and data from the main storage device (3), performs data processing according to the program, and writes the data resulting from the processing to the main storage device (3).

このような動作を主記憶装置(3)の参照・更新という
が、主記憶装置(3)の参照・更新は各演算処理装置(
11)、(12)において行われ、異常状態発生に対処
するために予め記録に残しておくことが必要な記録はこ
の参照・更新の際に行われる。
This kind of operation is called referencing/updating the main memory (3), but referencing/updating the main memory (3) is performed by each arithmetic processing unit (
11) and (12), and records that need to be kept in advance in order to deal with the occurrence of an abnormal state are performed during this reference/update.

第5図は異常状態発生に対応して行われる処理を示すフ
ローチャートであり、図において(sll)〜(s 1
4)で示す各ブロックは各ステップを表す。異常状態に
対処するため予め残しておく記録も、異常状態の検出も
、これに対応する情報の採取も、システムの回復も、原
因の分析も、すべて各演算処理装置により実行される。
FIG. 5 is a flowchart showing the processing performed in response to the occurrence of an abnormal state, and in the figure, (sll) to (s1
Each block indicated by 4) represents each step. Records kept in advance to deal with abnormal conditions, detection of abnormal conditions, collection of corresponding information, system recovery, and cause analysis are all performed by each processing unit.

すなわちステップ(sll)で異常状態が発生したこと
を検出すると、その検出した演算処理装置が、ステップ
(s12)で情報採取を行いステップ(slB)でシス
テムを回復させ、ステップ(s14)で原因分析を行う
In other words, when it is detected that an abnormal state has occurred in step (sll), the arithmetic processing unit that detected it collects information in step (s12), recovers the system in step (slB), and analyzes the cause in step (s14). I do.

[発明か解決しようとする課題] 従来のシステムは以上のように構成されているので、異
常状態を検出した演算処理装置は実行中のプログラムを
中断して事故処理プログラムの実行に移らねばならず、
必要な場合にはシステムを一旦停止させないと、情報の
採取および原因の分析ができないという問題があった。
[Problem to be solved by the invention] Since the conventional system is configured as described above, the arithmetic processing unit that detects an abnormal condition must interrupt the program being executed and move on to executing the accident handling program. ,
There is a problem in that it is impossible to collect information or analyze the cause unless the system is temporarily stopped if necessary.

この発明は従来のものにおける上述の課題を解決するた
めになされたもので、異常状態の発生に当たっても、シ
ステムを停止することなく、原因の分析及び回復処理が
できるシステム監視制御装置を得ることを目的としてい
る。
This invention was made in order to solve the above-mentioned problems in the conventional system, and aims to provide a system monitoring and control device that can analyze the cause and perform recovery processing without stopping the system even when an abnormal state occurs. The purpose is

[課題を解決するための手段] この発明ではシステムの監視制御だけを行う専用の演算
処理装置を設け、この専用の演算処理装置が用いる監視
プログラムを主記憶装置の領域に格納することとした。
[Means for Solving the Problems] In the present invention, a dedicated arithmetic processing unit that performs only system monitoring control is provided, and a monitoring program used by this dedicated arithmetic processing unit is stored in an area of the main storage device.

[作用] 異常状態発生時には監視用演算処理装置が動作してシス
テムを回復するので、実行用演算処理装置の動作を停止
する必要がなくなる。
[Operation] When an abnormal state occurs, the monitoring processing unit operates to recover the system, so there is no need to stop the execution processing unit.

[実施例コ 以下、この発明の実施例を図面を用いて説明する。第1
図はこの発明の一実施例を示すブロック図で、図におい
て第4図と同一符号は同一または相当部分を示し、(2
)は監視用演算処理装置(これに対し演算処理装置(1
1)、 (12)を実行用演算処理装置という)、(4
)は主記憶装置内に設けられた監視プログラム格納領域
である。 第2図は監視プログラム格納領域(4)に格
納されているプログラムの内容を示す説明図で、(20
)は監視モニタ、(21)、(22)・・・(2n)は
各監視プログラムである。各監視プログラムは実行用演
算処理装置(11)、(12)で実行される各プログラ
ムにそれぞれ対応して設けられ、監視対象プログラムに
対応してきめ細かな監視が可能なように構成されている
[Embodiments] Hereinafter, embodiments of the present invention will be described with reference to the drawings. 1st
The figure is a block diagram showing an embodiment of the present invention. In the figure, the same reference numerals as in FIG. 4 indicate the same or corresponding parts.
) is the monitoring arithmetic processing unit (in contrast, the arithmetic processing unit (1
1), (12) are called execution arithmetic processing units), (4
) is a monitoring program storage area provided in the main memory. FIG. 2 is an explanatory diagram showing the contents of the program stored in the monitoring program storage area (4).
) is a monitoring monitor, and (21), (22), . . . (2n) are respective monitoring programs. Each monitoring program is provided corresponding to each program executed by the execution processing units (11) and (12), and is configured to enable detailed monitoring corresponding to the program to be monitored.

現時点でこのシステムにより実行されているプログラム
の種類は監視モニタ〈20)に知らされ、監視モニタ(
20)はこれに対応する監視プログラムを順次循環的に
起動してそれぞれ対応する実行プログラムを監視させる
The type of program currently being executed by this system is informed to the monitoring monitor (20), and the monitoring monitor (20)
20) sequentially and cyclically starts the corresponding monitoring programs to monitor the respective corresponding execution programs.

第3図はこの発明の監視制御の流れを示すフローチャー
トで、(Sl)〜(S6)の各ブロックは各ステップを
示す。ステップ(Sl)では、監視モニタ(20)が各
監視プログラム毎に予め定められている時間間隔で監視
プログラムを起動する。従って(s2)〜(s6)まで
のステップは起動された監視プログラムに就いての実行
ステップであり、所定の時間の経過後はステップ(Sl
)に戻り次の監視プログラムが起動される。 ステップ
(s2)では起動された監視プログラムに対応する実行
プログラムの実行状況を、主記憶装置内の当該実行プロ
グラムの進行状況について監視用演算処理装置(2)で
監視する。ステップ(S3)で異常が検出された時は、
監視用演算処理装置(2)がその監視プログラムに従っ
て(s4)。
FIG. 3 is a flowchart showing the flow of supervisory control according to the present invention, and each block (Sl) to (S6) represents each step. In step (Sl), the monitoring monitor (20) starts the monitoring programs at predetermined time intervals for each monitoring program. Therefore, steps (s2) to (s6) are execution steps for the activated monitoring program, and after a predetermined time has elapsed, step (Sl
) and the next monitoring program is started. In step (s2), the execution status of the execution program corresponding to the activated monitoring program is monitored by the monitoring arithmetic processing unit (2) with respect to the progress status of the execution program in the main storage device. When an abnormality is detected in step (S3),
The monitoring processing unit (2) follows the monitoring program (s4).

(s5)、(s6)の処理を行うが、ステップ〈s5)
における判定がNoの場合には、従来通りの通常の異常
処理に入る。(S4)による原因分析の結果(S5)に
おいて回復可能と判定されれば監視用演算処理装置(2
)がステップ(S6)の回復処理を実施するので、実行
用演算処理装置(11)、(12)は実行プログラムの
実行を停止する必要がない。すなわち、オンライン状態
で異常状態からの回復が可能になる。
(s5) and (s6) are performed, but step <s5)
If the determination in step 3 is No, normal abnormality processing as before is entered. If it is determined that recovery is possible as a result of the cause analysis (S5) in (S4), the monitoring arithmetic processing unit (2)
) carries out the recovery process in step (S6), so the execution arithmetic processing units (11) and (12) do not need to stop the execution of the execution program. In other words, it is possible to recover from an abnormal state while online.

なお上記実施例では、実行用演算処理装置の数が2個の
場合について説明したが、2個以上の場合においても監
視用演算処理装置1個で監視制御を行うことができる。
In the above embodiment, the case where the number of executing processing units is two has been described, but even in the case where there are two or more processing units, the monitoring control can be performed with one monitoring processing unit.

[発明の効果] 以上のようにこの発明によれば、システムを稼働させな
がらそのシステムの動作状態を監視することができ、異
常発生時の原因分析及び回復処理をオンラインで実行す
ることができるという効果がある。
[Effects of the Invention] As described above, according to the present invention, the operational status of the system can be monitored while the system is running, and cause analysis and recovery processing can be performed online when an abnormality occurs. effective.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図はこの発明の一実施例を示すブロック図、第2図
は第1図の監視プログラム格納領域の内容を示す説明図
、第3図はこの発明の監視制御の流れを示すフローチャ
ート、第4図は従来の装置を示すブロック図、第5図は
第4図のシステムで実行される監視制御の流れを示すフ
ローチャート。 (11)、(12)はそれぞれ実行用演算処理装置、(
2)は監視用演算処理装置、(3)は主記憶装置、(4
)は監視プログラム格納領域、(20)は監視モニタ、
(21)、(22>・・・(2n)はそれぞれ監視プロ
グラム。 なお、図中同一符号は同一または相当部分を示すものと
する。
FIG. 1 is a block diagram showing an embodiment of the present invention, FIG. 2 is an explanatory diagram showing the contents of the monitoring program storage area of FIG. 1, and FIG. 3 is a flow chart showing the flow of monitoring control of the invention. FIG. 4 is a block diagram showing a conventional device, and FIG. 5 is a flowchart showing the flow of monitoring control executed in the system of FIG. 4. (11) and (12) are the execution arithmetic processing unit, (
2) is a monitoring arithmetic processing unit, (3) is a main storage device, and (4)
) is the monitoring program storage area, (20) is the monitoring monitor,
(21), (22>...(2n) are monitoring programs, respectively. Note that the same reference numerals in the figures indicate the same or corresponding parts.

Claims (1)

【特許請求の範囲】 複数個の演算処理装置とこの複数個の演算処理装置間で
共用される主記憶装置とを有するシステムの動作を監視
し異常状態の発生時に当該システムの動作を制御するシ
ステム監視制御装置おいて、当該システムの動作監視の
ため専用に設けられる監視用演算処理装置、当該システ
ムで実行される各プログラムにそれぞれ対応して設けら
れ、対応プログラムの実行を監視する各監視プログラム
、 この各監視プログラムを格納する主記憶装置の監視プロ
グラム格納領域、 当該システムで実行中のプログラムに対応する監視プロ
グラムを上記監視用演算処理装置に順次循環的に実行さ
せる監視モニタ、 上記監視用演算処理装置では実行中の監視プログラムの
制御に従い対応するプログラムの実行状況を監視し、異
常を検出したときは異常状態の原因分析および回復処理
を上記実行中のプログラムを停止することなく行う手段
、 を備えたことを特徴とするシステム監視制御装置。
[Scope of Claims] A system that monitors the operation of a system having a plurality of arithmetic processing units and a main storage device shared among the plurality of arithmetic processing units, and controls the operation of the system when an abnormal state occurs. In the monitoring control device, a monitoring arithmetic processing unit is provided exclusively for monitoring the operation of the system, each monitoring program is provided corresponding to each program executed in the system and monitors the execution of the corresponding program, A monitoring program storage area of a main storage device that stores each of the monitoring programs, a monitoring monitor that causes the monitoring arithmetic processing unit to sequentially and cyclically execute the monitoring program corresponding to the program being executed in the system, and the monitoring arithmetic processing unit. The device monitors the execution status of the corresponding program under the control of the running monitoring program, and when an abnormality is detected, the device is equipped with a means for analyzing the cause of the abnormal state and performing recovery processing without stopping the running program. A system monitoring and control device characterized by:
JP63096118A 1988-04-19 1988-04-19 System monitor controller Pending JPH01266637A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63096118A JPH01266637A (en) 1988-04-19 1988-04-19 System monitor controller

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63096118A JPH01266637A (en) 1988-04-19 1988-04-19 System monitor controller

Publications (1)

Publication Number Publication Date
JPH01266637A true JPH01266637A (en) 1989-10-24

Family

ID=14156468

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63096118A Pending JPH01266637A (en) 1988-04-19 1988-04-19 System monitor controller

Country Status (1)

Country Link
JP (1) JPH01266637A (en)

Similar Documents

Publication Publication Date Title
JPS62154037A (en) Virtual computer supervising and controlling system
JPH01266637A (en) System monitor controller
JPS6115239A (en) Processor diagnosis system
JP2712833B2 (en) Failure support equipment
JPH05241906A (en) Programmable controller
JP2842748B2 (en) Monitoring and control method for information processing equipment
JPH0196752A (en) Multi-processor device
JPS61813A (en) Deciding system for faulty area of sequence controller
JPH01183701A (en) Plant supervisory unit
JPH02207347A (en) Method for detecting fault of software
JPS63273901A (en) Plant controller
JPS61259302A (en) Abnormal time disposing deciding device for plant
JPH01134637A (en) Supervising system for information in stall processing system
JPH02129755A (en) Degeneration system for multiprocessor system
JPH03246638A (en) Program runaway detecting system
JPH0469744A (en) Runaway detector for microcomputer
JPS63237147A (en) Supervisory system for input/output actions in computer system
JPH04105197A (en) Process monitoring device
JPH04124735A (en) System starting system
JPS62274439A (en) Monitoring system for program under execution
JPH02310634A (en) System for supervising runaway of program
JPH01233503A (en) Fault diagnostic method for numerical controller
JPH05257745A (en) Runaway-monitoring system for microcomputer
JPS60171544A (en) Self-diagnosis device for abnormality of computer system
JPH02101546A (en) System for executing and supervising virtual computer