JPS6376009A

JPS6376009A - Operation control system for computer system

Info

Publication number: JPS6376009A
Application number: JP61221370A
Authority: JP
Inventors: Yoshimasa Yamamoto; 芳正山本; Koji Miyajima; 宮嶋　宏二; Yutaka Kumazawa; 熊沢　裕; Shoji Goto; 後藤　昌治
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1986-09-19
Filing date: 1986-09-19
Publication date: 1988-04-06

Abstract

PURPOSE:To attain automation and unmanned operation at dispersed portions and a host portion by controlling and monitoring equipment in plural dispersed portions constituting a computer system by dispersed operation control devices arranged in respective portions. CONSTITUTION:Respective dispersed operation control devices 2 always monitor the state of the system except the period of control execution. A processing control part 34 receives the state change of the system from each system control part 33 and reports the state change to an overall operation control part 1 in accordance with a procedure set up in a storage part 32. When a fixed procedure against the generation of a fault is set up, the control part 34 outputs an operation control command to a system control part 33. Even if control is failed, the control part 34 reports the state to its master device in accordance with the procedure set up in the storage part 32. Consequently, the dipersed portions and the host portion can be automated and unmanned.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、コンピュータシステムの運転制御方式に関し
、特にネットワークを介して展開される複数の分散拠点
とホスト拠点から構成される大規模コンピュータシステ
ムの運転制御方式に関する。[Detailed Description of the Invention] [Field of Industrial Application] The present invention relates to an operation control method for a computer system, and particularly to a large-scale computer system consisting of a plurality of distributed bases and host bases deployed via a network. Concerning operation control methods.

[Conventional technology]

従来の運転制御方式では、複数のホストと複数のデバイ
スからなるセンター機器とシステムの電源設備・空調設
備とを、一元的に制御・監視していた。すなわち、コン
ピュータシステムがネットワークを介して接続されたホ
スト拠点と複数個の分散拠点から構成されている場合に
は、それぞれの拠点に監視装置が設けられ、各設備の起
動・停止・障害の監視等を各拠点内で独立して行ってい
たので、分＠拠点とホスト拠点とではその検知時刻にバ
ラツキが生じていた。なお、従来、この種のシステムの
運転制御方式に関連するものとしては、例えば特開昭５
９−１２８６５９号公報、特開昭５９−３０１６０号公
報にそれぞれ記載された監視方式が挙げられる。Conventional operation control methods centrally control and monitor the center equipment, which consists of multiple hosts and multiple devices, as well as the system's power supply equipment and air conditioning equipment. In other words, when a computer system consists of a host base and multiple distributed bases connected via a network, a monitoring device is installed at each base to monitor the startup, shutdown, failure, etc. of each piece of equipment. Since this was done independently within each base, there were variations in the detection time between the minute @ base and the host base. Incidentally, conventional methods related to the operation control method of this type of system include, for example, Japanese Patent Application Laid-open No. 5
The monitoring methods described in Japanese Patent Application Laid-open No. 9-128659 and Japanese Patent Application Laid-Open No. 59-30160 are examples.

[The problem that the invention is trying to solve]

上記従来の監視方式では、運転制御装置がホスト拠点ま
たは分散拠点内の制御・監視を行っているだけであった
。従って、コンピュータシステムがネットワークを介し
て接続されたホスト拠点と複数個の分散拠点とで構成さ
れている場合に、従来の監視方式では、コンピュータシ
ステム全体の制御・監視を行うという点ｔ：、、ついて
は配慮していなかった。すなわち、前述のように、ホス
１−拠点および分散拠点のシステム起動、システム停止
の時間にバラツキがあり、サービス時間に地域差が生じ
るため、分散拠点の障害発生がホスト拠点に伝オ）るの
に時間がかかり、障害回復が遅れる等の問題があった。In the conventional monitoring system described above, the operation control device only controls and monitors the host site or the distributed site. Therefore, when a computer system is composed of a host base and multiple distributed bases connected via a network, the conventional monitoring method controls and monitors the entire computer system. No consideration was given to this. In other words, as mentioned above, there are variations in the system start-up and system stop times at the host base and distributed bases, and regional differences in service times occur, so it is difficult for a failure at a distributed base to be transmitted to the host base. There were problems such as it took a long time to complete and recovery from failures was delayed.

例えば、銀行のオンラインシステム等では、各支店の設
備障害を本店に報告する際にも、報告の時間が連載てし
まうため、回復処置が遅くなり、結局は顧客１；対する
サービスの低下となる。また、近時、分散拠点の無人化
が要求されており、さらにホスト拠点の無人化に進む傾
向にある。システムの無人化が達成さ九た場合には、ど
うしてもコンピータシステム全体の一元的な監視方式が
必要となる。For example, in a bank's online system, etc., when reporting an equipment failure at each branch to the head office, the reporting times are serialized, which delays recovery measures and ultimately results in a decline in service to customer 1. Additionally, there has recently been a demand for unmanned distributed locations, and there is a trend toward even more unmanned host locations. If unmanned systems are to be achieved, a unified monitoring method for the entire computer system will be necessary.

本発明の目的は、このような従来の問題を改善し、１箇
所以上のホス１−拠点ど複数個の分散拠点で構成される
コンピュータシステム全体の制御・監視を一元的に行い
、分散拠点およびホスト拠点の自動化・無人化を実現で
きるコンピュータシステムの運転制御方式を提供するこ
とにある。The purpose of the present invention is to improve such conventional problems, to centrally control and monitor the entire computer system consisting of multiple distributed bases such as one or more host bases, and to The objective is to provide a computer system operation control method that can realize automation and unmanned host bases.

[Means for solving problems]

上記目的を達成するため、本発明によるコンピュータシ
ステムの運転制御方式は、各分散拠点には、該分散拠点
内のシステムの起動・停止・監視を行う制御手段および
上位装置との連絡手段を備えた分散運転制御装置を、ホ
スト拠点には、各分散運転制御装置との連絡手段、該ホ
スト拠点内のシステムの起動・停止・監視を行う制御手
段および該ホスト拠点と１箇所以上の分散拠点の起動手
順・停止手順・障害発生時の対応手順を記憶する記憶手
段を備えた統括運転制御装置を、それぞれ設けることに
特徴がある。In order to achieve the above object, the computer system operation control method according to the present invention is such that each distributed base is equipped with a control means for starting, stopping, and monitoring the system within the distributed base and a means for communicating with a host device. The distributed operation control device is installed at the host site, including means for communicating with each distributed operation control device, control means for starting, stopping, and monitoring systems within the host site, and activation of the host site and one or more distributed sites. A feature of each of these systems is that they are provided with integrated operation control devices each having a storage means for storing procedures, stopping procedures, and response procedures in the event of a failure.

[For production]

本発明においては、コンピュータシステムを構成する複
数の分散拠点内の設備の制御と監視を各拠点内の分散運
転制御装置により行い、それらの分ｉ！２運転制御装置
をホスト拠点内の統括運転制御装置がまとめて制御・監
視し、この統括運転制御装置から、あるいはその上位装
置から一元的に全体の制御と監視を行う。分散運転制御
装置は、従来の人手入力やタイマー人力に加えて、統括
運転制御装置および上位装置からの指示によってもシス
テムの起動、システムの停止等の制御を行い、また障害
検出時には統括運転制御装置および上位装置に報告をす
るので、分散拠点の自動化・無人化が可能となる。統括
運転制御装置は、記憶袋口に記憶したコンピュータシス
テム全体の起動手順、停止手順等に従って、各分散運転
制御装置との連絡手段を介して各分散運転制御装置に起
動・停止等の制御を指示するとともに、自身でホスト拠
点内のシステムの起動・停止等の制御を行うので、例え
ば、全ての分散拠点のシステムを立ち上げてからホスト
拠点のシステムを立ち上げるように制御することによっ
て、エンドユーザへのサービス開始時間を揃えることが
できる。また、統括運転制御装置の記憶装置に記憶され
た手順の作成・変更・選択実行およびコンピュータシス
テムの状態監視を、上位装置との連絡手段を介して上位
装置からも行うことができるので、ホスト拠点の自動化
・無人化が可能となる。In the present invention, the equipment in a plurality of distributed locations constituting a computer system is controlled and monitored by a distributed operation control device in each location, and the i! The two operation control devices are collectively controlled and monitored by the general operation control device in the host base, and the overall control and monitoring is performed centrally from this general operation control device or from its higher-level device. In addition to conventional manual input and timer input, the distributed operation control device performs system start-up, system stop, etc. control based on instructions from the central operation control device and higher-level devices, and when a failure is detected, the system is activated by the central operation control device. and reports to higher-level equipment, making it possible to automate and unmanned distributed bases. The central operation control device instructs each distributed operation control device to start, stop, etc. via the communication means with each distributed operation control device according to the startup procedure, stop procedure, etc. of the entire computer system stored in the memory bag opening. At the same time, since the end user controls the startup and stopping of the systems within the host base, for example, by controlling the systems to start up the systems at all distributed bases before starting the system at the host base, the end user service start times can be aligned. In addition, the creation, modification, and selection of procedures stored in the storage device of the central operation control device and the status monitoring of the computer system can also be performed from the host device via the means of communication with the host device. Automation and unmanned operation will become possible.

〔Example〕

以下１本発明の実施例を、図面により詳細に説明する。 EMBODIMENT OF THE INVENTION Below, one embodiment of the present invention will be described in detail with reference to the drawings.

第１図は１本発明の運転制御方式を適用したコンピュー
タシステムの一実施例を示す構成図である。ここでは、
１つのホスト拠点と複数の分散拠点１〜ｎ、さらに上位
装置３からなるコンピュータシステムを示しているが、
勿論複数のホスト拠点を鍔えたシステムにも適用可能で
ある。FIG. 1 is a block diagram showing an embodiment of a computer system to which the operation control method of the present invention is applied. here,
A computer system consisting of one host base, multiple distributed bases 1 to n, and a higher-level device 3 is shown.
Of course, it is also applicable to a system with multiple host locations.

第１図において、ホスト拠点には１，５〜７゜１０．１
１の各装置が、分散拠点１〜ｎにはそれぞれ２，８，９
．１２〜１４の各装置が、リモー１−センタ（上位装置
）には、３の装置が設けられる。■はホスト拠点の監視
・制御と各分散運転制御装置を制御する統括運転制御装
置、２は分散拠点を監視・制御する分散運転制御装置、
３は端末袋口あるいはパーソナルコンピュータ等の上位
装置、４は交換機、回線、加入者線等の回線網、５は５
，１４はホスト拠点および分散拠点に設けられる電源・
空調設備、６，７はホスト拠点に設けられた現用と待機
用のプロセッサ、８．９は分散拠点に設けられ５通信制
御回路を内蔵した現用と待機用の小型プロセッサ、１０
は大型プロセッサに接続さ九、公衆網または専用線を介
して通信を制御する通信制御装置、１１．１２は磁気デ
ィスク装置、光デイスク装置および半導体記憶装置等の
ＤＡＳＤ（Ｄｉｒｅｃｔ　　Ａｃｃｅｓｓ　　Ｄｅｖｉ
ｃｅ）　、　　１３は現用と待機用のプロセッサ８，９
を公衆網に切替え接続する回線切替え装置である。In Figure 1, the host base is 1.5~7°10.1
Each device in 1 is connected to 2, 8, and 9 devices in distributed bases 1 to n, respectively.
．． Each of the devices 12 to 14 is provided, and the remote 1-center (superior device) is provided with 3 devices. ■ is a central operation control device that monitors and controls the host base and controls each distributed operation control device; 2 is a distributed operation control device that monitors and controls the distributed bases;
3 is a terminal bag or a host device such as a personal computer, 4 is a line network such as an exchange, a line, a subscriber line, etc., and 5 is a 5
, 14 is the power supply installed at the host base and distributed bases.
Air conditioning equipment, 6 and 7 are active and standby processors installed at the host base, 8.9 are installed at distributed bases, and 5 are small processors for active and standby with built-in communication control circuits, 10
11.12 is a communication control device that is connected to a large processor and controls communication via a public network or private line; 11.12 is a DASD (Direct Access Device) such as a magnetic disk device, an optical disk device, and a semiconductor storage device;
ce), 13 are active and standby processors 8 and 9
This is a line switching device that switches and connects the network to the public network.

分散拠点１〜ｎにはそれぞれ分散運転制御装は２が配性
され、ホスト拠点に設けられた統括運転制御装置１と回
線網４を介して接続し５統括運転制御装置８！１からコ
ンピュータシステム全体の起動／停止等の定形的な制御
を自動的に行う。また、統括運転制御装置１を回線窮４
を介してリモートセンターの上位装置３に接続し、リモ
ートセンターの上位装置３から集中監視と障害発生時等
の非定型的制御を含めた集中制御を行う。第１図におい
て、各装置を接続する実線は情報伝送路であり、矢印の
付加された破線は運転制御装ｒａｌ、２からの制御線で
あり、一点鎖線は運転制御装置間の情報伝送路である。A distributed operation control device 2 is arranged in each of the distributed bases 1 to n, and is connected to the central operation control device 1 provided at the host base via a line network 4, and is connected to the computer system from the central operation control device 8!1. Automatically performs routine controls such as starting/stopping the entire system. In addition, the overall operation control device 1 is connected to the line
The remote center is connected to the host device 3 of the remote center via the host device 3, and the host device 3 of the remote center performs centralized control including centralized monitoring and atypical control when a failure occurs. In Figure 1, the solid lines connecting each device are information transmission paths, the broken lines with arrows are the control lines from the operation control devices ral and 2, and the dash-dotted lines are the information transmission paths between the operation control devices. be.

第２図は、統括運転制御装置および分散運転制御装置の
機能ブロック図であり、第３図は第２図の動作フローチ
ャートである。FIG. 2 is a functional block diagram of the integrated operation control device and the distributed operation control device, and FIG. 3 is an operation flowchart of FIG. 2.

統括運転制御装置１は、上位装ことのデータの送受信を
行う上位通信制御部２１、コンピュータシステム全体の
起動手順、停止手順、障害発生時の対応手順等を記憶す
る記憶部２２、定時起動あるいは時間監視のためのタイ
マー制御部２３、分散運転制御装に２とのデータの送受
信を行う下位通信制御部２４、ホス１へシステム内の電
源・空調設備の投入／切断、ＣＰＵおよびデバイスの電
源投入〆切断、デバイスのＣＰＵへの接続／切芝し、Ｃ
ＰＵのＩＰＬ／リセット、プログラムのスタート／スト
ップ等を制御するシステム制御部２５、システムの状態
表示とオペレータからの手動指示を制御する操作表示部
２６、および前記各部を統括制御する処理制御部２７か
ら構成される。The general operation control device 1 includes a higher-level communication control unit 21 that sends and receives data to and from higher-level equipment, a storage unit 22 that stores startup procedures, shutdown procedures, response procedures in the event of a failure, etc. of the entire computer system, and a regular startup or timer system. A timer control unit 23 for monitoring, a lower-level communication control unit 24 that sends and receives data to and from the distributed operation control unit 2, turns on/off the power supply and air conditioning equipment in the system to the host 1, and turns on the power of the CPU and devices. Disconnect, connect/cut device to CPU, C
From a system control unit 25 that controls PU IPL/reset, program start/stop, etc., an operation display unit 26 that controls system status display and manual instructions from the operator, and a processing control unit 27 that centrally controls each of the above units. configured.

また１分散運転制御装［２は、Ｒ括運転制御装＝１との
データの送受信を行う上位通信制御部３１、分散拠点内
システムの起動手順、停止手順、障害発生時の対応手順
等を記憶する記憶部３２゜分散システム内の電源・空調
設備の投入／切断、ＣＰＵおよびデバイスの電源投入／
切断、デバイスのＣＰＵへの接続〆切離し、ＣＰＵのＩ
ＰＬ／リセット、プログラムのスタート／ストップ等を
制御するシステム制御部３３および前記各部を統括制御
する処理制御部３４から構成される。分散拠点内のシス
テム運転に自由度を持たせる場合には、統括運転制御装
置１における操作表示部２Ｇ、タイマー制御部２３を付
加してもよい。また、複合ネットワークに対しては、統
括運転制御装置１の上位装置を統括運転制御装置１とし
、多諧Ｒり化してこれに対応する。In addition, the 1 distributed operation control unit [2 stores the upper communication control unit 31 that sends and receives data with the R integrated operation control unit 1, the startup procedure of the distributed base system, the stop procedure, the response procedure when a failure occurs, etc. Storage unit 32゜Turns on/off the power supply and air conditioning equipment in the distributed system, turns on/off the power of the CPU and devices
Disconnect, disconnect the device from the CPU, disconnect the device from the CPU
It is comprised of a system control section 33 that controls PL/reset, program start/stop, etc., and a processing control section 34 that centrally controls each of the above sections. If a degree of freedom is to be given to system operation within a distributed base, an operation display section 2G and a timer control section 23 may be added to the integrated operation control device 1. Moreover, for a complex network, the higher-level device of the integrated operation control device 1 is set as the integrated operation control device 1, and the network is multimodal to cope with this.

コンピュータシステム全体の起動手順、停止手順、障害
発生時の対応手順等の設定・変更は、通常、上位装置３
から統括運転制御装ｒｒＬ１の上位通信制御部２１を介
して処理制御部２７にデータとして送られ、記憶部２２
に収納される（処理１０１゜１０２）、各分散拠点内シ
ステムの起動手順、停止手順、障害発生時の対応手順等
は、上位装置３から統括運転制御装置１の上位通信制御
部２１を介して処理制御部２７にデータとして送られた
後、処理制御部２７により下位通信制御部２４を介して
分散運転制御装置２に転送され、分散運転制御装置２の
上位通信制御部３１を介して処理制御部３４に送られ、
記憶部３２に収納される（処理１０３．１０４，１０５
）。制御手順の設定・変更は、統括運転制御装置ｉの操
作表示部からも行うことができる（処理１０６）。Settings and changes such as startup procedures, shutdown procedures, and response procedures in the event of a failure for the entire computer system are normally performed by the host device 3.
is sent as data to the processing control unit 27 via the upper communication control unit 21 of the integrated operation control unit rrL1, and is stored in the storage unit 22.
(Processes 101 and 102), the startup procedure, stop procedure, response procedure when a failure occurs, etc. of the systems within each distributed base are transmitted from the host device 3 via the host communication control unit 21 of the integrated operation control device 1. After being sent as data to the processing control unit 27, it is transferred by the processing control unit 27 to the distributed operation control unit 2 via the lower-level communication control unit 24, and is processed and controlled via the upper-level communication control unit 31 of the distributed operation control unit 2. Sent to Department 34,
Stored in the storage unit 32 (processes 103, 104, 105
). Setting and changing of the control procedure can also be performed from the operation display section of the integrated operation control device i (process 106).

次に、定型的な制御の一例として、システム起動の動作
フローを述べる。Ｒ括運転制御装口１は、通常タイマー
制御部２３からの定時起動によりコンピュータシステム
の起動を開始する（処理１０７）、なお、上位装置３ま
たは操作表示部２Ｇからのオペレータの指示により、開
始することもできる（処理１０８，１０９）、統括運転
制御装置１の処理制御部２７は、開始指示を受けると、
記憶部２２に設定されたシステム起動の手順に従って５
システム制御部２５および下位通信制御部２４を介して
各分散運転制御装置２に運転制御指示を送出する（処理
１１０，１１１，１．１２）。Next, the operational flow of system startup will be described as an example of routine control. The R integrated operation control device 1 normally starts the startup of the computer system by a scheduled startup from the timer control section 23 (process 107), and is started by an operator's instruction from the host device 3 or the operation display section 2G. When the processing control unit 27 of the integrated operation control device 1 receives the start instruction (processes 108 and 109),
5 according to the system startup procedure set in the storage unit 22.
Operation control instructions are sent to each distributed operation control device 2 via the system control unit 25 and lower-level communication control unit 24 (processes 110, 111, 1.12).

システム起動の手順の一例を次に示す。An example of the system startup procedure is shown below.

（イ）ホストシステム内の電源・空調設備５の投入（処
理１１３）、（ロ）ホストシステム内の温度・湿度のチ
ェック（処理１１４）、（ハ）ホストシステム内プロセ
ッサ６．７および通信制御装置１０、ＤＡＳＤＩｌ、そ
の他の電源投入・チェック（処理１１５）。（ニ）各分
散運転制御装置２へのシステム起動指示（処理１１６）
、（ホ）ホストシステム内の通信制御装置１０およびＤ
ＡＳＤＩｌ、他のオンライン運転中のプロセッサ（例え
ば６）への接続・チェック（処理１１７）、（へ）ホス
トシステム内のオンライン中プロセッサのＩ　Ｐ　Ｌ　
（Ｉ　ｎｉｔｉａｌＰｒｏｇｒａｍ　　Ｌｏａｄｉｎｇ
）　・チェック（処理１１８）。(b) Turn on the power supply and air conditioning equipment 5 in the host system (process 113), (b) check the temperature and humidity in the host system (process 114), (c) processor 6.7 and communication control device in the host system 10. Power on/check DASDIl and others (processing 115). (d) System startup instruction to each distributed operation control device 2 (process 116)
, (e) Communication control device 10 and D in the host system
ASDIl, connection/check to other online processors (e.g. 6) (process 117), (to) I P L of the online processor in the host system
(I initialProgram Loading
) - Check (processing 118).

（ト）各分散運転制御装置２からのシステム起動結果の
チェック（処１！ｌＮ１９）、（チ）ホストシステム内
のオンライン用プロセッサのオンラインプログラムのス
タート指示・チェック（処理１２０）である。(G) Checking the system startup results from each distributed operation control device 2 (Process 1!1N19); (H) Instructing and checking the start of the online program of the online processor in the host system (Process 120).

次に、各分散運転制御装置２の処理制御部３４は、前記
（ニ）のシステム起動指示を統括運転制御装置１から上
位通信制御部３１を介して受は取ると（処理１２１）、
記憶部３２に設定されたシステム起動の手順に従って、
システム制御部３３に運転制御指示を出す（処理１２２
）。分子１１拠点内システムのシステム起動の手順の一
例を次に述べる。Next, the processing control unit 34 of each distributed operation control device 2 receives the system startup instruction (d) from the integrated operation control device 1 via the upper communication control unit 31 (process 121).
According to the system startup procedure set in the storage unit 32,
Issues an operation control instruction to the system control unit 33 (processing 122
). An example of the procedure for starting up the system within the Molecule 11 site will be described below.

（イ）電源・空調設備１４の投入（処理１２３）、（ロ
）プロセッサ８，９、ＤＡＳＤ１２、および回線切替え
装置１３の電源投入・チェック（処理１２４）、（ハ）
回線切替え装置１３のプロセッサ８への接続・チェック
（処理１２５）、（ニ）プロセッサ８，９のｒＰＬ・プ
ログラムスタートとチェック（処理１２６）である。(b) Turn on the power supply/air conditioning equipment 14 (process 123), (b) turn on/check the power to the processors 8, 9, DASD 12, and line switching device 13 (process 124), (c)
These are the connection and check of the line switching device 13 to the processor 8 (process 125), and (d) rPL/program start and check of the processors 8 and 9 (process 126).

制御が不成功に終った場合にも、記憶部３２に設定され
た手順に従って、統括運転制御装置１に報告する（処理
１３１，１３２）。Even if the control is unsuccessful, it is reported to the integrated operation control device 1 according to the procedure set in the storage section 32 (processes 131, 132).

各分散運転制御装置２は、制御実行中の場合を除いて、
常にシステムの状態監視を行っている。Each distributed operation control device 2, except when control is being executed,
System status is constantly monitored.

処理制御部３４は、システムの状態変化をシステム制御
部３３から受は取ると、記憶部３２に設定された手順に
従って、統括運転制御装に１に報告する（処理１３３，
１３４）。障害発生に対して定型的な手順が設定されて
いる場合には、手順に従ってシステム制御部３３に運転
制御指示を出す（処理１３５，１３６）。次に、分散拠
点内のプロセッサ８の障害発生に対する手順の一例を挙
げる。When the processing control unit 34 receives a change in the state of the system from the system control unit 33, it reports it to the general operation control unit 1 according to the procedure set in the storage unit 32 (processing 133,
134). If a routine procedure has been set for the occurrence of a failure, an operation control instruction is issued to the system control unit 33 according to the procedure (processes 135, 136). Next, an example of a procedure for the occurrence of a failure in the processor 8 in a distributed base will be described.

（イ）プロセッサ８のプログラムをストップする（処理
１３７）、回線切替え装置１３に対してプロセッサ８か
らの切難しとプロセッサ９への接続を行う（処理１３８
）。(b) Stop the program of the processor 8 (process 137), send a request from the processor 8 to the line switching device 13, and connect to the processor 9 (process 138).
).

制御が不成功に終った場合にも、記憶部３２に設定され
ている手順に従って、上位装置３に報告する（処理１４
１，１４２）。Even if the control is unsuccessful, it is reported to the host device 3 according to the procedure set in the storage unit 32 (processing 14).
1,142).

また、統括運転制御装置１も、ホストシステムの状１ｒ
５監視を行っており、処理制御部３７はホストシステム
内の状態変化をシステム制御部２５がら受は取ると、記
憶部２２に設定された手順に従って上位袋に３への報告
を行い（処理１５１，１５２，１５３）、操作表示部２
６に表示を行う（処理１５４）。障害発生に対して定型
的な手順が設定されている場合には、その手順に従って
システム制御部２５に運転制御指示を出す（処理１６１
゜１６２）。統括運転制御装置１は、タイマー制御部２
３からの指示により分散運転制御装置２の状態チェック
を行う（処理１７１）。分散運転制御装置の障害を検出
すると、処理制御部２７は、記憶部２２に設定された手
順に従って、上位袋に３への報告を行い、操作表示部２
６に表示を指示する（処理１８１，１８２，１８３）。In addition, the overall operation control device 1 also controls the state of the host system 1r.
5 monitoring, and when the processing control unit 37 receives the status change in the host system from the system control unit 25, it reports it to the upper level bag 3 according to the procedure set in the storage unit 22 (processing 151). , 152, 153), operation display section 2
6 (processing 154). If a routine procedure is set for the occurrence of a failure, an operation control instruction is issued to the system control unit 25 according to the procedure (process 161).
゜162). The overall operation control device 1 includes a timer control section 2
The status of the distributed operation control device 2 is checked according to instructions from the controller 3 (process 171). When detecting a failure in the distributed operation control device, the processing control unit 27 reports it to the upper level bag 3 according to the procedure set in the storage unit 22, and displays the operation display unit 2.
6 to display (processes 181, 182, 183).

例えば、銀行オンラインシステムに適用する場合には、
上位装置３は、定型的な電源・空調設備の運転時間を、
平日、土曜日、休日のいずれかにより、本店（ホストシ
ステム）には９時〜５時、９時〜３時、１０時〜１２時
を指示し、支店（分散システム）には、９時〜３時、９
時〜１２時、非動作等を指示する等、任意に指令を出す
ことができる。For example, when applied to a bank online system,
The host device 3 calculates the operation time of standard power supply and air conditioning equipment,
Depending on weekdays, Saturdays, or holidays, the head office (host system) is instructed to operate from 9:00 to 5:00, 9:00 to 3:00, and 10:00 to 12:00, and branches (distributed system) are instructed to operate from 9:00 to 3:00. hour, 9
From 12:00 to 12:00, any command can be issued, such as instructing non-operation.

また、第１図において１回Ｌｉｃ網４のうち、ホスト拠
点と分散拠点とを結ぶ情報伝送路には専用線を用い、統
括運転制御表５１１と上位袋Ｅ１３の間、統括運転制御
装置１と分散運転制御装置２間の各情報伝送路には公衆
回線を用いることができる。In addition, in FIG. 1, in the LIC network 4, a dedicated line is used for the information transmission path connecting the host base and the distributed base, and between the central operation control table 511 and the upper bag E13, the central operation control device 1 and A public line can be used for each information transmission path between the distributed operation control devices 2.

なお、障害監視時に、ある分散拠点のプロセッサが全て
ダウンしたときには、隣の分散拠点からサーチするよう
にホストｍ魚から指示することができる。Note that during failure monitoring, if all the processors at a certain distributed base go down, the host can instruct the host to search from the neighboring distributed base.

このように、本実施例においては、コンピュータシステ
ム全体の運転制御・監視を一元的に行うことができると
ともに、次のような利点がある。In this way, in this embodiment, the operation control and monitoring of the entire computer system can be performed in a unified manner, and there are the following advantages.

（ａ）ｇ括運転制御装置（分散運転制御袋ｒｌｉ）は、
上位装置からの指示だけでなく、タイマー、オペレータ
の指示により、制御を開始できるので、自由度の高い運
転制御方式を実現できる。(a) The g collective operation control device (distributed operation control bag rli) is
Since control can be started not only by instructions from the host device but also by instructions from a timer or operator, a highly flexible operation control method can be realized.

（ｂ）定型的な手順は、全て各サブシステムの中で閉じ
た形で対処できるので、各サブシステムでは早いサービ
ス（制御・障害対処）が受けられる。(b) Since all routine procedures can be handled in a closed manner within each subsystem, each subsystem can receive quick service (control/failure handling).

（ｃ）統括運転制御装置１の上位装置として、別の統括
運転制御装置を用いれば、システムの多段階店化に簡単
に対処できる。(c) If another integrated operation control device is used as a higher-level device of the integrated operation control device 1, it is possible to easily deal with multi-level system development.

（ｄ）非定型的な手順は、上位階層に上げることにより
、高い判断と対処が期待できる。(d) By elevating non-routine procedures to a higher level, better judgment and handling can be expected.

（ｅ）システムの変化を上位層に集めることにより、最
上位層では、システム全体の状態を見ながら適確な判断
ができる。(e) By collecting changes in the system in the upper layer, the highest layer can make appropriate decisions while looking at the overall system status.

〔Effect of the invention〕

以上説明したように、本発明によれば、ネットワークを
介して接続された１つ以上のホスト拠点と復数個の分散
拠点から構成されるコンピュータシステム全体の運転制
御および障害監視を、一元的に行うことができるので、
（、）分散拠点およびホスト拠点の自動化、無人化運転
を実現でき、（ｂ）サービス時間の分散拠点によるバラ
ツキがなくなり、（ｃ）障害回復時間が短縮できる。ま
た、（ｄ）制御手順を１ｆＩ！Ｊ所から設定・変更でき
るので。As explained above, according to the present invention, the operation control and fault monitoring of the entire computer system consisting of one or more host bases and several distributed bases connected via a network can be centrally performed. Because it can be done
(,) Automation and unmanned operation of distributed bases and host bases can be realized, (b) variations in service time between distributed bases can be eliminated, and (c) fault recovery time can be shortened. Also, (d) the control procedure is 1fI! You can set and change it from J place.

システム変更・運用変更に対して、簡単に、かつ誤りな
く対処できる。Able to deal with system changes and operational changes easily and without errors.

[Brief explanation of the drawing]

第１図は本発明の一実施例を示すコンピュータシステム
のブロック図、第２図は第１図の統括運転制御装置と分
散運転制御装置の機能ブロック図５第３図は第２図の動
作フローチャートである。１：Ｒ括運転制御装置、２：分散運転ｆＩＪ御装置、３
：上位装置、４：回線網、５，１４：電源・空調設備、
６〜９：プロセッサ、１ｏ：通信制御装置、１１．Ｉ　
２　：ＤＡＳＤ、１３：回線切替え装置、２１，３ｔ：
上位通信制御部、２２，３２：記憶部、２７，３１処理
制御部、２５，３３ニジステム制御部、２４：下位通信
制御部、２３：タイマー制御部、２６：Ｓ作表承部。第　　　　　１　　　　　図 □：情報伝送路一−−−：運転制御装置からの制御線 −・−一一二運転制御装置間の情報伝送路筒　　　　　
２　　　　　図上位装詮Fig. 1 is a block diagram of a computer system showing an embodiment of the present invention, Fig. 2 is a functional block diagram of the integrated operation control device and distributed operation control device shown in Fig. 1, and Fig. 3 is an operation flowchart of Fig. 2. It is. 1: R integrated operation control device, 2: Distributed operation fIJ control device, 3
: Upper device, 4: Line network, 5, 14: Power supply/air conditioning equipment,
6-9: Processor, 1o: Communication control device, 11. I
2: DASD, 13: Line switching device, 21,3t:
Upper communication control section, 22, 32: Storage section, 27, 31 Processing control section, 25, 33 System control section, 24: Lower communication control section, 23: Timer control section, 26: S tabulation reception section. Figure 1 □: Information transmission line 1 ---: Control line from the operation control device ---112 Information transmission line between the operation control devices
2 Upper figure layout

Claims

[Scope of Claims] 1. In a computer system in which one or more host locations including one or more processing devices are connected to one or more distributed locations including one or more processing devices, each distributed location has a , a distributed operation control device equipped with a control means for starting, stopping, and monitoring the system within the distributed base and a means of communication with a host device, and a host base with a means of communication with each distributed operation control device and the host An integrated operation control device equipped with a control means for starting, stopping, and monitoring systems within a base, and a storage means for storing startup procedures, shutdown procedures, and response procedures in the event of a failure for the host base and one or more distributed bases. An operation control method for a computer system characterized by providing each of the following. 2. The distributed operation control device stores the startup procedure, shutdown procedure, and response procedure when a failure occurs for the system within the distributed base,
2. An operation control method for a computer system according to claim 1, wherein the settings, changes, and selective execution of the procedures are performed in accordance with instructions from an integrated operation control device. 3. The above-mentioned integrated operation control device is characterized by setting, changing, and selectively executing procedures stored in the storage means according to instructions from the host device, and reporting the status of the computer system to the host device. An operation control method for a computer system according to claim 1 or 2.