JPS63238652A

JPS63238652A - Operation managing system for distributed processing system

Info

Publication number: JPS63238652A
Application number: JP7323487A
Authority: JP
Inventors: Kingo Tsugawa; 津川　金吾
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1987-03-26
Filing date: 1987-03-26
Publication date: 1988-10-04

Abstract

PURPOSE:To prevent a drop of a throughput of the whole system by monitoring a utilizing state of shared resources, in the owning node of its shared resources. CONSTITUTION:A control table 2 for recording users 4, 5 for utilizing shared resources 1 in accordance with a node, and its utilization time, and a resources monitoring mechanism 3 for monitoring the contents of this control table 2 are prepared in advance. In this state, when there is a person whose requests to utilize the shared resources 1, full names of its users 4, 5 are recorded in the control table 2, and, when there is a user who is being recorded and also when there is a person who requests to utilize the shared resources 1, its full name and the waiting time are recorded as its idle waiting user in the control table 2, and by the monitoring mechanism 3, the control table 2 is checked at a prescribed time interval, and when the waiting time has exceeded a prescribed time, a warning is given to the idle waiting user. In such a way, a drop of a through-put of the whole system can be prevented.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は共通に使用される複数の共用資源を有する分散
処理システムの運用管理方式に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to an operation management method for a distributed processing system having a plurality of commonly used shared resources.

[Conventional technology]

°共通に使用される複数の共用資源を有する従来の分散
処理システムにおいては、システムがデッドロック状態
になっていることを検出するためには、全ての共用資源
に関してただ一つ管理機構を用意し、その管理機構にお
いてすべての利用者の共用資源の利用状況を監視すると
いう方式を用いている。°In conventional distributed processing systems that have multiple shared resources that are commonly used, in order to detect when the system is in a deadlock state, it is necessary to provide only one management mechanism for all shared resources. , whose management mechanism uses a method of monitoring the usage status of shared resources by all users.

[Problem that the invention seeks to solve]

上述のような従来の分散処理システムの運用管理方式は
、分散処理システム全体の資源をただ一つの管理機構に
おいて統括して監視するため、中央処理装置（ホスト）
と端末装置という垂直形の分散処理システムには適する
が、複数のホストを有するような水平形分散処理システ
ムでは、管理機構を割当てたノードがシステム上の障害
に合うと、それだけでシステム全体が停止するだけでな
く、システムネットワーク上を伝達される利用者情報の
外に、管理８％禍を作動させるための情報がそのノード
に集中するため、分散処理システム全体のスループット
の低下を引き起こすという欠点がある。In the conventional operation management method for distributed processing systems as described above, the resources of the entire distributed processing system are centrally monitored by a single management mechanism, so the central processing unit (host)
However, in a horizontal distributed processing system with multiple hosts, if a node to which a management mechanism is assigned encounters a system failure, the entire system will stop. In addition to the user information transmitted on the system network, information for operating the management 8% disaster is concentrated on that node, which causes a reduction in the throughput of the entire distributed processing system. be.

本発明の目的は、上述のような従来の分散処理システム
の運用管理方式の欠点を除去するために、資源管理−ヒ
の負荷を分散させ、システム全体が停止するのを防止し
、かつシステム全体のスループットの低下を防止するこ
とのできる分散処理システムの運用管理方式を提供する
ことにある。An object of the present invention is to distribute the resource management load, prevent the entire system from stopping, and eliminate the drawbacks of the conventional operation management method for distributed processing systems as described above. An object of the present invention is to provide an operation management method for a distributed processing system that can prevent a decrease in throughput.

[Means for solving problems]

本発明の分散処理システムの運用管理方式は、そのシス
テムで共用される資源を有するおのおののノードにおい
て、その共用資源の利用状況を記録した制御表と、資源
監視機構とを有している。The operation management method for a distributed processing system of the present invention includes a control table that records the usage status of the shared resource and a resource monitoring mechanism in each node that has the resource shared in the system.

すなわち本発明の分散処理システムの運用管理方式は、
共用資源を有する複数のノードを有する分散処理システ
ムの運用管理方式であって、前記ノードに対応して前記
共用資源を利用する利用者名とその利用時間とを記録す
る制御表と前記制御表の内容を監視する資源監視機構と
を用意し、前記共用資源の利用を要求する者があったと
きはその利用者の氏名を前記制御表に記録し、前記記録
中の利用者があるときに更に前記共用資源の利用を要求
する者があったときはその空き待ち利用者としてその氏
名と待時間とを前記制御表に記録し、前記監視機構によ
って前記制御表を一定時間間隔でチェックし、前記待時
間が所定値を越えたときは前記空き待ち利用者に対して
警告を与えることを含んで構成される。In other words, the operation management method of the distributed processing system of the present invention is as follows:
An operation management method for a distributed processing system having a plurality of nodes having a shared resource, the control table including a control table for recording the names of users who use the shared resource and their usage times corresponding to the nodes; A resource monitoring mechanism is prepared to monitor the contents, and when there is a person who requests the use of the shared resource, the name of that user is recorded in the control table, and when there is a user in the record, the When there is a person who requests the use of the shared resource, his/her name and waiting time are recorded in the control table as a waiting user, and the control table is checked at regular intervals by the monitoring mechanism. The system includes the step of giving a warning to the waiting user when the waiting time exceeds a predetermined value.

〔Example〕

次に本発明の実施例について図面を参照して説明する。 Next, embodiments of the present invention will be described with reference to the drawings.

第１図は本発明の一実施例を適用する分散処理システム
において、共用資源を保有するノードの構成を示す構成
図、第２図は第１図のノードを有する分散処理システム
が稼働して共用資源を使用している状態を示す説明図で
ある。FIG. 1 is a configuration diagram showing the configuration of nodes holding shared resources in a distributed processing system to which an embodiment of the present invention is applied, and FIG. 2 shows a distributed processing system having the nodes shown in FIG. FIG. 2 is an explanatory diagram showing a state in which resources are used.

第１図において、１は共用資源、２は共用資源の利用状
況を記録する制御表、３は資源監視機構である。また第
２図において、２は制御表、４は共用資源を使用中の利
用者、５は共用資源が空くのを待っている利用者である
。In FIG. 1, 1 is a shared resource, 2 is a control table for recording the usage status of the shared resource, and 3 is a resource monitoring mechanism. In FIG. 2, 2 is a control table, 4 is a user who is using the shared resource, and 5 is a user who is waiting for the shared resource to become available.

第１図および第２図において、共用資源１を利用する者
がいない状態では、制御表２は何の情報も書込まれてい
ないため全く空白であり、資源監視機構３によって一定
間隔でその内容のチェックを行っても、何の警告信号も
発しない。この状！ぶて利用者４が共用資源１の利用を
要求すると、制御表２にはその利用者４の氏名が記録さ
れて共用資源１は利用中となる。更にその後、他の利用
者５が共用資源１の利用を要求すると、制御表２にはそ
の利用者（空き待ち利用者）５の氏名とその待ち時間と
して“°０”′とが記録され、利用者５は空き待ち利用
者となる。このとき以降、制御表２は資源監視機構３に
よって一定時間間隔でその状況をチェックされ、制御表
２上に空き待ち利用者５が一引続いて存在する場合は、
その待ち時間に一定値を加算する。共用資源１の空き待
ち時間が分散処理システムの許容量を越える場合は、資
源監視８％構３は空き待ち利用者５に対して警告を与え
る。In FIGS. 1 and 2, when no one uses the shared resource 1, the control table 2 is completely blank because no information is written therein, and the resource monitoring mechanism 3 updates the control table 2 with the contents at regular intervals. The check does not raise any warning signals. This situation! When the user 4 requests the use of the shared resource 1, the name of the user 4 is recorded in the control table 2, and the shared resource 1 becomes in use. Furthermore, after that, when another user 5 requests the use of the shared resource 1, the name of the user (waiting user) 5 and "°0"' as the waiting time are recorded in the control table 2. User 5 becomes a waiting user. After this time, the status of the control table 2 is checked by the resource monitoring mechanism 3 at fixed time intervals, and if there are consecutive waiting users 5 on the control table 2,
A fixed value is added to the waiting time. If the idle waiting time of the shared resource 1 exceeds the capacity of the distributed processing system, the resource monitoring 8% system 3 issues a warning to the idle waiting user 5.

以上のように、分散処理システムは、共用資源の利用状
況をその資源の所有ノードにおいて監視することによっ
て、資源管理上の負荷を分散するので、システムネット
ワーク上で共用資源所有ノードの一つが故障しても、当
該共用資源に関する部分のみが動作を停止するだけであ
って、残余の共用資源は従前どおり利用可能な状態であ
り、システムは引続いて稼働しつづけられる上、システ
ム全体の動作が不可能となるデッドロックの発生という
状況に陥らないので、分散処理システムを往来する情報
としては本来必要な利用者情報のみであるのでスループ
ットの低下を抑制できる。As described above, a distributed processing system distributes the resource management load by monitoring the usage status of shared resources at the node that owns the resource, so if one of the nodes that owns the shared resource fails on the system network, However, only the part related to the shared resource will stop operating, the remaining shared resources will remain available as before, and the system will continue to operate. Since a situation where a possible deadlock does not occur, only the originally necessary user information is sent back and forth in the distributed processing system, thereby suppressing a decrease in throughput.

〔Effect of the invention〕

以上説明したように、本発明の分散処理システムの運用
管理方式を用いることにより、特に水平形分散処理シス
テムに対して一部のノードに故障が発生してもシステム
全体が停止するのを防止することができるという効果が
あり、ｔなシステム全体のスループットの低下も防止で
きるという効果もある。As explained above, by using the operation management method for a distributed processing system of the present invention, it is possible to prevent the entire system from stopping even if a failure occurs in some nodes, especially in a horizontal distributed processing system. This has the effect of making it possible to perform the following tasks, and it also has the effect of preventing a significant decrease in the throughput of the entire system.

[Brief explanation of the drawing]

第１図は本発明の一実施例の共用資源保有ノードの構成
を示す構成図、第２図は第１図の分散処理システムが稼
働して共用資源を使用しているときの制御表の状態を示
す説明図である。１・・・共用資源、２・・・制御表５、・・・資源監視
機構、４・５・・・利用者。Ｚ制旬吸４乙　図FIG. 1 is a configuration diagram showing the configuration of a shared resource holding node according to an embodiment of the present invention, and FIG. 2 is a state of the control table when the distributed processing system shown in FIG. 1 is operating and using shared resources. FIG. 1... Shared resource, 2... Control table 5,... Resource monitoring mechanism, 4.5... User. Z system Junsu 4 Otsu figure

Claims

[Claims]

An operation management method for a distributed processing system having a plurality of nodes having a shared resource, the control table including a control table for recording the names of users who use the shared resource and their usage times corresponding to the nodes; A resource monitoring mechanism is prepared to monitor the contents, and when there is a person who requests the use of the shared resource, the name of that user is recorded in the control table, and when there is a user in the record, the When there is a person who requests the use of the shared resource, his/her name and waiting time are recorded in the control table as a waiting user, and the control table is checked at regular intervals by the monitoring mechanism. 1. An operation management method for a distributed processing system, comprising: issuing a warning to the idle waiting user when the waiting time exceeds a predetermined value.