JPS63238652A - Operation managing system for distributed processing system - Google Patents

Operation managing system for distributed processing system

Info

Publication number
JPS63238652A
JPS63238652A JP7323487A JP7323487A JPS63238652A JP S63238652 A JPS63238652 A JP S63238652A JP 7323487 A JP7323487 A JP 7323487A JP 7323487 A JP7323487 A JP 7323487A JP S63238652 A JPS63238652 A JP S63238652A
Authority
JP
Japan
Prior art keywords
control table
user
distributed processing
processing system
shared resources
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP7323487A
Other languages
Japanese (ja)
Inventor
Kingo Tsugawa
津川 金吾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP7323487A priority Critical patent/JPS63238652A/en
Publication of JPS63238652A publication Critical patent/JPS63238652A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To prevent a drop of a throughput of the whole system by monitoring a utilizing state of shared resources, in the owning node of its shared resources. CONSTITUTION:A control table 2 for recording users 4, 5 for utilizing shared resources 1 in accordance with a node, and its utilization time, and a resources monitoring mechanism 3 for monitoring the contents of this control table 2 are prepared in advance. In this state, when there is a person whose requests to utilize the shared resources 1, full names of its users 4, 5 are recorded in the control table 2, and, when there is a user who is being recorded and also when there is a person who requests to utilize the shared resources 1, its full name and the waiting time are recorded as its idle waiting user in the control table 2, and by the monitoring mechanism 3, the control table 2 is checked at a prescribed time interval, and when the waiting time has exceeded a prescribed time, a warning is given to the idle waiting user. In such a way, a drop of a through-put of the whole system can be prevented.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明は共通に使用される複数の共用資源を有する分散
処理システムの運用管理方式に関する。
DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to an operation management method for a distributed processing system having a plurality of commonly used shared resources.

〔従来の技術〕[Conventional technology]

°共通に使用される複数の共用資源を有する従来の分散
処理システムにおいては、システムがデッドロック状態
になっていることを検出するためには、全ての共用資源
に関してただ一つ管理機構を用意し、その管理機構にお
いてすべての利用者の共用資源の利用状況を監視すると
いう方式を用いている。
°In conventional distributed processing systems that have multiple shared resources that are commonly used, in order to detect when the system is in a deadlock state, it is necessary to provide only one management mechanism for all shared resources. , whose management mechanism uses a method of monitoring the usage status of shared resources by all users.

〔発明が解決しようとする問題点〕[Problem that the invention seeks to solve]

上述のような従来の分散処理システムの運用管理方式は
、分散処理システム全体の資源をただ一つの管理機構に
おいて統括して監視するため、中央処理装置(ホスト)
と端末装置という垂直形の分散処理システムには適する
が、複数のホストを有するような水平形分散処理システ
ムでは、管理機構を割当てたノードがシステム上の障害
に合うと、それだけでシステム全体が停止するだけでな
く、システムネットワーク上を伝達される利用者情報の
外に、管理8%禍を作動させるための情報がそのノード
に集中するため、分散処理システム全体のスループット
の低下を引き起こすという欠点がある。
In the conventional operation management method for distributed processing systems as described above, the resources of the entire distributed processing system are centrally monitored by a single management mechanism, so the central processing unit (host)
However, in a horizontal distributed processing system with multiple hosts, if a node to which a management mechanism is assigned encounters a system failure, the entire system will stop. In addition to the user information transmitted on the system network, information for operating the management 8% disaster is concentrated on that node, which causes a reduction in the throughput of the entire distributed processing system. be.

本発明の目的は、上述のような従来の分散処理システム
の運用管理方式の欠点を除去するために、資源管理−ヒ
の負荷を分散させ、システム全体が停止するのを防止し
、かつシステム全体のスループットの低下を防止するこ
とのできる分散処理システムの運用管理方式を提供する
ことにある。
An object of the present invention is to distribute the resource management load, prevent the entire system from stopping, and eliminate the drawbacks of the conventional operation management method for distributed processing systems as described above. An object of the present invention is to provide an operation management method for a distributed processing system that can prevent a decrease in throughput.

〔問題点を解決するための手段〕[Means for solving problems]

本発明の分散処理システムの運用管理方式は、そのシス
テムで共用される資源を有するおのおののノードにおい
て、その共用資源の利用状況を記録した制御表と、資源
監視機構とを有している。
The operation management method for a distributed processing system of the present invention includes a control table that records the usage status of the shared resource and a resource monitoring mechanism in each node that has the resource shared in the system.

すなわち本発明の分散処理システムの運用管理方式は、
共用資源を有する複数のノードを有する分散処理システ
ムの運用管理方式であって、前記ノードに対応して前記
共用資源を利用する利用者名とその利用時間とを記録す
る制御表と前記制御表の内容を監視する資源監視機構と
を用意し、前記共用資源の利用を要求する者があったと
きはその利用者の氏名を前記制御表に記録し、前記記録
中の利用者があるときに更に前記共用資源の利用を要求
する者があったときはその空き待ち利用者としてその氏
名と待時間とを前記制御表に記録し、前記監視機構によ
って前記制御表を一定時間間隔でチェックし、前記待時
間が所定値を越えたときは前記空き待ち利用者に対して
警告を与えることを含んで構成される。
In other words, the operation management method of the distributed processing system of the present invention is as follows:
An operation management method for a distributed processing system having a plurality of nodes having a shared resource, the control table including a control table for recording the names of users who use the shared resource and their usage times corresponding to the nodes; A resource monitoring mechanism is prepared to monitor the contents, and when there is a person who requests the use of the shared resource, the name of that user is recorded in the control table, and when there is a user in the record, the When there is a person who requests the use of the shared resource, his/her name and waiting time are recorded in the control table as a waiting user, and the control table is checked at regular intervals by the monitoring mechanism. The system includes the step of giving a warning to the waiting user when the waiting time exceeds a predetermined value.

〔実施例〕〔Example〕

次に本発明の実施例について図面を参照して説明する。 Next, embodiments of the present invention will be described with reference to the drawings.

第1図は本発明の一実施例を適用する分散処理システム
において、共用資源を保有するノードの構成を示す構成
図、第2図は第1図のノードを有する分散処理システム
が稼働して共用資源を使用している状態を示す説明図で
ある。
FIG. 1 is a configuration diagram showing the configuration of nodes holding shared resources in a distributed processing system to which an embodiment of the present invention is applied, and FIG. 2 shows a distributed processing system having the nodes shown in FIG. FIG. 2 is an explanatory diagram showing a state in which resources are used.

第1図において、1は共用資源、2は共用資源の利用状
況を記録する制御表、3は資源監視機構である。また第
2図において、2は制御表、4は共用資源を使用中の利
用者、5は共用資源が空くのを待っている利用者である
In FIG. 1, 1 is a shared resource, 2 is a control table for recording the usage status of the shared resource, and 3 is a resource monitoring mechanism. In FIG. 2, 2 is a control table, 4 is a user who is using the shared resource, and 5 is a user who is waiting for the shared resource to become available.

第1図および第2図において、共用資源1を利用する者
がいない状態では、制御表2は何の情報も書込まれてい
ないため全く空白であり、資源監視機構3によって一定
間隔でその内容のチェックを行っても、何の警告信号も
発しない。この状!ぶて利用者4が共用資源1の利用を
要求すると、制御表2にはその利用者4の氏名が記録さ
れて共用資源1は利用中となる。更にその後、他の利用
者5が共用資源1の利用を要求すると、制御表2にはそ
の利用者(空き待ち利用者)5の氏名とその待ち時間と
して“°0”′とが記録され、利用者5は空き待ち利用
者となる。このとき以降、制御表2は資源監視機構3に
よって一定時間間隔でその状況をチェックされ、制御表
2上に空き待ち利用者5が一引続いて存在する場合は、
その待ち時間に一定値を加算する。共用資源1の空き待
ち時間が分散処理システムの許容量を越える場合は、資
源監視8%構3は空き待ち利用者5に対して警告を与え
る。
In FIGS. 1 and 2, when no one uses the shared resource 1, the control table 2 is completely blank because no information is written therein, and the resource monitoring mechanism 3 updates the control table 2 with the contents at regular intervals. The check does not raise any warning signals. This situation! When the user 4 requests the use of the shared resource 1, the name of the user 4 is recorded in the control table 2, and the shared resource 1 becomes in use. Furthermore, after that, when another user 5 requests the use of the shared resource 1, the name of the user (waiting user) 5 and "°0"' as the waiting time are recorded in the control table 2. User 5 becomes a waiting user. After this time, the status of the control table 2 is checked by the resource monitoring mechanism 3 at fixed time intervals, and if there are consecutive waiting users 5 on the control table 2,
A fixed value is added to the waiting time. If the idle waiting time of the shared resource 1 exceeds the capacity of the distributed processing system, the resource monitoring 8% system 3 issues a warning to the idle waiting user 5.

以上のように、分散処理システムは、共用資源の利用状
況をその資源の所有ノードにおいて監視することによっ
て、資源管理上の負荷を分散するので、システムネット
ワーク上で共用資源所有ノードの一つが故障しても、当
該共用資源に関する部分のみが動作を停止するだけであ
って、残余の共用資源は従前どおり利用可能な状態であ
り、システムは引続いて稼働しつづけられる上、システ
ム全体の動作が不可能となるデッドロックの発生という
状況に陥らないので、分散処理システムを往来する情報
としては本来必要な利用者情報のみであるのでスループ
ットの低下を抑制できる。
As described above, a distributed processing system distributes the resource management load by monitoring the usage status of shared resources at the node that owns the resource, so if one of the nodes that owns the shared resource fails on the system network, However, only the part related to the shared resource will stop operating, the remaining shared resources will remain available as before, and the system will continue to operate. Since a situation where a possible deadlock does not occur, only the originally necessary user information is sent back and forth in the distributed processing system, thereby suppressing a decrease in throughput.

〔発明の効果〕〔Effect of the invention〕

以上説明したように、本発明の分散処理システムの運用
管理方式を用いることにより、特に水平形分散処理シス
テムに対して一部のノードに故障が発生してもシステム
全体が停止するのを防止することができるという効果が
あり、tなシステム全体のスループットの低下も防止で
きるという効果もある。
As explained above, by using the operation management method for a distributed processing system of the present invention, it is possible to prevent the entire system from stopping even if a failure occurs in some nodes, especially in a horizontal distributed processing system. This has the effect of making it possible to perform the following tasks, and it also has the effect of preventing a significant decrease in the throughput of the entire system.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の一実施例の共用資源保有ノードの構成
を示す構成図、第2図は第1図の分散処理システムが稼
働して共用資源を使用しているときの制御表の状態を示
す説明図である。 1・・・共用資源、2・・・制御表5、・・・資源監視
機構、4・5・・・利用者。 Z制旬吸 4乙 図
FIG. 1 is a configuration diagram showing the configuration of a shared resource holding node according to an embodiment of the present invention, and FIG. 2 is a state of the control table when the distributed processing system shown in FIG. 1 is operating and using shared resources. FIG. 1... Shared resource, 2... Control table 5,... Resource monitoring mechanism, 4.5... User. Z system Junsu 4 Otsu figure

Claims (1)

【特許請求の範囲】[Claims] 共用資源を有する複数のノードを有する分散処理システ
ムの運用管理方式であって、前記ノードに対応して前記
共用資源を利用する利用者名とその利用時間とを記録す
る制御表と前記制御表の内容を監視する資源監視機構と
を用意し、前記共用資源の利用を要求する者があったと
きはその利用者の氏名を前記制御表に記録し、前記記録
中の利用者があるときに更に前記共用資源の利用を要求
する者があったときはその空き待ち利用者としてその氏
名と待時間とを前記制御表に記録し、前記監視機構によ
って前記制御表を一定時間間隔でチェックし、前記待時
間が所定値を越えたときは前記空き待ち利用者に対して
警告を与えることを含むことを特徴とする分散処理シス
テムの運用管理方式。
An operation management method for a distributed processing system having a plurality of nodes having a shared resource, the control table including a control table for recording the names of users who use the shared resource and their usage times corresponding to the nodes; A resource monitoring mechanism is prepared to monitor the contents, and when there is a person who requests the use of the shared resource, the name of that user is recorded in the control table, and when there is a user in the record, the When there is a person who requests the use of the shared resource, his/her name and waiting time are recorded in the control table as a waiting user, and the control table is checked at regular intervals by the monitoring mechanism. 1. An operation management method for a distributed processing system, comprising: issuing a warning to the idle waiting user when the waiting time exceeds a predetermined value.
JP7323487A 1987-03-26 1987-03-26 Operation managing system for distributed processing system Pending JPS63238652A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP7323487A JPS63238652A (en) 1987-03-26 1987-03-26 Operation managing system for distributed processing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP7323487A JPS63238652A (en) 1987-03-26 1987-03-26 Operation managing system for distributed processing system

Publications (1)

Publication Number Publication Date
JPS63238652A true JPS63238652A (en) 1988-10-04

Family

ID=13512288

Family Applications (1)

Application Number Title Priority Date Filing Date
JP7323487A Pending JPS63238652A (en) 1987-03-26 1987-03-26 Operation managing system for distributed processing system

Country Status (1)

Country Link
JP (1) JPS63238652A (en)

Similar Documents

Publication Publication Date Title
US8566278B2 (en) Hierarchical systems and methods for performing data storage operations
JPS63238652A (en) Operation managing system for distributed processing system
JP2893868B2 (en) I / O device control method
JPH11353292A (en) Cluster system and its fail over control method
JPH05108848A (en) Data transmission system
JPH07334468A (en) Load distribution system
JP3573092B2 (en) Exclusive use of shared resources in computer systems
JPS592152A (en) Resetting system in case of fault
JPH08329023A (en) Parallel electronic computer system
JPS63211060A (en) Load distribution control system for multiprocessor system
JPH0137782B2 (en)
JPH03246740A (en) Communication bus managing system in network
JP2778328B2 (en) Simultaneous control system for line switching equipment
JPH05224964A (en) Bus abnormality information system
JP3299315B2 (en) Multiprocessor system
JPS59135554A (en) Communication system between computer systems
JPS62182831A (en) Duplex processor
JPH0228866A (en) Log-in switching system
JPH02270043A (en) Computer monitor device
JPH02287755A (en) Information processing system
JPS62123542A (en) Local area network system
JP2002271352A (en) Data transmission system
JPS61117644A (en) Double inquiry preventing system of inquiry terminal in cluster system
JPS62173559A (en) File control system between loose coupled computer systems
JPS62245362A (en) Resetting system for multiprocessor system