JPS6385859A

JPS6385859A - Intersystem exclusive control system in case of generating abnormality

Info

Publication number: JPS6385859A
Application number: JP61230943A
Authority: JP
Inventors: Toru Nihei; 仁平　亨
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1986-09-29
Filing date: 1986-09-29
Publication date: 1988-04-16

Abstract

PURPOSE:To allow a normal system to maintain an access right to a shared data base and to attain the continuous operation of a data base by recording the system status at every system on a shared system file, and analyzing the cause of an intersystem communication interruption in case of generating system abnormality. CONSTITUTION:In case of detecting communication interruption, a status setting part 15 writes a sound mark indicating that respective systems are normal in a system status storing part 18. After a prescribed time lapses, the system status of the other system is read out and whether the cause of the communication interruption is the system down of the opposite system or a fault in a communication line between system can be recognized by checking whether the read system status if kept at the initial mark or turned to a sound mark. Consequently, the reconstitution (reassignment of access right) of a control network preventing access to the shared data base from being disabled can be attained.

Description

【発明の詳細な説明】〔概要〕共用データベースを有する疎結合計算機システムにおい
て、共用システムファイル上に各システム毎のシステム
状態を記録しておき、システム異常発生時にシステム間
の通信途絶の原因を自動的に解析することにより、正常
なシステムが共用データベースに対するアクセス権を維
持して、データベースの継続運用を可能とする。[Detailed Description of the Invention] [Summary] In a loosely coupled computer system having a shared database, the system status of each system is recorded in a shared system file, and the cause of communication interruption between systems can be automatically detected when a system abnormality occurs. By analyzing the information in detail, a normal system can maintain access rights to the shared database, allowing continued operation of the database.

[Industrial application field]

本発明は、疎結合計算機システムにおけるシステム異常
発生時のシステム間排他制御方式に係り。The present invention relates to an inter-system exclusive control method when a system abnormality occurs in a loosely coupled computer system.

特にシステムに異常が発生した場合における共用データ
ベースに対する可用性を向上させた異常発生時システム
間排他制御方式に関するものである。In particular, the present invention relates to an exclusive control method between systems when an abnormality occurs, which improves the availability of a shared database when an abnormality occurs in the system.

共用データベースを有する疎結合計算機システムにおい
て、システム間通信路の障害や、システムダウンなどの
異常事態が発生し、システム間の通信が途絶した場合、
共用データベースに対するアクセス権を持つシステムを
限定することが不可決となる。この場合、システム間の
通信途絶の原因に応じて、３１も効率的にデータベース
を！Ｉ続して運用できるように、使用できるシステムを
決定することが必要となる。In a loosely coupled computer system that has a shared database, if an abnormal situation occurs such as a failure in the communication path between the systems or a system down, and communication between the systems is interrupted,
Limiting the systems that have access rights to the shared database becomes untenable. In this case, depending on the cause of the communication breakdown between the systems, 31 databases can be efficiently used! It is necessary to determine which systems can be used so that they can continue to operate.

[Conventional technology]

第５図は従来方式を説明するための図である。 FIG. 5 is a diagram for explaining the conventional method.

第５図において、１０−１ないし１０−４はそれぞれ独
立したプロセッサを持つシステム、１１は通信路、３０
は共用データベースを表す。In FIG. 5, 10-1 to 10-4 are systems each having an independent processor, 11 is a communication path, and 30
represents a shared database.

疎結合計算機システムにおいて、共用データベース３０
の排他制御を行うためには、システム間通信により、シ
ステム間で排他制御情報のやりとりを行う、共用データ
ベース３０の排他制御を行うために、システム間通信路
１１で結合された計算機群（以下、制御網という）では
、システム間通信路の障害や、一部計算機システムのシ
ステムダウンなどにより、所定時間の通信途絶を検出す
ると１通信途絶の対象となったシステムを制御網から除
外する（以下、切り離しという）。In a loosely coupled computer system, a shared database 30
In order to perform exclusive control of the shared database 30, exclusive control information is exchanged between systems through intersystem communication.In order to perform exclusive control of the shared database 30, a group of computers (hereinafter referred to as In the control network, when a communication interruption is detected for a predetermined period of time due to a failure in the communication path between systems or a system down of some computer systems, the system subject to the communication interruption is excluded from the control network (hereinafter referred to as the control network). (referred to as separation).

切り離しの際には、各々の計算機システムでは。When disconnecting, each computer system.

自システムを含む通信可能な計算機群が構成する制御網
と１通信途絶となった計算機群が構成する制御網の２つ
の制御網に分裂したと認識する。It is recognized that the control network has been split into two control networks: a control network composed of a group of computers that can communicate, including the own system, and a control network composed of a group of computers that have lost one communication.

制御網の分裂後に、共用データベースの完全性を保証す
るためには、アクセス権を持つ制？［Ｍを唯一に限定す
る必要がある。例えば従来方式によれば、制御網分裂の
原因に関わらず１分裂後の各制御網のシステム台数と、
システム毎に予め定めた共用データベース３０に対する
アクセスの優先順位（以下、システムプライオリティと
いう）により、アクセス権を持つ制′４Ｂ網を決定する
。Is there a system for having access rights to guarantee the integrity of a shared database after a control network splits? [It is necessary to limit M to be unique. For example, according to the conventional method, regardless of the cause of control network splitting, the number of systems in each control network after one split is
Based on the priority order of access to the shared database 30 (hereinafter referred to as system priority) predetermined for each system, the 4B network that has access rights is determined.

例えば、第５図（イ）図示のように、システム＃ｌとシ
ステム＃２問およびシステム＃２とシステム＃４間で通
信途絶が発生したとする。制御網は、制御！ｉ１Ａと制
御Ｉ網Ｂとに分裂するが、制御網Ａのシステム台数が多
いため、制御網Ａに属するシステムが、共用データベー
ス３０に対するアクセス権を維持する。For example, as shown in FIG. 5(a), it is assumed that a communication breakdown occurs between system #1 and system #2 and between system #2 and system #4. Control network is control! Although it is divided into i1A and control I network B, since the number of systems in control network A is large, the systems belonging to control network A maintain access rights to the shared database 30.

例えば、第５図（ロ）図示のように、システム＃１とシ
ステム＃２問およびシステム＃３とシステム＃４間で通
信途絶が発生したとする。この場合、同数のシステムか
らなる制？［ＭＡ、制御Ｍ４Ｂに分裂する。ここで、共
用データベース３０に対するシステムプライオリティが
、＃１＞＃２＞９３〉＃４であったとすると、一番高い
優先順位を持つシステム＃ｌを含む制御ｍＡが、共用デ
ータベースに対するアクセス権を維持し、制御網Ｂのシ
ステムは、共用データベース３０に対するアクセス権を
喪失する。For example, as shown in FIG. 5(b), it is assumed that communication is interrupted between system #1 and system #2 and between system #3 and system #4. In this case, a system consisting of the same number of systems? [MA splits into control M4B. Here, if the system priorities for the shared database 30 are #1>#2>93>#4, the control mA that includes system #l with the highest priority maintains access rights to the shared database. , the control network B system loses access rights to the shared database 30.

[Problem that the invention seeks to solve]

通信路を利用したシステム間通信が途絶した場合、その
原因としては５以下の２つが考えられる。When inter-system communication using a communication channel is interrupted, there are two possible causes:

■　システム間通信路の障害 ■　通信相手システムのシステムダウンところで、シス
テム異常が発生して、第５図（ロ）に示すような２つの
制Ｊ１ｉ１Ａ、制御ｙＪＢに分裂した場合９通信途絶の
原因によって、共用データベース３０に対するアクセス
可否は次のようになる。■ Failure in the communication path between systems ■ System down of the communication partner system By the way, if a system abnormality occurs and the system is divided into two control J1i1A and control YJB as shown in Figure 5 (b), 9. , access permission to the shared database 30 is determined as follows.

ｉ）通信途絶の原因がシステム間通信路の障害である場
合、制１ＢＨ４Ａは共用データベース３０にアクセスで
きる。制御！ｉＢはアクセス権を喪失する。i) If the cause of communication disruption is a failure in the intersystem communication path, controller 1BH4A can access the shared database 30. control! iB loses access rights.

ｉｉ）通信途絶の原因が制［４Ａのシステムダウンであ
る場合、制御１４Ａはアクセス権を有するが。ii) If the cause of communication loss is a system down of Control 4A, Control 14A has access rights.

システムダウンのため継続運用できない。制御網Ｂはア
クセス権を持たない。Unable to continue operation due to system down. Control network B has no access rights.

ｉｉｉ　）通信途絶の原因が制御ｍＢのシステムダウン
である場合、制？Ｈ４Ａは共用データベース３０にアク
セスできる。制御ｍＢはアクセスできない。　　　　　
　　　　−゛以上のように従来方式によれば、上記ｉ；）のケースで
は、実際には制？１１１Ｂが共用データベース３０を使
用できるにもかかわらず、共用データベース３０に対す
るアクセス権を持つシステムが存在しなくなり、共用デ
ータベース３０の運用を継続できなくなるという問題が
ある。iii) If the cause of the communication disruption is the control mB system down, what is the control? H4A can access the shared database 30. Control mB cannot be accessed.
−゛As described above, according to the conventional method, in case i;) above, is there actually a control? 111B can use the shared database 30, there is no longer a system that has access rights to the shared database 30, and there is a problem that the operation of the shared database 30 cannot be continued.

本発明は上記問題点の解決を図り、制御網内の計算機シ
ステム間に通信途絶というシステム異常が発生した場合
、各計算機システムが自動的に異常事態の解析を行い、
正常な計算機システムを特定して、異常発注後も共用デ
ータベースの継続運用を保証する方式を提供することを
目的としている。The present invention aims to solve the above problems, and when a system abnormality such as a communication breakdown occurs between computer systems in a control network, each computer system automatically analyzes the abnormal situation.
The purpose is to identify a normal computer system and provide a method to guarantee continued operation of a shared database even after an abnormal order is placed.

[Means for solving problems]

第１図は本発明の基本構成例を示す。 FIG. 1 shows an example of the basic configuration of the present invention.

第１図において、１０−１．１０−２はそれぞれＣＰＵ
およびメモリを持ち共用データベースを有する疎結合さ
れたシステム、１１はシステム間通信路、１２は１シス
テム監視電文を他のシステムと送受信するシステム監視
電文送受信部、１３はシステムにおける時間管理を行う
タイマ、１４は相手システムとの通信途絶を検出する通
信途絶検出部、１５は自システムの状態情報を共用シス
テムファイル上に設定する状態設定部、１６は異常原因
に応じて制御網の再構成（アクセス権の決定を含む）を
行う網再構成部、１７は各システムに共通に接続されて
いる共用システムファイル、１日は各システム毎のシス
テム状態が記録されるシステム状態記憶部、１９はシス
テム診断ブロックを表す。In Figure 1, 10-1 and 10-2 are CPUs, respectively.
and a loosely coupled system having memory and a shared database; 11 is an inter-system communication path; 12 is a system monitoring message transmitting/receiving unit that sends and receives one system monitoring message to and from another system; 13 is a timer for managing time in the system; 14 is a communication loss detection unit that detects communication loss with the other system; 15 is a status setting unit that sets the status information of the own system on the shared system file; and 16 is a control network reconfiguration unit (access rights control) according to the cause of the abnormality. 17 is a shared system file commonly connected to each system; 1 is a system status storage unit in which the system status of each system is recorded; 19 is a system diagnosis block; represents.

本発明では、共用システムファイル１７上に。In the present invention, on the shared system file 17.

システム状態記憶部１日の領域が設けられ、その中のシ
ステム診断ブロック１９に、各システムの状態情報が設
定されるようになっている。システム状態を示す情報と
して１例えばシステムが制御網への参入時に設定し、初
期状態であることを示す初期マークと、異常発生時に自
システムが正常状態であることを示す健全マークと、自
システムが共用データベースに対するアクセス権を維持
することを示す維持マークなどがある。A system status storage section is provided with an area for one day, in which the status information of each system is set in a system diagnosis block 19. Information indicating the system status includes: 1. For example, an initial mark that is set when the system enters the control network and indicates that it is in the initial state, a healthy mark that indicates that the system is in a normal state when an abnormality occurs, and a health mark that indicates that the system is in a normal state when an error occurs. There is a maintenance mark that indicates that access rights to the shared database are maintained.

制御網内の各システムは、システム監視電文送受信部１
２により９通信路１１を介して、相互に所定の時間間隔
でシステム監視電文を送受信する。Each system in the control network has a system monitoring message transmitting/receiving unit 1.
2 sends and receives system monitoring messages to and from each other at predetermined time intervals via the 9 communication path 11.

これにより、互いに相手システムが正常に動作している
ことを確認する０通信途絶検出部１４が。As a result, the communication interruption detection unit 14 mutually confirms that the partner systems are operating normally.

所定時間の通信途絶を検出すると、状態設定部１５が呼
び出される。状態設定部１５は、共用システムファイル
１７のシステム状態記憶部１日に。When a communication interruption for a predetermined period of time is detected, the status setting unit 15 is called. The status setting section 15 stores the system status storage section of the shared system file 17 on the first day.

自システムが正常状態であることを示す健全マークを書
き込む。Writes a healthy mark indicating that the local system is in a normal state.

その後、′ｉＩ４再構成部１６は、相手システムのシス
テム状態を、システム状態記憶部１８から読み出し、こ
れにより１通信途絶の原因が通信路１１の障害であるの
か、相手システムのシステムダウンであるのかを判別す
る。Thereafter, the iI4 reconfiguration unit 16 reads the system status of the other system from the system status storage unit 18, and determines whether the cause of the communication interruption is a failure in the communication path 11 or a system down of the other system. Determine.

網再構成部１６は１通信途絶の原因が相手システムのシ
ステムダウンであると認識した場合には。When the network reconfiguration unit 16 recognizes that the cause of the communication interruption is a system down of the other party's system.

自システムの共用データベースに対するアクセス権を維
持する処理を行う、相手システムが正常に動作しており
９通信途絶の原因が通信路障害であると認識した場合に
は、システムプライオリティの高いシステムが共用デー
タベースに対するアクセス権を維持し、低いシステムは
アクセス権を自ら喪失するよう、制′ａｗｆ４の再構成
を行う。Performs processing to maintain access rights to the shared database of its own system.If the other system is operating normally and recognizes that the cause of communication interruption is a communication path failure, the system with higher system priority accesses the shared database. awf4 is reconfigured so that the lower system loses the access right by itself.

[Effect]

従来技術では、最も高いシステムプライオリティを持つ
システムを含む半数のシステムが同時にシステムダウン
した場合、正常に動作している他の半数のシステム群は
、自ら共用データベースに対するアクセス権を放棄する
ため、共用データベースの運用が以後不可能になる。In conventional technology, if half of the systems including the system with the highest system priority go down at the same time, the other half of the systems that are operating normally give up access rights to the shared database. operation will no longer be possible.

本発明の場合１通信途絶が検出された場合、状態設定部
１５によって、それぞれのシステムが正常であることを
示す健全マークを、システム状態記憶部１８に書き込む
。所定時間後に、他システムのシステム状態を読み出し
、それが初期マークのままであるか、健全マークになっ
ているかを調べることにより１通信途絶の原因が相手シ
ステムのシステムダウンであるか、システム間通信路の
障害であるかを認識できる。In the case of the present invention, when a communication interruption is detected, the status setting unit 15 writes a health mark indicating that each system is normal in the system status storage unit 18. After a predetermined period of time, the system status of the other system is read and checked to see if it remains at the initial mark or has become a healthy mark to determine whether the cause of the communication interruption is the system down of the other system, and to determine if the communication between the systems is It is possible to recognize obstacles on the road.

これによって、共用データベースに対するアクセスが不
可能とならないような制′４Ｂ網の再構成（アクセス権
の再割当て）ができることとなる。This makes it possible to reconfigure the 4B network (reassign access rights) so that access to the shared database is not disabled.

なお、１ｆｆｉ信路障害については、システム間通信路
が片方向のバスにより構成され、送信側または受信側の
一方だけに障害が発生して、システム間通信が途絶した
場合にも、システム状態によって。Regarding 1ffi signal path failure, even if the communication path between systems is configured with a unidirectional bus and a failure occurs on only one of the sending or receiving sides and intersystem communication is interrupted, depending on the system status, .

システム間の排他制御を行うことができる。Exclusive control between systems can be performed.

〔Example〕

第２図は本発明の一実施例におけるシステムダウン発生
時の処理例、第３図は本発明の一実施例における通信路
障害発生時の処理例、第４図は本発明の一実施例におけ
る片方向パス障害発生時の処理例を示す。FIG. 2 is an example of processing when a system failure occurs in an embodiment of the present invention, FIG. 3 is an example of processing when a communication path failure occurs in an embodiment of the present invention, and FIG. 4 is an example of processing in an embodiment of the present invention. An example of processing when a unidirectional path failure occurs is shown.

以下、説明を簡単にするために、２台の計算機システム
（システム＃１．システム＃２）により。Hereinafter, in order to simplify the explanation, two computer systems (system #1 and system #2) will be used.

制御網を構成する例について説明する。３台以上の場合
も同様である。システム＃ｌとシステム＃２とは、シス
テム＃１のほうがシステムプライオリティが高いものと
する。An example of configuring a control network will be explained. The same applies to the case of three or more units. It is assumed that between system #1 and system #2, system #1 has a higher system priority.

第２図はシステム＃１がシステムダウンした場合の例を
示している０両システム＃１．＃２は。Figure 2 shows an example where system #1 goes down. #2 is.

参入時ＴＯに、共用システムファイル１７に、初期状態
を示す「初期マーク（＃１）Ｊ、ｒ初期マーク（＃２）
Ｊをそれぞれ記録する。ここで「参入」とは、制御網の
１システムとなること、即ち。At the time of entry, "initial mark (#1) J, r initial mark (#2)" indicating the initial state is placed in the shared system file 17 in the TO.
Record each J. Here, "participating" means becoming part of a control network system.

共用データベースに対する排他制御を受ける対象となる
ことを意味する。This means that the shared database is subject to exclusive control.

時刻ＴｏからＴ１までは、システム＃１．　＃２の２シ
ステムで制御網が構成され、それぞれ共用データベース
に対するアクセス権を持つ。From time To to T1, system #1. A control network is made up of the two systems #2, each having access rights to the shared database.

時刻Ｔ１に、システム＃ｌにシステムダウンが発生する
と、システム＃２では、システム監視電文または応答電
文が到着しないことから２時刻Ｔ１から所定時間後（Ｔ
２）に通信途絶を検出する。When a system down occurs in system #l at time T1, system #2 does not receive the system monitoring message or the response message, so the system #1 returns after a predetermined period of time from time T1 (T
2) Detect communication disruption.

システム＃２は９通信途絶の検出時に、自システムは正
常である旨の「健全マーク（＃２）Ｊを。When system #2 detects a loss of communication, it displays a "healthy mark (#2) J" indicating that the system is normal.

共用システムファイル１７に記録する。It is recorded in the shared system file 17.

その後、所定の時間が経過してから（Ｔ３）。Then, after a predetermined time has passed (T3).

システム＃２は、システム＃ｌのシステム状態を読み出
す、このとき、システム状態は「初期マーク（＃１）Ｊ
のままであるので、異常原因がシステム＃１のシステム
ダウンであることを認識し。System #2 reads the system status of system #l. At this time, the system status is "Initial mark (#1) J
Since it remains the same, we recognize that the cause of the abnormality is system #1 going down.

システム＃１の切り離し処理を行う、自システム＃２は
、共用データベースに対するアクセス権を維持して、ア
クセス権を維持することを示す「維持マーク（＃２）Ｊ
を共用システムファイル１７に記録する。システムダウ
ンが発生したシステム＃１が、システムプライオリティ
が高くても、正常なシステム＃２によって、共用データ
ベースの運用を継続できることとなる。System #2, which performs the separation process for system #1, maintains access rights to the shared database and displays a "maintenance mark (#2) J" indicating that the access rights are maintained.
is recorded in the shared system file 17. Even if system #1, which has experienced a system failure, has a high system priority, the normal system #2 can continue to operate the shared database.

次に、第３図に従って１通信路障害発生時の例を説明す
る。Next, an example when a failure occurs in one communication path will be described with reference to FIG.

第２図の場合と同様に、システム＃１．　＃２は。As in the case of FIG. 2, system #1. #2 is.

参入時ＴＯに、共用システムファイル１７にそれぞれ初
期状態を示す「初期マーク（＃１）Ｊ。At the time of entry, an "initial mark (#1) J" is placed in the shared system file 17 to indicate the initial state.

「初期マーク（＃２）Ｊを記録する。“Record initial mark (#2) J.

時刻Ｔ１に通信路障害が発生すると１時刻ＴＩから所定
の時間経過後（Ｔ２）に、システム＃１゜システム＃２
は、それぞれ通信途絶を検出する。When a communication path failure occurs at time T1, after a predetermined time has elapsed (T2) from time TI, system #1 and system #2
detect communication loss.

このとき、各システム＃１．＃２は、それぞれ自システ
ムが正常であることを示す「健全マーク（＃１）Ｊ、ｒ
健全マーク（＃２）Ｊを記録する。At this time, each system #1. #2 indicates the “healthy mark (#1) J, r” indicating that the respective systems are normal.
Record the healthy mark (#2) J.

その後、さらに所定の時間経過してから（Ｔ３）９両シ
ステム＃１．１２は、互いに相手システムのシステム状
態を読み出す、ここで、相手システムの「健全マーク」
によって、どちらのシステムも正常に動作していること
を認識する。この場合、システム＃１は、相手システム
＃２よりもシステムプライオリティが高いため、共用デ
ータベースに対するアクセス権を維持し、アクセス権維
持を示す「維持マーク（＃１）Ｊを記録する。After that, after a further predetermined period of time has elapsed (T3), both systems #1 and 12 read out the system status of the other system.
This confirms that both systems are working properly. In this case, since the system #1 has a higher system priority than the partner system #2, it maintains the access right to the shared database and records a "maintenance mark (#1) J" indicating that the access right is maintained.

一方、システム＃２は、自システムのシステムフ。On the other hand, system #2 is the system file of its own system.

ライオリティが低いので、共用データベースに対するア
クセス権を自ら放棄する。それぞれ、切り離し処理を行
い、制御網を再構成する。Since the priority is low, it voluntarily relinquishes access to the shared database. Perform separation processing and reconfigure the control network.

第４図は、システム間通信路に片方向パスを使用し、送
信側または受信側の一方のパスにだけ障害が発生した場
合の例を示している。　。FIG. 4 shows an example in which a unidirectional path is used for the intersystem communication path and a failure occurs in only one path on the transmitting side or the receiving side. .

両システム＃１．＃２は、参入時（ＴＯ）に。Both systems #1. #2 is at the time of entry (TO).

「初期マーク（＃１）Ｊ、ｒ初期マーク（＃２）」を記
録する。Record "initial mark (#1) J, r initial mark (#2)".

その後（ＴＩ）、　システム＃ｌの送信側パスに通信路
障害が発生したとする。システム＃ｌでは。After that (TI), assume that a communication path failure occurs on the transmission side path of system #l. In system #l.

システム＃２からのシステム監視電文を受ｆ＠シ。Receive system monitoring message from system #2.

システム＃２に対する応答電文も送信できていると認識
するので、この時点では、異常を検出できない、システ
ム＃２では、システム＃ｌからの応答電文が届かないた
め、所定の時間経過後（Ｔ２）に９通信途絶を検出する
。そこで、システム＃２は、自システムは正常であるこ
とを示す「健全マーク（＃２）Ｊを共用システムファイ
ル１７に記録する。Since it recognizes that the response message to system #2 has also been sent, no abnormality can be detected at this point. System #2 does not receive the response message from system #l, so after a predetermined period of time (T2) 9 communication loss was detected. Therefore, system #2 records a "healthy mark (#2) J" in the shared system file 17, indicating that the system is normal.

その後、所定の時間経過してから（Ｔ３）、　　システ
ム＃２は、相手システム＃ｌのシステム状態を読み出す
、その結果、「初期マーク（＃１）Ｊのままであること
がわかり、これにより、システム状態側の異常を検出し
て切り離し処理を行う。After that, after a predetermined time has elapsed (T3), system #2 reads the system status of partner system #l, and as a result, it is found that the "initial mark (#1) J" remains. Detects an abnormality in the system status and performs disconnection processing.

システム＃２は、共用データヘースに対するアクセス権
を維持し、「維持マーク（＃２）Ｊを記録し、運用を継
続する。System #2 maintains the access right to the shared data cache, records the "maintenance mark (#2) J," and continues operation.

システム＃１では、システム＃２が切り離し処理を行っ
たので、それから所定の時間経過後（Ｔ４）に９通信途
絶を検出する。そこで、自システムは正常であることを
示す「健全マーク（＃１）」を記録する。その後（Ｔ５
）に、相手システム＃２のシステム状態を調べると、「
維持マーク（＃２）Ｊが記録されていることを認識する
。この場合、システム＃１は、システム＃２が先に網の
再構成を行った結果、アクセス権を維持していると認識
し、自らは共用データベースに対するアクセス権を喪失
する。In system #1, since system #2 has performed the disconnection process, 9 communication interruptions are detected after a predetermined time has elapsed (T4). Therefore, a "healthy mark (#1)" indicating that the own system is normal is recorded. After that (T5
), when I check the system status of the other system #2, I get "
Recognize that maintenance mark (#2) J is recorded. In this case, system #1 recognizes that system #2 maintains the access right as a result of the network reconfiguration first, and loses the access right to the shared database.

このように１片方向パスの障害により、システム間の通
信途絶を検出する時刻に差異が生じても。In this way, even if a failure in one unidirectional path causes a difference in the time at which communication interruption between systems is detected.

共用データベースに対するシステム間の排他制御を行う
ことができる。It is possible to perform exclusive control between systems over a shared database.

以上、２台の計算機システムによって制御網が構成され
ている場合の例について説明したが１例えば４台の計算
機システムで、２台ずつの同数分裂が生じるような場合
にも１例えば各側ｍｙにおけるシステムプライオリティ
が高いシステムが。Above, we have explained an example in which a control network is configured by two computer systems.1 For example, when there are four computer systems and an equal number division of two computers occurs, for example, on each side my, A system with high system priority.

代表して相手システムのシステム状態を調べるなどして
、同様に排他制御を行うことが可能である。It is also possible to perform exclusive control in the same way by checking the system status of the other system as a representative.

〔Effect of the invention〕

以上説明したように１本発明によれば、共用データベー
スを有する疎結合計算機システムにおいて、システム間
通信路の障害や、一部計算機システムのシステ、ムダウ
ン等の異常事態が発生しても。As explained above, according to one aspect of the present invention, in a loosely coupled computer system having a shared database, even if an abnormal situation occurs such as a failure in a communication path between systems or a system failure of some computer systems.

正常なシステムで共用データベースの運用をＩＩ！ｌｒ
、１することが可能となり、疎結合計算機システム全体
としての高信輔性を実現することができるようになる。Operate a shared database with a normal system! lr
, 1, and it becomes possible to realize high reliability of the entire loosely coupled computer system.

[Brief explanation of the drawing]

第１図は本発明の基本構成例、第２図は本発明の一実施
例におけるシステムダウン発生時の処理例、第３図は本
発明の一実施例における通信路障害発生時の処理例、第
４図は本発明の一実施例における片方向パス障害発生時
の処理例、第５図は従来方式を説明するための図を示す
。図中、１０−１．１０−２はシステム、１１は通信路、
１２はシステム監視電文送受信部、１３はタイマ、１４
は通信途絶検出部、１５は状態設定部、１６は網再構成
部、１７は共用システムファイル、１８はシステム状態
記憶部、１９はシステム診断ブロックを表す。FIG. 1 is an example of the basic configuration of the present invention, FIG. 2 is an example of processing when a system down occurs in an embodiment of the present invention, and FIG. 3 is an example of processing when a communication path failure occurs in an embodiment of the present invention. FIG. 4 shows an example of processing when a unidirectional path failure occurs in an embodiment of the present invention, and FIG. 5 shows a diagram for explaining a conventional method. In the figure, 10-1 and 10-2 are the system, 11 is the communication path,
12 is a system monitoring message transmission/reception unit, 13 is a timer, 14
15 is a communication interruption detection unit, 15 is a status setting unit, 16 is a network reconfiguration unit, 17 is a shared system file, 18 is a system status storage unit, and 19 is a system diagnosis block.

Claims

[Claims] In a loosely coupled computer system that has a shared database and performs exclusive control of access to the shared database through intersystem communication, a shared system file (
17) A system status storage means (18) for recording the system status of each system, and a system monitoring message transmitting/receiving means (12) for transmitting and receiving system monitoring messages to and from other systems via the communication path (11). ), communication interruption detection means (14) for detecting communication interruption by not receiving a system monitoring message for a predetermined period of time or more; A status setting means (
15), and network reconfiguration means (16) that checks the status of other systems in the system status storage means (18) after a predetermined period of time has elapsed since the communication interruption is detected, and reconfigures the control network based on the status. ). A method for exclusive control between systems when an abnormality occurs.