JPS63191240A

JPS63191240A - Multi-processor system

Info

Publication number: JPS63191240A
Application number: JP62024306A
Authority: JP
Inventors: Kazuo Nakanishi; 中西　和男
Original assignee: Nissin Electric Co Ltd
Current assignee: Nissin Electric Co Ltd
Priority date: 1987-02-03
Filing date: 1987-02-03
Publication date: 1988-08-08

Abstract

PURPOSE:To prevent the total discontinuation of a multi-processor system by separating logically only a processor having a fault from the system CONSTITUTION:When a processor P has a fault among plural processors, a fault occurrence signal is delivered to a system state monitor signal line from the processor P. An optional processor monitoring the state of said signal line decides the occurrence of the fault with detection of the fault occurrence signal. A processor detecting means calls successively other processors to detect and specify a faulty one and gives the control commands to the faulty processor for stopping, resetting, interruption inhibiting actions, etc., based on the address signal of an address bus (a) and the data of a data bus (d). Thus the faulty processor stops the operation of a CPU, etc., resets each part and inhibits the interruptions supplied from an input/output control part, etc., based on the given control commands. Then the faulty processor is separated logically from a multi-processor system.

Description

【発明の詳細な説明】〔産業上の利用分野〕この発明は、複数のプロセッサを共通バスにより結合し
て構成されたマルチプロセッサシステムに関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a multiprocessor system configured by connecting a plurality of processors via a common bus.

[Conventional technology]

マルチプロセッサシステムは、たとえば第４図に示すよ
うな構成になって２す、複数個のプロセッサ（Ｐｌ）、
（Ｐり、・・・、　（Ｐガ）（以下総称して（Ｐｌとす
る）がアドレスバス・データバス・コントロールバス等
からなる共通バス（Ｂ）により有機的に結合されるとと
もに、この共通バス［Ｂ）に複数個の入出力制御部（Ｉ
ＯＱ　）−（ＩＯＣｍ）、　−、（ＩＯＣｍ）（以下総
称して（ＩＯＣ）とする）および共通メモＩＪ　（Ｍ）
が接続され、個々のプロセッサ（１’）がもつ処理能力
以上の処理能力を提供できるようになっている。A multiprocessor system has a configuration as shown in FIG. 4, for example, and includes a plurality of processors (Pl),
(Pri, ..., (Pga) (hereinafter collectively referred to as (Pl)) are organically connected by a common bus (B) consisting of an address bus, data bus, control bus, etc., and this common Multiple input/output control units (I
OQ) - (IOCm), -, (IOCm) (hereinafter collectively referred to as (IOC)) and common memo IJ (M)
are connected to each other to provide processing power greater than that of each individual processor (1').

ところで、前記マルチプロセッサシステムでは、複数の
プロセッサ（Ｐ）のうち１つのプロセッサ（１’ｌに障
害が発生すると、該プロセッサ（Ｐｌの暴走等によリシ
ステム全体が誤動作する危険があるため、従来より、他
の正常なプロセッサがこの状況を検出してシステムを停
止させること、およびこのような障害発生状況を短時間
に修復するために障害発生プロセッサを特定することが
行なわれている。By the way, in the multiprocessor system, if a failure occurs in one processor (1'l) among the plurality of processors (P), there is a risk that the entire system will malfunction due to runaway of that processor (Pl, etc.). Therefore, other normal processors detect this situation and stop the system, and the faulty processor is identified in order to repair such a faulty situation in a short time.

たとえば、前者の場合、第５図に示すように、共通バス
（Ｈｌに、＠プロセッサ（月が接続されるシステム状態
監視信号線（ｓ）およびリセット信号Ｗ（ｒｌを設け、
両信号線（ｓｌ　、　（ｒｉ　’ｉｒ：それぞれプルア
ップ抵抗量を介して正電源端子（＋■）に接続し、両信
号線（Ｓ）。For example, in the former case, as shown in FIG.
Both signal lines (sl, (ri'ir) are connected to the positive power supply terminal (+■) through pull-up resistors, respectively, and both signal lines (S).

ｆｒｌ　ｅハイレベル（以下「ハイ」という）に保持し
ている。frl e is maintained at a high level (hereinafter referred to as "high").

そして、任意のプロセッサ（月に障害が発生し、これが
各プロセッサ（Ｐ）毎の障害検出子役により検出される
と、該手段より「ハイ」の障害発生信号が出力され、オ
ープンコレクタ型のドライバｆｌ＋で信号線（８１がロ
ウレベル（以下「ロウ」という）にドライブされる。When a failure occurs in any processor (month) and this is detected by the failure detection child actor of each processor (P), a "high" failure occurrence signal is output from the means, and the open collector type driver fl+ The signal line (81) is driven to a low level (hereinafter referred to as "low").

したがって、信号線（Ｓ）の状態を監視している各プロ
セッサ（Ｐｉでは、信号線（Ｓｌが「ロウ」になること
により、第１ノット回路（２Ｉより「ハイ」の障害検出
信号が出力され、該信号によりランプ、　ＬＥＤ。Therefore, in each processor (Pi) that monitors the state of the signal line (S), when the signal line (Sl) becomes "low", a "high" fault detection signal is output from the first knot circuit (2I). , Lamp, LED according to the signal.

ブザー等の表示・報知手段が駆動さｎ、オペレータ等の
人間に障害発生を知らせるようにしている。A display/notification means such as a buzzer is activated to notify a human such as an operator of the occurrence of a failure.

さらに、前記任意のプロセッサ（Ｐ）では、障害発生信
号と障害検出信号とのアンドにより第ｌアンド回路（３
）より「ハイ」のリセット指令信号が出力され、オープ
ンコレクタ盛出力のノア回路ｆ４１’ｚ介してリセット
信号線（「）が「ハイ」から１０ワ」にドライブされる
。Furthermore, in the arbitrary processor (P), the l-th AND circuit (3
) outputs a "high" reset command signal, and the reset signal line () is driven from "high" to 10 watts via the open collector output NOR circuit f41'z.

この結果、各プロセッサ（Ｐ）では、信号ａｉｒ）が「
ロウ」になることにより、第２ノット回路（５）より「
ハイ」の信号が出力され、このとき、リセット回路より
パワーオンリセットの「ハイ」の信号が出力されること
がなく、第３ノット回路１６）より「ハイ」の信号が出
力され続けるため、第２アンド回路（７）より「ハイ」
のリセット信号がそれぞれのリセット回路へ出力され、
各プロセッサ（Ｐｌは動作を停止する。As a result, in each processor (P), the signal air) is "
By becoming “low”, the second knot circuit (5)
At this time, the reset circuit does not output the power-on reset "high" signal, and the third knot circuit 16) continues to output the "high" signal. “High” from 2-AND circuit (7)
A reset signal is output to each reset circuit,
Each processor (Pl stops operating.

また、佐者の場合は、−例として第６図に示すように構
成される。In addition, in the case of a player, the configuration is as shown in FIG. 6 as an example.

すなわち、複数のプロセッサ（Ｐ）のうち１つのプロセ
ッサｆＰ）に障害が発生すると、このプロセッサ（杓は
ｍ１述と同様にして障害発生信号を出力してオープンコ
レクタ型のドライバｆｌｌ　’ｉオンにし、システム状
態監視信号１（ｓ）’ｋｒロウ」にドライブする。That is, when a fault occurs in one processor (fP) among the plurality of processors (P), this processor (P) outputs a fault occurrence signal in the same manner as described in m1 and turns on the open collector driver fll'i. System status monitoring signal 1(s) is driven to 'kr low'.

すると、複数のプロセッサＩＰ）のうち障害を３こして
いないプロセッサ（Ｐｌの１つが、インバータ■を介し
、信号線（８＋が「ロウ」にドライブされたことを検出
し、これによってＣＰ　Ｕ　（ｓ＋に割込みがかかり、
該ＣＰ　Ｕ　（８３が障害発生プロセッサ（）’ｌ　ｋ
特定するプログラムを実行し、つきのように動作する。Then, one of the processors (Pl) that is not faulty among the plurality of processors (IP) detects that the signal line (8+) is driven to "low" via the inverter (2), and as a result, the CPU (s+ is interrupted,
The CPU (83 is the faulty processor ()'l k
Execute the specified program and operate as if it were attached.

すなわち、前記正常なプロセッサ旧の１つは。That is, the normal processor old one.

まず、自分以外の各プロセッサｆＰｌのそれぞれのゲー
ト回ｒＩ＆ｃ（ｚのアドレスデータｉ　Ｉｌ［次送出し
、各プロセッサ（卸ヲ呼び出す。First, each gate circuit rI&c(z) of each processor fPl other than itself is sent, and each processor (output is called).

し友がって、各プロセッサ（Ｐｌでは、共通バス同金ａ
成するアドレスバス（ａｌ上のアドレスデータをアドレ
スデコーダ口υに取り込み、自己のゲート回路３２に与
えられ友アドレスデータを取り込んだとき、ゲート回路
Ｃ（２に呼出信号を出力し、これによりゲート回路（至
）が開かれる。Accordingly, each processor (Pl) uses a common bus
When the address data on the address bus (al), which is formed by the (to) is opened.

この比め、障害プロセッサ（Ｐｌが呼び出されると。In comparison, when the fault processor (Pl) is called.

そのゲート回路３２ヲ通して前記障害発生信号が共通バ
ス（Ｂｌ　’！ｅ構成するデータバスｔｄｌ上に送出さ
れるから、正常プロセッサ［Ｐｌでは、送出したアドレ
スデータとデータバスｆｄｌ上の障害発生信号とにより
１１Ｗプロセツサ（Ｅ’ｌ−検出する。Through the gate circuit 32, the fault occurrence signal is sent onto the data bus tdl constituting the common bus (Bl'!e), so that the normal processor [Pl] receives the sent address data and the fault occurrence signal on the data bus fdl. The 11W processor (E'l-detects).

このようにして、障害の発生したプロセッサ（Ｐ）を特
定するようにしている。In this way, the faulty processor (P) is identified.

〔発明が解決しようとする問題点」ところが、前記従来の構成では、システム内の一部の障
害、すなわち任意のプロセッサ（Ｐ）の障害によってシ
ステム全体が停止してしまうことになり、平均故障間隔
であるＭ’Ｈ３Ｆ（ｍｅａｎ　ｔｉｍｅ　ｂｅｔ　−ｗ
ｅｅｎ　ｆａｉｌｕｒｅ）が低−ドし、システムの機能
の時間的安定性、すなわち信頼性が低下する入点がある
。[Problems to be Solved by the Invention] However, in the conventional configuration, the entire system stops due to a failure in a part of the system, that is, a failure in any processor (P), and the mean time between failures decreases. M'H3F(mean time bet -w
There is a point at which the temporal stability, or reliability, of the system's functionality decreases.

一方、平均修復時間であるＭＴＴＲ（ｍｅａｎ　ｔｉｍ
ｅｔｏ　ｒｅｐａｉｒ）は、既述したＲＷ発生プロセッ
サ金特定する手段によって十分小さくできるが、ＭＴＢ
ＦとＭＴＴＲ，と金柑いて表わされる可用率（ＭＴ　ｈ
ｌ　Ｆ／（ＭＴＢＦ＋ＭＴＴＪ）を上げるためには、前
述のＭＴＢＦ”ｋ大きくしなければならない。On the other hand, the mean repair time (MTTR)
eto repair) can be made sufficiently small by the above-mentioned means of specifying the RW generation processor cost, but MTB
F and MTTR, the availability rate (MT h
In order to increase lF/(MTBF+MTTJ), the aforementioned MTBF''k must be increased.

ところで、現存するマルチプロセッサシステムは、特定
のアプリケーション（ｔとえば流体解析などの数値計算
等）に対する処理スピードを上げることを目的としてい
るため、汎用のオペレーティングシステムの構築が難か
しく、複数のプロセッサのうちいずれかのプロセッサに
障害が発生した場合のシステムの誤動作を未然に防止す
るために、既述の如＜ＭＴＢＦの低下、つまり可用率の
低下を＠認した上でシステム全体を停止せざるを得なか
った。By the way, existing multiprocessor systems aim to increase processing speed for specific applications (for example, numerical calculations such as fluid analysis), making it difficult to build a general-purpose operating system and using multiple processors. In order to prevent system malfunctions in the event of a failure in one of the processors, it is necessary to stop the entire system after acknowledging the drop in MTBF, that is, the drop in availability, as described above. I didn't get it.

しかし、最近では、マイクロプロセッサの機能向上と低
価格化とによりマルチプロセッサシステムの構築が容易
になりつつあり、これによって、一般的な（汎用）アプ
リケーションにもマルチプロセッサシステムを適用する
ことが要求されるようになってきて旧り、それに伴ない
マルチプロセッサシステムの可用率を上げることが要求
されるようになってきた。However, in recent years, it has become easier to construct multiprocessor systems due to improved functionality and lower prices of microprocessors, and as a result, it has become necessary to apply multiprocessor systems to general (general-purpose) applications. With this trend, there has been a need to increase the availability of multiprocessor systems.

この発明は、前記の点に留意してなされたものであり、
障害が発生したプロセッサをシステム全体を停止するこ
となくシステムから論理的に切り離し、システム全体に
障害が波及しない手段を提供しようとするものである。This invention was made with the above points in mind,
The objective is to logically disconnect a faulty processor from the system without stopping the entire system, and to provide a means to prevent the fault from spreading to the entire system.

[Means to solve all problems]

この発明は、複数のプロセッサを共通バスにより結合す
るとともに、前記共通バスに入出力制御部旧よび共通メ
モリ等を接続し、かつ、前記いずれかのプロセッサに障
害が発生した場合に障害発生のプロセッサを特定し得る
よう構成してなるマルチプロセッサシステムに耶いて、ｎｉＪ記共通バスを構成するアドレスバスのアドレス信
号および前記共通バスを構成するデータバスのデータに
もとずいて前記障害発生のプロセッサに停止、リセット
、割込み禁止等の制御指令を与える手段と、前記各プロ
セッサに設けられ前記共通バスを介して前記入出力制御
部等から入力される割込みを前記任意のプロセッサから
の割込み禁止の制御指令により禁止する手段とを備えた
ことを特徴とするものである。This invention connects a plurality of processors by a common bus, connects an old input/output control unit, a common memory, etc. to the common bus, and when a failure occurs in any of the processors, the processor Based on the address signal of the address bus constituting the common bus and the data of the data bus constituting the common bus, means for giving control commands such as stop, reset, and interrupt prohibition; and a control command for disabling interrupts from any of the processors provided in each of the processors and input from the input/output control unit or the like via the common bus. The invention is characterized in that it includes a means for prohibiting the above.

[Effect]

したがって、この発明によれば、複数のプロセッサのう
ち１つのプロセッサに障害が発生すると、公知の手段、
たとえば障害が発生したプロセッサよりシステム状態監
視信号線に障害発生信号が出力され、この信号線の状態
を監視している任意のプロセッサで前記障害発生信号の
検出により障害発生を判断し、続いて他の各プロセッサ
を順次呼び出して障害発生のプロセッサを検出するよう
な手段により、障害発生プロセッサを特定し、アドレス
バスのアドレス信号とデータバスのデータにもとずいて
障害発生のプロセッサに停止、リセット、割込み禁止等
の制御指令を与える。Therefore, according to the present invention, when a failure occurs in one of the plurality of processors, the known means
For example, a fault occurrence signal is output from a faulty processor to a system status monitoring signal line, and any processor monitoring the state of this signal line determines that a fault has occurred by detecting the fault occurrence signal, and then The faulty processor is identified by sequentially calling each processor in order to detect the faulty processor, and the faulty processor is stopped, reset, or stopped based on the address signal on the address bus and the data on the data bus. Gives control commands such as disabling interrupts.

この結果、障害発生のプロセッサは、与えられた制御指
令により、ＣＰＵ等の停止や各部のリセット、さらに入
出力制御部等からの割込みを禁止し。As a result, the faulty processor stops the CPU, resets each part, and prohibits interrupts from the input/output control part, etc., according to the control command given.

当該プロセッサがシステムから論理的に切り離される。The processor is logically separated from the system.

な８、障害発生のプロセッサに入力されるべき入出力制
御部等からの割込みは、たとえば的記庄意のプロセッサ
の指令により、狗のプロセッサが取り扱うことになる。8. Interrupts from the input/output control unit, etc. that should be input to the faulty processor are handled by the dog processor, for example, according to instructions from the designated processor.

〔実施例１つぎに、この発明を、そのｌ実施例を示した第１図ない
し第８図とともに詳細に説明する。[Embodiment 1] Next, the present invention will be explained in detail with reference to FIGS. 1 to 8 showing embodiments thereof.

まず、第１因は、アドレスバス（ａ）、データバス＋ｄ
ｌ　、　コントロールバス（Ｃ１、割込みバス（ｉｌ　
ｚよびシステム状態監視信号線（Ｓ）よりなる共通バス
＋Ｈ１によって結合された複数のプロセッサ旧のうち、
１つのプロセッサ（Ｐｌの要部を示している。First, the first factor is address bus (a), data bus +d
l, control bus (C1, interrupt bus (il)
Among the plurality of processors connected by a common bus +H1 consisting of z and system status monitoring signal line (S),
The main parts of one processor (Pl) are shown.

同図に２いて、（８）はＣＰＵであり、そのデータ端子
、アドレス端子葛よびリード・ライト等のコントロール
端子がそれぞれバスバッファ（９）、Ｃ０，ｔｌｌ１１
２１はアンド回路であり、一方の入力端子に各プロセッ
サ（Ｐｌ毎に設けられた障害検出手段からの「ハイ」の
障害発生信号が入力され、出力端子がオープンコレクタ
型のドライバｆｌ＋　？介して信号線（Ｓ）に接続され
ている。０３は出力端子がアンド回路＋１２１の他方の
入力端子に接続された第１制御回路であり、通常は「ハ
イ」の信号を出力し、電源投入時のパワーオンリセット
信号または後述の第２制御回路からの障害出力停止信号
が入力された時のみ「ロウ」の信号を出力する。2 in the same figure, (8) is a CPU whose data terminals, address terminals, and control terminals such as read/write are bus buffers (9), C0, and tll11, respectively.
21 is an AND circuit, one input terminal receives a "high" fault occurrence signal from the fault detection means provided for each processor (Pl), and the output terminal receives the signal via an open collector type driver fl+? It is connected to the line (S).03 is the first control circuit whose output terminal is connected to the other input terminal of the AND circuit +121, and normally outputs a "high" signal, and the power is turned on when the power is turned on. It outputs a "low" signal only when an on-reset signal or a fault output stop signal from a second control circuit, which will be described later, is input.

■は信号線［５１の状態を監視し障害発生信号、すなわ
ち信号！＋３１の「ロウ」の状態を検出して障害検出信
号を出力する検出回路であり、電源投入直後の自己診断
時に出力される「ロウ」の障害検出制御信号により障害
発生信号の検出を禁止する以外は、常時検出状態にある
。（＋６１は第２制御回路であり、横出回路圓からの障
害検出信号や後述の割込みゲート回路からの割込み要求
信号が入力され、各入力信号の入力に応じてＣＰ　Ｕ　
（８１の割込み端子に割込み信号が人力される。■ Monitors the status of the signal line [51 and indicates a fault occurrence signal, that is, a signal! This is a detection circuit that detects the "low" state of +31 and outputs a fault detection signal, and does not prohibit the detection of fault occurrence signals using the "low" fault detection control signal that is output during self-diagnosis immediately after power is turned on. is always in the detection state. (+61 is a second control circuit, into which a fault detection signal from the Yokode circuit and an interrupt request signal from an interrupt gate circuit (to be described later) are input, and the CPU is controlled according to the input of each input signal.
(An interrupt signal is manually input to the interrupt terminal 81.

ここで、第２制御回路圃は、たとえば、検出回路０４１
からの障害検出信号２割込みゲート回路か、らの割込み
要求信号等の複数の割込みのための信号線がそれぞれ接
続されたオア回路３よびラッチ回路よりなり、オア回路
の出力がＣｌ’　Ｕ　ｆｇ１の割込み端子に入力される
とともに、ラッチ回路でラッチされたデータがデータバ
ス上に入力される構成になってどり、任意の割込みのた
めの信号がオア回路を介してＣＰ　Ｕ　ｆｇ）に入力さ
れると、ＣＰＵ　＋８１からの割込み応答信号によって
ラッチ回路が割込みのための信号線の状態をラッチし、
ＣＰ　［Ｊ　ｆｇ１が割込み信号によって実行する割込
み処理プログラムでラッチされたデータを読み取り、割
込み要因。Here, the second control circuit field is, for example, the detection circuit 041
It consists of an OR circuit 3 and a latch circuit to which signal lines for multiple interrupts such as a failure detection signal 2 interrupt gate circuit or an interrupt request signal from The configuration is such that the data latched by the latch circuit is input to the data bus as well as being input to the interrupt terminal, and the signal for any interrupt is input to the CPU (fg) via the OR circuit. Then, the latch circuit latches the state of the signal line for interrupt according to the interrupt response signal from CPU +81,
CP [J Reads the data latched by the interrupt processing program executed by fg1 in response to the interrupt signal, and determines the cause of the interrupt.

すなわち障害検出信号か刷込み要求信号か等を判定する
仕組みになっている。In other words, the system is designed to determine whether the signal is a fault detection signal or an imprint request signal.

あるいは、この第２制御回路（１５）は、複数の割込み
の几めの信号線がそれぞれ接続されたオア回路およびプ
ライオリティエンコーダとこのエンコーダ出力をデータ
バス上に出力するゲート回路とからなり、任意のあるい
は任意数の割込みのための信号がオア回路を介してＣＰ
　Ｕ　ｆｇ）に人力されると、ＣＰ　Ｕ　ｆｇ１からの
割込み応答信号によってゲート回路が開かれ、ＣＰ　Ｕ
　（８１がエンコーダ出力をデータバスから読み収り、
割込み処理プログラムを選択して実行し、優先順位の高
い割込み要求上受は付ける仕組みになっている。Alternatively, the second control circuit (15) is composed of an OR circuit to which a plurality of interrupt signal lines are respectively connected, a priority encoder, and a gate circuit that outputs the encoder output onto the data bus, Alternatively, signals for any number of interrupts can be routed to the CP via an OR circuit.
When the CPU fg1 is manually operated, the gate circuit is opened by the interrupt response signal from the CPU fg1, and the CPU
(81 reads the encoder output from the data bus,
It selects and executes an interrupt processing program, and accepts interrupt requests with higher priority.

な８、ある種のＣＰＵでは割込み要求受付端子を複数有
してどり、このようなＣＰＵｅ使用ししかもその端子数
以下の割込み要求しか存在しないような場合には、各割
込みのための信号線がそれぞれ受付端子に接続され、前
述したような第２電ＩＪ疵回路（１６）は不要となる。8. Some types of CPU have multiple interrupt request reception terminals, and when using such a CPU and there are fewer interrupt requests than the number of terminals, the signal line for each interrupt is The second electric IJ defect circuit (16), which is connected to the reception terminal, as described above, becomes unnecessary.

ところで、各プロセッサＣＰ＋に８けるＣ　］’　Ｕ　
（８１では、彼程、詳述するが、障害検出信号の割込み
が入力すれると、アドレスバス（ａｌ上にアドレスデー
タを送出して他の各プロセッサ（月ヲ順次呼び出し、障
害発生のプロセッサ（Ｐｌｔ−データバス（ｄｌ上のデ
ータにより検出するとともに、該障害発生のプロセッサ
（Ｐ）に対して、停止、リセット、割込み禁止等の制御
指令データをデータバスｔｄｌ上に送出する機能金有し
ている◎ ａｅは割込みバス［ｉＪ上の入出力割込み信号を受けて
第２制御回路Ｑ５１に割込み要求信号を出力する割込み
ゲート回路であり、いま、第２図に示すように、この柚
マルチプロセッサシステムに５個の入用力制御ｍ　（，
１（ＪＣ−ご（ＩＯＣｓ）が設けられているとすると、
割込みバス（ｉＪは各入用力制御ｇ１ｉ　（ＩＵＣｌ）
〜（ＩＯＣｓ）毎の５本の割込み要求線よりなり、各プ
ロセッサｆＰ）には、各要求線毎の５個の刷込みゲート
回路（１６θ〜（！りが備えられる。By the way, each processor CP+ has 8 digits C ]' U
(This will be explained in detail in 81, but when the failure detection signal interrupt is input, the address data is sent out on the address bus (al) and the other processors are sequentially called, and the faulty processor ( Plt-data bus (dl) It has a function that detects the failure based on data on the data bus (dl) and sends control command data such as stop, reset, and interrupt prohibition to the faulty processor (P) over the data bus tdl. ◎ ae is an interrupt gate circuit that receives an input/output interrupt signal on the interrupt bus [iJ and outputs an interrupt request signal to the second control circuit Q51. 5 input force controls m (,
1 (JC-go (IOCs)) is provided,
Interrupt bus (iJ is each input input control g1i (IUCl)
It consists of five interrupt request lines for each ~(IOCs), and each processor fP is provided with five imprinted gate circuits (16θ~(!ri) for each request line.

そして１通常、１つの入出力側脚部（ＩＯＣ）からの割
込みは、１つのプロセッサ（Ｐ）が受は付け、他のプロ
セッサ＋１’ｌ　ｆｌ受は付けないように設定されるた
め、各プロセッサ（１’ｌでは、割込み制御回路（１７
１より出力される割込み制御信号により各割込みゲート
（ロ）路（１６１）〜（１６ｓ）の開、閉が制御され、
各プロセッサ（Ｐｌに３いて予め設定された割込みのみ
ｆｔ受は付けるようにしている。1 Normally, interrupts from one input/output leg (IOC) are set to be accepted by one processor (P) and not accepted by the other processors, so each processor (In 1'l, the interrupt control circuit (17
The opening and closing of each interrupt gate (b) path (161) to (16s) is controlled by the interrupt control signal output from 1,
Only interrupts set in advance in each processor (Pl) are allowed to receive ft.

賭はアドレスデコーダであり、各プロセッサ（１’１毎
のデコーダ賭は、障害検出信号の割込みを受は付けた任
意のプロセッサ田）のＣ）’　Ｕ　ｆ８］より出力され
たアドレスデータを取り込み、自分のプロセッサｔｌ’
ｌに対するアドレスデータについて、後述のゲ−ト回路
または第３制御回路に呼出信号を出力する。すなわち、
システムの構築時に、各プロセッサ（Ｐ）毎のゲート回
路および第３制御回路にそれぞれ予めアドレスが設定さ
れ、前記任意のプロセッサ（ＰｌのＣ）’　Ｕ　（８１
より各プロセッサ（月のゲート回路。The decoder is an address decoder, which takes in the address data output from C)' U f8] of each processor (the decoder for each 1'1 is any processor that accepts the interrupt of the failure detection signal), my processor tl'
Regarding the address data for 1, a call signal is output to a gate circuit or a third control circuit, which will be described later. That is,
When constructing the system, addresses are set in advance for the gate circuit and third control circuit for each processor (P),
More each processor (Mon gate circuit.

第３制御回路のアドレスデータが出力されることにより
、これに該当するゲート回路、第３制御回路にデコーダ
賭より呼出信号が出力される。When the address data of the third control circuit is output, a calling signal is output from the decoder to the corresponding gate circuit and the third control circuit.

＋１９１はデコーダ（１８１からの呼出信号により開と
なり前記障害発生信号をデータバスＣｄｌ上に送出する
ゲート回路、ωはデコーダαａからの呼出信号のタイミ
ングでデータバスＣｄｌ上の制御指令データを取り込む
とともに保持しかつ解析する第３制御回路であり、ｆｆ
１１４　？Ｊｕｌ指令データの解析に応じて、Ｃｌ’　
Ｕ　ｆ８１等に停止信号、他の各回路にリセット信号、
ｉ１制御回路０３１に障害出力停止信号２割込みゲート
回Ｍｔ１６１に入出力割込みの入力？禁止する割込み禁
止信号をそれぞれ出力する。+191 is a gate circuit which is opened by a call signal from the decoder (181) and sends out the fault occurrence signal onto the data bus Cdl, and ω takes in and holds the control command data on the data bus Cdl at the timing of the call signal from the decoder αa. and the third control circuit to be analyzed, ff
114? According to the analysis of Jul command data, Cl'
Stop signal to U f81 etc., reset signal to each other circuit,
Failure output stop signal to i1 control circuit 031 2 Interrupt gate input of input/output interrupt to Mt161? Outputs each interrupt disable signal to be disabled.

つぎに、前記実施例の動作について説明する。Next, the operation of the embodiment will be explained.

まず、電源投入後、各プロセッサ［Ｐｌはそれぞれプロ
セッサ自身の初期化を行ない、パワーオンリセット信号
金弟１制６ｆｌＩロ路ＯＪおよび第３制御（ロ）路−に
出力し、第１　？１ｌＩＩ御ＩｇＪ路（１３からの「ロ
ウ」の信号によりアンド回路ａ２ｇよびドライバ＋Ｉｌ
ｉ介して信号）Ｙ＃ｔＳｌをＩＪ　ＩＪ−スするととも
に、第３制御回路四からのすべての信号出力を停止させ
る。First, after the power is turned on, each processor (Pl) initializes itself and outputs a power-on reset signal to the first control (6flI) path (OJ) and the third control (b) path. 1lII control IgJ path (by the "low" signal from 13, AND circuit a2g and driver +Il
The signal Y#tSl is passed through IJ IJ-, and all signal outputs from the third control circuit 4 are stopped.

続いて、イニシャルプログラムをロードし、システムと
して動作を開始する。Next, the initial program is loaded and the system starts operating.

このとき、イニシャルプログラムに８いて、各入出力制
御部（ＩＯＣＩ）〜（ＩＯＣｓ）のそれぞれの割込みを
取り扱うプロセッサ旧が決定され、各プロセッサ（Ｐｌ
にどいて、その割込み制御部１Ｉ６０ηからの割込み制
御信号により６割込みゲートＩＰ！回路０６．）〜（１
６ｓ）がそれぞれ開閉制御され、１つの入出力制御部（
ＩＯＣ）からの割込みを１つのプロセッサｆＰ）のみが
受は付けるように設定される。At this time, the processor old that handles each interrupt of each input/output control unit (IOCI) to (IOCs) is determined in the initial program, and
Then, the 6th interrupt gate IP! is activated by the interrupt control signal from the interrupt control unit 1I60η. Circuit 06. )～(1
6s) are controlled to open and close, respectively, and one input/output control section (
Only one processor fP) is set to accept interrupts from the IOC).

このようにして動作を開始したシステムに８いて、複数
のプロセッサ田）のうち１つのプロセッサｔｌ’ｌに障
害が発生すると、その障害を生じたプロセッサ（以下障
害プロセッサという）（Ｐ）に２いて、障害発生信号が
出力され、このとき、第１制御回路（１３１からは前記
初期化ののち「ハイ」の信号が出力されているため、障
害発生信号はアンド回路ａｚを介してドライバ＋１１に
入力され、信号線（Ｓｌがドライバｉｌ＋によって「ロ
ウ」にドライブされる。In a system that has started operating in this way, if one of the multiple processors (tl'l) fails, the failed processor (hereinafter referred to as the failed processor) (P) , a fault occurrence signal is output, and at this time, since the "high" signal has been output from the first control circuit (131) after the initialization, the fault occurrence signal is input to the driver +11 via the AND circuit az. and the signal line (Sl) is driven "low" by the driver il+.

そして、複数のプロセッサ（１’ｌのうち、障害を２こ
していないプロセッサ（Ｐｌの１つ（以下検出プロセッ
サ（？という）が、検出回路ａ４１および第２制御回路
ａ５１を介してＣ、）’　Ｕ　ｆ８１で信号線ｆｓ＋が
「ロウ」にドライブされ友ことを検出すると、該ＣＰ　
Ｕ　（８１はそれまで実行していたプログラムを停止し
、第８図のプログラムを呼び出し、実行する。Then, among the plurality of processors (1'l), one of the processors (Pl) (hereinafter referred to as the detection processor (?)) which is not faulty is C, through the detection circuit a41 and the second control circuit a51. When Uf81 detects that the signal line fs+ is driven low, the corresponding CP
U (81 stops the program that has been running up to that point, calls and executes the program shown in FIG. 8.

すなわち、検出プロセッサ（劫は、まず、自分以外の各
プロセッサ（月のそれぞれのゲート回路（１９１のアド
レスデータｅ　１１１１次送出し、各プロセッサ（Ｐｌ
　ｋ呼び出す。That is, the detection processor (Kalpa) first sends out the address data e 1111 of each processor other than itself (each gate circuit (191),
Call k.

したがって、各プロセッサ（Ｐ）では、アドレスバスｆ
ａ）上のアドレスデータをアドレスデコーダ（１８１に
収り込み、自己のゲート回路畑に与えられたアドレスデ
ータを取り込んだとき、ゲート回路Ｑ９１に呼出信号を
出力し、これによりゲート回路時が開かれる。Therefore, in each processor (P), the address bus f
a) When the above address data is stored in the address decoder (181) and the address data given to its own gate circuit field is taken in, a call signal is output to the gate circuit Q91, thereby opening the gate circuit. .

このため、障害プロセッサＩＰ）が呼び出されると、そ
のゲート回路Ｑ９’？通して前記障害発生信号がデータ
バスＣｄｌ上に送出されるから、検出プロセッサ（Ｐ）
では、送出したアドレスデータとデータバスＣｄｌ上の
障害発生信号とにより障害プロセッサ（杓ヲ検出する（
ステップ■）。Therefore, when the faulty processor IP) is called, its gate circuit Q9'? Since the fault occurrence signal is sent onto the data bus Cdl through the detection processor (P)
Now, we will detect a faulty processor based on the sent address data and the fault occurrence signal on the data bus Cdl.
Step ■).

な２、前記障害プロセッサ（坊ヲ検出する動作は、従来
技術で説明し友ことと基本的に同じである。2. The operation of detecting a faulty processor is basically the same as that described in the prior art.

つきに、検出プロセッサ（酌は、検出した障害プロセッ
サ（劫に対して、その第３制御回路四のアドレスデータ
とともに停止、リセット等の制御指令のデータをデータ
バスＣｄｌ上に送出する（ステップ■）。At that time, the detection processor sends control command data such as stop and reset to the detected faulty processor along with the address data of its third control circuit 4 onto the data bus Cdl (step 2). .

したがって、障害プロセッサ（１’）では、アドレスデ
コーダ賭に収り込んだアドレスデータによって第３制御
回路−に呼出信す全出力し、第３制御回路■はこの呼出
信号によってデータバスＣｄｌ上のデータ、すなわち制
御指令データ全敗り込み、これれ保持するとともに解析
し、ＣＰ　Ｕ　＋８１等に停止信号を出力するとともに
、他の回路、装置にリセット信号を出力し、さらにｊ８
１制御回路０３に障害出力停止信号を出力する。Therefore, in the faulty processor (1'), the address data stored in the address decoder outputs all outputs to call the third control circuit, and the third control circuit receives the data on the data bus Cdl by this calling signal. , that is, all control command data is lost, it is held and analyzed, a stop signal is output to CPU +81, etc., a reset signal is output to other circuits and devices, and further j8
1 A fault output stop signal is output to the control circuit 03.

ここで、Ｗ１１制御回路＋１３は障害出力停止信号によ
って「ロウ」の信号を出力するようになるので、アンド
ｔｇ　ｆｌｕｚ　’１介してドライバｆｌ＋の厖励が停
止され、信号＃Ｈ３ｌが「ハイ」の状態に戻る。この状
態は検出プロセッサ（約によって検出されるので、該プ
ロセッサ（Ｐ）は障害プロセッサ（Ｐ）が正しく制御さ
れていることを確認できることになる。Here, the W11 control circuit +13 outputs a "low" signal in response to the fault output stop signal, so the drive of the driver fl+ is stopped via the AND tg fluz '1, and the signal #H3l becomes "high". Return to state. Since this condition is detected by the detecting processor (P), the processor (P) can confirm that the faulty processor (P) is correctly controlled.

つぎに、検出プロセッサ（月は、障害プロセッサ（Ｐｉ
が入出力制御部（ＩＯＣ）からの割込みを扱ってい友か
どうかを判断しくステップ■）、　Ｎｏであれば、障害
プロセッサ（Ｐ）が論理的にシステムから切り離されて
いることになるので、ステップ■に移行し、ＬＥＤ表示
器、プリンタ、コンソール・ターミナル等、何らかの表
示・報知手段を駆動し、どのプロセッサ（Ｐ）に障害が
発生し友かを表示し、オペレータ等の人間に障害発生を
知らせる。Next, the detection processor (Mon) is the fault processor (Pi).
Determine whether or not the processor (P) handles interrupts from the input/output control unit (IOC). Shift to ■, drive some kind of display/notification means such as an LED display, printer, console/terminal, etc., display which processor (P) has a problem and which processor (P) has a problem, and notify humans such as operators of the problem. .

また、ステップ■の判断でＹＥＳの条件が成立すると、
つぎに、障害プロセッサ（Ｐ）に対して、第３制＃回路
団のアドレスデータとともに割込み禁止の制御指令デー
タ金出力する（ステップ■）。Also, if the condition of YES is satisfied in step ■,
Next, control command data for inhibiting interrupts is outputted to the faulty processor (P) together with the address data of the third system # circuit group (step 2).

この、結果、障害プロセッサｔＰ）では、第３制御向路
四で制御指令データを保持・解析して割込みゲート１ｇ
ｌ路ｕＯに割込み禁止信号が出力されることになるため
、入出力割込みの取り扱いが禁止され、システムから論
理的に切り離されたことになる。As a result, the fault processor tP) holds and analyzes the control command data in the third control path 4 and interrupts the interrupt gate 1g.
Since an interrupt prohibition signal is output to path uO, handling of input/output interrupts is prohibited, and the system is logically disconnected from the system.

その債、検出プロセッサ（Ｐ）は、障害プロセッサ（Ｐ
）が扱っていた入出力制御部（ＩＯＣ）からの割込みを
新たにどのプロセッサが扱うか全決定する（ステップ■
）。この決定は、通常、入出力割込みに関して予備のプ
ロセッサ（Ｐ）に対して行なわれ、予備のプロセッサが
ない場合は任意に決定される。In that case, the detection processor (P) is the faulty processor (P
) is used to handle interrupts from the input/output control unit (IOC).
). This determination is normally made for a spare processor (P) for input/output interrupts, and is arbitrarily determined if there is no spare processor.

そして、検出プロセッサ（Ｐ）は、自己がＩｉｒたに割
込みを扱うこととなったかどうか全判断しくステップ■
）、ＹＥＳであれば、当該ＣＰＵ　ｆｓ＋は自己の割込
み制御回路０７１に対して設定信号全出力しくステップ
■）、新たに仮うこととなった割込みの入出力制御部（
ＩＯＣ）に対応する割込みゲート回路α６１ヲ開ｄＪＩ
ｉｌｌし、ステップ■に移行する。Then, the detection processor (P) performs step II to determine whether or not it is supposed to handle interrupts.
), if YES, the CPU fs+ outputs all the setting signals to its own interrupt control circuit 071 (Step ■), and the newly provisional interrupt input/output control unit (
Interrupt gate circuit α61 corresponding to IOC) opens dJI
ill, and move on to step ■.

また、ステップ■の判断がＮＯであれば、決定されたプ
ロセッサ（ＰｌのＣ）’　Ｕ　（８１に対して設定指令
のデータを出力しくステップ■）、そのＣＰＵ　（８１
に割込み制御回路（＋７１を介して該当の割込みゲート
回路Ｏａヲ開制御させ、ステップ■に移行する。゛この
ようにして％第３図に示し几プログラムを終了した検出
プロセッサ（）′）のＣＰＵ（８３は、再び自己のプロ
グラムを開始し、障害プロセッサ（Ｐｌ　ｅ切り離し友
状態で、システムは停止することなく動作を継続するこ
とになる。Further, if the judgment in step ■ is NO, the determined processor (C of Pl)'U (step ■) to output the data of the setting command to 81,
The interrupt control circuit (+71) controls the opening of the corresponding interrupt gate circuit Oa, and the process moves to step (2).The CPU of the detection processor ()', which has thus completed the program shown in FIG. (83 starts its own program again, and the system continues to operate without stopping in the faulty processor (Ple disconnection state).

また、このシステムの動作状態に３いて、切り離され之
プロセッサ（坊以外の他のプロセッサ（Ｐ）にＩ章害が
発生すると、前述した動作によって肖該障否プロセッサ
（Ｐ）がシステムから論理的に切り離され、システムと
して正常に動作する。In addition, if this system is in an operating state and a problem occurs in another processor (P) other than the disconnected processor (P), the affected processor (P) will be logically removed from the system by the above-mentioned operation. It is isolated and operates normally as a system.

すなわち、共通バス（Ｂ）　を介してｎ個のプロセッサ
ＩＰＩが結合されたマルチプロセッサシステムに８いて
、（ｎ−１）個までのプロセッサ（Ｐ）に障害が発生し
動作不能に至ったとしても、システム全体が停止するこ
とはなく、動作を継続することになる。In other words, even if there is a multiprocessor system in which n processors IPI are connected via a common bus (B), and up to (n-1) processors (P) fail and become inoperable. , the entire system will not stop and will continue to operate.

な葛、実施例では、複数の各プロセッサ（Ｐｌにそれぞ
れ、信号ＳｔＳ＋が「ロウ」にドライブされたことを検
出する検出回路１１４１ｉ設け、それぞれのＣ１’Ｕ（
８）にｉｉａ図のプログラムを実行する機能を有するよ
うにしたが、任意のプロセッサ（Ｐ）にのみ検出回路１
１４１等を設けるようにしてもよく、この場合、この任
意のプロセッサ（Ｐｌが１個であると該プロセッサ（Ｌ
’ｌの障害発生時にシステムが暴走等により誤動作を起
こすため、２個以上の任意のプロセッサｔｐ＋に検出回
路Ｉ等を設けるようにすれはよい。In the embodiment, each of the plurality of processors (Pl) is provided with a detection circuit 1141i that detects that the signal StS+ is driven to "low", and the respective C1'U(
8) has the function of executing the program shown in Figure IIA, but the detection circuit 1 is only installed in an arbitrary processor (P).
141 etc. may be provided, and in this case, if this arbitrary processor (Pl is one), the processor (L
Since the system may malfunction due to runaway or the like when a failure occurs in 'l, it is better to provide a detection circuit I or the like to two or more arbitrary processors tp+.

［発明の効果］以上のように、この発明のマルチプロセッサシステムに
よると、複数のプロセッサのうちのいずれかに障害が発
生しても、この猷讐奮発生したプロセッサのみを論理的
にシステムから切り離すことができるｔめ、障害がシス
テム全体に波及することを防止でき、システム全体が停
止することはなく、信頼性が飛催的に向上するものであ
る。[Effects of the Invention] As described above, according to the multiprocessor system of the present invention, even if a failure occurs in any one of the plurality of processors, only the processor in which the failure occurs can be logically separated from the system. As a result, failures can be prevented from spreading to the entire system, the entire system will not stop, and reliability will be dramatically improved.

[Brief explanation of the drawing]

第１図ないし第３図はこの発明のマルチプロセッサシス
テムの１実施例金示し、第１図は１つのプロセッサの要
部のブロック図、第２図は第１図の一部の詳細なブロッ
ク図、第３因は動作説明用のフローチャート、第４図は
一般のマルチプロセッサシステムの構成図、第５図は従
来のマルチプロセッサシステムの一部のブロック図、第
６図は従来の他のマルチプロセッサシステムの一部のブ
ロック図である。ＣＰ＋、　　ＣＰ＋）（ＰＦＩ）　・・・プロセッサ、
（ＩＯＣ）、　（ＩＯＣ，）〜（ＩＯＣｍ）・・・入出
力制御部、（Ｍ）・・・共通メモリ、ｉＢｌ・・共通バ
ス、ＩｓＩ・・・システム状態監視信号線、ｉｌ＋・・
・ドライバ、（８）・・ＣＰＵ、　Ｌ１２・・・検出回
路、Ｑ６１　、　（１６＋　）〜（１６５）・・・割込
みゲート回路。代理人　弁理士　藤　１）龍太部第　２　図チ　　　　　　　　　　力傑第４図第５図１！Ｊ６図1 to 3 show one embodiment of the multiprocessor system of the present invention, FIG. 1 is a block diagram of the main parts of one processor, and FIG. 2 is a detailed block diagram of a part of FIG. 1. , the third factor is a flowchart for explaining the operation, FIG. 4 is a block diagram of a general multiprocessor system, FIG. 5 is a block diagram of a part of a conventional multiprocessor system, and FIG. 6 is a diagram of another conventional multiprocessor system. FIG. 2 is a block diagram of a portion of the system. CP+, CP+) (PFI)...Processor,
(IOC), (IOC,) ~ (IOCm)...Input/output control unit, (M)...Common memory, iBl...Common bus, IsI...System status monitoring signal line, il+...
- Driver, (8)... CPU, L12... detection circuit, Q61, (16+) to (165)... interrupt gate circuit. Agent Patent Attorney Fuji 1) Ryutabe No. 2 Figure Chi Rikiketsu Figure 4 Figure 5 Figure 1! J6 diagram

Claims

[Claims]

(1) Connect multiple processors via a common bus, connect the input/output control unit, common memory, etc. to the common bus, and identify the faulty processor when a fault occurs in any of the processors. In a multiprocessor system configured to enable the processor to stop, reset, or interrupt the faulty processor based on address signals on an address bus that makes up the common bus and data on a data bus that makes up the common bus, means for giving a control command such as prohibition, and means provided in each of the processors to inhibit interrupts input from the input/output control unit or the like via the common bus by a control command for prohibiting interrupts from the arbitrary processor; A multiprocessor system characterized by being equipped with the following.