JPH04367061A - Faulty processor detection system - Google Patents

Faulty processor detection system

Info

Publication number
JPH04367061A
JPH04367061A JP14283191A JP14283191A JPH04367061A JP H04367061 A JPH04367061 A JP H04367061A JP 14283191 A JP14283191 A JP 14283191A JP 14283191 A JP14283191 A JP 14283191A JP H04367061 A JPH04367061 A JP H04367061A
Authority
JP
Japan
Prior art keywords
transmission
processor
data
processors
flag
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP14283191A
Other languages
Japanese (ja)
Inventor
Kaoru Yamamoto
薫 山本
Masato Konuki
小貫 理人
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
NEC Communication Systems Ltd
Original Assignee
NEC Corp
NEC Communication Systems Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp, NEC Communication Systems Ltd filed Critical NEC Corp
Priority to JP14283191A priority Critical patent/JPH04367061A/en
Publication of JPH04367061A publication Critical patent/JPH04367061A/en
Pending legal-status Critical Current

Links

Landscapes

  • Multi Processors (AREA)

Abstract

PURPOSE:To accurately detect the fault of a processor by setting the receiving operation of a transmission destination processor at a transmittable flag and utilizing the state of this flag in a queue length check processing state. CONSTITUTION:A processor 21 connected via a bus 10 is used as the transmission destination processor, and the processors 22-2n are used as the transmission destination processors respectively. Then the processor 21 is provided with the transmission queues 72-7n corresponding to other processors, the transmission queue length counters 82-8n, and the transmittable flags 92-9n. A communication control processing part 51 of a CPU 50 registers the transmission data to the queues 72-7n and counts down the counters 82-8n. When the data sent from a transmission part 40 are received by the processors 22-2n, the corresponding flags 92-9n are set and those data are not registered into the queues 72-7n. Then the fault of a processor is decided only when the count values of the counters 82-8n exceed the threshold value and at the same time the flags 92-9n are cleared.

Description

【発明の詳細な説明】[Detailed description of the invention]

【0001】0001

【産業上の利用分野】本発明は障害プロセッサ検出方式
に関し、特に複数のプロセッサをバスにより接続して構
成されるマルチプロセッサシステムのプロセッサ間通信
における送信先プロセッサの障害を検出する障害プロセ
ッサ検出方式に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method for detecting a faulty processor, and more particularly to a method for detecting a faulty processor for detecting a fault in a destination processor in communication between processors in a multiprocessor system configured by connecting a plurality of processors via a bus. .

【0002】0002

【従来の技術】従来、複数のプロセッサを含んで構成さ
れるマルチプロセッサシステムにおいては、送信先プロ
セッサ対応の送信待ちキュー(queue)を持ち送信
待ちキューの数が所定のしきい値を超えていれば送信先
プロセッサを障害と見なしていた。
2. Description of the Related Art Conventionally, in a multiprocessor system including a plurality of processors, there is a transmission waiting queue corresponding to a destination processor, and there is a system in which the number of transmission waiting queues exceeds a predetermined threshold. In other cases, the destination processor was considered to be a failure.

【0003】0003

【発明が解決しようとする課題】上述した従来の障害プ
ロセッサ検出方式は、送信待ちキューの数しか考慮して
いないため、単に送信先プロセッサが輻輳して受信処理
が追いつかず、送信は正常に行われているにもかかわら
ず結果的に送信待ちキューにたまってしまた場合でも送
信先プロセッサを障害としてしまい、正常動作している
プロセッサをシステムから切り離してしまうという欠点
がある。
[Problems to be Solved by the Invention] The conventional faulty processor detection method described above takes into account only the number of queues waiting for transmission, so the destination processor is simply congested and reception processing cannot catch up, and transmission is not performed normally. However, even if the destination processor ends up being accumulated in the transmission waiting queue, the destination processor becomes a failure, and a normally operating processor is disconnected from the system.

【0004】本発明の目的は上述した欠点を除去し、送
信先プロセッサが単なる輻輳状態において障害と誤認識
されることを大幅に抑圧した障害プロセッサ検出方式を
提供することにある。
SUMMARY OF THE INVENTION An object of the present invention is to provide a faulty processor detection method that eliminates the above-mentioned drawbacks and greatly suppresses the possibility of a destination processor being mistakenly recognized as faulty when it is simply in a state of congestion.

【0005】[0005]

【課題を解決するための手段】本発明の方式は、複数の
プロセッサをバスに接続して成るマルチプロセッサシス
テムにおける障害プロセッサを検出する障害プロセッサ
検出方式において、データを送信する発信元プロセッサ
に、送信すべき各送信先プロセッサごとに設定した送信
待ちキューと送信動作可能を意味する送信可能フラグと
を備え、送信先プロセッサが送信データを受領しない場
合には送信データを前記送信待ちキューに登録し、送信
先プロセッサが送信データを受領した場合には前記送信
可能フラグを立てて、第一の周期で前記送信待ちキュー
に蓄積した送信データを送信して掃き出すとともに前記
第一の周期よりも長い第二の周期で前記送信待ちキュー
の数をチェックしてチェック値が所定のしきい値を超え
てかつ前記送信可能フラグがクリアされている場合のみ
送信先プロセッサを障害と見なし、それ以外の場合は障
害なしと判定して前記送信可能フラグをクリアする障害
判定手段を有して成る。
[Means for Solving the Problems] The method of the present invention is a faulty processor detection method for detecting a faulty processor in a multiprocessor system in which a plurality of processors are connected to a bus. a transmission waiting queue set for each transmission destination processor and a transmission enable flag indicating that the transmission operation is possible, and registering the transmission data in the transmission waiting queue when the transmission destination processor does not receive the transmission data; When the transmission destination processor receives the transmission data, it sets the transmission ready flag, transmits and purges the transmission data accumulated in the transmission waiting queue in the first cycle, and also flushes the transmission data accumulated in the transmission waiting queue in the first cycle. The number of the queues waiting for transmission is checked at intervals of , and only when the check value exceeds a predetermined threshold and the transmittable flag is cleared, the destination processor is considered to be faulty; otherwise, the destination processor is faulty. The apparatus further comprises a failure determining means for determining that there is no such thing and clearing the transmittable flag.

【0006】[0006]

【実施例】次に、本発明について図面を参照して説明す
る。
DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, the present invention will be explained with reference to the drawings.

【0007】図1は、本発明の一実施例の構成を示すブ
ロック図である図1に示す実施例は、n個のプロセッサ
21〜2nをバス10を介して接続し、プロセッサ21
〜2nの間で通信を行いながら処理を進めるマルチプロ
セッサシステムを示す。
FIG. 1 is a block diagram showing the configuration of an embodiment of the present invention. In the embodiment shown in FIG. 1, n processors 21 to 2n are connected via a bus 10.
A multiprocessor system is shown in which processing is performed while communicating between ~2n.

【0008】プロセッサ21は、中央処理装置50と、
バス10上のデータを取り込んで中央処理装置50に伝
える受信部30と、中央処理装置50からのデータをバ
ス10上に送出する送信部40と、中央処理装置50に
接続された記憶装置60とを備えており、他のプロセッ
サ22〜2nも同様な構成を有する。
[0008] The processor 21 includes a central processing unit 50;
A receiving unit 30 that takes in data on the bus 10 and transmits it to the central processing unit 50, a transmitting unit 40 that sends data from the central processing unit 50 onto the bus 10, and a storage device 60 connected to the central processing unit 50. The other processors 22 to 2n also have similar configurations.

【0009】中央処理装置50は、送信制御処理部51
と、待ちキュー掃き出し処理部52と、キュー長チェッ
ク処理部53とを有し、記憶装置60は自プロセッサ以
外の送信相手先プロセッサに対応する送信待ちキュー7
2〜7nと、送信待ちキュー72〜7nにたまっている
データの数をカウントする送信待ちキュー長カウンタ8
2〜8nと、送信可能フラグ92〜9nとを有する。
The central processing unit 50 includes a transmission control processing section 51
, a waiting queue purging processing section 52 , and a queue length checking processing section 53 , and the storage device 60 stores the transmission waiting queue 7 corresponding to the transmission destination processor other than the own processor.
2 to 7n, and a transmission waiting queue length counter 8 that counts the number of data accumulated in the transmission waiting queues 72 to 7n.
2 to 8n, and transmittable flags 92 to 9n.

【0010】以下に、プロセッサ21がデータを送信す
る送信元プロセッサ、プロセッサ22及びプロセッサ2
3がデータ受信する送信先プロセッサであるとして本実
施例の動作を説明する。
[0010] In the following, processor 21 transmits data to a source processor, processor 22, and processor 2.
The operation of this embodiment will be explained assuming that 3 is a destination processor that receives data.

【0011】中央処理装置50の送信制御処理部51が
、たとえばプロセッサ22とプロセッサ23に対して送
信したが、何らかの要因によりプロセッサ22,プロセ
ッサ23がいずれも送信データを受信できなかった場合
には、送信制御処理部51はプロセッサ22に対する送
信データをプロセッサ22に対応する送信待ちキュー7
2に登録し、送信待ちキュー長カウンタ82のカウント
ダウンを行なわせる。同様に、プロセッサ23に対応す
る送信待ちキュー73に登録し、送信待ちキュー長カウ
ンタ83もカウントダウンさせる。
[0011] If the transmission control processing section 51 of the central processing unit 50 transmits data to, for example, the processors 22 and 23, but for some reason neither the processors 22 nor the processors 23 are able to receive the transmitted data, The transmission control processing unit 51 sends transmission data to the processor 22 to a transmission waiting queue 7 corresponding to the processor 22.
2 and causes the transmission waiting queue length counter 82 to count down. Similarly, it is registered in the transmission waiting queue 73 corresponding to the processor 23, and the transmission waiting queue length counter 83 is also counted down.

【0012】逆に、たとえばプロセッサ23が送信デー
タを受信した場合には、プロセッサ21のプロセッサ2
3に対する送信可能フラグ93を立て、送信待ちキュー
83へは登録しない。
Conversely, when the processor 23 receives the transmission data, the processor 2 of the processor 21
The transmittable flag 93 for No. 3 is set, and it is not registered in the transmission waiting queue 83.

【0013】中央処理装置50の待ちキュー掃き出し処
理部52は、第一の周期で各プロセッサへの送信待ちキ
ュー長カウンタ82〜8nのカウント値を確認して、送
信待ちキュー72〜7nにたまっていれば該当するプロ
セッサへ送信するように送信制御部51へ要求を出す。 1つのプロセッサへ対し、送信待ちキュー長カウンタが
0になるか、送信失敗になるまで送信待ちキューの掃き
出しを行い、次のプロセッサに対する送信待ちキューの
掃き出しへと移行する。
The waiting queue purging unit 52 of the central processing unit 50 checks the count values of the transmission waiting queue length counters 82 to 8n for each processor in the first cycle, and removes any data accumulated in the transmission waiting queues 72 to 7n. If so, a request is issued to the transmission control unit 51 to transmit it to the corresponding processor. The transmission queue for one processor is flushed out until the transmission queue length counter becomes 0 or transmission fails, and then the process moves to flushing the transmission queue for the next processor.

【0014】中央処理装置50のキュー長チェック処理
部53は、第一の周期より長い第二の周期ごとに各プロ
セッサに対応する送信待ちキュー長カウンタ82〜8n
の値と所定のしきい値と比較し、各プロセッサに対応す
る送信可能フラグ92〜9nをクリアする。この場合、
しき値を超えているプロセッサは、送信先プロセッサが
輻輳状態または障害中と思われる。そこで、そのどちら
かの状態かを決定するため、該当プロセッサに対する送
信可能フラグをチェックする。該当プロセッサの送信可
能フラグがクリアされていれば、前回のキュー長チェッ
ク処理から1度も送信データを受けとっていないので障
害が発生しているものと判断する。
The queue length check processing section 53 of the central processing unit 50 checks the transmission waiting queue length counters 82 to 8n corresponding to each processor every second period which is longer than the first period.
is compared with a predetermined threshold, and the transmittable flags 92 to 9n corresponding to each processor are cleared. in this case,
A processor that exceeds the threshold indicates that the destination processor is congested or failing. Therefore, in order to determine which state it is in, the transmittable flag for the processor in question is checked. If the transmittable flag of the relevant processor is cleared, it is determined that a failure has occurred since no transmission data has been received since the previous queue length check process.

【0015】たとえば、プロセッサ22とプロセッサ2
3の両方に対する送信待ちキュー長カウンタ82,83
が両方ともに所定のしきい値を超えていたとする。この
時プロセッサ22に対する送信可能フラグ92はクリア
状態、プロセッサ23に対する送信可能フラグ93は立
った状態であるならばプロセッサ23は何らかの要因で
送信データを引き取りにくくしている輻輳状態にあると
判断し、プロセッサ22を障害と判断する。
For example, processor 22 and processor 2
Transmission queue length counters 82 and 83 for both
Suppose that both exceed a predetermined threshold. At this time, if the transmittable flag 92 for the processor 22 is cleared and the transmittable flag 93 for the processor 23 is set, it is determined that the processor 23 is in a congested state that is making it difficult to receive the transmitted data for some reason. The processor 22 is determined to be at fault.

【0016】[0016]

【発明の効果】以上説明したように本発明は、マルチプ
ロセッサシステムにおける送信元プロセッサに対し、従
来から設けられている送信先プロセッサ対応の送信待ち
キュー長カウンタに加え新たに送信可能フラグを設け、
送信先プロセッサが受信動作をしているかどうかを送信
可能フラグに設定し、キュー長チェック処理時に送信可
能フラグの状態を併用することによって、送信先プロセ
ッサが単なる輻輳状態で受信動作が追いつかない場合に
、誤って送信先プロセッサを障害として判断する可能性
を著しく抑圧する効果を有する。
As described above, the present invention provides a new transmission enable flag for a transmission source processor in a multiprocessor system in addition to the conventional transmission waiting queue length counter corresponding to a destination processor.
By setting the transmission ready flag to indicate whether the destination processor is performing reception operations, and using the status of the transmission availability flag in conjunction with the queue length check process, you can use This has the effect of significantly suppressing the possibility of erroneously determining the destination processor as faulty.

【図面の簡単な説明】[Brief explanation of the drawing]

【図1】本発明の一実施例の構成を示すブロック図であ
る。
FIG. 1 is a block diagram showing the configuration of an embodiment of the present invention.

【符号の説明】[Explanation of symbols]

10    バス 21〜2n    プロセッサ 30    受信部 40    送信部 50    中央処理装置 51    送信制御処理部 52    待ちキュー掃き出し処理部53    キ
ュー長チェック処理部 60    記憶装置 71〜7n    送信待ちキュー 82〜8n    送信待ちキュー長カウンタ92〜9
n    送信可能フラグ
10 Buses 21 to 2n Processor 30 Receiving unit 40 Transmitting unit 50 Central processing unit 51 Transmission control processing unit 52 Waiting queue purging processing unit 53 Queue length check processing unit 60 Storage devices 71 to 7n Waiting for transmission queue 82 to 8n Waiting for transmission queue length counter 92-9
n Sendable flag

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】  複数のプロセッサをバスに接続して成
るマルチプロセッサシステムにおける障害プロセッサを
検出する障害プロセッサ検出方式において、データを送
信する発信元プロセッサに、送信すべき各送信先プロセ
ッサごとに設定した送信待ちキューと送信動作可能を意
味する送信可能フラグとを備え、送信先プロセッサが送
信データを受領しない場合には送信データを前記送信待
ちキューに登録し、送信先プロセッサが送信データを受
領した場合には前記送信可能フラグを立てて、第一の周
期で前記送信待ちキューに蓄積した送信データを送信し
て掃き出すとともに前記第一の周期よりも長い第二の周
期で前記送信待ちキューの数をチェックしてチェック値
が所定のしきい値を超えてかつ前記送信可能フラグがク
リアされている場合のみ送信先プロセッサを障害と見な
し、それ以外の場合は障害なしと判定して前記送信可能
フラグをクリアする障害判定手段を有することを特徴と
する障害プロセッサ検出方式。
Claim 1: In a faulty processor detection method for detecting a faulty processor in a multiprocessor system in which a plurality of processors are connected to a bus, a faulty processor is set for each destination processor in a source processor that sends data. It is equipped with a transmission waiting queue and a transmission enable flag indicating that transmission operation is possible, and when the transmission destination processor does not receive the transmission data, the transmission data is registered in the transmission waiting queue, and when the transmission destination processor receives the transmission data, the transmission data is registered in the transmission waiting queue. sets the transmit ready flag, transmits and flushes out the transmission data accumulated in the transmission queue in a first cycle, and increases the number of transmission data in the transmission queue in a second cycle, which is longer than the first cycle. Only when the check value exceeds a predetermined threshold and the transmittable flag is cleared, the destination processor is considered to be faulty; otherwise, it is determined that there is no failure and the transmittable flag is cleared. A failure processor detection method characterized by having a failure determination means for clearing the failure.
JP14283191A 1991-06-14 1991-06-14 Faulty processor detection system Pending JPH04367061A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP14283191A JPH04367061A (en) 1991-06-14 1991-06-14 Faulty processor detection system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP14283191A JPH04367061A (en) 1991-06-14 1991-06-14 Faulty processor detection system

Publications (1)

Publication Number Publication Date
JPH04367061A true JPH04367061A (en) 1992-12-18

Family

ID=15324635

Family Applications (1)

Application Number Title Priority Date Filing Date
JP14283191A Pending JPH04367061A (en) 1991-06-14 1991-06-14 Faulty processor detection system

Country Status (1)

Country Link
JP (1) JPH04367061A (en)

Similar Documents

Publication Publication Date Title
US5383178A (en) Network commentator
US5363493A (en) Token ring network test device using finite state machine
EP0580938A2 (en) Duplex communication control device
JPH04367061A (en) Faulty processor detection system
JPS6072351A (en) Method for supervising operating condition of packet communication system
CN100367262C (en) Method and device for testing a monitoring function of a bus system and a corresponding bus system
JPH06175944A (en) Network monitoring method
JP3486182B2 (en) Communication device and communication control method and program therefor
JP4024941B2 (en) LAN device monitoring device for computers
JP3111543B2 (en) MAC bridge buffer release method
JPH02277336A (en) Faulty processor detection system
KR100250888B1 (en) Network detection device of distributed control system
JP2010028185A (en) Network node, network system and congestion detection method
JP2005071007A (en) Sensor system and its program
JP2701315B2 (en) Congestion identification method
JP2001297016A (en) Instruction execution system
JPH1023098A (en) Communication controller
JPH06112993A (en) Serial data communication equipment
JP2002330191A (en) Method and system for detecting abnormality
JP2713271B2 (en) Transport communication control method
JPH05153144A (en) Line deterioration diagnostic system for network
JP2007221619A (en) Broadcast communication equipment
JPS6383857A (en) Controlling system for processor congestion of multi-processor system
JP2003169041A (en) Asynchronous message communication method
JPH05268293A (en) Method and device for detecting fault of data communication processor