JPH08263456A - Diagnostic controller - Google Patents

Diagnostic controller

Info

Publication number
JPH08263456A
JPH08263456A JP7062851A JP6285195A JPH08263456A JP H08263456 A JPH08263456 A JP H08263456A JP 7062851 A JP7062851 A JP 7062851A JP 6285195 A JP6285195 A JP 6285195A JP H08263456 A JPH08263456 A JP H08263456A
Authority
JP
Japan
Prior art keywords
diagnostic
control
processing
card
cpu
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP7062851A
Other languages
Japanese (ja)
Inventor
Atsushi Hiraiwa
敦司 平岩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Computertechno Ltd
Original Assignee
NEC Computertechno Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Computertechno Ltd filed Critical NEC Computertechno Ltd
Priority to JP7062851A priority Critical patent/JPH08263456A/en
Publication of JPH08263456A publication Critical patent/JPH08263456A/en
Pending legal-status Critical Current

Links

Landscapes

  • Multi Processors (AREA)
  • Test And Diagnosis Of Digital Computers (AREA)

Abstract

PURPOSE: To accelerate fault processing by providing specified connection memories in respective parallelly connected cards, reporting fault generation to a diagnostic processor(DGP) at such a time and parallelly performing diagnostic control inside the cards corresponding to an instruction from the DGP for each card. CONSTITUTION: Respective CPU cards are composed of CPUs 2-1 to 2-4, system control parts 2-5 and diagnostic control parts 2-6 equipped with control memories 2-7 and the control memory 2-7 stores the respective routines of reset processing, initializing processing, constitution control processing, fault processing and diagnostic processing. When any fault is generated at the CPU inside the CPU card during ordinary operation, the fault processing routine in the control memory 2-7 is started and the generation of the fault is reported to a DGP 1-6. The DGP 1-6 discriminates the degree of that fault and outputs the instruction corresponding to this degree, and the control memory 2-7 parallelly performs diagnostic control inside the card for each card based on the instruction from the DGP 1-6.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、診断制御装置、特にそ
れぞれが複数のCPUを搭載する複数のカードと診断プ
ロセッサ(以下DGPと記す)をバス接続して構成され
た情報処理装置における診断制御装置に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a diagnostic control device, and more particularly to a diagnostic control in an information processing device constituted by connecting a plurality of cards each having a plurality of CPUs and a diagnostic processor (hereinafter referred to as "DGP") via a bus. Regarding the device.

【0002】[0002]

【従来の技術】従来、情報処理装置に分野では、シング
ルプロセッサによる性能が重視されていた為、密結合単
位にDGPが存在し、またCPUが少なかった為、密結
合を構成する各CPUとDGP間には個別に診断バスが
あり、診断制御はこの診断バスを使うことによりDGP
から直接可能であった。
2. Description of the Related Art Conventionally, in the field of information processing apparatuses, since performance by a single processor was emphasized, there was a DGP in a tightly coupled unit, and since there were few CPUs, each CPU and DGP forming a tightly coupled unit. There is an individual diagnostic bus between them, and diagnostic control is performed by using this diagnostic bus.
It was possible directly from.

【0003】しかし、CPUを複数個搭載したカードを
バス接続などにより並列接続させた情報処理装置では、
DGPと各CPU間に診断バスを1対1に接続させるに
は、物理的に限度ある。そこで、従来、各カード内にハ
ードウエアによる診断制御部を設け、DGPからの指示
を受けそれに応じた制御をハードウエアのみで行ってい
る。
However, in an information processing device in which cards having a plurality of CPUs are connected in parallel by bus connection or the like,
There is a physical limit to the one-to-one connection of the diagnostic bus between the DGP and each CPU. Therefore, conventionally, a diagnostic control unit by hardware is provided in each card, and an instruction from the DGP is received and the corresponding control is performed only by the hardware.

【0004】[0004]

【発明が解決しようとする課題】上述した従来の診断制
御装置では、DGPからの代表的な処理としてリセッ
ト、処理設定、構成制御、障害処理だけを考えても診断
制御部をオールハードウエアで構成した場合、DGP−
CPU間の中継点に過ぎず、制御はDGPからこまめに
指示を送り、処理内容によってはその都度正常に完了し
たことを確認するリプライを受けなければならない為、
同時に複数のカードに対し行える処理は限られ、2つ以
上のカードに違った処理を出す場合は、DGPで先の処
理が完了したことを確認してからだすケースが大半でC
PUの数が多ければ多いほど処理が遅れるという問題点
がある。
In the above-described conventional diagnostic control device, the diagnostic control unit is constructed by all hardware even if only reset, process setting, configuration control, and fault processing are considered as typical processes from the DGP. If you do, DGP-
It is just a relay point between CPUs, and control frequently sends instructions from DGP, and depending on the processing content, it must receive a reply to confirm that it has completed normally each time.
The processing that can be performed on multiple cards at the same time is limited, and when different processing is to be performed on two or more cards, in most cases C will be issued after confirming that the previous processing is completed with DGP.
There is a problem that the processing is delayed as the number of PUs increases.

【0005】[0005]

【課題を解決するための手段】本発明の装置は、演算部
とシステム制御部により構成される中央処理部を1カー
ド上に複数個搭載させ、該カードを並列接続させた情報
処理装置における診断制御装置において、各カード内に
障害処理と診断制御を行うルーチンを格納する制御記憶
を設け、障害発生時に前記ルーチンを起動させることに
より診断プロセッサへの報告を行うと共に、DGPから
の指示によりカード内の診断制御をカード毎に並行して
行う。
The apparatus of the present invention is provided with a plurality of central processing units each comprising a computing unit and a system control unit mounted on one card, and the diagnostics in an information processing unit in which the cards are connected in parallel. In the control device, each card is provided with a control memory for storing a routine for performing fault processing and diagnostic control, and when the fault occurs, the routine is activated to report to the diagnostic processor, and at the same time the DGP instructs the card The diagnostic control of is performed in parallel for each card.

【0006】[0006]

【作用】本発明は、制御記憶を設けることによりDGP
とCPUカード間でリセット、初期設定、構成制御等の
処理を特定のCPUに対し継続的に行ったり、1つの処
理のみを行えるコマンドを設け、DGPからはケースに
応じて必要なコマンドを発行することにより診断制御部
内の制御記憶を起動させることにより行い、DGPは送
ったコマンドが正常に処理されたことを示す正常リプラ
イが一定時間内に戻ったことにより処理が完了したこと
を認識できる。
The present invention provides the DGP by providing the control memory.
Between CPU card and CPU card, commands such as reset, initial setting, configuration control, etc. can be continuously performed for a specific CPU, or only one process can be provided, and DGP issues necessary commands depending on the case. This is done by activating the control memory in the diagnostic control unit, and the DGP can recognize that the processing has been completed by returning the normal reply indicating that the command sent was processed normally within a fixed time.

【0007】又、通常運用中、CPUカード内で何らか
の障害が発生した場合も診断制御部内の制御記憶を起動
させ、その障害が直ちにシステムを停止すべく重度な障
害であるか、一部のCPUのみを論理的にシステムから
切り離すことによりシステムの運用は継続的に行える軽
度の障害であるかを判別させ、重度な障害時にはDGP
にシステム停止を指示し、軽度な障害時には切り離した
CPUを報告し、CPUの診断制御についてはDGPか
らのコマンド処理で行う。
Also, during normal operation, if some failure occurs in the CPU card, the control memory in the diagnostic control unit is activated, and the failure is a serious failure to immediately stop the system. Only by logically disconnecting it from the system, it is possible to continuously operate the system to determine whether it is a minor fault, and when it is a serious fault, the DGP
The CPU is instructed to stop, the CPU disconnected is reported in the case of a slight failure, and CPU diagnostic control is performed by command processing from the DGP.

【0008】[0008]

【実施例】次に本発明の実施例について図面を参照して
説明する。
Next, an embodiment of the present invention will be described with reference to the drawings.

【0009】図2は本発明の一実施例を示す図であり、
それぞれが4つのCPUを搭載させたCPUカード1−
1〜1−4と、主記憶1−5を構成する1枚のカードを
バス接続させ、各カード1−1〜1−5と診断プロセッ
サ1−6間に専用パスを設けている。
FIG. 2 shows an embodiment of the present invention.
CPU card 1 with 4 CPUs each
1 to 1-4 and one card constituting the main memory 1-5 are connected by a bus, and a dedicated path is provided between each card 1-1 to 1-5 and the diagnostic processor 1-6.

【0010】各CPUカードの構成は同一であり、図1
に示すように、4つのCPU2−1〜2−4と、システ
ム制御部2−5と、制御記憶2−7を備えた診断制御部
2−6とにより構成される。診断制御部2−6内の制御
記憶2−7は図3に示すように、リセット処理3−1、
初期設定処理3−2、構成制御処理3−3、障害処理3
−4、診断処理3−5の各ルーチンを記憶する。更に、
CPUで障害が発生した場合の処理フローを図4及び図
5に示す。
The configuration of each CPU card is the same.
As shown in FIG. 4, it is composed of four CPUs 2-1 to 2-4, a system control unit 2-5, and a diagnostic control unit 2-6 having a control memory 2-7. As shown in FIG. 3, the control memory 2-7 in the diagnostic control unit 2-6 has a reset process 3-1,
Initialization processing 3-2, configuration control processing 3-3, failure processing 3
-4, memorize | store each routine of the diagnostic process 3-5. Furthermore,
The processing flow when a failure occurs in the CPU is shown in FIGS.

【0011】通常稼働中、CPUカード1−1内のCP
U2−1で障害が発生した場合、同一カード1−1内の
障害処理ルーチン3−4が起動し、診断制御部2−6へ
障害発生が報告され、診断制御部内の制御記憶2−7に
起動がかかり、発生した障害がシステムを停止すべく重
度な障害であった場合、制御記憶2−7からはDGP1
−6に対しシステム停止指示が出され、DGP1−6が
認知すると共にシステムダウンとなる。
During normal operation, the CP in the CPU card 1-1
When a failure occurs in U2-1, the failure processing routine 3-4 in the same card 1-1 is activated, the failure occurrence is reported to the diagnostic control unit 2-6, and the control memory 2-7 in the diagnostic control unit is notified. If the system is started up and the fault that has occurred is a serious fault to stop the system, the DGP1 is stored in the control memory 2-7.
A system stop instruction is issued to -6, DGP1-6 recognizes it, and the system goes down.

【0012】一方、発生した障害がCPU2−1のみを
論理的にシステムから切り離すことによりシステムの継
続的な運用に支障を来さない軽度な障害である場合、制
御記憶2−7からはDGP1−6に対し切り離すユニッ
ト(この場合、CPU2−1)を報告する。DGP1−
6は、各カード内の診断制御部に対しCPU2−1がシ
ステムから切り離されたことを報告する。各カード内の
診断制御部では、制御記憶部内の構成制御ルーチン3−
3が起動しCPU2−1とのインタフェースを論理的に
切り離す。更に、CPUカード1−1の制御記憶2−7
はCPU2−1の障害情報を主記憶1−5の障害情報格
納領域に送出する。
On the other hand, if the fault that has occurred is a minor fault that does not hinder the continuous operation of the system by logically disconnecting only the CPU 2-1 from the system, then the control memory 2-7 outputs DGP1- The unit to be disconnected (CPU 2-1 in this case) is reported to 6. DGP1-
6 reports to the diagnostic control unit in each card that the CPU 2-1 has been disconnected from the system. In the diagnostic control unit in each card, the configuration control routine in the control storage unit 3-
3 starts and logically disconnects the interface with the CPU 2-1. Furthermore, the control memory 2-7 of the CPU card 1-1
Sends the failure information of the CPU 2-1 to the failure information storage area of the main memory 1-5.

【0013】DGP1−6は、全カードからのリプライ
を受けるとCPUカード1−1内の診断制御部2−6に
対しCPU2−1のリセットを指示する。CPUカード
1−1の診断制御部2−6より制御記憶2−7内のリセ
ット処理ルーチン3−1が起動しCPU2−1がリセッ
トされる。
Upon receiving the replies from all the cards, the DGP 1-6 instructs the diagnostic control unit 2-6 in the CPU card 1-1 to reset the CPU 2-1. The reset control routine 3-1 in the control memory 2-7 is activated by the diagnostic control unit 2-6 of the CPU card 1-1 to reset the CPU 2-1.

【0014】DGP1−6はCPUカード1−1の診断
制御部2−6よりリプライを受けると、CPUカード1
−1内の診断制御部2−6に対しCPU2−1へのFW
ロードを指示する。CPUカード1−1の診断制御部2
−6より制御記憶2−7内の初期設定処理ルーチン3−
2が起動し、CPU2−1へFWがロードされる。
When the DGP 1-6 receives a reply from the diagnostic controller 2-6 of the CPU card 1-1, the CPU card 1
FW to CPU 2-1 for diagnostic control unit 2-6 in -1
Instruct to load. Diagnostic control unit 2 of CPU card 1-1
Initial setting processing routine in control memory 2-7 from 6
2 starts, and FW is loaded into the CPU 2-1.

【0015】DGP1−6はCPUカード1−1の診断
制御部2−6よりリプライを受けると、CPUカード1
−1内の診断制御部2−6に対しCPU2−1へのテス
トプログラム起動を指示する。CPUカード1−1の診
断制御部2−6より制御記憶内2−7の診断処理ルーチ
ン3−5が起動し、CPU2−1ではテストプログラム
が実行される。テストプログラム実行において障害の再
発の有無をDGP1−6へ報告する。
When the DGP 1-6 receives a reply from the diagnostic control unit 2-6 of the CPU card 1-1, the CPU card 1
The diagnostic control unit 2-6 in -1 is instructed to start the test program to the CPU 2-1. The diagnostic control unit 2-6 of the CPU card 1-1 activates the diagnostic processing routine 3-5 in the control memory 2-7, and the CPU 2-1 executes the test program. Report the recurrence of the failure to the DGP1-6 during the execution of the test program.

【0016】DGP1−6は障害再発によりCPU2−
1に固定障害有りと見なし以降CPU12−1切り離し
状態で運用を継続する。又、障害再発無であればCPU
2−1の再度組み込み可能と見なし全カードの診断制御
部2−6に対し、CPU2−1のシステムへの組み込み
を報告する。各カード内の診断制御部2−6より制御記
憶部2−7内の構成制御ルーチン3−3が起動しCPU
2−1とのインタフェースが論理的に接続される。これ
以降、システムは障害発生以前と同状態で稼働する。
DGP1-6 is CPU2-
1 is regarded as having a fixed failure, and thereafter, the operation is continued in the CPU 12-1 disconnected state. If the failure does not recur, the CPU
It is considered that 2-1 can be installed again, and the diagnosis control units 2-6 of all cards are informed of the installation of the CPU 2-1 into the system. The configuration control routine 3-3 in the control storage unit 2-7 is activated by the diagnostic control unit 2-6 in each card, and the CPU
The interface with 2-1 is logically connected. After that, the system operates in the same state as before the failure.

【0017】以上、図4に示すフローに沿って1つの障
害発生時の処理過程を説明したが、2つの障害が同時に
起きた場合、図4に示すリセット処理ルーチン、処理設
定処理ルーチン、診断処理ルーチンは同時に起動させる
ことも可能である。
The processing procedure when one failure occurs has been described above with reference to the flow shown in FIG. 4, but when two failures occur simultaneously, the reset processing routine, the processing setting processing routine, and the diagnostic processing shown in FIG. The routines can be started at the same time.

【0018】[0018]

【発明の効果】以上説明したように本発明は、グループ
(カード)単位のCPUに診断制御専用の制御記憶を設
けたことにより、所望のCPUに対しDGPが必要とす
る処理をコマンドを発行するだけで可能である。又、本
発明はCPUを複数個並列接続させたシステムにおいて
は、異なるグループで発生した障害においては、その処
理を個別に実行できる為、従来に比べて障害処理が高速
化される。
As described above, according to the present invention, the CPU for each group (card) is provided with the control memory dedicated to the diagnostic control, so that the command for the process required by the DGP is issued to the desired CPU. Only possible. Further, according to the present invention, in a system in which a plurality of CPUs are connected in parallel, when a failure occurs in a different group, the processing can be executed individually, so that the failure processing is speeded up as compared with the conventional case.

【図面の簡単な説明】[Brief description of drawings]

【図1】図2に示した各CPUカードの詳細図である。FIG. 1 is a detailed view of each CPU card shown in FIG.

【図2】本発明の一実施例のブロック図である。FIG. 2 is a block diagram of an embodiment of the present invention.

【図3】図2に示した制御記憶部内に保持する制御ルー
チンの一例を示す図である。
FIG. 3 is a diagram showing an example of a control routine held in the control storage unit shown in FIG.

【図4】本発明における障害発生時の処理フローチャー
トである。
FIG. 4 is a processing flowchart when a failure occurs in the present invention.

【図5】本発明における障害発生時の図4に続く処理フ
ローチャートである。
FIG. 5 is a processing flowchart following FIG. 4 when a failure occurs in the present invention.

【符号の説明】[Explanation of symbols]

1−1〜1−4 CPUカード 1−5 主記憶 1−6 診断プロセッサ(DGP) 2−1〜2−4 CPU 2−5 システム制御部 2−6 診断制御部 2−7 制御記憶。 1-1 to 1-4 CPU card 1-5 Main memory 1-6 Diagnostic processor (DGP) 2-1 to 2-4 CPU 2-5 System controller 2-6 Diagnostic controller 2-7 Control memory.

Claims (2)

【特許請求の範囲】[Claims] 【請求項1】 演算部とシステム制御部により構成され
る中央処理部を1カード上に複数個搭載させ、該カード
を並列接続させた情報処理装置における診断制御装置に
おいて、各カード内に障害処理と診断制御を行うルーチ
ンを格納する制御記憶を設け、障害発生時に前記ルーチ
ンを起動させることにより診断プロセッサへの報告を行
うと共に、前記診断プロセッサからの指示によりカード
内の診断制御をカード毎に並行して行う診断制御装置。
1. A diagnostic control device in an information processing device, wherein a plurality of central processing units each comprising a computing unit and a system control unit are mounted on one card, and the cards are connected in parallel. And a control memory for storing a routine for performing diagnostic control, and when a failure occurs, the routine is activated to report to the diagnostic processor, and the instruction from the diagnostic processor causes the diagnostic control in the card to be performed in parallel for each card. Diagnostic control device.
【請求項2】 前記障害が重障害か否かを前記カード内
で判断して前記診断プロセッサに報告し、該診断プロセ
ッサは重障害ならシステムを停止し、重障害でなければ
障害プロセッサへは障害情報の出力、他のプロセッサへ
は障害プロセッサの切り離しを指示することを特徴とす
る請求項1記載の診断処理装置。
2. The card determines whether or not the failure is a serious failure and reports it to the diagnostic processor. If the diagnostic processor is a serious failure, the system is stopped. 2. The diagnostic processing device according to claim 1, wherein the output of information and the instruction to disconnect the faulty processor are given to other processors.
JP7062851A 1995-03-22 1995-03-22 Diagnostic controller Pending JPH08263456A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP7062851A JPH08263456A (en) 1995-03-22 1995-03-22 Diagnostic controller

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP7062851A JPH08263456A (en) 1995-03-22 1995-03-22 Diagnostic controller

Publications (1)

Publication Number Publication Date
JPH08263456A true JPH08263456A (en) 1996-10-11

Family

ID=13212233

Family Applications (1)

Application Number Title Priority Date Filing Date
JP7062851A Pending JPH08263456A (en) 1995-03-22 1995-03-22 Diagnostic controller

Country Status (1)

Country Link
JP (1) JPH08263456A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008003652A (en) * 2006-06-20 2008-01-10 Hitachi Ltd Diagnosis method for circuit board, circuit board and cpu unit
JP2008293276A (en) * 2007-05-24 2008-12-04 Sony Corp Digital cinema reproduction system, showing suspension corresponding method, and program
US8230260B2 (en) 2010-05-11 2012-07-24 Hewlett-Packard Development Company, L.P. Method and system for performing parallel computer tasks

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH02244252A (en) * 1989-03-17 1990-09-28 Hitachi Ltd One-chip multiprocessor containing bus arbiter and comparator
JPH04149660A (en) * 1990-10-09 1992-05-22 Oki Electric Ind Co Ltd Multiprocessor system
JPH0535706A (en) * 1991-07-31 1993-02-12 Nec Corp Multiprocessor system
JPH05120129A (en) * 1991-05-15 1993-05-18 Internatl Business Mach Corp <Ibm> Multiplex-bank large-area memory card

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH02244252A (en) * 1989-03-17 1990-09-28 Hitachi Ltd One-chip multiprocessor containing bus arbiter and comparator
JPH04149660A (en) * 1990-10-09 1992-05-22 Oki Electric Ind Co Ltd Multiprocessor system
JPH05120129A (en) * 1991-05-15 1993-05-18 Internatl Business Mach Corp <Ibm> Multiplex-bank large-area memory card
JPH0535706A (en) * 1991-07-31 1993-02-12 Nec Corp Multiprocessor system

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008003652A (en) * 2006-06-20 2008-01-10 Hitachi Ltd Diagnosis method for circuit board, circuit board and cpu unit
US7870428B2 (en) 2006-06-20 2011-01-11 Hitachi, Ltd. Method of diagnosing circuit board, circuit board, and CPU unit
JP2008293276A (en) * 2007-05-24 2008-12-04 Sony Corp Digital cinema reproduction system, showing suspension corresponding method, and program
US8230260B2 (en) 2010-05-11 2012-07-24 Hewlett-Packard Development Company, L.P. Method and system for performing parallel computer tasks

Similar Documents

Publication Publication Date Title
US5652833A (en) Method and apparatus for performing change-over control to processor groups by using rate of failed processors in a parallel computer
JPH08263456A (en) Diagnostic controller
JPS634209B2 (en)
JPH03179538A (en) Data processing system
JPH07121395A (en) Method for preferentially selecting auxiliary device
JPH047645A (en) Fault tolerant computer
JP3042034B2 (en) Failure handling method
JP2000137623A (en) Diagnostic method and its system
JPH1139032A (en) Multi cpu type centralized monitoring device
JPH11265321A (en) Fault restoring method central processing unit and central processing system
JPH05108394A (en) Initializing diagnostic system for computer system
JPS59123946A (en) System control system
JPS6318229B2 (en)
JPS60195649A (en) Error reporting system of microprogram-controlled type data processor
JPH07183889A (en) On-line testing method for each device in atm exchange
JPH0418743B2 (en)
JPS59180758A (en) Loop detecting system of ncp
JPH0588944A (en) Input/output control and monitor system for information processing system
JPS6152749A (en) System supervisory system
JPH02278436A (en) Microprocessor diagnostic circuit
JPH04243341A (en) Diagnostic system for data transmitter in loop data transmission system
JPS59180776A (en) System for making forced ipl to stanby system controlling device
JPH0325628A (en) Automatic diagnostic system
JPH03156646A (en) Output system for fault information
JPS5911456A (en) Automatic switching system of electronic computer system

Legal Events

Date Code Title Description
A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 19980428