JP3279004B2

JP3279004B2 - Redundant resource management method and distributed fault tolerant computer system using the same

Info

Publication number: JP3279004B2
Application number: JP25801393A
Authority: JP
Inventors: 信康金川
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1993-10-15
Filing date: 1993-10-15
Publication date: 2002-04-30
Anticipated expiration: 2017-04-30
Also published as: JPH07114520A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、冗長資源の管理の方法
にかかり、特にフォールトトレラントコンピュータシス
テムにおける冗長資源の効率的利用に関するものであ
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method for managing redundant resources, and more particularly to an efficient use of redundant resources in a fault-tolerant computer system.

【０００２】[0002]

【従来の技術】コンピュータが交通管制や、金融などの
社会の中枢ともいえる役割や、宇宙船や航空機の制御等
の人命にかかわる役割を担うようになり、コンピュータ
の故障による停止や誤動作が大きな影響を波及するよう
になってきている。このような時代のすう勢の中で、コ
ンピュータの信頼性が益々求められるようになってきて
いる。2. Description of the Related Art Computers play a role in the center of society such as traffic control and finance, and also play a role in human life such as control of spacecraft and aircraft. Is spreading. In such an era, computer reliability is increasingly required.

【０００３】コンピュータを高信頼化するために、フォ
ールト発生に備えてコンピュータやコンピュータを構成
するユニットを予め余分に用意しておく冗長化という手
段が広く用いられている。In order to increase the reliability of a computer, a means of redundancy in which an extra computer or a unit constituting the computer is prepared in advance for the occurrence of a fault is widely used.

【０００４】一方、コンピュータの高信頼化の為に冗長
なハードウエアを持たせることは、コスト，寸法，重
量，消費電力の大幅な増加につながる。そこで、フォー
ルトトレラントコンピュータシステムへの投資効率すな
わちコストパフォーマンスを高めるためには、冗長ハー
ドウエア資源を信頼性向上，処理性能向上のために有効
利用することが求められる。On the other hand, providing redundant hardware for high reliability of a computer leads to a large increase in cost, size, weight, and power consumption. Therefore, in order to increase the investment efficiency, that is, the cost performance, of the fault-tolerant computer system, it is required to effectively use the redundant hardware resources for improving the reliability and the processing performance.

【０００５】冗長ハードウエア資源を有効利用するため
の冗長資源管理方法として、文献(Jean−Charles Fabr
e，et al.:“Saturation：reduced idleness for impro
vedfault−tolerance，"Proc.FTCS−18（The 18th Int'
1 Symp.on Fault−tolerantComputing），pp.200−205
（1988)）が従来から提案されている。As a redundant resource management method for effectively utilizing redundant hardware resources, a literature (Jean-Charles Fabr
e, et al .: “Saturation: reduced idleness for impro
vedfault-tolerance, "Proc. FTCS-18 (The 18th Int '
1 Symp.on Fault-tolerantComputing), pp.200-205
(1988)).

【０００６】本従来技術によれば、タスクごとに同時に
実行する冗長コピーの最小必要数すなわちＭＮＣ（Mini
mum Number of Copies）を予め定めておき、タスク実行
要求メッセージが到達したとき、空いている（idle状態
の）ノード（冗長コンピュータモジュール）の数がＭＮ
Ｃよりも大きい場合には該空きノードでタスク実行を開
始する。もし、空きノードの数がＭＮＣよりも小さい場
合には現在実行しているタスクが終了し、必要な数の空
きノードが出るまで待つ。According to the prior art, the minimum required number of redundant copies executed simultaneously for each task, that is, the MNC (Minimum Copy)
mum Number of Copies) is determined in advance, and when the task execution request message arrives, the number of vacant (idle) nodes (redundant computer modules) is MN.
If the value is larger than C, task execution is started at the empty node. If the number of free nodes is smaller than the MNC, the task currently being executed ends, and the system waits until the required number of free nodes is output.

【０００７】[0007]

【発明が解決しようとする課題】前記文献に示されてい
る従来技術は、タスク開始要求が頻繁に生じるＯＬＴＰ
（On−line Transaction Processor）には好適な冗長資
源管理方法である。The prior art disclosed in the above-mentioned document is based on the OLTP in which task start requests frequently occur.
(On-line Transaction Processor).

【０００８】従来技術はリアルタイム制御用コンピュー
タの高信頼化に際して、タスク実行中のフォールト発
生、さらには多重フォールトの発生については十分な考
慮がされていない。これは、トランザクションが短時間
で終了するというＯＬＴＰの性質から、タスク実行時間
が平均故障間隔ＭＴＢＦ（Mean Time Between Failure)
と比べて十分に短いという仮定のもとに従来技術が提案
されているからである。しかし、リアルタイム制御用コ
ンピュータの場合、タスクが長時間連続して実行される
ことが多い。例えば、航空機，宇宙船等ではそのミッシ
ョンタイムの間、コンピュータは正常に動作を続けなけ
ればならないばかりでなく、ミッションを中断するのに
もコンピュータの支援を必要とする。そのためタスク実
行時間は平均故障間隔ＭＴＢＦと比べ無視できず、タス
ク実行途中でのフォールトの発生、さらには多重フォー
ルトの発生について考慮する必要が出てくる。In the prior art, when the reliability of a real-time control computer is increased, the occurrence of a fault during the execution of a task and the occurrence of multiple faults are not sufficiently considered. This is because of the nature of OLTP that transactions are completed in a short time, task execution time is mean time between failures (MTBF).
This is because the prior art has been proposed on the assumption that it is sufficiently shorter than. However, in the case of a real-time control computer, tasks are often executed continuously for a long time. For example, in an aircraft, a spacecraft, or the like, the computer must not only normally operate during the mission time, but also require the assistance of the computer to interrupt the mission. Therefore, the task execution time cannot be ignored as compared with the mean time between failures MTBF, and it is necessary to consider the occurrence of a fault during the task execution and the occurrence of multiple faults.

【０００９】従来技術では、割り当てるコンピュータモ
ジュールの数を管理するのはタスク実行開始時のみであ
る。従ってタスク実行中にフォールト発生によるタスク
を実行しているコンピュータモジュールの機能が失われ
ても、新たにコンピュータモジュールを追加するような
ことはない。つまり、タスク実行中にフオールトが発生
した場合には、冗長度（タスクを冗長に実行しているコ
ンピュータモジュールの数）が減少したままタスク実行
を続けることになり、当該タスクの信頼度が損なわれ
る。例えば、２つのコンピュータモジュールが冗長にタ
スクを実行しているとき２つのうち１つのコンピュータ
モジュールに故障が発生した場合、万一第２のフォール
トが引き続いて発生すればタスクの実行は中断されるこ
とになる。本発明の目的は、タスク実行途中でのフォー
ルトの発生、さらには多重フォールトの発生に対処でき
る冗長資源の管理方法及びそれを用いた分散型フォール
トトレラントコンピュータシステムを提供することにあ
る。In the prior art, the number of computer modules to be allocated is managed only at the start of task execution. Therefore, even if the function of the computer module executing the task due to the occurrence of a fault is lost during the task execution, no new computer module is added. In other words, if a fault occurs during the execution of a task, the task execution is continued with the redundancy (the number of computer modules executing the task redundantly) reduced, and the reliability of the task is impaired. . For example, if a failure occurs in one of the two computer modules when two computer modules are redundantly executing a task, the execution of the task may be interrupted if a second fault continues to occur. become. An object of the present invention is to provide a redundant resource management method capable of coping with the occurrence of a fault during the execution of a task and the occurrence of multiple faults, and a distributed fault-tolerant computer system using the same.

【００１０】[0010]

【課題を解決するための手段】本発明は、複数のタスク
を複数のコンピュータモジュールに割当て実行する分散
型フォールトトレラントコンピュータシステムにおい
て、システム内の任意のコンピュータモジュールに故障
が発生した場合に、その故障が発生したコンピュータモ
ジュールが実行していたタスクと異なるタスクを予め割
り当てられていた他のコンピュータモジュールの中から
少なくとも１以上のコンピュータモジュールを選択し、
この選択したコンピュータモジュールに対し、故障が発
生したコンピュータモジュールが実行していたタスクを
割当て実行させる選択実行手段を設けたことを特徴とす
る。SUMMARY OF THE INVENTION The present invention relates to a distributed fault-tolerant computer system in which a plurality of tasks are assigned to a plurality of computer modules and executed. Selecting at least one or more computer modules from among other computer modules to which a task different from the task executed by the computer module in which the error occurred has been assigned in advance;
Selective execution means is provided for allocating and executing a task executed by the failed computer module to the selected computer module.

【００１１】尚、本発明における各コンピュータモジュ
ールは、具体的には、以下のような構成をとるものであ
る。Each of the computer modules according to the present invention specifically has the following configuration.

【００１２】(1）それぞれのコンピュータモジュールは
タスク処理中は常に適当なタイミング（チェックポイン
ト）で自コンピュータモジュールについてのフォールト
発生情報（フォールト検出結果）、処理結果を他のコン
ピュータモジュールにブロードキャストする。(1) Each computer module broadcasts fault occurrence information (fault detection result) and its processing result of its own computer module to other computer modules at an appropriate timing (checkpoint) during task processing.

【００１３】(2）それぞれのコンピュータモジュール
は、他のコンピュータモジュールからブロードキャスト
されたフォールト発生情報（フォールト検出結果）、お
よび処理結果の一致／不一致の情報に基づき各タスクに
ついての信頼度の余裕度を表す評価関数Ｆij（ｉ：コン
ピュータモジュールの番号，ｊ：タスクの番号）を推定
する。(2) Each computer module determines a margin of reliability for each task based on fault occurrence information (fault detection result) broadcast from another computer module and information on matching / mismatching of processing results. Estimate an evaluation function Fij (i: computer module number, j: task number) to represent.

【００１４】(3）それぞれのコンピュータモジュール
は、評価関数Ｆijを最小にするタスクｊを実行すべき処
理として決定し、実行している処理を前記実行すべき処
理に切り替える。(3) Each computer module determines a task j that minimizes the evaluation function Fij as a process to be executed, and switches the executed process to the process to be executed.

【００１５】なおここで、評価関数Ｆijは各タスクにつ
いての信頼度の余裕度を表すので、タスクの重要度が高
いほどＦijは低く、コンピュータモジュールのタスクに
対する責任度が高いほどＦijは低く、タスクの信頼度が
高いほどＦijは高くなるように定める。Here, since the evaluation function Fij represents a margin of reliability for each task, the higher the importance of the task, the lower the Fij, and the higher the responsibility of the computer module for the task, the lower the Fij. Is determined so that the higher the reliability of is, the higher the Fij becomes.

【００１６】例えば、以下のように定めれば以上の条件
を満たすことができる。For example, the above conditions can be satisfied by determining as follows.

【００１７】Ｆij＝Ｌrj−Ｌthij または、Ｆij＝Ｌrj／Ｌthij ただし、Ｌthij：コンピュータモジュールｉにおけるタ
スクｊの信頼度レベルのしきい値Ｌrj：タスクｊの信頼度レベルｉ：自コンピュータモジュールの番号ｊ：タスクの番号または、Ｆij＝log{(１−Ｌthij）／Ｐej} ただし、Ｐej：タスクｊの結果が誤っている確率なおここでタスクｊの信頼度レベルのしきい値Ｌthijは
タスクによって異なり、より高い信頼度の要求される重
要な（重要度の高い）タスクほど大きな値のＬthijが設
定される。Fij = Lrj-Lthij or Fij = Lrj / Lthij where Lthij: threshold value of the reliability level of task j in computer module i Lrj: reliability level of task j i: number of own computer module j: Task number or Fij = log {(1-Lthij) / Pej} where Pej is the probability that the result of task j is wrong. Here, the threshold value Lthij of the reliability level of task j differs depending on the task. A larger value of Lthij is set for an important (higher importance) task requiring higher reliability.

【００１８】さらにＬthijはコンピュータモジュールｉ
によって異なり、タスクについて責任度の高いコンピュ
ータモジュールほど高いＬthijが設定される。Further, Lthij is a computer module i
Lthij is set higher for a computer module having higher responsibility for a task.

【００１９】[0019]

【作用】本発明によれば、常に評価関数Ｆijがバランス
するように各タスクにコンピュータモジュールが割り当
てられるので、特定のタスクのＦijが突出して大きくな
ったり小さくなったりすることがない。つまり、動作中
にフォールト発生により信頼度レベルが低下した特定の
タスク（以下危険タスクと呼ぶことにする）がある場合
には、信頼度の面で余裕のある他のタスクを実行してい
るコンピュータモジュールが危険タスクを実行するの
で、特定のタスクのみの信頼度レベル低下を防ぐことが
できる。したがって、タスク実行中のフォールトの発生
に対処し、信頼性を維持しながらシステムに与えられた
責務を果たすことができる。According to the present invention, a computer module is assigned to each task so that the evaluation function Fij is always balanced, so that the Fij of a specific task does not increase or decrease. In other words, if there is a specific task whose reliability level has been reduced due to the occurrence of a fault during operation (hereinafter, referred to as a dangerous task), a computer that is executing another task that has a margin in terms of reliability. Since the module executes a dangerous task, it is possible to prevent the reliability level of only a specific task from being lowered. Therefore, it is possible to cope with the occurrence of a fault during the execution of a task and fulfill the responsibility given to the system while maintaining reliability.

【００２０】また、重要度の高いタスクほどＬthijが高
く設定されるので、より高いＬrjで他のタスクとＦijが
バランスする。従って、重要度の高いタスクほど多くの
コンピュータモジュールを割当てることにより高い信頼
度レベルＬrjを維持することができる。Since Lthij is set higher for a task having a higher importance, Fij is balanced with another task at a higher Lrj. Therefore, a higher reliability level Lrj can be maintained by assigning more computer modules to tasks with higher importance.

【００２１】さらに各コンピュータモジュールが自律的
に実行すべきタスクを決定することができるので、タス
ク実行を割り振るための中枢的な機構が不要で、単一故
障点が無い。したがって単一のフォールトによりシステ
ム全体が障害を受けることが無く、システムの信頼性を
向上させることができる。Further, since each computer module can autonomously determine a task to be executed, a central mechanism for allocating task execution is not required, and there is no single point of failure. Therefore, the entire system is not damaged by a single fault, and the reliability of the system can be improved.

【００２２】[0022]

【実施例】以下、図に従い本発明の実施例について説明
を加える。BRIEF DESCRIPTION OF THE DRAWINGS FIG.

【００２３】〈動作概念〉図４は本発明の実施例を模式
的に示したものである。図４では例として、コンピュー
タモジュール１０１〜１０（ｉ−１）が冗長にタスク１
を実行し、コンピュータモジュール１０ｉ〜１０ｍが冗
長にタスク２を実行しており、コンピュータモジュール
１０（ｉ−１）でフォールト（障害）が発生し、正常な
動作が不可能になった場合を想定している。本実施例に
よれば、コンピュータモジュール１０（ｉ−１）でフォ
ールト（障害）が発生し、正常な動作が不可能になった
場合には、コンピュータモジュール１０ｉがタスク２の
実行を中止して、タスク１の実行を開始する。従って、
コンピュータモジュール１０（ｉ−１）の障害によりタ
スク１を実行しているコンピュータモジュールの数が大
幅に減少することを緩和し、タスク１の信頼度の大幅な
低下を防止することができる。<Operation Concept> FIG. 4 schematically shows an embodiment of the present invention. In FIG. 4, as an example, the computer modules 101 to 10 (i-1) redundantly execute the task 1
Is executed, the computer modules 10i to 10m redundantly execute the task 2, and assume that a fault (failure) occurs in the computer module 10 (i-1) and normal operation becomes impossible. ing. According to the present embodiment, when a fault (failure) occurs in the computer module 10 (i-1) and normal operation becomes impossible, the computer module 10i stops execution of the task 2, and Start execution of task 1. Therefore,
It is possible to alleviate a significant decrease in the number of computer modules executing the task 1 due to the failure of the computer module 10 (i-1), and to prevent a significant decrease in the reliability of the task 1.

【００２４】図５は図４のコンピュータモジュール１０
ｉのタスク切り替えの判断に評価関数Ｆ１，Ｆ２を導入
した実施例である。ここで評価関数Ｆ１，Ｆ２はそれぞ
れタスク１，タスク２の信頼度を反映したものとする。
なお、評価関数の定め方については後述する。まず図５
の左側では、タスク１を実行しているコンピュータモジ
ュール１０（ｉ−１）でフォールトが発生したため、評
価関数Ｆ１（信頼度）が低下してＦ２よりも低下する。
そこで図５の左側のように、タスク２を実行しているコ
ンピュータモジュールの中のコンピュータモジュール１
０ｉがタスク１の実行に加わり、評価関数Ｆ１，Ｆ２が
ほぼ同じ値となる。なお、フォールト発生に伴い評価関
数の値に大きな差が生じた場合に、どのコンピュータモ
ジュールで実行しているタスクを変えるかの判断は、あ
らかじめ各コンピュータモジュールの各タスクに対して
の責任度を定めておくことにより決定づけられる。本実
施例では、タスク２を実行しているコンピュータモジュ
ール１０ｉ〜１０ｍの中ではコンピュータモジュール１
０ｉがタスク１に対する責任度が最も高い。FIG. 5 shows the computer module 10 of FIG.
This is an embodiment in which evaluation functions F1 and F2 are introduced in the determination of task switching of i. Here, the evaluation functions F1 and F2 reflect the reliability of task 1 and task 2, respectively.
How to determine the evaluation function will be described later. First, FIG.
On the left side of, since a fault has occurred in the computer module 10 (i-1) executing the task 1, the evaluation function F1 (reliability) is reduced to be lower than F2.
Therefore, as shown on the left side of FIG. 5, the computer module 1 among the computer modules executing the task 2
0i is added to the execution of the task 1, and the evaluation functions F1 and F2 have substantially the same value. If a large difference occurs in the value of the evaluation function due to the occurrence of a fault, the determination of which computer module is to change the task being executed is determined in advance by determining the degree of responsibility for each task of each computer module. It is decided by keeping. In this embodiment, among the computer modules 10i to 10m executing the task 2, the computer module 1
0i has the highest responsibility for task 1.

【００２５】以上述べた冗長資源管理機能、すなわち、
タスク切り替え機能，判断機能を担うハードウエアが冗
長化されておらず単一であると、当該ハードウエアの故
障により冗長資源管理機能をもとよりシステム全体の正
常動作が妨げられるおそれがある。そこで、冗長資源管
理機能を担うハードウエア自体を冗長化することが必要
である。冗長化の方法としては以下の３つの方法があ
る。The redundant resource management function described above, that is,
If the hardware that performs the task switching function and the judgment function is not redundant and is single, the failure of the hardware may hinder the normal operation of the entire system based on the redundant resource management function. Therefore, it is necessary to make the hardware itself responsible for the redundant resource management function redundant. There are the following three methods for redundancy.

【００２６】(1）専用のハードウエアに冗長資源管理機
能を担わせ、そのハードウエアを冗長化する方法。(1) A method in which dedicated hardware is assigned a redundant resource management function to make the hardware redundant.

【００２７】(2）コンピュータモジュール１０１〜１０
（ｉ−１）のうちの複数のコンピュータモジュールに冗
長資源管理機能を担わせ、どのコンピュータモジュール
に冗長資源管理機能を担わせるかという判断，制御を冗
長資源管理機能により実現する方法。(2) Computer modules 101 to 10
(I-1) A method of causing a plurality of computer modules to perform the redundant resource management function and determining and controlling which computer module is to perform the redundant resource management function by the redundant resource management function.

【００２８】(3）コンピュータモジュール１０１〜１０
（ｉ−１）それぞれに自律的に自コンピュータモジュー
ルで実行すべきタスクを判断し、実行する冗長資源管理
機能を持たせる方法。(3) Computer modules 101 to 10
(I-1) A method of autonomously determining a task to be executed by the own computer module and providing a redundant resource management function to execute the task.

【００２９】(1）の方法は、図４，図５に示す冗長資源
管理機能を実現するためのハードウエアまたはハードウ
エア及びソフトウエアを複数用意すれば実現できる。ま
た、(2）の方法は図４，図５に示す冗長資源管理機能を
行うタスクを複数のコンピュータモジュールに割当て
て、このタスクも他のタスクと同様に冗長資源管理の対
象とすればよい。続いて(3）の方法の実施例を以下に述
べる。The method (1) can be realized by preparing hardware or a plurality of hardware and software for realizing the redundant resource management function shown in FIGS. In the method (2), a task for performing the redundant resource management function shown in FIGS. 4 and 5 may be assigned to a plurality of computer modules, and this task may be subjected to redundant resource management like other tasks. Subsequently, an embodiment of the method (3) will be described below.

【００３０】図６はフォールト発生に伴い評価関数の値
に大きな差が生じた場合に、評価関数の低下したタスク
の実行に加えるかどうかを各コンピュータモジュールが
単独に独立して判断する実施である。コンピュータモジ
ュール１０１〜１０ｍは各々評価関数Ｆij（ｉ：プロセ
ッサの番号，ｊ：タスクの番号）を計算している。ここ
で、評価関数Ｆijはタスクｊに対する責任度の高いコン
ピュータモジュールほど低くなるように定義するものと
する。すなわち、評価関数Ｆijは、各タスクに対しての
各コンピュータモジュールが果たすべき責任の余裕度と
考えることができる。例えば、図６ではコンピュータモ
ジュール１０１，…，１０ｍの順にタスク１に対する責
任度が高く、タスク２に対する責任度が低いので、図６
の左側のように全てのコンピュータモジュールが正常な
時でもＦ11＜Ｆ21＜…＜Ｆm1，Ｆ12＞Ｆ22…＞Ｆm2とな
る。コンピュータモジュール１０１〜１０（ｉ−１）で
はＦij＜Ｆi2が成立ち、コンピュータモジュール１０ｉ
〜１０ｍではＦi1＞Ｆi2が成立つためそれぞれタスク
１，タスク２を実行している。FIG. 6 shows an embodiment in which, when a large difference occurs in the value of the evaluation function due to the occurrence of a fault, each computer module independently determines whether or not to add to the execution of the task whose evaluation function has been lowered. . Each of the computer modules 101 to 10m calculates an evaluation function Fij (i: processor number, j: task number). Here, it is assumed that the evaluation function Fij is defined to be lower as the computer module has higher responsibility for the task j. That is, the evaluation function Fij can be considered as a margin of responsibility to be fulfilled by each computer module for each task. For example, in FIG. 6, the responsibility for task 1 is high and the responsibility for task 2 is low in the order of the computer modules 101,.
F11 <F21 <... <Fm1, F12> F22 ...> Fm2 even when all the computer modules are normal as shown on the left side of FIG. In the computer modules 101 to 10 (i-1), Fij <Fi2 holds and the computer module 10i
At 10 to 10 m, since Fi1> Fi2 holds, task 1 and task 2 are executed, respectively.

【００３１】図６の中央のようにコンピュータモジュー
ル１０（ｉ−１）でフォールトが発生した場合には全て
のコンピュータモジュールにおいてＦi1の値が低下し、
コンピュータモジュール１０ｉでは、Ｆi1，Ｆi2の間の
大小関係が反転し、Ｆi1＜Ｆi2となる。従って図６の右
に示すようにコンピュータモジュール１０ｉは独自の判
断によりタスク２の実行を中止してタスク１を開始す
る。以上のように本実施例によれば、それぞれのコンピ
ュータモジュールが自律的に独自の判断で実行している
タスクを切り替えているので、システム全体の冗長資源
管理機能が集中した、いわゆるマネージャーが存在しな
いので、信頼性向上の上であい路となる単一故障点がな
く、冗長資源管理機能自体の信頼性を高めることができ
る。When a fault occurs in the computer module 10 (i-1) as shown in the center of FIG. 6, the value of Fi1 decreases in all the computer modules,
In the computer module 10i, the magnitude relationship between Fi1 and Fi2 is inverted, and Fi1 <Fi2. Therefore, as shown on the right side of FIG. 6, the computer module 10i stops execution of task 2 and starts task 1 by its own judgment. As described above, according to the present embodiment, since each computer module autonomously switches the task executed by its own judgment, there is no so-called manager in which the redundant resource management function of the entire system is concentrated. Therefore, there is no single point of failure, which is a path for improving reliability, and the reliability of the redundant resource management function itself can be improved.

【００３２】以上図４から図６に示す実施例では簡単の
ためにシステム内ではタスク１とタスク２の２つしか実
行していない場合を例としてあげたが、任意のｎ個のタ
スクについても同様に冗長資源の管理ができることはい
うまでもない。In the embodiments shown in FIGS. 4 to 6, for the sake of simplicity, the case where only two tasks, task 1 and task 2, are executed in the system has been described as an example. Similarly, it goes without saying that redundant resources can be managed.

【００３３】なお、各タスクの冗長なコンピュータモジ
ュールによる計算結果の選択の方法としては、多数決に
よる方法、本件発明者らによってすでに出願されている
特願昭63−118603号，特開平1−288928 号による方法な
どがある。As a method of selecting a calculation result by a redundant computer module for each task, a method based on a majority decision is adopted, and Japanese Patent Application No. 63-118603 and Japanese Patent Application Laid-Open No. 1-288928 filed by the present inventors have already filed. And the like.

【００３４】〈システム構成〉図１は、本発明を実現す
るためのシステム構成である。本発明によるシステムは
同一の機能を持つｍ個のコンピュータモジュール１０１
〜１０ｍから構成されている。タスク１１１〜１１ｎに
は高信頼化のために複数のコンピュータモジュールが割
り当てられ、冗長にタスクを実行している。図１の例で
はコンピュータモジュール１０１〜１０ｉ１のｉ１個が
タスク１（１１１）に、コンピュータモジュール１０
（ｉ１＋１）〜１０ｉ２の（ｉ２−ｉ１）個がタスク２
（１１２）に、コンピュータモジュール１０（ｉ_n+1＋
１）〜１０ｍの（ｉ_n+1−ｍ）個がタスクｎ（１１ｎ）
に割り当てられている。<System Configuration> FIG. 1 shows a system configuration for realizing the present invention. The system according to the present invention comprises m computer modules 101 having the same function.
-10 m. A plurality of computer modules are allocated to the tasks 111 to 11n for high reliability, and the tasks are executed redundantly. In the example of FIG. 1, i1 of the computer modules 101 to 10i1 is assigned to the task 1 (111),
(I1 + 1) to 10i2 (i2-i1) tasks 2
In (112), the computer module 10 (in _{+ 1} +
1) to ₁₀ m (in _{+ 1−} m) tasks n (11n)
Assigned to.

【００３５】コンピュータモジュール１０１〜１０ｍそ
れぞれが出力選択回路５１〜５ｌに出力を出せる。な
お、出力３１−１〜３１−ｌ，…，３ｍ−１〜３ｍ−ｌ
がそれぞれコンピュータモジュール１０−１，…，１０
−ｍの出力選択回路５１〜５ｌへの出力である。さら
に、コンピュータモジュール１０−１，…，１０−ｍは
出力選択回路５１〜５ｌへ選択制御信号４１−１〜４１
−ｌ，…，４ｍ−１〜４ｍ−ｌを出力３１−１〜３１−
ｌ，…，３ｍ−１〜３ｍ−ｌとともに出している。選択
制御信号４１−１〜４１−ｌ，…，４ｍ−１〜４ｍ−ｌ
は、出力３１−１〜３１−ｌ，…，３ｍ−１〜３ｍ−ｌ
を出力選択回路５１〜５ｌで選択すべきかどうかを表し
ている。例えば、コンピュータモジュール１０１が正常
でかつ出力選択回路５１へ出力すべき出力３１−３を出
力しているときは、選択制御信号４１−１がオンとな
る。Each of the computer modules 101 to 10m can output to the output selection circuits 51 to 51. .., 3m-1 to 3m-1
Are computer modules 10-1,..., 10
−m is an output to the output selection circuits 51 to 51. Further, the computer modules 10-1,..., 10-m supply selection control signals 41-1 to 41-1 to the output selection circuits 51 to 51.
−l,..., Output 4m-1 to 4m-1
1, ..., 3m-1 to 3m-1. Selection control signals 41-1 to 41-1,..., 4m-1 to 4m-1
Are the outputs 31-1 to 31-l,..., 3m-1 to 3m-1
Is to be selected by the output selection circuits 51 to 51. For example, when the computer module 101 is normal and outputs the output 31-3 to be output to the output selection circuit 51, the selection control signal 41-1 turns on.

【００３６】なお、図中では出力３１−１〜３１−ｌ，
選択制御信号４１−１〜４１−ｌのみを示し、出力３２
−１〜３２−ｌ，…，３ｍ−１〜３ｍ−ｌ，選択制御信
号４２−１〜４２−ｌ，…，４ｍ−１〜４ｍ−ｌは省略
している。In the drawing, outputs 31-1 to 31-l,
Only the selection control signals 41-1 to 41-1 are shown, and the output 32
-1 to 32-1, ..., 3m-1 to 3m-1, and the selection control signals 42-1 to 42-1, ..., 4m-1 to 4m-1 are omitted.

【００３７】出力選択回路５１〜５ｌでは、選択制御信
号４１−１〜４１−ｌ，…，４ｍ−１〜４ｍ−ｌに基づ
き、選択すべき出力を決定し各出力６１−６ｌとする。
なお、出力６１〜６ｌは出力装置７１−７ｌに接続され
ている。また多くの制御装置の場合には、出力装置７１
−７ｌは電気，油圧等のアクチュエータで制御対象を制
御する。The output selection circuits 51 to 51 determine the outputs to be selected based on the selection control signals 41-1 to 41-1,..., 4m-1 to 4m-1, and make the outputs 61-6l.
The outputs 61 to 61 are connected to the output device 71-7l. In the case of many control devices, the output device 71
-7l controls an object to be controlled by an actuator such as an electric or hydraulic pressure.

【００３８】出力選択回路５１−５ｌとしては本件発明
者らによって既に出願されている特願昭63−118603号，
特開平1−288928 号の図２記載のＭＶ（Modified Vote
r）回路がある。The output selection circuit 51-5l is disclosed in Japanese Patent Application No. 63-118603, filed by the present inventors.
MV (Modified Vote) described in FIG. 2 of JP-A 1-288928.
r) There is a circuit.

【００３９】図２は本発明を実施するためのコンピュー
タモジュール１０ｉの構成を機能イメージで示したもの
である。コンピュータモジユール１０ｉにタスク実行手
段１２ｉ，フォールト情報交換機能１３ｉ，実行すべき
タスクを決定する判定機能１４ｉ，タスク切り替え機能
１５ｉを有し、判定機能１４ｉでの判定結果に基づきタ
スク１(１１１)からタスクｎ(１１ｎ)の中から実行すべ
きタスクを選択し実行する。図２の実施例では、コンピ
ュータモジュール１０１はタスク１(１１１)を実行して
いる。FIG. 2 is a functional image showing the configuration of a computer module 10i for implementing the present invention. The computer module 10i has a task execution means 12i, a fault information exchange function 13i, a determination function 14i for deciding a task to be executed, and a task switching function 15i. The task to be executed is selected from the tasks n (11n) and executed. In the embodiment of FIG. 2, the computer module 101 executes the task 1 (111).

【００４０】フォールト情報交換機能１３ｉでは、自コ
ンピュータモジュールでのフォールト発生状況や、実行
しているタスクの処理結果を通信路１を介して他のコン
ピュータモジュールにブロードキャストすると同時に、
他のコンピュータモジュールがブロードキャストしたフ
ォールト発生状況や、実行しているタスクの処理結果を
収集する。The fault information exchange function 13i broadcasts the fault occurrence status in the own computer module and the processing result of the task being executed to other computer modules via the communication path 1, and
Collects the fault occurrence status broadcast by another computer module and the processing result of the task being executed.

【００４１】なお、通信路１を介して他のコンピュータ
モジュールと通信する方法には、メッセージパッシング
による方法，共有メモリによる方法，メモリバンク切り
替えによる方法などが従来から提案されており、また通
信路１の形態にもバス型，ネット型，リング型などが提
案されている。As a method of communicating with another computer module via the communication path 1, a method using message passing, a method using a shared memory, a method using memory bank switching, and the like have been conventionally proposed. As a form, a bus type, a net type, a ring type, and the like have been proposed.

【００４２】図３は本発明を実施するためのコンピュー
タモジュール１０ｉの構成を示したものである。バス２
０ｉにはＭＰＵ（Micro−Processing Unit）２１ｉ，通
信インタフェース２２ｉ，出力インタフェース２３ｉ，
選択制御信号インタフェース２４ｉ，記憶装置２５ｉが
接続されている。通信インタフェース２２ｉは他のコン
ピュータモジュールとの通信のためのもので、通信路１
を経由して他のコンピュータモジュールと接続してい
る。図２のフォールト情報交換機能１３ｉは、選択制御
信号インタフェース２４ｉを通じて実現する。FIG. 3 shows the configuration of a computer module 10i for implementing the present invention. Bus 2
In 0i, an MPU (Micro-Processing Unit) 21i, a communication interface 22i, an output interface 23i,
The selection control signal interface 24i and the storage device 25i are connected. The communication interface 22i is used for communication with another computer module.
Connected to other computer modules via. The fault information exchange function 13i of FIG. 2 is realized through the selection control signal interface 24i.

【００４３】出力インタフェース２３ｉは出力選択回路
５１〜５ｌに出力３ｉ−１〜３ｉ−ｌを出力するための
回路である。出力の転送方式は用途に合わせてパラレル
転送とシリアル転送のいずれでも可能である。なお、出
力インタフェース２３ｉからそれぞれ独立した出力３ｉ
−１〜３ｉ−ｌを出せるようにすれば、同時に複数の出
力装置を用いる用途に使用することができる。The output interface 23i is a circuit for outputting the outputs 3i-1 to 3i-1 to the output selection circuits 51 to 51. The output transfer method can be either parallel transfer or serial transfer according to the application. The output 3i is independent from the output interface 23i.
If it is possible to output -1 to 3i-l, it can be used for an application using a plurality of output devices at the same time.

【００４４】選択制御信号インタフェース２４ｉは、出
力選択回路５１〜５ｌに選択制御信号４ｉ−１〜４ｉ−
ｌを送るための回路である。MPU21iから選択制御信号イ
ンタフェース２４ｉのレジスタに書き込むことによって
所定の選択制御信号４ｉ〜１〜４ｉ−ｌをオン（選択）
とすることができる。なお、選択制御信号４ｉ−ｌ′
（ｌ′：１，…，ｌの整数）をオン（選択）とする条件
は、(a）コンピュータモジュール１０ｉで出力選択回路
５ｌ′へ出力３ｉ−ｌ′を出力するタスクを実行してい
て、かつ(b）コンピュータモジュール１０ｉで実行して
いるタスクが正常とみなされることである。なお、(b）
の正常か異常かという判断の方法は、既に本件発明者ら
によって出願されている特願昭63−118603号，特開平1
−288928 号の方法がある。The selection control signal interface 24i provides the output selection circuits 51 to 51 with selection control signals 4i-1 to 4i-.
This is a circuit for sending l. The predetermined selection control signals 4i to 1 to 4i-l are turned on (selected) by writing from the MPU 21i to the register of the selection control signal interface 24i.
It can be. Note that the selection control signal 4i-l '
The condition for turning on (selecting) (l ′: 1,..., L) is that (a) the computer module 10i executes the task of outputting the output 3i−1 ′ to the output selection circuit 51 ′. And (b) the task being executed by the computer module 10i is considered to be normal. (B)
The method of judging whether is normal or abnormal is described in Japanese Patent Application No. 63-118603, which has already been filed by the present inventors.
There is a method of −288928.

【００４５】コンピュータモジュール１０１が正常でか
つ出力選択回路５１へ出力を出すタスク１を実行してい
て、出力選択回路５２へ出力を出すタスク２を実行して
いる他のコンピュータモジュール１０−ｉでフォールト
が発生し、コンピュータモジュール１０１のタスク２へ
の責任度が最も高い場合には、コンピュータモジュール
１０１はタスク１の実行を終了し、タスク２の実行を開
始する。この時には、タスク１を実行中オンとなってい
たコンピュータモジュール１０１から出力選択回路５１
への選択制御信号４１−１がタスク１の実行終了と共に
オフとなり、タスク２実行開始と共に出力選択回路５２
への選択制御信号４２−１がオンとなる。さらに、オン
となっていたコンピュータモジュール１０−ｉから出力
選択回路５２への選択制御信号４ｉ−２はフォールト発
生と共にオフとなる。その結果出力選択回路５２では、
フォールト発生前には正常にタスク２を実行しているコ
ンピュータモジュール１０−ｉからの出力３２−ｉを出
力６２として選択しアクチュエータ７２に送っていた
が、フォールト発生後にはコンピュータモジュール１０
１からの出力３２−１を出力６２として選択しアクチュ
エータ７２に送ることができる。A fault occurs in another computer module 10-i in which the computer module 101 is normal and executes the task 1 for outputting to the output selecting circuit 51, and executes the task 2 for outputting to the output selecting circuit 52. Occurs, and when the responsibility of the computer module 101 for the task 2 is the highest, the computer module 101 ends the execution of the task 1 and starts the execution of the task 2. At this time, the output selection circuit 51 is output from the computer module 101 which is on during execution of the task 1.
Is turned off when the execution of the task 1 is completed, and the output selection circuit 52 is started when the execution of the task 2 is started.
Is turned on. Further, the selection control signal 4i-2 from the computer module 10-i which has been turned on to the output selection circuit 52 turns off when a fault occurs. As a result, in the output selection circuit 52,
Before the occurrence of the fault, the output 32-i from the computer module 10-i normally executing the task 2 was selected as the output 62 and sent to the actuator 72.
The output 32-1 from 1 can be selected as the output 62 and sent to the actuator 72.

【００４６】以上本発明の実施例によれば、複数のコン
ピュータモジュールを用いて複数のタスクを並列にかつ
冗長に実施することが可能となる。According to the embodiment of the present invention, a plurality of tasks can be executed in parallel and redundantly by using a plurality of computer modules.

【００４７】なお、ここでは１つのタスクが複数のアク
チュエータに出力する場合を想定して説明したが、１つ
のタスクが複数のアクチュエータに出力したり、タスク
がアクチュエータにまったく出力しない場合なども考え
られる。The above description has been made on the assumption that one task outputs to a plurality of actuators. However, it is also conceivable that one task outputs to a plurality of actuators, or the task does not output to an actuator at all. .

【００４８】〈評価関数の算出と判定アルゴリズム〉図
７は、本発明により実行すべきタスクを判定機能１４−
１〜１４−ｍで判定する際の判定のフローチャートを表
したものである。<Evaluation Function Calculation and Judgment Algorithm> FIG. 7 shows a task to be executed according to the present invention by a judgment function 14-.
It is a flow chart of the judgment at the time of judgment in 1 to 14-m.

【００４９】評価関数算出処理３００では各タスクにつ
いての評価関数Ｆij（ｊ：タスクの番号）を算出する。In the evaluation function calculation processing 300, an evaluation function Fij (j: task number) for each task is calculated.

【００５０】なおここで、評価関数Ｆijは各タスクにつ
いての信頼度の余裕度を表すので、タスクの重要度が高
いほどＦijは低く、コンピュータモジュールのタスクに
対する責任度が高いほどＦijは低く、タスクの信頼度が
高いほどＦijは高くなるように定める。すなわち、 ∂Ｆij／∂Ｉ＜０ ∂Ｆij／∂Ｒesp＜０ ∂Ｆij／∂Ｒel＞０ただし、Ｉ：重要度Ｒesp：責任度Ｒel：信頼度となる。Here, since the evaluation function Fij represents a margin of reliability for each task, the higher the importance of the task, the lower the Fij, and the higher the responsibility of the computer module for the task, the lower the Fij. Is determined so that the higher the reliability of is, the higher the Fij becomes. That is, ∂Fij / ∂I <0 ∂Fij / ∂Resp <0 ∂Fij / ∂Rel> 0, where I: importance Resp: responsibility Rel: reliability.

【００５１】上記の条件を満たす評価関数Ｆijは例え
ば、The evaluation function Fij satisfying the above condition is, for example,

【００５２】[0052]

【数１】Ｆij＝Ｌrj−Ｌthij …（１）ただし、Ｌthij：コンピュータモジュールｉにおけるタ
スクｊの信頼度レベルのしきい値Ｌrj：タスクｊの信頼度レベルｉ：自コンピュータモジュールの番号ｊ：タスクの番号となる。Fij = Lrj−Lthij (1) where Lthij is a threshold value of the reliability level of task j in computer module i Lrj: the reliability level of task j i: the number of the own computer module j: task number Number.

【００５３】なおここで、タスクｊの信頼度レベルのし
きい値Ｌthijはタスクの重要度によって異なり、重要な
タスクすなわち高い信頼度が要求されるタスクほど大き
な値が設定される。さらに全てのコンピュータモジュー
ルについて同じ値のＬthijを設定すると、フォールト発
生時に全てコンピュータモジュールが同一のタスクを実
行してしまうためにシステムの動作が不安定になる。従
ってＬthijはコンピュータモジュールｉによって異な
り、タスクについて責任度の高いコンピュータモジュー
ルほど高い値が設定される。Here, the threshold value Lthij of the reliability level of the task j differs depending on the importance of the task, and a larger value is set for an important task, that is, a task requiring a higher reliability. Further, if the same value Lthij is set for all the computer modules, the operation of the system becomes unstable because all the computer modules execute the same task when a fault occurs. Therefore, Lthij differs depending on the computer module i, and a higher value is set for a computer module having a higher responsibility for a task.

【００５４】すなわち、 ∂Ｌthij／∂Ｉ＞０ ∂Ｌthij／∂Ｒesp＞０となる。That is, ∂Lthij / ∂I> 0 and ∂Lthij / ∂Resp> 0.

【００５５】ここで、タスクｊの信頼度レベルＬrjの決
定方法について説明する。信頼度レベルＬrjなる評価関
数は、タスクｊを実行しているコンピュータモジュール
の数、処理結果の一致不一致および処理結果が一致して
いるプロセッサの数などのフォールト検出結果すなわち
フォールト情報から算出されるものとする。Here, a method of determining the reliability level Lrj of the task j will be described. The evaluation function having the reliability level Lrj is calculated from fault detection results, that is, fault information such as the number of computer modules executing the task j, the mismatch between processing results, and the number of processors having matching processing results. And

【００５６】まずここで、誤った結果をシステムの出力
として採用してしまう確率に着目すると、チェック（検
査）の合格の度合いから信頼度レベルＬrjが算出でき
る。Ｎ１個のコンピュータモジュールがタスクｊが実行
していて、その内Ｎ２個のコンピュータモジュールがチ
ェックの結果正常と判断され、Ｎ３個のコンピュータモ
ジュールの計算結果が一致した場合、タスクｊの計算結
果が誤っている確率Ｐejは、First, focusing on the probability that an erroneous result will be adopted as the output of the system, the reliability level Lrj can be calculated from the degree of pass of the check (inspection). When the task j is executed by the N1 computer modules, and the N2 computer modules are determined to be normal as a result of the check, and the calculation results of the N3 computer modules match, the calculation result of the task j is incorrect. Probability Pej is

【００５７】[0057]

【数２】 (Equation 2)

【００５８】ただし、Ｐε：誤りが発生する確率Ｐεd：チェックが誤りを見逃す確率Ｐεa：誤った計算結果が偶然一致する確率となる。なおここで、Ｐε，Ｐεd，Ｐεaはシステムの
動作環境、誤り検出方式により求めることができ、既知
の定数であるのでＰejはＮ１，Ｎ２，Ｎ３−１の関数で
ある。Here, Pε: the probability of occurrence of an error Pεd: the probability that a check misses an error Pεa: the probability that an erroneous calculation result coincides by chance. Here, Pε, Pεd, and Pεa can be obtained by the operating environment of the system and the error detection method. Since Pej is a known constant, Pej is a function of N1, N2, and N3-1.

【００５９】タスクｊの信頼度レベルＬrjすなわち計算
結果が誤っていない確率は、The reliability level Lrj of task j, that is, the probability that the calculation result is not incorrect, is

【００６０】[0060]

【数３】Ｌrj＝１−Ｐej …（３）で与えられる。Lrj = 1−Pej (3)

【００６１】なおここで、簡単のために（３）式よりＰ
ejの大小でＬrjの大小を評価することにすれば、（２）
式の両辺の対数をとれば、Here, for the sake of simplicity, from equation (3), P
If we evaluate the magnitude of Lrj based on the magnitude of ej, (2)
If we take the log of both sides of the equation,

【００６２】[0062]

【数４】 log(Ｐej)＝Ｎ１・log(Ｐε)＋Ｎ２・log(Ｐεd)＋(Ｎ３−１)・log(Ｐεa) …（４）となり、Ｐε，Ｐεd，Ｐεaの値はフィールドデータや
シミュレーションにより算出が可能であるから、これら
の値の対数をそれぞれＫ１，Ｋ２，Ｋ３とすると（４）
式は、Log (Pej) = N1 · log (Pε) + N2 · log (Pεd) + (N3-1) · log (Pεa) (4) where the values of Pε, Pεd, and Pεa are field data and simulations. Since the logarithms of these values are K1, K2, and K3, respectively, (4)
ceremony,

【００６３】[0063]

【数５】 log(Ｐe）＝Ｎ１・Ｋ１＋Ｎ２・Ｋ２＋(Ｎ３−１)・Ｋ３ …（４′）と簡略化される。(5) log (Pe) = N1 · K1 + N2 · K2 + (N3-1) · K3 (4 ′)

【００６４】さらにここで、（１）式の評価関数のかわ
りに計算結果が誤っている確率Ｐeに着目して、Further, paying attention to the probability Pe that the calculation result is incorrect instead of the evaluation function of the equation (1),

【００６５】[0065]

【数６】Ｆij＝log{(１−Ｌthij)／Ｐe｝ …（１′）なる評価関数を定義すれば、[Formula 6] By defining an evaluation function of Fij = log {(1-Lthij) / Pe｝ (1 ′),

【００６６】[0066]

【数７】Ｆij＝Ｋ４−Ｎ１・Ｋ１＋Ｎ２・Ｋ２＋(Ｎ３−１）・Ｋ３ …（１″）ただし、Ｋ４＝log（１−Ｌthij) と評価関数Ｆijの算出が加算，減算，乗算だけで可能と
なり、容易化（高速化）できる。Fij = K4−N1 · K1 + N2 · K2 + (N3-1) · K3 (1 ″) where K4 = log (1−Lthij) and the calculation of the evaluation function Fij can be performed only by addition, subtraction, and multiplication. And can be simplified (speeded up).

【００６７】同様に、タスクｊを実行しているコンピュ
ータモジュールで誤りが発生する確率に着目してもタス
クｊの信頼度レベルＬrjを算出することができる。Similarly, the reliability level Lrj of the task j can be calculated by paying attention to the probability of occurrence of an error in the computer module executing the task j.

【００６８】Ｎ１個のコンピュータモジュールがタスク
ｊを実行していると仮定すると、これら全てのコンピュ
ータモジュールで誤りが発生し、タスクｊについての計
算結果が誤る確率は、Assuming that N1 computer modules are executing task j, the probability that an error will occur in all of these computer modules and the calculation result for task j will be incorrect will be:

【００６９】[0069]

【数８】 (Equation 8)

【００７０】となり、両辺の対数をとると（２）式と同
様にして、Then, taking the logarithm of both sides, as in the equation (2),

【００７１】[0071]

【数９】Ｆij＝Ｋ４−Ｎ１・Ｋ１ …（１′′′）と評価関数Ｆijの算出を簡略化することができる。[Mathematical formula-see original document] Fij = K4-N1 * K1 (1 "") and the calculation of the evaluation function Fij can be simplified.

【００７２】条件判定処理３０１では、各タスクについ
ての評価関数（Ｆij（ｊ：１，…，ｎ，ｎ：タスクの
数）と現在実行しているタスクｋについての評価関数Ｆ
ikとを比較する。その結果もしＦij＜Ｆikを満たすタス
クｊがあれば、現在実行しているタスクｋを終了し、タ
スクｊを開始する。In the condition determination processing 301, an evaluation function (Fij (j: 1,..., N, n: number of tasks)) for each task and an evaluation function F for the task k currently being executed
Compare with ik. As a result, if there is a task j that satisfies Fij <Fik, the currently executed task k is terminated and the task j is started.

【００７３】タスクｋの終了、タスクｊ開始のタスミン
グを図８に示す。フィードバック制御用のコンピュータ
の場合には図８のように制御フレームごとに周期的に入
力データを読み込み、タスクを実行し、結果を出力す
る。いま、コンピュータモジュールｉでタスクｋを実行
しており、制御フレーム１においてタスクｊを実行して
いるコンピュータモジュールでの故障発生によりＦij＜
Ｆikとなったとする。コンピュータモジュールｉでは、
直ちにタスクｋを終了し、タスクｊの実行準備を開始す
る。タスクｊを開始するために前回の制御フレームまで
のデータ（履歴データ）が不要な場合には、制御フレー
ム２からタスクｊを開始することができる。また、タス
クｊを開始するために履歴データが必要な場合には、図
８のように制御フレーム２で履歴データを収集し、制御
フレーム３からタスクｊを開始する。この際、履歴デー
タは既にタスクｊを実行しているコンピュータモジュー
ルに通信路１を介して要求を出し、収集すればよい。FIG. 8 shows tasking for ending task k and starting task j. In the case of a computer for feedback control, as shown in FIG. 8, input data is periodically read for each control frame, a task is executed, and a result is output. Now, the task k is being executed by the computer module i, and Fij <Fij <
Suppose that it becomes Fik. In the computer module i,
Immediately ends task k and starts preparation for execution of task j. When the data (history data) up to the previous control frame is not needed to start the task j, the task j can be started from the control frame 2. When history data is required to start task j, history data is collected in control frame 2 as shown in FIG. 8, and task j is started from control frame 3. At this time, the history data may be collected by issuing a request to the computer module already executing the task j via the communication path 1.

【００７４】〈ハンチング防止−不感帯の設定〉図９は
条件判定処理３０１での判断の際に不感帯δを設け、Ｆ
ij＜Ｆik−δを満たすタスクｊがあれば、現在実行して
いるタスクｋを終了し、タスクｊを開始するようにした
実施例である。本実施例は図７の動作を更に改善するも
のである。<Hunting Prevention-Setting of Dead Zone> FIG.
In this embodiment, if there is a task j that satisfies ij <Fik-δ, the currently executing task k is terminated and the task j is started. This embodiment further improves the operation of FIG.

【００７５】図７の実施例において図１０に示すよう
に、 (1）フォールト発生によりＦij＜Ｆikとなり、タスクｋ
を実行しているコンピュータモジュールがタスクｊの実
行を時刻ｔ１に始めると、評価関数Ｆijが大きくなり、
評価関数Ｆikが小さくなる。As shown in FIG. 10 in the embodiment of FIG. 7, (1) Fij <Fik due to the occurrence of a fault and the task k
Is started at time t1, the evaluation function Fij increases,
The evaluation function Fik becomes smaller.

【００７６】(2）ここでもし、Ｆij，Ｆikの大小が逆転
しＦij＞Ｆikとなるとタスクｊの実行を開始したコンピ
ュータモジュールが再びタスクｋを時刻ｔ２に開始す
る。(2) Here, if the magnitudes of Fij and Fik are reversed and Fij> Fik, the computer module which has started execution of task j starts task k again at time t2.

【００７７】上記(1），(2）を繰り返す結果、履歴デー
タ収集などのためシステムの動作効率が低下してしまう
可能性がある。As a result of repeating the above (1) and (2), there is a possibility that the operation efficiency of the system is reduced due to the collection of history data.

【００７８】そこで図９のように条件判断処理３０１で
の判断の際にタスク切り換え時のＦik，Ｆijの変化分よ
りも大きな不感帯δを設けヒステリシス特性を持たせれ
ば、実行タスク切り替えのハンチングが発生せずにシス
テムを図１１に示すように安定に動作させることができ
る。Therefore, as shown in FIG. 9, if a dead zone δ larger than the change of Fik and Fij at the time of task switching is provided at the time of the determination in the condition determination processing 301 and a hysteresis characteristic is provided, hunting of the execution task switching occurs. Without this, the system can be operated stably as shown in FIG.

【００７９】なおＰε，Ｐεd，Ｐεaが既知であるの
で、Ｎ１，Ｎ２，Ｎ３が変化したときのＦijの変化分∂
Ｆij／∂Ｎ１，∂Ｆij／∂Ｎ２，∂Ｆij／∂Ｎ３を予め
知ることができる。そこで不感帯δの幅は、 max（∂Ｆij／∂Ｎ１，∂Ｆij／∂Ｎ２，∂Ｆij／∂Ｎ
３）よりも大きい値を設定すればよい。Since Pε, Pεd, and Pεa are known, the change amount of Fij when N1, N2, and N3 change is obtained.
Fij / ∂N1, ∂Fij / ∂N2, ∂Fij / ∂N3 can be known in advance. Therefore, the width of the dead zone δ is max (∂Fij / ∂N1, ∂Fij / ∂N2, ∂Fij / ∂N
3) A larger value may be set.

【００８０】以上述べた図１から図１１に示す実施例に
より図１２に示すように、故障発生により、冗長システ
ムを構成するコンピュータモジュールの時間と共に失わ
れていっても、タスク１，…，タスクｎにコンピュータ
モジュールを割り当てて行き、各タスクに求められてい
る信頼度レベルに従ってタスク間の冗長度のバランスを
保つことができる。また、高い信頼度が要求される重要
度の高いタスクほど多くの冗長コンピュータモジュール
を割り当てるのでフォールト検出のカバレッジを高める
ことができる。As shown in FIG. 12, according to the embodiment shown in FIGS. 1 to 11 described above, even if the computer modules constituting the redundant system are lost with time due to the occurrence of a failure, the tasks 1,. It is possible to assign computer modules to n, and to balance the redundancy between tasks according to the reliability level required for each task. In addition, since more redundant computer modules are allocated to tasks of higher importance requiring higher reliability, the coverage of fault detection can be increased.

【００８１】〈安定度の向上−時間平均化〉さらに図１
から図１２に示す実施例を図１３に示す実施例を加えれ
ば、システムの安定度を高めることが可能である。<Improvement of stability—averaging over time> Further, FIG.
By adding the embodiment shown in FIG. 13 to the embodiment shown in FIG. 12, the stability of the system can be improved.

【００８２】図１３は評価関数Ｆijを算出する際にＬrj
またはＰe を時間領域で平均化する実施例である。FIG. 13 shows that when calculating the evaluation function Fij, Lrj
Alternatively, this is an embodiment in which Pe is averaged in the time domain.

【００８３】図１から図１２に示す実施例によれば、タ
スクｊを実行しているコンピュータモジュールの故障が
発生した場合、タスクｋを実行しているコンピュータモ
ジュールの中で最もＬthijが高いすなわちタスクｊに対
する責任度が最も高いコンピュータモジュールｉにおい
てＦij＜Ｆikが成り立つためにコンピュータモジュール
ｉはタスクｊの実行を始め、図１４のａに示すようにタ
スクｊの信頼度レベルを保つことができる。もしこの時
コンピュータモジュールｉでも故障が発生していた場合
にはタスクｊの実行を新たに始めるコンピュータモジュ
ールは存在しなくなり図１４のｂに示すようにタスクｊ
の信頼度レベルが低下したままとなる。つまり、コンピ
ュータモジュールｉ故障の発生によって冗長資源管理の
結果が影響を受けシステムの安定度が低下することにな
る。According to the embodiment shown in FIGS. 1 to 12, when a failure occurs in the computer module executing the task j, the highest Lthij among the computer modules executing the task k, that is, the task Since Fij <Fik is satisfied in the computer module i having the highest responsibility for j, the computer module i starts executing the task j and can maintain the reliability level of the task j as shown in FIG. At this time, if a failure has occurred in the computer module i, there is no computer module for newly starting the execution of the task j, and the task j as shown in FIG.
Remains low. That is, the result of the redundant resource management is affected by the occurrence of the failure of the computer module i, and the stability of the system is reduced.

【００８４】そこで図１３に示すように評価関数Ｆijを
算出する際にＬrjまたはＰejを時間領域で平均化すれ
ば、図１５の実線で示すようにＦijは時間経過と共に徐
々に低下して行く。もしタスクｊに対する責任度が最も
高いコンピュータモジュールｉが存在しているのなら
ば、図１５のａで示すように時刻ｔ１においてコンピュ
ータモジュールｉがタスクｊの実行を開始しＦijの値が
回復する。もしコンピュータモジュールｉが生存してお
らず、次にタスクｊに対する責任度が高いコンピュータ
モジュールｉ′が生存しているのならば、図１５のｂに
示すように時刻ｔ２においてコンピュータモジュール
ｉ′がタスクｊの実行を開始しＦijの値が回復する。も
しコンピュータモジュールｉもコンピュータモジュール
ｉ′も生存しておらず、コンピュータモジュールｉ′に
続いてタスクｊに対する責任度が高いコンピュータモジ
ュールｉ″が生存しているのならば、図１５のｃで示す
ように時刻ｔ３においてコンピュータモジュールｉ″が
タスクｊの実行を開始しＦijの値が回復する。Therefore, if Lrj or Pej is averaged in the time domain when calculating the evaluation function Fij as shown in FIG. 13, Fij gradually decreases with time as shown by the solid line in FIG. If there is a computer module i having the highest responsibility for the task j, the computer module i starts executing the task j at time t1, and the value of Fij recovers, as shown in FIG. If the computer module i is not alive and the computer module i ′ having the next highest responsibility for the task j is alive, as shown in FIG. The execution of j is started, and the value of Fij recovers. If neither the computer module i nor the computer module i ′ is alive and the computer module i ″ having a high responsibility for the task j is alive after the computer module i ′, as shown in FIG. 15C. At time t3, the computer module i ″ starts executing the task j, and the value of Fij recovers.

【００８５】ＬrjまたはＰe を時間領域で平均化する方
法としては(1）移動平均をとる方法、(2）Ｋ次遅れ系
（伝達関数Ｇ(ｓ)＝１／（１＋Ｔｓ）∧Ｋ）を用いる方
法などがある。As a method of averaging Lrj or Pe in the time domain, (1) a method of taking a moving average, and (2) a K-order delay system (transfer function G (s) = 1 / (1 + Ts) ∧K) is used. There are methods.

【００８６】本実施例によれば、タスクに対する責任度
の大きな特定のコンピュータモジュールの故障によって
冗長資源管理の結果が受ける影響を軽減することができ
るので、冗長資源管理方式自体のフォールトトレランス
（耐故障性）を高めることができる。According to the present embodiment, it is possible to reduce the influence on the result of the redundant resource management due to the failure of a specific computer module having a high responsibility for a task. Therefore, the fault tolerance (fault tolerance) of the redundant resource management method itself can be reduced. ) Can be increased.

【００８７】〈通信量，計算量の削減〉図１６は本発明
実施によりコンピュータモジュール１０１−ｍ相互間の
通信量及び評価関数算出のための計算量の増加を緩和す
る実施例である。図１から図１５に示す実施例によれ
ば、それぞれのコンピュータモジュールは自コンピュー
タモジュールでのフォールト検出状況を他の全てのコン
ピュータモジュールに通知する（ブロードキャストす
る）ためにＮcom ＝ｍ（ｍ−１）回の通信が必要とな
り、通信量が著しく増加する。そこで、図１６では、通
常は同一タスクを実行しているコンピュータモジュール
のみ評価関数フォールト検出状況を通知し、評価関数Ｆ
ijが変化したときのみ、他の全てのコンピュータモジュ
ールに通知する。コンピュータモジュール１〜３がタス
ク１を実行しており、コンピュータモジュールｉがタス
ク２を実行している場合を考える。制御フレーム１で
は、コンピュータモジュール１〜３には異常が見られな
いため、コンピュータモジュール１〜３相互間でのみ６
回の通信が発生する。続いて制御フレーム２でコンピュ
ータモジュール３に異常が発生した場合を考える。第１
回目の通信はコンピュータモジュール１〜３相互間での
み行われ、通信により交換したフォールト検出情報に基
づき（この場合はコンピュータモジュール３がダウンし
て沈黙している）算出された評価関数Ｆijはコンピュー
タモジュール３の異常のために前回（制御フレーム１）
よりも低下している。従って制御フレーム２では引き続
き第２回目の通信が実施され、評価関数Ｆijが低下した
旨コンピュータモジュールｉに通知される。コンピュー
タモジュールｉでは、通知された情報をもとに自コンピ
ュータモジュールがタスク１の実行に参加すべきかどう
かを判断し、参加すべき時にはタスク２の実行を中止し
てタスク１の実行を開始する。<Reduction of Communication Volume and Calculation Volume> FIG. 16 shows an embodiment of the present invention in which the communication volume between the computer modules 101-m and the calculation volume for calculating the evaluation function are alleviated. According to the embodiment shown in FIGS. 1 to 15, each computer module informs (broadcasts) the fault detection status of its own computer module to all other computer modules, and Ncom = m (m-1). Times of communication is required, and the amount of communication increases significantly. Therefore, in FIG. 16, normally, only the computer module executing the same task notifies the evaluation function fault detection status, and the evaluation function F
Notify all other computer modules only when ij changes. It is assumed that the computer modules 1 to 3 are executing the task 1 and the computer module i is executing the task 2. In the control frame 1, since no abnormality is found in the computer modules 1 to 3, only 6
Times communication occurs. Next, a case where an abnormality occurs in the computer module 3 in the control frame 2 is considered. First
The first communication is performed only between the computer modules 1 to 3, and the evaluation function Fij calculated based on the fault detection information exchanged by the communication (in this case, the computer module 3 is down and silent) is calculated by the computer module Previous time (control frame 1) due to abnormality 3
Than it is. Therefore, in the control frame 2, the second communication is continuously performed, and the computer module i is notified that the evaluation function Fij has decreased. The computer module i determines whether or not its own computer module should participate in the execution of the task 1 based on the notified information, and when it should participate, suspends the execution of the task 2 and starts the execution of the task 1.

【００８８】本実施例によれば、コンピュータモジュー
ル相互間の通信は、According to this embodiment, the communication between the computer modules is

【００８９】[0089]

【数１０】 (Equation 10)

【００９０】ただし、Ｎ1j：タスクｊを実行しているコ
ンピュータモジュールの数となる。ここで、Here, N1j is the number of computer modules executing the task j. here,

【００９１】[0091]

【数１１】 [Equation 11]

【００９２】であるから、本実施例により通信の回数は
Ｎcom′≒Ｎcom／ｎとほぼ１／ｎとなる。Therefore, according to the present embodiment, the number of times of communication is Ncom'ｎNcom / n, which is approximately 1 / n.

【００９３】図１７は、図１６の実施例のための全ての
コンピュータモジュールにブロードキャストするかどう
かの判断を示すフローチャートである。まず、同一タス
クを実行しているコンピュータモジュール同士でフォー
ルト検出情報を交換(３０２)し、それに基づいた評価関
数Ｆijを算出（３００′）する。なお、評価関数Ｆijの
算出処理３００′は同一タスクを実行しているコンピュ
ータモジュールのみの評価関数Ｆijを算出している点
で、図７，図９，図１８に示す（全てのコンピュータモ
ジュールの）評価関数Ｆijの算出処理３００と異なる。
従って、評価関数Ｆijの算出処理３００ではｍ回のＦij
の計算が必要であるのにたいして、評価関数Ｆijの算出
処理３００′では同一タスクを実行しているコンピュー
タモジュールの個数だけ（Ｏ（ｍ／ｎ））のＦijを計算
すればよいので、計算量もほぼ１／ｎとすることができ
る。評価関数Ｆijの算出３００′の後、現在のＦijと前
回の値Ｆijold とを比較し（３０３）、不一致の場合に
はフォールト情報を全てのコンピュータモジュールにブ
ロードキャスト（３０４）する。最後に現在の評価関数
の値Ｆijを次回に備えて変数Ｆijoldに格納（３０５）
する。FIG. 17 is a flowchart showing the determination as to whether or not to broadcast to all computer modules for the embodiment of FIG. First, fault detection information is exchanged between computer modules executing the same task (302), and an evaluation function Fij based on the fault detection information is calculated (300 '). The calculation function 300 'of the evaluation function Fij calculates the evaluation function Fij only for the computer module executing the same task. This is different from the calculation processing 300 of the evaluation function Fij.
Therefore, in the calculation processing 300 of the evaluation function Fij, m times Fij
Is required, the calculation function 300 ′ of the evaluation function Fij only needs to calculate Fij (O (m / n)) by the number of computer modules executing the same task. It can be approximately 1 / n. After the evaluation function Fij is calculated 300 ', the current Fij is compared with the previous value Fijold (303), and if they do not match, the fault information is broadcast to all the computer modules (304). Finally, the current evaluation function value Fij is stored in the variable Fijold for the next time (305).
I do.

【００９４】一方、ブロードキャストを受ける側のコン
ピュータモジュールでは、図１８に示すように全域ブロ
ードキャストがあったかどうか判定（３０６）し、全域
ブロードキャストがあった場合に限り図７，図９に示す
判断に進む。On the other hand, the computer module on the broadcast receiving side determines whether or not there has been an entire area broadcast as shown in FIG. 18 (306), and proceeds to the determination shown in FIGS. 7 and 9 only when there is an all area broadcast.

【００９５】〈適応制御システムへの応用〉図１９は本
発明を適応制御システムに適用した実施例である。制御
対象２０の物理量をセンサ２で測定し状態観測器１６で
制御対象の状態を観測（推定）し、観測された状態をも
とに制御に適切な特性を持つレギュレータ１７，アクチ
ュエータ７を介して制御対象２０へフィードバックを加
える。以上は現代制御理論に基づき状態フィードバック
を行う制御システムの典型的な構成である。<Application to Adaptive Control System> FIG. 19 shows an embodiment in which the present invention is applied to an adaptive control system. The physical quantity of the control target 20 is measured by the sensor 2, the state of the control target is observed (estimated) by the state observer 16, and the regulator 17 and the actuator 7 having characteristics suitable for control based on the observed state. Feedback is added to the control target 20. The above is a typical configuration of a control system that performs state feedback based on modern control theory.

【００９６】さらにセンサ２の出力信号とアクチュエー
タ７への入力信号とから、制御対象特性同定部１８でセ
ンサ２，アクチュエータ７を含めた制御対象２０の特性
を同定し、最適レギュレータ設計部１９では、制御対象
特性の同定結果から制御に最適なレギュレータのパラメ
ータを算出し、レギュレータ１７のパラメータを最適値
に設定する。以上のような適応制御システムにより制御
特性が向上し、特に航空機，宇宙往還機（いわゆるスペ
ースシャトル）などの空気力学特性の非線形性により、
線形近似した制御システムにおいて高度，速度により制
御対象の特性が見かけ上変化するような制御対象の制御
に特に最適であることが知られている。さらに制御対象
２０，センサ２，アクチュエータ７に故障が発生した場
合でも制御システムは制御対象の特性変化として捉え、
その都度最適なパラメータをレギュレータ１７に設定す
るので、制御対象の故障による特性劣化を補うことがで
きる。通常信頼性の要求されている制御システムではア
クチュエータも冗長化していることが多い。例えば航空
機などでは、昇降舵，方向舵などの制御舵面（Control
Surface)や推力発生装置は冗長化し、一部が故障した場
合でも飛行には支障のないように設計されている。しか
しこれらの冗長化したアクチュエータの一部が故障した
場合には、等価的にアクチュエータのゲインが低下した
ことになり、システム全体の制御特性が悪化する。また
場合によっては、制御の対象となる値の間に干渉が起こ
り特に人間の運転操作を介しての制御が著しく困難とな
る。そこで、本実施例による適応制御装置はアクチュエ
ータのゲイン低下を特性同定部１８で検出し、最適レギ
ュレータ設計部１９ではそれに最適なレギュレータ１７
のパラメータを決定すれば、制御特性の悪化を補うこと
ができる。Further, from the output signal of the sensor 2 and the input signal to the actuator 7, the characteristics of the controlled object 20 including the sensor 2 and the actuator 7 are identified by the controlled object characteristic identifying section 18, and the optimum regulator designing section 19 A regulator parameter optimal for control is calculated from the identification result of the control target characteristic, and the parameter of the regulator 17 is set to an optimal value. The control characteristics are improved by the adaptive control system as described above. In particular, due to the non-linearity of the aerodynamic characteristics of aircraft, space shuttles (so-called space shuttles),
It is known that a linearly approximated control system is particularly suitable for controlling a controlled object in which the characteristics of the controlled object apparently change depending on altitude and speed. Further, even if a failure occurs in the control target 20, the sensor 2, and the actuator 7, the control system regards the change as a characteristic change of the control target.
Since the optimal parameters are set in the regulator 17 each time, it is possible to compensate for the characteristic deterioration due to the failure of the control target. Usually, in a control system requiring reliability, the actuator is often made redundant. For example, in aircraft, control rudder, rudder, etc.
(Surface) and thrust generators are designed to be redundant, so that even if a part fails, flight is not hindered. However, when a part of these redundant actuators fails, the gain of the actuators equivalently decreases, and the control characteristics of the entire system deteriorate. Further, in some cases, interference occurs between the values to be controlled, and it becomes particularly difficult to control the vehicle through a human driving operation. Therefore, in the adaptive control device according to the present embodiment, the decrease in the gain of the actuator is detected by the characteristic identification unit 18, and the optimal regulator
By determining these parameters, the deterioration of the control characteristics can be compensated.

【００９７】本実施例では、適応制御への応用にあた
り、状態観測器１６とレギュレータ１７をタスク１（ま
たはタスクグループ１）で実現し、制御対象特性同定部
１８と最適レギュレータ設計部１９をタスク２（または
タスクグループ２）で実現し、Ｌth11＞Ｌth21＞Ｌth31＞Ｌth41＞Ｌth51かつ、Ｌth12
＜Ｌth22＜Ｌth32＜Ｌth42＜Ｌth52かつ、Ｌth11＞Ｌth
52かつ、Ｌth21＞Ｌth42かつ、Ｌth31＞Ｌth32かつ、Ｌ
th41＞Ｌth22かつ、Ｌth51＞Ｌth11 と設定し、タスク２（またはタスクグループ２）を実行
するコンピュータモジュールが存在しない場合には予め
用意した数表によりレギュレータ１７のパラメータを設
定するようにしたものである。本実施例による冗長資源
管理の様子を図２０に示す。まず、正常なコンピュータ
モジュールの数が５個の場合には３個のコンピュータモ
ジュールがタスク１（またはタスクグループ１）に、２
個のコンピュータモジュールがタスク２（またはタスク
グループ２）に割り当てられる。１個のコンピュータモ
ジュールで故障が発生し、正常なコンピュータモジュー
ルの数が４個となった場合には２個のコンピュータモジ
ュールがタスク１（またはタスクグループ１）に、２個
のコンピュータモジュールがタスク２（またはタスクグ
ループ２）に割り当てられる。２個のコンピュータモジ
ュールで故障が発生し、正常なコンピュータモジュール
の数が３個となった場合には２個のコンピュータモジュ
ールがタスク１（またはタスクグループ１）に、１個の
コンピュータモジュールがタスク２（またはタスクグル
ープ２）に割り当てられる。３個のコンピュータモジュ
ールで故障が発生し、正常なコンピュータモジュールの
数が４個となった場合には２個のコンピュータモジュー
ルがタスク１（またはタスクグループ１）に割り当てら
れ、タスク２（またはタスクグループ２）にはコンピュ
ータモジュールは割り当てられない、代わりに予め用意
された数表によりレギュレータ１７のパラメータを設定
して制御を続行する。In this embodiment, in application to adaptive control, the state observer 16 and the regulator 17 are realized by task 1 (or task group 1), and the control target characteristic identification unit 18 and the optimal regulator design unit 19 are implemented by task 2 (Or task group 2), Lth11>Lth21>Lth31>Lth41> Lth51 and Lth12
<Lth22 <Lth32 <Lth42 <Lth52 and Lth11> Lth
52 and Lth21> Lth42 and Lth31> Lth32 and L
When th41> Lth22 and Lth51> Lth11 are set, and there is no computer module that executes task 2 (or task group 2), the parameters of the regulator 17 are set according to a numerical table prepared in advance. . FIG. 20 shows the state of redundant resource management according to this embodiment. First, when the number of normal computer modules is 5, three computer modules are assigned to task 1 (or task group 1).
Computer modules are assigned to task 2 (or task group 2). When a failure occurs in one computer module and the number of normal computer modules becomes four, two computer modules are assigned to task 1 (or task group 1) and two computer modules are assigned to task 2 (Or task group 2). When two computer modules fail and the number of normal computer modules becomes three, two computer modules are assigned to task 1 (or task group 1) and one computer module is assigned to task 2 (Or task group 2). If three computer modules fail and the number of normal computer modules becomes four, two computer modules are assigned to task 1 (or task group 1) and task 2 (or task group 1). In 2), no computer module is assigned. Instead, parameters of the regulator 17 are set according to a table prepared in advance, and control is continued.

【００９８】以上本実施例によれば、コンピュータモジ
ュールでの故障だけでなく制御対象の故障をも許容する
制御システムを構成することができ、制御システム全体
の信頼度を向上させることができる。As described above, according to the present embodiment, a control system that allows not only a failure in the computer module but also a failure in the control target can be configured, and the reliability of the entire control system can be improved.

【００９９】図２１，図２２，図２３は出力選択，多数
決機能を持つサーボモータ系の実施例である。このサー
ボモータ系は図１の出力選択回路５１〜５ｌと出力装置
７１〜７ｌの機能を合わせ持っている。本実施例のサー
ボモータは図２１に示すように単一のシャフト７０１に
複数の電機子巻線７０４１〜７０４ｌを設け、電機子巻
線に対応した界磁巻線７０３１〜７０３ｌを電機子巻線
に対向させてハウジング７０２内に設けたものである。
なお、図２１中のＡ−Ａ′面の断面を図２２に示す。こ
のサーボモータの出力トルクは次式で与えられる。FIGS. 21, 22, and 23 show an embodiment of a servo motor system having an output selection and majority decision function. This servo motor system has the functions of the output selection circuits 51 to 51 and the output devices 71 to 71 in FIG. As shown in FIG. 21, the servomotor of this embodiment is provided with a plurality of armature windings 7041 to 704l on a single shaft 701, and the field windings 7031 to 703l corresponding to the armature windings are replaced by armature windings. Are provided in the housing 702 so as to face the.
FIG. 22 shows a cross section taken along the line AA ′ in FIG. The output torque of this servomotor is given by the following equation.

【０１００】[0100]

【数１２】 (Equation 12)

【０１０１】ただしＩfi：界磁巻線７０３ｉの電流Ｉai：電機子巻線７０４ｉの電流Ｋ：比例係数ここで、全てのＩfiを一定とすれば、Where Ifi: current of the field winding 703i Iai: current of the armature winding 704i K: proportionality coefficient Here, if all Ifi are constant,

【０１０２】[0102]

【数１３】 (Equation 13)

【０１０３】ただしＫ′：比例係数（Ｋ・Ｉfi）となりＩriを入力すれば多数決に準じた動作（以下疑似
多数決と呼ぶ）をさせることができる。また、Ｉfiを各
入力Ｉaiの信頼度に比例した値とすれば（８）式に示す
ように重み付きの疑似多数決を実施できる。図２３は図
２１，図２２の疑似多数決機能を持つサーボモータを用
いて重み付きの疑似多数決を実施するための回路であ
る。この図では図１の出力選択回路５１と出力装置７１
の機能を担う回路を示しているが、他の出力選択回路５
２〜５ｌと出力装置７２〜７ｌについても同様である。
コンピュータモジュール１０１〜１０ｍからの出力３１
−１〜３ｍ−１，選択制御信号４１−１〜４ｍ−１に比
例した電流をサーボアンプを介して電機子巻線７０４１
〜７０４ｍ，界磁巻線７０３１〜７０３ｍにそれぞれ供
給する。以上により、選択制御信号４１−１〜４ｍ−１
により正常とみなされたコンピュータモジュール１０１
〜１０ｍからの出力３１−１〜３ｍ−１の多数決を実現
できる。さらに、サーボアンプ、電機子巻線７０４１〜
７０４ｍ、界磁巻線７０３１〜７０３ｍを多重化すれば
サーボアンプの故障や巻線の短絡，断線による障害を防
げ、サーボモータ系の信頼度を高めることが可能であ
る。However, K ′: proportional coefficient (K · Ifi), and by inputting Iri, an operation in accordance with majority decision (hereinafter referred to as pseudo majority decision) can be performed. If Ifi is a value proportional to the reliability of each input Iai, a weighted pseudo-majority decision can be performed as shown in equation (8). FIG. 23 shows a circuit for performing a weighted pseudo majority by using the servo motor having the pseudo majority function shown in FIGS. In this figure, the output selection circuit 51 and the output device 71 of FIG.
Is shown, but other output selection circuits 5
The same applies to 2 to 5 l and output devices 72 to 7 l.
Output 31 from computer module 101-10m
-1 to 3m-1 and a current proportional to the selection control signal 41-1 to 4m-1 are supplied to the armature winding 7041 via the servo amplifier.
To 704 m and field windings 7031 to 703 m, respectively. As described above, the selection control signals 41-1 to 4m-1
Module 101 considered normal by
A majority decision of outputs 31-1 to 3m-1 from 10 to 10m can be realized. Furthermore, a servo amplifier, armature windings 7041 to
By multiplexing the 704 m and the field windings 7031 to 703 m, it is possible to prevent a failure of the servo amplifier or a failure due to short-circuit or disconnection of the winding, thereby improving the reliability of the servo motor system.

【０１０４】またここで、選択制御信号４１−１〜４ｍ
−１はＯＮ／ＯＦＦの二値だけでなく、出力を出してい
る各コンピュータモジュールの信頼度に対応した多値と
すれば、重み付きの疑似多数決を実現できる。上記のサ
ーボモータ系を適用したシステム構成を図２４に示す。
先に述べたように図１のシステムで出力選択回路５２〜
５ｌと出力装置７２〜７ｌをそれぞれサーボモータ系７
００に置き換えればよい。以上述べた本実施例によれ
ば、サーボモータ系により図１の出力選択回路５１〜５
ｌと出力装置７１〜７ｌの機能を実現できるので、シス
テム全体の構成を簡略化でき、小型化，部品点数の削減
による高信頼化が可能である。なお、（８）式から明ら
かなようにＩfi，Ｉaiの間では交換法則が成り立つの
で、コンピュータモジュール１０１〜１０ｍからの出力
３１−１〜３ｍ−１に対応した電流を界磁巻線７０３１
〜７０３ｍに、選択制御信号４１−１〜４ｍ−１に対応
した電流を電機子巻線７０４１〜７０４ｍにそれぞれ流
しても同じ効果が得られる。Here, the selection control signals 41-1 to 4m
If -1 is not only a binary value of ON / OFF but also a multi-value value corresponding to the reliability of each computer module outputting an output, a weighted pseudo majority decision can be realized. FIG. 24 shows a system configuration to which the above servomotor system is applied.
As described above, in the system of FIG.
5l and the output devices 72 to 7l are respectively connected to the servo motor system 7
00 may be replaced. According to the present embodiment described above, the output selection circuits 51 to 5 of FIG.
1 and the functions of the output devices 71 to 7l can be realized, so that the configuration of the whole system can be simplified, and high reliability can be achieved by reducing the size and the number of parts. As is clear from equation (8), since the exchange law is established between Ifi and Iai, the current corresponding to the outputs 31-1 to 3m-1 from the computer modules 101 to 10m is supplied to the field winding 7031.
The same effect can be obtained by supplying currents corresponding to the selection control signals 41-1 to 4m-1 to the armature windings 7041 to 704m respectively.

【０１０５】[0105]

【発明の効果】本発明によれば、各タスクに要求される
信頼度レベルに応じて適切な数の冗長資源を割り当てる
ことができるので、冗長資源の処理性能向上，信頼度向
上が可能となる。According to the present invention, an appropriate number of redundant resources can be allocated according to the reliability level required for each task, so that the processing performance and reliability of the redundant resources can be improved. .

【０１０６】さらに本発明を適応制御システムに応用す
ることによりコンピュータモジュールでの故障だけでな
く制御対象の故障をも許容する制御システムを構成で
き、制御システム全体の信頼度を向上させることができ
る。Further, by applying the present invention to an adaptive control system, a control system that allows not only a failure in a computer module but also a failure in a control target can be configured, and the reliability of the entire control system can be improved.

[Brief description of the drawings]

【図１】本発明のフォールトトレラントシステムの構成
図。FIG. 1 is a configuration diagram of a fault-tolerant system of the present invention.

【図２】コンピュータモジュールの構成図。FIG. 2 is a configuration diagram of a computer module.

【図３】コンピュータモジュールの他の構成図。FIG. 3 is another configuration diagram of a computer module.

【図４】本発明の実施例の概念図。FIG. 4 is a conceptual diagram of an embodiment of the present invention.

【図５】本発明の実施例の概念図。FIG. 5 is a conceptual diagram of an embodiment of the present invention.

【図６】本発明の実施例の概念図。FIG. 6 is a conceptual diagram of an embodiment of the present invention.

【図７】条件判定のフローチャート。FIG. 7 is a flowchart of condition determination.

【図８】タスク切り換えのタイミング。FIG. 8 shows task switching timing.

【図９】条件判定のフローチャート（不感帯付）。FIG. 9 is a flowchart of condition determination (with a dead zone).

【図１０】Ｆijの変化（不感帯なし）を表す図。FIG. 10 is a diagram showing a change in Fij (no dead zone).

【図１１】Ｆijの変化（不感帯付）を表す図。FIG. 11 is a diagram showing a change in Fij (with a dead zone).

【図１２】コンピュータモジュール割付け状況を示す
図。FIG. 12 is a diagram showing a computer module allocation status.

【図１３】Ｌrjの平均化を表す図。FIG. 13 is a diagram showing averaging of Lrj.

【図１４】Ｆijの変化（Ｌrjの平均化なし）を示す図。FIG. 14 is a diagram showing a change in Fij (without averaging of Lrj).

【図１５】Ｆijの変化（Ｌrjの平均化あり）を示す図。FIG. 15 is a diagram showing a change in Fij (with Lrj averaging).

【図１６】モジュール間通信量の削減を説明する図。FIG. 16 is a view for explaining reduction of the inter-module communication amount.

【図１７】広域ブロードキャストのための条件判断フロ
ーチャート。FIG. 17 is a flowchart of condition determination for wide area broadcasting.

【図１８】広域ブロードキャストのための条件判断フロ
ーチャート。FIG. 18 is a flowchart of condition determination for wide area broadcasting.

【図１９】適応制御システムへの応用をした場合のシス
テム構成図。FIG. 19 is a system configuration diagram when applied to an adaptive control system.

【図２０】コンピュータモジュール割付け状況を表す
図。FIG. 20 is a diagram showing a computer module allocation status.

【図２１】サーボモータの断面図。FIG. 21 is a sectional view of a servomotor.

【図２２】サーボモータの断面図（図２１Ａ−Ａ′
面）。FIG. 22 is a sectional view of the servomotor (FIG. 21A-A ′);
surface).

【図２３】サーボモータ系のブロック図。FIG. 23 is a block diagram of a servo motor system.

【図２４】サーボモータ系を適用したシステムの構成
図。FIG. 24 is a configuration diagram of a system to which a servo motor system is applied.

[Explanation of symbols]

１０１〜１０ｍ…コンピュータモジュール、１１１−１
１ｎ…タスク、２１〜２ｈ…入力装置、３１−１〜３ｍ
−ｌ，…，３１−ｌ〜３ｍ−ｌ…出力、４１−１〜４ｍ
−ｌ，…，４１−ｌ〜４ｍ−ｌ…選択制御信号、５１〜
５ｌ…出力選択回路、７１〜７ｌ…出力装置、１６…状
態観測器、１７…レギュレータ、１８…制御対象特性同
定部、１９…最適レギュレータ設計部、２０…制御対
象。101 to 10 m: Computer module, 111-1
1n task, 21-2h input device, 31-1-3m
−l,..., 31-1 to 3 ml,... Output, 41-1 to 4 m
−l,..., 41−1 to 4 ml −...
5l: output selection circuit, 71 to 7l: output device, 16: state observer, 17: regulator, 18: control target characteristic identification unit, 19: optimal regulator design unit, 20: control target.

Claims

(57) [Claims]

A computer module configured to execute a plurality of tasks,
In addition, in a method of managing redundant resources in a fault-tolerant computer system in which each task is executed by a plurality of redundant computer modules, each task is redundantly executed according to the number of normal computer modules and the importance of each task. The number of computer modules to be executed is changed, and an evaluation function is calculated for each task based on a fault detection situation in the redundantly executed computer module. If there is a first task having a lower evaluation function value, And causing a computer module executing a second task having a larger value of the evaluation function to execute the first task.

2. The computer system according to claim 1, wherein all the computer modules calculate an evaluation function for each task, and if there is a first task having a reduced evaluation function value, the execution is executed in the own computer module. If the value of the evaluation function of the second task is larger than the first task, the execution of the second task is stopped by the judgment of the computer module alone, and the first task is executed. To manage redundant resources.

3. A computer system comprising a plurality of computer modules, wherein a plurality of tasks are executed by the computer modules.
In addition, in a method of managing redundant resources in a fault-tolerant computer system in which each task is executed by a plurality of redundant computer modules, each task is redundantly executed according to the number of normal computer modules and the importance of each task. In each computer module, the task number and the fault occurrence information executed in its own computer module are notified to other computer modules, and the fault occurrence information notified from the other computer modules is added to each computer module. Estimate the reliability of each task based on
The computer module determines which task to participate in the redundant configuration, and when the task to be joined is different from the task currently being executed, the task to be executed is switched to the task to be joined. Redundant resource management method.

4. A one of claims 1 to 2, the computer module i the evaluation function for (i:: 1, ..., N, N Computer number of modules),
A redundant resource management method, wherein Fij is defined by the following equation, and a task j that minimizes the evaluation function Fij is determined as a process to be executed. Fij = Lrj-Lthij where Lthij: threshold value of reliability level of task j in computer module i Lrj: reliability level of task j i: number of own computer module j: number of task

5. In any one of claims 1 to 2, the computer module i the evaluation function for (i:: 1, ..., N, N Computer number of modules),
A redundant resource management method, wherein Fij is defined by the following equation, and a task j that minimizes the evaluation function Fij is determined as a process to be executed. Fij = Lrj / Lthij where Lthij: threshold value of the reliability level of task j in computer module i Lrj: reliability level of task j i: number of own computer module j: number of task

6. In any one of claims 1 to 2, the computer module i (i: 1, ... N , N: Computer number of modules) the evaluation function for, and Fij which is defined by the following formula, A redundant resource management method, wherein a task j that minimizes the evaluation function Fij is determined as a process to be executed. Fij = log {(1−Lthij) / Pej} where Lthij: threshold value of the reliability level of task j in computer module i Rej: probability that the result of task j is incorrect i: number of own computer module j: Task number

7. The one of claims 4 to 6, wherein Lthij is predetermined for each respective computer modules, and tasks, the Lrj is redundant resources being determined on the basis of the fault occurrence information Management method.

8. In any one of claims 4 to 6, in determining the task j that minimizes the evaluation function Fij, if they meet the following relational expression, determining as a task should participate the task j A method for managing redundant resources, characterized in that: Fij <Fik−δ, where k is the number of the currently executed task δ is the width of the dead zone

9. In any of claims 4 to 6, wherein Lrj is the time of fault occurrence, the management method of redundant resource, wherein a decrease over time.

10. A one of claims 4 to 6, the value of the Lrj the management method of redundant resource, characterized in that it is configured as a moving average of the confidence level of the task j per unit time.

11. The method according to claim 9 , wherein the value of Lrj is determined by a K (K: an integer of 1 or more) order delay system (transfer function G (s) = 1 / (1 + Ts) ｓK, T: time constant). A method for managing redundant resources, characterized by averaging.

12. In the ninth aspect , when the evaluation function Fij has not changed from the previous control frame, the fault occurrence information is notified only to the computer module executing the same task as that of the own computer module. If the evaluation function Fij has changed compared to the previous control frame, the redundant resource management method is characterized by notifying all computer modules of fault occurrence information.