JP2001175627A

JP2001175627A - Decentralized processing computer system

Info

Publication number: JP2001175627A
Application number: JP36266399A
Authority: JP
Inventors: Toshiaki Miyamoto; 本敏章宮
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1999-12-21
Filing date: 1999-12-21
Publication date: 2001-06-29

Abstract

PROBLEM TO BE SOLVED: To provide a decentralized processing computer system which can properly and easily cope with the stagnation of one computer in such a case and bring out the maximum processing capability of the whole system. SOLUTION: If, for example, a decentralized process computer PC4 has stagnation while the system is in operation, a decentralized process execution means 3 of a centralized management computer PC1 selects a decentralized process computer PC3 as the best computer which processes a task HHH of the decentralized process computer PC3 instead by referring pieces of information in a management information storage part 2. Then the decentralized processing execution means 3 transmits task program data 8 of the task HHH from the management information storage part 2 to the decentralized processing computer PC3.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、所謂集中管理型の
分散処理計算機システムに関するものである。The present invention relates to a so-called centralized management type distributed processing computer system.

【０００２】[0002]

【従来の技術】ネットワーク上に構築された計算機シス
テムでは、システムのトータルスループットを向上させ
るために、複数のタスクを複数台の計算機で分散処理す
る形態がしばしば採用される。このような分散処理計算
機システムでは、ある計算機の負荷が大きくなり、その
計算機の処理に停滞が発生した場合には、その計算機の
タスクの一部を停滞が発生していない他の計算機に振り
替えることにより、システム全体の円滑な稼働を確保す
るようにしている。この場合、停滞が発生した計算機の
タスクをどの計算機が肩代わりするかについては予め設
定されていたり、所定プログラムに基づいて決定される
ようになっているが、オペレータの介入によって決定す
ることもできる。2. Description of the Related Art In a computer system constructed on a network, a form in which a plurality of tasks are distributed and processed by a plurality of computers is often adopted in order to improve the total throughput of the system. In such a distributed processing computer system, when a load on a computer increases and stagnation occurs in the processing of the computer, a part of the task of the computer is transferred to another computer without stagnation. This ensures smooth operation of the entire system. In this case, although which computer takes over the task of the computer in which the stagnation has occurred is set in advance or determined based on a predetermined program, it can be determined by operator intervention.

【０００３】[0003]

【発明が解決しようとする課題】しかし、上記した従来
の技術では、肩代わりする計算機の決定が固定した情報
に基づいて行われたり、あるいはオペレータの経験又は
勘を頼りに行われていたために、システム全体の処理能
力を最大限に引き出すことが困難であった。そのため、
停滞が発生した計算機のタスクの一部を他の計算機が肩
代わりしたとしても、肩代わりした計算機の能力が不足
している場合や負荷が大きすぎる場合には、この肩代わ
りした計算機にも停滞が発生する事態となりシステム全
体の円滑な稼働を確保することができないことがあっ
た。However, in the above-mentioned prior art, the computer to be replaced is determined based on fixed information or based on the experience or intuition of the operator. It was difficult to maximize the overall processing capacity. for that reason,
Even if another computer takes over some of the tasks of the computer in which the stagnation has occurred, if the capacity of the computer that took over is insufficient or the load is too large, stagnation will also occur in the computer that takes over. In some cases, smooth operation of the entire system could not be ensured.

【０００４】また、ある計算機のタスクを他の計算機に
肩代わりさせる場合、タスク処理のプログラムを双方の
計算機に同一内容で定義しておく必要があるが、タスク
によってはこの作業が繁雑となり多くの労力を費やさな
ければならないことがあった。[0004] Further, when a task of one computer is taken over by another computer, it is necessary to define a task processing program in both computers with the same contents. However, depending on the task, this work becomes complicated and a lot of labor is required. Had to spend some time.

【０００５】本発明は上記事情に鑑みてなされたもので
あり、ある計算機に停滞が発生した場合の対処を最適且
つ容易に行うことができ、システム全体の処理能力を最
大限に引き出すことが可能な分散処理計算機システムを
提供することを目的としている。The present invention has been made in view of the above circumstances, and it is possible to optimally and easily cope with a case where a stagnation occurs in a certain computer, thereby maximizing the processing capability of the entire system. It is intended to provide a simple distributed processing computer system.

【０００６】[0006]

【課題を解決するための手段】上記課題を解決するため
の手段として請求項１記載の発明は、複数台の計算機を
ネットワーク上で相互に接続し、これら複数台の計算機
により複数のタスクを分散処理する分散処理計算機シス
テムにおいて、前記複数台の計算機のうちの少なくとも
１台は、各タスク及び各計算機についての管理情報が記
憶された管理情報記憶部と、各計算機の作動状況を監視
し、いずれかの計算機の処理に停滞が発生した場合に
は、前記管理情報記憶部の各情報を参照して、その計算
機が処理していたタスクを他の計算機に割り当てる分散
処理実行手段と、を備えたことを特徴とする。According to a first aspect of the present invention, a plurality of computers are interconnected on a network, and a plurality of tasks are distributed by the plurality of computers. In the distributed processing computer system for processing, at least one of the plurality of computers monitors a management information storage unit in which management information on each task and each computer is stored, and monitors the operation status of each computer. Distributed processing executing means for referring to each information of the management information storage unit and allocating a task processed by the computer to another computer when stagnation occurs in the processing of the computer. It is characterized by the following.

【０００７】請求項２記載の発明は、請求項１記載の発
明において、前記管理情報記憶部に記憶される情報は、
各計算機が処理すべきタスクについての計算機・タスク
割当情報、各計算機の能力を示す計算機資源管理情報、
各タスクの処理に必要な資源及び処理タイミングについ
てのデータを含むタスク資源管理情報、及び各タスクの
処理内容を示す各タスクプログラムデータであり、前記
分散処理実行手段は、前記停滞が発生した計算機のタス
クを前記他の計算機に割り当てる際に、そのタスクに係
るタスクプログラムデータを前記管理情報記憶部から読
み出して当該他の計算機に伝送するものである、ことを
特徴とする。According to a second aspect of the present invention, in the first aspect, the information stored in the management information storage section is:
Computer / task allocation information on tasks to be processed by each computer, computer resource management information indicating the capabilities of each computer,
Task resource management information including data on resources and processing timing required for processing of each task, and each task program data indicating the processing content of each task; the distributed processing execution means includes a computer for which the stagnation has occurred; When assigning a task to the other computer, the task program data related to the task is read from the management information storage unit and transmitted to the other computer.

【０００８】請求項３記載の発明は、請求項１又は２記
載の発明において、前記管理情報記憶部には、各タスク
の優先度についてのタスク優先度情報が含まれており、
前記分散処理実行手段は、いずれかの計算機に停滞が発
生し且つ他のどの計算機もその停滞が発生した計算機の
タスクを処理するだけの余裕を有していない場合に、前
記タスク優先度情報を参照して、比較的優先度の低いタ
スクを処理している一の計算機に対してその処理の一時
停止若しくは遅延、又は他の計算機への移行を指令する
ことにより処理能力に余裕を生じさせ、前記停滞が発生
した計算機のタスクをこの一の計算機に割り当てるもの
である、ことを特徴とする。According to a third aspect of the present invention, in the first or second aspect of the present invention, the management information storage section includes task priority information on the priority of each task,
The distributed processing execution means, when a stagnation occurs in any of the computers and no other computers have a margin to process the task of the computer in which the stagnation occurred, the task priority information By referring to one computer processing a relatively low-priority task, suspending or delaying the processing, or instructing the transfer to another computer, to allow processing capacity, The task of the computer in which the stagnation has occurred is assigned to the one computer.

【０００９】請求項４記載の発明は、請求項１乃至３の
いずれかに記載の発明において、前記分散処理実行手段
は、前記停滞が発生した計算機のタスクを前記他の計算
機又は前記一の計算機に割り当ててそのタスク処理を実
行させると共に、前記管理情報記憶部の計算機・タスク
割当情報をこの割り当て後の内容に更新するものであ
る、ことを特徴とする。According to a fourth aspect of the present invention, in the invention of any one of the first to third aspects, the distributed processing execution means assigns the task of the computer in which the stagnation has occurred to the other computer or the one computer. And the task processing is executed, and the computer / task assignment information in the management information storage unit is updated to the contents after the assignment.

【００１０】[0010]

【発明の実施の形態】以下、本発明の実施形態を図に基
づき説明する。図１は、この実施形態に係る分散処理計
算機システムの構成を示すブロック図である。この図に
おいて、スイッチングハブ１を介して、集中管理計算機
ＰＣ１と分散処理計算機ＰＣ２〜ＰＣ４とが接続されて
いる。このスイッチングハブ１を用いることにより、集
中管理計算機ＰＣ１は複数台の分散処理計算機システム
との間で同時にデータの送受信を行うことができる。こ
の実施形態では、各計算機が実行するタスクは、図示さ
れるように、集中管理計算機ＰＣ１がタスクＡＡＡ、タ
スクＢＢＢであり、分散処理計算機ＰＣ２がタスクＣＣ
Ｃ、タスクＤＤＤであり、分散処理計算機ＰＣ３がタス
クＥＥＥ、タスクＦＦＦ、タスクＧＧＧであり、分散処
理計算機ＰＣ４がタスクＨＨＨ、タスクＩＩＩ、タスク
ＪＪＪとなっている。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing the configuration of the distributed processing computer system according to this embodiment. In this figure, a central management computer PC1 and distributed processing computers PC2 to PC4 are connected via a switching hub 1. By using this switching hub 1, the central management computer PC1 can simultaneously transmit and receive data to and from a plurality of distributed processing computer systems. In this embodiment, the tasks executed by each computer are, as shown, the central management computer PC1 is the task AAA and the task BBB, and the distributed processing computer PC2 is the task CC
C, task DDD, distributed processing computer PC3 is task EEE, task FFF, task GGG, and distributed processing computer PC4 is task HHH, task III, and task JJJ.

【００１１】各計算機のうち集中管理計算機ＰＣ１のみ
が管理情報記憶部２及び分散処理実行手段３を備えてい
る。そして、管理情報記憶部２には、計算機・タスク割
当情報４、計算機資源管理情報５、タスク資源管理情報
６、タスク優先度情報７、及びタスクプログラムデータ
８がファイル形式で格納されている。分散処理実行手段
３は、各計算機ＰＣ１〜ＰＣ４の作動状況を監視し、い
ずれかの計算機に停滞が発生した場合には、管理情報記
憶部２の各情報を参照して、その計算機が処理していた
タスクを処理能力に余裕がある他の計算機に割り当てる
処理を実行するものである。Among the computers, only the central management computer PC1 has the management information storage unit 2 and the distributed processing execution unit 3. The management information storage unit 2 stores computer / task allocation information 4, computer resource management information 5, task resource management information 6, task priority information 7, and task program data 8 in a file format. The distributed processing execution means 3 monitors the operation status of each of the computers PC1 to PC4, and when any one of the computers stagnates, the computer refers to the information in the management information storage unit 2 to perform processing. This process executes a process of allocating the task to another computer having sufficient processing capacity.

【００１２】計算機・タスク割当情報４は、計算機とそ
の計算機が実行するタスクとの対応関係を示したもので
あり、図２に示すようなテーブルにその対応関係につい
ての情報が記載されている。計算機資源管理情報５は、
各計算機の主な資源（リソース）についてのデータを示
したものであり、例えば図３に示すようなテーブルに、
「相対ＣＰＵ性能（ＣＰＵ速度）」及び「メモリ容量」
等のデータが記載されている。タスク資源管理情報６
は、タスクを処理するのに必要な資源及びその他必要な
項目についてのデータを示したものであり、例えば図４
に示すようなテーブルに、「所要メモリ容量」、「所要
ＣＰＵ時間」、「実行タイミング」、及び「許容処理時
間」等が記載されている。また、タスク優先度情報７
は、タスクを優先して処理すべき度合いについての数値
であり、図４に示すように、タスク資源管理情報が記載
されているテーブルに一緒に記載されている（図４の例
では、数値が小さなものほど高い優先度となってい
る。）。タスクプログラムデータ８は、タスクの処理手
順を示すソフトウエアについてのデータであり、計算機
ＰＣ１〜ＰＣ４が本来有しているタスクプログラムデー
タと同一内容で定義されたものである。The computer / task assignment information 4 indicates the correspondence between computers and tasks executed by the computers, and information on the correspondence is described in a table as shown in FIG. The computer resource management information 5 is
It shows data about the main resources of each computer. For example, a table as shown in FIG.
"Relative CPU performance (CPU speed)" and "Memory capacity"
Etc. are described. Task resource management information 6
4 shows data on resources necessary for processing a task and other necessary items. For example, FIG.
In the table as shown in the table, "required memory capacity", "required CPU time", "execution timing", "permissible processing time" and the like are described. Also, the task priority information 7
Is a numerical value of the degree to which the task should be processed with priority, and as shown in FIG. 4, is described together with the table in which the task resource management information is described (in the example of FIG. 4, the numerical value is Smaller things have higher priority.) The task program data 8 is data on software indicating a task processing procedure, and is defined with the same contents as the task program data originally possessed by the computers PC1 to PC4.

【００１３】次に、上記のように構成される図１の分散
処理計算機システムの動作につき、分散処理計算機ＰＣ
４に停滞が発生したのでタスクＨＨＨを他の計算機に肩
代わりさせて処理しようとする場合を例に取り説明す
る。各計算機の作動状況を監視していた分散処理実行手
段３は、分散処理計算機ＰＣ４に停滞が発生したことを
検知すると、管理情報記憶部２の計算機・タスク割当情
報４、計算機資源管理情報５、タスク資源管理情報６、
及びタスク優先度情報７を参照する。Next, the operation of the distributed processing computer system of FIG.
In the following, an example will be described in which the task HHH is to be processed by another computer because the stagnation has occurred in the task HHH. When the distributed processing execution means 3 that has monitored the operation status of each computer detects that a stagnation has occurred in the distributed processing computer PC4, the distributed processing computer 3 has computer / task allocation information 4, computer resource management information 5, Task resource management information 6,
And the task priority information 7.

【００１４】図３に示した計算機資源管理情報によれ
ば、ＰＣ４以外の計算機のうちで相対ＣＰＵ性能が一番
高いのは集中管理計算機ＰＣ１であり、次に、分散処理
計算機ＰＣ３、分散処理計算機ＰＣ２の順となる。した
がって、相対ＣＰＵ性能を考慮した観点からは集中管理
計算機ＰＣ１にタスクＨＨＨの処理を割り当てればよい
ことになる。しかし、タスク優先度情報７を参照する
と、図４に示したように、集中管理計算機ＰＣ１が処理
するタスクＡＡＡ及びタスクＢＢＢの優先度はそれぞれ
「２０」、「１０」となっており、いずれもタスクＨＨ
Ｈの優先度「４０」よりも高いものであるため、集中管
理計算機ＰＣ１にタスクＨＨＨを割り当てることは不可
能である。According to the computer resource management information shown in FIG. 3, among the computers other than PC4, the centralized management computer PC1 has the highest relative CPU performance, followed by the distributed processing computer PC3 and the distributed processing computer PC3. The order is PC2. Therefore, from the viewpoint of considering the relative CPU performance, the task HHH may be assigned to the central management computer PC1. However, referring to the task priority information 7, as shown in FIG. 4, the priorities of the task AAA and the task BBB processed by the central management computer PC1 are "20" and "10", respectively. Task HH
Since the priority of H is higher than “40”, it is impossible to assign the task HHH to the central management computer PC1.

【００１５】そこで、ＰＣ２又はＰＣ３のいずれかの計
算機にタスクＨＨＨの処理を割り当てることになるが、
図３に示した計算機資源管理情報によれば、計算機ＰＣ
２のメモリ容量は５１２ＭＢであり、所要メモリ容量１
２８ＭＢのタスクＨＨＨを処理する場合には殆ど余裕が
なくなることになる。一方、計算機ＰＣ３のメモリ容量
は１０２４ＭＢであり、タスクＨＨＨが割り当てられた
としてもメモリ容量は依然として余裕を有している。よ
って、分散処理実行手段３は、現在、分散処理計算機Ｐ
Ｃ４が処理しているタスクＨＨＨを分散処理計算機ＰＣ
３に割り当てることを決定する。そして、分散処理実行
手段３は、分散処理計算機ＰＣ４に対してはタスクＨＨ
Ｈの処理を停止することを指令し、分散処理計算機ＰＣ
３に対してはタスクＨＨＨの処理を行うべきことを指令
すると共に、管理情報記憶部２からタスクＨＨＨに係る
タスクプログラムデータ８を読み出し、これを分散処理
計算機ＰＣ３に伝送するようにする。Therefore, the task HHH processing is assigned to either the PC2 or the PC3.
According to the computer resource management information shown in FIG.
2 has a memory capacity of 512 MB and a required memory capacity of 1 MB.
When processing the 28 MB task HHH, there is almost no room. On the other hand, the memory capacity of the computer PC3 is 1024 MB, and even if the task HHH is assigned, the memory capacity still has room. Therefore, the distributed processing execution means 3 is currently executing the distributed processing computer P
Distributed processing computer PC for task HHH being processed by C4
3 is assigned. Then, the distributed processing execution means 3 sends the task HH to the distributed processing computer PC4.
H, and instruct the distributed processing computer PC to stop processing.
Instruct the task 3 to execute the task HHH, read the task program data 8 relating to the task HHH from the management information storage unit 2, and transmit this to the distributed processing computer PC3.

【００１６】これにより、分散処理計算機ＰＣ４はタス
クＩＩＩ及びタスクＪＪＪのみの処理を行えばよいこと
になるので、分散処理計算機ＰＣ４の停滞発生が解消さ
れ、タスクＨＨＨは資源余裕のある分散処理計算機ＰＣ
３によって処理される。この後、分散処理実行手段３
は、管理情報記憶部２に記憶される計算機・タスク割当
情報４の内容を、図５に示すように、タスクＨＨＨの割
り当てが分散処理計算機ＰＣ４から分散処理計算機ＰＣ
３に変更された後の内容に更新する（図４のテーブルの
内容もこれに基づき更新される。）。そして、分散処理
実行手段３は、以後、タスクの割り当ての変更を行う場
合は、この更新後の情報を参照する。As a result, the distributed processing computer PC4 only needs to execute the processing of the task III and the task JJJ. Therefore, the occurrence of the stagnation of the distributed processing computer PC4 is eliminated, and the task HHH is allocated to the distributed processing computer PC having a sufficient resource.
3 is processed. Thereafter, the distributed processing execution means 3
5, the contents of the computer / task allocation information 4 stored in the management information storage unit 2 are changed from the distributed processing computer PC4 to the task HHH, as shown in FIG.
3 (the contents of the table in FIG. 4 are also updated based on this). Then, when the allocation of tasks is changed thereafter, the distributed processing execution unit 3 refers to the updated information.

【００１７】上記した分散処理実行手段３によるタスク
ＨＨＨの割り当て処理は、各計算機の計算機資源管理情
報とタスクＨＨＨの処理に必要なタスク資源管理情報と
を対比し、更に各タスクの優先度を考慮した上で、最適
の計算機である分散処理計算機ＰＣ３に肩代わり処理さ
せるものである。したがって、分散処理計算機ＰＣ４の
停滞状況を解消させて、システム全体の円滑な稼働を確
保することができるようになる。また、あるタスクの処
理を肩代わりすることになった計算機に対しては、その
都度そのタスクに係るタスクプログラムデータを分散処
理実行手段３が伝送すればよいので、予め各計算機に対
して他の計算機が処理すべきタスクプログラムデータを
同一内容で定義しておく必要はなくなる。The task HHH allocation processing by the distributed processing execution means 3 compares the computer resource management information of each computer with the task resource management information required for the processing of the task HHH, and further considers the priority of each task. After that, the distributed processing computer PC3, which is the most suitable computer, performs the shoulder processing. Therefore, the stagnation state of the distributed processing computer PC4 can be eliminated, and the smooth operation of the entire system can be ensured. Further, the distributed processing execution means 3 only needs to transmit the task program data relating to the task to the computer that has been assigned to the processing of a certain task each time. It is no longer necessary to define task program data to be processed with the same contents.

【００１８】次に、上記のように停滞が発生した分散処
理計算機ＰＣ４のタスクＨＨＨを分散処理計算機ＰＣ３
に肩代わりさせ処理させているうちに、更に、分散処理
計算機ＰＣ２にも停滞が発生した場合につき説明する。Next, the task HHH of the distributed processing computer PC4 where the stagnation has occurred as described above is transferred to the distributed processing computer PC3.
Next, a case will be described in which the stagnation occurs in the distributed processing computer PC2 while the processing is performed in the same manner as described above.

【００１９】分散処理実行手段３は、停滞が発生した分
散処理計算機ＰＣ２のタスクＣＣＣを集中管理計算機Ｐ
Ｃ１又は分散処理計算機ＰＣ３に肩代わりさせようとす
る。しかし、集中管理計算機ＰＣ１及び分散処理計算機
ＰＣ３のいずれの演算処理能力もタスクＣＣＣの処理を
実行するためには不足している。そこで、分散処理実行
手段３は、集中管理計算機ＰＣ１に対してタスクＢＢＢ
よりも優先度の低いタスクＡＡＡをディスパッチ（ＣＰ
ＵのＲＡＭを占有しているタスクＡＡＡについてのデー
タを一旦ハードディスクへ退避させること）してタスク
ＣＣＣの処理を実行することを指令する。このタスクＡ
ＡＡのディスパッチにより、集中管理計算機ＰＣ１は、
タスクＣＣＣの処理を実行するために必要な演算処理能
力を確保することができる。The distributed processing execution means 3 transmits the task CCC of the distributed processing computer PC2 in which the stagnation has occurred to the central management computer P2.
Attempt to replace C1 or the distributed processing computer PC3. However, the arithmetic processing capabilities of both the central management computer PC1 and the distributed processing computer PC3 are insufficient for executing the processing of the task CCC. Therefore, the distributed processing execution means 3 sends the task BBB to the central management computer PC1.
Dispatch task AAA with lower priority than (CP
The data of the task AAA occupying the RAM of U is temporarily saved to the hard disk), and the process of the task CCC is executed. This task A
By the dispatch of the AA, the central management computer PC1
Arithmetic processing capacity necessary for executing the processing of the task CCC can be secured.

【００２０】次いで、分散処理実行手段３は、管理情報
記憶部２からタスクＣＣＣに係るタスクプログラムデー
タを読み出し、これを集中管理計算機ＰＣ１内のタスク
処理実行部（図示せず）に出力する。これにより、タス
クＢＢＢ及びタスクＣＣＣの処理が集中管理計算機ＰＣ
１で実行されることになる。なお、この場合、ディスパ
ッチしたタスクＡＡＡについては、これを分散処理計算
機ＰＣ３に移行させて処理することができ、更に、集中
管理計算機ＰＣ１でタスクＡＡＡ又はタスクＢＢＢのい
ずれかの処理が終了した時点で、再度集中管理計算機Ｐ
Ｃ１で処理するようにすることもできる。Next, the distributed processing execution means 3 reads the task program data relating to the task CCC from the management information storage unit 2 and outputs this to a task processing execution unit (not shown) in the central management computer PC1. As a result, the processing of the task BBB and the task CCC can be performed by the central management computer PC.
1 will be executed. In this case, the dispatched task AAA can be transferred to the distributed processing computer PC3 for processing. Further, when the processing of either the task AAA or the task BBB is completed in the central management computer PC1, , Again centralized management computer P
The processing may be performed in C1.

【００２１】この後、分散処理実行手段３は、管理情報
記憶部２に記憶される計算機・タスク割当情報４の内容
を、図６に示すように、タスクＣＣＣの割り当てが分散
処理計算機ＰＣ２から集中管理計算機ＰＣ１に変更され
た後の内容に更新し、以後、タスクの割り当ての変更を
行う場合は、この更新後の情報を参照する。Thereafter, the distributed processing execution means 3 stores the contents of the computer / task allocation information 4 stored in the management information storage unit 2 as shown in FIG. When the content is updated to the content after the change to the management computer PC1 and the task assignment is changed thereafter, the updated information is referred to.

【００２２】上述したように、本発明に係る分散処理計
算機システムでは、停滞が発生した計算機ＰＣ４のタス
クＨＨＨを計算機ＰＣ３に肩代わりさせ処理させている
うちに、更に、計算機ＰＣ２にも停滞が発生し、この計
算機ＰＣ３のタスクＣＣＣを実行するための演算処理能
力については計算機ＰＣ１，ＰＣ３のいずれも不足して
いる場合に、計算機ＰＣ１のタスクのうち優先度の低い
方のタスクＡＡＡをディスパッチするか、あるいは計算
機ＰＣ３に処理させるようにしている。すなわち、本発
明に係る分散処理計算機システムによれば、システムの
実際の稼働状況に対応したフレキシブルな処理が行われ
ることになり、限られた資源の下でシステム全体の処理
能力を最大限に引き出すことができる。As described above, in the distributed processing computer system according to the present invention, while the task HHH of the computer PC4 in which the stagnation has occurred is taken over by the computer PC3, the stagnation occurs in the computer PC2. Regarding the arithmetic processing capability of the computer PC3 for executing the task CCC, if none of the computers PC1 and PC3 is insufficient, the task AAA of the lower priority of the tasks of the computer PC1 is dispatched. Alternatively, the processing is performed by the computer PC3. That is, according to the distributed processing computer system according to the present invention, flexible processing corresponding to the actual operation status of the system is performed, and the processing capacity of the entire system is maximized under limited resources. be able to.

【００２３】なお、上記実施形態では、管理情報記憶部
及び分散処理実行手段を備えた集中管理計算機が１台で
ある場合を例に取り説明したが、システムの計算機台数
が多い場合には、集中管理計算機を２台以上設置するよ
うにしてもよい。In the above-described embodiment, an example is described in which the number of centralized management computers provided with the management information storage unit and the distributed processing execution means is one. However, when the number of computers in the system is large, Two or more management computers may be installed.

【００２４】[0024]

【発明の効果】以上のように、本発明によれば、複数台
の計算機のうちの少なくとも１台は、管理情報記憶部及
び分散処理実行手段を備えており、この分散処理実行手
段が、各計算機の作動状況を監視し、いずれかの計算機
に停滞が発生した場合には、管理情報記憶部の各情報を
参照して、その計算機が処理していたタスクを処理能力
に余裕が有る他の計算機に割り当てるようにしているの
で、ある計算機に停滞が発生した場合の対処を最適且つ
容易に行うことができ、システム全体の処理能力を最大
限に引き出すことが可能な分散処理計算機システムを実
現することができる。As described above, according to the present invention, at least one of a plurality of computers includes a management information storage unit and distributed processing execution means, and the distributed processing execution means The operation status of the computer is monitored, and if a stagnation occurs in one of the computers, a task that the computer has processed is referred to by referring to each information in the management information storage unit. Since the assignment is made to the computers, it is possible to optimally and easily cope with a stagnation of a certain computer, and to realize a distributed processing computer system capable of maximizing the processing capacity of the entire system. be able to.

[Brief description of the drawings]

【図１】本発明の実施形態に係る分散処理計算機システ
ムの構成を示すブロック図。FIG. 1 is a block diagram showing a configuration of a distributed processing computer system according to an embodiment of the present invention.

【図２】図１における計算機・タスク割当情報４のデー
タ構成例を示す説明図。FIG. 2 is an explanatory diagram showing a data configuration example of computer / task assignment information 4 in FIG. 1;

【図３】図１における計算機資源管理情報５のデータ構
成例を示す説明図。FIG. 3 is an explanatory diagram showing a data configuration example of computer resource management information 5 in FIG. 1;

【図４】図１におけるタスク資源管理情報６及びタスク
優先度情報７のデータ構成例を示す説明図。FIG. 4 is an explanatory diagram showing a data configuration example of task resource management information 6 and task priority information 7 in FIG. 1;

【図５】停滞が発生した計算機のタスクを他の計算機に
割り当てた場合における計算機・タスク割当情報４のデ
ータ構成例を示す説明図。FIG. 5 is an explanatory diagram showing a data configuration example of computer / task assignment information 4 when a task of a computer in which stagnation has occurred is assigned to another computer.

【図６】停滞が発生した計算機のタスクを他の計算機に
割り当てた場合における計算機・タスク割当情報４のデ
ータ構成例を示す説明図。FIG. 6 is an explanatory diagram showing a data configuration example of computer / task assignment information 4 when a task of a computer in which stagnation has occurred is assigned to another computer.

[Explanation of symbols]

ＰＣ１集中管理計算機ＰＣ２〜ＰＣ４分散処理計算機１スイッチングハブ２管理情報記憶部３分散処理実行手段４計算機・タスク割当情報５計算機資源管理情報６タスク資源管理情報７タスク優先度情報８タスクプログラムデータ PC1 Centralized management computer PC2 to PC4 Distributed processing computer 1 Switching hub 2 Management information storage unit 3 Distributed processing execution means 4 Computer / task allocation information 5 Computer resource management information 6 Task resource management information 7 Task priority information 8 Task program data

Claims

[Claims]

1. A distributed processing computer system in which a plurality of computers are mutually connected on a network and a plurality of tasks are distributed and processed by the plurality of computers, wherein at least one of the plurality of computers includes: A management information storage unit in which management information on each task and each computer is stored; and an operation status of each computer is monitored. If a stagnation occurs in the processing of any of the computers, each of the management information storage units Distributed processing execution means for referring to information and allocating a task being processed by the computer to another computer.

2. The information stored in the management information storage unit includes:
Computer / task allocation information on tasks to be processed by each computer, computer resource management information indicating the capabilities of each computer,
Task resource management information including data on resources and processing timing required for processing of each task, and each task program data indicating the processing content of each task; the distributed processing execution means includes a computer for which the stagnation has occurred; 2. The distributed processing according to claim 1, wherein when assigning a task to the other computer, the task program data related to the task is read from the management information storage unit and transmitted to the other computer. 3. Computer system.

3. The management information storage section includes task priority information on the priority of each task, and the distributed processing execution means determines that a stagnation occurs in one of the computers and the other If the computer also does not have enough time to process the task of the computer in which the stagnation has occurred, the computer refers to the task priority information, and the computer that is processing the task with a relatively low priority is sent to one computer. On the other hand, by giving a command to suspend or delay the processing, or to shift to another computer, a margin is provided in the processing capacity, and the task of the computer in which the stagnation has occurred is assigned to this one computer. 3. The distributed processing computer system according to claim 1, wherein:

4. The distributed processing execution means allocates the task of the computer in which the stagnation has occurred to the other computer or the one computer to execute the task processing, and executes the computer / task of the management information storage unit. 4. The distributed processing computer system according to claim 1, wherein the allocation information is updated to the content after the allocation.