JP3910982B2

JP3910982B2 - Computer system, service load balancing method and program

Info

Publication number: JP3910982B2
Application number: JP2004248949A
Authority: JP
Inventors: 研一溝口
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2003-08-29
Filing date: 2004-08-27
Publication date: 2007-04-25
Anticipated expiration: 2024-08-27
Also published as: JP2005100381A

Description

本発明は、複数の計算機を備え、複数種類のサービスを実行可能な計算機システムに係り、特に非対称なリソース環境におけるサービスの負荷分散に好適な計算機システム及びプログラムに関する。 The present invention relates to a computer system including a plurality of computers and capable of executing a plurality of types of services, and more particularly to a computer system and a program suitable for load distribution of services in an asymmetric resource environment.

従来から、多数のクライアント端末からのサービス実行要求を効率的に処理するために、これらの実行要求を複数のサーバ計算機に分配するサーバ負荷分散システムが知られている（例えば非特許文献１または２参照）。このサーバ負荷分散システムは一般に、対称（均一）なリソース環境を持つ複数のサーバ計算機と、負荷分散装置とから構成される。負荷分散装置はクライアント端末からネットワーク（外部ネットワーク）を介してサービスの実行要求（リクエスト）を受け取ると、当該要求で指定されたサービスを実行すべきサーバ計算機を選択する。このサーバ計算機の選択は、特定のサーバ計算機だけに多大な負荷がかからないように考慮して行われる。つまり負荷分散装置は、同種のサービスの実行を複数の計算機に振り分ける。 Conventionally, in order to efficiently process service execution requests from a large number of client terminals, a server load distribution system that distributes these execution requests to a plurality of server computers is known (for example, Non-Patent Document 1 or 2). reference). This server load distribution system is generally composed of a plurality of server computers having a symmetric (uniform) resource environment and a load distribution apparatus. When the load distribution apparatus receives a service execution request (request) from a client terminal via a network (external network), the load distribution apparatus selects a server computer to execute the service specified by the request. The selection of the server computer is performed in consideration of not applying a great load only to the specific server computer. That is, the load balancer distributes the execution of the same type of service to a plurality of computers.

サーバ負荷分散システムでは、サービスを実行する計算機を選択するのに、つまりサービスのスケジュールに、（１）ラウンドロビン法、（２）重み付けラウンドロビン法、（３）最小接続法、或いは（４）最速法が適用されるのが一般的である。ラウンドロビン法は、各サーバ計算機を一定の順番で均等に選択する方法である。重み付けラウンドロビン法は、各計算機に能力に見合った重み（選択され易さ）を付けることで、ラウンドロビン法を基本としながら、各サーバ計算機の能力に応じて当該計算機が選択される頻度を変える方法である。最小接続法は、その時点でコネクション（セッション）数が最小の計算機を選択する方法である。最速法は、その時点で最も速く応答することが可能な計算機を選択する方法である。 In the server load balancing system, in order to select a computer that executes a service, that is, in a service schedule, (1) round robin method, (2) weighted round robin method, (3) minimum connection method, or (4) fastest The law is generally applied. The round robin method is a method of selecting each server computer equally in a certain order. The weighted round robin method changes the frequency with which each computer is selected according to the capabilities of each server computer, by giving each computer a weight (ease of being selected) according to the capability (based on the round robin method). Is the method. The minimum connection method is a method of selecting a computer having the smallest number of connections (sessions) at that time. The fastest method is a method of selecting a computer that can respond the fastest at that time.

負荷分散装置は、上記（１）乃至（４）のうちの何れかの方法で、サービスを実行すべきサーバ計算機を選択すると、クライアント端末からの実行要求を、選択したサーバ計算機にネットワーク（内部ネットワーク）を経由して送る。サーバ計算機は、この実行要求に基づいてサービスを実行し、応答を負荷分散装置に送る。負荷分散装置はサーバ計算機から送られた応答を要求元のクライアント端末に返す。 When the load distribution device selects a server computer to execute a service by any one of the methods (1) to (4), the load distribution device sends an execution request from the client terminal to the selected server computer via a network (internal network). ) To send via. The server computer executes a service based on this execution request and sends a response to the load balancer. The load balancer returns the response sent from the server computer to the requesting client terminal.

負荷分散装置は、サーバ計算機からの応答を監視する。そして負荷分散装置は、サーバ計算機からの応答が一定時間内に返ってこないタイムアウトをもとに、サーバ計算機の障害を検出する。このサーバ計算機の障害には、サーバ計算機自体の障害と、サーバ計算機によるサービス実行に関する障害とがある。負荷分散装置は、サーバ計算機の障害を検出すると、その障害計算機へのサービスの割り当てを行わないことで、システムを縮退運転させる。 The load balancer monitors a response from the server computer. Then, the load balancer detects a failure of the server computer based on a timeout in which a response from the server computer does not return within a certain time. The failure of the server computer includes a failure of the server computer itself and a failure related to service execution by the server computer. When the load balancer detects a failure in the server computer, the load balancing device causes the system to perform a degenerate operation by not allocating the service to the failure computer.

一方、近年は、クラスタシステムと呼ばれる計算機システムが出現している（例えば、非特許文献３参照）。クラスタシステムは、非対称なリソース環境を持つ複数の計算機で構成される。クラスタシステムでは、非対称なリソース環境を持つ複数の計算機に対して、予めユーザによって綿密に計画された形で機能的に異なるサービス（異種のサービス）が割り当てられる。クラスタシステム内の各計算機は互いにネットワークを介して通信を行うことで、サービスを実行中の計算機の障害を検出する。この場合、クラスタシステムでは、障害が検出された計算機で実行されていたサービスを他の計算機に割り当て直す（フェイルオーバする）再スケジュールが行われる。これにより、サービス（業務）停止時間を短くでき、ＨＡ（High Availability）と呼ばれる高い可用性（サーバ稼動率、業務稼動率）が実現できる。そこで、このようなクラスタシステムは。ＨＡクラスタシステムと呼ばれる。 On the other hand, in recent years, a computer system called a cluster system has appeared (for example, see Non-Patent Document 3). The cluster system is composed of a plurality of computers having an asymmetric resource environment. In a cluster system, functionally different services (heterogeneous services) are assigned to a plurality of computers having an asymmetric resource environment in a form carefully planned by a user in advance. Each computer in the cluster system communicates with each other via a network to detect a failure of the computer executing the service. In this case, in the cluster system, rescheduling is performed to reassign (fail over) the service that was being executed on the computer in which the failure was detected. As a result, service (business) downtime can be shortened, and high availability (HA availability, business availability) called HA (High Availability) can be realized. So such a cluster system. This is called an HA cluster system.

クラスタシステムにおけるサービスの再スケジュールは、待機系の計算機に対して行われるのが一般的である。このようなクラスタシステムでは、サービスのスケジューリングで計算機の負荷状況が考慮されることはない。また、ユーザの操作で、システム内の計算機毎に処理能力を設定すると共に、システムが提供可能なサービス毎に当該サービスの実行に必要な処理能力（チケット）を設定することで、特定の計算機に処理能力以上のサービスが割り当てられないようにする静的チケット方式のクラスタシステムが知られている。
Rajkumar Buyya（ラジクマル・ブイヤ），“High Performance Cluster Computing: Architecture and Systems（Volume 1）”，１９９９年，Prentice-Hall, Inc.，p.340-363 Tony Bourke（トニー・ブルーク），“Server Load Balancing”，O'Relly & Associates, Inc., p.3-31, December 2001 金子哲夫、森良哉、「クラスタソフトウェア」、東芝レビュー、Vol.54 No.12(1999)、p.18-21 Generally, rescheduling of services in a cluster system is performed for a standby computer. In such a cluster system, the load status of the computer is not taken into consideration in service scheduling. In addition, by setting the processing capability for each computer in the system by the user's operation, and setting the processing capability (ticket) necessary for execution of the service for each service that can be provided by the system, a specific computer can be set. 2. Description of the Related Art A static ticket type cluster system that prevents a service exceeding its processing capacity from being assigned is known.
Rajkumar Buyya, “High Performance Cluster Computing: Architecture and Systems (Volume 1)”, 1999, Prentice-Hall, Inc., p.340-363 Tony Bourke, “Server Load Balancing”, O'Relly & Associates, Inc., p.3-31, December 2001 Tetsuo Kaneko, Yoshiya Mori, "Cluster Software", Toshiba Review, Vol.54 No.12 (1999), p.18-21

上記したように、従来のサーバ負荷分散システムは、対称なリソース環境を持つ複数のサーバ計算機を対象として動的に負荷分散することが可能である。しかしながら従来のサーバ負荷分散システムは、複雑な非対称なリソース環境を持つ複数のサーバ計算機を対象として動的に負荷分散すること、つまり複雑な非対称なリソース環境で動作するサービスの確実な実行制御は行えない。また、従来のサーバ負荷分散システムでは、計算機からの応答のタイムアウトにより当該計算機の障害検出が行われるため、計算機障害を迅速に検出できない。 As described above, the conventional server load balancing system can dynamically load balance a plurality of server computers having a symmetric resource environment. However, the conventional server load distribution system can dynamically distribute loads to multiple server computers with complex asymmetric resource environments, that is, it can perform reliable execution control of services that operate in complex asymmetric resource environments. Absent. Further, in the conventional server load distribution system, since the failure of the computer is detected due to a response timeout from the computer, the computer failure cannot be detected quickly.

一方、非対称なリソース環境を持つ従来のクラスタシステムでの負荷分散は、機能的な負荷分散をユーザが綿密に計画することで実現されるか、或いはサービス毎に固定的なチケットを割り当てる静的チケット方式で実現されている。このため従来のクラスタシステムでは動的に負荷分散することができず、しかも静的チケット方式では、実際の負荷状況と異なるサービスの配置が行われる虞がある。 On the other hand, load balancing in a conventional cluster system with an asymmetric resource environment is realized by careful planning of functional load balancing by the user, or a static ticket that assigns a fixed ticket for each service. It is realized by the method. For this reason, the conventional cluster system cannot dynamically distribute the load, and in the static ticket method, there is a possibility that services different from the actual load situation may be arranged.

本発明は上記事情を考慮してなされたものでその目的は、非対称なリソース環境におけるサービスの負荷分散が動的に行える計算機システム、サービス負荷分散方法及びプログラムを提供することにある。 The present invention has been made in view of the above circumstances, and an object of the present invention is to provide a computer system, a service load distribution method, and a program capable of dynamically distributing service loads in an asymmetric resource environment.

本発明の１つの観点によれば、複数の計算機を備え、複数種類のサービスを実行可能な計算機システムが提供される。この計算機システムは、上記各計算機で実行されるサービスの負荷をサービス負荷として計測するサービス負荷計測手段と、上記各計算機の負荷をノード負荷として計測するノード負荷計測手段と、上記サービス負荷計測手段及び上記ノード負荷計測手段の計測結果をもとに、上記複数の計算機のうちサービス実行に最適な計算機及び当該最適な計算機に再配置するサービスを決定して、当該最適な計算機に当該サービスを再配置するサービス最適配置手段とを備えることを特徴とする。 According to one aspect of the present invention, a computer system including a plurality of computers and capable of executing a plurality of types of services is provided. The computer system includes a service load measuring unit that measures a load of a service executed on each computer as a service load, a node load measuring unit that measures a load on each of the computers as a node load, the service load measuring unit, Based on the measurement result of the node load measuring means, the most suitable computer for service execution among the plurality of computers and the service to be relocated to the optimum computer are determined, and the service is relocated to the optimum computer. And an optimum service arrangement means.

このような構成においては、システム内の各計算機で実行されるサービスの負荷（サービス負荷）と、当該各計算機の負荷（ノード負荷）とをもとに、サービス実行に最適な計算機及び当該最適な計算機に再配置するサービスが決定される。これにより、上記各計算機が有するリソース環境が非対称であったとしても、サービスの負荷分散が動的に行える。 In such a configuration, based on the service load (service load) executed on each computer in the system and the load (node load) on each computer, the optimal computer for service execution and the optimal The service to be relocated to the computer is determined. As a result, even if the resource environment of each of the computers is asymmetric, service load distribution can be performed dynamically.

ここで、上記サービス負荷計測手段に、上記各計算機でのサービスの実行に消費されるリソース量をもとに、当該サービスの動的負荷を表す動的サービスチケット値を上記サービス負荷として算出する動的サービスチケット値算出手段を持たせると共に、上記ノード負荷計測手段に、上記サービス負荷計測手段によって算出された上記各計算機で実行されているサービスの動的サービスチケット値をもとに、上記各計算機のノード負荷を表す総サービスチケット値を算出する総サービスチケット値算出手段と、この総サービスチケット値算出手段によって算出された上記各計算機の総サービスチケット値、及び予め設定されている上記各計算機の処理能力を表す静的ノードチケット値をもとに、上記各計算機でそれぞれ新たに使用可能なチケット値を動的ノードチケット値として算出する動的ノードチケット値算出手段とを持たせ、更に上記サービス最適配置手段には選択手段を持たせ、上記動的ノードチケット値が一定値以下になる可能性のある計算機で実行中のサービスの中から上記最適計算機に再配置されるサービスが上記選択手段によって選択される構成とすると良い。この場合、最適な負荷分散が可能となる。 Here, the service load measuring means calculates the dynamic service ticket value representing the dynamic load of the service as the service load based on the amount of resources consumed for executing the service on each computer. And each node load measuring means based on the dynamic service ticket value of the service being executed by each computer calculated by the service load measuring means. A total service ticket value calculating means for calculating a total service ticket value representing the node load of the computer, a total service ticket value of each of the computers calculated by the total service ticket value calculating means, and a preset value of each of the computers Based on the static node ticket value that represents the processing capacity, each of the above computers can use a new ticket. A dynamic node ticket value calculating means for calculating a dynamic node ticket value as a dynamic node ticket value, and a selection means for the service optimal arrangement means so that the dynamic node ticket value can be equal to or less than a predetermined value. It is preferable that a service to be relocated to the optimal computer is selected by the selection means from among the services being executed on a compatible computer. In this case, optimal load distribution is possible.

また、最適計算機が、上記各計算機の動的ノードチケット値をもとに上記サービス最適配置手段によって検索される構成とするならば、より最適な負荷分散が可能となる。 Further, if the optimum computer is configured to be searched by the service optimum arrangement unit based on the dynamic node ticket value of each computer, more optimal load distribution is possible.

更に、上記各計算機の動的ノードチケット値と、上記選択されたサービスを上記各計算機で実行するのに必要であると予め予測されているリソース量を表す静的サービスチケット値及び当該計算機で実行されている当該選択されたサービスの上記動的サービスチケット値の大きい方の値とをもとに、動的ノードチケット値が当該大きい方の値を超える計算機が上記最適計算機として検索される構成とするならば、より一層最適な負荷分散が可能となる。 Furthermore, the dynamic node ticket value of each computer, the static service ticket value that represents the resource amount that is predicted in advance to be executed by each computer, and the computer that executes the selected service. A computer whose dynamic node ticket value exceeds the larger value is searched as the optimum computer based on the larger value of the selected dynamic service ticket value of the selected service. If so, even more optimal load distribution becomes possible.

また、予め定められた並列実行型サービスを、上記複数の計算機のうちの少なくとも２つの計算機で並行して実行させる並列実行型サービス実行手段を追加すると共に、上記サービス最適配置手段に最適サービス数算出手段及びサービス数調整手段を持たせ、上記最適サービス数算出手段では、上記システム内で並行して実行される並列実行型サービスの最適な実行数を表す最適サービス数が、上記各計算機における当該並列実行型サービスの動的サービスチケット値、及び当該並列実行型サービスを上記各計算機上で実行するのに必要であると予め予測されているリソース量を表す静的サービスチケット値をもとに算出され、上記サービス数調整手段では、上記最適サービス数算出手段によって算出された最適サービス数とその時点において上記システム内で実際に実行されている上記並列実行型サービスの実行数である現サービス数との大小関係に応じて、上記並列実行型サービス実行手段によって実際に実行させられる並列実行型サービスの実行数が調整される構成とすると良い。このようにすると、上記計算機システムがクラスタシステムであって、上記各計算機が有するリソース環境が非対称であったとしても、当該システムで並行して実行される並列実行型サービスの実行数を最適な数に調整できる。 In addition, parallel execution type service execution means for executing a predetermined parallel execution type service in parallel on at least two of the plurality of computers is added, and the optimum number of services is calculated in the optimum service arrangement means. Means for adjusting the number of services, and in the optimum service number calculation means, the optimum service number representing the optimum execution number of the parallel execution type service executed in parallel in the system It is calculated based on the dynamic service ticket value of the execution type service and the static service ticket value that represents the resource amount that is predicted in advance to be executed on each computer. In the service number adjusting means, the optimum service number calculated by the optimum service number calculating means and the current time Execution of a parallel execution type service that is actually executed by the parallel execution type service execution means in accordance with the magnitude relationship with the current service number that is the number of executions of the parallel execution type service that is actually executed in the system. It is preferable that the number is adjusted. In this way, even if the computer system is a cluster system and the resource environment of each computer is asymmetric, the optimal number of parallel execution services executed in parallel on the system is determined. Can be adjusted.

本発明によれば、システム内の各計算機で実行されるサービスの負荷と、当該各計算機の負荷とをもとに、サービス実行に最適な計算機及び当該最適な計算機に再配置するサービスが決定されることから、当該各計算機が有するリソース環境が非対称であったとしても、サービスの負荷分散が動的に行える According to the present invention, on the basis of the load of services executed on each computer in the system and the load on each computer, the optimal computer for service execution and the service to be relocated to the optimal computer are determined. Therefore, even if the resource environment of each computer is asymmetric, service load distribution can be performed dynamically.

以下、本発明の一実施形態につき図面を参照して説明する。図１は本発明の一実施形態に係る高可用性クラスタシステム（ＨＡクラスタシステム）の構成を示すブロック図である。図１のクラスタシステムは、４台の計算機（サーバ計算機）１０-1〜１０-4から構成されている。計算機１０-1〜１０-4は、当該計算機１０-1〜１０-4間の通信に用いられるネットワーク（内部ネットワーク）２０によって相互接続されている。図１では、計算機１０-1〜１０-4とクライアント端末との間の通信に用いられるネットワーク（外部ネットワーク）は省略されている。クライアント端末からのサービスの実行要求は、この外部ネットワークを介して図１のクラスタシステムに送信される。クラスタシステム内の計算機１０-i（ｉ＝１〜４）は、クライアント端末からの要求で指定されたサービスを実行し、その実行結果を示す応答を外部ネットワークを介して要求元のクライアント端末に返す。なお、計算機１０-1〜１０-4間の通信と、計算機１０-1〜１０-4とクライアント端末との間の通信とが、同一のネットワークを介して行われる構成であっても構わない。但し、ネットワークにおける通信トラフィックの増大を招く。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing a configuration of a high availability cluster system (HA cluster system) according to an embodiment of the present invention. The cluster system shown in FIG. 1 includes four computers (server computers) 10-1 to 10-4. The computers 10-1 to 10-4 are interconnected by a network (internal network) 20 used for communication between the computers 10-1 to 10-4. In FIG. 1, a network (external network) used for communication between the computers 10-1 to 10-4 and the client terminal is omitted. A service execution request from the client terminal is transmitted to the cluster system of FIG. 1 via this external network. The computers 10-i (i = 1 to 4) in the cluster system execute the service specified by the request from the client terminal, and return a response indicating the execution result to the requesting client terminal via the external network. . The communication between the computers 10-1 to 10-4 and the communication between the computers 10-1 to 10-4 and the client terminal may be performed via the same network. However, it increases communication traffic in the network.

計算機１０-1〜１０-4では、オペレーティングシステム（以下、ＯＳと称する）１１-1〜１１-4が動作する。また、計算機１０-1〜１０-4によって構成されるクラスタシステムでは、クラスタ制御機構１２が動作する。クラスタ制御機構１２は、計算機１０-1〜１０-4にそれぞれ設けられたクラスタ制御部（図示せず）が相互に通信を行いながら同期して一体となって動作することにより実現されるバーチャルマシンである。このため、クラスタ制御機構１２は、計算機１０-1〜１０-4にまたがって存在していると考えることができる。クラスタ制御部は、サービスの負荷分散を含むクラスタ制御用のプログラム（クラスタソフトウェア）を計算機１０-iが読み取り実行することにより実現される。このクラスタ制御用のプログラムは、計算機１０-iで読み取り可能な記憶媒体（フロッピー（登録商標）ディスクに代表される磁気ディスク、ＣＤ−ＲＯＭ、ＤＶＤに代表される光ディスク、フラッシュメモリに代表される半導体メモリ等）に予め格納して頒布可能である。また、このプログラムが、ネットワークを介してダウンロード（頒布）されても構わない。クラスタ制御機構１２は、各計算機１０-1〜１０-4上のクラスタ制御部が相互に通信を行いながら同期して動作することにより、計算機の障害を迅速に検出することができる。 In the computers 10-1 to 10-4, operating systems (hereinafter referred to as OSs) 11-1 to 11-4 operate. In the cluster system constituted by the computers 10-1 to 10-4, the cluster control mechanism 12 operates. The cluster control mechanism 12 is a virtual machine realized by a cluster control unit (not shown) provided in each of the computers 10-1 to 10-4 operating in synchronism with each other while communicating with each other. It is. Therefore, it can be considered that the cluster control mechanism 12 exists across the computers 10-1 to 10-4. The cluster control unit is realized by the computer 10-i reading and executing a cluster control program (cluster software) including service load balancing. The cluster control program is a storage medium (a magnetic disk represented by a floppy (registered trademark) disk, an optical disk represented by a DVD, a semiconductor represented by a flash memory) that can be read by the computer 10-i. It can be stored in advance in a memory or the like and distributed. Further, this program may be downloaded (distributed) via a network. The cluster control mechanism 12 can quickly detect a failure of a computer when the cluster control units on the computers 10-1 to 10-4 operate synchronously while communicating with each other.

クラスタ制御機構１２は、サービス最適配置機構１２１とサービス制御機構１２２とを含む。サービス最適配置機構１２１は、計算機１０-1〜１０-4にそれぞれ設けられたサービス最適配置部（図示せず）が相互に通信を行いながら同期して一体となって動作することにより実現される。サービス最適配置機構１２１は、サービスを実行中の計算機の障害発生時、或いはサービスの負荷状況の変化時に、当該サービスを実行するのに最適な計算機を決定する機能を有する。サービス最適配置機構１２１はまた、決定された最適な計算機にサービスを再配置する機能を有する。サービス最適配置機構１２１は更に、後述する並列実行型サービス実行部１３によって並行して実行される同種のサービス（並列実行型サービスＰＳＶＣ）の実行数を最適な数に調整する機能を有する。 The cluster control mechanism 12 includes a service optimal arrangement mechanism 121 and a service control mechanism 122. The service optimum arrangement mechanism 121 is realized by a service optimum arrangement unit (not shown) provided in each of the computers 10-1 to 10-4 operating synchronously and integrally while communicating with each other. . The service optimal arrangement mechanism 121 has a function of determining an optimal computer for executing the service when a failure of the computer executing the service occurs or when the load status of the service changes. The service optimal arrangement mechanism 121 also has a function of rearranging services to the determined optimal computer. The service optimum arrangement mechanism 121 further has a function of adjusting the number of executions of the same type of service (parallel execution type service PSVC) executed in parallel by the parallel execution type service execution unit 13 described later to an optimal number.

サービス制御機構１２２は、計算機１０-1〜１０-4にそれぞれ設けられたサービス制御部（図示せず）が相互に通信を行いながら同期して一体となって動作することにより実現される。サービス制御機構１２２は、サービス最適配置機構１２１の制御のもとで、当該サービス最適配置機構１２１によって決定された計算機にサービスをスイッチオーバする機能を有する。 The service control mechanism 122 is realized by a service control unit (not shown) provided in each of the computers 10-1 to 10-4 operating synchronously and integrally while communicating with each other. The service control mechanism 122 has a function of switching over the service to the computer determined by the service optimum arrangement mechanism 121 under the control of the service optimum arrangement mechanism 121.

図１のクラスタシステムではまた、クラスタ制御機構１２によって管理される並列実行型サービス実行部１３が動作する。並列実行型サービス実行部１３は、クラスタ制御機構１２と同様に計算機１０-1〜１０-4によって実現され、当該計算機１０-1〜１０-4にまたがって存在する。並列実行型サービス実行部１３は、サービスＰＳＶＣを計算機１０-1〜１０-4のうちの複数の計算機（ノード）上で並行して実行する機能を有する。この並列実行型サービス実行部１３によって並行して実行可能なタイプのサービスＰＳＶＣを並列実行型サービスと呼ぶ。ここで、並列実行型サービス実行部１３によって並行して実行される並列実行型サービスＰＳＶＣの実行数、つまりサービス数（＝ノード数）は、後述するサービスチケット値をもとにクラスタ制御機構１２内のサービス最適配置機構１２１によって決定される。図１では、並列実行型サービスＰＳＶＣの実行数（サービス数）が２の場合が示されている。ここではサービスＰＳＶＣの実行に関し、計算機１０-3及び１０-4が稼動状態にあり、計算機１０-1及び１０-2が待機状態にある。つまり、図１では、並列実行型サービスＰＳＶＣが、並列実行型サービス実行部１３によって２つの計算機１０-3及び１０-4上で並行して実行されている。 In the cluster system of FIG. 1, a parallel execution type service execution unit 13 managed by the cluster control mechanism 12 operates. The parallel execution type service execution unit 13 is realized by the computers 10-1 to 10-4 similarly to the cluster control mechanism 12, and exists across the computers 10-1 to 10-4. The parallel execution type service execution unit 13 has a function of executing the service PSVC in parallel on a plurality of computers (nodes) among the computers 10-1 to 10-4. The type of service PSVC that can be executed in parallel by the parallel execution type service execution unit 13 is referred to as a parallel execution type service. Here, the number of executions of the parallel execution type service PSVC executed in parallel by the parallel execution type service execution unit 13, that is, the number of services (= number of nodes) is determined in the cluster control mechanism 12 based on the service ticket value described later. This is determined by the service optimum arrangement mechanism 121. FIG. 1 shows a case where the number of executions (number of services) of the parallel execution service PSVC is two. Here, regarding the execution of the service PSVC, the computers 10-3 and 10-4 are in the operating state, and the computers 10-1 and 10-2 are in the standby state. That is, in FIG. 1, the parallel execution type service PSVC is executed in parallel on the two computers 10-3 and 10-4 by the parallel execution type service execution unit 13.

並列実行型サービスＰＳＶＣ（を実現するアプリケーション）には、静的サービスチケット値ＳＳＴ_PSVCと呼ぶパラメータ値が予め設定されている。この静的サービスチケット値ＳＳＴ_PSVCは、並列実行型サービスＰＳＶＣを計算機１０-i上で実行するのに必要であると予め予測されているリソース量（つまり、サービスＰＳＶＣの静的な負荷）を表す。一方、並列実行型サービス実行部１３には、システム内で最低限並列実行させておくべき並列実行型サービスＰＳＶＣの実行数（以下、最小サービス数Ｎ_minと称する）がユーザの入力操作によって予め設定される。最小サービス数Ｎ_minは、システム内で並列実行型サービスＰＳＶＣを最低限並行して実行する計算機の数（ノード数）をも表す。 A parameter value called a static service ticket value SST _PSVC is set in advance in the parallel execution service PSVC (an application that realizes the service). This static service ticket value SST _PSVC represents a resource amount that is predicted in advance to be necessary for executing the parallel execution type service PSVC on the computer 10-i (that is, a static load of the service PSVC). . On the other hand, the parallel execution type service execution unit 13, the number of execution of the parallel execution type service PSVC should by minimally parallel execution in the system (hereinafter, referred to as the minimum service number N _min) in advance by the input operation of the user settings Is done. The minimum service number N _min also represents the number of computers (the number of nodes) that execute the parallel execution type service PSVC in parallel at the minimum in the system.

並列実行型サービス実行部１３は、計算機１０-1〜１０-4上で動作可能なサービス負荷モニタ１３１-1〜１３１-4を含む。サービス負荷モニタ１３１-i（ｉ＝１〜４）は、計算機１０-iが並列実行型サービスＰＳＶＣの実行状態にある場合だけ動作して、当該計算機１０-iでのサービスＰＳＶＣの実行状態におけるリソースの消費量を計測する。サービス負荷モニタ１３１-iは、計測された現在のリソースの消費量に基づいて計算機１０-iでの並列実行型サービスＰＳＶＣの実行に必要なリソース量を予測して、当該サービスＰＳＶＣの実行に必要なリソース量（つまり、サービスＰＳＶＣの動的な負荷）を表す動的サービスチケット値ＤＳＴ_PSVCiを求める。サービス負荷モニタ１３１-iは、動的サービスチケット値ＤＳＴ_PSVCiをクラスタ制御機構１２に通知する。 The parallel execution type service execution unit 13 includes service load monitors 131-1 to 131-4 operable on the computers 10-1 to 10-4. The service load monitor 131-i (i = 1 to 4) operates only when the computer 10-i is in the execution state of the parallel execution type service PSVC, and the resource in the execution state of the service PSVC in the computer 10-i. Measure consumption. The service load monitor 131-i predicts the resource amount necessary for executing the parallel execution type service PSVC on the computer 10-i based on the measured current resource consumption, and is necessary for executing the service PSVC. A dynamic service ticket value DST _PSVCi representing a large resource amount (that is, a dynamic load of the service PSVC) is _obtained . The service load monitor 131-i notifies the cluster control mechanism 12 of the dynamic service ticket value DST _PSVCi .

計算機１０-1〜１０-4では、ＨＡ型サービスＳＶＣ１を実行するＨＡ型サービス実行部１４１-1〜１４１-4と、ＨＡ型サービスＳＶＣ２を実行するＨＡ型サービス実行部１４２-1〜１４２-4とが動作可能である。ＨＡ型サービス実行部１４１-1〜１４１-4，１４２-1〜１４２-4は、クラスタ制御機構１２によって管理される。 In the computers 10-1 to 10-4, the HA type service execution units 141-1 to 141-4 that execute the HA type service SVC1 and the HA type service execution units 142-1 to 142-4 that execute the HA type service SVC2 are used. Are operable. The HA type service execution units 141-1 to 141-4 and 142-1 to 142-4 are managed by the cluster control mechanism 12.

ＨＡ型サービスは、クラスタ制御機構１２の制御によってフェイルオーバされるサービス（アプリケーション）であり、同一時間帯では、計算機１０-1〜１０-4のうちのいずれか１つの計算機のみで実行が可能なタイプのサービスである。図１では、ＨＡ型サービスＳＶＣ１の実行に関し、計算機１０-1（内のＨＡ型サービス実行部１４１-1）のみが稼動状態にあり、他の計算機１０-2〜１０-4（内のＨＡ型サービス実行部１４１-2〜１４１-4）は待機状態にある。また、図１では、ＨＡ型サービスＳＶＣ２の実行に関し、計算機１０-2（内のＨＡ型サービス実行部１４２-2）のみが稼動状態にあり、他の計算機１０-1，１０-3，１０-4（内のＨＡ型サービス実行部１４２-1，１４２-3，１４２-4）は待機状態にある。 The HA type service is a service (application) that is failed over by the control of the cluster control mechanism 12, and can be executed only by any one of the computers 10-1 to 10-4 in the same time zone. Service. In FIG. 1, regarding execution of the HA type service SVC1, only the computer 10-1 (inside the HA type service execution unit 141-1) is in operation, and the other computers 10-2 to 10-4 (inside HA type) The service execution units 141-2 to 141-4) are in a standby state. In FIG. 1, regarding execution of the HA type service SVC2, only the computer 10-2 (inside the HA type service execution unit 142-2) is in operation, and the other computers 10-1, 10-3, 10- 4 (of which the HA type service execution units 142-1, 142-3, 142-4) are in a standby state.

ＨＡ型サービスＳＶＣ１，ＳＶＣ２（を実現する各アプリケーション）には、静的サービスチケット値ＳＳＴ_SVC1，ＳＳＴ_SVC2が予め設定されている。静的サービスチケット値ＳＳＴ_SVC1，ＳＳＴ_SVC2は、ＨＡ型サービスＳＶＣ１，ＳＶＣ２を計算機１０-i内のＨＡ型サービス実行部１４１-iが実行するのに必要なリソース量を表すパラメータ値である。 Static service ticket values SST _SVC1 and SST _SVC2 are set in advance in the HA type services SVC1 and SVC2 (each application that realizes them). The static service ticket values SST _SVC1 and SST _SVC2 are parameter values representing the amount of resources necessary for the HA type service execution unit 141-i in the computer 10-i to execute the HA type services SVC1 and SVC2.

ＨＡ型サービス実行部１４１-1〜１４１-4，１４２-1〜１４２-4は、それぞれサービス負荷モニタ１５１-1〜１５１-4，１５２-1〜１５２-4を含む。サービス負荷モニタ１５１-i，１５２-i（ｉ＝１〜４）は、計算機１０-i内のＨＡ型サービス実行部１４１-i，１４２-iがＨＡ型サービスＳＶＣ１，ＳＶＣ２の実行状態にある場合だけ動作して、当該計算機１０-iでのサービスＳＶＣ１，ＳＶＣ２の実行状態におけるリソースの消費量を計測する。サービス負荷モニタ１５１-i，１５２-iは、計測された現在のリソースの消費量に基づいて計算機１０-iでのサービスＳＶＣ１，ＳＶＣ２の実行に必要なリソース量を予測して、当該サービスＳＶＣ１，ＳＶＣ２の実行に必要なリソース量（つまり、サービスＳＶＣ１，ＳＶＣ２の動的な負荷）を表す動的サービスチケット値ＤＳＴ_SVC1i，ＤＳＴ_SVC2iを求める。サービス負荷モニタ１５１-i，１５２-iは、動的サービスチケット値ＤＳＴ_SVC1i，ＤＳＴ_SVC2iをクラスタ制御機構１２に通知する。 The HA type service execution units 141-1 to 141-4 and 142-1 to 142-4 include service load monitors 151-1 to 151-4 and 152-1 to 152-4, respectively. The service load monitors 151-i and 152-i (i = 1 to 4) indicate that the HA type service execution units 141-i and 142-i in the computer 10-i are in the execution state of the HA type services SVC1 and SVC2. Only the resource consumption in the execution state of the services SVC1 and SVC2 in the computer 10-i is measured. The service load monitors 151-i and 152-i predict the resource amount necessary for the execution of the services SVC1 and SVC2 in the computer 10-i based on the measured current resource consumption, and the service SVC1, The dynamic service ticket values DST _SVC1i and DST _SVC2i representing the amount of resources necessary for executing the SVC 2 (that is, the dynamic load of the services _{SVC 1} and _{SVC 2)} are obtained. The service load monitors 151-i and 152-i notify the cluster control mechanism 12 of the dynamic service ticket values DST _SVC1i and DST _SVC2i .

計算機１０-1〜１０-4では、それぞれノード負荷モニタ１６-1〜１６-4が動作する。計算機１０-1〜１０-4には、当該計算機（ノード）の処理能力（リソース量）を表す静的ノードチケット値ＳＮＴ₁〜ＳＮＴ₄が予め設定されている。本実施形態では、計算機１０-1〜１０-4が非対称なリソース環境を持っている場合を想定しており、したがって静的ノードチケット値ＳＮＴ₁〜ＳＮＴ₄はそれぞれ異なる。ノード負荷モニタ１６-1〜１６-4は、計算機１０-1〜１０-4上で実行されている全てのサービスのチケット値の総和（以下、総サービスチケット値と称する）ＴＳＴ₁〜ＴＳＴ₄と静的ノードチケット値ＳＮＴ₁〜ＳＮＴ₄とから動的ノードチケット値ＤＮＴ₁〜ＤＮＴ₄を予め定められた検査時期が到来する毎に計算する。動的ノードチケット値ＤＮＴ₁〜ＤＮＴ₄は、計算機１０-1〜１０-4で新たに使用可能なリソース量を表すチケット値である。ノード負荷モニタ１６-1〜１６-4は、動的ノードチケット値ＤＮＴ₁〜ＤＮＴ₄をクラスタ制御機構１２に通知する。 In the computers 10-1 to 10-4, node load monitors 16-1 to 16-4 operate. The computer 10-1 to 10-4, a static node ticket value SNT ₁ ~SNT ₄ representing a processing capability of the computer (node) (resource amount) is set in advance. In the present embodiment, it is assumed that the computer 10-1 to 10-4 has an asymmetric resource environment, thus static nodes ticket value SNT ₁ ~SNT ₄ are different. Node load monitor 16-1 to 16-4, the sum of the ticket value of all services that are running on the computer 10-1 to 10-4 (hereinafter, referred to as total service ticket value) TST ₁ ~TST ₄ The dynamic node ticket values DNT _{1 to} DNT ₄ are calculated from the static node ticket values SNT _{1 to} SNT ₄ every time a predetermined inspection time comes. The dynamic node ticket values DNT _{1 to} DNT ₄ are ticket values representing resource amounts that can be newly used in the computers 10-1 to 10-4. Node load monitor 16-1 to 16-4 notifies the dynamic node ticket value DNT ₁ ~DNT ₄ to the cluster control mechanism 12.

次に、図１のクラスタシステムの動作について説明する。
サービス負荷モニタ１３１-i（ｉ＝１〜４）は、並列実行型サービス実行部１３によって計算機１０-i上で並列実行型サービスＰＳＶＣが実行されている場合に、予め定められた検査時期が到来する毎に、例えば定期的に動作する。そしてサービス負荷モニタ１３１-iは、計算機１０-iでのサービスＰＳＶＣの実行状態におけるリソースの消費量を計測する。サービス負荷モニタ１３１-iは、計測された現在のリソースの消費量に基づいて計算機１０-iでの並列実行型サービスＰＳＶＣの実行に必要なリソース量の予測値を表す動的サービスチケット値ＤＳＴ_PSVCiを計算する。この動的サービスチケット値ＤＳＴ_PSVCiの計算には、３つの予測関数ｆ（ｘ），ｇ（ｙ），ｈ（ｚ）が用いられる。予測関数ｆ（ｘ），ｇ（ｙ），ｈ（ｚ）は、それぞれ計算機１０-iにおける３種類のリソースの消費量、例えばＣＰＵ使用量ｘ、メモリ使用量ｙ、応答時間ｚ（クライアント端末からのサービスｓの実行要求を受け取ってから実行結果の応答を返すまでの時間）の関数である。ここでは、動的サービスチケット値ＤＳＴ_PSVCiは、次式
ＤＳＴ_si＝ｆ（ｘ）＋ｇ（ｙ）＋ｈ（ｚ）（１）
に従って算出される。但し、ｓ＝ＰＳＶＣである。サービス負荷モニタ１３１-iによって算出された動的サービスチケット値ＤＳＴ_PSVCiはクラスタ制御機構１２に通知される。 Next, the operation of the cluster system of FIG. 1 will be described.
The service load monitor 131-i (i = 1 to 4) receives a predetermined inspection time when the parallel execution service PSVC is executed on the computer 10-i by the parallel execution service execution unit 13. For example, it operates periodically, for example. The service load monitor 131-i measures the resource consumption in the execution state of the service PSVC in the computer 10-i. The service load monitor 131-i is a dynamic service ticket value DST _PSVCi that represents a predicted value of the resource amount necessary for executing the parallel execution service PSVC in the computer 10-i based on the measured current resource consumption. Calculate Three prediction functions f (x), g (y), and h (z) are used for the calculation of the dynamic service ticket value DST _PSVCi . The prediction functions f (x), g (y), and h (z) are consumption amounts of three types of resources in the computer 10-i, for example, CPU usage x, memory usage y, response time z (from the client terminal). The time from receiving the execution request of the service s until the response of the execution result is returned). Here, the dynamic service ticket value DST _PSVCi is _expressed by the following equation: DST _si = f (x) + g (y) + h (z) (1)
Is calculated according to However, s = PSVC. The dynamic service ticket value DST _PSVCi calculated by the service load monitor 131-i is notified to the cluster control mechanism 12.

同様に、サービス負荷モニタ１５１-i，１５２-iは、計算機１０-i（内のＨＡ型サービス実行部１４１-i，１４２-i）がＨＡ型サービスＳＶＣ１，ＳＶＣ２の実行状態にある場合、予め定められた検査時期が到来する毎に、例えば定期的に動作する。そしてサービス負荷モニタ１５１-i，１５２-iは、計算機１０-iでのサービスＳＶＣ１，ＳＶＣ２の実行に必要なリソース量の予測値を表す動的サービスチケット値ＤＳＴ_SVC1i，ＤＳＴ_SVC2iを、現在のリソースの消費量に基づいて上記（１）式に従って算出する。ここでは、ｓ＝ＳＶＣ１，ｓ＝ＳＶＣ２である。サービス負荷モニタ１５１-i，１５２-iによって算出された動的サービスチケット値ＤＳＴ_SVC1i，ＤＳＴ_SVC2iはクラスタ制御機構１２に通知される。 Similarly, the service load monitors 151-i and 152-i are preliminarily stored when the computer 10-i (within the HA-type service execution units 141-i and 142-i) is in the execution state of the HA-type services SVC1 and SVC2. Each time a predetermined inspection time arrives, for example, it operates periodically. The service load monitors 151-i and 152-i use the dynamic service ticket values DST _SVC1i and DST _SVC2i representing the predicted values of the resource amounts necessary for executing the services SVC1 and SVC2 in the computer 10-i as current resources. It is calculated according to the above formula (1) based on the consumption amount. Here, s = SVC1 and s = SVC2. The dynamic service ticket values DST _SVC1i and DST _SVC2i calculated by the service load monitors 151-i and 152-i are notified to the cluster control mechanism 12.

次に、ノード負荷モニタ１６-i（ｉ＝１〜４）による動的ノードチケット値ＤＮＴ_iの算出動作について、図２のフローチャートを参照して説明する。
ノード負荷モニタ１６-iは、計算機１０-i上で実行中の全てのサービスｓについて、サービスチケット値ＳＴ_siを、次式
サービスチケット値ＳＴ_si
＝ＭＡＸ（静的サービスチケット値ＳＳＴ_s，動的サービスチケット値ＤＳＴ_si）
（２）
に従って求める（ステップＳ１，Ｓ２）。サービスチケット値ＳＴ_siは、計算機１０-i上で実行中のサービスｓで消費されているリソース量または消費されると予測されるリソース量のうちの最大値、つまりサービスｓで消費される可能性のある最大のリソース量を表す。 Next, the calculation operation of the dynamic node ticket value DNT _i by the node load monitor 16-i (i = 1 to 4) will be described with reference to the flowchart of FIG.
The node load monitor 16-i obtains the service ticket value ST _si for all the services s being executed on the computer 10-i by the following formula: service ticket value ST _si
= MAX (static service ticket value SST _s , dynamic service ticket value DST _si )
(2)
(Steps S1, S2). The service ticket value ST _si is the maximum value of the amount of resources consumed or predicted to be consumed by the service s being executed on the computer 10-i, that is, the possibility of being consumed by the service s. Represents the maximum resource amount with

ノード負荷モニタ１６-iは、計算機１０-i上で実行中の全てのサービスｓのサービスチケット値ＳＴ_siを求めると、当該サービスチケット値ＳＴ_siの総和、つまり総サービスチケット値ＴＳＴ_iを、次式
総サービスチケット値ＴＳＴ_i
＝Σ（計算機１０-iで実行中のサービスのサービスチケット値ＳＴ_si）（３）
に従って算出する（ステップＳ３）。総サービスチケット値ＴＳＴ_iは、計算機１０-i上で実行中のサービス全体で消費される可能性のある最大のリソース量、つまり計算機１０-i全体の負荷（ノード負荷）を表す。 Node load monitor 16-i, when obtaining the service ticket value ST _si for all services s running on the computer 10-i, the sum of the service ticket value ST _si, that is, the total service ticket value TST _i, following Formula Total service ticket value TST _i
= Σ (service ticket value ST _si of the service being executed on the computer 10-i) (3)
(Step S3). The total service ticket value TST _i represents the maximum resource amount that can be consumed by the entire service being executed on the computer 10-i, that is, the load (node load) of the entire computer 10-i.

ノード負荷モニタ１６-iは、計算機１０-i上で実行中の全てのサービスｓの総サービスチケット値ＴＳＴ_iを求めると、その時点において計算機１０-iで新たに使用可能なリソース量を表すチケット値、つまり動的ノードチケット値ＤＮＴ_iを、次式
動的ノードチケット値ＤＮＴ_i
＝（静的ノードチケット値ＳＮＴ_i）−（総サービスチケット値ＴＳＴ_i）（４）
に従って算出する（ステップＳ４）。
このように、動的ノードチケット値ＤＮＴ_iは、静的ノードチケット値ＳＮＴ_iから総サービスチケット値ＴＳＴ_iを差し引くことにより算出される。ノード負荷モニタ１６-iは、以上の処理（ステップＳ１〜Ｓ４）を、定期的に（つまり一定時間間隔で）繰り返す。 When the node load monitor 16-i obtains the total service ticket value TST _i of all services s being executed on the computer 10-i, the ticket representing the amount of resources that can be newly used by the computer 10-i at that time. Value, that is, the dynamic node ticket value DNT _i is expressed by the following formula: dynamic node ticket value DNT _i
= (Static node ticket value SNT _i ) − (Total service ticket value TST _i ) (4)
(Step S4).
Thus, the dynamic node ticket value DNT _i is calculated by subtracting the total service ticket value TST _i from the static node ticket value SNT _i . The node load monitor 16-i repeats the above processing (steps S1 to S4) periodically (that is, at regular time intervals).

次に、並列実行型サービス実行部１３により並行して実行される並列実行型サービスＰＳＶＣの実行数（サービス数）を、クラスタ制御機構１２内のサービス最適配置機構１２１が最適な数に調整する動作について、図３のフローチャートを参照して説明する。 Next, the operation for adjusting the number of executions (number of services) of the parallel execution type service PSVC executed in parallel by the parallel execution type service execution unit 13 to the optimum number by the service optimal arrangement mechanism 121 in the cluster control mechanism 12 Will be described with reference to the flowchart of FIG.

サービス最適配置機構１２１は、システム内の各計算機（ノード）１０-iにおける並列実行型サービスＰＳＶＣの動的サービスチケット値ＤＳＴ_PSVCi、当該サービスＰＳＶＣの静的サービスチケット値ＳＳＴ_PSVC及び当該サービスＰＳＶＣの最小サービス数Ｎ_minをもとに、現在システムで必要な並列実行型サービスＰＳＶＣの実行数（以下、最適サービス数と称する）ＯＳＮを次のように算出する（ステップＳ１１）。まずサービス最適配置機構１２１は、各計算機１０-1〜１０-4での動的サービスチケット値ＤＳＴ_PSVC1〜ＤＳＴ_PSVC4の総和、つまり総動的サービスチケット値ＴＤＳＴを、次式
総動的サービスチケット値ＴＤＳＴ
＝Σ（計算機１０-iの動的サービスチケット値ＤＳＴ_PSVCi）（５）
に従って算出する（ステップＳ１１ａ）。 The service optimum arrangement mechanism 121 includes the dynamic service ticket value DST _PSVCi of the parallel execution type service PSVC, the static service ticket value SST _PSVC of the service PSVC, and the minimum of the service PSVC in each computer (node) 10-i in the system. Based on the service number N _min , the execution number (hereinafter referred to as the optimum service number) OSN of the parallel execution type service PSVC necessary for the current system is calculated as follows (step S11). First International optimal deployment mechanism 121, the sum of the dynamic service ticket value DST _PSVC1 ~DST _PSVC4 at each computer 10-1 to 10-4, that is, the total dynamic service ticket value TDST, the following equation total dynamic service ticket value TDST
= Σ (Dynamic service ticket value DST _PSVCi of computer 10-i) (5)
(Step S11a).

次にサービス最適配置機構１２１は、総動的サービスチケット値ＴＤＳＴと並列実行型サービスＰＳＶＣの静的サービスチケット値ＳＳＴ_PSVCとをもとに、現在必要な並列実行型サービスＰＳＶＣの実行数を、仮サービス数ＴＳＮとして次式
仮サービス数ＴＳＮ
＝（総動的サービスチケット値ＴＤＳＴ／静的サービスチケット値ＳＳＴ_PSVC）
の整数部（余りが０の場合）
または
仮サービス数ＴＳＮ
＝（総動的サービスチケット値ＴＤＳＴ／静的サービスチケット値ＳＳＴ_PSVC）
の整数部＋１（余りが０でない場合）（６）
に従って算出する（ステップＳ１１ｂ）。 Next, the service optimum arrangement mechanism 121 determines the number of executions of the currently required parallel execution service PSVC based on the total dynamic service ticket value TDST and the static service ticket value SST _PSVC of the parallel execution service PSVC. The following formula is used as the number of services TSN.
= (Total dynamic service ticket value TDST / Static service ticket value SST _PSVC )
Integer part (when remainder is 0)
Or provisional service number TSN
= (Total dynamic service ticket value TDST / Static service ticket value SST _PSVC )
Integer part of +1 (when remainder is not 0) (6)
(Step S11b).

次にサービス最適配置機構１２１は、仮サービス数ＴＳＮと予め設定されている最小サービス数Ｎ_minとをもとに、最適サービス数ＯＳＮを次式
最適サービス数ＯＳＮ
＝ＭＡＸ（仮サービス数ＴＳＮ，最小サービス数Ｎ_min）（７）
に従って算出する（ステップＳ１１ｃ）。 Next, the service optimum arrangement mechanism 121 calculates the optimum service number OSN based on the provisional service number TSN and the preset minimum service number N _min as follows:
= MAX (temporary service number TSN, minimum service number N _min ) (7)
(Step S11c).

次にサービス最適配置機構１２１は、最適サービス数ＯＳＮを、現在並列実行型サービス実行部１３によって実行されている並列実行型サービスＰＳＶＣの実行数（以下、現サービス数と称する）ＣＳＮと比較し、最適サービス数ＯＳＮが現サービス数ＣＳＮより多いならば（ステップＳ１２）、システム内の計算機１０-1〜１０-4の中に新たに並列実行型サービスＰＳＶＣを実行可能な計算機１０-j（ｊは１〜４のいずれか）があるか否かを判定する（ステップＳ１３）。もし、サービスＰＳＶＣを実行可能な計算機１０-jがあるならば、サービス最適配置機構１２１は、当該計算機１０-jのうちの、静的ノードチケット値ＳＮＴ_jと動的ノードチケット値ＤＮＴ_jとの差が最も大きい計算機を選択し、その選択された計算機で当該サービスＰＳＶＣを実行させる（ステップＳ１４）。その後、サービス最適配置機構１２１はステップＳ１１に戻る。つまりサービス最適配置機構１２１は、サービスＰＳＶＣを実行可能な計算機１０-jの中から、当該サービスＰＳＶＣを実行させる計算機を静的ノードチケット値ＳＮＴ_jと動的ノードチケット値ＤＮＴ_jとの差が大きい順に選択して、その選択された計算機で当該サービスＰＳＶＣを開始させる動作を、最適サービス数ＯＳＮが現サービス数ＣＳＮに達するようになるまで繰り返す。一方、サービスＰＳＶＣを実行可能な計算機１０-jがないならば（ステップＳ１３）、サービス最適配置機構１２１は一定時間待った（スリープした）後に（ステップＳ１５）、ステップＳ１１に戻る。 Next, the service optimal arrangement mechanism 121 compares the optimal service number OSN with the execution number (hereinafter referred to as the current service number) CSN of the parallel execution type service PSVC currently being executed by the parallel execution type service execution unit 13. If the optimum service number OSN is larger than the current service number CSN (step S12), the computer 10-j (j is a computer that can newly execute the parallel execution type service PSVC among the computers 10-1 to 10-4 in the system). It is determined whether there is any one of 1-4) (step S13). If there is a computer 10-j capable of executing the service PSVC, the service optimal arrangement mechanism 121 determines whether the static node ticket value SNT _j and the dynamic node ticket value DNT _j out of the computer 10-j. The computer having the largest difference is selected, and the service PSVC is executed by the selected computer (step S14). Thereafter, the service optimum arrangement mechanism 121 returns to step S11. That is, the service optimal arrangement mechanism 121 has a large difference between the static node ticket value SNT _j and the dynamic node ticket value DNT _j among the computers 10-j that can execute the service PSVC. The operation of sequentially selecting and starting the service PSVC by the selected computer is repeated until the optimum service number OSN reaches the current service number CSN. On the other hand, if there is no computer 10-j that can execute the service PSVC (step S13), the service optimum arrangement mechanism 121 waits for a certain period of time (sleeps) (step S15), and then returns to step S11.

また、サービス最適配置機構１２１は、最適サービス数ＯＳＮが現サービス数ＣＳＮより少ないならば（ステップＳ１６）、システム内の計算機１０-1〜１０-4の中に並列実行型サービスＰＳＶＣを停止可能な計算機１０-j（ｊは１〜４のいずれか）があるか否かを判定する（ステップＳ１７）。もし、サービスＰＳＶＣを停止可能な計算機１０-jがあるならば、サービス最適配置機構１２１は、当該計算機１０-jのうちの、静的ノードチケット値ＳＮＴ_jと動的ノードチケット値ＤＮＴ_jとの差が最も小さい計算機を選択し、その選択された計算機における当該サービスＰＳＶＣの実行を停止させる（ステップＳ１８）。その後、サービス最適配置機構１２１はステップＳ１１に戻る。つまりサービス最適配置機構１２１は、サービスＰＳＶＣを停止可能な計算機１０-jの中から、当該サービスＰＳＶＣを停止させる計算機を静的ノードチケット値ＳＮＴ_jと動的ノードチケット値ＤＮＴ_jとの差が小さい順に選択して、その選択された計算機における当該サービスＰＳＶＣの実行を停止させる動作を、最適サービス数ＯＳＮが現サービス数ＣＳＮに達するようになるまで繰り返す。一方、サービスＰＳＶＣを停止可能な計算機１０-jがないならば（ステップＳ１７）、サービス最適配置機構１２１は一定時間待った後に（ステップＳ１５）、ステップＳ１１に戻る。また、最適サービス数ＯＳＮが現サービス数ＣＳＮに一致しているならば、サービス最適配置機構１２１は一定時間待った後に（ステップＳ１５）、ステップＳ１１に戻る。 Further, if the optimum service number OSN is smaller than the current service number CSN (step S16), the service optimum arrangement mechanism 121 can stop the parallel execution type service PSVC in the computers 10-1 to 10-4 in the system. It is determined whether or not there is a computer 10-j (j is any one of 1 to 4) (step S17). If there is a computer 10-j capable of stopping the service PSVC, the service optimum arrangement mechanism 121 determines whether the static node ticket value SNT _j and the dynamic node ticket value DNT _j out of the computer 10-j. The computer with the smallest difference is selected, and the execution of the service PSVC in the selected computer is stopped (step S18). Thereafter, the service optimum arrangement mechanism 121 returns to step S11. That is, the service optimal arrangement mechanism 121 has a small difference between the static node ticket value SNT _j and the dynamic node ticket value DNT _j among the computers 10-j that can stop the service PSVC. The operation of sequentially selecting and stopping the execution of the service PSVC in the selected computer is repeated until the optimum service number OSN reaches the current service number CSN. On the other hand, if there is no computer 10-j that can stop the service PSVC (step S17), the service optimum arrangement mechanism 121 waits for a certain time (step S15), and then returns to step S11. If the optimum service number OSN matches the current service number CSN, the service optimum arrangement mechanism 121 waits for a predetermined time (step S15) and then returns to step S11.

このように本実施形態においては、図１のクラスタシステム（計算機システム）内で並行して実行される並列実行型サービスＰＳＶの最適な実行数を表す最適サービス数ＯＳＮが、当該システム内の各計算機１０-iにおける並列実行型サービスＰＳＶＣの動的サービスチケット値ＤＳＴ_PSVCi、当該サービスＰＳＶＣの静的サービスチケット値ＳＳＴ_PSVC及び当該サービスＰＳＶＣの最小サービス数Ｎ_minをもとに算出される。そして、算出された最適サービス数ＯＳＮと現サービス数ＣＳＮ（その時点においてシステム内で実際に実行されている並列実行型サービスＰＳＶの実行数）との大小関係に応じて、並列実行型サービスの実行数が調整される。これにより、本実施形態のように、図１の計算機システムがクラスタシステムであり、且つ当該システム内の計算機１０-1乃至１０-4が有するリソース環境が非対称であったとしても、当該システムで並行して実行される並列実行型サービスの実行数を最適な数に調整できる。 As described above, in this embodiment, the optimum service number OSN representing the optimum number of executions of the parallel execution service PSV executed in parallel in the cluster system (computer system) of FIG. It is calculated based on the dynamic service ticket value DST _PSVCi of the parallel execution type service PSVC in 10-i, the static service ticket value SST _PSVC of the service PSVC, and the minimum service number N _min of the service PSVC. Then, the execution of the parallel execution service is performed according to the magnitude relationship between the calculated optimum service number OSN and the current service number CSN (the number of executions of the parallel execution service PSV actually executed in the system at that time). The number is adjusted. Thus, as in the present embodiment, even if the computer system of FIG. 1 is a cluster system and the resource environments of the computers 10-1 to 10-4 in the system are asymmetric, It is possible to adjust the number of parallel-execution services that are executed in an optimal number.

なお、図１のシステムでは、実行可能な並列実行型サービスの種類がＰＳＶＣの１種類である場合を想定している。しかし、実行可能な並列実行型サービスを複数種類とすることも可能である。この場合、並列実行型サービスの種類毎に、最適サービス数ＯＳＮを求めれば良い。 In the system of FIG. 1, it is assumed that the type of executable parallel execution service is one type of PSVC. However, a plurality of types of parallel execution services that can be executed are also possible. In this case, the optimum service number OSN may be obtained for each type of parallel execution service.

次に、サービス最適配置機構１２１によるＨＡ型サービスまたは並列実行型サービスの最適配置について、図４のフローチャートを参照して説明する。
サービス最適配置機構１２１は、計算機１０-1〜１０-4の中から、（動的ノードチケット値ＤＮＴ_j−Δ）の値が一定値以下となる計算機１０-j、つまり動的ノードチケット値ＤＮＴ_jが一定値以下となる可能性のある計算機１０-jを検索する（ステップＳ２１）。ここで、Δは、動的ノードチケット値ＤＮＴ_jが一定値以下でなくても、当該ＤＮＴ_jが一定値以下となる可能性のある計算機１０-jを検索するためのマージンである。なお、本実施形態では、上記一定値は零である。また、動的ノードチケット値ＤＮＴ_jが一定値未満となる可能性のある計算機１０-jを検索する構成としても構わない。 Next, the optimum arrangement of the HA type service or the parallel execution type service by the service optimum arrangement mechanism 121 will be described with reference to the flowchart of FIG.
The service optimal arrangement mechanism 121 is a computer 10-j in which the value of (dynamic node ticket value DNT _j -Δ) is equal to or less than a predetermined value among the computers 10-1 to 10-4, that is, the dynamic node ticket value DNT. _The computer 10-j in which j may be a certain value or less is searched (step S21). Here, delta is also not less than a predetermined value is the dynamic node ticket value DNT _j, a margin for the DNT _j retrieves the computer 10-j that can be a fixed value or less. In the present embodiment, the constant value is zero. In addition, a configuration may be adopted in which the computer 10-j in which the dynamic node ticket value DNT _j may be less than a certain value is searched.

もし、動的ノードチケット値ＤＮＴ_jが一定値以下となる可能性のある計算機１０-jがないならば（ステップＳ２１）、サービス最適配置機構１２１は、一定時間を待った（スリープした）後（ステップＳ２２）、ステップＳ２１に戻る。なお、計算機障害等のイベントが発生した場合には、サービス最適配置機構１２１は、一定時間を待つことなくステップＳ２１に戻る。 If there is no computer 10-j that may cause the dynamic node ticket value DNT _{j to} become a certain value or less (step S21), the service optimum arrangement mechanism 121 waits for a certain time (sleeps) (step S21). S22), the process returns to step S21. When an event such as a computer failure occurs, the service optimal arrangement mechanism 121 returns to step S21 without waiting for a certain time.

これに対し、動的ノードチケット値ＤＮＴ_jが一定値以下となる可能性のある計算機１０-jがあるならば（ステップＳ２１）、サービス最適配置機構１２１は、当該計算機１０-jの中から、最も優先度の低いサービスｓを実行中の計算機と当該サービスｓとを選択する（ステップＳ２３）。次にサービス最適配置機構１２１は、選択されたサービスｓをシステム内の他の計算機にスイッチオーバ（引き継ぎ）可能であるか否かを判定する（ステップＳ２４）。本実施形態では、スイッチオーバ可能なサービスは予め定められている。つまり本実施形態では、サービス毎に、スイッチオーバ可能か否かが予め設定されている。この場合、ステップＳ２４の判定は、選択されたサービスｓが、スイッチオーバ可能なサービスとして予め定められているかを判定することによって実現される。なお、サービスｓの実行状況、例えばサービスｓがクリティカル領域の処理中であるかによって、スイッチオーバ可能であるかが判定される構成とすることも可能である。クリティカル領域の処理とは、例えば、応答性能が要求される処理、或いは一貫性（原始性）が要求される処理で、後戻り処理にコストがかかる処理である。具体例としては、トランザクション処理及びデータベース更新処理が挙げられる。 On the other hand, if there is a computer 10-j in which the dynamic node ticket value DNT _j may be equal to or less than a certain value (step S21), the service optimum arrangement mechanism 121 may select from among the computers 10-j. The computer executing the service s with the lowest priority and the service s are selected (step S23). Next, the service optimal arrangement mechanism 121 determines whether or not the selected service s can be switched over to another computer in the system (step S24). In the present embodiment, services that can be switched over are determined in advance. That is, in this embodiment, whether or not switchover is possible is set in advance for each service. In this case, the determination in step S24 is realized by determining whether the selected service s is predetermined as a service that can be switched over. Note that it is possible to determine whether the switchover is possible depending on the execution status of the service s, for example, whether the service s is in the critical area. The critical area process is, for example, a process that requires response performance or a process that requires consistency (primitiveness), and is a process that requires a cost for backtracking. Specific examples include transaction processing and database update processing.

まず、選択されたサービスｓが他の計算機にスイッチオーバ可能であるものとする。また、サービスｓを実行可能な計算機が計算機１０-k（ｋは１〜４のいずれか）であるものとする。この場合、サービス最適配置機構１２１は、選択されたサービスｓを実行可能な計算機１１-kの中から、当該サービスｓをスイッチオーバするのに最適な計算機を次のように検索する（ステップＳ２５）。即ちサービス最適配置機構１２１は、サービスｓを実行可能な各計算機１０-kの動的ノードチケット値ＤＮＴ_kと、静的サービスチケット値ＳＳＴ_s及び動的サービスチケット値ＤＳＴ_skのうちの大きい方の値、つまりＭＡＸ（ＳＳＴ_s，ＤＳＴ_sk）とをもとに、ＤＮＴ_kがＭＡＸ（ＳＳＴ_s，ＤＳＴ_sk）を超える計算機１０-kを検索する。もし、複数の計算機１０-kが検索された場合、サービス最適配置マシン１２１は、当該複数の計算機１０-kの１つをサービスｓをスイッチオーバするのに最適な計算機として選択する。ここでは、動的ノードチケット値ＤＮＴ_kが最も大きい計算機１０-kを最適な計算機として選択すると良い。また、ＭＡＸ（ＳＳＴ_s，ＤＳＴ_sk）を超えるＤＮＴ_kのうち、ＭＡＸ（ＳＳＴ_s，ＤＳＴ_sk）に最も近いＤＮＴ_kを持つ計算機１０-kが選択される構成としても良い。 First, it is assumed that the selected service s can be switched over to another computer. Further, it is assumed that the computer capable of executing the service s is the computer 10-k (k is any one of 1 to 4). In this case, the service optimum arrangement mechanism 121 searches the computer 11-k capable of executing the selected service s for the optimum computer for switching over the service s as follows (step S25). . That is, the service optimal arrangement mechanism 121 has a larger one of the dynamic node ticket value DNT _k , the static service ticket value SST _s, and the dynamic service ticket value DST _sk of each computer 10-k that can execute the service s. value, i.e. MAX (SST _s, DST _sk) on the basis of the, DNT _k searches for computer 10-k of greater than _{_{MAX (SST s, DST sk)}} . If a plurality of computers 10-k are searched, the service optimum arrangement machine 121 selects one of the plurality of computers 10-k as an optimum computer for switching over the service s. Here, the computer 10-k having the largest dynamic node ticket value DNT _k may be selected as the optimum computer. _{_{Furthermore, MAX (SST s, DST sk}} ) of the DNT _k _{_{exceeding, MAX (SST s, DST sk}} ) computer 10-k having the closest DNT _k in may be configured to be selected.

サービス最適配置機構１２１は、選択されたサービスｓをスイッチオーバするのに最適な計算機が検索できた場合（ステップＳ２６）、当該最適な計算機で当該サービスｓの実行を開始させ（ステップＳ２７）、しかる後にステップＳ２１に戻る。これに対し、最適な計算機が検索できなかった場合（ステップＳ２６）、サービス最適配置機構１２１は、動的ノードチケット値ＤＮＴ_jが一定値以下となる可能性のある計算機１０-jの中から、次に優先度の低いサービスｓを実行中の計算機と当該サービスｓとを選択する（ステップＳ２８）。その後、サービス最適配置機構１２１はステップＳ２４に戻る。 When the optimal computer for switching over the selected service s can be searched (step S26), the service optimal arrangement mechanism 121 starts execution of the service s with the optimal computer (step S27). Later, the process returns to step S21. On the other hand, when the optimal computer cannot be searched (step S26), the service optimal arrangement mechanism 121 determines that the dynamic node ticket value DNT _j may be a certain value or less from among the computers 10-j. Next, the computer that is executing the service s with the lower priority and the service s are selected (step S28). Thereafter, the service optimum arrangement mechanism 121 returns to step S24.

一方、選択されたサービスｓが他の計算機にスイッチオーバ可能でないならば（ステップＳ２４）、サービス最適配置機構１２１は、当該選択されたサービスｓが停止可能であるか否かを判定する（ステップＳ２９）。本実施形態では、停止可能なサービスは予め定められている。つまり本実施形態では、サービス毎に、停止可能か否かが予め設定されている。なお、サービスｓの実行状況によって、停止可能であるかが判定される構成とすることも可能である。 On the other hand, if the selected service s cannot be switched over to another computer (step S24), the service optimum arrangement mechanism 121 determines whether or not the selected service s can be stopped (step S29). ). In the present embodiment, services that can be stopped are determined in advance. That is, in this embodiment, whether or not the service can be stopped is set in advance for each service. Note that it is possible to determine whether the service can be stopped depending on the execution status of the service s.

もし、選択されたサービスｓが停止可能であるならば（ステップＳ２９）、サービス最適配置機構１２１は、当該選択されたサービスｓを停止させ（ステップＳ３０）、しかる後にステップＳ２１に戻る。これに対し、選択されたサービスｓが停止可能でないならば（ステップＳ２９）、サービス最適配置機構１２１は、動的ノードチケット値ＤＮＴ_jが一定値以下となる可能性のある計算機１０-jの中から、次に優先度の低いサービスｓを実行中の計算機と当該サービスｓとを選択する（ステップＳ３１）。その後、サービス最適配置機構１２１はステップＳ２４に戻る。 If the selected service s can be stopped (step S29), the service optimal arrangement mechanism 121 stops the selected service s (step S30), and then returns to step S21. On the other hand, if the selected service s cannot be stopped (step S29), the service optimal arrangement mechanism 121 determines that the dynamic node ticket value DNT _j is in the computer 10-j that may be a certain value or less. Then, the computer executing the service s having the next lowest priority and the service s are selected (step S31). Thereafter, the service optimum arrangement mechanism 121 returns to step S24.

このように本実施形態においては、動的ノードチケット値ＤＮＴ_jが一定値以下となる可能性のある計算機１０-jで実行中のサービスｓを、動的ノードチケット値ＤＮＴ_kが、静的サービスチケット値ＳＳＴ_s及び動的サービスチケット値ＤＳＴ_skのうちの大きい方の値を超えている計算機１０-kで実行させることができる。これにより最適な負荷分散が実現される。即ち本実施形態においては、計算機の障害、或いは大幅なサービス負荷またはノード負荷の変化が発生した場合、サービス最適配置マシン１２１によりサービスの再配置が自動的に行われる。 As described above, in the present embodiment, the service s being executed by the computer 10-j in which the dynamic node ticket value DNT _j may be equal to or less than a certain value is represented by the dynamic node ticket value DNT _k being a static service. It can be executed by the computer 10-k that exceeds the larger one of the ticket value SST _s and the dynamic service ticket value DST _sk . Thereby, optimal load distribution is realized. That is, in the present embodiment, when a computer failure or a significant service load or node load change occurs, the service relocation is automatically performed by the service optimum arrangement machine 121.

なお、図４のフローチャートには、ステップＳ２４，Ｓ２５，Ｓ２６及びＳ２８のループを繰り返しても、選択されたサービスをスイッチオーバするのに最適な計算機がなかった場合については示されていない。同様に、図４には、ステップＳ２４，Ｓ２９及びＳ３１のループを繰り返しても、停止可能なサービスがなかった場合については示されていない。このような場合、予め定められたユーザの設定に応じて、例えば他のサービスのスイッチオーバ／停止を行うようにしても良い。また、最適な計算機がなかった場合には、最適な計算機が出現するまで、選択されたサービスを停止するか、或いは何もしないようにしても良い。 Note that the flowchart of FIG. 4 does not show a case where there is no optimal computer for switching over the selected service even if the loop of steps S24, S25, S26 and S28 is repeated. Similarly, FIG. 4 does not show a case where there is no service that can be stopped even if the loop of steps S24, S29, and S31 is repeated. In such a case, for example, other services may be switched over / stopped according to a predetermined user setting. When there is no optimal computer, the selected service may be stopped or nothing may be performed until the optimal computer appears.

上記実施形態においては、ＨＡ型サービスだけでなく並列実行型サービスが実行可能なクラスタシステムを想定している。しかし本発明はクラスタシステムに限らず、ＨＡ型サービスは実行できないものの並列実行型サービスが実行可能な計算機システム（負荷分散システム）にも適用可能である。 The above embodiment assumes a cluster system that can execute not only HA type services but also parallel execution type services. However, the present invention is not limited to a cluster system, and can be applied to a computer system (load distribution system) capable of executing a parallel execution type service although it cannot execute an HA type service.

なお、本発明は、上記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施形態に開示されている複数の構成要素の適宜な組み合せにより種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。 Note that the present invention is not limited to the above-described embodiment as it is, and can be embodied by modifying the constituent elements without departing from the scope of the invention in the implementation stage. In addition, various inventions can be formed by appropriately combining a plurality of constituent elements disclosed in the embodiment. For example, some components may be deleted from all the components shown in the embodiment.

本発明の一実施形態に係るクラスタシステムの構成を示すブロック図。1 is a block diagram showing a configuration of a cluster system according to an embodiment of the present invention. 同実施形態におけるノード負荷モニタ１６-i（ｉ＝１〜４）による動的ノードチケット値ＤＮＴ_iの算出手順を示すフローチャート。9 is a flowchart showing a procedure for calculating a dynamic node ticket value DNT _i by the node load monitor 16-i (i = 1 to 4) in the embodiment. 同実施形態で適用される並列実行型サービスＰＳＶＣの実行数（サービス数）を調整する動作の手順を示すフローチャート。6 is an exemplary flowchart showing an operation procedure for adjusting the number of executions (number of services) of a parallel execution service PSVC applied in the embodiment; 同実施形態で適用されるサービス（ＨＡ型サービスまたは並列実行型サービス）の最適配置のための動作の手順を示すフローチャート。6 is a flowchart showing an operation procedure for optimal arrangement of services (HA type service or parallel execution type service) applied in the embodiment.

Explanation of symbols

１０-1〜１０-4…計算機、１１-1〜１１-4…ＯＳ（オペレーティングシステム）、１２…クラスタ制御機構、１３…並列実行型サービス実行部、１６-1〜１６-4…ノード負荷モニタ（ノード負荷計測手段）、２０…ネットワーク、１２１…サービス最適配置機構、１２２…サービス制御機構、１３１-1〜１３１-4，１５１-1〜１５１-4，１５２-1〜１５２-4…サービス負荷モニタ（サービス負荷計測手段）。 10-1 to 10-4 ... computers, 11-1 to 11-4 ... OS (operating system), 12 ... cluster control mechanism, 13 ... parallel execution type service execution unit, 16-1 to 16-4 ... node load monitor (Node load measuring means), 20 ... network, 121 ... service optimum arrangement mechanism, 122 ... service control mechanism, 131-1 to 131-4, 151-1 to 151-4, 152-1 to 152-4 ... service load Monitor (service load measuring means).

Claims

In a computer system equipped with multiple computers and capable of executing multiple types of services,
Parallel execution service execution means for executing a predetermined parallel execution service in parallel on at least two of the plurality of computers;
Service load measuring means for measuring a service load executed on each of the plurality of computers as a service load, based on the amount of resources consumed for executing the service on each of the plurality of computers, Service load measuring means including dynamic service ticket value calculating means for calculating a dynamic service ticket value representing a dynamic load of a service as the service load;
Node load measuring means for measuring the load of each of the plurality of computers as a node load, the dynamic service of the service being executed by each of the plurality of computers calculated by the dynamic service ticket value calculating means Based on the ticket value, total service ticket value calculating means for calculating the total service ticket value representing the node load of each of the plurality of computers, and the plurality of computers calculated by the total service ticket value calculating means Based on each total service ticket value and a static node ticket value representing the processing capability of each of the plurality of computers set in advance, a ticket value newly usable in each of the plurality of computers is moved. A node load measuring means including a dynamic node ticket value calculating means for calculating a dynamic node ticket value;
Based on the measurement results of the service load measuring means and the node load measuring means, the optimum computer for service execution and the service to be relocated to the optimum computer are determined from the plurality of computers, and the optimum computer is determined. Service relocation means for relocating the service to a service that is relocated to the optimal computer from among the services being executed on a computer whose dynamic node ticket value may be a predetermined value or less A selection means for selecting the system , a search means for searching for the optimum computer to be relocated of the service selected by the selection means based on a dynamic node ticket value of each of the plurality of computers, and the system The optimal number of services representing the optimal number of executions of the parallel execution type service executed in parallel in each of the plurality of computers. Column execution type total dynamic service ticket value is the sum of the dynamic service ticket value of the service, and the amount of resources is pre predicted to be required to perform the parallel execution type service in each of the plurality of computers Based on a static service ticket value that represents the optimal dynamic service ticket value calculated by dividing the total dynamic service ticket value by the static service ticket value, and an optimal service calculated by the optimal service number calculation unit The parallel execution type service execution unit actually executes the service according to the magnitude relation between the number and the current service number that is the number of executions of the parallel execution type service actually executed in the system at that time. Service optimal arrangement means including service number adjustment means for adjusting the number of executions of parallel execution type services ;
A computer system comprising:

The search means is a resource that is predicted in advance to be necessary for each of the plurality of computers to execute the dynamic node ticket value of each of the plurality of computers and the service selected by the selection means. Based on the static service ticket value representing the quantity and the larger value of the dynamic service ticket value of the selected service being executed on the computer, the larger of the dynamic node ticket values 2. The computer system according to claim 1 , wherein a computer exceeding the value is searched as the optimum computer.

A load balancing method for dynamically performing service load balancing in a computer system having a plurality of computers and capable of executing a plurality of types of services,
Executing a predetermined parallel execution type service in parallel on at least two of the plurality of computers;
Calculating a dynamic service ticket value representing a dynamic load of the service based on an amount of resources consumed for execution of the service in each of the plurality of computers;
Calculating a total service ticket value representing a load of each of the plurality of computers based on the dynamic service ticket value of a service being executed on each of the plurality of computers;
Newly used in each of the plurality of computers based on the total service ticket value of each of the plurality of computers and a static node ticket value representing the processing capability of each of the plurality of computers set in advance. Calculating a dynamic node ticket value, which is a possible ticket value;
Selecting a service to be relocated to a computer that is most suitable for service execution among the plurality of computers from among the services being executed by a computer that may have a dynamic node ticket value that is less than a certain value; ,
Searching for the optimal computer to which the selected service is to be relocated based on the dynamic node ticket value of each of the plurality of computers;
Relocating the selected service to the retrieved optimal computer;
A total dynamic service ticket value that is the sum of dynamic service ticket values of the parallel execution service in each of the plurality of computers, and an optimal number of services representing the optimal execution number of the parallel execution service, and the parallel execution Based on a static service ticket value that represents a resource amount that is predicted in advance to be executed on each of the plurality of computers , the total dynamic service ticket value is represented by the static service ticket value. A step of calculating by dividing ,
The parallel execution actually executed according to the magnitude relationship between the calculated optimum service number and the current service number that is the number of executions of the parallel execution type service actually executed in the system at that time Adjusting the number of executions of type services
A service load distribution method comprising :

In the searching step, the dynamic node ticket value of each of the plurality of computers and a resource amount that is predicted in advance to be necessary to execute the selected service on each of the plurality of computers are calculated. Based on the static service ticket value that represents and the larger value of the dynamic service ticket value of the selected service being executed by the computer, the larger value of the dynamic node ticket value 4. The service load distribution method according to claim 3, wherein more than one computer is searched as the optimum computer.

A parallel execution type service execution means for executing a predetermined parallel execution type service in parallel on at least two of the plurality of computers, and a service load executed on each of the plurality of computers Service load measuring means for measuring as a load, wherein a dynamic service ticket value representing a dynamic load of the service is obtained based on an amount of resources consumed for executing the service in each of the plurality of computers. Service load measuring means including dynamic service ticket value calculating means for calculating as a load, and node load measuring means for measuring each load of the plurality of computers as a node load, wherein the dynamic service ticket value calculating means Based on the calculated dynamic service ticket value of the service being executed on each of the plurality of computers, A total service ticket value calculating means for calculating a total service ticket value representing the node load of each of the plurality of computers; a total service ticket value of each of the plurality of computers calculated by the total service ticket value calculating means; And based on the static node ticket value representing the processing capability of each of the plurality of computers set in advance, a ticket value that can be newly used in each of the plurality of computers is calculated as a dynamic node ticket value. A plurality of computers including a node load measuring unit including a dynamic node ticket value calculating unit , applied to a computer system capable of executing a plurality of types of services, and being executed by each of the plurality of computers A program that enables dynamic load balancing of
In the calculator,
Obtaining the dynamic service ticket value of the service being executed in each of the plurality of computers from the service load measuring means;
Obtaining the dynamic node ticket value of each of the plurality of computers from the service load measuring means;
Select a service to be relocated to a computer that is most suitable for service execution among the plurality of computers from among the services that are being executed by the computers that may have the acquired dynamic node ticket value below a certain value. And steps to
Searching for the optimal computer on which the selected service is to be relocated based on the obtained dynamic node ticket value of each of the plurality of computers;
Relocating the selected service to the retrieved optimal computer ;
A total dynamic service ticket value that is a sum of the acquired dynamic service ticket values of the parallel execution service in each of the plurality of computers, and an optimal service number representing an optimal execution number of the parallel execution service; And based on a static service ticket value representing a resource amount that is predicted in advance to be executed by each of the plurality of computers, the total dynamic service ticket value is set to the static service ticket value. Calculating by dividing by the service ticket value;
The parallel execution actually executed according to the magnitude relation between the calculated optimum service number and the current service number that is the number of executions of the parallel execution type service actually executed in the system at that time For executing the step of adjusting the number of executions of the type service .