JP2011243162A

JP2011243162A - Quantity control device, quantity control method and quantity control program

Info

Publication number: JP2011243162A
Application number: JP2010117335A
Authority: JP
Inventors: Takeshi Higuchi; 毅樋口
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2010-05-21
Filing date: 2010-05-21
Publication date: 2011-12-01

Abstract

PROBLEM TO BE SOLVED: To increase the number of operating servers when needed to address an increase in load while suppressing resource usage of standby servers in a distributed processing system where the number of operating servers is varied in accordance with the resource usage of the servers.SOLUTION: The present invention calculates expected time until resource usage reaches a threshold which is an index to change a standby server to an operating server, selects a standby method to enable the standby server to be generated within the calculated expected time, and generates the standby server with the standby method.

Description

この発明は、複数のサーバに処理を分散させる分散処理システムであって、サーバのリソース使用量に応じて稼動するサーバの台数が変更される分散処理システムに関する。 The present invention relates to a distributed processing system that distributes processing to a plurality of servers, in which the number of servers that are operated is changed according to the resource usage of the server.

分散処理システムでは、負荷分散装置が、複数のサーバへ処理を分散させ、各サーバへの負荷を分散させる。特に、分散処理システムには、負荷分散装置が、全体の負荷状況に応じて、サーバの稼働台数を制御することで、負荷の上昇や減少に対応するものもある。 In a distributed processing system, a load distribution apparatus distributes processing to a plurality of servers and distributes the load to each server. In particular, in some distributed processing systems, a load distribution apparatus responds to an increase or decrease in load by controlling the number of operating servers according to the overall load status.

また、仮想化技術を用いることにより、各サーバを仮想サーバとすることで、サーバの生成や削除を短時間で行うことができる。また、各サーバを仮想サーバとすることで、新たにハードウェアを用意することなく、新たなサーバを生成することができる。そのため、複数のシステムが混在した環境において、一時的に負荷が高いシステムに、より多くのリソースを割り当てる等の制御が可能となっている。 In addition, by using a virtualization technique, each server can be a virtual server, and server generation and deletion can be performed in a short time. Also, by making each server a virtual server, a new server can be generated without preparing new hardware. Therefore, in an environment in which a plurality of systems are mixed, control such as allocating more resources to a system with a temporarily high load is possible.

分散処理システムにおいて、負荷の急激な上昇に対応するには、待機サーバを予め用意しておき、必要になった時点で待機サーバを稼動サーバに切り替える方法がある（特許文献１参照）。しかし、待機サーバは通常は使われないサーバであるため、待機サーバを用意しておくことは、負荷の上昇が発生しない場合には、無駄なリソースを使用することになる。 In a distributed processing system, there is a method of preparing a standby server in advance and switching the standby server to an active server when it becomes necessary (see Patent Document 1) in order to cope with a sudden increase in load. However, since the standby server is a server that is not normally used, preparing the standby server uses useless resources when no increase in load occurs.

国際公開ＷＯ２００６／０４０８１１号International Publication WO2006 / 040811

特許文献１では、待機サーバを用意する方式が一意に決められており、その方式の待機サーバを起動した状態にしておくか、起動していない状態にしておくかの選択のみが可能である。待機サーバを予め起動した状態で用意しておけば、負荷の上昇への対応はすぐにできるが、負荷の上昇が発生しない場合には無駄なリソースを使用することになる。一方、待機サーバを起動しない状態で用意しておけば、負荷の上昇の発生しない場合におけるリソースの使用を抑えることができるものの、負荷の上昇への対応が遅れる可能性がある。
この発明は、待機サーバによるリソース使用量を抑えつつ、必要なタイミングで稼動サーバの台数を増やし、負荷上昇への対応を可能とすることを目的とする。 In Patent Document 1, a method for preparing a standby server is uniquely determined, and it is only possible to select whether the standby server of the method is activated or not activated. If the standby server is prepared in a state where it is activated in advance, it is possible to quickly cope with an increase in load, but if no increase in load occurs, useless resources are used. On the other hand, if the standby server is prepared without being activated, it is possible to suppress the use of resources when no increase in load occurs, but the response to the increase in load may be delayed.
An object of the present invention is to increase the number of active servers at a necessary timing while suppressing resource usage by a standby server, and to cope with an increase in load.

この発明に係る台数制御装置は、
複数のサーバに処理を分散させる分散処理システムであって、サーバのリソースが使用されている量であるリソース使用量に応じて稼動する稼動サーバの台数が変更される分散処理システムにおける稼動サーバの台数を制御する台数制御装置であり、
前記稼動サーバへの切り替えを待機している待機サーバを所定の設定状態のサーバから生成するまでにかかる生成時間を、前記待機サーバの待機方式毎に記憶装置に記憶する時間記憶部と、
処理装置により、所定のリソース使用量が予め設定された第１閾値に前記リソース使用量が達するまでの第１予測時間を計算する予測時間計算部と、
処理装置により、前記時間記憶部が記憶した前記生成時間が、前記予測時間計算部が計算した第１予測時間よりも短い待機方式を選択する待機方式選択部と、
処理装置により、前記所定の設定状態のサーバから、前記待機方式選択部が選択した待機方式の待機サーバを生成する待機サーバ生成部と、
前記リソース使用量が前記第１閾値に達した場合、前記待機サーバ生成部が生成した待機サーバを稼動させて稼動サーバに切り替える切替部と
を備えることを特徴とする。 The number control device according to the present invention,
A distributed processing system that distributes processing to multiple servers, and the number of active servers in the distributed processing system in which the number of active servers is changed according to the amount of resource used, which is the amount of server resources used Is a unit control device that controls
A time storage unit that stores a generation time required to generate a standby server waiting for switching to the active server from a server in a predetermined setting state in a storage device for each standby method of the standby server;
A predicted time calculation unit that calculates a first predicted time until the resource usage reaches a first threshold that is set in advance by a processing device;
A standby method selection unit that selects a standby method that is shorter than the first predicted time calculated by the predicted time calculation unit by the generation time stored by the time storage unit by a processing device;
A standby server generation unit that generates a standby server of the standby method selected by the standby method selection unit from the server in the predetermined setting state by a processing device;
And a switching unit that operates the standby server generated by the standby server generation unit and switches to the active server when the resource usage amount reaches the first threshold.

この発明に係る台数制御装置は、待機サーバを稼動サーバへ切り替える指標となる閾値にリソース使用量が達するまでの予測時間を計算して、計算した予測時間内に生成可能な待機サーバの待機方式を選択し、その待機方式の待機サーバを生成する。そのため、稼動サーバへの切り替えが必要となる時点には、待機サーバを用意することができており、必要なタイミングで稼動サーバの台数を増やし、負荷上昇への対応が可能である。 The number control device according to the present invention calculates a standby time of a standby server that can be generated within the calculated predicted time by calculating a predicted time until the resource usage reaches a threshold value that serves as an index for switching the standby server to an active server. Select and create a standby server for that standby method. Therefore, a standby server can be prepared at the time when switching to an active server is required, and the number of active servers can be increased at a necessary timing to cope with an increase in load.

実施の形態１に係るサーバリソース制御システムの構成図。1 is a configuration diagram of a server resource control system according to Embodiment 1. FIG. 図１に示す各仮想サーバ３の構成を論理的に示した図。The figure which showed the structure of each virtual server 3 shown in FIG. 1 logically. 実施の形態１に係るリソース提供時間管理テーブル２６の例を示す図。The figure which shows the example of the resource provision time management table 26 which concerns on Embodiment 1. FIG. 各待機方式２６−１で示した方式で待機サーバの生成する処理（生成手順）と、待機サーバから稼動サーバに変更する際の処理（切替手順）との例を示した図。The figure which showed the example of the process (generation procedure) which a standby server produces | generates by the system shown in each standby system 26-1, and the process (switching procedure) at the time of changing from a standby server to an active server. 実施の形態１に係るリソース管理テーブル２７の例を示す図。FIG. 6 is a diagram showing an example of a resource management table 27 according to the first embodiment. 実施の形態１に係るリソース管理・制御サーバ６の負荷上昇時における動作の流れを示すフローチャート。5 is a flowchart showing a flow of operations when the load of the resource management / control server 6 according to the first embodiment is increased. 実施の形態１に係る将来の負荷の上昇の予測計算の説明図。Explanatory drawing of the prediction calculation of the future load increase which concerns on Embodiment 1. FIG. 実施の形態２に係るリソース管理テーブル２７の例を示す図。The figure which shows the example of the resource management table 27 which concerns on Embodiment 2. FIG. 実施の形態２に係るリソース管理・制御サーバ６の負荷上昇時における動作の流れを示すフローチャート。9 is a flowchart showing an operation flow when the load of the resource management / control server 6 according to the second embodiment increases. 実施の形態２に係るリソース管理・制御サーバ６の負荷減少時における動作の流れを示すフローチャート。9 is a flowchart showing an operation flow when the load of the resource management / control server 6 according to the second embodiment is reduced. 実施の形態３に係るリソース管理テーブル２７の例を示す図。FIG. 10 shows an example of a resource management table 27 according to the third embodiment. 実施の形態３に係るリソース管理・制御サーバ６の各サーバグループに対する処理開始時における動作の流れを示すフローチャート。10 is a flowchart showing a flow of operations at the start of processing for each server group of the resource management / control server 6 according to the third embodiment. 実施の形態４に係るサーバリソース制御システムの構成図。FIG. 6 is a configuration diagram of a server resource control system according to a fourth embodiment. 実施の形態４に係るリソース管理・制御サーバ６の稼動サーバ停止時の動作の流れを示すフローチャート。10 is a flowchart showing a flow of operations when the active server of the resource management / control server 6 according to the fourth embodiment is stopped. 実施の形態４に係る将来の負荷の上昇の予測計算の説明図。Explanatory drawing of the prediction calculation of the future load rise which concerns on Embodiment 4. FIG. 物理計算機１、リソース管理・制御サーバ６、負荷分散装置５、端末８のハードウェア構成の一例を示す図。The figure which shows an example of the hardware constitutions of the physical computer 1, the resource management and control server 6, the load distribution apparatus 5, and the terminal 8. FIG.

以下、図に基づき、発明の実施の形態を説明する。
以下の説明において、処理装置は後述するＣＰＵ９１１等である。記憶装置は後述するＲＯＭ９１３、ＲＡＭ９１４、磁気ディスク９２０等の記憶装置である。つまり、処理装置、記憶装置はハードウェアである。 Hereinafter, embodiments of the invention will be described with reference to the drawings.
In the following description, the processing device is a CPU 911 or the like which will be described later. The storage device is a storage device such as a ROM 913, a RAM 914, or a magnetic disk 920, which will be described later. That is, the processing device and the storage device are hardware.

実施の形態１．
この実施の形態では、負荷上昇時に用意する待機サーバの待機方式を決定する方法について説明する。 Embodiment 1 FIG.
In this embodiment, a method for determining a standby method of a standby server prepared when the load increases will be described.

図１は、実施の形態１に係るサーバリソース制御システムの構成図である。
図１に示すように、サーバリソース制御システムは、複数の物理計算機１（１−１，１−２，・・・）と、負荷分散装置５と、リソース管理・制御サーバ６（台数制御装置）と、複数の端末８と、ストレージ１０とを備える。各物理計算機１と負荷分散装置５やリソース管理・制御サーバ６とは、通信回線４を介して接続される。各物理計算機１とストレージ１０とは通信回線９を介して接続される。また、負荷分散装置５と各端末８とは、インターネット／イントラネット７を介して接続される。 FIG. 1 is a configuration diagram of a server resource control system according to the first embodiment.
As shown in FIG. 1, the server resource control system includes a plurality of physical computers 1 (1-1, 1-2,...), A load distribution device 5, and a resource management / control server 6 (number control device). A plurality of terminals 8 and a storage 10. Each physical computer 1 is connected to the load balancer 5 and the resource management / control server 6 via a communication line 4. Each physical computer 1 and the storage 10 are connected via a communication line 9. The load balancer 5 and each terminal 8 are connected via the Internet / intranet 7.

各物理計算機１は、仮想化ソフトウェア２（２−１，２−２，・・・）を備える。仮想化ソフトウェア２は、サーバ仮想化を実現するソフトウェアである。各物理計算機１では、仮想化ソフトウェア２により、システムを構成するサーバ群が仮想サーバ３（３−１〜３−１０，・・・）として構築されている。
物理計算機１−１では、Ｗｅｂ１−１仮想サーバ、Ｗｅｂ２−３仮想サーバ、Ａｐｐ１−１仮想サーバ、ＤＢ−１仮想サーバ、Ｗｅｂ２−２仮想サーバが、仮想化ソフトウェア２−１により仮想サーバ３として構築されている。また、物理計算機１−２では、Ｗｅｂ１−２仮想サーバ、Ｗｅｂ２−１仮想サーバ、Ａｐｐ２−１仮想サーバ、Ｗｅｂ１−３仮想サーバ、Ａｐｐ１−２仮想サーバが、仮想化ソフトウェア２−２により仮想サーバ３として構築されている。
各仮想サーバ３は、各種サービスを提供する。ここでは、Ｗｅｂ１の機能を提供する仮想サーバ３として、Ｗｅｂ１−１仮想サーバ、Ｗｅｂ１−２仮想サーバ、Ｗｅｂ１−３仮想サーバがある。また、Ｗｅｂ２の機能を提供する仮想サーバ３として、Ｗｅｂ２−１仮想サーバ、Ｗｅｂ２−２仮想サーバ、Ｗｅｂ２−３仮想サーバがある。また、Ａｐｐ１の機能を提供する仮想サーバ３として、Ａｐｐ１−１仮想サーバ、Ａｐｐ１−２仮想サーバがある。また、Ａｐｐ２の機能を提供する仮想サーバ３として、Ａｐｐ２−１仮想サーバがある。また、ＤＢの機能を有した仮想サーバ３として、ＤＢ−１仮想サーバがある。
なお、これらの仮想サーバのうち、Ｗｅｂ１−３仮想サーバと、Ｗｅｂ２−３仮想サーバとが待機サーバであり、他は稼動サーバである。各稼動サーバには、使用可能なリソースが割り当てられる。
以下の説明では、同一の機能を提供する仮想サーバ３をサーバグループと呼ぶ。例えば、Ｗｅｂ１−１仮想サーバ、Ｗｅｂ１−２仮想サーバ、Ｗｅｂ１−３仮想サーバが１つのサーバグループであり、Ａｐｐ１−１仮想サーバ、Ａｐｐ１−２仮想サーバが１つのサーバグループである。 Each physical computer 1 includes virtualization software 2 (2-1, 2-2,...). The virtualization software 2 is software that realizes server virtualization. In each physical computer 1, a server group constituting the system is constructed as virtual servers 3 (3-1 to 3-10,...) By the virtualization software 2.
In the physical computer 1-1, the Web 1-1 virtual server, the Web 2-3 virtual server, the App 1-1 virtual server, the DB-1 virtual server, and the Web 2-2 virtual server are constructed as the virtual server 3 by the virtualization software 2-1. Has been. In the physical computer 1-2, the Web 1-2 virtual server, the Web 2-1 virtual server, the App 2-1 virtual server, the Web 1-3 virtual server, and the App 1-2 virtual server are converted into the virtual server 3 by the virtualization software 2-2. Is built as.
Each virtual server 3 provides various services. Here, as the virtual server 3 that provides the function of Web1, there are a Web1-1 virtual server, a Web1-2 virtual server, and a Web1-3 virtual server. Further, as the virtual server 3 that provides the Web2 function, there are a Web2-1 virtual server, a Web2-2 virtual server, and a Web2-3 virtual server. Moreover, as the virtual server 3 that provides the function of App1, there are an App1-1 virtual server and an App1-2 virtual server. Further, as the virtual server 3 that provides the function of App2, there is an App2-1 virtual server. As a virtual server 3 having a DB function, there is a DB-1 virtual server.
Of these virtual servers, the Web 1-3 virtual server and the Web 2-3 virtual server are standby servers, and the others are active servers. Each active server is assigned available resources.
In the following description, the virtual server 3 that provides the same function is referred to as a server group. For example, the Web1-1 virtual server, the Web1-2 virtual server, and the Web1-3 virtual server are one server group, and the App1-1 virtual server and the App1-2 virtual server are one server group.

通信回線４は、例えばイントラネット、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）等のネットワークである。ここでは１本のネットワークで記載しているが、管理用のネットワークや業務用のネットワーク等で複数用意することも可能である。 The communication line 4 is, for example, a network such as an intranet or a LAN (Local Area Network). Although a single network is described here, a plurality of networks can be prepared using a management network, a business network, or the like.

負荷分散装置５は、ある機能に対するリクエストを、その機能を有した複数の仮想サーバ３に振り分けてリクエストを送信し、同一機能の処理を分散させることで、処理能力を向上させる機能を実現するための装置である。
負荷分散装置５としては、アプライアンスサーバを導入する場合、ソフトウェアで実現する場合、ＤＮＳ（ＤｏｍａｉｎＮａｍｅＳｙｓｔｅｍ）サーバの機能を利用する場合等あるが、その実現方法はどのようなものであっても構わない。 The load distribution device 5 distributes requests for a certain function to a plurality of virtual servers 3 having the function, and transmits the request to distribute the processing of the same function, thereby realizing a function of improving the processing capability. It is a device.
As the load balancer 5, when an appliance server is introduced, when it is realized by software, a function of a DNS (Domain Name System) server is used, etc., any implementation method may be used. Absent.

リソース管理・制御サーバ６は、仮想サーバ３のリソース使用量に応じて、稼動サーバの台数制御を行うサーバである。リソース管理・制御サーバ６は、リソース情報収集・蓄積部２１、負荷傾向判断部２２、役割変更部２３、リソース提供方式決定部２４、待機生成・削除部２５、リソース提供時間管理テーブル２６、リソース管理テーブル２７を備える。
リソース情報収集・蓄積部２１は、各仮想サーバ３のリソース使用情報と、各サーバグループに割り当てているリソース量情報とを、定期的に収集し、リソース管理・制御サーバ６の記憶装置に蓄積する。また、リソース情報収集・蓄積部２１は、処理装置により、サーバグループ毎に定義されている第１閾値を、そのサーバグループのリソース使用量が超えたか否かを判断する。
負荷傾向判断部２２は、リソース情報収集・蓄積部２１が収集したリソース使用情報を基に、回帰分析等を用いて処理装置により負荷の将来の上昇の予測計算を行う。
役割変更部２３は、処理装置により、待機サーバを稼動サーバに変更する、あるいは稼動サーバを待機サーバに変更する。
リソース提供方式決定部２４は、負荷傾向判断部２２が予測計算することにより導出した負荷上昇傾向と、待機サーバの生成が必要なサーバグループに割り当てられているリソース量の情報、待機サーバを生成するための時間、待機サーバを稼動サーバに変更するために要する時間、待機サーバ用に必要とするリソース量情報を基に、処理装置により最適な待機サーバの待機方式を判定する。
待機生成・削除部２５は、処理装置により、リソース提供方式決定部２４が導出した待機方式に基づき、待機サーバを生成する、あるいは待機サーバを削除する。
リソース提供時間管理テーブル２６は、各待機方式の生成時間、待機サーバを稼動サーバに変更させるために要する時間、リソース利用の情報が格納されたテーブル（記憶装置）である。
リソース管理テーブル２７は、各サーバグループの構成情報や閾値等の情報が格納されたテーブル（記憶装置）である。 The resource management / control server 6 is a server that controls the number of active servers in accordance with the resource usage of the virtual server 3. The resource management / control server 6 includes a resource information collection / accumulation unit 21, a load trend determination unit 22, a role change unit 23, a resource provision method determination unit 24, a standby generation / deletion unit 25, a resource provision time management table 26, resource management A table 27 is provided.
The resource information collection / accumulation unit 21 periodically collects resource usage information of each virtual server 3 and resource amount information allocated to each server group, and accumulates them in a storage device of the resource management / control server 6. . Further, the resource information collection / accumulation unit 21 determines whether or not the resource usage amount of the server group exceeds the first threshold defined for each server group by the processing device.
Based on the resource usage information collected by the resource information collection / accumulation unit 21, the load trend determination unit 22 performs a prediction calculation of a future increase in load by the processing device using regression analysis or the like.
The role changing unit 23 changes the standby server to the active server or changes the active server to the standby server by the processing device.
The resource provision method determination unit 24 generates a load increase tendency derived by the load trend determination unit 22 performing prediction calculation, information on the amount of resources allocated to a server group that needs to generate a standby server, and a standby server. The processing device determines an optimal standby server standby method based on the time required for the standby server to change to the active server and the resource amount information required for the standby server.
The standby generation / deletion unit 25 generates a standby server or deletes the standby server based on the standby method derived by the resource provision method determination unit 24 by the processing device.
The resource provision time management table 26 is a table (storage device) in which the generation time of each standby method, the time required to change a standby server to an active server, and resource usage information are stored.
The resource management table 27 is a table (storage device) that stores information such as configuration information and threshold values of each server group.

インターネット／イントラネット７は、各サーバ群に対してリクエストを行う際に利用されるネットワークであり、インターネットやイントラネット、ＬＡＮ、専用回線等である。 The Internet / intranet 7 is a network used when making a request to each server group, and is the Internet, an intranet, a LAN, a dedicated line, or the like.

端末８は、各サーバ群に対して、インターネット／イントラネット７を介してリクエストを送出する。 The terminal 8 sends a request to each server group via the Internet / intranet 7.

通信回線９は、各物理計算機がストレージにアクセスするネットワークＳＡＮ（ＳｔｏｒａｇｅＡｒｅａＮｅｔｗｏｒｋ）である。 The communication line 9 is a network SAN (Storage Area Network) through which each physical computer accesses the storage.

ストレージ１０は、各物理計算機１で稼動する仮想サーバ３のブートディスクイメージや待機サーバを生成する基となるテンプレートが格納されたストレージである。特に、仮想サーバディスクイメージ１１は、各仮想サーバ３のブートディスクイメージである。また、仮想サーバテンプレート１２は、各サーバグループの待機サーバを生成する基となるテンプレートである。 The storage 10 is a storage that stores a boot disk image of the virtual server 3 running on each physical computer 1 and a template that is a basis for generating a standby server. In particular, the virtual server disk image 11 is a boot disk image of each virtual server 3. The virtual server template 12 is a template that is a basis for generating a standby server for each server group.

図２は、図１に示す各仮想サーバ３の構成を論理的に示した図である。図２に示すように、各仮想サーバ３によって実現されるシステムは、Ｗｅｂ１、Ｗｅｂ２をフロントエンドとする２つのシステムが混在した環境となっている。
端末８からのリクエストは、インターネット／イントラネット７を介して負荷分散装置５に届けられる。そのリクエストがＷｅｂ１に対するものであれば、負荷分散装置５からＷｅｂ１−１あるいはＷｅｂ１−２のいずれかにリクエストが送出される。また、必要に応じ、負荷分散装置５を介してＡｐｐ１−１あるいはＡｐｐ１−２にリクエストが送出される。また、必要に応じＤＢ−１にリクエストが送出される。そして、処理結果が、リクエストが送られた順の逆順で戻っていく構成となっている。
また、同様に、リクエストがＷｅｂ２に対するものであれば、負荷分散装置５からＷｅｂ２−１あるいはＷｅｂ２−２のいずれかにリクエストが送出される。また、必要に応じ、負荷分散装置５を介してＡｐｐ２−１にリクエストが送出される。そして、処理結果が、リクエストが送られた順の逆順で戻っていく構成となっている。 FIG. 2 is a diagram logically showing the configuration of each virtual server 3 shown in FIG. As shown in FIG. 2, the system realized by each virtual server 3 is an environment in which two systems having Web1 and Web2 as front ends are mixed.
A request from the terminal 8 is delivered to the load balancer 5 via the Internet / intranet 7. If the request is for Web1, the request is sent from the load balancer 5 to either Web1-1 or Web1-2. Further, a request is sent to App1-1 or App1-2 via the load balancer 5 as necessary. Requests are sent to DB-1 as necessary. Then, the processing result returns in the reverse order of the order in which the requests are sent.
Similarly, if the request is for Web 2, the request is sent from the load balancer 5 to either Web 2-1 or Web 2-2. Further, a request is sent to App 2-1 through the load balancer 5 as necessary. Then, the processing result returns in the reverse order of the order in which the requests are sent.

図３は、実施の形態１に係るリソース提供時間管理テーブル２６の例を示す図である。
リソース提供時間管理テーブル２６は、待機方式２６−１、切替時間２６−２（待機から稼動に要する時間）、生成時間２６−３（生成のための時間）、リソース使用量の順位２６−４の項目を有するテーブルである。
待機方式２６−１は、待機サーバを用意する方式を識別するための方式名が格納される。本例では、待機サーバを一時停止状態にしたサスペンド、待機サーバの状態を保存したスナップショット、待機サーバのブートイメージを用意したブート、待機サーバの基となるテンプレートの状態のままのテンプレートを待機方式として用意した例を示している。
切替時間２６−２は、各待機方式で用意された待機サーバを稼動サーバに変更するために要する時間が格納される。
生成時間２６−３は、各待機方式の待機サーバを生成するために要する時間が格納される。
リソース使用量の順位２６−４は、各待機方式にて待機サーバを用意した場合に、その待機サーバによって使用されるリソースの量（リソース使用量）の順位が格納される。この例では、リソース使用量が多いものほど少ない数字で表している。 FIG. 3 is a diagram illustrating an example of the resource provision time management table 26 according to the first embodiment.
The resource provision time management table 26 includes a standby method 26-1, a switching time 26-2 (time required from standby to operation), a generation time 26-3 (time for generation), and a resource usage rank 26-4. It is a table having items.
The standby method 26-1 stores a method name for identifying a method for preparing a standby server. In this example, suspending the standby server in a suspended state, a snapshot that saves the standby server state, a boot with a standby server boot image, and a template that remains in the state of the template on which the standby server is based An example prepared as is shown.
The switching time 26-2 stores the time required to change the standby server prepared in each standby method to an active server.
The generation time 26-3 stores the time required to generate the standby server of each standby method.
The resource usage ranking 26-4 stores the ranking of the amount of resources (resource usage) used by the standby server when a standby server is prepared in each standby method. In this example, the larger the resource usage, the smaller the number.

図４は、各待機方式２６−１で示した方式で待機サーバを生成する処理（生成手順）と、待機サーバから稼動サーバに変更する際の処理（切替手順）との例を示した図である。
ここでは、各サーバグループの待機サーバを生成するためのＯＳやアプリケーション等のインストールが完了したテンプレートが予め用意されている前提としている。 FIG. 4 is a diagram showing an example of processing (generation procedure) for generating a standby server by the method shown in each standby method 26-1 and processing (switching procedure) when changing from a standby server to an active server. is there.
Here, it is assumed that a template in which installation of an OS, an application, and the like for generating a standby server of each server group is completed is prepared in advance.

（サスペンド）
生成手順は次の（１）から（３）までである。（１）テンプレートからブートイメージをコピーして、ブートディスクの生成を行う。（２）ブートディスクのブートイメージで起動する。（３）一時停止状態にする。
切替手順は次の（１）（２）である。（１）一時停止状態を解除して稼動させる。（２）稼動状態にした後、負荷分散装置５へ登録する。 (suspend)
The generation procedure is the following (1) to (3). (1) Copy a boot image from a template and generate a boot disk. (2) Start with the boot image of the boot disk. (3) Set to a temporary stop state.
The switching procedure is the following (1) and (2). (1) Release the pause state and operate. (2) After entering the operating state, register in the load balancer 5.

（スナップショット）
生成手順は次の（１）から（４）までである。（１）テンプレートからブートイメージをコピーして、ブートディスクの生成を行う。（２）ブートディスクのブートイメージで起動する。（３）スナップショット情報の生成を行う。（４）スナップショット情報の生成完了後、生成した待機サーバをシャットダウンする。
切替手順は次の（１）（２）である。（１）スナップショットから起動を行う。（２）稼動状態にした後、負荷分散装置５へ登録する。 (snap shot)
The generation procedure is the following (1) to (4). (1) Copy a boot image from a template and generate a boot disk. (2) Start with the boot image of the boot disk. (3) Generate snapshot information. (4) After the generation of the snapshot information is completed, the generated standby server is shut down.
The switching procedure is the following (1) and (2). (1) Start from a snapshot. (2) After entering the operating state, register in the load balancer 5.

（ブート）
生成手順は次の（１）である。（１）テンプレートからブートイメージをコピーして、ブートディスクの生成を行う。
切替手順は次の（１）（２）である。（１）ブートディスクのブートイメージで起動する。（２）稼動状態にした後、負荷分散装置５へ登録する。 (boot)
The generation procedure is the following (1). (1) Copy a boot image from a template and generate a boot disk.
The switching procedure is the following (1) and (2). (1) Start with the boot image of the boot disk. (2) After entering the operating state, register in the load balancer 5.

（テンプレート）
テンプレートは用意されていることが前提のため、生成手順は存在しない。
切替手順は次の（１）から（３）までである。（１）テンプレートからブートイメージをコピーして、ブートディスクの生成を行う。（２）ブートディスクのブートイメージで起動する。（３）稼動状態にした後、負荷分散装置５へ登録する。 (template)
Since a template is assumed to be prepared, there is no generation procedure.
The switching procedure is from the following (1) to (3). (1) Copy a boot image from a template and generate a boot disk. (2) Start with the boot image of the boot disk. (3) After entering the operating state, register in the load balancer 5.

このように、待機方式毎に生成手順や切替手順が異なるため、待機方式毎に待機サーバの生成にかかる時間や、待機サーバから稼動サーバへの切り替えにかかる時間や、待機サーバによって使用されるリソース使用量が異なる。そこで、これらの情報をリソース提供時間管理テーブル２６に記憶しておき、最適な待機方式を導出する。 As described above, since the generation procedure and switching procedure differ for each standby method, the time required for generating the standby server for each standby method, the time required for switching from the standby server to the active server, and the resources used by the standby server The amount used is different. Therefore, these pieces of information are stored in the resource provision time management table 26, and an optimum standby method is derived.

なお、切替手順では、最後に必ず負荷分散装置５へ登録するという処理が行われる。待機サーバを稼動サーバに切り替える処理にかかる時間を短縮するため、予めサーバグループ毎に割り当て可能なＩＰアドレスを確保しておき、待機サーバが生成された時点で確保されたＩＰアドレスをその待機サーバに割り当てておいてもよい。この場合、待機サーバから稼動サーバへ切り替えされた場合、待機サーバに割り当てられていたＩＰアドレスを負荷分散装置５へ登録すればよい。つまり、待機サーバから稼動サーバへ切り替えた場合、単にＩＰアドレスを負荷分散装置５へ登録するだけで、負荷分散装置５はどのサーバグループの稼動サーバが、どのＩＰアドレスで追加されたかを識別することが可能になる。 In the switching procedure, a process of always registering in the load balancer 5 is performed at the end. In order to shorten the time required for the process of switching the standby server to the active server, an IP address that can be assigned to each server group is secured in advance, and the IP address secured when the standby server is generated is assigned to the standby server. It may be assigned. In this case, when the standby server is switched to the active server, the IP address assigned to the standby server may be registered in the load balancer 5. That is, when switching from the standby server to the active server, the load balancer 5 identifies which server group's active server is added with which IP address simply by registering the IP address in the load balancer 5. Is possible.

図５は、実施の形態１に係るリソース管理テーブル２７の例を示す図である。
リソース管理テーブル２７は、サーバグループ名２７−１、稼動サーバ２７−２、待機サーバ２７−３、待機方式２７−４、テンプレート名２７−５、最大数２７−６、第１閾値２７−７、第２閾値２７−８、第３閾値２７−９の項目を有するテーブルである。
サーバグループ名２７−１は、各サーバグループを識別するための名前が格納される。
稼動サーバ２７−２は、各サーバグループで稼動サーバとして動作しているサーバを識別するための名前が格納される。
待機サーバ２７−３は、現時点で用意されている待機サーバを識別するための名前が格納される。
待機方式２７−４は、用意されている待機サーバの待機方式の方式名が格納される。なお、待機方式２７−４は、リソース提供時間管理テーブル２６の待機方式２６−１に記録されているいずれかの方式名が格納される。
テンプレート名２７−５は、待機サーバを生成するための基となるテンプレートを識別する名前が格納される。
最大数２７−６は、各サーバグループで稼動可能な最大サーバ数が格納される。
第１閾値２７−７、第２閾値２７−８、第３閾値２７−９は、リソースの閾値が格納される。第１閾値２７−７は、待機サーバから稼動サーバへの変更を開始するトリガーとして利用される値が格納される。第２閾値２７−８は、各サーバグループのリソース使用量の上限値として利用される値であり、この値を超えた状態が長く続かないようにすべき値が格納される。第３閾値２７−９は、リソース使用量が少なくなり、稼動サーバ数を減らすトリガーとして利用される値が格納される。なお、各閾値には、そのサーバグループに割り当てられたリソース量に対するリソース使用量の割合（パーセンテージ）が閾値として格納される。 FIG. 5 is a diagram illustrating an example of the resource management table 27 according to the first embodiment.
The resource management table 27 includes a server group name 27-1, an active server 27-2, a standby server 27-3, a standby method 27-4, a template name 27-5, a maximum number 27-6, a first threshold 27-7, It is a table which has the item of the 2nd threshold value 27-8 and the 3rd threshold value 27-9.
The server group name 27-1 stores a name for identifying each server group.
The active server 27-2 stores a name for identifying a server operating as an active server in each server group.
The standby server 27-3 stores a name for identifying a standby server prepared at the present time.
In the standby method 27-4, the name of the standby method of the prepared standby server is stored. The standby method 27-4 stores one of the method names recorded in the standby method 26-1 of the resource provision time management table 26.
The template name 27-5 stores a name for identifying a template that is a basis for generating the standby server.
The maximum number 27-6 stores the maximum number of servers that can operate in each server group.
As the first threshold 27-7, the second threshold 27-8, and the third threshold 27-9, resource thresholds are stored. The first threshold value 27-7 stores a value used as a trigger for starting the change from the standby server to the active server. The second threshold value 27-8 is a value used as the upper limit value of the resource usage amount of each server group, and stores a value that should prevent a state exceeding this value from continuing for a long time. The third threshold 27-9 stores a value used as a trigger for reducing the number of active servers by reducing the resource usage. In each threshold value, the ratio of the resource usage amount to the resource amount allocated to the server group is stored as a threshold value.

図６は、実施の形態１に係るリソース管理・制御サーバ６の負荷上昇時における動作の流れを示すフローチャートである。なお、リソース管理・制御サーバ６は、図６に示す処理をサーバグループ毎に実行する。
（ＳＴ１０１）では、リソース情報収集・蓄積部２１が、各仮想サーバ３のリソース使用量を示すリソース使用情報を定期的に収集して、リソース管理テーブル２７に格納されている第１閾値２７−７を超えたか否かを処理装置により判定する。なお、ここでリソース情報収集・蓄積部２１が収集したリソース使用情報は、リソース管理・制御サーバ６の記憶装置に蓄積される。
第１閾値２７−７を超えたと判定した場合、処理を（ＳＴ１０２）へ進める。一方、第１閾値２７−７を超えていないと判定した場合、所定の時間が経過した後、再度（ＳＴ１０１）を実行する。 FIG. 6 is a flowchart showing an operation flow when the load of the resource management / control server 6 according to the first embodiment increases. The resource management / control server 6 executes the process shown in FIG. 6 for each server group.
In (ST101), the resource information collection / accumulation unit 21 periodically collects resource usage information indicating the resource usage of each virtual server 3, and stores the first threshold value 27-7 stored in the resource management table 27. It is determined by the processing device whether or not the threshold is exceeded. Here, the resource usage information collected by the resource information collection / accumulation unit 21 is accumulated in the storage device of the resource management / control server 6.
If it is determined that the first threshold value 27-7 has been exceeded, the process proceeds to (ST102). On the other hand, if it is determined that the first threshold value 27-7 is not exceeded, (ST101) is executed again after a predetermined time has elapsed.

（ＳＴ１０２）では、役割変更部２３が、リソース管理テーブル２７の待機サーバ２７−３に格納された情報から、処理装置により、待機サーバが用意されているか否かを判定する。
待機サーバが用意されていれる場合、処理を（ＳＴ１０５）へ進める。一方、待機サーバが用意されていない場合、処理を（ＳＴ１０３）へ進める。 In (ST102), the role changing unit 23 determines from the information stored in the standby server 27-3 of the resource management table 27 whether the standby server is prepared by the processing device.
If a standby server is prepared, the process proceeds to (ST105). On the other hand, if no standby server is prepared, the process proceeds to (ST103).

（ＳＴ１０３）では、役割変更部２３が、リソース管理テーブル２７の稼動サーバ２７−２と最大数２７−６とに格納された情報から、処理装置により、稼動サーバ数が最大数に達しているか否かを判定する。
最大数に達していると判定した場合、稼動サーバを追加できないため処理を（ＳＴ１０１）へ戻す。最大数に達していないと判定した場合、処理を（ＳＴ１０４）へ進める。 In (ST103), whether or not the role changing unit 23 has reached the maximum number of operating servers by the processing device from the information stored in the operating servers 27-2 and the maximum number 27-6 of the resource management table 27. Determine whether.
If it is determined that the maximum number has been reached, the operation server cannot be added, and the process returns to (ST101). If it is determined that the maximum number has not been reached, the process proceeds to (ST104).

（ＳＴ１０４）では、最大数に達していないにも関わらず待機サーバが用意されていない場合、待機サーバが生成中であるため、役割変更部２３が、処理装置により、待機サーバの生成が完了したか否かを判定する。
生成が完了していないと判定した場合、所定の時間が経過した後、再度（ＳＴ１０４）を実行する。一方、生成が完了したと判定した場合、処理を（ＳＴ１０５）へ進める。 In (ST104), when the standby server is not prepared even though the maximum number has not been reached, since the standby server is being generated, the role changing unit 23 has completed the generation of the standby server by the processing device. It is determined whether or not.
If it is determined that the generation has not been completed, (ST104) is executed again after a predetermined time has elapsed. On the other hand, if it is determined that the generation has been completed, the process proceeds to (ST105).

（ＳＴ１０５）では、役割変更部２３が、リソース管理テーブル２７の待機方式２７−４に格納された情報に基づき、待機サーバを稼動サーバに切り替える。また、役割変更部２３が、リソース管理テーブル２７の稼動サーバ２７−２、待機サーバ２７−３、待機方式２７−４に格納された情報を更新する。
そして、処理を（ＳＴ１０６）へ進める。 In (ST105), the role changing unit 23 switches the standby server to the active server based on the information stored in the standby method 27-4 of the resource management table 27. Further, the role changing unit 23 updates the information stored in the active server 27-2, standby server 27-3, and standby method 27-4 in the resource management table 27.
Then, the process proceeds to (ST106).

（ＳＴ１０６）では、負荷傾向判断部２２が、処理装置により、将来の負荷の上昇の予測計算を行う。
図７は、実施の形態１に係る将来の負荷の上昇の予測計算の説明図である。図７において、各プロットは、過去所定の時間（時間ｔ０）に、（ＳＴ１０１）で収集し、記憶装置に蓄積されたリソース使用情報が示すリソース使用量である。このリソース使用量（リソースの使用状況の分布）から回帰分析等の手法を用い、将来の負荷の上昇の予測計算を行う。例えば、ｙ＝ａｘ＋ｂ（ｙ：リソース使用量、ｘ：時間、ａ，ｂ：はリソース使用量から計算される値）等の近似式を導出して、導出した近似式から予測計算を行う。
そして、処理を（ＳＴ１０７）へ進める。 In (ST106), the load tendency determination unit 22 performs a prediction calculation of a future load increase by the processing device.
FIG. 7 is an explanatory diagram of a prediction calculation of a future load increase according to the first embodiment. In FIG. 7, each plot is a resource usage amount indicated by the resource usage information collected in (ST101) and accumulated in the storage device at a past predetermined time (time t0). From this resource usage amount (distribution of resource usage status), a method such as regression analysis is used to predict the future load increase. For example, an approximate expression such as y = ax + b (y: resource use amount, x: time, a, b: values calculated from the resource use amount) is derived, and prediction calculation is performed from the derived approximate expression.
Then, the process proceeds to (ST107).

（ＳＴ１０７）では、リソース提供方式決定部２４が、負荷傾向判断部２２によって導出された負荷上昇の傾向と、現時点で対象のサーバグループに割り当てられているリソース量とから、処理装置により、リソース使用量が第１閾値２７−７に達するまでの時間（図７のｔ１、第１予測時間）を計算する。また、リソース提供方式決定部２４は、処理装置により、リソース使用量が第１閾値２７−７に達してから第２閾値２７−８に達するまでの時間（図７のｔ２、第２予測時間）を計算する。
なお、現時点で対象のサーバグループに割り当てられているリソース量とは、（ＳＴ１０５）で稼動サーバが追加されるまでに、各稼動サーバに割り当てられていたリソース量の合計に、（ＳＴ１０５）で追加された稼動サーバに割り当てられたリソース量を加えたリソース量である。 In (ST107), the resource providing method determination unit 24 uses the processing device to determine the resource usage based on the load increase trend derived by the load trend determination unit 22 and the resource amount currently assigned to the target server group. The time until the amount reaches the first threshold value 27-7 (t1 in FIG. 7, the first predicted time) is calculated. Also, the resource provision method determination unit 24 uses the processing device to wait for the resource usage amount to reach the second threshold value 27-8 after reaching the first threshold value 27-7 (t2, second predicted time in FIG. 7). Calculate
Note that the resource amount currently assigned to the target server group is added in (ST105) to the total amount of resources assigned to each active server until the active server is added in (ST105). This is the amount of resources added to the amount of resources allocated to the active server.

（ＳＴ１０８）では、リソース提供方式決定部２４が、処理装置により、リソース提供時間管理テーブル２６の生成時間２６−３に格納された時間と、（ＳＴ１０７）で計算した第１予測時間とを比較する。また、リソース提供方式決定部２４が、処理装置により、リソース提供時間管理テーブル２６の切替時間２６−２に格納された時間と、（ＳＴ１０７）で計算した第２予測時間とを比較する。そして、リソース管理テーブル２７が、第１予測時間よりも生成時間２６−３の時間が短く、かつ、第２予測時間よりも切替時間２６−２の時間が短い待機方式が存在するか否かを判定する。
両方の時間を満たす待機方式が存在すると判定した場合、処理を（ＳＴ１０９）へ進める。一方、両方の時間を満たす待機方式が存在しないと判定した場合、処理を（ＳＴ１１０）へ進める。 In (ST108), the resource provision method determination unit 24 compares the time stored in the generation time 26-3 of the resource provision time management table 26 with the first predicted time calculated in (ST107) by the processing device. . Further, the resource provision method determination unit 24 compares the time stored in the switching time 26-2 of the resource provision time management table 26 with the second predicted time calculated in (ST107) by the processing device. Whether the resource management table 27 has a standby method in which the generation time 26-3 is shorter than the first prediction time and the switching time 26-2 is shorter than the second prediction time. judge.
If it is determined that there is a standby method that satisfies both times, the process proceeds to (ST109). On the other hand, if it is determined that there is no standby method that satisfies both times, the process proceeds to (ST110).

（ＳＴ１０９）では、リソース提供方式決定部２４が、処理装置により、両方の時間を満たす待機方式が１方式のみか複数存在するかを判定する。
１方式のみと判定した場合、処理を（ＳＴ１１２）へ進める。一方、複数存在すると判定した場合、処理を（ＳＴ１１１）へ進める。 In (ST109), the resource provision method determination unit 24 determines whether there is only one method or a plurality of standby methods satisfying both times by the processing device.
If it is determined that there is only one method, the process proceeds to (ST112). On the other hand, if it is determined that there are a plurality, the process proceeds to (ST111).

（ＳＴ１１０）では、リソース提供方式決定部２４が、処理装置により、リソース提供時間管理テーブル２６の切替時間２６−２に格納された時間と生成時間２６−３に格納された時間との和が最も小さい待機方式を選択する。
そして、処理を（ＳＴ１１２）へ進める。 In (ST110), the resource provision method determination unit 24 determines that the sum of the time stored in the switching time 26-2 and the time stored in the generation time 26-3 of the resource provision time management table 26 is the largest by the processing device. Choose a smaller standby method.
Then, the process proceeds to (ST112).

（ＳＴ１１１）では、リソース提供方式決定部２４が、処理装置により、リソース提供時間管理テーブル２６のリソース使用量の順位２６−４に格納された値が低い（リソース使用量が少ない）待機方式を選択する。 In (ST111), the resource provision method determination unit 24 selects a standby method with a low value (the resource use amount is small) stored in the resource use amount ranking 26-4 of the resource provision time management table 26 by the processing device. To do.

（ＳＴ１１２）では、待機生成・削除部２５が、処理装置により、リソース提供方式決定部２４が決定した待機方式の待機サーバを生成する。そして、待機生成・削除部２５は、リソース管理テーブル２７の待機サーバ２７−３、待機方式２７−４の情報を更新する。
そして、処理を（ＳＴ１０１）へ戻す。 In (ST112), the standby generation / deletion unit 25 generates a standby server of the standby method determined by the resource provision method determination unit 24 by the processing device. Then, the standby generation / deletion unit 25 updates the information of the standby server 27-3 and the standby method 27-4 in the resource management table 27.
Then, the process returns to (ST101).

以上のように、リソース管理・制御サーバ６は、負荷傾向に基づき、必要とされる時間内に稼動サーバの追加（リソース追加）が可能な待機方式を選択する。また、必要とされる時間内に稼動サーバの追加が可能な待機方式が複数存在する場合には、待機サーバが使用するリソースが少ない待機方式を選択する。
そのため、待機サーバのリソース使用量を抑えつつ、必要なタイミングで待機サーバを稼動サーバに変更して、負荷上昇へ対応することが可能である。
特に、同じサーバグループで同じ負荷傾向であっても、稼動サーバ数が少ない場合には、サーバグループに割り当てられている総リソース量が少ないため、即時に追加が必要となり、稼動サーバ数が多い場合には、サーバグループに割り当てられている総リソース量が多いため、即時追加は不要となるといった傾向がある。リソース管理・制御サーバ６によれば、このような傾向に合わせて、稼動サーバの追加を行うことができる。 As described above, the resource management / control server 6 selects a standby method capable of adding an active server (adding resources) within a required time based on the load tendency. In addition, when there are a plurality of standby methods in which an active server can be added within the required time, a standby method that uses fewer resources is selected.
Therefore, it is possible to cope with an increase in load by changing the standby server to an active server at a necessary timing while suppressing the resource usage of the standby server.
In particular, even if the same server group has the same load trend, if the number of active servers is small, the total amount of resources allocated to the server group is small, so it must be added immediately and the number of active servers is large. However, since the total amount of resources allocated to the server group is large, immediate addition is not necessary. According to the resource management / control server 6, an active server can be added in accordance with such a tendency.

なお、上記説明では、仮想サーバの構成で説明したが、物理サーバの構成に適用することも可能である。
また、上記説明では、負荷傾向を判断する手法として回帰分析による一次式の例を示したが、負荷傾向を予測できる手法であれば、別の手法としても構わない。
また、上記説明では、待機方式の例としてサスペンド、スナップショット、ブート、テンプレートを示したが、これらの方式に限定されるものではない。 In the above description, the configuration of the virtual server is described. However, the configuration can be applied to the configuration of the physical server.
In the above description, an example of a linear expression based on regression analysis is shown as a method for determining the load tendency. However, another method may be used as long as the method can predict the load tendency.
In the above description, suspend, snapshot, boot, and template are shown as examples of the standby method. However, the present invention is not limited to these methods.

つまり、実施の形態１に係るサーバリソース制御方式は、グループ化された複数のサーバに処理を分散させ、サーバの追加により処理量の増加を実現するシステムにおいて、
待機サーバを生成する時間情報と、待機サーバから稼動サーバに役割を変更するための時間情報と、稼動サーバのリソース使用状況の履歴情報から導出する負荷変動傾向と、その時の稼動サーバ群に割り当てられているリソース量により変化する閾値に達するまでの時間予測に基づき、稼動サーバのリソース量変更に必要な時間内に対応でき、かつ、最も待機サーバをリソース使用量の少ない形態を選択し、待機サーバの生成を実施することを特徴とする。 In other words, the server resource control method according to the first embodiment distributes processing to a plurality of grouped servers, and in a system that realizes an increase in processing amount by adding servers,
Assigned to the time information for generating the standby server, the time information for changing the role from the standby server to the active server, the load fluctuation trend derived from the resource usage history information of the active server, and the active server group at that time Based on the time estimate until reaching the threshold that changes depending on the amount of resources that are available, the standby server can be used within the time required to change the resource amount of the active server and the standby server is selected with the least amount of resource usage. Is generated.

実施の形態２．
この実施の形態では、負荷上昇時に選択した待機サーバの待機方式を記録し、記録した待機方式に基づき、負荷減少時の待機サーバの待機方式を決定する方法について説明する。 Embodiment 2. FIG.
In this embodiment, a method of recording a standby method of a standby server selected at the time of load increase and determining a standby method of a standby server at the time of load reduction based on the recorded standby method will be described.

図８は、実施の形態２に係るリソース管理テーブル２７の例を示す図である。
図８に示すリソース管理テーブル２７には、図５に示すリソース管理テーブル２７に初期台数２７−１０と選択待機方式２７−１１とが追加されている。他は図５に示すリソース管理テーブル２７と同様である。
初期台数２７−１０は、初期の稼動サーバ台数が格納される。なお、ここでは、初期の稼動サーバ台数よりも少ない台数に稼動サーバを減らせないものとする。
選択待機方式２７−１１は、各稼動サーバ数の時に、待機方式としてどの方式が選択されたかが格納される。例えば、図８では、Ｗｅｂ１のサーバグループであれば、稼動サーバが２台の場合における待機サーバの待機方式がサスペンドであり、３台の場合における待機サーバの待機方式がブートであり、４台の場合における待機サーバの待機方式がブートであることを示す。 FIG. 8 is a diagram illustrating an example of the resource management table 27 according to the second embodiment.
In the resource management table 27 shown in FIG. 8, an initial number 27-10 and a selection standby method 27-11 are added to the resource management table 27 shown in FIG. The rest is the same as the resource management table 27 shown in FIG.
The initial number 27-10 stores the initial number of active servers. Here, it is assumed that the number of operating servers cannot be reduced to a smaller number than the initial number of operating servers.
The selection standby method 27-11 stores which method has been selected as the standby method for each number of active servers. For example, in FIG. 8, in the case of the Web1 server group, the standby server standby method in the case of two active servers is suspended, and the standby server standby method in the case of three is boot, and four Indicates that the standby method of the standby server in this case is boot.

図９は、実施の形態２に係るリソース管理・制御サーバ６の負荷上昇時における動作の流れを示すフローチャートである。
図９の（ＳＴ１０１）から（ＳＴ１１２）までの動作は、実施の形態１で説明した図６に示す動作と同様である。
（ＳＴ２０１）では、リソース提供方式決定部２４が、選択した待機方式を、リソース管理テーブル２７の選択待機方式２７−１１における稼動サーバ数に応じた項目に格納する。 FIG. 9 is a flowchart showing an operation flow when the load of the resource management / control server 6 according to the second embodiment increases.
The operations from (ST101) to (ST112) in FIG. 9 are the same as the operations shown in FIG. 6 described in the first embodiment.
In (ST201), the resource providing method determination unit 24 stores the selected standby method in an item according to the number of active servers in the selected standby method 27-11 of the resource management table 27.

図１０は、実施の形態２に係るリソース管理・制御サーバ６の負荷減少時における動作の流れを示すフローチャートである。なお、リソース管理・制御サーバ６は、サーバグループ毎に図１０に示す処理を実行する。
（ＳＴ２０２）では、リソース情報収集・蓄積部２１が、各仮想サーバ３のリソース使用情報を定期的に収集して、リソース管理テーブル２７に格納されている第３閾値２７−９を下回ったか否かを処理装置により判定する。なお、ここでリソース情報収集・蓄積部２１が収集したリソース使用情報は、リソース管理・制御サーバ６の記憶装置に蓄積される。
第３閾値２７−９を下回ったと判定した場合、処理を（ＳＴ２０３）へ進める。一方、第３閾値２７−９を下回っていないと判定した場合、所定の時間が経過した後、再度（ＳＴ２０２）を実行する。 FIG. 10 is a flowchart showing an operation flow when the load of the resource management / control server 6 according to the second embodiment is reduced. The resource management / control server 6 executes the process shown in FIG. 10 for each server group.
In (ST202), whether or not the resource information collection / accumulation unit 21 periodically collects the resource usage information of each virtual server 3 and falls below the third threshold value 27-9 stored in the resource management table 27. Is determined by the processing device. Here, the resource usage information collected by the resource information collection / accumulation unit 21 is accumulated in the storage device of the resource management / control server 6.
If it is determined that the value falls below the third threshold 27-9, the process proceeds to (ST203). On the other hand, if it is determined that it is not lower than the third threshold 27-9, (ST202) is executed again after a predetermined time has elapsed.

（ＳＴ２０３）では、リソース提供方式決定部２４が、リソース管理テーブル２７の稼動サーバ２７−２と初期台数２７−１０とに格納された情報から、処理装置により、稼動サーバの台数が初期台数か否かを判断する。
初期台数ではないと判定した場合、処理を（ＳＴ２０４）へ進める。一方、初期台数であると判定した場合、稼動サーバを減らせないため、処理を（ＳＴ２０２）へ戻す。 In (ST203), the resource provision method determination unit 24 determines whether the number of active servers is the initial number by the processing device from the information stored in the active servers 27-2 and the initial number 27-10 of the resource management table 27. Determine whether.
If it is determined that the number is not the initial number, the process proceeds to (ST204). On the other hand, if it is determined that the number is the initial number, the number of active servers cannot be reduced, so the process returns to (ST202).

（ＳＴ２０４）では、リソース提供方式決定部２４が、リソース管理テーブル２７の稼動サーバ２７−２に格納された情報から、処理装置により、現在の稼動サーバ数を計算する。そして、リソース提供方式決定部２４が、リソース管理テーブル２７の選択待機方式２７−１１における計算した稼働サーバ数から１台減少させた台数の項目に格納された待機方式を読み出す。
そして、処理を（ＳＴ２０５）へ進める。 In (ST204), the resource provision method determination unit 24 calculates the current number of active servers from the information stored in the active server 27-2 of the resource management table 27 by the processing device. Then, the resource provision method determination unit 24 reads the standby method stored in the item of the number reduced by one from the number of active servers calculated in the selection standby method 27-11 of the resource management table 27.
Then, the process proceeds to (ST205).

（ＳＴ２０５）では、役割変更部２３が、処理装置により、現在の稼動サーバのうちの１台を、（ＳＴ２０４）で読み出された待機方式の待機サーバに切り替える。そして、役割変更部２３が、リソース管理テーブル２７の稼動サーバ２７−２、待機サーバ２７−３、待機方式２７−４の情報を更新する。 In (ST205), the role changing unit 23 switches one of the currently active servers to the standby server of the standby method read in (ST204) by the processing device. Then, the role changing unit 23 updates information on the active server 27-2, the standby server 27-3, and the standby method 27-4 in the resource management table 27.

（ＳＴ２０６）では、待機生成・削除部２５が、処理装置により、（ＳＴ２０５）で待機サーバに切り替える前から用意されていた待機サーバを削除する。 In (ST206), the standby generation / deletion unit 25 deletes the standby server prepared before switching to the standby server in (ST205) by the processing device.

以上のように、負荷上昇時であれば、時間やリソース使用状況で待機方式を選択可能であるが、負荷減少時では、次の上昇傾向が予測できないため、待機方式の選択が困難となる。そこで、負荷上昇時に選択した待機方式を記録しておき、その負荷減少時に記録した待機方式を選択することにより、そのシステムに最適な待機方式を選択できる。 As described above, when the load is increasing, the standby method can be selected depending on the time and the resource usage state. However, when the load is decreasing, it is difficult to select the standby method because the next upward trend cannot be predicted. Therefore, by recording the standby method selected when the load increases, and selecting the standby method recorded when the load decreases, the optimal standby method for the system can be selected.

なお、図９に示す流れでは、同一の稼動サーバ数の場合における待機方式が最新の情報に常に更新される。しかし、過去の情報の中で最もリソースを使わない待機方式、あるいは最も高速に待機サーバを稼動サーバに切り替えられる待機方式等を記録しておくとしてもよい。 In the flow shown in FIG. 9, the standby method in the case of the same number of active servers is always updated to the latest information. However, a standby method that uses the least resources among the past information, or a standby method that switches the standby server to the active server at the highest speed may be recorded.

つまり、実施の形態２に係るサーバリソース制御方式は、負荷上昇に伴い、稼動サーバを追加した後に用意した待機サーバの待機方式を稼動サーバの台数と対応させて記録し、負荷減少時の稼動サーバ台数削減時に、稼動サーバ台数に対応した待機方式を記録された方式にて用意することにより、負荷減少時の待機サーバの待機方式を負荷上昇時に選択した方式をそのまま利用可能とすることで、特別な計算をすることなく、そのシステムに適した待機方式を選択可能とすることを特徴とする。 In other words, the server resource control method according to the second embodiment records the standby server standby method prepared after adding the active server in association with the number of active servers as the load increases, and the active server when the load decreases By preparing a standby method corresponding to the number of active servers in the recorded method when the number of servers is reduced, the standby server standby method at the time of load reduction can be used as it is and the method selected at the time of load increase can be used as it is. This makes it possible to select a standby method suitable for the system without performing a complicated calculation.

実施の形態３．
この実施の形態では、サーバグループの役割に応じて、待機サーバを用意する動作を変更する方法について説明する。 Embodiment 3 FIG.
In this embodiment, a method for changing the operation of preparing a standby server according to the role of the server group will be described.

図１１は、実施の形態３に係るリソース管理テーブル２７の例を示す図である。
図１１に示すリソース管理テーブル２７には、図８に示すリソース管理テーブル２７にフロントグループ２７−１２が追加されている。他は図８に示すリソース管理テーブル２７と同様である。
フロントグループ２７−１２は、サーバグループ名２７−１の名前が示すサーバグループのフロントエンドとして動作する他のサーバグループが存在する場合、そのサーバグループの名前が格納される。フロントエンドとして動作する他のサーバグループが存在しない場合には、“なし”との情報が格納される。
なお、フロントエンドで動作するサーバグループとは、端末８よりで動作するサーバグループである。例えば、図２に示すように、Ａｐｐ１のサーバグループよりもＷｅｂ１のサーバグループは端末８よりで動作するため、Ｗｅｂ１のサーバグループはＡｐｐ１のサーバグループのフロントエンドとして動作するサーバグループである。なお、この場合、Ａｐｐ１のサーバグループをＷｅｂ１のサーバグループのバックエンドとして動作するサーバグループと呼ぶ。 FIG. 11 is a diagram illustrating an example of the resource management table 27 according to the third embodiment.
In the resource management table 27 shown in FIG. 11, a front group 27-12 is added to the resource management table 27 shown in FIG. The rest is the same as the resource management table 27 shown in FIG.
When there is another server group that operates as a front end of the server group indicated by the name of the server group name 27-1, the front group 27-12 stores the name of the server group. If there is no other server group operating as a front end, information of “none” is stored.
The server group operating on the front end is a server group operating on the terminal 8. For example, as shown in FIG. 2, since the server group of Web1 operates from the terminal 8 rather than the server group of App1, the server group of Web1 is a server group that operates as a front end of the server group of App1. In this case, the App1 server group is referred to as a server group that operates as a back end of the Web1 server group.

図１２は、リソース管理・制御サーバ６の各サーバグループに対する処理開始時における動作の流れを示すフローチャートである。なお、前提として、バックエンドサーバグループには、待機サーバを用意しておかず、リソース管理テーブル２７の待機方式２７−４に用意すべき待機方式が設定されている状態とする。
（ＳＴ３０１）では、リソース提供方式決定部２４が、リソース管理テーブル２７のフロントグループ２７−１２に格納された情報に基づき、処理装置により、フロントエンドで動作するサーバグループが存在するサーバグループか否かを判断する。
フロントエンドで動作するサーバグループが存在しないと判定した場合、処理を図６や図９に示す（ＳＴ１０１）へ進め、以降は図６や図９に示した処理が実行される。一方、フロントエンドサーバグループが存在すると判定した場合、処理を（ＳＴ３０２）へ進める。 FIG. 12 is a flowchart showing an operation flow at the start of processing for each server group of the resource management / control server 6. As a premise, a standby server is not prepared in the back-end server group, and a standby method to be prepared in the standby method 27-4 of the resource management table 27 is set.
In (ST301), based on the information stored in the front group 27-12 of the resource management table 27, the resource provision method determining unit 24 determines whether the server group includes a server group that operates on the front end based on the information. Judging.
If it is determined that there is no server group operating on the front end, the process proceeds to (ST101) shown in FIGS. 6 and 9, and thereafter, the processes shown in FIGS. 6 and 9 are executed. On the other hand, if it is determined that the front-end server group exists, the process proceeds to (ST302).

（ＳＴ３０２）では、リソース提供方式決定部２４が、リソース管理テーブル２７の待機サーバ２７−３に格納された情報から、処理装置により、フロントエンドサーバグループに待機サーバが存在するか否かを判定する。
待機サーバが存在すると判定した場合、処理を図６や図９に示す（ＳＴ１０１）へ進め、以降は図６や図９に示した処理が実行される。一方、待機サーバが存在しないと判定した場合、処理を（ＳＴ３０３）へ進める。 In (ST302), the resource provision method determination unit 24 determines whether or not there is a standby server in the front-end server group by the processing device from the information stored in the standby server 27-3 of the resource management table 27. .
If it is determined that there is a standby server, the process proceeds to (ST101) shown in FIGS. 6 and 9, and thereafter, the processes shown in FIGS. 6 and 9 are executed. On the other hand, if it is determined that there is no standby server, the process proceeds to (ST303).

（ＳＴ３０３）では、リソース提供方式決定部２４が、処理装置により、フロントエンドサーバグループに、待機サーバが追加されたか否かを判定する。なお、待機サーバが追加されたとは、初期状態で用意されていた待機サーバとは別に待機サーバが生成されたという意味である。例えば、リソース管理テーブル２７の初期台数２７−１０よりも稼動サーバ数が多く、かつ、待機サーバが用意されていれば、待機サーバが追加されたと判定できる。
追加されていないと判定した場合、所定の時間が経過した後、再度（ＳＴ３０３）を実行する。一方、追加されたと判定した場合、処理を（ＳＴ３０４）へ進める。 In (ST303), the resource provision method determination unit 24 determines whether a standby server has been added to the front-end server group by the processing device. Note that the addition of the standby server means that a standby server has been generated separately from the standby server prepared in the initial state. For example, if the number of active servers is larger than the initial number 27-10 of the resource management table 27 and a standby server is prepared, it can be determined that a standby server has been added.
If it is determined that it has not been added, (ST303) is executed again after a predetermined time has elapsed. On the other hand, if it is determined that it has been added, the process proceeds to (ST304).

（ＳＴ３０４）では、待機生成・削除部２５が、処理装置により、リソース管理テーブル２７の待機方式２７−４に記録されている待機方式の待機サーバを生成する。 In (ST304), the standby generation / deletion unit 25 generates the standby server of the standby method recorded in the standby method 27-4 of the resource management table 27 by the processing device.

（ＳＴ３０５）では、待機生成・削除部２５が、生成した待機方式をリソース管理テーブル２７の選択待機方式２７−１１における稼動サーバ数に応じた項目に記録する。 In (ST305), the standby generation / deletion unit 25 records the generated standby method in an item corresponding to the number of active servers in the selection standby method 27-11 of the resource management table 27.

フロントエンドのサーバグループにリソースが追加されていなければ、バックエンドのサーバグループにリソースを追加しても、フロントエンドのサーバグループがネックとなって処理量を増やせない。
そこで、以上のように、バックエンドとなるサーバグループは、フロントエンドのサーバグループの負荷が上昇し、初期状態からリソースが追加された（稼動サーバが追加された）と判断されてから、待機サーバの用意を実施する。これにより、待機サーバが使用するリソース使用量を不用に多くすることを防ぐことができる。 If resources are not added to the front-end server group, even if resources are added to the back-end server group, the amount of processing cannot be increased due to the front-end server group becoming a bottleneck.
Therefore, as described above, the server group serving as the back end has a standby server after it is determined that the load on the front end server group has increased and resources have been added from the initial state (an active server has been added). Prepare for. Thereby, it is possible to prevent an unnecessary increase in the amount of resource used by the standby server.

つまり、実施の形態３に係るサーバリソース制御方式は、グループ間の階層構造情報に基づき、負荷上昇時に、フロントとなるグループの稼動サーバ台数が追加されたことを契機に、そのグループのバックエンドとなるグループの待機サーバの生成を開始することでバックエンド側の待機サーバを予め用意しないようにすることで、待機サーバのためのリソース確保を抑えることを特徴とする。 In other words, the server resource control method according to the third embodiment is based on the hierarchical structure information between groups, and when the load increases, the number of active servers in the front group is added and By starting generation of a standby server of a certain group, a standby server on the back-end side is not prepared in advance, thereby suppressing resource reservation for the standby server.

実施の形態４．
この実施の形態では、稼動サーバが障害等で停止した場合の動作について説明する。 Embodiment 4 FIG.
In this embodiment, an operation when an active server is stopped due to a failure or the like will be described.

図１３は、実施の形態４に係るサーバリソース制御システムの構成図である。図１に示す構成図に対し、障害監視部３１が追加されている。追加された障害監視部３１以外の構成は図１と同様である。
障害監視部３１は、処理装置により、各サーバグループにおける稼動サーバの稼動状態を監視する。 FIG. 13 is a configuration diagram of a server resource control system according to the fourth embodiment. A failure monitoring unit 31 is added to the configuration diagram shown in FIG. The configuration other than the added failure monitoring unit 31 is the same as that in FIG.
The failure monitoring unit 31 monitors the operating state of the active server in each server group by the processing device.

図１４は、実施の形態４に係るリソース管理・制御サーバ６の稼動サーバ停止時の動作の流れを示すフローチャートである。
（ＳＴ４０１）では、障害監視部３１が、処理装置により、サーバグループ内の稼動サーバの稼動状態を監視し、障害等により稼動サーバが停止したか否かを判定する。
停止していないと判断した場合、所定の時間が経過した後、再度（ＳＴ４０１）を実行する。一方、停止したと判断した場合、処理を（ＳＴ４０２）へ進める。 FIG. 14 is a flowchart showing an operation flow when the active server of the resource management / control server 6 according to the fourth embodiment is stopped.
In (ST401), the failure monitoring unit 31 monitors the operating state of the active server in the server group by the processing device, and determines whether the active server has stopped due to a failure or the like.
If it is determined that it has not stopped, (ST401) is executed again after a predetermined time has elapsed. On the other hand, if it is determined that the process has been stopped, the process proceeds to (ST402).

（ＳＴ４０２）では、障害監視部３１が、（ＳＴ４０１）で稼動サーバが停止したと判断されたサーバグループについてのリソース管理テーブル２７の待機サーバ２７−３に格納された情報に基づき、処理装置により、待機サーバが用意されているか否かを判定する。
用意されていると判定した場合、処理を（ＳＴ４０８）へ進める。一方、用意されていないと判定した場合、処理を（ＳＴ４０３）へ進める。 In (ST402), based on the information stored in the standby server 27-3 of the resource management table 27 for the server group in which the active server is determined to have been stopped in (ST401), the failure monitoring unit 31 uses the processing device to It is determined whether or not a standby server is prepared.
If it is determined that it is prepared, the process proceeds to (ST408). On the other hand, if it is determined that it is not prepared, the process proceeds to (ST403).

（ＳＴ４０３）では、役割変更部２３が、リソース管理テーブル２７の稼動サーバ２７−２と最大数２７−６とに格納された情報から、処理装置により、稼動サーバ数が最大数に達しているか否かを判定する。
最大数に達していると判定した場合、処理を（ＳＴ４０４）へ進める。一方、最大数に達していないと判定した場合、処理を（ＳＴ４０７）へ進める。なお、ここでは、稼動サーバが停止してしまったため、最大数に達していても新たに稼動サーバを追加することが可能である。 In (ST403), whether or not the role changing unit 23 has reached the maximum number of active servers by the processing device from the information stored in the active servers 27-2 and the maximum number 27-6 of the resource management table 27. Determine whether.
If it is determined that the maximum number has been reached, the process proceeds to (ST404). On the other hand, if it is determined that the maximum number has not been reached, the process proceeds to (ST407). Here, since the active server has stopped, it is possible to add a new active server even if the maximum number is reached.

（ＳＴ４０４）では、リソース提供方式決定部２４が、処理装置により、リソース提供時間管理テーブル２６の切替時間２６−２と生成時間２６−３との和が最も小さい待機方式を選択する。そして、処理を（ＳＴ４０５）へ進める。
（ＳＴ４０５）では、待機生成・削除部２５が、処理装置により、リソース提供方式決定部２４が決定した待機方式の待機サーバを生成する。
（ＳＴ４０６）では、役割変更部２３が、処理装置により、生成された待機サーバを稼動サーバに変更する。これにより、稼動サーバ数が稼動サーバが停止する前の状態（最大数）に戻る。 In (ST404), the resource provision method determination unit 24 selects the standby method with the smallest sum of the switching time 26-2 and the generation time 26-3 of the resource provision time management table 26 by the processing device. Then, the process proceeds to (ST405).
In (ST405), the standby generation / deletion unit 25 generates a standby server of the standby method determined by the resource provision method determination unit 24 by the processing device.
In (ST406), the role changing unit 23 changes the generated standby server to an active server by the processing device. Thereby, the number of active servers returns to the state (maximum number) before the active servers are stopped.

（ＳＴ４０７）では、最大数に達していない場合、待機サーバが生成中であるため、役割変更部２３が、処理装置により、待機サーバの生成が完了したか否かを判定する。
生成が完了していないと判定した場合、所定の時間が経過した後、再度（ＳＴ４０７）を実行する。一方、生成が完了したと判定した場合、処理を（ＳＴ４０８）へ進める。 In (ST407), since the standby server is being generated when the maximum number has not been reached, the role changing unit 23 determines whether or not the standby server has been generated by the processing device.
If it is determined that the generation has not been completed, (ST407) is executed again after a predetermined time has elapsed. On the other hand, if it is determined that the generation has been completed, the process proceeds to (ST408).

（ＳＴ４０８）では、役割変更部２３が、リソース管理テーブル２７の待機方式２７−４に格納された情報に基づき、待機サーバを稼動サーバに切り替える。これにより、稼動サーバ数が稼動サーバが停止する前の状態に戻る。また、役割変更部２３が、リソース管理テーブル２７の稼動サーバ２７−２、待機サーバ２７−３、待機方式２７−４に格納された情報を更新する。
そして、処理を（ＳＴ４０９）へ進める。 In (ST408), the role changing unit 23 switches the standby server to the active server based on the information stored in the standby method 27-4 of the resource management table 27. Thereby, the number of active servers returns to the state before the active servers are stopped. Further, the role changing unit 23 updates the information stored in the active server 27-2, standby server 27-3, and standby method 27-4 in the resource management table 27.
Then, the process proceeds to (ST409).

（ＳＴ４０９）では、負荷傾向判断部２２が、処理装置により、将来の負荷の上昇の予測計算を行う。
図１５は、実施の形態４に係る将来の負荷の上昇の予測計算の説明図である。なお、図１５では、障害発生時に待機サーバが生成されていた場合（ＳＴ４０２でＹｅｓとなった場合）と、待機サーバが生成中であった場合（ＳＴ４０７を経由した場合）との２つの場合を示す。
稼動サーバ数が元の構成に戻った時点（図１５において、障害発生時に待機サーバの生成が完了している場合には障害発生時点、障害発生時に待機サーバの生成が完了していない場合には、待機サーバの生成が完了した時点）で、負荷傾向判断部２２が過去所定の時間（時間ｔ０）のリソースの使用状況の分布から回帰分析等の手法を用い、将来の負荷の上昇の予測計算を行う。例えば、ｙ＝ａｘ＋ｂ（ｙ：リソース使用量、ｘ：時間、ａ，ｂ：はリソース使用量から計算される値）等の近似式を導出して、導出した近似式から予測計算を行う。 In (ST409), the load tendency determination unit 22 performs prediction calculation of a future load increase by the processing device.
FIG. 15 is an explanatory diagram of a prediction calculation of a future load increase according to the fourth embodiment. In FIG. 15, there are two cases: a case where a standby server is generated when a failure occurs (when the answer is YES in ST402) and a case where a standby server is being generated (via ST407). Show.
When the number of active servers returns to the original configuration (in FIG. 15, when the generation of a standby server is completed when a failure occurs, when the failure occurs, and when the generation of a standby server is not completed when a failure occurs) , When the generation of the standby server is completed), the load trend determination unit 22 uses a method such as regression analysis from the distribution of resource usage in the past predetermined time (time t0) to predict the future load increase. I do. For example, an approximate expression such as y = ax + b (y: resource use amount, x: time, a, b: values calculated from the resource use amount) is derived, and prediction calculation is performed from the derived approximate expression.

（ＳＴ４１０）から（ＳＴ４１５）までの処理は、図６の（ＳＴ１０７）から（ＳＴ１１２）までの処理と同様である。 The processing from (ST410) to (ST415) is the same as the processing from (ST107) to (ST112) in FIG.

以上のように、稼動サーバに障害が発生し、現在割り当てているリソース量の減少が発生した場合には、即座に待機サーバを稼動サーバに変更し、今後のリソース増加に備えた待機サーバを用意する。これにより、障害発生時のリソース量の減少時間を最小限に食い止めることができるとともに、その後に発生する可能性がある負荷増加にも対応することができる。
また、障害発生時に生成した待機方式は記録しない。これにより、負荷減少時の待機方式は、障害の影響を受けることなく、通常時の待機方式を提供することが可能となる。 As described above, when a failure occurs in the active server and the amount of resources currently allocated decreases, the standby server is immediately changed to the active server, and a standby server is prepared for future resource increase. To do. As a result, it is possible to minimize the decrease time of the resource amount when a failure occurs, and to cope with an increase in load that may occur thereafter.
Also, the standby method generated when a failure occurs is not recorded. As a result, the standby method when the load is reduced can be provided with the normal standby method without being affected by the failure.

つまり、実施の形態４に係るサーバリソース制御方式は、サーバが障害等により切り離された場合に、待機サーバを稼動サーバに即座に変更すると共に、障害発生時点で稼動サーバのリソース使用状況の履歴情報から導出する負荷変動傾向を再計算し、その時の稼動サーバ群に割り当てられているリソース量により変化する閾値に達するまでの時間予測に基づき、稼動サーバのリソース量変更に必要な時間内に対応でき、かつ、最も待機サーバをリソース使用量の少ない形態を選択し、待機サーバの生成を実施することを特徴とする。 In other words, the server resource control method according to the fourth embodiment immediately changes the standby server to the active server when the server is disconnected due to a failure or the like, and the resource usage history information of the active server at the time of the failure. Recalculate the load fluctuation trend derived from, and based on the time estimate for reaching the threshold that changes according to the amount of resources allocated to the active server group at that time, it can respond within the time required to change the resource amount of the active server In addition, the standby server is generated by selecting the standby server having the least amount of resource usage.

次に、実施の形態における物理計算機１、負荷分散装置５、リソース管理・制御サーバ６、端末８のハードウェア構成について説明する。
図１６は、物理計算機１、負荷分散装置５、リソース管理・制御サーバ６、端末８のハードウェア構成の一例を示す図である。
図１６に示すように、物理計算機１、負荷分散装置５、リソース管理・制御サーバ６、端末８は、プログラムを実行するＣＰＵ９１１（Ｃｅｎｔｒａｌ・Ｐｒｏｃｅｓｓｉｎｇ・Ｕｎｉｔ、中央処理装置、処理装置、演算装置、マイクロプロセッサ、マイクロコンピュータ、プロセッサともいう）を備えている。ＣＰＵ９１１は、バス９１２を介してＲＯＭ９１３、ＲＡＭ９１４、ＬＣＤ９０１（ＬｉｑｕｉｄＣｒｙｓｔａｌＤｉｓｐｌａｙ）、キーボード９０２（Ｋ／Ｂ）、通信ボード９１５、磁気ディスク装置９２０と接続され、これらのハードウェアデバイスを制御する。磁気ディスク装置９２０（固定ディスク装置）の代わりに、光ディスク装置、メモリカード読み書き装置などの記憶装置でもよい。磁気ディスク装置９２０は、所定の固定ディスクインタフェースを介して接続される。 Next, the hardware configuration of the physical computer 1, the load balancer 5, the resource management / control server 6, and the terminal 8 in the embodiment will be described.
FIG. 16 is a diagram illustrating an example of a hardware configuration of the physical computer 1, the load balancer 5, the resource management / control server 6, and the terminal 8.
As shown in FIG. 16, the physical computer 1, the load distribution device 5, the resource management / control server 6, and the terminal 8 are a CPU 911 (Central Processing Unit, central processing unit, processing unit, arithmetic unit, micro unit that executes a program. A processor, a microcomputer, and a processor). The CPU 911 is connected to the ROM 913, the RAM 914, the LCD 901 (Liquid Crystal Display), the keyboard 902 (K / B), the communication board 915, and the magnetic disk device 920 via the bus 912, and controls these hardware devices. Instead of the magnetic disk device 920 (fixed disk device), a storage device such as an optical disk device or a memory card read / write device may be used. The magnetic disk device 920 is connected via a predetermined fixed disk interface.

ＲＯＭ９１３、磁気ディスク装置９２０は、不揮発性メモリの一例である。ＲＡＭ９１４は、揮発性メモリの一例である。ＲＯＭ９１３とＲＡＭ９１４と磁気ディスク装置９２０とは、記憶装置（メモリ）の一例である。また、キーボード９０２、通信ボード９１５は、入力装置の一例である。また、通信ボード９１５は、通信装置（ネットワークインタフェース）の一例である。さらに、ＬＣＤ９０１は、表示装置の一例である。 The ROM 913 and the magnetic disk device 920 are examples of a nonvolatile memory. The RAM 914 is an example of a volatile memory. The ROM 913, the RAM 914, and the magnetic disk device 920 are examples of a storage device (memory). The keyboard 902 and the communication board 915 are examples of input devices. The communication board 915 is an example of a communication device (network interface). Furthermore, the LCD 901 is an example of a display device.

磁気ディスク装置９２０又はＲＯＭ９１３などには、オペレーティングシステム９２１（ＯＳ）、ウィンドウシステム９２２、プログラム群９２３、ファイル群９２４が記憶されている。プログラム群９２３のプログラムは、ＣＰＵ９１１、オペレーティングシステム９２１、ウィンドウシステム９２２により実行される。 An operating system 921 (OS), a window system 922, a program group 923, and a file group 924 are stored in the magnetic disk device 920 or the ROM 913. The programs in the program group 923 are executed by the CPU 911, the operating system 921, and the window system 922.

プログラム群９２３には、上記の説明において「仮想化ソフトウェア２」、「リソース情報収集・蓄積部２１」、「負荷傾向判断部２２」、「役割変更部２３」、「リソース提供方式決定部２４」、「待機生成・削除部２５」、「障害監視部３１」等として説明した機能を実行するソフトウェアやプログラムやその他のプログラムが記憶されている。プログラムは、ＣＰＵ９１１により読み出され実行される。
ファイル群９２４には、上記の説明において「リソース提供時間管理テーブル２６」、「リソース管理テーブル２７」に格納される情報やデータや信号値や変数値やパラメータが、「ファイル」や「データベース」の各項目として記憶される。「ファイル」や「データベース」は、ディスクやメモリなどの記録媒体に記憶される。ディスクやメモリなどの記憶媒体に記憶された情報やデータや信号値や変数値やパラメータは、読み書き回路を介してＣＰＵ９１１によりメインメモリやキャッシュメモリに読み出され、抽出・検索・参照・比較・演算・計算・処理・出力・印刷・表示などのＣＰＵ９１１の動作に用いられる。抽出・検索・参照・比較・演算・計算・処理・出力・印刷・表示のＣＰＵ９１１の動作の間、情報やデータや信号値や変数値やパラメータは、メインメモリやキャッシュメモリやバッファメモリに一時的に記憶される。 The program group 923 includes “virtualized software 2”, “resource information collection / accumulation unit 21”, “load tendency determination unit 22”, “role change unit 23”, “resource provision method determination unit 24” in the above description. , Software, programs and other programs for executing the functions described as “standby generation / deletion unit 25”, “failure monitoring unit 31”, and the like are stored. The program is read and executed by the CPU 911.
In the file group 924, information, data, signal values, variable values, and parameters stored in the “resource provision time management table 26” and “resource management table 27” in the above description are stored in the “file” and “database”. Stored as each item. The “file” and “database” are stored in a recording medium such as a disk or a memory. Information, data, signal values, variable values, and parameters stored in a storage medium such as a disk or memory are read out to the main memory or cache memory by the CPU 911 via a read / write circuit, and extracted, searched, referenced, compared, and calculated. Used for the operation of the CPU 911 such as calculation / processing / output / printing / display. Information, data, signal values, variable values, and parameters are temporarily stored in the main memory, cache memory, and buffer memory during the operation of the CPU 911 for extraction, search, reference, comparison, calculation, calculation, processing, output, printing, and display. Is remembered.

また、上記の説明におけるフローチャートの矢印の部分は主としてデータや信号の入出力を示し、データや信号値は、ＲＡＭ９１４のメモリ、その他光ディスク等の記録媒体やＩＣチップに記録される。また、データや信号は、バス９１２や信号線やケーブルその他の伝送媒体や電波によりオンライン伝送される。
また、上記の説明において「〜部」として説明するものは、「〜回路」、「〜装置」、「〜機器」、「〜手段」、「〜機能」であってもよく、また、「〜ステップ」、「〜手順」、「〜処理」であってもよい。また、「〜装置」として説明するものは、「〜回路」、「〜機器」、「〜手段」、「〜機能」であってもよく、また、「〜ステップ」、「〜手順」、「〜処理」であってもよい。さらに、「〜処理」として説明するものは「〜ステップ」であっても構わない。すなわち、「〜部」として説明するものは、ＲＯＭ９１３に記憶されたファームウェアで実現されていても構わない。或いは、ソフトウェアのみ、或いは、素子・デバイス・基板・配線などのハードウェアのみ、或いは、ソフトウェアとハードウェアとの組み合わせ、さらには、ファームウェアとの組み合わせで実施されても構わない。ファームウェアとソフトウェアは、プログラムとして、ＲＯＭ９１３等の記録媒体に記憶される。プログラムはＣＰＵ９１１により読み出され、ＣＰＵ９１１により実行される。すなわち、プログラムは、上記で述べた「〜部」としてコンピュータ等を機能させるものである。あるいは、上記で述べた「〜部」の手順や方法をコンピュータ等に実行させるものである。 In the above description, the arrows in the flowchart mainly indicate input / output of data and signals, and the data and signal values are recorded in a memory of the RAM 914, other recording media such as an optical disk, and an IC chip. Data and signals are transmitted online by a bus 912, signal lines, cables, other transmission media, and radio waves.
In addition, what is described as “to part” in the above description may be “to circuit”, “to device”, “to device”, “to means”, and “to function”. It may be “step”, “˜procedure”, “˜processing”. In addition, what is described as “˜device” may be “˜circuit”, “˜device”, “˜means”, “˜function”, and “˜step”, “˜procedure”, “ ~ Process ". Furthermore, what is described as “to process” may be “to step”. That is, what is described as “˜unit” may be realized by firmware stored in the ROM 913. Alternatively, it may be implemented only by software, or only by hardware such as elements, devices, substrates, and wirings, by a combination of software and hardware, or by a combination of firmware. Firmware and software are stored in a recording medium such as ROM 913 as a program. The program is read by the CPU 911 and executed by the CPU 911. That is, the program causes a computer or the like to function as the “˜unit” described above. Alternatively, the computer or the like is caused to execute the procedures and methods of “to part” described above.

１物理計算機、２仮想化ソフトウェア、３仮想サーバ、４，９通信回線、５負荷分散装置、６リソース管理・制御サーバ、７インターネット／イントラネット、８端末、１０ストレージ、１１仮想サーバディスクイメージ、１２仮想サーバテンプレート、２１リソース情報収集・蓄積部、２２負荷傾向判断部、２３役割変更部、２４リソース提供方式決定部、２５待機生成・削除部、２６リソース提供時間管理テーブル、２７リソース管理テーブル、３１障害監視部。 1 physical computer, 2 virtualization software, 3 virtual server, 4,9 communication line, 5 load balancer, 6 resource management / control server, 7 Internet / Intranet, 8 terminal, 10 storage, 11 virtual server disk image, 12 virtual Server template, 21 Resource information collection / accumulation unit, 22 Load trend determination unit, 23 Role change unit, 24 Resource provision method determination unit, 25 Standby generation / deletion unit, 26 Resource provision time management table, 27 Resource management table, 31 Failure Monitoring department.

Claims

A distributed processing system that distributes processing to multiple servers, and the number of active servers in the distributed processing system in which the number of active servers is changed according to the amount of resource used, which is the amount of server resources used Is a unit control device that controls
A time storage unit that stores a generation time required to generate a standby server waiting for switching to the active server from a server in a predetermined setting state in a storage device for each standby method of the standby server;
A predicted time calculation unit that calculates a first predicted time until the resource usage reaches a first threshold that is set in advance by a processing device;
A standby method selection unit that selects a standby method that is shorter than the first predicted time calculated by the predicted time calculation unit by the generation time stored by the time storage unit by a processing device;
A standby server generation unit that generates a standby server of the standby method selected by the standby method selection unit from the server in the predetermined setting state by a processing device;
A number control device comprising: a switching unit that operates the standby server generated by the standby server generation unit and switches to the active server when the resource usage amount reaches the first threshold.

The time storage unit further stores, for each standby method of the standby server, a switching time required to operate the standby server and switch to the active server,
The predicted time calculation unit further calculates a second predicted time until the resource usage reaches a second threshold in which a resource usage larger than the first threshold is set,
The standby mode selection unit selects the standby mode in which the generation time is shorter than the first prediction time and the switching time is shorter than the second prediction time. Number control device.

The standby method selection unit, when there is no standby method in which the generation time is shorter than the first prediction time and the switching time is shorter than the second prediction time, the generation time and the switching time, The number control device according to claim 2, wherein the standby method with the shortest sum is selected.

The standby method selection unit, when there are a plurality of standby methods that satisfy the condition that the generation time is shorter than the first prediction time and the switching time is shorter than the second prediction time, The number control device according to claim 2 or 3, wherein a standby method that uses a small amount of resources used by generating a standby server of the standby method is selected from among the standby methods that satisfy the condition.

The number control device further includes:
A standby method storage unit for storing the standby method selected by the standby method selection unit;
The switching unit further waits for at least one active server when the resource usage is less than a third threshold for which the resource usage is less than the first threshold. The number control device according to claim 1, wherein the number control device is switched to a standby server of a standby method stored in a method storage unit.

The plurality of servers are divided into a plurality of groups according to roles,
The number control device controls the number of active servers belonging to each group of the plurality of groups,
Whether the standby server generation unit generates a standby server belonging to the predetermined group according to whether there exists a standby server belonging to a front end group closer to the client than the predetermined group among the plurality of groups. 6. The number control device according to claim 1, wherein it is determined whether or not.

When the active server is stopped, the switching unit operates the standby server to switch to the active server,
The predicted time calculation unit recalculates a first predicted time until the resource usage reaches the first threshold after the switching unit operates the standby server and switches to the active server,
The standby method selection unit selects the standby method in which the generation time stored in the time storage unit is shorter than the first prediction time recalculated by the prediction time calculation unit,
The number of standby servers according to any one of claims 1 to 6, wherein the standby server generation unit generates a standby server of the standby method selected by the standby method selection unit from the servers in the predetermined setting state. Control device.

A distributed processing system that distributes processing to multiple servers, and the number of active servers in the distributed processing system in which the number of active servers is changed according to the amount of resource used, which is the amount of server resources used Is a unit control method for controlling
A prediction time calculation step in which the processing device calculates a first prediction time until the resource usage reaches a first threshold that is set in advance with a predetermined resource usage;
A standby time shorter than the first predicted time calculated in the predicted time calculation step is a time required for the processing device to generate a standby server waiting for switching to the active server from a server in a predetermined setting state. A standby method selection step for selecting a method;
A standby server generation step of generating a standby server of the standby method selected in the standby method selection step from the server in the predetermined setting state, the processing device;
A number control method comprising: a switching step of operating the standby server generated in the standby server generation step and switching to the active server when the resource usage amount reaches the first threshold value.

A distributed processing system that distributes processing to multiple servers, and the number of active servers in the distributed processing system in which the number of active servers is changed according to the amount of resource used, which is the amount of server resources used Is a unit control program for controlling
A predicted time calculation process for calculating a first predicted time until the resource usage reaches a first threshold that is set in advance with a predetermined resource usage;
Select a standby method in which the generation time required to generate a standby server waiting for switching to the active server from a server in a predetermined setting state is shorter than the first predicted time calculated in the predicted time calculation process Standby method selection processing,
A standby server generation process for generating a standby server of the standby system selected in the standby system selection process from the server in the predetermined setting state;
A number control program for causing a computer to execute a switching process of operating a standby server generated in the standby server generation process and switching to an active server when the resource usage amount reaches the first threshold.