JP2018041190A

JP2018041190A - Information processing device, control method and program

Info

Publication number: JP2018041190A
Application number: JP2016173461A
Authority: JP
Inventors: 幸雄山川; Yukio Yamakawa
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2016-09-06
Filing date: 2016-09-06
Publication date: 2018-03-15
Anticipated expiration: 2036-09-06
Also published as: JP6891425B2

Abstract

PROBLEM TO BE SOLVED: To suppress an occurrence of a failure based on a time required by the time an increased virtual machine operates when the virtual machine is increased in a system for providing a service with a virtual machine as a server.SOLUTION: The system includes: a request holding part 130 for holding a request for causing an execution server 200 to execute processing: a transmission and reception part 110 for transmitting the request held by the request holding part 130 to the execution server 200; and an execution server management part 160 for managing information of the execution server 200, and increasing a resource of the execution server 200 in the case that a state of the execution server 200 based on a load of processing to a request during execution of the execution server 200 and a predicted load of processing in the execution server 200 when processing to the request held by the request holding part 130 is executed satisfy a predetermined specific condition.SELECTED DRAWING: Figure 2

Description

本発明は、情報処理装置、制御方法およびプログラムに関する。 The present invention relates to an information processing apparatus, a control method, and a program.

ネットワーク上に設定された仮想マシン（仮想サーバ）をサーバとして用い、サービスを提供するシステムが実現されている。この種のシステムにおいて、処理の量に応じて仮想マシンを増加（スケールアウト）させたり、減少（スケールイン）させたりする制御が行われる場合がある。特許文献１には、システムの負荷に応じて仮想マシンを増減させる従来技術が開示されている。同文献に記載された情報処理装置は、監視部、制御部および記憶部を備え、監視部は、管理対象のシステムの負荷をモニタし、制御部は、システムに対して過去に起動指示を行い、かつ、起動処理中である仮想マシンの状況、および、負荷のモニタ結果に応じて、システムにおける仮想マシンの起動指示と起動処理中である仮想マシンの停止指示を行う。 A system that provides a service by using a virtual machine (virtual server) set on a network as a server has been realized. In this type of system, there are cases where control is performed to increase (scale out) or decrease (scale in) a virtual machine according to the amount of processing. Patent Document 1 discloses a conventional technique for increasing or decreasing a virtual machine according to a system load. The information processing apparatus described in the document includes a monitoring unit, a control unit, and a storage unit. The monitoring unit monitors the load of the system to be managed, and the control unit gives a start instruction to the system in the past. In addition, a virtual machine start instruction and a virtual machine stop instruction in the system are issued according to the status of the virtual machine in the start process and the load monitoring result.

特開２０１５−１９４９５８号公報JP-A-2015-194958

仮想マシンをサーバとしてサービスを提供するシステムでは、システムの負荷が増大すると、仮想マシンを増加させて対応することが行われる。しかし、仮想マシンを増加させる操作を完了して追加された仮想マシンによる処理の実行が可能となる（追加された仮想マシンが稼働する）までには、時間を要する。そのため、負荷が増大した際、増加した仮想マシンが稼働するまでの間に、エラーや遅延等の不具合が発生する可能性があった。 In a system that provides a service using a virtual machine as a server, when the load on the system increases, the number of virtual machines is increased. However, it takes time until the added virtual machine can execute processing after the operation for increasing the number of virtual machines is completed (the added virtual machine operates). For this reason, when the load increases, there is a possibility that problems such as errors and delays may occur before the increased virtual machine is operated.

本発明の目的は、仮想マシンをサーバとしてサービスを提供するシステムにおいて、仮想マシンを増加させた際に、増加した仮想マシンが稼働するまでに要する時間に基づく不具合の発生を抑制することにある。 An object of the present invention is to suppress the occurrence of problems based on the time required for an increased virtual machine to operate when the number of virtual machines is increased in a system that provides services using virtual machines as servers.

本発明の請求項１に係る情報処理装置は、
サーバに処理を実行させる要求を保持する要求保持部と、
前記要求保持部に保持された前記要求を前記サーバへ送信する送信部と、
前記サーバの情報を管理し、当該サーバが実行中の前記要求に対する処理の負荷と前記要求保持部に保持されている要求に対する処理が実行される場合に予想される当該サーバにおける処理の負荷とに基づく当該サーバの状態が、予め定められた特定の条件を満足する場合に、前記サーバの資源を増加させるサーバ管理部と、
を備えることを特徴とする、情報処理装置である。
請求項２に係る情報処理装置は、
前記サーバが、仮想サーバであり、前記サーバ管理部は、前記仮想サーバのインスタンスを増加させることを特徴とする、請求項１に記載の情報処理装置である。
請求項３に係る情報処理装置は、
前記サーバ管理部が、前記サーバが実行中の前記要求に対する処理の負荷と前記要求保持部に保持されている要求に対する処理が実行される場合に予想される当該サーバにおける処理の負荷の合計が予め定められた値よりも大きい場合に、前記サーバの資源を増加させることを特徴とする、請求項１または請求項２に記載の情報処理装置である。
請求項４に係る情報処理装置は、
前記サーバ管理部が、前記サーバが実行中の前記要求に対する処理の負荷と前記要求保持部に保持されている要求に対する処理が実行される場合に予想される当該サーバにおける処理の負荷に基づく当該サーバの状態が、前記特定の条件とは異なる予め定められた準備依頼条件を満足する場合に、前記サーバの資源を増加するための準備を依頼することを特徴とする、請求項１乃至請求項３に記載の情報処理装置である。
請求項５に係る情報処理装置は、
前記サーバ管理部が、前記サーバが実行中の前記要求に対する処理の負荷と前記要求保持部に保持されている要求に対する処理が実行される場合に予想される当該サーバにおける処理の負荷に基づく当該サーバの状態が、前記準備依頼条件を満足した後、前記特定の条件および当該準備依頼条件とは異なる予め定められた破棄条件を満足する場合に、前記資源を増加するための準備の依頼の破棄を依頼することを特徴とする、請求項４に記載の情報処理装置である。
請求項６に係る方法は、
要求をサーバへ送信し、当該要求に対する処理をサーバに実行させる制御方法であって、
サーバに処理を実行させる要求を記憶手段に保持するステップと、
当該サーバが実行中の前記要求に対する処理の負荷と前記記憶手段に保持されている要求に対する処理が実行される場合に予想される当該サーバにおける処理の負荷とに基づく当該サーバの状態が、予め定められた特定の条件を満足する場合に、前記サーバの資源を増加させるステップと、
前記記憶手段に保持された前記要求を前記サーバへ送信するステップと、
を含むことを特徴とする、制御方法である。
請求項７に係るプログラムは、
コンピュータを、
サーバに処理を実行させる要求を保持する要求保持手段と、
前記要求保持手段に保持された前記要求を前記サーバへ送信する送信手段と、
前記サーバの情報を管理し、当該サーバが実行中の前記要求に対する処理の負荷と前記要求保持手段に保持されている要求に対する処理が実行される場合に予想される当該サーバにおける処理の負荷とに基づく当該サーバの状態が、予め定められた特定の条件を満足する場合に、前記サーバの資源を増加させるサーバ管理手段として、
機能させることを特徴とする、プログラムである。 An information processing apparatus according to claim 1 of the present invention provides:
A request holding unit for holding a request to cause the server to execute processing;
A transmission unit that transmits the request held in the request holding unit to the server;
The server information is managed, and the processing load on the request being executed by the server and the processing load expected on the server when processing for the request held in the request holding unit is executed. A server management unit that increases the resource of the server when the state of the server based on satisfies a predetermined specific condition;
An information processing apparatus comprising:
An information processing apparatus according to claim 2
The information processing apparatus according to claim 1, wherein the server is a virtual server, and the server management unit increases instances of the virtual server.
An information processing apparatus according to claim 3 is:
The server management unit preliminarily calculates the total of the processing load for the request being executed by the server and the expected processing load for the server when processing for the request held in the request holding unit is executed. 3. The information processing apparatus according to claim 1, wherein when the value is larger than a predetermined value, the resource of the server is increased.
An information processing apparatus according to claim 4 is:
The server based on the processing load on the server that is expected when the server management unit executes processing load on the request being executed by the server and processing on the request held in the request holding unit When the state of the server satisfies a predetermined preparation request condition different from the specific condition, preparation for increasing the resource of the server is requested. It is an information processing apparatus as described in.
An information processing apparatus according to claim 5 is:
The server based on the processing load on the server that is expected when the server management unit executes processing load on the request being executed by the server and processing on the request held in the request holding unit If the state satisfies the preparation request condition and then satisfies the specific condition and a predetermined discard condition different from the preparation request condition, the preparation request for increasing the resource is discarded. The information processing apparatus according to claim 4, wherein the information processing apparatus requests the information processing apparatus.
The method according to claim 6 comprises:
A control method for transmitting a request to a server and causing the server to execute processing for the request,
Holding in the storage means a request to cause the server to execute processing;
The state of the server based on the processing load on the request being executed by the server and the processing load on the server that is expected when the processing on the request held in the storage unit is executed is determined in advance. Increasing the resource of the server if the specified specific condition is satisfied;
Transmitting the request held in the storage means to the server;
It is a control method characterized by including these.
The program according to claim 7 is:
Computer
Request holding means for holding a request to cause the server to execute processing;
Transmitting means for transmitting the request held in the request holding means to the server;
The server information is managed, and the processing load for the request being executed by the server and the processing load expected at the time when the processing for the request held in the request holding unit is executed. As a server management means for increasing the resource of the server when the state of the server based on satisfies a predetermined specific condition,
It is a program characterized by functioning.

請求項１の発明によれば、サーバの負荷が増大して資源の増加が必要になってから資源を増加する構成と比較して、エラーや遅延が生じることなくサーバの処理を実行することができる。
請求項２の発明によれば、仮想サーバの負荷が増大してインスタンスの増加が必要になってからインスタンスを増加する構成と比較して、エラーや遅延が生じることなくサーバの処理を実行することができる。
請求項３の発明によれば、仮想サーバの負荷が増大して資源の増加が必要になってから資源を増加する構成と比較して、保持している要求を実行する際に対応するように資源の増加を依頼することにより、エラーや遅延が生じることなくサーバの処理を実行することができる。
請求項４の発明によれば、仮想サーバの負荷が増大して資源の増加が必要になってから資源を増加する構成と比較して、資源の増加が必要になる前に予め準備を依頼することにより、速やかに資源を増加させることを可能とし、エラーや遅延が生じることなくサーバの処理を実行することができる。
請求項５の発明によれば、仮想サーバの負荷が減少して資源を増加する必要がなくなった場合に、資源の増加の準備を破棄することで、無駄な資源の準備を抑制することができる。
請求項６の発明によれば、要求をサーバへ送信し、当該要求に対する処理をサーバに実行させる制御方法において、サーバの負荷が増大して資源の増加が必要になってから資源を増加する構成と比較して、エラーや遅延が生じることなくサーバの処理を実行することができる。
請求項７の発明によれば、コンピュータを制御するプログラムにおいて、サーバの負荷が増大して資源の増加が必要になってから資源を増加する構成と比較して、エラーや遅延が生じることなくサーバの処理を実行することができる。 According to the first aspect of the present invention, it is possible to execute server processing without causing errors and delays as compared to a configuration in which resources are increased after the load on the server increases and resources need to be increased. it can.
According to the second aspect of the present invention, server processing is executed without causing an error or delay as compared with a configuration in which instances are increased after the load on the virtual server increases and an increase in instances is required. Can do.
According to the third aspect of the present invention, compared to the configuration in which the resources are increased after the load of the virtual server is increased and the resources need to be increased, it is possible to cope with the execution of the held request. By requesting an increase in resources, it is possible to execute server processing without causing errors or delays.
According to the invention of claim 4, preparation is requested in advance before an increase in resources is required, compared to a configuration in which the resources are increased after the load of the virtual server increases and an increase in resources is required. As a result, it is possible to quickly increase resources, and it is possible to execute server processing without causing errors or delays.
According to the invention of claim 5, when the load on the virtual server is reduced and it is no longer necessary to increase the resources, it is possible to suppress the preparation of useless resources by discarding the preparation for increasing the resources. .
According to the invention of claim 6, in the control method for transmitting a request to a server and causing the server to execute a process for the request, a configuration in which the resource is increased after the load on the server increases and the resource needs to be increased. Compared with, server processing can be executed without causing errors or delays.
According to the seventh aspect of the present invention, in the program for controlling the computer, there is no error or delay as compared with the configuration in which the resource is increased after the load of the server increases and the resource needs to be increased. Can be executed.

本実施形態が適用される情報処理システムの構成例を示す図である。It is a figure which shows the structural example of the information processing system to which this embodiment is applied. 管理サーバの機能構成を示す図である。It is a figure which shows the function structure of a management server. 管理サーバの動作を示すフローチャートである。It is a flowchart which shows operation | movement of a management server. 管理サーバの動作を示すフローチャートである。It is a flowchart which shows operation | movement of a management server. 管理サーバを実現するコンピュータ（情報処理装置）のハードウェア構成例を示す図である。It is a figure which shows the hardware structural example of the computer (information processing apparatus) which implement | achieves a management server.

＜本実施形態が適用される情報処理システムの構成＞
図１は、本実施形態が適用される情報処理システムの構成例を示す図である。図１に示すように、本実施形態による情報処理システム１０は、管理サーバ１００と、実行サーバ２００と、クライアント３００とを備える。管理サーバ１００と実行サーバ２００とは、ネットワーク２０を介して接続されている。また、図示の構成において、クライアント３００は、ネットワーク２０を介さずに管理サーバ１００と接続されているが、ネットワーク２０を介して接続してもよい。また、図示の構成において、サービスＡ、サービスＢ、サービスＣをそれぞれ提供する３種類の実行サーバ２００が示されているが、実行サーバ２００の数は利用するサービスに応じて設定され、図示の３種類に限定されない。また、図１には、クライアント３００が１台のみ記載されているが、実際のシステムにおいては複数台のクライアント３００が管理サーバ１００に接続されてもよい。 <Configuration of information processing system to which this embodiment is applied>
FIG. 1 is a diagram illustrating a configuration example of an information processing system to which the present embodiment is applied. As illustrated in FIG. 1, the information processing system 10 according to the present embodiment includes a management server 100, an execution server 200, and a client 300. The management server 100 and the execution server 200 are connected via the network 20. In the illustrated configuration, the client 300 is connected to the management server 100 without going through the network 20, but may be connected through the network 20. In the illustrated configuration, three types of execution servers 200 that provide service A, service B, and service C, respectively, are shown. However, the number of execution servers 200 is set according to the service to be used. It is not limited to the type. Although only one client 300 is illustrated in FIG. 1, a plurality of clients 300 may be connected to the management server 100 in an actual system.

ネットワーク２０は、管理サーバ１００と実行サーバ２００との間で情報通信を行えるものであれば特に限定されず、例えばインターネットやＬＡＮ（Local Area Network）等としてよい。情報通信に用いられる通信回線は、有線であっても無線であっても良い。また、複数のネットワークや通信回線を介して管理サーバ１００と実行サーバ２００とを接続しても良い。特に図示しないが、ネットワーク２０には、ネットワークや通信回線を接続するためのゲートウェイやハブ、無線ＬＡＮにおけるアクセスポイント等の中継装置が適宜設けられる。なお、詳しくは後述するが、本実施形態において、実行サーバ２００は、仮想マシンにより実現される。したがって、実際には、実行サーバ２００が物理的にネットワーク２０に接続されるのではなく、実行サーバ２００が構築されている物理的なサーバマシンがネットワーク２０に接続されている。 The network 20 is not particularly limited as long as information communication can be performed between the management server 100 and the execution server 200. For example, the network 20 may be the Internet or a LAN (Local Area Network). A communication line used for information communication may be wired or wireless. Further, the management server 100 and the execution server 200 may be connected via a plurality of networks or communication lines. Although not particularly illustrated, the network 20 is appropriately provided with a relay device such as a gateway or hub for connecting a network or a communication line, or an access point in a wireless LAN. Although described in detail later, in the present embodiment, the execution server 200 is realized by a virtual machine. Therefore, actually, the execution server 200 is not physically connected to the network 20, but a physical server machine in which the execution server 200 is constructed is connected to the network 20.

管理サーバ１００とクライアント３００とは、図１に示すように、ネットワークを介さずに直接接続してもよいし、ネットワーク２０を介して接続してもよい。また、管理サーバ１００と実行サーバ２００とが接続されたネットワーク２０とは異なるネットワークを介して接続してもよい。具体的な構成の一例としては、管理サーバ１００とクライアント３００とがＬＡＮにより接続され、管理サーバ１００と実行サーバ２００とがインターネットにより接続される構成が考えられる。 As shown in FIG. 1, the management server 100 and the client 300 may be directly connected without using a network, or may be connected through the network 20. The management server 100 and the execution server 200 may be connected via a network different from the network 20 to which the management server 100 and the execution server 200 are connected. As an example of a specific configuration, a configuration in which the management server 100 and the client 300 are connected by a LAN, and the management server 100 and the execution server 200 are connected by the Internet can be considered.

管理サーバ１００は、コンピュータにより実現される。管理サーバ１００は、クライアント３００から要求（リクエスト）を受け付けて、要求に応じた処理を実行する実行サーバ２００へ送信する。そして、実行サーバ２００から返送された応答（レスポンス）を受け付けて、クライアント３００へ送信する。本実施形態の管理サーバ１００は、クライアント３００から受け付けた要求に応じて、複数の実行サーバ２００にアクセスし、処理を実行させることがある。すなわち、受け付けた要求が、複数の処理の実行を要する要求であって、要求された複数の処理が個別の実行サーバ２００により実行される処理である場合、管理サーバ１００は、各処理を実行する実行サーバ２００にアクセスして処理を実行させる。複数の実行サーバ２００に処理を実行させる場合、各処理を順次実行させてもよいし、並列実行が可能な複数の処理に関しては、並列に実行させてもよい。 The management server 100 is realized by a computer. The management server 100 receives a request (request) from the client 300 and transmits the request to the execution server 200 that executes processing according to the request. Then, the response returned from the execution server 200 is received and transmitted to the client 300. The management server 100 of this embodiment may access a plurality of execution servers 200 and execute processing in response to a request received from the client 300. That is, when the received request is a request that requires execution of a plurality of processes, and the requested plurality of processes are processes executed by the individual execution servers 200, the management server 100 executes each process. The execution server 200 is accessed to execute processing. When causing a plurality of execution servers 200 to execute processes, the respective processes may be executed sequentially, or a plurality of processes that can be executed in parallel may be executed in parallel.

具体例として、文字が書かれた画像データに対して、光学的文字認識処理（ＯＣＲ；Optical Character Recognition/Reader）を行い、画像変換処理を行い、ネットワーク上の記憶装置に格納する処理（格納処理）を要求する場合を考える。ここで、光学的文字認識処理、画像変換処理、格納処理の各々は、別個の実行サーバ２００により実行される処理であるものとする。この場合、この要求を受け付けた管理サーバ１００は、まず、光学的文字認識処理を実行する実行サーバ２００にアクセスして画像データを送り、光学的文字認識処理を実行させる。そして、実行サーバ２００から光学的文字認識処理の処理結果を受け取ると、次に管理サーバ１００は、画像変換処理を実行する実行サーバ２００にアクセスして画像データを送り、画像変換処理を実行させる。そして、実行サーバ２００から画像変換処理の処理結果を受け取ると、次に管理サーバ１００は、格納処理を実行する実行サーバ２００にアクセスし、光学的文字認識処理および画像変換処理の処理結果を記憶装置に格納させる。なお、上記の具体例において、例えば、光学的文字認識処理と画像変換処理とが互いに独立した処理である場合、画像変換処理を先に実行させてもよいし、これらの処理を並列に実行してもよい。 As a specific example, optical character recognition processing (OCR: Optical Character Recognition / Reader) is performed on image data in which characters are written, image conversion processing is performed, and the processing is stored in a storage device on the network (storage processing) ). Here, it is assumed that each of the optical character recognition process, the image conversion process, and the storage process is executed by a separate execution server 200. In this case, the management server 100 that has received this request first accesses the execution server 200 that executes the optical character recognition process, sends image data, and executes the optical character recognition process. When receiving the processing result of the optical character recognition process from the execution server 200, the management server 100 next accesses the execution server 200 that executes the image conversion process and sends the image data to execute the image conversion process. When receiving the processing result of the image conversion processing from the execution server 200, the management server 100 next accesses the execution server 200 that executes the storage processing, and stores the processing results of the optical character recognition processing and the image conversion processing in the storage device. To store. In the above specific example, for example, when the optical character recognition process and the image conversion process are independent processes, the image conversion process may be executed first, or these processes may be executed in parallel. May be.

実行サーバ２００は、ネットワーク２０に接続されたコンピュータ上に構築された仮想マシンにより実現される。一つの実行サーバ２００は、１台のコンピュータ上に構築した仮想マシンであってもよいし、複数のコンピュータに機能を分散させて構築した仮想マシンであってもよい。また、１台のコンピュータ上に複数の実行サーバ２００を構築し、複数のサービスを提供する構成としてもよい。サービスの提供における実行サーバ２００の性能（処理能力や記憶容量など）は、サービスの提供者や仮想マシンの提供者により予め設定される。そして、処理を実行する資源としての実行サーバ２００として動作する仮想マシンのインスタンスを増やしたり減らしたりすることにより、サービスの実行能力が変更される。管理サーバ１００から利用可能な実行サーバ２００のインスタンスの数および増減は、実行サーバ２００の管理サイトにより管理される。図１に示す例では、サービスＡ、サービスＢ、サービスＣをそれぞれ提供する３種類の実行サーバ２００が示されており、以下、これらを区別する場合は、提供されるサービスの符号を添え字として用い、実行サーバ２００Ａ、実行サーバ２００Ｂ、実行サーバ２００Ｃ等と記載する。 The execution server 200 is realized by a virtual machine constructed on a computer connected to the network 20. One execution server 200 may be a virtual machine constructed on one computer, or may be a virtual machine constructed by distributing functions to a plurality of computers. A plurality of execution servers 200 may be constructed on one computer to provide a plurality of services. The performance (processing capacity, storage capacity, etc.) of the execution server 200 in providing the service is preset by the service provider or the virtual machine provider. The service execution capability is changed by increasing or decreasing the number of virtual machine instances that operate as the execution server 200 as a resource for executing processing. The number and increase / decrease of execution server 200 instances available from the management server 100 are managed by the management site of the execution server 200. In the example shown in FIG. 1, three types of execution servers 200 that provide service A, service B, and service C, respectively, are shown. In the following, in order to distinguish these, the code of the provided service is used as a subscript. Used, described as execution server 200A, execution server 200B, execution server 200C, and the like.

クライアント３００は、実行サーバ２００により提供されるサービスの利用者が使用する端末装置であり、パーソナルコンピュータ、携帯情報端末、プリンタ複合機、その他の種々の情報処理装置により実現される。 The client 300 is a terminal device used by a user of a service provided by the execution server 200, and is realized by a personal computer, a portable information terminal, a printer multifunction device, and other various information processing devices.

＜管理サーバの機能構成＞
図２は、管理サーバ１００の機能構成を示す図である。図２に示すように、管理サーバ１００は、送受信部１１０と、要求管理部１２０と、要求保持部１３０と、応答保持部１４０と、応答管理部１５０と、実行サーバ管理部１６０とを備える。送受信部１１０は、ネットワーク・インターフェイス（図示せず）を介して実行サーバ２００およびクライアント３００とデータ交換を行う。 <Functional configuration of the management server>
FIG. 2 is a diagram illustrating a functional configuration of the management server 100. As illustrated in FIG. 2, the management server 100 includes a transmission / reception unit 110, a request management unit 120, a request holding unit 130, a response holding unit 140, a response management unit 150, and an execution server management unit 160. The transmission / reception unit 110 exchanges data with the execution server 200 and the client 300 via a network interface (not shown).

要求管理部１２０は、送受信部１１０がクライアント３００から受信した要求を受け付け、要求保持部１３０に保持させる。要求保持部１３０は、送信先の実行サーバ２００ごとに設けられたキューである。図２に示す例では、図１に示した実行サーバ２００Ａ、２００Ｂ、２００Ｃに対応して、サービスＡ用のキュー１３１Ａ、サービスＢ用のキュー１３１Ｂ、サービスＣ用のキュー１３１Ｃが記載されている（以下、各キュー１３１Ａ、１３１Ｂ、１３１Ｃを区別しないときは、単にキュー１３１と記載する）。したがって、要求管理部１２０は、例えば、実行サーバ２００Ａにより提供されるサービスの要求を受け付けると、順次、要求保持部１３０のキュー１３１Ａに保持させる。要求保持部１３０のキュー１３１Ａ、１３１Ｂ、１３１Ｃに保持された要求は、各々保持された順に、送受信部１１０により、対応する実行サーバ２００Ａ、２００Ｂ、２００Ｃへ送信される。 The request management unit 120 receives a request received from the client 300 by the transmission / reception unit 110 and causes the request holding unit 130 to hold the request. The request holding unit 130 is a queue provided for each execution server 200 as a transmission destination. In the example shown in FIG. 2, a service A queue 131A, a service B queue 131B, and a service C queue 131C are described corresponding to the execution servers 200A, 200B, and 200C shown in FIG. Hereinafter, when the queues 131A, 131B, and 131C are not distinguished, they are simply referred to as queues 131). Therefore, for example, when receiving a request for a service provided by the execution server 200A, the request management unit 120 sequentially holds the request in the queue 131A of the request holding unit 130. The requests held in the queues 131A, 131B, and 131C of the request holding unit 130 are transmitted to the corresponding execution servers 200A, 200B, and 200C by the transmission / reception unit 110 in the order in which they are held.

また、要求管理部１２０が受け付けた要求が、一つの実行サーバ２００（例えば、実行サーバ２００Ａ）による処理結果に対して他の実行サーバ２００（例えば、実行サーバ２００Ｂ）による処理の実行を要求するものである場合を考える。この場合、要求管理部１２０は、実行サーバ２００Ａから返送された処理結果を新たな要求として、キュー１３１Ｂに保持させる。このようにして、一つの要求に対する複数の実行サーバ２００による処理が順次実行される。 Further, the request received by the request management unit 120 requests execution of processing by another execution server 200 (for example, execution server 200B) with respect to the processing result by one execution server 200 (for example, execution server 200A). Consider the case. In this case, the request management unit 120 holds the processing result returned from the execution server 200A as a new request in the queue 131B. In this way, processing by a plurality of execution servers 200 for one request is sequentially executed.

要求保持部１３０は、上述したように、要求の送信先である実行サーバ２００ごとに設けられたキュー１３１である。本実施形態では、クライアント３００から受け付けた要求を要求保持部１３０に一時的に保持させた後、送信先の実行サーバ２００ごとに順次送信する。このように、実行サーバ２００へ送る要求を一時的に保持することにより、本実施形態は、受け付けた要求の量が増加したときに、直ちに実行サーバ２００の処理能力を超えてしまう事態が発生することを回避する。また、本実施形態は、要求保持部１３０に保持された要求の量を監視することにより、現在の実行サーバ２００の負荷だけでなく、これから実行サーバ２００へ依頼する要求の量に基づいて予想される実行サーバ２００の負荷に基づき、実行サーバ２００のインスタンスの増加を依頼する。本実施形態における実行サーバ２００のインスタンスの増加を依頼する機能の詳細については後述する。 As described above, the request holding unit 130 is a queue 131 provided for each execution server 200 that is a transmission destination of a request. In the present embodiment, the request received from the client 300 is temporarily stored in the request storage unit 130 and then sequentially transmitted to each execution server 200 as a transmission destination. As described above, by temporarily holding a request to be sent to the execution server 200, in the present embodiment, when the amount of received requests increases, a situation in which the processing capacity of the execution server 200 is immediately exceeded occurs. Avoid that. In addition, the present embodiment is predicted based on not only the current load on the execution server 200 but also the amount of requests to be requested from the execution server 200 by monitoring the amount of requests held in the request holding unit 130. The execution server 200 is requested to increase the number of instances based on the load of the execution server 200. Details of the function for requesting an increase in the number of instances of the execution server 200 in this embodiment will be described later.

応答保持部１４０は、送受信部１１０が実行サーバ２００から受信した応答（処理結果）を保持するキューである。応答保持部１４０のキューは、上述した要求保持部１３０のキューと異なり、実行サーバ２００ごとに設けられるのではなく単一である。すなわち、送受信部１１０により受信した応答は、いずれの実行サーバ２００から送信されたかに関わらず、受信した順番に応答保持部１４０に保持される。 The response holding unit 140 is a queue that holds the response (processing result) received from the execution server 200 by the transmission / reception unit 110. Unlike the queue of the request holding unit 130 described above, the queue of the response holding unit 140 is not provided for each execution server 200 but is single. That is, the responses received by the transmission / reception unit 110 are held in the response holding unit 140 in the order received, regardless of which execution server 200 has transmitted the response.

応答管理部１５０は、応答保持部１４０に保持された応答を順次読み出し、処理する。ここでは、応答が要求を発行したクライアント３００に返送されるべきものであるならば、応答管理部１５０は、送受信部１１０により、その応答をクライアント３００へ返送させる。また、応答が次の実行サーバ２００による処理の対象であるならば、応答管理部１５０は、その応答を要求管理部１２０に渡し、要求管理部１２０が、その要求を、次の処理を行う実行サーバ２００に対応する要求保持部１３０のキュー１３１Ａ、１３１Ｂ、１３１Ｃに保持させる。 The response management unit 150 sequentially reads and processes the responses held in the response holding unit 140. Here, if the response is to be returned to the client 300 that issued the request, the response management unit 150 causes the transmission / reception unit 110 to return the response to the client 300. If the response is a target of processing by the next execution server 200, the response management unit 150 passes the response to the request management unit 120, and the request management unit 120 executes the request for the next processing. The queues 131A, 131B, and 131C of the request holding unit 130 corresponding to the server 200 are held.

本実施形態は、応答管理部１５０が応答保持部１４０を管理することにより、実行サーバ２００の動作状況を把握する。すなわち、応答管理部１５０は、要求管理部１２０から実行サーバ２００へ送った要求の情報を取得し、実行サーバ２００へ送った要求と実行サーバ２００から受け取った応答との対応関係に基づいて、実行サーバ２００における要求の実行状況を把握する。また、実行サーバ２００のインスタンスの増減を行った場合は、実行サーバ２００の動作状況を把握することにより、インスタンスの増減に基づく処理能力の変化（インスタンスを増加または減少した後の処理能力）を認識する。 In the present embodiment, the response management unit 150 manages the response holding unit 140 to grasp the operation status of the execution server 200. That is, the response management unit 150 acquires information on the request sent from the request management unit 120 to the execution server 200, and executes the execution based on the correspondence between the request sent to the execution server 200 and the response received from the execution server 200. The execution status of the request in the server 200 is grasped. Further, when the number of instances of the execution server 200 is increased / decreased, the change in the processing capacity based on the increase / decrease of the instance (processing capacity after increasing or decreasing the instance) is recognized by grasping the operation status of the execution server 200 To do.

実行サーバ管理部１６０は、各実行サーバ２００のサービスの提供における性能の情報を管理する。実行サーバ２００は、仮想マシンのインスタンスとして構成されるため、インスタンスを増減することにより性能が変更する。そこで、実行サーバ管理部１６０は、各サービスの実行サーバ２００ごとに、インスタンス単位の性能の情報と、稼働しているインスタンスの数とを管理する。なお、インスタンス単位の性能の情報は、例えば予め、各実行サーバ２００のサービスの提供者や仮想マシンの提供者から取得し、管理サーバ１００の記憶装置（図示せず）に保持しておく。 The execution server management unit 160 manages information on the performance of each execution server 200 in providing the service. Since the execution server 200 is configured as an instance of a virtual machine, the performance is changed by increasing or decreasing the number of instances. Therefore, the execution server management unit 160 manages performance information for each instance and the number of running instances for each execution server 200 of each service. Note that the performance information for each instance is acquired in advance from a service provider or a virtual machine provider of each execution server 200 and stored in a storage device (not shown) of the management server 100, for example.

また、実行サーバ管理部１６０は、各実行サーバ２００の負荷に応じて、実行サーバ２００のインスタンスを増加（スケールアウト）、減少（スケールイン）させる。このインスタンスの増減は、実行サーバ２００のオートスケール機能を利用して行ってもよいし、実行サーバ２００の管理サイトに依頼して行ってもよい。実行サーバ２００のオートスケール機能を利用する場合、実行サーバ管理部１６０は、オートスケール機能によりインスタンスの増加または減少を実行させるコマンドを、実行サーバ２００に送信する。以下、具体的な手法に関わらず、実行サーバ２００のインスタンスの増加（または減少）を実行させるための操作を行うことを、実行サーバ２００のインスタンスの増加（または減少）を依頼する等と記載する。 Further, the execution server management unit 160 increases (scales out) and decreases (scales in) instances of the execution server 200 according to the load of each execution server 200. The increase / decrease of the instances may be performed using the autoscale function of the execution server 200 or may be performed by requesting the management site of the execution server 200. When using the auto-scaling function of the execution server 200, the execution server management unit 160 transmits a command for executing an increase or decrease in the instance to the execution server 200 by the auto-scaling function. Hereinafter, regardless of a specific method, performing an operation for executing an increase (or decrease) in the instance of the execution server 200 is referred to as requesting an increase (or decrease) in the instance of the execution server 200. .

さらに、実行サーバ管理部１６０は、実行サーバ２００のインスタンスの負荷が増大する可能性が高いとして予め定められた条件を満たす場合に、実行サーバ２００のインスタンスを増加させる操作の準備を行うように、実行サーバ２００の管理サイトに依頼（準備依頼）する。準備依頼は、実際に増加することが必要になったときに、追加されたインスタンスが直ちに稼働できるように予め準備するように、実行サーバ２００の管理サイトに依頼するものである。具体的にどのような準備をするかは、実行サーバ２００の管理サイトに任されており、サービス内容等に応じて実行サーバ２００ごとに定めてよい。また、実際にインスタンスを増加させる依頼が行われたときに、直ちに稼働できるサーバであれば、事前の準備のための操作は特に行う必要はない。 Further, the execution server management unit 160 prepares an operation for increasing the number of instances of the execution server 200 when a predetermined condition is satisfied that the load of the instance of the execution server 200 is likely to increase. A request (preparation request) is made to the management site of the execution server 200. The preparation request is for requesting the management site of the execution server 200 to prepare in advance that the added instance can be immediately operated when it is necessary to actually increase the number. The specific preparation is left to the management site of the execution server 200, and may be determined for each execution server 200 according to the service content and the like. Further, if the server can be operated immediately when a request to increase the number of instances is actually made, there is no need to perform an operation for preparing in advance.

＜実行サーバのインスタンスの増減の依頼＞
上述したように、実行サーバ管理部１６０は、現在の実行サーバ２００の負荷およびこれから実行サーバ２００へ依頼する要求の量に基づいて予想される実行サーバ２００の負荷に基づいて、実行サーバ２００に対し、インスタンスの増加を依頼する。現在の実行サーバ２００の負荷とは、実行サーバ２００が処理を実行中の要求を処理するために要する負荷（以下、現在負荷と呼ぶ）である。これから実行サーバ２００へ依頼する要求の量に基づいて予想される実行サーバ２００の負荷とは、要求保持部１３０において該当する実行サーバ２００用のキュー１３１に保持されている要求を処理するために要する負荷（以下、予想負荷と呼ぶ）である。本実施形態において、実行サーバ管理部１６０は、現在負荷と予想負荷を加算した負荷（以下、合計負荷と呼ぶ）の、実行サーバ２００において同時に実行可能な処理の量（以下、同時処理能力と呼ぶ）に対する割合に基づいて、インスタンスの増減を依頼する。 <Request for increase / decrease of execution server instance>
As described above, the execution server management unit 160 applies the execution server 200 to the execution server 200 based on the current load on the execution server 200 and the expected load on the execution server 200 based on the amount of requests to be requested from the execution server 200. , Request an increase in instances. The current load of the execution server 200 is a load required for the execution server 200 to process a request being executed (hereinafter referred to as a current load). The expected load of the execution server 200 based on the amount of requests to be requested to the execution server 200 from now on is required for processing the requests held in the queue 131 for the corresponding execution server 200 in the request holding unit 130. Load (hereinafter referred to as expected load). In the present embodiment, the execution server management unit 160 is an amount of processing that can be simultaneously executed in the execution server 200 (hereinafter, referred to as simultaneous processing capability) of a load obtained by adding the current load and the expected load (hereinafter referred to as total load). ) Request increase / decrease of instances based on the ratio to.

具体的な例を挙げて説明する。ある実行サーバ２００において、「要求ａ」、「要求ｂ」、「要求ｃ」の３種類の要求に対する処理を実行可能であるとする。そして、実行サーバ２００の同時処理能力を１００とすると、「要求ａ」に対する処理を実行するのに要する実行サーバ２００の同時処理能力は２０であるものとする。これは、「要求ａ」に対する処理を実行する際に、実行サーバ２００の同時処理能力の２０％が使用されることを意味する。同様に、「要求ｂ」に対する処理を実行するのに要する実行サーバ２００の同時処理能力は１０、「要求ｃ」に対する処理を実行するのに要する実行サーバ２００の同時処理能力は５であるものとする。 A specific example will be described. Assume that a certain execution server 200 can execute processing for three types of requests “request a”, “request b”, and “request c”. Then, assuming that the simultaneous processing capability of the execution server 200 is 100, the simultaneous processing capability of the execution server 200 required to execute the processing for “request a” is assumed to be 20. This means that 20% of the simultaneous processing capability of the execution server 200 is used when executing the processing for “request a”. Similarly, the simultaneous processing capability of the execution server 200 required for executing the processing for “request b” is 10, and the simultaneous processing capability of the execution server 200 required for executing the processing for “request c” is 5. To do.

また、一例として、実行サーバ２００のインスタンスの増減に関する規則を、次のように設定する。
（１）稼働中の実行サーバ２００のインスタンスの全てにおいて、合計負荷が、１００％を超えたときは、インスタンスを１つ増加させる。
（２）稼働中の実行サーバ２００のインスタンスの数をＮ（Ｎは２以上）としたとき、合計負荷が、（Ｎ−２）台のインスタンスで１００％であり、１台のインスタンスで０％であり、残りの１台のインスタンスで８０％を下回ったときは、インスタンスを１つ減少させる。
（３）稼働中の実行サーバ２００のインスタンスの数をＭ（Ｍは１以上）としたとき、合計負荷が、（Ｍ−１）台のインスタンスで１００％であり、１台のインスタンスで８０％以上となったときは、インスタンスを増加するための準備依頼を行う。 Also, as an example, the rules regarding increase / decrease of instances of the execution server 200 are set as follows.
(1) When the total load exceeds 100% in all the instances of the active execution server 200, the number of instances is increased by one.
(2) When the number of running execution server 200 instances is N (N is 2 or more), the total load is 100% for (N-2) instances and 0% for one instance. When the remaining one instance falls below 80%, the instance is decreased by one.
(3) When the number of running execution server 200 instances is M (M is 1 or more), the total load is 100% for (M-1) instances and 80% for one instance. When this happens, make a preparation request to increase the number of instances.

簡単のために稼働中のインスタンスを１台として、負荷の計算例を示す。第１の計算例として、実行サーバ２００において、１個の「要求ａ」に対する処理、２個の「要求ｂ」に対する処理、３個の「要求ｃ」に対する処理が実行されているものとする。そして、要求保持部１３０のキュー１３１に、１個の「要求ｂ」が保持されているものとする。この場合、上記の各要求における負荷（要求に対する処理を実行するのに要する同時処理能力）に基づき、合計負荷は、
２０×１＋１０×２＋５×３＋１０×１＝６５
である。合計負荷が６５なので、上記の規則に基づき、インスタンスの増減に関する操作は行われない。 For the sake of simplicity, an example of load calculation will be shown with one running instance. As a first calculation example, it is assumed that the execution server 200 is executing processing for one “request a”, processing for two “request b”, and processing for three “request c”. It is assumed that one “request b” is held in the queue 131 of the request holding unit 130. In this case, based on the load in each request (simultaneous processing capability required to execute processing for the request), the total load is
20 × 1 + 10 × 2 + 5 × 3 + 10 × 1 = 65
It is. Since the total load is 65, no operation relating to increase / decrease of instances is performed based on the above rules.

第２の計算例として、実行サーバ２００において、１個の「要求ａ」に対する処理、２個の「要求ｂ」に対する処理、６個の「要求ｃ」に対する処理が実行されているものとする。そして、要求保持部１３０のキュー１３１に、１個の「要求ｂ」が保持されているものとする。この場合、合計負荷は、
２０×１＋１０×２＋５×６＋１０×１＝８０
である。合計負荷が８０なので、上記の規則に基づき、実行サーバ管理部１６０は、実行サーバ２００の管理サイトにインスタンスを増加させるための準備依頼を行う。 As a second calculation example, it is assumed that the execution server 200 is executing processing for one “request a”, processing for two “request b”, and processing for six “request c”. It is assumed that one “request b” is held in the queue 131 of the request holding unit 130. In this case, the total load is
20 × 1 + 10 × 2 + 5 × 6 + 10 × 1 = 80
It is. Since the total load is 80, the execution server management unit 160 makes a preparation request for increasing the number of instances to the management site of the execution server 200 based on the above rules.

第３の計算例として、実行サーバ２００において、２個の「要求ａ」に対する処理、３個の「要求ｂ」に対する処理、５個の「要求ｃ」に対する処理が実行されているものとする。そして、要求保持部１３０のキュー１３１に、１個の「要求ｂ」が保持されているものとする。この場合、合計負荷は、
２０×２＋１０×３＋５×５＋１０×１＝１０５
である。合計負荷が１０５で、１００を上回ったので、上記の規則に基づき、実行サーバ管理部１６０は、実行サーバ２００の管理サイトにインスタンスの増加を依頼する。 As a third calculation example, it is assumed that the execution server 200 is executing processing for two “requests a”, processing for three “requests b”, and processing for five “requests c”. It is assumed that one “request b” is held in the queue 131 of the request holding unit 130. In this case, the total load is
20 × 2 + 10 × 3 + 5 × 5 + 10 × 1 = 105
It is. Since the total load is 105 and exceeds 100, the execution server management unit 160 requests the management site of the execution server 200 to increase the number of instances based on the above rules.

第４の計算例では、既にインスタンスの増加が１回行われており、２台のインスタンスが稼働中である場合について例示する。２台のインスタンスのうち、１台のインスタンスでは処理が実行されておらず、負荷が０％であるものとする。また、残りの１台のインスタンスにおいて、２個の「要求ａ」に対する処理、３個の「要求ｂ」に対する処理、３個の「要求ｃ」に対する処理が実行されており、要求保持部１３０のキュー１３１には新規の要求が保持されていないものとする。そして、実行サーバ２００において実行されていた処理のうち、１個の「要求ｂ」に対する処理が完了し、処理結果の応答が返送されて応答保持部１４０に保持されたものとする。この場合、合計負荷は、
２０×２＋１０×３＋５×３−１０×１＝７５
である。合計負荷が７５で、８０を下回ったので、上記の規則に基づき、実行サーバ管理部１６０は、実行サーバ２００の管理サイトにインスタンスの減少を依頼する。 The fourth calculation example illustrates a case where the number of instances has already been increased once and two instances are operating. It is assumed that processing is not executed in one instance of the two instances, and the load is 0%. In the remaining one instance, processing for two “request a”, processing for three “request b”, and processing for three “request c” are executed. It is assumed that no new request is held in the queue 131. Then, it is assumed that the processing for one “request b” among the processing executed in the execution server 200 is completed, and a response as a processing result is returned and held in the response holding unit 140. In this case, the total load is
20 × 2 + 10 × 3 + 5 × 3-10 × 1 = 75
It is. Since the total load is 75, which is below 80, the execution server management unit 160 requests the management site of the execution server 200 to reduce the number of instances based on the above rules.

ここで、上記の第３の計算例を参照すると、現在負荷は、９５（＝２０×２＋１０×３＋５×５）であり、まだ、実行サーバ２００の同時処理能力の１００％に達していない。しかし、これから要求保持部１３０に保持されている要求が実行されると実行サーバ２００の同時処理能力の１００％を超えるため、実行サーバ管理部１６０は、この時点で実行サーバ２００のインスタンスの増加を依頼する。これにより、実際に要求保持部１３０に保持されている要求が実行サーバ２００に送られて処理が実行される時点ではインスタンスが増加されており、エラーや遅延が生じることなく処理が実行される。 Here, referring to the third calculation example, the current load is 95 (= 20 × 2 + 10 × 3 + 5 × 5), and has not yet reached 100% of the simultaneous processing capability of the execution server 200. However, when a request held in the request holding unit 130 is executed from now on, it exceeds 100% of the simultaneous processing capability of the execution server 200. Therefore, the execution server management unit 160 increases the number of instances of the execution server 200 at this time. Ask. As a result, the instances are increased at the time when the request actually held in the request holding unit 130 is sent to the execution server 200 and the process is executed, and the process is executed without causing an error or a delay.

また、上記の第２の計算例を参照すると、実行サーバ管理部１６０は、合計負荷が８０に達した時点で、実行サーバ２００のインスタンスを増加するための準備を依頼する。これにより、インスタンスの増加に時間を要するサービスが行われる実行サーバ２００においては、予めインスタンスの増加の準備を行っておき、この後に要求が増えてインスタンスの増加が必要となった際に、直ちに追加のインスタンスを稼働して対応することが可能となる。 Referring to the second calculation example, the execution server management unit 160 requests preparation for increasing the number of instances of the execution server 200 when the total load reaches 80. As a result, in the execution server 200 in which a service that takes time to increase the instance is performed, preparation for increasing the instance is performed in advance, and when an increase in the number of requests becomes necessary after that, the instance is immediately added. It is possible to respond by running an instance of.

なお、上記のインスタンスの増減に関する各規則、各要求の負荷や計算例に示す数値などは、全て例示であり、上記の具体的な内容や値に限定されるものではない。実行サーバ２００が要求に対する処理を実行する際の負荷は、実際のシステムにおいて具体的に特定される。また、インスタンスの増減に関する規則は、少なくともインスタンスを増加する依頼を行うための基準として現在負荷と予想負荷とを考慮するものであればよく、具体的には、実行サーバ２００の仕様や情報処理システム１０の運用の態様等に応じて設定される。 It should be noted that each rule relating to the increase / decrease of the instance, the load of each request, and the numerical values shown in the calculation examples are all examples, and are not limited to the specific contents and values described above. The load when the execution server 200 executes the processing for the request is specifically specified in the actual system. Further, the rules regarding the increase / decrease of the instance only need to consider the current load and the expected load as a reference for making a request to increase the instance, and specifically, the specification of the execution server 200 and the information processing system It is set according to 10 operation modes and the like.

＜管理サーバの動作＞
図３、４は、管理サーバ１００の動作を示すフローチャートである。図３に示すように、管理サーバ１００において、送受信部１１０を介して要求管理部１２０が、クライアント３００から要求を受け付けると（ステップ３０１）、受け付けた要求の内容に基づき、その要求を処理するサービス（実行サーバ２００）を決定する（ステップ３０２）。 <Operation of management server>
3 and 4 are flowcharts showing the operation of the management server 100. As shown in FIG. 3, in the management server 100, when the request management unit 120 receives a request from the client 300 via the transmission / reception unit 110 (step 301), a service that processes the request based on the content of the received request (Execution server 200) is determined (step 302).

次に、実行サーバ管理部１６０が、ステップ３０２で決定された実行サーバ２００の性能の情報を記憶装置から読み出して取得し（ステップ３０３）、インスタンスの増加の準備依頼を行う条件（準備依頼条件）を満足するか否かを判断する（ステップ３０４）。準備依頼条件としては、例えば、上述した合計負荷に基づく規則（規則（３）参照）等が設定される。準備依頼条件を満足しない場合（ステップ３０４でＮｏ）、要求管理部１２０による要求の実行依頼が行われる（ステップ３１１）。 Next, the execution server management unit 160 reads out and acquires the performance information of the execution server 200 determined in step 302 from the storage device (step 303), and a condition for making a preparation request for an instance increase (preparation request condition) Is judged (step 304). As the preparation request condition, for example, a rule based on the total load described above (see rule (3)) is set. When the preparation request condition is not satisfied (No in step 304), a request execution request is made by the request management unit 120 (step 311).

一方、準備依頼条件を満足する場合（ステップ３０４でＹｅｓ）、次に実行サーバ管理部１６０は、既に準備依頼を行っているか否かを判断する（ステップ３０５）。まだ準備依頼が行われていない場合（ステップ３０５でＮｏ）、実行サーバ管理部１６０は、実行サーバ２００の管理サイトに対して準備依頼を実行する（ステップ３０６）。そして、要求管理部１２０による要求の実行依頼が行われる（ステップ３１１）。 On the other hand, when the preparation request condition is satisfied (Yes in step 304), the execution server management unit 160 determines whether or not a preparation request has already been made (step 305). If a preparation request has not been made (No in step 305), the execution server management unit 160 executes a preparation request for the management site of the execution server 200 (step 306). Then, a request execution request is made by the request management unit 120 (step 311).

これに対し、既に準備依頼が行われている場合（ステップ３０５でＹｅｓ）、次に実行サーバ管理部１６０は、準備依頼を破棄する条件（破棄条件）を満足するか否かを判断する（図４、ステップ３０７）。例えば、準備依頼条件を満足しなくなったときは、破棄条件を満足すると設定することができる。この場合、上述した合計負荷に基づく規則（３）を準備依頼条件とすると、合計負荷が８０％を下回ったとき、破棄条件を満足することになる。破棄条件を満足した場合（ステップ３０７でＹｅｓ）、実行サーバ管理部１６０は、実行サーバ２００の管理サイトに対して準備依頼の破棄を依頼する（ステップ３１０）。そして、要求管理部１２０による要求の実行依頼が行われる（ステップ３１１）。なお、ここでは、準備依頼条件を満足しなくなったときに破棄条件を満足することとしたが、準備依頼条件とは別に破棄条件を設定しても良い。 On the other hand, if a preparation request has already been made (Yes in step 305), the execution server management unit 160 next determines whether or not a condition for discarding the preparation request (discard condition) is satisfied (see FIG. 4, Step 307). For example, when the preparation request condition is not satisfied, it can be set that the discard condition is satisfied. In this case, if the rule (3) based on the total load described above is set as the preparation request condition, the discard condition is satisfied when the total load falls below 80%. If the discard condition is satisfied (Yes in Step 307), the execution server management unit 160 requests the management site of the execution server 200 to discard the preparation request (Step 310). Then, a request execution request is made by the request management unit 120 (step 311). Here, the discard condition is satisfied when the preparation request condition is no longer satisfied, but a discard condition may be set separately from the preparation request condition.

一方、破棄条件を満足しない場合（ステップ３０７でＮｏ）、上記の破棄条件の例では、準備依頼条件を満足した状態が継続している場合、次に実行サーバ管理部１６０は、実行サーバ２００のインスタンスを増加させるための条件（増加条件）を満足するか否かを判定する（ステップ３０８）。増加条件としては、例えば、上述した合計負荷に基づく規則（規則（１）参照）等が設定される。増加条件を満足しない場合（ステップ３０８でＮｏ）、要求管理部１２０による要求の実行依頼が行われる（ステップ３１１）。 On the other hand, if the discard condition is not satisfied (No in step 307), in the example of the discard condition, if the state where the preparation request condition is satisfied continues, then the execution server management unit 160 It is determined whether or not a condition for increasing the instance (increase condition) is satisfied (step 308). As the increase condition, for example, a rule based on the total load described above (see rule (1)) is set. If the increase condition is not satisfied (No in Step 308), the request management unit 120 requests execution of the request (Step 311).

一方、増加条件を満足する場合（ステップ３０８でＹｅｓ）、実行サーバ管理部１６０は、実行サーバ２００のインスタンスの増加を依頼する（ステップ３０９）。そして、要求管理部１２０による要求の実行依頼が行われる（ステップ３１１）。 On the other hand, when the increase condition is satisfied (Yes in Step 308), the execution server management unit 160 requests an increase in the number of instances of the execution server 200 (Step 309). Then, a request execution request is made by the request management unit 120 (step 311).

ステップ３１１において、要求管理部１２０は、要求に対する処理を実行する実行サーバ２００に対応する要求保持部１３０のキュー１３１に要求を保持させる。そして、キュー１３１に保持されている要求を、順次、実行サーバ２００へ送る。要求が実行サーバ２００へ送られると、ステップ３１１におけるその要求に対する実行依頼が完了する。 In step 311, the request management unit 120 holds the request in the queue 131 of the request holding unit 130 corresponding to the execution server 200 that executes processing for the request. The requests held in the queue 131 are sequentially sent to the execution server 200. When the request is sent to the execution server 200, the execution request for the request in step 311 is completed.

ステップ３１１における要求の実行依頼が完了すると、次に、実行サーバ管理部１６０は、実行サーバ２００のインスタンスを減少させるための条件（減少条件）を満足するか否かを判定する（ステップ３１２）。減少条件としては、例えば、上述した合計負荷に基づく規則（規則（２）参照）等が設定される。減少条件を満足する場合（ステップ３１２でＹｅｓ）、実行サーバ管理部１６０が実行サーバ２００のインスタンスの減少を依頼する（ステップ３１３）。そして、この要求に関する処理が終了する。一方、減少条件を満足しない場合（ステップ３１２でＮｏ）、インスタンスを減少させる必要がないので、そのままこの要求に関する処理が終了する。 When the request execution request in step 311 is completed, the execution server management unit 160 next determines whether or not a condition (decrease condition) for reducing the instances of the execution server 200 is satisfied (step 312). As the reduction condition, for example, a rule based on the total load described above (see rule (2)) is set. When the reduction condition is satisfied (Yes in Step 312), the execution server management unit 160 requests reduction of the instance of the execution server 200 (Step 313). Then, the processing related to this request ends. On the other hand, if the decrease condition is not satisfied (No in step 312), there is no need to decrease the instance, and the process related to this request is terminated as it is.

以上、本実施形態の管理サーバ１００による実行サーバ２００の制御方法について説明したが、上記の手順は例示に過ぎない。本実施形態は、管理サーバ１００が実行サーバ２００において実行中の要求および要求保持部１３０に保持されている要求に基づいて実行サーバ２００のインスタンスの増加や増加の準備を依頼するものであればよく、具体的な動作は、図３、４に示す手順に限定されるものではない。 The method for controlling the execution server 200 by the management server 100 of the present embodiment has been described above, but the above procedure is merely an example. In the present embodiment, the management server 100 may request increase of the execution server 200 or preparation for increase based on the request being executed in the execution server 200 and the request held in the request holding unit 130. The specific operation is not limited to the procedure shown in FIGS.

＜管理サーバ１００のハードウェア構成例＞
図５は、管理サーバ１００を実現するコンピュータ（情報処理装置）のハードウェア構成例を示す図である。図５に示すように、管理サーバ１００は、制御手段および演算手段であるＣＰＵ（Central Processing Unit）１０１と、ＲＡＭ１０２およびＲＯＭ１０３と、外部記憶装置１０４と、ネットワーク・インターフェイス部１０５とを備える。ＣＰＵ１０１は、ＲＯＭ１０３に格納されているプログラムを実行することにより、要求管理部１２０、応答管理部１５０および実行サーバ管理部１６０の機能を実現する。ＲＡＭ１０２は、ＣＰＵ２０１による制御や演算処理において作業メモリとして用いられるとともに、要求保持部１３０（キュー１３１）および応答保持部１４０として機能する。ＲＯＭ１０３は、ＣＰＵ１０１が実行するプログラムや制御において用いられる各種のデータを格納している。外部記憶装置１０４は、例えば磁気ディスク装置や、データの読み書きが可能で不揮発性の半導体メモリで実現され、ＲＡＭ１０２に展開されてＣＰＵ１０１により実行されるプログラムや、ＣＰＵ１０１による演算処理の結果を格納する。ネットワーク・インターフェイス部１０５は、ネットワーク２０に接続して、実行サーバ２００やクライアント３００との間でデータの送受信を行う。なお、図５に示す構成例は、管理サーバ１００をコンピュータで実現するハードウェア構成の一例に過ぎない。管理サーバ１００の具体的構成は、本実施形態の機能を実現し得るものであれば、図５に示す構成例に限定されない。 <Example of Hardware Configuration of Management Server 100>
FIG. 5 is a diagram illustrating a hardware configuration example of a computer (information processing apparatus) that implements the management server 100. As shown in FIG. 5, the management server 100 includes a central processing unit (CPU) 101 that is a control unit and a calculation unit, a RAM 102 and a ROM 103, an external storage device 104, and a network interface unit 105. The CPU 101 implements the functions of the request management unit 120, the response management unit 150, and the execution server management unit 160 by executing a program stored in the ROM 103. The RAM 102 is used as a work memory in the control and arithmetic processing by the CPU 201 and functions as the request holding unit 130 (queue 131) and the response holding unit 140. The ROM 103 stores programs executed by the CPU 101 and various data used in control. The external storage device 104 is realized by, for example, a magnetic disk device or a non-volatile semiconductor memory that can read and write data, and stores a program that is expanded in the RAM 102 and executed by the CPU 101, and a result of arithmetic processing by the CPU 101. The network interface unit 105 is connected to the network 20 and transmits / receives data to / from the execution server 200 and the client 300. The configuration example illustrated in FIG. 5 is merely an example of a hardware configuration that implements the management server 100 with a computer. The specific configuration of the management server 100 is not limited to the configuration example illustrated in FIG. 5 as long as the functions of the present embodiment can be realized.

１０…情報処理システム、２０…ネットワーク、１００…管理サーバ、１１０…送受信部、１２０…要求管理部、１３０…要求保持部、１４０…応答保持部、１５０…応答管理部、１６０…実行サーバ管理部 DESCRIPTION OF SYMBOLS 10 ... Information processing system 20 ... Network 100 ... Management server 110 ... Transmission / reception part 120 ... Request management part 130 ... Request holding part 140 ... Response holding part 150 ... Response management part 160 ... Execution server management part

Claims

A request holding unit for holding a request to cause the server to execute processing;
A transmission unit that transmits the request held in the request holding unit to the server;
The server information is managed, and the processing load on the request being executed by the server and the processing load expected on the server when processing for the request held in the request holding unit is executed. A server management unit that increases the resource of the server when the state of the server based on satisfies a predetermined specific condition;
An information processing apparatus comprising:

The information processing apparatus according to claim 1, wherein the server is a virtual server, and the server management unit increases the number of instances of the virtual server.

The server management unit preliminarily calculates a total of a processing load for the request being executed by the server and a processing load in the server that is expected when a process for the request held in the request holding unit is executed. The information processing apparatus according to claim 1, wherein when the value is larger than a predetermined value, the resource of the server is increased.

The server management unit is based on the processing load on the request being executed by the server and the processing load on the server that is expected when the processing on the request held in the request holding unit is executed. When the state of the server satisfies a predetermined preparation request condition different from the specific condition, preparation for increasing the resource of the server is requested. The information processing apparatus described in 1.

The server management unit is based on the processing load on the request being executed by the server and the processing load on the server that is expected when the processing on the request held in the request holding unit is executed. If the state satisfies the preparation request condition and then satisfies the specific condition and a predetermined discard condition different from the preparation request condition, the preparation request for increasing the resource is discarded. The information processing apparatus according to claim 4, wherein the information processing apparatus requests the information processing apparatus.

A control method for transmitting a request to a server and causing the server to execute processing for the request,
Holding in the storage means a request to cause the server to execute processing;
The state of the server based on the processing load on the request being executed by the server and the processing load on the server that is expected when the processing on the request held in the storage unit is executed is determined in advance. Increasing the resource of the server if the specified specific condition is satisfied;
Transmitting the request held in the storage means to the server;
The control method characterized by including.

Computer
Request holding means for holding a request to cause the server to execute processing;
Transmitting means for transmitting the request held in the request holding means to the server;
The server information is managed, and the processing load for the request being executed by the server and the processing load expected at the time when the processing for the request held in the request holding unit is executed. As a server management means for increasing the resource of the server when the state of the server based on satisfies a predetermined specific condition,
A program characterized by functioning.