JP3964896B2

JP3964896B2 - Resource allocation apparatus and resource allocation method

Info

Publication number: JP3964896B2
Application number: JP2004288913A
Authority: JP
Inventors: 健士門之園
Original assignee: Toshiba Corp; Toshiba Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2004-09-30
Filing date: 2004-09-30
Publication date: 2007-08-22
Anticipated expiration: 2024-09-30
Also published as: JP2006106859A

Description

本発明は、サーバ負荷分散装置と複数のサーバとが組込まれた情報処理システムに係わり、特に、この情報処理システムにおける負荷状態に応じて、サーバ（資源）の動作（割当数）を制御する資源割当装置及び資源割当方法に関する。 The present invention relates to an information processing system in which a server load balancer and a plurality of servers are incorporated, and in particular, a resource for controlling the operation (allocation number) of a server (resource) according to a load state in the information processing system. The present invention relates to an allocation device and a resource allocation method .

例えば、図５に示すように、企業のネットワーク（インターネット）１上に公開されたＷｅｂサイトに採用される情報処理システム２においては、多数のクライアント（顧客消費者）３からの同一ＩＰアドレス（ＵＲＬ）を指定したアクセス要求（処理要求）に短時間で応答するために、１台のサーバ負荷分散装置（ロードバランサー）４と複数台のサーバ５とが組込まれている。各サーバ５には、クライアント（顧客消費者）３からの例えばホームページ閲覧要求等の処理要求を実現するためのＷｅｂ処理部６が組込まれている。 For example, as shown in FIG. 5, in an information processing system 2 adopted for a website published on a corporate network (Internet) 1, the same IP address (URL) from a number of clients (customer consumers) 3 is used. In order to respond in a short time to an access request (processing request) that designates (), one server load balancer (load balancer) 4 and a plurality of servers 5 are incorporated. Each server 5 incorporates a Web processing unit 6 for realizing a processing request such as a homepage browsing request from the client (customer consumer) 3.

サーバ負荷分散装置４は、各クライアント３からネットワーク（インターネット）１を介して入力された同一ＩＰアドレス（ＵＲＬ）を指定した複数のアクセス要求（処理要求）を各サーバ５に割り振る。処理要求が割り振られたサーバ５は、自己のＷｅｂ処理部６で処理要求が指定する例えばホームページ閲覧に応じるホームページ送信等の処理を実行する。 The server load balancer 4 allocates a plurality of access requests (processing requests) specifying the same IP address (URL) input from each client 3 via the network (Internet) 1 to each server 5. The server 5 to which the processing request is assigned executes processing such as homepage transmission corresponding to homepage browsing specified by the processing request in its own Web processing unit 6.

なお、各サーバ５は、同時に複数の処理を実施可能である。同時にＭ個の処理が可能であれば、ｎ個のサーバ５が情報処理システム２に組込まれていた場合は、この情報処理システム２で同時に（Ｍ×ｎ）個のクライアント３からの処理要求に応じることが可能である。 Each server 5 can simultaneously perform a plurality of processes. If M servers can be processed at the same time, if n servers 5 are incorporated in the information processing system 2, processing requests from (M × n) clients 3 at the same time in the information processing system 2 It is possible to respond.

このような情報処理システム２においては、サーバ負荷分散装置４へ入力される処理要求数は、一定でなく、時間、曜日に応じて大きく変動する。したがって、常時、全部のサーバ５を動作状態にすることはなく、その時点の処理要求数Ａに応じて運転するサーバ数Ｅを１台分余裕を見越して次のように決めておく。 In such an information processing system 2, the number of processing requests input to the server load distribution device 4 is not constant and varies greatly according to time and day of the week. Accordingly, all the servers 5 are not always in an operating state, and the number of servers E to be operated is determined as follows in consideration of the margin for one unit according to the processing request number A at that time.

Ｅ＝（Ａ／Ｍ）＋１
このサーバ数Ｅ分のサーバ５のみを運転（動作）状態にする。そして、サーバ負荷分散装置４は、各クライアント３からネットワーク１を介して入力された各処理要求を動作中の各サーバ５に割り振る。 E = (A / M) +1
Only the servers 5 corresponding to the number of servers E are set in the operation (operation) state. Then, the server load distribution device 4 allocates each processing request input from each client 3 via the network 1 to each operating server 5.

なお、特許文献１に、各負荷ユニット（サーバ）の負荷率を算出して、負荷率の小さい負荷ユニット（サーバ）に新規の負荷要求を送信する技術が開示されている。
特開２００１―７５９３４号公報 Patent Document 1 discloses a technique for calculating a load factor of each load unit (server) and transmitting a new load request to a load unit (server) having a small load factor.
Japanese Patent Laid-Open No. 2001-75934

しかしながら、上述した手法でその時点の処理要求数Ａに応じて運転するサーバ数Ｅを求める情報処理システム２においても、まだ解消すべき次のような課題があった。 However, the information processing system 2 that obtains the number E of servers to be operated according to the number A of processing requests at that time by the above-described method still has the following problems to be solved.

すなわち、図６に示すように、各サーバ５内に、上述した例えばクライアント３からのホームページ閲覧要求等の処理要求を実現するためのＷｅｂ処理部６の他に、クライアント３からのメール応答要求等の処理要求を実現するためメール処理部７が設けられている場合を想定する。 That is, as shown in FIG. 6, in each server 5, in addition to the Web processing unit 6 for realizing the processing request such as the homepage browsing request from the client 3 described above, a mail response request from the client 3, etc. Assume that the mail processing unit 7 is provided to realize the processing request.

この場合、各サーバ５にはＷｅｂ処理要求とメール処理要求との２種別の処理要求が入力される。各サーバ５は、同時にＭ個の処理を実施可能である。しかし、一つのメール処理を実行する場合のＣＰＵの処理負担は、一つのＷｅｂ処理を実行する場合のＣＰＵの処理負担に比較して格段に大きく、その結果、処理時間も長くなる。 In this case, each server 5 receives two types of processing requests, a Web processing request and a mail processing request. Each server 5 can perform M processes simultaneously. However, the processing burden on the CPU when executing one mail process is much larger than the processing burden on the CPU when executing one Web process, and as a result, the processing time becomes longer.

したがって、サーバ５に入力されたＭ個の処理要求が全部メール処理要求である場合のサーバ５の処理負担は、サーバ５に入力されたＭ個の処理要求が全部Ｗｅｂ処理要求である場合のサーバ５の処理負担に比較して格段に大きくなる。その結果、該当サーバ５の全体の処理時間が他のサーバ５に比較して長くなる。この場合、メール処理要求が多いサーバ５に入力されたＷｅｂ処理要求の処理も遅れることになる。 Therefore, the processing load of the server 5 when the M processing requests input to the server 5 are all mail processing requests is the server load when the M processing requests input to the server 5 are all Web processing requests. Compared with the processing burden of 5, it becomes much larger. As a result, the entire processing time of the corresponding server 5 becomes longer than that of the other servers 5. In this case, the processing of the Web processing request input to the server 5 with many mail processing requests is also delayed.

このことは、各クライアント３からネットワーク１を介して情報処理システム２に入力された各処理要求の処理所要時間に大きな差が生じる問題がある。また、サーバ５間で処理負担の大きな不均一が生じる。 This has a problem that a large difference occurs in the processing time required for each processing request input from each client 3 to the information processing system 2 via the network 1. In addition, a large non-uniform processing load occurs between the servers 5.

本発明はこのような事情に鑑みてなされたものであり、たとえ外部から入力される処理要求が処理負担の異なる複数種別の処理要求であったとしても、かつ外部から入力される処理要求数が時間的に変動したとしても、各サーバの処理負担をほぼ均一に維持でき、外部から入力される処理要求が指定する処理を時間遅れなく正常に実行できる資源割当装置及び資源割当方法を提供することを目的とする。 The present invention has been made in view of such circumstances, and even if the processing requests input from the outside are a plurality of types of processing requests having different processing loads, the number of processing requests input from the outside is not limited. To provide a resource allocation device and a resource allocation method that can maintain the processing load of each server almost evenly even if it fluctuates in time, and that can normally execute processing specified by a processing request input from the outside without time delay With the goal.

本発明は、複数のサーバと、クライアントから入力された複数かつ複数種別の処理要求を動作状態のサーバに割り振るサーバ負荷分散装置とを持つ情報処理システムにおける資源割当装置である。 The present invention is a resource allocation apparatus in the information processing system having a plurality of servers, a server load balancer that allocate processing requests of a plurality and plurality of types of input from a client to a server operating state.

そして、上記課題を解消するために、本発明の資源割当装置は、処理要求の種別毎に当該種別の処理を実行するに要する時間に対応した補正値を記憶する補正値記憶手段と、クライアントから入力される種別毎の単位時間当たりの処理要求数を算出する処理要求数算出手段と、処理要求数算出手段で算出した種別毎の単位時間当たりの処理要求数に前記補正値記憶手段に記憶された該当種別毎の補正値を乗算して種別毎の単位時間当たりの処理要求数を補正する処理要求数補正手段と、この処理要求数補正手段で補正された種別毎の単位時間当たりの補正済処理要求数を加算して、クライアントから入力された処理要求に対する補正済合計処理要求数を求める処理要求加算手段と、加算した補正済合計処理要求数をサーバの同時処理可能な処理要求数を示す最大処理可能数で除算して、この除算値に余裕台数を加算した数をクライアントから入力された処理要求を処理するために必要なサーバ数として求めるサーバ数算出手段と、このサーバ数算出手段で算出された必要サーバ数に基づいて動作状態のサーバ数を増減するサーバ制御手段とを備えている。 In order to solve the above-described problem, the resource allocation device of the present invention includes a correction value storage unit that stores a correction value corresponding to the time required to execute the type of processing for each type of processing request, and a client. The processing request number calculating means for calculating the number of processing requests per unit time for each type inputted, and the processing request number per unit time for each type calculated by the processing request number calculating means are stored in the correction value storage means. It was appropriate as the processing request count correcting means for classification for each of multiplying the correction value for correcting the number of processing requests per unit of each type time, corrected per unit time per corrected type in this processing request number correction means by adding the number of processing requests, a processing request adding means for obtaining the number of corrected total processing required for the input processing request from the client, that can be processed simultaneously in the addition number was corrected total processing requesting server punished Is divided by the maximum processable number indicating the number of requests, a server number calculating means for determining a number of servers necessary to process the input processing request number obtained by adding a margin number to the quotient from the client, the server Server control means for increasing or decreasing the number of servers in the operating state based on the required number of servers calculated by the number calculation means.

このように構成された情報処理システムにおける資源割当装置においては、外部から入力される種別毎の処理要求数が算出され、算出された種別毎の処理要求数は、種別毎の処理時間に対応する補正値で補正される。したがって、補正後の種別毎の処理要求数は処理に必要な時間に対応する（比例する）処理要求数となる。よって、補正後の種別毎の処理要求数の加算値から必要なサーバ数を正確に求めることが可能となる。 In the resource allocation device in the information processing system configured as described above, the number of processing requests for each type input from the outside is calculated, and the calculated number of processing requests for each type corresponds to the processing time for each type. It is corrected with the correction value. Therefore, the number of processing requests for each type after correction is the number of processing requests corresponding to (proportional to) the time required for processing. Therefore, the required number of servers can be accurately obtained from the added value of the number of processing requests for each type after correction.

その結果、たとえ外部からに入力される処理要求が処理負担の異なる複数種別の処理要求であったとしても、かつ外部から入力される処理要求数が変動したとしても、各サーバの処理負担をほぼ均一に維持できる。 As a result, even if the processing request input from the outside is a plurality of types of processing requests with different processing loads, and the number of processing requests input from the outside fluctuates, the processing load of each server is almost the same. It can be maintained uniformly.

また、別の発明は、複数のサーバと、クライアントから入力された複数かつ複数種別の処理要求を動作状態のサーバに割り振るサーバ負荷分散装置とを持つ情報処理システムにおける資源割当方法である。 Another invention includes a plurality of servers, a resource allocation method in an information processing system having a server load balancer that allocate the server process requests the operating status of the plurality and plurality of types input from the client.

そして、上記課題を解消するために、本発明の資源割当方法は、クライアントから入力される種別毎の単位時間当たりの処理要求数を算出する処理要求数算出ステップと、処理要求数算出ステップで算出した種別毎の単位時間当たりの処理要求数に、該当種別の処理要求の処理時間に対応した補正値を乗算して種別毎の単位時間当たりの処理要求数を補正する処理要求数補正ステップと、この処理要求数補正ステップで補正された種別毎の単位時間当たりの補正済処理要求数を加算して、クライアントから入力された処理要求に対する補正済合計処理要求を求める処理要求加算ステップと、加算した補正済合計処理要求数をサーバの同時処理可能な処理要求数を示す最大処理可能数で除算して、この除算値に余裕台数を加算した数をクライアントから入力された処理要求を処理するために必要なサーバ数として求めるサーバ数算出ステップと、このサーバ数算出ステップで算出された必要サーバ数に基づいて動作状態のサーバ数を増減するサーバ制御ステップとを備えている。 In order to solve the above problem, the resource allocation method according to the present invention is calculated by a processing request number calculating step for calculating a processing request number per unit time for each type input from a client, and a processing request number calculating step. A processing request number correction step of correcting the processing request number per unit time for each type by multiplying the processing request number per unit time for each type by a correction value corresponding to the processing time of the processing request of the corresponding type ; A processing request addition step for obtaining a corrected total processing request for the processing request input from the client by adding the corrected processing request number per unit time for each type corrected in the processing request number correction step, and adding Divide the corrected total processing request count by the maximum processing count that indicates the number of processing requests that can be processed simultaneously by the server, and add the surplus number to the division value. Server control step of increasing or decreasing the number of servers calculating step of obtaining a number of servers required, the number of servers operating state based on the required number of servers that are calculated by the server number calculating step for processing the processing request inputted from bets And.

このように構成された情報処理システムにおける資源割当方法においては、上述した情報処理システムにおける資源割当装置とほぼ同じ作用効果を奏することが可能である。 In the resource allocation method in the information processing system configured as described above, it is possible to achieve substantially the same operational effects as the resource allocation device in the information processing system described above.

本発明は、外部から入力される種別毎の処理要求数を算出し、算出した種別毎の処理要求数を種別毎の処理時間に対応する補正値で補正し、補正後の種別毎の処理要求数の加算値から必要なサーバ数を求めている。 The present invention calculates the number of processing requests for each type input from the outside, corrects the calculated number of processing requests for each type with a correction value corresponding to the processing time for each type, and processes the processing request for each type after correction. The required number of servers is obtained from the sum of the numbers.

したがって、たとえ外部から入力される処理要求が処理負担の異なる複数種別の処理要求であったとしても、かつ外部から入力される処理要求数が時間的に変動したとしても、各サーバの処理負担をほぼ均一に維持でき、外部から入力される処理要求が指定する処理を時間遅れなく正常に実行できる。 Therefore, even if processing requests input from the outside are multiple types of processing requests with different processing loads, and even if the number of processing requests input from outside fluctuates over time, the processing load on each server is reduced. The process can be maintained almost uniformly, and the process specified by the process request input from the outside can be executed normally without time delay.

以下、本発明の一実施形態を図面を用いて説明する。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings.

図１は、本発明の一実施形態の資源割当方法が採用された資源割当装置が組込まれた情報処理システムの概略構成を示す模式図である。図５、図６に示す従来の情報処理システム２と同一部分には同一符号を付して重複する部分の重複説明を省略する。 FIG. 1 is a schematic diagram showing a schematic configuration of an information processing system in which a resource allocation device employing a resource allocation method according to an embodiment of the present invention is incorporated. The same parts as those of the conventional information processing system 2 shown in FIG. 5 and FIG.

例えば、企業のネットワーク（インターネット）１上に公開されたＷｅｂサイトに採用される情報処理システム１１には、多数のクライアント（顧客消費者）３からの各アクセス要求（処理要求）に短時間で応答するために、１台のサーバ負荷分散装置（ロードバランサー）１２と、ｎ台のサーバ５と、１台の資源割当装置１３とが組込まれている。 For example, the information processing system 11 employed in a Web site published on the corporate network (Internet) 1 responds to each access request (processing request) from a large number of clients (customer consumers) 3 in a short time. For this purpose, one server load balancer (load balancer) 12, n servers 5, and one resource allocation device 13 are incorporated.

各サーバ５には、クライアント（顧客消費者）３からの例えばホームページ閲覧要求等の処理要求を実現するためのＷｅｂ処理部６と、クライアント（顧客消費者）３からのメール応答要求等の処理要求を実現するためメール処理部７との２種別の処理部が組込まれている。メール処理部７で一つのメール処理を実行する場合のＣＰＵの処理負担、すなわち処理時間は、例えば、Ｗｅｂ処理部６で一つのＷｅｂ処理を実行する場合の処理時間に対して２倍を必要とする。各サーバ５は同時にＭ個の処理を実施可能である。なお、各サーバ５は、資源割当装置１３でＯＮ／ＯＦＦ制御される。 Each server 5 includes a Web processing unit 6 for realizing a processing request such as a homepage browsing request from the client (customer consumer) 3 and a processing request such as a mail response request from the client (customer consumer) 3. In order to realize the above, two types of processing units with the mail processing unit 7 are incorporated. The CPU processing load, that is, the processing time when the mail processing unit 7 executes one mail process, for example, needs to be twice as long as the processing time when the Web processing unit 6 executes one Web process. To do. Each server 5 can perform M processes simultaneously. Each server 5 is ON / OFF controlled by the resource allocation device 13.

したがって、各クライアント３から情報処理システム１１へ入力される処理要求には、Ｗｅｂ処理要求（種別Ｗ₁）とメール処理要求（種別Ｗ₂）との２種別ある。 Accordingly, there are two types of processing requests input from each client 3 to the information processing system 11, that is, a web processing request (type W ₁ ) and a mail processing request (type W ₂ ).

サーバ負荷分散装置（ロードバランサー）１２には、サーバ処理容量メモリ１４、サーバ状態メモリ１５、サーバ負荷状況メモリ１６、処理要求受信部１７、処理種別判定部１８、処理要求分配部１９が設けられている。 The server load balancer (load balancer) 12 includes a server processing capacity memory 14, a server status memory 15, a server load status memory 16, a processing request reception unit 17, a processing type determination unit 18, and a processing request distribution unit 19. .

サーバ処理容量メモリ１４内には、図２（ａ）に示すように、各サーバ５における同時に処理可能な処理要求数を示す最大処理可能数Ｍ（この実施形態においては、Ｍ＝５０）と、各処理種別Ｗ₁、Ｗ₂の単独処理可能数Ｎ₁、Ｎ₂（この実施形態においては、Ｎ₁＝５０、Ｎ₂＝２５）とが記憶されている。この各単独処理可能数Ｎ₁、Ｎ₂は、サーバ５において、対応する種別Ｗ₁、Ｗ₂の処理のみを実行した場合において、その処理時間が基準時間内に終了する処理要求数である。 In the server processing capacity memory 14, as shown in FIG. 2A, a maximum processable number M (in this embodiment, M = 50) indicating the number of processing requests that can be processed simultaneously in each server 5, The number of independently processable N ₁ and N ₂ (in this embodiment, N ₁ = 50, N ₂ = 25) for each processing type W ₁ and W ₂ is stored. Each of these independently processable numbers N ₁ and N ₂ is the number of processing requests whose processing time ends within the reference time when only the processing of the corresponding types W ₁ and W ₂ is executed in the server 5.

サーバ状態メモリ１５内には、図２（ｂ）に示すように、Ｎｏ．１〜Ｎｏ．ｎのｎ台の各サーバ５の現在時点における動作中（ＯＮ）又は停止中（ＯＦＦ）であるかの状態が記憶されている。 In the server status memory 15, as shown in FIG. 1-No. The state of whether each of the n servers 5 is operating (ON) or stopped (OFF) at the current time is stored.

サーバ負荷状況メモリ１６内には、ｎ台の各サーバ５の現在時点における処理中の処理要求の各種別Ｗ₁、Ｗ₂毎の処理要求数Ｘ₁、Ｘ₂が記憶されている。 The server load status memory 16 stores the number of processing requests X ₁ and X ₂ for each type W ₁ and W ₂ of processing requests being processed at the current time of each of the n servers 5.

処理要求受信部１７は、各クライアント３からネットワーク１を介して入力された複数のアクセス要求（処理要求）を受信し、次の処理種別判定部１８へ送出する。処理種別判定部１８は、入力された各処理要求の種別Ｗ₁、Ｗ₂を判定して、各処理要求に種別Ｗ₁、Ｗ₂を付して、次の処理要求分配部１９へ送出する。 The processing request reception unit 17 receives a plurality of access requests (processing requests) input from each client 3 via the network 1 and sends them to the next processing type determination unit 18. The process type determination unit 18 determines the types W ₁ and W ₂ of each input process request, attaches the types W ₁ and W ₂ to each process request, and sends them to the next process request distribution unit 19. .

処理要求分配部１９は、順次入力される各種別Ｗ₁、Ｗ₂の処理要求を、サーバ状態メモリ１５内に設定された動作中の各サーバ５に対して例えばラウンドロビン方式で順番に割り振って（送信して）いく。そして、割り振り（送信）結果をサーバ負荷状況メモリ１６内に登録（書込む）する。 The processing request distribution unit 19 assigns the processing requests of various types W ₁ and W ₂ that are sequentially input to the active servers 5 set in the server state memory 15 in order, for example, in a round robin manner. (Send it). Then, the allocation (transmission) result is registered (written) in the server load status memory 16.

なお、サーバ負荷状況メモリ１６内の割り振り先のサーバ５の割り振り済みの、処理時間で補正した処理要求数Ｓ＝Ｘ₁＋２Ｘ₂が最大処理可能数Ｍを超える場合は、このサーバ５に対する新規処理要求の割り振り（送信）を実施せずに、次のサーバ５に新規処理要求の割り振り（送信）を行う。さらに、処理要求分配部１９は、サーバ５にて情報処理が終了した処理要求をサーバ負荷状況メモリ１６内から削除する。 If the processing request number S = X ₁ + 2X ₂ already allocated by the allocation destination server 5 in the server load status memory 16 exceeds the maximum processable number M, new processing for this server 5 is performed. A new processing request is allocated (transmitted) to the next server 5 without allocating (transmitting) the request. Further, the processing request distribution unit 19 deletes the processing request whose information processing has been completed by the server 5 from the server load status memory 16.

図３は、資源割当装置１３の概略構成を示すブロック図である。
処理要求数算出部２０は、サーバ負荷分散装置１２の処理種別判定部１８から出力されたＷｅｂ処理要求（種別Ｗ₁）の単位時間当たりの処理要求数ｃ₁を算出して、次の処理要求数補正部２１へ送出する。一方、処理要求数算出部２２は、サーバ負荷分散装置１２の処理種別判定部１８から出力されたメール処理要求（種別Ｗ₂）の単位時間当たりの処理要求数ｃ₂を算出して、次の処理要求数補正部２３へ送出する。 FIG. 3 is a block diagram showing a schematic configuration of the resource allocation device 13.
The processing request number calculation unit 20 calculates the number of processing requests c ₁ per unit time of the Web processing request (type W ₁ ) output from the processing type determination unit 18 of the server load distribution device 12 and determines the next processing request. It is sent to the number correction unit 21. On the other hand, the processing request number calculation unit 22 calculates the processing request number c ₂ per unit time of the mail processing request (type W ₂ ) output from the processing type determination unit 18 of the server load balancer 12, and This is sent to the processing request number correction unit 23.

補正値メモリ２４内には、処理要求の前述した種別Ｗ₁、Ｗ₂毎に各処理要求の処理を実行するに要する時間に対応した補正値Ｂ₁、Ｂ₂が記憶されている。この実施形態においては、メール処理要求（種別Ｗ₂）の処理時間はＷｅｂ処理要求（種別Ｗ₁）の処理時間の２倍であるので、各補正値は、Ｂ₁＝１、Ｂ₂＝２に設定されている。 In the correction value memory 24, correction values B ₁ and B ₂ corresponding to the time required to execute the processing of each processing request are stored for each of the above-described types W ₁ and W ₂ of processing requests. In this embodiment, since the processing time of the mail processing request (type W ₂ ) is _twice the processing time of the Web processing request (type W ₁ ), the correction values are B ₁ = 1 and B ₂ = 2. Is set to

処理要求数補正部２１は、処理要求数ｃ₁に補正値Ｂ₁＝１を乗算して、Ｗｅｂ処理要求（種別Ｗ₁）の処理要求数を補正して、補正後の処理要求数ｃ_1Aとして、有効サーバ数算出部２５へ送出する。同様に、処理要求数補正部２３は、処理要求数ｃ₂に補正値Ｂ₂＝２を乗算して、メール処理要求（種別Ｗ₂）の処理要求数を補正して、補正後の処理要求数ｃ_2Aとして、有効サーバ数算出部２５へ送出する。 The processing request number correction unit 21 corrects the processing request number of the Web processing request (type W ₁ ) by multiplying the processing request number c ₁ by the correction value B ₁ = 1, and the corrected processing request number c _1A. Is sent to the effective server number calculation unit 25. Similarly, the processing request number correction unit 23 multiplies the processing request number c ₂ by the correction value B ₂ = 2 to correct the processing request number of the mail processing request (type W ₂ ), and the corrected processing request. The number c _2A is sent to the effective server number calculation unit 25.

ｃ_1A＝ｃ₁×Ｂ₁＝ｃ₁×１＝ｃ₁
ｃ_2A＝ｃ₂×Ｂ₂＝ｃ₂×２＝２ｃ₂
有効サーバ数算出式メモリ２６内には、前述した最大処理可能数Ｍ、及び補正後の処理要求数ｃ_1A、ｃ_2Aを用いて、１台分余裕を見越した必要なサーバ数Ｄ₁を算出する算出式が記憶されている。 c _1A = c ₁ × B ₁ = c ₁ × 1 = c ₁
c _2A = c ₂ × B ₂ = c ₂ × 2 = 2c ₂
Valid server number calculation formula memory 26, calculates the maximum processable number described above M, and the processing request number c _1A after correction, using c _2A, the server number D ₁ required in anticipation of one minute margin The calculation formula to be stored is stored.

Ｄ₁＝［（ｃ_1A＋ｃ_2A）／Ｍ］＋１
有効サーバ数算出部２５は、補正後の処理要求数ｃ_1A、ｃ_2A分の処理要求を実現させるために必要なサーバ数Ｄ₁を前述した算出式を用いて算出して、比較部２７へ送出する。 D ₁ = [(c _1A + c _2A ) / M] +1
The effective server number calculation unit 25 calculates the server number D ₁ necessary for realizing the corrected processing request numbers c _1A and c _2A for the processing request using the above-described calculation formula, and sends the calculation request to the comparison unit 27. Send it out.

サーバ状態読取部２８は、サーバ負荷分散装置１２のサーバ状態メモリ１５内の動作中のサーバ数Ｄ₀を読取って比較部２７へ送出する。さらに、サーバ状態読取部２８は、各サーバ５の現在時点における動作中（ＯＮ）又は停止中（ＯＦＦ）の状態を読取って状態変更サーバ決定部３０へ送出する。 The server state reading unit 28 reads the number of active servers D ₀ in the server state memory 15 of the server load distribution device 12 and sends it to the comparison unit 27. Furthermore, the server status reading unit 28 reads the operating (ON) or stopped (OFF) status of each server 5 at the current time point and sends it to the status change server determining unit 30.

比較部２７は、算出した必要なサーバ数Ｄ₁と現在時点で動作中のサーバ数Ｄ₀とを比較して、不一致の場合のみ、サーバ変更数算出部２９を起動する。サーバ変更数算出部２９は、必要なサーバ数Ｄ₁と動作中のサーバ数Ｄ₀との差で示されるサーバ変更数（Ｄ₁ーＤ₀）を算出して、状態変更サーバ決定部３０へ送出する。 The comparison unit 27 compares the calculated necessary number of servers D ₁ with the number of servers D ₀ currently operating, and activates the server change number calculation unit 29 only when there is a mismatch. The server change number calculation unit 29 calculates the server change number (D ₁ -D ₀ ) indicated by the difference between the required server number D ₁ and the active server number D _0, and sends it to the state change server determination unit 30. Send it out.

状態変更サーバ決定部３０は、各サーバ５の現在時点における動作中（ＯＮ）又は停止中（ＯＦＦ）の状態に基づいて、新たに起動又は停止するサーバ５を例えばサーバ番号の若い順に決定して、サーバ状態変更部３１を介してサーバ負荷分散装置１２のサーバ状態メモリ１５内の該当サーバ５の動作中（ＯＮ）又は停止中（ＯＦＦ）の状態を変更する。 The state change server determination unit 30 determines, for example, the servers 5 to be newly started or stopped in ascending order of the server numbers based on the operating (ON) or stopped (OFF) state of each server 5 at the current time point. The server status change unit 31 changes the operating (ON) or stopped (OFF) status of the server 5 in the server status memory 15 of the server load distribution device 12.

さらに、状態変更サーバ決定部３０は、決定した起動又は停止するサーバ５を、サーバ起動停止制御部３２を介して実際に起動又は停止する。 Further, the state change server determination unit 30 actually starts or stops the determined server 5 to be started or stopped via the server start / stop control unit 32.

図４は、このように構成された資源割当装置１３の全体動作を示す流れ図である。制御周期で定まる規定時間Ｔ_Sが経過すると（ステップＳ１）、各クライアント３から情報処理システム１１へ入力されるＷｅｂ処理要求（種別Ｗ₁）とメール処理要求（種別Ｗ₂）との単位時間当たりの処理要求数ｃ₁、ｃ₂を読取る（Ｓ２）。読取った各処理要求数ｃ₁、ｃ₂を、一律条件とするために、補正値Ｂ₁、Ｂ₂を乗算して補正する（Ｓ３）。 FIG. 4 is a flowchart showing the overall operation of the resource allocation device 13 configured as described above. When the specified time T _S determined by the control cycle elapses (step S1), per unit time of the Web processing request (type W ₁ ) and the mail processing request (type W ₂ ) input from each client 3 to the information processing system 11 The processing request numbers c ₁ and c ₂ are read (S2). In order to make the read processing request counts c ₁ and c ₂ uniform conditions, the correction values B ₁ and B ₂ are multiplied and corrected (S3).

補正済みの処理要求数ｃ_1A、ｃ_2Aから、１台分余裕を見越した必要なサーバ数Ｄ₁を算出式Ｄ₁＝［（ｃ_1A＋ｃ_2A）／Ｍ］＋１を用いて算出する（Ｓ４）。現在時点で動作中のサーバ数Ｄ₀を読取る（Ｓ５）。算出した必要なサーバ数Ｄ₁と現在時点で動作中のサーバ数Ｄ₀とを比較して（Ｓ６）、一致する場合は（Ｓ７）、動作中のサーバ数を変更する必要ないので、何もせずに、そのままＳ１へ戻り、次の制御周期まで待つ。 From the corrected number of processing requests c _1A and c _2A , the necessary number of servers D ₁ in anticipation of a margin for _one is calculated using the calculation formula D ₁ = [(c _1A + c _2A ) / M] +1 (S4 ). The number of servers D ₀ currently operating is read (S5). Compared calculated and server number D ₁ required a number of servers D ₀ in operation at the present time (S6), if there is a match (S7), since it is not necessary to change the number of servers during operation, doing anything Instead, the process returns to S1 and waits until the next control cycle.

算出した必要なサーバ数Ｄ₁と現在時点で動作中のサーバ数Ｄ₀とが不一致の場合のみ（Ｓ７）、サーバ変更数（Ｄ₁ーＤ₀）を算出する（Ｓ８）。そして、起動又は停止すべきサーバ５を決定する（Ｓ９）。サーバ負荷分散装置１２のサーバ状態メモリ１５内の該当サーバ５の動作中（ＯＮ）又は停止中（ＯＦＦ）の状態を変更する（Ｓ１０）。最後に、決定した起動又は停止すべきサーバ５を、サーバ起動停止制御部３２を介して実際に起動又は停止する（Ｓ１１）。以上の処理が終了すると、Ｓ１へ戻り、次の制御周期まで待つ。 Calculated with the server number D ₁ and required number of servers D ₀ in operation at the present time is not matched only (S7), and calculates the server change number (D ₁ over D ₀₎ (S8). Then, the server 5 to be started or stopped is determined (S9). The state of operation (ON) or stop (OFF) of the server 5 in the server state memory 15 of the server load balancer 12 is changed (S10). Finally, the determined server 5 to be started or stopped is actually started or stopped via the server start / stop control unit 32 (S11). When the above process ends, the process returns to S1 and waits for the next control cycle.

このように構成された情報処理システムにおける資源割当装置及び資源割当方法においては、各クライアント３からネットワーク１を介して情報処理システム１１へ入力されるＷｅｂ処理要求（種別Ｗ₁）とメール処理要求（種別Ｗ₂）との単位時間当たりの処理要求数ｃ₁、ｃ₂が算出され、算出された種別Ｗ₁、Ｗ₂毎の処理要求数ｃ₁、ｃ₂は、種別毎の処理時間に対応する補正値Ｂ₁、Ｂ₂で補正される。したがって、補正後の種別毎の処理要求数ｃ_A1、ｃ_A2は処理に必要な時間に対応する（比例する）処理要求数となる。よって、補正後の種別毎の処理要求数の加算値（ｃ_A1＋ｃ_A2）から必要なサーバ数Ｄ₁＝［（ｃ_1A＋ｃ_2A）／Ｍ］＋１を正確に求めることが可能となる。 In the resource allocation device and the resource allocation method in the information processing system configured as described above, a Web processing request (type W ₁ ) and a mail processing request (type W ₁ ) input from each client 3 to the information processing system 11 via the network 1 ( The number of processing requests c ₁ and c ₂ per unit time with the type W ₂ ) is calculated, and the calculated number of processing requests c ₁ and c ₂ for each type W ₁ and W ₂ corresponds to the processing time for each type. The correction values B ₁ and B ₂ are corrected. Accordingly, the number of processing requests c _A1 and c _{A2 for} each type after correction is the number of processing requests corresponding to (proportional to) the time required for processing. Therefore, the required number of servers D ₁ = [(c _1A + c _2A ) / M] +1 can be accurately obtained from the added value (c _A1 + c _A2 ) of the number of processing requests for each type after correction.

その結果、たとえ各クライアント３からネットワーク１を介して情報処理システム１１へ入力される処理負担の異なるＷｅｂ処理要求（種別Ｗ₁）とメール処理要求（種別Ｗ₂）との単位時間当たりの処理要求数ｃ₁、ｃ₂の比が時間的に変動したとしても、各サーバ５の処理負担をほぼ均一に維持できる。よって、各クライアント３からネットワーク１を介して情報処理システム１１へ入力される処理要求が指定する処理を時間遅れなく正常に実行できる。 As a result, processing requests per unit time of Web processing requests (type W ₁ ) and mail processing requests (type W ₂ ) with different processing loads input from each client 3 to the information processing system 11 via the network 1. Even if the ratio between the numbers c ₁ and c ₂ fluctuates with time, the processing load on each server 5 can be maintained substantially uniformly. Therefore, the processing specified by the processing request input from each client 3 to the information processing system 11 via the network 1 can be normally executed without a time delay.

なお、本発明は上述した実施形態に限定されるものではない。実施形態においては、各サーバ５には、Ｗｅｂ処理部６とメール処理部７との２種別Ｗ₁、Ｗ₂の処理要求に対応する２種類の処理部が設けられている場合を示したが、処理要求の種別数、各サーバ５の処理部数は、２に限定されるものではない。 In addition, this invention is not limited to embodiment mentioned above. In the embodiment, the case where each server 5 is provided with two types of processing units corresponding to the processing requests of the two types W ₁ and W ₂ of the Web processing unit 6 and the mail processing unit 7 is shown. The number of types of processing requests and the number of processing copies of each server 5 are not limited to two.

また、各サーバ５に設けられた各処理部もＷｅｂ処理部やメール処理部に限定されるものではない。 Also, each processing unit provided in each server 5 is not limited to a Web processing unit or a mail processing unit.

本発明の一実施形態の資源割当方法が採用された資源割当装置が組込まれた情報処理システムの概略構成を示す模式図The schematic diagram which shows schematic structure of the information processing system incorporating the resource allocation apparatus as which the resource allocation method of one Embodiment of this invention was employ | adopted. 同情報処理システム内に設けられたサーバ処理容量メモリ及びサーバ状態メモリの記憶内容を示す図The figure which shows the memory content of the server processing capacity memory and the server status memory provided in the information processing system 同実施形態の資源割当装置の概略構成を示すブロック図The block diagram which shows schematic structure of the resource allocation apparatus of the embodiment 同実施形態の資源割当装置の動作を示す流れ図A flowchart showing the operation of the resource allocation device of the same embodiment 従来の情報処理システムの概略構成を示す模式図Schematic diagram showing the schematic configuration of a conventional information processing system 同じく従来の情報処理システムの概略構成を示す模式図Similarly, a schematic diagram showing a schematic configuration of a conventional information processing system

Explanation of symbols

１…ネットワーク、３…クライアント、５…サーバ、６…Ｗｅｂ処理部、７…メール処理部、１１…情報処理システム、１２…サーバ負荷分散装置、１３…資源割当装置、１４…サーバ処理容量メモリ、１５…サーバ状態メモリ、１６…サーバ負荷状況メモリ、１７…処理要求受信部、１８…処理種別判定部、１９…処理要求分配部、２０，２２…処理要求数算出部、２１．２３…処理要求数補正部、２４…補正値メモリ、２５…有効サーバ数算出部、２６…有効サーバ数算出式メモリ、２７…比較部、２８…サーバ状態読取部、２９…サーバ変更数算出部、３０…状態変更サーバ決定部、３１…サーバ状態変更部、３２…サーバ起動停止制御部 DESCRIPTION OF SYMBOLS 1 ... Network, 3 ... Client, 5 ... Server, 6 ... Web processing part, 7 ... Mail processing part, 11 ... Information processing system, 12 ... Server load distribution apparatus, 13 ... Resource allocation apparatus, 14 ... Server processing capacity memory, DESCRIPTION OF SYMBOLS 15 ... Server state memory, 16 ... Server load condition memory, 17 ... Processing request receiving part, 18 ... Processing type determination part, 19 ... Processing request distribution part, 20, 22 ... Processing request number calculation part, 21.23 ... Processing request Number correction unit, 24 ... Correction value memory, 25 ... Effective server number calculation unit, 26 ... Effective server number calculation formula memory, 27 ... Comparison unit, 28 ... Server state reading unit, 29 ... Server change number calculation unit, 30 ... State Change server determination unit, 31 ... server state change unit, 32 ... server start / stop control unit

Claims

A plurality of servers, keep the information processing system having a server load balancer that allocate processing requests of a plurality and plurality of types of input from a client to a server operating state,
Correction value storage means for storing a correction value corresponding to the time required to execute the type of processing for each type of processing request;
Processing request number calculating means for calculating the number of processing requests per unit time for each type input from the client ;
The number of processing requests per unit time calculated by the processing request number calculating means is multiplied by the correction value for each corresponding type stored in the correction value storage means by the number of processing requests per unit time calculated by the processing request number calculating means. A processing request number correcting means for correcting;
Processing request addition means for adding the corrected number of processing requests per unit time for each type corrected by the processing request number correction means to obtain a corrected total processing request number for the processing requests input from the client ;
The number of corrected total processing requests added is divided by the maximum number of processing requests indicating the number of processing requests that can be processed simultaneously by the server, and the number obtained by adding the surplus number to the divided value is the processing request input from the client. Server number calculating means for obtaining the number of servers necessary for processing
A resource allocation device comprising server control means for increasing or decreasing the number of servers in an operating state based on the required number of servers calculated by the server number calculation means.

A plurality of servers, in the information processing system having a server load balancer that allocate processing requests of a plurality and plurality of types of input from a client to a server operating state,
A processing request number calculating step of calculating the number of processing requests per unit time for each type input from the client ;
The number of processing requests per unit time for each type calculated in the processing request number calculation step is multiplied by a correction value corresponding to the processing time of the processing request for the corresponding type to correct the number of processing requests per unit time for each type. A processing request number correction step to be performed;
A processing request addition step for adding the corrected processing request count per unit time for each type corrected in the processing request count correction step to obtain a corrected total processing request count for the processing request input from the client ;
The number of corrected total processing requests added is divided by the maximum number of processing requests indicating the number of processing requests that can be processed simultaneously by the server, and the number obtained by adding the surplus number to the divided value is the processing request input from the client. Calculating the number of servers as the number of servers required to process
A resource allocation method comprising: a server control step for increasing / decreasing the number of servers in an operating state based on the required number of servers calculated in the server number calculation step.