JP2018128956A

JP2018128956A - Load distribution device and load distribution method

Info

Publication number: JP2018128956A
Application number: JP2017023031A
Authority: JP
Inventors: 雄大北野; Yudai Kitano
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2017-02-10
Filing date: 2017-02-10
Publication date: 2018-08-16
Anticipated expiration: 2037-02-10
Also published as: JP6660322B2

Abstract

PROBLEM TO BE SOLVED: To provide a load distribution device and a load distribution method capable of reducing processing TAT of a request signal while keeping a processing load of a processing server constant.SOLUTION: A load distribution device 100 includes: an information storage section 130 storing a load limit value as a limit value for processing of a low delay server; a server information acquisition section 120 acquiring load information of the low delay server; a take-over signal number calculation section 111 calculating a take-over signal number S which is the number of signals which should be assigned to a high delay server but are actually assigned to the low delay server, in a range not exceeding the load limit value, on the basis of the number of high delay servers, the number of low delay servers, a maximum value of the acquired load information, and a processing server load fluctuation rate A indicating load fluctuation of the low delay server; and an assigning function section 110 performing assignment of signals of the calculated take-over signal number S from the high delay server to the low delay server.SELECTED DRAWING: Figure 1

Description

本発明は、データセンタの複数の処理サーバに対して、クライアントからリクエストされた情報処理を実行させるサーバを決定する負荷分散装置および負荷分散方法に関する。 The present invention relates to a load distribution apparatus and a load distribution method for determining a server that executes information processing requested by a client for a plurality of processing servers in a data center.

サーバ構成技術や仮想化技術の発達、激甚災害対策等により、地域や国をまたいでデータセンタが接続され、そのデータセンタ内に複数の処理サーバが存在するようになった。点在する処理サーバに対して負荷を分散するため、ロードバランサのような負荷分散装置を設置し、当該負荷分散装置を用いて要求信号（リクエスト）を処理サーバに振り分け、各処理サーバの負荷を分散させることが一般的である。
例えば、非特許文献１では、各処理サーバの負荷状態（ＣＰＵ使用率など）や応答時間を考慮して振分先の処理サーバを決定し、負荷分散を実現している。 Due to the development of server configuration technology, virtualization technology, and severe disaster countermeasures, data centers are connected across regions and countries, and there are multiple processing servers in the data center. In order to distribute the load to the scattered processing servers, a load distribution device such as a load balancer is installed, and the load distribution device is used to distribute request signals (requests) to the processing servers. It is common to disperse.
For example, in Non-Patent Document 1, load distribution is realized by determining a processing server as a distribution destination in consideration of the load state (CPU usage rate, etc.) and response time of each processing server.

負荷分散入門(ロードバランサ入門),［online］,［平成２９年１月２４日検索］,インターネット〈 URL : http://fenics.fujitsu.com/products/ipcom/catalog/data/1/3.html〉Introduction to Load Balancing (Introduction to Load Balancer), [online], [Search January 24, 2017], Internet <URL: http://fenics.fujitsu.com/products/ipcom/catalog/data/1/3. html> 北野雄大、岡本光浩 ”拠点間通信遅延を考慮した負荷分散方法” 2016年電子情報通信ソサイエティ大会、B-6-13Yuta Kitano, Mitsuhiro Okamoto “Load Balancing Method Considering Inter-Communication Communication Delay” 2016 Electronic Information Communication Society Conference, B-6-13

しかし、地域や国をまたいで信号を処理サーバに振り分けることによって、当該信号において、地域や国をまたぐ分の処理遅延が生じることになる。処理遅延が生じると、例えば通信制御サーバにおいては、後続信号が先行信号を追い抜いたりすることも考えられ問題となる。
一方で、処理遅延を防止するため、拠点内の処理サーバへ優先して信号を割り振ると、拠点内の処理サーバのみ負荷が重くなってしまい、ハードウェアリソースの非効率化や不要なスケーリングにつながってしまう。
非特許文献１に記載の負荷分散方法は、応答時間や処理サーバの負荷を考慮して処理サーバを決定している。しかし、全ての処理サーバの負荷状態や応答時間を収集する必要があるため、処理サーバが地域や国をまたいで存在していた場合は、情報を収集するための信号が増大しネットワーク帯域を圧迫する可能性がある。 However, when a signal is distributed to processing servers across regions and countries, a processing delay corresponding to the region and country is caused in the signal. When the processing delay occurs, for example, in the communication control server, it is considered that the subsequent signal may overtake the preceding signal.
On the other hand, in order to prevent processing delay, if a signal is assigned with priority to the processing server in the base, only the processing server in the base becomes heavy, leading to inefficient hardware resources and unnecessary scaling. End up.
The load distribution method described in Non-Patent Document 1 determines a processing server in consideration of response time and processing server load. However, since it is necessary to collect the load status and response time of all processing servers, if processing servers exist across regions and countries, the signal for collecting information increases and compresses the network bandwidth. there's a possibility that.

このような背景に鑑みて本発明がなされたのであり、本発明は、処理サーバの処理負荷を一定以下に保ちつつ、要求信号の処理ＴＡＴ（Turn Around Time）を低減することができる負荷分散装置および負荷分散方法を提供することを課題とする。 The present invention has been made in view of such a background, and the present invention provides a load distribution apparatus capable of reducing a processing signal TAT (Turn Around Time) of a request signal while keeping the processing load of the processing server below a certain level. It is another object of the present invention to provide a load balancing method.

前記した課題を解決するため、請求項１に記載の発明は、拠点内に複数の処理サーバが備えられ、クライアントからリクエストされた情報処理を実行させる前記処理サーバを決定する負荷分散装置であって、各前記処理サーバの処理ＴＡＴを測定する処理時間測定手段と、測定した前記処理ＴＡＴに基づいて、前記処理サーバを高遅延サーバと低遅延サーバとに分類する分類手段と、前記低遅延サーバの処理の限界値である負荷制限値を記憶する記憶手段と、前記低遅延サーバの負荷情報を取得する負荷情報取得手段と、前記高遅延サーバの台数と、前記低遅延サーバの台数と、取得した前記負荷情報の最大値と前記低遅延サーバの負荷変動を示す処理サーバの負荷変動率とに基づいて、前記負荷制限値を超えない範囲で、前記高遅延サーバに振り分けるべき信号を前記低遅延サーバに振り分ける肩代わり信号数を算出する信号数算出手段と、算出した前記肩代わり信号数分を、前記高遅延サーバから前記低遅延サーバに振り分ける振分手段と、を備える負荷分散装置とした。 In order to solve the above-described problem, the invention according to claim 1 is a load distribution apparatus that includes a plurality of processing servers in a base and determines the processing server that executes information processing requested by a client. A processing time measuring means for measuring the processing TAT of each of the processing servers, a classifying means for classifying the processing servers into a high delay server and a low delay server based on the measured processing TAT, A storage unit that stores a load limit value that is a processing limit value, a load information acquisition unit that acquires load information of the low-delay server, the number of high-delay servers, and the number of low-delay servers are acquired. Based on the maximum value of the load information and the load fluctuation rate of the processing server indicating the load fluctuation of the low-latency server, the high-delay server within a range not exceeding the load limit value A load comprising: a signal number calculating means for calculating the number of substitution signals to distribute signals to be distributed to the low delay server; and a distribution means for distributing the calculated number of substitution signals from the high delay server to the low delay server. A dispersion apparatus was used.

また、請求項７に記載の発明は、拠点内に複数の処理サーバが備えられ、クライアントからリクエストされた情報処理を実行させる前記処理サーバを決定する負荷分散装置の負荷分散方法であって、前記負荷分散装置は、各前記処理サーバの処理ＴＡＴを測定する処理時間測定ステップと、測定した前記処理ＴＡＴに基づいて、前記処理サーバを高遅延サーバと低遅延サーバとに分類する分類ステップと、前記低遅延サーバの処理の限界値である負荷制限値を記憶する記憶ステップと、前記低遅延サーバの負荷情報を取得する負荷情報取得ステップと、前記高遅延サーバの台数と、前記低遅延サーバの台数と、取得した前記負荷情報の最大値と前記低遅延サーバの負荷変動を示す処理サーバの負荷変動率とに基づいて、前記負荷制限値を超えない範囲で、前記高遅延サーバに振り分けるべき信号を前記低遅延サーバに振り分ける肩代わり信号数を算出する信号数算出ステップと、算出した前記肩代わり信号数分を、前記高遅延サーバから前記低遅延サーバに振り分ける振分ステップと、を実行する負荷分散方法とした。 The invention according to claim 7 is a load distribution method of a load distribution apparatus that includes a plurality of processing servers in a base and determines the processing server that executes information processing requested by a client. The load distribution apparatus includes a processing time measuring step for measuring a processing TAT of each of the processing servers, a classification step for classifying the processing servers into a high delay server and a low delay server based on the measured processing TAT, A storage step for storing a load limit value that is a limit value of processing of the low delay server, a load information acquisition step for acquiring load information of the low delay server, the number of the high delay servers, and the number of the low delay servers And exceeding the load limit value based on the acquired maximum value of the load information and the load fluctuation rate of the processing server indicating the load fluctuation of the low-latency server. A signal number calculating step for calculating the number of substitution signals to be distributed to the low delay server within a range, and the calculated number of substitution signals from the high delay server to the low delay server. The load distribution method executes the distribution step.

このようにすることで、同じ拠点内に低遅延サーバと高遅延サーバが混在する場合や、他拠点の処理サーバのＴＡＴが動的に変動する場合であっても、振り分ける信号数を適切に算出することができる。その結果、処理サーバの処理負荷を一定以下に保ちつつ、要求信号の処理ＴＡＴを低減することができる。 By doing this, even when low-latency servers and high-latency servers coexist in the same base, or when the TAT of processing servers at other bases changes dynamically, the number of signals to be distributed is calculated appropriately. can do. As a result, the request signal processing TAT can be reduced while keeping the processing load of the processing server below a certain level.

また、請求項２に記載の発明は、前記信号数算出手段は、前記低遅延サーバで処理した場合の処理ＴＡＴと、前記高遅延サーバで処理した場合の処理ＴＡＴとに基づいて、前記リクエストをあらかじめ決めた順序で各前記処理サーバに振り分ける場合の処理応答時間の変動を示す平均信号処理ＴＡＴ変動率を算出するとともに、前記低遅延サーバの負荷変動率と前記平均信号処理ＴＡＴ変動率とを基に、前記肩代わり信号数を評価するスコアを算出し、前記振分手段は、取得した前記負荷情報の最大値と前記低遅延サーバの負荷変動率とに基づいて、前記負荷制限値を超えない範囲の中から、最大の前記スコアとなる前記肩代わり信号数分を、前記高遅延サーバから前記低遅延サーバに振り分けることを特徴とする。 Further, the invention according to claim 2 is characterized in that the signal number calculation means receives the request based on a processing TAT when processed by the low-latency server and a processing TAT when processed by the high-latency server. An average signal processing TAT variation rate indicating a variation in processing response time when allocating to each processing server in a predetermined order is calculated, and a load variation rate of the low delay server and the average signal processing TAT variation rate are In addition, a score for evaluating the number of substitution signals is calculated, and the allocating means is a range that does not exceed the load limit value based on the acquired maximum value of the load information and the load fluctuation rate of the low-delay server. The number of substitution signals corresponding to the maximum score is distributed from the high delay server to the low delay server.

このようにすることで、処理サーバの負荷と応答時間の両面を考慮して、処理負荷を制限しつつ、応答時間をできるだけ早めることができる。換言すれば、低遅延サーバの処理負荷をギリギリまで上げてＴＡＴを減らすことができる。 In this way, it is possible to make the response time as fast as possible while limiting the processing load in consideration of both the load of the processing server and the response time. In other words, TAT can be reduced by increasing the processing load of the low-latency server to the limit.

また、請求項３に記載の発明は、前記負荷情報取得手段が、前記低遅延サーバのＣＰＵ使用率を負荷情報として取得することを特徴とする。 The invention according to claim 3 is characterized in that the load information acquisition means acquires the CPU usage rate of the low-latency server as load information.

このようにすることで、各処理サーバのＣＰＵ使用率は比較的取得が容易であるため、各処理サーバの処理負荷をタイムリーで取得でき、負荷分散動作に直ちに反映させることができる。 In this way, since the CPU usage rate of each processing server is relatively easy to acquire, the processing load of each processing server can be acquired in a timely manner and can be immediately reflected in the load distribution operation.

また、請求項４に記載の発明は、前記低遅延サーバで処理した場合の処理ＴＡＴを１としたとき、前記高遅延サーバで処理した場合の処理ＴＡＴは、前記高遅延サーバの処理ＴＡＴの平均を前記低遅延サーバの処理ＴＡＴの平均で割った値で近似されることを特徴とする。 In the invention according to claim 4, when the processing TAT when processed by the low delay server is 1, the processing TAT when processed by the high delay server is an average of the processing TAT of the high delay server. Is divided by the average of the processing TAT of the low-latency server.

このようにすることで、低遅延サーバで処理した場合の処理ＴＡＴと高遅延サーバで処理した場合の処理ＴＡＴとの通信遅延倍率から、１ラウンドにおける信号処理ＴＡＴ変動率を求めることができる。これにより、低遅延サーバの負荷変動率と、平均信号処理ＴＡＴ変動率とを用いて肩代わり信号を算出することができる。 In this way, the signal processing TAT fluctuation rate in one round can be obtained from the communication delay magnification between the processing TAT when processed by the low-delay server and the processing TAT when processed by the high-delay server. Thereby, the substitution signal can be calculated using the load fluctuation rate of the low-delay server and the average signal processing TAT fluctuation rate.

また、請求項５に記載の発明は、前記振分手段が、取得した前記負荷情報の最大値と前記低遅延サーバの負荷変動率とを乗算し、ＣＰＵ閾値を超えない前記肩代わり信号数の範囲を算出し、算出した前記肩代わり信号数の範囲の中で最大のスコアとなる前記肩代わり信号数の値で振り分けを行うことを特徴とする。 Further, the invention according to claim 5 is the range of the number of substitution signals that does not exceed a CPU threshold value by the distribution means multiplying the acquired maximum value of the load information by the load fluctuation rate of the low delay server. And the distribution is performed according to the value of the number of substitution signals that gives the maximum score in the calculated range of the number of substitution signals.

このようにすることで、算出した肩代わり信号数の範囲の中で最大のスコアとなる肩代わり信号数の値で振り分けを行うので、より信頼度の高い肩代わり信号数Ｓを得ることができる。その結果、より最適な肩代わり信号数Ｓ分だけ高遅延サーバから低遅延サーバに振り分けることができる。 In this way, since the distribution is performed based on the value of the shoulder signal number that is the maximum score in the calculated range of the shoulder signal number, the shoulder signal number S with higher reliability can be obtained. As a result, it is possible to distribute from the high-delay server to the low-delay server by the optimal number of substitution signals S.

また、請求項６に記載の発明は、前記分類手段が、各処理サーバを処理ＴＡＴの小さい順に低遅延サーバとして１台ずつ、全台数−１まで増加させていき、それぞれの場合の前記低遅延サーバの台数と前記高遅延サーバの台数とを決定し、前記信号数算出手段は、決定された前記低遅延サーバの台数と当該低遅延サーバの台数と、前記低遅延サーバで処理した場合の処理ＴＡＴと、前記高遅延サーバで処理した場合の処理ＴＡＴとに基づいて、前記リクエストをあらかじめ決めた順序で各前記処理サーバに振り分ける場合の処理応答時間の変動を示す平均信号処理ＴＡＴ変動率を算出するとともに、前記低遅延サーバの負荷変動率と前記平均信号処理ＴＡＴ変動率とを基に、前記肩代わり信号数を評価するスコアを算出し、前記振分手段は、取得した負荷情報の最大値と低遅延サーバの負荷変動率Ａを乗算し、ＣＰＵ閾値を超えない前記肩代わり信号数の範囲を算出し、算出した前記肩代わり信号数の範囲の中で最大の前記スコアとなる前記肩代わり信号数の値で振り分けを行うことを特徴とする。 Further, in the invention according to claim 6, the classification means increases each processing server as a low-delay server one by one in ascending order of processing TAT to the total number −1, and the low delay in each case The number of servers and the number of high-delay servers are determined, and the signal number calculation means determines the number of the low-delay servers, the number of the low-delay servers, and processing when processing is performed by the low-delay servers Based on the TAT and the processing TAT when processed by the high-delay server, an average signal processing TAT variation rate indicating a variation in processing response time when the request is distributed to each processing server in a predetermined order is calculated. And calculating a score for evaluating the number of substitute signals based on the load variation rate of the low-delay server and the average signal processing TAT variation rate, The maximum value of the obtained load information is multiplied by the load fluctuation rate A of the low delay server, the range of the number of substitution signals not exceeding the CPU threshold value is calculated, and the maximum score in the calculated range of the number of substitution signals The distribution is performed according to the value of the number of substitution signals.

このようにすることで、一定値である応答時間閾値を用いずに、処理サーバの全体的な処理遅延や処理負荷を見て動的に、その時点で最適な低遅延サーバおよび高遅延サーバを分類するので、より適切な処理サーバの分類を行うことができる。その結果、負荷分散のさらなる最適化を図ることができ、より一層処理負荷を制限しつつ、応答時間をできるだけ早めることができる。 In this way, the optimal low-latency server and high-delay server can be dynamically determined by looking at the overall processing delay and processing load of the processing server without using a constant response time threshold value. Since classification is performed, more appropriate processing server classification can be performed. As a result, load distribution can be further optimized, and the response time can be shortened as much as possible while further limiting the processing load.

本発明によれば、処理サーバの情報を収集するための信号量を減らしつつ、応答時間と処理サーバの処理負荷を減らすことができる負荷分散装置および負荷分散方法を提供することができる。 ADVANTAGE OF THE INVENTION According to this invention, the load distribution apparatus and load distribution method which can reduce a response time and the processing load of a processing server can be provided, reducing the signal amount for collecting the information of a processing server.

本発明の第１の実施形態に係る負荷分散装置が適用される分散システムを示す構成図である。1 is a configuration diagram illustrating a distribution system to which a load distribution apparatus according to a first embodiment of the present invention is applied. 上記第１の実施形態に係る負荷分散装置の情報格納部が格納する拠点情報を示す図である。It is a figure which shows the base information which the information storage part of the load distribution apparatus which concerns on the said 1st Embodiment stores. 上記第１の実施形態に係る負荷分散装置の負荷分散動作を説明する図である。It is a figure explaining the load distribution operation | movement of the load distribution apparatus which concerns on the said 1st Embodiment. 上記第１の実施形態に係る負荷分散装置の高遅延サーバに振り分ける信号を低遅延サーバで処理する負荷分散動作を表にして説明する図である。It is a figure explaining the load distribution operation | movement which processes the signal distributed to the high delay server of the load distribution apparatus which concerns on the said 1st Embodiment by a low delay server as a table | surface. 上記第１の実施形態に係る負荷分散装置の負荷分散のシーケンス例を示すフローチャートである。It is a flowchart which shows the example of a load distribution sequence of the load distribution apparatus which concerns on the said 1st Embodiment. 図５のシーケンス例を説明する図である。It is a figure explaining the example of a sequence of FIG. 上記第１の実施形態に係る負荷分散装置の肩代わり信号数算出部により算出された肩代わり信号数、低遅延サーバの負荷変動率、平均信号処理ＴＡＴ変動率、およびスコアの算出例を示す図である。It is a figure which shows the example of calculation of the shoulder signal number calculated by the shoulder signal number calculation part of the load distribution apparatus which concerns on the said 1st Embodiment, the load fluctuation rate of a low-delay server, an average signal processing TAT fluctuation rate, and a score. . 本発明の第２の実施形態に係る負荷分散装置が適用される分散システムを示す構成図である。It is a block diagram which shows the distribution system with which the load distribution apparatus which concerns on the 2nd Embodiment of this invention is applied. 上記第２の実施形態に係る負荷分散装置の負荷分散方法のシーケンス例を説明するフローチャートである。It is a flowchart explaining the example of a sequence of the load distribution method of the load distribution apparatus which concerns on the said 2nd Embodiment. 図９のシーケンス例を説明する図である。It is a figure explaining the example of a sequence of FIG. 上記第２の実施形態に係る負荷分散装置の肩代わり信号数算出部により算出された肩代わり信号数、低遅延サーバの負荷変動率、平均信号処理ＴＡＴ変動率、およびスコアの算出例を示す図である。It is a figure which shows the example of calculation of the number of substitution signals calculated by the substitution signal number calculation part of the load distribution apparatus which concerns on the said 2nd Embodiment, the load fluctuation rate of a low-delay server, an average signal processing TAT fluctuation rate, and a score. . 上記第２の実施形態に係る負荷分散装置の肩代わり信号数算出部により算出された肩代わり信号数、低遅延サーバの負荷変動率、平均信号処理ＴＡＴ変動率およびスコアの算出、ならびにスコアＣを用いた肩代わり信号数による振り分け例を示す図である。The number of substitution signals calculated by the substitution signal number calculation unit of the load distribution apparatus according to the second embodiment, the load fluctuation rate of the low-delay server, the average signal processing TAT fluctuation rate and score calculation, and the score C were used. It is a figure which shows the example of distribution by the number of shoulder substitute signals. 非特許文献２の負荷分散装置が適用される分散システムを示す構成図である。It is a block diagram which shows the distribution system with which the load distribution apparatus of a nonpatent literature 2 is applied.

以下、図面を参照して本発明を実施するための形態（以下、「本実施形態」という）における負荷分散装置等について説明する。
（背景説明）
まず、非特許文献２を用いて背景技術を説明する。
図１３は、非特許文献２の負荷分散装置が適用される分散システムを示す構成図である。
図１３に示すように、２つのデータセンタ（ＤＣ：Data Center）１，２が存在している。データセンタ１（自拠点）とデータセンタ２（他拠点）とは、地域や国をまたいで存在可能である。例えば、データセンタ１は東京（日本）を拠点とし、データセンタ２はワシントンＤ.Ｃ.（米国）を拠点とする。データセンタ１とデータセンタ２間は、広帯域と低遅延のネットワーク３で接続される。ただし、データセンタ１とデータセンタ２には、信号の拠点間遅延がある。 Hereinafter, a load balancer and the like in a mode for carrying out the present invention (hereinafter referred to as “the present embodiment”) will be described with reference to the drawings.
(Background explanation)
First, the background art will be described using Non-Patent Document 2.
FIG. 13 is a configuration diagram illustrating a distributed system to which the load distribution apparatus of Non-Patent Document 2 is applied.
As shown in FIG. 13, two data centers (DC: Data Center) 1 and 2 exist. The data center 1 (own site) and the data center 2 (other sites) can exist across regions and countries. For example, the data center 1 is based in Tokyo (Japan), and the data center 2 is based in Washington, DC (USA). The data center 1 and the data center 2 are connected by a network 3 having a wide band and a low delay. However, the data center 1 and the data center 2 have a signal-to-base delay.

本明細書において、自拠点とは、負荷分散装置１０が置かれている拠点をいう。ここで、自拠点は、データセンタ１である。他拠点とは、負荷分散装置１０（自身）が置かれている拠点とは別の離れた拠点をいう。ここで、他拠点は、データセンタ２である。
データセンタ１内には、複数（複数台）の処理サーバ１１〜１３が収容されており、各処理サーバ１１〜１３は、負荷分散装置１０にデータセンタ内ネットワークで接続されている。また、データセンタ２は、複数の処理サーバ２１〜２４がデータセンタ内ネットワークで接続されている。
データセンタ１は、クライアント（図示省略）がネットワークで接続されて構成される。 In the present specification, the own site refers to a site where the load balancer 10 is placed. Here, the local base is the data center 1. The other base refers to a remote base other than the base where the load balancer 10 (self) is placed. Here, the other base is the data center 2.
In the data center 1, a plurality of (plural) processing servers 11 to 13 are accommodated, and each of the processing servers 11 to 13 is connected to the load distribution apparatus 10 via a data center network. In the data center 2, a plurality of processing servers 21 to 24 are connected by a data center network.
The data center 1 is configured by connecting clients (not shown) via a network.

負荷分散装置１０は、データセンタ１（自拠点）に存在している。負荷分散装置１０は、データセンタ１またはデータセンタ２の処理サーバ１１〜１３，２１〜２４に対してクライアント（図示省略）からの要求信号（リクエスト）の振分を行っている。負荷分散装置１０は、クライアントからの要求を受け、データセンタ１またはデータセンタ２の処理サーバ１１〜１３，２１〜２４のうちの１台を選択し、その選択した処理サーバに処理を依頼する。選択された処理サーバは、クライアントからのアプリケーションの要求を処理する。 The load balancer 10 exists in the data center 1 (own site). The load balancer 10 distributes request signals (requests) from clients (not shown) to the processing servers 11 to 13 and 21 to 24 of the data center 1 or 2. Upon receiving a request from a client, the load distribution apparatus 10 selects one of the processing servers 11 to 13 and 21 to 24 of the data center 1 or the data center 2 and requests the selected processing server to perform processing. The selected processing server processes the application request from the client.

なお、負荷分散装置１０は、分散システムで動作するプログラムの一例としての分散処理ミドルウェアであってもよく、分散システム内の複数台の処理サーバ１１〜１３，２１〜２４に処理を振り分けるロードバランサのいずれでもよい。
また、図１３の分散システムを構成する各装置の台数は、図１３に例示した台数に限定されず、任意の台数としてもよい。 The load balancer 10 may be distributed processing middleware as an example of a program that operates in the distributed system, and is a load balancer that distributes processing to a plurality of processing servers 11 to 13 and 21 to 24 in the distributed system. Either is acceptable.
Moreover, the number of each apparatus which comprises the distributed system of FIG. 13 is not limited to the number illustrated in FIG. 13, It is good also as arbitrary numbers.

非特許文献２には、要求信号の処理ＴＡＴと処理サーバの処理負荷の両面を考慮した負荷分散方法が記載されている。非特許文献２に記載の負荷分散方法は、処理サーバの処理負荷や台数、２拠点間の遅延から低遅延サーバへ振り分ける信号数を算出する。
非特許文献２に記載の負荷分散方法では、サーバを、通信遅延が小さい低通信遅延サーバ（以下、低遅延サーバという）と通信遅延が大きい高通信遅延サーバ（以下、高遅延サーバという）の２種類に分け、あらかじめ与えられた遅延倍率（高遅延サーバの遅延時間を低遅延サーバの遅延時間で割った値）から振り分ける信号数の計算を行っている。図１３の場合、データセンタ１またはデータセンタ２の処理サーバ１１〜１３，２１〜２４のうち、データセンタ１の処理サーバ１１，１２が低遅延サーバ、データセンタ１の処理サーバ１３とデータセンタ２の処理サーバ２１〜２４が高遅延サーバである。 Non-Patent Document 2 describes a load distribution method that considers both the processing TAT of a request signal and the processing load of a processing server. The load distribution method described in Non-Patent Document 2 calculates the number of signals distributed to the low-delay server from the processing load and number of processing servers and the delay between the two sites.
In the load distribution method described in Non-Patent Document 2, two servers, a low communication delay server with a small communication delay (hereinafter referred to as a low delay server) and a high communication delay server with a large communication delay (hereinafter referred to as a high delay server), are used. It is divided into types and the number of signals to be distributed is calculated from a predetermined delay factor (the value obtained by dividing the delay time of the high delay server by the delay time of the low delay server). In the case of FIG. 13, among the processing servers 11 to 13 and 21 to 24 of the data center 1 or 2, the processing servers 11 and 12 of the data center 1 are low-latency servers, the processing server 13 of the data center 1 and the data center 2. The processing servers 21 to 24 are high-latency servers.

ところで、図１３の符号ｎに示すように、同じ拠点（データセンタ１（自拠点））内に低遅延サーバ１１，１２と高遅延サーバ１３が混在している場合がある。また、図１３の符号ｏに示すように、拠点（データセンタ２）では、処理サーバのＴＡＴが動的に変動する場合がある。処理サーバのＴＡＴが動的に変動するのは、具体的には以下のような状態のときに発生しうる。（１）処理サーバのバックグラウンド処理によってＴＡＴが変動する。（２）災害等によって特定のルートが遮断されることによってＴＡＴが変動する。 By the way, as indicated by a symbol n in FIG. 13, the low-delay servers 11 and 12 and the high-delay server 13 may be mixed in the same base (data center 1 (own base)). Further, as indicated by a symbol o in FIG. 13, the TAT of the processing server may dynamically fluctuate at the base (data center 2). The dynamic change of the TAT of the processing server can specifically occur in the following state. (1) The TAT varies depending on the background processing of the processing server. (2) TAT fluctuates due to a specific route being blocked by a disaster or the like.

非特許文献２に記載の負荷分散方法では、処理サーバのＴＡＴが動的に変動する場合や、同じ拠点内に低遅延サーバと高遅延サーバが混在している場合には、ＴＡＴが変動するため各処理サーバの処理負荷を精確に測定できず、振り分ける信号数を適切に算出することができない。 In the load distribution method described in Non-Patent Document 2, when the TAT of the processing server changes dynamically, or when the low delay server and the high delay server coexist in the same base, the TAT changes. The processing load of each processing server cannot be accurately measured, and the number of signals to be distributed cannot be calculated appropriately.

同じ拠点内に低遅延サーバと高遅延サーバが混在する場合や、他拠点の処理サーバのＴＡＴが動的に変動する場合であっても、振り分ける信号数を適切に算出することができる方法が求められている。 There is a need for a method that can appropriately calculate the number of signals to be distributed even when low-latency servers and high-latency servers coexist in the same base, or even when the TAT of a processing server at another base changes dynamically. It has been.

（第１の実施形態）
図１は、本発明の第１の実施形態に係る負荷分散装置が適用される分散システムを示す構成図である。図１３と同一構成部には同一符号を付している。
図１に示すように、２つのデータセンタ（ＤＣ）１，２が存在している。ここでは、データセンタ１は東京（日本）を拠点とし、データセンタ２はワシントンＤ.Ｃ.（米国）を拠点とする例で説明する。データセンタ１とデータセンタ２間は、広帯域で低遅延のネットワーク３で接続される。
データセンタ１内には、複数（複数台）の処理サーバ１１〜１７が収容されており、各処理サーバ１１〜１７は、負荷分散装置１００にデータセンタ内ネットワークで接続されている。また、データセンタ２は、複数の処理サーバ２１〜２４がデータセンタ内ネットワークで接続されている。
データセンタ１は、クライアント３１〜３４がネットワークで接続されて構成される。
クライアント３１〜３４は、ユーザの端末や対向システムであり、データセンタ１，２の処理サーバ１１〜１７，２１〜２４に対して、アプリケーションの処理を要求する。 (First embodiment)
FIG. 1 is a configuration diagram showing a distributed system to which a load distribution apparatus according to a first embodiment of the present invention is applied. The same components as those in FIG. 13 are denoted by the same reference numerals.
As shown in FIG. 1, two data centers (DC) 1 and 2 exist. Here, an example will be described in which the data center 1 is based in Tokyo (Japan) and the data center 2 is based in Washington, DC (USA). The data center 1 and the data center 2 are connected by a broadband and low delay network 3.
In the data center 1, a plurality (multiple) of processing servers 11 to 17 are accommodated, and each of the processing servers 11 to 17 is connected to the load distribution apparatus 100 via a data center network. In the data center 2, a plurality of processing servers 21 to 24 are connected by a data center network.
The data center 1 is configured by connecting clients 31 to 34 via a network.
The clients 31 to 34 are user terminals or opposing systems, and request application processing to the processing servers 11 to 17 and 21 to 24 of the data centers 1 and 2.

負荷分散装置１００は、データセンタ１（自拠点）に存在している。負荷分散装置１００は、データセンタ１またはデータセンタ２の処理サーバ１１〜１７，２１〜２４に対してクライアント３１〜３４からの要求信号（リクエスト）の振分を行っている。負荷分散装置１００は、クライアント３１〜３４からの要求を受け、データセンタ１またはデータセンタ２の処理サーバ１１〜１７，２１〜２４のうちの１台を選択し、その選択した処理サーバに処理を依頼する。選択された処理サーバは、クライアント３１〜３４からのアプリケーションの要求を処理する。 The load distribution apparatus 100 exists in the data center 1 (own site). The load distribution apparatus 100 distributes request signals (requests) from the clients 31 to 34 to the processing servers 11 to 17 and 21 to 24 of the data center 1 or the data center 2. In response to a request from the clients 31 to 34, the load distribution apparatus 100 selects one of the processing servers 11 to 17 and 21 to 24 of the data center 1 or the data center 2, and performs processing on the selected processing server. Ask. The selected processing server processes application requests from the clients 31-34.

このように、負荷分散装置１００は、自身が設置された処理サーバ１１〜１７、または自拠点から離隔した処理サーバ２１〜２４に対して、クライアント３１〜３４からリクエストされた情報処理を実行させる処理サーバを決定する。負荷分散装置１００の詳細構成については後記する。 As described above, the load distribution apparatus 100 causes the processing servers 11 to 17 in which the load balancing apparatus 100 is installed or the processing servers 21 to 24 separated from the base to execute the information processing requested by the clients 31 to 34. Determine the server. The detailed configuration of the load distribution apparatus 100 will be described later.

なお、負荷分散装置１００は、分散システムで動作するプログラムの一例としての分散処理ミドルウェアであってもよく、分散システム内の複数台の処理サーバ２１〜２７に処理を振り分けるロードバランサのいずれでもよい。
また、図１の分散システムを構成する各装置の台数は、図１に例示した台数に限定されず、任意の台数としてもよい。 The load distribution apparatus 100 may be distributed processing middleware as an example of a program that operates in the distributed system, or may be any load balancer that distributes processing to a plurality of processing servers 21 to 27 in the distributed system.
Further, the number of devices constituting the distributed system in FIG. 1 is not limited to the number illustrated in FIG. 1 and may be an arbitrary number.

負荷分散装置１００の詳細について説明する。
負荷分散装置１００は、処理サーバの処理負荷や台数、通信遅延を考慮しながら、できるだけ通信遅延が小さい処理サーバへ信号を振り分ける。負荷分散装置１００は、各処理サーバの処理ＴＡＴ（Turn Around Time）を測定する処理時間測定手段と、測定した前記処理ＴＡＴに基づいて、各処理サーバを低遅延サーバと高遅延サーバとに分類する分類手段としての機能を有する。なお、前記処理ＴＡＴには、負荷分散装置１００から処理サーバに信号を送信するのに要する時間（通信遅延時間）と処理サーバによる信号処理時間が含まれる。
図１に示すように、負荷分散装置１００は、肩代わり信号数算出部１１１（信号数算出手段）を有する振分機能部１１０（振分手段）と、サーバ情報取得部１２０（負荷情報取得手段）と、情報格納部１３０（記憶手段）と、を備える。 Details of the load balancer 100 will be described.
The load distribution apparatus 100 distributes a signal to a processing server having as small a communication delay as possible while considering the processing load, the number of processing servers, and the communication delay. The load distribution apparatus 100 classifies each processing server into a low-delay server and a high-delay server based on a processing time measuring unit that measures a processing TAT (Turn Around Time) of each processing server and the measured processing TAT. It has a function as a classification means. The processing TAT includes a time required for transmitting a signal from the load balancer 100 to the processing server (communication delay time) and a signal processing time by the processing server.
As shown in FIG. 1, the load distribution apparatus 100 includes a distribution function unit 110 (distribution unit) having a substitution signal number calculation unit 111 (signal number calculation unit) and a server information acquisition unit 120 (load information acquisition unit). And an information storage unit 130 (storage means).

<肩代わり信号数算出部１１１>
肩代わり信号数算出部１１１は、肩代わり信号数Ｓごとの低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣ（いずれも後記）の値を算出し、情報格納部１３０へ記憶する。
肩代わり信号数算出部１１１は、後記（方法１）を実行する場合、低遅延サーバの台数と高遅延サーバの台数と、高遅延サーバに振り分けるべき信号を低遅延サーバに振り分ける肩代わり信号数Ｓ（後記）とを用いて低遅延サーバの負荷変動を示す処理サーバの負荷変動率Ａを算出する。ここで肩代わり信号数Ｓは、ｎ（ｎは自然数）個あり、Ｓ＝０からＳ＝ｎまで、後記する式（１）に順次当てはめて、低遅延サーバの負荷変動率Ａを求める。なお、肩代わり信号数Ｓ決定の詳細については実施例１：（方法１）（後記図３，図４参照）で述べる。
そして、肩代わり信号数算出部１１１は、取得したＣＰＵ使用率（負荷情報）の最大値と低遅延サーバの負荷変動率Ａとに基づいて、ＣＰＵ限界値（負荷制限値）を超えない範囲で、高遅延サーバが処理すべき信号を低遅延サーバで処理する肩代わり信号数Ｓを決定する。 <Shoulder substitution signal number calculation unit 111>
The shoulder signal number calculation unit 111 calculates the values of the load fluctuation rate A, average signal processing TAT fluctuation rate B, and score C (all described later) of the low-delay server for each shoulder signal number S, and stores them in the information storage unit 130. To do.
When executing the postscript (method 1), the shoulder substitution signal number calculation unit 111 assigns the number of low delay servers, the number of high delay servers, and the number of shoulder substitution signals S (postscript) to which signals to be distributed to the high delay servers are distributed to the low delay servers. ) To calculate the load fluctuation rate A of the processing server indicating the load fluctuation of the low-latency server. Here, the number of substitution signals S is n (n is a natural number), and the load fluctuation rate A of the low-delay server is obtained by sequentially applying from S = 0 to S = n to the formula (1) described later. The details of determining the substitution signal number S will be described in Example 1: (Method 1) (see FIGS. 3 and 4 to be described later).
Then, the substitution signal number calculation unit 111 is based on the acquired maximum value of the CPU usage rate (load information) and the load fluctuation rate A of the low-delay server in a range not exceeding the CPU limit value (load limit value). The number S of substitution signals to be processed by the low delay server is determined for the signal to be processed by the high delay server.

また、肩代わり信号数算出部１１１は、後記（方法２）を実行する場合、低遅延サーバの台数と高遅延サーバの台数と、高遅延サーバに振り分けるべき信号を低遅延サーバに振り分ける肩代わり信号数Ｓと、低遅延サーバで処理した場合の処理ＴＡＴ（応答時間）と、高遅延サーバで処理した場合の処理ＴＡＴとを用いて、リクエストをあらかじめ決めた順序で各処理サーバに振り分ける場合の応答時間の変動を示す平均信号処理ＴＡＴ変動率Ｂを算出する。ここで肩代わり信号数Ｓは、ｎ（ｎは自然数）個あり、Ｓ＝０からＳ＝ｎまで、後記する式（２）に順次当てはめて、平均信号処理ＴＡＴ変動率Ｂを求める。 In addition, when executing the postscript (method 2), the substitution signal number calculation unit 111 counts the number of low delay servers, the number of high delay servers, and the substitution signal number S that distributes signals to be distributed to the high delay servers to the low delay servers. Response time when the requests are distributed to each processing server in a predetermined order using the processing TAT (response time) when processed by the low delay server and the processing TAT when processed by the high delay server. An average signal processing TAT fluctuation rate B indicating fluctuation is calculated. Here, the number of substitution signals S is n (n is a natural number), and from S = 0 to S = n, the average signal processing TAT fluctuation rate B is obtained by sequentially applying to equation (2) described later.

そして、肩代わり信号数算出部１１１は、低遅延サーバの負荷変動率Ａと平均信号処理ＴＡＴ変動率Ｂとを用いて、高遅延サーバが処理すべき信号を低遅延サーバで処理する肩代わり信号数Ｓの範囲を算出する。具体的には、肩代わり信号数算出部１１１は、情報格納部１３０から拠点情報１５０（後記図２参照）を参照し、取得した負荷情報の中の最大値と低遅延サーバの負荷変動率Ａを乗算し、あらかじめ設定されている低遅延サーバのＣＰＵ限界値（負荷制限値）を超えない肩代わり信号数Ｓの範囲を算出し、算出した肩代わり信号数Ｓの範囲の中で最大の肩代わり信号数Ｓを評価するスコアとなる肩代わり信号数Ｓを決定する。すなわち、肩代わり信号数算出部１１１は、低遅延サーバのＣＰＵ使用率の最大値をＡ倍しても低遅延サーバのＣＰＵ限界値（負荷制限値）を超えないような最大肩代わり信号数Ｓの範囲を算出し、この最大肩代わり信号数Ｓを超えない範囲で最も高いスコアとなる肩代わり信号数Ｓを決定する。なお、スコア算出による肩代わり信号数Ｓ決定の詳細については実施例２：（方法２）（図６，図７参照）で後記する。 Then, the substitution signal number calculation unit 111 uses the load variation rate A and the average signal processing TAT variation rate B of the low delay server, and the substitution signal number S for processing the signal to be processed by the high delay server by the low delay server. The range of is calculated. Specifically, the substitution signal number calculation unit 111 refers to the base information 150 (see FIG. 2 to be described later) from the information storage unit 130, and determines the maximum value in the acquired load information and the load variation rate A of the low-delay server. Multiplication is performed to calculate the range of the substitution signal number S that does not exceed the CPU limit value (load limit value) of the low delay server set in advance, and the largest substitution signal number S in the calculated substitution signal number S range. The number S of shoulder substitute signals serving as a score for evaluating is determined. That is, the substitution signal number calculation unit 111 is a range of the maximum substitution signal number S that does not exceed the CPU limit value (load limit value) of the low-delay server even if the maximum value of the CPU usage rate of the low-delay server is multiplied by A. Is calculated, and the shoulder signal number S that gives the highest score in a range not exceeding the maximum shoulder signal number S is determined. The details of determining the substitution signal number S by calculating the score will be described later in Example 2: (Method 2) (see FIGS. 6 and 7).

ここで、負荷分散装置１００は、各処理サーバに均等にリクエストを振り分けるラウンドロビン（静的分散方式の１つ）を採る場合、上記平均信号処理ＴＡＴ変動率Ｂは、例えばラウンドロビンの１ラウンドにおける平均信号処理ＴＡＴ変動率Ｂである。なお、処理サーバの処理能力に応じてリクエストの分散比率を変えて各処理サーバの負荷を均等にする重み付きラウンドロビンに適用し、この重み付きラウンドロビンにおいて、１ラウンドにおける平均信号処理ＴＡＴ変動率Ｂを算出する態様でもよい。 Here, when the load distribution apparatus 100 adopts round robin (one of static distribution methods) that distributes requests equally to each processing server, the average signal processing TAT fluctuation rate B is, for example, in one round of round robin. Average signal processing TAT variation rate B. In addition, it applies to the weighted round robin which changes the distribution ratio of requests according to the processing capacity of the processing server and equalizes the load of each processing server, and in this weighted round robin, the average signal processing TAT variation rate in one round A mode of calculating B may be used.

<振分機能部１１０>
振分機能部１１０は、肩代わり信号数算出部１１１によって算出された肩代わり信号数Ｓの値に従って、処理サーバへ信号の振り分けを行う。具体的には、振分機能部１１０は、振分先の処理サーバが高遅延サーバだった場合、肩代わり信号数Ｓの分だけ、本来高遅延サーバに振り分けていた信号を低遅延サーバに振り分ける（実施例１：（方法１）で後記）。 <Distribution function unit 110>
The distribution function unit 110 distributes the signal to the processing server according to the value of the shoulder signal number S calculated by the shoulder signal number calculation unit 111. Specifically, when the processing server that is the distribution destination is a high-delay server, the distribution function unit 110 distributes the signal originally distributed to the high-delay server by the number of substitution signals S to the low-delay server ( Example 1 (described later in (Method 1)).

また、振分機能部１１０は、低遅延サーバのＣＰＵ限界値（負荷制限値）を超えない肩代わり信号数Ｓの範囲の中で最大のスコアとなる最大肩代わり信号数Ｓ分を、高遅延サーバから低遅延サーバに振り分ける（実施例２：（方法２）で後記）。具体的には、振分機能部１１０は、取得した負荷情報の最大値と低遅延サーバの負荷変動率Ａとを乗算し、ＣＰＵ閾値を超えない肩代わり信号数Ｓの範囲を算出し、算出した肩代わり信号数Ｓの範囲の中で最大のスコアＣとなる肩代わり信号数Ｓの値で振り分けを行う。
なお、振分機能部１１０は、クライアント３１〜３４からの要求信号（リクエスト）を受け取ると、あらかじめ決められた方法（例えばラウンドロビン方式）によって振分先の処理サーバを決める。 In addition, the distribution function unit 110 obtains the maximum substitution signal number S corresponding to the maximum score in the range of the substitution signal number S that does not exceed the CPU limit value (load limit value) of the low delay server from the high delay server. Distribution to low-latency servers (Example 2: (described later in (Method 2)). Specifically, the distribution function unit 110 multiplies the maximum value of the acquired load information by the load fluctuation rate A of the low-delay server, calculates the range of the substitution signal number S that does not exceed the CPU threshold, and calculates Sorting is performed according to the value of the shoulder signal number S that gives the maximum score C within the range of the shoulder signal number S.
When the distribution function unit 110 receives request signals (requests) from the clients 31 to 34, the distribution function unit 110 determines a distribution destination processing server by a predetermined method (for example, a round robin method).

<サーバ情報取得部１２０>
サーバ情報取得部１２０は、各処理サーバの応答時間（ＴＡＴ）を取得する。また、サーバ情報取得部１２０は、各処理サーバの負荷情報（例えば、ＣＰＵ使用率、メモリ使用率等がある。本実施形態では、ＣＰＵ使用率を例に採る）を定期的に収集（取得）する。負荷情報取得部１２０は、収集したＣＰＵ使用率（負荷情報）を情報格納部１３０に格納するとともに、振分機能部１１０に通知する。 <Server information acquisition unit 120>
The server information acquisition unit 120 acquires the response time (TAT) of each processing server. In addition, the server information acquisition unit 120 periodically collects (acquires) load information (for example, CPU usage rate, memory usage rate, etc., taking CPU usage rate as an example in this embodiment) of each processing server. To do. The load information acquisition unit 120 stores the collected CPU usage rate (load information) in the information storage unit 130 and notifies the distribution function unit 110 thereof.

<情報格納部１３０>
情報格納部１３０は、ＣＰＵ閾値、各処理サーバの応答時間（ＴＡＴ）、および各処理サーバのＣＰＵ使用率を格納する。情報格納部１３０は、肩代わり信号数Ｓごとの低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣ（図７参照）の値を保持する。
また、情報格納部１３０は、自拠点の処理サーバのＣＰＵ限界値（負荷制限値）、データセンタ１とデータセンタ２間の処理遅延時間値（拠点間遅延）、自拠点の処理サーバ数（処理サーバの台数）、自拠点の処理サーバ数のうち低遅延サーバ数（低遅延サーバの台数）および高遅延サーバ数（高遅延サーバの台数）、処理サーバのＣＰＵ使用率（負荷情報）などを拠点情報１５０（図２参照）として格納する。情報格納部１３０が保持する拠点情報１５０については、図２により後記する。 <Information storage unit 130>
The information storage unit 130 stores the CPU threshold, the response time (TAT) of each processing server, and the CPU usage rate of each processing server. The information storage unit 130 holds values of the load fluctuation rate A, the average signal processing TAT fluctuation rate B, and the score C (see FIG. 7) of the low-delay server for each substitution signal number S.
In addition, the information storage unit 130 includes a CPU limit value (load limit value) of the processing server at the local site, a processing delay time value between the data center 1 and the data center 2 (inter-base delay), and the number of processing servers at the local site (processing Number of servers), number of low-delay servers (number of low-delay servers) and number of high-delay servers (number of high-delay servers), processing server CPU usage (load information), etc. Stored as information 150 (see FIG. 2). The base information 150 held by the information storage unit 130 will be described later with reference to FIG.

図２は、図１の負荷分散装置１００の情報格納部１３０が格納する拠点情報１５０を示す図である。
図２に示すように、情報格納部１３０は、拠点情報１５０として、低遅延サーバのＣＰＵ限界値（負荷制限値）、拠点間遅延、低遅延サーバ数（低遅延サーバの台数）、高遅延サーバ数（高遅延サーバの台数）、低遅延サーバ２１〜２３の使用率（処理サーバ２１のＣＰＵ使用率、処理サーバ２２のＣＰＵ使用率、および処理サーバ２３のＣＰＵ使用率）を格納する。なお、上記ＣＰＵ限界値を格納しておくことで、上記ＣＰＵ使用率の最大値をＡ（低遅延サーバの負荷変動率）倍した場合の比較から、肩代わり信号数Ｓを決定することができる。 FIG. 2 is a diagram illustrating the base information 150 stored in the information storage unit 130 of the load distribution apparatus 100 in FIG.
As shown in FIG. 2, the information storage unit 130 includes, as the base information 150, a CPU limit value (load limit value) of a low-delay server, an inter-base delay, a low-delay server number (the number of low-delay servers), and a high-delay server. Number (the number of high-delay servers) and usage rates of the low-delay servers 21 to 23 (CPU usage rate of the processing server 21, CPU usage rate of the processing server 22, and CPU usage rate of the processing server 23) are stored. By storing the CPU limit value, it is possible to determine the substitution signal number S from a comparison when the maximum value of the CPU usage rate is multiplied by A (load variation rate of the low delay server).

図２の例では、低遅延サーバのＣＰＵ限界値は、「８０」(ＣＰＵ限界値の最大値を１００％とした場合、その８０％)である。
拠点間遅延は、「１０００」(１０００[ｍｓ])である。
自拠点の処理サーバ数は、自拠点（データセンタ１）内に７つの処理サーバ１１〜１７（図１参照）を有するので「７」である。
また、自拠点の処理サーバ１１〜１７のうち、測定した処理時間に基づいて分類された結果、低遅延サーバ数は「３」、高遅延サーバ数は「４」である（図３参照）。
他拠点の処理サーバ数は、他拠点（データセンタ２）内に４つの処理サーバ２１〜２４（図１参照）を有するので「４」である。
低遅延サーバ２１のＣＰＵ使用率は、「３５」（ＣＰＵ使用率の最大値を１００％とした場合、その３５％。以下同様。)、処理サーバ２２のＣＰＵ使用率は、「３４」、処理サーバ２３のＣＰＵ使用率は、「３４」である。 In the example of FIG. 2, the CPU limit value of the low-latency server is “80” (80% when the maximum CPU limit value is 100%).
The inter-base delay is “1000” (1000 [ms]).
The number of processing servers at the local site is “7” because there are seven processing servers 11 to 17 (see FIG. 1) in the local site (data center 1).
In addition, as a result of classification based on the measured processing time among the processing servers 11 to 17 at the local site, the number of low delay servers is “3” and the number of high delay servers is “4” (see FIG. 3).
The number of processing servers at other bases is “4” because there are four processing servers 21 to 24 (see FIG. 1) in the other bases (data center 2).
The CPU usage rate of the low-latency server 21 is “35” (35% when the maximum value of the CPU usage rate is 100%. The same applies hereinafter), and the CPU usage rate of the processing server 22 is “34”. The CPU usage rate of the server 23 is “34”.

以下、上述のように構成された負荷分散装置１００の負荷分散方法について説明する。
［原理説明］
まず、本発明の基本的な考え方について述べる。
負荷分散装置１００は、各処理サーバの処理時間（ＴＡＴ）を測定し、測定した各処理サーバの処理時間（ＴＡＴ）があらかじめ設定された閾値より大きいか小さいかを判定して、各処理サーバを高遅延サーバと低遅延サーバとに分類する。
ここで、低遅延サーバの台数をＮｍ、高遅延サーバの台数をＮｎ、ラウンドロビン方式に従うと本来高遅延サーバで処理される信号を低遅延サーバで処理する１ラウンドあたりの肩代わり信号数をＳ（０≦Ｓ≦Ｎｎ）とすると、低遅延サーバの負荷変動率Ａは、次式（１）で示される。 Hereinafter, a load distribution method of the load distribution apparatus 100 configured as described above will be described.
[Principle explanation]
First, the basic concept of the present invention will be described.
The load balancer 100 measures the processing time (TAT) of each processing server, determines whether the measured processing time (TAT) of each processing server is larger or smaller than a preset threshold, and sets each processing server. Classify into high-latency servers and low-latency servers.
Here, the number of low-delay servers is Nm, the number of high-delay servers is Nn, and according to the round robin method, the number of signals per round for processing signals that are originally processed by the high-delay server by the low-delay server is S ( If 0 ≦ S ≦ Nn), the load fluctuation rate A of the low-latency server is expressed by the following equation (1).

低遅延サーバの負荷変動率Ａ＝（Ｎｍ＋Ｓ）／Ｎｍ …（１） Load fluctuation rate A = (Nm + S) / Nm of the low delay server (1)

また、低遅延サーバで処理した場合の処理ＴＡＴ（Turn Around Time）（応答時間）を１としたとき、高遅延サーバで処理した場合の処理ＴＡＴを通信遅延倍率Ｄとすると、１ラウンドにおける平均信号処理ＴＡＴ変動率Ｂは、次式（２）で示される。 Further, when the processing TAT (Turn Around Time) (response time) when processed by the low delay server is 1, and the processing TAT when processed by the high delay server is the communication delay factor D, the average signal in one round The processing TAT fluctuation rate B is expressed by the following equation (2).

平均信号処理ＴＡＴ変動率Ｂ＝（（Ｎｍ＋Ｓ）＋Ｄ×（Ｎｎ−Ｓ））／（Ｎｍ＋Ｄ×Ｎｎ） …（２） Average signal processing TAT fluctuation rate B = ((Nm + S) + D × (Nn−S)) / (Nm + D × Nn) (2)

すなわち、１ラウンドにおける平均信号処理ＴＡＴ変動率Ｂは（（Ｎｍ＋Ｓ）＋Ｄ×（Ｎｎ−Ｓ））／（Ｎｍ＋Ｄ×Ｎｎ）倍となる。Ｄは、高遅延サーバの処理ＴＡＴの平均を低遅延サーバの処理ＴＡＴの平均で割った値で近似する遅延倍率である。 That is, the average signal processing TAT fluctuation rate B in one round is ((Nm + S) + D × (Nn−S)) / (Nm + D × Nn) times. D is a delay magnification that approximates a value obtained by dividing the average of the processing TAT of the high delay server by the average of the processing TAT of the low delay server.

負荷分散装置１００は、肩代わり信号数Ｓを決定する方法として（方法１）と、（方法２）と、を備える。 The load distribution apparatus 100 includes (Method 1) and (Method 2) as methods for determining the number S of substitution signals.

（方法１）：自拠点の処理サーバの負荷変動率Ａを用いて肩代わり信号数Ｓを決定する。
負荷分散装置１００は、自拠点の各処理サーバのＣＰＵ使用率（負荷情報）を取得し、取得した負荷情報の中の最大値と低遅延サーバの負荷変動率Ａを乗算する。あらかじめ設定されている処理サーバのＣＰＵ限界値（負荷制限値）を超えない範囲で最大の肩代わり信号数Ｓを算出し、算出した肩代わり信号数Ｓまで高遅延サーバに振り分けられる信号を低遅延サーバで処理するようにする。 (Method 1): The substitution signal number S is determined using the load fluctuation rate A of the processing server at the local site.
The load distribution apparatus 100 acquires the CPU usage rate (load information) of each processing server at its own base, and multiplies the maximum value in the acquired load information by the load fluctuation rate A of the low delay server. The maximum substitution signal number S is calculated within a range not exceeding the CPU limit value (load limit value) of the processing server set in advance, and the signals distributed to the high delay server up to the calculated substitution signal number S are transmitted to the low delay server. To be processed.

（方法２）：低遅延サーバの負荷変動率Ａを用いて最大肩代わり信号数の範囲を算出し、最大肩代わり信号数の範囲中で最大のスコアとなる肩代わり信号数Ｓを決定する。
負荷分散装置１００は、自拠点の各処理サーバのＣＰＵ使用率（負荷情報）を取得し、取得した負荷情報の中の最大値と低遅延サーバの負荷変動率Ａを乗算する。あらかじめ設定されている処理サーバの負荷制限値を超えない肩代わり信号数Ｓの範囲を算出する。算出した肩代わり信号数Ｓの範囲の中で最大のスコアとなる肩代わり信号数Ｓの値を用いて高遅延サーバで処理される信号を低遅延サーバで処理するようにする。上記スコアは、低遅延サーバの負荷変動率Ａと平均信号処理ＴＡＴ変動率Ｂを用いて算出される評価関数で表される値である。例えば、スコアの一例として、低遅延サーバの負荷変動率Ａと平均信号処理ＴＡＴ変動率Ｂから算出される減少関数を評価関数として用いる。この減少関数は、低遅延サーバの負荷変動率Ａまたは平均信号処理ＴＡＴ変動率Ｂが減少すると、スコアが上昇する値（例えば（１／Ａ＋１／Ｂ））である。 (Method 2): The range of the maximum substitution signal number is calculated using the load fluctuation rate A of the low-delay server, and the substitution signal number S having the maximum score in the range of the maximum substitution signal number is determined.
The load distribution apparatus 100 acquires the CPU usage rate (load information) of each processing server at its own base, and multiplies the maximum value in the acquired load information by the load fluctuation rate A of the low delay server. A range of the substitution signal number S that does not exceed a preset load limit value of the processing server is calculated. The signal processed by the high delay server is processed by the low delay server using the value of the shoulder signal number S that is the maximum score in the calculated range of the shoulder signal number S. The score is a value represented by an evaluation function calculated using the load fluctuation rate A and the average signal processing TAT fluctuation rate B of the low-delay server. For example, as an example of the score, a decreasing function calculated from the load fluctuation rate A and the average signal processing TAT fluctuation rate B of the low-delay server is used as the evaluation function. This decrease function is a value (for example, (1 / A + 1 / B)) in which the score increases when the load fluctuation rate A or the average signal processing TAT fluctuation rate B of the low delay server decreases.

上記（方法１）または（方法２）により、肩代わり信号数Ｓが決定した後、負荷分散装置１００は、本来高遅延サーバに振り分けられる信号（本来高遅延サーバが処理するはずだった信号）を肩代わり信号数Ｓ分だけ、低遅延サーバで処理するように低遅延サーバへ振り分ける。その際、負荷分散装置１００は、自拠点のどの処理サーバを選択するかは、負荷分散装置１００がこれまで用いていた振分アルゴリズムに従えばよい。 After the shoulder signal number S is determined by the above (Method 1) or (Method 2), the load distribution apparatus 100 replaces the signal that was originally distributed to the high delay server (the signal that the high delay server was supposed to process). The number of signals is allocated to the low-delay server so that the low-delay server processes the signal. At this time, the load balancer 100 may select which processing server at its own site according to the distribution algorithm used by the load balancer 100 so far.

まず、上記（方法１）の具体的動作例について説明する。
［実施例１：（方法１）の動作例］
上記（方法１）は、低遅延サーバの処理負荷をギリギリまで上げてＴＡＴを減らす手法を提供する。
図３および図４を参照して負荷分散装置１００の負荷分散方法の（方法１）動作について説明する。
図３は、負荷分散装置１００の負荷分散動作を説明する図である。図１と同一構成部には同一符号を付している。
図３に示すように、自拠点（データセンタ１）内の処理サーバ１１〜１７には、低遅延サーバ１１〜１３と高遅延サーバ１４〜１７とが混在している。負荷分散装置１００から各処理サーバ１１〜１７に伸びる矢印（細実線矢印、破線矢印、太実線矢印）は、負荷分散装置１００が各処理サーバ１１〜１７に信号を振り分けることを示している。例えば、負荷分散装置１００は、図３の細実線矢印（→印）に示すように、処理サーバ１１〜１３，１５〜１７に対してクライアントからの要求信号の振分を行っている。 First, a specific operation example of the above (Method 1) will be described.
[Example 1: Operation example of (Method 1)]
The above (Method 1) provides a method for reducing the TAT by increasing the processing load of the low-latency server to the limit.
With reference to FIG. 3 and FIG. 4, the (method 1) operation of the load distribution method of the load distribution apparatus 100 will be described.
FIG. 3 is a diagram for explaining the load distribution operation of the load distribution apparatus 100. The same components as those in FIG. 1 are denoted by the same reference numerals.
As shown in FIG. 3, low delay servers 11 to 13 and high delay servers 14 to 17 are mixed in the processing servers 11 to 17 in the local base (data center 1). Arrows (thin solid line arrows, broken line arrows, thick solid arrows) extending from the load balancer 100 to the respective processing servers 11 to 17 indicate that the load balancer 100 distributes signals to the respective processing servers 11 to 17. For example, the load distribution apparatus 100 distributes request signals from the clients to the processing servers 11 to 13 and 15 to 17 as indicated by thin solid arrows (→ marks) in FIG.

上記（方法１）で述べたように、負荷分散装置１００は、低遅延サーバの負荷変動率Ａを用いて肩代わり信号数Ｓを算出し、算出した肩代わり信号数Ｓまで本来高遅延サーバに振り分けられる信号を肩代わり信号数Ｓ分だけ、低遅延サーバに振り分ける。図３の符号ａおよび白抜き矢印（⇒印）に示すように、本来であれば高遅延サーバ１４に振り分けられる信号（破線矢印参照）を低遅延サーバ１１へ振り分ける（太実線矢印参照）。 As described in the above (Method 1), the load distribution apparatus 100 calculates the shoulder signal number S using the load fluctuation rate A of the low delay server, and is distributed to the high delay server up to the calculated shoulder signal number S. The signals are distributed to the low-delay server by the number of signals S on the shoulder. As shown by the symbol a and the white arrow (⇒ mark) in FIG. 3, the signal (see the broken line arrow) that is originally distributed to the high delay server 14 is distributed to the low delay server 11 (see the thick solid arrow).

図４は、図３の負荷分散装置１００によって、高遅延サーバ１４〜１７に振り分ける信号を低遅延サーバ１１〜１３で処理する負荷分散動作を表にして説明する図である。図４は、１ラウンドあたりの肩代わり信号数Ｓが１の場合の振分例を示す。図４の縦軸は処理サーバによってシーケンシャル処理される信号シーケンス番号１〜１４を示し、横軸は処理サーバ１１〜１７（低遅延サーバ１１〜１３と高遅延サーバ１４〜１７）を示す。図４において、実線の白丸（○印）は信号が該当処理サーバに振り分けられることを示している。例えば、図４の信号１と低遅延サーバ１１とが交差する位置の実線の白丸（○印）は、信号シーケンス番号の信号１が低遅延サーバ１１に振り分けられることを示している。また、図４において、破線の白丸（破線○印）は、本来高遅延サーバに振り分けられる信号が低遅延サーバに振り分けられたことを示している。例えば、図４の信号４と低遅延サーバ１１とが交差する位置の実線の白丸（○印）は、本来高遅延サーバ１４に振り分けられる信号シーケンス番号の信号４が低遅延サーバ１１に振り分けられることを示している。 FIG. 4 is a diagram illustrating the load distribution operation in which the low delay servers 11 to 13 process signals distributed to the high delay servers 14 to 17 by the load distribution apparatus 100 of FIG. FIG. 4 shows an example of distribution when the shoulder signal number S per round is 1. The vertical axis in FIG. 4 indicates signal sequence numbers 1 to 14 that are sequentially processed by the processing server, and the horizontal axis indicates processing servers 11 to 17 (low delay servers 11 to 13 and high delay servers 14 to 17). In FIG. 4, a solid white circle (◯ mark) indicates that the signal is distributed to the corresponding processing server. For example, a solid white circle (o mark) at a position where signal 1 and low-delay server 11 in FIG. 4 intersect indicates that signal 1 having a signal sequence number is distributed to low-delay server 11. Further, in FIG. 4, broken white circles (broken line circles) indicate that the signal originally distributed to the high delay server is distributed to the low delay server. For example, a solid white circle (o mark) at a position where the signal 4 and the low delay server 11 in FIG. 4 intersect each other indicates that the signal 4 having the signal sequence number originally assigned to the high delay server 14 is assigned to the low delay server 11. Is shown.

図４に示すように、１ラウンドあたりの肩代わり信号数Ｓが１の場合、肩代わり信号数Ｓが１であるので、１ラウンドあたり高遅延サーバに振り分けられる信号の１つを低遅延サーバへ振り分ける。すなわち、図４の符号ｂおよび破線の白丸（破線○印）に示すように、本来であれば高遅延サーバ１４（図３参照）に振り分けられる信号を低遅延サーバ１１（図３参照）へ振り分ける。また、図４の符号ｃおよび破線の白丸（破線○印）に示すように、本来であれば高遅延サーバ１５（図３参照）に振り分けられる信号を低遅延サーバ１２（図３参照）へ振り分ける。 As shown in FIG. 4, when the shoulder signal number S per round is 1, the shoulder signal number S is 1, so that one of the signals distributed to the high delay server per round is distributed to the low delay server. That is, as indicated by the symbol b in FIG. 4 and the white circle of the broken line (dashed circle mark), the signal that is originally distributed to the high delay server 14 (see FIG. 3) is distributed to the low delay server 11 (see FIG. 3). . In addition, as indicated by the symbol c in FIG. 4 and the white circle (dashed line circle), the signal that is originally distributed to the high delay server 15 (see FIG. 3) is distributed to the low delay server 12 (see FIG. 3). .

図４の符号ｄおよび破線の枠囲みに示すように、１ラウンドあたりの肩代わり信号数Ｓが１の場合の振分例の効果は下記の通りである。
すなわち、高遅延サーバ１４〜１７に振り分けられる信号の１つを低遅延サーバ１１〜１３へ振り分けることで、低遅延サーバ１１〜１３の負荷は４／３倍になった。また、１ラウンドにおける平均信号処理ＴＡＴ変動率Ｂは、前記式（２）にＮｍ「３」とＮｎ「４」を代入することで、（４＋Ｄ×３）／（３＋Ｄ×４）倍となることが分かる。高遅延サーバ１４〜１７に振り分けられていた信号が低遅延サーバ１１〜１３により多く振り分けられるので、本負荷分散動作を行わない場合と比べて、迅速に取得が可能であり、よりタイムリーな情報で負荷分散処理が可能となる。 As shown in the symbol d in FIG. 4 and the dashed box, the effect of the distribution example when the shoulder substitution signal number S per round is 1 is as follows.
That is, by distributing one of the signals distributed to the high delay servers 14 to 17 to the low delay servers 11 to 13, the load on the low delay servers 11 to 13 is increased by 4/3. Further, the average signal processing TAT fluctuation rate B in one round is (4 + D × 3) / (3 + D × 4) times by substituting Nm “3” and Nn “4” into the equation (2). I understand. Since signals distributed to the high-delay servers 14 to 17 are distributed more to the low-delay servers 11 to 13, the signals can be acquired more quickly than when the load distribution operation is not performed, and more timely information can be obtained. With load distribution processing becomes possible.

次に、上記（方法２）の具体的動作例について説明する。
［実施例２：（方法２）の動作例］
上記（方法２）は、処理負荷とＴＡＴから算出されるスコアを最大にする手法を提供する。
図５は、負荷分散装置の（方法２）の負荷分散のシーケンス例を説明する図である。図１と同一構成部には同一符号を付している。図６は、図５のシーケンス例を説明するフローチャートである。
図５のステップＳ１０１において、負荷分散装置１００は、各処理サーバ１１〜１７のＣＰＵ閾値、応答時間閾値（高遅延サーバか低遅延サーバかを判定するための閾値）を設定する。例えば、負荷分散装置１００は、各種設定値（ＣＰＵ閾値＝８０、応答時間閾値＝１０ｍｓ）を設定する。 Next, a specific operation example of the above (Method 2) will be described.
[Example 2: Operation example of (Method 2)]
The above (Method 2) provides a method for maximizing the score calculated from the processing load and TAT.
FIG. 5 is a diagram for explaining an example of a load distribution sequence in the (method 2) of the load distribution apparatus. The same components as those in FIG. 1 are denoted by the same reference numerals. FIG. 6 is a flowchart illustrating the sequence example of FIG.
5, the load distribution apparatus 100 sets the CPU threshold value and response time threshold value (threshold value for determining whether the server is a high delay server or a low delay server) for each of the processing servers 11 to 17. For example, the load distribution apparatus 100 sets various setting values (CPU threshold = 80, response time threshold = 10 ms).

ステップＳ１０２において、負荷分散装置１００は、処理時間測定手段として、定期的に全処理サーバ１１〜１７へテスト信号（ｐｉｎｇなど）を送信し、各処理サーバ１１〜１７の応答時間（ＴＡＴ）を取得（測定）する。図６の符号ｅに示すように、処理サーバ１１の応答時間（ＴＡＴ）「１．５ｍｓ」、処理サーバ１２の応答時間（ＴＡＴ）「２．１ｍｓ」、処理サーバ１３の応答時間（ＴＡＴ）「１．１ｍｓ」、処理サーバ１４の応答時間（ＴＡＴ）「１０．１ｍｓ」、処理サーバ１５の応答時間（ＴＡＴ）「１１．５ｍｓ」、処理サーバ１６の応答時間（ＴＡＴ）「２３．５ｍｓ」、処理サーバ１７の応答時間（ＴＡＴ）「１０．７ｍｓ」を取得する。 In step S102, the load distribution apparatus 100 periodically transmits test signals (such as ping) to all the processing servers 11 to 17 as processing time measuring means, and obtains response times (TAT) of the processing servers 11 to 17. (taking measurement. 6, the response time (TAT) “1.5 ms” of the processing server 11, the response time (TAT) “2.1 ms” of the processing server 12, and the response time (TAT) “of the processing server 13. 1.1 ms ", processing server 14 response time (TAT)" 10.1 ms ", processing server 15 response time (TAT)" 11.5 ms ", processing server 16 response time (TAT)" 23.5 ms ", The response time (TAT) “10.7 ms” of the processing server 17 is acquired.

ステップＳ１０３において、負荷分散装置１００は、分類手段として、取得した各処理サーバ１１〜１７の応答時間（ＴＡＴ）を応答時間閾値と比較して、各処理サーバ１１〜１７を高遅延サーバ１４〜１７と低遅延サーバ１１〜１３とに分類する。例えば、図６に示すように、応答時間閾値を１０ｍｓに設定した場合、自拠点の処理サーバ１１〜１７のうち、処理サーバ１４〜１７を高遅延サーバ、処理サーバ１１〜１３を低遅延サーバに分類する。 In step S103, the load distribution apparatus 100 compares the acquired response times (TAT) of the processing servers 11 to 17 with response time thresholds as classification means, and sets the processing servers 11 to 17 as high-delay servers 14 to 17. And low-latency servers 11-13. For example, as shown in FIG. 6, when the response time threshold is set to 10 ms, among the processing servers 11 to 17 at the local site, the processing servers 14 to 17 are high delay servers, and the processing servers 11 to 13 are low delay servers. Classify.

ステップＳ１０４において、負荷分散装置１００のサーバ情報取得部１２０は、定期的に低遅延サーバ１１〜１３（図６参照）の負荷情報（ＣＰＵ使用率）を取得する。図６の例では、サーバ情報取得部１２０は、低遅延サーバである処理サーバ１１のＣＰＵ使用率「３５」、処理サーバ１２のＣＰＵ使用率「３４」、処理サーバ１３のＣＰＵ使用率「３４」を取得する。 In step S104, the server information acquisition unit 120 of the load distribution apparatus 100 periodically acquires load information (CPU usage rate) of the low-latency servers 11 to 13 (see FIG. 6). In the example of FIG. 6, the server information acquisition unit 120 has a CPU usage rate “35” of the processing server 11 that is a low-latency server, a CPU usage rate “34” of the processing server 12, and a CPU usage rate “34” of the processing server 13. To get.

ステップＳ１０５において、負荷分散装置１００は、取得した情報を情報格納部１３０に格納し、振分機能部１１０に通知する。 In step S <b> 105, the load distribution apparatus 100 stores the acquired information in the information storage unit 130 and notifies the distribution function unit 110.

ステップＳ１０６において、振分機能部１１０の肩代わり信号数算出部１１１は、情報格納部１３０から情報を取り出し、低遅延サーバ数と高遅延サーバ数と肩代わり信号数Ｓとを用いて処理サーバ負荷変動数Ａと平均信号処理ＴＡＴ変動率ＢとスコアＣを算出し、スコアＣが最大となるときの肩代わり信号数Ｓの値を決定する。具体的には、肩代わり信号数算出部１１１は、高遅延サーバ台数、低遅延サーバ台数、高遅延サーバの応答時間の平均、低遅延サーバの応答時間の平均に基づいて、肩代わり信号数Ｓごとの低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣの値を算出する（図７参照）。上記スコアＣは、肩代わり信号数Ｓを評価する評価値であり、低遅延サーバの負荷変動率Ａと平均信号処理ＴＡＴ変動率Ｂから算出される減少関数（ＡまたはＢが減少すると上昇する）で表される。 In step S106, the substitution signal number calculation unit 111 of the distribution function unit 110 extracts information from the information storage unit 130, and uses the number of low delay servers, the number of high delay servers, and the number of substitution signals S to calculate the number of processing server load fluctuations. A, average signal processing TAT fluctuation rate B, and score C are calculated, and the value of the substitution signal number S when the score C is maximized is determined. Specifically, the substitution signal number calculation unit 111 calculates the number of substitution signals for each substitution signal S based on the number of high delay servers, the number of low delay servers, the average response time of high delay servers, and the average response time of low delay servers. The load fluctuation rate A, average signal processing TAT fluctuation rate B, and score C of the low-delay server are calculated (see FIG. 7). The score C is an evaluation value for evaluating the number S of substitute signals, and is a decreasing function calculated from the load fluctuation rate A and the average signal processing TAT fluctuation rate B of the low delay server (increases when A or B decreases). expressed.

ステップＳ１０７において、負荷分散装置１００の振分機能部１１０は、取得した負荷情報の最大値と低遅延サーバの負荷変動率Ａとを乗算し、ＣＰＵ閾値を超えない肩代わり信号数Ｓの範囲を算出し、算出した肩代わり信号数Ｓの範囲の中で最大のスコアＣとなる肩代わり信号数Ｓの値で、本来高遅延サーバに振り分けられる信号を低遅延サーバに振り分ける。すなわち、振分機能部１１０は、最大肩代わり信号数Ｓを超えない範囲で最も高いスコアＣとなる最大肩代わり信号数Ｓの分だけ、本来高遅延サーバ１１〜１４に振り分けられる信号を低遅延サーバ１１〜１３に振り分ける。 In step S107, the distribution function unit 110 of the load distribution apparatus 100 multiplies the acquired maximum value of the load information and the load fluctuation rate A of the low-delay server, and calculates the range of the substitution signal number S that does not exceed the CPU threshold value. Then, with the value of the substitution signal number S that gives the maximum score C in the calculated substitution signal number S range, the signal that is originally distributed to the high delay server is distributed to the low delay server. That is, the distribution function unit 110 transmits signals that are originally distributed to the high delay servers 11 to 14 by the amount corresponding to the maximum shoulder signal number S that has the highest score C within a range not exceeding the maximum shoulder signal number S. Sort to ~ 13.

次に、図７を参照して低遅延サーバへの信号の振り分け例について説明する。
図７は、肩代わり信号数算出部１１１により算出された肩代わり信号数Ｓ、低遅延サーバの負荷変動率Ａ、平均信号処理ＴＡＴ変動率Ｂ、およびスコアＣの算出例を示す図である。なお、スコアＣは、減少関数（１／Ａ＋１／Ｂ）を用いている。
低遅延サーバの負荷変動率Ａは、前記式（１）にＮｍ「３」を代入することによって算出され、また平均信号処理ＴＡＴ変動率Ｂは、前記式（２）にＮｍ「３」とＮｎ「４」を代入することで算出される。例えば、肩代わり信号数Ｓ「１」の場合には、Ｎｍ「３」とＳ「１」を前記式（１）に導入して低遅延サーバの負荷変動率Ａ「１．３３」が算出され、またＮｍ「３」とＮｎ「４」とＳ「１」を前記式（２）に導入して平均信号処理ＴＡＴ変動率Ｂ「０．８０」が算出される。また、Ｓ「１」の場合のスコアは、減少関数（１／Ａ＋１／Ｂ）に、Ａ「１．３３」とＢ「０．８０」を代入することで、スコア「２．０１」が算出される。同様にして、Ｓ「２」の場合には、低遅延サーバの負荷変動率Ａ「１．６７」、平均信号処理ＴＡＴ変動率Ｂ「０．５９」、スコア「２．２９」が算出され、Ｓ「３」の場合には、低遅延サーバの負荷変動率Ａ「２」、平均信号処理ＴＡＴ変動率Ｂ「０．３９」、スコア「３．０９」が算出され、Ｓ「４」の場合には、低遅延サーバの負荷変動率Ａ「２．３３」、平均信号処理ＴＡＴ変動率Ｂ「０．１８」、スコア「５．９５」が算出される。 Next, an example of signal distribution to the low-latency server will be described with reference to FIG.
FIG. 7 is a diagram illustrating a calculation example of the shoulder signal number S calculated by the shoulder signal number calculation unit 111, the load variation rate A, the average signal processing TAT variation rate B, and the score C of the low-delay server. The score C uses a decreasing function (1 / A + 1 / B).
The load fluctuation rate A of the low delay server is calculated by substituting Nm “3” into the equation (1), and the average signal processing TAT fluctuation rate B is calculated as Nm “3” and Nn in the equation (2). Calculated by substituting “4”. For example, when the number of substitution signals S is “1”, Nm “3” and S “1” are introduced into the equation (1) to calculate the load fluctuation rate A “1.33” of the low-delay server, Also, Nm “3”, Nn “4”, and S “1” are introduced into the equation (2) to calculate the average signal processing TAT fluctuation rate B “0.80”. The score for S “1” is calculated as “2.01” by substituting A “1.33” and B “0.80” into the decreasing function (1 / A + 1 / B). Is done. Similarly, in the case of S “2”, the load fluctuation rate A “1.67”, the average signal processing TAT fluctuation rate B “0.59”, and the score “2.29” of the low delay server are calculated. In the case of S “3”, the load fluctuation rate A “2”, the average signal processing TAT fluctuation rate B “0.39”, and the score “3.09” of the low delay server are calculated, and in the case of S “4” , The load fluctuation rate A “2.33”, the average signal processing TAT fluctuation rate B “0.18”, and the score “5.95” of the low delay server are calculated.

本例の場合、前記図６に示すように、取得された低遅延サーバ１１〜１３のＣＰＵ使用率（負荷情報）は、処理サーバ１１のＣＰＵ使用率「３５」、処理サーバ１２のＣＰＵ使用率「３４」、処理サーバ１３のＣＰＵ使用率「３４」であり、ＣＰＵ使用率の最大値は、処理サーバ１１の「３５」である。また、前記図２に示すように、情報格納部１３０には、拠点情報１５０として、低遅延サーバのＣＰＵ限界値「８０」、拠点間遅延「１０００」、低遅延サーバ数「３」、高遅延サーバ数「４」が格納されている。さらに、肩代わり信号数算出部１１１は、図７に示す肩代わり信号数Ｓ、低遅延サーバの負荷変動率Ａ、平均信号処理ＴＡＴ変動率Ｂ、およびスコアＣを算出している。 In the case of this example, as shown in FIG. 6, the CPU usage rate (load information) of the acquired low-latency servers 11 to 13 is the CPU usage rate “35” of the processing server 11 and the CPU usage rate of the processing server 12. “34” is the CPU usage rate “34” of the processing server 13, and the maximum value of the CPU usage rate is “35” of the processing server 11. As shown in FIG. 2, the information storage unit 130 stores, as the base information 150, the CPU limit value “80” of the low delay server, the inter-base delay “1000”, the number of low delay servers “3”, and the high delay. The number of servers “4” is stored. Furthermore, the shoulder signal number calculation unit 111 calculates the shoulder signal number S, the load fluctuation rate A of the low delay server, the average signal processing TAT fluctuation rate B, and the score C shown in FIG.

この場合、図７の符号ｇに示すように、取得したＣＰＵ使用率の最大値「３５」を２倍してもＣＰＵ限界値「８０」を超えないため、肩代わり信号数Ｓ「３」が最大肩代わり信号数の範囲となる。振分機能部１１０は、この最大肩代わり信号数の範囲の３信号を低遅延サーバ１１〜１３へ振り分けることが可能である。ちなみに、この最大肩代わり信号数Ｓをそのまま肩代わり信号数Ｓとして決定する方法が前記（方法１）に対応する。 In this case, as shown by the symbol g in FIG. 7, the CPU limit value “80” is not exceeded even if the obtained maximum value “35” of the CPU usage rate is doubled. This is the range of the number of substitution signals. The distribution function unit 110 can distribute the three signals in the range of the maximum number of substitution signals to the low delay servers 11 to 13. Incidentally, a method of determining the maximum shoulder signal number S as it is as the shoulder signal number S corresponds to the above (Method 1).

また、図７の符号ｈに示すように、最大肩代わり信号数Ｓを超えない範囲で最も高いスコアＣは肩代わり信号数Ｓが「３」のため、肩代わり信号数Ｓ「３」の範囲の中で最も高いスコアであるスコア「３．０９」が選ばれ、そして該スコア「３．０９」が示すＳ「３」が決定される。この例では、振分機能部１１０は、スコア「３．０９」から肩代わり信号数Ｓ「３」を決定し、３信号を低遅延サーバ１１〜１３に振り分ける。 Further, as shown by the symbol h in FIG. 7, the highest score C in the range not exceeding the maximum shoulder signal number S is “3” because the shoulder signal number S is within the range of the shoulder signal number S “3”. The highest score “3.09” is selected, and S “3” indicated by the score “3.09” is determined. In this example, the distribution function unit 110 determines the number of substitution signals S “3” from the score “3.09”, and distributes the three signals to the low-delay servers 11 to 13.

図７の例では、低遅延サーバの負荷変動率Ａのみを用いて、肩代わり信号数Ｓを決定する方法（方法１）によるＳ「３」と（図７の符号ｇ参照）、最大肩代わり信号数の範囲の中で最大のスコアとなる肩代わり信号数Ｓを決定する方法（方法２）によるＳ「３」と（図７の符号ｈ参照）、が同じ結果となっているが、決定される肩代わり信号数Ｓが両者で異なる場合もあり得る。すなわち、スコアの値によっては最大肩代わり信号数の範囲の中から、スコアの値がより高い別の肩代わり信号数Ｓが決定されることは有り得る。両者による肩代わり信号数Ｓが異なった場合、（方法２）による肩代わり信号数Ｓの決定を優先させる。
このように、（方法２）では、（方法１）で得られた最大肩代わり信号数の範囲から、最も評価の高い肩代わり信号数Ｓを決定するので、より信頼度の高い肩代わり信号数Ｓを得ることができる。その結果、より最適な肩代わり信号数Ｓ分だけ高遅延サーバから低遅延サーバに振り分けることができるので、負荷分散のさらなる最適化を図ることができ、より一層処理負荷を制限しつつ、応答時間をできるだけ早めることができる。 In the example of FIG. 7, S “3” according to the method (method 1) of determining the shoulder signal number S using only the load fluctuation rate A of the low-delay server (see symbol g in FIG. 7), the maximum shoulder signal number. S "3" by the method (method 2) for determining the number S of substitution signals that gives the maximum score in the range of (2) (see symbol h in FIG. 7) has the same result, but the substitution to be decided The number S of signals may be different between the two. That is, depending on the score value, another shoulder signal number S having a higher score value may be determined from the range of the maximum shoulder signal number. When the shoulder signal number S differs between the two, priority is given to the determination of the shoulder signal number S by (Method 2).
In this way, in (Method 2), since the highest substitution signal number S is determined from the range of the maximum substitution signal number obtained in (Method 1), a more reliable substitution signal number S is obtained. be able to. As a result, since it is possible to distribute from the high-delay server to the low-delay server by the optimal number of substitution signals S, it is possible to further optimize the load distribution and further reduce the response time while limiting the processing load. It can be accelerated as much as possible.

以上説明したように、本実施形態の負荷分散装置１００は、各処理サーバの処理ＴＡＴを測定する処理時間測定手段と、測定した処理ＴＡＴに基づいて、処理サーバを高遅延サーバと低遅延サーバとに分類する分類手段と、を備える。さらに、負荷分散装置１００は、低遅延サーバの処理の限界値である負荷制限値を記憶する情報格納部１３０と、低遅延サーバの負荷情報を取得するサーバ情報取得部１２０と、高遅延サーバの台数と、低遅延サーバの台数と、取得した負荷情報の最大値と低遅延サーバの負荷変動を示す処理サーバの負荷変動率Ａとに基づいて、負荷制限値を超えない範囲で、高遅延サーバに振り分けるべき信号を低遅延サーバに振り分ける肩代わり信号数Ｓを算出する肩代わり信号数算出部１１１と、算出した肩代わり信号数Ｓ分を、高遅延サーバから低遅延サーバに振り分ける振分機能部１１０と、を備える。 As described above, the load distribution apparatus 100 according to this embodiment includes the processing time measuring unit that measures the processing TAT of each processing server, and the processing servers are classified into the high delay server and the low delay server based on the measured processing TAT. And classifying means for classifying the data. Furthermore, the load distribution apparatus 100 includes an information storage unit 130 that stores a load limit value that is a limit value of processing of the low-delay server, a server information acquisition unit 120 that acquires load information of the low-delay server, Based on the number of servers, the number of low-latency servers, the maximum value of the acquired load information, and the load fluctuation rate A of the processing server that indicates the load fluctuations of the low-latency servers, A substitution signal number calculation unit 111 that calculates a substitution signal number S that distributes a signal to be distributed to a low delay server, a distribution function unit 110 that distributes the calculated substitution signal number S from a high delay server to a low delay server, Is provided.

そして、負荷分散装置１００の負荷分散方法では、各処理サーバの処理ＴＡＴを測定する処理時間測定ステップと、測定した処理ＴＡＴに基づいて、処理サーバを高遅延サーバと低遅延サーバとに分類する分類ステップと、低遅延サーバの処理の限界値である負荷制限値を記憶する記憶ステップと、低遅延サーバの負荷情報を取得する負荷情報取得ステップと、高遅延サーバの台数と、低遅延サーバの台数と、取得した負荷情報の最大値と低遅延サーバの負荷変動を示す処理サーバの負荷変動率Ａとに基づいて、負荷制限値を超えない範囲で、高遅延サーバに振り分けるべき信号を低遅延サーバに振り分ける肩代わり信号数Ｓを算出する信号数算出ステップと、算出した肩代わり信号数Ｓ分を、高遅延サーバから低遅延サーバに振り分ける振分ステップと、を実行する。 In the load distribution method of the load distribution apparatus 100, the processing time measurement step for measuring the processing TAT of each processing server, and the classification for classifying the processing servers into the high delay server and the low delay server based on the measured processing TAT. A step, a storage step for storing a load limit value that is a limit value of processing of the low delay server, a load information acquisition step for acquiring load information of the low delay server, the number of high delay servers, and the number of low delay servers And a signal to be distributed to the high-delay server based on the maximum value of the acquired load information and the load variation rate A of the processing server indicating the load variation of the low-delay server within a range not exceeding the load limit value A signal number calculating step for calculating the number of shoulder substitute signals S to be distributed to and a portion for distributing the calculated number of shoulder substitute signals S from the high delay server to the low delay server. And step, to run.

既存技術のロードバランサの負荷分散方法のデメリットについて述べる。
既存技術のロードバランサの負荷分散方法には、負荷分散性に着目した「低負荷サーバ優先」と、平均処理時間に着目した「低遅延サーバ優先」とがある。
（１）低負荷サーバ優先
各処理サーバの負荷には、ばらつき（あるサーバは１００％だが、他のサーバは０％などという状況）がある。負荷分散性は、各処理サーバの負荷を均等に分散する。低負荷サーバ優先は、ＣＰＵ負荷が一番小さい処理サーバへ優先して振り分ける。低負荷サーバ優先では、ＴＡＴが短い（低遅延）サーバの負荷が上がってくると、ＴＡＴが長い（高遅延）サーバへ振り分けてしまう。このため、低遅延サーバの処理能力を使い切ることができない。結果として、平均ＴＡＴが高くなる可能性がある。 The disadvantages of the load balancing method of the existing technology load balancer are described.
The load balancing method of the load balancer of the existing technology includes “low load server priority” focusing on load distribution and “low delay server priority” focusing on average processing time.
(1) Low-load server priority The load on each processing server varies (the situation is that some servers are 100% but other servers are 0%, etc.). Load distribution distributes the load of each processing server evenly. The low-load server priority is assigned with priority to the processing server with the smallest CPU load. In the low load server priority, when the load of a server with a short TAT (low delay) increases, the server is distributed to a server with a long TAT (high delay). For this reason, the processing capability of the low-latency server cannot be used up. As a result, the average TAT can be high.

（２）低遅延サーバ優先
低負荷サーバ優先では、ＴＡＴが一番短いサーバへ優先して振り分ける。低負荷サーバ優先は、サーバの負荷を考慮しないため、ＴＡＴが短いサーバの負荷が一時的に高くなる可能性がある。結果として、ＣＰＵ規制などの対象となってしまう可能性がある。 (2) Low-latency server priority In low-load server priority, priority is given to the server with the shortest TAT. Since the low load server priority does not consider the load on the server, the load on the server with a short TAT may temporarily increase. As a result, there is a possibility of being subject to CPU regulation.

本実施形態の負荷分散装置１００は、上記既存技術と比較して下記の特徴を有する。
本実施形態の負荷分散装置１００は、ＣＰＵ限界値を超えない範囲で、本来高遅延サーバ１４〜１７（図３参照）が処理すべき信号を低遅延サーバ１１〜１３に戻す振分を行っている。換言すれば、負荷分散装置１００は、低遅延サーバ１４〜１７への信号の振り分けを最適化するようにしているので、高遅延サーバ１４〜１７への信号の振分を、より減らすことで、処理遅延を防止しつつ、低遅延サーバ１１〜１３に過度の処理負荷をかけないようにしてハードウェアリソースの非効率化や不要なスケーリングを抑制することができる。その結果、処理サーバの処理負荷を一定以下に保ちつつ、要求信号の処理ＴＡＴを低減することができる。要求信号の処理ＴＡＴと処理サーバの処理負荷の両面を満足させることができる。 The load distribution apparatus 100 according to the present embodiment has the following characteristics as compared with the existing technology.
The load distribution apparatus 100 according to the present embodiment performs distribution to return signals to be processed by the high-delay servers 14 to 17 (see FIG. 3) to the low-delay servers 11 to 13 within a range not exceeding the CPU limit value. Yes. In other words, since the load distribution apparatus 100 optimizes the signal distribution to the low-delay servers 14 to 17, the signal distribution to the high-delay servers 14 to 17 is further reduced, While preventing processing delay, it is possible to suppress inefficiency of hardware resources and unnecessary scaling by not applying an excessive processing load to the low-latency servers 11 to 13. As a result, the request signal processing TAT can be reduced while keeping the processing load of the processing server below a certain level. Both the request signal processing TAT and the processing load of the processing server can be satisfied.

（第２の実施形態）
図８は、本発明の第２の実施形態に係る負荷分散装置が適用される分散システムを示す構成図である。図１と同一構成部には同一符号を付して重複箇所の説明を省略する。
図８に示すように、負荷分散装置２００は、肩代わり信号数算出部２１１を有する振分機能部２１０と、サーバ情報取得部１２０と、情報格納部１３０と、を備える。 (Second Embodiment)
FIG. 8 is a configuration diagram showing a distributed system to which the load distribution apparatus according to the second embodiment of the present invention is applied. The same components as those in FIG. 1 are denoted by the same reference numerals, and description of overlapping portions is omitted.
As illustrated in FIG. 8, the load distribution apparatus 200 includes a distribution function unit 210 having a substitution signal number calculation unit 211, a server information acquisition unit 120, and an information storage unit 130.

負荷分散装置２００は、処理時間測定手段として、各処理サーバの処理時間を測定する。また、分類手段として、測定した処理時間に基づいて、処理サーバを高遅延サーバと低遅延サーバとに分類する。本実施形態では、負荷分散装置２００は、各処理サーバを処理ＴＡＴの小さい順に低遅延サーバとして１台ずつ、全台数−１まで増加させていき、それぞれの場合の低遅延サーバの台数と高遅延サーバの台数とを決定する。 The load distribution apparatus 200 measures the processing time of each processing server as processing time measurement means. Further, as the classifying means, the processing server is classified into a high delay server and a low delay server based on the measured processing time. In the present embodiment, the load distribution apparatus 200 increases each processing server as a low delay server in order of increasing processing TAT to the total number −1, and the number of low delay servers and the high delay in each case. Determine the number of servers.

肩代わり信号数算出部２１１は、決定された低遅延サーバの台数と当該低遅延サーバの台数と、低遅延サーバで処理した場合の処理ＴＡＴと、高遅延サーバで処理した場合の処理ＴＡＴとに基づいて、リクエストをあらかじめ決めた順序で各処理サーバに振り分ける場合の処理応答時間の変動を示す平均信号処理ＴＡＴ変動率Ｂを算出するとともに、低遅延サーバの負荷変動率Ａと平均信号処理ＴＡＴ変動率Ｂとを基に、肩代わり信号数Ｓを評価するスコアＣを算出する。
振分機能部２１０は、取得した負荷情報の最大値と低遅延サーバの負荷変動率Ａを乗算し、ＣＰＵ閾値を超えない肩代わり信号数Ｓの範囲を算出し、算出した肩代わり信号数Ｓの範囲の中で最大のスコアＣとなる肩代わり信号数Ｓの値で振り分けを行う。 The substitution signal number calculation unit 211 is based on the determined number of low-delay servers, the number of the low-delay servers, a process TAT when processed by the low-delay server, and a process TAT when processed by the high-delay server. The average signal processing TAT fluctuation rate B indicating the fluctuation of the processing response time when the request is distributed to each processing server in a predetermined order, and the load fluctuation rate A of the low delay server and the average signal processing TAT fluctuation rate Based on B, a score C for evaluating the shoulder signal number S is calculated.
The distribution function unit 210 multiplies the maximum value of the acquired load information by the load variation rate A of the low-delay server, calculates the range of the substitution signal number S that does not exceed the CPU threshold, and calculates the calculated substitution signal number S range The distribution is performed by the value of the shoulder signal number S that gives the maximum score C.

以下、上述のように構成された負荷分散装置２００の負荷分散方法について説明する。
本実施形態は、処理サーバの処理遅延や処理負荷、台数を考慮しながら、できるだけ通信遅延が小さい処理サーバへ信号を振り分ける。
負荷分散装置２００は、各処理サーバの処理時間（ＴＡＴ）と負荷を測定し、処理サーバをＴＡＴの小さい順にソートする。
そして、負荷分散装置２００は、ＴＡＴの小さい順に低遅延サーバ数をまず１つ決め、他を高遅延サーバ数（全台数Ｎ−１）として、肩代わり信号数Ｓごとの低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣを算出する。次いで、ＴＡＴの小さい順に低遅延サーバ数を２つ決め、他を高遅延サーバ数（Ｎ−２）として、肩代わり信号数Ｓごとの低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣを算出する。以下同様に、低遅延サーバ数をＴＡＴの小さい順に１，２，３，…と増やしていき、全ての低遅延サーバ数と高遅延サーバ数との組合せについて肩代わり信号数Ｓごとの低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣを算出する。 Hereinafter, a load distribution method of the load distribution apparatus 200 configured as described above will be described.
In the present embodiment, a signal is distributed to a processing server having a communication delay as small as possible while considering the processing delay, processing load, and number of processing servers.
The load distribution apparatus 200 measures the processing time (TAT) and load of each processing server, and sorts the processing servers in ascending order of TAT.
Then, the load distribution apparatus 200 first determines the number of low-delay servers in order of increasing TAT, and sets the other as the number of high-delay servers (total number N-1), and the load fluctuation rate of the low-delay server for each signal number S A, average signal processing TAT fluctuation rate B, and score C are calculated. Next, the number of low-delay servers is determined in order of increasing TAT, the other is the number of high-delay servers (N-2), and the load variation rate A and average signal processing TAT variation rate B of the low-delay server for each substitution signal number S , Score C is calculated. Similarly, the number of low-delay servers is increased to 1, 2, 3,... In order of increasing TAT, and the combinations of low-delay servers for every combination of low-delay servers and high-delay servers are assumed for each low-delay server number S. A load fluctuation rate A, an average signal processing TAT fluctuation rate B, and a score C are calculated.

そして、負荷分散装置２００は、全ての低遅延サーバ数と高遅延サーバ数との組合せの中から、低遅延サーバの負荷変動率Ａと負荷情報（ＣＰＵ使用率）の乗算値がＣＰＵ閾値を超えない範囲で最大のスコアＣとなる肩代わり信号数Ｓの値で振り分けを行う。
振分機能部１１０は、取得した負荷情報の最大値と低遅延サーバの負荷変動率Ａとを乗算し、ＣＰＵ閾値を超えない肩代わり信号数Ｓの範囲を算出し、算出した肩代わり信号数Ｓの範囲の中で最大のスコアＣとなる肩代わり信号数Ｓの値で振り分けを行う。 Then, the load balancer 200 determines that the multiplication value of the load variation rate A and the load information (CPU usage rate) of the low-latency server exceeds the CPU threshold value among all combinations of the low-latency server count and the high-latency server count. The distribution is performed by the value of the shoulder signal number S that gives the maximum score C in a non-existing range.
The allocating function unit 110 multiplies the acquired maximum value of the load information by the load variation rate A of the low-delay server, calculates the range of the substitution signal number S that does not exceed the CPU threshold, and calculates the calculated substitution signal number S. The distribution is performed by the value of the shoulder signal number S that gives the maximum score C in the range.

図９は、負荷分散装置２００の負荷分散方法のシーケンス例を説明するフローチャートである。図１０は、図９のシーケンス例を説明する図である。
図９のステップＳ２０１において、負荷分散装置２００は、各処理サーバ１１〜１７のＣＰＵ閾値を設定する。例えば、負荷分散装置２００は、ＣＰＵ閾値＝８０を設定する。 FIG. 9 is a flowchart for explaining a sequence example of the load distribution method of the load distribution apparatus 200. FIG. 10 is a diagram illustrating the sequence example of FIG.
In step S201 of FIG. 9, the load distribution apparatus 200 sets the CPU threshold value of each of the processing servers 11-17. For example, the load balancer 200 sets CPU threshold = 80.

図９のステップＳ２０２において、負荷分散装置２００は、定期的に全処理サーバ１１〜１７へテスト信号（ｐｉｎｇなど）を送信し、各処理サーバ１１〜１７の応答時間（ＴＡＴ）を取得（測定）する。図１０の符号ｉに示すように、処理サーバ１１の応答時間（ＴＡＴ）「１．１ｍｓ」、処理サーバ１２の応答時間（ＴＡＴ）「１．５ｍｓ」、処理サーバ１３の応答時間（ＴＡＴ）「２．１ｍｓ」、処理サーバ１４の応答時間（ＴＡＴ）「１０．１ｍｓ」、処理サーバ１５の応答時間（ＴＡＴ）「１１．５ｍｓ」、処理サーバ１６の応答時間（ＴＡＴ）「１３．５ｍｓ」、処理サーバ１７の応答時間（ＴＡＴ）「１１８．７ｍｓ」を取得する。 In step S202 of FIG. 9, the load distribution apparatus 200 periodically transmits test signals (such as ping) to all the processing servers 11 to 17, and acquires (measures) the response times (TAT) of the processing servers 11 to 17. To do. As shown by the symbol i in FIG. 10, the response time (TAT) “1.1 ms” of the processing server 11, the response time (TAT) “1.5 ms” of the processing server 12, and the response time (TAT) “of the processing server 13. 2.1 ms ”, response time (TAT)“ 10.1 ms ”of the processing server 14, response time (TAT)“ 11.5 ms ”of the processing server 15, response time (TAT)“ 13.5 ms ”of the processing server 16, The response time (TAT) “118.7 ms” of the processing server 17 is acquired.

ステップＳ２０３において、負荷分散装置２００のサーバ情報取得部１２０は、定期的に全処理サーバ１１〜１７の負荷情報（ＣＰＵ使用率）を取得する。図１０の符号ｊに示すように、サーバ情報取得部１２０は、処理サーバ１１のＣＰＵ使用率「３５」、処理サーバ１２のＣＰＵ使用率「３４」、処理サーバ１３のＣＰＵ使用率「３８」、処理サーバ１４のＣＰＵ使用率「４５」、処理サーバ１５のＣＰＵ使用率「４４」、処理サーバ１６のＣＰＵ使用率「３１」、処理サーバ１７のＣＰＵ使用率「２４」を取得する。 In step S <b> 203, the server information acquisition unit 120 of the load distribution apparatus 200 periodically acquires load information (CPU usage rate) of all the processing servers 11 to 17. As shown by a symbol j in FIG. 10, the server information acquisition unit 120 includes a CPU usage rate “35” of the processing server 11, a CPU usage rate “34” of the processing server 12, a CPU usage rate “38” of the processing server 13, The CPU usage rate “45” of the processing server 14, the CPU usage rate “44” of the processing server 15, the CPU usage rate “31” of the processing server 16, and the CPU usage rate “24” of the processing server 17 are acquired.

ステップＳ２０４において、振分機能部２１０の肩代わり信号数算出部２１１は、応答時間（ＴＡＴ）が少ない順に低遅延サーバを１台，２台，…，(全台数−１)台と増加させていき、それぞれの場合の肩代わり信号数Ｓごとの低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣの値を算出する（後記図１１参照）。
図１０の例では、各処理サーバ１１〜１７の応答時間（ＴＡＴ）の測定結果を見ると、応答時間（ＴＡＴ）が少ない順に、処理サーバ１１の応答時間（ＴＡＴ）「１．１ｍｓ」、処理サーバ１２の応答時間（ＴＡＴ）「１．５ｍｓ」、処理サーバ１３の応答時間（ＴＡＴ）「２．１ｍｓ」、…となっている。したがって、まず、処理サーバ１１〜１７のうち、処理サーバ１１の１台を低遅延サーバとし、他の処理サーバ１２〜１７の６台を高遅延サーバとして、上記肩代わり信号数Ｓごとの低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣの値を算出する。次に、処理サーバ１１と処理サーバ１２の２台を低遅延サーバとし、他の処理サーバ１３〜１７の５台を高遅延サーバとして、上記肩代わり信号数Ｓごとの低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣの値を算出する。 In step S204, the substitution signal number calculation unit 211 of the distribution function unit 210 increases the number of low-delay servers to one, two,..., (Total number-1) in ascending order of response time (TAT). Then, the load fluctuation rate A, average signal processing TAT fluctuation rate B, and score C of the low-delay server are calculated for each substitution signal number S in each case (see FIG. 11 described later).
In the example of FIG. 10, when the measurement results of the response times (TAT) of the processing servers 11 to 17 are viewed, the response time (TAT) of the processing server 11 is “1.1 ms” in ascending order of the response time (TAT). The response time (TAT) of the server 12 is “1.5 ms”, the response time (TAT) of the processing server 13 is “2.1 ms”, and so on. Therefore, first, among the processing servers 11 to 17, one of the processing servers 11 is a low-latency server, and six of the other processing servers 12 to 17 are high-latency servers. Load fluctuation rate A, average signal processing TAT fluctuation rate B, and score C are calculated. Next, two processing servers 11 and 12 are low-delay servers, and five other processing servers 13 to 17 are high-delay servers. , Average signal processing TAT fluctuation rate B and score C are calculated.

以下、同様に、応答時間（ＴＡＴ）が少ない順に低遅延サーバを増加させていき、(全台数−１)すなわち処理サーバ１１〜１６の６台を低遅延サーバとし、他の処理サーバ１７の１台を高遅延サーバとして、上記肩代わり信号数Ｓごとの低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣの値を算出する。以上により、応答時間（ＴＡＴ）が少ない順に、低遅延サーバの台数を変えた場合のすべての組み合わせについて、上記肩代わり信号数Ｓごとの低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣの値を算出する。 In the same manner, the low-delay servers are increased in ascending order of response time (TAT), (total number-1), that is, six processing servers 11 to 16 are set as low-delay servers, and one of the other processing servers 17 The value of the load fluctuation rate A, average signal processing TAT fluctuation rate B, and score C of the low delay server for each substitution signal number S is calculated using the platform as a high delay server. As described above, the load variation rate A and the average signal processing TAT variation rate B of the low-delay server for each substitution signal number S for all combinations when the number of low-delay servers is changed in the order of decreasing response time (TAT). , The value of score C is calculated.

ステップＳ２０５において、負荷分散装置２００の振分機能部２１０は、上記スコアＣが最大となるときの肩代わり信号数Ｓの値で振り分けを行う。すなわち、振分機能部２１０は、上記スコアＣが最大となるときの肩代わり信号数Ｓの分だけ、本来高遅延サーバに振り分けられる信号を低遅延サーバに振り分ける。 In step S205, the distribution function unit 210 of the load distribution apparatus 200 performs distribution according to the value of the substitution signal number S when the score C is maximized. That is, the distribution function unit 210 distributes the signal that is originally distributed to the high-delay server to the low-delay server by the number of substitution signals S when the score C is maximized.

次に、図１１を参照して低遅延サーバへの信号の振り分け例について説明する。
図１１は、肩代わり信号数算出部２１１により算出された肩代わり信号数Ｓ、低遅延サーバの負荷変動率Ａ、平均信号処理ＴＡＴ変動率Ｂ、およびスコアＣの算出例を示す図である。なお、スコアＣは、減少関数（１／Ａ＋１／Ｂ）を用いている。
図１１の符号ｋに示すように、低遅延サーバを増加させていき、それぞれの場合で、肩代わり信号数Ｓごとに低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣの値を算出する。図１１の例では、低遅延サーバの台数「１」かつ高遅延サーバの台数「６」の場合において、肩代わり信号数Ｓ「１」のとき、Ｎｍ「１」とＳ「０」を前記式（１）に導入して低遅延サーバの負荷変動率Ａ「１」が算出され、またＮｍ「１」とＮｎ「６」とＳ「０」を前記式（２）に導入して平均信号処理ＴＡＴ変動率Ｂ「１」が算出される。また、Ｓ「０」の場合のスコアは、減少関数（１／Ａ＋１／Ｂ）に、Ａ「１」とＢ「１」を代入することで、スコア「２」が算出される。同様にして、Ｓ「１」の場合には、低遅延サーバの負荷変動率Ａ「２」、平均信号処理ＴＡＴ変動率Ｂ「０．８６」、スコア「１．６７」が算出され、Ｓ「２」の場合には、低遅延サーバの負荷変動率Ａ「３」、平均信号処理ＴＡＴ変動率Ｂ「０．７１」、スコア「１．７４」が算出される。
そして、図１１の符号ｌに示すように、ＣＰＵ閾値を超えない肩代わり信号数Ｓの範囲を算出し、その中でスコアＣが最大となる肩代わり信号数Ｓで振り分けを行う。 Next, an example of signal distribution to the low-latency server will be described with reference to FIG.
FIG. 11 is a diagram illustrating a calculation example of the shoulder signal number S calculated by the shoulder signal number calculation unit 211, the load variation rate A, the average signal processing TAT variation rate B, and the score C of the low-delay server. The score C uses a decreasing function (1 / A + 1 / B).
As shown by the symbol k in FIG. 11, the number of low-delay servers is increased, and in each case, the load variation rate A, average signal processing TAT variation rate B, and score C of the low-delay server for each substitution signal number S. Calculate the value. In the example of FIG. 11, when the number of low-delay servers is “1” and the number of high-delay servers is “6”, Nm “1” and S “0” are expressed by the above formula ( 1), the load fluctuation rate A “1” of the low-delay server is calculated, and Nm “1”, Nn “6”, and S “0” are introduced into the equation (2) to calculate the average signal processing TAT. The fluctuation rate B “1” is calculated. Also, the score “2” is calculated by substituting A “1” and B “1” into the decreasing function (1 / A + 1 / B) in the case of S “0”. Similarly, in the case of S “1”, the load fluctuation rate A “2”, the average signal processing TAT fluctuation rate B “0.86”, and the score “1.67” of the low delay server are calculated. In the case of “2”, the load fluctuation rate A “3”, average signal processing TAT fluctuation rate B “0.71”, and score “1.74” of the low delay server are calculated.
Then, as indicated by reference numeral 1 in FIG. 11, the range of the substitution signal number S that does not exceed the CPU threshold is calculated, and distribution is performed based on the substitution signal number S in which the score C is maximum.

次に、図１２を参照して低遅延サーバへの信号の振り分け例についてより詳細に説明する。
図１２は、肩代わり信号数算出部２１１により算出された肩代わり信号数Ｓ、低遅延サーバの負荷変動率Ａ、平均信号処理ＴＡＴ変動率ＢおよびスコアＣの算出、ならびにスコアＣを用いた肩代わり信号数Ｓによる振り分け例を示す図である。なお、スコアＣは、減少関数（１／Ａ＋１／Ｂ）を用いている。肩代わり信号数Ｓごとの低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣの値の算出方法については、図１１により説明した。 Next, an example of signal distribution to the low-latency server will be described in more detail with reference to FIG.
FIG. 12 illustrates the calculation of the shoulder signal number S calculated by the shoulder signal number calculation unit 211, the load fluctuation rate A of the low delay server, the average signal processing TAT fluctuation rate B and the score C, and the number of shoulder signals using the score C. It is a figure which shows the example of distribution by S. The score C uses a decreasing function (1 / A + 1 / B). The method of calculating the load fluctuation rate A, average signal processing TAT fluctuation rate B, and score C of the low-delay server for each substitution signal number S has been described with reference to FIG.

図１２に示すように、低遅延サーバの台数と高遅延サーバの台数との組み合わせにおいて、それぞれの組み合わせにおける肩代わり信号数Ｓごとの低遅延サーバの負荷変動率Ａ，平均信号処理ＴＡＴ変動率Ｂ，スコアＣの値が算出されている。そして、図１２に示すように、負荷分散装置２００の振分機能部２１０（図８参照）は、低遅延サーバの中で最大のＣＰＵ値をＡ倍してもＣＰＵ閾値を超えないか否かを判定する。図１２の例では、例えば低遅延サーバの台数「１」かつ高遅延サーバの台数「６」の場合において、肩代わり信号数Ｓ「０」および「１」のとき、前記ＣＰＵ閾値を超え、肩代わり信号数Ｓ「２」〜「６」のときには、前記ＣＰＵ閾値を超えない。また、低遅延サーバの台数「２」かつ高遅延サーバの台数「５」の場合において、肩代わり信号数Ｓ「０」と「１」と「２」のとき、前記ＣＰＵ閾値を超え、肩代わり信号数Ｓ「３」〜「５」のときには、前記ＣＰＵ閾値を超えない。また、低遅延サーバの台数「３」かつ高遅延サーバの台数「４」の場合において、肩代わり信号数Ｓ「０」と「１」と「２」と「３」のとき、前記ＣＰＵ閾値を超え、肩代わり信号数Ｓ「４」のときのみ前記ＣＰＵ閾値を超えない。以下、低遅延サーバの台数「４」かつ高遅延サーバの台数「３」の場合、低遅延サーバの台数「５」かつ高遅延サーバの台数「２」の場合、低遅延サーバの台数「６」かつ高遅延サーバの台数「１」の場合は、肩代わり信号数Ｓの値にかかわらず、前記ＣＰＵ閾値を超える。 As shown in FIG. 12, in the combination of the number of low-delay servers and the number of high-delay servers, the load variation rate A, average signal processing TAT variation rate B of the low-delay server for each substitution signal number S in each combination, A value of score C is calculated. Then, as shown in FIG. 12, the distribution function unit 210 (see FIG. 8) of the load balancer 200 determines whether or not the CPU threshold is exceeded even if the maximum CPU value is multiplied by A among the low-latency servers. Determine. In the example of FIG. 12, for example, when the number of low-delay servers is “1” and the number of high-delay servers is “6”, when the number of substitution signals S is “0” and “1”, the CPU threshold value is exceeded and the substitution signal When the number S is “2” to “6”, the CPU threshold value is not exceeded. In the case of the number of low-delay servers “2” and the number of high-delay servers “5”, when the number of substitution signals S is “0”, “1”, and “2”, the CPU threshold is exceeded and the number of substitution signals When S “3” to “5”, the CPU threshold value is not exceeded. Further, when the number of low-delay servers is “3” and the number of high-delay servers is “4”, the CPU threshold is exceeded when the number of substitution signals S is “0”, “1”, “2”, and “3”. The CPU threshold value is not exceeded only when the number of substitution signals S is “4”. Hereinafter, when the number of low delay servers is “4” and the number of high delay servers is “3”, the number of low delay servers is “5” and the number of high delay servers is “2”, the number of low delay servers is “6”. When the number of high-delay servers is “1”, the CPU threshold value is exceeded regardless of the value of the substitution signal number S.

そして、図１２の符号ｍに示すように、低遅延サーバの中で最大のＣＰＵ値をＡ倍してもＣＰＵ閾値を超えないもののうち、最大のスコアＣとなる低遅延サーバの台数と高遅延サーバの台数と肩代わり信号数Ｓとの組合せで振り分けを行う。なお、単に最大のスコアＣとなるものは、低遅延サーバの台数「２」かつ高遅延サーバの台数「５」の場合において、肩代わり信号数Ｓ「５」のときのスコアＣ「６．７１」がある。しかし、この組合せは、ＣＰＵ閾値＝８０を超えた場合の肩代わり信号数Ｓ「５」による振り分け例である。つまり、処理サーバのＣＰＵ値を超えた状態においての振り分けとなるためシステム全体の処理時間は増大する。図１２の例では、低遅延サーバの台数「３」と高遅延サーバの台数「４」と肩代わり信号数Ｓ「３」の分だけ、本来高遅延サーバに振り分けられる信号を低遅延サーバに振り分ける。 Then, as shown by the symbol m in FIG. 12, among the low-delay servers, the maximum CPU value multiplied by A does not exceed the CPU threshold value, and the number of low-delay servers with the maximum score C and the high delay The distribution is performed by the combination of the number of servers and the number of signals S on the shoulder. The maximum score C is simply the score C “6.71” when the number of low-delay servers is “2” and the number of high-delay servers is “5” and the number of substitution signals S is “5”. There is. However, this combination is an example of distribution based on the number of substitution signals S “5” when the CPU threshold value exceeds 80. In other words, the processing time of the entire system increases because the distribution is performed when the CPU value of the processing server is exceeded. In the example of FIG. 12, signals that are originally distributed to the high-delay server are allocated to the low-delay servers by the number of low-delay servers “3”, the number of high-delay servers “4”, and the number of substitution signals S “3”.

ここで、前記第１の実施形態では、各処理サーバの応答時間（ＴＡＴ）を応答時間閾値と比較して、各処理サーバを低遅延サーバと高遅延サーバとに分類していた。これに対して、本実施形態では、第１の実施形態のような応答時間閾値を設けることなく、各処理サーバを処理ＴＡＴの小さい順に低遅延サーバとして１台ずつ、全台数−１まで増加させていき、それぞれの場合の低遅延サーバの台数と高遅延サーバの台数とを決定することで、各処理サーバを低遅延サーバと高遅延サーバとに分類する。すなわち、本実施形態では、一定値である応答時間閾値を用いずに、処理サーバの全体的な処理遅延や処理負荷を見て動的に、その時点で最適な低遅延サーバおよび高遅延サーバを分類するので、より適切な処理サーバの分類を行うことができる。その結果、負荷分散のさらなる最適化を図ることができ、より一層処理負荷を制限しつつ、応答時間をできるだけ早めることができる。 Here, in the first embodiment, the response time (TAT) of each processing server is compared with a response time threshold value, and each processing server is classified into a low delay server and a high delay server. On the other hand, in this embodiment, without setting a response time threshold as in the first embodiment, each processing server is increased as a low-delay server one by one in ascending order of processing TAT to the total number -1. Then, by determining the number of low latency servers and the number of high latency servers in each case, each processing server is classified into a low latency server and a high latency server. That is, in this embodiment, without using a response time threshold value that is a constant value, the low delay server and the high delay server that are optimal at that time are dynamically determined by looking at the overall processing delay and processing load of the processing server. Since classification is performed, more appropriate processing server classification can be performed. As a result, load distribution can be further optimized, and the response time can be shortened as much as possible while further limiting the processing load.

なお、上記各実施形態では、低遅延サーバの負荷変動率Ａと平均信号処理ＴＡＴ変動率Ｂから減少関数（評価関数）を用いてスコアを算出しているが、平均信号処理ＴＡＴ変動率Ｂを用いるものであればよく、評価関数に基づくスコアを算出しないものでもよい。すなわち、低遅延サーバの負荷変動率Ａと平均信号処理ＴＡＴ変動率Ｂを用いて肩代わり信号数Ｓを決定するものであれば、どのようなものでもよい。
また、低遅延サーバの負荷変動率Ａを用いることなく、平均信号処理ＴＡＴ変動率Ｂのみを用いるものでもよい。この場合には、あらかじめＴＡＴ変動率Ｂ目標値を設定し、実測値との差値等から肩代わり信号数Ｓを求めることができる。 In each of the above embodiments, the score is calculated from the load fluctuation rate A and the average signal processing TAT fluctuation rate B of the low-delay server using a decreasing function (evaluation function). What is necessary is just to use, and the thing which does not calculate the score based on an evaluation function may be used. In other words, any method may be used as long as the shouldering signal number S is determined using the load fluctuation rate A and the average signal processing TAT fluctuation rate B of the low-delay server.
Further, only the average signal processing TAT fluctuation rate B may be used without using the load fluctuation rate A of the low delay server. In this case, the TAT fluctuation rate B target value is set in advance, and the shoulder signal number S can be obtained from the difference value from the actual measurement value or the like.

また、高遅延サーバに振り分けられる信号を低遅延サーバへ振り分ける（戻す）動作は、信号を他拠点から自拠点へ巻き取るイメージとなっている。この点で、低遅延サーバへの「振り分け（戻す）」は、「巻き取り」と呼称してもよい。 In addition, the operation of distributing (returning) the signal distributed to the high-delay server to the low-delay server is an image of winding the signal from another site to its own site. In this respect, “sort (return)” to the low-latency server may be referred to as “winding”.

また、上記各実施形態において説明した各処理のうち、自動的に行われるものとして説明した処理の全部または一部を手動的に行うこともでき、あるいは、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上述文書中や図面中に示した処理手順、制御手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。
また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。 In addition, among the processes described in the above embodiments, all or a part of the processes described as being automatically performed can be manually performed, or the processes described as being manually performed All or a part of the above can be automatically performed by a known method. In addition, the processing procedures, control procedures, specific names, and information including various data and parameters shown in the above-described document and drawings can be arbitrarily changed unless otherwise specified.
Further, each component of each illustrated apparatus is functionally conceptual, and does not necessarily need to be physically configured as illustrated. In other words, the specific form of distribution / integration of each device is not limited to that shown in the figure, and all or a part thereof may be functionally or physically distributed or arbitrarily distributed in arbitrary units according to various loads or usage conditions. Can be integrated and configured.

また、上記の各構成、機能、処理部、処理手段等は、それらの一部または全部を、例えば集積回路で設計する等によりハードウェアで実現してもよい。また、上記の各構成、機能等は、プロセッサがそれぞれの機能を実現するプログラムを解釈し、実行するためのソフトウェアで実現してもよい。各機能を実現するプログラム、テーブル、ファイル等の情報は、メモリや、ハードディスク、ＳＳＤ（Solid State Drive）等の記録装置、または、ＩＣ（Integrated Circuit）カード、ＳＤ（Secure Digital）カード、光ディスク等の記録媒体に保持することができる。また、本明細書において、時系列的な処理を記述する処理ステップは、記載された順序に沿って時系列的に行われる処理はもちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理（例えば、並列処理あるいはオブジェクトによる処理）をも含むものである。 Each of the above-described configurations, functions, processing units, processing means, and the like may be realized by hardware by designing a part or all of them with, for example, an integrated circuit. Further, each of the above-described configurations, functions, and the like may be realized by software for interpreting and executing a program that realizes each function by the processor. Information such as programs, tables, and files for realizing each function is stored in a memory, a hard disk, a recording device such as an SSD (Solid State Drive), an IC (Integrated Circuit) card, an SD (Secure Digital) card, an optical disk, etc. It can be held on a recording medium. Further, in this specification, the processing steps describing time-series processing are not limited to processing performed in time series according to the described order, but are not necessarily performed in time series, either in parallel or individually. The processing (for example, parallel processing or object processing) is also included.

１データセンタ（自拠点）
２データセンタ（他拠点）
１１〜１７，２１〜２４処理サーバ
３１〜３４クライアント
１００，２００負荷分散装置（処理時間測定手段，分類手段）
１１１，２１１肩代わり信号数算出部（信号数算出手段）
１１０，２１０振分機能部（振分手段）
１２０サーバ情報取得部（負荷情報取得手段）
１３０情報格納部（記憶手段）
１５０拠点情報
Ａ低遅延サーバの負荷変動率
Ｂ平均信号処理ＴＡＴ変動率
Ｓ肩代わり信号数
Ｃスコア 1 Data center (own site)
2 Data center (other bases)
11-17, 21-24 Processing server 31-34 Client 100, 200 Load balancer (processing time measuring means, classification means)
111, 211 Shoulder signal number calculation unit (signal number calculation means)
110, 210 Distribution function part (distribution means)
120 Server information acquisition unit (load information acquisition means)
130 Information storage unit (storage means)
150 Base information A Load variation rate of low-latency server B Average signal processing TAT variation rate S Number of shoulder signals C Score

Claims

A load balancer that includes a plurality of processing servers in a base and determines the processing server that executes information processing requested by a client,
A processing time measuring means for measuring a processing TAT (Turn Around Time) of each of the processing servers;
Classification means for classifying the processing server into a high-latency server and a low-latency server based on the measured processing TAT;
Storage means for storing a load limit value which is a limit value of processing of the low-latency server;
Load information acquisition means for acquiring load information of the low-latency server;
Based on the number of the high-delay servers, the number of the low-delay servers, the maximum value of the acquired load information, and the load variation rate of the processing server indicating the load variation of the low-delay server, the load limit value is calculated. A signal number calculating means for calculating the number of substitution signals to be distributed to the low-delay server within a range not exceeding the signal to be distributed to the low-delay server;
Distribution means for distributing the calculated number of substitution signals from the high delay server to the low delay server;
A load balancer comprising:

The signal number calculating means distributes the requests to the processing servers in a predetermined order based on a processing TAT when processed by the low-latency server and a processing TAT when processed by the high-latency server. Calculating the average signal processing TAT variation rate indicating the variation of the processing response time in the case,
Based on the load variation rate of the low-latency server and the average signal processing TAT variation rate, a score for evaluating the number of substitute signals is calculated,
The allocating means, based on the acquired maximum value of the load information and the load fluctuation rate of the low-latency server, from the range not exceeding the load limit value, the number of substitution signals serving as the maximum score The load distribution apparatus according to claim 1, wherein minutes are distributed from the high delay server to the low delay server.

The load distribution apparatus according to claim 1, wherein the load information acquisition unit acquires a CPU usage rate of the low-latency server as load information.

When the processing TAT when processed by the low delay server is 1, the processing TAT when processed by the high delay server is the average of the processing TAT of the high delay server by the average of the processing TAT of the low delay server. The load distribution apparatus according to claim 1, wherein the load distribution apparatus is approximated by a divided value.

The allocating unit multiplies the acquired maximum value of the load information by the load variation rate of the low-latency server, calculates a range of the shoulder signal number that does not exceed the CPU threshold, and calculates the calculated shoulder signal number 2. The load distribution apparatus according to claim 1, wherein the distribution is performed based on the value of the number of substitution signals having the maximum score in the range.

The classification means increases each processing server as a low-delay server in order from the smallest processing TAT to the total number -1, and the number of low-delay servers and the number of high-delay servers in each case And decide
The signal number calculating means includes the determined number of low-delay servers, the number of low-delay servers, a process TAT when processed by the low-delay server, and a process TAT when processed by the high-delay server; And calculating an average signal processing TAT variation rate indicating variation in processing response time when the requests are distributed to the processing servers in a predetermined order, and the load variation rate of the low delay server and the average signal Based on the processing TAT fluctuation rate, a score for evaluating the number of substitution signals is calculated,
The allocating means multiplies the maximum value of the acquired load information by the load fluctuation rate of the low-delay server, calculates the range of the substitution signal number not exceeding the CPU threshold value, and calculates the range of the calculated substitution signal number. The load distribution apparatus according to claim 1, wherein the distribution is performed based on the value of the number of substitution signals having the largest score.

A load balancing method for a load balancer that includes a plurality of processing servers in a base and determines the processing server that executes information processing requested by a client,
The load balancer is:
A processing time measuring step of measuring a processing TAT (Turn Around Time) of each processing server;
A classification step of classifying the processing server into a high-latency server and a low-latency server based on the measured processing TAT;
A storage step of storing a load limit value that is a limit value of processing of the low-latency server;
A load information acquisition step of acquiring load information of the low-latency server;
Based on the number of the high-delay servers, the number of the low-delay servers, the maximum value of the acquired load information, and the load variation rate of the processing server indicating the load variation of the low-delay server, the load limit value is calculated. A signal number calculating step for calculating the number of substitution signals to be distributed to the low delay server within a range not exceeding the signal to be distributed to the low delay server;
A distribution step of distributing the calculated number of substitution signals from the high delay server to the low delay server;
Load balancing method to execute.