JP4548168B2

JP4548168B2 - Load balance device, server system and load balance method thereof

Info

Publication number: JP4548168B2
Application number: JP2005082120A
Authority: JP
Inventors: 潤鈴木
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2005-03-22
Filing date: 2005-03-22
Publication date: 2010-09-22
Anticipated expiration: 2025-03-22
Also published as: JP2006268155A

Description

本発明は、複数の階層からなるサーバシステムに組み込まれるロードバランス装置、このロードバランス装置を備えた多階層のサーバシステム、及びそのロードバランス方法に関する。 The present invention relates to a load balance device incorporated in a server system composed of a plurality of hierarchies, a multi-tier server system including the load balance device, and a load balance method thereof.

従来より、複数の端末装置がサーバに接続して通信を行うウェブサーバシステムが使用されている。そして、複数のサーバを並列に動作させるサーバシステムにおいて、クライアントの端末装置が接続されるサーバ、即ち、フロントエンドサーバの負荷を管理するロードバランス装置を設ける技術が開示されている（例えば、特許文献１及び２参照。）。ロードバランス装置は、サーバシステム内に設けられた複数のフロントエンドサーバ間で負荷が均一になるように、処理を分散させるものである。例えば、応答時間が増大した場合に、予め設定されているポリシーに従って、処理能力の大小等のサーバの機差を考慮した最適サーバの選択、リクエストの投入抑止、予備サーバの追加等の方策を自動的に実施する。これにより、サーバシステム全体の処理速度を向上させ、例えばＷＷＷトランザクションの応答時間を短縮することができる。 Conventionally, a web server system in which a plurality of terminal devices connect to a server and perform communication has been used. In a server system in which a plurality of servers are operated in parallel, a technique is disclosed in which a server to which a client terminal device is connected, that is, a load balancing device that manages the load on a front-end server is provided (for example, Patent Documents). 1 and 2). The load balancer distributes processing so that the load is uniform among a plurality of front-end servers provided in the server system. For example, when response time increases, automatic measures such as selection of the optimal server, suppression of request input, and addition of spare servers are taken into account according to preset policies, taking into account differences in server capacity such as processing capacity To implement. Thereby, the processing speed of the entire server system can be improved, and for example, the response time of a WWW transaction can be shortened.

一方、近時、サーバに接続する端末装置の増加及びサーバと端末装置との間の通信処理量の増大による通信速度の低下を防止するために、ウェブサーバシステムを３階層化する技術が開発されている（例えば、特許文献３参照。）。このような３階層のウェブサーバシステムにおいては、フロントエンド層、中間層及びバックエンド層が３層構造を構成している。フロントエンド層には、ネットワークを介して各ユーザの端末装置が接続される複数のフロントエンドサーバが設けられており、バックエンド層には、データベースが記憶されたストレージ（外部記憶装置）が設けられており、中間層には、端末装置からフロントエンドサーバを介して入力されたリクエストをストレージに伝達すると共に、ストレージから出力されたデータを、フロントエンドサーバを介して端末装置に伝達するデータベースサーバが設けられている。 On the other hand, recently, in order to prevent a decrease in communication speed due to an increase in the number of terminal devices connected to the server and an increase in the amount of communication processing between the server and the terminal device, a technology for developing a three-level web server system has been developed. (For example, refer to Patent Document 3). In such a three-layer web server system, the front end layer, the intermediate layer, and the back end layer form a three-layer structure. The front-end layer is provided with a plurality of front-end servers to which each user's terminal device is connected via a network, and the back-end layer is provided with a storage (external storage device) in which a database is stored. In the intermediate layer, there is a database server that transmits a request input from the terminal device via the front-end server to the storage and transmits data output from the storage to the terminal device via the front-end server. Is provided.

特開２００２−３４２２９８号公報JP 2002-342298 A 特開２００３−１９６１７８号公報JP 2003-196178 A 特開２００１−５７１３号公報JP 2001-5713 A

しかしながら、上述の従来の技術には以下に示すような問題点がある。即ち、特許文献３に記載されているような３階層のサーバシステムにおいては、ロードバランス装置を使用してフロントエンドサーバの負荷を管理しても、応答時間を短縮する効果が不十分である。 However, the conventional techniques described above have the following problems. That is, in the three-tier server system described in Patent Document 3, even if the load on the front-end server is managed using the load balancer, the effect of shortening the response time is insufficient.

本発明はかかる問題点に鑑みてなされたものであって、多階層のサーバシステムに設けられ、このサーバシステムの応答時間を短縮するロードバランス装置、このロードバランス装置を備えたサーバシステム及びそのロードバランス方法を提供することを目的とする。 The present invention has been made in view of such a problem, and is provided in a multi-tier server system and reduces the response time of the server system, a server system including the load balance apparatus, and a load thereof It aims to provide a balance method.

本発明に係るロードバランス装置は、複数の第１サーバを含む第１層及び第２乃至第ｎ（ｎは２以上の整数）サーバを夫々含む第２乃至第ｎ層によりｎ階層構造をなすサーバシステムに組み込まれ、前記複数の第１サーバに対して処理を分散するロードバランス装置において、前記サーバシステムの応答時間が基準範囲内にあるか否かを判定する判定手段と、前記判定手段が基準範囲内にないと判定した場合に、前記第２乃至第ｎサーバの負荷を評価する負荷評価手段と、前記サーバシステムに予備の前記第１サーバが存在し、且つ、前記負荷評価手段が、前記第２乃至第ｎサーバがいずれも過負荷状態にないと評価した場合に、前記予備の第１サーバを前記第１層に組み込んで駆動させ、前記第２乃至第ｎサーバのいずれかが過負荷状態であると評価した場合に、前記予備の第１サーバを前記第１層に組み込まない過負荷抑止手段と、を有することを特徴とする。 A load balancing device according to the present invention is a server having an n-layer structure with a first layer including a plurality of first servers and a second to nth layers including second to nth (n is an integer of 2 or more) servers. In a load balance apparatus that is incorporated in a system and distributes processing to the plurality of first servers, a determination unit that determines whether a response time of the server system is within a reference range, and the determination unit is a reference When it is determined that it is not within the range, there is a load evaluation unit that evaluates the load of the second to nth servers, a spare first server exists in the server system, and the load evaluation unit includes When it is evaluated that none of the second to n-th servers are in an overload state, the spare first server is driven by being incorporated in the first layer, and any of the second to n-th servers is overloaded. In state When assessed with that, and having a overload inhibiting means not incorporated the first server of the preliminary to the first layer.

本発明においては、過負荷抑止手段が、第２乃至第ｎサーバのいずれかが過負荷状態である場合に、予備の第１サーバを第１層に組み込まないことにより、過負荷状態にあるサーバの負荷を増大させることを防止し、サーバシステム全体の応答時間を短縮することができる。 In the present invention, when any of the second to n-th servers is in an overload state, the overload prevention means does not incorporate the spare first server in the first layer, thereby causing the server in an overload state. Can be prevented, and the response time of the entire server system can be shortened.

また、前記過負荷抑止手段は、前記サーバシステムに前記予備の第１サーバが存在しない場合、または前記負荷評価手段が前記第２乃至第ｎサーバのいずれかが過負荷状態であると評価した場合に、前記第１サーバに対するリクエストを抑止するものであることが好ましい。これにより、第２乃至第ｎサーバの負荷を軽減することができる。 Further, the overload inhibition means, when the first server of the preliminary to the server system does not exist, or the case where the load evaluation unit evaluates to any one of the second to n server is overloaded In addition, it is preferable to suppress requests to the first server. Thereby, the load of the 2nd thru | or nth server can be reduced.

更に、前記ｎが３であってもよい。このとき、第３サーバは、データベースが記憶されたストレージであり、第２サーバは、前記第１サーバと前記第３サーバとの間に接続され、前記第１サーバから入力されたリクエストを前記第３サーバに伝達すると共に、前記第３サーバから出力されたデータを前記第１サーバに伝達するデータベースサーバであってもよい。 Further, n may be 3. At this time, the third server is a storage in which a database is stored, the second server is connected between the first server and the third server, and a request input from the first server is sent to the first server. The database server may transmit the data output from the third server to the first server while transmitting the data to the three servers.

本発明に係るサーバシステムは、複数の第１サーバを有する第１層と、第２乃至第ｎ（ｎは２以上の整数）サーバを夫々有し前記第１層と共にｎ階層構造をなす第２乃至第ｎ層と、前記複数の第１サーバに対して処理を分散するロードバランス装置と、を有するサーバシステムであって、前記ロードバランス装置は、前記サーバシステムの応答時間が基準範囲内にあるか否かを判定する判定手段と、前記判定手段が基準範囲内にないと判定した場合に、前記第２乃至第ｎサーバの負荷を評価する負荷評価手段と、予備の前記第１サーバが存在し、且つ、前記負荷評価手段が、前記第２乃至第ｎサーバがいずれも過負荷状態にないと評価した場合に、前記予備の第１サーバを前記第１層に組み込んで駆動させ、前記第２乃至第ｎサーバのいずれかが過負荷状態であると評価した場合に、前記予備の第１サーバを前記第１層に組み込まない過負荷抑止手段と、を有することを特徴とする。 A server system according to the present invention includes a first layer having a plurality of first servers, and second to n-th (n is an integer of 2 or more) servers, and a second layer having an n-layer structure together with the first layer. Thru | or nth layer, The load balance apparatus which distributes a process with respect to these 1st servers, Comprising: The said load balance apparatus has the response time of the said server system in the reference | standard range A determination means for determining whether or not there is a load evaluation means for evaluating the load of the second to n-th servers when it is determined that the determination means is not within a reference range; and the spare first server And when the load evaluation unit evaluates that the second to n-th servers are not overloaded, the spare first server is incorporated into the first layer and driven, Any of 2nd to nth servers There when assessed to be overloaded, and having a overload inhibiting means not incorporated the first server of the preliminary to the first layer.

本発明に係るサーバシステムのロードバランス方法は、複数の第１サーバを含む第１層及び第２乃至第ｎ（ｎは２以上の整数）サーバを夫々含む第２乃至第ｎ層によりｎ階層構造をなすサーバシステムのロードバランス方法において、前記複数の第１サーバに対して処理を分散する負荷分散工程と、この負荷分散工程の結果、前記サーバシステムの応答時間が基準範囲内にあるか否かを判断する工程と、前記第２乃至第ｎサーバの負荷を評価する負荷評価工程と、前記応答時間が前記基準範囲内になく、予備の前記第１サーバが存在し、且つ前記負荷評価工程で前記第２乃至第ｎサーバがいずれも過負荷状態にないと評価された場合に、予備の前記第１サーバを前記第１層に組み込んで駆動させ、前記第２乃至第ｎサーバのいずれかが過負荷状態であると評価された場合に、前記予備の第１サーバを前記第１層に組み込まない過負荷抑止工程と、を有することを特徴とする。 The server system load balancing method according to the present invention includes a first layer including a plurality of first servers and a second layer to an nth layer including second to nth (n is an integer of 2 or more) servers. In the load balancing method for the server system, the load balancing step for distributing the processing to the plurality of first servers, and whether the response time of the server system is within a reference range as a result of the load balancing step A load evaluation step for evaluating the load of the second to n-th servers, a response time not within the reference range, a spare first server exists, and the load evaluation step When it is evaluated that none of the second to n-th servers are in an overload state, the spare first server is incorporated into the first tier and driven, and any of the second to n-th servers is Overload When it is evaluated to be state, and having a overload suppression process that does not incorporate a first server of the preliminary to the first layer.

本発明によれば、第２乃至第ｎサーバのいずれかが過負荷状態である場合に、予備の第１サーバを第１層に組み込まないことにより、過負荷状態にあるサーバの負荷を増大させることを防止し、サーバシステム全体の応答時間を短縮することができる。 According to the present invention, when any of the second to nth servers is in an overload state, the load of the server in the overload state is increased by not incorporating the spare first server in the first layer. And the response time of the entire server system can be shortened.

以下、本発明の実施形態について添付の図面を参照して具体的に説明する。図１は、本実施形態に係るサーバシステムを示すブロック図であり、図２は、図１に示すロードバランス装置を示すブロック図である。図１に示すように、本実施形態に係るサーバシステム１は、３階層のサーバシステムである。 Hereinafter, embodiments of the present invention will be specifically described with reference to the accompanying drawings. FIG. 1 is a block diagram showing a server system according to the present embodiment, and FIG. 2 is a block diagram showing the load balance apparatus shown in FIG. As shown in FIG. 1, the server system 1 according to the present embodiment is a three-tier server system.

サーバシステム１のバックエンド層には、データベースが記憶されたストレージ（外部記憶装置）２が設けられている。ストレージ２は、例えばハードディスク等の大容量記憶装置を備えた装置である。また、データサーバ１のフロントエンド層には、複数のフロントエンドサーバ４が設けられている。フロントエンドサーバ４は、ネットワークを介して各ユーザの端末装置（図示せず）が接続されるものである。端末装置とは、例えばパーソナルコンピュータ等である。更に、サーバシステム１の中間層には、データベースサーバ３が設けられている。データベースサーバ３はストレージ２とフロントエンドサーバ４との間に接続されており、端末装置からフロントエンドサーバ４を介して入力されたリクエストをストレージ２に伝達すると共に、ストレージ２から出力されたデータをフロントエンドサーバ４を介して端末装置に伝達するものである。 In the back-end layer of the server system 1, a storage (external storage device) 2 in which a database is stored is provided. The storage 2 is a device provided with a mass storage device such as a hard disk. A plurality of front end servers 4 are provided in the front end layer of the data server 1. The front-end server 4 is connected to a terminal device (not shown) of each user via a network. The terminal device is, for example, a personal computer. Further, a database server 3 is provided in an intermediate layer of the server system 1. The database server 3 is connected between the storage 2 and the front-end server 4, transmits a request input from the terminal device via the front-end server 4 to the storage 2, and transmits the data output from the storage 2. This is transmitted to the terminal device via the front-end server 4.

更にまた、サーバシステム１には、ロードバランス装置５が設けられている。ロードバランス装置５は、ストレージ２、データベースサーバ３及び各フロントエンドサーバ４に接続されている。ロードバランス装置５は、ストレージ２、データベースサーバ３及びフロントエンドサーバ４の負荷を管理し、複数のフロントエンドサーバ４間で負荷が均一になるように処理を分散させると共に、ストレージ２、データベースサーバ３及びフロントエンドサーバ４の負荷が過剰となることを防止するために、リクエストの投入抑止及び予備サーバの追加等を行うものである。ロードバランス装置５は外部のネットワークにも接続可能となっている。 Furthermore, the server system 1 is provided with a load balance device 5. The load balance device 5 is connected to the storage 2, the database server 3, and each front end server 4. The load balancer 5 manages the load on the storage 2, the database server 3, and the front end server 4, distributes the processing so that the load is uniform among the plurality of front end servers 4, and stores the storage 2, the database server 3. In addition, in order to prevent the load on the front-end server 4 from becoming excessive, request input is suppressed and a spare server is added. The load balance device 5 can be connected to an external network.

図２に示すように、ロードバランス装置５においては、ストレージ２及びデータベースサーバ３の負荷状況を評価する負荷評価手段１１が設けられている。負荷評価手段１１には、データベースサーバ３（以下、ＤＢサーバ３ともいう）のＣＰＵ（Central Processing Unit：中央処理装置）の使用時間の測定を行うＤＢサーバＣＰＵ使用率測定手段１２が設けられている。また、負荷評価手段１１には、ＤＢサーバ３がリクエストを処理し、その処理結果をフロントエンドサーバ４に返すまでの応答時間（レスポンスタイム）を測定するＤＢサーバレスポンスタイム測定手段１３が設けられている。更に、負荷評価手段１１には、ストレージ２がリクエストを処理し、その処理結果をＤＢサーバ３に返すまでの遅延時間（レイテンシ）を測定するストレージＩ／Ｏレイテンシ測定手段１４が設けられている。更にまた、負荷評価手段１１には、ストレージ２で処理待ちとなっているトランザクションの数を測定するストレージＩ／Ｏ待ち行列測定手段１５が設けられている。 As shown in FIG. 2, the load balancer 5 is provided with a load evaluation unit 11 that evaluates the load status of the storage 2 and the database server 3. The load evaluation unit 11 is provided with a DB server CPU usage rate measurement unit 12 that measures the usage time of a CPU (Central Processing Unit) of the database server 3 (hereinafter also referred to as the DB server 3). . Further, the load evaluation unit 11 is provided with a DB server response time measurement unit 13 that measures a response time (response time) until the DB server 3 processes a request and returns the processing result to the front-end server 4. Yes. Furthermore, the load evaluation means 11 is provided with storage I / O latency measurement means 14 for measuring a delay time (latency) until the storage 2 processes a request and returns the processing result to the DB server 3. Furthermore, the load evaluation unit 11 is provided with a storage I / O queue measurement unit 15 that measures the number of transactions waiting for processing in the storage 2.

更にまた、負荷評価手段１１には、ＤＢサーバＣＰＵ使用率測定手段１２、ＤＢサーバレスポンスタイム測定手段１３、ストレージＩ／Ｏレイテンシ測定手段１４及びストレージＩ／Ｏ待ち行列測定手段１５に対して、決まった時間間隔で測定を指示し、その測定結果を集計する負荷状況集計手段１６が設けられている。 Furthermore, the load evaluation unit 11 is determined for the DB server CPU usage rate measurement unit 12, the DB server response time measurement unit 13, the storage I / O latency measurement unit 14, and the storage I / O queue measurement unit 15. Load status totaling means 16 is provided for instructing measurement at a predetermined time interval and totaling the measurement results.

また、ロードバランス装置５には、管理ポリシー記憶手段１７が設けられている。管理ポリシー記憶手段１７には、ストレージ２及びＤＢサーバ３の負荷が過剰であるか否かを判断する際の判断基準である管理ポリシー１８が記憶されている。 The load balance device 5 is provided with a management policy storage unit 17. The management policy storage unit 17 stores a management policy 18 that is a criterion for determining whether or not the loads on the storage 2 and the DB server 3 are excessive.

管理ポリシー１８において、ＤＢサーバ３の負荷が過剰であるとする基準は、例えば以下のとおりである。
（１）ＤＢサーバＣＰＵ使用率測定手段１２による測定の結果、ＤＢサーバ３のＣＰＵ使用率が９０％以上である場合。
（２）ＤＢサーバレスポンスタイム測定手段１３による測定の結果、ＤＢサーバ３のレスポンスタイムが１秒以上である場合。 In the management policy 18, the criterion that the load on the DB server 3 is excessive is, for example, as follows.
(1) When the CPU usage rate of the DB server 3 is 90% or more as a result of the measurement by the DB server CPU usage rate measuring means 12.
(2) When the response time of the DB server 3 is 1 second or longer as a result of the measurement by the DB server response time measuring means 13.

また、管理ポリシー１８において、ストレージ２の負荷が過剰であるとする基準は、例えば以下のとおりである。
（３）ストレージＩ／Ｏレイテンシ測定手段１４による測定の結果、ストレージ２のレイテンシが１５ｍｓ（ミリ秒）以上である場合。
（４）ストレージＩ／Ｏ待ち行列測定手段１５による測定の結果、ストレージ２の待ち行列数が５以上である場合。 In the management policy 18, for example, the standard that the load on the storage 2 is excessive is as follows.
(3) The case where the storage I / O latency measurement means 14 has measured the latency of the storage 2 to be 15 ms (milliseconds) or more.
(4) When the number of queues in the storage 2 is 5 or more as a result of the measurement by the storage I / O queue measurement means 15.

更にまた、ロードバランス装置５には、負荷状況集計手段１６の集計結果が入力され、管理ポリシー１８に従ってストレージ２及びＤＢサーバ３の負荷が過剰であるか否かを判断する負荷状況判断手段１９が設けられている。 Furthermore, the load balance device 5 is input with the total result of the load status totaling unit 16, and a load status determination unit 19 that determines whether the load on the storage 2 and the DB server 3 is excessive according to the management policy 18. Is provided.

一方、ロードバランス装置５においては、フロントエンドサーバ４の負荷状況を監視すると共に、複数のフロントエンドサーバ４間で負荷が均一になるように処理を分散させるフロントエンドサーバ負荷分散手段２０が設けられている。フロントエンドサーバ負荷分散手段２０は、各フロントエンドサーバ４に接続されており、また、ネットワークを介して端末装置（図示せず）に接続されるようになっている。 On the other hand, the load balancer 5 is provided with a front-end server load distribution unit 20 that monitors the load state of the front-end server 4 and distributes the processing so that the load is uniform among the plurality of front-end servers 4. ing. The front-end server load distribution means 20 is connected to each front-end server 4 and is connected to a terminal device (not shown) via a network.

また、ロードバランス装置５には、負荷状況判断手段１９からストレージ２及びＤＢサーバ３の負荷が過剰であるか否かの判断結果が入力されると共に、フロントエンドサーバ負荷分散手段２０からフロントエンドサーバ４の監視結果が入力され、これらの結果に基づいて、ストレージ２、データベースサーバ３及びフロントエンドサーバ４の負荷が過剰となることを防止するために、リクエストの投入抑止及び予備サーバの追加等を行う過負荷抑止手段２１が設けられている。 Further, the load balance device 5 receives a determination result as to whether or not the load on the storage 2 and the DB server 3 is excessive from the load status determination means 19, and from the front end server load distribution means 20 to the front end server. 4 is input, and on the basis of these results, in order to prevent the load on the storage 2, the database server 3 and the front-end server 4 from becoming excessive, it is possible to suppress request input and add a spare server. An overload suppression means 21 is provided.

なお、サーバシステム１におけるストレージ２、データベースサーバ３、フロントエンドサーバ４及びロードバランス装置５は、固定配線によって相互に接続されていてもよく、ネットワークを介して相互に接続されていてもよい。また、ロードバランス装置５において、管理ポリシー記憶手段１７以外の各手段は、ハードウエアによって構成されていてもよく、ソフトウエアによって構成されていてもよい。 Note that the storage 2, the database server 3, the front-end server 4, and the load balance device 5 in the server system 1 may be connected to each other by a fixed wiring, or may be connected to each other via a network. In the load balance device 5, each unit other than the management policy storage unit 17 may be configured by hardware or may be configured by software.

次に、上述の如く構成された本実施形態に係るサーバシステムの動作、即ち、本実施形態に係るサーバシステムのロードバランス方法について、図１乃至図３を参照して説明する。図３は、本実施形態に係るサーバシステムの動作を示すフローチャート図である。サーバシステム１が動作しているときには、各ユーザの端末装置がネットワークを介してサーバシステム１のロードバランス装置５に接続される。そして、各端末装置から種々のリクエストがロードバランス装置５に入力され、ロードバランス装置５はフロントエンドサーバ４に対してこのリクエストを振り分ける。フロントエンドサーバ４は、必要に応じてＤＢサーバ３を介してストレージ２からデータベースの一部を読出し、リクエストに対応する処理を行う。そして、その処理結果を、ロードバランス装置５を介して端末装置に返答する。 Next, the operation of the server system according to the present embodiment configured as described above, that is, the load balancing method of the server system according to the present embodiment will be described with reference to FIGS. FIG. 3 is a flowchart showing the operation of the server system according to this embodiment. When the server system 1 is operating, each user's terminal device is connected to the load balance device 5 of the server system 1 via a network. Various requests are input from the terminal devices to the load balance device 5, and the load balance device 5 distributes the requests to the front end server 4. The front-end server 4 reads a part of the database from the storage 2 via the DB server 3 as necessary, and performs processing corresponding to the request. Then, the processing result is returned to the terminal device via the load balance device 5.

このとき、図３のステップＳ１に示すように、ロードバランス装置５のフロントエンドサーバ負荷分散手段２０は、複数のフロントエンドサーバ４間で負荷が分散されるように、リクエストを送るフロントエンドサーバ４を選択する。即ち、フロントエンドサーバ４に対するロードバランス施策を実施する。具体的には、下記（ａ）乃至（ｄ）のいずれかの方式を実施して、リクエストの送付先を選択する。
（ａ）リクエストが来た順番に、各フロントエンドサーバ４を順々に選択するラウンドロビン方式。
（ｂ）各フロントエンドサーバ４に対する端末装置の接続数を監視して、最も接続数が少ないフロントエンドサーバ４を選択する方式。
（ｃ）測定用パケットを各フロントエンドサーバ４に送り、その応答時間を測定することによって各フロントエンドサーバ４の負荷の大きさを評価して、応答が速いフロントエンドサーバを選択する方式。
（ｄ）フロントエンドサーバ４に優先順位を設定し、アクセス量に閾値（スレッショルド）を設け、優先順位がより上位のフロントエンドサーバのアクセス量がスレッショルドを超えたら、優先順位がそれより下位のフロントエンドサーバを選択する方式。 At this time, as shown in step S <b> 1 of FIG. 3, the front-end server load distribution unit 20 of the load balancer 5 sends the request so that the load is distributed among the plurality of front-end servers 4. Select. That is, a load balance measure for the front end server 4 is implemented. Specifically, one of the following methods (a) to (d) is performed to select a request destination.
(A) A round robin method in which the front-end servers 4 are selected in order in the order in which requests are received.
(B) A method of monitoring the number of terminal devices connected to each front-end server 4 and selecting the front-end server 4 having the smallest number of connections.
(C) A method of selecting a front-end server that responds quickly by evaluating the magnitude of the load on each front-end server 4 by sending a measurement packet to each front-end server 4 and measuring its response time.
(D) A priority order is set for the front-end server 4 and a threshold value (threshold) is set for the access amount. A method for selecting an end server.

そして、ステップＳ２に示すように、フロントエンドサーバ負荷分散手段２０が、ステップＳ１の施策によりサーバシステムの応答時間が基準範囲内にあるか否かを判定する。応答時間が基準範囲内にある場合には、ステップＳ１に戻り、ロードバランス施策を継続する。一方、応答時間が基準範囲よりも長くなった場合には、ステップＳ３に進む。 Then, as shown in step S2, the front-end server load distribution means 20 determines whether or not the response time of the server system is within the reference range by the measure of step S1. If the response time is within the reference range, the process returns to step S1 to continue the load balance measure. On the other hand, if the response time is longer than the reference range, the process proceeds to step S3.

ステップＳ３においては、追加投入する予備のフロントエンドサーバ（以下、予備サーバともいう）の有無を確認する。予備サーバが無い場合は、ステップＳ４に進み、過負荷抑止手段２１が新たなリクエストの投入を抑止する。そして、ステップＳ１に戻り、ロードバランス施策を継続する。 In step S3, it is confirmed whether there is a spare front-end server (hereinafter also referred to as a spare server) to be additionally input. If there is no spare server, the process proceeds to step S4, where the overload suppression means 21 suppresses the input of a new request. And it returns to step S1 and continues a load balance measure.

一方、予備サーバが存在する場合は、ステップＳ５に進み、予備サーバを投入することによってサーバシステム全体の応答時間が短縮化されるか否かを調べる。図２に示すように、負荷状況集計手段１６は、一定の時間間隔で、ＤＢサーバＣＰＵ使用率測定手段１２、ＤＢサーバレスポンスタイム測定手段１３、ストレージＩ／Ｏレイテンシ測定手段１４、ストレージＩ／Ｏ待ち行列測定手段１５に対して測定を指示する。これにより、負荷評価手段１１におけるＤＢサーバＣＰＵ使用率測定手段１２が、ＤＢサーバ３のＣＰＵの使用時間を測定する。また、ＤＢサーバレスポンスタイム測定手段１３が、ＤＢサーバ３がリクエストを処理し、その処理結果をフロントエンドサーバ４に返すまでの応答時間（レスポンスタイム）を測定する。更に、ストレージＩ／Ｏレイテンシ測定手段１４が、ストレージ２がリクエストを処理し、その処理結果をＤＢサーバ３に返すまでの遅延時間（レイテンシ）を測定する。更にまた、ストレージＩ／Ｏ待ち行列測定手段１５が、ストレージ２で処理待ちとなっているトランザクションの数を測定する。そして、これらの測定結果を負荷状況集計手段１６に対して出力する。 On the other hand, if there is a spare server, the process proceeds to step S5, and it is checked whether or not the response time of the entire server system is shortened by inserting the spare server. As shown in FIG. 2, the load status totaling unit 16 includes a DB server CPU usage rate measuring unit 12, a DB server response time measuring unit 13, a storage I / O latency measuring unit 14, and a storage I / O at regular time intervals. Measurement is instructed to the queue measuring means 15. Thereby, the DB server CPU usage rate measuring means 12 in the load evaluating means 11 measures the usage time of the CPU of the DB server 3. Further, the DB server response time measuring unit 13 measures a response time (response time) until the DB server 3 processes a request and returns the processing result to the front end server 4. Further, the storage I / O latency measuring unit 14 measures a delay time (latency) until the storage 2 processes the request and returns the processing result to the DB server 3. Furthermore, the storage I / O queue measuring means 15 measures the number of transactions waiting for processing in the storage 2. Then, these measurement results are output to the load status totaling means 16.

負荷状況集計手段１６は、これらの測定結果を集計し、負荷状況判断手段１９に対して出力する。負荷状況集計手段１６は一定時間毎に前記測定結果を取得して集計しているため、負荷状況判断手段１９は、一番最近に集計された結果に基づき、管理ポリシー記憶手段１７から読み出した管理ポリシー１８に従って、ストレージ２及びＤＢサーバ３が過負荷であるか否かを判断する。 The load status totaling unit 16 totals these measurement results and outputs them to the load status determination unit 19. Since the load status totalization unit 16 acquires and aggregates the measurement results at regular intervals, the load status determination unit 19 performs the management read from the management policy storage unit 17 based on the most recent totalization result. In accordance with the policy 18, it is determined whether or not the storage 2 and the DB server 3 are overloaded.

具体的には、以下のような場合に、ＤＢサーバ３が過負荷であると判断する。
（１）ＤＢサーバ３のＣＰＵ使用率が９０％以上である場合。
（２）ＤＢサーバ３のレスポンスタイムが１秒以上である場合。
また、以下のような場合に、ストレージ２が過負荷であると判断する。
（３）ストレージ２のレイテンシが１５ｍｓ（ミリ秒）以上である場合。
（４）ストレージ２の待ち行列数が５以上である場合。 Specifically, it is determined that the DB server 3 is overloaded in the following cases.
(1) The CPU usage rate of the DB server 3 is 90% or more.
(2) When the response time of the DB server 3 is 1 second or longer.
Further, it is determined that the storage 2 is overloaded in the following cases.
(3) When the latency of the storage 2 is 15 ms (milliseconds) or more.
(4) The number of queues in the storage 2 is 5 or more.

そして、図３のステップＳ６に示すように、ストレージ２及びＤＢサーバ３の双方が過負荷状況でない場合は、ステップＳ７に進み、予備サーバをフロントエンド層に組み込んで駆動させる。これにより、サーバシステム１におけるフロントエンドサーバ４の数が増加し、その分、フロントエンドサーバ４における１台当たりの負荷が減少し、各フロントエンドサーバ４の応答時間が短縮する。この結果、サーバシステム１全体の処理速度が向上し、応答時間が短縮される。その後、ステップＳ１に戻り、ロードバランス施策を継続する。一方、ストレージ２及びＤＢサーバ３の少なくとも一方が過負荷状況である場合は、ステップＳ７に進み、予備サーバをフロントエンド層に組み込むことなく、新たなリクエストの投入を抑止する。その後、ステップＳ１に戻り、ロードバランス施策を継続する。 Then, as shown in step S6 of FIG. 3, when both the storage 2 and the DB server 3 are not overloaded, the process proceeds to step S7, and the spare server is incorporated into the front end layer and driven. As a result, the number of front-end servers 4 in the server system 1 increases, and the load per unit in the front-end server 4 decreases accordingly, and the response time of each front-end server 4 is shortened. As a result, the processing speed of the entire server system 1 is improved and the response time is shortened. Then, it returns to step S1 and continues a load balance measure. On the other hand, if at least one of the storage 2 and the DB server 3 is in an overload state, the process proceeds to step S7, and the introduction of a new request is suppressed without incorporating the spare server into the front end layer. Then, it returns to step S1 and continues a load balance measure.

次に、本実施形態における効果について説明する。例えば特許文献１及び２に記載されているような従来のロードバランス装置を、例えば特許文献３に記載されているような３階層のサーバシステムに適用した場合、ロードバランス装置は、フロントエンドサーバ間のみで負荷の分散を行う。即ち、ロードバランス装置は、フロントエンド層のみで閉じて負荷分散を実施する。そして、フロントエンドサーバ間における負荷分散によっても応答時間を短縮化できない場合は、予備サーバが存在する場合にはそれをサーバシステムに組み込み、予備サーバがない場合には新たなリクエストの投入を抑止する。 Next, the effect in this embodiment is demonstrated. For example, when a conventional load balance device as described in Patent Documents 1 and 2 is applied to a three-tier server system as described in Patent Document 3, for example, the load balance device is connected between front-end servers. Just distribute the load. In other words, the load balancer is closed only by the front end layer and performs load distribution. If the response time cannot be shortened even by load balancing between front-end servers, if there is a spare server, it is built into the server system, and if there is no spare server, new requests are suppressed. .

しかしながら、サーバシステム全体の応答時間は、フロントエンドサーバの負荷管理だけでは改善できない場合がある。また、予備のフロントエンドサーバを組み込むことにより、かえってサーバシステム全体の応答時間が延びる場合もある。即ち、サーバシステム全体を見た場合、フロントエンドサーバの負荷よりも、ＤＢサーバ又はストレージの負荷の方が高くなっていて、ＤＢサーバ又はストレージが時間的なボトルネックになっている場合がある。このような場合は、予備サーバを投入してフロントエンドサーバの数を増やしても、サーバシステム全体の応答時間は改善されず、逆に、フロントエンドサーバの数を増やすことにより、ＤＢサーバ及びストレージの負荷が更に高くなり、サーバシステム全体の応答時間が更に延びてしまう可能性が高い。 However, the response time of the entire server system may not be improved by load management of the front-end server alone. In addition, by incorporating a spare front-end server, the response time of the entire server system may be increased. That is, when looking at the entire server system, the load on the DB server or storage is higher than the load on the front-end server, and the DB server or storage may become a time bottleneck. In such a case, even if the number of front-end servers is increased by introducing spare servers, the response time of the entire server system is not improved. Conversely, by increasing the number of front-end servers, the DB server and storage There is a high possibility that the response time of the entire server system will further increase.

例えば、ＤＢサーバのＣＰＵの使用率が１００％に近ければ、フロントエンドサーバを追加しても、ＤＢサーバが処理しきれず、結果的にシステム応答時間を短縮できない。また、ＤＢサーバのレスポンスタイムが長い場合は、ＤＢサーバの負荷が高いことが示されており、フロントエンドサーバを追加しても、ＤＢサーバが処理しきれず、結果的にシステム応答時間を短縮できない。更に、ストレージのレイテンシが長い場合は、ストレージの負荷が高いことが示されており、フロントエンドサーバを追加しても、ストレージが処理しきれず、結果的にシステム応答時間を短縮できない。更にまた、ストレージで処理待ちとなっているトランザクションの数が多い場合は、ストレージの負荷が高くなっており、フロントエンドサーバを追加しても、ストレージが処理しきれず、結果的にシステム応答時間を短縮できない。このように、従来のロードバランス装置では、３階層サーバシステムの応答時間を改善できないことがある。 For example, if the CPU usage rate of the DB server is close to 100%, even if a front-end server is added, the DB server cannot be processed, and as a result, the system response time cannot be shortened. In addition, when the response time of the DB server is long, it is indicated that the load on the DB server is high, and even if a front-end server is added, the DB server cannot be processed and consequently the system response time cannot be shortened. . Further, when the storage latency is long, it is indicated that the load on the storage is high, and even if a front-end server is added, the storage cannot be processed, and as a result, the system response time cannot be shortened. Furthermore, if there are a large number of transactions waiting to be processed in the storage, the storage load is high, and even if a front-end server is added, the storage cannot be processed, resulting in an increase in system response time. Cannot be shortened. As described above, the conventional load balancing apparatus may not be able to improve the response time of the three-tier server system.

これに対して、本実施形態においては、負荷評価手段１１がストレージ２及びＤＢサーバ３の負荷を表す各種のパラメータを測定し、負荷状況集計手段１６がこの測定結果を集計し、負荷状況判断手段１９がこの集計結果に基づいてストレージ２及びＤＢサーバ３が過負荷状態であるか否かを判断し、過負荷抑止手段２１が、ストレージ２及びＤＢサーバ３のうち少なくとも一方が過負荷状態にある場合は、予備サーバが存在する場合であっても、予備サーバをサーバシステム１に組み込まず、新たなリクエストの投入を抑止する。これにより、予備サーバを組み込むことにより、サーバシステム１全体の応答性がかえって低下することを防止できる。また、新たなリクエストの投入を抑止することにより、ストレージ２及びＤＢサーバ３における負荷を軽減することができる。 On the other hand, in the present embodiment, the load evaluation unit 11 measures various parameters representing the loads of the storage 2 and the DB server 3, and the load status totaling unit 16 totals the measurement results, and the load status determination unit 19 determines whether the storage 2 and the DB server 3 are in an overload state based on the counting result, and the overload suppression means 21 has at least one of the storage 2 and the DB server 3 in an overload state. In this case, even if a spare server exists, the spare server is not incorporated into the server system 1 and the input of a new request is suppressed. Thereby, by incorporating a spare server, it is possible to prevent the overall response of the server system 1 from being lowered. Further, by suppressing the input of a new request, the load on the storage 2 and the DB server 3 can be reduced.

なお、本実施形態においては、管理ポリシー１８としてストレージ２及びＤＢサーバ３に関する基準を設け、負荷状況判断手段１９がこの管理ポリシー１８を参照してストレージ２及びＤＢサーバ３の負荷状況を判断する例を示したが、本発明はこれに限定されない。負荷状況判断手段は、ストレージ２、ＤＢサーバ３及びフロントエンドサーバ４の３者の負荷状況を加味して、予備サーバを追加するか否かを決定してもよい。 In the present embodiment, an example in which a standard regarding the storage 2 and the DB server 3 is set as the management policy 18 and the load status determination unit 19 determines the load status of the storage 2 and the DB server 3 with reference to the management policy 18. However, the present invention is not limited to this. The load status determination unit may determine whether or not to add a spare server in consideration of the load status of the three parties of the storage 2, the DB server 3, and the front-end server 4.

また、本実施形態においては、サーバシステムとして３階層のサーバシステムを示したが、本発明はこれに限定されない。例えば、フロントエンドサーバ及びバックエンドサーバからなる２階層のサーバシステムにおいても、バックエンドサーバの負荷状態によって予備のフロントエンドサーバを投入するか否かを判断することで、サーバシステム全体の応答時間を短縮することができる。また、４階層以上のサーバシステムにおいても、フロントエンド層以外の階層の負荷状態を評価し、この評価結果に応じて、予備のフロントエンドサーバをフロントエンド層に組み込むか否かを判断することで、サーバシステム全体の応答時間を短縮することができる。 In the present embodiment, a three-tier server system is shown as the server system, but the present invention is not limited to this. For example, even in a two-tier server system composed of a front-end server and a back-end server, the response time of the entire server system can be reduced by determining whether or not a spare front-end server is to be inserted depending on the load state of the back-end server. It can be shortened. Also, in a server system with four or more layers, it is possible to evaluate the load state of the layers other than the front end layer and determine whether or not to install a spare front end server in the front end layer according to the evaluation result. The response time of the entire server system can be shortened.

更に、本実施形態においては、サーバシステムが複数のフロントエンドサーバを備える例を示したが、本発明はこれに限定されず、本発明は、セルベースマシン等、１台のマシンが複数のＣＰＵを備え、これらのＣＰＵを分割して動的にパーテション構成を変更することが可能な装置にも適用することができる。これにより、このようなマシンにおいて、動的に構成が変更されるパーテション間のロードバランスを的確に調整することができる。 Furthermore, in the present embodiment, an example in which the server system includes a plurality of front-end servers has been described. However, the present invention is not limited to this, and the present invention includes a cell-based machine, such as one machine having a plurality of CPUs. It can also be applied to an apparatus that can divide these CPUs and dynamically change the partition configuration. Thereby, in such a machine, it is possible to accurately adjust the load balance between partitions whose configuration is dynamically changed.

本発明は、多階層サーバシステムに好適に利用することができる。 The present invention can be suitably used for a multi-tier server system.

本発明の第１の実施形態に係るサーバシステムを示すブロック図である。It is a block diagram which shows the server system which concerns on the 1st Embodiment of this invention. 図１に示すロードバランス装置を示すブロック図である。It is a block diagram which shows the load balance apparatus shown in FIG. 本実施形態に係るサーバシステムの動作を示すフローチャート図である。It is a flowchart figure which shows operation | movement of the server system which concerns on this embodiment.

Explanation of symbols

１；サーバシステム
２；ストレージ
３；データベースサーバ
４；フロントエンドサーバ
５；ロードバランス装置
１１；負荷評価手段
１２；ＤＢサーバＣＰＵ使用率測定手段
１３；ＤＢサーバレスポンスタイム測定手段
１４；ストレージＩ／Ｏレイテンシ測定手段
１５；ストレージＩ／Ｏ待ち行列測定手段
１６；負荷状況集計手段
１７；管理ポリシー記憶手段
１８；管理ポリシー
１９；負荷状況判断手段
２０；フロントエンドサーバ負荷分散手段
２１；過負荷抑止手段 DESCRIPTION OF SYMBOLS 1; Server system 2; Storage 3; Database server 4; Front-end server 5; Load balance apparatus 11; Load evaluation means 12; DB server CPU utilization measurement means 13; DB server response time measurement means 14; Storage I / O latency Measurement means 15; Storage I / O queue measurement means 16; Load status totaling means 17; Management policy storage means 18; Management policy 19; Load status judgment means 20; Front-end server load distribution means 21;

Claims

The first layer including a plurality of first servers and the second to nth layers including second to n-th (n is an integer of 2 or more) servers are incorporated in a server system having an n-layer structure, In a load balancer that distributes processing to one server,
Determining means for determining whether or not the response time of the server system is within a reference range;
A load evaluation unit that evaluates a load of the second to n-th servers when the determination unit determines that it is not within a reference range;
When the spare first server exists in the server system, and the load evaluating means evaluates that the second to nth servers are not overloaded, the spare first server is Overload inhibiting means that does not incorporate the spare first server into the first layer when it is incorporated and driven in the first layer and it is evaluated that any of the second to nth servers is overloaded. When,
A load balance device comprising:

Based on the evaluation result of the load evaluation means, it is determined whether or not the second to n-th servers are in an overload state, and load status determination means for outputting the result to the overload suppression means. The load balance apparatus according to claim 1.

The overload inhibition means, when if the absence of the first server of the spare server system, or if any of the load evaluation means and the second to n-th server was evaluated as overloaded, The load balancing apparatus according to claim 1, wherein the load balancing apparatus suppresses requests to the first server.

The load balance device according to any one of claims 1 to 3, wherein the n is 3.

The third server is a storage in which a database is stored, the second server is connected between the first server and the third server, and a request input from the first server is transmitted to the third server. 5. The load balancer according to claim 4, wherein the load balancer is a database server that transmits data output from the third server to the first server while transmitting the data to the server.

6. The load balancing apparatus according to claim 5, wherein the load evaluation unit includes a unit that measures a usage time of the CPU of the database server.

6. The load balancing apparatus according to claim 5, wherein the load evaluation unit includes a unit that measures a response time until the database server processes a request and returns the processing result to the first server.

6. The load balancing apparatus according to claim 5, wherein the load evaluation unit includes a unit that measures a delay time until the storage processes a request and returns the processing result to the database server.

6. The load balancing apparatus according to claim 5, wherein the load evaluation unit includes a unit that measures the number of processes waiting for processing in the storage.

Means for measuring the CPU usage time of the database server; means for measuring a response time until the database server processes a request and returns a processing result to the first server; At least two or more means among a means for processing a request and measuring a delay time until the processing result is returned to the database server and a means for measuring the number of processes waiting for processing in the storage; 6. The load balance device according to claim 5, further comprising a load status totaling unit that totalizes the measurement results of the two or more units.

A first layer having a plurality of first servers, a second layer to an nth layer each having second to nth (n is an integer of 2 or more) servers and having an n-layer structure with the first layer, and the plurality of layers A load balancer that distributes processing to the first server of the server system,
The load balance device is:
Determining means for determining whether or not the response time of the server system is within a reference range;
A load evaluation unit that evaluates a load of the second to n-th servers when the determination unit determines that it is not within a reference range;
When the spare first server exists and the load evaluation unit evaluates that the second to nth servers are not overloaded, the spare first server is designated as the first layer. An overload inhibiting means that does not incorporate the spare first server into the first layer when any of the second to n-th servers is evaluated as being overloaded. A server system characterized by that.

The load balancer determines whether the second to n-th servers are in an overload state based on the evaluation result of the load evaluation unit, and outputs the result to the overload suppression unit. The server system according to claim 11, further comprising a load status determination unit.

The overload inhibition means, when if the absence of the first server of the spare server system, or if any of the load evaluation means and the second to n-th server was evaluated as overloaded, The server system according to claim 11 or 12, wherein a request for the first server is suppressed.

The server system according to any one of claims 11 to 13, wherein the n is 3.

The third server is a storage in which a database is stored, the second server is connected between the first server and the third server, and a request input from the first server is transmitted to the third server. 15. The server system according to claim 14, wherein the server system is a database server that transmits data output from the third server to the first server while transmitting the data to the server.

The server system according to claim 15, wherein the load evaluation unit includes a unit that measures a usage time of the CPU of the database server.

The server system according to claim 15, wherein the load evaluation unit includes a unit that measures a response time until the database server processes a request and returns a processing result to the first server.

The server system according to claim 15, wherein the load evaluation unit includes a unit that measures a delay time until the storage processes a request and returns a processing result to the database server.

The server system according to claim 15, wherein the load evaluation unit includes a unit that measures the number of processes waiting for processing in the storage.

Means for measuring the CPU usage time of the database server; means for measuring a response time until the database server processes a request and returns a processing result to the first server; At least two or more means among a means for processing a request and measuring a delay time until the processing result is returned to the database server and a means for measuring the number of processes waiting for processing in the storage; The server system according to claim 15, further comprising a load status totaling unit that totalizes the measurement results of the two or more units.

In a load balancing method for a server system having an n-layer structure with a first layer including a plurality of first servers and a second to n-th layer including second to n-th (n is an integer of 2 or more) servers,
A load distribution step for distributing processing to the plurality of first servers;
A step of determining whether or not the response time of the server system is within a reference range as a result of the load distribution step;
A load evaluation step for evaluating a load of the second to n-th servers;
Not in said response time is the reference range, when the preliminary first server is present, and the at the load evaluation step second to n server evaluates not any overload, preliminary When the first server is incorporated into the first layer and driven, and any of the second to n-th servers is evaluated to be overloaded, the spare first server is replaced with the first layer. Overload suppression process not incorporated in
A load balancing method for a server system, comprising:

Between the load evaluation step and the overload suppression step, there is a load status determination step for determining whether the second to nth servers are in an overload state based on an evaluation result in the load evaluation step. The server system load balancing method according to claim 21, wherein the server system is load balanced.

In the overload suppression step, when the response time is not within the reference range and the spare first server does not exist in the server system, or the response time is not within the reference range and the load evaluation step The load balance of the server system according to claim 21 or 22, wherein when any of the second to n-th servers is evaluated as being overloaded, a request to the first server is suppressed. Method.

The server system load balancing method according to any one of claims 21 to 23, wherein n is 3.

The third server is a storage in which a database is stored, the second server is connected between the first server and the third server, and a request input from the first server is transmitted to the third server. 25. The load balancing method for a server system according to claim 24, wherein the server is a database server that transmits data output from the third server to the first server while transmitting the data to the server.

26. The load balancing method for a server system according to claim 25, wherein the load evaluation step includes a step of measuring a usage time of the CPU of the database server.

The load of the server system according to claim 25, wherein the load evaluation step includes a step of measuring a response time until the database server processes a request and returns the processing result to the first server. Balance method.

26. The load balancing method for a server system according to claim 25, wherein the load evaluation step includes a step of measuring a delay time until the storage processes a request and returns the processing result to the database server. .

26. The load balancing method for a server system according to claim 25, wherein the load evaluation step includes a step of measuring the number of processes waiting for processing in the storage.

The load evaluating step measuring the usage time of the CPU of the database server; measuring the response time until the database server processes a request and returns the processing result to the first server; At least two or more of a process of processing a request and measuring a delay time until the process result is returned to the database server and a process of measuring the number of processes waiting for processing in the storage; 26. The load balance method for a server system according to claim 25, further comprising a load status totaling step of totaling measurement results in the two or more steps.