JP2012053899A

JP2012053899A - Operation management device and information processing system

Info

Publication number: JP2012053899A
Application number: JP2011234898A
Authority: JP
Inventors: Tomoo Misaki; 友雄三崎; Kazuhiko Matsusei; 和彦松政
Original assignee: Nomura Research Institute Ltd
Current assignee: Nomura Research Institute Ltd
Priority date: 2011-10-26
Filing date: 2011-10-26
Publication date: 2012-03-15
Anticipated expiration: 2030-03-26
Also published as: JP5309200B2

Abstract

PROBLEM TO BE SOLVED: To reduce power consumption of servers performing parallel processing.SOLUTION: An operation management device 1010 comprises: a load history holding unit 1112 for holding a history of the total access number for an information processing device composed of a plurality of servers connected via a network; a load estimation unit 1134 for estimating the total access number occurring in the future by referring to the load history holding unit 1112; a state setting unit 1140 for setting at least one of the servers in a power saving state when the total access number estimated by the load estimation unit 1134 is less than an access threshold value; and an index calculation unit 100 for calculating indexes indicating operation conditions of the plurality of servers. The state setting unit 1140 selects servers to set in the power saving state based on the indexes calculated by the index calculation unit 100.

Description

本発明は、運用管理装置および情報処理システムに関する。 The present invention relates to an operation management apparatus and an information processing system.

地球温暖化と言う問題が、昨今聞かれる。地球温暖化対策の一つとしては電力需要を減らすことがある。電力は一般的に水力発電、太陽光発電、風力発電、地熱発電などの自然由来の発電や、原子力を使う原子力発電や、ガス、石油、石炭などを燃料として燃やし発電を行う火力発電によって供給されている。この中でも特に火力発電は地球温暖化を進める大きな要因となっている。電力需要を減らせば火力発電の発電量も減らすことができるので、その分地球温暖化も抑止されうる。 The problem of global warming has been heard recently. One measure against global warming is to reduce demand for electricity. Electricity is generally supplied by hydropower, solar power, wind power, geothermal power, and other natural power generation, nuclear power generation using nuclear power, and thermal power generation that uses gas, oil, coal, etc. as fuel to generate power. ing. Of these, thermal power generation is a major factor in promoting global warming. Decreasing power demand can reduce the amount of thermal power generation, so global warming can be suppressed accordingly.

ＩＴ（Information Technology）やＩＣＴ（Information and Communication Technology）の分野でも、コンピュータやネットワーク機器、データセンタ機器が消費する電力を低減する必要性が指摘され始めている。 In the fields of IT (Information Technology) and ICT (Information and Communication Technology), the necessity of reducing the power consumed by computers, network devices, and data center devices has begun to be pointed out.

近年のインターネットの普及により、多くのデータセンタは、ネットワーク接続型の形態を有しており、インターネットなどの外部のネットワークからの仕事要求を受けて処理を行う。これらのデータセンタのなかには、一度に多くのアクセスを処理するために複数の並列に配置されたサーバを備える構成を採用したものがある（特許文献１参照）。 With the spread of the Internet in recent years, many data centers have a network connection type, and perform processing in response to a job request from an external network such as the Internet. Some of these data centers employ a configuration including a plurality of servers arranged in parallel in order to process many accesses at once (see Patent Document 1).

このようなデータセンタを管理する運用管理システムとしては、株式会社野村総合研究所が提供する千手（登録商標）や株式会社日立製作所が提供するＯｐｅｎＴＰ１００１（登録商標）や富士通株式会社が提供するシステムウォーカーがある。
また、ネットワーク接続装置の消費電力低減装置が知られている（特許文献２参照）。 As an operation management system for managing such a data center, Sente (registered trademark) provided by Nomura Research Institute, Ltd., OpenTP1001 (registered trademark) provided by Hitachi, Ltd., and a system provided by Fujitsu Ltd. There is a walker.
Further, a power consumption reduction device for a network connection device is known (see Patent Document 2).

特開２００８−２２５７９３号公報JP 2008-225793 A 特開２００７−９７１２６号公報JP 2007-97126 A

The Green Grid Association、［online］、インターネット＜URL：http://www.thegreengrid.org＞The Green Grid Association, [online], Internet <URL: http://www.thegreengrid.org>

現行の運用管理システムはロードバランサを備え、アクセス数が少ない場合でも個々のサーバに要求を分散させる。そのようなロードバランサには、それに接続されているサーバに均等にアクセスを分配するものが多い。この場合、各サーバの処理能力や省エネの度合いは無視された形でアクセスが分配されうる。 The current operation management system has a load balancer and distributes requests to individual servers even when the number of accesses is small. Many such load balancers distribute access evenly to the servers connected to it. In this case, access can be distributed in a manner that ignores the processing capacity and the degree of energy saving of each server.

サーバの特性について、製造年月日や機器の種類やメーカや装備（メモリ、ディスク）や電源ユニットなどの違いにより、処理能力や省エネの度合いはサーバごとに異なる場合が多い。例えば、同じＷＥＢ（World Wide Web）サーバで同じプロセスを走らせた場合でも、メーカが異なればそのプロセスで消費される電力は異なりうる。したがって、現行の運用管理システムにおけるアクセスの分配の方法は、省エネ化の観点からは必ずしも最適な方法とは言えない。 Regarding server characteristics, the processing capacity and the degree of energy saving often differ from server to server depending on the date of manufacture, type of equipment, manufacturer, equipment (memory, disk), power supply unit, and the like. For example, even when the same process is run on the same WEB (World Wide Web) server, the power consumed by the process may differ depending on the manufacturer. Therefore, the access distribution method in the current operation management system is not necessarily the optimum method from the viewpoint of energy saving.

本発明はこうした課題に鑑みてなされたものであり、その目的は、複数の要求処理ユニットで並列的に要求を処理する際の消費電力を低減できる運用管理技術の提供にある。 The present invention has been made in view of such problems, and an object thereof is to provide an operation management technique capable of reducing power consumption when a request is processed in parallel by a plurality of request processing units.

本発明のある態様は運用管理装置に関する。この運用管理装置は、複数の要求処理ユニットがネットワークで接続されることによって構成された情報処理装置に対する要求の負荷の履歴を保持する負荷履歴保持部と、負荷履歴保持部を参照して今後発生する負荷を予測する負荷予測部と、負荷予測部によって予測された負荷が所定の第１しきい値より少ない場合、少なくともひとつの要求処理ユニットを、要求を受付可能な第１状態よりも省電力の第２状態に設定する状態設定部と、ネットワークから得られる情報から、複数の要求処理ユニットの稼動状況が示された指標を算出する指標算出部と、を備える。状態設定部は、指標算出部によって算出された指標に基づいて、第２状態に設定する要求処理ユニットを選択する。 One embodiment of the present invention relates to an operation management apparatus. This operation management device is generated in the future with reference to a load history holding unit for holding a load history of a request for an information processing device configured by connecting a plurality of request processing units via a network, and a load history holding unit. When the load predicted by the load predictor and the load predicted by the load predictor are smaller than the predetermined first threshold, at least one request processing unit is saved more power than the first state in which a request can be accepted. A state setting unit that sets the second state, and an index calculation unit that calculates an index indicating the operating status of the plurality of request processing units from information obtained from the network. The state setting unit selects a request processing unit to be set to the second state based on the index calculated by the index calculation unit.

「要求」とは、例えば情報処理装置に対する処理の要求であってもよい。
「負荷」とは、例えば要求の量を示す値であってもよい。 The “request” may be a request for processing to the information processing apparatus, for example.
The “load” may be a value indicating the amount of request, for example.

この態様によると、指標算出部によって算出された指標に基づいて、第２状態に設定する要求処理ユニットを選択できる。 According to this aspect, the request processing unit to be set to the second state can be selected based on the index calculated by the index calculation unit.

なお、以上の構成要素の任意の組み合わせや、本発明の構成要素や表現を装置、方法、システム、コンピュータプログラム、コンピュータプログラムを格納した記録媒体などの間で相互に置換したものもまた、本発明の態様として有効である。 It should be noted that any combination of the above-described constituent elements, or those obtained by replacing the constituent elements and expressions of the present invention with each other between apparatuses, methods, systems, computer programs, recording media storing computer programs, and the like are also included in the present invention. It is effective as an embodiment of

本発明によれば、複数の要求処理ユニットで並列的に要求を処理する際の消費電力を低減できる。 According to the present invention, it is possible to reduce power consumption when processing requests in parallel by a plurality of request processing units.

実施の形態に係る運用管理装置を備える情報処理システムを示す概略図である。It is the schematic which shows an information processing system provided with the operation management apparatus which concerns on embodiment. 図１におけるアクセスの流れを説明するための説明図である。It is explanatory drawing for demonstrating the flow of access in FIG. 図１のロードバランサの機能および構成を示すブロック図である。It is a block diagram which shows the function and structure of the load balancer of FIG. 図３の接続保持部を示すデータ構造図である。It is a data structure figure which shows the connection holding | maintenance part of FIG. 図３の第１サーバ群状態保持部を示すデータ構造図である。It is a data structure figure which shows the 1st server group state holding | maintenance part of FIG. 図１の運用管理装置の機能および構成を示すブロック図である。It is a block diagram which shows the function and structure of the operation management apparatus of FIG. 図６の通信情報保持部に保持されるデータの一例を示すデータ構造図である。It is a data structure figure which shows an example of the data hold | maintained at the communication information holding part of FIG. 図６のサーバ情報保持部に保持されるデータの一例を示すデータ構造図である。It is a data structure figure which shows an example of the data hold | maintained at the server information holding | maintenance part of FIG. 図６の第１指標保持部に保持されるデータの一例を示すデータ構造図である。It is a data structure figure which shows an example of the data hold | maintained at the 1st parameter | index holding | maintenance part of FIG. 図６の使用状況保持部に保持されるデータの一例を示すデータ構造図である。It is a data structure figure which shows an example of the data hold | maintained at the use condition holding | maintenance part of FIG. 図６の係数保持部に保持されるデータの一例を示すデータ構造図である。FIG. 7 is a data structure diagram illustrating an example of data held in a coefficient holding unit in FIG. 6. 図６の第２指標保持部に保持されるデータの一例を示すデータ構造図である。It is a data structure figure which shows an example of the data hold | maintained at the 2nd parameter | index holding | maintenance part of FIG. 図６の第３指標保持部に保持されるデータの一例を示すデータ構造図である。It is a data structure figure which shows an example of the data hold | maintained at the 3rd parameter | index holding | maintenance part of FIG. 図６の負荷履歴保持部を示すデータ構造図である。It is a data structure figure which shows the load log | history holding part of FIG. 図１５（ａ）〜（ｄ）は、ステータス画面の代表画面図である。15A to 15D are representative screen diagrams of the status screen. 警告画面の代表画面図である。It is a typical screen figure of a warning screen. 状態設定画面の代表画面図である。It is a typical screen figure of a state setting screen. 図６の指標算出部における一連の処理を示すフローチャートである。It is a flowchart which shows a series of processes in the parameter | index calculation part of FIG. 図６の学習部および状態設定部における一連の処理を時系列に沿って示すチャートである。It is a chart which shows a series of processes in the learning part and state setting part of FIG. 6 along a time series. 図１の情報処理システムが証券会社のインターネット株取引システムを提供するデータセンタとして使用される場合の、負荷履歴保持部の一例を示すデータ構造図である。It is a data structure figure which shows an example of a load history holding | maintenance part in case the information processing system of FIG. 1 is used as a data center which provides the internet stock trading system of a securities company. 図１の情報処理システムが検索サービスを提供するデータセンタとして使用される場合の、負荷履歴保持部の一例を示すデータ構造図である。FIG. 2 is a data structure diagram illustrating an example of a load history holding unit when the information processing system of FIG. 1 is used as a data center that provides a search service. ある実施の形態を大学の共用データセンタに適用した場合の負荷履歴保持部を示すデータ構造図である。It is a data structure figure which shows the load log | history holding part at the time of applying a certain embodiment to the shared data center of a university. 変形例に係る指標算出部における一連の処理を示すフローチャートである。It is a flowchart which shows a series of processes in the parameter | index calculation part which concerns on a modification.

以下、本発明を好適な実施の形態をもとに図面を参照しながら説明する。各図面に示される同一または同等の構成要素、部材、処理には、同一の符号を付するものとし、適宜重複した説明は省略する。 The present invention will be described below based on preferred embodiments with reference to the drawings. The same or equivalent components, members, and processes shown in the drawings are denoted by the same reference numerals, and repeated descriptions are omitted as appropriate.

実施の形態に係る運用管理装置は、複数の並列に配置されたサーバ群でユーザからのアクセスを処理している情報処理システムの管理サーバとして好適に利用される。運用管理装置は、ネットワークから情報処理システム宛に到来する要求の負荷を過去の運用実績から予測し、その予測値が少ない場合は不必要なサーバ群のＯＳを休眠させたり、サーバ群の電源自体をオフにする。また、サーバ群の電力特性は使用する機種等により異なる場合があるが、運用管理装置はネットワークを傍受することにより各サーバ群の電力特性を計測する。これにより例えば電力効率が悪いサーバ群からＯＳを休眠させたり電源をオフにしたりすることができ、情報処理システムが総合的に省エネ化される。 The operation management apparatus according to the embodiment is suitably used as a management server of an information processing system that processes access from a user with a plurality of servers arranged in parallel. The operation management apparatus predicts a load of a request coming from the network to the information processing system from past operation results, and when the predicted value is small, sleeps an unnecessary OS of the server group, or power supply itself of the server group Turn off. The power characteristics of the server group may vary depending on the model used, but the operation management apparatus measures the power characteristics of each server group by intercepting the network. As a result, for example, the OS can be put to sleep or the power can be turned off from a group of servers with low power efficiency, and the information processing system is comprehensively energy-saving.

図１は、実施の形態に係る運用管理装置１０１０を備える情報処理システム２を示す概略図である。情報処理システム２は、ネットワーク接続型のデータセンタであり、例えば証券会社のインターネット株取引システムを提供するデータセンタである。情報処理システム２は、運用管理装置１０１０と、情報処理装置１０２０と、ロードバランサ１０３０と、を備える。情報処理装置１０２０は、第１サーバ群１０２２ａと、第２サーバ群１０２２ｂと、第３サーバ群１０２２ｃと、第４サーバ群１０２２ｄと、第５サーバ群１０２２ｅと、を含む。第１サーバ群１０２２ａは、第１フロントエンドサーバ１０２４ａと、第１アプリケーションサーバ１０２６ａと、第１データベースサーバ１０２８ａと、を含む。第２サーバ群１０２２ｂ、第３サーバ群１０２２ｃ、第４サーバ群１０２２ｄ、および第５サーバ群１０２２ｅは、それぞれ第１サーバ群１０２２ａと同等の構成を有する。 FIG. 1 is a schematic diagram showing an information processing system 2 including an operation management apparatus 1010 according to the embodiment. The information processing system 2 is a network connection type data center, for example, a data center that provides an Internet stock trading system of a securities company. The information processing system 2 includes an operation management device 1010, an information processing device 1020, and a load balancer 1030. The information processing apparatus 1020 includes a first server group 1022a, a second server group 1022b, a third server group 1022c, a fourth server group 1022d, and a fifth server group 1022e. The first server group 1022a includes a first front-end server 1024a, a first application server 1026a, and a first database server 1028a. The second server group 1022b, the third server group 1022c, the fourth server group 1022d, and the fifth server group 1022e each have a configuration equivalent to that of the first server group 1022a.

以下、第１サーバ群１０２２ａ、第２サーバ群１０２２ｂ、第３サーバ群１０２２ｃ、第４サーバ群１０２２ｄ、第５サーバ群１０２２ｅ、はサーバ群１０２２と総称される場合がある。また、第１フロントエンドサーバ１０２４ａ、第２フロントエンドサーバ１０２４ｂ、第３フロントエンドサーバ１０２４ｃ、第４フロントエンドサーバ１０２４ｄ、第５フロントエンドサーバ１０２４ｅ、はフロントエンドサーバ１０２４と総称される場合がある。また、第１アプリケーションサーバ１０２６ａ、第２アプリケーションサーバ１０２６ｂ、第３アプリケーションサーバ１０２６ｃ、第４アプリケーションサーバ１０２６ｄ、第５アプリケーションサーバ１０２６ｅ、はアプリケーションサーバ１０２６と総称される場合がある。また、第１データベースサーバ１０２８ａ、第２データベースサーバ１０２８ｂ、第３データベースサーバ１０２８ｃ、第４データベースサーバ１０２８ｄ、第５データベースサーバ１０２８ｅ、はデータベースサーバと総称される場合がある。 Hereinafter, the first server group 1022a, the second server group 1022b, the third server group 1022c, the fourth server group 1022d, and the fifth server group 1022e may be collectively referred to as a server group 1022. The first front-end server 1024a, the second front-end server 1024b, the third front-end server 1024c, the fourth front-end server 1024d, and the fifth front-end server 1024e may be collectively referred to as a front-end server 1024. Further, the first application server 1026a, the second application server 1026b, the third application server 1026c, the fourth application server 1026d, and the fifth application server 1026e may be collectively referred to as an application server 1026. The first database server 1028a, the second database server 1028b, the third database server 1028c, the fourth database server 1028d, and the fifth database server 1028e may be collectively referred to as a database server.

情報処理システム２は、外部ネットワーク１００４と接続され同じく外部ネットワーク１００４に接続されている少なくともひとつのユーザ端末１００６から要求を受ける。ここで要求とは、例えばユーザ端末１００６からのアクセスである。アクセスとは、ユーザ端末１００６と情報処理システム２のひとつのサーバ群１０２２とが接続を確立して一連の情報をやりとりすることである。異なるユーザ端末からのアクセスは異なるアクセスであり、同じユーザ端末１００６からでも異なるアクセスがなされうる。
外部ネットワーク１００４は、例えばＬＡＮ（Local Area Network）・ＷＡＮ（Wide Area Network）・インターネットである。ユーザ端末１００６は、ユーザが使用するコンピュータであり、例えば有線で外部ネットワーク１００４に接続された家庭用デスクトップコンピュータや、無線で外部ネットワーク１００４に接続されたラップトップコンピュータである。 The information processing system 2 is connected to the external network 1004 and receives a request from at least one user terminal 1006 that is also connected to the external network 1004. Here, the request is access from the user terminal 1006, for example. Access means that the user terminal 1006 and one server group 1022 of the information processing system 2 establish a connection and exchange a series of information. Accesses from different user terminals are different accesses, and different accesses can be made from the same user terminal 1006.
The external network 1004 is, for example, a local area network (LAN), a wide area network (WAN), or the Internet. The user terminal 1006 is a computer used by the user, such as a home desktop computer connected to the external network 1004 by wire or a laptop computer connected to the external network 1004 by wireless.

ロードバランサ１０３０は、外部ネットワーク１００４と接続され、また情報処理装置１０２０に含まれる個々のサーバ群１０２２と内部ネットワーク３を介して接続される。ロードバランサ１０３０は、データの流れの観点からは情報処理装置１０２０と外部ネットワーク１００４との間に位置し、外部ネットワーク１００４からの情報処理装置１０２０に対するアクセスを仲介する。
ロードバランサ１０３０はさらに、情報処理装置１０２０に含まれる複数のサーバ群１０２２のうちアクセスを受付可能な状態（以下、稼動状態と称する）に設定されたサーバ群に、ユーザからのアクセスを割り当てる。なお、サーバ群１０２２の状態として、稼動状態と省電力状態とが基本的に規定される。 The load balancer 1030 is connected to the external network 1004 and is connected to each server group 1022 included in the information processing apparatus 1020 via the internal network 3. The load balancer 1030 is located between the information processing apparatus 1020 and the external network 1004 from the viewpoint of data flow, and mediates access to the information processing apparatus 1020 from the external network 1004.
The load balancer 1030 further assigns access from the user to a server group set in a state where access can be accepted (hereinafter referred to as an operating state) among the plurality of server groups 1022 included in the information processing apparatus 1020. Note that the operating state and the power saving state are basically defined as the state of the server group 1022.

運用管理装置１０１０は、情報処理装置１０２０およびロードバランサ１０３０を管理する管理サーバである。運用管理装置１０１０は以下で説明する負荷予測機能などの他に、例えば株式会社野村総合研究所が提供する先手（登録商標）と同様のサーバ運用管理のための機能を搭載する。 The operation management apparatus 1010 is a management server that manages the information processing apparatus 1020 and the load balancer 1030. In addition to the load prediction function described below, the operation management apparatus 1010 is equipped with a function for server operation management similar to the first (registered trademark) provided by Nomura Research Institute, Ltd., for example.

運用管理装置１０１０は、外部ネットワーク１００４、ロードバランサ１０３０、内部ネットワーク３、のそれぞれを通過するパケットを傍受する。また運用管理装置１０１０は、各サーバ群１０２２の内部でやりとりされるパケットも傍受する。これは例えば、フロントエンドサーバ１０２４とアプリケーションサーバ１０２６とを接続するサブネットや、アプリケーションサーバ１０２６とデータベースサーバ１０２８とを接続するサブネットに解析装置１００が接続されることによって実現される。内部ネットワーク３は、各サーバ群１０２２のサーバを相互接続するこれらのサブネットを含む。
「傍受する」ことは、傍受対象のパケットを消滅させることなくその内容を取得することであってもよく、例えばパケットをそれに含まれる終点ＩＰアドレスによらずに取得することであってもよい。
運用管理装置１０１０は、傍受したパケットの内容を蓄積し、ＩＰアドレスでソートすることで各ＩＰアドレスに対応するサーバやロードバランサ１０３０における仕事の内訳を導出する。 The operation management apparatus 1010 intercepts packets passing through the external network 1004, the load balancer 1030, and the internal network 3. The operation management apparatus 1010 also intercepts packets exchanged inside each server group 1022. This is realized, for example, by connecting the analysis apparatus 100 to a subnet that connects the front-end server 1024 and the application server 1026 or a subnet that connects the application server 1026 and the database server 1028. The internal network 3 includes these subnets that interconnect the servers of each server group 1022.
“Intercepting” may be acquiring the contents of the packet to be intercepted without annihilating it, for example, acquiring the packet regardless of the destination IP address included in the packet.
The operation management apparatus 1010 accumulates the contents of intercepted packets and sorts them by IP address, thereby deriving a work breakdown in the server or load balancer 1030 corresponding to each IP address.

運用管理装置１０１０は、例えば各サーバに電力を供給するバスバー方式のテーブルタップ型のＰＤＵから、各サーバに供給された電力の計測値を取得し蓄積する。運用管理装置１０１０は、蓄積された電力の計測値および上記の仕事の内訳から、サーバ群１０２２の稼働状況が示された指標、例えばサーバ群１０２２における単位有効仕事量あたりの消費電力を算出する。 The operation management apparatus 1010 acquires and accumulates measured values of power supplied to each server from, for example, a bus bar type table tap PDU that supplies power to each server. The operation management apparatus 1010 calculates an index indicating the operating status of the server group 1022, for example, the power consumption per unit effective work amount in the server group 1022, from the accumulated measured power value and the breakdown of the work.

運用管理装置１０１０は、外部ネットワーク１００４から情報処理装置１０２０へのアクセスによる負荷を算出し記録する一方、過去に要求された負荷をもとに今後発生する負荷を予測する。運用管理装置１０１０は、その予測された負荷が所定の値より少なくなると、算出されたサーバ群１０２２の稼働状況が示された指標に基づいて、稼動状態にあるサーバ群のうちアクセスの処理に不要なサーバ群を選択し、選択されたサーバ群を稼動状態よりも省電力の省電力状態に設定する。 The operation management apparatus 1010 calculates and records a load due to access from the external network 1004 to the information processing apparatus 1020, while predicting a load that will occur in the future based on a load requested in the past. When the predicted load becomes smaller than a predetermined value, the operation management apparatus 1010 is unnecessary for access processing among the server groups in the operating state based on the index indicating the calculated operating state of the server group 1022. A server group is selected, and the selected server group is set to a power saving state that is more power saving than the operating state.

ここで負荷とは、例えばサーバ群１０２２が行う仕事の量を表す値であり、単位時間当たりのアクセスの数を基に定められる。例えば負荷は、単位時間当たりのアクセスの数と比例関係などの数学的関係を有する値であってもよい。また、負荷は単位時間当たりのアクセスの数に上限値または下限値若しくはその両方を課した値であってもよい。以下では、負荷が単に単位時間当たりのアクセスの数（以下、アクセス数と称す）である場合について説明する。
なお、負荷が所定の値以上の場合は運用管理装置１０１０は全てのサーバ群１０２２を稼動状態に設定する。 Here, the load is a value representing the amount of work performed by the server group 1022, for example, and is determined based on the number of accesses per unit time. For example, the load may be a value having a mathematical relationship such as a proportional relationship with the number of accesses per unit time. The load may be a value obtained by imposing an upper limit value, a lower limit value, or both on the number of accesses per unit time. Hereinafter, a case where the load is simply the number of accesses per unit time (hereinafter referred to as the number of accesses) will be described.
If the load is equal to or greater than a predetermined value, the operation management apparatus 1010 sets all the server groups 1022 to the operating state.

情報処理装置１０２０は、外部ネットワーク１００４からのアクセスを処理する。情報処理装置１０２０は、複数のサーバ群１０２２を含み、そのそれぞれのサーバ群１０２２はアクセスの処理単位である要求処理ユニットとして機能する。
本実施の形態では、ＴＣＰ（Transmission Control Protocol）／ＩＰ（Internet Protocol）プロトコルにしたがった通信が行われる場合を考える。 The information processing apparatus 1020 processes access from the external network 1004. The information processing apparatus 1020 includes a plurality of server groups 1022, and each of the server groups 1022 functions as a request processing unit which is an access processing unit.
In the present embodiment, a case is considered in which communication is performed according to the TCP (Transmission Control Protocol) / IP (Internet Protocol) protocol.

サーバ群１０２２はいわゆる３階層のサーバ群であり、これら１セットでアクセスの処理単位を構成する。フロントエンドサーバ１０２４は、ＷＥＢサーバとも呼ばれ、ＨＴＴＰ（HyperText Transfer Protocol）に則り、ユーザ端末１００６のＷＥＢブラウザに対して、ＨＴＭＬ（HyperText Markup Language）や画像などのオブジェクトの表示を提供するサービスが動作するサーバコンピュータである。アプリケーションサーバ１０２６は、フロントエンドサーバ１０２４からジャバサーブレット（Java Servlet、ジャバは登録商標）の処理などのアプリケーションに関する機能を切り出して実現するサーバコンピュータである。データベースサーバ１０２８は、アプリケーションサーバ１０２６のアプリケーションが使用するデータが格納されるサーバコンピュータである。フロントエンドサーバ１０２４、アプリケーションサーバ１０２６、およびデータベースサーバ１０２８は、公知の情報処理技術を使用して実現される。本実施の形態では、フロントエンドサーバ１０２４とアプリケーションサーバ１０２６とデータベースサーバ１０２８とは別個のサーバであり、この順に直列に接続されている。 The server group 1022 is a so-called three-level server group, and one set of these constitutes an access processing unit. The front-end server 1024 is also called a WEB server, and a service that provides display of objects such as HTML (HyperText Markup Language) and images to the WEB browser of the user terminal 1006 operates in accordance with HTTP (HyperText Transfer Protocol). Server computer. The application server 1026 is a server computer that implements functions related to applications such as Java Servlet (Java Servlet, Java is a registered trademark) processing from the front-end server 1024. The database server 1028 is a server computer that stores data used by applications of the application server 1026. The front-end server 1024, the application server 1026, and the database server 1028 are realized using a known information processing technique. In the present embodiment, the front-end server 1024, the application server 1026, and the database server 1028 are separate servers, and are connected in series in this order.

サーバ群１０２２のフロントエンドサーバ１０２４は内部ネットワーク３を介して運用管理装置１０１０およびロードバランサ１０３０と接続される。各サーバ群１０２２は運用管理装置１０１０によって管理されている。 The front end server 1024 of the server group 1022 is connected to the operation management apparatus 1010 and the load balancer 1030 via the internal network 3. Each server group 1022 is managed by the operation management apparatus 1010.

なお、個々のサーバ群１０２２に含まれるサーバは個々にもしくは全体としてＯＳに管理されている。個々のサーバ群の稼動状態は、ユーザからのアクセスを即時処理可能な状態である。この状態は、サーバ群１０２２がユーザからのアクセスを処理しつつ新たなアクセスを処理可能である状態と、ユーザからのアクセスがなく仕事が発生していないアイドル状態と、を含む。稼動状態では、たとえアイドル状態であっても少なくとも基本ＯＳ機能とネットワークモニタリングのタスクは稼動しており、その分電力を消費する。本発明者の当業者としての経験から、このアイドル状態での消費電力は、サーバ機器の種類によってピークアクセス数での稼動時のおよそ３０％から７０％の範囲にある。特に、最新のブレードサーバでは、その高密度な回路構成と狭いスペースに多くの半導体素子が設置されている都合上、アイドル状態でも消費電力は大きくなる。一部のブレードサーバではアイドル状態においてピークアクセス数での稼動時の７０％もの電力を消費しているものも存在する。また、標準的なサーバ機器を用いる場合は６０％程度である。ここでピークアクセス数とは、サーバ群１０２２がその処理速度を落とさずに稼動できるアクセス数の範囲の上限値であり、サーバ群１０２２ごとにその仕様を基に予め定められている。ここで処理速度とは、サーバ群１０２２におけるユーザからのアクセスの処理速度である。 The servers included in each server group 1022 are managed by the OS individually or as a whole. The operating state of each server group is a state in which access from a user can be processed immediately. This state includes a state in which the server group 1022 can process a new access while processing an access from the user, and an idle state in which there is no access from the user and no work is generated. In the operating state, even in the idle state, at least the basic OS function and the network monitoring task are operating, and power is consumed accordingly. From the experience of the present inventor as a person skilled in the art, the power consumption in the idle state is in the range of approximately 30% to 70% during operation at the peak access number depending on the type of server device. In particular, the latest blade server consumes a large amount of power even in an idle state because of its high-density circuit configuration and many semiconductor elements installed in a narrow space. Some blade servers consume 70% of electric power during operation at the peak number of accesses in an idle state. When standard server equipment is used, it is about 60%. Here, the peak number of accesses is an upper limit value of the range of the number of accesses that the server group 1022 can operate without reducing its processing speed, and is predetermined for each server group 1022 based on its specifications. Here, the processing speed is a processing speed of access from the user in the server group 1022.

個々のサーバ群１０２２の省電力状態は、ユーザからのアクセスを即時処理できない状態である。この状態は、サーバ群１０２２へ電源は供給されているがそのサーバ群１０２２はユーザからのアクセスを処理できないＯＳ休眠状態を含む。サーバ群１０２２は運用管理装置１０１０から休眠導入信号を受信するとＯＳ休眠状態となる。このＯＳ休眠状態では、サーバ群１０２２のＯＳは休眠（ハイボネート）しており、ユーザからのアクセスがあってもそれを受け付けない。ＯＳ休眠状態にあるサーバ群１０２２は運用管理装置１０１０から休眠解除信号を受信すると、稼動状態に復帰する。ＯＳ休眠状態から稼動状態に復帰するためには通常数秒から数十秒かかる。また本発明者の当業者としての経験から、ＯＳ休眠状態におけるサーバ群１０２２の消費電力は、ピークアクセス数での稼働時のおよそ５％から１０％である。 The power saving state of each server group 1022 is a state where access from a user cannot be processed immediately. This state includes an OS sleep state in which power is supplied to the server group 1022 but the server group 1022 cannot process access from the user. When the server group 1022 receives the sleep introduction signal from the operation management apparatus 1010, the server group 1022 enters the OS sleep state. In this OS dormant state, the OS of the server group 1022 is hibernated and does not accept any access from the user. When the server group 1022 in the OS sleep state receives the sleep release signal from the operation management apparatus 1010, it returns to the operating state. It usually takes several seconds to several tens of seconds to return from the OS sleep state to the operating state. Further, from the experience of the present inventor as a person skilled in the art, the power consumption of the server group 1022 in the OS sleep state is approximately 5% to 10% during operation at the peak access number.

サーバ群１０２２の省電力状態はさらに、サーバ群１０２２への電源が遮断されている電源オフ状態を含む。サーバ群１０２２は運用管理装置１０１０から電源オフ信号を受信すると電源オフ状態となる。サーバ群１０２２は運用管理装置１０１０からＷＯＬ（Wake Up on LAN）信号などの電源オン信号を受信すると、稼動状態に復帰する。電源オフ状態から稼動状態に復帰するためには通常数分かかる。
本実施の形態ではサーバ群１０２２が省電力状態から稼動状態に復帰するためにかかる時間（以下、オーバヘッドと称す）と、省電力状態で低減される消費電力とのかねあいで、省電力状態とされるサーバ群の数が決定される。つまり、現在のアクセス数を処理するのに必要なぎりぎりの数のサーバ群だけを稼動状態としていると、突然のアクセス数の増大に対して対処できなくなる可能性がある。一方で稼動状態のサーバ群の数が多いほど待機電力由来の消費電力も大きくなる。そこでそれらの影響が拮抗するように、省電力状態とされるサーバ群の数が決定される。 The power saving state of the server group 1022 further includes a power off state in which power to the server group 1022 is cut off. When the server group 1022 receives a power-off signal from the operation management apparatus 1010, the server group 1022 enters a power-off state. When the server group 1022 receives a power-on signal such as a WOL (Wake Up on LAN) signal from the operation management apparatus 1010, the server group 1022 returns to an operating state. It usually takes several minutes to return from the power-off state to the operating state.
In the present embodiment, the server group 1022 is set in the power saving state in consideration of the time taken for the server group 1022 to return from the power saving state to the operating state (hereinafter referred to as overhead) and the power consumption reduced in the power saving state. The number of server groups to be determined is determined. In other words, if only the server group as many as necessary to process the current number of accesses is operating, there is a possibility that a sudden increase in the number of accesses cannot be handled. On the other hand, the power consumption derived from standby power increases as the number of active server groups increases. Therefore, the number of server groups to be placed in the power saving state is determined so that their influences are antagonized.

図２は、図１におけるアクセスの流れを説明するための説明図である。以下、１回のアクセスにおいて、少なくともひとつのパケットがユーザ端末１００６とアクセスが割り当てられたサーバ群との間でやりとりされる場合について説明する。このパケットは、送信元のＩＰアドレスである始点ＩＰアドレスと、受信先のＩＰアドレスである終点ＩＰアドレスと、シーケンス番号と、ポート番号と、タイムスタンプと、を含む。多くの場合において一回のアクセスにつき複数個のパケットがユーザ端末１００６とサーバ群との間を行き来する。図２では第３サーバ群１０２２ｃがアクセスに割り当てられたとする。ユーザ端末１００６のＩＰアドレスを「１７５．３４．１１．２１」、ロードバランサ１０３０のＩＰアドレスを「１００．１０．１０．１０」、第３サーバ群１０２２ｃに含まれる第３フロントエンドサーバ１０２４ｃのＩＰアドレスを「１２１．２１．１５．３」とする。 FIG. 2 is an explanatory diagram for explaining the flow of access in FIG. Hereinafter, a case where at least one packet is exchanged between the user terminal 1006 and a server group to which access is assigned in one access will be described. This packet includes a start point IP address that is a source IP address, an end point IP address that is a destination IP address, a sequence number, a port number, and a time stamp. In many cases, a plurality of packets are transferred between the user terminal 1006 and the server group for one access. In FIG. 2, it is assumed that the third server group 1022c is assigned to access. The IP address of the user terminal 1006 is “175.34.111.21”, the IP address of the load balancer 1030 is “100.10.10.10”, and the IP of the third front-end server 1024c included in the third server group 1022c. The address is “121.21.15.3”.

ロードバランサ１０３０は、ユーザ端末１００６に対して仮想サーバとして働く。つまり、外部ネットワーク１００４では、ユーザが情報処理装置１０２０が有する情報資源にアクセスしようとする場合、かかる情報資源のＵＲＬ（Uniform Resource Locator）がロードバランサ１０３０のＩＰアドレス「１００．１０．１０．１０」に名前解決されるよう設定されている。 The load balancer 1030 functions as a virtual server for the user terminal 1006. That is, in the external network 1004, when a user tries to access an information resource included in the information processing apparatus 1020, the URL (Uniform Resource Locator) of the information resource is the IP address “100.10.10.10” of the load balancer 1030. Is set to resolve names.

まずユーザは、ユーザ端末１００６のウェブブラウザに対して情報処理装置１０２０が有する情報資源のＵＲＬを指定する。ユーザ端末１００６のウェブブラウザによって始点ＩＰアドレスを「１７５．３４．１１．２１」、終点ＩＰアドレスを「１００．１０．１０．１０」とした第１パケットＰ１が生成され、外部ネットワーク１００４に送られる。ロードバランサ１０３０は第１パケットＰ１を受信し、稼動状態にあるサーバ群のなかから第３サーバ群１０２２ｃを選択してこのアクセスを割り当てる。ロードバランサ１０３０は、第１パケットＰ１の始点ＩＰアドレスはそのままにして終点ＩＰアドレスを「１２１．２１．１５．３」とした第２パケットＰ２を内部ネットワーク３に送出する。第３サーバ群１０２２ｃの第３フロントエンドサーバ１０２４ｃは自己宛の第２パケットＰ２を受信し、第２パケットＰ２のポート番号やそのパケットに搭載されるデータに基づき処理を行う。その処理の結果ユーザ端末１００６へ戻すべき情報は、始点ＩＰアドレスを「１２１．２１．１５．３」、終点ＩＰアドレスを「１７５．３４．１１．２１」とした第３パケットＰ３に含められ、第３フロントエンドサーバ１０２４ｃから内部ネットワーク３に送出される。ロードバランサ１０３０は第３パケットＰ３を受信する。ロードバランサ１０３０は、第３パケットＰ３の終点ＩＰアドレスはそのままにして始点ＩＰアドレスを「１００．１０．１０．１０」とした第４パケットＰ４を外部ネットワーク１００４に送る。ユーザ端末１００６は自己宛の第４パケットＰ４を外部ネットワーク１００４から受信する。
以下、ユーザ端末１００６からロードバランサ１０３０に送られるパケット（第１パケットＰ１）を総称して行きパケット、ロードバランサ１０３０から情報処理装置１０２０へ送られるパケット（第２パケットＰ２）を総称して割当パケットという。 First, the user specifies the URL of the information resource that the information processing apparatus 1020 has to the web browser of the user terminal 1006. A first packet P1 having a start point IP address “175.34.11.21” and an end point IP address “100.10.10.10” is generated by the web browser of the user terminal 1006 and sent to the external network 1004. . The load balancer 1030 receives the first packet P1, selects the third server group 1022c from the server groups in the operating state, and allocates this access. The load balancer 1030 sends the second packet P2 with the end point IP address “121.21.15.3” to the internal network 3 while keeping the start point IP address of the first packet P1. The third front-end server 1024c of the third server group 1022c receives the second packet P2 addressed to itself, and performs processing based on the port number of the second packet P2 and data loaded in the packet. Information to be returned to the user terminal 1006 as a result of the processing is included in the third packet P3 with the start point IP address “121.21.15.3” and the end point IP address “1755.34.111.21”. The data is transmitted from the third front-end server 1024c to the internal network 3. The load balancer 1030 receives the third packet P3. The load balancer 1030 sends the fourth packet P4 having the start point IP address “100.10.10.10” to the external network 1004 while keeping the end point IP address of the third packet P3. The user terminal 1006 receives the fourth packet P4 addressed to itself from the external network 1004.
Hereinafter, the packet (first packet P1) sent from the user terminal 1006 to the load balancer 1030 is generically referred to as a packet, and the packet (second packet P2) sent from the load balancer 1030 to the information processing apparatus 1020 is generically assigned packet. That's it.

図３は、ロードバランサ１０３０の機能および構成を示すブロック図である。ここに示す各ブロックは、ハードウエア的には、コンピュータのＣＰＵ（central processing unit）をはじめとする素子や機械装置で実現でき、ソフトウエア的にはコンピュータプログラム等によって実現されるが、ここでは、それらの連携によって実現される機能ブロックを描いている。したがって、これらの機能ブロックはハードウエア、ソフトウエアの組合せによっていろいろなかたちで実現できることは、本明細書に触れた当業者には理解されるところである。 FIG. 3 is a block diagram showing the function and configuration of the load balancer 1030. Each block shown here can be realized by hardware such as a computer (CPU) (central processing unit) and other elements and mechanical devices, and software can be realized by a computer program or the like. Here, The functional block realized by those cooperation is drawn. Therefore, it is understood by those skilled in the art who have touched this specification that these functional blocks can be realized in various forms by a combination of hardware and software.

ロードバランサ１０３０は、第１サーバ群状態保持部１３２２と、接続保持部１３２４と、要求取得部１０３４と、要求割当部１０３６と、を備える。
第１サーバ群状態保持部１３２２は、現在設定されているであろうサーバ群１０２２の状態を記憶するテーブルである。第１サーバ群状態保持部１３２２の詳細は後述する。 The load balancer 1030 includes a first server group state holding unit 1322, a connection holding unit 1324, a request acquisition unit 1034, and a request allocation unit 1036.
The first server group state holding unit 1322 is a table that stores the state of the server group 1022 that would be currently set. Details of the first server group state holding unit 1322 will be described later.

接続保持部１３２４は、同一アクセス内のパケットは同じサーバ群へ送られること（以下、アクセスの同一性と称す）を保証するためのテーブルである。図４は、接続保持部１３２４を示すデータ構造図である。接続保持部１３２４には、後述する接続更新部１３６８によってアクセスに対応するエントリ１１１８が生成される。一回のアクセスに対してひとつのエントリが対応する。接続保持部１３２４のエントリ１１１８は、アクセスしてきたユーザ端末１００６のＩＰアドレスであるユーザ端末ＩＰアドレス１２１０と、ロードバランサ１０３０のＩＰアドレスであるロードバランサＩＰアドレス１２１２と、ロードバランサ１０３０によってそのアクセスに割り当てられたサーバ群のフロントエンドサーバのＩＰアドレスである割当サーバ群ＩＰアドレス１２１４と、ユーザ端末１００６が同一の場合にアクセスの異同を判別するためのシーケンス番号１２１６と、を有する。同一ユーザ端末において、異なるアクセスには異なるシーケンス番号が割り振られる。以下、サーバ群のフロントエンドサーバのＩＰアドレスを単にサーバ群のＩＰアドレスと称す。 The connection holding unit 1324 is a table for guaranteeing that packets within the same access are sent to the same server group (hereinafter referred to as access identity). FIG. 4 is a data structure diagram showing the connection holding unit 1324. In the connection holding unit 1324, an entry 1118 corresponding to access is generated by a connection update unit 1368 described later. One entry corresponds to one access. The entry 1118 of the connection holding unit 1324 is assigned to the access by the user terminal IP address 1210 that is the IP address of the user terminal 1006 that has accessed, the load balancer IP address 1212 that is the IP address of the load balancer 1030, and the load balancer 1030. The assigned server group IP address 1214, which is the IP address of the front end server of the server group, and the sequence number 1216 for determining the difference in access when the user terminal 1006 is the same. Different sequence numbers are assigned to different accesses in the same user terminal. Hereinafter, the IP address of the front end server of the server group is simply referred to as the IP address of the server group.

図３に戻る。要求取得部１０３４は、外部ネットワーク１００４と接続される。要求取得部１０３４は、外部ネットワーク１００４から到来するユーザ端末１００６からのアクセスの行きパケットを取得する。この際、要求取得部１０３４は行きパケットに含まれる終点ＩＰアドレスを基に自己宛のパケットであるか否かを判別する。要求取得部１０３４は、取得した行きパケットを要求割当部１０３６に渡す。 Returning to FIG. The request acquisition unit 1034 is connected to the external network 1004. The request acquisition unit 1034 acquires an access outbound packet from the user terminal 1006 that arrives from the external network 1004. At this time, the request acquisition unit 1034 determines whether the packet is addressed to itself based on the end point IP address included in the outgoing packet. The request acquisition unit 1034 passes the acquired outbound packet to the request allocation unit 1036.

なお、本明細書において「渡す」とは、ある機能ブロックからある機能ブロックに情報要素に対する処理が移ることを意味する。要求取得部１０３４と要求割当部１０３６との間で言うと、渡すとは、例えば要求取得部１０３４が図示しない一時メモリを有し、取得した行きパケットをそこに蓄えた上で、要求割当部１０３６からの要請に応じて適宜行きパケットを一時メモリから要求割当部１０３６に伝達することである。また渡すとは、第１記憶装置１０３２が図示しない記憶領域を有し、要求取得部１０３４は取得した行きパケットをその記憶領域に書き込み、要求割当部１０３６は適宜その記憶領域から必要な行きパケットを読み出して処理することであってもよい。 In this specification, “pass” means that processing for an information element is transferred from a certain functional block to a certain functional block. Speaking between the request acquisition unit 1034 and the request allocation unit 1036, for example, the request acquisition unit 1034 has a temporary memory (not shown) and stores the acquired outgoing packets in the request allocation unit 1036. The packet is appropriately transmitted in response to the request from the temporary memory and transmitted to the request allocation unit 1036 from the temporary memory. In addition, to pass, the first storage device 1032 has a storage area (not shown), the request acquisition unit 1034 writes the acquired outbound packet to the storage area, and the request allocation unit 1036 appropriately transmits the required outbound packet from the storage area. It may be read and processed.

要求割当部１０３６は、ユーザ端末１００６からのアクセスの行きパケットを稼動状態にあるサーバ群のうちのひとつに割り当てる。特に傾斜配分モードでは、ユーザ端末１００６からのアクセスが新規のアクセスである場合、要求割当部１０３６は運用管理装置１０１０から取得する指標に基づいて、その新規のアクセスをサーバ群のうちのひとつに割り当てる。要求割当部１０３６は、同一接続判断部１３６２と、サーバ群選択部１３６４と、アドレス変換部１３６６と、接続更新部１３６８と、を含む。 The request allocation unit 1036 allocates an outgoing packet from the user terminal 1006 to one of the server groups in the operating state. Particularly in the slope distribution mode, when the access from the user terminal 1006 is a new access, the request allocation unit 1036 allocates the new access to one of the server groups based on the index acquired from the operation management apparatus 1010. . The request allocation unit 1036 includes an identical connection determination unit 1362, a server group selection unit 1364, an address conversion unit 1366, and a connection update unit 1368.

同一接続判断部１３６２は、要求取得部１０３４から行きパケットを取得し、その行きパケットが新規のアクセスによるものか否かを判別する。同一接続判断部１３６２は、取得した行きパケットの始点ＩＰアドレスとシーケンス番号とを読み取る。同一接続判断部１３６２は、読み取られた始点ＩＰアドレスとシーケンス番号とをキーとして接続保持部１３２４のエントリを検索し、それらと一致するエントリが存在する場合、そのエントリに含まれる割り当てられたサーバ群の割当サーバ群ＩＰアドレスを取得する。同一接続判断部１３６２は、この取得された割当サーバ群ＩＰアドレスと行きパケットとをアドレス変換部１３６６に渡す。 The same connection determination unit 1362 acquires the outgoing packet from the request acquisition unit 1034 and determines whether the outgoing packet is due to a new access. The same connection determination unit 1362 reads the start point IP address and sequence number of the acquired outbound packet. The same connection determination unit 1362 searches the entry of the connection holding unit 1324 using the read start point IP address and sequence number as a key, and if there is an entry that matches these, the assigned server group included in the entry The assigned server group IP address is acquired. The same connection determination unit 1362 passes the acquired allocation server group IP address and the outgoing packet to the address conversion unit 1366.

なお、このように一致するエントリが存在する場合は、当該行きパケットは既にあるサーバ群１０２２（エントリの割当サーバ群ＩＰアドレスで指定されるサーバ群１０２２）に割り当てられたアクセスのなかのひとつのパケットである。アドレス変換部１３６６は、渡された行きパケットの終点ＩＰアドレスを、同一接続判断部１３６２によって接続保持部１３２４から取得された割当サーバ群ＩＰアドレスに変換する。このように終点ＩＰアドレスが変換された行きパケットはアドレス変換部１３６６から割当パケットとして内部ネットワーク３に送出される。 If there is such a matching entry, the outgoing packet is one packet in the access assigned to the existing server group 1022 (the server group 1022 specified by the assigned server group IP address of the entry). It is. The address conversion unit 1366 converts the end point IP address of the passed packet to the assigned server group IP address acquired from the connection holding unit 1324 by the same connection determination unit 1362. The outgoing packet with the end point IP address converted in this way is sent from the address conversion unit 1366 to the internal network 3 as an assigned packet.

一致するエントリが存在しない場合は、同一接続判断部１３６２は当該行きパケットをサーバ群選択部１３６４に渡す。この場合は同一接続判断部１３６２は新規のアクセスを検知したと言うことができる。 If there is no matching entry, the identical connection determination unit 1362 passes the outgoing packet to the server group selection unit 1364. In this case, it can be said that the same connection determination unit 1362 has detected a new access.

サーバ群選択部１３６４は、同一接続判断部１３６２から新規のアクセスに対応する行きパケットを受け取ると、第１サーバ群状態保持部１３２２を参照してそのアクセスを処理させるサーバ群１０２２を選択する。第１サーバ群状態保持部１３２２は、運用管理装置１０１０におけるアクセス数の予測および算出される指標に基づき運用管理装置１０１０によって更新される。 When the server group selection unit 1364 receives the outgoing packet corresponding to the new access from the same connection determination unit 1362, the server group selection unit 1364 refers to the first server group state holding unit 1322 and selects the server group 1022 to process the access. The first server group state holding unit 1322 is updated by the operation management apparatus 1010 based on the prediction of the number of accesses in the operation management apparatus 1010 and the calculated index.

図５は、第１サーバ群状態保持部１３２２を示すデータ構造図である。第１サーバ群状態保持部１３２２は、サーバ群１０２２のＩＰアドレス１２０２と、サーバ群１０２２の状態１２０４と、サーバ群１０２２の稼働率１２０６と、サーバ群１０２２の電力コスト１２０７と、を対応付けて保持する。 FIG. 5 is a data structure diagram showing the first server group state holding unit 1322. The first server group state holding unit 1322 holds the IP address 1202 of the server group 1022, the state 1204 of the server group 1022, the operation rate 1206 of the server group 1022, and the power cost 1207 of the server group 1022 in association with each other. To do.

サーバ群１０２２の稼働率１２０６は、サーバ群１０２２のピークアクセス数に対する現在そのサーバ群１０２２が処理しているアクセス数の割合を％単位で示す。この稼働率は、図示されない稼働率更新部によって、予め定められているサーバ群１０２２のピークアクセス数と、接続保持部１３２４から分かるサーバ群１０２２に現在割り当てられているアクセス数とから演算され更新されてもよい。あるいは、図示されない稼働率更新部が稼動状態にあるサーバ群１０２２から稼働率を取得し、第１サーバ群状態保持部１３２２の稼働率を更新してもよい。あるいはまた、第１サーバ群状態保持部１３２２の稼働率は運用管理装置１０１０によって更新されてもよい。 The operation rate 1206 of the server group 1022 indicates the ratio of the number of accesses that the server group 1022 is currently processing to the peak access number of the server group 1022 in% units. This operation rate is calculated and updated by an operation rate update unit (not shown) from a predetermined peak access number of the server group 1022 and the access number currently assigned to the server group 1022 known from the connection holding unit 1324. May be. Alternatively, an operating rate update unit (not shown) may acquire the operating rate from the server group 1022 in the operating state and update the operating rate of the first server group state holding unit 1322. Alternatively, the operation rate of the first server group state holding unit 1322 may be updated by the operation management apparatus 1010.

サーバ群１０２２の電力コスト１２０７は、ＷＥＢ通信用の単位仕事量に対する消費電力を表す。単位仕事量は、例えば単位データ量であってもよく、１つのアクセスであってもよい。サーバ群１０２２の電力コストがＷＥＢ通信用の単位データ量に対する消費電力を表す場合、この電力コストが１（Ｗ／ＧＢ）であることは、ＷＥＢサーバにおいて１ＧＢ分のＷＥＢ通信用のデータを処理するのに１Ｗ消費することを示す。ＷＥＢ通信用のデータ量をＷＥＢサーバに対する負荷と考えると、電力コストは、各ＷＥＢサーバにおいて同じ負荷（単位データ量）がかけられた場合にそのＷＥＢサーバで消費される電力を示す。
第１サーバ群状態保持部１３２２の電力コストの値は、運用管理装置１０１０によって更新される。 A power cost 1207 of the server group 1022 represents power consumption with respect to a unit work amount for WEB communication. The unit work amount may be, for example, a unit data amount or one access. When the power cost of the server group 1022 represents the power consumption with respect to the unit data amount for WEB communication, this power cost of 1 (W / GB) means that 1 GB of WEB communication data is processed in the WEB server. 1W is consumed. Considering the amount of data for WEB communication as a load on the WEB server, the power cost indicates the power consumed by the WEB server when the same load (unit data amount) is applied to each WEB server.
The value of the power cost of the first server group state holding unit 1322 is updated by the operation management apparatus 1010.

図３に戻る。サーバ群選択部１３６４は、第１サーバ群状態保持部１３２２に登録されたサーバ群のなかからサーバ群の状態を参照して稼動状態にあるサーバ群を抽出する。要求割当部１０３６が省電力モードに設定されているか通常モードに設定されているか傾斜配分モードに設定されているかによって、サーバ群選択部１３６４が稼動状態にあるサーバ群から新規のアクセスを処理させるサーバ群を選択するアルゴリズムは異なる。以下それぞれの場合について説明する。 Returning to FIG. The server group selection unit 1364 extracts a server group in an operating state with reference to the state of the server group from the server groups registered in the first server group state holding unit 1322. A server that causes the server group selection unit 1364 to process a new access depending on whether the request allocation unit 1036 is set to the power saving mode, the normal mode, or the slope distribution mode. The algorithm for selecting a group is different. Each case will be described below.

１．省電力モード
省電力モードでは、サーバ群選択部１３６４は、稼動状態にあるサーバ群の稼働率が１００％となるように、稼動状態にあるサーバ群から新規のアクセスを処理させるサーバ群を選択する。例えば図５の例では、サーバ群選択部１３６４は新規のアクセスを処理させるサーバ群として第３サーバ群１０２２ｃを選択する。また、例えば第１サーバ群１０２２ａ、第２サーバ群１０２２ｂ、第３サーバ群１０２２ｃが稼動状態に設定されており、第１サーバ群１０２２ａの稼働率が１００％、第２サーバ群１０２２ｂの稼働率が８０％、第３サーバ群１０２２ｃの稼働率が０％の場合、サーバ群選択部１３６４は新規のアクセスを処理させるサーバ群として第２サーバ群１０２２ｂを選択する。 1. Power Saving Mode In the power saving mode, the server group selection unit 1364 selects a server group for processing a new access from the server group in the operating state so that the operating rate of the server group in the operating state is 100%. . For example, in the example of FIG. 5, the server group selection unit 1364 selects the third server group 1022 c as a server group for processing a new access. Further, for example, the first server group 1022a, the second server group 1022b, and the third server group 1022c are set in an operating state, the operating rate of the first server group 1022a is 100%, and the operating rate of the second server group 1022b is When 80% and the availability factor of the third server group 1022c is 0%, the server group selection unit 1364 selects the second server group 1022b as a server group for processing a new access.

２．通常モード
通常モードでは、サーバ群選択部１３６４は、予め情報処理システム２の管理者によって設定されている負荷分散アルゴリズムにしたがって、新規のアクセスを処理させるのに最適なサーバ群を選択する。ここで使用される負荷分散アルゴリズムは、順番にサーバ群１０２２が選択されるラウンドロビン方式や、処理しているアクセス数が最小のサーバ群を選択する最小接続方式や、１番早く応答しているサーバ群を選択する最速方式などの公知のアルゴリズムである。 2. Normal Mode In the normal mode, the server group selection unit 1364 selects an optimal server group for processing a new access according to a load distribution algorithm set in advance by the administrator of the information processing system 2. The load distribution algorithm used here is the round robin method in which the server group 1022 is selected in order, the minimum connection method in which the server group with the smallest number of accesses being processed is selected, or the earliest response. This is a known algorithm such as a fastest method for selecting a server group.

３．傾斜配分モード
傾斜配分モードでは、サーバ群選択部１３６４は、第１サーバ群状態保持部１３２２を参照し、稼動状態にあるサーバ群から、新規のアクセスを処理させるサーバ群として電力コストが小さいサーバ群を優先的に選択する。 3. Inclined distribution mode In the inclined distribution mode, the server group selection unit 1364 refers to the first server group state holding unit 1322, and a server group with a low power cost as a server group for processing a new access from a server group in an operating state. Select with priority.

サーバ群選択部１３６４は、は、選択されたサーバ群のＩＰアドレスと新規のアクセスに対応する行きパケットとをアドレス変換部１３６６に渡す。アドレス変換部１３６６は、渡された行きパケットの終点ＩＰアドレスを、サーバ群選択部１３６４によって選択されたサーバ群のＩＰアドレスに変換する。このように終点ＩＰアドレスが変換された行きパケットはアドレス変換部１３６６から割当パケットとして内部ネットワーク３に送出される。 The server group selection unit 1364 passes the IP address of the selected server group and the outgoing packet corresponding to the new access to the address conversion unit 1366. The address conversion unit 1366 converts the end point IP address of the received outgoing packet into the IP address of the server group selected by the server group selection unit 1364. The outgoing packet with the end point IP address converted in this way is sent from the address conversion unit 1366 to the internal network 3 as an assigned packet.

接続更新部１３６８は、サーバ群選択部１３６４で新規のアクセスに対してサーバ群１０２２の選択が行われる毎に、その選択に関する情報をサーバ群選択部１３６４から取得し、接続保持部１３２４に対応するエントリを追加する。この選択に関する情報は、新規のアクセスを行ったユーザのユーザ端末１００６のユーザ端末ＩＰアドレスと、ロードバランサＩＰアドレスと、新規のアクセスに対して選択されたサーバ群の割当サーバ群ＩＰアドレスと、シーケンス番号と、を含む。
また、接続更新部１３６８は、適宜不要となったエントリを削除する。 Each time the server group selection unit 1364 selects a server group 1022 for a new access, the connection update unit 1368 acquires information related to the selection from the server group selection unit 1364 and corresponds to the connection holding unit 1324. Add an entry. Information regarding this selection includes the user terminal IP address of the user terminal 1006 of the user who has made a new access, the load balancer IP address, the assigned server group IP address of the server group selected for the new access, and the sequence And a number.
Also, the connection update unit 1368 deletes entries that are no longer necessary.

図６は、運用管理装置１０１０の機能および構成を示すブロック図である。ここに示す各ブロックは、ハードウエア的には、コンピュータのＣＰＵ（central processing unit）をはじめとする素子や機械装置で実現でき、ソフトウエア的にはコンピュータプログラム等によって実現されるが、ここでは、それらの連携によって実現される機能ブロックを描いている。したがって、これらの機能ブロックはハードウエア、ソフトウエアの組合せによっていろいろなかたちで実現できることは、本明細書に触れた当業者には理解されるところである。 FIG. 6 is a block diagram showing the function and configuration of the operation management apparatus 1010. Each block shown here can be realized by hardware such as a computer (CPU) (central processing unit) and other elements and mechanical devices, and software can be realized by a computer program or the like. Here, The functional block realized by those cooperation is drawn. Therefore, it is understood by those skilled in the art who have touched this specification that these functional blocks can be realized in various forms by a combination of hardware and software.

運用管理装置１０１０は、指標算出部１００と、指標送信部１５４と、表示制御部１２０と、負荷履歴保持部１１１２と、第２サーバ群状態保持部１１１４と、学習部１１３０と、状態設定部１１４０と、乖離判定部１１５０と、オーバライド部１１６０と、を備える。 The operation management apparatus 1010 includes an index calculation unit 100, an index transmission unit 154, a display control unit 120, a load history holding unit 1112, a second server group state holding unit 1114, a learning unit 1130, and a state setting unit 1140. And a divergence determination unit 1150 and an override unit 1160.

指標算出部１００は、外部ネットワーク１００４、ロードバランサ１０３０および内部ネットワーク３から得られる情報から、複数のサーバ群１０２２の稼動状況が示された指標を算出する。
指標算出部１００は、ネットワークインタフェース部１０２と、通信情報保持部１０４と、分類部１３２と、集計部１３０と、第１指標保持部１０８と、係数保持部１１４と、第１解析部１１６と、使用状況取得部１１０と、使用状況保持部１１２と、第２指標保持部１１８と、第２解析部１５２と、第３指標保持部１５８と、を含む。 The index calculation unit 100 calculates an index indicating the operating status of the plurality of server groups 1022 from information obtained from the external network 1004, the load balancer 1030, and the internal network 3.
The index calculation unit 100 includes a network interface unit 102, a communication information holding unit 104, a classification unit 132, a totaling unit 130, a first index holding unit 108, a coefficient holding unit 114, a first analysis unit 116, The usage status acquisition unit 110, the usage status holding unit 112, the second index holding unit 118, the second analysis unit 152, and the third index holding unit 158 are included.

ネットワークインタフェース部１０２は、傍受部１２２と、抽出部１２４と、フィルタ部１２６と、登録部１２８と、を含む。
傍受部１２２は、外部ネットワーク１００４とロードバランサ１０３０と内部ネットワーク３とからパケットを傍受する。抽出部１２４は、傍受部１２２によって傍受されたパケットから始点ＩＰアドレスと終点ＩＰアドレスとポート番号とシーケンス番号とを抽出する。 The network interface unit 102 includes an intercepting unit 122, an extracting unit 124, a filter unit 126, and a registration unit 128.
The intercepting unit 122 intercepts packets from the external network 1004, the load balancer 1030, and the internal network 3. The extraction unit 124 extracts the start point IP address, the end point IP address, the port number, and the sequence number from the packet intercepted by the interception unit 122.

以下、ロードバランサ１０３０またはサーバ群１０２２またはサーバ群１０２２のサーバをノードと呼ぶ場合がある。
フィルタ部１２６は、フィルタ機能がオンとされるフィルタリングモードでは、抽出部１２４によって抽出された始点ＩＰアドレスおよび終点ＩＰアドレスを基に、計測対象のノードのＩＰアドレスが含まれたパケットのみを選択して、登録部１２８に出力する。より具体的にはフィルタ部１２６は、抽出部１２４によって抽出された始点ＩＰアドレスおよび終点ＩＰアドレスのうちの少なくとも一方が計測対象のノードのＩＰアドレスと一致する場合にのみ、抽出元のパケットを登録部１２８に出力する。 Hereinafter, the load balancer 1030, the server group 1022, or the server of the server group 1022 may be referred to as a node.
In the filtering mode in which the filter function is turned on, the filter unit 126 selects only packets including the IP address of the measurement target node based on the start point IP address and the end point IP address extracted by the extraction unit 124. To the registration unit 128. More specifically, the filter unit 126 registers the extraction source packet only when at least one of the start point IP address and the end point IP address extracted by the extraction unit 124 matches the IP address of the measurement target node. To the unit 128.

特にフィルタ部１２６は、計測対象のノードを、情報処理システム２に含まれる複数のノードから順番に選択する。すなわち、フィルタ部１２６は、計測対象のノードのＩＰアドレスを、情報処理システム２に含まれる複数のノードのＩＰアドレスのなかで順番に切り替える。
フィルタ部１２６は、フィルタ機能がオフとされるフィルタリングオフモードでは、抽出部１２４によって始点ＩＰアドレスと終点ＩＰアドレスとポート番号とシーケンス番号とが抽出されたパケットを登録部１２８に出力する。 In particular, the filter unit 126 sequentially selects a measurement target node from a plurality of nodes included in the information processing system 2. That is, the filter unit 126 switches the IP address of the measurement target node in order among the IP addresses of a plurality of nodes included in the information processing system 2.
In the filtering off mode in which the filter function is turned off, the filter unit 126 outputs the packet from which the extraction point 124 has extracted the start point IP address, the end point IP address, the port number, and the sequence number to the registration unit 128.

登録部１２８は、フィルタ部１２６によって出力されたパケットについて、そのパケットに関する時刻と、抽出部１２４によってそのパケットから抽出された始点ＩＰアドレスと、抽出部１２４によってそのパケットから抽出された終点ＩＰアドレスと、抽出部１２４によってそのパケットから抽出されたポート番号に対応する通信用途と、そのパケットのデータ量と、抽出部１２４によってそのパケットから抽出されたシーケンス番号と、を対応付けて通信情報保持部１０４に登録する。 For the packet output by the filter unit 126, the registration unit 128 includes a time related to the packet, a start point IP address extracted from the packet by the extraction unit 124, and an end point IP address extracted from the packet by the extraction unit 124. The communication information holding unit 104 associates the communication usage corresponding to the port number extracted from the packet by the extraction unit 124 with the data amount of the packet and the sequence number extracted from the packet by the extraction unit 124. Register with.

図７は、通信情報保持部１０４に保持されるデータの一例を示すデータ構造図である。通信情報保持部１０４は、時刻２０２と、始点ＩＰアドレス２０４と、終点ＩＰアドレス２０６と、通信用途２０８と、データ量２１０と、シーケンス番号２１１と、を対応付けて保持する。
時刻２０２は、パケットに関する時刻であり、例えばパケットにタイムスタンプが含まれているのであればそのタイムスタンプで示される時刻である。あるいはまた、傍受部１２２によってパケットが傍受された時刻であってもよい。
始点ＩＰアドレス２０４および終点ＩＰアドレス２０６およびシーケンス番号２１１はそれぞれ、抽出部１２４によって抽出された始点ＩＰアドレスおよび終点ＩＰアドレスおよびシーケンス番号である。
通信用途２０８は、抽出部１２４によって抽出されたポート番号に対応する通信用途である。
データ量２１０は、パケットに搭載されているデータの量をバイト（Ｂ）単位で示す。 FIG. 7 is a data structure diagram showing an example of data held in the communication information holding unit 104. The communication information holding unit 104 holds the time 202, the start point IP address 204, the end point IP address 206, the communication application 208, the data amount 210, and the sequence number 211 in association with each other.
The time 202 is a time related to the packet. For example, if the packet includes a time stamp, the time 202 is a time indicated by the time stamp. Alternatively, the time when the packet was intercepted by the intercepting unit 122 may be used.
The start point IP address 204, the end point IP address 206, and the sequence number 211 are the start point IP address, the end point IP address, and the sequence number extracted by the extraction unit 124, respectively.
The communication application 208 is a communication application corresponding to the port number extracted by the extraction unit 124.
The data amount 210 indicates the amount of data loaded in the packet in units of bytes (B).

図６に戻る。分類部１３２は、抽出部１２４によって抽出された始点ＩＰアドレスおよび終点ＩＰアドレスのうちの少なくともひとつとポート番号とによってパケットを分類する。以下では分類部１３２が始点ＩＰアドレスとポート番号とによってパケットを分類する場合について説明する。しかしながら、分類部が終点ＩＰアドレスとポート番号とによってパケットを分類する場合、および分類部が始点および終点ＩＰアドレスとポート番号とによってパケットを分類する場合でも以下と同様の説明が成り立つことは、本明細書に触れた当業者には明らかである。 Returning to FIG. The classification unit 132 classifies the packet based on at least one of the start point IP address and the end point IP address extracted by the extraction unit 124 and the port number. Hereinafter, a case will be described in which the classification unit 132 classifies packets based on the start point IP address and the port number. However, even when the classification unit classifies a packet based on the end point IP address and the port number, and when the classification unit classifies the packet based on the start point, the end point IP address, and the port number, the following explanation is valid. It will be apparent to those skilled in the art who have touched the specification.

分類部１３２は、通信情報保持部１０４に保持されるデータを始点ＩＰアドレスについてソーティングする。以下、通信情報保持部１０４に登録されている始点ＩＰアドレスを登録ＩＰアドレスと称す。分類部１３２は、各登録ＩＰアドレスについて通信用途によってパケットを分類する。特に分類部１３２は、サーバ情報保持部１３８を参照し、登録ＩＰアドレスに対応するノード（以下、登録ノードと称す）に対して予め定められた通信用途である本来用途に関する情報を取得する。分類部１３２は、その本来用途に対応したパケットと、本来用途以外の通信用途である非本来用途に対応したパケットとに分類する。本来用途は、例えば登録ノードがＷＥＢサーバであれば、ポート番号「８０」に対応するＷＥＢ通信である。非本来用途は、例えば本来用途以外の通信用途であり、登録ノードがＷＥＢサーバであればポート番号「２０」に対応するＦＴＰ転送などである。 The classification unit 132 sorts the data held in the communication information holding unit 104 with respect to the start point IP address. Hereinafter, the start point IP address registered in the communication information holding unit 104 is referred to as a registered IP address. The classification unit 132 classifies the packet for each registered IP address according to the communication use. In particular, the classification unit 132 refers to the server information holding unit 138, and acquires information on the original use that is a predetermined communication use for a node corresponding to the registered IP address (hereinafter referred to as a registered node). The classification unit 132 classifies the packet into a packet corresponding to the original use and a packet corresponding to a non-original use which is a communication use other than the original use. For example, if the registered node is a WEB server, the original use is WEB communication corresponding to the port number “80”. The non-original use is, for example, a communication use other than the original use, such as FTP transfer corresponding to the port number “20” if the registered node is a WEB server.

図８は、サーバ情報保持部１３８に保持されるデータの一例を示すデータ構造図である。サーバ情報保持部１３８は、ノードのノード名２１４と、ノードのＩＰアドレス２１８と、ノードの用途２２４と、を対応付けて保持する。
分類部１３２は、サーバ情報保持部１３８を参照してあるノードの本来用途に関する情報を取得する際、そのノードの用途を参照して本来用途を取得してもよい。 FIG. 8 is a data structure diagram showing an example of data held in the server information holding unit 138. The server information holding unit 138 holds the node name 214 of the node, the IP address 218 of the node, and the usage 224 of the node in association with each other.
When the classification unit 132 acquires information on the original use of a node by referring to the server information holding unit 138, the classification unit 132 may acquire the original use by referring to the use of the node.

図６に戻る。集計部１３０は、少なくともひとつの登録ノードについて、本来用途に対応したパケットを集計する。特に集計部１３０は、所定の長さの期間すなわち時間帯ごとに、登録ノードの本来用途に対応したパケットのデータ量を集計し、また非本来用途に対応したパケットのデータ量を集計する。
例えば、登録ＩＰアドレス「121.21.15.1」に対応する登録ノードについて、時間帯「02/15/10 11:10:00~11:20:00」で集計する場合について考える。図８のサーバ情報保持部１３８から登録ノードは第１フロントエンドサーバ１０２４ａでありＷＥＢサーバである。集計部１３０は、分類部１３２における分類結果から、始点ＩＰアドレスが「121.21.15.1」であり、通信用途が「ＷＥＢ通信」であり、時刻が時間帯「02/15/10 11:10:00~11:20:00」に含まれるパケットのデータ量を足し合わせて、本来用途に対する集計結果を得る。また、集計部１３０は、始点ＩＰアドレスが「121.21.15.1」であり、通信用途が「ＷＥＢ通信」ではなく、時刻が時間帯「02/15/10 11:10:00~11:20:00」に含まれるパケットのデータ量を足し合わせて、非本来用途に対する集計結果を得る。 Returning to FIG. The totaling unit 130 totals packets corresponding to the intended use for at least one registered node. In particular, the totaling unit 130 totals the data amount of the packet corresponding to the original use of the registered node for each period of a predetermined length, that is, the time zone, and totals the data amount of the packet corresponding to the non-original use.
For example, consider a case where the registered nodes corresponding to the registered IP address “121.21.15.1” are counted in the time zone “02/15/10 11:10:00 to 11:20:00”. The registered node from the server information holding unit 138 in FIG. 8 is the first front-end server 1024a, which is a WEB server. From the classification result in the classification unit 132, the totaling unit 130 has the start point IP address “121.21.15.1”, the communication application “WEB communication”, and the time “02/15/10 11:10:00”. Add up the data amount of packets included in "11:20:00" to get the total result for the intended use. In addition, the totaling unit 130 has a start IP address “121.21.15.1”, a communication usage not “WEB communication”, and a time zone “02/15/10 11:10:00 to 11:20:00 Are added together to obtain the total result for the non-original use.

集計部１３０は、少なくともひとつの登録ノードについて、始点ＩＰアドレスと終点ＩＰアドレスとシーケンス番号とを参照して時間帯ごとのアクセスの数を算出する。集計部１３０は算出されたアクセスの数を時間帯の長さで除すことで、その登録ノードにおけるその時間帯でのアクセス数を算出する。
なお、集計部１３０は、ロードバランサ１０３０については、情報処理システム２の外部から情報処理装置１０２０全体へのアクセス数（以下、総アクセス数と称す）を算出する。つまり、集計部１３０は、情報処理システム２の外部の端末のＩＰアドレスを始点ＩＰアドレスとし、ロードバランサ１０３０のＩＰアドレスを終点ＩＰアドレスとするパケットを集計することで、ロードバランサ１０３０についての総アクセス数を算出する。
集計部１３０は、集計結果および算出結果を第１指標保持部１０８に登録する。 The totaling unit 130 calculates the number of accesses for each time period with reference to the start point IP address, the end point IP address, and the sequence number for at least one registered node. The totaling unit 130 divides the calculated number of accesses by the length of the time zone, thereby calculating the number of accesses in the registered node at the time zone.
For the load balancer 1030, the aggregation unit 130 calculates the number of accesses from the outside of the information processing system 2 to the entire information processing apparatus 1020 (hereinafter referred to as the total number of accesses). That is, the totaling unit 130 totals the access to the load balancer 1030 by totaling packets having the IP address of the terminal external to the information processing system 2 as the start IP address and the IP address of the load balancer 1030 as the end point IP address. Calculate the number.
The aggregation unit 130 registers the aggregation result and the calculation result in the first index holding unit 108.

図９は、第１指標保持部１０８に保持されるデータの一例を示すデータ構造図である。第１指標保持部１０８は、登録ノードのノード名２２８と、登録ノードの本来用途２３０と、本来用途に対応するポート番号である主なポート番号２３２と、本来用途データ量２３４と、非本来用途データ量２３６と、集計部１３０で使用される所定の長さの期間すなわち時間帯に対応する計測期間２３８と、アクセス数２３９と、を対応付けて保持する。
本来用途データ量２３４は、登録ノードの本来用途に対応したパケットのデータ量を計測期間内で足し合わせた結果のデータ量である。
非本来用途データ量２３６は、登録ノードの非本来用途に対応したパケットのデータ量を計測期間内で足し合わせた結果のデータ量である。 FIG. 9 is a data structure diagram illustrating an example of data held in the first index holding unit 108. The first index holding unit 108 includes the node name 228 of the registered node, the original use 230 of the registered node, the main port number 232 that is a port number corresponding to the original use, the original use data amount 234, and the non-original use. A data amount 236, a period of a predetermined length used in the counting unit 130, that is, a measurement period 238 corresponding to a time zone, and the number of accesses 239 are stored in association with each other.
The original use data amount 234 is a data amount obtained as a result of adding the data amount of the packet corresponding to the original use of the registered node within the measurement period.
The non-original use data amount 236 is a data amount obtained as a result of adding the data amount of the packet corresponding to the non-original use of the registered node within the measurement period.

図６に戻る。使用状況取得部１１０は、内部ネットワーク３を通じて所定の時間間隔もしくはリアルタイムで、情報処理システム２のノードから、そのノードの使用状況に関する情報を取得する。ノードの使用状況は、例えばノードがフロントエンドサーバ（ＷＥＢサーバ）１０２４やアプリケーションサーバ１０２６であれば消費電力（単位はＷ）や機器の温度（単位は℃）やＣＰＵ使用率（単位は％）であり、ノードがデータベースサーバ１０２８であればデータ転送量（単位はＭＢ／秒）や消費電力や機器の温度であり、ノードがロードバランサ１０３０などのネットワーク機器であればデータ転送量や消費電力や機器の温度である。 Returning to FIG. The usage status acquisition unit 110 acquires information on the usage status of the node from the node of the information processing system 2 at a predetermined time interval or in real time through the internal network 3. For example, if the node is a front-end server (WEB server) 1024 or an application server 1026, power consumption (unit: W), device temperature (unit: ° C), and CPU usage rate (unit:%). Yes, if the node is the database server 1028, the data transfer amount (unit: MB / second), power consumption, and device temperature. If the node is a network device such as the load balancer 1030, the data transfer amount, power consumption, and device. Temperature.

使用状況取得部１１０は、サーバの基本ＯＳに搭載されているシステムパフォーマンスモニタのデータを取得してもよい。
使用状況取得部１１０は、各ノードに電力を供給するバスバー方式のテーブルタップ型のＰＤＵから各ノードに供給された電力の計測値を消費電力として取得してもよい。
使用状況取得部１１０は、ノードの温度を計測できる市販の装置（温度センサや、光ファイバ方式の温度センサ）と、その温度データを記憶して報告するデータ収集機能（市販されているソフトウエアパッケージ、またはサーバ組み込み型の特殊ハードウエアパッケージ）と、を利用して機器の温度を取得してもよい。 The usage status acquisition unit 110 may acquire data of a system performance monitor installed in the basic OS of the server.
The usage status acquisition unit 110 may acquire, as power consumption, a measured value of power supplied to each node from a bus bar table tap type PDU that supplies power to each node.
The usage status acquisition unit 110 includes a commercially available device (a temperature sensor or an optical fiber temperature sensor) that can measure the temperature of the node, and a data collection function (a commercially available software package) that stores and reports the temperature data. Or a server-embedded special hardware package), the temperature of the device may be acquired.

あるいはまた、使用状況取得部１１０は、米国ＥＰＡのEnergy Star for Server 2で登録し認定されるノードからは、下記の３つの報告機能を使用して消費電力と機器の温度とＣＰＵ使用率とを取得する。すなわち、米国のＥＰＡが行っているEnergy Star for Server 2では、登録し認定される機器は、３つの報告機能をハードウエア本体のマイクロコード（ファームウエア）のレベルで提供でき、基本ＯＳや他の計測ソフト、アプリケーションや本来のサーバの処理能力（本体のＣＰＵ・ＭＰＵ（Micro Processing Unit））には、影響を与えないハードウエアの管理機能を持つこととされている。この３つの報告機能は、１）サーバの実電力消費数値、２）サーバの本体ＣＰＵ・ＭＰＵ等の処理用プロセッサの使用率、３）サーバの実稼動の温度、である。 Alternatively, the usage status acquisition unit 110 uses the following three reporting functions to obtain power consumption, device temperature, and CPU usage rate from nodes registered and certified by Energy Star for Server 2 of the US EPA. get. In other words, in Energy Star for Server 2 conducted by EPA in the United States, registered and certified devices can provide three reporting functions at the microcode (firmware) level of the hardware itself, and the basic OS and other It is supposed to have a hardware management function that does not affect the measurement software, the application, and the processing capacity of the original server (main body CPU / MPU (Micro Processing Unit)). The three reporting functions are 1) the actual power consumption value of the server, 2) the usage rate of the processing processor such as the main body CPU / MPU of the server, and 3) the actual operating temperature of the server.

使用状況取得部１１０は、取得した使用状況に関する情報と、それを取得した時刻に対応する時間帯と、取得先のノード名と、ＩＰアドレスと、を対応付けて使用状況保持部１１２に登録する。使用状況取得部１１０が所定の時間間隔で情報を取得する場合は、時間帯は例えば取得した時刻から所定の時間間隔が経過するまでの期間に設定される。あるいはまた、使用状況取得部１１０がリアルタイムで情報を取得する場合は、所定の長さを有する期間を時間帯とし、使用状況に関する情報をその期間で時間平均する。 The usage status acquisition unit 110 registers the acquired usage status information, the time zone corresponding to the time when the usage status is acquired, the node name of the acquisition destination, and the IP address in association with each other in the usage status holding unit 112. . When the usage status acquisition unit 110 acquires information at a predetermined time interval, for example, the time zone is set to a period from the acquired time until the predetermined time interval elapses. Alternatively, when the usage status acquisition unit 110 acquires information in real time, a period having a predetermined length is set as a time zone, and information on the usage status is averaged over the period.

図１０は、使用状況保持部１１２に保持されるデータの一例を示すデータ構造図である。使用状況保持部１１２は、使用状況取得部１１０によって使用状況が取得されたノードのノード名２４４と、ＩＰアドレス２４５と、使用状況が取得された時刻に対応する記録時間帯２４６と、取得された消費電力２４８と、取得された機器温度２５０と、取得されたＣＰＵ使用率２５２と、取得されたデータ転送量２５３と、を対応付けて保持する。 FIG. 10 is a data structure diagram illustrating an example of data held in the usage status holding unit 112. The usage status holding unit 112 has acquired the node name 244 of the node whose usage status has been acquired by the usage status acquisition unit 110, the IP address 245, and the recording time zone 246 corresponding to the time when the usage status was acquired. The power consumption 248, the acquired device temperature 250, the acquired CPU usage rate 252 and the acquired data transfer amount 253 are stored in association with each other.

一般的に、ＷＥＢサーバにおいて、所定の量、例えば１ｋＢのＷＥＢ通信用のデータを処理して送るにはどれだけのＣＰＵ使用率が必要かは測定できる。つまり、ＷＥＢサーバについて、ＷＥＢ通信用のデータ量とＣＰＵ使用率との関係は事前に測定できる。係数保持部１１４はその関係を示す係数（例えば、％・分／ＧＢの単位）を保持する。この係数が１である場合、例えば１ＧＢ分のＷＥＢ通信用のデータを処理して送るためにＷＥＢサーバは１％のＣＵＰ使用率で１分間稼動しなければならないことを意味する。 Generally, in a WEB server, it is possible to measure how much CPU usage is required to process and send data for a predetermined amount, for example, 1 kB of WEB communication. That is, for the WEB server, the relationship between the data amount for WEB communication and the CPU usage rate can be measured in advance. The coefficient holding unit 114 holds a coefficient indicating the relationship (for example, unit of% · minute / GB). When this coefficient is 1, it means that, for example, in order to process and send data for 1 GB of WEB communication, the WEB server must operate for 1 minute at a CUP usage rate of 1%.

図１１は、係数保持部１１４に保持されるデータの一例を示すデータ構造図である。係数保持部１１４は、ＷＥＢサーバのサーバ名２４０と、係数２４２と、を対応付けて保持する。 FIG. 11 is a data structure diagram illustrating an example of data held in the coefficient holding unit 114. The coefficient holding unit 114 holds the server name 240 of the WEB server and the coefficient 242 in association with each other.

図６に戻る。第１解析部１１６は、第１算出部１３４と、第２算出部１３６と、第１合成部１４２と、を含む。
第１算出部１３４は、第１指標保持部１０８に登録される登録ノードのうちＷＥＢサーバ（以下、登録ＷＥＢサーバと称す）について、本来用途データ量すなわちＷＥＢ通信用のデータ量を対応する計測期間の長さ、例えば分を単位とする長さ、で除し、ＷＥＢ通信用の時間平均データ量（例えば、ＧＢ／分の単位）を得る。第１算出部１３４は、係数保持部１１４からその登録ＷＥＢサーバに対応する係数を取得する。第１算出部１３４は、その登録ＷＥＢサーバについて、ＷＥＢ通信用の時間平均データ量と係数とを乗算し、乗算の結果得られる値（例えば、％の単位）を、その登録ＷＥＢサーバおよび計測期間における、ＷＥＢ通信に関する処理によるＣＰＵ使用率の推測値（以下、第１推測値と称す）とする。 Returning to FIG. The first analysis unit 116 includes a first calculation unit 134, a second calculation unit 136, and a first synthesis unit 142.
The first calculation unit 134 is a measurement period corresponding to the original use data amount, that is, the data amount for WEB communication, for a WEB server (hereinafter referred to as a registered WEB server) among the registered nodes registered in the first index holding unit 108. The time average data amount (for example, unit of GB / min) for WEB communication is obtained by dividing by the length of, for example, the length in minutes. The first calculation unit 134 acquires a coefficient corresponding to the registered WEB server from the coefficient holding unit 114. The first calculation unit 134 multiplies the registration web server by the time average data amount for web communication and a coefficient, and obtains a value (for example, unit of%) obtained as a result of the multiplication by the registration web server and the measurement period. The CPU usage rate estimated value (hereinafter referred to as the first estimated value) by the processing related to WEB communication in FIG.

例えば、図９の第１指標保持部１０８に登録されるノード名「第１フロントエンドサーバ」に着目すると、第１算出部１３４は、計測期間「02/15/10 11:10:00~11:20:00」におけるＷＥＢ通信用のデータ量として「５００（ＧＢ）」を得る。第１算出部１３４は、「５００（ＧＢ）」を計測期間の長さ「１０（分）」で除し、ＷＥＢ通信用の時間平均データ量「５０（ＧＢ／分）」を得る。第１算出部１３４は、図１１の係数保持部１１４を参照し、係数「１．５（％・分／ＧＢ）」を得る。第１算出部１３４は、ＷＥＢ通信用の時間平均データ量「５０（ＧＢ／分）」と係数「１．５（％・分／ＧＢ）」とを乗算し「７５（％）」を得る。第１算出部１３４は、この値「７５（％）」をノード名「第１フロントエンドサーバ」のＷＥＢサーバにおける計測期間「02/15/10 11:10:00~11:20:00」中のＷＥＢ通信に関する処理による第１推測値とする。 For example, when paying attention to the node name “first front-end server” registered in the first index holding unit 108 in FIG. 9, the first calculation unit 134 detects the measurement period “02/15/10 11:10:00 to 11”. : 20: 00 ",“ 500 (GB) ”is obtained as the data amount for WEB communication. The first calculator 134 divides “500 (GB)” by the length of the measurement period “10 (minutes)” to obtain a time average data amount “50 (GB / minute)” for WEB communication. The first calculation unit 134 refers to the coefficient holding unit 114 in FIG. 11 and obtains a coefficient “1.5 (% · min / GB)”. The first calculation unit 134 multiplies the time average data amount “50 (GB / min)” for the WEB communication by the coefficient “1.5 (% · min / GB)” to obtain “75 (%)”. The first calculation unit 134 calculates this value “75 (%)” during the measurement period “02/15/10 11:10:00 to 11:20:00” on the WEB server of the node name “first front-end server”. It is set as the 1st guess value by the process regarding the WEB communication.

第２算出部１３６は、第１算出部１３４における第１推測値の計算で対象とされた登録ＷＥＢサーバのサーバ名と計測期間とをキーとして使用状況保持部１１２を検索し、対応する消費電力と機器温度とＣＰＵ使用率とを抽出する。第２算出部１３６は、第１推測値を使用状況保持部１１２から抽出したＣＰＵ使用率から減じ、ＷＥＢ通信に関する処理以外の処理によるＣＰＵ使用率の推測値（以下、第２推測値と称す）とする。
ＷＥＢ通信に関する処理以外の処理は、例えば他の通信に関する処理や、ウイルスチェッカーによるウイルススキャンや、ハードディスクドライブへのデータの書き込みである。あるいはまた、アイドル状態にあってなにも仕事をしていないことも考えられる。 The second calculation unit 136 searches the usage status holding unit 112 using the server name and measurement period of the registered WEB server targeted by the calculation of the first estimated value in the first calculation unit 134 as keys, and the corresponding power consumption And device temperature and CPU usage rate are extracted. The second calculation unit 136 subtracts the first estimated value from the CPU usage rate extracted from the usage status holding unit 112, and estimates the CPU usage rate by processing other than the processing related to WEB communication (hereinafter referred to as second estimated value). And
Processing other than processing related to WEB communication is, for example, processing related to other communication, virus scanning by a virus checker, and writing of data to a hard disk drive. Or, you may be idle and not working.

第２算出部１３６は、抽出された消費電力を第１推測値と第２推測値との比に分け、第１推測値に対応する値をＷＥＢ通信に関する処理のために使用された電力の推測値、第２推測値に対応する値をそうでない処理のために使用された電力の推測値とする。機器温度についても、機器温度と外気温との差を第１推測値と第２推測値との比に分ける点で同様である。 The second calculation unit 136 divides the extracted power consumption into the ratio between the first estimated value and the second estimated value, and estimates the value corresponding to the first estimated value for the power used for processing related to WEB communication. The value corresponding to the value and the second estimated value is the estimated value of the power used for the processing that is not so. The device temperature is the same in that the difference between the device temperature and the outside air temperature is divided into the ratio between the first estimated value and the second estimated value.

例えば、図９の第１指標保持部１０８に登録されるノード名「第１フロントエンドサーバ」に着目すると、第２算出部１３６は図１０に示される使用状況保持部１１２から、ノード名「第１フロントエンドサーバ」と計測期間「02/15/10 11:10:00~11:20:00」とに対応する消費電力「３０（Ｗ）」、機器温度「３５（℃）」、ＣＰＵ使用率「８０（％）」を抽出する。第２算出部１３６は、第１推測値「７５（％）」をＣＰＵ使用率「８０（％）」から減じ、第２推測値「５（％）」を得る。第２算出部１３６は、消費電力「３０（Ｗ）」を第１推測値「７５（％）」と第２推測値「５（％）」との比すなわち１５：１に分け、ＷＥＢ通信に関する処理のために使用された電力の推測値「２８（Ｗ）」とそれ以外の処理のために使用された電力の推測値「２（Ｗ）」と、を得る。 For example, focusing on the node name “first front-end server” registered in the first index holding unit 108 in FIG. 9, the second calculation unit 136 receives the node name “first” from the usage status holding unit 112 shown in FIG. 1 front-end server ”and measurement period“ 02/15/10 11: 10: 00 ~ 11: 20: 00 ”Power consumption“ 30 (W) ”, device temperature“ 35 (° C) ”, CPU usage The rate “80 (%)” is extracted. The second calculation unit 136 subtracts the first estimated value “75 (%)” from the CPU usage rate “80 (%)” to obtain the second estimated value “5 (%)”. The second calculation unit 136 divides the power consumption “30 (W)” into a ratio between the first estimated value “75 (%)” and the second estimated value “5 (%)”, that is, 15: 1, and relates to WEB communication. The estimated power value “28 (W)” used for processing and the estimated power value “2 (W)” used for other processing are obtained.

第１合成部１４２は、時刻およびノード名をキーとして、第１指標保持部１０８に保持されるデータと、使用状況保持部１１２に保持されるデータと、第２算出部１３６による計算結果（登録ＷＥＢサーバについての、ＷＥＢ通信処理由来の消費電力、ＷＥＢ通信処理由来の上昇温度、ＷＥＢ通信処理由来のＣＰＵ使用率）と、を対応付けて第２指標保持部１１８に登録する。 The first synthesizing unit 142 uses the time and node name as keys, the data held in the first index holding unit 108, the data held in the usage status holding unit 112, and the calculation result (registration) by the second calculation unit 136 The power consumption derived from the WEB communication process, the rising temperature derived from the WEB communication process, and the CPU usage rate derived from the WEB communication process) are registered in the second index holding unit 118 in association with each other.

図１２は、第２指標保持部１１８に保持されるデータの一例を示すデータ構造図である。第２指標保持部１１８は、ノード名２５４と、本来用途２５６と、主なポート番号２５８と、本来用途データ量２６０と、非本来用途データ量２６２と、計測期間２６４と、消費電力２６６と、機器温度２６８と、ＣＰＵ使用率２７０と、データ転送量２７１と、ＷＥＢ通信由来消費電力２７２と、ＷＥＢ通信由来上昇温度２７４と、ＷＥＢ通信由来ＣＰＵ使用率２７６と、を対応付けて保持する。
ＷＥＢ通信由来消費電力２７２は、第２算出部１３６によって算出された、ＷＥＢ通信に関する処理のために使用された電力の推測値である。ＷＥＢ通信由来上昇温度２７４は、第２算出部１３６によって算出された、機器温度と外気温との差のうちＷＥＢ通信に関する処理によって上昇したと予測される分である。ここでは外気温は２５（℃）とされている。ＷＥＢ通信由来ＣＰＵ使用率２７６は、第１算出部１３４によって算出された第１推測値である。 FIG. 12 is a data structure diagram illustrating an example of data held in the second index holding unit 118. The second index holding unit 118 includes a node name 254, an original use 256, a main port number 258, an original use data amount 260, a non-original use data amount 262, a measurement period 264, power consumption 266, The device temperature 268, the CPU usage rate 270, the data transfer amount 271, the WEB communication-derived power consumption 272, the WEB communication-derived rising temperature 274, and the WEB communication-derived CPU usage rate 276 are stored in association with each other.
The WEB communication-derived power consumption 272 is an estimated value of power used for processing related to WEB communication, calculated by the second calculation unit 136. The WEB communication-derived increased temperature 274 is a portion that is predicted to have been increased by the process related to WEB communication among the difference between the device temperature and the outside air temperature calculated by the second calculation unit 136. Here, the outside air temperature is 25 (° C.). The WEB communication-derived CPU usage rate 276 is a first estimated value calculated by the first calculation unit 134.

図６に戻る。第２解析部１５２は、第２指標保持部１１８に保持されるデータのうちＷＥＢサーバ（フロントエンドサーバ１０２４）に関するデータを、使用状況取得部１１０における所定の時間間隔よりも長い時間間隔、例えば１ヶ月や半年の時間間隔で統計処理する。
第２解析部１５２は、第２指標保持部１１８に登録されているノードのうちＷＥＢサーバについて、消費電力の時間平均である平均消費電力と、統計処理対象の期間内における消費電力の最高値である最高消費電力と、を算出する。
第２解析部１５２は、第２指標保持部１１８に登録されているノードのうちＷＥＢサーバについて、ＷＥＢ通信由来の消費電力の時間平均を本来用途データ量（ＷＥＢ通信用のデータ量）の時間平均で除し、電力コスト（例えば、Ｗ／ＧＢの単位）を算出する。
第２解析部１５２は、サーバ名と、平均消費電力と、最高消費電力と、電力コストと、を対応付けて第３指標保持部１５８に登録する。 Returning to FIG. The second analysis unit 152 uses the data related to the WEB server (front-end server 1024) among the data stored in the second index storage unit 118 for a time interval longer than a predetermined time interval in the usage status acquisition unit 110, for example, 1 Statistical processing is performed at time intervals of months and half years.
The second analysis unit 152 uses the average power consumption that is the time average of power consumption and the maximum value of power consumption within the statistical processing target period for the WEB server among the nodes registered in the second index holding unit 118. A certain maximum power consumption is calculated.
The second analysis unit 152 calculates the time average of the power consumption derived from WEB communication for the WEB server among the nodes registered in the second index holding unit 118 as the time average of the original use data amount (data amount for WEB communication). The power cost (for example, unit of W / GB) is calculated.
The second analysis unit 152 registers the server name, average power consumption, maximum power consumption, and power cost in the third index holding unit 158 in association with each other.

図１３は、第３指標保持部１５８に保持されるデータの一例を示すデータ構造図である。第３指標保持部１５８は、サーバ名３０４と、平均消費電力３０６と、最高消費電力３０８と、電力コスト３１０と、を対応付けて保持する。 FIG. 13 is a data structure diagram illustrating an example of data held in the third index holding unit 158. The third index holding unit 158 holds the server name 304, the average power consumption 306, the maximum power consumption 308, and the power cost 310 in association with each other.

図６に戻る。表示制御部１２０は、第１指標保持部１０８、第２指標保持部１１８、第３指標保持部１５８のうちの少なくともひとつに基づき、ノードごとに各種パラメータを示す画面をモニタ１４０に表示させる。 Returning to FIG. The display control unit 120 causes the monitor 140 to display a screen showing various parameters for each node based on at least one of the first index holding unit 108, the second index holding unit 118, and the third index holding unit 158.

指標送信部１５４は、第１サーバ群状態保持部１３２２の更新のために、第３指標保持部１５８に登録されている電力コストに関する情報をロードバランサ１０３０に送信する。 The index transmission unit 154 transmits information regarding the power cost registered in the third index holding unit 158 to the load balancer 1030 in order to update the first server group state holding unit 1322.

学習部１１３０は、更新部１１３２と、負荷予測部１１３４と、を含む。学習部１１３０は、更新部１１３２によって総アクセス数の履歴を負荷履歴保持部１１１２に記憶し、その記憶をふまえて負荷予測部１１３４によって今後発生しうる総アクセス数を予測する。これにより、運用実績が長いほど省エネの観点からより賢くなる運用管理が可能となる。 The learning unit 1130 includes an update unit 1132 and a load prediction unit 1134. The learning unit 1130 stores the history of the total number of accesses in the load history holding unit 1112 by the update unit 1132, and predicts the total number of accesses that can occur in the future by the load prediction unit 1134 based on the storage. As a result, the longer the operation results, the smarter the operation management from the viewpoint of energy saving.

更新部１１３２は、第１指標保持部１０８から総アクセス数を取得する。更新部１１３２は、取得した総アクセス数をもとに負荷履歴保持部１１１２を更新する。負荷履歴保持部１１１２はしたがって、総アクセス数の履歴すなわち過去に要求された総アクセス数を保持する。 The update unit 1132 acquires the total number of accesses from the first index holding unit 108. The updating unit 1132 updates the load history holding unit 1112 based on the acquired total number of accesses. Therefore, the load history holding unit 1112 holds the history of the total number of accesses, that is, the total number of accesses requested in the past.

図１４は、負荷履歴保持部１１１２を示すデータ構造図である。負荷履歴保持部１１１２は、時間帯１２１８と、その時間帯が属する属性１２２０と、その時間帯に取得された総アクセス数１２２２と、その時間帯に稼動状態に設定される稼動サーバ群の数１２２４と、その時間帯の平均稼動率１２２６と、を対応付けて記憶する。時間帯が属する属性１２２０とは、例えば月曜日、火曜日などの曜日や、平日休日の別や、年末年始とそれ以外の別や、午前午後の別や、それらの任意の組み合わせである。平均稼働率１２２６は、稼動状態に設定されるサーバ群の稼働率の平均値である。
更新部１１３２は、稼動状態に設定される稼動サーバ群の数１２２４を取得するために第２サーバ群状態保持部１１１４を参照してもよい。
時間帯１２１８は、運用管理装置１０１０のモードを更新する基準となる時間間隔を有する期間である。 FIG. 14 is a data structure diagram showing the load history holding unit 1112. The load history holding unit 1112 includes a time zone 1218, an attribute 1220 to which the time zone belongs, the total number of accesses 1222 acquired in the time zone, and the number of active server groups 1224 set to the operating state in the time zone. And the average operating rate 1226 for that time period are stored in association with each other. The attribute 1220 to which the time zone belongs is, for example, a day of the week such as Monday, Tuesday, a weekday holiday, a year-end / New Year holiday, a morning / afternoon, or any combination thereof. The average operation rate 1226 is an average value of the operation rates of the server group set in the operation state.
The update unit 1132 may refer to the second server group state holding unit 1114 in order to obtain the number 1224 of active server groups set to the operation state.
The time zone 1218 is a period having a time interval that serves as a reference for updating the mode of the operation management apparatus 1010.

図６に戻る。負荷予測部１１３４は、過去に要求された総アクセス数をもとに今後発生する総アクセス数を予測する。負荷予測部１１３４は、未来の時間帯ごとに総アクセス数を予測する。後述の状態設定部１１４０もまた、この時間帯ごとに必要に応じてサーバ群１０２２の状態を設定する。特に負荷予測部１１３４は、総アクセス数予測の対象となる時間帯（以下、予測対象の時間帯と称す）について負荷履歴保持部１１１２を参照して総アクセス数を予測する。以下、負荷予測部１１３４によって予測される総アクセス数を予測総アクセス数と称す。 Returning to FIG. The load prediction unit 1134 predicts the total number of accesses that will occur in the future based on the total number of accesses requested in the past. The load prediction unit 1134 predicts the total number of accesses for each future time zone. The state setting unit 1140, which will be described later, also sets the state of the server group 1022 as necessary for each time period. In particular, the load predicting unit 1134 predicts the total number of accesses with reference to the load history holding unit 1112 for a time zone (hereinafter referred to as a time zone to be predicted) for which the total number of accesses is predicted. Hereinafter, the total number of accesses predicted by the load prediction unit 1134 is referred to as a predicted total number of accesses.

負荷予測部１１３４では、情報処理システム２が使用される目的に応じた総アクセス数の傾向（トレンド）を負荷履歴保持部１１１２から抽出する処理と、抽出された傾向に基づいて予測対象の時間帯の総アクセス数を予測する処理とが行われる。特に負荷予測部１１３４では、時間帯の属性をキーとして総アクセス数の傾向が抽出される。 The load prediction unit 1134 extracts a trend of the total number of accesses according to the purpose for which the information processing system 2 is used from the load history holding unit 1112, and the time zone to be predicted based on the extracted trend The total number of accesses is predicted. In particular, the load prediction unit 1134 extracts the tendency of the total number of accesses using the time zone attribute as a key.

負荷予測部１１３４は予測対象の時間帯が属する属性に対応する総アクセス数を負荷履歴保持部１１１２から取得してその時間帯に対する予測総アクセス数とする。
例えば、曜日がキーとして設定された場合、負荷予測部１１３４は負荷履歴保持部１１１２を参照し、曜日毎の総アクセス数の平均値を演算する。負荷予測部１１３４は、これらの平均値のうち予測対象の時間帯の曜日にマッチする曜日の平均値を予測総アクセス数とする。また、例えば平日休日の別がキーとして設定された場合、負荷予測部１１３４は負荷履歴保持部１１１２を参照し、平日の総アクセス数の平均値と休日の総アクセス数の平均値を演算する。負荷予測部１１３４は、これらの平均値のうち予測対象の時間帯が平日であれば平日の、休日であれば休日の平均値を予測総アクセス数とする。
なお、ここでは一例として平均値を用いたが、代表値などの他の統計的なパラメータであってもよい。 The load predicting unit 1134 obtains the total number of accesses corresponding to the attribute to which the prediction target time zone belongs from the load history holding unit 1112 and sets the total access number for the time zone.
For example, when a day of the week is set as a key, the load prediction unit 1134 refers to the load history holding unit 1112 and calculates an average value of the total number of accesses for each day of the week. Of these average values, the load prediction unit 1134 sets the average value of the day of the week that matches the day of the week to be predicted as the predicted total access count. Also, for example, when the type of weekday holiday is set as a key, the load prediction unit 1134 refers to the load history holding unit 1112 and calculates the average value of the total number of accesses on weekdays and the average value of the total number of accesses on holidays. The load predicting unit 1134 uses the average value of these average values as the predicted total access number on weekdays if the time zone to be predicted is a weekday and on holidays if it is a holiday.
Note that although the average value is used here as an example, other statistical parameters such as a representative value may be used.

負荷予測部１１３４は予測総アクセス数を乖離判定部１１５０および状態設定部１１４０の負荷比較部１１４２に渡す。 The load prediction unit 1134 passes the predicted total number of accesses to the deviation determination unit 1150 and the load comparison unit 1142 of the state setting unit 1140.

状態設定部１１４０は、予測総アクセス数が所定のアクセスしきい値Ｍｏより少ない場合、予測対象の時間帯において情報処理装置１０２０に含まれる少なくともひとつのサーバ群１０２２を省電力状態に設定する。 When the predicted total number of accesses is smaller than the predetermined access threshold Mo, the state setting unit 1140 sets at least one server group 1022 included in the information processing apparatus 1020 in the power saving state in the prediction target time zone.

状態設定部１１４０におけるモアクセスしきい値Ｍｏは、情報処理装置１０２０の性能が落ちない範囲に設定される。ここで性能とは、例えばどれだけのアクセス数をどの程度の速さで処理できるかということである。あるいは性能とは、ひとつのアクセスが処理されるのにかかる時間などのレスポンスタイムであってもよい。また、情報処理装置１０２０の性能が落ちる、とは、例えばあるサーバ群１０２２に対してピークアクセス数を越える数のアクセスが割り当てられ、その結果そのサーバ群１０２２の処理速度が落ちることにより情報処理装置１０２０全体のアクセスの処理速度が落ちることである。 The mo access threshold value Mo in the state setting unit 1140 is set in a range where the performance of the information processing apparatus 1020 does not deteriorate. Here, the performance means, for example, how many accesses can be processed at what speed. Alternatively, the performance may be a response time such as a time taken to process one access. Further, the performance of the information processing apparatus 1020 is reduced. For example, the number of accesses exceeding the peak access number is assigned to a certain server group 1022, and as a result, the processing speed of the server group 1022 is reduced. That is, the access processing speed of the entire 1020 decreases.

例えば第１サーバ群１０２２ａから第５サーバ群１０２２ｅのピークアクセス数が全て２０００であるとする。この場合アクセスしきい値Ｍｏを９０００に設定すると、予測総アクセス数が８５００であっても予測対象の時間帯で少なくともひとつのサーバ群を省電力状態に設定しなくてはならない。ここでは第５サーバ群１０２２ｅを省電力状態に設定したとする。残りの４つのサーバ群１０２２ａ〜２２ｄのトータルのピークアクセス数は８０００であり、予測総アクセス数９５００よりも少ない。したがってこの場合、予測対象の時間帯が到来したときに、残りの４つのサーバ群１０２２ａ〜２２ｄのうちの少なくともひとつのサーバ群がピークアクセス数以上のアクセスを処理しなくてはならなくなる可能性が高くなり、その場合そのサーバ群の処理速度は低下する。これにより情報処理装置１０２０全体の処理速度が落ちる可能性がある。このような状況を避けるために、アクセスしきい値Ｍｏは情報処理装置１０２０の性能が落ちない範囲に設定される。上述の例ではアクセスしきい値Ｍｏは８０００以下に設定されればよい。 For example, it is assumed that the peak access counts of the first server group 1022a to the fifth server group 1022e are all 2000. In this case, if the access threshold Mo is set to 9000, at least one server group must be set in the power saving state in the prediction target time zone even if the predicted total number of accesses is 8500. Here, it is assumed that the fifth server group 1022e is set to the power saving state. The total peak access number of the remaining four server groups 1022a to 22d is 8000, which is smaller than the predicted total access number 9500. Therefore, in this case, when the time zone to be predicted arrives, there is a possibility that at least one server group among the remaining four server groups 1022a to 22d has to process accesses exceeding the peak access number. In this case, the processing speed of the server group decreases. As a result, the processing speed of the entire information processing apparatus 1020 may decrease. In order to avoid such a situation, the access threshold value Mo is set in a range where the performance of the information processing apparatus 1020 does not deteriorate. In the above example, the access threshold Mo may be set to 8000 or less.

状態設定部１１４０は、サーバ群１０２２の最大性能を発揮せしめる前提で、アクセスを処理させるサーバ群を決定し、予測対象の時間帯において残りのサーバ群を省電力状態に設定する。この場合サーバ群１０２２の最大性能を発揮せしめる、とは、例えばサーバ群１０２２にピークアクセス数でアクセス処理を行わせることであり、言い換えるとサーバ群を１００％の稼働率で使用することである。さらに状態設定部１１４０は、予測総アクセス数の変動により情報処理装置１０２０の性能が落ちると予測される場合には、予測対象の時間帯において省電力状態に設定されている少なくともひとつのサーバ群を稼動状態に設定する。 The state setting unit 1140 determines a server group to process access on the premise that the maximum performance of the server group 1022 is exhibited, and sets the remaining server group to the power saving state in the prediction target time zone. In this case, maximizing the performance of the server group 1022 is, for example, causing the server group 1022 to perform access processing with the peak number of accesses, in other words, using the server group at an operation rate of 100%. Further, when it is predicted that the performance of the information processing apparatus 1020 will deteriorate due to a change in the predicted total number of accesses, the state setting unit 1140 selects at least one server group that is set in the power saving state in the prediction target time zone. Set to operational state.

サーバ群１０２２を省電力状態または稼動状態に設定することに関して、状態設定部１１４０では、予測対象の時間帯において稼動状態にするサーバ群の数に応じた予測総アクセス数の範囲が定められている。状態設定部１１４０は例えば予測総アクセス数が０から第１しきい値Ｔ１の範囲にあれば予測対象の時間帯においてひとつのサーバ群のみを稼動状態とし、他のサーバ群を省電力状態とする。表１は、状態設定部１１４０における状態設定に関して、稼動状態とするサーバ群の数と、ＯＳ休眠状態とするサーバ群の数と、電源オフ状態とするサーバ群の数と、予測総アクセス数の範囲と、の関係を示す。Ｔ２は第２しきい値、Ｔ３は第３しきい値であり、Ｔ１＜Ｔ２＜Ｔ３である。個々のしきい値は予め情報処理システム２の管理者によって設定される。

Regarding the setting of the server group 1022 to the power saving state or the operating state, the state setting unit 1140 defines a range of the predicted total number of accesses according to the number of server groups to be in the operating state in the prediction target time zone. . For example, if the predicted total number of accesses is in the range from 0 to the first threshold value T1, the state setting unit 1140 sets only one server group to the operating state and sets the other server groups to the power saving state in the prediction target time zone. . Table 1 shows the number of server groups that are in an active state, the number of server groups that are in an OS sleep state, the number of server groups that are in a power-off state, and the predicted total number of accesses for the state setting in the state setting unit 1140 The relationship with the range is shown. T2 is a second threshold value, T3 is a third threshold value, and T1 <T2 <T3. Individual threshold values are set in advance by an administrator of the information processing system 2.

第１しきい値Ｔ１、第２しきい値Ｔ２、第３しきい値Ｔ３はそれぞれサーバ群１０２２を１００％の稼働率で使用することを前提に設定される。つまり上述の第１サーバ群１０２２ａから第５サーバ群１０２２ｅのピークアクセス数が全て２０００であるとする例では、Ｔ１＝２０００、Ｔ２＝４０００、Ｔ３＝６０００である。この場合、予測総アクセス数のアクセスを処理するのに必要最低限の数のサーバ群が予測対象の時間帯において稼動状態とされる。また、例えば第１しきい値Ｔ１と第２しきい値Ｔ２との間にあった予測総アクセス数が増大して第２しきい値Ｔ２を越えた場合、そのままだと予測対象の時間帯において少なくともひとつのサーバ群の稼働率が１００％を上回ると予測されるので、予測対象の時間帯において稼動状態とするサーバ群の数をひとつ増やして情報処理装置１０２０の処理速度の低下を回避する。 The first threshold value T1, the second threshold value T2, and the third threshold value T3 are set on the assumption that the server group 1022 is used at an operation rate of 100%. That is, in the example in which all the peak access numbers of the first server group 1022a to the fifth server group 1022e are 2000, T1 = 2000, T2 = 4000, and T3 = 6000. In this case, the minimum number of server groups necessary for processing the access of the predicted total access count is set in the operating state in the prediction target time zone. For example, when the predicted total number of accesses between the first threshold value T1 and the second threshold value T2 increases and exceeds the second threshold value T2, if it remains as it is, at least one in the prediction target time zone. Because the operating rate of the server group is predicted to exceed 100%, the number of server groups that are in the operating state in the prediction target time zone is increased by one to avoid a decrease in the processing speed of the information processing apparatus 1020.

状態設定部１１４０は、負荷比較部１１４２と、稼動サーバ群決定部１１４４と、状態信号生成部１１４６と、を含む。
負荷比較部１１４２は、負荷予測部１１３４から取得した予測総アクセス数と、第１しきい値Ｔ１、第２しきい値Ｔ２、第３しきい値Ｔ３、アクセスしきい値Ｍｏとの大小関係を判別する。この大小関係は例えば「Ｔ２＜予測総アクセス数＜Ｔ３」という情報である。負荷比較部１１４２はこの大小関係に関する情報を稼動サーバ群決定部１１４４に渡す。 The state setting unit 1140 includes a load comparison unit 1142, an active server group determination unit 1144, and a state signal generation unit 1146.
The load comparison unit 1142 determines the magnitude relationship between the predicted total number of accesses acquired from the load prediction unit 1134 and the first threshold value T1, the second threshold value T2, the third threshold value T3, and the access threshold value Mo. Determine. This magnitude relationship is, for example, information “T2 <predicted total access count <T3”. The load comparison unit 1142 passes information on the magnitude relationship to the active server group determination unit 1144.

稼動サーバ群決定部１１４４は、この情報を基に表１のストラテジにしたがい、次の時間帯における状態の切替の必要性を判定する。稼動サーバ群決定部１１４４は、状態の切替が必要な場合には、指標算出部１００から取得する指標に基づいて、稼動状態とするサーバ群とＯＳ休眠状態とするサーバ群と電源オフ状態とするサーバ群を決定する。稼動サーバ群決定部１１４４は次の時間帯の到来に合わせて、この決定に基づき第２サーバ群状態保持部１１１４およびロードバランサ１０３０の第１サーバ群状態保持部１３２２を更新する。稼動サーバ群決定部１１４４は、次の時間帯の到来に合わせて、状態の切り替えが必要なサーバ群の情報を状態信号生成部１１４６に渡す。稼動サーバ群決定部１１４４は、状態の切り替えが必要ない場合には処理を中断または終了し、次の情報を待ち受ける。
第２サーバ群状態保持部１１１４は図５に示される第１サーバ群状態保持部１３２２と同様のテーブルである。なお、運用管理装置１０１０またはロードバランサ１０３０のいずれか一方がサーバ群状態テーブルを備え、そのテーブルを備えない方が備える方のテーブルを参照する構成としてもよい。 Based on this information, the active server group determination unit 1144 determines the necessity of switching the state in the next time zone according to the strategy shown in Table 1. When it is necessary to switch the state, the active server group determination unit 1144 sets the server group to be in the active state, the server group to be in the OS sleep state, and the power-off state based on the index acquired from the index calculation unit 100. Determine the server group. The active server group determination unit 1144 updates the second server group state holding unit 1114 and the first server group state holding unit 1322 of the load balancer 1030 based on this determination in accordance with the arrival of the next time zone. The active server group determination unit 1144 passes information on a server group that needs to be switched to the state signal generation unit 1146 when the next time zone arrives. The active server group determination unit 1144 suspends or terminates the process when it is not necessary to switch the state, and waits for the next information.
The second server group state holding unit 1114 is a table similar to the first server group state holding unit 1322 shown in FIG. Note that either the operation management apparatus 1010 or the load balancer 1030 may include a server group state table and refer to a table provided by a person who does not include the table.

稼動サーバ群決定部１１４４は、表１のストラテジから稼動状態、ＯＳ休眠状態および電源オフ状態とするサーバ群の数をまず決める。次に稼動サーバ群決定部１１４４は第２サーバ群状態保持部１１１４を参照し、次の時間帯でサーバ群の状態を切り替える必要があるか、言い換えると負荷予測部１１３４が予測した次の時間帯での予測総アクセス数に対応する各状態のサーバ群の数と第２サーバ群状態保持部１１１４に登録されている現在の各状態のサーバ群の数とが一致するか否かを判断する。そこで一致する場合は稼動サーバ群決定部１１４４は処理を中断または終了する。 The active server group determination unit 1144 first determines the number of server groups to be set in the active state, the OS sleep state, and the power-off state from the strategy shown in Table 1. Next, the active server group determination unit 1144 refers to the second server group state holding unit 1114, and it is necessary to switch the state of the server group in the next time zone. In other words, the next time zone predicted by the load prediction unit 1134 It is determined whether or not the number of server groups in each state corresponding to the predicted total number of accesses in the server matches the number of server groups in each current state registered in the second server group state holding unit 1114. If they match, the active server group determination unit 1144 interrupts or ends the processing.

一致しない場合は、稼動サーバ群決定部１１４４は次の時間帯でそれぞれの状態にするサーバ群を決める。稼動サーバ群決定部１１４４は、指標算出部１００の第３指標保持部１５８に保持されるデータを取得する。稼動サーバ群決定部１１４４は、取得したデータを基に各サーバ群１０２２に対して省電力状態とされる順番を示す省エネ停止可能順位を設定する。省エネ停止可能順位は「１」が最も高く、数が増えるにつれて低くなる順位である。
稼動サーバ群決定部１１４４では、第３指標保持部１５８に保持されるフロントエンドサーバ１０２４の平均消費電力、最高消費電力、電力コストを、そのフロントエンドサーバ１０２４が属するサーバ群１０２２の平均消費電力、最高消費電力、電力コストのそれぞれとみなす。 If they do not match, the active server group determination unit 1144 determines a server group to be in each state in the next time zone. The active server group determination unit 1144 acquires data held in the third index holding unit 158 of the index calculation unit 100. The active server group determination unit 1144 sets an energy saving stoppable order indicating the order in which the power saving state is set for each server group 1022 based on the acquired data. The order of possible energy saving stops is “1”, which is the highest and decreases as the number increases.
In the active server group determination unit 1144, the average power consumption, the maximum power consumption, and the power cost of the front-end server 1024 held in the third index holding unit 158 are changed to the average power consumption of the server group 1022 to which the front-end server 1024 belongs, It is considered as the maximum power consumption and power cost.

表２は、サーバ群１０２２の電力コストに基づき省エネ停止可能順位を定める場合の、サーバ群名と平均消費電力と最高消費電力と電力コストと省エネ停止可能順位との関係を示す。

表２のストラテジでは、サーバ群１０２２の電力コストが大きいほど優先的にそのサーバ群１０２２が省電力状態とされるように省エネ停止可能順位が設定される。電力コストが遜色ない２つのサーバ群が存在する場合は、平均消費電力や最高消費電力が高いほうのサーバ群の省エネ停止可能順位をより高く設定する（表２の第４サーバ群と第５サーバ群）。 Table 2 shows the relationship among the server group name, the average power consumption, the maximum power consumption, the power cost, and the energy saving stoppable order when the energy saving stoppable order is determined based on the power cost of the server group 1022.

In the strategy of Table 2, the energy saving stoppage priority order is set so that the higher the power cost of the server group 1022 is, the more preferentially the server group 1022 is put into the power saving state. If there are two server groups with similar power costs, the higher energy saving stoppage order is set for the server group with the higher average power consumption or maximum power consumption (the fourth server group and the fifth server in Table 2). group).

表３は、サーバ群１０２２の消費電力に基づき省エネ停止可能順位を定める場合の、サーバ群名と平均消費電力と最高消費電力と電力コストと省エネ停止可能順位との関係を示す。

表３のストラテジでは、サーバ群１０２２の平均消費電力と最高消費電力との和が大きいほど優先的にそのサーバ群１０２２が省電力状態とされるように省エネ停止可能順位が設定される。
情報処理装置１０２０において各サーバ群１０２２が主にＷＥＢ通信をおこなっており他の用途の処理は無視できる程度であり、またロードバランサ１０３０において稼動状態にあるサーバ群にアクセスが均等に割り当てられる設定となっている場合は、稼動状態にあるサーバ群には同じ負荷がかけられれていると考えることができる。したがって、稼動状態にある各サーバ群の平均消費電力は、同じ負荷がかけられれている場合に消費される電力の平均値と同一視することができ、また、稼動状態にある各サーバ群の最高消費電力は、同じ負荷がかけられれている場合に消費される電力の最高値と同一視することができる。この場合、表３のストラテジでは、サーバ群１０２２に同じ負荷がかけられた場合に消費される電力が大きいほど優先的にそのサーバ群１０２２が省電力状態とされるように省エネ停止可能順位が設定されているといえる。 Table 3 shows the relationship among the server group name, the average power consumption, the maximum power consumption, the power cost, and the energy saving stoppable order when the energy saving stoppable order is determined based on the power consumption of the server group 1022.

In the strategy of Table 3, the energy saving stoppage priority order is set so that the server group 1022 is preferentially put into the power saving state as the sum of the average power consumption and the maximum power consumption of the server group 1022 increases.
In the information processing apparatus 1020, each server group 1022 mainly performs WEB communication, and processing for other uses is negligible. In addition, the load balancer 1030 is configured so that access is evenly allocated to the server group in the operating state. In this case, it can be considered that the same load is applied to the server group in the operating state. Therefore, the average power consumption of each server group in the operating state can be equated with the average value of the power consumed when the same load is applied, and the maximum power of each server group in the operating state is the same. The power consumption can be equated with the highest value of power consumed when the same load is applied. In this case, according to the strategy shown in Table 3, the energy saving stoppage priority order is set so that the larger the power consumed when the same load is applied to the server group 1022, the higher the power consumption of the server group 1022 is. It can be said that.

稼動サーバ群決定部１１４４は、稼動状態にあるサーバ群のひとつを省電力状態に切り替える必要がある場合、稼動状態にあるサーバ群のうち表２または表３で設定される省エネ停止可能順位が最も高いサーバ群を省電力状態とすべきサーバ群として選択する。
稼動サーバ群決定部１１４４は、省電力状態にあるサーバ群のひとつを稼動状態に切り替える必要がある場合、省電力状態にあるサーバ群のうち表２または表３で設定される省エネ停止可能順位が最も低いサーバ群を稼動状態とすべきサーバ群として選択する。 When one of the server groups in the operating state needs to be switched to the power saving state, the active server group determining unit 1144 has the highest order of possible energy saving stoppages set in Table 2 or Table 3 among the server groups in the operating state. A high server group is selected as a server group that should be in a power saving state.
When it is necessary to switch one of the server groups in the power saving state to the operating state, the active server group determination unit 1144 has the energy saving stop possibility order set in Table 2 or Table 3 among the server groups in the power saving state. The lowest server group is selected as the server group that should be in operation.

なお、稼動サーバ群決定部１１４４でのサーバ群１０２２を決める上述のアルゴリズムでは、特に稼動状態を省電力状態に切り替える場合は、アクセスの同一性が考慮されてもよい。例えば、稼動サーバ群決定部１１４４は、稼動状態を省電力状態に切り替える場合、次の時間帯の到来に合わせて第２サーバ群状態保持部１１１４および第１サーバ群状態保持部１３２２を更新して省電力状態に切り替えるべきサーバ群への新規アクセスの割り当てを制限する一方、ロードバランサ１０３０の接続保持部１３２４を参照し、省電力状態に切り替えるべきサーバ群へのアクセスがなくなったことを確認してから状態の切り替えが必要なサーバ群の情報を状態信号生成部１１４６に渡してもよい。これによりアクセスの同一性が保証されうる。 In the above-described algorithm for determining the server group 1022 in the active server group determination unit 1144, access identity may be taken into consideration particularly when the operating state is switched to the power saving state. For example, when switching the operating state to the power saving state, the active server group determining unit 1144 updates the second server group state holding unit 1114 and the first server group state holding unit 1322 as the next time zone arrives. While restricting assignment of new access to the server group that should be switched to the power saving state, refer to the connection holding unit 1324 of the load balancer 1030 and confirm that there is no access to the server group that should be switched to the power saving state. The information on the server group that needs to be switched from one state to another may be passed to the state signal generation unit 1146. As a result, access identity can be guaranteed.

状態信号生成部１１４６は内部ネットワーク３を介して第１サーバ群１０２２ａ〜第５サーバ群１０２２ｅに接続される。
状態信号生成部１１４６は、状態の切り替えが必要なサーバ群の情報に基づきそのサーバ群に対して切替に対応する休眠導入信号、休眠解除信号、電源オフ信号、および電源オン信号のうちのいずれかを送る。例えば第３サーバ群１０２２ｃを稼動状態（ＯＳ休眠状態）からＯＳ休眠状態（稼動状態）とする必要がある場合、状態信号生成部１１４６は第３サーバ群１０２２ｃに対して休眠導入信号（休眠解除信号）を送出する。また、第３サーバ群１０２２ｃを稼動状態（電源オフ状態）から電源オフ状態（稼動状態）とする必要がある場合、状態信号生成部１１４６は第３サーバ群１０２２ｃに対して電源オフ信号（電源オン信号）を送出する。
状態信号生成部１１４６によって省電力状態から稼動状態に設定されたサーバ群は、それが稼動状態であることが稼動サーバ群決定部１１４４によってロードバランサ１０３０の第１サーバ群状態保持部１３２２に記録されるので、ロードバランサ１０３０の要求割当部１０３６によって新規のアクセスが割り当てられる。 The status signal generator 1146 is connected to the first server group 1022a to the fifth server group 1022e via the internal network 3.
The state signal generation unit 1146 is one of a dormancy introduction signal, a dormancy release signal, a power off signal, and a power on signal corresponding to switching for the server group based on the information of the server group that needs to be switched. Send. For example, when it is necessary to change the third server group 1022c from the operating state (OS sleep state) to the OS sleep state (operating state), the state signal generation unit 1146 sends a sleep introduction signal (sleep release signal) to the third server group 1022c. ). Further, when it is necessary to change the third server group 1022c from the operating state (power off state) to the power off state (operating state), the state signal generation unit 1146 sends a power off signal (power on) to the third server group 1022c. Signal).
The server group set from the power saving state to the operating state by the status signal generation unit 1146 is recorded in the first server group state holding unit 1322 of the load balancer 1030 by the operating server group determination unit 1144 as being in the operating state. Therefore, a new access is allocated by the request allocation unit 1036 of the load balancer 1030.

運用管理装置１０１０は、予測総アクセス数を基に状態設定部１１４０でサーバ群１０２２の状態を設定する上述の負荷予測モードの他に、第１指標保持部１０８に保持される総アクセス数を基に状態設定部１１４０でサーバ群１０２２の状態を適応的に設定する適応モードを有する。以下、この適応モードについて説明する。 The operation management apparatus 1010 is based on the total number of accesses held in the first index holding unit 108 in addition to the above-described load prediction mode in which the state setting unit 1140 sets the state of the server group 1022 based on the predicted total number of accesses. The state setting unit 1140 has an adaptive mode in which the state of the server group 1022 is adaptively set. Hereinafter, this adaptive mode will be described.

乖離判定部１１５０は、予測された総アクセス数と実際の総アクセス数との差が大きい、つまりその差の絶対値が所定の乖離値よりも大きい場合、状態設定部１１４０に実際の総アクセス数に基づいてサーバ群の状態を設定せしめる。
乖離判定部１１５０は、第１指標保持部１０８から現在の時間帯に対応する総アクセス数を取得する。また乖離判定部１１５０は負荷予測部１１３４から現在の時間帯に対して予測された予測総アクセス数も取得する。そして乖離判定部１１５０は、両者の差を演算する。その差の絶対値が乖離値より大きい場合は、乖離判定部１１５０は状態設定部１１４０の負荷比較部１１４２に、予測総アクセス数の代わりに第１指標保持部１０８からの総アクセス数を使用させる。この場合負荷比較部１１４２は、第１指標保持部１０８からの総アクセス数と、第１しきい値Ｔ１、第２しきい値Ｔ２、第３しきい値Ｔ３、アクセスしきい値Ｍｏとの大小関係を判別する。状態設定部１１４０はかかる大小関係を使用して上述の処理と同様の処理を行う。これにより状態設定部１１４０は、予測された総アクセス数と実際の総アクセス数との差が大きい場合に、実際の総アクセス数に応じて適応的にサーバ群１０２２の状態を設定することができる。 When the difference between the predicted total number of accesses and the actual total number of accesses is large, that is, when the absolute value of the difference is larger than a predetermined divergence value, the divergence determination unit 1150 indicates to the state setting unit 1140 the actual total number of accesses. Let the server group state be set based on
The deviation determination unit 1150 acquires the total number of accesses corresponding to the current time zone from the first index holding unit 108. The divergence determination unit 1150 also acquires the predicted total number of accesses predicted for the current time zone from the load prediction unit 1134. Then, the deviation determination unit 1150 calculates the difference between the two. When the absolute value of the difference is larger than the deviation value, the deviation determination unit 1150 causes the load comparison unit 1142 of the state setting unit 1140 to use the total number of accesses from the first index holding unit 108 instead of the predicted total number of accesses. . In this case, the load comparison unit 1142 compares the total number of accesses from the first index holding unit 108 with the first threshold value T1, the second threshold value T2, the third threshold value T3, and the access threshold value Mo. Determine the relationship. The state setting unit 1140 performs processing similar to the above-described processing using the magnitude relationship. Thereby, the state setting unit 1140 can adaptively set the state of the server group 1022 according to the actual total number of accesses when the difference between the predicted total number of accesses and the actual total number of accesses is large. .

また、運用管理装置１０１０は、情報処理システム２の管理者がマニュアルでサーバ群１０２２の状態を設定できるマニュアル設定モードも有する。
オーバライド部１１６０は、運用管理装置１０１０に付随するキーボードなどの入力装置１０１２から管理者によるマニュアル設定を受け付ける。オーバライド部１１６０はこのマニュアル設定を受け付けると状態設定部１１４０に、このマニュアル設定に基づいてサーバ群１０２２の状態を設定せしめる。これにより急激な使用環境の変化にも対応可能となる。 The operation management apparatus 1010 also has a manual setting mode in which the administrator of the information processing system 2 can set the state of the server group 1022 manually.
The override unit 1160 receives a manual setting by an administrator from an input device 1012 such as a keyboard attached to the operation management device 1010. When the override unit 1160 receives this manual setting, the override unit 1160 causes the state setting unit 1140 to set the state of the server group 1022 based on the manual setting. This makes it possible to cope with sudden changes in the usage environment.

表示制御部１２０は、第１指標保持部１０８から現在の時間帯における総アクセス数を、第２サーバ群状態保持部１１１４から現在のサーバ群１０２２の状態を、取得する。表示制御部１２０は、運用管理装置１０１０に付随するモニタ１４０に総アクセス数とサーバ群１０２２の状態とを示すステータス画面１４００ａ〜１４００ｄを表示させる。また、表示制御部１２０は、乖離判定部１１５０において予測された総アクセス数と実際の総アクセス数との差の絶対値が乖離値よりも大きいと判断された場合、モニタ１４０に警告画面１４０２を表示させる。また、表示制御部１２０は、管理者によるマニュアル設定のための状態設定画面１４０４をモニタ１４０に表示させる。 The display control unit 120 acquires the total number of accesses in the current time zone from the first index holding unit 108 and the current state of the server group 1022 from the second server group state holding unit 1114. The display control unit 120 displays status screens 1400 a to 1400 d indicating the total number of accesses and the state of the server group 1022 on the monitor 140 associated with the operation management apparatus 1010. When the absolute value of the difference between the total number of accesses predicted by the divergence determination unit 1150 and the actual total number of accesses is determined to be larger than the divergence value, the display control unit 120 displays a warning screen 1402 on the monitor 140. Display. Further, the display control unit 120 causes the monitor 140 to display a state setting screen 1404 for manual setting by the administrator.

図１５（ａ）〜（ｄ）は、ステータス画面１４００ａ〜１４００ｄの代表画面図である。ここでは、第１サーバ群１０２２ａから第５サーバ群１０２２ｅのピークアクセス数が全て２０００であるとする。また、稼動サーバ群決定部１１４４では表２のストラテジが使用され、サーバ群選択部１３６４では傾斜配分モードが使用される場合を考える。図１５（ａ）〜図１５（ｄ）はそれぞれ現在の時間帯における総アクセス数が１６００、２８００、４４００、７０００の場合に対応する。
図１５（ａ）は、現在の時間帯における総アクセス数が１６００の場合に対応するステータス画面１４００ａの代表画面図である。ステータス画面１４００ａは、総アクセス数領域１４０６と、グラフ１４０８と、オーバライドボタン１４１０と、を含む。総アクセス数領域１４０６は、現在の時間帯における総アクセス数を示す。グラフ１４０８は、各サーバ群１０２２の稼働率を示す。グラフ１４０８において「△」はＯＳ休眠状態を示し、「×」は電源オフ状態を示す。オーバライドボタン１４１０は、押し下げられるとオーバライド部１１６０をトリガする。オーバライドボタン１４１０が押し下げられると表示制御部１２０は状態設定画面１４０４をモニタ１４０に表示させる。図１５（ｂ）〜図１５（ｄ）についても同様である。 15A to 15D are representative screen diagrams of the status screens 1400a to 1400d. Here, it is assumed that the peak access counts of the first server group 1022a to the fifth server group 1022e are all 2000. Further, consider a case where the strategy shown in Table 2 is used in the active server group determination unit 1144 and the inclination distribution mode is used in the server group selection unit 1364. FIGS. 15A to 15D correspond to cases where the total number of accesses in the current time zone is 1600, 2800, 4400, and 7000, respectively.
FIG. 15A is a representative screen diagram of the status screen 1400a corresponding to the case where the total number of accesses in the current time zone is 1600. The status screen 1400a includes a total access number area 1406, a graph 1408, and an override button 1410. The total access number area 1406 indicates the total number of accesses in the current time zone. A graph 1408 indicates the operation rate of each server group 1022. In the graph 1408, “Δ” indicates an OS sleep state, and “X” indicates a power-off state. Override button 1410 triggers override portion 1160 when depressed. When the override button 1410 is pressed, the display control unit 120 displays a state setting screen 1404 on the monitor 140. The same applies to FIGS. 15B to 15D.

図１６は、警告画面１４０２の代表画面図である。
図１７は、状態設定画面１４０４の代表画面図である。状態設定画面１４０４は、設定領域１４１２と、設定ボタン１４１４と、を含む。状態設定画面１４０４を開いた直後の状態では、設定領域１４１２には各サーバ群の現在の状態がラジオボタン方式で示されている。設定領域１４１２では第１サーバ群１０２２ａ〜第５サーバ群１０２２ｅはそれぞれサーバ群Ａ〜サーバ群Ｅという名称で表示されている。管理者はマウスなどの入力装置１０１２を使用して各サーバ群の状態を選択し、設定ボタン１４１４を押し下げる。すると設定領域１４１２に設定された各サーバ群１０２２の状態が所望のマニュアル設定としてオーバライド部１１６０に送られる。これにより、第１サーバ群１０２２ａ〜第５サーバ群１０２２ｅのそれぞれを稼動状態とするか、ＯＳ休眠状態とするか、または電源オフ状態とするかを設定できる。 FIG. 16 is a representative screen diagram of the warning screen 1402.
FIG. 17 is a representative screen diagram of the state setting screen 1404. The state setting screen 1404 includes a setting area 1412 and a setting button 1414. In the state immediately after the state setting screen 1404 is opened, the current state of each server group is indicated in the setting area 1412 by a radio button method. In the setting area 1412, the first server group 1022a to the fifth server group 1022e are displayed with the names server group A to server group E, respectively. The administrator uses the input device 1012 such as a mouse to select the state of each server group and presses the setting button 1414. Then, the state of each server group 1022 set in the setting area 1412 is sent to the override unit 1160 as a desired manual setting. Accordingly, it is possible to set whether each of the first server group 1022a to the fifth server group 1022e is in an operating state, an OS sleep state, or a power-off state.

上述のロードバランサ１０３０や運用管理装置１０１０において、保持部の例は、ハードディスクやメモリである。また、本明細書の記載に基づき、各部を、図示しないＣＰＵや、インストールされたアプリケーションプログラムのモジュールや、システムプログラムのモジュールや、ハードディスクから読み出したデータの内容を一時的に記憶するメモリなどにより実現できることは本明細書に触れた当業者には理解されるところである。 In the load balancer 1030 and the operation management apparatus 1010 described above, examples of the holding unit are a hard disk and a memory. Based on the description in this specification, each unit is realized by a CPU (not shown), an installed application program module, a system program module, a memory that temporarily stores the contents of data read from the hard disk, and the like. It is understood by those skilled in the art who have touched this specification that they can do this.

以上の構成による運用管理装置１０１０の動作を説明する。図１８は、指標算出部１００における一連の処理を示すフローチャートである。傍受部１２２は、ネットワークからパケットを傍受する（Ｓ４０２）。抽出部１２４は、傍受されたパケットから始点アドレスとポート番号とを抽出する（Ｓ４０４）。登録部１２８は、抽出された始点アドレスとポート番号とパケットのデータ量とを通信情報保持部１０４に登録する（Ｓ４０６）。分類部１３２は、通信情報保持部１０４に保持されるデータを始点アドレスでソーティングする（Ｓ４０８）。分類部１３２は、ソーティングされたデータを本来用途と非本来用途とに分類する（Ｓ４１０）。集計部１３０は、分類されたデータを集計する（Ｓ４１２）。第２解析部１５２は、集計結果から各サーバ群１０２２の電力コストを算出する（Ｓ４１４）。 The operation of the operation management apparatus 1010 having the above configuration will be described. FIG. 18 is a flowchart showing a series of processing in the index calculation unit 100. The intercepting unit 122 intercepts the packet from the network (S402). The extraction unit 124 extracts the start point address and the port number from the intercepted packet (S404). The registration unit 128 registers the extracted start point address, port number, and packet data amount in the communication information holding unit 104 (S406). The classification unit 132 sorts the data held in the communication information holding unit 104 by the start point address (S408). The classification unit 132 classifies the sorted data into an original use and a non-original use (S410). The tabulation unit 130 tabulates the classified data (S412). The second analysis unit 152 calculates the power cost of each server group 1022 from the aggregation result (S414).

図１９は、学習部１１３０および状態設定部１１４０における一連の処理を時系列に沿って示すチャートである。更新部１１３２が第１指標保持部１０８から総アクセス数を取得する時間間隔は１５分間隔である場合を考える。２００９年９月１６日の９：１５に、更新部１１３２は第１指標保持部１０８から総アクセス数を取得する（Ｓ５０２）。更新部１１３２は、取得された総アクセス数をもとに負荷履歴保持部１１１２を更新する（Ｓ５０４）。一方、２００９年９月１６日の９：１５−９：３０の時間帯内でステップＳ５０４の負荷履歴保持部１１１２更新と重ならないときに、負荷予測部１１３４は負荷履歴保持部１１１２を参照して次の時間帯（１００９：３０−９：４５）の総アクセス数を予測する（Ｓ５０６）。状態設定部１１４０は、予測総アクセス数とアクセスしきい値Ｍｏ、第１しきい値Ｔ１、第２しきい値Ｔ２、第３しきい値Ｔ３との大小比較を行う（Ｓ５０８）。状態設定部１１４０は、その大小比較を基に次の時間帯においてサーバ群１０２２の状態の切替が必要か否かを判断する（Ｓ５１０）。状態の切替が必要でない場合（Ｓ５１０のＮ）、次の時間帯が到来してもサーバ群１０２２の状態の設定は行わない。状態の切替が必要な場合（Ｓ５１０のＹ）、状態設定部１１４０は省エネ停止可能順位に基づき、次の時間帯におけるサーバ群１０２２の状態を決定する（Ｓ５１２）。次の時間帯が到来すると、つまり２００９年９月１６日の９：３０になると、状態設定部１１４０はサーバ群の状態をステップＳ５１２で決定されたように設定する。また、状態設定部１１４０は平行して第２サーバ群状態保持部１１１４およびロードバランサ１０３０の第１サーバ群状態保持部１３２２を更新する（Ｓ５１４）。運用管理装置１０１０はこの処理を時間帯単位で繰り返す。 FIG. 19 is a chart showing a series of processes in the learning unit 1130 and the state setting unit 1140 in time series. Consider a case in which the time interval for the update unit 1132 to acquire the total number of accesses from the first index holding unit 108 is 15 minutes. At 9:15 on September 16, 2009, the update unit 1132 acquires the total number of accesses from the first index holding unit 108 (S502). The updating unit 1132 updates the load history holding unit 1112 based on the acquired total number of accesses (S504). On the other hand, the load prediction unit 1134 refers to the load history holding unit 1112 when it does not overlap with the update of the load history holding unit 1112 in step S504 within the time zone of 9: 15-9: 30 on September 16, 2009. The total number of accesses in the next time zone (1009: 30-9: 45) is predicted (S506). The state setting unit 1140 compares the predicted total number of accesses with the access threshold value Mo, the first threshold value T1, the second threshold value T2, and the third threshold value T3 (S508). The state setting unit 1140 determines whether or not it is necessary to switch the state of the server group 1022 in the next time zone based on the size comparison (S510). When switching of the state is not necessary (N in S510), the state of the server group 1022 is not set even when the next time zone comes. When the state needs to be switched (Y in S510), the state setting unit 1140 determines the state of the server group 1022 in the next time zone based on the energy-saving stoppage priority order (S512). When the next time zone arrives, that is, at 9:30 on September 16, 2009, the state setting unit 1140 sets the state of the server group as determined in step S512. In addition, the state setting unit 1140 updates the second server group state holding unit 1114 and the first server group state holding unit 1322 of the load balancer 1030 in parallel (S514). The operation management apparatus 1010 repeats this process for each time zone.

本実施の形態に係る運用管理装置１０１０によると、データセンタなどの情報処理システム２が使用される目的、例えば証券業務や検索エンジンなど、に応じた総アクセス数の傾向を把握し、それを用いて今後発生しうる総アクセス数を予測できる。そしてこの予測総アクセス数が少ない場合は、予測対象の時間帯において少なくともひとつのサーバ群が省電力状態に設定される。 According to the operation management apparatus 1010 according to the present embodiment, the trend of the total number of accesses according to the purpose for which the information processing system 2 such as a data center is used, for example, securities business or search engine is grasped and used. The total number of accesses that can occur in the future can be predicted. When the predicted total number of accesses is small, at least one server group is set in the power saving state in the prediction target time zone.

上述した通り稼動状態のサーバ群の消費電力は、アイドル状態であってもピーク時のおよそ６０％である。これに対して省電力状態のサーバ群の消費電力はピーク時のおよそ０〜１０％である。本実施の形態では、アクセス数が少ないと予測され、したがってアクセスを処理する必要がないと予測されるサーバ群がある場合はそれらのサーバ群をアイドル状態ではなく省電力状態としている。これにより、情報処理装置１０２０全体の消費電力を低減でき、電力の無駄遣いを抑え、省エネ化を図ることができる。 As described above, the power consumption of the server group in the operating state is approximately 60% of the peak time even in the idle state. On the other hand, the power consumption of the server group in the power saving state is approximately 0 to 10% at the peak time. In the present embodiment, when there are server groups that are predicted to have a small number of accesses and are therefore predicted not to be processed, the server groups are set in a power saving state instead of an idle state. As a result, power consumption of the entire information processing apparatus 1020 can be reduced, wasteful use of power can be suppressed, and energy saving can be achieved.

また本実施の形態に係る運用管理装置１０１０では、情報処理システム２が使用される目的に応じた総アクセス数の傾向を把握し、それを用いて今後発生しうる総アクセス数を予測しているので、個々の使用目的に対して最適な消費電力低減化、省エネ化を実現できる。 In addition, the operation management apparatus 1010 according to the present embodiment grasps the tendency of the total number of accesses according to the purpose for which the information processing system 2 is used, and predicts the total number of accesses that can occur in the future using the trend. Therefore, optimal power consumption reduction and energy saving can be realized for each purpose of use.

さらに、本実施の形態に係る運用管理装置１０１０によると、指標算出部１００はネットワークを傍受して各サーバ群１０２２の特性の違いを示す指標を算出する。状態設定部１１４０は、少なくともひとつのサーバ群１０２２を省電力状態に設定する際、指標算出部１００によって算出されたこの指標に基づいて、省電力状態に設定するサーバ群を選択する。したがって、各サーバ群１０２２の特性の違いの解析結果を取り入れた形で、稼動状態にあるサーバ群のなかから省電力状態とするサーバ群を選択できる。これにより、全てのサーバ群１０２２を特性上同等と見なした上で省電力状態とするサーバ群を選択する場合と比べて、選択に各サーバ群１０２２の特性の違いを反映できるので、省エネの観点からより好適なサーバ群１０２２の状態制御が可能となる。その結果、情報処理システム２の省エネ化を一層進めることができる。 Furthermore, according to the operation management apparatus 1010 according to the present embodiment, the index calculation unit 100 intercepts the network and calculates an index indicating a difference in characteristics of each server group 1022. When setting at least one server group 1022 in the power saving state, the state setting unit 1140 selects a server group to be set in the power saving state based on the index calculated by the index calculation unit 100. Therefore, it is possible to select a server group in the power saving state from among the server groups in the operating state in the form of incorporating the analysis result of the difference in characteristics of each server group 1022. As a result, it is possible to reflect the difference in the characteristics of each server group 1022 in the selection as compared with the case of selecting the server group in which the power saving state is considered after considering all the server groups 1022 to be equivalent in characteristics. It is possible to control the state of the server group 1022 that is more suitable from the viewpoint. As a result, energy saving of the information processing system 2 can be further promoted.

また、稼動サーバ群決定部１１４４は、電力コストが大きいサーバ群を省電力状態とするサーバ群として優先的に選択する。したがって、結果として稼動状態とされるサーバ群の電力コストは、省電力状態とされるサーバ群の電力コストよりも小さくなるので、情報処理システム２全体で見たときに有効な仕事をより少ない消費電力で処理できる。 In addition, the active server group determination unit 1144 preferentially selects a server group having a large power cost as a server group in which the power saving state is set. As a result, the power cost of the server group that is in the operating state is smaller than the power cost of the server group that is in the power saving state, so that less work is consumed effectively when viewed from the information processing system 2 as a whole. Can be processed with electricity.

また、本実施の形態に係る運用管理装置１０１０によると、パケットをネットワークから傍受し、傍受したパケットを始点ＩＰアドレスとポート番号とで分類する。したがって、始点ＩＰアドレスごとおよびポート番号ごとに通信データ量を計測できる。これにより、所望の解析対象のノードについて、どの通信用途（メール通信やＷＥＢ閲覧など）に対応する仕事をどの程度行ったかを、そのノードに直接問い合わせることなしにネットワーク上を流れるパケットから追跡できる。ノードへ問い合わせないので、仕事量の解析に伴うノード側の負担はほとんどない。運用管理装置１０１０はネットワークの傍受を行うので、運用管理装置１０１０自身の測定への寄与は小さく、また把握できる。 Further, according to the operation management apparatus 1010 according to the present embodiment, packets are intercepted from the network, and the intercepted packets are classified by the start IP address and the port number. Therefore, the communication data amount can be measured for each starting point IP address and each port number. Accordingly, it is possible to trace how much work corresponding to which communication application (e-mail communication, WEB browsing, etc.) has been performed for a desired node to be analyzed from a packet flowing on the network without directly inquiring the node. Since the node is not inquired, there is almost no burden on the node side accompanying the analysis of the workload. Since the operation management apparatus 1010 intercepts the network, the contribution of the operation management apparatus 1010 itself to the measurement is small and can be grasped.

さらに本実施の形態では、更新部１１３２は指標算出部１００によって算出された総アクセス数に基づいて負荷履歴保持部１１１２を更新する。したがって、総アクセス数を予測するための履歴データを、サーバ側の負担をほとんど増やすことなしに得ることができる。 Further, in the present embodiment, the update unit 1132 updates the load history holding unit 1112 based on the total number of accesses calculated by the index calculation unit 100. Therefore, history data for predicting the total number of accesses can be obtained without increasing the load on the server side.

また、本実施の形態では、稼動サーバ群決定部１１４４は、予測総アクセス数が所定の値より多くなると予測される場合には、指標算出部１００によって算出された指標に基づいて、省電力状態に設定されている複数のサーバ群のうちの少なくともひとつのサーバ群を選択する。状態信号生成部１１４６は、稼動サーバ群決定部１１４４によって選択された少なくともひとつのサーバ群を稼動状態に設定する。これにより、まず、サーバ群１０２２をピークアクセス数以上で使用しなければならない状況を回避し、アクセス処理の遅滞を避けることができる。さらに、各サーバ群１０２２の特性の違いの解析結果を取り入れた形で、省電力状態にあるサーバ群のなかから稼動状態とするサーバ群を選択できる。これにより、全てのサーバ群を特性上同等と見なした上で稼動状態とするサーバ群を選択する場合と比べて、選択に各サーバ群１０２２の特性の違いを反映できるので、省エネの観点からより好適なサーバ群１０２２の状態制御が可能となる。 In the present embodiment, the active server group determination unit 1144 determines the power saving state based on the index calculated by the index calculation unit 100 when the predicted total number of accesses is predicted to be greater than a predetermined value. At least one server group is selected from among a plurality of server groups set to. The status signal generation unit 1146 sets at least one server group selected by the active server group determination unit 1144 to the operating state. Thereby, first, it is possible to avoid a situation where the server group 1022 has to be used at a peak access number or more, and to avoid a delay in access processing. Furthermore, a server group that is in an active state can be selected from among the server groups that are in a power saving state in a form that incorporates an analysis result of the difference in characteristics of each server group 1022. As a result, it is possible to reflect the difference in the characteristics of each server group 1022 in the selection as compared with the case of selecting the server group to be in an operating state after regarding all the server groups as equivalent in characteristics. More favorable state control of the server group 1022 is possible.

また、稼動サーバ群決定部１１４４は、電力コストが小さいサーバ群を稼動状態とするサーバ群として優先的に選択する。したがって、結果として稼動状態とされるサーバ群の電力コストは、省電力状態とされるサーバ群の電力コストよりも小さくなるので、情報処理システム２全体で見たときに有効な仕事をより少ない消費電力で処理できる。 In addition, the operating server group determination unit 1144 preferentially selects a server group with a low power cost as a server group that is in an operating state. As a result, the power cost of the server group that is in the operating state is smaller than the power cost of the server group that is in the power saving state, so that less work is consumed effectively when viewed from the information processing system 2 as a whole. Can be processed with electricity.

また、本実施の形態では、ロードバランサ１０３０は、取得されたアクセスを、電力コストが小さいサーバ群に優先的に割り当てる。したがって、結果としてアクセスがより多く割り当てられるサーバ群の電力コストは、そうでないサーバ群の電力コストよりも小さくなるので、情報処理システム２全体で見たときに有効な仕事をより少ない消費電力で処理できる。 In the present embodiment, the load balancer 1030 preferentially assigns the acquired access to a server group with a low power cost. Therefore, as a result, the power cost of the server group to which more accesses are allocated is smaller than the power cost of the server group that is not so, so that effective work can be processed with less power consumption when viewed from the entire information processing system 2. it can.

また、本実施の形態に係る運用管理装置１０１０では、負荷履歴保持部１１１２は、取得された総アクセス数と、その総アクセス数が取得された時間帯が属する属性と、を対応付けて記憶する。したがって、総アクセス数の傾向を属性を基準にして把握することができる。さらに運用管理装置１０１０では、予測対象の時間帯が属する属性に対応する総アクセス数を負荷履歴保持部１１１２から取得してその時間帯に対して予測される総アクセス数とする。したがって、属性を情報処理システム２の使用目的に応じて適切に設定することにより、予測される総アクセス数の精度を向上することができる。 Moreover, in the operation management apparatus 1010 according to the present embodiment, the load history holding unit 1112 stores the acquired total access count and the attribute to which the time zone from which the total access count belongs, in association with each other. . Therefore, the tendency of the total number of accesses can be grasped on the basis of attributes. Further, the operation management apparatus 1010 obtains the total number of accesses corresponding to the attribute to which the prediction target time zone belongs from the load history holding unit 1112 and sets the total number of accesses predicted for the time zone. Therefore, the accuracy of the predicted total number of accesses can be improved by appropriately setting the attributes according to the purpose of use of the information processing system 2.

図２０は、本実施の形態に係る情報処理システム２が証券会社のインターネット株取引システムを提供するデータセンタとして使用される場合の、負荷履歴保持部１１１２ａの一例を示すデータ構造図である。ここでは稼動サーバ群の数および平均稼動率は説明を明瞭とするため省略される。この場合の総アクセス数の傾向としては、証券取引所の取引時間内にアクセスが集中することがある。また、証券取引所が閉まってからしばらくはいわゆるバッチ処理を行うためにいくらかの仕事が発生する。それ以外で証券取引所が閉まっている時間帯、例えば休日などにはアクセスはほとんどない。あっても証券会社の顧客が自己の口座の情報を参照する程度である。したがって、図２０に示されるように、時間帯の属性として、証券取引所の取引時間にあたる時間帯に「取引時間」という属性を、バッチ処理が行われる時間帯に「バッチ処理」という属性を、それ以外の時間帯に「時間外」もしくは土日祝日の場合は「休日」を、それぞれ与えてもよい。このような属性を付与することで、証券会社のインターネット株取引システムを提供するデータセンタに発生する総アクセス数をより適切に予測することができる。 FIG. 20 is a data structure diagram showing an example of the load history holding unit 1112a when the information processing system 2 according to the present embodiment is used as a data center that provides an Internet stock trading system of a securities company. Here, the number of operating servers and the average operating rate are omitted for the sake of clarity. As a tendency of the total number of accesses in this case, access may be concentrated within the trading hours of the stock exchange. In addition, some work is required to perform so-called batch processing for a while after the stock exchange closes. Other than that, there is almost no access during times when the stock exchange is closed, such as holidays. Even so, the brokerage firm's customers refer to their account information. Therefore, as shown in FIG. 20, as a time zone attribute, an attribute “transaction time” in a time zone corresponding to the trading time of a stock exchange, an attribute “batch processing” in a time zone in which batch processing is performed, “Holiday” may be given in the case of “out of hours” or weekends and holidays in other time zones. By assigning such attributes, it is possible to more appropriately predict the total number of accesses occurring in the data center that provides the securities company's Internet stock trading system.

図２１は、本実施の形態に係る情報処理システム２が検索サービスを提供するデータセンタとして使用される場合の、負荷履歴保持部１１１２ｂの一例を示すデータ構造図である。ここでは稼動サーバ群の数および平均稼動率は説明を明瞭とするため省略される。この場合の総アクセス数の傾向としては、平日よりも休日の方が利用が多く、また平日でも午前よりも午後の方が利用が多いことがある。したがって、図２１に示されるように、時間帯の属性として、平日休日の別と午前午後の別との組み合わせを与えてもよい。このような属性を付与することで、検索サービスを提供するデータセンタに発生する総アクセス数をより適切に予測することができる。 FIG. 21 is a data structure diagram showing an example of the load history holding unit 1112b when the information processing system 2 according to the present embodiment is used as a data center that provides a search service. Here, the number of operating servers and the average operating rate are omitted for the sake of clarity. As a tendency of the total number of accesses in this case, a holiday is more frequently used than a weekday, and a weekday is more frequently used in the afternoon than in the morning. Therefore, as shown in FIG. 21, a combination of a weekday holiday and a morning / afternoon may be given as the attribute of the time zone. By assigning such attributes, the total number of accesses occurring in the data center that provides the search service can be predicted more appropriately.

本実施の形態に係る運用管理装置１０１０では、省電力モード（予測総アクセス数＜アクセスしきい値Ｍｏ）においては、表１に示される通りＯＳ休眠状態とするサーバ群と電源オフ状態とするサーバ群との両方を設けている。これにより、予測対象の時間帯において突然総アクセス数が増大した場合は、適応モードに移行した後復帰のためのオーバヘッドが小さいＯＳ休眠状態にあるサーバ群を稼動状態に戻すことで対応できる。また、そのように対応できる限りにおいては他のサーバ群は電力を消費しない電源オフ状態とし、情報処理装置１０２０全体の消費電力をさらに低減している。なお、表１ではＯＳ休眠状態とするサーバ群をひとつだけ確保しているが、この数はオーバヘッドと消費電力とのかねあいで定められればよく、適宜増減可能であることは本明細書に触れた当業者には理解される。 In the operation management apparatus 1010 according to the present embodiment, in the power saving mode (predicted total access count <access threshold Mo), as shown in Table 1, a server group that is in an OS sleep state and a server that is in a power-off state Both groups are provided. As a result, when the total number of accesses suddenly increases in the prediction target time zone, it can be dealt with by returning the server group in the OS sleep state in which the overhead for returning after shifting to the adaptive mode is small to the operating state. Further, as long as it can cope with such a situation, the other server groups are in a power-off state in which power is not consumed, and the power consumption of the entire information processing apparatus 1020 is further reduced. In Table 1, only one server group that is in the OS sleep state is secured, but this number may be determined according to the balance between overhead and power consumption, and it is mentioned in the present specification that it can be appropriately increased or decreased. It will be understood by those skilled in the art.

本実施の形態に係る運用管理装置１０１０では、状態設定部１１４０はサーバ群の最大性能を発揮せしめる前提で、予測対象の時間帯において稼動状態とするサーバ群を決定する。このサーバ群の決定方式によると、所与の予測総アクセス数に対してより多くの数のサーバ群を省電力状態とすることができる。したがって、情報処理装置１０２０全体の消費電力をより低減できる。なお、稼働率によってサーバ群の消費電力が異なるのも事実ではあるが、上述の通りアイドル状態でもピーク時のおよそ６０％の電力が消費されることを考えると、稼働率を下げることによる電力削減効果よりもアイドル状態を省電力状態とすることによる電力削減効果のほうが大きいと考えられる。 In the operation management apparatus 1010 according to the present embodiment, the state setting unit 1140 determines a server group to be in an operating state in a prediction target time zone on the premise that the maximum performance of the server group is exhibited. According to this server group determination method, a larger number of server groups can be put into a power saving state with respect to a given predicted total number of accesses. Therefore, the power consumption of the entire information processing apparatus 1020 can be further reduced. It is also true that the power consumption of the server group differs depending on the operating rate, but considering the fact that about 60% of the power is consumed even in the idle state as described above, power reduction by lowering the operating rate It is considered that the power reduction effect by setting the idle state to the power saving state is greater than the effect.

また、アクセスしきい値Ｍｏは情報処理装置１０２０の性能が落ちない範囲に設定される。これにより、予測総アクセス数が多い場合は通常モードで情報処理装置１０２０の並列処理能力をいかんなく発揮させ、予測総アクセス数が少なくなると省電力モードに移行させて性能を保ちつつ電力消費量を低減できる。 Further, the access threshold Mo is set in a range where the performance of the information processing apparatus 1020 does not deteriorate. As a result, when the predicted total number of accesses is large, the parallel processing capability of the information processing apparatus 1020 is fully exhibited in the normal mode, and when the predicted total number of accesses decreases, the power consumption mode is reduced while shifting to the power saving mode to maintain the performance. Can be reduced.

また、本実施の形態に係る運用管理装置１０１０では、ＩＰアドレスは仕事の主体を示し、ポート番号は通信用途を示すので、それらを用いてノードで行われている仕事の内訳を導出し、特に有効仕事量を計測できる。この点、現実的に有効仕事量の計測が困難であった従来の技術とは大きく異なる。運用管理装置１０１０によると、ノードにおける仕事量を解析することで、そのノードが有効な仕事をどの程度行ったかを計測でき、情報処理システム２を省エネ化するためのよい指標が得られる。 Further, in the operation management apparatus 1010 according to the present embodiment, the IP address indicates the subject of work, and the port number indicates the communication use. Therefore, the breakdown of the work being performed at the node is derived using them. Effective work can be measured. In this respect, it is greatly different from the conventional technique in which it is practically difficult to measure the effective work amount. According to the operation management apparatus 1010, by analyzing the amount of work in a node, it is possible to measure how much effective work the node has performed, and a good index for saving the information processing system 2 can be obtained.

また、本実施の形態に係る運用管理装置１０１０では、通信情報保持部１０４に保持される傍受されたパケットの情報を本来用途と非本来用途とで分類する。したがって、図９の第１指標保持部１０８に示されるように、所望の解析対象のノードについて、本来用途データ量と非本来用途データ量とを得ることができる。これにより、両者の比率などから解析対象のノードがどの程度本来の用途で使用されているかを追跡できる。これは情報処理システム２を省エネ化するためのよい指標のひとつである。例えば、本来の用途で使用されていないノードを発見し、適切な処置を施すことで情報処理システム２の効率を向上できる。 Also, in the operation management apparatus 1010 according to the present embodiment, the intercepted packet information held in the communication information holding unit 104 is classified into an original use and a non-original use. Therefore, as shown in the first index holding unit 108 in FIG. 9, it is possible to obtain the original use data amount and the non-original use data amount for a desired analysis target node. Thereby, it can be tracked how much the node to be analyzed is used for the original purpose from the ratio of both. This is one of the good indexes for saving energy in the information processing system 2. For example, the efficiency of the information processing system 2 can be improved by finding a node that is not used for the original purpose and applying an appropriate measure.

また、運用管理装置１０１０では、第２指標保持部１１８は、第１指標保持部１０８と使用状況保持部１１２とが対応付けられた形となっている。したがって、情報処理システム２の省エネ化を考える際に、ノードにおける仕事量と使用状況とを対応付けて把握できる。すなわち、ノードが何の仕事をいつどれだけ行ったことにより、どれだけのＣＰＵの処理能力を使用し、どれだけの電力を消費し、どれだけの温度上昇があったかを解析できる。 In the operation management apparatus 1010, the second index holding unit 118 has a form in which the first index holding unit 108 and the usage status holding unit 112 are associated with each other. Therefore, when considering the energy saving of the information processing system 2, it is possible to grasp the workload at the node and the usage status in association with each other. In other words, it is possible to analyze how much CPU processing power is used, how much power is consumed, and how much temperature has been increased by what and when the node has done what work.

例えば、ＣＰＵ使用率が１００％とされているサーバの仕事量の内訳を見ることにより、実際１００％のＣＰＵ使用率のうちどれだけが有効な仕事のために使用されたかを知ることができる。１００％のＣＰＵ使用率でもその大半が有効な仕事のために使用されていなければ、そのサーバは省エネ化のための検討対象とすべきである。ＰＵＥ（Power Usage Effectiveness）（非特許文献１参照）などを使用する従来の技術では、仕事の内訳が見えないのでこのような判断をすることができなかったが、運用管理装置１０１０を使用すると可能となる。 For example, by looking at the breakdown of the workload of the server whose CPU usage rate is 100%, it is possible to know how much of the CPU usage rate of 100% is actually used for effective work. If most of the CPU usage rate is not used for effective work, the server should be considered for energy saving. In the conventional technology using PUE (Power Usage Effectiveness) (see Non-Patent Document 1) or the like, it is impossible to make such a determination because the breakdown of work cannot be seen, but it is possible to use the operation management apparatus 1010. It becomes.

また例えば、消費電力および有効仕事量の両者が共に低いサーバがある場合、使用状況だけを見ていると消費電力が低いので省エネ的によいサーバに見え、このサーバに対しては何ら対策がなされない可能性が高い。しかしながら、運用管理装置１０１０を使用して使用状況と仕事量とが対比可能な形で提示される場合、有効仕事量も低いことが分かるので、データセンタの効率を向上してさらなる省エネ化を進めるために、例えば仮想化により他の高負荷サーバから仕事を回す等の対策を取ることができる。したがって、データセンタの省エネ化への寄与は大きい。 Also, for example, if there is a server with both low power consumption and effective workload, it looks like a server that is good for energy saving because of low power consumption when looking only at the usage status, and no measures are taken against this server. It is likely not. However, if the operation management apparatus 1010 is used to present the usage status and the workload so that they can be compared, it can be seen that the effective workload is also low. Therefore, the efficiency of the data center is improved and further energy saving is promoted. Therefore, it is possible to take measures such as turning work from another high-load server by virtualization, for example. Therefore, the contribution to energy saving of the data center is large.

また例えば、有効仕事量的には問題ないが排熱の大きなサーバがある場合にそれを見つけることができる。したがって、そのようなサーバがホットアイルに存在する場合には他の位置に移設したほうがよいことをサーバの持ち主に提案できる。 In addition, for example, when there is a server with a large exhaust heat although there is no problem in the effective work amount, it can be found. Therefore, if such a server exists in the hot aisle, it can be proposed to the server owner that it is better to move it to another location.

また、運用管理装置１０１０では、フィルタ部１２６は、フィルタリングモードでは、抽出部１２４によって抽出された始点ＩＰアドレスおよび終点ＩＰアドレスを基に、計測対象のノードのＩＰアドレスが含まれたパケットのみを選択して、登録部１２８に出力する。したがって、計測に影響を与えない範囲で通信情報保持部１０４に保持されるデータの量を低減できる。 Also, in the operation management apparatus 1010, in the filtering mode, the filter unit 126 selects only packets including the IP address of the measurement target node based on the start point IP address and the end point IP address extracted by the extraction unit 124. And output to the registration unit 128. Therefore, the amount of data held in the communication information holding unit 104 can be reduced within a range that does not affect measurement.

また、フィルタ部１２６は、計測対象のノードを、情報処理システム２に含まれる複数のノードから順番に選択する。つまり、一度に全てのノードを計測対象とするのではなく、例えば月曜日は第１サーバ群１０２２ａ、火曜日は第２サーバ群１０２２ｂ、等のように計測対象のノードを順番に変えてゆく。これにより、計測に影響を与えない範囲で通信情報保持部１０４に保持されるデータの量を低減した上で、ネットワークの全てのノードを計測対象とすることができる。 In addition, the filter unit 126 sequentially selects a measurement target node from a plurality of nodes included in the information processing system 2. That is, not all the nodes are measured at one time, but the nodes to be measured are changed in order, for example, the first server group 1022a on Monday, the second server group 1022b on Tuesday, and the like. As a result, it is possible to reduce all the data held in the communication information holding unit 104 within a range that does not affect the measurement and set all nodes of the network as measurement targets.

仮想化が行なわれているサーバについて考える。従来では主に、ＶＭＯＴＩＯＮ機能等を使用して、親（ホスト）仮想化ＯＳの上での個々のサーバイメージ（個々のゲストＯＳ）のプロセッサ使用率を取得して比較する。そして、ゲストＯＳを他のプロセッサ使用率的に空いている仮想化ホストＯＳサーバへ移設させている。しかしながら、このように単にプロセッサ使用率のみを尺度として仮想化サーバ間のゲストＯＳの移動を行う場合には、実際にゲストＯＳで行われている仕事の内訳までは考慮されていない。したがって、有効仕事量に基づいた移設が行われているとは言い難い。 Consider a server that is being virtualized. Conventionally, the processor usage rate of each server image (individual guest OS) on the parent (host) virtualized OS is mainly obtained and compared using the VMOTION function or the like. Then, the guest OS is relocated to another virtual host OS server that is vacant in terms of processor utilization. However, in the case where the guest OS is moved between the virtualization servers using only the processor usage rate as a scale in this way, the breakdown of work actually performed by the guest OS is not taken into consideration. Therefore, it cannot be said that the relocation based on the effective work volume is performed.

運用管理装置１０１０を使用して仮想化されたサーバにおける仕事を解析する場合、仮想化のプラットフォームごとに仕事量が分かるので、有効仕事量に基づいた精度の高いゲストＯＳの移設が可能となる。
また、例えば仮想化したことによってどの程度状況が改善されているかを知ることができる。つまり、有効仕事量と消費電力とについて仮想化の前後で比較することで、改善の度合いを知ることができる。また、仮想化後は個々のゲストＯＳが行っている仕事についてはあまり注意されないのが現状であるが、解析装置１００で各ゲストＯＳについて有効仕事量を追跡することにより、例えば時と共に使用されなくなったゲストＯＳを特定できる。したがって、そのように特定されたゲストＯＳを外すことで他のゲストＯＳの性能を改善できる。これはデータセンタの効率の向上に貢献する。 When analyzing the work in the virtualized server using the operation management apparatus 1010, the work amount is known for each virtualization platform, so that it is possible to transfer the guest OS with high accuracy based on the effective work amount.
In addition, for example, it is possible to know how much the situation has been improved by virtualization. That is, the degree of improvement can be known by comparing the effective work amount and power consumption before and after virtualization. In addition, after virtualization, the current situation is that little attention is paid to the work performed by each guest OS, but the analysis apparatus 100 keeps track of the effective work amount for each guest OS, so that it is not used with time, for example. Guest OS can be specified. Therefore, the performance of other guest OSs can be improved by removing the guest OS thus identified. This contributes to improving the efficiency of the data center.

従来ではシステム運用者は、システムを運用するにあたり、主にサーバのプロセッサ使用率やメモリの利用度を使用してサーバの性能を評価し、その性能評価を軸にして、サーバの統合、アプリケーションの移設、分配、統合、サーバの更新（新しい機器に交換する・買い換える）を行っている。しかしながら、特に昨今の景気低迷期では、サーバのさらなる有効利用やさらなる消費電力の低減によって一段とデータセンタ全体の運用コストを下げることが求められている。
そこで、運用管理装置１０１０を使用することにより、サーバにおける仕事が内向け（対サーバ）か、外向け（サーバ発）なのかを詳細に解析でき、また、その仕事がデータセンタ外部への仕事なのかデータセンタ内の他のサーバやアプリケーションに対する仕事なのかも解析できる。したがって、この解析を元にして、省エネ化のための精度の高いサーバ統合、分配、アプリケーション統合、分配、機器の更新、増強、廃止などが行える。 In the past, system operators have mainly evaluated server performance using server processor usage and memory utilization in system operation, and server integration and application Relocation, distribution, integration, and server update (exchange / replace with new equipment). However, particularly in the recent economic downturn, there is a need to further reduce the operational cost of the entire data center by further effective use of servers and further reduction of power consumption.
Therefore, by using the operation management apparatus 1010, it is possible to analyze in detail whether the work at the server is inward (to the server) or outward (from the server), and the work is a work to the outside of the data center. You can also analyze whether it is work for other servers and applications in the data center. Therefore, based on this analysis, high-accuracy server integration, distribution, application integration, distribution, equipment update, enhancement, abolition, etc. can be performed for energy saving.

以上、実施の形態に係る運用管理装置１０１０の構成と動作について説明した。この実施の形態は例示であり、その各構成要素や各処理の組み合わせにいろいろな変形例が可能なこと、またそうした変形例も本発明の範囲にあることは当業者に理解されるところである。 The configuration and operation of the operation management apparatus 1010 according to the embodiment have been described above. This embodiment is an exemplification, and it is understood by those skilled in the art that various modifications can be made to each component and combination of processes, and such modifications are within the scope of the present invention.

実施の形態では、ユーザ端末１００６からサーバ群１０２２へのパケットの流れを基に説明したが、サーバ群１０２２からユーザ端末１００６へパケットを返すときもロードバランサ１０３０が適宜接続保持部１３２４を参照してアドレス変換できることは本明細書に触れた当業者には明らかである。 In the embodiment, the description has been made based on the flow of packets from the user terminal 1006 to the server group 1022. However, when returning a packet from the server group 1022 to the user terminal 1006, the load balancer 1030 refers to the connection holding unit 1324 as appropriate. It will be apparent to those skilled in the art who have touched this specification that address translation is possible.

実施の形態では、アクセスしきい値Ｍｏ、第１しきい値Ｔ１、第２しきい値Ｔ２、および第３しきい値Ｔ３が稼動状態のサーバ群の数を決めるしきい値となる場合について説明したが、これに限られない。例えば、それぞれのしきい値にヒステリシスを持たせてもよい。この場合、予測総アクセス数がしきい値付近で変動しても、しきい値をまたぐ毎にサーバ群の状態を切り替えなくてもよいので、状態切替に伴うオーバヘッドを低減できる。その結果情報処理システム２全体のレスポンスが向上しうる。 In the embodiment, a case where the access threshold value Mo, the first threshold value T1, the second threshold value T2, and the third threshold value T3 are threshold values that determine the number of servers in the active state will be described. However, it is not limited to this. For example, each threshold value may have hysteresis. In this case, even if the predicted total number of accesses fluctuates in the vicinity of the threshold value, it is not necessary to switch the state of the server group every time the threshold value is crossed, so that the overhead associated with the state switching can be reduced. As a result, the overall response of the information processing system 2 can be improved.

実施の形態では、サーバ群１０２２をどの状態に置くかについて、状態設定部１１４０において表１に示されるストラテジが使用される場合について説明したが、これに限られない。例えば、省電力状態としてＯＳ休眠状態を使用し、電源オフ状態を使用しなくてもよい。この場合、電源オンオフにかかる比較的長いオーバヘッドがなくなるので、より早いレスポンスが期待できる。また、処理が簡素化される。別の例としては、省電力状態として電源オフ状態を使用し、ＯＳ休眠状態を使用しなくてもよい。この場合、消費電力をより低減できる。 In the embodiment, the state in which the server group 1022 is placed has been described in the case where the strategy shown in Table 1 is used in the state setting unit 1140. However, the present invention is not limited to this. For example, the OS sleep state may be used as the power saving state, and the power off state may not be used. In this case, since there is no relatively long overhead for power on / off, a faster response can be expected. Further, the processing is simplified. As another example, the power-off state may be used as the power saving state, and the OS sleep state may not be used. In this case, power consumption can be further reduced.

実施の形態では、フロントエンドサーバ１０２４とアプリケーションサーバ１０２６とデータベースサーバ１０２８とは別個のサーバであり、この順に直列に接続されている場合について説明したが、これに限られない。個々のサーバ群は少なくともひとつのサーバを含めばよく、例えば、サーバ群はフロントエンドサーバとアプリケーションサーバとデータベースサーバの機能を全て併せ持つ１台のサーバを含んでもよい。また、サーバ群は、それら３つのサーバの機能のうちの任意の２つの機能を併せ持つサーバと、残りの機能を持つサーバと、を含んでもよい。 In the embodiment, the front end server 1024, the application server 1026, and the database server 1028 are separate servers, and the case where they are connected in series in this order has been described. However, the present invention is not limited to this. Each server group may include at least one server. For example, the server group may include one server having all functions of a front-end server, an application server, and a database server. The server group may include a server having any two functions of the functions of the three servers and a server having the remaining functions.

実施の形態では、負荷予測部１１３４は属性をキーとする場合について説明したが、時間帯であってもよい。負荷予測部１１３４は、負荷履歴保持部１１１２を参照し、予測対象の時間帯に対して過去の同月同日の同じ時間帯の負荷履歴を取得し、これを基に予測対象の時間帯の予測総アクセス数を決定する。例えば、負荷予測部１１３４が２００９年７月２８日の９：１５〜９：３０における総アクセス数を予測する場合、負荷予測部１１３４は２００８年７月２８日の９：１５〜９：３０における総アクセス数（図１４の場合、５４００）を負荷履歴保持部１１１２から取得し、それを予測総アクセス数とする。 In the embodiment, the case where the load prediction unit 1134 uses the attribute as a key has been described, but it may be a time zone. The load prediction unit 1134 refers to the load history holding unit 1112, acquires the load history of the same time in the same month in the past for the prediction target time zone, and based on this, the prediction total of the prediction target time zone Determine the number of accesses. For example, when the load prediction unit 1134 predicts the total number of accesses at 9:15 to 9:30 on July 28, 2009, the load prediction unit 1134 at 9:15 to 9:30 on July 28, 2008 The total access number (5400 in the case of FIG. 14) is acquired from the load history holding unit 1112 and is set as the predicted total access number.

実施の形態では、情報処理システム２が使用される例として証券業務や検索エンジンを挙げたが、これに限られない。本実施の形態に係る技術的思想は、例えば不特定多数のユーザが利用するコンピュータセンタの機器の管理に適用されうる。このコンピュータセンタの例としては、大学等の学術関係で、多くの生徒や研究者が利用するサーバとストレージのシステムがある。
図２２は、本発明のある実施の形態を大学の共用データセンタに適用した場合の負荷履歴保持部１１１２ｃを示すデータ構造図である。この場合、仕事量の傾向をより正確に把握するために大学のスケジュールに応じた属性が選択されることが望ましい。 In the embodiment, as an example in which the information processing system 2 is used, a securities business or a search engine is given, but the present invention is not limited to this. The technical idea according to the present embodiment can be applied to, for example, management of computer center equipment used by an unspecified number of users. An example of this computer center is a server and storage system used by many students and researchers in academic relations such as universities.
FIG. 22 is a data structure diagram showing the load history holding unit 1112c when an embodiment of the present invention is applied to a shared data center of a university. In this case, it is desirable to select an attribute according to the university schedule in order to grasp the tendency of the workload more accurately.

実施の形態では、要求処理ユニットがサーバ群である場合について説明したが、これに限られない。本実施の形態に係る技術思想は例えばＧＳＬＢ（Global Server Load Balance）にも応用されうる。そこでは、要求処理ユニットはそれ自体が複数の並列に配されたサーバ群を有するシステムであってもよい。また、実施の形態では要求はユーザからのアクセスである場合について説明したが、これに限られず、本実施の形態に係る技術思想が適用されるシステムによって異なってもよい。要求とは処理主体への処理の指示であるとも言える。 In the embodiment, the case where the request processing unit is a server group has been described. However, the present invention is not limited to this. The technical idea according to the present embodiment can be applied to, for example, GSLB (Global Server Load Balance). There, the request processing unit may itself be a system having a plurality of servers arranged in parallel. Further, in the embodiment, the case where the request is an access from a user has been described. However, the request is not limited to this, and may be different depending on a system to which the technical idea according to the present embodiment is applied. It can be said that the request is an instruction of processing to the processing subject.

実施の形態では、サーバ群１０２２の平均消費電力、最高消費電力、電力コストのそれぞれとして、そのサーバ群１０２２のフロントエンドサーバ（ＷＥＢサーバ）１０２４の平均消費電力、最高消費電力、電力コストを採用する場合について説明したが、これに限られない。例えば、サーバ群１０２２に含まれる各サーバの電力コストを算出し、それを平均した値をそのサーバ群１０２２の電力コストとして採用してもよい。 In the embodiment, the average power consumption, the maximum power consumption, and the power cost of the front-end server (WEB server) 1024 of the server group 1022 are adopted as the average power consumption, the maximum power consumption, and the power cost of the server group 1022, respectively. Although the case has been described, the present invention is not limited to this. For example, the power cost of each server included in the server group 1022 may be calculated, and an average value thereof may be adopted as the power cost of the server group 1022.

実施の形態では、例えば図１８に示されるように、指標算出部１００においてパケットの傍受（Ｓ４０２）からデータの集計（Ｓ４１２）が一連の処理として行われる場合について説明したが、これに限られない。例えば、指標算出部は、パケットの傍受からデータの分類までを随時行って分類結果のデータを蓄積し、所定の条件が満たされると分類結果のデータを集計してもよい。ここで所定の条件は、例えば前回の集計から所定の期間が経過したこと、もしくは分類結果のデータの量が所定の量に達したことである。 In the embodiment, for example, as illustrated in FIG. 18, the case has been described in which the index calculation unit 100 performs data aggregation (S412) from packet interception (S402) as a series of processes, but the present invention is not limited thereto. . For example, the index calculation unit may perform from packet interception to data classification at any time, accumulate the classification result data, and aggregate the classification result data when a predetermined condition is satisfied. Here, the predetermined condition is, for example, that a predetermined period has elapsed since the previous aggregation, or that the amount of classification result data has reached a predetermined amount.

図２３は、変形例に係る指標算出部における一連の処理を示すフローチャートである。変形例に係る指標算出部は、傍受部と、抽出部と、登録部と、通信情報保持部と、分類部と、分類結果保持部と、集計部と、を備える。傍受部は、ネットワークからパケットを傍受する（Ｓ５０２）。抽出部は、傍受されたパケットから始点アドレスとポート番号とを抽出する（Ｓ５０４）。登録部は、抽出された始点アドレスとポート番号とパケットのデータ量とを通信情報保持部に登録する（Ｓ５０６）。分類部は、通信情報保持部に保持されるデータを始点アドレスでソーティングする（Ｓ５０８）。分類部は、ソーティングされたデータを本来用途と非本来用途とに分類する（Ｓ５１０）。分類部は、分類結果を分類結果保持部に登録する（Ｓ５１２）。所定の条件が満たされていない場合（Ｓ５１４のＮ）、ステップＳ５０２に処理が戻る。これにより、所定の条件が満たされるまで分類結果が分類結果保持部に蓄積されてゆく。所定の条件が満たされた場合（Ｓ５１４のＹ）、集計部は、分類結果保持部に蓄積されたデータを集計する（Ｓ５１６）。
本変形例によると、所定の条件を変えることにより、集計の母集団として使用する分類結果データの量を調整できる。したがって、集計の結果得られる指標に対して求められている精度に応じて柔軟に分類結果データの量を調整できる。 FIG. 23 is a flowchart showing a series of processes in the index calculation unit according to the modification. The index calculation unit according to the modification includes an intercept unit, an extraction unit, a registration unit, a communication information holding unit, a classification unit, a classification result holding unit, and a counting unit. The intercepting unit intercepts the packet from the network (S502). The extraction unit extracts the start point address and the port number from the intercepted packet (S504). The registration unit registers the extracted start point address, port number, and packet data amount in the communication information holding unit (S506). The classification unit sorts the data held in the communication information holding unit with the start point address (S508). The classification unit classifies the sorted data into an original use and a non-original use (S510). The classification unit registers the classification result in the classification result holding unit (S512). If the predetermined condition is not satisfied (N in S514), the process returns to step S502. Thereby, the classification result is accumulated in the classification result holding unit until a predetermined condition is satisfied. When the predetermined condition is satisfied (Y in S514), the totaling unit totals the data accumulated in the classification result holding unit (S516).
According to the present modification, the amount of classification result data used as a total population can be adjusted by changing predetermined conditions. Therefore, it is possible to flexibly adjust the amount of classification result data according to the accuracy required for the index obtained as a result of aggregation.

以上、実施の形態にもとづき本発明を説明したが、実施の形態は、本発明の原理、応用を示しているにすぎないことはいうまでもなく、実施の形態には、請求の範囲に規定された本発明の思想を逸脱しない範囲において、多くの変形例や配置の変更が可能であることはいうまでもない。 Although the present invention has been described based on the embodiments, the embodiments merely show the principle and application of the present invention, and the embodiments are defined in the claims. Needless to say, many modifications and arrangements can be made without departing from the spirit of the present invention.

２情報処理システム、１００指標算出部、１０２ネットワークインタフェース部、１０４通信情報保持部、１０８第１指標保持部、１１０使用状況取得部、１１２使用状況保持部、１１４係数保持部、１１６第１解析部、１１８第２指標保持部、１２０表示制御部、１３０集計部、１３２分類部、１３８ノード情報保持部、１４０モニタ、１５２第２解析部、１５８第３指標保持部、１００６ユーザ端末、１０１０運用管理装置、１０２０情報処理装置、１０３０ロードバランサ、１１３０学習部、１１４０状態設定部、１１５０乖離判定部、１１６０オーバライド部。 2 Information processing system, 100 index calculation unit, 102 network interface unit, 104 communication information holding unit, 108 first index holding unit, 110 usage status acquisition unit, 112 usage status holding unit, 114 coefficient holding unit, 116 first analysis unit 118 Second index holding unit, 120 Display control unit, 130 Totaling unit, 132 Classification unit, 138 Node information holding unit, 140 Monitor, 152 Second analysis unit, 158 Third index holding unit, 1006 User terminal, 1010 Operation management Apparatus, 1020 information processing apparatus, 1030 load balancer, 1130 learning unit, 1140 state setting unit, 1150 deviation determination unit, 1160 override unit.

Claims

A load history holding unit for holding a load history of requests for an information processing apparatus configured by connecting a plurality of servers via a network;
A load prediction unit that predicts a load that will occur in the future with reference to the load history holding unit;
A state setting unit configured to set at least one server in a second state of power saving rather than the first state capable of accepting a request when the load predicted by the load prediction unit is less than a predetermined first threshold; ,
A usage status acquisition unit that acquires the temperature of the server from information obtained from the network,
The state setting unit preferentially selects a server having a high temperature acquired by the use state acquisition unit as a server to be set to the second state.

A load history holding unit for holding a load history of requests for an information processing apparatus configured by connecting a plurality of servers via a network;
A load prediction unit that predicts a load that will occur in the future with reference to the load history holding unit;
A state setting unit configured to set at least one server in a second state of power saving rather than the first state capable of accepting a request when the load predicted by the load prediction unit is less than a predetermined first threshold; ,
Based on the information obtained from the network, the amount of work of the server is classified into the amount of work for the original purpose predetermined for the server and the amount of work for the non-original purpose that is a use other than the original purpose. An index calculation unit that calculates an increase in the temperature of the server due to processing of one unit of work for use, and
The operation management apparatus, wherein the state setting unit preferentially selects a server having a large increase value calculated by the index calculation unit as a server to be set to the second state.

A load management device that obtains a request for an information processing device configured by connecting a plurality of servers through a network from outside, and assigns the obtained request to at least one of the plurality of servers;
An analysis device that acquires the temperature of the server from information obtained from the network,
The analysis device includes:
A unit selection unit that selects at least one server that does not require execution of processing corresponding to the request among a plurality of servers;
A unit setting unit that sets at least one server selected by the unit selection unit from an operating state in which a request can be received to a power saving state that is lower than the operating state; and
The unit selection unit sets the order in which the power saving state is set for each server so that the higher the temperature acquired by the analysis device is, the higher the temperature is, the power saving state is set.
When the requested load is less than a predetermined value, the unit selection unit selects a server in the operating state that has the highest power saving state as a server that does not need to execute processing corresponding to the request. An information processing system characterized by

A load management device that obtains a request for an information processing device configured by connecting a plurality of servers through a network from outside, and assigns the obtained request to at least one of the plurality of servers;
From the information obtained from the network, for each server, the amount of work for the server is determined in advance for the amount of work for the original use and the amount of work for the non-original use that is a use other than the original use. An analysis device that classifies and calculates an increase in the temperature of the server due to the processing of one unit of work for intended use,
The analysis device includes:
A unit selection unit that selects at least one server that does not require execution of processing corresponding to the request among a plurality of servers;
A unit setting unit that sets at least one server selected by the unit selection unit from an operating state in which a request can be received to a power saving state that is lower than the operating state; and
The unit selection unit sets the order in which the power saving state is set for each server so that the increase value calculated by the analysis device is larger and the power saving state is preferentially set.
When the requested load is less than a predetermined value, the unit selection unit selects a server in the operating state that has the highest power saving state as a server that does not need to execute processing corresponding to the request. An information processing system characterized by