JP2021101278A

JP2021101278A - Information processing device, information processing system, information processing method, and program

Info

Publication number: JP2021101278A
Application number: JP2019232537A
Authority: JP
Inventors: 公敬山崎; Kimitaka Yamazaki
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2019-12-24
Filing date: 2019-12-24
Publication date: 2021-07-08
Anticipated expiration: 2039-12-24
Also published as: JP7360036B2

Abstract

To provide an information processing device which reduces a communication volume for collecting operation information.SOLUTION: An information processing device 10 includes a communication unit 11, a storage unit 12, and a processing unit 13. The communication unit receives operation information 23 indicating a usage state of resources from a virtual machine 21, and receives operation information 24 indicating a usage state of resources from a virtual machine 22. The storage unit stores histories of the received operation information 23 and 24. The processing unit determines a correlation 14 of the resource usage states of the virtual machine 21 and the virtual machine 22, and when the correlation satisfies a prescribed condition, performs control so that a reception frequency of the operation information 24 is lower than a reception frequency of the operation information 23. The processing unit estimates the resource usage state in the virtual machine 22 on the basis of the operation information 23 received from the virtual machine 21 and the correlation.SELECTED DRAWING: Figure 1

Description

本発明は情報処理装置、情報処理システム、情報処理方法およびプログラムに関する。 The present invention relates to an information processing device, an information processing system, an information processing method and a program.

物理的なコンピュータである物理マシンに、仮想的なコンピュータである仮想マシンを配置して実行させるコンピュータ仮想化技術がある。コンピュータ仮想化技術によれば、仮想マシンの動的な追加や削除や移動が可能であり、情報処理の運用管理が容易となる。仮想マシンが配置される物理マシンは、データセンタなどの大規模情報処理施設が有するものであることもあり、いわゆるクラウドシステムが有するものであることもある。 There is a computer virtualization technology in which a virtual machine, which is a virtual computer, is placed and executed on a physical machine, which is a physical computer. According to computer virtualization technology, virtual machines can be dynamically added, deleted, and moved, facilitating the operation management of information processing. The physical machine in which the virtual machine is placed may be owned by a large-scale information processing facility such as a data center, or may be owned by a so-called cloud system.

異なる物理マシンに配置された複数の仮想マシンが分散してアプリケーションプログラムを実行し、それら複数の仮想マシンが連携して、ユーザからの要求に応じた情報処理サービスを提供するようにすることもできる。複数の仮想マシンを、異なるデータセンタや異なるクラウドシステムに分割して配置することも可能である。複数の仮想マシンを利用した情報処理サービスの運用管理では、プロセッサやメモリなどのリソースの使用状況を示す稼働情報を各仮想マシンから収集することが好ましい。収集した稼働情報に基づいて、新たな仮想マシンの追加、一部の仮想マシンの削除、仮想マシンに割り当てるリソースの増強、仮想マシンを配置するクラウドシステムの変更などを検討することができる。 Multiple virtual machines located on different physical machines can be distributed to execute application programs, and these multiple virtual machines can work together to provide information processing services in response to user requests. .. It is also possible to divide and arrange multiple virtual machines in different data centers and different cloud systems. In the operation management of an information processing service using a plurality of virtual machines, it is preferable to collect operation information indicating the usage status of resources such as processors and memories from each virtual machine. Based on the collected operation information, it is possible to consider adding new virtual machines, deleting some virtual machines, increasing the resources allocated to virtual machines, and changing the cloud system in which virtual machines are placed.

例えば、物理マシン全体のリソース使用率と仮想マシン毎のリソース使用率とを示す測定データを収集し、リソース不足を検出した場合に一部の仮想マシンを他の物理マシンに移動する仮想マシン管理装置が提案されている。また、ある物理マシンから別の物理マシンに２つの仮想マシンを移動する場合に、２つの仮想マシンの移動を並行して行うか否か決定する仮想マシン制御装置が提案されている。提案の仮想マシン制御装置は、一方の仮想マシンの移動を開始したとき、２つの仮想マシンの処理状況を監視して相関係数を算出し、相関係数が正である場合、他方の仮想マシンの移動も開始する。 For example, a virtual machine management device that collects measurement data showing the resource usage rate of the entire physical machine and the resource usage rate of each virtual machine, and moves some virtual machines to other physical machines when a resource shortage is detected. Has been proposed. Further, when moving two virtual machines from one physical machine to another, a virtual machine control device for determining whether or not to move the two virtual machines in parallel has been proposed. When the proposed virtual machine controller starts moving one virtual machine, it monitors the processing status of the two virtual machines and calculates the correlation coefficient. If the correlation coefficient is positive, the other virtual machine The movement of is also started.

国際公開第２００８／０６２８６４号International Publication No. 2008/062864 特開２０１２−８８８０８号公報Japanese Unexamined Patent Publication No. 2012-888808

収集する稼働情報の精度という観点からは、短い周期で複数の仮想マシンの稼働情報を収集することが好ましい。しかし、情報処理サービスで使用されている全ての仮想マシンと短い周期で通信すると、稼働情報を収集する際の通信量が多くなってしまうという問題がある。例えば、複数の仮想マシンの少なくとも一部がデータセンタやクラウドシステムに配置されている場合、インターネットなどの広域データ通信ネットワークを介して短い周期で通信することになり、その通信負荷が問題となる。 From the viewpoint of the accuracy of the operation information to be collected, it is preferable to collect the operation information of a plurality of virtual machines in a short cycle. However, if communication is performed with all the virtual machines used in the information processing service in a short cycle, there is a problem that the amount of communication when collecting operation information increases. For example, when at least a part of a plurality of virtual machines is arranged in a data center or a cloud system, communication is performed in a short cycle via a wide area data communication network such as the Internet, and the communication load becomes a problem.

１つの側面では、本発明は、稼働情報を収集する際の通信量を削減できる情報処理装置、情報処理システム、情報処理方法およびプログラムを提供することを目的とする。 In one aspect, it is an object of the present invention to provide an information processing device, an information processing system, an information processing method and a program capable of reducing the amount of communication when collecting operation information.

１つの態様では、通信部と記憶部と処理部とを有する情報処理装置が提供される。通信部は、第１の仮想マシンから第１の仮想マシンにおけるリソースの使用状況を示す第１の稼働情報を受信し、第２の仮想マシンから第２の仮想マシンにおけるリソースの使用状況を示す第２の稼働情報を受信する。記憶部は、受信された第１の稼働情報および第２の稼働情報の履歴を記憶する。処理部は、履歴に基づいて、第１の仮想マシンと第２の仮想マシンとの間のリソースの使用状況の相関関係を判定し、相関関係が所定条件を満たす場合、第２の稼働情報の受信頻度が第１の稼働情報の受信頻度よりも低くなるよう制御する。処理部は、第１の仮想マシンから受信された第１の稼働情報と相関関係とに基づいて、第２の仮想マシンにおけるリソースの使用状況を推定する。 In one aspect, an information processing device having a communication unit, a storage unit, and a processing unit is provided. The communication unit receives the first operation information indicating the resource usage status in the first virtual machine from the first virtual machine, and indicates the resource usage status in the second virtual machine from the second virtual machine. Receive the operation information of 2. The storage unit stores the received first operation information and the history of the second operation information. The processing unit determines the correlation of the resource usage status between the first virtual machine and the second virtual machine based on the history, and if the correlation satisfies a predetermined condition, the second operation information The reception frequency is controlled to be lower than the reception frequency of the first operation information. The processing unit estimates the resource usage status in the second virtual machine based on the first operation information received from the first virtual machine and the correlation.

また、１つの態様では、第１の仮想マシンを実行する第１の情報処理装置と第２の仮想マシンを実行する第２の情報処理装置と第３の情報処理装置とを有する情報処理システムが提供される。また、１つの態様では、コンピュータが実行する情報処理方法が提供される。また、１つの態様では、コンピュータに実行させるプログラムが提供される。 Further, in one embodiment, an information processing system having a first information processing device that executes a first virtual machine, a second information processing device that executes a second virtual machine, and a third information processing device. Provided. Also, in one aspect, an information processing method executed by a computer is provided. Also, in one aspect, a program to be executed by a computer is provided.

１つの側面では、稼働情報を収集する際の通信量を削減できる。 On one side, the amount of communication when collecting operation information can be reduced.

第１の実施の形態の情報処理システムの例を説明する図である。It is a figure explaining the example of the information processing system of 1st Embodiment. 第２の実施の形態の情報処理システムの例を示す図である。It is a figure which shows the example of the information processing system of the 2nd Embodiment. 情報収集サーバのハードウェア例を示すブロック図である。It is a block diagram which shows the hardware example of an information collection server. サービスと仮想マシンとシステムの対応例を示す図である。It is a figure which shows the correspondence example of a service, a virtual machine, and a system. 情報収集サーバと仮想マシンの通信例を示すシーケンス図である。It is a sequence diagram which shows the communication example of an information collection server and a virtual machine. 仮想マシンのグループ化例を示す図である。It is a figure which shows the grouping example of a virtual machine. 仮想マシンのグループ化例を示す図（続き１）である。It is a figure which shows the grouping example of a virtual machine (continuation 1). 仮想マシンのグループ化例を示す図（続き２）である。It is a figure which shows the grouping example of a virtual machine (continuation 2). 情報収集サーバと仮想マシンの機能例を示すブロック図である。It is a block diagram which shows the functional example of an information collection server and a virtual machine. サービス構造テーブルと稼働履歴テーブルの例を示す図である。It is a figure which shows the example of the service structure table and the operation history table. 相関関係テーブルと稼働状況テーブルの例を示す図である。It is a figure which shows the example of the correlation table and the operation status table. グループ管理テーブルの例を示す図である。It is a figure which shows the example of the group management table. サーバ開始処理の手順例を示すフローチャートである。It is a flowchart which shows the procedure example of a server start processing. グループ判定の手順例を示すフローチャートである。It is a flowchart which shows the procedure example of group determination. 仮想マシン開始処理の手順例を示すフローチャートである。It is a flowchart which shows the procedure example of the virtual machine start processing. 仮想マシン継続処理の手順例を示すフローチャートである。It is a flowchart which shows the procedure example of virtual machine continuation processing. サーバ継続処理の手順例を示すフローチャートである。It is a flowchart which shows the procedure example of a server continuation processing.

以下、本実施の形態を図面を参照して説明する。
［第１の実施の形態］
第１の実施の形態を説明する。 Hereinafter, the present embodiment will be described with reference to the drawings.
[First Embodiment]
The first embodiment will be described.

図１は、第１の実施の形態の情報処理システムの例を説明する図である。
第１の実施の形態の情報処理システムは、複数の仮想マシンを異なる情報処理装置に配置し、それら複数の仮想マシンそれぞれのリソースの使用状況を監視する。第１の実施の形態の情報処理システムは、情報処理装置１０，２０，２０ａを含む。 FIG. 1 is a diagram illustrating an example of an information processing system according to the first embodiment.
In the information processing system of the first embodiment, a plurality of virtual machines are arranged in different information processing devices, and the resource usage status of each of the plurality of virtual machines is monitored. The information processing system of the first embodiment includes information processing devices 10, 20, 20a.

情報処理装置１０は、仮想マシンにおけるリソースの使用状況を示す稼働情報を収集して、仮想マシンを監視するコンピュータである。情報処理装置１０は、クライアントコンピュータでもよいしサーバコンピュータでもよい。 The information processing device 10 is a computer that monitors a virtual machine by collecting operation information indicating the usage status of resources in the virtual machine. The information processing device 10 may be a client computer or a server computer.

情報処理装置２０，２０ａは、仮想マシンを実行可能なサーバコンピュータである。情報処理装置２０，２０ａは、情報処理装置１０とネットワークを介して通信することができる。情報処理装置２０，２０ａは、データセンタのコンピュータでもよく、クラウド事業者が所有するパブリッククラウドシステムのコンピュータでもよい。また、情報処理装置２０，２０ａは、仮想マシンを利用してユーザにサービスを提供するサービス事業者が所有するプライベートクラウドシステムまたはオンプレミスシステムのコンピュータでもよい。情報処理装置２０と情報処理装置２０ａとは、異なるシステムに属していてもよい。情報処理装置２０，２０ａと情報処理装置１０とが、インターネットなどの広域データ通信ネットワークを介して通信するようにしてもよい。 The information processing devices 20 and 20a are server computers capable of executing virtual machines. The information processing devices 20 and 20a can communicate with the information processing device 10 via a network. The information processing devices 20 and 20a may be computers in a data center or computers in a public cloud system owned by a cloud operator. Further, the information processing devices 20 and 20a may be computers of a private cloud system or an on-premises system owned by a service provider that provides services to users by using a virtual machine. The information processing device 20 and the information processing device 20a may belong to different systems. The information processing devices 20 and 20a and the information processing device 10 may communicate with each other via a wide area data communication network such as the Internet.

情報処理装置２０には、仮想マシン２１（第１の仮想マシン）が配置されている。情報処理装置２０ａには、仮想マシン２２（第２の仮想マシン）が配置されている。情報処理装置１０は、仮想マシン２１，２２それぞれのリソースの使用状況を監視する。例えば、情報処理装置１０は、仮想マシン２１，２２におけるリソースの使用状況を、表示装置に表示するなどの方法によってサービス管理者に提示する。リソースの使用状況を監視することで、新たな仮想マシンの追加、一部の仮想マシンの削除、仮想マシンに割り当てるリソースの増強、仮想マシンを配置するシステムの変更などの構成変更を検討できる。 A virtual machine 21 (first virtual machine) is arranged in the information processing device 20. A virtual machine 22 (second virtual machine) is arranged in the information processing device 20a. The information processing device 10 monitors the resource usage status of each of the virtual machines 21 and 22. For example, the information processing device 10 presents the resource usage status in the virtual machines 21 and 22 to the service manager by a method such as displaying it on the display device. By monitoring resource usage, you can consider configuration changes such as adding new virtual machines, deleting some virtual machines, increasing the resources allocated to virtual machines, and changing the system where virtual machines are located.

リソースは、仮想マシンに割り当てられて情報処理に用いられるハードウェア資源である。リソースの使用状況として、ＣＰＵ（Central Processing Unit）などのプロセッサの使用率、ＲＡＭ（Random Access Memory）などのメモリの使用量、ＨＤＤ（Hard Disk Drive）などのストレージのアクセス量、ネットワーク帯域使用率などが挙げられる。 A resource is a hardware resource that is allocated to a virtual machine and used for information processing. Resource usage includes processor usage such as CPU (Central Processing Unit), memory usage such as RAM (Random Access Memory), storage access such as HDD (Hard Disk Drive), and network bandwidth usage. Can be mentioned.

仮想マシン２１は、仮想マシン２１におけるリソースの使用状況を示す稼働情報２３（第１の稼働情報）を送信することができる。仮想マシン２２は、仮想マシン２２におけるリソースの使用状況を示す稼働情報２４（第２の稼働情報）を送信することができる。仮想マシン２１，２２は、情報処理装置１０からの要求に応じて稼働情報を送信してもよいし、情報処理装置１０から直接的または間接的に指定された頻度で自律的に稼働情報を送信してもよい。後述する送信頻度の低下には、稼働情報の送信を停止することも含まれる。なお、仮想マシン２１，２２は、それぞれゲストオペレーティングシステム（ＯＳ：Operating System）を実行する狭義の仮想マシンでもよいし、コンテナ型コンピュータ仮想化技術におけるコンテナなどゲストＯＳを実行しない仮想情報処理主体でもよい。 The virtual machine 21 can transmit operation information 23 (first operation information) indicating the resource usage status in the virtual machine 21. The virtual machine 22 can transmit operation information 24 (second operation information) indicating the resource usage status in the virtual machine 22. The virtual machines 21 and 22 may transmit operation information in response to a request from the information processing device 10, or autonomously transmit operation information from the information processing device 10 at a frequency specified directly or indirectly. You may. The decrease in transmission frequency, which will be described later, includes stopping the transmission of operation information. The virtual machines 21 and 22 may be virtual machines in a narrow sense that execute a guest operating system (OS), respectively, or may be a virtual information processing entity that does not execute a guest OS such as a container in container-type computer virtualization technology. ..

情報処理装置１０は、通信部１１、記憶部１２および処理部１３を有する。通信部１１は、情報処理装置２０，２０ａとネットワークを介して通信する通信インタフェースである。通信部１１は、スイッチやルータなどの有線通信装置に接続される有線通信インタフェースでもよいし、アクセスポイントや無線基地局などの無線通信装置に接続される無線通信インタフェースでもよい。記憶部１２は、ＲＡＭなどの揮発性半導体メモリでもよいし、ＨＤＤやフラッシュメモリなどの不揮発性ストレージでもよい。 The information processing device 10 includes a communication unit 11, a storage unit 12, and a processing unit 13. The communication unit 11 is a communication interface that communicates with the information processing devices 20 and 20a via a network. The communication unit 11 may be a wired communication interface connected to a wired communication device such as a switch or a router, or a wireless communication interface connected to a wireless communication device such as an access point or a wireless base station. The storage unit 12 may be a volatile semiconductor memory such as a RAM, or a non-volatile storage such as an HDD or a flash memory.

処理部１３は、例えば、ＣＰＵ、ＧＰＵ（Graphics Processing Unit）、ＤＳＰ（Digital Signal Processor）などのプロセッサである。ただし、処理部１３は、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）などの特定用途の電子回路を含んでもよい。プロセッサは、ＲＡＭなどのメモリ（記憶部１２でもよい）に記憶されたプログラムを実行する。複数のプロセッサの集合を「マルチプロセッサ」または単に「プロセッサ」と言うこともある。 The processing unit 13 is, for example, a processor such as a CPU, a GPU (Graphics Processing Unit), or a DSP (Digital Signal Processor). However, the processing unit 13 may include an electronic circuit for a specific purpose such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array). The processor executes a program stored in a memory (may be a storage unit 12) such as a RAM. A set of multiple processors is sometimes referred to as a "multiprocessor" or simply a "processor."

通信部１１は、仮想マシン２１から稼働情報２３を受信し、仮想マシン２２から稼働情報２４を受信する。記憶部１２は、仮想マシン２１から受信された稼働情報２３および仮想マシン２２から受信された稼働情報２４の履歴を記憶する。稼働情報２３，２４の履歴は、異なる複数の時刻それぞれのリソースの使用状況を示す。記憶部１２に十分な履歴が蓄積されるまでは、例えば、通信部１１は、低い所定頻度（例えば、１分間隔）で仮想マシン２１，２２それぞれから稼働情報を受信する。処理部１３は、送信頻度の初期値を仮想マシン２１，２２に通知することで、仮想マシン２１，２２に定期的に稼働情報を送信させるようにしてもよい。また、通信部１１は、所定頻度で仮想マシン２１，２２に稼働情報を要求することで、稼働情報を収集するようにしてもよい。 The communication unit 11 receives the operation information 23 from the virtual machine 21 and the operation information 24 from the virtual machine 22. The storage unit 12 stores the history of the operation information 23 received from the virtual machine 21 and the operation information 24 received from the virtual machine 22. The history of the operation information 23 and 24 shows the usage status of the resource at each of a plurality of different times. Until a sufficient history is accumulated in the storage unit 12, for example, the communication unit 11 receives operation information from each of the virtual machines 21 and 22 at a low predetermined frequency (for example, at 1-minute intervals). The processing unit 13 may notify the virtual machines 21 and 22 of the initial value of the transmission frequency so that the virtual machines 21 and 22 may periodically transmit the operation information. Further, the communication unit 11 may collect the operation information by requesting the virtual machines 21 and 22 for the operation information at a predetermined frequency.

処理部１３は、記憶部１２に記憶された履歴に基づいて、仮想マシン２１と仮想マシン２２との間のリソースの使用状況の相関関係１４を判定する。稼働情報２３，２４が２以上の種類のリソースの使用状況を示す場合、処理部１３は、リソースの種類毎に相関関係１４を判定してもよい。例えば、処理部１３は、記憶部１２に記憶された履歴の中から、時刻が近い稼働情報２３と稼働情報２４とを対応付け、複数のペアに基づいて、稼働情報２３に含まれる数値と稼働情報２４に含まれる数値の間の相関係数を算出する。また、例えば、処理部１３は、直近の一定期間の稼働情報２３，２４に基づいて、稼働情報２３に含まれる数値に対する稼働情報２４に含まれる数値の平均倍率を算出する。 The processing unit 13 determines the correlation 14 of the resource usage status between the virtual machine 21 and the virtual machine 22 based on the history stored in the storage unit 12. When the operation information 23 and 24 indicate the usage status of two or more types of resources, the processing unit 13 may determine the correlation 14 for each type of resource. For example, the processing unit 13 correlates the operation information 23 and the operation information 24 whose times are close to each other from the history stored in the storage unit 12, and based on a plurality of pairs, the numerical value included in the operation information 23 and the operation. The correlation coefficient between the numerical values included in the information 24 is calculated. Further, for example, the processing unit 13 calculates the average magnification of the numerical value included in the operation information 24 with respect to the numerical value included in the operation information 23 based on the operation information 23 and 24 in the latest fixed period.

相関関係１４が所定条件を満たす場合、処理部１３は、仮想マシン２２の稼働情報２４の受信頻度が、仮想マシン２１の稼働情報２３の受信頻度よりも低くなるよう制御する。所定条件は、例えば、相関係数の絶対値が閾値（例えば、０．８）より大きいことである。このとき、処理部１３は、稼働情報２４の受信頻度を、相関関係１４を判定する前の初期値より低くしてもよく、仮想マシン２２からの稼働情報２４の受信を停止してもよい。また、処理部１３は、稼働情報２３の受信頻度を、１分間隔をその半分の３０秒間隔にするなど、相関関係１４を判定する前の初期値より高くしてもよい。 When the correlation 14 satisfies a predetermined condition, the processing unit 13 controls so that the reception frequency of the operation information 24 of the virtual machine 22 is lower than the reception frequency of the operation information 23 of the virtual machine 21. The predetermined condition is, for example, that the absolute value of the correlation coefficient is larger than the threshold value (for example, 0.8). At this time, the processing unit 13 may set the reception frequency of the operation information 24 to be lower than the initial value before determining the correlation 14, or may stop the reception of the operation information 24 from the virtual machine 22. Further, the processing unit 13 may set the reception frequency of the operation information 23 to be higher than the initial value before determining the correlation 14, such as setting the 1-minute interval to the half of the 30-second interval.

受信頻度の制御では、処理部１３は、仮想マシン２２に関する情報を仮想マシン２１に通知することで、仮想マシン２１を通じて仮想マシン２２に稼働情報２４の送信頻度の低下を指示するようにしてもよい。また、処理部１３は、稼働情報２４の送信頻度の低下を仮想マシン２２に直接指示するようにしてもよい。仮想マシン２２に対して直接的または間接的に行われる指示は、送信停止の指示であってもよい。また、通信部１１から仮想マシン２２に稼働情報２４を要求する頻度を下げるようにしてもよい。 In the control of the reception frequency, the processing unit 13 may notify the virtual machine 21 of the information about the virtual machine 22 to instruct the virtual machine 22 to reduce the transmission frequency of the operation information 24 through the virtual machine 21. .. Further, the processing unit 13 may directly instruct the virtual machine 22 to reduce the transmission frequency of the operation information 24. The instruction given directly or indirectly to the virtual machine 22 may be an instruction to stop transmission. Further, the frequency of requesting the operation information 24 from the communication unit 11 to the virtual machine 22 may be reduced.

相関関係１４が所定条件を満たす場合に受信頻度を低くする仮想マシンは、仮想マシン２１，２２のうちデータフローの下流側に位置する仮想マシンとしてもよい。例えば、通信部１１がテストリクエストを送信し、仮想マシン２１，２２の通信ログを収集するなどの方法により、処理部１３が複数の仮想マシンの間のメッセージ通信の順序を検出する。仮想マシン２１がメッセージの送信元であり仮想マシン２２が当該メッセージの送信先である場合、処理部１３は、仮想マシン２１，２２のうちデータフローの下流側に位置する仮想マシン２２の受信頻度を低くすることが考えられる。 The virtual machine whose reception frequency is lowered when the correlation 14 satisfies a predetermined condition may be a virtual machine located on the downstream side of the data flow among the virtual machines 21 and 22. For example, the communication unit 11 transmits a test request and collects the communication logs of the virtual machines 21 and 22, and the processing unit 13 detects the order of message communication between the plurality of virtual machines. When the virtual machine 21 is the source of the message and the virtual machine 22 is the destination of the message, the processing unit 13 determines the reception frequency of the virtual machine 22 located on the downstream side of the data flow among the virtual machines 21 and 22. It is possible to lower it.

稼働情報２４の受信頻度が稼働情報２３の受信頻度より低くなると、仮想マシン２１のリソースの使用状況を示す稼働情報２３は収集される一方、仮想マシン２２のリソースの使用状況を示す稼働情報２４は収集されない期間が生じる。そこで、処理部１３は、稼働情報２４が収集されない期間について、仮想マシン２２のリソースの使用状況を推定する。具体的には、処理部１３は、仮想マシン２１から受信された稼働情報２３と相関関係１４とに基づいて、仮想マシン２２のリソースの使用状況を推定する。例えば、処理部１３は、稼働情報２３に含まれる数値に、相関関係１４を判定する際に算出しておいた倍率を乗じたものを、仮想マシン２２のリソースの使用状況を示す数値と推定する。 When the reception frequency of the operation information 24 becomes lower than the reception frequency of the operation information 23, the operation information 23 indicating the resource usage status of the virtual machine 21 is collected, while the operation information 24 indicating the resource usage status of the virtual machine 22 is collected. There will be periods when it will not be collected. Therefore, the processing unit 13 estimates the resource usage status of the virtual machine 22 during the period when the operation information 24 is not collected. Specifically, the processing unit 13 estimates the resource usage status of the virtual machine 22 based on the operation information 23 received from the virtual machine 21 and the correlation 14. For example, the processing unit 13 estimates that the numerical value included in the operation information 23 is multiplied by the magnification calculated when determining the correlation 14 as a numerical value indicating the resource usage status of the virtual machine 22. ..

処理部１３は、稼働情報２３に含まれる数値と推定した数値とを混在させてサービス管理者に提示してもよい。例えば、処理部１３は、仮想マシン２１のリソースの使用状況については、最新の稼働情報２３に含まれる数値を表示し、仮想マシン２２のリソースの使用状況については、相関関係１４に基づいて推定した数値を表示する。 The processing unit 13 may mix and present the numerical value included in the operation information 23 and the estimated numerical value to the service manager. For example, the processing unit 13 displays the numerical value included in the latest operation information 23 for the resource usage status of the virtual machine 21, and estimates the resource usage status of the virtual machine 22 based on the correlation 14. Display the numerical value.

第１の実施の形態の情報処理装置１０によれば、仮想マシン２１から受信された稼働情報２３および仮想マシン２２から受信された稼働情報２４の履歴が蓄積される。その履歴に基づいて、仮想マシン２１と仮想マシン２２との間のリソースの使用状況の相関関係１４が判定され、相関関係１４が所定条件を満たす場合、仮想マシン２２の稼働情報２４の受信頻度が低くなる。そして、仮想マシン２１から受信された稼働情報２３と相関関係１４とに基づいて、仮想マシン２２のリソースの使用状況が推定される。 According to the information processing apparatus 10 of the first embodiment, the history of the operation information 23 received from the virtual machine 21 and the operation information 24 received from the virtual machine 22 is accumulated. Based on the history, the correlation 14 of the resource usage status between the virtual machine 21 and the virtual machine 22 is determined, and when the correlation 14 satisfies a predetermined condition, the reception frequency of the operation information 24 of the virtual machine 22 is high. It gets lower. Then, the resource usage status of the virtual machine 22 is estimated based on the operation information 23 received from the virtual machine 21 and the correlation 14.

これにより、一部の仮想マシンからの稼働情報の収集頻度が低くなり、全ての仮想マシンから高頻度で稼働情報を収集する場合と比べて通信量を削減することができる。特に、複数の仮想マシンが異なるシステム（例えば、異なるクラウドシステム）に配置されている場合においては、インターネットなどの広域データ通信ネットワークのトラフィックを削減することができる。また、収集頻度が高い仮想マシンの稼働情報と相関関係から、収集頻度が低い仮想マシン（収集を停止している仮想マシンを含む）の稼働情報が推定される。よって、全ての仮想マシンの収集頻度を一律に低くする場合と比べて、リソースの使用状況を監視する監視精度を向上させることができる。 As a result, the frequency of collecting operation information from some virtual machines becomes low, and the amount of communication can be reduced as compared with the case of collecting operation information from all virtual machines at high frequency. In particular, when a plurality of virtual machines are arranged in different systems (for example, different cloud systems), it is possible to reduce the traffic of a wide area data communication network such as the Internet. In addition, the operation information of virtual machines with low collection frequency (including virtual machines for which collection is stopped) is estimated from the correlation with the operation information of virtual machines with high collection frequency. Therefore, it is possible to improve the monitoring accuracy of monitoring the resource usage status as compared with the case where the collection frequency of all virtual machines is uniformly reduced.

なお、第１の実施の形態の情報処理システムでは、仮想マシン２２のリソースの使用状況を推定する推定処理が行われており、推定処理の追加は情報処理システムの負荷を若干増大させる可能性がある。一方、推定処理の採用によって、仮想マシン２２から情報処理装置１０に稼働情報２４を送信して稼働情報２４を処理する監視処理が減少し、監視処理の減少は情報処理システムの負荷を大きく低減させる。よって、推定処理の負荷増大よりも監視処理の負荷減少の効果が大きく、情報処理システムの全体的負荷は低減する。 In the information processing system of the first embodiment, an estimation process for estimating the resource usage status of the virtual machine 22 is performed, and the addition of the estimation process may slightly increase the load on the information processing system. is there. On the other hand, by adopting the estimation process, the monitoring process of transmitting the operation information 24 from the virtual machine 22 to the information processing device 10 and processing the operation information 24 is reduced, and the reduction of the monitoring process greatly reduces the load on the information processing system. .. Therefore, the effect of reducing the load of the monitoring process is greater than the increase of the load of the estimation process, and the overall load of the information processing system is reduced.

また、仮想マシン２１を通じて仮想マシン２２に各種の指示を送信するなど、収集頻度を下げた仮想マシン２２を仮想マシン２１に従属させ、仮想マシン２２の監視を仮想マシン２１に委ねる分散管理を採用することもできる。これにより、情報処理装置１０と仮想マシン２２との間の通信を削減できる。例えば、仮想マシン２１，２２が同一のクラウドシステムに配置され、情報処理装置１０が当該クラウドシステムの外部に存在する場合、広域データ通信ネットワークの通信を削減することができる。 Further, a distributed management is adopted in which the virtual machine 22 whose collection frequency is reduced is subordinated to the virtual machine 21 by transmitting various instructions to the virtual machine 22 through the virtual machine 21, and the monitoring of the virtual machine 22 is entrusted to the virtual machine 21. You can also do it. As a result, communication between the information processing device 10 and the virtual machine 22 can be reduced. For example, when the virtual machines 21 and 22 are arranged in the same cloud system and the information processing device 10 exists outside the cloud system, the communication of the wide area data communication network can be reduced.

［第２の実施の形態］
次に、第２の実施の形態を説明する。
図２は、第２の実施の形態の情報処理システムの例を示す図である。 [Second Embodiment]
Next, a second embodiment will be described.
FIG. 2 is a diagram showing an example of the information processing system of the second embodiment.

第２の実施の形態の情報処理システムは、サービス提供に用いられる複数の仮想マシンを、異なるシステムに分散して配置するマルチクラウドシステムである。広義のマルチクラウドシステムは、狭義のマルチクラウドシステムとハイブリッドクラウドシステムを含む。狭義のマルチクラウドシステムは、複数の仮想マシンを、２以上のパブリッククラウドシステムに分散して配置するものである。ハイブリッドクラウドシステムは、複数の仮想マシンを、パブリッククラウドシステムと自社システム（オンプレミスシステム）に分散して配置するものである。複数のシステムを利用することで、サービス停止リスクを低減できると共に、システム特有の機能を活用することができる。 The information processing system of the second embodiment is a multi-cloud system in which a plurality of virtual machines used for providing services are distributed and arranged in different systems. A broad-sense multi-cloud system includes a narrow-sense multi-cloud system and a hybrid cloud system. A multi-cloud system in a narrow sense is a system in which a plurality of virtual machines are distributed and arranged in two or more public cloud systems. In the hybrid cloud system, a plurality of virtual machines are distributed and arranged in a public cloud system and an in-house system (on-premises system). By using multiple systems, it is possible to reduce the risk of service outages and utilize system-specific functions.

第２の実施の形態の情報処理システムは、クラウドシステム３１，３２、自社システム３３および情報収集サーバ１００を含む。クラウドシステム３１，３２、自社システム３３および情報収集サーバ１００は、ネットワーク３０に接続されている。ネットワーク３０は、インターネットなどの広域データ通信ネットワークである。 The information processing system of the second embodiment includes cloud systems 31, 32, an in-house system 33, and an information collection server 100. The cloud systems 31 and 32, the in-house system 33, and the information collection server 100 are connected to the network 30. The network 30 is a wide area data communication network such as the Internet.

クラウドシステム３１は、あるクラウド事業者によって所有され、そのクラウド事業者との契約に従ってサービス事業者が仮想マシンを配置することができるパブリッククラウドシステムである。クラウドシステム３１は、仮想マシンを実行する物理マシンであるサーバ２００，２００ａなどの複数のサーバを含む。 The cloud system 31 is a public cloud system that is owned by a cloud operator and allows the service provider to place virtual machines in accordance with a contract with the cloud operator. The cloud system 31 includes a plurality of servers such as servers 200 and 200a, which are physical machines that execute virtual machines.

クラウドシステム３２は、クラウドシステム３１とは異なるクラウド事業者によって所有され、そのクラウド事業者との契約に従ってサービス事業者が仮想マシンを配置することができるパブリッククラウドシステムである。クラウドシステム３２は、仮想マシンを実行する物理マシンであるサーバ２００ｂ，２００ｃなどの複数のサーバを有する。 The cloud system 32 is a public cloud system that is owned by a cloud operator different from the cloud system 31 and in which the service provider can arrange virtual machines according to a contract with the cloud operator. The cloud system 32 has a plurality of servers such as servers 200b and 200c, which are physical machines that execute virtual machines.

自社システム３３は、仮想マシンを用いてサービスを提供するサービス事業者によって所有されるオンプレミスシステムである。自社システム３３は、仮想マシンを実行するサーバ２００ｄ，２００ｅなどの複数のサーバを含む。サービス事業者は、複数の仮想マシンを、クラウドシステム３１，３２および自社システム３３のうちの２以上のシステムに分散して配置する。それら複数の仮想マシンは、ユーザからのリクエストに応じて連携して情報処理を行い、ユーザに対してレスポンスを送信する。例えば、１つの仮想マシンがリクエストを受信し、複数の仮想マシンがメッセージを木構造に転送しながら分散してデータ処理を行い、当該１つの仮想マシンが最終結果を集約してレスポンスを送信する。サービスの管理者によって、それら複数の仮想マシンの配置が決定される。 The in-house system 33 is an on-premises system owned by a service provider that provides services using virtual machines. The in-house system 33 includes a plurality of servers such as servers 200d and 200e that execute virtual machines. The service provider distributes and arranges a plurality of virtual machines in two or more of the cloud systems 31 and 32 and the in-house system 33. The plurality of virtual machines cooperate with each other to process information in response to a request from the user, and send a response to the user. For example, one virtual machine receives a request, a plurality of virtual machines transfer messages to a tree structure and perform data processing in a distributed manner, and the one virtual machine aggregates the final result and sends a response. The service administrator determines the placement of these multiple virtual machines.

情報収集サーバ１００は、サービスに用いられる複数の仮想マシンにおけるリソースの使用状況を監視するサーバコンピュータである。情報収集サーバ１００は、クラウドシステム３１，３２の外側に位置する。情報収集サーバ１００は、サービス事業者によって所有されてもよく、自社システム３３に含まれていてもよい。 The information collection server 100 is a server computer that monitors the usage status of resources in a plurality of virtual machines used for services. The information collection server 100 is located outside the cloud systems 31 and 32. The information collection server 100 may be owned by the service provider or may be included in the company's system 33.

情報収集サーバ１００は、複数の仮想マシンのうちの少なくとも一部から、ネットワーク３０を介して、リソースの使用状況を示す稼働情報を定期的に受信する。リソースの使用状況は、仮想マシンに割り当てられたハードウェア資源の使用状況である。リソースの使用状況として、ＣＰＵ使用率、ＲＡＭ使用量、ＨＤＤなどのストレージのアクセス量、ネットワーク帯域使用率などが挙げられる。情報収集サーバ１００は、複数の仮想マシンにおけるリソースの使用状況を、サービスの管理者に対して報告する。例えば、情報収集サーバ１００は、複数の仮想マシンのリソースの使用状況を列挙した画面を表示する。 The information collection server 100 periodically receives operation information indicating the resource usage status from at least a part of the plurality of virtual machines via the network 30. The resource usage is the usage of the hardware resources allocated to the virtual machine. Examples of the resource usage include CPU usage, RAM usage, storage access such as HDD, and network bandwidth usage. The information collection server 100 reports the resource usage status in the plurality of virtual machines to the service administrator. For example, the information collection server 100 displays a screen listing the resource usage status of a plurality of virtual machines.

複数の仮想マシンのリソースの使用状況は、サービスの運用管理のために参照される。例えば、新たな仮想マシンの追加、一部の仮想マシンの削除、仮想マシンに割り当てるリソースの増強、仮想マシンを配置するシステムの変更などの構成変更が検討される。なお、サーバ２００は、第１の実施の形態の情報処理装置２０に対応する。サーバ２００ｂは、第１の実施の形態の情報処理装置２０ａに対応する。情報収集サーバ１００は、第１の実施の形態の情報処理装置１０に対応する。 The resource usage status of multiple virtual machines is referred to for service operation management. For example, configuration changes such as adding new virtual machines, deleting some virtual machines, increasing resources allocated to virtual machines, and changing the system in which virtual machines are placed are considered. The server 200 corresponds to the information processing device 20 of the first embodiment. The server 200b corresponds to the information processing device 20a of the first embodiment. The information collection server 100 corresponds to the information processing device 10 of the first embodiment.

図３は、情報収集サーバのハードウェア例を示すブロック図である。
情報収集サーバ１００は、ＣＰＵ１０１、ＲＡＭ１０２、ＨＤＤ１０３、画像インタフェース１０４、入力インタフェース１０５、媒体リーダ１０６および通信インタフェース１０７を有する。情報収集サーバ１００が有するこれらのユニットは、バスに接続されている。ＣＰＵ１０１は、第１の実施の形態の処理部１３に対応する。ＲＡＭ１０２またはＨＤＤ１０３は、第１の実施の形態の記憶部１２に対応する。通信インタフェース１０７は、第１の実施の形態の通信部１１に対応する。サーバ２００，２００ａ，２００ｂ，２００ｃ，２００ｄ，２００ｅも、情報収集サーバ１００と同様のハードウェアを有する。 FIG. 3 is a block diagram showing a hardware example of the information collection server.
The information collection server 100 includes a CPU 101, a RAM 102, an HDD 103, an image interface 104, an input interface 105, a medium reader 106, and a communication interface 107. These units included in the information collection server 100 are connected to the bus. The CPU 101 corresponds to the processing unit 13 of the first embodiment. The RAM 102 or the HDD 103 corresponds to the storage unit 12 of the first embodiment. The communication interface 107 corresponds to the communication unit 11 of the first embodiment. The servers 200, 200a, 200b, 200c, 200d, and 200e also have the same hardware as the information collection server 100.

ＣＰＵ１０１は、プログラムの命令を実行するプロセッサである。ＣＰＵ１０１は、ＨＤＤ１０３に記憶されたプログラムやデータの少なくとも一部をＲＡＭ１０２にロードし、プログラムを実行する。ＣＰＵ１０１は複数のプロセッサコアを備えてもよく、情報収集サーバ１００は複数のプロセッサを備えてもよい。複数のプロセッサの集合を「マルチプロセッサ」または単に「プロセッサ」と言うことがある。 The CPU 101 is a processor that executes a program instruction. The CPU 101 loads at least a part of the programs and data stored in the HDD 103 into the RAM 102 and executes the program. The CPU 101 may include a plurality of processor cores, and the information collection server 100 may include a plurality of processors. A collection of multiple processors is sometimes referred to as a "multiprocessor" or simply a "processor."

ＲＡＭ１０２は、ＣＰＵ１０１が実行するプログラムやＣＰＵ１０１が演算に使用するデータを一時的に記憶する揮発性半導体メモリである。情報収集サーバ１００は、ＲＡＭ以外の種類のメモリを備えてもよく、複数のメモリを備えてもよい。 The RAM 102 is a volatile semiconductor memory that temporarily stores a program executed by the CPU 101 and data used by the CPU 101 for calculation. The information collection server 100 may include a type of memory other than RAM, or may include a plurality of memories.

ＨＤＤ１０３は、ＯＳ（Operating System）やミドルウェアやアプリケーションソフトウェアなどのソフトウェアのプログラム、および、データを記憶する不揮発性ストレージである。情報収集サーバ１００は、フラッシュメモリやＳＳＤ（Solid State Drive）など他の種類のストレージを備えてもよく、複数のストレージを備えてもよい。 The HDD 103 is a non-volatile storage that stores software programs such as an OS (Operating System), middleware, and application software, and data. The information collection server 100 may include other types of storage such as a flash memory and an SSD (Solid State Drive), or may include a plurality of storages.

画像インタフェース１０４は、ＣＰＵ１０１からの命令に従って、情報収集サーバ１００に接続された表示装置１１１に画像を出力する。表示装置１１１として、ＣＲＴ（Cathode Ray Tube）ディスプレイ、液晶ディスプレイ（ＬＣＤ：Liquid Crystal Display）、有機ＥＬ（ＯＥＬ：Organic Electro-Luminescence）ディスプレイ、プロジェクタなど、任意の種類の表示装置を使用することができる。情報収集サーバ１００に、プリンタなど表示装置１１１以外の出力デバイスが接続されてもよい。 The image interface 104 outputs an image to the display device 111 connected to the information collection server 100 in accordance with a command from the CPU 101. As the display device 111, any kind of display device such as a CRT (Cathode Ray Tube) display, a liquid crystal display (LCD), an organic EL (OEL: Organic Electro-Luminescence) display, and a projector can be used. .. An output device other than the display device 111 such as a printer may be connected to the information collection server 100.

入力インタフェース１０５は、情報収集サーバ１００に接続された入力デバイス１１２から入力信号を受け付ける。入力デバイス１１２として、マウス、タッチパネル、タッチパッド、キーボードなど、任意の種類の入力デバイスを使用することができる。情報収集サーバ１００に複数種類の入力デバイスが接続されてもよい。 The input interface 105 receives an input signal from the input device 112 connected to the information collection server 100. As the input device 112, any kind of input device such as a mouse, a touch panel, a touch pad, and a keyboard can be used. A plurality of types of input devices may be connected to the information collection server 100.

媒体リーダ１０６は、記録媒体１１３に記録されたプログラムやデータを読み取る読み取り装置である。記録媒体１１３として、フレキシブルディスク（ＦＤ：Flexible Disk）やＨＤＤなどの磁気ディスク、ＣＤ（Compact Disc）やＤＶＤ（Digital Versatile Disc）などの光ディスク、半導体メモリなど、任意の種類の記録媒体を使用することができる。媒体リーダ１０６は、例えば、記録媒体１１３から読み取ったプログラムやデータを、ＲＡＭ１０２やＨＤＤ１０３などの他の記録媒体にコピーする。読み取られたプログラムは、例えば、ＣＰＵ１０１によって実行される。なお、記録媒体１１３は可搬型記録媒体であってもよく、プログラムやデータの配布に用いられることがある。また、記録媒体１１３やＨＤＤ１０３を、コンピュータ読み取り可能な記録媒体と言うことがある。 The medium reader 106 is a reading device that reads programs and data recorded on the recording medium 113. As the recording medium 113, any kind of recording medium such as a magnetic disk such as a flexible disk (FD) or HDD, an optical disk such as a CD (Compact Disc) or a DVD (Digital Versatile Disc), or a semiconductor memory is used. Can be done. The medium reader 106 copies, for example, a program or data read from the recording medium 113 to another recording medium such as the RAM 102 or the HDD 103. The read program is executed by, for example, the CPU 101. The recording medium 113 may be a portable recording medium, and may be used for distribution of programs and data. Further, the recording medium 113 and the HDD 103 may be referred to as a computer-readable recording medium.

通信インタフェース１０７は、ネットワーク３０に接続され、ネットワーク３０を介してサーバ２００，２００ａ，２００ｂ，２００ｃ，２００ｄ，２００ｅと通信する。通信インタフェース１０７は、例えば、スイッチやルータなどの有線通信装置に接続される有線通信インタフェースである。ただし、通信インタフェース１０７が、基地局やアクセスポイントなどの無線通信装置に接続される無線通信インタフェースであってもよい。 The communication interface 107 is connected to the network 30 and communicates with the servers 200, 200a, 200b, 200c, 200d, 200e via the network 30. The communication interface 107 is a wired communication interface connected to, for example, a wired communication device such as a switch or a router. However, the communication interface 107 may be a wireless communication interface connected to a wireless communication device such as a base station or an access point.

次に、複数の仮想マシンの配置および接続関係について説明する。
図４は、サービスと仮想マシンとシステムの対応例を示す図である。
第２の実施の形態で運用されるサービスの１つの例は、プロセス４１，４２，４３，４４，４５，４６を含む。プロセス４１，４２，４３，４４，４５，４６の間の通信のトポロジは木構造である。プロセス４１は、ユーザからリクエストを受信する。プロセス４１は、プロセス４２，４３にそれぞれメッセージを送信して、プロセス４２，４３にデータ処理を要求する。プロセス４３は、プロセス４１からメッセージを受信した後、プロセス４４にメッセージを送信して、プロセス４４にデータ処理を要求する。プロセス４４は、プロセス４３からメッセージを受信した後、プロセス４５にメッセージを送信して、プロセス４５にデータ処理を要求する。プロセス４２は、プロセス４１からメッセージを受信した後、プロセス４６にメッセージを送信して、プロセス４６にデータ処理を要求する。 Next, the arrangement and connection relationship of a plurality of virtual machines will be described.
FIG. 4 is a diagram showing an example of correspondence between a service, a virtual machine, and a system.
One example of a service operated in the second embodiment includes processes 41, 42, 43, 44, 45, 46. The topology of communication between processes 41, 42, 43, 44, 45, 46 is tree-structured. Process 41 receives a request from the user. Process 41 sends a message to processes 42 and 43, respectively, and requests processes 42 and 43 to process data. After receiving the message from the process 41, the process 43 sends a message to the process 44 to request the process 44 to process the data. After receiving the message from the process 43, the process 44 sends a message to the process 45 to request the process 45 to process data. After receiving the message from the process 41, the process 42 sends a message to the process 46 to request the process 46 to process data.

プロセス４５は、要求されたデータ処理を行う。プロセス４４は、送信したメッセージに対する応答として、プロセス４５からデータ処理結果を受信し、自身のデータ処理を行う。プロセス４３は、送信したメッセージに対する応答として、プロセス４４からデータ処理結果を受信し、自身のデータ処理を行う。プロセス４６は、要求されたデータ処理を行う。プロセス４２は、送信したメッセージに対する応答として、プロセス４６からデータ処理結果を受信し、自身のデータ処理を行う。プロセス４１は、送信したメッセージに対する応答として、プロセス４２，４３からデータ処理結果を受信する。最後に、プロセス４１は、自身のデータ処理を行い、ユーザにレスポンスを送信する。 Process 45 performs the requested data processing. The process 44 receives the data processing result from the process 45 as a response to the transmitted message, and performs its own data processing. The process 43 receives the data processing result from the process 44 as a response to the transmitted message, and performs its own data processing. Process 46 performs the requested data processing. The process 42 receives the data processing result from the process 46 as a response to the transmitted message, and performs its own data processing. The process 41 receives the data processing result from the processes 42 and 43 as a response to the transmitted message. Finally, the process 41 processes its own data and sends a response to the user.

プロセス４１は、仮想マシン５１で実行される。プロセス４２は、仮想マシン５２で実行される。プロセス４３は、仮想マシン５３で実行される。プロセス４４は、仮想マシン５４で実行される。プロセス４５は、仮想マシン５５で実行される。プロセス４６は、仮想マシン５６で実行される。プロセス４１，４２，４３，４４，４５，４６の処理手順は、アプリケーションプログラムによって規定される。第２の実施の形態の仮想マシン５１，５２，５３，５４，５５，５６は、ゲストＯＳおよびアプリケーションプログラムを実行する狭義の仮想マシンである。ただし、仮想マシン５１，５２，５３，５４，５５，５６を、コンテナ型コンピュータ仮想化技術のコンテナとしてもよく、仮想マシン５１，５２，５３，５４，５５，５６がゲストＯＳを実行していなくてもよい。 The process 41 is executed by the virtual machine 51. The process 42 is executed in the virtual machine 52. Process 43 is executed in virtual machine 53. Process 44 is executed in virtual machine 54. Process 45 is executed in virtual machine 55. Process 46 is executed in virtual machine 56. The processing procedure of processes 41, 42, 43, 44, 45, 46 is defined by the application program. The virtual machines 51, 52, 53, 54, 55, 56 of the second embodiment are virtual machines in a narrow sense that execute the guest OS and the application program. However, the virtual machines 51, 52, 53, 54, 55, 56 may be used as containers for container-type computer virtualization technology, and the virtual machines 51, 52, 53, 54, 55, 56 are not executing the guest OS. You may.

仮想マシン５１，５３，５４，５５は、クラウドシステム３１に配置される。クラウドシステム３１の中では、仮想マシン５１，５３，５４，５５が互いに異なるサーバに配置されてもよいし、仮想マシン５１，５３，５４，５５のうちの２つ以上が同一のサーバに配置されてもよい。仮想マシン５２，５６は、クラウドシステム３２に配置される。クラウドシステム３２の中では、仮想マシン５２，５６が異なるサーバに配置されてもよいし同一のサーバに配置されてもよい。この例では、仮想マシン５１，５２，５３，５４，５５，５６が２つのパブリッククラウドシステムに分割して配置されている。ただし、仮想マシン５１，５２，５３，５４，５５，５６の一部を、自社システム３３に配置することも可能である。なお、仮想マシン５１は、第１の実施の形態の仮想マシン２１に対応する。仮想マシン５２は、第１の実施の形態の仮想マシン２２に対応する。 The virtual machines 51, 53, 54, 55 are arranged in the cloud system 31. In the cloud system 31, virtual machines 51, 53, 54, 55 may be arranged on different servers, or two or more of virtual machines 51, 53, 54, 55 are arranged on the same server. You may. The virtual machines 52 and 56 are arranged in the cloud system 32. In the cloud system 32, the virtual machines 52 and 56 may be arranged on different servers or may be arranged on the same server. In this example, the virtual machines 51, 52, 53, 54, 55, 56 are divided and arranged in two public cloud systems. However, it is also possible to arrange a part of the virtual machines 51, 52, 53, 54, 55, 56 in the in-house system 33. The virtual machine 51 corresponds to the virtual machine 21 of the first embodiment. The virtual machine 52 corresponds to the virtual machine 22 of the first embodiment.

ここで、情報収集サーバ１００は、仮想マシン５１，５２，５３，５４，５５，５６におけるリソースの使用状況を監視する。リソースの使用状況の監視精度を高くし、監視情報のリアルタイム性を確保するには、情報収集サーバ１００が高頻度で稼働情報を受信することが好ましい。しかし、情報収集サーバ１００が仮想マシン５１，５２，５３，５４，５５，５６の全てから高頻度で稼働情報を受信すると、広域データ通信ネットワークであるネットワーク３０のトラフィックが多くなるという問題がある。 Here, the information collection server 100 monitors the resource usage status of the virtual machines 51, 52, 53, 54, 55, and 56. In order to improve the monitoring accuracy of the resource usage status and ensure the real-time property of the monitoring information, it is preferable that the information collection server 100 receives the operation information with high frequency. However, if the information collection server 100 receives operation information from all of the virtual machines 51, 52, 53, 54, 55, and 56 at high frequency, there is a problem that the traffic of the network 30, which is a wide area data communication network, increases.

そこで、第２の実施の形態の情報処理システムは、仮想マシン５１，５２，５３，５４，５５，５６のうち、一部の仮想マシンから情報収集サーバ１００への稼働情報の送信を停止する。稼働情報の送信を停止する仮想マシンは、リソースの使用状況が他の特定の仮想マシンと強い相関をもつ仮想マシンとする。稼働情報を収集しない仮想マシンのリソースの使用状況は、情報収集サーバ１００が、稼働情報を収集する仮想マシンのリソースの使用状況と予め算出しておいた相関関係とに基づいて推定する。これにより、監視精度を維持しつつネットワーク３０のトラフィックを削減できる。 Therefore, the information processing system of the second embodiment stops the transmission of operation information from some of the virtual machines 51, 52, 53, 54, 55, 56 to the information collection server 100. The virtual machine that stops sending operation information is a virtual machine whose resource usage has a strong correlation with other specific virtual machines. The resource usage status of the virtual machine that does not collect the operation information is estimated based on the correlation calculated in advance with the resource usage status of the virtual machine that the information collection server 100 collects the operation information. As a result, the traffic of the network 30 can be reduced while maintaining the monitoring accuracy.

図５は、情報収集サーバと仮想マシンの通信例を示すシーケンス図である。
ここでは、仮想マシン５１，５２，５３，５４，５５，５６のうち、代表して仮想マシン５１，５３の通信について説明する。図５のシーケンス例では、仮想マシン５１が情報収集サーバ１００への稼働情報の送信を継続し、仮想マシン５３が情報収集サーバ１００への稼働情報の送信を一時的に停止している。 FIG. 5 is a sequence diagram showing an example of communication between the information collection server and the virtual machine.
Here, among the virtual machines 51, 52, 53, 54, 55, 56, the communication of the virtual machines 51, 53 will be described as a representative. In the sequence example of FIG. 5, the virtual machine 51 continues to transmit the operation information to the information collection server 100, and the virtual machine 53 temporarily stops the transmission of the operation information to the information collection server 100.

まず、情報収集サーバ１００は、仮想マシン５１，５３にテスト開始通知を送信する（Ｓ１００）。テスト開始通知を受けて、仮想マシン５１，５３は、他の仮想マシンとの通信の監視を開始する。通信の監視は、パケットレベルの送信パケットの監視でもよいし、アプリケーションレベルの送信メッセージの監視でもよい。仮想マシン５１，５３は、送信メッセージの宛先および時刻を少なくとも含む通信ログを採取する。 First, the information collection server 100 transmits a test start notification to the virtual machines 51 and 53 (S100). Upon receiving the test start notification, the virtual machines 51 and 53 start monitoring communication with other virtual machines. The communication monitoring may be packet-level transmission packet monitoring or application-level transmission message monitoring. The virtual machines 51 and 53 collect a communication log including at least the destination and time of the transmitted message.

情報収集サーバ１００は、サービスの入り口である仮想マシン５１にテストリクエストを送信する（Ｓ１０１）。仮想マシン５１は、テストリクエストを受けて、テストメッセージを仮想マシン５３に送信して仮想マシン５３にデータ処理を要求する（Ｓ１０２）。このとき、仮想マシン５１は、仮想マシン５３へのテストメッセージの送信を検出し、仮想マシン５３へのテストメッセージの送信を示す通信ログを採取している。テストリクエストの処理が完了すると、仮想マシン５１，５３は、情報収集サーバ１００に通信ログを送信する（Ｓ１０３）。仮想マシン５１，５３は、自律的に通信ログを送信する。ただし、情報収集サーバ１００が仮想マシン５１，５３に通信ログを要求してもよい。 The information collection server 100 transmits a test request to the virtual machine 51, which is the entrance of the service (S101). Upon receiving the test request, the virtual machine 51 sends a test message to the virtual machine 53 to request the virtual machine 53 to process data (S102). At this time, the virtual machine 51 detects the transmission of the test message to the virtual machine 53, and collects the communication log indicating the transmission of the test message to the virtual machine 53. When the processing of the test request is completed, the virtual machines 51 and 53 transmit the communication log to the information collection server 100 (S103). The virtual machines 51 and 53 autonomously transmit the communication log. However, the information collection server 100 may request the communication logs from the virtual machines 51 and 53.

情報収集サーバ１００は、他の仮想マシンからも同様に通信ログを受信する。情報収集サーバ１００は、これらの通信ログに基づいて仮想マシン間のメッセージ送信の順序を判定し、図４に示すようなサービスのデータフローを判定する。また、情報収集サーバ１００は、稼働情報の収集頻度の初期値として、低い収集頻度を決定する。低い収集頻度は、例えば、１分間隔とする。情報収集サーバ１００は、サービスに参加する仮想マシンの総数に応じて収集頻度の初期値を決定してもよい。その場合、仮想マシンの総数が少ないほど収集頻度を高くし、仮想マシンの総数が多いほど収集頻度を低くする。後述のグループ分類の前は、情報収集サーバ１００が低い収集頻度で稼働情報を収集することで、情報処理システム全体の通信量が一時的に過大になるのを抑制することができる。 The information collection server 100 also receives communication logs from other virtual machines. The information collection server 100 determines the order of message transmission between virtual machines based on these communication logs, and determines the data flow of the service as shown in FIG. Further, the information collection server 100 determines a low collection frequency as an initial value of the operation information collection frequency. The low collection frequency is, for example, 1 minute intervals. The information collection server 100 may determine the initial value of the collection frequency according to the total number of virtual machines participating in the service. In that case, the smaller the total number of virtual machines, the higher the collection frequency, and the larger the total number of virtual machines, the lower the collection frequency. Before the group classification described later, the information collection server 100 collects the operation information at a low collection frequency, so that it is possible to prevent the communication volume of the entire information processing system from becoming excessive temporarily.

情報収集サーバ１００は、仮想マシン５１，５３に収集頻度の初期値を通知する（Ｓ１０４）。仮想マシン５１，５３は、情報収集サーバ１００から通知された低い頻度で、リソースの使用状況を取得して情報収集サーバ１００に稼働情報を送信することを開始する（Ｓ１０５）。リソースの使用状況としては、ＣＰＵ使用率、ＲＡＭ使用量、ＨＤＤアクセス量、ネットワーク帯域使用率などが挙げられる。第２の実施の形態では、リソースとして主にＣＰＵとＲＡＭに着目するものとする。 The information collection server 100 notifies the virtual machines 51 and 53 of the initial value of the collection frequency (S104). The virtual machines 51 and 53 start acquiring the resource usage status and transmitting the operation information to the information collection server 100 at a low frequency notified from the information collection server 100 (S105). Examples of the resource usage include CPU usage, RAM usage, HDD access, network bandwidth usage, and the like. In the second embodiment, the CPU and RAM are mainly focused on as resources.

情報収集サーバ１００は、仮想マシン５１，５３から受信する稼働情報を履歴として少なくとも一定期間保持する。一定期間の稼働情報の履歴が蓄積されると、情報収集サーバ１００は、稼働情報の履歴と複数の仮想マシンの間のデータフローに基づいて、それら複数の仮想マシンをグループに分類する。２つの仮想マシンがデータフロー上で親子関係にあり、当該２つの仮想マシンのリソースの使用状況に強い相関がある場合、当該２つの仮想マシンが同一グループに分類される。２つの仮想マシンの間で、稼働情報に含まれる計測値の相関係数の絶対値が所定の閾値（例えば、０．８）より大きい場合、強い相関があると判断される。親子関係は、一方の仮想マシンが他方の仮想マシンにメッセージを送信する関係である。図４の例によれば、仮想マシン５１，５３に着目すると、仮想マシン５１が親仮想マシンであり仮想マシン５３が子仮想マシンである。 The information collection server 100 retains the operation information received from the virtual machines 51 and 53 as a history for at least a certain period of time. When the history of operation information for a certain period of time is accumulated, the information collection server 100 classifies the plurality of virtual machines into groups based on the history of operation information and the data flow between the plurality of virtual machines. When two virtual machines have a parent-child relationship on the data flow and there is a strong correlation between the resource usage status of the two virtual machines, the two virtual machines are classified into the same group. When the absolute value of the correlation coefficient of the measured value included in the operation information is larger than a predetermined threshold value (for example, 0.8) between the two virtual machines, it is determined that there is a strong correlation. A parent-child relationship is one in which one virtual machine sends a message to the other virtual machine. According to the example of FIG. 4, focusing on the virtual machines 51 and 53, the virtual machine 51 is a parent virtual machine and the virtual machine 53 is a child virtual machine.

子仮想マシンと更に親子関係にある孫仮想マシンのリソースの使用状況が、親仮想マシンと強い相関をもつ場合、孫仮想マシンも連続的に親仮想マシンと同一グループに分類される。情報収集サーバ１００は、各グループの中で最も上流側の仮想マシンを代表仮想マシンとして選択し、代表仮想マシン以外の仮想マシンを従属仮想マシンとして扱う。後述するように、代表仮想マシンのみが情報収集サーバ１００に稼働情報を送信し、従属仮想マシンは情報収集サーバ１００に稼働情報を送信しないことになる。なお、情報収集サーバ１００は、同一グループに分類する仮想マシンを、クラウドシステム３１，３２および自社システム３３のうち同一のシステムに配置された仮想マシンに限定してもよい。 If the resource usage of the grandchild virtual machine that has a parent-child relationship with the child virtual machine has a strong correlation with the parent virtual machine, the grandchild virtual machine is also continuously classified into the same group as the parent virtual machine. The information collection server 100 selects the most upstream virtual machine in each group as the representative virtual machine, and treats the virtual machines other than the representative virtual machine as the subordinate virtual machines. As will be described later, only the representative virtual machine transmits the operation information to the information collection server 100, and the subordinate virtual machine does not transmit the operation information to the information collection server 100. The information collection server 100 may limit the virtual machines classified into the same group to the virtual machines arranged in the same system among the cloud systems 31 and 32 and the in-house system 33.

仮想マシンのグループ化は、リソースの種類毎に行われる。よって、２つの仮想マシンが、ＣＰＵ使用率については強い相関をもつため同一グループに分類される一方、ＲＡＭ使用量については相関が低いため別グループに分類されることが有り得る。ここでは、仮想マシン５１が代表仮想マシンであり、仮想マシン５３が従属仮想マシンであるとする。また、情報収集サーバ１００は、従属仮想マシンについて、代表仮想マシンのリソースの使用状況に対する倍率を算出する。倍率は、例えば、一定期間の従属仮想マシンの計測値の平均を代表仮想マシンの計測値の平均で割ることで算出できる。ただし、倍率は、代表仮想マシンの計測値を表す変数を用いた関数として表現されてもよい。 Virtual machines are grouped by resource type. Therefore, two virtual machines may be classified into the same group because they have a strong correlation with respect to CPU usage, while they may be classified into different groups because they have a low correlation with respect to RAM usage. Here, it is assumed that the virtual machine 51 is the representative virtual machine and the virtual machine 53 is the subordinate virtual machine. In addition, the information collection server 100 calculates a magnification for the resource usage status of the representative virtual machine for the subordinate virtual machine. The magnification can be calculated, for example, by dividing the average of the measured values of the dependent virtual machines for a certain period by the average of the measured values of the representative virtual machine. However, the magnification may be expressed as a function using a variable representing the measured value of the representative virtual machine.

情報収集サーバ１００は、代表仮想マシンである仮想マシン５１に、グループ通知を送信する（Ｓ１０６）。グループ通知は、従属仮想マシンである仮想マシン５３の識別情報と、仮想マシン５１の計測値に対する仮想マシン５３の計測値の倍率とを含む。グループ通知を受信すると、仮想マシン５１は、仮想マシン５１と同一グループに属する従属仮想マシンである仮想マシン５３に、停止通知を送信する（Ｓ１０７）。仮想マシン５３は、停止通知を受信すると、情報収集サーバ１００への稼働情報の送信を停止する。 The information collection server 100 transmits a group notification to the virtual machine 51, which is the representative virtual machine (S106). The group notification includes the identification information of the virtual machine 53, which is a subordinate virtual machine, and the magnification of the measured value of the virtual machine 53 with respect to the measured value of the virtual machine 51. Upon receiving the group notification, the virtual machine 51 transmits a stop notification to the virtual machine 53, which is a subordinate virtual machine belonging to the same group as the virtual machine 51 (S107). Upon receiving the stop notification, the virtual machine 53 stops transmitting the operation information to the information collection server 100.

また、仮想マシン５１は、稼働情報を情報収集サーバ１００に送信する頻度を初期値よりも高く変更する。仮想マシン５１は、変更後の高い収集頻度で、リソースの使用状況を取得して情報収集サーバ１００に稼働情報を送信することを開始する（Ｓ１０８）。 Further, the virtual machine 51 changes the frequency of transmitting the operation information to the information collection server 100 to be higher than the initial value. The virtual machine 51 starts to acquire the resource usage status and transmit the operation information to the information collection server 100 at a high collection frequency after the change (S108).

変更後の収集頻度は、同一グループに属する従属仮想マシンの数に応じて決定される。グループ内の仮想マシンの総数をｎ、従属仮想マシンの数をｍとすると、代表仮想マシンの送信頻度を、初期値のｎ÷（ｎ−ｍ）×α倍になるよう上げる。すなわち、代表仮想マシンの送信間隔を、初期値の（ｎ−ｍ）÷ｎ÷α倍になるよう短縮する。稼働情報を送信しない従属仮想マシンが多いほど、代表仮想マシンの送信頻度が高くなる。調整パラメータαは、１より小さい正の実数である。これにより、グループ内の稼働情報の総データ量を削減しつつ代表仮想マシンの収集頻度を上げることができる。 The changed collection frequency depends on the number of dependent virtual machines that belong to the same group. Assuming that the total number of virtual machines in the group is n and the number of dependent virtual machines is m, the transmission frequency of the representative virtual machine is increased so as to be n ÷ (nm) × α times the initial value. That is, the transmission interval of the representative virtual machine is shortened so as to be (nm) ÷ n ÷ α times the initial value. The more dependent virtual machines that do not send operation information, the higher the frequency of sending the representative virtual machine. The adjustment parameter α is a positive real number less than 1. As a result, it is possible to increase the collection frequency of representative virtual machines while reducing the total amount of operation information in the group.

ただし、収集頻度を高くし過ぎることは、監視精度の向上に寄与しない上に、ネットワーク負荷を無駄に高くしてしまう。そこで、代表仮想マシンの収集頻度は、所定の上限より高くならないようにする。収集頻度の上限は、例えば、１秒間隔とする。仮想マシン５１，５３の２つの仮想マシンが１つのグループを形成する場合、例えば、代表仮想マシンである仮想マシン５１の収集頻度が３０秒と１分の間で調整される。 However, if the collection frequency is too high, it does not contribute to the improvement of monitoring accuracy and the network load is unnecessarily increased. Therefore, the collection frequency of the representative virtual machine should not be higher than the predetermined upper limit. The upper limit of the collection frequency is, for example, an interval of 1 second. When two virtual machines of virtual machines 51 and 53 form one group, for example, the collection frequency of the virtual machine 51, which is a representative virtual machine, is adjusted between 30 seconds and 1 minute.

情報収集サーバ１００は、仮想マシン５１からは稼働情報を受信する一方、仮想マシン５３からは稼働情報を受信しなくなる。そこで、情報収集サーバ１００は、仮想マシン５１から受信した最新の稼働情報が示す計測値に、予め算出しておいた仮想マシン５３の倍率を乗じることで、仮想マシン５３の最新のリソースの使用状況を推定する。倍率の算出およびリソースの使用状況の推定は、リソースの種類毎に行われる。 The information collection server 100 receives the operation information from the virtual machine 51, but does not receive the operation information from the virtual machine 53. Therefore, the information collection server 100 multiplies the measured value indicated by the latest operation information received from the virtual machine 51 by the magnification of the virtual machine 53 calculated in advance, so that the latest resource usage status of the virtual machine 53 is used. To estimate. The calculation of the magnification and the estimation of the resource usage status are performed for each resource type.

また、仮想マシン５１は、同一グループに属する従属仮想マシンである仮想マシン５３のリソースの使用状況を監視する。仮想マシン５１は、低い所定頻度で仮想マシン５３に稼働情報を要求する（Ｓ１０９）。低い所定頻度は、ステップＳ１０４で通知された収集頻度の初期値と同じでもよく、例えば、１分間隔とする。仮想マシン５３は、仮想マシン５１からの要求に応じて仮想マシン５３におけるリソースの使用状況を取得し、仮想マシン５１に稼働情報を送信する（Ｓ１１０）。 In addition, the virtual machine 51 monitors the resource usage status of the virtual machine 53, which is a subordinate virtual machine belonging to the same group. The virtual machine 51 requests operation information from the virtual machine 53 at a low predetermined frequency (S109). The low predetermined frequency may be the same as the initial value of the collection frequency notified in step S104, for example, at 1-minute intervals. The virtual machine 53 acquires the resource usage status in the virtual machine 53 in response to the request from the virtual machine 51, and transmits the operation information to the virtual machine 51 (S110).

仮想マシン５１は、仮想マシン５３から稼働情報を受信すると、仮想マシン５１と仮想マシン５３との間でリソースの使用状況に強い相関が維持されているか判定する。強い相関が維持されているかは、直近の複数回の仮想マシン５３の稼働情報とそれに対応する仮想マシン５１の稼働情報とを対比して判断される。 When the virtual machine 51 receives the operation information from the virtual machine 53, the virtual machine 51 determines whether a strong correlation between the virtual machine 51 and the virtual machine 53 in the resource usage status is maintained. Whether or not the strong correlation is maintained is determined by comparing the latest operation information of the virtual machine 53 with the operation information of the corresponding virtual machine 51.

例えば、仮想マシン５１は、仮想マシン５３からの稼働情報の受信時刻に最も近い仮想マシン５１の稼働情報を履歴から抽出し、仮想マシン５１の稼働情報に含まれる計測値と仮想マシン５３の稼働情報に含まれる計測値とのペアを作成する。仮想マシン５１は、複数のペアから相関係数を算出し、相関係数の絶対値が閾値より大きい場合は強い相関が維持されていると判断し、相関係数の絶対値が閾値以下である場合は相関が低くなったと判断する。また、仮想マシン５１は、強い相関が維持されていると判断した場合、仮想マシン５１に対する仮想マシン５３のリソースの使用状況の倍率を再計算する。例えば、仮想マシン５１は、直近の複数回の仮想マシン５３の計測値の平均を、それに対応する仮想マシン５１の計測値の平均で割ることで算出する。 For example, the virtual machine 51 extracts the operation information of the virtual machine 51 closest to the reception time of the operation information from the virtual machine 53 from the history, and the measured value included in the operation information of the virtual machine 51 and the operation information of the virtual machine 53. Create a pair with the measured value included in. The virtual machine 51 calculates the correlation coefficient from a plurality of pairs, determines that a strong correlation is maintained when the absolute value of the correlation coefficient is larger than the threshold value, and the absolute value of the correlation coefficient is equal to or less than the threshold value. In that case, it is judged that the correlation is low. Further, when the virtual machine 51 determines that the strong correlation is maintained, the virtual machine 51 recalculates the ratio of the resource usage status of the virtual machine 53 to the virtual machine 51. For example, the virtual machine 51 is calculated by dividing the average of the measured values of the most recent plurality of virtual machines 53 by the average of the measured values of the corresponding virtual machine 51.

ここでは、仮想マシン５１と仮想マシン５３との間に強い相関が維持されているとする。すると、仮想マシン５１は、再計算した倍率を含む相関更新通知を情報収集サーバ１００に送信する（Ｓ１１１）。また、仮想マシン５１は、引き続き高い収集頻度で、リソースの使用状況を取得して情報収集サーバ１００に稼働情報を送信する（Ｓ１１２）。情報収集サーバ１００は、引き続き、仮想マシン５３から稼働情報を受信しない。そこで、情報収集サーバ１００は、ステップＳ１１１で通知された最新の倍率を用いて、仮想マシン５３におけるリソースの使用状況を推定する。 Here, it is assumed that a strong correlation is maintained between the virtual machine 51 and the virtual machine 53. Then, the virtual machine 51 transmits a correlation update notification including the recalculated magnification to the information collection server 100 (S111). Further, the virtual machine 51 continues to acquire the resource usage status and transmits the operation information to the information collection server 100 at a high collection frequency (S112). The information collection server 100 does not continue to receive operation information from the virtual machine 53. Therefore, the information collection server 100 estimates the resource usage status in the virtual machine 53 by using the latest magnification notified in step S111.

再び、仮想マシン５１は、仮想マシン５３に稼働情報を要求し（Ｓ１１３）、仮想マシン５３から稼働情報を受信する（Ｓ１１４）。ここでは、相関係数の絶対値が閾値以下になり、仮想マシン５１と仮想マシン５３との間の相関が失われたとする。すると、仮想マシン５１は、仮想マシン５３に再開通知を送信する（Ｓ１１５）。再開通知を受信すると、仮想マシン５３は、定期的にリソースの使用状況を取得して情報収集サーバ１００に送信することを再開する。このときの仮想マシン５３の収集頻度は、低い頻度でよく、例えば、ステップＳ１０４で通知された初期値とする。 Again, the virtual machine 51 requests the virtual machine 53 for operation information (S113), and receives the operation information from the virtual machine 53 (S114). Here, it is assumed that the absolute value of the correlation coefficient becomes equal to or less than the threshold value and the correlation between the virtual machine 51 and the virtual machine 53 is lost. Then, the virtual machine 51 transmits a restart notification to the virtual machine 53 (S115). Upon receiving the resume notification, the virtual machine 53 periodically acquires the resource usage status and resumes transmitting the resource usage status to the information collection server 100. The collection frequency of the virtual machine 53 at this time may be low, and is, for example, the initial value notified in step S104.

仮想マシン５１は、情報収集サーバ１００に、仮想マシン５３がグループから外れたことを示すグループ解除通知を送信する（Ｓ１１６）。情報収集サーバ１００は、グループ解除通知を受信すると、グループから外れた仮想マシン５３のリソースの使用状況を、推定値ではなく計測値に基づいて把握するように切り替える。 The virtual machine 51 transmits a group release notification indicating that the virtual machine 53 has left the group to the information collection server 100 (S116). When the information collection server 100 receives the group release notification, the information collection server 100 switches to grasp the resource usage status of the virtual machine 53 out of the group based on the measured value instead of the estimated value.

また、仮想マシン５１は、グループに属する従属仮想マシンが減少したことに伴い、仮想マシン５１が稼働情報を情報収集サーバ１００に送信する頻度を再計算する。これにより、仮想マシン５１の収集頻度が低下する。その結果、仮想マシン５１，５３は、低い頻度で、リソースの使用状況を取得して情報収集サーバ１００に稼働情報を送信するようになる（Ｓ１１７）。収集頻度は、例えば、１分間隔に戻る。 Further, the virtual machine 51 recalculates the frequency with which the virtual machine 51 transmits the operation information to the information collection server 100 as the number of dependent virtual machines belonging to the group decreases. As a result, the collection frequency of the virtual machine 51 is reduced. As a result, the virtual machines 51 and 53 frequently acquire the resource usage status and transmit the operation information to the information collection server 100 (S117). The collection frequency returns, for example, to 1 minute intervals.

このように、サービスの中でメッセージを送信する親仮想マシンと当該メッセージを受信してデータ処理を行う子仮想マシンとの間には、ＣＰＵやＲＡＭなどのリソースの使用状況に強い相関があることがある。この場合、子仮想マシンの稼働情報の収集を停止し、親仮想マシンの稼働情報から子仮想マシンのリソースの使用状況を推定することで、ネットワーク３０のトラフィックを削減することができる。また、ネットワーク３０の総トラフィックが削減される範囲で、親仮想マシンの収集頻度を高くすることで、リソースの使用状況をリアルタイムに監視することができ、監視精度を向上させることができる。 In this way, there is a strong correlation between the parent virtual machine that sends a message in the service and the child virtual machine that receives the message and processes data, in terms of resource usage such as CPU and RAM. There is. In this case, the traffic of the network 30 can be reduced by stopping the collection of the operation information of the child virtual machine and estimating the resource usage status of the child virtual machine from the operation information of the parent virtual machine. Further, by increasing the collection frequency of the parent virtual machine within the range in which the total traffic of the network 30 is reduced, the resource usage status can be monitored in real time, and the monitoring accuracy can be improved.

なお、情報収集サーバ１００は、従属仮想マシンの稼働情報の収集を停止した後も、稼働情報を収集している代表仮想マシンの間で相関関係を定期的に判定する。新たに２つの代表仮想マシンの間で強い相関が検出された場合、情報収集サーバ１００は、データフローの下流側の代表仮想マシンを従属仮想マシンに変更する。その場合、情報収集サーバ１００は、ステップＳ１０６と同様に、上流側の代表仮想マシンに対して、従属仮想マシンが追加されたことを示すグループ通知を送信する。 The information collection server 100 periodically determines the correlation between the representative virtual machines that are collecting the operation information even after the collection of the operation information of the subordinate virtual machines is stopped. When a strong correlation is newly detected between the two representative virtual machines, the information collection server 100 changes the representative virtual machine on the downstream side of the data flow to the dependent virtual machine. In that case, the information collection server 100 transmits a group notification indicating that the subordinate virtual machine has been added to the representative virtual machine on the upstream side, as in step S106.

図６は、仮想マシンのグループ化例を示す図である。
前述のように、情報収集サーバ１００は、仮想マシン５１にテストリクエストを送信する。すると、図４に示す構造に従い、仮想マシン５１は仮想マシン５２，５３にテストメッセージを送信する。仮想マシン５３は仮想マシン５４にテストメッセージを送信し、仮想マシン５４は仮想マシン５５にテストメッセージを送信する。仮想マシン５２は仮想マシン５６にテストメッセージを送信する。この間、仮想マシン５１，５２，５３，５４，５５，５６は、パケットを監視するなどの方法により、テストメッセージの送信先の仮想マシンおよびテストメッセージの送信時刻を含む通信ログを採取する。 FIG. 6 is a diagram showing an example of grouping virtual machines.
As described above, the information collection server 100 transmits a test request to the virtual machine 51. Then, according to the structure shown in FIG. 4, the virtual machine 51 sends a test message to the virtual machines 52 and 53. The virtual machine 53 sends a test message to the virtual machine 54, and the virtual machine 54 sends a test message to the virtual machine 55. The virtual machine 52 sends a test message to the virtual machine 56. During this time, the virtual machines 51, 52, 53, 54, 55, 56 collect a communication log including the virtual machine to which the test message is sent and the transmission time of the test message by a method such as monitoring packets.

仮想マシン５１，５２，５３，５４，５５，５６は、通信ログを情報収集サーバ１００に送信する。情報収集サーバ１００は、受信した通信ログに基づいて、仮想マシン５１，５２，５３，５４，５５，５６の間のメッセージ送信の順序を判定してデータフローを把握する。また、情報収集サーバ１００は、仮想マシン５１，５２，５３，５４，５５，５６から低頻度（例えば、１分間隔）で、ＣＰＵ使用率を示す稼働情報であるＣＰＵ情報と、ＲＡＭ使用量を示す稼働情報であるＲＡＭ情報の両方を受信する。 The virtual machines 51, 52, 53, 54, 55, 56 transmit the communication log to the information collection server 100. The information collection server 100 determines the order of message transmission between the virtual machines 51, 52, 53, 54, 55, and 56 based on the received communication log, and grasps the data flow. Further, the information collection server 100 uses the virtual machines 51, 52, 53, 54, 55, and 56 at low frequency (for example, at 1-minute intervals) to obtain CPU information indicating the CPU usage rate and RAM usage amount. Receives both RAM information, which is the operation information shown.

図７は、仮想マシンのグループ化例を示す図（続き１）である。
情報収集サーバ１００は、リソースの種類毎、すなわち、ＣＰＵとＲＡＭそれぞれについて、仮想マシン５１，５２，５３，５４，５５，５６をグループ化する。 FIG. 7 is a diagram (continuation 1) showing an example of grouping virtual machines.
The information collection server 100 groups virtual machines 51, 52, 53, 54, 55, 56 for each resource type, that is, for each of the CPU and RAM.

ＣＰＵについて、情報収集サーバ１００は、低頻度で仮想マシン５１，５２，５３，５４，５５，５６から受信したＣＰＵ情報を用いて、データフローの先頭の仮想マシン５１から下流に向かって相関関係を順に判定する。 Regarding the CPU, the information collection server 100 uses the CPU information received from the virtual machines 51, 52, 53, 54, 55, and 56 infrequently to correlate from the virtual machine 51 at the head of the data flow toward the downstream. Judge in order.

まず、情報収集サーバ１００は、仮想マシン５１と仮想マシン５３の間のＣＰＵ使用率の相関係数を算出する。ここでは、相関係数の絶対値が閾値より大きいとする。すると、情報収集サーバ１００は、仮想マシン５３を仮想マシン５１と同一グループに分類する。次に、情報収集サーバ１００は、データフローを下流に向かって辿り、仮想マシン５１と仮想マシン５４の間のＣＰＵ使用率の相関係数を算出する。ここでは、相関係数の絶対値が閾値より大きいとする。すると、情報収集サーバ１００は、仮想マシン５４を仮想マシン５１，５３と同一グループに分類する。 First, the information collection server 100 calculates the correlation coefficient of the CPU usage rate between the virtual machine 51 and the virtual machine 53. Here, it is assumed that the absolute value of the correlation coefficient is larger than the threshold value. Then, the information collection server 100 classifies the virtual machine 53 into the same group as the virtual machine 51. Next, the information collection server 100 traces the data flow downstream and calculates the correlation coefficient of the CPU usage rate between the virtual machine 51 and the virtual machine 54. Here, it is assumed that the absolute value of the correlation coefficient is larger than the threshold value. Then, the information collection server 100 classifies the virtual machine 54 into the same group as the virtual machines 51 and 53.

次に、情報収集サーバ１００は、データフローを下流に向かって辿り、仮想マシン５１と仮想マシン５５の間のＣＰＵ使用率の相関係数を算出する。ここでは、相関係数の絶対値が閾値以下であるとする。すると、情報収集サーバ１００は、仮想マシン５５を仮想マシン５１，５３，５４と別グループに分類する。また、情報収集サーバ１００は、仮想マシン５１と仮想マシン５２の間のＣＰＵ使用率の相関係数を算出する。ここでは、相関係数の絶対値が閾値以下であるとする。すると、情報収集サーバ１００は、仮想マシン５２を仮想マシン５１と別グループに分類する。次に、情報収集サーバ１００は、データフローを下流に向かって辿り、仮想マシン５２と仮想マシン５６の間のＣＰＵ使用率の相関係数を算出する。ここでは、相関係数の絶対値が閾値より大きいとする。すると、情報収集サーバ１００は、仮想マシン５６を仮想マシン５２と同一グループに分類する。 Next, the information collection server 100 traces the data flow downstream and calculates the correlation coefficient of the CPU usage rate between the virtual machine 51 and the virtual machine 55. Here, it is assumed that the absolute value of the correlation coefficient is equal to or less than the threshold value. Then, the information collection server 100 classifies the virtual machine 55 into a different group from the virtual machines 51, 53, 54. In addition, the information collection server 100 calculates the correlation coefficient of the CPU usage rate between the virtual machine 51 and the virtual machine 52. Here, it is assumed that the absolute value of the correlation coefficient is equal to or less than the threshold value. Then, the information collection server 100 classifies the virtual machine 52 into a different group from the virtual machine 51. Next, the information collection server 100 traces the data flow downstream and calculates the correlation coefficient of the CPU usage rate between the virtual machine 52 and the virtual machine 56. Here, it is assumed that the absolute value of the correlation coefficient is larger than the threshold value. Then, the information collection server 100 classifies the virtual machine 56 into the same group as the virtual machine 52.

これにより、ＣＰＵについては、仮想マシン５１，５２，５５が代表仮想マシンになり、仮想マシン５３，５４，５６が従属仮想マシンになる。仮想マシン５１，５２，５５は、情報収集サーバ１００に定期的にＣＰＵ情報を送信する。仮想マシン５３，５４，５６は、情報収集サーバ１００へのＣＰＵ情報の送信を停止する。また、仮想マシン５３，５４がＣＰＵ情報の送信を停止したことに伴い、初期値の３倍を上限として（例えば、送信間隔の下限を２０秒として）、仮想マシン５１の収集頻度が高くなる。また、仮想マシン５６がＣＰＵ情報の送信を停止したことに伴い、初期値の２倍を上限として（例えば、送信間隔の下限を３０秒として）、仮想マシン５２の収集頻度が高くなる。 As a result, with respect to the CPU, the virtual machines 51, 52, 55 become representative virtual machines, and the virtual machines 53, 54, 56 become subordinate virtual machines. The virtual machines 51, 52, and 55 periodically transmit CPU information to the information collection server 100. The virtual machines 53, 54, and 56 stop transmitting CPU information to the information collection server 100. Further, as the virtual machines 53 and 54 stop transmitting CPU information, the collection frequency of the virtual machines 51 increases with the upper limit being three times the initial value (for example, the lower limit of the transmission interval being 20 seconds). Further, as the virtual machine 56 stops transmitting CPU information, the collection frequency of the virtual machine 52 increases with the upper limit being twice the initial value (for example, the lower limit of the transmission interval being 30 seconds).

同様に、ＲＡＭについて、情報収集サーバ１００は、低頻度で仮想マシン５１，５２，５３，５４，５５，５６から受信したＲＡＭ情報を用いて、データフローの先頭の仮想マシン５１から下流に向かって相関関係を順に判定する。 Similarly, with respect to the RAM, the information collection server 100 uses the RAM information received from the virtual machines 51, 52, 53, 54, 55, 56 infrequently from the virtual machine 51 at the head of the data flow toward the downstream. Determine the correlation in order.

まず、情報収集サーバ１００は、仮想マシン５１と仮想マシン５３の間のＲＡＭ使用量の相関係数を算出する。ここでは、相関係数の絶対値が閾値より大きいとする。すると、情報収集サーバ１００は、仮想マシン５３を仮想マシン５１と同一グループに分類する。次に、情報収集サーバ１００は、データフローを下流に向かって辿り、仮想マシン５１と仮想マシン５４の間のＲＡＭ使用量の相関係数を算出する。ここでは、相関係数の絶対値が閾値以下であるとする。すると、情報収集サーバ１００は、仮想マシン５４を仮想マシン５１，５３と別グループに分類する。 First, the information collection server 100 calculates the correlation coefficient of the RAM usage between the virtual machine 51 and the virtual machine 53. Here, it is assumed that the absolute value of the correlation coefficient is larger than the threshold value. Then, the information collection server 100 classifies the virtual machine 53 into the same group as the virtual machine 51. Next, the information collection server 100 traces the data flow downstream and calculates the correlation coefficient of the RAM usage between the virtual machine 51 and the virtual machine 54. Here, it is assumed that the absolute value of the correlation coefficient is equal to or less than the threshold value. Then, the information collection server 100 classifies the virtual machine 54 into a different group from the virtual machines 51 and 53.

次に、情報収集サーバ１００は、データフローを下流に向かって辿り、仮想マシン５４と仮想マシン５５の間のＲＡＭ使用量の相関係数を算出する。ここでは、相関係数の絶対値が閾値より大きいとする。すると、情報収集サーバ１００は、仮想マシン５５を仮想マシン５４と同一グループに分類する。また、情報収集サーバ１００は、仮想マシン５１と仮想マシン５２の間のＲＡＭ使用量の相関係数を算出する。ここでは、相関係数の絶対値が閾値以下であるとする。すると、情報収集サーバ１００は、仮想マシン５２を仮想マシン５１と別グループに分類する。次に、情報収集サーバ１００は、データフローを下流に向かって辿り、仮想マシン５２と仮想マシン５６の間のＲＡＭ使用量の相関係数を算出する。ここでは、相関係数の絶対値が閾値以下であるとする。すると、情報収集サーバ１００は、仮想マシン５６を仮想マシン５２と別グループに分類する。 Next, the information collection server 100 traces the data flow downstream and calculates the correlation coefficient of the RAM usage between the virtual machine 54 and the virtual machine 55. Here, it is assumed that the absolute value of the correlation coefficient is larger than the threshold value. Then, the information collection server 100 classifies the virtual machine 55 into the same group as the virtual machine 54. In addition, the information collection server 100 calculates the correlation coefficient of the RAM usage between the virtual machine 51 and the virtual machine 52. Here, it is assumed that the absolute value of the correlation coefficient is equal to or less than the threshold value. Then, the information collection server 100 classifies the virtual machine 52 into a different group from the virtual machine 51. Next, the information collection server 100 traces the data flow downstream and calculates the correlation coefficient of the RAM usage between the virtual machine 52 and the virtual machine 56. Here, it is assumed that the absolute value of the correlation coefficient is equal to or less than the threshold value. Then, the information collection server 100 classifies the virtual machine 56 into a group different from the virtual machine 52.

これにより、ＲＡＭについては、仮想マシン５１，５２，５４，５６が代表仮想マシンになり、仮想マシン５３，５５が従属仮想マシンになる。仮想マシン５１，５２，５４，５６は、情報収集サーバ１００に定期的にＲＡＭ情報を送信する。仮想マシン５３，５５は、情報収集サーバ１００へのＲＡＭ情報の送信を停止する。また、仮想マシン５３がＲＡＭ情報の送信を停止したことに伴い、初期値の２倍を上限として、仮想マシン５１の収集頻度が高くなる。また、仮想マシン５５がＲＡＭ情報の送信を停止したことに伴い、初期値の２倍を上限として、仮想マシン５４の収集頻度が高くなる。 As a result, with respect to RAM, virtual machines 51, 52, 54, and 56 become representative virtual machines, and virtual machines 53 and 55 become subordinate virtual machines. The virtual machines 51, 52, 54, and 56 periodically transmit RAM information to the information collection server 100. The virtual machines 53 and 55 stop transmitting RAM information to the information collection server 100. Further, as the virtual machine 53 stops transmitting RAM information, the collection frequency of the virtual machine 51 increases up to twice the initial value. Further, as the virtual machine 55 stops transmitting RAM information, the collection frequency of the virtual machine 54 increases up to twice the initial value.

図８は、仮想マシンのグループ化例を示す図（続き２）である。
ここでは、ＣＰＵ情報のグループ変更について説明する。ＲＡＭ情報については、ＣＰＵ情報とは独立にグループ変更が行われる。代表仮想マシンである仮想マシン５１は、低い頻度で、同一グループに属する従属仮想マシンである仮想マシン５３，５４にＣＰＵ情報を要求し、仮想マシン５３，５４からＣＰＵ情報を受信する。また、代表仮想マシンである仮想マシン５２は、低い頻度で、同一グループに属する従属仮想マシンである仮想マシン５６にＣＰＵ情報を要求し、仮想マシン５６からＣＰＵ情報を受信する。 FIG. 8 is a diagram (continuation 2) showing an example of grouping virtual machines.
Here, the group change of CPU information will be described. The RAM information is group-changed independently of the CPU information. The virtual machine 51, which is a representative virtual machine, frequently requests CPU information from virtual machines 53, 54, which are subordinate virtual machines belonging to the same group, and receives CPU information from the virtual machines 53, 54. Further, the virtual machine 52, which is a representative virtual machine, frequently requests CPU information from the virtual machine 56, which is a subordinate virtual machine belonging to the same group, and receives the CPU information from the virtual machine 56.

仮想マシン５１は、仮想マシン５１と仮想マシン５３の間のＣＰＵ使用率の相関係数を算出し、仮想マシン５１と仮想マシン５４の間のＣＰＵ使用率の相関係数を算出する。ここでは、何れの相関係数の絶対値も閾値より大きいとする。すると、仮想マシン５３，５４は、引き続き仮想マシン５１に対する従属仮想マシンとして動作する。また、仮想マシン５２は、仮想マシン５２と仮想マシン５６の間のＣＰＵ使用率の相関係数を算出する。ここでは、相関係数の絶対値が閾値以下に低下したとする。すると、仮想マシン５６は、仮想マシン５２のグループから外れ、代表仮想マシンとして動作する。よって、仮想マシン５６は、情報収集サーバ１００へのＣＰＵ情報の送信を再開する。また、仮想マシン５６の送信再開に伴い、仮想マシン５２の収集頻度が低下する。 The virtual machine 51 calculates the correlation coefficient of the CPU usage rate between the virtual machine 51 and the virtual machine 53, and calculates the correlation coefficient of the CPU usage rate between the virtual machine 51 and the virtual machine 54. Here, it is assumed that the absolute value of any correlation coefficient is larger than the threshold value. Then, the virtual machines 53 and 54 continue to operate as subordinate virtual machines with respect to the virtual machine 51. In addition, the virtual machine 52 calculates the correlation coefficient of the CPU usage rate between the virtual machine 52 and the virtual machine 56. Here, it is assumed that the absolute value of the correlation coefficient drops below the threshold value. Then, the virtual machine 56 is removed from the group of the virtual machine 52 and operates as a representative virtual machine. Therefore, the virtual machine 56 restarts the transmission of the CPU information to the information collection server 100. Further, as the transmission of the virtual machine 56 is resumed, the collection frequency of the virtual machine 52 decreases.

一方、情報収集サーバ１００は、仮想マシン５１，５２，５５から定期的に受信するＣＰＵ情報の相関関係を監視する。情報収集サーバ１００は、データフローに従い、仮想マシン５１とそのグループに隣接する仮想マシン５５とを選択する。情報収集サーバ１００は、仮想マシン５１と仮想マシン５５の間のＣＰＵ使用率の相関係数を算出する。ここでは、相関係数の絶対値が閾値より大きくなったとする。すると、仮想マシン５５は、仮想マシン５１，５３，５４のグループに入り、従属仮想マシンとして動作する。よって、仮想マシン５５は、情報収集サーバ１００へのＣＰＵ情報の送信を停止する。 On the other hand, the information collection server 100 monitors the correlation of CPU information periodically received from the virtual machines 51, 52, 55. The information collection server 100 selects the virtual machine 51 and the virtual machines 55 adjacent to the group according to the data flow. The information collection server 100 calculates the correlation coefficient of the CPU usage rate between the virtual machine 51 and the virtual machine 55. Here, it is assumed that the absolute value of the correlation coefficient becomes larger than the threshold value. Then, the virtual machine 55 enters the group of the virtual machines 51, 53, 54 and operates as a subordinate virtual machine. Therefore, the virtual machine 55 stops transmitting CPU information to the information collection server 100.

また、情報収集サーバ１００は、データフローに従い、仮想マシン５１とそのグループに隣接する仮想マシン５２とを選択する。情報収集サーバ１００は、仮想マシン５１と仮想マシン５２の間のＣＰＵ使用率の相関係数を算出する。ここでは、相関係数の絶対値が引き続き閾値以下であるとする。すると、仮想マシン５２は、引き続き代表仮想マシンとして動作し、情報収集サーバ１００へのＣＰＵ情報の送信を継続する。 Further, the information collection server 100 selects the virtual machine 51 and the virtual machines 52 adjacent to the group according to the data flow. The information collection server 100 calculates the correlation coefficient of the CPU usage rate between the virtual machine 51 and the virtual machine 52. Here, it is assumed that the absolute value of the correlation coefficient is still below the threshold value. Then, the virtual machine 52 continues to operate as a representative virtual machine, and continues to transmit CPU information to the information collection server 100.

このように、情報収集サーバ１００の判断により、既存の代表仮想マシンが従属仮想マシンに変更されることがある。また、代表仮想マシンの判断により、既存の従属仮想マシンが代表仮想マシンに変更されることがある。 In this way, the existing representative virtual machine may be changed to the subordinate virtual machine at the discretion of the information collection server 100. In addition, the existing subordinate virtual machine may be changed to the representative virtual machine at the discretion of the representative virtual machine.

次に、情報収集サーバ１００および各仮想マシンの機能について説明する。
図９は、情報収集サーバと仮想マシンの機能例を示すブロック図である。
情報収集サーバ１００は、構造情報記憶部１２１、稼働履歴記憶部１２２および相関情報記憶部１２３を有する。これらの記憶部は、例えば、ＲＡＭ１０２またはＨＤＤ１０３の記憶領域を用いて実現される。また、情報収集サーバ１００は、テスト実行部１２４、稼働情報受信部１２５、相関検出部１２６、相関更新部１２７および稼働状況表示部１２８を有する。これらの処理部は、例えば、プログラムを用いて実現される。 Next, the functions of the information collection server 100 and each virtual machine will be described.
FIG. 9 is a block diagram showing a functional example of the information collection server and the virtual machine.
The information collection server 100 has a structural information storage unit 121, an operation history storage unit 122, and a correlation information storage unit 123. These storage units are realized by using, for example, the storage area of the RAM 102 or the HDD 103. Further, the information collection server 100 includes a test execution unit 124, an operation information reception unit 125, a correlation detection unit 126, a correlation update unit 127, and an operation status display unit 128. These processing units are realized, for example, by using a program.

構造情報記憶部１２１は、サービスに用いられる複数の仮想マシンの間のデータフローを表すサービス構造情報を記憶する。サービス構造情報は、複数の仮想マシンを木構造に接続したトポロジを表すものである。稼働履歴記憶部１２２は、仮想マシン５１，５２，５３，５４，５５，５６から受信された稼働情報の履歴を記憶する。稼働履歴記憶部１２２は、時刻と仮想マシンとリソースの組に対応付けて計測値を記録する。稼働履歴記憶部１２２は、受信された稼働情報を少なくとも一定期間履歴として保持しておく。相関情報記憶部１２３は、リソースの使用状況が強い相関をもつ代表仮想マシンと従属仮想マシンとの間の関係を示す相関情報を記憶する。相関情報は、代表仮想マシンの計測値から従属仮想マシンの数値を推定するための倍率を含む。 The structure information storage unit 121 stores service structure information representing a data flow between a plurality of virtual machines used for the service. The service structure information represents a topology in which multiple virtual machines are connected to a tree structure. The operation history storage unit 122 stores the history of operation information received from the virtual machines 51, 52, 53, 54, 55, 56. The operation history storage unit 122 records the measured value in association with the time, the virtual machine, and the resource set. The operation history storage unit 122 holds the received operation information as a history for at least a certain period of time. The correlation information storage unit 123 stores the correlation information indicating the relationship between the representative virtual machine and the dependent virtual machine having a strong correlation in the resource usage status. The correlation information includes a magnification for estimating the numerical value of the dependent virtual machine from the measured value of the representative virtual machine.

テスト実行部１２４は、仮想マシン５１，５２，５３，５４，５５，５６のデータフローを判定するためのテストを実行する。テスト実行部１２４は、仮想マシン５１，５２，５３，５４，５５，５６にテスト開始通知し、サービスの入り口である仮想マシン５１にテストリクエストを送信する。テスト実行部１２４は、仮想マシン５１，５２，５３，５４，５５，５６から、テストメッセージの送信先の仮想マシンおよび送信時刻を示す通信ログを収集し、通信ログを分析してデータフローを判定する。テスト実行部１２４は、データフローを示す構造情報を生成して構造情報記憶部１２１に格納する。 The test execution unit 124 executes a test for determining the data flow of the virtual machines 51, 52, 53, 54, 55, 56. The test execution unit 124 notifies the virtual machines 51, 52, 53, 54, 55, and 56 of the start of the test, and sends a test request to the virtual machine 51, which is the entrance of the service. The test execution unit 124 collects communication logs indicating the virtual machine to which the test message is sent and the transmission time from the virtual machines 51, 52, 53, 54, 55, 56, analyzes the communication log, and determines the data flow. To do. The test execution unit 124 generates structural information indicating a data flow and stores it in the structural information storage unit 121.

稼働情報受信部１２５は、仮想マシン５１，５２，５３，５４，５５，５６の少なくとも一部が定期的に送信する稼働情報を受信する。稼働情報はリソースの種類毎に分けられており、何れのリソースについての稼働情報を送信するかは仮想マシンによって異なる。初期状態では、仮想マシン５１，５２，５３，５４，５５，５６の全てが全てのリソースについての稼働情報を低頻度で送信する。稼働情報受信部１２５は、受信した稼働情報に含まれる最新の計測値を相関情報記憶部１２３に上書きする。また、稼働情報受信部１２５は、受信した稼働情報に含まれる計測値を、送信元の仮想マシンの識別情報と受信時刻と計測値が示すリソースの組に対応付けて、稼働履歴記憶部１２２に追記する。 The operation information receiving unit 125 receives operation information periodically transmitted by at least a part of the virtual machines 51, 52, 53, 54, 55, 56. The operation information is divided according to the type of resource, and which resource the operation information is transmitted depends on the virtual machine. In the initial state, all of the virtual machines 51, 52, 53, 54, 55, 56 transmit operation information about all resources at low frequency. The operation information receiving unit 125 overwrites the correlation information storage unit 123 with the latest measured value included in the received operation information. Further, the operation information receiving unit 125 associates the measured value included in the received operation information with the identification information of the source virtual machine, the reception time, and the set of resources indicated by the measured value, and stores the measured value in the operation history storage unit 122. Append.

相関検出部１２６は、定期的に、構造情報記憶部１２１に記憶された構造情報と、稼働履歴記憶部１２２に記憶された稼働情報の履歴とに基づいて、リソース毎に、稼働情報を送信する仮想マシンの間の相関関係を判定する。相関検出部１２６は、強い相関が検出された２つの仮想マシンのうち、上流側の仮想マシンを代表仮想マシンに指定し、下流側の仮想マシンを従属仮想マシンに指定する。また、相関検出部１２６は、代表仮想マシンに対する従属仮想マシンのリソースの使用状況の倍率を算出する。相関検出部１２６は、代表仮想マシンと従属仮想マシンの関係および倍率を、相関情報記憶部１２３に記録する。 The correlation detection unit 126 periodically transmits operation information for each resource based on the structural information stored in the structural information storage unit 121 and the history of the operation information stored in the operation history storage unit 122. Determine the correlation between virtual machines. The correlation detection unit 126 designates the upstream virtual machine as the representative virtual machine and the downstream virtual machine as the dependent virtual machine among the two virtual machines in which the strong correlation is detected. In addition, the correlation detection unit 126 calculates the ratio of the resource usage status of the subordinate virtual machine to the representative virtual machine. The correlation detection unit 126 records the relationship and magnification between the representative virtual machine and the dependent virtual machine in the correlation information storage unit 123.

具体的には、相関検出部１２６は、リソースの種類を選択し、データフローの上流と下流の関係にある２つの仮想マシンを選択し、当該リソースおよび当該２つの仮想マシンに関する直近の一定期間の稼働情報を稼働履歴記憶部１２２から抽出する。相関検出部１２６は、一方の仮想マシンの計測値と、受信時刻が最も近い他方の仮想マシンの計測値とを対応付け、異なるタイミングの複数の計測値ペアから相関係数を算出する。相関検出部１２６は、相関係数の絶対値が閾値より大きい場合は相関が強いと判断し、相関係数の絶対値が閾値以下である場合は相関が弱いと判断する。相関が強い場合、相関検出部１２６は、代表仮想マシンの直近の一定期間の計測値の平均と、従属仮想マシンの直近の一定期間の計測値の平均とを算出し、後者を前者で割って倍率を算出する。 Specifically, the correlation detection unit 126 selects the resource type, selects two virtual machines that are upstream and downstream of the data flow, and the resource and the two virtual machines for the most recent fixed period of time. The operation information is extracted from the operation history storage unit 122. The correlation detection unit 126 associates the measured value of one virtual machine with the measured value of the other virtual machine having the closest reception time, and calculates the correlation coefficient from a plurality of measured value pairs at different timings. The correlation detection unit 126 determines that the correlation is strong when the absolute value of the correlation coefficient is larger than the threshold value, and determines that the correlation is weak when the absolute value of the correlation coefficient is equal to or less than the threshold value. When the correlation is strong, the correlation detection unit 126 calculates the average of the measured values of the representative virtual machine for the most recent fixed period and the average of the measured values of the dependent virtual machine for the most recent fixed period, and divides the latter by the former. Calculate the magnification.

相関更新部１２７は、代表仮想マシンから、倍率が更新されたことを示す相関更新通知を受信する。すると、相関更新部１２７は、更新された倍率を相関情報記憶部１２３に記録する。また、相関更新部１２７は、代表仮想マシンから、ある従属仮想マシンがグループから外れたことを示すグループ解除通知を受信する。すると、相関更新部１２７は、相関情報記憶部１２３に記録された相関関係を変更する。 The correlation update unit 127 receives a correlation update notification indicating that the magnification has been updated from the representative virtual machine. Then, the correlation update unit 127 records the updated magnification in the correlation information storage unit 123. In addition, the correlation update unit 127 receives a group release notification from the representative virtual machine indicating that a certain subordinate virtual machine has left the group. Then, the correlation update unit 127 changes the correlation recorded in the correlation information storage unit 123.

稼働状況表示部１２８は、ユーザからの要求に応じて、仮想マシン５１，５２，５３，５４，５５，５６のリソースの使用状況を列挙した管理画面を表示装置１１１に表示する。このとき、稼働状況表示部１２８は、相関情報記憶部１２３に計測値が記録されている仮想マシンとリソースの組については、その計測値を表示すればよい。一方、稼働状況表示部１２８は、相関情報記憶部１２３に計測値が記録されていない仮想マシンとリソースの組については、関連する代表仮想マシンの計測値に倍率を乗じて推定値を算出し、計測値に代えて推定値を表示する。なお、稼働状況表示部１２８は、ユーザからの要求を待たずに継続的に管理画面を表示するようにしてもよい。また、稼働状況表示部１２８は、リソースの使用状況を列挙した管理情報を、ＨＤＤ１０３などのストレージ装置に保存してもよく、他の情報処理装置に送信してもよい。 The operation status display unit 128 displays a management screen listing the resource usage status of the virtual machines 51, 52, 53, 54, 55, and 56 on the display device 111 in response to a request from the user. At this time, the operation status display unit 128 may display the measured value for the set of the virtual machine and the resource in which the measured value is recorded in the correlation information storage unit 123. On the other hand, the operation status display unit 128 calculates an estimated value by multiplying the measured value of the related representative virtual machine by a magnification for the set of the virtual machine and the resource for which the measured value is not recorded in the correlation information storage unit 123. Display the estimated value instead of the measured value. The operation status display unit 128 may continuously display the management screen without waiting for a request from the user. Further, the operation status display unit 128 may store the management information enumerating the resource usage status in a storage device such as the HDD 103, or may transmit the management information to another information processing device.

仮想マシン５１は、グループ情報記憶部２２１を有する。グループ情報記憶部２２１は、例えば、ＲＡＭまたはＨＤＤの記憶領域を用いて実現される。また、仮想マシン５１は、サービス処理部２２２、通信ログ取得部２２３、稼働情報取得部２２４およびグループ制御部２２５を有する。これらの処理部は、例えば、プログラムを用いて実現される。他の仮想マシンも、仮想マシン５１と同様の機能を有する。 The virtual machine 51 has a group information storage unit 221. The group information storage unit 221 is realized by using, for example, a storage area of a RAM or an HDD. Further, the virtual machine 51 has a service processing unit 222, a communication log acquisition unit 223, an operation information acquisition unit 224, and a group control unit 225. These processing units are realized, for example, by using a program. Other virtual machines also have the same functions as the virtual machine 51.

グループ情報記憶部２２１は、仮想マシン５１が代表仮想マシンである場合に、同一グループに属する従属仮想マシンを管理するためのグループ情報を記憶する。グループ情報は、情報収集サーバ１００から通知された従属仮想マシンの識別情報を含む。また、グループ情報は、従属仮想マシンから収集した計測値を含む。また、グループ情報は、情報収集サーバ１００から通知された倍率または更新された最新の倍率を含む。 The group information storage unit 221 stores group information for managing subordinate virtual machines belonging to the same group when the virtual machine 51 is a representative virtual machine. The group information includes the identification information of the subordinate virtual machine notified from the information collection server 100. In addition, the group information includes the measured values collected from the subordinate virtual machines. In addition, the group information includes the magnification notified from the information collection server 100 or the latest updated magnification.

サービス処理部２２２は、アプリケーションプログラムに従ってサービスのデータ処理を実行する。仮想マシン５１がサービスの入り口に相当する場合、サービス処理部２２２は、情報収集サーバ１００からテストリクエストを受信する。サービス処理部２２２は、他の仮想マシンにメッセージを送信することがある。また、サービス処理部２２２は、他の仮想マシンからメッセージを受信することがある。 The service processing unit 222 executes the data processing of the service according to the application program. When the virtual machine 51 corresponds to the entrance of the service, the service processing unit 222 receives the test request from the information collection server 100. The service processing unit 222 may send a message to another virtual machine. In addition, the service processing unit 222 may receive a message from another virtual machine.

通信ログ取得部２２３は、情報収集サーバ１００からテスト開始の通知を受信すると、仮想マシン５１が送信するパケットを監視する。または、通信ログ取得部２２３は、アプリケーションレベルの送信メッセージを監視する。通信ログ取得部２２３は、テスト実行時におけるメッセージの送信先の仮想マシンと送信時刻を検出し、送信先の仮想マシンおよび送信時刻を示す通信ログを生成して情報収集サーバ１００に送信する。 When the communication log acquisition unit 223 receives the test start notification from the information collection server 100, the communication log acquisition unit 223 monitors the packet transmitted by the virtual machine 51. Alternatively, the communication log acquisition unit 223 monitors the application-level transmitted message. The communication log acquisition unit 223 detects the transmission destination virtual machine and the transmission time of the message at the time of test execution, generates a communication log indicating the transmission destination virtual machine and the transmission time, and transmits the communication log to the information collection server 100.

稼働情報取得部２２４は、仮想マシン５１における現在のリソースの使用状況を示す計測値を取得する。計測値として、ＣＰＵ使用率、ＲＡＭ使用量、ＨＤＤアクセス量、ネットワーク帯域使用率などが挙げられる。仮想マシン５１でゲストＯＳが実行されている場合、稼働情報取得部２２４は、ゲストＯＳから計測値を取得してもよい。 The operation information acquisition unit 224 acquires a measured value indicating the current resource usage status in the virtual machine 51. Examples of the measured value include CPU usage rate, RAM usage amount, HDD access amount, network bandwidth usage rate, and the like. When the guest OS is executed in the virtual machine 51, the operation information acquisition unit 224 may acquire the measured value from the guest OS.

情報収集サーバ１００から収集頻度の初期値が通知されると、稼働情報取得部２２４は、指定された収集頻度で、全てのリソースの計測値を取得して情報収集サーバ１００に送信する。グループ制御部２２５によって、あるリソースの収集頻度の変更が決定されると、稼働情報取得部２２４は、変更後の収集頻度で計測値の取得および送信を行う。あるリソースについて他の仮想マシンから停止通知を受信すると、稼働情報取得部２２４は、計測値の取得および送信を停止する。あるリソースについて他の仮想マシンから再開通知を受信すると、稼働情報取得部２２４は、計測値の取得および送信を再開する。あるリソースについて他の仮想マシンから稼働情報が要求されると、稼働情報取得部２２４は、計測値を取得して当該他の仮想マシンに返信する。 When the information collection server 100 notifies the initial value of the collection frequency, the operation information acquisition unit 224 acquires the measured values of all resources at the specified collection frequency and transmits them to the information collection server 100. When the group control unit 225 determines that the collection frequency of a certain resource is changed, the operation information acquisition unit 224 acquires and transmits the measured value at the changed collection frequency. When a stop notification is received from another virtual machine for a certain resource, the operation information acquisition unit 224 stops the acquisition and transmission of the measured value. When a restart notification is received from another virtual machine for a certain resource, the operation information acquisition unit 224 resumes the acquisition and transmission of the measured value. When operation information is requested from another virtual machine for a certain resource, the operation information acquisition unit 224 acquires the measured value and returns it to the other virtual machine.

グループ制御部２２５は、仮想マシン５１が代表仮想マシンである場合に、従属仮想マシンを管理する。グループ制御部２２５は、情報収集サーバ１００から通知された従属仮想マシンの識別情報および倍率を、グループ情報記憶部２２１に記録する。また、グループ制御部２２５は、従属仮想マシンに停止通知を送信する。また、グループ制御部２２５は、従属仮想マシンの数に応じて、仮想マシン５１の収集頻度を変更する。 The group control unit 225 manages the subordinate virtual machine when the virtual machine 51 is the representative virtual machine. The group control unit 225 records the identification information and the magnification of the subordinate virtual machine notified from the information collection server 100 in the group information storage unit 221. In addition, the group control unit 225 sends a stop notification to the subordinate virtual machine. Further, the group control unit 225 changes the collection frequency of the virtual machines 51 according to the number of dependent virtual machines.

また、グループ制御部２２５は、定期的に従属仮想マシンに稼働情報を要求して、従属仮想マシンの稼働情報を収集する。グループ制御部２２５は、仮想マシン５１の稼働情報と従属仮想マシンの稼働情報との間に強い相関が維持されているか判定する。強い相関が維持されている場合、グループ制御部２２５は、最新の倍率を算出し、グループ情報記憶部２２１に記録された倍率を更新すると共に、最新の倍率を含む相関更新通知を情報収集サーバ１００に送信する。強い相関が維持されていない場合、グループ制御部２２５は、その従属仮想マシンに再開通知を送信し、グループ情報記憶部２２１のグループ情報を更新し、グループ解除通知を情報収集サーバ１００に送信する。 Further, the group control unit 225 periodically requests the operation information of the subordinate virtual machine and collects the operation information of the subordinate virtual machine. The group control unit 225 determines whether a strong correlation is maintained between the operation information of the virtual machine 51 and the operation information of the subordinate virtual machine. When a strong correlation is maintained, the group control unit 225 calculates the latest magnification, updates the magnification recorded in the group information storage unit 221 and sends a correlation update notification including the latest magnification to the information collection server 100. Send to. If the strong correlation is not maintained, the group control unit 225 sends a restart notification to the subordinate virtual machine, updates the group information of the group information storage unit 221 and sends a group release notification to the information collection server 100.

図１０は、サービス構造テーブルと稼働履歴テーブルの例を示す図である。
サービス構造テーブル１３１は、構造情報記憶部１２１に記憶される。サービス構造テーブル１３１は、仮想マシンＩＤ、親仮想マシンおよび子仮想マシンの項目を含む。仮想マシンＩＤの項目には、仮想マシンの識別情報が登録される。親仮想マシンの項目には、当該仮想マシンにメッセージを送信する送信元である上流側の仮想マシンの識別情報が登録される。子仮想マシンの項目には、当該仮想マシンがメッセージを送信する宛先である下流側の仮想マシンの識別情報が登録される。 FIG. 10 is a diagram showing an example of a service structure table and an operation history table.
The service structure table 131 is stored in the structure information storage unit 121. The service structure table 131 includes items for the virtual machine ID, the parent virtual machine, and the child virtual machine. The virtual machine identification information is registered in the virtual machine ID item. In the item of the parent virtual machine, the identification information of the upstream virtual machine that is the source of sending the message to the virtual machine is registered. In the child virtual machine item, the identification information of the downstream virtual machine to which the virtual machine sends a message is registered.

サービス構造テーブル１３１によって、図４に示すような仮想マシン５１，５２，５３，５４，５５，５６の間の木構造のデータフローが表現される。例えば、仮想マシン５１（ＶＭ１）の子仮想マシンは、仮想マシン５２（ＶＭ２）および仮想マシン５３（ＶＭ３）である。仮想マシン５２（ＶＭ２）の親仮想マシンは、仮想マシン５１（ＶＭ１）である。仮想マシン５２（ＶＭ２）の子仮想マシンは、仮想マシン５６（ＶＭ６）である。 The service structure table 131 represents a tree-structured data flow between virtual machines 51, 52, 53, 54, 55, 56 as shown in FIG. For example, the child virtual machines of the virtual machine 51 (VM1) are the virtual machine 52 (VM2) and the virtual machine 53 (VM3). The parent virtual machine of the virtual machine 52 (VM2) is the virtual machine 51 (VM1). The child virtual machine of the virtual machine 52 (VM2) is the virtual machine 56 (VM6).

稼働履歴テーブル１３２は、稼働履歴記憶部１２２に記憶される。稼働履歴テーブル１３２は、時刻、仮想マシンＩＤ、リソースおよび計測値の項目を含む。時刻の項目には、情報収集サーバ１００が稼働情報を受信した時刻が登録される。ただし、時刻として、稼働情報に含まれている時刻を用いてもよく、仮想マシンが計測値を取得した時刻や稼働情報を送信した時刻を用いてもよい。仮想マシンＩＤの項目には、稼働情報を送信した仮想マシンの識別情報が登録される。リソースの項目には、計測値が示すリソースの種類が登録される。リソースの種類として、ＣＰＵ、ＲＡＭ、ストレージ、ネットワークなどが挙げられる。計測値の項目には、稼働情報に含まれる計測値が登録される。 The operation history table 132 is stored in the operation history storage unit 122. The operation history table 132 includes items of time, virtual machine ID, resource, and measured value. In the time item, the time when the information collection server 100 receives the operation information is registered. However, as the time, the time included in the operation information may be used, or the time when the virtual machine acquires the measured value or the time when the operation information is transmitted may be used. In the virtual machine ID item, the identification information of the virtual machine that sent the operation information is registered. In the resource item, the resource type indicated by the measured value is registered. Examples of resource types include CPU, RAM, storage, and network. The measured value included in the operation information is registered in the measured value item.

図１１は、相関関係テーブルと稼働状況テーブルの例を示す図である。
相関関係テーブル１３３は、相関情報記憶部１２３に記憶される。相関関係テーブル１３３は、仮想マシンＩＤ、リソース、計測値、従属仮想マシンおよび倍率の項目を含む。仮想マシンＩＤの項目には、稼働情報を送信している代表仮想マシンの識別情報が登録される。リソースの項目には、ＣＰＵやＲＡＭなどのリソースの種類が登録される。計測値の項目には、当該代表仮想マシンおよび当該リソースについて受信された最新の稼働情報に含まれる計測値が登録される。従属仮想マシンの項目には、当該リソースについて当該代表仮想マシンと強い相関のある従属仮想マシンの識別情報が登録される。倍率は、代表仮想マシンの計測値に対する従属仮想マシンの数値の倍率が登録される。 FIG. 11 is a diagram showing an example of a correlation table and an operation status table.
The correlation table 133 is stored in the correlation information storage unit 123. The correlation table 133 includes items such as virtual machine ID, resource, measured value, dependent virtual machine, and magnification. In the virtual machine ID item, the identification information of the representative virtual machine that is transmitting the operation information is registered. In the resource item, the type of resource such as CPU and RAM is registered. In the measurement value item, the measurement value included in the latest operation information received for the representative virtual machine and the resource is registered. In the item of the dependent virtual machine, the identification information of the dependent virtual machine having a strong correlation with the representative virtual machine for the resource is registered. As the magnification, the magnification of the numerical value of the dependent virtual machine with respect to the measured value of the representative virtual machine is registered.

例えば、相関関係テーブル１３３には、仮想マシン５１の最新のＣＰＵの計測値が５０であり、仮想マシン５１に対する仮想マシン５３のＣＰＵの倍率が０．５０であり、仮想マシン５１に対する仮想マシン５４のＣＰＵの倍率が０．３０であることが登録される。また、例えば、仮想マシン５１の最新のＲＡＭの計測値が３０であり、仮想マシン５１に対する仮想マシン５３のＲＡＭの倍率が０．４０であることが登録される。また、例えば、仮想マシン５２の最新のＣＰＵの計測値が７０であり、仮想マシン５２に対する仮想マシン５６のＣＰＵの倍率が１．２０であることが登録される。 For example, in the correlation table 133, the latest CPU measurement value of the virtual machine 51 is 50, the CPU magnification of the virtual machine 53 with respect to the virtual machine 51 is 0.50, and the virtual machine 54 with respect to the virtual machine 51. It is registered that the magnification of the CPU is 0.30. Further, for example, it is registered that the latest measured value of the RAM of the virtual machine 51 is 30, and the magnification of the RAM of the virtual machine 53 with respect to the virtual machine 51 is 0.40. Further, for example, it is registered that the latest measured value of the CPU of the virtual machine 52 is 70, and the magnification of the CPU of the virtual machine 56 with respect to the virtual machine 52 is 1.20.

稼働状況テーブル１３４は、稼働状況表示部１２８によって生成されて表示装置１１１に表示される。稼働状況テーブル１３４は、仮想マシンＩＤ、リソースおよび現在値の項目を含む。仮想マシンＩＤの項目には、仮想マシンの識別情報が登録される。リソースの項目には、ＣＰＵやＲＡＭなどのリソースの種類が登録される。現在値の項目には、最新のリソースの使用状況を示す数値が登録される。現在値の項目に登録される数値は、代表仮想マシンについては最新の計測値であり、従属仮想マシンについては推定値である。稼働状況テーブル１３４は、推定値については推定値である旨を付記してもよい。 The operation status table 134 is generated by the operation status display unit 128 and displayed on the display device 111. The operation status table 134 includes items of virtual machine ID, resource, and current value. The virtual machine identification information is registered in the virtual machine ID item. In the resource item, the type of resource such as CPU and RAM is registered. In the current value item, a numerical value indicating the latest resource usage status is registered. The numerical value registered in the current value item is the latest measured value for the representative virtual machine and the estimated value for the dependent virtual machine. It may be added that the operation status table 134 is an estimated value.

例えば、仮想マシン５１のＣＰＵ使用率として、計測値である「５０」が表示される。また、仮想マシン５１のＲＡＭ使用量として、計測値である「３０」が表示される。一方、仮想マシン５３のＣＰＵ使用率として、仮想マシン５１のＣＰＵの計測値である５０に倍率０．５０を乗じて算出される推定値である「２５」が表示される。また、仮想マシン５３のＲＡＭ使用量として、仮想マシン５１のＲＡＭの計測値である３０に倍率０．４０を乗じて算出される推定値である「１２」が表示される。 For example, the measured value "50" is displayed as the CPU usage rate of the virtual machine 51. Further, the measured value "30" is displayed as the RAM usage amount of the virtual machine 51. On the other hand, as the CPU usage rate of the virtual machine 53, "25", which is an estimated value calculated by multiplying 50, which is the measured value of the CPU of the virtual machine 51, by 0.50, is displayed. Further, as the amount of RAM used by the virtual machine 53, "12", which is an estimated value calculated by multiplying 30 which is the measured value of the RAM of the virtual machine 51 by 0.40, is displayed.

図１２は、グループ管理テーブルの例を示す図である。
グループ管理テーブル２３１は、グループ情報記憶部２２１に記憶される。グループ管理テーブル２３１は、リソース、自計測値、従属仮想マシン、他計測値および倍率の項目を含む。リソースの項目には、ＣＰＵやＲＡＭなどのリソースの種類が登録される。自計測値の項目には、仮想マシン５１で計測された当該リソースの使用状況を示す数値が登録される。従属仮想マシンの項目には、仮想マシン５１と同一グループに属する従属仮想マシンの識別情報が登録される。他計測値の項目には、従属仮想マシンで計測された当該リソースの使用状況を示す数値が登録される。倍率の項目には、仮想マシン５１の計測値に対する従属仮想マシンの計測値の最新の平均倍率が登録される。 FIG. 12 is a diagram showing an example of a group management table.
The group management table 231 is stored in the group information storage unit 221. The group management table 231 includes items of resources, self-measured values, dependent virtual machines, other measured values, and magnifications. In the resource item, the type of resource such as CPU and RAM is registered. In the self-measured value item, a numerical value indicating the usage status of the resource measured by the virtual machine 51 is registered. In the item of the subordinate virtual machine, the identification information of the subordinate virtual machine belonging to the same group as the virtual machine 51 is registered. In the item of other measured values, a numerical value indicating the usage status of the resource measured by the dependent virtual machine is registered. In the item of magnification, the latest average magnification of the measured value of the dependent virtual machine with respect to the measured value of the virtual machine 51 is registered.

次に、情報収集サーバ１００および代表仮想マシンの処理手順について説明する。
図１３は、サーバ開始処理の手順例を示すフローチャートである。
（Ｓ１０）テスト実行部１２４は、所定の仮想マシンにテストリクエストを送信する。テストリクエストに応じて、複数の仮想マシンの間でメッセージが送信される。 Next, the processing procedure of the information collection server 100 and the representative virtual machine will be described.
FIG. 13 is a flowchart showing a procedure example of the server start process.
(S10) The test execution unit 124 transmits a test request to a predetermined virtual machine. A message is sent between multiple virtual machines in response to a test request.

（Ｓ１１）テスト実行部１２４は、各仮想マシンから通信ログを受信する。通信ログは、メッセージの送信先の仮想マシンと送信時刻とを含んでいる。
（Ｓ１２）テスト実行部１２４は、ステップＳ１１で受信した通信ログに基づいて、複数の仮想マシンの間のデータフローを判定し、サービス構造テーブル１３１を生成する。 (S11) The test execution unit 124 receives the communication log from each virtual machine. The communication log contains the virtual machine to which the message is sent and the transmission time.
(S12) The test execution unit 124 determines the data flow between the plurality of virtual machines based on the communication log received in step S11, and generates the service structure table 131.

（Ｓ１３）稼働情報受信部１２５は、サービスに参加する仮想マシンの総数に応じて、稼働情報を収集する収集頻度の初期値を決定する。
（Ｓ１４）稼働情報受信部１２５は、収集頻度を各仮想マシンに通知する。これにより、稼働情報受信部１２５は、通知した頻度で各仮想マシンから稼働情報を受信する。稼働情報受信部１２５は、受信した稼働情報を稼働履歴テーブル１３２に保存する。 (S13) The operation information receiving unit 125 determines the initial value of the collection frequency for collecting the operation information according to the total number of virtual machines participating in the service.
(S14) The operation information receiving unit 125 notifies each virtual machine of the collection frequency. As a result, the operation information receiving unit 125 receives the operation information from each virtual machine at the notified frequency. The operation information receiving unit 125 stores the received operation information in the operation history table 132.

（Ｓ１５）相関検出部１２６は、サービス構造テーブル１３１および稼働履歴テーブル１３２に基づいて、リソース毎に複数の仮想マシンをグループ分けする。グループ内の１つの仮想マシンが稼働情報を送信する代表仮想マシンとなり、他の仮想マシンが稼働情報を送信しない従属仮想マシンとなる。グループ判定の詳細は後述する。 (S15) The correlation detection unit 126 groups a plurality of virtual machines for each resource based on the service structure table 131 and the operation history table 132. One virtual machine in the group becomes a representative virtual machine that transmits operation information, and another virtual machine becomes a subordinate virtual machine that does not transmit operation information. Details of group determination will be described later.

（Ｓ１６）相関検出部１２６は、ステップＳ１５で判定された代表仮想マシンに、同一グループに属する従属仮想マシンの識別情報およびその倍率を通知する。グループ分けはリソース毎に行われるため、当該通知もリソース毎に行われる。 (S16) The correlation detection unit 126 notifies the representative virtual machine determined in step S15 of the identification information of the subordinate virtual machines belonging to the same group and the magnification thereof. Since grouping is performed for each resource, the notification is also performed for each resource.

図１４は、グループ判定の手順例を示すフローチャートである。
グループ判定は、上記のステップＳ１５および後述するステップＳ６４で実行される。
（Ｓ２０）相関検出部１２６は、リソースを１つ選択する。監視対象のリソースは、ＣＰＵ、ＲＡＭ、ＨＤＤ、ネットワークなど予め決まっている。 FIG. 14 is a flowchart showing an example of a procedure for group determination.
The group determination is executed in step S15 described above and step S64 described later.
(S20) The correlation detection unit 126 selects one resource. The resources to be monitored are predetermined such as CPU, RAM, HDD, and network.

（Ｓ２１）相関検出部１２６は、サービス構造テーブル１３１が示す木構造のデータフローの中でルート（始点）の仮想マシンを選択し、代表仮想マシンに指定する。
（Ｓ２２）相関検出部１２６は、木構造のデータフローの中で、稼働情報を収集中の仮想マシンが残っているか判断する。該当する仮想マシンが残っている場合はステップＳ２３に進み、該当する仮想マシンが残っていない場合はステップＳ２９に進む。 (S21) The correlation detection unit 126 selects a root (starting point) virtual machine from the tree-structured data flow shown in the service structure table 131, and designates it as a representative virtual machine.
(S22) The correlation detection unit 126 determines whether or not a virtual machine for which operation information is being collected remains in the tree-structured data flow. If the corresponding virtual machine remains, the process proceeds to step S23, and if the corresponding virtual machine does not remain, the process proceeds to step S29.

（Ｓ２３）相関検出部１２６は、現在選択している仮想マシンからデータフローを下流方向に辿り、次の仮想マシンを選択する。データフローが分岐している場合、何れか１つの分岐方向を辿ればよい。木構造の末端まで到達した場合、相関検出部１２６は、直前の分岐点まで戻った上で未選択の分岐方向を辿るようにする。 (S23) The correlation detection unit 126 traces the data flow downstream from the currently selected virtual machine and selects the next virtual machine. When the data flow is branched, one of the branching directions may be followed. When the end of the tree structure is reached, the correlation detection unit 126 returns to the immediately preceding branch point and follows the unselected branch direction.

（Ｓ２４）相関検出部１２６は、稼働履歴テーブル１３２から、ステップＳ２３で選択した仮想マシンについて、ステップＳ２０で選択したリソースの稼働履歴を抽出する。また、相関検出部１２６は、稼働履歴テーブル１３２から、当該仮想マシンの上流側にある直前の代表仮想マシンについて、ステップＳ２０で選択したリソースの稼働履歴を抽出する。相関検出部１２６は、直前の代表仮想マシンと選択した仮想マシンとの間で、抽出した稼働情報に含まれる計測値の相関係数を算出する。 (S24) The correlation detection unit 126 extracts the operation history of the resource selected in step S20 for the virtual machine selected in step S23 from the operation history table 132. Further, the correlation detection unit 126 extracts the operation history of the resource selected in step S20 for the immediately preceding representative virtual machine on the upstream side of the virtual machine from the operation history table 132. The correlation detection unit 126 calculates the correlation coefficient of the measured value included in the extracted operation information between the immediately preceding representative virtual machine and the selected virtual machine.

（Ｓ２５）相関検出部１２６は、ステップＳ２４で算出した相関係数の絶対値と所定の閾値とを比較する。相関係数の絶対値が閾値より大きい場合はステップＳ２６に進み、相関係数の絶対値が閾値以下である場合はステップＳ２８に進む。 (S25) The correlation detection unit 126 compares the absolute value of the correlation coefficient calculated in step S24 with a predetermined threshold value. If the absolute value of the correlation coefficient is greater than the threshold value, the process proceeds to step S26, and if the absolute value of the correlation coefficient is equal to or less than the threshold value, the process proceeds to step S28.

（Ｓ２６）相関検出部１２６は、ステップＳ２３で選択した仮想マシンを、直前の代表仮想マシンと同一グループに分類し、従属仮想マシンに指定する。
（Ｓ２７）相関検出部１２６は、ステップＳ２４で稼働履歴テーブル１３２から抽出した稼働情報を用いて、代表仮想マシンの計測値に対する従属仮想マシンの計測値の倍率を算出する。相関検出部１２６は、代表仮想マシンと従属仮想マシンとが強い相関をもつ旨および倍率を、相関関係テーブル１３３に登録する。そして、ステップＳ２２に戻る。 (S26) The correlation detection unit 126 classifies the virtual machine selected in step S23 into the same group as the immediately preceding representative virtual machine and designates it as a dependent virtual machine.
(S27) The correlation detection unit 126 calculates the ratio of the measured value of the dependent virtual machine to the measured value of the representative virtual machine by using the operation information extracted from the operation history table 132 in step S24. The correlation detection unit 126 registers in the correlation table 133 that the representative virtual machine and the dependent virtual machine have a strong correlation and the magnification. Then, the process returns to step S22.

（Ｓ２８）相関検出部１２６は、ステップＳ２３で選択した仮想マシンを代表仮想マシンに指定する。そして、ステップＳ２２に戻る。
（Ｓ２９）相関検出部１２６は、ステップＳ２０において全てのリソースを選択したか判断する。全てのリソースを選択した場合はグループ判定が終了し、未選択のリソースが残っている場合はステップＳ２０に戻る。 (S28) The correlation detection unit 126 designates the virtual machine selected in step S23 as the representative virtual machine. Then, the process returns to step S22.
(S29) The correlation detection unit 126 determines whether all the resources have been selected in step S20. When all resources are selected, the group determination is completed, and when unselected resources remain, the process returns to step S20.

図１５は、仮想マシン開始処理の手順例を示すフローチャートである。
（Ｓ３０）通信ログ取得部２２３は、テスト実行中に通信ログを取得する。通信ログは、仮想マシン５１が送信するメッセージの宛先と送信時刻とを含む。 FIG. 15 is a flowchart showing a procedure example of the virtual machine start process.
(S30) The communication log acquisition unit 223 acquires the communication log during the test execution. The communication log includes the destination and the transmission time of the message transmitted by the virtual machine 51.

（Ｓ３１）通信ログ取得部２２３は、情報収集サーバ１００に通信ログを送信する。
（Ｓ３２）稼働情報取得部２２４は、情報収集サーバ１００から収集頻度通知を受信する。収集頻度通知は、収集頻度の初期値として、所定の低い収集頻度を示している。 (S31) The communication log acquisition unit 223 transmits the communication log to the information collection server 100.
(S32) The operation information acquisition unit 224 receives a collection frequency notification from the information collection server 100. The collection frequency notification indicates a predetermined low collection frequency as an initial value of the collection frequency.

（Ｓ３３）稼働情報取得部２２４は、ステップＳ３２で通知された低い収集頻度で、全てのリソースの稼働情報を取得して情報収集サーバ１００に送信することを開始する。
（Ｓ３４）グループ制御部２２５は、情報収集サーバ１００からグループ通知を受信したか判断する。グループ通知は、同一グループに属する従属仮想マシンの識別情報とその倍率を含む。グループ通知を受信した場合はステップＳ３５に進み、グループ通知を受信しなかった場合はステップＳ３８に進む。 (S33) The operation information acquisition unit 224 starts acquiring operation information of all resources and transmitting the operation information to the information collection server 100 at the low collection frequency notified in step S32.
(S34) The group control unit 225 determines whether or not the group notification has been received from the information collection server 100. The group notification includes the identification information of the subordinate virtual machines belonging to the same group and the magnification thereof. If the group notification is received, the process proceeds to step S35, and if the group notification is not received, the process proceeds to step S38.

（Ｓ３５）グループ制御部２２５は、グループ通知に基づいて、グループ内の仮想マシンの収集対象比率を算出する。グループに属する仮想マシンの総数をｎ、稼働情報を収集しない従属仮想マシンの数をｍとすると、収集対象比率は（ｎ−ｍ）÷ｎとなる。 (S35) The group control unit 225 calculates the collection target ratio of the virtual machines in the group based on the group notification. Assuming that the total number of virtual machines belonging to the group is n and the number of dependent virtual machines that do not collect operation information is m, the collection target ratio is (nm) ÷ n.

（Ｓ３６）グループ制御部２２５は、ステップＳ３５で算出した収集対象比率に応じて、稼働情報取得部２２４の収集頻度を、ステップＳ３２で通知された初期値から上げる。収集対象比率が小さいほど、収集頻度が高くなるようにする。変更後の収集頻度は、初期値をｎ÷（ｎ−ｍ）×α倍したものである。よって、変更後の送信間隔は、初期の送信間隔を（ｎ−ｍ）÷ｎ÷α倍したものに短縮される。ただし、この計算式で算出される収集頻度が所定の上限を超える場合、当該上限を収集頻度とする。 (S36) The group control unit 225 raises the collection frequency of the operation information acquisition unit 224 from the initial value notified in step S32 according to the collection target ratio calculated in step S35. The smaller the collection target ratio, the higher the collection frequency. The changed collection frequency is the initial value multiplied by n ÷ (nm) × α. Therefore, the transmission interval after the change is shortened to the initial transmission interval multiplied by (nm) ÷ n ÷ α. However, if the collection frequency calculated by this formula exceeds a predetermined upper limit, the upper limit shall be the collection frequency.

（Ｓ３７）グループ制御部２２５は、従属仮想マシンに停止通知を送信する。
（Ｓ３８）稼働情報取得部２２４は、代表仮想マシンから停止通知を受信したか判断する。停止通知を受信した場合はステップＳ３９に進み、停止通知を受信しなかった場合は仮想マシン開始処理が終了する。なお、代表仮想マシンと従属仮想マシンの分類は、リソース毎に行われる。よって、仮想マシン５１は、あるリソースについては代表仮想マシンに指定され、別のリソースについては従属仮想マシンに指定されることがある。その場合、仮想マシン５１は、あるリソースについては情報収集サーバ１００からグループ通知を受信し、別のリソースについては代表仮想マシンから停止通知を受信することがある。 (S37) The group control unit 225 transmits a stop notification to the subordinate virtual machine.
(S38) The operation information acquisition unit 224 determines whether or not the stop notification has been received from the representative virtual machine. If the stop notification is received, the process proceeds to step S39, and if the stop notification is not received, the virtual machine start process ends. The representative virtual machine and the subordinate virtual machine are classified for each resource. Therefore, the virtual machine 51 may be designated as a representative virtual machine for a certain resource and as a subordinate virtual machine for another resource. In that case, the virtual machine 51 may receive a group notification from the information collection server 100 for a certain resource and a stop notification from the representative virtual machine for another resource.

（Ｓ３９）稼働情報取得部２２４は、稼働情報の取得および送信を停止する。
図１６は、仮想マシン継続処理の手順例を示すフローチャートである。
（Ｓ４０）グループ制御部２２５は、仮想マシン５１が代表仮想マシンであるか判断する。代表仮想マシンである場合はステップＳ４１に進み、代表仮想マシンでない場合はステップＳ５０に進む。なお、代表仮想マシンであるか否かは、リソース毎に判断される。 (S39) The operation information acquisition unit 224 stops the acquisition and transmission of the operation information.
FIG. 16 is a flowchart showing a procedure example of virtual machine continuous processing.
(S40) The group control unit 225 determines whether the virtual machine 51 is a representative virtual machine. If it is a representative virtual machine, the process proceeds to step S41, and if it is not a representative virtual machine, the process proceeds to step S50. Whether or not it is a representative virtual machine is determined for each resource.

（Ｓ４１）グループ制御部２２５は、グループ管理テーブル２３１を参照して、仮想マシン５１と同一グループに属する従属仮想マシンを特定する。グループ制御部２２５は、特定した従属仮想マシンに稼働情報を要求し、稼働情報を受信する。 (S41) The group control unit 225 identifies a subordinate virtual machine that belongs to the same group as the virtual machine 51 with reference to the group management table 231. The group control unit 225 requests the operation information from the specified subordinate virtual machine and receives the operation information.

（Ｓ４２）グループ制御部２２５は、ステップＳ４１で受信した従属仮想マシンの稼働情報に含まれる計測値と、稼働情報取得部２２４で取得された仮想マシン５１の最新の計測値とを対応付けて記録する。グループ制御部２２５は、直近の一定期間の記録に基づいて、仮想マシン５１の計測値と従属仮想マシンの計測値との間の相関係数を算出する。 (S42) The group control unit 225 records the measured value included in the operation information of the dependent virtual machine received in step S41 in association with the latest measured value of the virtual machine 51 acquired by the operation information acquisition unit 224. To do. The group control unit 225 calculates the correlation coefficient between the measured value of the virtual machine 51 and the measured value of the dependent virtual machine based on the record of the latest fixed period.

（Ｓ４３）グループ制御部２２５は、ステップＳ４２で算出した相関係数の絶対値と所定の閾値とを比較する。相関係数の絶対値が閾値より大きい場合はステップＳ４４に進み、相関係数の絶対値が閾値以下である場合はステップＳ４６に進む。 (S43) The group control unit 225 compares the absolute value of the correlation coefficient calculated in step S42 with a predetermined threshold value. If the absolute value of the correlation coefficient is greater than the threshold value, the process proceeds to step S44, and if the absolute value of the correlation coefficient is equal to or less than the threshold value, the process proceeds to step S46.

（Ｓ４４）グループ制御部２２５は、直近の一定期間の記録に基づいて、仮想マシン５１の計測値に対する従属仮想マシンの計測値の倍率を算出する。グループ制御部２２５は、グループ管理テーブル２３１に記録された倍率を更新する。 (S44) The group control unit 225 calculates the magnification of the measured value of the dependent virtual machine with respect to the measured value of the virtual machine 51 based on the record of the latest fixed period. The group control unit 225 updates the magnification recorded in the group management table 231.

（Ｓ４５）グループ制御部２２５は、ステップＳ４４で更新した倍率を含む相関更新通知を、情報収集サーバ１００に送信する。そして、ステップＳ５０に進む。
（Ｓ４６）グループ制御部２２５は、従属仮想マシンに再開通知を送信する。 (S45) The group control unit 225 transmits a correlation update notification including the magnification updated in step S44 to the information collection server 100. Then, the process proceeds to step S50.
(S46) The group control unit 225 transmits a restart notification to the subordinate virtual machine.

（Ｓ４７）グループ制御部２２５は、１つの従属仮想マシンがグループから外れたことに伴って、グループ内の仮想マシンの収集対象比率を更新する。直前の時点のグループに属する仮想マシンの総数をｎ、直前の時点の稼働情報を収集しない従属仮想マシンの数をｍとすると、収集対象比率は（ｎ−ｍ）÷（ｎ−１）に増大する。 (S47) The group control unit 225 updates the collection target ratio of the virtual machines in the group when one subordinate virtual machine leaves the group. Assuming that the total number of virtual machines belonging to the group at the immediately preceding point in time is n and the number of dependent virtual machines that do not collect operation information at the immediately preceding point in time is m, the collection target ratio increases to (nm) ÷ (n-1). To do.

（Ｓ４８）グループ制御部２２５は、ステップＳ４７で算出した収集対象比率に応じて、稼働情報取得部２２４の収集頻度を直前よりも下げる。
（Ｓ４９）グループ制御部２２５は、グループから外れた仮想マシンの識別情報を含むグループ解除通知を、情報収集サーバ１００に送信する。 (S48) The group control unit 225 lowers the collection frequency of the operation information acquisition unit 224 from the immediately preceding value according to the collection target ratio calculated in step S47.
(S49) The group control unit 225 transmits a group release notification including the identification information of the virtual machine out of the group to the information collection server 100.

（Ｓ５０）稼働情報取得部２２４は、代表仮想マシンから再開通知を受信したか判断する。再開通知を受信した場合はステップＳ５１に進み、再開通知を受信しなかった場合は仮想マシン継続処理が終了する。なお、仮想マシン５１は、あるリソースについて代表仮想マシンに指定され、別のリソースについて従属仮想マシンに指定されることがある。よって、仮想マシン５１は、あるリソースについて従属仮想マシンを監視する一方、別のリソースについて代表仮想マシンから再開通知を受信することがある。 (S50) The operation information acquisition unit 224 determines whether or not the restart notification has been received from the representative virtual machine. If the restart notification is received, the process proceeds to step S51, and if the restart notification is not received, the virtual machine continuation process ends. The virtual machine 51 may be designated as a representative virtual machine for a certain resource and as a subordinate virtual machine for another resource. Therefore, the virtual machine 51 may monitor the dependent virtual machine for one resource while receiving a restart notification from the representative virtual machine for another resource.

（Ｓ５１）稼働情報取得部２２４は、稼働情報の取得および送信を再開する。再開後の収集頻度は、例えば、ステップＳ３２で通知された低頻度の初期値とする。
図１７は、サーバ継続処理の手順例を示すフローチャートである。 (S51) The operation information acquisition unit 224 resumes acquisition and transmission of operation information. The collection frequency after resumption is, for example, the low frequency initial value notified in step S32.
FIG. 17 is a flowchart showing an example of a procedure for continuous server processing.

（Ｓ６０）相関更新部１２７は、各代表仮想マシンから相関更新通知を受信する。相関更新通知は、従属仮想マシンの識別情報および更新後の倍率を含む。
（Ｓ６１）相関更新部１２７は、相関関係テーブル１３３の倍率を更新する。 (S60) The correlation update unit 127 receives the correlation update notification from each representative virtual machine. The correlation update notification includes the identification information of the dependent virtual machine and the updated magnification.
(S61) The correlation update unit 127 updates the magnification of the correlation table 133.

（Ｓ６２）相関更新部１２７は、何れかの代表仮想マシンからグループ解除通知を受信したか判断する。グループ解除通知を受信した場合はステップＳ６３に進み、グループ解除通知を受信していない場合はステップＳ６４に進む。 (S62) The correlation update unit 127 determines whether the group release notification has been received from any of the representative virtual machines. If the group release notification is received, the process proceeds to step S63, and if the group release notification is not received, the process proceeds to step S64.

（Ｓ６３）相関更新部１２７は、相関関係テーブル１３３から、グループ解除通知を送信した代表仮想マシンと通知された従属仮想マシンとの間の相関関係を削除する。
（Ｓ６４）相関検出部１２６は、図１４に示すフローチャートに従い、現在の代表仮想マシンの中にグループ化できるものがあるか再判定する。 (S63) The correlation update unit 127 deletes the correlation between the representative virtual machine that sent the group release notification and the notified dependent virtual machine from the correlation table 133.
(S64) The correlation detection unit 126 redetermines whether or not any of the current representative virtual machines can be grouped according to the flowchart shown in FIG.

（Ｓ６５）相関検出部１２６は、ステップＳ６３の再判定により、代表仮想マシンが従属仮想マシンに変更されることで従属仮想マシンが増加したか判断する。従属仮想マシンが増加した場合はステップＳ６６に進み、増加していない場合はステップＳ６７に進む。 (S65) The correlation detection unit 126 determines whether or not the number of subordinate virtual machines has increased due to the change of the representative virtual machine to the subordinate virtual machine by the redetermination in step S63. If the number of dependent virtual machines has increased, the process proceeds to step S66, and if the number has not increased, the process proceeds to step S67.

（Ｓ６６）相関検出部１２６は、相関関係テーブル１３３を更新する。また、相関検出部１２６は、新たな従属仮想マシンが属するグループの代表仮想マシンに、当該従属仮想マシンの識別情報およびその倍率を含むグループ通知を送信する。 (S66) The correlation detection unit 126 updates the correlation table 133. In addition, the correlation detection unit 126 transmits a group notification including the identification information of the subordinate virtual machine and its magnification to the representative virtual machine of the group to which the new subordinate virtual machine belongs.

（Ｓ６７）稼働状況表示部１２８は、ユーザから稼働状況の要求を受け付けたか判断する。稼働状況の要求を受け付けた場合はステップＳ６８に進み、稼働状況の要求を受け付けていない場合はサーバ継続処理が終了する。 (S67) The operation status display unit 128 determines whether the request for the operation status has been received from the user. If the request for the operating status is accepted, the process proceeds to step S68, and if the request for the operating status is not accepted, the server continuation process ends.

（Ｓ６８）稼働状況表示部１２８は、従属仮想マシンの稼働状況を示す数値を、相関関係テーブル１３３に登録された代表仮想マシンの計測値に倍率を乗じることで推定する。
（Ｓ６９）稼働状況表示部１２８は、各仮想マシンの稼働状況の数値を列挙した稼働状況テーブル１３４を生成し、表示装置１１１に表示する。稼働状況テーブル１３４の数値には、収集された計測値とステップＳ６８で算出された推定値とが含まれる。 (S68) The operation status display unit 128 estimates a numerical value indicating the operation status of the dependent virtual machine by multiplying the measured value of the representative virtual machine registered in the correlation table 133 by a multiplication factor.
(S69) The operation status display unit 128 generates an operation status table 134 that lists the numerical values of the operation status of each virtual machine, and displays the operation status table 134 on the display device 111. The numerical values in the operation status table 134 include the collected measured values and the estimated values calculated in step S68.

第２の実施の形態の情報処理システムによれば、一部の仮想マシンからの稼働情報の収集が停止され、全体のデータ通信量が減少する範囲で、残った仮想マシンからの稼働情報の収集頻度が引き上げられる。稼働情報を収集しない仮想マシンについては、稼働情報を収集する仮想マシンとの相関関係に基づいて、リソースの使用状況が推定される。これにより、ネットワークトラフィックを削減しつつ、仮想マシンの監視のリアルタイム性を確保し監視精度を向上させることができる。特に、マルチクラウド環境において、インターネットなどの広域データ通信ネットワークのトラフィックを削減することができる。よって、仮想マシンの監視コストを削減し、監視の遅延を抑制することができる。 According to the information processing system of the second embodiment, the collection of operation information from some virtual machines is stopped, and the collection of operation information from the remaining virtual machines is collected within the range in which the total data communication volume is reduced. The frequency will be increased. For virtual machines that do not collect operation information, resource usage is estimated based on the correlation with the virtual machines that collect operation information. As a result, it is possible to ensure real-time monitoring of virtual machines and improve monitoring accuracy while reducing network traffic. In particular, in a multi-cloud environment, it is possible to reduce the traffic of a wide area data communication network such as the Internet. Therefore, it is possible to reduce the monitoring cost of the virtual machine and suppress the delay in monitoring.

また、テストリクエストを用いて、サービス内での複数の仮想マシンの間のデータフローが判定され、データフローに沿って親仮想マシンと子仮想マシンとの間で相関関係が判定される。サービスでは親仮想マシンから子仮想マシンにメッセージが送信されているため、親仮想マシンの負荷と子仮想マシンの負荷の間には強い相関がある可能性が高い。そのため、複数の仮想マシンの中で相関関係を効率的に判定することができる。また、代表仮想マシンと従属仮想マシンとの間の相関関係の変化は、代表仮想マシンによって分散して監視される。よって、広域データ通信ネットワークのトラフィックを削減できる。 In addition, the test request is used to determine the data flow between a plurality of virtual machines in the service, and the correlation between the parent virtual machine and the child virtual machine is determined along the data flow. Since the service sends messages from the parent virtual machine to the child virtual machine, there is likely a strong correlation between the load on the parent virtual machine and the load on the child virtual machine. Therefore, the correlation can be efficiently determined among a plurality of virtual machines. Also, changes in the correlation between the representative virtual machine and the dependent virtual machines are distributed and monitored by the representative virtual machines. Therefore, the traffic of the wide area data communication network can be reduced.

１０，２０，２０ａ情報処理装置
１１通信部
１２記憶部
１３処理部
１４相関関係
２１，２２仮想マシン
２３，２４稼働情報 10, 20, 20a Information processing device 11 Communication unit 12 Storage unit 13 Processing unit 14 Correlation 21,22 Virtual machine 23,24 Operation information

Claims

A second operation information indicating the resource usage status in the first virtual machine is received from the first virtual machine, and a second operation information indicating the resource usage status in the second virtual machine is received from the second virtual machine. Communication unit that receives operation information and
A storage unit that stores the received history of the first operation information and the second operation information, and
Based on the history, the correlation of the resource usage status between the first virtual machine and the second virtual machine is determined, and when the correlation satisfies a predetermined condition, the second operation information Is controlled so that the reception frequency of the first operation information is lower than the reception frequency of the first operation information, and based on the first operation information received from the first virtual machine and the correlation, the second operation information A processing unit that estimates resource usage in a virtual machine,
Information processing device with.

When the correlation satisfies the predetermined condition, the transmission of the second operation information from the second virtual machine to the information processing device is controlled to be stopped.
The information processing device according to claim 1.

When the correlation satisfies the predetermined condition, it is instructed to reduce the frequency of transmitting the second operation information to the second virtual machine through the first virtual machine.
The information processing device according to claim 1.

Before determining the correlation, the processing unit receives the first operation information and the first operation information at a predetermined frequency lower than the reception frequency of the first operation information when the correlation satisfies the predetermined condition. Control so that the operation information of 2 is received,
The information processing device according to claim 1.

When the correlation satisfies the predetermined condition, the reception frequency of the second operation information is lower than that before the correlation is determined, and the reception frequency of the first operation information is before the correlation is determined. Controlled to be higher than
The information processing device according to claim 1.

Of the plurality of virtual machines including the first virtual machine and the second virtual machine, the frequency of receiving operation information of some of the virtual machines including the second virtual machine is higher than before the correlation is determined. Also, the frequency of receiving operation information of other virtual machines including the first virtual machine becomes higher than before the correlation is determined.
The reception frequency after the change of the first operation information is determined based on the ratio of the number of the other virtual machines to the number of the plurality of virtual machines.
The information processing device according to claim 5.

The reception frequency of the second operation information is updated according to the change in the correlation.
The information processing device according to claim 1.

The change in the correlation is monitored by the first virtual machine, and the first virtual machine notifies the information processing apparatus.
The information processing device according to claim 7.

The processing unit detects the order of message communication between the first virtual machine and a plurality of virtual machines including the second virtual machine by sending a test request, and the virtual machine from which the message is sent. Is selected as the first virtual machine, and the virtual machine to which the message is sent is selected as the second virtual machine.
The information processing device according to claim 1.

Based on the first operation information and the second operation information within a predetermined time included in the history, the processing unit has a time series of resource usage status in the first virtual machine as the correlation. A correlation coefficient between the change and the time-series change of the resource usage in the second virtual machine is calculated, and it is determined whether the correlation coefficient is larger than the threshold value.
The information processing device according to claim 1.

The first information processing device that executes the first virtual machine,
A second information processing device that executes the second virtual machine,
A third information processing device that communicates with the first virtual machine and the second virtual machine,
The third information processing apparatus has
The first operation information indicating the resource usage status in the first virtual machine is received from the first virtual machine, and the resource usage status in the second virtual machine is shown from the second virtual machine. Received the operation information of 2 and
The received history of the first operation information and the second operation information is retained, and the history is retained.
Based on the history, the correlation of the resource usage status between the first virtual machine and the second virtual machine is determined, and when the correlation satisfies a predetermined condition, the second operation information Is controlled so that the reception frequency of the first operation information is lower than the reception frequency of the first operation information.
Based on the first operation information received from the first virtual machine and the correlation, the resource usage status in the second virtual machine is estimated.
Information processing system.

The computer
The first operation information indicating the resource usage status in the first virtual machine received from the first virtual machine and the resource usage status in the second virtual machine received from the second virtual machine are shown. Acquire the history with the second operation information shown,
Based on the history, the correlation of the resource usage status between the first virtual machine and the second virtual machine is determined, and when the correlation satisfies a predetermined condition, the second operation information Is controlled so that the reception frequency of the first operation information is lower than the reception frequency of the first operation information.
Based on the first operation information received from the first virtual machine and the correlation, the resource usage status in the second virtual machine is estimated.
Information processing method.

On the computer
The first operation information indicating the resource usage status in the first virtual machine received from the first virtual machine and the resource usage status in the second virtual machine received from the second virtual machine are shown. Acquire the history with the second operation information shown,
Based on the history, the correlation of the resource usage status between the first virtual machine and the second virtual machine is determined, and when the correlation satisfies a predetermined condition, the second operation information Is controlled so that the reception frequency of the first operation information is lower than the reception frequency of the first operation information.
Based on the first operation information received from the first virtual machine and the correlation, the resource usage status in the second virtual machine is estimated.
A program that executes processing.