JP6775022B2

JP6775022B2 - Learning data processing device and method

Info

Publication number: JP6775022B2
Application number: JP2018536651A
Authority: JP
Inventors: 中島　淳; 淳中島; 峰義増田; 裕教江丸
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2016-09-02
Filing date: 2016-09-02
Publication date: 2020-10-28
Anticipated expiration: 2036-09-02
Also published as: WO2018042637A1; JPWO2018042637A1

Description

本発明は、ＩＴシステム運用管理において取得されるデータの処理方法に関する。 The present invention relates to a method of processing data acquired in IT system operation management.

仮想化機構の普及やクラウド等の新たなシステム提供形態の出現に伴い、ＩＴシステムの運用管理は複雑化している。また、ＩＴシステムで扱われるデータ量の爆発的な増加に伴い、ＩＴシステムの規模は年々拡大し、ＩＴシステムを管理する管理ソフトウェアが扱うオブジェクト数（例えば、ストレージ装置が提供するボリュームの数）も増大している。複雑かつ大量なデータを持つＩＴシステムを、管理コストを抑えて管理することが求められている。 With the spread of virtualization mechanisms and the emergence of new system provision forms such as the cloud, the operation management of IT systems has become complicated. In addition, with the explosive increase in the amount of data handled by IT systems, the scale of IT systems is expanding year by year, and the number of objects handled by management software that manages IT systems (for example, the number of volumes provided by storage devices) is also increasing. It is increasing. It is required to manage an IT system having a complicated and large amount of data at a low management cost.

ＩＴシステムの管理コストを抑えるために、管理を自動化することが考えられる。ＩＴシステム管理の自動化において活用可能な技術の１つに、機械学習技術が存在する。ＩＴシステムの各オブジェクトにおける各種情報を収集し、学習データとして学習することで、ＩＴシステム内の任意の要素とそれ以外の要素との関連について、学習データに最も良くあてはまる関数を特定することが可能となる。 In order to reduce the management cost of the IT system, it is conceivable to automate the management. Machine learning technology is one of the technologies that can be used in the automation of IT system management. By collecting various information in each object of the IT system and learning it as learning data, it is possible to identify the function that best applies to the learning data regarding the relationship between any element in the IT system and other elements. It becomes.

例えば、非特許文献１には、ＩＴシステムで実行する処理の応答性能を、処理の並列数などの処理をおこなう設定に関するパラメータと、処理するデータのサイズなどの処理対象に関するパラメータから予測することを可能にする関数を、学習によって求める技術について記載されている。この関数を利用することで、例えば処理の並列数とデータサイズから処理の応答時間が推定できるようになるため、処理の実行スケジュールの決定をおこなうことができる。また、必要な応答性能を出すために必要な処理の並列数を推定できるようになるため、必要な応答性能を得るために必要となるリソース量を推定することができる。 For example, Non-Patent Document 1 states that the response performance of processing executed in an IT system is predicted from parameters related to settings for processing such as the number of parallel processes and parameters related to processing targets such as the size of data to be processed. It describes the technique for finding the functions that enable it by learning. By using this function, for example, the response time of the process can be estimated from the number of parallel processes and the data size, so that the execution schedule of the process can be determined. In addition, since the number of parallel processes required to obtain the required response performance can be estimated, the amount of resources required to obtain the required response performance can be estimated.

また、インターネットを介して、ＩＴシステムを事業者が提供し、利用に応じて利用者に課金するクラウドサービスが普及している。クラウドサービスの形態として、ＩａａＳ（ＩｎｆｒａｓｔｒｕｃｔｕｒｅａｓａＳｅｒｖｉｃｅ）、ＰａａＳ（ＰｌａｔｆｏｒｍａｓａＳｅｒｖｉｃｅ）、ＳａａＳ（ＳｏｆｔｗａｒｅａｓａＳｅｒｖｉｃｅ）などの形態がある。 In addition, a cloud service in which an IT system is provided by a business operator via the Internet and the user is charged according to the usage is widespread. As a form of cloud service, there are forms such as IaaS (Infrastructure as a Service), PasaS (Platform as a Service), and SaaS (Software as a Service).

加えて、クラウドサービスは、機密性が高いデータや、リアルタイム性を要求するアプリケーションのデータ格納場所として利用するには向かないものの、管理業務は、そのＩＴシステムの本来の機能から切り離せることからも、また利用に応じて費用を支払いたいというニーズからもクラウドサービスに好適である。 In addition, cloud services are not suitable for use as data storage locations for highly confidential data and applications that require real-time performance, but management operations can be separated from the original functions of the IT system. It is also suitable for cloud services because of the need to pay for usage.

このような背景から、これまでオンプレミスで稼働していた管理ソフトのＳａａＳ（ＳｏｆｔｗａｒｅａｓａＳｅｒｖｉｃｅ）提供や、運用業務の一部をサービスとして請け負うといった動きがある。オンプレミスにあるストレージを事業者がネットワーク経由で監視し、イベントが発生した場合に構成変更やディスク交換等の保守業務を行う方式は特許文献１で開示されている。 Against this background, there are movements such as providing SaaS (Software as a Service), a management software that has been operating on-premises, and undertaking part of the operation work as a service. Patent Document 1 discloses a method in which a business operator monitors on-premises storage via a network and performs maintenance work such as configuration change and disk replacement when an event occurs.

特開２００６−１０７０８０号公報Japanese Unexamined Patent Publication No. 2006-107080

Statistics-driven workload modeling for the cloud，Archana Ganapathi，University of California at Berkeley, ICDE 2010Statistics-driven workload modeling for the cloud, Archana Ganapathi, University of California at Berkeley, ICDE 2010

特許文献１及び非特許文献１に開示されている技術は比較的変化の少ない環境での利用を想定しており、システム構成が頻繁に変更されることを想定していない。一方で、仮想化機構の普及や、クラウド等の新たなシステム提供形態の出現に伴い、ＩＴシステムの構成を比較的容易に変更できようになっており、システム構成の変更頻度も上がることが考えられる。 The techniques disclosed in Patent Document 1 and Non-Patent Document 1 are intended to be used in an environment with relatively little change, and are not intended to be frequently changed in the system configuration. On the other hand, with the spread of virtualization mechanisms and the emergence of new system provision forms such as the cloud, it is becoming relatively easy to change the IT system configuration, and it is thought that the frequency of system configuration changes will increase. Be done.

機械学習の精度を上げるには大量の学習データが必要である。ＩＴシステムの管理においては、ＩＴシステムの各オブジェクトから長期間にわたって性能情報や容量情報などの各種履歴情報を取得する必要がある。しかし、ＩＴシステムにおいて構成変更が発生すると、構成変更が発生した後に再び長期間の学習をおこなう必要が生じる。そして、構成変更後しばらくは機械学習の精度が上がらず効率の良い管理業務を行うことができないことが考えられる。 A large amount of learning data is required to improve the accuracy of machine learning. In IT system management, it is necessary to acquire various historical information such as performance information and capacity information from each object of the IT system over a long period of time. However, when a configuration change occurs in the IT system, it becomes necessary to perform long-term learning again after the configuration change occurs. Then, it is conceivable that the accuracy of machine learning does not improve for a while after the configuration change, and efficient management work cannot be performed.

本発明の目的は、構成が変更されるシステムを対象とした機械学習の精度を高める技術を提供することである。 An object of the present invention is to provide a technique for improving the accuracy of machine learning for a system whose configuration is changed.

本発明の一態様によれば、学習データ処理装置は、対象システムが運用される間に対象システムの構成要素から学習データを取得する情報収集部と、学習データに基づいて対象システムの構成要素間の関係を目的情報と説明情報の関係により表現した予測式を生成する予測式生成部と、対象システムの構成を変更する設定内容を決定する設定変更内容決定部と、対象システムの構成を変更する構成変更部と、を有し、設定変更内容決定部が、対象システムの構成を変更する場合に取り得る構成要素の状態の範囲において、学習データが十分に取得されていない状態をデータ不足状態として抽出し、構成要素がデータ不足状態となるように対象システムの構成を変更する設定内容を決定し、構成変更部が、決定された設定内容に従って対象システムの構成を変更し、情報収集部が、対象システムの構成要素からデータ不足状態のときの学習データを取得する。 According to one aspect of the present invention, the learning data processing device is between an information collecting unit that acquires learning data from the components of the target system while the target system is operated and the components of the target system based on the learning data. Change the configuration of the target system, the predictive formula generator that generates the predictive formula that expresses the relationship between the objective information and the explanatory information, and the setting change content determination unit that determines the setting contents that change the configuration of the target system. A state in which training data is not sufficiently acquired within the range of the state of the components that can be taken when the setting change content determination unit changes the configuration of the target system, which has a configuration change unit, is regarded as a data shortage state. Extract and determine the setting contents to change the configuration of the target system so that the components are in the data shortage state, the configuration change unit changes the configuration of the target system according to the determined setting content, and the information collection unit Acquire training data when data is insufficient from the components of the target system.

将来の対象システムの構成変更時に不足となる学習データを、予め対象システムの構成を一時的に変更して取得しておくことができるので、実際に構成が変更されたときに生じる学習データの不足が低減され、機械学習の精度が早期に向上する。これを構成の変更が比較的頻繁に行われる対象システムにおける機械学習を適用した場合、構成が変更されたときに生じる機械学習の精度低下を抑制することができる。 Since the learning data that will be insufficient when the configuration of the target system is changed in the future can be acquired by temporarily changing the configuration of the target system in advance, the lack of learning data that occurs when the configuration is actually changed Is reduced, and the accuracy of machine learning is improved at an early stage. When machine learning in a target system in which the configuration is changed relatively frequently is applied to this, it is possible to suppress a decrease in the accuracy of machine learning that occurs when the configuration is changed.

実施例１による計算機システムの概略を説明するための図である。It is a figure for demonstrating the outline of the computer system according to Example 1. FIG. 実施例１に係る計算機システムの一例の構成図である。It is a block diagram of an example of the computer system which concerns on Example 1. FIG. 実施例１に係る関連情報テーブルの一例を示す図である。It is a figure which shows an example of the related information table which concerns on Example 1. FIG. 実施例１に係る性能履歴情報テーブル１１２０の一例を示す図である。It is a figure which shows an example of the performance history information table 1120 which concerns on Example 1. FIG. 実施例１に係る構成情報テーブル１１３０の一例を示す図である。It is a figure which shows an example of the configuration information table 1130 which concerns on Example 1. FIG. 実施例１に係る構成情報テーブル１１３０の一例を示す図である。It is a figure which shows an example of the configuration information table 1130 which concerns on Example 1. FIG. 実施例１に係る構成情報テーブル１１３０の一例を示す図である。It is a figure which shows an example of the configuration information table 1130 which concerns on Example 1. FIG. 実施例１に係る予測式元情報テーブル１１４０の一例を示す図である。It is a figure which shows an example of the prediction formula original information table 1140 which concerns on Example 1. FIG. 実施例１に係る予測式テーブル１１５０の一例を示す図である。It is a figure which shows an example of the prediction formula table 1150 which concerns on Example 1. FIG. 実施例１に係わる予測式を生成する処理のフローチャートである。It is a flowchart of the process which generates the prediction formula which concerns on Example 1. FIG. 実施例１に係わる学習データを取得するための設定変更内容を決定する処理のフローチャートである。It is a flowchart of the process of determining the setting change content for acquiring the learning data which concerns on Example 1. 学習データ取得用設定変更を実行する処理のフローチャートである。It is a flowchart of the process which executes the setting change for learning data acquisition. 実施例２に係る業務特性管理テーブル１８００の一例を示す図である。It is a figure which shows an example of the business characteristic management table 1800 which concerns on Example 2. FIG. 実施例２に係る学習用データ共有を実行する処理のフローチャートである。It is a flowchart of the process which executes learning data sharing which concerns on Example 2.

幾つかの実施例を、図面を参照して説明する。なお、以下に説明する実施例は特許請求の範囲にかかる発明を限定するものではなく、また実施例の中で説明されている諸要素及びその組み合わせの全てが発明の解決手段に必須であるとは限らない。これらの図面において、複数の図を通じて同一の符号は同一の構成要素を示している。なお、以後の説明では「ａａａテーブル」等の表現にて本発明の情報を説明するが、これら情報はテーブル等のデータ構造以外で表現されていてもよい。そのため、データ構造に依存しないことを示すために「ａａａテーブル」等について「ａａａ情報」と呼ぶことがある。さらに、各情報の内容を説明する際に、「識別情報」、「識別子」、「名称」、「ＩＤ」という表現を用いるが、これらについてはお互いに置換が可能である。 Some embodiments will be described with reference to the drawings. It should be noted that the examples described below do not limit the inventions claimed in the claims, and all of the elements and combinations thereof described in the examples are essential for the means for solving the invention. Is not always. In these drawings, the same reference numerals indicate the same components throughout the drawings. In the following description, the information of the present invention will be described by expressions such as "aaa table", but these information may be expressed by other than the data structure such as a table. Therefore, the "aaa table" and the like may be referred to as "aaa information" to show that it does not depend on the data structure. Further, when explaining the contents of each information, the expressions "identification information", "identifier", "name", and "ID" are used, but these can be replaced with each other.

以後の説明では「プログラム」を主語として説明を行う場合があるが、プログラムはプロセッサによって実行されることで定められた処理をメモリ及び通信ポート（通信デバイス、管理Ｉ／Ｆ、データＩ／Ｆ）を用いながら行うため、プロセッサを主語とした説明としてもよい。また、プログラムを主語として開示された処理は管理サーバ（管理計算機）等の計算機、情報処理装置が行う処理としてもよい。また、プログラムの一部または全ては専用ハードウェアによって実現されてもよい。また、各種プログラムはプログラム配布サーバや、計算機が読み取り可能な記憶メディアによって各計算機にインストールされてもよい。 In the following explanation, "program" may be used as the subject, but the program performs the processing defined by being executed by the processor in the memory and communication port (communication device, management I / F, data I / F). Since it is performed while using, the explanation may be based on the processor. Further, the process disclosed with the program as the subject may be a process performed by a computer such as a management server (management computer) or an information processing device. In addition, part or all of the program may be realized by dedicated hardware. In addition, various programs may be installed in each computer by a program distribution server or a storage medium that can be read by the computer.

以後、計算機システムを管理し、本発明の表示用情報を表示する一つ以上の計算機の集合を管理システムと呼ぶことがある。管理サーバが表示用情報を表示する場合は管理サーバが管理システムである。また、管理サーバと表示用計算機との組み合わせも管理システムである。また、管理処理の高速化や高信頼化のために複数の計算機で管理サーバと同等の処理を実現してもよく、この場合は当該複数の計算機（表示を表示用計算機が行う場合は表示用計算機も含め）が管理システムである。 Hereinafter, a set of one or more computers that manages a computer system and displays display information of the present invention may be referred to as a management system. When the management server displays display information, the management server is the management system. The combination of the management server and the display computer is also a management system. Further, in order to speed up and improve the reliability of the management process, a plurality of computers may realize the same processing as the management server. In this case, the plurality of computers (for display when the display is performed by the display computer). (Including the computer) is the management system.

本実施例に係る計算機システムについて説明する。 The computer system according to this embodiment will be described.

図１は、実施例１による計算機システムの概略を説明するための図である。ここで説明する動作は主に設定変更内容決定プログラム１１８０により実行される。 FIG. 1 is a diagram for explaining the outline of the computer system according to the first embodiment. The operation described here is mainly executed by the setting change content determination program 1180.

（１）設定変更内容決定プログラム１１８０は、まず、予測式テーブル１１５０及び構成情報テーブル１１３０を参照し、予測式のパラメータのうち計算機システムの構成を示す構成情報にあたるパラメータを抽出する。（２）続いて、設定変更内容決定プログラム１１８０は、予測式元情報テーブル１１４０を参照し、抽出したパラメータの構成上取り得る範囲内で、学習データの情報が不足している範囲を特定する。（３）続いて、設定変更内容決定プログラム１１８０は、抽出したパラメータを、特定した範囲に設定するようにパラメータの値を決定し、パラメータの設定変更を行う。（４）設定を変更した状態で計算機システムを運用することにより、不足していた学習データを取得することができる。 (1) The setting change content determination program 1180 first refers to the prediction formula table 1150 and the configuration information table 1130, and extracts the parameters corresponding to the configuration information indicating the configuration of the computer system from the parameters of the prediction formula. (2) Subsequently, the setting change content determination program 1180 refers to the prediction formula source information table 1140, and identifies a range in which the training data information is insufficient within a range that can be taken in terms of the configuration of the extracted parameters. (3) Subsequently, the setting change content determination program 1180 determines the value of the parameter so as to set the extracted parameter in the specified range, and changes the setting of the parameter. (4) By operating the computer system with the settings changed, the missing learning data can be acquired.

図２は、実施例１に係る計算機システムの一例の構成図である。本実施例に係る計算機システムは、１台以上の管理サーバ１０００、１台以上のストレージ装置２０００、及び１台以上のサーバ３０００を備える。サーバ３０００及びストレージ装置２０００は、ＳＡＮ（ＳｔｏｒａｇｅＡｒｅａＮｅｔｗｏｒｋ）４０００を介して互いに接続される。ＳＡＮの具体例としてファイバチャネルがある。管理サーバ１０００、ストレージ装置２０００、およびサーバ３０００は、管理用ネットワーク５０００を介して互いに接続される。 FIG. 2 is a configuration diagram of an example of a computer system according to the first embodiment. The computer system according to this embodiment includes one or more management servers 1000, one or more storage devices 2000, and one or more servers 3000. The server 3000 and the storage device 2000 are connected to each other via a SAN (Storage Area Network) 4000. A specific example of SAN is Fiber Channel. The management server 1000, the storage device 2000, and the server 3000 are connected to each other via the management network 5000.

管理サーバ１０００は、メモリ１１００、通信デバイス１２００、プロセッサ１３００、出力デバイス１４００、入力デバイス１５００、および記憶デバイス１６００を備えている。これらは管理サーバ１０００内の内部バス１７００を介して互いに接続される。 The management server 1000 includes a memory 1100, a communication device 1200, a processor 1300, an output device 1400, an input device 1500, and a storage device 1600. These are connected to each other via the internal bus 1700 in the management server 1000.

メモリ１１００には、関連情報テーブル１１１０、性能履歴情報テーブル１１２０、構成情報テーブル１１３０、予測式元情報テーブル１１４０、予測式テーブル１１５０、情報収集プログラム１１６０、予測式生成プログラム１１７０、設定変更内容決定プログラム１１８０、構成変更プログラム１１９０が格納される。 The memory 1100 includes a related information table 1110, a performance history information table 1120, a configuration information table 1130, a prediction formula source information table 1140, a prediction formula table 1150, an information collection program 1160, a prediction formula generation program 1170, and a setting change content determination program 1180. , The configuration change program 1190 is stored.

通信デバイス１２００は、管理サーバ１０００を管理用ネットワーク５０００に接続するためのデバイスである。管理サーバ１０００は、管理用ネットワーク５０００を通して、サーバ３０００上で動作するプログラムと通信できる。プロセッサ１３００は、メモリ１１００上に展開されている各種プログラムを実行する。出力デバイス１４００は、管理サーバ１０００が実行した処理結果を出力するデバイスであり、例えばディスプレイ等である。入力デバイス１５００は、管理者が管理サーバ１０００に指示を入力するためのデバイスであり、例えばキーボード等である。記憶デバイス１６００は、情報を格納するＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等である。 The communication device 1200 is a device for connecting the management server 1000 to the management network 5000. The management server 1000 can communicate with a program running on the server 3000 through the management network 5000. The processor 1300 executes various programs expanded on the memory 1100. The output device 1400 is a device that outputs a processing result executed by the management server 1000, such as a display. The input device 1500 is a device for the administrator to input an instruction to the management server 1000, such as a keyboard. The storage device 1600 is an HDD (Hard Disk Drive), an SSD (Solid State Drive), or the like that stores information.

なお、図２に示す例では、各種プログラム及びテーブルは、メモリ１１００に格納されているが、記憶デバイス１６００または他の記憶媒体（図示しない）に格納されても良い。この場合、プロセッサ１３００は、プログラム実行時にメモリ１１００上に対象のプログラムを読みだし、読みだしたプログラムを実行する。 In the example shown in FIG. 2, various programs and tables are stored in the memory 1100, but may be stored in the storage device 1600 or another storage medium (not shown). In this case, the processor 1300 reads the target program on the memory 1100 when the program is executed, and executes the read program.

また、ストレージ装置２０００のメモリ２１００に、前述のプログラム及びテーブルが格納され、ストレージ装置２０００または物理サーバ３０００が、格納されたプログラムを実行しても良い。また、他のサーバ３０００またはスイッチ（図示しない）等の他の装置が、前述のプログラム及びテーブルを格納し、実行しても良い。 Further, the above-mentioned programs and tables may be stored in the memory 2100 of the storage device 2000, and the storage device 2000 or the physical server 3000 may execute the stored program. In addition, another server 3000 or another device such as a switch (not shown) may store and execute the above-mentioned programs and tables.

ストレージ装置２０００は、メモリ２１００、論理ボリューム提供部２２００、ディスクＩ／Ｆコントローラ２３００、管理Ｉ／Ｆ２４００、プロセッサ２５００、及びデータＩ／Ｆ２６００を備えている。これらはストレージ装置２０００内の内部バス等の通信路２７００を介して接続される。メモリ２１００は、ディスクキャッシュ２１１０を有する。また、メモリ２１００は、構成性能情報収集プログラム２１２０を格納する。ディスクキャッシュ２１１０は、情報を一時格納するための記憶領域である。構成性能情報収集プログラム２１２０は、ストレージ装置２０００の管理情報及び性能情報等を管理サーバ１０００との間で送受信するためのプログラムである。構成変更プログラム２１３０は、管理サーバ１０００の構成変更プログラム１１９０から呼び出され、ストレージ装置２０００の構成変更をおこなうためのプログラムである。 The storage device 2000 includes a memory 2100, a logical volume providing unit 2200, a disk I / F controller 2300, a management I / F 2400, a processor 2500, and a data I / F 2600. These are connected via a communication path 2700 such as an internal bus in the storage device 2000. The memory 2100 has a disk cache 2110. In addition, the memory 2100 stores the configuration performance information collection program 2120. The disk cache 2110 is a storage area for temporarily storing information. The configuration performance information collection program 2120 is a program for transmitting and receiving management information and performance information of the storage device 2000 to and from the management server 1000. The configuration change program 2130 is a program called from the configuration change program 1190 of the management server 1000 to change the configuration of the storage device 2000.

論理ボリューム提供部２２００は、物理領域２２３０によって構成されるディスクプール２２２０を備え、ディスクプール２２２０の記憶領域を論理的に分割し、当該論理的に分割された記憶領域をボリューム２２１０として提供する。ここで物理領域２２３０は、物理ディスクや複数の物理ディスクから構成されるパリティグループなどである。当該ストレージ装置２０００外の装置からはボリューム２２１０経由で物理領域にアクセスすることが可能である。 The logical volume providing unit 2200 includes a disk pool 2220 composed of a physical area 2230, logically divides the storage area of the disk pool 2220, and provides the logically divided storage area as the volume 2210. Here, the physical area 2230 is a physical disk, a parity group composed of a plurality of physical disks, or the like. It is possible to access the physical area from a device other than the storage device 2000 via the volume 2210.

なお、物理領域２２３０には物理領域番号が付され、ディスクプール２２２０にはディスクプール番号が付され、ボリューム２２１０にはボリューム番号が付される。これによって、ストレージ装置２０００は、物理領域２２３０、ディスクプール２２２０及びボリューム２２１０をそれぞれ一意に識別することができる。 The physical area 2230 is assigned a physical area number, the disk pool 2220 is assigned a disk pool number, and the volume 2210 is assigned a volume number. As a result, the storage device 2000 can uniquely identify the physical area 2230, the disk pool 2220, and the volume 2210, respectively.

図２に示す例では、１つの物理領域（パリティグループＰＧ１）から構成されるディスクプール２２２０（ＰＯＯＬ１）が論理的に分割され、１つのボリューム２２１０（Ｖｏｌ１）がストレージ装置２０００外の装置、例えばサーバ３０００に提供される。 In the example shown in FIG. 2, the disk pool 2220 (POOL1) composed of one physical area (parity group PG1) is logically divided, and one volume 2210 (Vol1) is a device other than the storage device 2000, for example, a server. Provided to 3000.

ディスクＩ／Ｆコントローラ２３００は、論理ボリューム提供部２２００に接続するためのインタフェースデバイスである。管理Ｉ／Ｆ２４００は管理用ネットワーク５０００に接続するためのインタフェースデバイスである。プロセッサ２５００は、メモリ２１００上に展開されたプログラムを実行する。 The disk I / F controller 2300 is an interface device for connecting to the logical volume providing unit 2200. The management I / F 2400 is an interface device for connecting to the management network 5000. The processor 2500 executes a program expanded on the memory 2100.

データＩ／Ｆ２６００は、ＳＡＮ４０００に接続するためのインタフェースデバイスである。図２示す例では、構成性能情報収集プログラム２１２０は、及び構成変更プログラム２１３０はメモリ２１００に格納されているが、他の記憶装置（図示しない）または、他の記憶媒体（図示しない）に格納されても良い。この場合、プロセッサ２５００は、処理実行時にメモリ２１００上に構成性能情報収集プログラム２１２０及び構成変更プログラム２１３０を読みだし、読みだしたプログラムを実行する。 The data I / F 2600 is an interface device for connecting to the SAN 4000. In the example shown in FIG. 2, the configuration performance information collection program 2120 and the configuration change program 2130 are stored in the memory 2100, but are stored in another storage device (not shown) or another storage medium (not shown). You may. In this case, the processor 2500 reads the configuration performance information collection program 2120 and the configuration change program 2130 on the memory 2100 at the time of processing execution, and executes the read programs.

また、論理ボリューム提供部２２００は、１つのディスクプール２２２０の全記憶領域を１つのボリューム２２１０として作成しても良い。また、論理ボリューム提供部２２００は、物理領域２２３０としてパリティグループ以外、例えば物理ディスクそのものや、フラッシュメモリ等の記憶媒体でも良い。 Further, the logical volume providing unit 2200 may create the entire storage area of one disk pool 2220 as one volume 2210. Further, the logical volume providing unit 2200 may be a physical area 2230 other than the parity group, for example, a physical disk itself or a storage medium such as a flash memory.

サーバ３０００は、メモリ３１００、データＩ／Ｆ３２００、プロセッサ３３００、及び管理Ｉ／Ｆ３４００を備えた物理サーバである。これらはサーバ３０００の内部バス等の通信路３５００を介して互いに接続される。 Server 3000 is a physical server with memory 3100, data I / F 3200, processor 3300, and management I / F 3400. These are connected to each other via a communication path 3500 such as an internal bus of the server 3000.

メモリ３１００は、構成情報収集プログラム３１１０、業務プログラム３１２０、構成変更プログラム３１３０を格納する。構成情報収集プログラム３１１０は、サーバ３０００の管理情報、性能情報等を管理サーバ１０００との間で送受信するためのプログラムである。業務プログラム３１２０は、３０００が実行する業務を実現するためのプログラムであり、例えば、ＤＢＭＳ（ＤａｔａＢａｓｅＭａｎａｇｅｍｅｎｔＳｙｓｔｅｍ）やファイルシステム等である。構成変更プログラム３１３０は、管理サーバ１０００の構成変更プログラム１１９０から予備だされ、サーバ３０００の構成変更をおこなうためのプログラムである。 The memory 3100 stores the configuration information collection program 3110, the business program 3120, and the configuration change program 3130. The configuration information collection program 3110 is a program for transmitting and receiving management information, performance information, and the like of the server 3000 to and from the management server 1000. The business program 3120 is a program for realizing the business executed by 3000, and is, for example, a DBMS (Data Base Management System), a file system, or the like. The configuration change program 3130 is a program reserved from the configuration change program 1190 of the management server 1000 and for changing the configuration of the server 3000.

サーバ３０００は、ストレージ装置２０００から提供されたボリューム２２１０を用いて、各種業務を実行する。図２に示す例では、各種プログラムはメモリ３１００上に格納されているが、他の記憶装置（図示しない）に格納されていても良い。この場合、プロセッサ３３００は、処理実行時にメモリ３１００上の対象のプログラムを読みだし、読みだしたプログラムを実行する。図２に示す例では、サーバＡとストレージ装置Ａとは、ＳＡＮ４０００を介して互いに接続される。ストレージ装置２０００と物理サーバ３０００との間の接続は、ファイバチャネルを介して直接接続されるものに限定されず、１台以上のファイバチャネルスイッチ等のネットワーク機器を介して接続されても良い。また、ストレージ装置２０００と物理サーバ３０００との間の接続は、データ通信用のネットワークであれば良く、ＩＰ（ＩｎｔｅｒｎｅｔＰｒｏｔｏｃｏｌ）ネットワークでも良い。 The server 3000 uses the volume 2210 provided by the storage device 2000 to execute various tasks. In the example shown in FIG. 2, various programs are stored in the memory 3100, but may be stored in another storage device (not shown). In this case, the processor 3300 reads the target program on the memory 3100 at the time of processing execution, and executes the read program. In the example shown in FIG. 2, the server A and the storage device A are connected to each other via the SAN 4000. The connection between the storage device 2000 and the physical server 3000 is not limited to those directly connected via Fiber Channel, and may be connected via network devices such as one or more Fiber Channel switches. Further, the connection between the storage device 2000 and the physical server 3000 may be any network for data communication, and may be an IP (Internet Protocol) network.

図３は、実施例１に係る関連情報テーブルの一例を示す図である。関連情報テーブルには、その性能を目的情報とする管理対象オブジェクトと、それに論理的に関連づけられた管理対象オブジェクトとを示す関連情報が格納される。オブジェクトは計算機システムの構成要素である。なお、構成要素には、物理的に存在する構成要素と、論理的に定義された構成要素が含まれる。一例として、関連情報テーブル１１１０は、サーバ３０００上で動作している業務プログラム３１２０から、サーバ３０００が使用しているボリュームを構成する物理領域までのＩ／Ｏ（入出力）経路上に存在する物理／仮想の装置、デバイスを示す情報、すなわち、Ｉ／Ｏ経路に基づく装置及びデバイスの論理的な関係を示す情報を管理する。ここで、論理的な関係は、「ボリューム」と「ボリュームを構成するプール」、「ボリューム」と「ボリュームへのＩ／Ｏ処理を担当するプロセッサ」、「ボリューム」と「ボリュームへのＩ／Ｏを一時的に格納するキャッシュ」など、設定に基づいて格納される。 FIG. 3 is a diagram showing an example of a related information table according to the first embodiment. The related information table stores related information indicating the managed object whose purpose information is its performance and the managed object logically associated with the managed object. Objects are components of a computer system. The component includes a physically existing component and a logically defined component. As an example, the related information table 1110 is a physical existing on an I / O (input / output) path from the business program 3120 running on the server 3000 to the physical area constituting the volume used by the server 3000. / Manages information indicating virtual devices and devices, that is, information indicating logical relationships between devices and devices based on I / O routes. Here, the logical relationship is "volume" and "pool constituting the volume", "volume" and "processor in charge of I / O processing to the volume", "volume" and "I / O to the volume". It is stored based on the settings such as "cache that temporarily stores".

関連情報テーブル１１１０には、装置ＩＤ１１１１、ボリュームＩＤ１１１２、プロセッサＩＤ１１１３、キャッシュＩＤ１１１４、プールＩＤ１１１５、物理領域ＩＤ１１１６のフィールドがある。 Related information The table 1110 has fields for device ID 1111, volume ID 1112, processor ID 1113, cache ID 1114, pool ID 1115, and physical area ID 1116.

装置ＩＤ１１１１にはストレージ２０００を一意に識別するための識別子が格納される。ボリュームＩＤ１１１２には、ボリューム２２１０を一意に識別するための識別子が格納される。プロセッサＩＤ１１１３には、ボリュームＩＤ１１１２で示されるボリュームへの処理を担当するプロセッサ２５００の識別子が格納される。キャッシュＩＤ１１１４には、ボリュームＩＤ１１１２で示されるボリュームへの処理がキャッシュされるディスクキャッシュ２１１０を一意に示す識別子が格納される。プールＩＤ１１１５には、ボリューム２２１０が作成されているディスクプール２２２０を一意に識別するための識別子が格納される。物理領域ＩＤ１１１６には、ディスクプールを構成する物理領域２２３０、例えばパリティグループやディスク等を一意に識別するための識別子が格納される。以上の各カラムのフィールドには計算機システムから収集された情報が格納される。情報の収集および格納の方法は特に限定されない。 An identifier for uniquely identifying the storage 2000 is stored in the device ID 1111. The volume ID 1112 stores an identifier for uniquely identifying the volume 2210. The processor ID 1113 stores the identifier of the processor 2500 in charge of processing the volume indicated by the volume ID 1112. The cache ID 1114 stores an identifier uniquely indicating the disk cache 2110 in which the processing for the volume indicated by the volume ID 1112 is cached. The pool ID 1115 stores an identifier for uniquely identifying the disk pool 2220 in which the volume 2210 is created. The physical area ID 1116 stores an identifier for uniquely identifying the physical area 2230 constituting the disk pool, for example, a parity group or a disk. Information collected from the computer system is stored in the fields of each of the above columns. The method of collecting and storing information is not particularly limited.

ここで、本実施例に係る関連情報テーブル１１１０は、装置ＩＤ１１１１とボリュームＩＤ１１１２とボリュームに係る管理対象オブジェクトとして、プロセッサ２５００、ディスクキャッシュ２１１０、ディスクプール２２２０、物理領域２２３０の情報を含んでいるが、本発明がこれに限定されることは無い。ＩＴシステムにおける如何なる管理対象オブジェクトであっても同様に扱うことができる。 Here, the related information table 1110 according to the present embodiment includes information on the processor 2500, the disk cache 2110, the disk pool 2220, and the physical area 2230 as managed objects related to the device ID 1111 and the volume ID 1112. However, the present invention is not limited to this. Any managed object in the IT system can be handled in the same way.

他の例として、業務アクセスの際に利用される管理対象オブジェクトである、サーバ３０００のマウントポイントをサーバ内で一意に識別するためのドライブ、サーバ３０００がボリュームＩＤ１１３６によって示されるボリューム２２１０にアクセスする際に利用されるサーバ３０００のデータＩ／Ｆ３２００を一意に識別するためのサーバデータＩ／Ｆ、などの物理、仮想含むその他の管理対象オブジェクトを識別するための識別子などが格納されても良い。 As another example, a drive for uniquely identifying the mount point of the server 3000, which is a managed object used for business access, the server 3000 accesses the volume 2210 indicated by the volume ID 1136. An identifier for identifying other managed objects including physical and virtual objects such as server data I / F for uniquely identifying the data I / F 3200 of the server 3000 used at the time may be stored. ..

また、スイッチのデータＩ／Ｆ等の情報を含んでもよく、また、業務サーバであるサーバ３０００上の業務プログラム（ＤＢＭＳ等）の情報等を関連付けて格納してもよい。また、業務プログラムの実行する処理単位の情報等を関連付けて格納しても良く、例えば、業務プログラムにおける処理Ａと、当該処理の実行に利用されるサーバ、当該サーバのＣＰＵ、メモリなどを関連付けて格納しても良い。 Further, information such as switch data I / F may be included, and information of a business program (DBMS or the like) on the server 3000, which is a business server, may be associated and stored. Further, information of a processing unit executed by the business program may be stored in association with each other. For example, processing A in the business program may be associated with a server used for executing the processing, a CPU of the server, memory, and the like. You may store it.

図４は、実施例１に係る性能履歴情報テーブル１１２０の一例を示す図である。性能履歴情報テーブル１１２０計算機システムの運用により各管理対象オブジェクトから取得された性能の履歴が格納される。性能履歴情報テーブル１１２０は、管理対象オブジェクトの性能、例えばストレージ装置２０００におけるボリューム２２１０、ディスクプール２２２０等に関する性能の情報を管理する。性能履歴情報テーブル１１２０にはエントリを追加するができる。 FIG. 4 is a diagram showing an example of the performance history information table 1120 according to the first embodiment. Performance history information table 1120 The performance history acquired from each managed object by the operation of the computer system is stored. The performance history information table 1120 manages performance information regarding the performance of the managed object, for example, the volume 2210, the disk pool 2220, and the like in the storage device 2000. An entry can be added to the performance history information table 1120.

性能履歴情報テーブル１１２０は、時刻１１２１、装置ＩＤ１１２２、デバイスＩＤ１１２３、メトリック１１２４、性能値１１２５のフィールドを含む。 The performance history information table 1120 includes fields for time 1121, device ID 1122, device ID 1123, metric 1124, and performance value 1125.

時刻１１２１には、情報を管理対象オブジェクトから収集した日時のデータが格納される。装置ＩＤ１１２２には、装置を一意に特定する識別子（装置ＩＤ）が格納される。デバイスＩＤ１１２３には、性能情報の取得対象となるデバイスを一意に特定するための識別子（デバイスＩＤ）が格納される。 At time 1121, data on the date and time when the information was collected from the managed object is stored. An identifier (device ID) that uniquely identifies the device is stored in the device ID 1122. The device ID 1123 stores an identifier (device ID) for uniquely identifying the device for which performance information is to be acquired.

メトリック１１２４には、ＣＰＵ使用率、記憶装置に対する単位時間（例えば１秒）あたりのＩ／Ｏ回数（ＩＯＰＳ）、リクエストに対するレスポンスの時間等の、性能情報の種類を示す情報が格納される。性能値１１２５には、デバイスＩＤ１１２３によって示されたデバイスの、メトリック１１２４によって示された種類の性能情報の値が、デバイスを含む装置から取得されて格納される。 The metric 1124 stores information indicating the type of performance information, such as the CPU usage rate, the number of I / Os (IOPS) per unit time (for example, 1 second) for the storage device, and the response time to the request. In the performance value 1125, the value of the performance information of the type indicated by the metric 1124 of the device indicated by the device ID 1123 is acquired from the device including the device and stored.

ここで、図４に示す性能履歴情報テーブル１１２０では、装置ＩＤ１１２２とデバイスＩＤ１１２３によって示される、性能情報の取得対象のデバイスとして、ストレージのボリューム２２１０、プロセッサ２５００、ディスクキャッシュ２１１０をあげたが、これらに限定されない。ＶＭ（図示しない）、ストレージのデータＩ／Ｆ２６００、サーバのデータＩ／Ｆ３２００や、スイッチやスイッチのポート（図示しない）等でもよい。 Here, in the performance history information table 1120 shown in FIG. 4, the storage volume 2210, the processor 2500, and the disk cache 2110 are listed as the devices for which the performance information is acquired, which are indicated by the device ID 1122 and the device ID 1123. Not limited to these. It may be a VM (not shown), a storage data I / F 2600, a server data I / F 3200, a switch or a switch port (not shown), or the like.

また、図４には、メトリックの一例として、リクエストに対する応答性能、ＣＰＵ使用率、キャッシュ使用率、ＩＯＰＳ、リクエストに対するレスポンスの時間等を示したが、これらに限定されることはない。Ｉ／Ｏビジー率、転送レート、スループット、データベース管理ソフトのバッファヒット率、挿入、更新、あるいは削除されたレコード数、Ｗｅｂサーバのレスポンスの時間、ファイルシステムあるいはディスクの空き容量あるいは利用率、入出力データ量、ネットワークインタフェースのエラー回数、バッファのオーバーフロー、及びフレームのエラー等の他の性能指標がメトリックとして用いられてもよい。 Further, FIG. 4 shows, as an example of the metric, the response performance to the request, the CPU usage rate, the cache usage rate, the IOPS, the response time to the request, and the like, but the present invention is not limited thereto. I / O busy rate, transfer rate, throughput, buffer hit rate of database management software, number of records inserted, updated or deleted, response time of Web server, free space or utilization rate of file system or disk, input / output Other performance indicators such as data volume, number of network interface errors, buffer overflows, and frame errors may be used as metrics.

図５Ａ、図５Ｂ、図５Ｃは、実施例１に係る構成情報テーブル１１３０の一例を示す図である。図５Ａ、図５Ｂには、後述する図１０のステップ３０１における構成変更プログラム１１９０による操作実行前の状態が示されている。図５Ｃには、図１０のステップ３０１における構成変更プログラム１１９０による操作実行後の状態が示されている。 5A, 5B, and 5C are diagrams showing an example of the configuration information table 1130 according to the first embodiment. 5A and 5B show a state before the operation is executed by the configuration change program 1190 in step 301 of FIG. 10, which will be described later. FIG. 5C shows the state after the operation is executed by the configuration change program 1190 in step 301 of FIG.

構成情報テーブル１１３０には、管理対象オブジェクトの構成情報が格納される。例えば、管理対象オブジェクトであるストレージ装置２０００についての構成情報であるディスクキャッシュ２１１０のキャッシュサイズが格納される。また、物理領域（パリティグループ）２２３０のディスク構成が格納される。構成情報テーブル１１３０には一般的な手段によりエントリが追加される。 The configuration information table 1130 stores the configuration information of the managed object. For example, the cache size of the disk cache 2110, which is the configuration information for the storage device 2000, which is a managed object, is stored. In addition, the disk configuration of the physical area (parity group) 2230 is stored. Entries are added to the configuration information table 1130 by common means.

構成情報テーブル１１３０には、装置ＩＤ１１３１、デバイスＩＤ１１３２、メトリック１１３３、値１１３４のフィールドが含まれている。装置ＩＤ１１３１には装置を一意に特定するための識別子が格納される。デバイスＩＤ１１３２には、構成情報の取得対象となるデバイスを一意に特定するための識別子が格納される。メトリック１１３３には、記憶容量や処理能力など構成情報の種類を示す情報が格納される。値１１３４には、デバイスＩＤ１１３２によって示されたデバイスの、メトリック１１３３によって示された種類の構成情報についての値が格納される。この値はデバイスを含む装置から取得されたものである。 The configuration information table 1130 includes fields for device ID 1131, device ID 1132, metric 1133, and value 1134. The device ID 1131 stores an identifier for uniquely identifying the device. The device ID 1132 stores an identifier for uniquely identifying the device for which the configuration information is to be acquired. Information indicating the type of configuration information such as storage capacity and processing capacity is stored in the metric 1133. The value 1134 stores values for the type of configuration information indicated by the metric 1133 for the device indicated by device ID 1132. This value is obtained from the device including the device.

ここで、図５Ａ〜図５Ｃに示す構成情報テーブル１１３０において装置ＩＤ１１３１およびデバイスＩＤ１１３２によって示されているデバイスが構成情報を取得する対象となる。ここでは、構成情報の取得対象のデバイスとして、ストレージ２０００のディスクキャッシュ２１１０（Ｃａｃｈｅ１）、物理領域２２３０（ＰＧ１、ＰＧ５）をあげたが、これらに限定されることはない。その他の管理対象オブジェクトの構成情報を保持しても良い。また、ここではメトリックの一例として、キャッシュのサイズ、パリティグループのＲＡＩＤレベル、およびディスク種別を挙げたが、これに限定されない。 Here, in the configuration information table 1130 shown in FIGS. 5A to 5C, the devices indicated by the device ID 1131 and the device ID 1132 are the targets for acquiring the configuration information. Here, the disk cache 2110 (Cache1) and the physical area 2230 (PG1, PG5) of the storage 2000 are mentioned as the devices for which the configuration information is acquired, but the devices are not limited thereto. The configuration information of other managed objects may be retained. Further, as an example of the metric, the cache size, the RAID level of the parity group, and the disk type are mentioned here, but the metric is not limited thereto.

図６は、実施例１に係る予測式元情報テーブル１１４０の一例を示す図である。予測式元情報テーブル１１４０は、予測式を生成するための元になる情報を管理するためのテーブルである。予測式元情報テーブル１１４０には、予測したい管理対象オブジェクトおよびそのパラメータと、その予測したい管理対象オブジェクトとＩ／Ｏパス上において関連を持つ他の管理対象オブジェクトおよびそのパラメータとが管理される。予測したい管理対象オブジェクトおよびそのパラメータが予測式の目的情報となり、関連する管理対象オブジェクトおよびそのパラメータが説明情報となる。 FIG. 6 is a diagram showing an example of the prediction formula source information table 1140 according to the first embodiment. The prediction formula source information table 1140 is a table for managing the information that is the basis for generating the prediction formula. In the prediction formula source information table 1140, the managed object to be predicted and its parameters, and other managed objects and their parameters related to the managed object to be predicted on the I / O path are managed. The managed object to be predicted and its parameters are the objective information of the prediction formula, and the related managed objects and their parameters are explanatory information.

予測式元情報テーブル１１４０には、時刻情報１１４１、目的情報１１４１１、および関連情報１１４１２のフィールドが含まれる。時刻情報１１４１には、情報を管理対象オブジェクトから収集した日時のデータが格納される。目的情報１１４１１には、予測したい管理対象オブジェクト識別情報と、当該管理対象オブジェクトのパラメータの値が格納される。関連情報１１４１２には、予測したい管理対象オブジェクトとＩ／Ｏパス上において関連を持つ、他管理対象オブジェクトのパラメータの値の情報が格納される。本実施例では、目的情報１１４１１として、装置ＩＤ１１４２、ボリュームＩＤ１１４３、およびボリューム応答性能１１４４が格納される。関連情報１１４１２には、ＰｒｏｃｅｓｓｏｒＢｕｓｙ１１４５、ＣａｃｈｅＵｓａｇｅ１１４６、ＣａｃｈｅＳｉｚｅ１１４７、ＰｏｏｌＢｕｓｙ１１４８、ＰＧ数１１４９のフィールドが含まれている。 The prediction formula source information table 1140 includes fields for time information 1141, purpose information 11411, and related information 11412. The time information 1141 stores data on the date and time when the information was collected from the managed object. The management target object identification information to be predicted and the parameter values of the management target object are stored in the target information 11411. The related information 11412 stores information on the parameter values of other managed objects that are related to the managed object to be predicted on the I / O path. In this embodiment, the device ID 1142, the volume ID 1143, and the volume response performance 1144 are stored as the target information 11411. Related information 11412 includes fields of Processor Busy 1145, Cache Use 1146, Cache Size 1147, Pool Busy 1148, and PG number 1149.

装置ＩＤ１１４２には、装置を一意に特定する識別子（装置ＩＤ）が格納される。ボリュームＩＤ１１４３には、管理対象オブジェクトを一意に特定するための識別子が格納される。ボリューム応答性能１１４４には、ボリュームでのＩ／Ｏ要求受信から処理完了までにかかる時間情報が格納される。ここでは、目的情報１１４１１の一例としてボリュームの応答性能を挙げ、関連情報の一例として、ＰｒｏｃｅｓｓｏｒＢｕｓｙ１１４５、ＣａｃｈｅＵｓａｇｅ１１４６等を挙げたが、これに限定されない。 An identifier (device ID) that uniquely identifies the device is stored in the device ID 1142. The volume ID 1143 stores an identifier for uniquely identifying the managed object. The volume response performance 1144 stores information on the time required from the reception of the I / O request on the volume to the completion of processing. Here, the response performance of the volume is given as an example of the target information 11411, and the Processor Busy1145, the Cache Use 1146, and the like are given as an example of the related information, but the present invention is not limited thereto.

図６に示したテーブルに格納されている値および情報のうち、時刻情報１１４１が１０：０１の情報および１０：０２の情報は、後述する図１０のステップ３０１における構成変更プログラム１１９０による操作実行前の状態を示し、時刻情報１１４１が１５：１０の情報および１５：１１の情報は、図１０のステップ３０１における構成変更プログラム１１９０による操作実行後の状態を示す。 Among the values and information stored in the table shown in FIG. 6, the information in which the time information 1141 is 10:01 and the information in 10:02 are before the operation execution by the configuration change program 1190 in step 301 of FIG. The information in which the time information 1141 is 15:10 and the information in 15:11 indicate the state after the operation is executed by the configuration change program 1190 in step 301 of FIG.

図７は、実施例１に係る予測式テーブル１１５０の一例を示す図である。予測式テーブル１１５０は、予測式を表す情報を管理するためのテーブルである。予測式テーブル１１５０には、予測式で用いられるメトリック、及び各メトリックにかかる係数などが格納される。予測式ｈ、具体的には、目的情報＝説明情報１＋説明情報２＋説明情報３＋説明情報４・・・と表すことができる。より具体的には、ストレージＡのボリューム１の応答性能＝係数１×ＰｒｏｃｅｓｓｏｒＢｕｓｙ＋係数２×ＣａｃｈｅＳｉｚｅ＋係数３×ＰｏｏｌＢｕｓｙ＋係数４×ＰＧ数という学習により得られる関数の情報である。 FIG. 7 is a diagram showing an example of the prediction formula table 1150 according to the first embodiment. The prediction formula table 1150 is a table for managing information representing the prediction formula. The prediction formula table 1150 stores the metrics used in the prediction formula, the coefficients related to each metric, and the like. The prediction formula h, specifically, objective information = explanatory information 1 + explanatory information 2 + explanatory information 3 + explanatory information 4 ... Can be expressed. More specifically, it is the information of the function obtained by learning that the response performance of the volume 1 of the storage A = coefficient 1 × Processor Busy + coefficient 2 × Cache Size + coefficient 3 × Pool Busy + coefficient 4 × number of PGs.

予測式テーブル１１５０は、目的情報１１５１１と説明情報１１５１２のフィールドを含む。目的情報１１５１１には、予測したい管理対象オブジェクトの識別情報と、当該管理対象オブジェクトのパラメータの値が格納される。説明情報１１５１２には、予測したい管理対象オブジェクトのパラメータの値を説明可能なその他管理対象オブジェクトのパラメータおよびその値の情報が格納される。本実施例には、目的情報１１５１１として、装置ＩＤ１１５１、デバイスＩＤ１１５２、メトリック１１５３が管理され、説明情報１１５１２として、ＰｒｏｃｅｓｓｏｒＢｕｓｙ１１５４、ＣａｃｈｅＳｉｚｅ１１５５、ＰｏｏｌＢｕｓｙ１１５６、ＰＧ数１１５７、及び各メトリックに対する係数を表すフィールドを含む。ここでは、目的情報１１４１１の一例としてボリュームの応答性能を、関連情報の一例として、ＰｒｏｃｅｓｓｏｒＢｕｓｙ１１５４、ＣａｃｈｅＳｉｚｅ１１５５等を挙げたが、これに限定されない。 The prediction table 1150 includes fields for objective information 11511 and explanatory information 11512. The objective information 11511 stores the identification information of the managed object to be predicted and the value of the parameter of the managed object. The explanatory information 11512 stores information on the parameters of other managed objects and their values that can explain the values of the parameters of the managed object to be predicted. In this embodiment, the device ID 1151, the device ID 1152, and the metric 1153 are managed as the target information 11511, and the processor Busy 1154, the Cache Size 1155, the Pool Busy 1156, the PG number 1157, and the fields representing the coefficients for each metric are used as the explanatory information 11512. Including. Here, the response performance of the volume is given as an example of the target information 11411, and the Processor Busy1154, Cache Size1155, and the like are given as an example of the related information, but the present invention is not limited thereto.

また、ここでは予測式は線形関係を表す式であるものとし、予測式テーブル１１５０は、データに最も良くあてはまる線形関係を特定するための回帰分析の式を表すものとしたが、これに限定されない。他の例として、予測式は多項式であるものとし、予測式テーブル１１５０は多項式を表す情報を管理することにしても良い。 Further, here, the prediction formula is assumed to be a formula expressing a linear relationship, and the prediction formula table 1150 is assumed to represent a regression analysis formula for identifying the linear relation that best fits the data, but the present invention is not limited to this. .. As another example, the prediction formula may be a polynomial, and the prediction formula table 1150 may manage information representing the polynomial.

次に、管理サーバ１０００が実行する各処理について説明する。 Next, each process executed by the management server 1000 will be described.

図８は、実施例１に係わる予測式を生成する処理のフローチャートである。予測式の生成とは、各オブジェクトにおける各種情報を学習データとして収集し、学習することで、目的とする要素とそれ以外の要素との関連について、学習データに最も良くあてはまる関数を特定することである。本予測式生成処理は、管理サーバ１０００のプロセッサ１３００が、メモリ１１００上に展開された予測式生成プログラム１１７０を実行することによっておこなわれる。以下、本フローチャートの具体例を示す。 FIG. 8 is a flowchart of a process for generating a prediction formula according to the first embodiment. Prediction formula generation is to collect various information in each object as learning data and learn it to identify the function that best applies to the training data regarding the relationship between the target element and other elements. is there. This prediction formula generation process is performed by the processor 1300 of the management server 1000 executing the prediction formula generation program 1170 expanded on the memory 1100. A specific example of this flowchart is shown below.

まず、予測式生成プログラム１１７０は、図３に例示した関連情報テーブル１１１０を参照し、予測式生成対象とする構成要素と、それに関連する構成要素とを特定する（ステップ１０１）。ここで、予測式生成対象の構成要素は、ユーザにより選択される、あるいは予測式生成プログラムにより自動的に選定される（例えば全てのボリューム応答性能について実行するなど）など、どのような方法によって選択され、指定されても良い。また、予測式生成プログラム１１７０が起動するタイミングは、定期的な実行、ユーザが指定した任意のタイミングで実行など任意である。 First, the prediction formula generation program 1170 refers to the related information table 1110 illustrated in FIG. 3 and identifies the component to be generated for the prediction formula and the component related thereto (step 101). Here, the component of the prediction formula generation target is selected by any method, such as being selected by the user or automatically selected by the prediction formula generation program (for example, executing for all volume response performances). And may be specified. Further, the timing at which the prediction formula generation program 1170 is started is arbitrary, such as periodic execution or execution at an arbitrary timing specified by the user.

ここでは具体例として、ユーザによりボリュームＩＤ“Ｖｏｌ１”で表されるボリュームが予測式を生成する対象として選択されたとする。この場合、予測式生成プログラム１１７０は、図３の関連情報テーブル１１１０に格納されている情報から、Ｖｏｌ１（Ｖｏｌｕｍｅ１）に関連する構成要素として、Ｐｒｏｃｅｓｓｏｒ１、Ｃａｃｈｅ１、Ｐｏｏｌ１、およびＰＧ１が特定される。 Here, as a specific example, it is assumed that the volume represented by the volume ID “Vol1” is selected by the user as the target for generating the prediction formula. In this case, the prediction formula generation program 1170 identifies Processor1, Cache1, Pool1, and PG1 as the components related to Vol1 (Volume1) from the information stored in the related information table 1110 of FIG.

図８に戻り、次に、予測式生成プログラム１１７０は、図４に例示した性能履歴情報テーブル１１２０を参照し、予測式を生成する対象の構成要素、及び、それに関連するものとして、ステップ１０１で特定した構成要素の性能履歴情報を取得する（ステップ１０２）。例えば、時刻１０：０１に取得されたＶｏｌｕｍｅ１の応答時間が１０．２ｍｓｅｃ、Ｐｒｏｃｅｓｓｏｒ１の使用率（Ｂｕｓｙ％）が４０％、Ｃａｃｈｅ１の使用率（Ｕｓａｇｅ％）が８０％、Ｐｏｏｌ１の単位時間当たりのＩ／Ｏ回数が７００ＩＯＰＳで使用率（Ｂｕｓｙ％）が３５％であったという性能の情報が取得される。 Returning to FIG. 8, the prediction formula generation program 1170 refers to the performance history information table 1120 illustrated in FIG. 4, and sets the target component for generating the prediction formula, and as related to the component, in step 101. Acquire the performance history information of the specified component (step 102). For example, the response time of Volume 1 acquired at 10:01 is 10.2 msec, the usage rate of Processor 1 (Busy%) is 40%, the usage rate of Cake 1 (Usage%) is 80%, and I per unit time of Pool 1. Performance information that the number of times / O was 700 IOPS and the usage rate (Busy%) was 35% is acquired.

次に、予測式生成プログラム１１７０は、図５Ａ、図５Ｂに例示した構成情報テーブル１１３０を参照し、予測式生成対象の構成要素、およびステップ１０１で特定した構成要素の構成情報を取得する（ステップ１０３）。例えば、図５Ａからは、ストレージＡのＣａｃｈｅ１のサイズが８ＧＢであるという構成情報が取得される。また、図５Ｂから、例えば、ストレージＡの物理領域ＰＧ１のＲＡＩＤレベルがＲＡＩＤ５（３Ｄ＋１Ｐ）である等の構成情報が取得される。 Next, the prediction formula generation program 1170 refers to the configuration information table 1130 illustrated in FIGS. 5A and 5B, and acquires the component of the prediction formula generation target and the component information of the component specified in step 101 (step). 103). For example, from FIG. 5A, configuration information that the size of Cache1 of the storage A is 8 GB is acquired. Further, from FIG. 5B, configuration information such as, for example, the RAID level of the physical area PG1 of the storage A is RAID5 (3D + 1P) is acquired.

次に、予測式生成プログラム１１７０は、ステップ１０２およびステップ１０３で取得した予測式生成に関連する情報を、図６に例示した予測式元情報テーブル１１４０に格納する（ステップ１０４）。図６を参照すると、Ｖｏｌｕｍｅ１の予測式元情報テーブル１１４０に、例えば、時刻１０：０１に取得された性能情報が格納されている。 Next, the prediction formula generation program 1170 stores the information related to the prediction formula generation acquired in steps 102 and 103 in the prediction formula source information table 1140 illustrated in FIG. 6 (step 104). With reference to FIG. 6, the performance information acquired at, for example, 10:01 is stored in the prediction formula source information table 1140 of Volume 1.

最後に、予測式生成プログラム１１７０は、ステップ１０４で生成した予測式元情報テーブル１１４０の情報から予測式を生成し、図７に例示した予測式テーブル１１５０に格納する（ステップ１０５）。例えば、図７の予測式テーブル１１５０には、（ストレージＡのＶｏｌｕｍｅ１の応答性能）＝３３．７６（係数１）×プロセッサ使用率＋７．２７（係数２）×キャッシュサイズ＋５．１（係数３）×Ｐｏｏｌの使用率＋０．８０（係数４）×物理領域ＰＧ数という予測式が格納されている。 Finally, the prediction formula generation program 1170 generates a prediction formula from the information in the prediction formula source information table 1140 generated in step 104, and stores it in the prediction formula table 1150 illustrated in FIG. 7 (step 105). For example, in the prediction formula table 1150 of FIG. 7, (response performance of Volume 1 of storage A) = 33.76 (coefficient 1) × processor usage rate +7.27 (coefficient 2) × cache size +5.1 (coefficient 3). A prediction formula of × Pool usage rate + 0.80 (coefficient 4) × number of physical region PGs is stored.

ステップ１０５にて予測式を生成する手法は特に限定されず、回帰分析などの一般的な手法を含め、どのような手法であってもよい。回帰分析の場合、例えば、予測式元情報テーブル１１４０に示された関連情報１１４１２の全てを説明変数として設定した上で、目的情報との関連性の低い変数を説明変数から外していくなどの方法で予測式を生成すればよい。本実施例では、図６に示した予測式元情報テーブル１１４０に格納された関連情報のうちＣａｃｈｅＵｓａｇｅ１１４６は説明変数から外され、図７に示した予測式テーブル１１５０に格納された情報には含まれていない。 The method for generating the prediction formula in step 105 is not particularly limited, and any method may be used, including a general method such as regression analysis. In the case of regression analysis, for example, after setting all the related information 11412 shown in the prediction formula source information table 1140 as explanatory variables, variables having low relevance to the target information are excluded from the explanatory variables. You can generate a prediction formula with. In this embodiment, among the related information stored in the prediction formula source information table 1140 shown in FIG. 6, Cache Use 1146 is excluded from the explanatory variables and is included in the information stored in the prediction formula table 1150 shown in FIG. Not done.

図９は、実施例１に係わる学習データを取得するための設定変更内容を決定する処理のフローチャートである。本設定変更内容決定処理２００は、例えば、図８に示した予測式を生成する処理の後に実施される。本処理は、管理サーバ１０００のプロセッサ１３００が、メモリ１１００上に展開された設定変更内容決定プログラム１１８０を実行することによっておこなわれる。 FIG. 9 is a flowchart of a process for determining the setting change contents for acquiring the learning data according to the first embodiment. The setting change content determination process 200 is performed, for example, after the process of generating the prediction formula shown in FIG. This process is performed by the processor 1300 of the management server 1000 executing the setting change content determination program 1180 expanded on the memory 1100.

以下、本フローチャートの具体例を示す。 A specific example of this flowchart is shown below.

はじめに、設定変更内容決定プログラム１１８０は、図７に例示した予測式テーブル１１５０における説明情報１１５１２のメトリックを抽出し、メトリックごとに以下の処理を実施する。 First, the setting change content determination program 1180 extracts the metric of the explanatory information 11512 in the prediction formula table 1150 illustrated in FIG. 7, and performs the following processing for each metric.

まず、設定変更内容決定プログラム１１８０は、メトリックが構成情報テーブル１１３０に含まれているかどうかをチェックする（ステップ２０１）。メトリックが構成情報テーブル１１３０に含まれていない場合、設定変更内容決定プログラム１１８０は、予測式テーブル１１５０における次のメトリックに対する処理に進む。メトリックが構成情報テーブル１１３０に含まれている場合、設定変更内容決定プログラム１１８０は、メトリックの取り得る範囲の情報を取得する（ステップ２０２）。メトリックがストレージのキャッシュサイズの場合、例えば、ハードウェアスペック上、キャッシュサイズとして取り得る値の範囲の情報を取得する。例えば、キャッシュサイズが１ＧＢ〜７２ＧＢの範囲であるといった情報が取得される。また、メトリックがストレージのパリティグループの場合、ＲＡＩＤレベルの範囲の情報を取得する。例えば、取り得るＲＡＩＤレベルが、ＲＡＩＤ０（２Ｄ）、ＲＡＩＤ１（１Ｄ＋１Ｐ）、ＲＡＩＤ５（３Ｄ＋１Ｐ）であるといった情報が取得される。これらメトリックの取り得る範囲を取得する方法は特に限定されない。例えば、各メトリックの取り得る範囲の情報をあらかじめテーブル（図示しない）に格納しておき、設定変更内容決定プログラム１１８０が適宜そのテーブルから必要な情報を取得することにしてもよい。あるいは、設定変更内容決定プログラム１１８０がストレージなどのハードウェアに要求を出して取得することにしてもよい。 First, the setting change content determination program 1180 checks whether or not the metric is included in the configuration information table 1130 (step 201). If the metric is not included in the configuration information table 1130, the setting change content determination program 1180 proceeds to process for the next metric in the prediction expression table 1150. When the metric is included in the configuration information table 1130, the setting change content determination program 1180 acquires information in a range that the metric can take (step 202). When the metric is the cache size of the storage, for example, information on the range of values that can be taken as the cache size is acquired in terms of hardware specifications. For example, information that the cache size is in the range of 1 GB to 72 GB is acquired. If the metric is a storage parity group, information on the RAID level range is acquired. For example, information that the possible RAID levels are RAID0 (2D), RAID1 (1D + 1P), and RAID5 (3D + 1P) is acquired. The method for obtaining the possible range of these metrics is not particularly limited. For example, information in a range that can be taken by each metric may be stored in a table (not shown) in advance, and the setting change content determination program 1180 may appropriately acquire necessary information from the table. Alternatively, the setting change content determination program 1180 may issue a request to hardware such as storage and acquire it.

次に、設定変更内容決定プログラム１１８０は、ステップ２０２で取得した範囲の中で、データの不足している定義域を探索する（ステップ２０３）。次に、データの不足している定義域が存在するかどうかを判定し（ステップ２０４）、存在しない場合、予測式テーブル１１５０における次のメトリックに対する処理に進む。ステップ２０４において不足している定義域が存在する場合、設定変更内容決定プログラム１１８０は、不足している定義域のデータの取得が可能となる設定変更操作のためのパラメータを生成する（ステップ２０５）。 Next, the setting change content determination program 1180 searches for a domain lacking data in the range acquired in step 202 (step 203). Next, it is determined whether or not there is a domain lacking data (step 204), and if it does not exist, the process proceeds to the next metric in the prediction expression table 1150. If there is a missing domain in step 204, the setting change content determination program 1180 generates a parameter for a setting change operation that enables acquisition of data in the missing domain (step 205). ..

例えば、メトリックであるキャッシュサイズに着目すると、図６に示した予測式元情報テーブル１１４０のＣａｃｈｅＳｉｚｅ１１４７として８ＧＢ設定時以外のデータが存在しないとする。その場合、設定変更内容決定プログラム１１８０は、８ＧＢ以外に設定したときのデータを取得しようと試みる。例えば、設定変更内容決定プログラム１１８０は、キャッシュサイズを１６ＧＢに設定変更するパラメータを生成する。 For example, focusing on the cache size, which is a metric, it is assumed that there is no data other than when 8 GB is set as the Cache Size 1147 of the prediction formula source information table 1140 shown in FIG. In that case, the setting change content determination program 1180 tries to acquire the data when the setting is set to other than 8GB. For example, the setting change content determination program 1180 generates a parameter for changing the setting of the cache size to 16 GB.

ここで、設定変更内容決定プログラム１１８０は、ステップ２０５で生成したパラメータの設定を変更した場合に、ＳＬＡ（ＳｅｒｖｉｃｅＬｅｖｅｌＡｇｒｅｅｍｅｎｔ）を満たすかどうかチェックし、変更後のパラメータがＳＬＡを満たさなくなる場合にはそのパラメータの設定範囲から除外するなどしても良い。例えば、キャッシュサイズの８ＧＢを４ＧＢに変更した場合に、ボリュームの性能や、そのボリュームを利用するサーバ３０００上で動作している業務アプリケーションの性能として、あらかじめ定められた要件（応答時間１秒以内など）を満たさなくなる場合には、パラメータの設定を４ＧＢへ変更することを実施しないことにしてもよい。 Here, the setting change content determination program 1180 checks whether or not the SLA (Service Level Agreement) is satisfied when the setting of the parameter generated in step 205 is changed, and if the changed parameter does not satisfy the SLA. It may be excluded from the setting range of the parameter. For example, when the cache size of 8GB is changed to 4GB, predetermined requirements (response time within 1 second, etc.) are set as the performance of the volume and the performance of the business application running on the server 3000 that uses the volume. ) Satisfaction is not satisfied, the parameter setting may not be changed to 4GB.

次に、設定変更内容決定プログラム１１８０は、学習データ取得用設定変更処理を実行する（ステップ２０６）。ステップ２０６については図１０を参照して詳細に説明する。 Next, the setting change content determination program 1180 executes the learning data acquisition setting change process (step 206). Step 206 will be described in detail with reference to FIG.

図１０は、学習データ取得用設定変更を実行する処理のフローチャートである。本学習データ取得要設定変更処理３００（図９の学習データ取得要設定変更処理２０６）は、管理サーバ１０００のプロセッサ１３００が、メモリ１１００上に展開された設定変更内容決定プログラム１１８０を実行することによっておこなわれる。以下、本フローチャートの具体例を示す。 FIG. 10 is a flowchart of a process for executing the setting change for learning data acquisition. The learning data acquisition required setting change process 300 (learning data acquisition required setting change process 206 in FIG. 9) is performed by the processor 1300 of the management server 1000 executing the setting change content determination program 1180 expanded on the memory 1100. It is carried out. A specific example of this flowchart is shown below.

まず、設定変更内容決定プログラム１１８０は、構成変更プログラム１１９０に設定変更の操作実行を要求し、実行結果を取得する（ステップ３０１）。次に、設定変更内容決定プログラム１１８０は、予測式元情報テーブル１１４０に新規時刻のエントリが追加されたかどうかを確認する（ステップ３０２）。 First, the setting change content determination program 1180 requests the configuration change program 1190 to execute the setting change operation, and acquires the execution result (step 301). Next, the setting change content determination program 1180 confirms whether or not a new time entry has been added to the prediction formula source information table 1140 (step 302).

新規時刻のエントリが追加されている場合、設定変更内容決定プログラム１１８０は、予測式元情報テーブル１１４０における対象定義域の取得データ数を取得し（ステップ３０３）、データを十分に取得できたかどうかをチェックする（ステップ３０４）。 When the entry of the new time is added, the setting change content determination program 1180 acquires the number of acquired data in the target domain in the prediction formula source information table 1140 (step 303), and determines whether or not the data can be sufficiently acquired. Check (step 304).

ここで、学習データが十分に取得できたかどうかの判定に、あらかじめデータ数の閾値を設定しておく、予測式テーブルに示す説明情報の個数を閾値として設定しておくなど、どのような方法にておいても良い。学習データを十分に取得できている場合には、設定変更内容決定プログラム１１８０は、次のステップ３０５へ進む。学習データを十分に取得できていない場合には、設定変更内容決定プログラム１１８０は、ステップ３０２から再び処理を実行する。 Here, in order to determine whether or not the training data has been sufficiently acquired, a threshold value for the number of data is set in advance, or the number of explanatory information shown in the prediction formula table is set as a threshold value. You can keep it. When sufficient training data has been acquired, the setting change content determination program 1180 proceeds to the next step 305. If sufficient training data has not been acquired, the setting change content determination program 1180 executes the process again from step 302.

ステップ３０５では、設定変更内容決定プログラム１１８０は、ステップ３０１実行前に戻す設定変更操作の実行を構成変更プログラム１１９０に要求し、実行結果を取得する。ステップ３０１、ステップ３０５において要求した設定変更操作が成功しなかった場合は、本処理を中断する。 In step 305, the setting change content determination program 1180 requests the configuration change program 1190 to execute the setting change operation returned to before the execution of step 301, and acquires the execution result. If the setting change operation requested in steps 301 and 305 is not successful, this process is interrupted.

図１０を実行して十分な学習データを取得した後に、図８に示した予測式生成処理１００を実行することで、新しい構成において、学習データの不足が無い状態の予測式元情報テーブル１１４０から、高い精度の予測式を示す予測式テーブル１１５０を生成することができる。 By executing the prediction formula generation process 100 shown in FIG. 8 after executing FIG. 10 to acquire sufficient training data, from the prediction formula source information table 1140 in a state where there is no shortage of training data in the new configuration. , It is possible to generate a prediction formula table 1150 showing a prediction formula with high accuracy.

本実施例では、図９に示した設定変更内容決定処理２００のステップ２０１〜ステップ２０４において、構成上取り得る範囲からデータが不足している定義域を全て抽出し、その後、図１０に示した学習データ取得用設定変更処理により、データが不足している定義域のデータを取得し、更にその後、図８に示した予測式生成処理１００で予測式を生成している。しかし、これに限定されることはない。他の例として、データが不足している定義域を１つ抽出するごとにその定義域のデータを取得しデータが取得できたらその段階で予測式を生成するという処理を、データが不足している定義域の個数だけ繰り返すことにしてもよい。 In this embodiment, in steps 201 to 204 of the setting change content determination process 200 shown in FIG. 9, all the domain areas in which data is insufficient are extracted from the range that can be configured, and then shown in FIG. The training data acquisition setting change process acquires the data in the domain where the data is insufficient, and then the prediction formula generation process 100 shown in FIG. 8 generates the prediction formula. However, it is not limited to this. As another example, every time a domain with insufficient data is extracted, the data in that domain is acquired, and if the data can be acquired, a prediction formula is generated at that stage. It may be repeated for the number of domains in which it exists.

具体例を示す。ここではキャッシュサイズの取り得る範囲が１ＧＢから７２ＧＢであるとする。例えば、キャッシュサイズの取り得る範囲のうちデータが不足している定義域を例えば１ＧＢ単位で抽出し、抽出した全ての定義域のデータを取得しきってから予測式を生成してもよい。あるいは他の例として、１ＧＢ単位で抽出したデータが不足している各定義域について、データを取得して予測式を生成し次の定義域に進むという処理を繰り返すことにしてもよい。 A specific example is shown. Here, it is assumed that the possible range of the cache size is 1 GB to 72 GB. For example, a domain in which data is insufficient in the range of possible cache size may be extracted in units of, for example, 1 GB, and a prediction formula may be generated after all the extracted domain data have been acquired. Alternatively, as another example, the process of acquiring data, generating a prediction formula, and proceeding to the next domain may be repeated for each domain in which the data extracted in 1 GB units is insufficient.

また、本実施例では、図９に示した設定変更内容決定処理２００において、予測式テーブル１１５０に含まれる項目のうち、構成情報テーブル１１３０に含まれている全ての項目に対して、データが不足している定義域を抽出した後、学習データ取得用設定変更処理２０６を実行している。そのため、予測式テーブル１１５０および構成情報テーブル１１３０に含まれる全ての項目についてデータを収集した後に予測式を生成することとなる。しかし、これに限定されることは無い。他の例として、予測式テーブル１１５０および構成情報テーブル１１３０に含まれる１つの項目に対して学習データ取得用設定変更処理および予測式生成処理を実行して次の項目に進むという処理を繰り返すことにしてもよい。 Further, in this embodiment, in the setting change content determination process 200 shown in FIG. 9, data is insufficient for all the items included in the configuration information table 1130 among the items included in the prediction formula table 1150. After extracting the defined domain, the training data acquisition setting change process 206 is executed. Therefore, the prediction formula is generated after collecting data for all the items included in the prediction formula table 1150 and the configuration information table 1130. However, it is not limited to this. As another example, it is decided to repeat the process of executing the setting change process for learning data acquisition and the predictive expression generation process for one item included in the predictive expression table 1150 and the configuration information table 1130 and proceeding to the next item. You may.

以上、本実施例によれば、計算機システムが構成上取り得る範囲において不足している学習データを予め能動的に収集しておくことにより、構成変更が行われたとき早期に精度の高い予測式を得ることができ、学習時間を短縮し、構成変更後すぐに機械学習技術に基づく効率の良い管理を実施可能とする。 As described above, according to this embodiment, by actively collecting the learning data that is insufficient in the range that the computer system can take in the configuration, a highly accurate prediction formula is performed at an early stage when the configuration is changed. It is possible to shorten the learning time and implement efficient management based on machine learning technology immediately after the configuration change.

例えば、関数を予兆監視に利用する場合、構成変更直後や新規構築された構成であっても、ＩＴシステムから取得した実測値が、関数で示される関係とかけ離れている場合、ＩＴシステムに問題が発生したと判断することが可能となる。 For example, when using a function for predictive monitoring, if the measured value obtained from the IT system is far from the relationship indicated by the function even immediately after the configuration change or even if the configuration is newly constructed, there is a problem with the IT system. It is possible to determine that it has occurred.

また、関数を障害原因切り分けに利用する場合、構成変更直後や新規構築された構成であっても、ＩＴシステムから取得した実測値が、関数で示される関係とかけ離れている場合、各説明情報の中で最も変動幅が大きい説明情報に問題が発生した可能性が高いとして、根本原因と判断することが可能となる。また、これにより、障害発生時に即座に根本原因特定を自動でおこなうことが可能となる。 In addition, when the function is used to isolate the cause of failure, if the measured value acquired from the IT system is far from the relationship indicated by the function even immediately after the configuration change or even if the configuration is newly constructed, each explanatory information It is possible to determine that the root cause is that there is a high possibility that a problem has occurred in the explanatory information that has the largest fluctuation range. In addition, this makes it possible to automatically identify the root cause immediately when a failure occurs.

また、関数をＷｈａｔ−ｉｆ分析に利用する場合、構成変更直後や新規構築された構成であっても、試行したい値を関数に代入することによって、代入した値の状況での、関数に現れる他のメトリックの値をシミュレートすることが可能となる。 Also, when using a function for What-if analysis, even if the configuration is changed immediately or newly constructed, by assigning the value you want to try to the function, it will appear in the function in the situation of the assigned value. It is possible to simulate the value of the metric of.

このように、本発明により、構成変更直後や新規構築された構成であっても、障害発生、あるいは管理要件を満たせなくなるなどの未然防止や、障害発生時の迅速な障害回復といった効果を得ることが可能となる。本発明は、前述の各種クラウド形態にも適用可能であり、管理ソフトのＳａａＳや運用管理業務をサービスとして請け負う形態においても適用可能である。 As described above, according to the present invention, it is possible to obtain effects such as prevention of failure occurrence or failure to satisfy management requirements even immediately after configuration change or newly constructed configuration, and quick failure recovery in the event of failure occurrence. Is possible. The present invention can be applied to the above-mentioned various cloud forms, and can also be applied to a form in which the management software SaaS and operation management work are undertaken as a service.

実施例２に係る計算機システムは基本的には実施例１のものと同様の構成を有し、同様の動作を行う。ただし、実施例２は、目的情報に関連する関連情報だけでなく、目的情報が対象とする業務と類似する特性を有する業務の計算機システムにおいて取得された情報を予測式の生成に利用する点で実施例１と異なる。 The computer system according to the second embodiment basically has the same configuration as that of the first embodiment, and performs the same operation. However, the second embodiment is to use not only the related information related to the target information but also the information acquired in the computer system of the business whose target information has characteristics similar to the target business to generate the prediction formula. Different from Example 1.

図１１は、実施例２に係る業務特性管理テーブル１８００の一例を示す図である。業務特性管理テーブル１８００は、業務単位の業務特性情報を管理する。 FIG. 11 is a diagram showing an example of the business characteristic management table 1800 according to the second embodiment. The business characteristic management table 1800 manages business characteristic information for each business unit.

業務特性管理テーブル１８００には、業務単位１８０１１と業務特性１８０１２のデータが格納される。本実施例では、業務単位をボリュームに対応づけ、業務特性として、各業務についてＩ／Ｏ回数および各Ｉ／Ｏパタンの割合などＩ／Ｏに関する情報を管理している例を示している。図１１を参照すると、業務特性管理テーブル１８００には業務単位１８０１１と業務特性１８０１２のフィールドが対応づけられている。業務単位１８０１１にはボリュームＩＤ１８０１が含まれている。業務特性１８０１２には、Ｉ／Ｏ数１８０２、Ｉ／Ｏ増減率１８０３、高頻度のアクセス１８０４、Ｉ／Ｏパタン１８０５のフィールドが含まれている。 The business characteristic management table 1800 stores data of the business unit 18011 and the business characteristic 18012. In this embodiment, an example is shown in which business units are associated with volumes and information related to I / O such as the number of I / Os and the ratio of each I / O pattern is managed as business characteristics for each business. Referring to FIG. 11, the business characteristic management table 1800 is associated with the fields of the business unit 18011 and the business characteristic 18012. The business unit 18011 includes the volume ID 1801. The business characteristics 18012 include fields of I / O number 1802, I / O increase / decrease rate 1803, high frequency access 1804, and I / O pattern 1805.

ボリュームＩＤ１８０１には、ボリューム２２１０を一意に識別するための識別子が格納される。Ｉ／Ｏ数１８０２には、Ｉ／Ｏ数が記録される。例えば、前月のＩＯＰＳの平均値や中間値などを記録する。Ｉ／Ｏ増減率１８０３には、過去一定期間にＩＯＰＳがどれだけ変化したかの割合を記録する。例えば、半年間あるいは１年間において、ＩＯＰＳの１か月の平均を算出し、各月の平均値の前月の平均値に対する増減率を算出する。Ｉ／Ｏパタン１８０５には、ＲａｎｄｏｍＲｅａｄ、ＲａｎｄｏｍＷｒｉｔｅ、ＳｅｑｕｅｎｔｉａｌＲｅａｄ、ＳｅｑｕｅｎｔｉａｌＷｒｉｔｅの各Ｉ／Ｏパタンの発生割合が記録される。その中で最も割合が高かったＩ／Ｏパタンが高頻度のアクセス１８０４に記録される。なお、ここでは業務単位がボリュームに対応する例を挙げたが、これに限定されない。他の例として、業務単位をＶＭにしてもよく、サーバ３０００上の業務プログラムにしてもよく、あるいは業務プログラムの実行する処理単位の情報などにしても良い。 The volume ID 1801 stores an identifier for uniquely identifying the volume 2210. The I / O number is recorded in the I / O number 1802. For example, record the average value or median value of IOPS of the previous month. In the I / O increase / decrease rate 1803, the rate of how much IOPS has changed in the past fixed period is recorded. For example, in half a year or one year, the average of IOPS for one month is calculated, and the rate of increase / decrease of the average value of each month with respect to the average value of the previous month is calculated. In the I / O pattern 1805, the generation ratio of each I / O pattern of Random Read, Random Write, Sequential Read, and Sequential Write is recorded. The I / O pattern with the highest proportion is recorded in the frequent access 1804. Here, an example in which a business unit corresponds to a volume is given, but the present invention is not limited to this. As another example, the business unit may be a VM, a business program on the server 3000, or information on a processing unit executed by the business program.

図１２は、実施例２に係る学習用データ共有を実行する処理のフローチャートである。本学習要データ共有処理４００は、実施例１における図８の予測式生成処理１００のステップ１０５に相当する実施例２における処理である。本処理は、管理サーバ１０００のプロセッサ１３００が、メモリ１１００上に展開された予測式生成プログラム１１７０を実行することによっておこなわれる。以下、本フローチャートの具体例を示す。 FIG. 12 is a flowchart of a process for executing learning data sharing according to the second embodiment. The learning-required data sharing process 400 is the process in Example 2 corresponding to step 105 of the predictive expression generation process 100 in FIG. 8 in Example 1. This process is performed by the processor 1300 of the management server 1000 executing the predictive expression generation program 1170 expanded on the memory 1100. A specific example of this flowchart is shown below.

予測式生成プログラム１１７０は、まず、業務特性管理テーブル１８００の情報を取得する（ステップ４０１）。次に、予測式生成プログラム１１７０は、予測式を生成する対象の業務と類似する類似業務で利用されている予測式生成対象の構成要素が存在するか否かをチェックする（ステップ４０２）。ここで、類似業務で利用しているかどうかの判定には、ステップ４０１で取得した業務特性管理テーブル１８００の情報が利用される。類似する業務を予めグループ化しておき、同じグループに属する業務で利用されている構成要素の有無をチェックすればよい。 The prediction formula generation program 1170 first acquires the information of the business characteristic management table 1800 (step 401). Next, the prediction formula generation program 1170 checks whether or not there is a component of the prediction formula generation target used in a similar business similar to the business of the target for which the prediction formula is generated (step 402). Here, the information in the business characteristic management table 1800 acquired in step 401 is used to determine whether or not the product is used in similar business. Similar businesses may be grouped in advance, and the presence or absence of components used in businesses belonging to the same group may be checked.

グループ化の例として、高頻度アクセス情報が同一な業務を業務類似グループとしてもよい。あるいは、Ｉ／Ｏ増減率に関して、５％以上の減少率、プラスマイナス５％以内の増減率、５％以上の増加率等をそれぞれ業務類似グループとすることにしてもよい。あるいは、ｋ平均法を用いて、業務をいくつのグループに分類しておいてもよい。あるいは、予めグループ数を入力しておきソノグループ数に適切にグループ分けすることにしてもよい。あるいは、上記グループ分けの方法を組み合わせて業務をグループ化してもよい。このようにグループ分けはどのような方法によっても良く、特に限定されない。 As an example of grouping, businesses with the same high-frequency access information may be grouped as business-like groups. Alternatively, regarding the I / O increase / decrease rate, a decrease rate of 5% or more, an increase / decrease rate of plus or minus 5% or less, an increase rate of 5% or more, etc. may be set as business-like groups. Alternatively, the k-means method may be used to classify the work into any number of groups. Alternatively, the number of groups may be input in advance and grouped appropriately according to the number of sono groups. Alternatively, the business may be grouped by combining the above grouping methods. As described above, the grouping may be performed by any method and is not particularly limited.

図１１に例示した業務特性管理テーブル１８００の場合、Ｖｏｌｕｍｅ１とＶｏｌｕｍｅ３は高頻度アクセス１８０４が「ＲＷ」で同一であり、Ｉ／Ｏ増減率１８０３が「５％以上」で同一であり、かつＩ／Ｏ数１８０２が「１００００以上」で同一である。Ｖｏｌｕｍｅ１とＶｏｌｕｍｅ３が同じ業務類似グループと判断されるように業務をグループ化をしておいてもよい。 In the case of the business characteristic management table 1800 illustrated in FIG. 11, Volume 1 and Volume 3 have the same high-frequency access 1804 for "RW", the same I / O increase / decrease rate 1803 for "5% or more", and I / O. The number of O's 1802 is "10000 or more" and is the same. Businesses may be grouped so that Volume 1 and Volume 3 are determined to be the same business-like group.

ステップ４０２において、類似業務で利用している構成要素が存在する場合、予測式生成プログラム１１７０は、類似業務で利用している各構成要素の予測式元情報テーブル１１４０の情報を利用して予測式を生成し、予測式テーブル１１５０に格納する（ステップ４０３）。ステップ４０２において、類似業務で利用している構成要素が存在しない場合、予測式生成プログラム１１７０は、構成要素単位で予測式元情報テーブルの情報から、予測式を生成し、予測式テーブルに格納する（ステップ４０４）。 In step 402, when there are components used in the similar business, the prediction formula generation program 1170 uses the information in the prediction formula source information table 1140 of each component used in the similar business to formulate the prediction formula. Is generated and stored in the prediction expression table 1150 (step 403). In step 402, when there is no component used in the similar business, the prediction formula generation program 1170 generates a prediction formula from the information in the prediction formula source information table for each component and stores it in the prediction formula table. (Step 404).

以上のように、業務の類似しているグループ間で学習データを共有することで、学習時間を短縮し、迅速に機械学習技術に基づく精度の高い効率の管理を実施可能とする。例えば、新しく環境を作成する場合に、通常であれば長期間、例えば数カ月にわたる性能情報および容量情報などの各種履歴情報を取得する必要がある。しかし、新たに作成する業務の環境に類似する環境の業務があれば、その類似業務グループのデータを活用し、短期間、例えば３日間で類似業務グループの判定をおこなうだけで、機械学習技術に基づく効率の良い管理を実施可能となる。また、図９に示した設定変更内容決定処理２００のステップ２０３においても、類似業務グループそれぞれで取ったことのある構成に基づき、不足している定義域を探索できるため、不足しているデータの収集にかかる時間を短縮することが可能となり、機械学習技術に基づく効率の良い管理を実施可能となる。 As described above, by sharing learning data between groups with similar tasks, it is possible to shorten the learning time and quickly implement highly accurate and efficient management based on machine learning technology. For example, when creating a new environment, it is usually necessary to acquire various historical information such as performance information and capacity information for a long period of time, for example, several months. However, if there is a business in an environment similar to the environment of the newly created business, the data of the similar business group can be used to determine the similar business group in a short period of time, for example, 3 days, and the machine learning technology can be used. Efficient management based on this becomes possible. Further, also in step 203 of the setting change content determination process 200 shown in FIG. 9, since the missing domain can be searched based on the configuration taken by each similar business group, the missing data can be searched. It is possible to shorten the time required for collection, and it is possible to carry out efficient management based on machine learning technology.

以上説明した各実施例による計算機システムは以下のような態様に整理することもできる。 The computer system according to each of the above-described embodiments can be organized in the following aspects.

（態様１）
対象システムが運用される間に前記対象システムの構成要素から学習データを取得する情報収集部と、前記学習データに基づいて前記対象システムの構成要素間の関係を目的情報と説明情報の関係により表現した予測式を生成する予測式生成部と、前記対象システムの構成を変更する設定内容を決定する設定変更内容決定部と、前記対象システムの構成を変更する構成変更部と、を有し、前記設定変更内容決定部が、前記対象システムの構成を変更する場合に取り得る前記構成要素の状態の範囲において、前記学習データが十分に取得されていない状態をデータ不足状態として抽出し、前記構成要素が前記データ不足状態となるように前記対象システムの構成を変更する設定内容を決定し、前記構成変更部が、前記決定された設定内容に従って前記対象システムの構成を変更し、前記情報収集部が、前記対象システムの前記構成要素から前記データ不足状態のときの学習データを取得する、学習データ処理装置。(Aspect 1)
The relationship between the information collecting unit that acquires learning data from the components of the target system while the target system is operated and the components of the target system based on the learning data is expressed by the relationship between the target information and the explanatory information. It has a prediction formula generation unit that generates the prediction formula, a setting change content determination unit that determines the setting content that changes the configuration of the target system, and a configuration change unit that changes the configuration of the target system. The setting change content determination unit extracts a state in which the training data is not sufficiently acquired within the range of the states of the components that can be taken when changing the configuration of the target system as a data shortage state, and the components. Determines the setting content to change the configuration of the target system so that the data is insufficient, the configuration changing unit changes the configuration of the target system according to the determined setting content, and the information collecting unit , A learning data processing device that acquires training data when the data is insufficient from the constituent elements of the target system.

（態様２）
前記設定変更内容決定部は、前記対象システムが取り得る前記構成要素の状態の範囲のうち、学習データが所定データ量以上に取得されていない構成を抽出し、該構成に相当する設定変更を決定する、態様１に記載の学習データ処理装置。(Aspect 2)
The setting change content determination unit extracts a configuration in which learning data is not acquired in excess of a predetermined amount of data from the range of states of the component that the target system can take, and determines a setting change corresponding to the configuration. The learning data processing apparatus according to the first aspect.

対象システムが取りうる構成のうち学習データが十分でない構成を一時的に設定し、学習データを収集しておくことが可能となり、対象システムの取り得る構成の学習データを網羅することができる。 It is possible to temporarily set a configuration in which the learning data is not sufficient among the configurations that the target system can take and collect the learning data, and it is possible to cover the learning data of the configuration that the target system can take.

（態様３）
前記設定変更内容決定部は、前記構成要素が前記データ不足状態となるように前記対象システムの構成を一時的に変更し、前記構成変更部が、前記決定された設定内容に従って前記対象システムの構成を変更し、前記情報収集部が、前記対象システムの前記構成要素から前記データ不足状態のときの学習データを取得して蓄積し、前記対象システムの構成が変更されたとき、前記予測式生成部は、前記構成要素が前記データ不足状態となる構成においては前記情報収集部により蓄積された学習データを用いて前記予測式を生成する、態様１に記載の学習データ処理装置。(Aspect 3)
The setting change content determination unit temporarily changes the configuration of the target system so that the component is in the data shortage state, and the configuration change unit configures the target system according to the determined setting content. When the information collecting unit acquires and accumulates the learning data in the data shortage state from the component of the target system and the configuration of the target system is changed, the predictive expression generation unit Is the learning data processing apparatus according to the first aspect, wherein the prediction formula is generated using the learning data accumulated by the information collecting unit in a configuration in which the component is in the data shortage state.

対象システムの構成が変更されたとき、変更後の構成での学習データを予め取得しておきその変更後の構成での学習データで予測式を生成するので、構成が変更されたとき短期間で精度の高い予測式を得ることができる。 When the configuration of the target system is changed, the training data in the changed configuration is acquired in advance and the prediction formula is generated from the learning data in the changed configuration, so when the configuration is changed, it takes a short time. A highly accurate prediction formula can be obtained.

（請求項４）
前記予測式は、該予測式の対象となる構成要素である対象構成要素の性能を目的情報とし、前記対象構成要素と論理的に関連づけられた１つ以上の関連構成要素の性能を説明情報とし、前記目的情報を前記説明情報の関数で示すものであり、前記情報収集部は、前記対象システムの構成要素が発揮した性能を性能履歴情報として蓄積し、前記予測式生成部は、前記性能履歴情報を前記学習データとして前記関数を算出する、態様１に記載の学習データ処理装置。(Claim 4)
The prediction formula uses the performance of the target component, which is the target component of the prediction formula, as the target information, and the performance of one or more related components logically associated with the target component as explanatory information. The target information is indicated by a function of the explanatory information, the information collecting unit accumulates the performance exhibited by the components of the target system as performance history information, and the prediction formula generation unit generates the performance history. The learning data processing device according to aspect 1, wherein the function is calculated using the information as the learning data.

測定して蓄積した性能履歴情報に基づいて予測式を生成するので、十分な性能履歴情報を蓄積しておくことにより、良好な予測式を生成することが可能になる。 Since the prediction formula is generated based on the measured and accumulated performance history information, it is possible to generate a good prediction formula by accumulating sufficient performance history information.

（態様５）
前記目的情報がストレージにおけるボリュームの応答時間であり、前記説明情報には、前記ボリュームへのアクセスに用いられるプロセッサの使用率およびキャッシュのサイズが含まれる、態様４に記載の学習データ処理装置。(Aspect 5)
The learning data processing apparatus according to aspect 4, wherein the target information is the response time of the volume in the storage, and the explanatory information includes the utilization rate of the processor used to access the volume and the size of the cache.

（態様６）
前記予測式は、前記目的情報を、前記説明情報と係数の積の和で示すものであり、前記予測式生成部は、前記性能履歴情報を前記学習データとして、前記関連構成要素毎の係数を算出する、態様４に記載の学習データ処理装置。(Aspect 6)
In the prediction formula, the target information is represented by the sum of the products of the explanatory information and the coefficients, and the prediction formula generation unit uses the performance history information as the learning data and calculates the coefficients for each of the related components. The learning data processing device according to aspect 4, which is calculated.

予測式を説明情報と係数の積和で表わし、その係数を算出するので、目的情報と説明情報の関係を示す関数を容易に算出することができる。 Since the prediction formula is represented by the sum of products of the explanatory information and the coefficient and the coefficient is calculated, a function showing the relationship between the target information and the explanatory information can be easily calculated.

（態様７）
前記予測式生成部は、前記対象システムに対して所定の類似条件を満たす類似システムにて取得された学習データを用いて前記予測式を生成する、態様１に記載の学習データ処理装置。(Aspect 7)
The learning data processing device according to aspect 1, wherein the prediction formula generation unit generates the prediction formula using learning data acquired by a similar system that satisfies a predetermined similarity condition with respect to the target system.

対象システムに類似システムがある場合には類似システムの学習データを利用して予測式を生成するので、利用できる学習データを増やして構成変更後に早期の段階から精度の高い機械学習が可能となる。 When the target system has a similar system, the prediction formula is generated using the learning data of the similar system. Therefore, the learning data that can be used is increased, and highly accurate machine learning becomes possible from an early stage after the configuration change.

（態様８）
前記目的情報が前記ストレージのボリュームの性能であり、前記類似条件は、前記ボリュームに対するランダムリード、ランダムライト、シーケンシャルリード、およびシーケンシャルライトを含むＩ／Ｏパタンの類似度合いで定められる、態様７に記載の学習データ処理装置。(Aspect 8)
The target information is the performance of the volume of the storage, and the similarity condition is defined by the degree of similarity of the I / O pattern including random read, random write, sequential read, and sequential write to the volume, according to the seventh aspect. Learning data processing device.

Ｉ／Ｏパタンの類似度で類似判断を行うので、Ｉ／Ｏパタンが類似する業務で収集された学習データを他の業務の構成変更あるいは新規構築にて利用することが可能である。 Since the similarity is judged based on the similarity of the I / O patterns, it is possible to use the learning data collected in the tasks with similar I / O patterns in the configuration change or new construction of other tasks.

（態様９）
前記設定変更内容決定部は、前記キャッシュサイズの取りうる範囲のうち、十分な学習データが得られていないサイズに変更することを決定する、態様５に記載の学習データ処理装置。(Aspect 9)
The learning data processing device according to the fifth aspect, wherein the setting change content determining unit determines to change the cache size to a size in which sufficient learning data is not obtained within a possible range.

説明情報にキャッシュサイズがある場合、キャッシュサイズの取りうる範囲の学習データを予め網羅しておくことができるので、対象システムのキャッシュサイズを変更しても学習データ不足で予測式の精度を高く維持することができる。 If the explanatory information has a cache size, the training data within the range that the cache size can take can be covered in advance, so even if the cache size of the target system is changed, the training data is insufficient and the accuracy of the prediction formula is maintained high. can do.

１０００…管理サーバ、１１００…メモリ、１１１０…関連情報テーブル、１１２０…性能履歴情報テーブル、１１２１…時刻、１１２４…メトリック、１１２５…性能値、１１３０…構成情報テーブル、１１３３…メトリック、１１３４…値、１１４０…予測式元情報テーブル、１１６０…情報収集プログラム、１１７０…予測式生成プログラム、１１８０…設定変更内容決定プログラム、１１９０…構成変更プログラム、１２００…通信デバイス、１３００…プロセッサ、１４００…出力デバイス、１５００…入力デバイス、１６００…記憶デバイス、１７００…内部バス、１８００…業務特性管理テーブル、２０００…ストレージ装置、２１００…メモリ、２１１０…ディスクキャッシュ、２１２０…構成性能情報収集プログラム、２１３０…構成変更プログラム、２２００…論理ボリューム提供部、２２１０…ボリューム、２２２０…ディスクプール、２２３０…物理領域、
２３００…ディスクＩ／Ｆコントローラ、２５００…プロセッサ、２６００…データＩ／Ｆ、２７００…通信路、３０００…サーバ、３１００…メモリ、３１１０…構成情報収集プログラム、３１２０…業務プログラム、３１３０…構成変更プログラム、３３００…プロセッサ、３５００…通信路、４０００…ＳＡＮ、５０００…管理用ネットワーク1000 ... Management server, 1100 ... Memory, 1110 ... Related information table, 1120 ... Performance history information table, 1121 ... Time, 1124 ... Metric, 1125 ... Performance value, 1130 ... Configuration information table, 1133 ... Metric, 1134 ... Value, 1140 ... Prediction formula source information table, 1160 ... Information collection program, 1170 ... Prediction formula generation program, 1180 ... Setting change content determination program, 1190 ... Configuration change program, 1200 ... Communication device, 1300 ... Processor, 1400 ... Output device, 1500 ... Input device, 1600 ... Storage device, 1700 ... Internal bus, 1800 ... Business characteristic management table, 2000 ... Storage device, 2100 ... Memory, 2110 ... Disk cache, 2120 ... Configuration performance information collection program, 2130 ... Configuration change program, 2200 ... Logical volume provider, 2210 ... Volume, 2220 ... Disk pool, 2230 ... Physical area,
2300 ... Disk I / F controller, 2500 ... Processor, 2600 ... Data I / F, 2700 ... Communication path, 3000 ... Server, 3100 ... Memory, 3110 ... Configuration information collection program, 3120 ... Business program, 3130 ... Configuration change program, 3300 ... processor, 3500 ... communication path, 4000 ... SAN, 5000 ... management network

Claims

An information collection unit that acquires learning data from the components of the target system while the target system is in operation.
A prediction formula generation unit that generates a prediction formula that expresses the relationship between the components of the target system based on the learning data by the relationship between the target information and the explanatory information.
The setting change content determination unit that determines the setting content that changes the configuration of the target system,
It has a configuration change unit that changes the configuration of the target system.
The setting change content determination unit extracts a state in which the learning data is not sufficiently acquired within the range of the states of the components that can be taken when changing the configuration of the target system as a data shortage state, and the configuration is described. Determine the setting contents to change the configuration of the target system so that the element is in the data shortage state.
The configuration change unit changes the configuration of the target system according to the determined setting contents.
The information collecting unit acquires learning data in the data shortage state from the component of the target system.
Learning data processing device.

The setting change content determination unit extracts a configuration in which learning data is not acquired in excess of a predetermined amount of data from the range of states of the component that can be taken by the target system, and determines a setting change corresponding to the configuration. To do
The learning data processing device according to claim 1.

The setting change content determination unit temporarily changes the configuration of the target system so that the component is in the data shortage state.
The configuration change unit changes the configuration of the target system according to the determined setting contents.
The information collecting unit acquires and accumulates learning data in the data shortage state from the component of the target system.
When the configuration of the target system is changed
The prediction formula generation unit generates the prediction formula using the learning data accumulated by the information collection unit in a configuration in which the component is in the data shortage state.
The learning data processing device according to claim 1.

The prediction formula uses the performance of the target component, which is the target component of the prediction formula, as the objective information, and the performance of one or more related components logically associated with the target component as explanatory information. , The target information is indicated by a function of the explanatory information.
The information gathering unit accumulates the performance exhibited by the components of the target system as performance history information.
The prediction formula generation unit calculates the function using the performance history information as the learning data.
The learning data processing device according to claim 1.

The learning data processing apparatus according to claim 4, wherein the target information is the response time of the volume in the storage, and the explanatory information includes the usage rate of the processor used for accessing the volume and the size of the cache.

In the prediction formula, the target information is represented by the sum of the products of the explanatory information and the coefficients.
The prediction formula generation unit calculates the coefficient for each of the related components using the performance history information as the learning data.
The learning data processing device according to claim 4.

The learning data processing device according to claim 1, wherein the prediction formula generation unit generates the prediction formula using learning data acquired by a similar system that satisfies a predetermined similarity condition with respect to the target system.

A performance of a volume of the object information gas storage,
The similarity condition is determined by the degree of similarity of the I / O pattern including random read, random write, sequential read, and sequential write to the volume.
The learning data processing device according to claim 7.

The learning data processing device according to claim 5, wherein the setting change content determination unit determines to change the cache size to a size within which sufficient learning data has not been obtained.

A prediction formula that acquires learning data from the components of the target system while the target system is in operation and expresses the relationship between the components of the target system based on the learning data by the relationship between the target information and the explanatory information. It is a training data processing method for generating
The means for determining the content of setting changes realized by a computer is
Within the range of states of the components that can be taken when the configuration of the target system is changed, a state in which the training data is not sufficiently acquired is extracted as a data shortage state.
The setting content for changing the configuration of the target system is determined so that the component is in the data shortage state.
The configuration changing means realized by the computer changes the configuration of the target system according to the determined setting contents.
The information collecting means realized by the computer acquires the learning data in the data shortage state from the component of the target system.
Training data processing method.