JP2016197450A

JP2016197450A - Operation management device, operation management system, information processing method, and operation management program

Info

Publication number: JP2016197450A
Application number: JP2016145114A
Authority: JP
Inventors: 清志加藤; Kiyoshi Kato
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2016-07-25
Filing date: 2016-07-25
Publication date: 2016-11-24
Anticipated expiration: 2028-02-25
Also published as: JP6168209B2

Abstract

PROBLEM TO BE SOLVED: To provide an operation management device capable of detecting a sign of failure and identifying an occurrence place.SOLUTION: An operation management device manages the operation of a system including a plurality of devices to be managed. The operation management device includes a generation unit, an analysis unit, and a display instruction unit. The generation unit generates a correlation function representing a correlation between time series of performance information obtained from the devices to be managed. The analysis unit applies newly obtained performance information to the correlation function and analyzes whether or not the correlation is maintained. The display instruction unit provides an instruction to display a second abnormality degree indicating the abnormality degree of the system together with a first abnormality degree indicating the abnormality degree of each of the devices to be managed which is calculated on the basis of an analysis result.SELECTED DRAWING: Figure 4

Description

本発明は、運用管理装置、運用管理システム、情報処理方法、及び運用管理プログラムに関し、特に、情報通信サービスを提供するシステムの性能劣化を正確に検知および局所化する運用管理装置などに関する。 The present invention relates to an operation management apparatus, an operation management system, an information processing method, and an operation management program, and more particularly to an operation management apparatus that accurately detects and localizes performance degradation of a system that provides an information communication service.

企業情報システムやＩＤＣ（ＩｎｔｅｒｎｅｔＤａｔａＣｅｎｔｅｒ）など比較的大規模なシステムなどにおいて、ＷＥＢサービスや業務サービスといった情報通信サービスの社会インフラとしての重要性が高まるにつれて、そのサービスを提供するサーバなどの装置の安定稼動が重要となっている。このような装置の運用管理は、管理者が手作業で行っていた。装置が大規模・複雑化するにつれて、管理者には知識および操作の面での負担が飛躍的に増大し、判断ミスや操作ミスによるサービス停止といった事態も発生している。 In relatively large-scale systems such as corporate information systems and IDCs (Internet Data Centers), as the importance of information communication services such as web services and business services as social infrastructure increases, devices such as servers that provide such services Stable operation is important. The operation management of such an apparatus has been performed manually by an administrator. As the devices become larger and more complicated, the burden on the manager is drastically increased in terms of knowledge and operation, and a situation such as service interruption due to a determination error or an operation error has also occurred.

このような事態に対処するため、システムを構成するハードウェアあるいはソフトウェアを一元的に状態監視し制御する統合運用管理システムが提供されている。 In order to cope with such a situation, an integrated operation management system that centrally monitors and controls the hardware or software constituting the system is provided.

この統合管理システムでは、管理対象となる複数のハードウェアまたはソフトウェアの稼動状況に関する情報をオンラインで取得し、統合管理システムに接続した運用管理装置に出力する。管理対象となるシステムの障害を判別するには、あらかじめ稼動情報に閾値を設定しておく方法や、平均値からのずれを評価する方法などがあり、障害と判定された場合、該箇所が報告される。 In this integrated management system, information regarding the operating status of a plurality of hardware or software to be managed is acquired online and output to an operation management apparatus connected to the integrated management system. To determine the failure of the system to be managed, there are a method of setting a threshold value in the operation information in advance and a method of evaluating a deviation from the average value. Is done.

例えば、このような統合運用管理システムの運用管理装置では、性能情報毎に閾値を設定し、各々の性能情報が閾値越えしたことを検出して障害を検知する。予め異常であることが明確な値を閾値に設定して、個々の要素の性能の異常を検出する。 For example, in the operation management apparatus of such an integrated operation management system, a threshold is set for each piece of performance information, and a failure is detected by detecting that each piece of performance information exceeds the threshold. A value that is clearly abnormal in advance is set as a threshold value, and an abnormality in the performance of each element is detected.

異常検出が報告された場合、それがメモリ容量不足が原因なのか、ＣＰＵ負荷が原因なのか、ネットワーク負荷が原因なのか等、解決のために原因を絞り込む必要がある。一般に原因の解明には関係がありそうな計算機のシステムログやパラメータの調査、さらにはシステムエンジニアの経験と勘に頼る必要があり、解決に時間と労力を要する。 When abnormality detection is reported, it is necessary to narrow down the cause for resolution, such as whether it is caused by insufficient memory capacity, CPU load, or network load. In general, investigation of computer system logs and parameters that are likely to be related to the elucidation of the cause, as well as relying on the experience and intuition of system engineers, require time and effort to solve.

このため、統合運用管理システムでは、複数の機器から収集したイベントデータ（状態通知）に基づいて異常状態の組み合わせ等の分析を自動的に行い、大局的な問題点や原因を推定して管理者に通知し、対処支援を行うことが重要となる。 For this reason, the integrated operation management system automatically analyzes combinations of abnormal conditions based on event data (status notifications) collected from multiple devices, estimates global problems and causes, and It is important to notify and provide support.

特に、サービスの長期連続運用での信頼性確保には、発生した異常だけでなく、明確な異常になっていない性能劣化や将来発生が予想される障害の兆候といった状態を検出し、計画的な設備増強を行うことが求められている。 In particular, in order to ensure reliability in long-term continuous service operation, not only abnormalities that have occurred but also conditions such as performance deterioration that is not clearly abnormal and signs of failure that are expected to occur in the future are detected and planned. There is a demand for equipment enhancement.

このような統合運用管理システムに関連する技術として、例えば以下に示す特許文献１乃至特許文献７などが挙げられる。 As technologies related to such an integrated operation management system, for example, Patent Documents 1 to 7 shown below can be cited.

特許文献１では、システム要素の処理の履歴と性能情報の変化の履歴とを比較することによって、特定の処理に起因する負荷量を特定し、将来の処理量での負荷を分析する。この運用管理装置では、予め処理と負荷との関係が把握できている場合には、システムの挙動を特定することができる。 In Patent Document 1, a load amount resulting from a specific process is specified by comparing a history of processing of system elements and a history of changes in performance information, and a load at a future processing amount is analyzed. In this operation management apparatus, the behavior of the system can be specified when the relationship between the processing and the load can be grasped in advance.

特許文献２では、システムの構成要素間の関係の大きさを稼動情報を元に定量化することにより障害の原因となる構成要素を特定する。この運用管理装置では、異常となった要素に対して、その時点で性能値に相関のある要素が重み付けされて一覧表示されることで、原因の候補を列挙する。 In Patent Document 2, the size of the relationship between the system components is quantified based on the operation information to identify the component causing the failure. In this operation management apparatus, the cause candidates are listed by weighting the elements that have a correlation with the performance values at that point in time and displaying them in a list.

すなわち、特許文献２では、管理対象システムと、ネットワークと、運用管理サーバを備え、管理対象システムから稼動情報収集アダプタを介して収集された各構成要素の稼動情報は稼動情報格納部に格納される。分析演算部では、任意の、もしくはあらかじめ設定した値の範囲を越えた稼動情報を1つ選択し、それ以外の稼動情報との関連の大きさを定量化する。定量化の演算の際には、稼動情報収集部から逐次必要な稼動情報を抽出する。演算の対象となった稼動情報のうち、定量化された関連の値があらかじめ設定した値の範囲を越えたものについて、性能のボトルネックや障害の原因となっている可能性が高いとし、入出力部に報告する。 That is, Patent Document 2 includes a management target system, a network, and an operation management server, and the operation information of each component collected from the management target system via the operation information collection adapter is stored in the operation information storage unit. . The analysis calculation unit selects one piece of operation information that exceeds an arbitrary or preset value range, and quantifies the magnitude of the relationship with other pieces of operation information. During the quantification calculation, necessary operation information is sequentially extracted from the operation information collection unit. Of the operation information that is subject to calculation, if the quantified related value exceeds the preset value range, it is highly likely that it will cause a performance bottleneck or failure. Report to output section.

特許文献３では、稼働情報収集部は、監視対象システム内の複数の装置から、ＩＣＭＰ、ＳＮＭＰ、ｒｓｈなどを利用して、ＣＰＵ、ＮｅｔｗｏｒｋＩＯ（ネットワークＩｎｐｕｔ／Ｏｕｔｐｕｔ）などのハードウェア稼動情報、Ｗｅｂサーバのアクセス量、ＤＢサーバの処理クエリ量などのアプリケーション稼動情報を一定時間間隔で取得し、稼動情報ＤＢに保存する。 In Patent Literature 3, an operation information collection unit uses a plurality of devices in a monitoring target system using ICMP, SNMP, rsh, etc., and hardware operation information such as CPU, Network IO (network input / output), Web Application operation information such as the server access amount and DB server processing query amount is acquired at regular time intervals and stored in the operation information DB.

前処理部は、稼働情報ＤＢに格納されている各構成要素の稼働情報間の統計的分析値を求める統計処理を行う。前処理部は、例えば、各稼働情報間の相関係数を求めたり、各稼働情報間で主成分分析を行ったりして、統計的分析値を求める。この統計的分析値は、所定時刻における各装置の稼働情報間の関連度を示す値となっている。例えば特許文献５の図２では、「サーバ１のＣＰＵ使用率」と「サーバ２のＣＰＵ使用率」との相関係数が「０．９３」等となっている。相関係数は、２つの変量間の相関関係の程度を表す数値である。 The preprocessing unit performs a statistical process for obtaining a statistical analysis value between the pieces of operation information of each component stored in the operation information DB. For example, the preprocessing unit obtains a statistical analysis value by obtaining a correlation coefficient between pieces of operation information or performing principal component analysis between pieces of operation information. This statistical analysis value is a value indicating the degree of association between the operation information of each device at a predetermined time. For example, in FIG. 2 of Patent Document 5, the correlation coefficient between “CPU usage rate of server 1” and “CPU usage rate of server 2” is “0.93” or the like. The correlation coefficient is a numerical value indicating the degree of correlation between two variables.

このような運用管理装置では、まず、監視対象となるサーバ、ネットワーク機器等から、ＣＰＵ使用率のようなハードウェア稼動情報、またＷｅｂサーバであれば、アクセスの状況といったアプリケーションレベルの情報を定期的に取得するようにし、正常なアクセス時や、障害発生時といった各状況における稼動情報から、各状況を特徴付ける“取得値間の関連”を相関分析・主成分分析といった統計的手法を用いて算出し、各状況のモデルを定義してモデル情報ＤＢに保持しておく。 In such an operation management apparatus, first, hardware operation information such as a CPU usage rate from a monitoring target server or network device, or application level information such as an access status for a Web server is periodically transmitted. Based on the operation information in each situation such as normal access or failure occurrence, the “relation between acquired values” that characterizes each situation is calculated using statistical methods such as correlation analysis and principal component analysis. The model of each situation is defined and stored in the model information DB.

そして、運用時には、定期的に、あるいは障害のアラートや、提供サービスのレスポンス低下などをトリガとして、現在の稼働情報に対して定義したモデルと同様の統計的手法を行い、モデル情報ＤＢに定義したモデルと比較して、マッチしたモデルの状況を現在置かれている状況として識別する。 At the time of operation, a statistical method similar to the model defined for the current operation information is performed periodically or triggered by a failure alert or a decrease in response of the provided service, and defined in the model information DB. Compare the model to identify the situation of the matched model as the current situation.

特許文献４では、モニタ部は、ＡＣ環境及び非ＡＣ環境の状態に係る状態情報を取得し、分析部又はモデル診断部は、取得された状態情報に基づいて、ＡＣ環境の装置の状態を判定する。シミュレーション部は、その判定結果に対応する対策リストを参照し、対策リストに含まれる少なくとも一つの対策夫々によるシミュレーション処理を実行し、各対策の効果を評価する。 In Patent Document 4, the monitor unit acquires state information regarding the states of the AC environment and the non-AC environment, and the analysis unit or the model diagnosis unit determines the state of the device in the AC environment based on the acquired state information. To do. The simulation unit refers to the countermeasure list corresponding to the determination result, executes a simulation process by each of at least one countermeasure included in the countermeasure list, and evaluates the effect of each countermeasure.

モデル抽出部は、時間に対するＣＰＵの使用率の関係を表す座標系において、時間１〜時間３の監視データをプロットし、プロットした各監視データの線形近似式（ｆａ（ｘ）＝αｘ＋β）を求めることによって、ＣＰＵ使用率の時系的変化を表すモデルを抽出する。モデル抽出部は、抽出したモデルを知識情報蓄積部に対して蓄積する。 The model extraction unit plots the monitoring data from time 1 to time 3 in a coordinate system representing the relationship of the CPU usage rate with respect to time, and obtains a linear approximate expression (fa (x) = αx + β) for each plotted monitoring data. As a result, a model representing a temporal change in the CPU usage rate is extracted. The model extraction unit stores the extracted model in the knowledge information storage unit.

同様にして、時間に対するスループットの関係を表す座標系においてもモデルを求める。 Similarly, a model is also obtained in a coordinate system representing the relationship of throughput with respect to time.

そして、モデル抽出部は、これらの２つのモデルに対して相関分析及び多変量解析を行うことで、処理Ａと処理Ｂとの夫々について、ＣＰＵ使用率とスループットとの相関を表す線形近似式（ｆＴＡ（ｘ）＝ρ１ｘ＋θ１、ｆＴＢ（ｘ）＝ρ２ｘ＋θ２）を求め、ＣＰＵ使用率とスループットとの相関を示すモデルを抽出する。モデル診断部は、各モデルに該当するポリシーを夫々参照し診断を実行する（特許文献６の段落番号００６０〜００６２）。 Then, the model extraction unit performs a correlation analysis and a multivariate analysis on these two models, so that a linear approximate expression (corresponding to the CPU usage rate and the throughput for each of the processing A and the processing B) ( fTA (x) = ρ1x + θ1, fTB (x) = ρ2x + θ2) is obtained, and a model indicating the correlation between the CPU usage rate and the throughput is extracted. The model diagnosis unit performs diagnosis by referring to the policy corresponding to each model (paragraph numbers 0060 to 0062 of Patent Document 6).

特許文献５では、コンピュータ上でどのようなワークロードが稼動中かを判断し、ワークロードのタイプに基づいてコレクタを始動し、ワークロード・ミックスに基づいてメトリックスのためのしきい値を設定し、メトリックスがしきい値を超えるときを（現在のワークロードおよび予測されるワークロードの両方で）判断し、メトリックスの相関をとってハードウェア・キャパシティが問題の原因かどうかを判断する。 In Patent Document 5, it is determined what kind of workload is running on the computer, a collector is started based on the type of workload, and a threshold for metrics is set based on the workload mix. , Determine when a metric exceeds a threshold (both current and predicted workload) and correlate the metric to determine if hardware capacity is the cause of the problem.

特開２００６−０２４０１７号公報JP 2006-024017 A 特開２００２−３４２１８２号公報JP 2002-342182 A 特開２００６−１４６６６８号公報JP 2006-146668 A 特開２００７−２０７１１７号公報JP 2007-207117 A 特表２００５−５２４８８６号公報JP 2005-524886 A

しかしながら、関連技術の運用管理装置においては、次のような不具合がある。 However, the related technology operation management apparatus has the following problems.

すなわち、関連技術の運用管理装置では、閾値を低く設定してしまうと、性能情報の変動が大きい場合などに誤報が多発して管理者が混乱する、という不具合があった。また、閾値を高く設定してしまうと、重大な障害以外検出できなくなり、システムは動作しているものの応答速度が劣化しているなどの性能異常の検出が困難となる、という不具合があった。 That is, in the related art operation management apparatus, if the threshold value is set low, there is a problem that the manager is confused because many false alarms occur when the performance information fluctuates greatly. Further, if the threshold value is set high, there is a problem that it becomes difficult to detect other than a serious failure, and it becomes difficult to detect a performance abnormality such as a system operating but a response speed deteriorated.

さらに、個々の要素毎の異常値は検出できるものの、ボトルネックなど入出力の関係にある他の要素の性能値との関係に起因する異常を検出することができない、という不具合があった。 Furthermore, although an abnormal value for each element can be detected, there is a problem that an abnormality caused by a relationship with a performance value of another element having an input / output relationship such as a bottleneck cannot be detected.

このように、関連技術の性能情報の閾値監視では、応答劣化などの性能異常を正確に検出し場所を特定することができない、という不具合があった。 As described above, in the threshold information monitoring of the performance information of the related technology, there is a problem in that it is impossible to accurately detect a performance abnormality such as response deterioration and specify a location.

さらに、異常時に性能情報の相関関係を算出する手法では、その相関関係が異常時のみ発生するものなのか平常時にも存在するものなのかを判断することが困難である、という不具合があった。 Furthermore, the method of calculating the correlation of performance information at the time of abnormality has a problem that it is difficult to determine whether the correlation is generated only at the time of abnormality or is present at normal times.

また、特許文献１では、正確な負荷を予測するためには、関係しうるすべての処理の履歴を収集して分析しなければならず、システムが大規模化したり、他システムと連携したりする場合には、処理と負荷の関係が極めて複雑になり、データ収集および分析の負荷が大きく、それを分析するための高度な知識も必要となる、という不具合があった。 Further, in Patent Document 1, in order to predict an accurate load, it is necessary to collect and analyze the history of all processes that can be related, and the system becomes large-scale or cooperates with other systems. In some cases, the relationship between the processing and the load is extremely complicated, the load of data collection and analysis is large, and a high level of knowledge for analyzing it is also required.

さらに、特許文献２では、障害が発生した時点の性能情報間の関係を提示するものであるため、ある異常と因果関係があり得る性能情報の中から、実際にどれが原因なのかを管理者が検証しなければならない、という不具合があった。 Furthermore, since Patent Document 2 presents a relationship between performance information at the time when a failure occurs, the administrator can determine which is actually the cause from performance information that may have a causal relationship with a certain abnormality. However, there was a defect that had to be verified.

また、これらの相関関係が、異常時のみ発生するものなのか、平常時にも存在するものなのかを判断することが困難である、という不具合があった。 In addition, there is a problem that it is difficult to determine whether these correlations occur only during an abnormality or exist during normal times.

特許文献３では、相関係数は値であるため、ある時点（異常時）の値の相関から、関係する異常原因を提示できるが、実在しない将来値の相関を算出することはできないので、ある異常と因果関係があり得る性能情報の中から、実際にどれが原因なのかを管理者が検証しなければならない、という不具合があった。また、障害の予兆を検出できない、という不具合があった。 In Patent Document 3, since the correlation coefficient is a value, a related abnormality cause can be presented from the correlation of the value at a certain point in time (at the time of abnormality), but the correlation of a future value that does not exist cannot be calculated. There was a problem that the administrator had to verify which is actually the cause from the performance information that could have a causal relationship with the abnormality. In addition, there was a problem that a sign of failure could not be detected.

特許文献４では、個々の性能情報の関数を推定するもの（第３の関連技術と同様）である。そして、ｙ＝ｆ（ｘ）の式において、ｘは時間で、１つのｙの時間変化を表す。この式を２つ用意して、別途与えた相関関係ルールで２つの関係を判定する。ルールは、自動生成されるわけではないので、システムを構成する各要素について全ての性能情報間のルールを別途与えなければ、ある異常と因果関係があり得る性能情報の中から、実際にどれが原因の要素なのかを正確に予測できない、という不具合があった。 In Patent Document 4, a function of individual performance information is estimated (similar to the third related technique). In the equation y = f (x), x is time and represents one time change of y. Two expressions are prepared, and the two relations are determined by the correlation rule given separately. Since the rules are not automatically generated, unless there is a separate rule between all pieces of performance information for each element that makes up the system, which of the pieces of performance information may have a causal relationship with a certain abnormality There was a problem that the cause could not be accurately predicted.

すなわち、ＣＰＵ使用率とスループットとの相関は、一の要素と他の要素との間の相関のみであるため、システムを構成する全要素についてのそれぞれ相関が不明であるため、ある異常と因果関係があり得る性能情報の中から、実際にどれが原因なのかをシステムの要素の中から管理者が検証しなければならない、という不具合があった。 In other words, the correlation between the CPU usage rate and the throughput is only the correlation between one element and the other elements, so the correlation for all the elements that make up the system is unknown. There is a problem that the administrator must verify from the system elements which is actually the cause from the possible performance information.

特許文献５では、ワークロードやメトリックスの変換は行われるが、いずれも値を算出しているものであるため、モデルを用いないため、ある異常と因果関係があり得る性能情報の中から、実際にどれが原因の要素なのかを正確に予測できず、検証には、これらの変換方法の全てを手作業で入力しなければならない、という不具合があった。 In Patent Document 5, although the workload and metrics are converted, since both values are calculated, since no model is used, performance information that may have a causal relationship with a certain abnormality is actually used. In the verification, all of these conversion methods have to be input manually.

本発明は、上記した技術の不具合を解決することを課題としてなされたものであって、その目的とするところは、障害の予兆を検出し、発生場所の特定が可能な運用管理装置、運用管理システム、情報処理方法、及び運用管理プログラムを提供することにある。 The present invention has been made to solve the problems of the above-described technology, and its purpose is to detect an indication of a failure and identify an occurrence location, an operation management device, and an operation management A system, an information processing method, and an operation management program are provided.

本発明の運用管理装置は、複数の被管理装置を含むシステムを運用管理する運用管理装置であって、前記被管理装置から取得された性能情報の時系列間の相関関係を表す相関関数を生成する生成部と、新たに取得された前記性能情報を前記相関関数に適用して当該相関関係が維持されているか否かを分析する分析部と、前記分析の結果に基づいて算出された被管理装置それぞれの異常度合を示す第１異常度とともに、前記システムの異常度合を示す第２異常度を表示する指示を行う表示指示部と、を備える。 An operation management apparatus according to the present invention is an operation management apparatus that operates and manages a system including a plurality of managed devices, and generates a correlation function representing a correlation between time series of performance information acquired from the managed device. A generating unit that performs analysis of whether or not the correlation is maintained by applying the newly acquired performance information to the correlation function, and a managed object calculated based on the result of the analysis A display instructing unit for giving an instruction to display a second abnormality degree indicating the abnormality degree of the system together with a first abnormality degree indicating the abnormality degree of each device.

本発明の運用管理システムは、複数の被管理装置を含むシステムと、運用管理装置と、を備え、前記運用管理装置は、前記被管理装置から取得された性能情報の時系列間の相関関係を表す相関関数を生成する生成部と、新たに取得された前記性能情報を前記相関関数に適用して当該相関関係が維持されているか否かを分析する分析部と、前記分析の結果に基づいて算出された被管理装置それぞれの異常度合を示す第１異常度とともに、前記システムの異常度合を示す第２異常度を表示する指示を行う表示指示部と、を含む。 An operation management system according to the present invention includes a system including a plurality of managed devices and an operation management device, and the operation management device indicates a correlation between time series of performance information acquired from the managed device. A generating unit that generates a correlation function to represent, an analysis unit that analyzes whether or not the correlation is maintained by applying the newly acquired performance information to the correlation function, and based on the result of the analysis A display instruction unit for giving an instruction to display a second abnormality degree indicating the abnormality degree of the system together with the first abnormality degree indicating the calculated abnormality degree of each managed device.

本発明の情報処理方法は、システムに含まれる複数の被管理装置から取得された性能情報の時系列間の相関関係を表す相関関数を生成し、新たに取得された前記性能情報を前記相関関数に適用して当該相関関係が維持されているか否かを分析し、前記分析の結果に基づいて算出された被管理装置それぞれの異常度合を示す第１異常度とともに、前記システムの異常度合を示す第２異常度を表示する指示を行う。 The information processing method of the present invention generates a correlation function representing a correlation between time series of performance information acquired from a plurality of managed devices included in the system, and the newly acquired performance information is used as the correlation function. And whether or not the correlation is maintained, and indicates the degree of abnormality of the system together with the degree of abnormality of each managed device calculated based on the result of the analysis. An instruction to display the second abnormality degree is given.

本発明の運用管理プログラムは、コンピュータに、システムに含まれる複数の被管理装置から取得された性能情報の時系列間の相関関係を表す相関関数を生成し、新たに取得された前記性能情報を前記相関関数に適用して当該相関関係が維持されているか否かを分析し、前記分析の結果に基づいて算出された被管理装置それぞれの異常度合を示す第１異常度とともに、前記システムの異常度合を示す第２異常度を表示する指示を行う、処理を実行させる。 The operation management program of the present invention generates a correlation function representing a correlation between time series of performance information acquired from a plurality of managed devices included in the system, and newly acquires the performance information. Analyzing whether or not the correlation is maintained by applying to the correlation function, along with the first abnormality degree indicating the degree of abnormality of each managed device calculated based on the result of the analysis, the abnormality of the system An instruction to display the second degree of abnormality indicating the degree is executed.

本発明によれば、相関モデル生成部が、任意の２つの各要素の各性能情報の各時系列（性能系列情報）に対して相関関数を導出することで相関モデルを生成すし、相関変化分析部が、新たに性能情報を検出した場合に、この相関モデルの相関関数に従った性能情報であるか否か変化を分析し、相関モデルの相関関係の変化（相関関係の維持ないしは崩壊）を分析する。 According to the present invention, the correlation model generation unit generates a correlation model by deriving a correlation function for each time series (performance series information) of each piece of performance information of any two elements, and performs a correlation change analysis. When the unit detects new performance information, it analyzes whether the performance information conforms to the correlation function of this correlation model and changes the correlation of the correlation model (maintenance or collapse of the correlation). analyse.

これにより、正常時に生成した相関関係が崩れているかどうかで異常の発生場所（異常のある要素）を特定でき、検出された性能情報の相関関係をモデル化し、そのモデルの変化を監視することで、応答劣化などの性能異常、障害の予兆を正確に検出し発生場所を特定できるという、関連技術にない優れた運用管理装置、運用管理システム、情報処理方法、及び運用管理プログラムを提供することができる。 As a result, it is possible to identify the location of the abnormality (the element with the abnormality) depending on whether the correlation generated during normal operation is broken, model the correlation of the detected performance information, and monitor the changes in the model. Providing an excellent operation management device, operation management system, information processing method, and operation management program that are not available in related technologies, which can accurately detect performance anomalies such as response degradation and failure signs and identify the location of failure. it can.

本発明の第１の実施の形態による運用管理装置を含む運用管理システムの全体構成の一例を示すブロック図である。It is a block diagram which shows an example of the whole structure of the operation management system containing the operation management apparatus by the 1st Embodiment of this invention. 本発明の第１の実施の形態による運用管理装置の前提となる構成の一例を示すブロック図である。It is a block diagram which shows an example of the structure used as the premise of the operation management apparatus by the 1st Embodiment of this invention. 本発明の第１の実施の形態による運用管理装置で利用する性能情報の一例を示す説明図である。It is explanatory drawing which shows an example of the performance information utilized with the operation management apparatus by the 1st Embodiment of this invention. 本発明の第１の実施の形態による運用管理装置の全体構成の一例を示すブロック図である。It is a block diagram which shows an example of the whole structure of the operation management apparatus by the 1st Embodiment of this invention. 本発明の第１の実施の形態による運用管理装置の全体構成の一例を示すブロック図である。It is a block diagram which shows an example of the whole structure of the operation management apparatus by the 1st Embodiment of this invention. 本発明の第１の実施の形態による運用管理装置において、変換関数同定を説明するための一例を示す説明図である。It is explanatory drawing which shows an example for demonstrating conversion function identification in the operation management apparatus by the 1st Embodiment of this invention. 本発明の第１の実施の形態による運用管理装置において、相関モデルのデータ構造の一例を示す説明図である。It is explanatory drawing which shows an example of the data structure of a correlation model in the operation management apparatus by the 1st Embodiment of this invention. 本発明の第１の実施の形態による運用管理装置において、相関モデルの他のデータ構造の一例を示す説明図である。It is explanatory drawing which shows an example of the other data structure of a correlation model in the operation management apparatus by the 1st Embodiment of this invention. 本発明の第１の実施の形態による運用管理装置において、相関モデル選別の一例を示す説明図である。It is explanatory drawing which shows an example of correlation model selection in the operation management apparatus by the 1st Embodiment of this invention. 本発明の第１の実施の形態による運用管理装置において、相関変化検出の一例を示す説明図である。It is explanatory drawing which shows an example of a correlation change detection in the operation management apparatus by the 1st Embodiment of this invention. 本発明の第１の実施の形態による運用管理装置において処理される全体の処理手順の一例を示すフローチャートである。It is a flowchart which shows an example of the whole process sequence processed in the operation management apparatus by the 1st Embodiment of this invention. 本発明の第１の実施の形態による運用管理装置において、相関モデル生成の詳細処理手順の一例を示すフローチャートである。It is a flowchart which shows an example of the detailed process sequence of a correlation model production | generation in the operation management apparatus by the 1st Embodiment of this invention. 本発明の第１の実施の形態による運用管理装置において、相関変化分析の詳細処理手順の一例を示すフローチャートである。It is a flowchart which shows an example of the detailed process sequence of a correlation change analysis in the operation management apparatus by the 1st Embodiment of this invention. 本発明の第１の実施の形態による運用管理装置において、表示される表示画面の一例を説明するための説明図である。It is explanatory drawing for demonstrating an example of the display screen displayed in the operation management apparatus by the 1st Embodiment of this invention. 本発明の第２の実施の形態による運用管理装置の全体構成の一例を示すブロック図である。It is a block diagram which shows an example of the whole structure of the operation management apparatus by the 2nd Embodiment of this invention. 本発明の第２の実施の形態による運用管理装置の全体構成の一例を示すブロック図である。It is a block diagram which shows an example of the whole structure of the operation management apparatus by the 2nd Embodiment of this invention. 本発明の第２の実施の形態による運用管理装置において、相関モデル無効化の一例を示す説明図である。It is explanatory drawing which shows an example of correlation model invalidation in the operation management apparatus by the 2nd Embodiment of this invention. 本発明の第２の実施の形態による運用管理装置において、相関モデルのデータ構造の一例を示す説明図である。It is explanatory drawing which shows an example of the data structure of a correlation model in the operation management apparatus by the 2nd Embodiment of this invention. 本発明の第２の実施の形態による運用管理装置において処理される全体の処理手順の一例を示すフローチャートである。It is a flowchart which shows an example of the whole process sequence processed in the operation management apparatus by the 2nd Embodiment of this invention. 本発明の第２の実施の形態による運用管理装置において、定常変化分析の詳細処理手順の一例を示すフローチャートである。It is a flowchart which shows an example of the detailed process sequence of a steady change analysis in the operation management apparatus by the 2nd Embodiment of this invention. 本発明の第２の実施の形態による運用管理装置において、表示される表示画面の一例を説明するための説明図である。It is explanatory drawing for demonstrating an example of the display screen displayed in the operation management apparatus by the 2nd Embodiment of this invention.

〔運用管理装置の基本的構成〕
先ず、運用管理装置の基本的構成について説明する。本発明の運用管理装置（例えば図４に示す符号１００など）は、システムを構成する複数の被管理装置から複数種の性能種目毎の性能情報を取得して、前記被管理装置を運用管理するものを対象とするものである。 [Basic configuration of operation management device]
First, the basic configuration of the operation management apparatus will be described. The operation management apparatus (for example, reference numeral 100 shown in FIG. 4) of the present invention acquires performance information for each of a plurality of types of performance items from a plurality of managed apparatuses constituting the system, and manages the managed apparatus. It is intended for things.

この運用管理装置は、前記性能情報の時系列変化を示す性能系列情報から前記性能情報間の相関関数を含む相関モデルを生成する相関モデル生成部（例えば図４に示す符号１２３など）と、新たに検出された前記性能情報を前記相関関数に適用して当該相関関数で示される相関関係が維持されているか否かを分析して該分析結果に関する情報を出力する相関変化分析部（例えば図４に示す符号１２４など）と、を含む構成としている。 The operation management apparatus includes a correlation model generation unit (for example, reference numeral 123 shown in FIG. 4) that generates a correlation model including a correlation function between the performance information from the performance sequence information indicating the time series change of the performance information, and a new A correlation change analysis unit (for example, FIG. 4) that applies the detected performance information to the correlation function, analyzes whether the correlation indicated by the correlation function is maintained, and outputs information on the analysis result. The reference numeral 124 shown in FIG.

このような運用管理装置では、前記相関モデル生成部が、前記性能系列情報のうちの一方と他方の相関関数の係数を算出する処理を前記性能系列情報の全ての組み合わせについて行って前記相関モデルを生成する。 In such an operation management apparatus, the correlation model generation unit performs a process of calculating a coefficient of the correlation function of one of the performance series information and the other for all the combinations of the performance series information to obtain the correlation model. Generate.

また、前記相関変化分析部が、前記新たに検出された性能情報のうちの１項目を該性能情報に関連する前記相関関数に適用して予測性能情報を算出し、この予測性能情報と同一の項目について実際に検出された性能情報とを比較して予測誤差を算出する。そして、前記予測誤差が一定の誤差範囲内を満たす場合に、前記相関関数で示される相関関係が維持されていると判断する処理を、前記新たに検出された性能情報と関係する前記相関モデルに含まれる全ての相関関数について行う。 In addition, the correlation change analysis unit calculates predicted performance information by applying one item of the newly detected performance information to the correlation function related to the performance information, and is identical to the predicted performance information. The prediction error is calculated by comparing the performance information actually detected for the item. When the prediction error satisfies a certain error range, a process for determining that the correlation indicated by the correlation function is maintained is applied to the correlation model related to the newly detected performance information. Perform for all the correlation functions included.

これにより、正常時に生成した相関関係が崩れているかどうかで異常の発生場所（異常のある要素）を特定でき、検出された性能情報の相関関係をモデル化し、そのモデルの変化を監視することで、応答劣化などの性能異常、障害の予兆を正確に検出し発生場所を特定できる。 As a result, it is possible to identify the location of the abnormality (the element with the abnormality) depending on whether the correlation generated during normal operation is broken, model the correlation of the detected performance information, and monitor the changes in the model. In addition, it is possible to accurately detect performance abnormalities such as response deterioration and signs of failures, and to identify the location of occurrence.

以下、このような本発明の「運用管理装置」を、「運用管理システム」に適用した好適な実施の形態の一例について、図面を参照して具体的に説明する。 Hereinafter, an example of a preferred embodiment in which the “operation management apparatus” of the present invention is applied to an “operation management system” will be specifically described with reference to the drawings.

〔第１の実施の形態〕
（運用管理システムの全体構成）
先ず、本実施の形態の運用管理システムの具体的構成について、全体構成から説明し、続いて各部の詳細構成について説明することとする。 [First Embodiment]
(Overall configuration of operation management system)
First, the specific configuration of the operation management system of the present embodiment will be described from the overall configuration, and then the detailed configuration of each unit will be described.

図１は、本発明における第１実施の形態の運用管理装置を含む運用管理システムの全体の概略構成の一例を示すブロック図である。 FIG. 1 is a block diagram showing an example of an overall schematic configuration of an operation management system including an operation management apparatus according to the first embodiment of the present invention.

図１に示すように、本実施の形態の運用管理システム１は、複数の被管理装置の一例である各コンピュータ２と、これらの各コンピュータ２とネットワークＮを介して通信可能に形成され各コンピュータ２を運用管理する運用管理装置３と、を含んで構成される。 As shown in FIG. 1, the operation management system 1 of the present embodiment is configured such that each computer 2 which is an example of a plurality of managed devices and each computer 2 can communicate with each computer 2 via a network N. And an operation management apparatus 3 that manages the operation of the system 2.

運用管理装置３は、複数のコンピュータ２から複数種の性能種目毎（例えばＣＰＵ利用率やメモリ残量など）の性能情報を取得可能に構成される。 The operation management apparatus 3 is configured to be able to acquire performance information for each of a plurality of types of performance items (for example, CPU usage rate, remaining memory capacity, etc.) from a plurality of computers 2.

コンピュータ２、運用管理装置３は、プログラム制御により動作するものであり、ネットワーク関連の機能を有していれば、デスクトップ、ラップトップコンピュータ、サーバー、その他無線・有線通信機能を有する情報機器、またはこれに類するコンピュータなどいかなるコンピュータでもよく、移動式・固定式を問わない。 The computer 2 and the operation management device 3 operate under program control, and if they have network-related functions, they are desktops, laptop computers, servers, other information devices having wireless / wired communication functions, or the like. Any computer, such as a computer similar to the above, may be used.

運用管理装置３のハードウエア構成は、種々の情報等を表示するための表示部（スクリーン）、この表示部の表示画面上（の各種入力欄等）にデータを操作入力するための操作入力部（例えばキーボード・マウス等）、各種信号・データを送受信するための送受信部（通信部）、各種プログラム・各種データを記憶しておく記憶部（例えばメモリ、ハードディスク等）、これらの制御を司る制御部（例えばＣＰＵ等）などを有している。 The hardware configuration of the operation management apparatus 3 includes a display unit (screen) for displaying various information and the like, and an operation input unit for operating and inputting data on the display screen (various input fields thereof). (For example, keyboard / mouse), transmission / reception unit (communication unit) for transmitting / receiving various signals / data, storage unit (for example, memory, hard disk) for storing various programs / various data, control for controlling these Part (for example, CPU etc.) etc.

また、コンピュータ２は、ネットワーク機器やその他の機器、メインフレームなどであってもよい。 The computer 2 may be a network device, other devices, a main frame, or the like.

（前提となる構成）
ここで、本実施の形態の特徴的構成を説明する前に、本実施の形態の前提となる運用管理装置の構成について、図２、図３を用いて説明する。 (Prerequisite configuration)
Here, before describing the characteristic configuration of the present embodiment, the configuration of the operation management apparatus which is the premise of the present embodiment will be described with reference to FIGS. 2 and 3.

図２を参照すると、本実施の形態の前提となる構成を示す運用管理装置３は、サービス実行部２１と、性能情報蓄積処理部１２と、情報収集部２２と、分析設定蓄積処理部１４と、障害分析部２６と、管理者対話部２７と、対処実行部２８と、を含んで構成される。 Referring to FIG. 2, the operation management apparatus 3 showing the precondition of the present embodiment includes a service execution unit 21, a performance information accumulation processing unit 12, an information collection unit 22, an analysis setting accumulation processing unit 14, , A failure analysis unit 26, an administrator dialogue unit 27, and a countermeasure execution unit 28.

サービス実行部２１は、ＷＥＢサービスや業務サービスといった情報通信サービスを提供するものである。このサービス実行部２１は、他の独立したコンピュータなどで構成することもできる。 The service execution unit 21 provides information communication services such as a WEB service and a business service. The service execution unit 21 can also be configured by another independent computer or the like.

性能情報蓄積処理部１２は、サービス実行部２１の各々の要素の性能情報を蓄積する。 The performance information accumulation processing unit 12 accumulates performance information of each element of the service execution unit 21.

情報収集部２２は、サービス実行部２１の動作状態を検出して出力するとともに、動作状態に含まれる性能情報を性能情報蓄積処理部１２に蓄積する。 The information collection unit 22 detects and outputs the operation state of the service execution unit 21 and accumulates performance information included in the operation state in the performance information accumulation processing unit 12.

分析設定蓄積処理部１４は、サービス実行部２１の異常を検出するための分析設定を蓄積する。 The analysis setting accumulation processing unit 14 accumulates analysis settings for detecting an abnormality in the service execution unit 21.

障害分析部２６は、情報収集部２２から動作状態を受け取って分析設定蓄積処理部１４の分析設定に従って障害分析を行う。 The failure analysis unit 26 receives the operation state from the information collection unit 22 and performs failure analysis according to the analysis setting of the analysis setting accumulation processing unit 14.

管理者対話部２７は、障害分析部２６から障害分析の結果を受け取って管理者に提示し、管理者の入力を受け付け、管理者の入力に従って対処実行部２８に対処を指示する。 The administrator dialogue unit 27 receives the result of the failure analysis from the failure analysis unit 26, presents it to the administrator, receives the administrator's input, and instructs the countermeasure execution unit 28 according to the administrator's input.

対処実行部２８は、管理者対話部２７の指示に応じてサービス実行部２１上で障害の対処となる処理を実行する。 The coping execution unit 28 executes a process for coping with a failure on the service execution unit 21 in accordance with an instruction from the administrator dialogue unit 27.

図３は、情報収集部２２が出力し、性能情報蓄積処理部１２に蓄積される性能情報の例を示す。性能情報１２ａは、サービス実行部２８の状態変化に伴って順次収集される性能情報の一覧を示す。性能情報１２ａを参照すると、個々の性能情報は、同一時刻の各々の性能の値で構成され、それらが一定時間間隔でリストアップされたものとなっている。 FIG. 3 shows an example of performance information output from the information collection unit 22 and accumulated in the performance information accumulation processing unit 12. The performance information 12a indicates a list of performance information that is sequentially collected as the service execution unit 28 changes its state. Referring to the performance information 12a, each piece of performance information is composed of performance values at the same time, and these are listed at regular time intervals.

上述のような前提となる構成を有する運用管理装置３の動作について、図２、図３を用いて説明する。 The operation of the operation management apparatus 3 having the above-described configuration will be described with reference to FIGS.

まず、図２の情報収集部２２がサービス実行部２１の動作状態を検出し、性能情報蓄積処理部１２に性能情報を蓄積する。例えば、サービス実行部２１でＷＥＢサービスが実行されている場合、ＷＥＢサービスを提供する各サーバのＣＰＵ使用率やメモリ残量を一定時間間隔で検出する。 First, the information collection unit 22 in FIG. 2 detects the operating state of the service execution unit 21 and accumulates performance information in the performance information accumulation processing unit 12. For example, when the WEB service is executed by the service execution unit 21, the CPU usage rate and the remaining memory capacity of each server providing the WEB service are detected at regular time intervals.

図３の性能情報１２ａは、このようにして検出された性能情報の例である。例えば、「ＳＶ１．ＣＰＵ」は、１つのサーバのＣＰＵ利用率の値を示し、２００７年１０月５日の１７時２５分の値が１２である。さらに、１分間隔で１７時２６分から１５、３４、６３といった値が検出されている。同様に、「ＳＶ１．ＭＥＭ」は同じサーバのメモリ残量の値を、「ＳＶ２．ＣＰＵ」は別のサーバのＣＰＵ利用率の値を、それぞれ同時刻に検出したものである。 The performance information 12a in FIG. 3 is an example of performance information detected in this way. For example, “SV1.CPU” indicates the value of the CPU usage rate of one server, and the value of 17:25 on October 5, 2007 is 12. Further, values such as 15, 34, 63 are detected from 17:26 at intervals of 1 minute. Similarly, “SV1.MEM” detects the value of the remaining memory capacity of the same server, and “SV2.CPU” detects the value of the CPU usage rate of another server at the same time.

次に、障害分析部２６は、分析設定蓄積処理部１４に蓄積されている分析設定に従って障害分析を行う。分析設定としては、例えば、ＣＰＵ利用率が一定値以上であれば管理者に警告メッセージを提示するといったことが指定されており、これに従って、情報収集部２２で検出された性能情報の値から、特定のサーバの負荷が高くなっているかどうかを閾値判定する。 Next, the failure analysis unit 26 performs failure analysis according to the analysis settings stored in the analysis setting storage processing unit 14. As the analysis setting, for example, it is specified that a warning message is presented to the administrator if the CPU usage rate is equal to or higher than a certain value, and according to this, from the value of the performance information detected by the information collecting unit 22, A threshold is used to determine whether the load on a specific server is high.

管理者対話部２７は、このような障害分析の結果を管理者に提示し、管理者が何らかの対処を指示する入力を行った場合、対処実行部２８を介してサービス実行部２１上で対処コマンドを実行する。 The administrator dialogue unit 27 presents the result of such failure analysis to the administrator, and when the administrator inputs an instruction for some countermeasure, the countermeasure command is executed on the service execution unit 21 via the countermeasure execution unit 28. Execute.

例えば、管理者は、ＣＰＵ負荷が高くなっていることを知って、業務量を減らしたり、負荷分散を行うための構成変更を行ったりすることができる。 For example, the administrator knows that the CPU load is high, and can reduce the amount of work or change the configuration for load distribution.

この後、一定時間間隔で情報収集部２２によって収集された性能情報の値が低下していれば、障害分析部２６で障害が回復したと判断され、その結果が管理者対話部２７を介して管理者に提示されることになる。このような情報収集、分析、対処の処理の繰り返しにより、サービス実行部２１の障害対処が継続して行われる。 Thereafter, if the value of the performance information collected by the information collecting unit 22 is decreased at regular time intervals, the failure analyzing unit 26 determines that the failure has been recovered, and the result is sent via the administrator dialogue unit 27. It will be presented to the administrator. By repeating such information collection, analysis, and handling processes, the service execution unit 21 continues to handle failures.

このような前提となる構成に加えて、本実施の形態では、以下に示す特徴的構成を有する。 In addition to the presupposed configuration, the present embodiment has the following characteristic configuration.

（本実施の形態の特徴的構成）
ここで、本発明の第１の実施の形態の運用管理装置の特徴的構成について、図４を参照しつつ説明する。図４は、本発明の第１の実施の形態の運用管理装置の特徴的構成の一例を示すブロック図である。 (Characteristic configuration of the present embodiment)
Here, a characteristic configuration of the operation management apparatus according to the first embodiment of this invention will be described with reference to FIG. FIG. 4 is a block diagram illustrating an example of a characteristic configuration of the operation management apparatus according to the first embodiment of this invention.

図４に示すように、本発明の第１の実施の形態の運用管理装置１００は、図２に示す運用管理装置３と同様の構成である、サービス実行部１２１、性能情報蓄積処理部１１２、情報収集部１２２、分析設定蓄積処理部１１４、障害分析部１２６、管理者対話部１２７、対処実行部１２８に加えて、相関モデル生成部１２３と、相関モデル情報蓄積処理部１１６と、相関変化分析部１２４と、を含んで構成される。 As shown in FIG. 4, the operation management apparatus 100 according to the first embodiment of the present invention has the same configuration as that of the operation management apparatus 3 shown in FIG. 2, and includes a service execution unit 121, a performance information accumulation processing unit 112, In addition to the information collection unit 122, the analysis setting accumulation processing unit 114, the failure analysis unit 126, the administrator dialogue unit 127, and the countermeasure execution unit 128, the correlation model generation unit 123, the correlation model information accumulation processing unit 116, and the correlation change analysis Part 124.

相関モデル生成部１２３は、性能情報蓄積処理部１１６から一定期間の性能情報を取り出し、任意の２つの性能情報の値の時系列に対して、一方を入力とし他方を出力とした場合の変換関数を導出するとともに、この変換関数で生成された性能値の系列と出力となる性能情報の実際の検出値の系列とを比較し、その値の差から前記変換関数の重み情報を算出する。さらに、これらの処理をすべての性能情報に対して繰り返すことで、サービス実行部１２１の全体的な稼動状態の相関モデルを生成する。 The correlation model generation unit 123 extracts performance information for a certain period from the performance information accumulation processing unit 116, and converts one of the two time performance information values as an input and the other as an output. And a sequence of performance values generated by this conversion function and an actual detection value sequence of performance information to be output are compared, and weight information of the conversion function is calculated from the difference between the values. Furthermore, by repeating these processes for all performance information, a correlation model of the overall operating state of the service execution unit 121 is generated.

相関モデル蓄積処理部１１６は、相関モデル生成部１２３が生成した相関モデルを蓄積処理する。 The correlation model accumulation processing unit 116 accumulates the correlation model generated by the correlation model generation unit 123.

相関変化分析部１２４は、情報収集部１２２から新たに検出された性能情報を受け取り、この性能情報に含まれる性能値が相関モデル情報蓄積処理部１１６に蓄積される相関モデルの各々の性能情報間の変換関数で示された関係を一定の誤差範囲内で満たしているかを分析して、その結果を出力する。 The correlation change analysis unit 124 receives the newly detected performance information from the information collection unit 122, and the performance value included in the performance information is stored between the performance information of the correlation models stored in the correlation model information storage processing unit 116. Analyzes whether the relationship represented by the conversion function is satisfied within a certain error range, and outputs the result.

障害分析部１２６は、この相関変化分析部１２４の分析結果を受け取って、閾値判定などの他の分析とともに障害分析を行う。 The failure analysis unit 126 receives the analysis result of the correlation change analysis unit 124 and performs failure analysis together with other analysis such as threshold determination.

ここで、これらの各部は、図５に示すように、制御部が発揮する複数の機能として構成することもできる。 Here, as shown in FIG. 5, each of these units can be configured as a plurality of functions exhibited by the control unit.

また、前記相関変化分析部１２４は、前記新たに検出された前記第１の要素に関する性能情報と前記相関関数とに基づいて前記第２の要素に関する予測性能情報を算出し、前記新たに検出された前記第２の要素に関する性能情報と前記予測性能情報とを比較して予測誤差を算出し、この予測誤差が一定の誤差範囲内を満たすか否かを分析することもできる。 The correlation change analysis unit 124 calculates predicted performance information about the second element based on the newly detected performance information about the first element and the correlation function, and the newly detected It is also possible to compare the performance information related to the second element and the predicted performance information to calculate a prediction error, and analyze whether or not the prediction error satisfies a certain error range.

さらに、前記相関変化分析部１２４は、前記予測誤差が前記誤差範囲外となる場合に、前記第１の要素と前記第２の要素との相関関係が破壊されていると判断し、それぞれの要素の異常スコアを算出することもできる。 Further, the correlation change analysis unit 124 determines that the correlation between the first element and the second element is broken when the prediction error is outside the error range, and each element An abnormal score can also be calculated.

また、前記相関変化分析部１２４は、前記異常スコアに基づいて、前記各要素を順位付けして提示可能に制御することもできる。 In addition, the correlation change analysis unit 124 may control to rank and present each element based on the abnormality score.

（相関モデル生成について）
ここで、相関モデル生成部１２３による相関モデル生成の概要について、図６を参照して説明する。図６は、本実施の形態にかかる運用管理装置の相関関数生成の概要の一例を示す説明図である。 (Correlation model generation)
Here, an outline of correlation model generation by the correlation model generation unit 123 will be described with reference to FIG. FIG. 6 is an explanatory diagram showing an example of the outline of correlation function generation of the operation management apparatus according to the present embodiment.

相関関数生成は、図１２に示す相関関数（変換関数）を生成するステップＳ１０３（相関関数生成機能）および誤差を算出するステップＳ１０４（重み情報算出機能）の処理により行うことができる。 The correlation function generation can be performed by processing of step S103 (correlation function generation function) for generating a correlation function (conversion function) and step S104 (weight information calculation function) for calculating an error shown in FIG.

図６に示すように、変換関数Ｇ３００は、グラフＧ１０１に示す「ＳＶ１．ＣＰＵ」の性能値の系列（第１の性能系列情報）を入力とした場合に、グラフＧ１０１に示す「ＳＶ１．ＭＥＭ」の性能値の系列（第２の性能系列情報）を出力するものである。 As shown in FIG. 6, the conversion function G300 receives “SV1.MEM” shown in the graph G101 when the “SV1.CPU” performance value series (first performance series information) shown in the graph G101 is input. A series of performance values (second performance series information) is output.

この変換関数Ｇ３００を、システム同定処理Ｇ３０１に示す処理によって算出する。 This conversion function G300 is calculated by the process shown in the system identification process G301.

一例として、「ｙ＝Ａｘ＋Ｂ」の式で示される変換関数では、「Ａ＝−０．６」、「Ｂ＝１００」の値が算出される。 As an example, in the conversion function represented by the expression “y = Ax + B”, values of “A = −0.6” and “B = 100” are calculated.

さらに、グラフＧ３０２で示すように、この変換関数でグラフＧ１０１から生成された性能値の予測値の系列と、グラフＧ１０２で示される実際の性能値の差分から重みｗが生成される。 Further, as shown by a graph G302, a weight w is generated from the difference between the predicted performance value series generated from the graph G101 by this conversion function and the actual performance value difference shown by the graph G102.

図７は、本実施の形態にかかる運用管理装置の相関モデルの例である。相関モデル１１６ａは、変換関数の入力となる性能値の系列の名称と、出力となる性能値の系列の名称と、変換関数を特定する各々係数の値と、重み情報と、相関関係が有効か無効かを示す相関関係判定情報と、を含んで構成される。例えば、図６に示す変換関数を算出した結果として、「ＳＶ１．ＣＰＵ」を入力とし、「ＳＶ１．ＭＥＭ」を出力とし、「ｙ＝Ａｘ＋Ｂ」の式で示される変換関数は、係数Ａの値が「−０．６」、係数Ｂの値が「１００」、重みが「０．８８」となる相関関係が蓄積されている。さらに、相関関係が「有効」となっており、相関関係が破壊されておらず、維持されていることを意味する。 FIG. 7 is an example of a correlation model of the operation management apparatus according to this embodiment. In the correlation model 116a, the name of the performance value series that is the input of the conversion function, the name of the performance value series that is the output, the value of each coefficient that specifies the conversion function, the weight information, and whether the correlation is valid And correlation determination information indicating invalidity. For example, as a result of calculating the conversion function shown in FIG. 6, “SV1.CPU” is input, “SV1.MEM” is output, and the conversion function represented by the expression “y = Ax + B” is the value of coefficient A Is stored as “−0.6”, the coefficient B is “100”, and the weight is “0.88”. Furthermore, the correlation is “valid”, which means that the correlation is not broken and maintained.

（相関変化分析について）
相関変化分析部１２４による相関変化分析の概要について、図９を参照して説明する。図９は、本実施の形態にかかる運用管理装置の相関変化分析の概要の一例を示す説明図である。 (About correlation change analysis)
The outline of the correlation change analysis by the correlation change analysis unit 124 will be described with reference to FIG. FIG. 9 is an explanatory diagram showing an example of the outline of the correlation change analysis of the operation management apparatus according to the present embodiment.

図９の相関グラフＧ３１０は、相関モデル情報蓄積処理部１１６の相関モデルの例であり、ＳＶ１〜ＳＶ３の３台のサーバのＣＰＵ利用率とＣＰＵ負荷を、それぞれ性能情報の要素Ａ〜Ｆで表している。 A correlation graph G310 in FIG. 9 is an example of a correlation model of the correlation model information accumulation processing unit 116. The CPU usage rate and the CPU load of the three servers SV1 to SV3 are represented by elements A to F of the performance information, respectively. ing.

例えば、要素Ａは、「ＳＶ１．ＣＰＵ」とあり、要素Ａに関する性能情報は、第１のサーバのＣＰＵ利用率であることを意味する。また、要素Ｄは、「ＳＶ３．ＭＥＭ」とあり、要素Ｄに関する性能情報は、第２のサーバのＣＰＵ負荷であることを意味する。 For example, the element A is “SV1.CPU”, and the performance information related to the element A means the CPU usage rate of the first server. The element D is “SV3.MEM”, and the performance information regarding the element D means that the CPU load of the second server.

そして、それぞれの要素の間を結ぶ線が、相関モデルの変換関数で表される関係であり、０〜１の範囲で表される重み情報が０．５以上の関係を太線で、それ以外のものを細線で表している。 A line connecting each element is a relationship represented by a transformation function of the correlation model, a weight information represented by a range of 0 to 1 is a relationship of 0.5 or more by a bold line, and other than that Things are represented by thin lines.

例えば、要素Ａと要素Ｂとの相関関係は、太線となっており、相関モデルの重み情報が０．５以上であることを意味する。また、要素Ａと要素Ｆとの相関関係は、細線となっており、相関モデルの重み情報が０．５未満であることを意味する。 For example, the correlation between the element A and the element B is a thick line, which means that the weight information of the correlation model is 0.5 or more. The correlation between element A and element F is a thin line, which means that the weight information of the correlation model is less than 0.5.

重み情報は、変換関数の誤差によって算出されるため、この線の太さが関係の強さを表している。相関変化分析部１２４は、例えば、相関グラフＧ３１０から重みが０．５以上であるような安定した関係のみを抽出し、相関モデルＧ３１１のような関係を得る。 Since the weight information is calculated from the error of the conversion function, the thickness of this line represents the strength of the relationship. For example, the correlation change analysis unit 124 extracts only a stable relationship having a weight of 0.5 or more from the correlation graph G310, and obtains a relationship such as a correlation model G311.

図１０は、本実施の形態にかかる運用管理装置において、新たに性能情報が検出された場合に、相関関係が破壊された様子の一例を示す説明図である。図１０に示す相関グラフＧ３１２では、相関グラフＧ３１１に示す相関関係のうち、要素Ａと要素Ｃ、要素Ｂと要素Ｃの相関が破壊（点線で示す）されている。 FIG. 10 is an explanatory diagram showing an example of how the correlation is destroyed when new performance information is detected in the operation management apparatus according to the present embodiment. In the correlation graph G312 illustrated in FIG. 10, among the correlations illustrated in the correlation graph G311, the correlation between the element A and the element C and the correlation between the element B and the element C is broken (indicated by a dotted line).

（処理手順について）
（全体処理）
次に、上述のような構成を有する運用管理装置における各部の処理は、方法としても実現可能であり、情報処理方法としての各種の処理手順について、図１１乃至図１３を参照しつつ説明する。 (About processing procedure)
(Overall processing)
Next, the processing of each unit in the operation management apparatus having the above-described configuration can be realized as a method, and various processing procedures as an information processing method will be described with reference to FIGS. 11 to 13.

図１１は、本発明の第１の実施の形態による運用管理装置における処理手順の一例を示すフローチャートである。 FIG. 11 is a flowchart illustrating an example of a processing procedure in the operation management apparatus according to the first embodiment of the present invention.

本実施の形態に係る情報処理方法は、システムを構成する複数の被管理装置から取得される複数種の性能種目毎の性能情報に基づいて、前記被管理装置を運用管理する制御する情報処理を（例えば１又は複数のコンピュータやその他の装置など）が行うものを対象とするものである。 The information processing method according to the present embodiment performs information processing for controlling and managing the managed device based on performance information for each of a plurality of types of performance items acquired from a plurality of managed devices constituting the system. (For example, one or a plurality of computers or other devices).

この情報処理方法は、基本的構成として、前記性能種目又は前記被管理装置を要素とした場合に、少なくとも第１の要素に関する性能情報の時系列変化を示す第１の性能系列情報と、第２の要素に関する性能情報の時系列変化を示す第２の性能系列情報との相関関数を導出し、この相関関数に基づいて相関モデルを生成し、この相関モデルを前記各要素間の組み合わせについて求める相関モデル生成ステップ（例えば図１１に示すステップＳ１１など）と、前記被管理装置から新たに検出し取得される前記性能情報に基づいて、前記相関モデルの変化を分析する相関変化分析ステップ（例えば図１１に示すステップＳ１２など）と、含むことができる。 In this information processing method, as a basic configuration, when the performance item or the managed device is used as an element, first performance series information indicating a time series change of performance information related to at least the first element; A correlation function with the second performance sequence information indicating the time series change of the performance information related to the elements of the first, a correlation model is generated based on the correlation function, and the correlation model is obtained for the combination between the elements. Based on the model generation step (for example, step S11 shown in FIG. 11) and the performance information newly detected and acquired from the managed device, a correlation change analysis step for analyzing the change of the correlation model (for example, FIG. 11). Step S12 shown in FIG.

以下、これらの「相関モデル生成」、「相関変化分析」の詳細処理について説明する。 Hereinafter, detailed processing of these “correlation model generation” and “correlation change analysis” will be described.

（相関モデル生成の詳細処理）
図１２は、本発明の第１の実施の形態による運用管理装置における相関モデル生成の詳細処理手順の一例を示すフローチャートである。 (Detailed processing of correlation model generation)
FIG. 12 is a flowchart illustrating an example of a detailed processing procedure for generating a correlation model in the operation management apparatus according to the first embodiment of the present invention.

本実施の形態における相関モデル生成の詳細処理では、まず、情報収集部１２２によってサービス実行部１２１の動作状態が収集され、性能情報蓄積処理部１２２に図３に示す性能情報１２ａが蓄積される。 In the detailed processing of generating the correlation model in the present embodiment, first, the operation state of the service execution unit 121 is collected by the information collecting unit 122, and the performance information 12a shown in FIG.

ここで、相関モデル生成部１２３は、性能情報蓄積処理部１１２から性能情報１２ａを読み込む（図１２に示すステップＳ１０１）。 Here, the correlation model generation unit 123 reads the performance information 12a from the performance information accumulation processing unit 112 (step S101 shown in FIG. 12).

次に、未分析の性能情報の有無を判定する（ステップＳ１０２）。 Next, the presence / absence of unanalyzed performance information is determined (step S102).

相関モデルが生成されていない状態では、未分析の性能情報があるため、性能情報間の変換関数を算出する処理（ステップＳ１０３）に移る。 In the state where the correlation model has not been generated, there is unanalyzed performance information, and therefore the process proceeds to processing for calculating a conversion function between performance information (step S103).

最初に、性能情報１２ａの「ＳＶ１．ＣＰＵ」の性能値の系列と「ＳＶ１．ＭＥＭ」の性能値の系列の変換関数を算出する。 First, a conversion function of the performance value series “SV1.CPU” and the performance value series “SV1.MEM” of the performance information 12a is calculated.

図６を参照すると、「ＳＶ１．ＣＰＵ」を入力ｘとし、「ＳＶ２．ＭＥＭ」を出力ｙとする変換関数Ｇ３００を、システム同定処理Ｇ３０１によって決定することになる。 Referring to FIG. 6, a conversion function G300 having “SV1.CPU” as an input x and “SV2.MEM” as an output y is determined by the system identification process G301.

一般的に、このようなシステム同定にはいくつかの手法があり、例えば、「ｙ＝Ａｘ（ｔ）＋Ｂｘ（ｔ−１）＋Ｃｘ（ｔ−２）＋Ｄｙ（ｔ−１）＋Ｅｙ（ｔ−２）＋Ｆ」といった式を用いて、ｘから算出したｙの時系列の値が実際に検出された値に最も近くなるように、変数Ａ〜Ｆの値を決定することで変換関数が算出できる。 In general, there are several methods for such system identification. For example, “y = Ax (t) + Bx (t−1) + Cx (t−2) + Dy (t−1) + Ey (t−2) ) + F ”, the conversion function can be calculated by determining the values of the variables A to F so that the time series value of y calculated from x is closest to the actually detected value.

以下では、説明を簡略化するために、式「ｙ＝Ａｘ＋Ｂ」のＡとＢを決定する形で説明するが、その例に限定されるものではなく、他のシステム同定手法を用いても１つの性能値の系列から他の性能値の系列が算出できる変換関数であれば同様の効果が得られるものである。 In the following, in order to simplify the description, description will be made in the form of determining A and B of the expression “y = Ax + B”, but the present invention is not limited to this example, and even if another system identification method is used, 1 is used. A similar effect can be obtained as long as the conversion function can calculate another performance value series from one performance value series.

図６のシステム同定処理Ｇ３０１を参照すると、関数として「ｙ＝Ａｘ＋Ｂ」を選択し、グラフＧ１０１からグラフＧ１０２を近似できるＡとＢの値として、それぞれ「−０．６」、「１００」を決定する（図１２に示すステップＳ１０３）。 Referring to the system identification process G301 in FIG. 6, “y = Ax + B” is selected as a function, and “−0.6” and “100” are determined as values of A and B that can approximate the graph G102 from the graph G101, respectively. (Step S103 shown in FIG. 12).

さらに、グラフＧ３０２に示すように、グラフＧ１０１をこの変換関数で算出した予測値の系列と、グラフＧ１０２の性能値の系列を比較して、その差分である変換誤差から、この変換関数の重み情報を算出する（図１２に示すステップＳ１０４）＜重み情報算出ステップないしは重み情報算出機能＞。 Further, as shown in the graph G302, the weight value information of the conversion function is calculated from the conversion error that is the difference between the predicted value series calculated by the conversion function of the graph G101 and the performance value series of the graph G102. (Step S104 shown in FIG. 12) <weight information calculation step or weight information calculation function>.

この後、算出された変換関数と重み情報を相関モデルに追加する（ステップＳ１０５）。 Thereafter, the calculated conversion function and weight information are added to the correlation model (step S105).

図７は、このようにして追加された相関モデルの例であり、「ＳＶ１．ＣＰＵ」と「ＳＶ２．ＭＥＭ」の相関として、Ａ、Ｂ、Ｗの値が蓄積されている。 FIG. 7 is an example of the correlation model added in this way, and the values of A, B, and W are accumulated as the correlation between “SV1.CPU” and “SV2.MEM”.

以降、同様にして、ステップＳ１０３からステップＳ１０５の処理を、性能情報１２ａに含まれる性能値の系列のすべての組合せに対して行うことで、相関モデル蓄積処理部１１６に現在のシステムの性能情報に関する相関モデルが完成する。 Thereafter, similarly, the processing from step S103 to step S105 is performed on all combinations of the performance value series included in the performance information 12a, so that the correlation model accumulation processing unit 116 is informed about the performance information of the current system. The correlation model is completed.

図８の相関モデル１１６ｂは、このようにして生成された相関モデルの例であり、図７の相関モデル１１６ａに加えて、「ＳＶ３．ＣＰＵ」や「ＳＶ２．ＣＰＵ」に関する変換関数が追加されている。 The correlation model 116b in FIG. 8 is an example of the correlation model generated in this way. In addition to the correlation model 116a in FIG. 7, conversion functions relating to “SV3.CPU” and “SV2.CPU” are added. Yes.

（相関変化分析の詳細処理）
次に、本実施の形態における相関変化分析の詳細処理について、図１３、図９、図１０を参照して説明する。図１３は、本実施の形態にかかる運用管理装置の相関変化分析の詳細処理手順の一例を示すフローチャートである。 (Detailed processing of correlation change analysis)
Next, detailed processing of correlation change analysis in the present embodiment will be described with reference to FIG. 13, FIG. 9, and FIG. FIG. 13 is a flowchart illustrating an example of a detailed processing procedure of correlation change analysis of the operation management apparatus according to the present embodiment.

まず、相関変化分析部１２４は、図１３に示すように、相関モデル情報蓄積処理部１１６から相関モデルを読み込み（図１３に示すステップＳ２０１）、相関モデルに含まれる重み情報に応じて相関を選別する（ステップＳ２０２）。 First, as shown in FIG. 13, the correlation change analysis unit 124 reads the correlation model from the correlation model information accumulation processing unit 116 (step S201 shown in FIG. 13), and selects the correlation according to the weight information included in the correlation model. (Step S202).

図１０に示す相関グラフＧ３１２では、相関グラフＧ３１１に示す相関関係のうち、要素Ａと要素Ｃ、要素Ｂと要素Ｃの相関が破壊（点線で示す）されている。 In the correlation graph G312 illustrated in FIG. 10, among the correlations illustrated in the correlation graph G311, the correlation between the element A and the element C and the correlation between the element B and the element C is broken (indicated by a dotted line).

次に、相関変化分析部１２４は、情報収集部１２２から新たに検出し取得した性能情報を取得する（ステップＳ２０３）。 Next, the correlation change analysis unit 124 acquires performance information newly detected and acquired from the information collection unit 122 (step S203).

例えば、図３の性能情報１２ａにおいて、最下行にある「２００７／１１／０７８：３０」時点の性能情報を得た場合、図８に示す相関モデル１１６ｂに記載された変換関数を順次探索する。 For example, in the performance information 12a of FIG. 3, when the performance information at the time “2007/11/07 8:30” in the bottom row is obtained, the conversion functions described in the correlation model 116b shown in FIG. 8 are sequentially searched. .

すなわち、未探索の相関モデルがあるか否かの判定処理を行う（ステップＳ２０４）。この判定処理において、未探索の相関モデルがないと判定された場合には、ステップＳ２０８に進み、破壊された相関の詳細を出力する。 That is, it is determined whether or not there is an unsearched correlation model (step S204). In this determination process, when it is determined that there is no unsearched correlation model, the process proceeds to step S208, and details of the destroyed correlation are output.

一方、この判定処理において、未探索の相関モデルがあると判定された場合には、ステップＳ２０５に進み、性能値の変換誤差を算出する。 On the other hand, in this determination process, when it is determined that there is an unsearched correlation model, the process proceeds to step S205, and a conversion error of the performance value is calculated.

例えば、「ＳＶ１．ＣＰＵ」の値「２０」に対して、（−０．６）＊（２０）＋１００を計算して「８８」という予測値を算出し、「ＳＶ１．ＭＥＭ」の検出値である「７９」と比較して誤差「９」という値を得る（ステップＳ２０５）。 For example, for the value “20” of “SV1.CPU”, (−0.6) * (20) +100 is calculated to calculate a predicted value of “88”, and the detected value of “SV1.MEM” A value of error “9” is obtained as compared with a certain “79” (step S205).

続いて、この誤差が検出値に占める割合を算出する。そして、この誤差が一定値以上あるか否か（一定範囲内にあるか否か）判定する処理を行う（ステップＳ２０６）。 Subsequently, the ratio of this error to the detected value is calculated. Then, a process for determining whether or not this error is equal to or greater than a certain value (whether or not it is within a certain range) is performed (step S206).

この判定処理において、この誤差が一定値以上ないと判定された場合には、ステップＳ２０４に戻り、以降の処理を繰り返す。 In this determination process, if it is determined that this error is not equal to or greater than a certain value, the process returns to step S204 and the subsequent processes are repeated.

一方、この判定処理において、この誤差が一定値以上あると判定された場合には、ステップＳ２０７に進み、相関破壊の異常スコアを算出し、ステップＳ２０４に戻る。 On the other hand, in this determination process, when it is determined that the error is greater than or equal to a certain value, the process proceeds to step S207, the correlation destruction abnormality score is calculated, and the process returns to step S204.

例えば、ステップＳ２０６で予め決めた値である２０％より小さい場合には、相関が維持されているとしてステップＳ２０４に戻る。 For example, if the value is smaller than 20% that is predetermined in step S206, the correlation is maintained and the process returns to step S204.

同様にして、「ＳＶ１．ＣＰＵ」と「ＳＶ２．ＣＰＵ」の予測誤差を算出し（ステップＳ２０５）、その値が２０％をオーバしていることを検知すると（ステップＳ２０６）、相関が崩れていると判断して、それぞれの要素の異常スコアを算出する（ステップＳ２０７）。 Similarly, the prediction error of “SV1.CPU” and “SV2.CPU” is calculated (step S205), and when it is detected that the value exceeds 20% (step S206), the correlation is broken. And the abnormal score of each element is calculated (step S207).

以降、順次すべての相関関係を探索して（ステップＳ２０４）、破壊された相関の一覧や異常スコアを含む分析結果を障害分析部１２６に出力する（ステップＳ２０８）。 Thereafter, all correlations are sequentially searched (step S204), and an analysis result including a list of destroyed correlations and an abnormal score is output to the failure analysis unit 126 (step S208).

図１０は、このようにして検出された相関破壊の様子を示す。相関グラフＧ３１２では、相関グラフＧ３１１に示す相関のうち、要素Ａと要素Ｃ、要素Ｂと要素Ｃの相関が破壊（点線で示す）されている。 FIG. 10 shows the state of correlation destruction detected in this way. In the correlation graph G312, among the correlations shown in the correlation graph G311, the correlation between the element A and the element C and the correlation between the element B and the element C is broken (indicated by a dotted line).

障害分析部１２６は、このような結果を受け取り、他の障害分析の結果と合わせて管理者に提示する。 The failure analysis unit 126 receives such a result and presents it to the administrator together with other failure analysis results.

このような相関変化分析の結果を示す表示画面として、例えば図１４に示すものが挙げられる。図１４は、運用管理装置の表示部に表示される表示画面の一例が示されている。同図では、相関変化分析における表示画面（相関変化分析結果画面）の一例が示されている。 An example of a display screen showing the result of such correlation change analysis is shown in FIG. FIG. 14 shows an example of a display screen displayed on the display unit of the operation management apparatus. In the figure, an example of a display screen (correlation change analysis result screen) in correlation change analysis is shown.

図１４に示すように、表示部に表示される表示画面Ｕ１００（相関変化分析結果画面）は、相関グラフを表示する相関グラフ表示部Ｕ１４０を有する。相関グラフ表示部Ｕ１４０は、先に述べた図９や図１０に示す相関グラフの状態、遷移状況を表示することができる。この例では、相関破壊が太線で、異常スコアの高い要素が太丸で示されている。 As shown in FIG. 14, the display screen U100 (correlation change analysis result screen) displayed on the display unit includes a correlation graph display unit U140 that displays a correlation graph. The correlation graph display unit U140 can display the state and transition state of the correlation graph shown in FIGS. 9 and 10 described above. In this example, the correlation destruction is indicated by a bold line, and the element having a high abnormality score is indicated by a bold circle.

さらに、表示画面Ｕ１００は、異常スコアの高い要素を順にリストアップした異常スコア要素リスト表示部Ｕ１２０を有する。異常スコア要素リスト表示部Ｕ１２０は、要素（性能種目）、その要素の異常スコアの量、その他の情報などを表示することができる。 Furthermore, the display screen U100 includes an abnormal score element list display unit U120 that lists elements with high abnormal scores in order. The abnormal score element list display unit U120 can display an element (performance item), the amount of abnormal score of the element, other information, and the like.

さらに、表示画面Ｕ１００は、相関グラフ表示部Ｕ１４０の相関グラフにおいて、相関破壊の割合や、異常スコア要素リスト表示部Ｕ１２０の異常スコア要素リストのうち、異常スコアが最大の要素などの相関変化分析結果を表示する分析結果表示部Ｕ１１０を有する。 Further, the display screen U100 displays the correlation change analysis result of the correlation failure rate in the correlation graph of the correlation graph display unit U140 and the element having the maximum abnormal score in the abnormal score element list of the abnormal score element list display unit U120. Has an analysis result display unit U110.

さらに、表示画面Ｕ１００は、相関破壊数の経時変化をグラフ化して表示する相関破壊数変化グラフ表示部Ｕ１３０を有する。 Furthermore, the display screen U100 includes a correlated destruction number change graph display unit U130 that displays the change over time in the number of correlated destructions as a graph.

またさらに、表示画面Ｕ１００は、破壊相関のリストを表示するための第１の表示操作部Ｕ１５２を有する。表示画面Ｕ１００は、選択要素の詳細情報を表示するための第２の表示操作部Ｕ１５４を有する。表示画面Ｕ１００は、相関変化分析結果画面の表示を終了するための第３の表示操作部Ｕ１５６を有する。 Furthermore, the display screen U100 includes a first display operation unit U152 for displaying a list of destruction correlations. The display screen U100 includes a second display operation unit U154 for displaying detailed information of the selected element. The display screen U100 includes a third display operation unit U156 for ending the display of the correlation change analysis result screen.

また、表示画面Ｕ１００は、図１４に示すように、相関変化分析の結果として、相関破壊として、相関グラフＧ３１１に示す相関関係の２５％である２つの相関が破壊されていることや、その２つの相関はどちらも要素Ｃに関係するものであることから、異常スコアの順序付け一覧では、要素Ｃである「ＳＶ２．ＣＰＵ」の異常スコアが高くなっていることが示される。 Further, as shown in FIG. 14, the display screen U100 shows that, as a result of the correlation change analysis, two correlations, which are 25% of the correlation shown in the correlation graph G311, are destroyed as a result of the correlation destruction. Since the two correlations are both related to the element C, the abnormal score ordering list indicates that the abnormal score of the element C “SV2.CPU” is high.

管理者は、この結果を参照し、性能値の異常が発生していること、および、それが「ＳＶ２．ＣＰＵ」に起因するものであることを知ることができる。 By referring to this result, the administrator can know that an abnormality in the performance value has occurred and that it is caused by “SV2.CPU”.

以上のステップＳ２０１乃至ステップＳ２０８により、前記相関変化分析ステップを行うことができる。 The correlation change analysis step can be performed by the above steps S201 to S208.

この相関変化分析ステップでは、前記新たに検出された前記第１の要素に関する性能情報と前記相関関数とに基づいて前記第２の要素に関する予測性能情報を算出し、前記新たに検出された前記第２の要素に関する性能情報と前記予測性能情報とを比較して予測誤差を算出し、この予測誤差が一定の誤差範囲内を満たすか否かを分析することができる。 In the correlation change analysis step, predicted performance information about the second element is calculated based on the newly detected performance information about the first element and the correlation function, and the newly detected first A prediction error can be calculated by comparing the performance information on the second element and the prediction performance information, and it can be analyzed whether or not the prediction error satisfies a certain error range.

また、前記相関変化分析ステップでは、前記予測誤差が前記誤差範囲外となる場合に、前記第１の要素と前記第２の要素との相関関係が破壊されていると判断し、それぞれの要素の異常スコアを算出することができる。 In the correlation change analysis step, when the prediction error is outside the error range, it is determined that the correlation between the first element and the second element is broken, and An abnormal score can be calculated.

さらに、前記相関変化分析ステップでは、前記異常スコアに基づいて、前記各要素を順位付けして提示可能に制御することができる。 Furthermore, in the correlation change analysis step, the respective elements can be ranked and controlled to be presented based on the abnormality score.

以上のように本実施の形態によれば、相関モデル生成部が、任意の２つの性能情報の値の時系列に対して、一方を入力とし他方を出力とした場合の変換関数を導出することで相関モデルを生成する。相関変化分析部が、新たに性能情報を検出した場合に、この相関モデルの変換関数に従った性能値であるかどうかを判定し、相関関係の崩れた数と量を含む情報を障害分析部に出力する。このように、正常時に学習した相関関係が崩れているかどうかで異常の発生場所を特定することができる。 As described above, according to the present embodiment, the correlation model generation unit derives a conversion function when one is input and the other is output with respect to a time series of two arbitrary performance information values. To generate a correlation model. When the correlation change analysis unit newly detects performance information, it determines whether or not the performance value is in accordance with the conversion function of this correlation model, and the failure analysis unit stores information including the number and amount of broken correlations. Output to. In this way, the location of occurrence of an abnormality can be specified based on whether or not the correlation learned during normal operation is broken.

これにより、関連技術の性能情報の閾値監視に比べて、応答劣化などの性能異常を正確に検出し場所を特定することができるという効果がある。 As a result, there is an effect that it is possible to accurately detect performance anomalies such as response deterioration and specify a place, as compared with threshold information monitoring of performance information of related technology.

また、関連技術の異常時に性能情報の相関関係を算出する手法に比べて、平常時の関係と異常時の関係を区別して提示することができるという効果がある。 In addition, compared to a method of calculating the correlation of performance information when the related technology is abnormal, there is an effect that the normal relationship and the abnormal relationship can be distinguished and presented.

さらに、これらの分析には、事前に知識となるデータを用意する必要がなく、性能情報以外の処理履歴等を収集する必要がないことから、管理者の負担を軽減し、システムの処理量の増大を防止することができる。 Furthermore, these analyzes do not require the preparation of knowledge data in advance, and it is not necessary to collect processing histories other than performance information, reducing the burden on the administrator and reducing the amount of system processing. An increase can be prevented.

また、関連技術では、１つの性能情報の時間変化関数としてモデルを生成しているため、１つの性能情報が、前回予測された値と同じかどうかを判断するものである。 In the related art, since a model is generated as a time change function of one piece of performance information, it is determined whether one piece of performance information is the same as a previously predicted value.

これに対して本実施の形態は、２つの性能情報の間の変換関数としてモデルを生成することで、任意の２つの性能情報の値の関係が維持されているかを判断できる。 In contrast, in the present embodiment, it is possible to determine whether the relationship between the values of any two pieces of performance information is maintained by generating a model as a conversion function between the two pieces of performance information.

さらに、関連技術では、２つの性能情報の間の「相関ルール」が用いられているが、このルールをどう生成するのかは一切記述されておらず、特異点を発見するためのルール生成の負担が大きい、という不具合があった。 Furthermore, in the related technology, a “correlation rule” between two pieces of performance information is used, but how to generate this rule is not described at all, and the burden of rule generation for finding singularities is not described. There was a problem that was large.

また、関連技術では、相関係数は値であって、２つの性能情報間の変換関数を算出していない。変換関数を導出する手法と、他の分析手法では特性が異なる。関連技術では、演算の結果として、予め用意しておくどのモデルと類似しているかを導出できるが、用意するモデルの中身を決める手法については不明である。 In the related art, the correlation coefficient is a value, and a conversion function between two pieces of performance information is not calculated. The characteristics of the method for deriving the conversion function are different from those of other analysis methods. In the related art, as a result of the calculation, it is possible to derive which model is similar in advance, but the method for determining the contents of the prepared model is unknown.

これに対して本実施の形態では、図９に示すように強い相関を抽出することで、誤報の少ないモデルを生成することができる。また、１対１の変換関数から分析することで、図１４に示すような要素別の異常度のランキングや異常要素の図示ができる。 On the other hand, in the present embodiment, a model with few false alarms can be generated by extracting a strong correlation as shown in FIG. Further, by analyzing from a one-to-one conversion function, it is possible to rank the degree of abnormality for each element as shown in FIG.

ここで、図４に示すブロック図における一部の各ブロック（例えば符号１２３、１２４、１２１、１２２、１２６、１２７、１２８等）は、コンピュータが適宜なメモリに格納された各種プログラムを実行することにより、該プログラムにより機能化された状態を示すソフトウエアモジュール構成であってもよい。 Here, some of the blocks (for example, reference numerals 123, 124, 121, 122, 126, 127, and 128) in the block diagram shown in FIG. 4 execute various programs stored in appropriate memories by the computer. Therefore, a software module configuration indicating a state functionalized by the program may be used.

すなわち、物理的構成は例えば一又は複数のＣＰＵ（或いは一又は複数のＣＰＵと一又は複数のメモリ）等ではあるが、各部（回路・手段）によるソフトウエア構成は、プログラムの制御によってＣＰＵが発揮する複数の機能を、それぞれ複数の部（手段）による構成要素として表現したものである。 That is, the physical configuration is, for example, one or a plurality of CPUs (or one or a plurality of CPUs and one or a plurality of memories), etc., but the software configuration by each unit (circuit / means) is exhibited by the CPU by controlling the program. A plurality of functions are expressed as components by a plurality of units (means).

ＣＰＵがプログラムによって実行されている動的状態（プログラムを構成する各手順を実行している状態）を機能表現した場合、ＣＰＵ内に各部（手段）が構成されることになる。プログラムが実行されていない静的状態にあっては、各手段の構成を実現するプログラム全体（或いは各手段の構成に含まれるプログラム各部）は、メモリなどの記憶領域に記憶されている。 When the CPU dynamically expresses a dynamic state (a state in which each procedure constituting the program is executed) executed by the program, each unit (means) is configured in the CPU. In a static state in which the program is not executed, the entire program (or each program part included in the configuration of each unit) that realizes the configuration of each unit is stored in a storage area such as a memory.

以上に示した各部（手段）の説明は、プログラムにより機能化されたコンピュータをプログラムの機能と共に説明したものと解釈することも出来るし、また、固有のハードウエアにより恒久的に機能化された複数の電子回路ブロックからなる装置を説明したものとも解釈することが出来ることは、当然である。したがって、これらの機能ブロックがハードウェアのみ、ソフトウェアのみ、またはそれらの組合せによっていろいろな形で実現でき、いずれかに限定されるものではない。 The description of each part (means) described above can be interpreted as a computer functionalized by a program together with the function of the program, or a plurality of functions permanently functioning by specific hardware. Naturally, it can be interpreted that the device comprising the electronic circuit block is described. Therefore, these functional blocks can be realized in various forms by hardware only, software only, or a combination thereof, and is not limited to any one.

また、各部は、通信可能な専用のコンピュータからなる装置としてそれぞれ構成し、これらの各装置により運用管理システムを構成してもよい。 In addition, each unit may be configured as a device including a dedicated computer capable of communication, and the operation management system may be configured by these devices.

［第２の実施の形態］
次に、本発明にかかる第２の実施の形態について、図１５に基づいて説明する。以下には、前記第１の実施の形態の実質的に同様の構成に関しては説明を省略し、異なる部分についてのみ述べる。図１５は、本発明の運用管理装置の第２の実施の形態の一例を示すブロック図である。 [Second Embodiment]
Next, a second embodiment according to the present invention will be described with reference to FIG. In the following, description of the substantially similar configuration of the first embodiment will be omitted, and only different parts will be described. FIG. 15 is a block diagram showing an example of the second embodiment of the operation management apparatus of the present invention.

本実施の形態における構成は、第１の実施の形態の図４を用いて説明した構成に加えて、変化履歴情報蓄積処理部２１８と、定常変化分析部２３１とを含む構成としている。 The configuration in the present embodiment includes a change history information accumulation processing unit 218 and a steady change analysis unit 231 in addition to the configuration described with reference to FIG. 4 of the first embodiment.

具体的には、本実施の形態の運用管理装置２００は、図１５に示すように、前記第１の実施の形態の構成である、サービス実行部２２１、性能情報蓄積処理部２１２、情報収集部２２２、分析設定蓄積処理部２１４、障害分析部２２６、管理者対話部２２７、対処実行部２２８、相関モデル生成部２２３、相関モデル情報蓄積処理部２１６、相関変化分析部２２４に加えて、変化履歴情報蓄積処理部２１８と、定常変化分析部２３１とを含んで構成される。 Specifically, as shown in FIG. 15, the operation management apparatus 200 according to the present embodiment includes a service execution unit 221, a performance information accumulation processing unit 212, and an information collection unit that are configured according to the first embodiment. 222, analysis setting accumulation processing unit 214, failure analysis unit 226, administrator dialogue unit 227, coping execution unit 228, correlation model generation unit 223, correlation model information accumulation processing unit 216, correlation change analysis unit 224, change history An information accumulation processing unit 218 and a steady change analysis unit 231 are included.

変化履歴情報蓄積処理部２１８は、相関変化分析部２２４により分析された相関変化の履歴情報を蓄積処理する。 The change history information accumulation processing unit 218 accumulates the correlation change history information analyzed by the correlation change analysis unit 224.

定常変化分析部２３１は、変化履歴情報蓄積処理部２１８から相関破壊の履歴を読み出し、一定期間連続して破壊されている相関関係を発見した場合に、相関モデル情報蓄積処理部２１６に蓄積される相関モデルの該当する相関関係を無効化する。 The steady-state change analysis unit 231 reads the correlation destruction history from the change history information accumulation processing unit 218, and accumulates the correlation model information accumulation processing unit 216 when it finds a correlation that has been destroyed for a certain period of time. Invalidate the corresponding correlation in the correlation model.

また、定常変化分析部２３１は、無効化された相関関係が一定の割合になった場合に、相関モデル生成部２２３に相関モデルの再生成を指示する。 In addition, the steady change analysis unit 231 instructs the correlation model generation unit 223 to regenerate the correlation model when the invalidated correlation reaches a certain ratio.

ここで、これらの各部は、図１６に示すように、制御部が発揮する複数の機能として構成することもできる。 Here, as shown in FIG. 16, each of these units can be configured as a plurality of functions exhibited by the control unit.

また、定常変化分析部２３１は、前記相関モデルの前記相関関係が定常的に破壊されているか否かを分析することができる。 Further, the steady change analysis unit 231 can analyze whether or not the correlation of the correlation model is constantly destroyed.

さらに、前記定常変化分析部２３１は、前記相関関係が定常的に破壊されている前記相関モデルを無効化することができる。 Further, the steady change analysis unit 231 can invalidate the correlation model in which the correlation is constantly destroyed.

また、前記定常変化分析部２３１は、無効化された前記相関モデルが全相関モデルに占める割合が一定以上となった場合に、前記相関モデル生成部に前記相関モデルの再生成を指示することができる。 The steady change analysis unit 231 may instruct the correlation model generation unit to regenerate the correlation model when the invalidated correlation model occupies a certain ratio or more in all correlation models. it can.

さらに、前記定常変化分析部２３１は、無効化された前記相関モデルが全相関モデルに占める割合が一定以上となった場合に、前記相関モデルの再生成が必要である旨を提示可能に制御することができる。 Further, the steady change analysis unit 231 controls to indicate that the correlation model needs to be regenerated when the invalidated correlation model occupies a certain ratio or more in all correlation models. be able to.

（定常分析における相関破壊について）
ここで、定常変化分析部２３１による定常分析における相関破壊の概要について、図１７を参照して説明する。図１７は、本実施の形態にかかる運用管理装置の定常分析における相関破壊の概要の一例を示す説明図である。 (About correlation destruction in steady-state analysis)
Here, an overview of correlation destruction in steady analysis by the steady change analysis unit 231 will be described with reference to FIG. FIG. 17 is an explanatory diagram showing an example of the outline of correlation destruction in the steady analysis of the operation management apparatus according to the present embodiment.

図１７に示すように、相関グラフＧ３２１では、要素Ｄと要素Ｅ、要素Ｅと要素Ｆの間の相関が定常的に破壊されている（点線で示す）。 As shown in FIG. 17, in the correlation graph G321, the correlation between the element D and the element E and between the element E and the element F is constantly broken (indicated by a dotted line).

定常変化分析部２３１が、図８に示す相関モデル１１６ｂにおいて、「ＳＶ３．ＣＰＵ」と「ＳＶ３．ＭＥＭ」の間、および、「ＳＶ２．ＭＥＭ」と「ＳＶ３．ＭＥＭ」の間の相関の有効欄を「×」とすることで、図１８に示す相関モデル２１６ｂに示すような相関モデルに修正される。 In the correlation model 116b shown in FIG. 8, the steady-state change analysis unit 231 has a correlation valid column between “SV3.CPU” and “SV3.MEM” and between “SV2.MEM” and “SV3.MEM”. By setting “x” to “x”, the correlation model as shown in the correlation model 216b shown in FIG. 18 is corrected.

この相関モデル２１６ｂに対応するグラフとして、例えば図１７に示すグラフＧ３２２などが挙げられる。 An example of the graph corresponding to the correlation model 216b is a graph G322 shown in FIG.

この後、相関変化分析部２２４がこの相関モデルを読み込み、無効化されていない相関関係のみで分析を行うことにより、これらの相関の破壊が常時検出されるのを防止することができる。 Thereafter, the correlation change analysis unit 224 reads this correlation model and performs analysis only with the correlation that has not been invalidated, thereby preventing the destruction of these correlations from being constantly detected.

（処理手順について）
次に、上述のような構成を有する運用管理装置における各部の処理は、方法としても実現可能であり、運用管理方法としての各種の処理手順について、図１９乃至図２０を参照しつつ説明する。図１９は、本発明の第２の実施の形態による運用管理装置における処理手順の一例を示すフローチャートである。 (About processing procedure)
Next, the processing of each unit in the operation management apparatus having the above-described configuration can be realized as a method, and various processing procedures as the operation management method will be described with reference to FIGS. 19 to 20. FIG. 19 is a flowchart illustrating an example of a processing procedure in the operation management apparatus according to the second embodiment of the present invention.

この情報処理方法は、基本的構成として、前記性能種目又は前記被管理装置を要素とした場合に、少なくとも第１の要素に関する性能情報の時系列変化を示す第１の性能系列情報と、第２の要素に関する性能情報の時系列変化を示す第２の性能系列情報との相関関数を導出し、この相関関数に基づいて相関モデルを生成し、この相関モデルを前記各要素間の組み合わせについて求める相関モデル生成ステップ（例えば図１９に示すステップＳ２１など）と、前記被管理装置から新たに検出し取得される前記性能情報に基づいて、前記相関モデルの変化を分析する相関変化分析ステップ（例えば図１９に示すステップＳ２２など）と、含むことができる。 In this information processing method, as a basic configuration, when the performance item or the managed device is used as an element, first performance series information indicating a time series change of performance information related to at least the first element; A correlation function with the second performance sequence information indicating the time series change of the performance information related to the elements of the first, a correlation model is generated based on the correlation function, and the correlation model is obtained for the combination between the elements. Based on the model generation step (for example, step S21 shown in FIG. 19) and the performance information newly detected and acquired from the managed device, a correlation change analysis step for analyzing the change of the correlation model (for example, FIG. 19). Step S22 shown in FIG.

さらに、この情報処理方法は、前記相関モデルの前記相関関係が定常的に破壊されているか否かを分析する定常変化分析ステップ（例えば図１９に示すステップＳ２３など）を含むことができる。 Furthermore, this information processing method can include a steady change analysis step (for example, step S23 shown in FIG. 19) for analyzing whether or not the correlation of the correlation model is constantly destroyed.

ここで、これらの「相関モデル生成」、「相関変化分析」の詳細処理については、第１の実施の形態で説明したものと同じであるため、以下には「定常変化分析」の詳細処理について説明する。 Here, since the detailed processing of “correlation model generation” and “correlation change analysis” is the same as that described in the first embodiment, the detailed processing of “steady change analysis” will be described below. explain.

（定常変化分析の詳細処理）
図２０は、本実施の形態にかかる運用管理装置の定常変化分析の詳細処理手順の一例を示すフローチャートである。 (Detailed processing of steady change analysis)
FIG. 20 is a flowchart illustrating an example of a detailed processing procedure of steady change analysis of the operation management apparatus according to the present embodiment.

本実施の形態の相関変化分析部２２４は、相関関係の破壊を検出した場合に、その履歴を変化履歴情報蓄積処理部２１８に蓄積処理する。 The correlation change analysis unit 224 according to the present embodiment accumulates the history in the change history information accumulation processing unit 218 when the destruction of the correlation is detected.

図２０に示すように、運用管理装置が備えたコンピュータの定常変化分析部２３１は、この相関破壊の履歴を読み出して相関破壊の履歴情報を取得する（ステップＳ３０１）。 As illustrated in FIG. 20, the steady change analysis unit 231 of the computer provided in the operation management apparatus reads the correlation destruction history and acquires the correlation destruction history information (step S301).

続いて、定常的に破壊される相関があるか否かの判定処理を行う（ステップＳ３０２）。 Subsequently, it is determined whether or not there is a correlation that is constantly destroyed (step S302).

この判定処理において、定常的に破壊される相関がないと判定された場合には、処理を終了する。 In this determination process, when it is determined that there is no correlation that is constantly destroyed, the process ends.

一方、この判定処理において、定常的に破壊される相関があると判定された場合には、ステップＳ３０３に進む。 On the other hand, in this determination process, when it is determined that there is a correlation that is constantly destroyed, the process proceeds to step S303.

すなわち、連続して破壊されている相関関係があった場合に（ステップＳ３０２）、相関モデル情報蓄積処理部２１６に蓄積されている相関モデルのうち、該当する相関関係を無効化する（ステップＳ３０３）。 That is, when there is a correlation that is continuously destroyed (step S302), the corresponding correlation among the correlation models stored in the correlation model information storage processing unit 216 is invalidated (step S303). .

図１７は、このような相関破壊の例を示す。図１７に示すように、相関グラフＧ３２１では、要素Ｄと要素Ｅ、要素Ｅと要素Ｆの間の相関が定常的に破壊されている（点線で示す）。 FIG. 17 shows an example of such correlation destruction. As shown in FIG. 17, in the correlation graph G321, the correlation between the element D and the element E and between the element E and the element F is constantly broken (indicated by a dotted line).

これにより、この相関モデル２１６ｂに対応するように、例えば図１７に示すグラフＧ３２２のようなグラフとなる。 Thus, a graph such as a graph G322 shown in FIG. 17 is obtained so as to correspond to the correlation model 216b.

さらに、定常変化分析部２３１は、相関モデル内で、このようにして無効化した相関関係が一定割合以上になったか否かの判定処理を行う（ステップＳ３０４）。 Further, the steady change analysis unit 231 performs a determination process as to whether or not the correlation thus invalidated exceeds a certain ratio in the correlation model (step S304).

この判定処理において、無効化した相関関係が一定割合以上にならないと判定された場合には、処理を終了する。 In this determination process, if it is determined that the invalidated correlation does not exceed a certain ratio, the process ends.

一方、この判定処理において、無効化した相関関係が一定割合以上になったと判定された場合には、ステップＳ３０５に進む。 On the other hand, in this determination process, if it is determined that the invalidated correlation has reached a certain ratio or more, the process proceeds to step S305.

すなわち、相関モデル内で、このようにして無効化した相関関係が一定割合以上になった場合に（ステップＳ３０４）、定常変化分析部２３１は、相関モデル生成部２２３に指示することで新たな相関モデルを生成する（ステップＳ３０５）。 That is, in the correlation model, when the correlation invalidated in this way becomes a certain ratio or more (step S304), the steady change analysis unit 231 instructs the correlation model generation unit 223 to create a new correlation. A model is generated (step S305).

このように相関モデルの無効化が多くなった場合の対話画面の一例を図２１に示す。図２１は、運用管理装置の表示部に表示される表示画面の一例が示されている。 FIG. 21 shows an example of the dialogue screen when the invalidation of the correlation model increases in this way. FIG. 21 shows an example of a display screen displayed on the display unit of the operation management apparatus.

図２１に示すように、表示部に表示される表示画面Ｕ２００（モデル再作成画面）は、モデル名、作成日時、相関数、定常破壊などのモデル関連情報を表示するモデル関連情報表示部Ｕ２２０を有する。 As shown in FIG. 21, the display screen U200 (model re-creation screen) displayed on the display unit includes a model-related information display unit U220 that displays model-related information such as a model name, creation date, correlation number, and steady fracture. Have.

さらに、表示画面Ｕ２００は、モデル作成に関するメッセージを表示した表示部Ｕ２１０を有する。さらに、表示画面Ｕ２００は、モデルを参照するための第１の表示操作部Ｕ２４２を有する。表示画面Ｕ２００は、特定の再作成条件にてモデルの再作成を行うための第２の表示操作部Ｕ２４４を有する。表示画面Ｕ２００は、モデル再作成画面の表示を終了するための第３の表示操作部Ｕ２４６を有する。 Furthermore, the display screen U200 includes a display unit U210 that displays a message related to model creation. Further, the display screen U200 includes a first display operation unit U242 for referring to the model. The display screen U200 includes a second display operation unit U244 for recreating a model under specific recreation conditions. The display screen U200 includes a third display operation unit U246 for ending the display of the model recreation screen.

このように、管理者は、システムからの情報により、分析に用いている相関モデルが現在の動作状況にそぐわなくなったことを知ることができる。 In this way, the administrator can know from the information from the system that the correlation model used for the analysis is no longer suitable for the current operation status.

以上のステップＳ３０１乃至ステップＳ３０５により、前記定常変化分析ステップを行うことができる。この定常変化分析ステップでは、前記相関関係が定常的に破壊されている前記相関モデルを無効化することができる。 The steady change analysis step can be performed by the above steps S301 to S305. In the steady change analysis step, the correlation model in which the correlation is constantly destroyed can be invalidated.

さらに、前記定常変化分析ステップでは、無効化された前記相関モデルが全相関モデルに占める割合が一定以上となった場合に、前記相関モデルの再生成を指示することができる。 Further, in the steady change analysis step, when the proportion of the invalidated correlation model in all the correlation models becomes equal to or greater than a certain level, the regeneration of the correlation model can be instructed.

また、前記定常変化分析ステップでは、無効化された前記相関モデルが全相関モデルに占める割合が一定以上となった場合に、前記相関モデルの再生成が必要である旨を提示可能に制御することができる。 Further, in the steady change analysis step, when the ratio of the invalidated correlation model to all correlation models becomes a certain ratio or more, control is performed so as to indicate that the correlation model needs to be regenerated. Can do.

以上のように本実施の形態によれば、前記第１の実施の形態と同様の作用効果を奏しながらも、定常変化分析部が一度作成した相関モデルで定常的に破壊される要素を無効化する。 As described above, according to the present embodiment, the elements that are constantly destroyed by the correlation model once created by the steady-state change analysis unit are invalidated while having the same operational effects as the first embodiment. To do.

これにより、システムの特性が徐々に変化する場合などにおいても、不要な異常検知を抑止して正確な性能異常検出を行うことができる。 Thereby, even when the characteristics of the system gradually change, unnecessary abnormality detection can be suppressed and accurate performance abnormality detection can be performed.

また、無効化される要素が多くなった場合に、相関モデルを再作成する必要性を管理者に提示することができるため、常に精度の高い分析を維持することができる。 Further, when the number of elements to be invalidated increases, the necessity of recreating the correlation model can be presented to the administrator, so that highly accurate analysis can always be maintained.

その他の構成およびその他のステップないしは機能並びにその作用効果については、前述した実施の形態の場合と同一となっている。また、上記の説明において、上述した各ステップの動作内容及び各部の構成要素並びにそれらによる各機能をプログラム化し、コンピュータに実行させてもよい。 Other configurations, other steps or functions, and the effects thereof are the same as those in the above-described embodiment. In the above description, the operation content of each step described above, the components of each unit, and the functions thereof may be programmed and executed by a computer.

[その他の各種変形例]
また、本発明にかかる装置及び方法は、そのいくつかの特定の実施の形態に従って説明してきたが、本発明の主旨および範囲から逸脱することなく本発明の本文に記述した実施の形態に対して種々の変形が可能である。 [Other variations]
Also, although the apparatus and method according to the present invention have been described according to some specific embodiments thereof, the embodiments described in the text of the present invention can be used without departing from the spirit and scope of the present invention. Various modifications are possible.

例えば、上記構成部材の数、位置、形状等は上記実施の形態に限定されず、本発明を実施する上で好適な数、位置、形状等にすることができる。すなわち、上記実施の形態では、相関変化分析における異常スコアを算出する際の相関が崩れているか否かの判断は、予測誤差が２０％をオーバする場合を示したが、本発明は、これらの数を制限するものではない。 For example, the number, position, shape, and the like of the constituent members are not limited to the above-described embodiment, and can be set to a suitable number, position, shape, and the like in practicing the present invention. That is, in the above embodiment, the judgment whether or not the correlation when calculating the abnormal score in the correlation change analysis is broken has shown the case where the prediction error exceeds 20%. It does not limit the number.

また、相関破壊あり、相関破壊なしの２分類する場合に限らず、複数段階に分類する場合であってもよい。 Further, the classification is not limited to two cases with correlation destruction and without correlation destruction, and may be classified into a plurality of stages.

本発明の運用管理ソフトウエアは、１台のＰＣにインストールする場合であっても、クライアント・サーバシステムにおける端末及びサーバ、Ｐ２Ｐで利用可能な構成であっても構わない。また、各種表示画面などはＷｅｂ上でアクセス可能な構成であっても構わない。 The operation management software of the present invention may be installed on a single PC, or may be configured to be used by terminals and servers in a client / server system and P2P. Various display screens may be accessible on the Web.

（プログラム）
また、前述した実施形態の機能を実現する本発明のソフトウエアのプログラムは、前述した各実施の形態における各種ブロック図などに示された処理部（処理手段）、機能などに対応したプログラムや、フローチャートなどに示された処理手順、処理手段、機能などに対応したプログラムなどにおいて各々処理される各処理プログラム、本明細書で全般的に記述される方法（ステップ）、説明された処理、データの全体もしくは各部を含む。 (program)
Further, the software program of the present invention that realizes the functions of the above-described embodiments is a program corresponding to the processing unit (processing means), functions, etc. shown in the various block diagrams in each of the above-described embodiments, Each processing program processed in the processing procedure, processing means, function, etc. shown in the flowchart etc., the method (step) generally described in this specification, the processing described, the data Including the whole or each part.

具体的には、本発明の運用管理プログラムは、システムを構成する複数の被管理装置から複数種の性能種目毎の性能情報を取得して、前記被管理装置を運用管理する運用管理装置が備えたコンピュータに諸機能を実現させることが可能なものである。 Specifically, the operation management program of the present invention is provided with an operation management apparatus that acquires performance information for each of a plurality of types of performance items from a plurality of managed apparatuses constituting the system, and operates and manages the managed apparatus. It is possible to realize various functions on a computer.

この運用管理プログラムは、前記性能種目又は前記被管理装置を要素とした場合に、少なくとも第１の要素に関する性能情報の時系列変化を示す第１の性能系列情報と、第２の要素に関する性能情報の時系列変化を示す第２の性能系列情報との相関関数を導出し、この相関関数に基づいて相関モデルを生成し、この相関モデルを前記各要素間の組み合わせについて求める相関モデル生成機能（例えば図４に示す符号１２３などの構成、図１１に示すステップＳ１１の機能など）と、前記被管理装置から新たに検出し取得される前記性能情報に基づいて、前記相関モデルの変化を分析する相関変化分析機能（例えば図４に示す符号１２４などの構成、図１１に示すステップＳ１２の機能など）と、を含む機能をコンピュータに実現させることができる。 The operation management program includes, when the performance item or the managed device is an element, first performance series information indicating a time series change of performance information relating to at least the first element, and performance information relating to the second element. A correlation model generation function (for example, a function for generating a correlation model based on the correlation function and obtaining a correlation model for the combination between the elements) Correlation for analyzing changes in the correlation model based on the configuration such as reference numeral 123 shown in FIG. 4 and the function of step S11 shown in FIG. 11 and the performance information newly detected and acquired from the managed device. A computer can realize a function including a change analysis function (for example, a configuration such as reference numeral 124 shown in FIG. 4 and a function of step S12 shown in FIG. 11). Kill.

また、この運用管理プログラムは、前記相関変化分析機能では、前記新たに検出された前記第１の要素に関する性能情報と前記相関関数とに基づいて前記第２の要素に関する予測性能情報を算出し、前記新たに検出された前記第２の要素に関する性能情報と前記予測性能情報とを比較して予測誤差を算出し、この予測誤差が一定の誤差範囲内を満たすか否かを分析する機能をコンピュータに実現させることができる。 In the correlation change analysis function, the operation management program calculates predicted performance information about the second element based on the newly detected performance information about the first element and the correlation function. The computer has a function of calculating a prediction error by comparing performance information relating to the newly detected second element and the predicted performance information, and analyzing whether the prediction error satisfies a certain error range. Can be realized.

さらに、この運用管理プログラムは、前記相関変化分析機能では、前記予測誤差が前記誤差範囲外となる場合に、前記第１の要素と前記第２の要素との相関関係が破壊されていると判断し、それぞれの要素の異常スコアを算出する機能をコンピュータに実現させることができる。 Further, the operation management program determines that the correlation between the first element and the second element is destroyed in the correlation change analysis function when the prediction error is outside the error range. In addition, it is possible to cause the computer to realize the function of calculating the abnormality score of each element.

また、この運用管理プログラムは、前記相関変化分析機能では、前記異常スコアに基づいて、前記各要素を順位付けして提示可能に制御する機能をコンピュータに実現させることができる。 In addition, the operation management program can cause the computer to realize a function of ranking and controlling the elements based on the abnormality score in the correlation change analysis function.

さらに、この運用管理プログラムは、前記相関モデルの前記相関関係が定常的に破壊されているか否かを分析する定常変化分析機能（例えば図１５に示す符号２３１などの構成、図１９に示すステップＳ２３の機能など）を含む機能をコンピュータに実現させることができる。 Further, the operation management program is configured to analyze whether or not the correlation of the correlation model is constantly destroyed (for example, a configuration such as reference numeral 231 shown in FIG. 15, step S23 shown in FIG. 19). Can be realized by a computer.

また、この運用管理プログラムは、前記定常変化分析機能では、前記相関関係が定常的に破壊されている前記相関モデルを無効化する機能をコンピュータに実現させることができる。 In addition, the operation management program can cause the computer to realize a function of invalidating the correlation model in which the correlation is constantly destroyed in the steady change analysis function.

さらに、この運用管理プログラムは、前記定常変化分析機能では、無効化された前記相関モデルが全相関モデルに占める割合が一定以上となった場合に、前記相関モデルの再生成を指示する機能をコンピュータに実現させることができる。 Further, the operation management program includes a function for instructing regeneration of the correlation model when the ratio of the invalidated correlation model to all correlation models exceeds a certain level in the steady change analysis function. Can be realized.

また、この運用管理プログラムは、前記定常変化分析機能では、無効化された前記相関モデルが全相関モデルに占める割合が一定以上となった場合に、前記相関モデルの再生成が必要である旨を提示可能に制御する機能をコンピュータに実現させることができる。 Further, the operation management program indicates that, in the steady change analysis function, the correlation model needs to be regenerated when the invalidated correlation model occupies a certain ratio or more in all correlation models. The function of controlling to be able to present can be realized in the computer.

プログラムは、オブジェクトコード、インタプリタにより実行されるプログラム、ＯＳに供給するスクリプトデータ等、プログラムの形態を問わない。プログラムは、高水準プロシージャ型またはオブジェクト指向プログラミング言語で、あるいは必要に応じてアセンブリまたはマシン言語で実装することができる。いずれの場合も、言語はコンパイラ型またはインタープリタ型言語であってもよい。上述のプログラムを、一般のパソコンや携帯型情報端末などで動作可能なアプリケーションソフトに組み込んだものも含む。 The program may be in any form such as an object code, a program executed by an interpreter, or script data supplied to the OS. The program can be implemented in a high level procedural or object oriented programming language, or in assembly or machine language as required. In either case, the language may be a compiler or interpreted language. Also included is a program in which the above-described program is incorporated into application software that can be operated on a general personal computer or a portable information terminal.

プログラムを供給する手法としては、電気通信回線（有線、無線を問わない）によってコンピュータと通信可能に接続された外部の機器から前記電気通信回線を通じて提供することも可能である。例えば、コンピュータのブラウザを用いてインターネットのホームページに接続し、該ホームページからプログラムそのもの、もしくは圧縮され自動インストール機能を含むファイルをハードディスク等の記録媒体にダウンロードすることによっても供給できる。また、プログラムを構成するプログラムコードを複数のファイルに分割し、それぞれのファイルを異なるホームページからダウンロードすることによっても実現可能である。つまり、本発明の機能処理をコンピュータで実現するためのプログラムファイルを複数のユーザに対してダウンロードさせるサーバも、本発明の範囲に含まれるものである。 As a method of supplying the program, it is also possible to provide the program from an external device that is communicably connected to the computer via an electric communication line (whether wired or wireless). For example, the program can be supplied by connecting to a homepage on the Internet using a browser on a computer and downloading the program itself or a compressed file including an automatic installation function from the homepage to a recording medium such as a hard disk. It can also be realized by dividing the program code constituting the program into a plurality of files and downloading each file from a different home page. That is, a server that allows a plurality of users to download a program file for realizing the functional processing of the present invention on a computer is also included in the scope of the present invention.

本発明のプログラムによれば、当該制御プログラムを格納するＲＯＭ等の記憶媒体から、当該制御プログラムをコンピュータ（ＣＰＵ）に読み込んで実行させれば、或いは、当該制御プログラムを、通信手段を介してコンピュータにダウンロードさせた後に実行させれば、上述した本発明に係る装置を比較的簡単に実現できる。発明の思想の具現化例として装置のソフトウェアとなる場合には、かかるソフトウェアを記憶した記憶媒体上においても当然に存在し、利用される。 According to the program of the present invention, if the control program is read into a computer (CPU) from a storage medium such as a ROM storing the control program and executed, or the control program is transmitted to the computer via communication means. If the device is executed after being downloaded, the above-described apparatus according to the present invention can be realized relatively easily. When the software of the apparatus is embodied as an embodiment of the idea of the invention, it naturally exists and is used on a storage medium storing such software.

また、プログラムは、一次複製品、二次複製品などの複製段階については全く問う余地無く同等である。プログラムの供給方法として通信回線を利用して行なう場合であれば通信回線が伝送媒体となって本発明が利用されることになる。むろん、プログラムの発明として特定することもできる。さらに、装置における従属請求項は、方法，プログラムにおいて従属請求項に対応した構成にすることも可能である。 Moreover, the program is the same without any question about the copying stage of the primary copy product, the secondary copy product, etc. If the program is supplied using a communication line, the communication line becomes a transmission medium and the present invention is used. Of course, it can also be specified as a program invention. Furthermore, the dependent claims in the apparatus may be configured to correspond to the dependent claims in the method and the program.

（情報記録媒体）
また、上述のプログラムを、情報記録媒体に記録した構成であってもよい。情報記録媒体には、上述のプログラムを含むアプリケーションプログラムが格納されており、コンピュータが当該情報記録媒体からアプリケーションプログラムを読み出し、当該アプリケーションプログラムをハードディスクにインストールすることが可能である。これにより、上述のプログラムは、磁気記録媒体、光記録媒体あるいはＲＯＭなどの情報記録媒体に記録してプログラムを提供することができる。そのようなプログラムが記録された情報記録媒体を、コンピュータにおいて使用することは、好都合な情報処理装置を構成する。 (Information recording medium)
Moreover, the structure which recorded the above-mentioned program on the information recording medium may be sufficient. The information recording medium stores an application program including the above-described program, and the computer can read the application program from the information recording medium and install the application program on the hard disk. Thus, the program can be provided by being recorded on an information recording medium such as a magnetic recording medium, an optical recording medium, or a ROM. Use of an information recording medium in which such a program is recorded in a computer constitutes a convenient information processing apparatus.

プログラムを供給するための情報記録媒体としては、例えばＲＯＭ、ＲＡＭ、フラッシュメモリやＳＲＡＭ等の半導体メモリ並びに集積回路、あるいはそれらを含むＵＳＢメモリやメモリカード、光ディスク、光磁気ディスク、磁気記録媒体等を用いてよく、さらに、フレキシブルディスク、ＣＤ−ＲＯＭ、ＣＤ―Ｒ、ＣＤ―ＲＷ、ＦＤ、ＤＶＤＲＯＭ、ＨＤＤＶＤ（ＨＤＤＶＤ−Ｒ−ＳＬ＜1層＞、ＨＤＤＶＤ−Ｒ−ＤＬ＜２層＞、ＨＤＤＶＤ−ＲＷ−ＳＬ、ＨＤＤＶＤ−ＲＷ−ＤＬ、ＨＤＤＶＤ−ＲＡＭ−ＳＬ）、ＤＶＤ±Ｒ−ＳＬ、ＤＶＤ±Ｒ−ＤＬ、ＤＶＤ±ＲＷ−ＳＬ、ＤＶＤ±ＲＷ−ＤＬ、ＤＶＤ−ＲＡＭ、Ｂｌｕ−ＲａｙＤｉｓｋ＜登録商標＞（ＢＤ−ＲーＳＬ、ＢＤ−Ｒ−ＤＬ、ＢＤ−ＲＥ−ＳＬ、ＢＤ−ＲＥ−ＤＬ）、ＭＯ、ＺＩＰ、磁気カード、磁気テープ、ＳＤカード、メモリスティック、不揮発性メモリカード、ＩＣカード、等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置、等に記録して構成して用いてよい。 As an information recording medium for supplying the program, for example, ROM, RAM, semiconductor memory such as flash memory and SRAM, and an integrated circuit, or a USB memory, memory card, optical disk, magneto-optical disk, magnetic recording medium and the like including them. Further, flexible disk, CD-ROM, CD-R, CD-RW, FD, DVDROM, HDDVD (HDDVD-R-SL <1 layer>, HDDVD-R-DL <2 layers>, HDDVD-RW) -SL, HDDVD-RW-DL, HDDVD-RAM-SL), DVD ± R-SL, DVD ± R-DL, DVD ± RW-SL, DVD ± RW-DL, DVD-RAM, Blu-Ray Disk <registration Trademark> (BD-R-SL, BD-R-DL, BD-RE-SL, BD-RE-DL), MO It is recorded on a portable medium such as ZIP, magnetic card, magnetic tape, SD card, memory stick, non-volatile memory card, IC card, etc., a storage device such as a hard disk built in a computer system, etc. Good.

さらに「情報記録媒体」は、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの（伝送媒体ないしは伝送波）、その場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含むものとする。 Furthermore, the “information recording medium” is a medium that dynamically holds a program for a short time (transmission medium), such as a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line. Or a transmission wave), a volatile memory inside a computer system serving as a server or a client in that case, and those holding a program for a certain period of time.

また、コンピュータ上で稼働しているＯＳ、端末（例えば携帯電話など）上のＲＴＯＳ等が処理の一部又は全部を行う場合にも、上記実施の形態と同等の機能を実現できると共に、同等の効果を得ることができる。 In addition, when an OS running on a computer, an RTOS on a terminal (for example, a mobile phone) performs part or all of the processing, the same functions as those in the above embodiment can be realized and An effect can be obtained.

さらに、プログラムを暗号化してＣＤ−ＲＯＭ等の記録媒体に格納してユーザに配布し、所定の条件をクリアしたユーザに対し、インターネットを介してホームページから暗号化を解く鍵情報をダウンロードさせ、その鍵情報を使用することにより暗号化されたプログラムを実行してコンピュータにインストールさせて実現することも可能である。この場合、本発明の構成は、プログラムの各構成要素（各種の手段、ステップ及びデータ）と、前記プログラム（各種の手段、ステップ及びデータ）を暗号化する暗号化手段と、を含んでよい。 Furthermore, the program is encrypted, stored in a recording medium such as a CD-ROM, distributed to the user, and the user who clears the predetermined condition is allowed to download key information for decryption from the homepage via the Internet, and It is also possible to execute the encrypted program by using the key information and install the program on a computer. In this case, the configuration of the present invention may include each component (various means, steps and data) of the program and encryption means for encrypting the program (various means, steps and data).

また、上記実施の形態では、クライアントサーバーシステムを例に説明したが、サーバを介さずに端末同士がネットワークを組み、相互にデータを送受信するピアツーピア（ＰｅｅｒＴｏＰｅｅｒ）通信によるシステムであってもよい。その際、管理装置は、ピア・ツゥ・ピア方式におけるマスタ端末であればよいまた、上述の実施の形態の「システム」を、他の「情報処理システム」と統合したシステムとして、これら全体を本発明の「システム」として構成することも一向に構わない。「情報処理システム」には、ＯＳや周辺機器等のハードウェアを含むものとする。 In the above embodiment, the client server system has been described as an example. However, a system based on peer-to-peer (Peer To Peer) communication in which terminals form a network without passing through a server and transmit / receive data to / from each other may be used. . At this time, the management apparatus may be a master terminal in the peer-to-peer system. Also, the “system” in the above-described embodiment is integrated with other “information processing system”, and the entire system is the main terminal. The configuration as the “system” of the invention may be performed in any way. The “information processing system” includes an OS and hardware such as peripheral devices.

また、前記実施の形態における「システム」とは、複数の装置が論理的に集合した物をいい、各構成の装置が同一筐体中にあるか否かは問わない。このため、本発明は、複数の機器から構成されるシステムに適用しても良いし、また、一つの機器からなる装置に適用しても良い。「システム」には、ＯＳや周辺機器等のハードウェアを含んでもよい。 In addition, the “system” in the above embodiments refers to a logical collection of a plurality of devices, and it does not matter whether the devices of each configuration are in the same housing. For this reason, this invention may be applied to the system comprised from a some apparatus, and may be applied to the apparatus which consists of one apparatus. The “system” may include hardware such as an OS and peripheral devices.

さらに、上述のプログラムなどが搭載される情報処理装置としては、サーバは、例えばパーソナルコンピュータに限らず、各種サーバー、ＥＷＳ（エンジニアリングワークステーション）、中型コンピュータ、メインフレームなどが挙げられる。情報端末は、以上の例に加えて、携帯型情報端末、各種モバイル端末、ＰＤＡ、携帯電話機、ウエアラブル情報端末、種々の（携帯型などの）テレビ・ＤＶＤレコーダ・各種音響機器及びそのリモコン、各種情報通信機能を搭載した家電機器、ネットワーク機能を有するゲーム機器等からも利用できる構成としても構わない。あるいは、これらの端末に表示されるアプリケーションとして改良されたものも本発明の範囲に含めることができる。 Furthermore, as an information processing apparatus in which the above-described program or the like is installed, the server is not limited to a personal computer, for example, but includes various servers, EWS (engineering workstation), medium-sized computers, mainframes, and the like. In addition to the above examples, information terminals include portable information terminals, various mobile terminals, PDAs, mobile phones, wearable information terminals, various (such as portable) televisions, DVD recorders, various acoustic devices and their remote controllers, A configuration that can be used from home appliances equipped with an information communication function, game devices having a network function, and the like may also be used. Or what was improved as an application displayed on these terminals can also be included in the scope of the present invention.

また、上記プログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるもの、いわゆる差分ファイル（差分プログラム）であっても良い。 Further, the program may be for realizing a part of the above-described functions, and further, a program that can realize the above-described functions in combination with a program already recorded in a computer system, a so-called difference file ( Difference program).

さらに、本明細書において、フローチャートに示されるステップは、記載された手順に従って時系列的に行われる処理はもちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理を含むものである。また、実装では、プログラム手順（ステップ）が実行される順序を変更することができる。さらに、実装の必要に応じて、本明細書で説明した特定の手順（ステップ）を、組み合わされた手順（ステップ）として実装、除去、追加、または再配置することができる。 Further, in the present specification, the steps shown in the flowchart include processes that are executed in parallel or individually even if they are not necessarily processed in time series, as well as processes that are executed in time series according to the described procedure. It is a waste. In the implementation, the order in which the program procedures (steps) are executed can be changed. Further, certain procedures (steps) described herein can be implemented, removed, added, or rearranged as a combined procedure (step) as needed for implementation.

さらに、装置の各手段、各機能、各ステップの手順の機能などのプログラムの機能を、専用のハードウエア（例えば専用の半導体回路等）によりその機能を達成してもよく、プログラムの全機能のうち一部の機能をハードウエアで処理し、全機能のうちさらに他の機能をソフトウエアで処理するようにしてもよい。専用のハードウエアの場合、各部を集積回路例えばＬＳＩにて形成されてもよい。これらは個別に１チップ化されても良いし、一部または全部を含むように１チップ化されても良い。また、ＬＳＩには、ストリーミングエンジンなど他の機能ブロックが含まれていても良い。また、集積回路化の手法はＬＳＩに限るものではなく、専用回路又は汎用プロセサで実現してもよい。さらには、半導体技術の進歩又は派生する別技術によりＬＳＩに置き換わる集積回路化の技術が登場すれば、当然、その技術を用いて機能ブロックの集積化を行ってもよい。 Furthermore, the functions of the program such as each means of the apparatus, each function, and the procedure function of each step may be achieved by dedicated hardware (for example, a dedicated semiconductor circuit). Some of these functions may be processed by hardware, and other functions among all functions may be processed by software. In the case of dedicated hardware, each unit may be formed by an integrated circuit such as an LSI. These may be individually made into one chip, or may be made into one chip so as to include a part or all of them. Further, the LSI may include other functional blocks such as a streaming engine. Further, the method of circuit integration is not limited to LSI, and implementation with a dedicated circuit or a general-purpose processor is also possible. Further, if integrated circuit technology comes out to replace LSI's as a result of the advancement of semiconductor technology or a derivative other technology, it is naturally also possible to carry out function block integration using this technology.

さらに、「通信」では、無線通信および有線通信は勿論、無線通信と有線通信とが混在した通信、即ち、ある区間では無線通信が行われ、他の区間では有線通信が行われるようなものであってもよい。さらに、ある装置から他の装置への通信が有線通信で行われ、他の装置からある装置への通信が無線通信で行われるようなものであってもよい。 Further, in “communication”, wireless communication and wired communication as well as communication in which wireless communication and wired communication are mixed, that is, wireless communication is performed in a certain section and wired communication is performed in another section. There may be. Further, communication from one device to another device may be performed by wired communication, and communication from another device to one device may be performed by wireless communication.

そして、この通信には通信網が含まれる。通信網を構成するネットワークとしては、例えば携帯電話回線網（基地局及び交換システムを含む）、公衆電話回線網、ＩＰ電話網、ＩＳＤＮ回線網などこれに類する各種回線網、インターネット（乃ち、ＴＣＰ・ＩＰプロトコルを用いた通信態様）やイントラネット、ＬＡＮ[イーサネット（登録商標）、やギガビットイーサネット（登録商標）などを含む]、ＷＡＮ、光ファイバー通信網、電力線通信網、ブロードバンド対応可能な各種専用回線網などいずれのハードウエア構成でもよい。さらに、ネットワークは、ＴＣＰ・ＩＰプロトコルの他、種々の通信プロトコルを用いたネットワークあるいはソフトウエア的に構築された仮想ネットワークやこれに類するあらゆるネットワークを含むネットワークなどいかなる通信プロトコルであってもよい。また、ネットワークは、有線に限らず、無線（衛星通信、各種高周波通信手段等を含む）ネットワーク（例えば、簡易電話システムや携帯電話のようなシングルキャリア通信システム、Ｗ―ＣＤＭＡやＩＥＥＥ８０２．１１ｂに準拠した無線ＬＡＮのようなスペクトラム拡散通信システム、ＩＥＥＥ８０２．１１ａやＨｉｐｅｒＬＡＮ／２のようなマルチキャリア通信システム、などを含むネットワーク）であっても構わず、これらの組み合わせを利用してもよく、他のネットワークと接続されたシステムであってもよい。さらに、ネットワークは、ポイントツーポイント、ポイントツーマルチポイント、マルチポイントツーマルチポイントなど如何なる形態でもよい。 This communication includes a communication network. As a network constituting the communication network, for example, a cellular phone network (including a base station and an exchange system), a public phone network, an IP phone network, an ISDN network such as various network networks, the Internet (Nochi, TCP. Communication mode using IP protocol), intranet, LAN [including Ethernet (registered trademark), gigabit Ethernet (registered trademark), etc.], WAN, optical fiber communication network, power line communication network, various dedicated line networks compatible with broadband, etc. Any hardware configuration may be used. In addition to the TCP / IP protocol, the network may be any communication protocol such as a network using various communication protocols, a virtual network constructed in software, or a network including any network similar thereto. Further, the network is not limited to a wired network, but includes a wireless (including satellite communication, various high-frequency communication means, etc.) network (for example, a single carrier communication system such as a simple telephone system or a mobile phone, W-CDMA or IEEE 802.11b). Network including a spread spectrum communication system such as a wireless LAN, a multi-carrier communication system such as IEEE802.11a and HiperLAN / 2, etc., or a combination of these may be used. It may be a system connected to a network. Further, the network may take any form such as point-to-point, point-to-multipoint, multipoint-to-multipoint.

また、運用管理装置と他の被管理装置との間の通信構造に際し、いずれか一方又は双方に形成されるインタフェースの種類は、例えばパラレルインタフェース、ＵＳＢインタフェース、ＩＥＥＥ１３９４、ＬＡＮやＷＡＮ等のネットワークやその他これに類するもの、もしくは今後開発される如何なるインタフェースであっても構わない。 In addition, in the communication structure between the operation management apparatus and another managed apparatus, the types of interfaces formed on one or both of them are, for example, a parallel interface, a USB interface, IEEE 1394, a network such as a LAN or WAN, and the like. Any interface that is similar to this or that will be developed in the future may be used.

さらに、相関モデルを生成し、相関変化分析を行う手法は、必ずしも実体のある装置に限られる必要はなく、その方法としても機能することは容易に理解できる。このため、方法にかかる発明も、必ずしも実体のある装置に限らず、その方法としても有効であることに相違はない。この場合、方法を実現するための一例として運用管理装置、運用管理システムなども含めることができる。 Furthermore, the method for generating the correlation model and performing the correlation change analysis is not necessarily limited to a substantial apparatus, and it can be easily understood that the method also functions as the method. For this reason, the invention relating to the method is not necessarily limited to a substantial apparatus, and there is no difference that the method is also effective. In this case, an operation management apparatus, an operation management system, or the like can be included as an example for realizing the method.

ところで、このような運用管理装置は、単独で存在する場合もあるし、ある機器に組み込まれた状態で利用されることもあるなど、発明の思想としてはこれに限らず、各種の態様を含むものである。従って、ソフトウェアであったりハードウェアであったりするなど、適宜、変更可能である。発明の思想の具現化例として装置のソフトウェアとなる場合には、かかるソフトウェアを記憶した記憶媒体上においても当然に存在し、利用されるといわざるをえない。 By the way, such an operation management apparatus may exist alone, or may be used in a state of being incorporated in a certain device, but the idea of the invention is not limited to this and includes various aspects. It is a waste. Therefore, it can be changed as appropriate, such as software or hardware. When the software of the apparatus is embodied as an embodiment of the idea of the invention, it naturally exists on the storage medium storing the software and is used.

さらに、一部がソフトウェアであって、一部がハードウェアで実現されている場合であってもよく、一部を記憶媒体上に記憶しておいて必要に応じて適宜読み込まれるような形態のものとしてあってもよい。本発明をソフトウェアで実現する場合、ハードウェアやオペレーティングシステムを利用する構成とすることも可能であるし、これらと切り離して実現することもできる。 Furthermore, it may be a case where a part is software and a part is realized by hardware, and a part is stored on a storage medium and is read as needed. It may be as a thing. When the present invention is realized by software, a configuration using hardware or an operating system may be used, or may be realized separately from these.

また、発明の範囲は、図示例に限定されないものとする。 The scope of the invention is not limited to the illustrated example.

さらに、上記各実施の形態には種々の段階が含まれており、開示される複数の構成要件における適宜な組み合わせにより種々の発明が抽出され得る。つまり、上述の各実施の形態同士、あるいはそれらのいずれかと各変形例のいずれかとの組み合わせによる例をも含む。この場合において、本実施形態において特に記載しなくとも、各実施の形態及びそれらの変形例に開示した各構成から自明な作用効果については、当然のことながら実施の形態の作用効果として含めることができる。逆に、本実施の形態に記載されたすべての作用効果を奏することのできる構成が、本発明の本質的特徴部分の必須構成要件であるとは限らない。また、実施の形態に示される全構成要件から幾つかの構成要件が削除された構成による実施の形態並びにその構成に基づく技術的範囲も発明になりうる。 Further, the above embodiments include various stages, and various inventions can be extracted by appropriately combining a plurality of disclosed constituent elements. That is, examples include combinations of the above-described embodiments, or any of them and any of the modifications. In this case, even if not specifically described in the present embodiment, the obvious effects from the respective configurations disclosed in the embodiments and their modifications are naturally included as the effects of the embodiments. it can. On the contrary, the configuration capable of exhibiting all the effects described in the present embodiment is not necessarily an essential component of the essential features of the present invention. In addition, an embodiment based on a configuration in which some of the configuration requirements are deleted from all the configuration requirements shown in the embodiment, and a technical scope based on the configuration may be an invention.

そして、各実施の形態及びそれらの変形例を含むこれまでの記述は、本発明の理解を容易にするために、本発明の多様な実施の形態のうちの一例の開示、すなわち、何れも本発明を実施するにあたっての具体化の例を示したものに過ぎず、例証するものであり、制限するものではなく、適宜変形及び／又は変更が可能である。本発明は、その技術思想、またはその主要な特徴に基づいて、様々な形で実施することができ、各実施の形態及びその変形例によって本発明の技術的範囲が限定的に解釈されてはならないものである。 In addition, the description so far including each of the embodiments and the modifications thereof is intended to facilitate the understanding of the present invention. The embodiments of the invention are merely shown as examples of implementation, are illustrative, not limiting, and can be modified and / or modified as appropriate. The present invention can be implemented in various forms based on its technical idea or its main features, and the technical scope of the present invention should not be construed in a limited manner by each embodiment and its modifications. It will not be.

従って、上記に開示された各要素は、本発明の技術的範囲に属する全ての設計変更や均等物を含む趣旨である。 Therefore, each element disclosed above is intended to include all design changes and equivalents belonging to the technical scope of the present invention.

本発明は、コンピュータ全般に適用可能である。 The present invention is applicable to all computers.

１運用管理システム
２コンピュータ（被管理装置）
３、１００、２００運用管理装置
１２、１１２、２１２性能情報蓄積処理部
１４、１１４、２１４分析設定蓄積処理部
２１、１２１、２２１サービス実行部
２２、１２２、２２２情報収集部
２６、１２６、２２６障害分析部
２７、１２７、２２７管理者対話部
２８、１２８、２２８対処実行部
１１６、２１６相関モデル蓄積処理部
１２３、２２３相関モデル生成部
１２４、２２４相関変化分析部
２１８変化履歴情報蓄積処理部
２３１定常変化分析部 1 Operation management system 2 Computer (managed device)
3, 100, 200 Operation management device 12, 112, 212 Performance information storage processing unit 14, 114, 214 Analysis setting storage processing unit 21, 121, 221 Service execution unit 22, 122, 222 Information collection unit 26, 126, 226 Failure Analysis unit 27, 127, 227 Administrator dialogue unit 28, 128, 228 Countermeasure execution unit 116, 216 Correlation model accumulation processing unit 123, 223 Correlation model generation unit 124, 224 Correlation change analysis unit 218 Change history information accumulation processing unit 231 Stationary Change analysis department

Claims

An operation management device for managing a system including a plurality of managed devices,
A generating unit that generates a correlation function representing a correlation between time series of performance information acquired from the managed device;
An analysis unit that analyzes whether the correlation is maintained by applying the newly acquired performance information to the correlation function;
A display instruction unit for giving an instruction to display a second abnormality degree indicating the abnormality degree of the system together with a first abnormality degree indicating the abnormality degree of each managed device calculated based on the analysis result;
An operation management apparatus comprising:

The first degree of abnormality indicates the degree of abnormality of each component included in the managed device.
The operation management apparatus according to claim 1.

The display instruction unit gives an instruction to rank and display the components based on the first degree of abnormality.
The operation management apparatus according to claim 2.

The display instruction unit performs an instruction to distinguish and display a correlation in which the correlation is maintained and a correlation in which the correlation is not maintained in a graph representing a correlation between the components.
The operation management apparatus according to claim 2 or 3.

The display instruction unit performs an instruction to distinguish and display a component having the first degree of abnormality larger than other components in a graph representing a correlation between the components.
The operation management apparatus according to claim 2.

A system including a plurality of managed devices;
An operation management device;
With
The operation management device includes:
A generating unit that generates a correlation function representing a correlation between time series of performance information acquired from the managed device;
An analysis unit that analyzes whether the correlation is maintained by applying the newly acquired performance information to the correlation function;
A display instruction unit for giving an instruction to display a second abnormality degree indicating the abnormality degree of the system together with a first abnormality degree indicating the abnormality degree of each managed device calculated based on the analysis result;
Operation management system including

Generate a correlation function representing a correlation between time series of performance information acquired from a plurality of managed devices included in the system,
Analyzing whether the correlation is maintained by applying the newly acquired performance information to the correlation function,
An instruction to display a second abnormality degree indicating the abnormality degree of the system together with a first abnormality degree indicating the abnormality degree of each managed device calculated based on the result of the analysis,
Information processing method.

The first degree of abnormality indicates the degree of abnormality of each component included in the managed device.
The information processing method according to claim 7.

When performing the instruction, an instruction to rank and display the components based on the first abnormality degree is given.
The information processing method according to claim 8.

When performing the instruction, in the graph representing the correlation between the components, an instruction to distinguish and display the correlation in which the correlation is maintained and the correlation in which the correlation is not maintained,
The information processing method according to claim 8 or 9.

In the case of performing the instruction, in the graph representing the correlation between the constituent elements, an instruction to distinguish and display the constituent elements having the first abnormality degree larger than the other constituent elements is performed.
The information processing method according to claim 8.

On the computer,
Generate a correlation function representing a correlation between time series of performance information acquired from a plurality of managed devices included in the system,
Analyzing whether the correlation is maintained by applying the newly acquired performance information to the correlation function,
An instruction to display a second abnormality degree indicating the abnormality degree of the system together with a first abnormality degree indicating the abnormality degree of each managed device calculated based on the result of the analysis,
An operation management program that executes processing.