JP6311329B2

JP6311329B2 - Information processing apparatus, monitoring method, and program

Info

Publication number: JP6311329B2
Application number: JP2014013895A
Authority: JP
Inventors: 善行後藤
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2014-01-29
Filing date: 2014-01-29
Publication date: 2018-04-18
Anticipated expiration: 2034-01-29
Also published as: JP2015141547A

Description

本発明は、情報処理装置、監視方法、及び、プログラムに関する。 The present invention relates to an information processing apparatus, a monitoring method, and a program.

近年、ＳａａＳ（Software as a Service）やＰａａＳ（Platform as a Service）と呼ばれるクラウドコンピューティングを用いたシステムの構築が広まっている。クラウドコンピューティング環境では、仮想化技術により、ユーザ数やデータ量に応じて仮想マシン等のリソースを増減することで、システムにおける負荷の変動が吸収される。クラウドコンピューティング環境においては、リソース使用に対する課金体系は従量課金となっていることが多い。このため、リソース使用量を最小に保ちながら、負荷の変動に応じて、迅速にリソースを増減することが要求される。 In recent years, the construction of systems using cloud computing called SaaS (Software as a Service) and PaaS (Platform as a Service) has become widespread. In a cloud computing environment, fluctuations in the load on the system are absorbed by increasing or decreasing resources such as virtual machines according to the number of users and the amount of data using virtualization technology. In a cloud computing environment, a billing system for resource use is often a pay-per-use charge. For this reason, it is required to quickly increase / decrease resources according to load fluctuations while keeping the resource usage to a minimum.

例えば、非特許文献１に開示されているようなサービスバスを介して通信を行う業務システムが、クラウドコンピューティングを用いて構築された場合、サービスバスの負荷に応じて、サービスバスを構成するリソースの増減が行われる。この場合、監視装置等は、サービスバスの負荷を計測、監視し、その結果に応じて、サービスバスに対する要求ＴＡＴや要求スループットを満たすように、リソースの割り当てを行う。しかしながら、単に負荷の増減に応じて、リソースの増減を行う方法では、突発的な負荷の増加に即時に対応することができず、要求ＴＡＴ（Turn Around Time）や要求スループットを満たせなくなる可能性がある。このため、例えば、サービスバスの負荷の増加の前兆を予め検出し、負荷が増加する前にリソースを追加することが要求される。 For example, when a business system that communicates via a service bus as disclosed in Non-Patent Document 1 is constructed using cloud computing, resources that configure the service bus according to the load on the service bus Increase / decrease is performed. In this case, the monitoring device or the like measures and monitors the load of the service bus, and allocates resources so as to satisfy the request TAT and the request throughput for the service bus according to the result. However, the method of simply increasing / decreasing resources according to the increase / decrease of the load cannot respond immediately to the sudden increase in load, and may not satisfy the required TAT (Turn Around Time) and the required throughput. is there. For this reason, for example, it is required to detect in advance a sign of an increase in service bus load and add resources before the load increases.

このように、監視対象のシステムにおける特定の事象の前兆を検出する監視装置の一例が、特許文献１に開示されている。特許文献１に開示された監視装置は、監視対象システムから収集された時系列データの特徴が、障害の前兆として検出すべき選定条件に適合する場合、当該特徴と時系列データとを、過去のメタデータと時系列データとして、関連づけて格納する。そして、監視装置は、現在の時系列データの特徴が選定条件に適合する場合、当該特徴と一致する過去のメタデータに関連づけられた時系列データを参照し、現在の時系列データの今後の変化を予想する。 Thus, an example of a monitoring device that detects a precursor of a specific event in a system to be monitored is disclosed in Patent Document 1. When the characteristics of the time-series data collected from the monitoring target system match the selection condition to be detected as a precursor of a failure, the monitoring device disclosed in Patent Literature 1 displays the characteristics and time-series data in the past. Store metadata and time-series data in association with each other. Then, when the feature of the current time-series data meets the selection condition, the monitoring device refers to the time-series data associated with the past metadata that matches the feature, and the future change of the current time-series data Expect.

特開２００９−２８９２２１号公報JP 2009-289221 A

"機能詳細: SOA基盤 WebOTX Enterprise Service Bus"、日本電気株式会社、［online］、［平成26年1月16日検索］、インターネット〈URL：http://www.nec.co.jp/WebOTX/integration/function/index.html〉"Function details: SOA-based WebOTX Enterprise Service Bus", NEC Corporation, [online], [searched January 16, 2014], Internet <URL: http://www.nec.co.jp/WebOTX/ integration / function / index.html>

しかしながら、特許文献１に記載されて技術においては、障害の前兆として検出すべき時系列データの特徴等を、管理者等が、予め選定条件として設定する必要がある。 However, in the technique described in Patent Document 1, it is necessary for an administrator or the like to set the characteristics of time-series data to be detected as a sign of failure as selection conditions in advance.

本発明の目的は、上述の課題を解決し、監視対象システムにおける特定の事象の前兆を検出するための条件を管理者等が設定することなく、容易に、前兆を検出できることである。 An object of the present invention is to solve the above-mentioned problems and to easily detect a precursor without an administrator or the like setting conditions for detecting a precursor of a specific event in a monitored system.

本発明の情報処理装置は、入力されたメッセージに対して所定の処理を行う監視対象システムの入力メッセージのログを記憶するメッセージログ記憶手段と、前記ログをもとに、前記監視対象システムにおいて所定の事象が発生した期間の前の期間における入力メッセージの特徴を表す照合定義を生成する解析手段と、新たな期間における前記監視対象システムの入力メッセージの特徴と前記照合定義とを比較し、前記監視対象システムにおける前記所定の事象の前兆の有無を判定する照合手段と、を備える。 The information processing apparatus according to the present invention includes a message log storage unit that stores a log of an input message of a monitoring target system that performs a predetermined process on an input message, and a predetermined number in the monitoring target system based on the log. The analysis means for generating the collation definition representing the characteristics of the input message in the period before the period of occurrence of the event, the characteristics of the input message of the monitored system in the new period and the collation definition are compared, and the monitoring Collating means for determining the presence or absence of a precursor of the predetermined event in the target system.

本発明の監視方法は、入力されたメッセージに対して所定の処理を行う監視対象システムの入力メッセージのログを記憶し、前記ログをもとに、前記監視対象システムにおいて所定の事象が発生した期間の前の期間における入力メッセージの特徴を表す照合定義を生成し、新たな期間における前記監視対象システムの入力メッセージの特徴と前記照合定義とを比較し、前記監視対象システムにおける前記所定の事象の前兆の有無を判定する。 The monitoring method of the present invention stores a log of an input message of a monitoring target system that performs predetermined processing on an input message, and a period during which a predetermined event occurs in the monitoring target system based on the log Generating a collation definition representing the characteristics of the input message in the previous period, comparing the characteristics of the input message of the monitored system with the collation definition in the new period, and indicating the precursor of the predetermined event in the monitored system The presence or absence of is determined.

本発明のプログラムは、コンピュータに、入力されたメッセージに対して所定の処理を行う監視対象システムの入力メッセージのログを記憶し、前記ログをもとに、前記監視対象システムにおいて所定の事象が発生した期間の前の期間における入力メッセージの特徴を表す照合定義を生成し、新たな期間における前記監視対象システムの入力メッセージの特徴と前記照合定義とを比較し、前記監視対象システムにおける前記所定の事象の前兆の有無を判定する、処理を実行させる。 The program of the present invention stores a log of an input message of a monitoring target system that performs predetermined processing on an input message in a computer, and a predetermined event occurs in the monitoring target system based on the log Generating a collation definition that represents the characteristics of the input message in the period before the selected period, comparing the characteristics of the input message of the monitored system in the new period with the collation definition, and the predetermined event in the monitored system The process of determining the presence or absence of a sign is executed.

本発明の効果は、監視対象システムにおける特定の事象の前兆を、容易に検出できることである。 The effect of the present invention is that a precursor of a specific event in the monitored system can be easily detected.

本発明の実施の形態の特徴的な構成を示すブロック図である。It is a block diagram which shows the characteristic structure of embodiment of this invention. 本発明の実施の形態における、業務システムの構成を示すブロック図である。It is a block diagram which shows the structure of the business system in embodiment of this invention. 本発明の実施の形態における、サービスバス２００が受信するメッセージの例を示す図である。It is a figure which shows the example of the message which the service bus 200 receives in embodiment of this invention. 本発明の実施の形態における、メッセージ処理を示すフローチャートである。It is a flowchart which shows a message process in embodiment of this invention. 本発明の実施の形態における、解析処理を示すフローチャートである。It is a flowchart which shows the analysis process in embodiment of this invention. 本発明の実施の形態における、メッセージログの例を示す図である。It is a figure which shows the example of the message log in embodiment of this invention. 本発明の実施の形態における、解析定義ファイル３２１の例を示す図である。It is a figure which shows the example of the analysis definition file 321 in embodiment of this invention. 本発明の実施の形態における、メッセージログ集合３１１の例を示す図である。It is a figure which shows the example of the message log set 311 in embodiment of this invention. 本発明の実施の形態における、平均ＴＡＴ、及び、スループットの算出結果の例を示す図である。It is a figure which shows the example of the calculation result of average TAT and throughput in embodiment of this invention. 本発明の実施の形態における、特徴情報の算出結果の例を示す図である。It is a figure which shows the example of the calculation result of the feature information in embodiment of this invention. 本発明の実施の形態における、照合定義ファイル３４１の例を示す図である。It is a figure which shows the example of the collation definition file 341 in embodiment of this invention. 本発明の実施の形態における、照合処理を示すフローチャートである。It is a flowchart which shows the collation process in embodiment of this invention. 本発明の実施の形態における、メッセージログ集合３１１の他の例を示す図である。It is a figure which shows the other example of the message log set 311 in embodiment of this invention. 本発明の実施の形態における、特徴情報の算出結果の他の例を示す図である。It is a figure which shows the other example of the calculation result of the feature information in embodiment of this invention. 本発明の実施の形態における、メッセージログ集合３１１のさらに他の例を示す図である。It is a figure which shows the further another example of the message log set 311 in embodiment of this invention. 本発明の実施の形態における、特徴情報の算出結果のさらに他の例を示す図である。It is a figure which shows the further another example of the calculation result of the feature information in embodiment of this invention. 本発明の実施の形態における、照合履歴テーブル３６１の例を示す図である。It is a figure which shows the example of the collation history table 361 in embodiment of this invention.

次に、本発明の実施の形態について説明する。 Next, an embodiment of the present invention will be described.

はじめに、本発明の実施の形態の構成について説明する。図２は、本発明の実施の形態における、業務システムの構成を示すブロック図である。 First, the configuration of the embodiment of the present invention will be described. FIG. 2 is a block diagram showing the configuration of the business system in the embodiment of the present invention.

図２を参照すると、本発明の実施の形態における業務システムは、１以上のサービスシステム１００、サービスバス２００、監視装置３００、及び、リソース制御装置４００を含む。サービスバス２００は、本発明の監視対象システムの一実施形態である。また、監視装置３００は、本発明の情報処理装置の一実施形態である。 Referring to FIG. 2, the business system in the embodiment of the present invention includes one or more service systems 100, a service bus 200, a monitoring device 300, and a resource control device 400. The service bus 200 is an embodiment of the monitoring target system of the present invention. The monitoring device 300 is an embodiment of the information processing device of the present invention.

１以上のサービスシステム１００の各々は、業務システムにおける様々なサービスを提供する。サービスシステム１００は、例えば、１以上のコンピュータ上で実行されるプログラムにより実現される。 Each of the one or more service systems 100 provides various services in the business system. The service system 100 is realized by a program executed on one or more computers, for example.

サービスバス２００は、１以上のサービスシステム１００と接続される。サービスバス２００は、要求元のサービスシステム１００から受信したメッセージを、他のサービスシステム１００に転送し、他のサービスシステム１００による処理結果を応答メッセージとして、当該要求元のサービスシステム１００に返信する。ここで、サービスバス２００は、サービスシステム１００間で、メッセージのプロトコル変換、データ形式の変換等も行う。本発明の実施の形態では、サービスバス２００が受信したメッセージに対して実行する、これらの転送、プロトコル変換、データ形式の変換等を、サービスバス２００の所定の処理と呼ぶ。サービスバス２００は、例えば、１以上の仮想マシン上で実行されるプログラムにより実現される。 The service bus 200 is connected to one or more service systems 100. The service bus 200 transfers the message received from the request source service system 100 to another service system 100, and returns the processing result by the other service system 100 as a response message to the request source service system 100. Here, the service bus 200 also performs message protocol conversion, data format conversion, and the like between the service systems 100. In the embodiment of the present invention, the transfer, protocol conversion, data format conversion, and the like executed on the message received by the service bus 200 are referred to as predetermined processing of the service bus 200. The service bus 200 is realized by a program executed on one or more virtual machines, for example.

サービスバス２００は、マスタ２１０、及び、１以上のスレーブ２２０を含む。マスタ２１０は、サービスシステム１００から受信したメッセージを、１以上のスレーブ２２０のいずれかに振り分けることに、負荷分散を行う。各スレーブ２２０は、上述の所定の処理を行う。各スレーブ２２０は、リソース制御装置４００により、有効化／無効化（追加／削除）される。 The service bus 200 includes a master 210 and one or more slaves 220. The master 210 distributes the load by distributing the message received from the service system 100 to one of the one or more slaves 220. Each slave 220 performs the predetermined processing described above. Each slave 220 is enabled / disabled (added / deleted) by the resource control device 400.

図３は、本発明の実施の形態における、サービスバス２００が受信するメッセージの例を示す図である。メッセージは、request要素の配下にmessagetype要素とbodyと要素を含む。 FIG. 3 is a diagram showing an example of a message received by the service bus 200 in the embodiment of the present invention. The message includes a messagetype element, a body, and an element under the request element.

監視装置３００は、サービスバス２００における高負荷状態の前兆を検出する（サービスバス２００が高負荷状態となることを予測する）。監視装置３００は、当該前兆の検出有無に応じて、リソース制御装置４００に、スレーブ２２０の有効化／無効化を指示する。 The monitoring device 300 detects a precursor of a high load state in the service bus 200 (predicts that the service bus 200 will be in a high load state). The monitoring apparatus 300 instructs the resource control apparatus 400 to enable / disable the slave 220 according to whether or not the precursor is detected.

監視装置３００は、メッセージログ記憶部３１０、解析定義記憶部３２０、解析部３３０、照合定義記憶部３４０、照合部３５０、及び、照合履歴記憶部３６０を含む。 The monitoring device 300 includes a message log storage unit 310, an analysis definition storage unit 320, an analysis unit 330, a collation definition storage unit 340, a collation unit 350, and a collation history storage unit 360.

メッセージログ記憶部３１０は、メッセージログ集合３１１を記憶する。メッセージログ集合３１１は、サービスバス２００に対して入出力されるメッセージのログ（メッセージログ）の時系列である。 The message log storage unit 310 stores a message log set 311. The message log set 311 is a time series of logs (message logs) of messages input to and output from the service bus 200.

図６は、本発明の実施の形態における、メッセージログの例を示す図である。メッセージログは、メッセージの受信日時、メッセージ長、送信元、送信先、メッセージ本文、及び、返信日時を含む。ここで、受信日時は、要求元のサービスシステム１００からのメッセージの受信日時を示す。返信日時は、当該要求元のサービスシステム１００への応答メッセージの送信日時を示す。 FIG. 6 is a diagram showing an example of a message log in the embodiment of the present invention. The message log includes a message reception date and time, a message length, a transmission source, a transmission destination, a message body, and a reply date and time. Here, the reception date / time indicates the reception date / time of the message from the requesting service system 100. The reply date / time indicates the transmission date / time of the response message to the service system 100 of the request source.

図８は、本発明の実施の形態における、メッセージログ集合３１１の例を示す図である。メッセージログ集合３１１は、メッセージログの時系列を含む。 FIG. 8 is a diagram showing an example of the message log set 311 in the embodiment of the present invention. The message log set 311 includes a time series of message logs.

解析定義記憶部３２０は、解析定義ファイル３２１を記憶する。解析定義ファイル３２１は、メッセージログ３１１の解析、及び、照合に係る設定を定義する。 The analysis definition storage unit 320 stores an analysis definition file 321. The analysis definition file 321 defines settings related to analysis and verification of the message log 311.

図７は、本発明の実施の形態における、解析定義ファイル３２１の例を示す図である。解析定義ファイル３２１は、解析間隔、統計間隔、検査対象、最大許容ＴＡＴ、最小許容スループット、平均メッセージ長マージン、メッセージ比率マージン、及び、予測失敗許容回数を含む。ここで、解析間隔は、後述する解析処理を行う間隔を示す。統計間隔は、後述する照合処理を行う間隔を示す。検査対象は、照合処理において検査する対象である、メッセージ本文中の要素を示す。最大許容ＴＡＴ、最小許容スループットは、サービスバス２００が高負荷状態にあるかどうかを判定するための、ＴＡＴ、スループットの許容範囲を示す。平均メッセージ長マージン、メッセージ比率マージンは、照合処理において、メッセージの特徴情報である、平均メッセージ長、メッセージ比率を比較するときに適用されるマージンを示す。予測失敗許容回数は、予測失敗の回数の許容範囲を示す。予測失敗とは、高負荷状態の前兆の検出（高負荷状態の予測）に対して、実際に高負荷状態が発生しないケースである。 FIG. 7 is a diagram showing an example of the analysis definition file 321 in the embodiment of the present invention. The analysis definition file 321 includes an analysis interval, a statistical interval, an inspection target, a maximum allowable TAT, a minimum allowable throughput, an average message length margin, a message ratio margin, and an allowable number of prediction failures. Here, the analysis interval indicates an interval for performing an analysis process described later. The statistical interval indicates an interval at which collation processing described later is performed. The inspection object indicates an element in the message body that is an object to be inspected in the collation process. The maximum allowable TAT and the minimum allowable throughput indicate an allowable range of TAT and throughput for determining whether or not the service bus 200 is in a high load state. The average message length margin and the message ratio margin indicate margins that are applied when comparing the average message length and the message ratio, which are message feature information, in the verification process. The allowable number of prediction failures indicates an allowable range of the number of prediction failures. A prediction failure is a case where a high load state does not actually occur in response to detection of a precursor of a high load state (prediction of a high load state).

解析部３３０は、メッセージログをもとに、サービスバス２００が高負荷状態にあった統計期間（統計間隔ごとの期間）を検出する。そして、解析部３３０は、当該高負荷状態にあった統計期間の直前の統計期間においてサービスバス２００が受信したメッセージの特徴を定義した照合定義ファイル３４１を生成する。 The analysis unit 330 detects a statistical period (period for each statistical interval) in which the service bus 200 is in a high load state based on the message log. Then, the analysis unit 330 generates a collation definition file 341 that defines the characteristics of the message received by the service bus 200 in the statistical period immediately before the statistical period in the high load state.

照合定義記憶部３４０は、照合定義ファイル３４１を記憶する。 The collation definition storage unit 340 stores a collation definition file 341.

図１１は、本発明の実施の形態における、照合定義ファイル３４１の例を示す図である。照合定義ファイル３４１は、照合定義ＩＤ（Identifier）、平均メッセージ長、メッセージ比率を含む。 FIG. 11 is a diagram showing an example of the collation definition file 341 in the embodiment of the present invention. The collation definition file 341 includes a collation definition ID (Identifier), an average message length, and a message ratio.

照合部３５０は、新たな統計期間においてサービスバス２００に入力されたメッセージの特徴と照合定義ファイル３４１とを比較し、サービスバス２００における高負荷状態の前兆の有無を判定する。 The collation unit 350 compares the characteristics of the message input to the service bus 200 in the new statistical period with the collation definition file 341, and determines whether there is a sign of a high load state in the service bus 200.

照合履歴記憶部３６０は、照合履歴テーブル３６１を記憶する。照合履歴テーブル３６１は、検出された高負荷状態の前兆に係る情報を示す。図１７は、本発明の実施の形態における、照合履歴テーブル３６１の例を示す図である。照合履歴テーブル３６１は、照合定義ＩＤ、照合日時、及び、予測成否を含む。照合定義ＩＤは、高負荷状態の前兆が検出されたときに用いられた、照合定義ファイル３４１の照合定義ＩＤを示す。照合日時は、高負荷状態の前兆が検出された統計期間を示す。予測成否は、高負荷状態の前兆の検出（高負荷状態の予測）に対して、実際に高負荷状態が発生した（予測が成功した）かどうかを、「ＯＫ」または「ＮＧ」で示す。 The verification history storage unit 360 stores a verification history table 361. The collation history table 361 shows information related to the detected sign of the high load state. FIG. 17 is a diagram showing an example of the collation history table 361 according to the embodiment of the present invention. The collation history table 361 includes a collation definition ID, a collation date and time, and a prediction success / failure. The collation definition ID indicates the collation definition ID of the collation definition file 341 used when a precursor of a high load state is detected. The collation date and time indicates a statistical period in which a precursor of a high load state is detected. The prediction success / failure indicates by “OK” or “NG” whether or not a high load state actually occurred (prediction was successful) in response to detection of a precursor of a high load state (prediction of a high load state).

リソース制御装置４００は、監視装置３００からの指示に従って、スレーブ２２０の有効化／無効化を行う。リソース制御装置４００は、監視装置３００に含まれていてもよい。 The resource control device 400 validates / invalidates the slave 220 in accordance with an instruction from the monitoring device 300. The resource control device 400 may be included in the monitoring device 300.

なお、監視装置３００はＣＰＵ（Central Processing Unit）とプログラムを記憶した記憶媒体を含み、プログラムに基づく制御によって動作するコンピュータであってもよい。またメッセージログ記憶部３１０、解析定義記憶部３２０、照合定義記憶部３４０、及び、照合履歴記憶部３６０は、それぞれ個別の記憶媒体でも、１つの記憶媒体によって構成されてもよい。 Note that the monitoring device 300 may be a computer that includes a CPU (Central Processing Unit) and a storage medium that stores a program, and that operates by control based on the program. Further, the message log storage unit 310, the analysis definition storage unit 320, the collation definition storage unit 340, and the collation history storage unit 360 may be configured as individual storage media or a single storage medium.

次に、本発明の実施の形態の動作について説明する。 Next, the operation of the embodiment of the present invention will be described.

はじめに、本発明の実施の形態における、メッセージ処理について説明する。 First, message processing in the embodiment of the present invention will be described.

図４は、本発明の実施の形態における、メッセージ処理を示すフローチャートである。 FIG. 4 is a flowchart showing message processing in the embodiment of the present invention.

要求元のサービスシステム１００は、サービスバス２００へメッセージを送信する（ステップＳ１０１）。 The requesting service system 100 transmits a message to the service bus 200 (step S101).

サービスバス２００のマスタ２１０は、受信したメッセージを、有効化されているスレーブ２２０のいずれかへ送信する（ステップＳ１０２）。 The master 210 of the service bus 200 transmits the received message to one of the activated slaves 220 (step S102).

メッセージを受信したスレーブ２２０は、当該メッセージに対して所定の処理を行い（ステップＳ１０３）、応答メッセージをマスタ２１０へ返信する（ステップＳ１０４）。 The slave 220 that has received the message performs predetermined processing on the message (step S103), and returns a response message to the master 210 (step S104).

マスタ２１０は、受信した応答メッセージを要求元のサービスシステム１００へ送信する（ステップＳ１０５）。 The master 210 transmits the received response message to the requesting service system 100 (step S105).

また、マスタ２１０は、メッセージログを生成し、監視装置３００へ送信する（ステップＳ１０６）。 In addition, the master 210 generates a message log and transmits it to the monitoring device 300 (step S106).

例えば、マスタ２１０は、図６のようなメッセージログを監視装置３００へ送信する。 For example, the master 210 transmits a message log as shown in FIG.

次に、本発明の実施の形態における、解析処理について説明する。 Next, analysis processing in the embodiment of the present invention will be described.

ここでは、図７のような解析定義ファイル３２１が、管理者等により設定され、解析定義記憶部３２０に記憶されていると仮定する。また、サービスバス２００から受信したメッセージログをもとに、図８のようなメッセージログ集合３１１が、メッセージログ記憶部３１０に記憶されていると仮定する。さらに、後述する照合処理により生成された、図１７のような照合履歴テーブル３６１（照合日時「9/8 00:50〜01:00」以前）が、照合履歴記憶部３６０に記憶されていると仮定する。 Here, it is assumed that an analysis definition file 321 as shown in FIG. 7 is set by an administrator or the like and stored in the analysis definition storage unit 320. Further, it is assumed that a message log set 311 as shown in FIG. 8 is stored in the message log storage unit 310 based on the message log received from the service bus 200. Furthermore, a collation history table 361 (before collation date “9/8 00:50 to 01:00”) as shown in FIG. 17 generated by the collation process described later is stored in the collation history storage unit 360. Assume.

図５は、本発明の実施の形態における、解析処理を示すフローチャートである。 FIG. 5 is a flowchart showing the analysis processing in the embodiment of the present invention.

解析処理は、解析定義ファイル３２１に定義された解析間隔で、実行される。 The analysis processing is executed at an analysis interval defined in the analysis definition file 321.

解析部３３０は、解析定義記憶部３２０から解析定義ファイル３２１を取得する（ステップＳ２０１）。 The analysis unit 330 acquires the analysis definition file 321 from the analysis definition storage unit 320 (step S201).

例えば、解析部３３０は、図７のような解析定義ファイル３２１を取得する。 For example, the analysis unit 330 acquires an analysis definition file 321 as shown in FIG.

解析部３３０は、メッセージログ記憶部３１０からメッセージログ集合３１１を取得する（ステップＳ２０２）。 The analysis unit 330 acquires the message log set 311 from the message log storage unit 310 (step S202).

例えば、解析部３３０は、図８のようなメッセージログ集合３１１を取得する。 For example, the analysis unit 330 acquires a message log set 311 as illustrated in FIG.

解析部３３０は、ステップＳ２０２で取得したメッセージログ集合３１１に含まれる期間内の、統計期間を１つ選択する（ステップＳ２０３）。ここでは、選択した統計期間を対象解析期間と呼ぶ。 The analysis unit 330 selects one statistical period within the period included in the message log set 311 acquired in step S202 (step S203). Here, the selected statistical period is called a target analysis period.

解析部３３０は、対象解析期間におけるメッセージログをもとに、当該対象解析期間において、サービスバス２００が高負荷状態にあったかどうかを判定する（ステップＳ２０４）。 Based on the message log in the target analysis period, the analysis unit 330 determines whether or not the service bus 200 was in a high load state in the target analysis period (step S204).

ここで、解析部３３０は、対象解析期間のメッセージについて、サービスバス２００において実行された所定の処理の平均ＴＡＴ、及び、スループットを算出する。解析部３３０は、メッセージログの受信時刻と返信時刻の差により各メッセージの処理のＴＡＴを算出し、当該対象解析期間のメッセージについて平均値を算出する。また、解析部３３０は、対象解析期間に処理されたメッセージ数を統計間隔で割ることにより、スループットを算出する。 Here, the analysis unit 330 calculates an average TAT and throughput of predetermined processing executed in the service bus 200 for the message in the target analysis period. The analysis unit 330 calculates the TAT for processing each message based on the difference between the message log reception time and the reply time, and calculates an average value for the messages in the target analysis period. The analysis unit 330 calculates the throughput by dividing the number of messages processed in the target analysis period by the statistical interval.

そして、解析部３３０は、算出された平均ＴＡＴ、スループットと、解析定義ファイル３２１に定義された最大許容ＴＡＴ、最小許容スループットを用いて、サービスバス２００が対象解析期間において高負荷状態であったかどうかを、以下のように判定する。解析部３３０は、平均ＴＡＴが、最大許容ＴＡＴの許容範囲内で、かつ、スループットが最小許容スループットの許容範囲内の場合、高負荷状態になかったと判定する。一方、解析部３３０は、平均ＴＡＴが、最大許容ＴＡＴの範囲外、あるいは、スループットが最小許容スループットの範囲外の場合、高負荷状態にあったと判定する。 Then, the analysis unit 330 uses the calculated average TAT and throughput and the maximum allowable TAT and minimum allowable throughput defined in the analysis definition file 321 to determine whether or not the service bus 200 was in a high load state during the target analysis period. The determination is as follows. When the average TAT is within the allowable range of the maximum allowable TAT and the throughput is within the allowable range of the minimum allowable throughput, the analysis unit 330 determines that the load is not high. On the other hand, when the average TAT is out of the range of the maximum allowable TAT or the throughput is out of the range of the minimum allowable throughput, the analysis unit 330 determines that the load is high.

図９は、本発明の実施の形態における、平均ＴＡＴ、及び、スループットの算出結果の例を示す図である。 FIG. 9 is a diagram illustrating an example of calculation results of average TAT and throughput in the embodiment of the present invention.

例えば、対象解析期間が「9/8 00:40〜00:50」の場合、図９の算出結果における平均ＴＡＴ「10ms」は最大許容ＴＡＴ「100ms」の許容範囲内である。また、スループット「10件/sec」は、最小許容スループット「50件/sec」の許容範囲外であるが、受信メッセージ数が「10件/sec」であるため、問題ないとみなす。したがって、この場合、解析部３３０は、高負荷状態になかったと判定する。また、対象解析期間が「9/8 00:50〜01:00」の場合、平均ＴＡＴ「75ms」は最大許容ＴＡＴの許容範囲内であり、スループット「75件/sec」も、最小許容スループットの許容範囲内である。したがって、この場合も、解析部３３０は、高負荷状態になかったと判定する。一方、対象解析期間が「9/8 01:00〜01:10」の場合、平均ＴＡＴ「200ms」は最大許容ＴＡＴの許容範囲外であり、スループット「25件/sec」も最小許容スループットの許容範囲外である。したがって、この場合、解析部３３０は、高負荷状態にあったと判定する。 For example, when the target analysis period is “9/8 00:40 to 00:50”, the average TAT “10 ms” in the calculation result of FIG. 9 is within the allowable range of the maximum allowable TAT “100 ms”. Further, the throughput “10 messages / sec” is outside the allowable range of the minimum allowable throughput “50 messages / sec”, but the number of received messages is “10 messages / sec”, so it is considered that there is no problem. Therefore, in this case, the analysis unit 330 determines that the load is not high. When the target analysis period is “9/8 00: 50-01: 00”, the average TAT “75 ms” is within the allowable range of the maximum allowable TAT, and the throughput “75 cases / sec” is also the minimum allowable throughput. It is within the allowable range. Therefore, also in this case, the analysis unit 330 determines that the load is not high. On the other hand, when the target analysis period is “9/8 01:00 to 01:10”, the average TAT “200ms” is outside the allowable range of the maximum allowable TAT, and the throughput “25 cases / sec” is also the allowable allowable minimum throughput. Out of range. Therefore, in this case, the analysis unit 330 determines that the load is high.

ステップＳ２０４において、高負荷状態にあったと判定された場合（ステップＳ２０４／Ｙ）、解析部３３０は、対象解析期間の直前の統計期間の入力メッセージの特徴情報を生成する（ステップＳ２０５）。 If it is determined in step S204 that the load is high (step S204 / Y), the analysis unit 330 generates feature information of the input message in the statistical period immediately before the target analysis period (step S205).

ここで、解析部３３０は、特徴情報として、平均メッセージ長、及び、メッセージ比率を算出する。解析部３３０は、直前の統計期間にサービスバス２００が受信したメッセージの長さの平均を「平均メッセージ長」として算出する。また、解析部３３０は、直前の統計期間にサービスバス２００が受信したメッセージで用いられた、解析定義ファイル３２１で定義された検査対象の要素の各文字列の比率を「メッセージ比率」として算出する。 Here, the analysis unit 330 calculates an average message length and a message ratio as the feature information. The analysis unit 330 calculates the average message length received by the service bus 200 during the immediately preceding statistical period as the “average message length”. Further, the analysis unit 330 calculates, as a “message ratio”, the ratio of each character string of the element to be inspected defined in the analysis definition file 321 used in the message received by the service bus 200 in the immediately preceding statistical period. .

図１０は、本発明の実施の形態における、特徴情報の算出結果の例を示す図である。 FIG. 10 is a diagram illustrating an example of the calculation result of the feature information in the embodiment of the present invention.

例えば、対象解析期間が「9/8 01:00〜01:10」の場合、解析部３３０は、直前の統計期間「9/8 00:50〜01:00」について、図１０のような特徴情報を生成する。 For example, when the target analysis period is “9/8 01:00 to 01:10”, the analysis unit 330 performs the characteristics as shown in FIG. 10 for the immediately preceding statistical period “9/8 00:50 to 01:00”. Generate information.

解析部３３０は、ステップＳ２０５で生成された、直前の統計期間の入力メッセージの特徴情報をもとに、照合定義ファイル３４１を生成する（ステップＳ２０６）。 The analysis unit 330 generates the collation definition file 341 based on the feature information of the input message of the immediately preceding statistical period generated in step S205 (step S206).

ここで、解析部３３０は、特徴情報の平均メッセージ長に、解析定義ファイル３２１に定義されたメッセージ長マージンを乗じた値を、照合定義ファイル３４１における「平均メッセージ長」に設定する。また、解析部３３０は、特徴情報の「メッセージ比率」に、解析定義ファイル３２１に定義されたメッセージ比率マージンを加算／減算した値を、照合定義ファイル３４１における「メッセージ比率」に設定する。なお、マージンを加算した値が100%を超えた場合は、算出結果を「100%」としてもよい。また、マージンを減算した値が0%未満の場合も、算出結果を「0%」としてもよい。 Here, the analysis unit 330 sets a value obtained by multiplying the average message length of the feature information by the message length margin defined in the analysis definition file 321 as the “average message length” in the collation definition file 341. Further, the analysis unit 330 sets a value obtained by adding / subtracting the message ratio margin defined in the analysis definition file 321 to the “message ratio” of the feature information as the “message ratio” in the collation definition file 341. When the value obtained by adding the margin exceeds 100%, the calculation result may be “100%”. Also, when the value obtained by subtracting the margin is less than 0%, the calculation result may be “0%”.

例えば、解析部３３０は、図１０の特徴情報、及び、図７の解析定義ファイル３２１における各マージンをもとに、対象解析期間「9/8 01:00〜01:10」について、図１１のように、照合定義ＩＤ「201309080100」を付与した照合定義ファイル３４１を生成する。 For example, based on the feature information in FIG. 10 and each margin in the analysis definition file 321 in FIG. 7, the analysis unit 330 performs the analysis for the target analysis period “9/8 01:00 to 01:10” in FIG. In this way, the collation definition file 341 assigned with the collation definition ID “201309080100” is generated.

さらに、解析部３３０は、照合履歴テーブル３６１を参照し、事前に実行された、対象解析期間の直前の統計期間における照合処理（後述）で、高負荷状態の前兆が検出済みかどうかを判定する（ステップＳ２０７）。ここで、照合履歴テーブル３６１に、照合日時が直前の統計期間であるレコードが存在する場合は、前兆が検出済みであることを表す。前兆が検出済みの場合（ステップＳ２０７／Ｙ）、解析部３３０は、照合履歴テーブル３６１における、当該前兆に係るレコードの予測成否に「ＯＫ」を設定する（ステップＳ２０８）。 Furthermore, the analysis unit 330 refers to the matching history table 361 and determines whether or not a precursor of a high load state has been detected in the matching process (described later) performed in advance in the statistical period immediately before the target analysis period. (Step S207). Here, if there is a record in the collation history table 361 whose collation date / time is the immediately preceding statistical period, this indicates that a precursor has been detected. When the precursor has been detected (step S207 / Y), the analysis unit 330 sets “OK” to the prediction success / failure of the record related to the precursor in the verification history table 361 (step S208).

例えば、対象解析期間が「9/8 01:00〜01:10」の場合、図１７の照合履歴テーブル３６１に、照合日時「9/8 00:50〜01:00」のレコードが存在する。解析部３３０は、図１７のように、当該レコードの予測成否に「ＯＫ」を設定する。 For example, when the target analysis period is “9/8 01:00 to 01:10”, the collation history table 361 in FIG. 17 has a record of the collation date “9/8 00:50 to 01:00”. As shown in FIG. 17, the analysis unit 330 sets “OK” to the prediction success / failure of the record.

解析部３３０は、ステップＳ２１３の処理に進む。 The analysis unit 330 proceeds to the process of step S213.

一方、ステップＳ２０４において、高負荷状態になかったと判定された場合（ステップＳ２０４／Ｎ）、解析部３３０は、ステップＳ２０７と同様に、高負荷状態の前兆が検出済みかどうかを判定する（ステップＳ２０９）。前兆が検出済みの場合（ステップＳ２０９／Ｙ）、解析部３３０は、照合履歴テーブル３６１における、当該前兆に係るレコードの予測成否に「ＮＧ」を設定する（ステップＳ２１０）。 On the other hand, when it is determined in step S204 that the high load state has not been reached (step S204 / N), the analysis unit 330 determines whether or not a precursor of the high load state has been detected, similarly to step S207 (step S209). ). When the precursor has been detected (step S209 / Y), the analysis unit 330 sets “NG” to the prediction success / failure of the record related to the precursor in the verification history table 361 (step S210).

さらに、解析部３３０は、照合履歴テーブル３６１において、ステップＳ２１０で予測成否「ＮＧ」が設定されたレコードと同一の照合定義ファイル３４１、及び、予測成否「ＮＧ」が設定されたレコードの数を求める。そして、求めた数が、解析定義ファイル３２１に定義された予測失敗許容回数を超えたか判定する（ステップＳ２１１）。予測失敗許容回数を超えた場合（ステップＳ２１１／Ｙ）、解析部３３０は、当該照合定義ファイル３４１を照合定義記憶部３４０から削除する（ステップＳ２１２）。これにより、不適切な照合定義ファイル３４１は削除され、予測精度の高い照合定義ファイル３４１が残る。また、サービスシステム１００から受信するメッセージのパターンが変化した場合も、古い照合定義ファイル３４１が削除される。したがって、管理者等による設定変更が無くても、照合定義ファイル３４１による予測精度が改善する。 Further, the analysis unit 330 obtains, in the matching history table 361, the same matching definition file 341 as the record for which the prediction success / failure “NG” is set in step S210 and the number of records for which the prediction success / failure “NG” is set. . Then, it is determined whether the obtained number exceeds the allowable number of prediction failures defined in the analysis definition file 321 (step S211). When the allowable number of prediction failures is exceeded (step S211 / Y), the analysis unit 330 deletes the matching definition file 341 from the matching definition storage unit 340 (step S212). Thereby, the inappropriate collation definition file 341 is deleted, and the collation definition file 341 with high prediction accuracy remains. Also, when the pattern of a message received from the service system 100 changes, the old collation definition file 341 is deleted. Therefore, even if there is no setting change by the administrator or the like, the prediction accuracy by the collation definition file 341 is improved.

解析部３３０は、ステップＳ２０２で取得したメッセージログ集合３１１に含まれる、全ての統計期間について、ステップＳ２０３からの処理を繰り返す（ステップＳ２１３）。 The analysis unit 330 repeats the processing from step S203 for all statistical periods included in the message log set 311 acquired in step S202 (step S213).

解析部３３０は、生成された照合定義ファイル３４１を、照合定義記憶部３４０に保存する（ステップＳ２１４）。 The analysis unit 330 stores the generated collation definition file 341 in the collation definition storage unit 340 (step S214).

なお、解析部３３０は、生成された照合定義ファイル３４１を複製して、外部の記憶装置や他の監視装置へ出力してもよい。この場合、他の監視装置が、出力された照合定義ファイル３４１を用いて、業務システム内の他のサービスバスや、他の業務システムのサービスバスの監視を行ってもよい。 Note that the analysis unit 330 may duplicate the generated collation definition file 341 and output it to an external storage device or another monitoring device. In this case, another monitoring apparatus may monitor other service buses in the business system or service buses of other business systems using the output collation definition file 341.

解析部３３０は、解析済みのメッセージログをメッセージログ集合３１１から削除する（ステップＳ２１５）。 The analysis unit 330 deletes the analyzed message log from the message log set 311 (step S215).

次に、本発明の実施の形態における、照合処理について説明する。 Next, collation processing in the embodiment of the present invention will be described.

ここでは、図７のような解析定義ファイル３２１が、管理者等により設定され、解析定義記憶部３２０に記憶されていると仮定する。また、解析処理により、図１１の照合定義ファイル３４１が、照合定義記憶部３４０に記憶されていると仮定する。さらに、図１７のような照合履歴テーブル３６１（照合日時「9/8 00:50〜01:00」以前）が、照合履歴記憶部３６０に記憶されていると仮定する。 Here, it is assumed that an analysis definition file 321 as shown in FIG. 7 is set by an administrator or the like and stored in the analysis definition storage unit 320. Further, it is assumed that the collation definition file 341 in FIG. 11 is stored in the collation definition storage unit 340 by the analysis processing. Furthermore, it is assumed that a collation history table 361 (before collation date “9/8 00:50 to 01:00”) as shown in FIG. 17 is stored in the collation history storage unit 360.

図１２は、本発明の実施の形態における、照合処理を示すフローチャートである。 FIG. 12 is a flowchart showing the collation process in the embodiment of the present invention.

照合処理は、解析定義ファイル３２１に定義された統計間隔で、実行される。 The collation process is executed at a statistical interval defined in the analysis definition file 321.

照合部３５０は、解析定義記憶部３２０から解析定義ファイル３２１を、照合定義記憶部３４０から全ての照合定義ファイル３４１を取得する（ステップＳ３０１）。 The collation unit 350 acquires the analysis definition file 321 from the analysis definition storage unit 320 and all the collation definition files 341 from the collation definition storage unit 340 (step S301).

照合部３５０は、メッセージログ記憶部３１０のメッセージログ集合３１１から、最新の統計期間のメッセージログを取得する（ステップＳ３０２）。ここでは、最新の統計期間を、対象照合期間と呼ぶ。 The collation unit 350 acquires the message log of the latest statistical period from the message log set 311 in the message log storage unit 310 (step S302). Here, the latest statistical period is referred to as a target verification period.

照合部３５０は、ステップＳ２０５と同様の方法で、対象照合期間のメッセージの特徴情報を生成する（ステップＳ３０３）。 The collation unit 350 generates the feature information of the message in the target collation period by the same method as in step S205 (step S303).

照合部３５０は、照合定義ファイル３４１を１つ選択する（ステップＳ３０４）。 The collation unit 350 selects one collation definition file 341 (step S304).

照合部３５０は、ステップＳ３０３で生成された、対象照合期間のメッセージの特徴情報と、ステップＳ３０４で選択された照合定義ファイル３４１とを比較する。そして、照合部３５０は、サービスバス２００における高負荷状態の前兆があるかどうかを判定する（ステップＳ３０５）。 The collation unit 350 compares the feature information of the message in the target collation period generated in step S303 with the collation definition file 341 selected in step S304. Then, the collation unit 350 determines whether there is a precursor of a high load state in the service bus 200 (step S305).

ここで、照合部３５０は、対象照合期間のメッセージの特徴情報と照合定義ファイル３４１とを用いて、高負荷状態の前兆の有無を、以下のように判定する。照合部３５０は、特徴情報の平均メッセージ長が、照合定義ファイル３４１の平均メッセージ長の範囲内で、かつ、特徴情報のメッセージ比率が照合定義ファイル３４１のメッセージ比率の範囲内の場合、前兆ありと判定する。一方、照合部３５０は、特徴情報の平均メッセージ長が、照合定義ファイル３４１の平均メッセージ長の範囲外、または、特徴情報のメッセージ比率が照合定義ファイル３４１のメッセージ比率の範囲外の場合、前兆なしと判定する。 Here, the collation unit 350 uses the feature information of the message in the target collation period and the collation definition file 341 to determine whether or not there is a sign of a high load state as follows. When the average message length of the feature information is within the range of the average message length of the verification definition file 341 and the message ratio of the feature information is within the range of the message ratio of the verification definition file 341, the verification unit 350 determines that there is a precursor. judge. On the other hand, when the average message length of the feature information is outside the range of the average message length of the verification definition file 341, or the message ratio of the feature information is outside the range of the message ratio of the verification definition file 341, the verification unit 350 has no warning. Is determined.

高負荷状態の前兆がある場合（ステップＳ３０５／Ｙ）、照合部３５０は、照合履歴テーブル３６１に、対象照合期間のレコードを追加する（ステップＳ３０６）。ここで、照合部３５０は、比較に用いた照合定義ファイル３４１の照合定義ＩＤを当該レコードの照合定義ＩＤに設定する。また、照合部３５０は、対象照合期間を当該レコードの照合日時に設定する。レコードにおける予測成否は、解析処理（前述）のステップＳ２０８、Ｓ２１０にて設定される。 When there is a sign of a high load state (step S305 / Y), the collation unit 350 adds a record of the target collation period to the collation history table 361 (step S306). Here, the collation unit 350 sets the collation definition ID of the collation definition file 341 used for the comparison to the collation definition ID of the record. The collation unit 350 sets the target collation period to the collation date and time of the record. The prediction success or failure in the record is set in steps S208 and S210 of the analysis process (described above).

図１３は、本発明の実施の形態における、メッセージログ集合３１１の他の例を示す図である。また、図１４は、本発明の実施の形態における、特徴情報の算出結果の他の例を示す図である。 FIG. 13 is a diagram showing another example of the message log set 311 in the embodiment of the present invention. FIG. 14 is a diagram showing another example of the calculation result of the feature information in the embodiment of the present invention.

例えば、照合部３５０は、図１３のメッセージログ集合３１１における対象照合期間「9/9 01:40〜01:50」について、図１４のような特徴情報を算出する。 For example, the collation unit 350 calculates feature information as shown in FIG. 14 for the target collation period “9/9 01:40 to 01:50” in the message log set 311 in FIG.

図１４の特徴情報と図１１の照合定義ファイル３４１を比較した場合、特徴情報における平均メッセージ長「1101」は照合定義ファイル３４１における平均メッセージ長「1323〜1617」の範囲外である。また、特徴情報における平均メッセージ比率「TYPE1:60%, TYPE2:40%」も照合定義ファイル３４１におけるメッセージ比率「TYPE1:10%〜30%, TYPE2:70%〜90%」の範囲外である。したがって、この場合、照合部３５０は、高負荷状態の前兆なしと判定する。 When the feature information of FIG. 14 is compared with the collation definition file 341 of FIG. 11, the average message length “1101” in the feature information is outside the range of the average message length “1323 to 1617” in the collation definition file 341. Further, the average message ratio “TYPE1: 60%, TYPE2: 40%” in the feature information is also outside the range of the message ratio “TYPE1: 10% to 30%, TYPE2: 70% to 90%” in the collation definition file 341. Therefore, in this case, the collation unit 350 determines that there is no sign of a high load state.

図１５は、本発明の実施の形態における、メッセージログ集合３１１のさらに他の例を示す図である。また、図１６は、本発明の実施の形態における、特徴情報の算出結果のさらに他の例を示す図である。 FIG. 15 is a diagram showing still another example of the message log set 311 in the embodiment of the present invention. FIG. 16 is a diagram showing still another example of the calculation result of the feature information in the embodiment of the present invention.

例えば、照合部３５０は、図１５のメッセージログ集合３１１における対象照合期間「9/9 01:50〜02:00」について、図１６のような特徴情報を算出する。 For example, the collation unit 350 calculates feature information as shown in FIG. 16 for the target collation period “9/9 01:50 to 02:00” in the message log set 311 in FIG.

図１６の特徴情報と図１１の照合定義ファイル３４１を比較した場合、特徴情報における平均メッセージ長「1615」は照合定義ファイル３４１における平均メッセージ長の範囲内である。また、特徴情報における平均メッセージ比率「TYPE1:20%, TYPE2:80%」も照合定義ファイル３４１におけるメッセージ比率の範囲内である。したがって、この場合、照合部３５０は、高負荷状態の前兆ありと判定する。そして、照合部３５０は、図１７のように、図１１の照合定義ファイル３４１の識別子「201309080100」、及び、対象照合期間「9/9 01:50〜02:00」が設定されたレコードを、照合履歴テーブル３６１に追加する。 When the feature information of FIG. 16 and the collation definition file 341 of FIG. 11 are compared, the average message length “1615” in the feature information is within the range of the average message length in the collation definition file 341. Further, the average message ratio “TYPE1: 20%, TYPE2: 80%” in the feature information is also within the range of the message ratio in the collation definition file 341. Therefore, in this case, the collation unit 350 determines that there is a sign of a high load state. Then, as shown in FIG. 17, the collation unit 350 records records in which the identifier “201309080100” and the target collation period “9/9 01:50 to 02:00” of the collation definition file 341 in FIG. It adds to the collation history table 361.

照合部３５０は、全ての照合定義ファイル３４１について、ステップＳ３０４〜Ｓ３０６を繰り返す（ステップＳ３０７）。 The collation unit 350 repeats steps S304 to S306 for all the collation definition files 341 (step S307).

ステップＳ３０４〜Ｓ３０７により、いずれかの照合定義ファイル３４１に対して、前兆が検出された場合（ステップＳ３０８／Ｙ）、照合部３５０は、リソース制御装置４００に、スレーブ２２０の有効化を指示する（ステップＳ３０９）。リソース制御装置４００は、例えば、リソースプールにおける仮想マシンを用いて、スレーブ２２０を１つ有効化（追加）する。なお、リソース制御装置４００は、スレーブ２２０の数が所定の上限値に達している場合は、スレーブ２２０の有効化を行わない。 When a precursor is detected for any of the collation definition files 341 in steps S304 to S307 (step S308 / Y), the collation unit 350 instructs the resource control device 400 to validate the slave 220 ( Step S309). The resource control apparatus 400 enables (adds) one slave 220 using, for example, a virtual machine in the resource pool. Note that the resource control device 400 does not validate the slaves 220 when the number of slaves 220 has reached a predetermined upper limit value.

一方、どの照合定義ファイル３４１に対しても、前兆が検出されない場合（ステップＳ３０８／Ｎ）、照合部３５０は、ステップＳ２０４と同様に、当該対象照合期間において、サービスバス２００が高負荷状態にあったかどうかを判定する（ステップＳ３１０）。高負荷状態になかった場合（ステップＳ３１０／Ｎ）、照合部３５０は、リソース制御装置４００に、スレーブ２２０の無効化を指示する（ステップＳ３１１）。リソース制御装置４００は、スレーブ２２０を１つ無効化（削除）する。なお、リソース制御装置４００は、スレーブ２２０の数が所定の下限値に達している場合は、スレーブ２２０の無効化を行わない。 On the other hand, if no precursor is detected for any collation definition file 341 (step S308 / N), the collation unit 350 determines whether the service bus 200 was in a high load state during the target collation period, as in step S204. It is determined whether or not (step S310). When the load is not high (step S310 / N), the collation unit 350 instructs the resource control device 400 to invalidate the slave 220 (step S311). The resource control device 400 invalidates (deletes) one slave 220. Note that the resource control device 400 does not invalidate the slaves 220 when the number of slaves 220 has reached a predetermined lower limit value.

以上により、本発明の実施の形態の動作が完了する。 Thus, the operation of the embodiment of the present invention is completed.

次に、本発明の実施の形態の特徴的な構成を説明する。図１は、本発明の実施の形態の特徴的な構成を示すブロック図である。 Next, a characteristic configuration of the embodiment of the present invention will be described. FIG. 1 is a block diagram showing a characteristic configuration of an embodiment of the present invention.

本発明の情報処理装置（監視装置３００）は、メッセージログ記憶部３１０、解析部３３０、及び、照合部３５０を含む。メッセージログ記憶部３１０は、入力されたメッセージに対して所定の処理を行う監視対象システム（サービスバス２００）の入力メッセージのログを記憶する。解析部３３０は、ログをもとに、監視対象システムにおいて所定の事象が発生した期間の前の期間における入力メッセージの特徴を表す照合定義（照合定義ファイル３４１）を生成する。照合部３５０は、新たな期間における監視対象システムの入力メッセージの特徴と照合定義とを比較し、監視対象システムにおける所定の事象の前兆の有無を判定する。 The information processing apparatus (monitoring apparatus 300) of the present invention includes a message log storage unit 310, an analysis unit 330, and a verification unit 350. The message log storage unit 310 stores a log of input messages of the monitoring target system (service bus 200) that performs predetermined processing on the input messages. Based on the log, the analysis unit 330 generates a collation definition (collation definition file 341) representing the characteristics of the input message in the period before the period in which the predetermined event has occurred in the monitoring target system. The collation unit 350 compares the characteristics of the input message of the monitoring target system in the new period with the collation definition, and determines whether there is a precursor of a predetermined event in the monitoring target system.

本発明の実施の形態によれば、監視対象システムにおける特定の事象の前兆を容易に検出できる。その理由は、解析部３３０が、所定の事象が発生した期間の前の期間の入力メッセージの特徴を表す照合定義ファイル３４１を生成し、照合部３５０が、新たな期間の入力メッセージの特徴と照合定義ファイル３４１とを比較し、前兆の有無を判定するためである。 According to the embodiment of the present invention, it is possible to easily detect a precursor of a specific event in the monitored system. The reason is that the analysis unit 330 generates a collation definition file 341 that represents the characteristics of the input message in the period before the period in which the predetermined event occurs, and the collation unit 350 collates with the characteristics of the input message in the new period. This is for comparing with the definition file 341 and determining the presence or absence of a precursor.

また、本発明の実施の形態によれば、監視対象システムにおけるリソースの使用量を必要最小限に抑えることができる。その理由は、リソース制御装置４００が、所定の事象の前兆の検出有無に応じて、監視対象システムのリソースの追加、または、削除を行うためである。 Further, according to the embodiment of the present invention, it is possible to minimize the amount of resources used in the monitoring target system. The reason is that the resource control device 400 adds or deletes resources of the monitoring target system according to whether or not a precursor of a predetermined event is detected.

また、本発明の実施の形態によれば、監視対象システムの性能を劣化させることなく、前兆の有無の判定ができる。その理由は、監視対象システムの外部の監視装置３００が、監視対象システムの入力メッセージのログをもとに、照合定義ファイル３４１の生成、及び、生成された照合定義ファイル３４１を用いた前兆の有無の判定を行うためである。 Further, according to the embodiment of the present invention, it is possible to determine the presence or absence of a sign without degrading the performance of the monitored system. The reason is that the monitoring device 300 outside the monitoring target system generates the collation definition file 341 based on the log of the input message of the monitoring target system and whether there is a precursor using the generated collation definition file 341 This is for making the determination.

また、本発明の実施の形態によれば、同様の監視対象システムを複数含むような大規模分散環境や、同様の監視対象システムを含む他の類似の環境においても、特定の事象の前兆を効率的に検出できる。その理由は、解析部３３０が、照合定義ファイル３４１を複製して、外部の記憶装置や他の監視装置へ出力するためである。 In addition, according to the embodiment of the present invention, the sign of a specific event can be efficiently detected even in a large-scale distributed environment including a plurality of similar monitoring target systems or in another similar environment including a similar monitoring target system. Can be detected. The reason is that the analysis unit 330 duplicates the collation definition file 341 and outputs it to an external storage device or another monitoring device.

また、本発明の実施の形態によれば、監視対象システムのＴＡＴやスループットのみを監視する場合に比べて、特定の事象の発生を、より高精度に予測できる。その理由は、監視装置３００が、平均メッセージ長や、メッセージ比率等、監視対象システムの入力メッセージの特徴の統計量をもとに、特定の事象の前兆の有無を判定するためである。 Further, according to the embodiment of the present invention, it is possible to predict the occurrence of a specific event with higher accuracy than in the case where only the TAT and throughput of the monitoring target system are monitored. This is because the monitoring apparatus 300 determines the presence or absence of a precursor of a specific event based on the statistics of the characteristics of the input message of the monitoring target system such as the average message length and the message ratio.

以上、実施形態を参照して本願発明を説明したが、本願発明は上記実施形態に限定されるものではない。本願発明の構成や詳細には、本願発明のスコープ内で当業者が理解し得る様々な変更をすることができる。 While the present invention has been described with reference to the embodiments, the present invention is not limited to the above embodiments. Various changes that can be understood by those skilled in the art can be made to the configuration and details of the present invention within the scope of the present invention.

例えば、本発明の実施の形態においては、前兆の検出対象の事象をサービスバス２００の高負荷状態とし、前兆の検出有無に応じた制御対象を負荷分散のためのリソース（スレーブ２２０）とした。しかしながら、前兆の検出対象の事象や、前兆の検出有無に応じた制御対象は、他の事象や制御対象でもよい。例えば、前兆の検出対象の事象を大きなメッセージ等、所定のメッセージの受信、前兆の検出有無に応じた制御対象を、当該所定のメッセージの処理に必要なハードディスク等のリソースとしてもよい。この場合、解析部３３０は、サービスバス２００が所定のメッセージを受信した統計期間の直前の統計期間における特徴情報をもとに、照合定義ファイル３４１を生成する。そして、照合部３５０は、生成した照合定義ファイル３４１を用いて所定のメッセージの受信の前兆を検出し、ハードディスクを追加する。これにより、例えば、数百ＭＢや数ＧＢといった大きなメッセージの処理に一時的にハードディスクの容量を必要とする場合、当該大きなメッセージの受信の前兆を検出し、予め、ハードディスクの容量を増やすことができる。 For example, in the embodiment of the present invention, an event that is a detection target of a warning is set to a high load state of the service bus 200, and a control target according to the presence or absence of detection of a warning is a resource (slave 220) for load distribution. However, the event that is the detection target of the precursor and the control target according to the presence or absence of the detection of the precursor may be another event or the control target. For example, an event that is a detection target of a sign may be a resource such as a hard disk that is necessary for receiving a predetermined message, such as a large message, and a target that is detected according to whether or not a sign is detected. In this case, the analysis unit 330 generates the collation definition file 341 based on the feature information in the statistical period immediately before the statistical period in which the service bus 200 receives the predetermined message. Then, the collation unit 350 detects a precursor of receiving a predetermined message using the generated collation definition file 341, and adds a hard disk. As a result, for example, when the capacity of the hard disk is temporarily required for processing a large message such as several hundred MB or several GB, the sign of reception of the large message can be detected and the capacity of the hard disk can be increased in advance. .

また、本発明の実施の形態においては、監視装置３００は、メッセージの特徴情報として、平均メッセージ長、及び、メッセージ比率を用いた。しかしながら、検出対象の事象の前兆が検出できれば、監視装置３００は、特徴情報として、メッセージに係る他の情報を用いてもよい。例えば、監視装置３００は、特徴情報として、メッセージの送信元や送信先等、メッセージ内の他のデータ項目に設定される各値（テキストを含む）の比率を用いてもよい。 Further, in the embodiment of the present invention, the monitoring apparatus 300 uses the average message length and the message ratio as the message feature information. However, if the precursor of the detection target event can be detected, the monitoring apparatus 300 may use other information related to the message as the feature information. For example, the monitoring apparatus 300 may use, as the feature information, the ratio of each value (including text) set in other data items in the message such as the message transmission source and the transmission destination.

また、本発明の実施の形態においては、解析部３３０は、検出対象の事象が発生した統計期間の直前の統計期間の特徴情報をもとに照合定義ファイル３４１を生成した。しかしながら、検出対象の事象の前兆が検出できれば、解析部３３０は、例えば、所定時間前の統計期間等、直前の統計期間よりさらに前の統計期間の特徴情報をもとに照合定義ファイル３４１を生成してもよい。 In the embodiment of the present invention, the analysis unit 330 generates the collation definition file 341 based on the characteristic information of the statistical period immediately before the statistical period in which the detection target event has occurred. However, if the precursor of the detection target event can be detected, the analysis unit 330 generates the matching definition file 341 based on the feature information of the statistical period prior to the immediately preceding statistical period, such as the statistical period of a predetermined time, for example. May be.

また、本発明の実施の形態においては、解析部３３０は、検出対象の事象の前兆が検出された統計期間の直後の統計期間で当該検出対象の事象が発生したかどうかにより、予測成否を判定した。しかしながら、解析部３３０は、所定時間後までの各統計期間等、直後の統計期間よりさらに後の統計期間までに当該検出対象の事象が発生したかどうかにより、予測成否を判定してもよい。 Further, in the embodiment of the present invention, the analysis unit 330 determines the success or failure of the prediction depending on whether or not the event to be detected has occurred in the statistical period immediately after the statistical period in which the precursor of the event to be detected has been detected. did. However, the analysis unit 330 may determine the success or failure of the prediction depending on whether or not the detection target event has occurred before the statistical period immediately after the statistical period immediately after, such as each statistical period until a predetermined time.

また、本発明の実施の形態においては、監視対象システムをサービスバス２００とした。しかしながら、入出力されるメッセージの特徴から特定の事象の前兆が検出できるシステムであれば、監視対象システムは、サービスバス２００以外の他のシステムでもよい。 In the embodiment of the present invention, the monitoring target system is the service bus 200. However, the system to be monitored may be a system other than the service bus 200 as long as it can detect a precursor of a specific event from the characteristics of input / output messages.

本発明は、例えば、鉄道、航空、区役所に係るシステム等、季節や時間帯などによって処理量が大幅に増減する業務システムに適用できる。 The present invention can be applied to a business system in which the amount of processing greatly increases or decreases depending on the season or time zone, such as a system related to railways, airlines, and ward offices.

１００サービスシステム
２００サービスバス
２１０マスタ
２２０スレーブ
３００監視装置
３１０メッセージログ記憶部
３１１メッセージログ集合
３２０解析定義記憶部
３２１解析定義ファイル
３３０解析部
３４０照合定義記憶部
３４１照合定義ファイル
３５０照合部
３６０照合履歴記憶部
３６１照合履歴テーブル
４００リソース制御装置 100 Service System 200 Service Bus 210 Master 220 Slave 300 Monitoring Device 310 Message Log Storage Unit 311 Message Log Set 320 Analysis Definition Storage Unit 321 Analysis Definition File 330 Analysis Unit 340 Verification Definition Storage Unit 341 Verification Definition File 350 Verification Unit 360 Verification History Storage 361 Matching history table 400 Resource control device

Claims

Message log storage means for storing a log of an input message to a monitoring target system that performs predetermined processing on the input message;
Analyzing means for generating a collation definition representing characteristics of an input message to the monitored system in a period before a period in which a predetermined event has occurred in the monitored system based on the log;
Comparing means for comparing the characteristics of the input message to the monitoring target system in the new period and the matching definition, and determining the presence or absence of a precursor of the predetermined event in the monitoring target system;
Equipped with a,
The input message characteristics of the monitored system include at least one of an average message length and a ratio of values set in predetermined data items in the message.
Information processing device.

Further, it comprises resource control means for adding or deleting resources of the monitoring target system according to whether or not a precursor of the predetermined event has been detected.
The information processing apparatus according to claim 1.

The monitored system includes a plurality of subsystems, and a message input to the monitored system is processed by any of the plurality of subsystems.
The resource control means adds or deletes the subsystem according to the presence or absence of detection of a precursor of the predetermined event.
The information processing apparatus according to claim 2.

The analysis means, when it is determined that there is a precursor of the predetermined event in the new period, the number of times that the predetermined event has not occurred in a period after the new period is greater than or equal to a predetermined threshold , Delete the matching definition,
The information processing apparatus according to any one of claims 1 to 3.

Said predetermined event is the monitored system is that in a high load state, the information processing apparatus according to any one of claims 1 to 4.

Said predetermined event is the an input of a predetermined message to the monitored system, information processing apparatus according to any one of claims 1 to 4.

Store a log of input messages to the monitored system that performs predetermined processing on the input messages,
Based on the log, generate a collation definition that represents the characteristics of the input message to the monitored system in the period before the period in which the predetermined event occurred in the monitored system ,
Comparing the characteristics of the input message to the monitored system in the new period with the matching definition to determine the presence or absence of a precursor of the predetermined event in the monitored system;
A monitoring method,
The input message characteristics of the monitored system include at least one of an average message length and a ratio of values set in predetermined data items in the message.
Monitoring method.

Furthermore, depending on whether or not a precursor of the predetermined event is detected, the resource of the monitored system is added or deleted,
The monitoring method according to claim 7 .

On the computer,
Store a log of input messages to the monitored system that performs predetermined processing on the input messages,
Based on the log, generate a collation definition that represents the characteristics of the input message to the monitored system in the period before the period in which the predetermined event occurred in the monitored system ,
Comparing the characteristics of the input message to the monitored system in the new period with the matching definition to determine the presence or absence of a precursor of the predetermined event in the monitored system;
A program for executing processing ,
The input message characteristics of the monitored system include at least one of an average message length and a ratio of values set in predetermined data items in the message.
program.

Further, depending on whether or not a precursor of the predetermined event has been detected, a process for adding or deleting the resource of the monitoring target system is executed.
The program according to claim 9.