JP7027912B2

JP7027912B2 - Order control program, order control method, and information processing device

Info

Publication number: JP7027912B2
Application number: JP2018012552A
Authority: JP
Inventors: 貴史昆野
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2018-01-29
Filing date: 2018-01-29
Publication date: 2022-03-02
Anticipated expiration: 2038-01-29
Also published as: JP2019133246A

Description

本件は、順序制御プログラム、順序制御方法、及び情報処理装置に関する。 This case relates to an order control program, an order control method, and an information processing apparatus.

サーバ装置、ストレージ装置、ネットワーク装置などを含むInformation and Communication Technology（ＩＣＴ）システムが知られている。また、例えばサーバ装置からCentral Processing Unit（ＣＰＵ）やメモリの使用状況に関する時系列データを採取して、使用状況の時間変化をモニタにグラフで表示する観測装置も知られている。特に、観測装置がＩＣＴシステムのアノマリ事象を検知し、アノマリ事象の発生状況をモニタに表示する技術も知られている（以上、例えば特許文献１参照）。 Information and Communication Technology (ICT) systems including server devices, storage devices, network devices, etc. are known. Further, for example, an observation device that collects time-series data related to the usage status of a Central Processing Unit (CPU) or a memory from a server device and displays the time change of the usage status on a monitor as a graph is also known. In particular, there is also known a technique in which an observation device detects an anomaly event in an ICT system and displays the occurrence status of the anomaly event on a monitor (see, for example, Patent Document 1).

特開２０１７－０７２８８２号公報Japanese Unexamined Patent Publication No. 2017-072882

ところで、アノマリ事象が大量に検知された場合、ＩＣＴシステムの運用者又は管理者（以下、単に運用者という。）には、アノマリ事象に対する対応の優先順位を判断し、そのアノマリ事象に対応することが求められる。特に、運用者にはアノマリ事象が検知されてからそのアノマリ事象に起因する問題が発生するまでの時間内で優先順位を判断し、そのアノマリ事象に対応することが求められる。 By the way, when a large number of anomaly events are detected, the operator or administrator of the ICT system (hereinafter, simply referred to as an operator) should determine the priority of the response to the anomaly event and respond to the anomaly event. Is required. In particular, the operator is required to determine the priority within the time from the detection of the anomaly event to the occurrence of the problem caused by the anomaly event and respond to the anomaly event.

運用者が優先順位の判断を誤り、アノマリ事象に対する対応が遅延すると、そのアノマリ事象に起因する問題が発生する。例えば、システム障害を引き起こす可能性があるアノマリ事象が検知された場合に、運用者がそのアノマリ事象に対する対応の優先順位の判断を誤ると、システム障害が発生する可能性がある。仮に、システム障害が現実に発生すると、ＩＣＴシステムを利用する多くの利用者が影響を受けるおそれがある。 If the operator makes a mistake in determining the priority and the response to the anomaly event is delayed, a problem caused by the anomaly event occurs. For example, if an anomaly event that may cause a system failure is detected and the operator makes a mistake in determining the priority of response to the anomaly event, a system failure may occur. If a system failure actually occurs, many users of the ICT system may be affected.

そこで、１つの側面では、アノマリ事象対応の遅延による問題の発生を抑制できる順序制御プログラム、順序制御方法、及び情報処理装置を提供することを目的とする。 Therefore, in one aspect, it is an object of the present invention to provide an order control program, an order control method, and an information processing apparatus capable of suppressing the occurrence of a problem due to a delay in responding to an anomaly event.

１つの実施態様では、順序制御プログラムは、システムに含まれる複数のマシンの監視データに基づき、前記複数のマシンのいずれかで発生したアノマリ事象を検知するアノマリ事象検知処理を行い、複数のアノマリ事象が検知された場合、前記システムで実行される複数のサービス間の関係を示す情報を記憶する記憶部を参照して、検知された前記複数のアノマリ事象それぞれの発生元のマシンごとに、該マシンで実行されるサービスに対して特定の関係性を有する他のサービスの数を特定し、前記複数のアノマリ事象を検知してから経過した時間を計測し、検知された前記複数のアノマリ事象に関する情報を、特定した前記他のサービスの数に応じた順序で表示部に表示し、計測した前記時間が短いほど、検知された前記複数のアノマリ事象に関する情報を上位に表示する、処理をコンピュータに実行させる。 In one embodiment, the sequence control program performs anomaly event detection processing for detecting anomaly events occurring in any of the plurality of machines based on monitoring data of a plurality of machines included in the system, and performs anomaly event detection processing, and the plurality of anomaly events. When is detected, the machine refers to a storage unit that stores information indicating the relationship between the plurality of services executed by the system, and the machine is used for each of the detected machines that generate the plurality of anomaly events. The number of other services that have a specific relationship to the service executed in is specified, the time elapsed since the detection of the plurality of anomaly events is measured, and the information regarding the detected multiple anomaly events is measured. Is displayed on the display unit in the order corresponding to the number of the specified other services, and the shorter the measured time, the higher the information about the detected plurality of anomaly events is displayed on the computer. Let me.

１つの実施態様では、順序制御プログラムは、システムに含まれる複数の監視対象の監視データに基づき、前記複数の監視対象のいずれかについて発生したアノマリ事象を検知するアノマリ事象検知処理を行い、複数のアノマリ事象が検知された場合、検知された前記複数のアノマリ事象それぞれの発生元の監視対象ごとに、該監視対象について発生したアノマリ事象の検知頻度を算出し、前記複数のアノマリ事象を検知してから経過した時間を計測し、検知された前記複数のアノマリ事象に関する情報を、算出した前記検知頻度に応じた順序で表示部に表示し、計測した前記時間が短いほど、検知された前記複数のアノマリ事象に関する情報を上位に表示する、処理をコンピュータに実行させる。 In one embodiment, the sequence control program performs anomaly event detection processing for detecting anomaly events that have occurred in any of the plurality of monitoring targets based on the monitoring data of the plurality of monitoring targets included in the system, and a plurality of monitoring targets. When an anomaly event is detected, the detection frequency of the anomaly event that occurred for the monitored object is calculated for each monitoring target of the source of each of the plurality of detected anomaly events, and the plurality of anomaly events are detected. The time elapsed from the measurement is measured, and the information regarding the detected plurality of anomaly events is displayed on the display unit in the order corresponding to the calculated detection frequency. The shorter the measured time, the more the plurality of detected events are detected. Anomaly Causes the computer to perform a process that displays information about the event at the top .

アノマリ事象対応の遅延による問題の発生を抑制することができる。 It is possible to suppress the occurrence of problems due to delays in responding to anomaly events.

図１は情報処理システムの一例である。FIG. 1 is an example of an information processing system. 図２はＣＭＤＢの一例である。FIG. 2 is an example of CMDB. 図３は分析サーバのハードウェア構成の一例である。FIG. 3 is an example of the hardware configuration of the analysis server. 図４（ａ）は分析サーバのブロック図の一例である。図４（ｂ）はデータ記憶部の一例である。FIG. 4A is an example of a block diagram of the analysis server. FIG. 4B is an example of a data storage unit. 図５は第１データ管理テーブルの一例である。FIG. 5 is an example of the first data management table. 図６（ａ）は第２データ管理テーブル及び第３データ管理テーブルの一例である。図６（ｂ）は第４データ管理テーブルの一例である。FIG. 6A is an example of the second data management table and the third data management table. FIG. 6B is an example of the fourth data management table. 図７は第５データ管理テーブルの一例である。FIG. 7 is an example of the fifth data management table. 図８はデータ収集部が実行する第１処理の一例を示すフローチャートである。FIG. 8 is a flowchart showing an example of the first process executed by the data collection unit. 図９はアノマリ検知部及び表示順序制御部が実行する第２処理の一例を示すフローチャートである。FIG. 9 is a flowchart showing an example of the second process executed by the anomaly detection unit and the display order control unit. 図１０は表示優先度の一例を説明する図である。FIG. 10 is a diagram illustrating an example of display priority. 図１１はデータ可視化部が実行する第３処理の一例を示すフローチャートである。FIG. 11 is a flowchart showing an example of the third process executed by the data visualization unit. 図１２はアノマリ一覧画面の一例である。FIG. 12 is an example of the anomaly list screen. 図１３はアノマリ詳細画面の一例である。FIG. 13 is an example of an anomaly detail screen. 図１４はアノマリ一覧画面の他の一例である。FIG. 14 is another example of the anomaly list screen. 図１５はアノマリ一覧画面の他の一例である。FIG. 15 is another example of the anomaly list screen. 図１６はアノマリ詳細画面の他の一例である。FIG. 16 is another example of the anomaly detail screen.

以下、本件を実施するための形態について図面を参照して説明する。 Hereinafter, a mode for carrying out this case will be described with reference to the drawings.

（第１実施形態）
図１は情報処理システムＳの一例である。図２はConfiguration Management Database（ＣＭＤＢ）１４０の一例である。情報処理システムＳは、監視対象システム１００と構成収集サーバ１３０と記憶部としてのＣＭＤＢ１４０とを備えている。ＣＭＤＢ１４０は構成収集サーバ１３０に含まれていてもよい。また、情報処理システムＳは、性能監視サーバ１５０とイベント監視サーバ１７０とを備えている。さらに、情報処理システムＳは、管理サーバ１９０と情報処理装置としての分析サーバ２００と運用端末３００とを備えている。 (First Embodiment)
FIG. 1 is an example of the information processing system S. FIG. 2 is an example of the Configuration Management Database (CMDB) 140. The information processing system S includes a monitoring target system 100, a configuration collection server 130, and a CMDB 140 as a storage unit. The CMDB 140 may be included in the configuration collection server 130. Further, the information processing system S includes a performance monitoring server 150 and an event monitoring server 170. Further, the information processing system S includes a management server 190, an analysis server 200 as an information processing device, and an operation terminal 300.

尚、情報処理システムＳの構成要素から監視対象システム１００、構成収集サーバ１３０、ＣＭＤＢ１４０、性能監視サーバ１５０、イベント監視サーバ１７０、運用端末３００の少なくとも１つを除外してもよい。すなわち、情報処理システムＳは少なくとも管理サーバ１９０と分析サーバ２００とを備えていればよい。また、運用端末３００は、図１に示すように、入力部３１０、表示部３２０、及び制御部（不図示）を含むPersonal Computer（ＰＣ）によって実現することができる。運用端末３００をタブレット端末といったスマートデバイスによって実現してもよい。運用端末３００は運用者によって操作される。 At least one of the monitoring target system 100, the configuration collection server 130, the CMDB 140, the performance monitoring server 150, the event monitoring server 170, and the operation terminal 300 may be excluded from the components of the information processing system S. That is, the information processing system S may include at least a management server 190 and an analysis server 200. Further, as shown in FIG. 1, the operation terminal 300 can be realized by a personal computer (PC) including an input unit 310, a display unit 320, and a control unit (not shown). The operation terminal 300 may be realized by a smart device such as a tablet terminal. The operation terminal 300 is operated by the operator.

監視対象システム１００は複数のホストマシン１１０を含んでいる。複数のホストマシン１１０の一部又は全部は互いに接続されている。複数のホストマシン１１０は通信ネットワークＮＷを介して構成収集サーバ１３０、性能監視サーバ１５０、及びイベント監視サーバ１７０と接続されている。通信ネットワークＮＷとしては、例えばインターネットやLocal Area Network（ＬＡＮ）などがある。ホストマシン１１０は例えばＷｅｂサーバ、アプリケーションサーバ、データベースサーバ、仮想化サーバといった各種のサーバ装置、ストレージ装置、ネットワーク装置などが該当する。したがって、監視対象システム１００は計算機システム又はＩＣＴシステムと呼ばれる場合もある。情報処理システムＳ又は監視対象システム１００のエンドユーザである利用者（不図示）は、監視対象システム１００にアクセスして、監視対象システム１００が実行又は提供（以下、単に実行という。）する複数のサービスを利用することができる。複数のサービスとしては、例えばデータ処理サービスや画像化サービスなど、業務支援に関するサービスがある。 The monitored system 100 includes a plurality of host machines 110. Some or all of the plurality of host machines 110 are connected to each other. The plurality of host machines 110 are connected to the configuration collection server 130, the performance monitoring server 150, and the event monitoring server 170 via the communication network NW. The communication network NW includes, for example, the Internet and a Local Area Network (LAN). The host machine 110 corresponds to, for example, various server devices such as a Web server, an application server, a database server, and a virtualization server, a storage device, a network device, and the like. Therefore, the monitored system 100 may be referred to as a computer system or an ICT system. A user (not shown) who is an end user of the information processing system S or the monitored system 100 accesses the monitored system 100 and executes or provides the monitored system 100 (hereinafter, simply referred to as execution). You can use the service. As a plurality of services, there are services related to business support such as a data processing service and an imaging service.

尚、監視対象システム１００にはクラウド基盤又はクラウド環境（以下、単にクラウド基盤という。）を構築するソフトウェア群（例えばOpen Stackなど）が実装されている。より詳しくは、上述したホストマシン１１０によってはKeystoneと呼ばれるソフトウェアが実装されている。Keystoneは利用者の管理やアクセス制御といった認証機能を発揮する。また、ホストマシン１１０によってはNovaと呼ばれるソフトウェアが実装されている。Novaは仮想マシンを実行するコンピュート機能を発揮する。その他、Swift、Glance、Cinderなどのソフトウェアも知られている。これらのソフトウェアが互いに連携することによって、監視対象システム１００はクラウド基盤を実現する。 The monitored system 100 is equipped with a cloud platform or a software group (for example, Open Stack) for constructing a cloud environment (hereinafter, simply referred to as a cloud platform). More specifically, some of the host machines 110 described above implement software called Keystone. Keystone demonstrates authentication functions such as user management and access control. Further, depending on the host machine 110, software called Nova is implemented. Nova has the ability to compute the ability to run virtual machines. Other software such as Swift, Glance, and Cinder are also known. By linking these softwares with each other, the monitored system 100 realizes a cloud platform.

構成収集サーバ１３０は複数のホストマシン１１０にアクセスし、各ホストマシン１１０が有する種々の構成情報を動的に収集してＣＭＤＢ１４０に格納する。構成情報としては、ホストマシン１１０のハードウェア構成に関する情報（具体的にはホストマシンの名称やＩＰアドレスなど）や、仮想環境に関する情報（具体的にはハイパーバイザーの版数、ＣＰＵやメモリの使用状況など）がある。また、構成情報としては、Operating System（ＯＳ）やアプリケーションソフトウェア（以下、単にアプリという）に関する情報や、パッチ適用の状況及びアプリのパラメータ設定状況に関する情報もある。その他、構成収集サーバ１３０はアクセスしたホストマシン１１０が実行するサービスとそのサービスに関連する一世代上のサービスと一世代下のサービスを特定して構成情報として格納する。また、構成収集サーバ１３０は複数のホストマシン１１０の親子関係又は主従関係（以下、単に親子関係という。）を特定して構成情報として格納する。 The configuration collection server 130 accesses a plurality of host machines 110, dynamically collects various configuration information possessed by each host machine 110, and stores them in the CMDB 140. The configuration information includes information on the hardware configuration of the host machine 110 (specifically, the name and IP address of the host machine, etc.) and information on the virtual environment (specifically, the version number of the hypervisor, CPU and memory usage). Situation etc.). Further, the configuration information includes information on the Operating System (OS) and application software (hereinafter, simply referred to as an application), patch application status, and application parameter setting status. In addition, the configuration collection server 130 identifies the service executed by the accessed host machine 110, the service one generation higher and the service one generation lower related to the service, and stores the service as configuration information. Further, the configuration collection server 130 identifies a parent-child relationship or a master-slave relationship (hereinafter, simply referred to as a parent-child relationship) of a plurality of host machines 110 and stores them as configuration information.

これにより、ＣＭＤＢ１４０は、図２に示すように、各ホストマシン１１０の構成情報をホストマシン１１０毎に記憶する。例えば、「ホスト２」の名称を有するホストマシン１１０は「ホスト１」の名称を有するホストマシン１１０が親であり、「ホスト３」の名称を有するホストマシン１１０が子の関係を有する。また、「ホスト２」の名称を有するホストマシン１１０はサービス３、サービス４、及びサービス５を実行する。一方で、「ホスト２」の名称を有するホストマシン１１０が実行するサービスに関連するサービスとして、親の関係にあるホストマシン１１０が実行するサービス１及びサービス２と、「ホスト３」の名称を有するホストマシン１１０が実行するサービス６とがある。このように、複数のサービス１～６にはホストマシン１１０の親子関係に伴う特定の関係性がある。したがって、例えば「ホスト２」の名称を有するホストマシン１１０に障害が発生すると、サービス３～５及び一世代下のサービス６の実行が停止するが、一世代上のサービス１～２の実行は継続する。尚、運用者が構成情報をＣＭＤＢ１４０に格納する操作を行って、運用端末３００がＣＭＤＢ１４０に構成情報を格納してもよい。 As a result, the CMDB 140 stores the configuration information of each host machine 110 for each host machine 110, as shown in FIG. For example, the host machine 110 having the name "host 2" has a parent relationship with the host machine 110 having the name "host 1", and the host machine 110 having the name "host 3" has a child relationship. Further, the host machine 110 having the name of "host 2" executes the service 3, the service 4, and the service 5. On the other hand, as services related to the service executed by the host machine 110 having the name "host 2", the services 1 and 2 executed by the host machine 110 having a parental relationship and the name "host 3" are included. There is a service 6 executed by the host machine 110. As described above, the plurality of services 1 to 6 have a specific relationship associated with the parent-child relationship of the host machine 110. Therefore, for example, if a failure occurs in the host machine 110 having the name "host 2", the execution of services 3 to 5 and the service 6 one generation below is stopped, but the execution of services 1 and 2 one generation above continues. do. The operator may perform an operation to store the configuration information in the CMDB 140, and the operation terminal 300 may store the configuration information in the CMDB 140.

性能監視サーバ１５０は通信ネットワークＮＷを介して各ホストマシン１１０及び各ホストマシン１１０のリソースを監視する。具体的には、性能監視サーバ１５０はハードウェア資源（例えばＣＰＵやメモリなど）を監視する。性能監視サーバ１５０はソフトウェア資源を監視してもよい。性能監視サーバ１５０は監視対象であるリソースから性能や負荷といったデータを監視データとして取得する。監視データには、ＣＰＵ使用率やメモリ使用率、ディスクスループットなどがリソースの性能や負荷として含まれている。性能監視サーバ１５０は取得した監視データを様々な角度で対比して管理する。 The performance monitoring server 150 monitors the resources of each host machine 110 and each host machine 110 via the communication network NW. Specifically, the performance monitoring server 150 monitors hardware resources (for example, CPU, memory, etc.). The performance monitoring server 150 may monitor software resources. The performance monitoring server 150 acquires data such as performance and load from the resource to be monitored as monitoring data. The monitoring data includes CPU usage rate, memory usage rate, disk throughput, etc. as resource performance and load. The performance monitoring server 150 manages the acquired monitoring data by comparing them at various angles.

イベント監視サーバ１７０は通信ネットワークＮＷを介して監視対象システム１００で発生する種々のイベント（事象）を監視する。例えば、イベント監視サーバ１７０は、監視対象システム１００で実行される複数のサービスのサービスレベルの低下の原因となり得るイベントが発生すると、そのイベントの発生を検知し、インシデント情報（以下、ＩＮＣチケットという。）を発行して管理サーバ１９０に通知する。詳細は後述するが、ＩＮＣチケットは、チケットの識別子、チケットの発行日時、発生したイベントに関連するサービスの名称、そのサービスを実行するホストマシン１１０の名称、発生したイベントの内容などを含んでいる。 The event monitoring server 170 monitors various events (events) that occur in the monitored system 100 via the communication network NW. For example, when an event that can cause a decrease in the service level of a plurality of services executed by the monitored system 100 occurs, the event monitoring server 170 detects the occurrence of the event and causes incident information (hereinafter referred to as INC ticket). ) Is issued to notify the management server 190. Although details will be described later, the INC ticket includes the ticket identifier, the ticket issuance date and time, the name of the service related to the event that occurred, the name of the host machine 110 that executes the service, the content of the event that occurred, and the like. ..

管理サーバ１９０は不図示のチケットＤＢを備え、イベント監視サーバ１７０から通知されたＩＮＣチケットをチケットＤＢにより記憶して管理する。また、管理サーバ１９０は利用者端末（不図示）が発行して通知したＩＮＣチケットや運用端末３００が発行して通知した変更要求チケットをチケットＤＢにより記憶して管理する。利用者端末は利用者が利用する端末装置である。尚、利用者端末が発行したＩＮＣチケットには、チケットの識別情報、チケットの発行日時のほか、監視対象システム１００に関する問い合わせなどが含まれている。一方、運用端末３００が発行した変更要求チケットには、チケットの識別子、チケットの発行日時、ホストマシン１１０の変更により影響を受けるサービスの名称、そのサービスを実行するホストマシン１１０の名称、改変や修正といった変更内容などを含んでいる。 The management server 190 includes a ticket DB (not shown), and stores and manages the INC ticket notified from the event monitoring server 170 by the ticket DB. Further, the management server 190 stores and manages the INC ticket issued and notified by the user terminal (not shown) and the change request ticket issued and notified by the operation terminal 300 in the ticket DB. The user terminal is a terminal device used by the user. The INC ticket issued by the user terminal includes ticket identification information, the ticket issuance date and time, and inquiries regarding the monitored system 100. On the other hand, in the change request ticket issued by the operation terminal 300, the identifier of the ticket, the issue date and time of the ticket, the name of the service affected by the change of the host machine 110, the name of the host machine 110 that executes the service, modification or modification. It includes changes such as.

分析サーバ２００はＣＭＤＢ１４０、性能監視サーバ１５０、管理サーバ１９０、及び運用端末３００と接続されている。分析サーバ２００は、図１に示すように、ＣＭＤＢ１４０、性能監視サーバ１５０、管理サーバ１９０、及び運用端末３００と直接的に接続されていてもよいし、通信ネットワークＮＷを介してＣＭＤＢ１４０、性能監視サーバ１５０、管理サーバ１９０、及び運用端末３００と間接的に接続されていてもよい。 The analysis server 200 is connected to the CMDB 140, the performance monitoring server 150, the management server 190, and the operation terminal 300. As shown in FIG. 1, the analysis server 200 may be directly connected to the CMDB 140, the performance monitoring server 150, the management server 190, and the operation terminal 300, or the CMDB 140 and the performance monitoring server via the communication network NW. It may be indirectly connected to the 150, the management server 190, and the operation terminal 300.

分析サーバ２００は性能監視サーバ１５０が取得した監視データを収集し、収集した監視データに基づいてアノマリ事象検知処理を実行する。アノマリ事象検知処理は、監視データを分析し、ホストマシン１１０やそのリソースの通常時、正常時、又は定常時（以下、単に通常時という。）の状態と違う状態を検知する処理である。アノマリ事象検知処理により通常時の状態と違う状態が検知されると、ホストマシン１１０やそのリソース、ホストマシン１１０を含む監視対象システム１００に問題が発生する可能性がある。 The analysis server 200 collects the monitoring data acquired by the performance monitoring server 150, and executes the anomaly event detection process based on the collected monitoring data. The anomaly event detection process is a process of analyzing monitoring data and detecting a state different from the normal time, normal time, or steady state (hereinafter, simply referred to as normal time) of the host machine 110 and its resources. If a state different from the normal state is detected by the anomaly event detection process, a problem may occur in the monitored system 100 including the host machine 110, its resources, and the host machine 110.

また、分析サーバ２００は、アノマリ事象が検知された監視データ（以下、アノマリデータという。）と、ＣＭＤＢ１４０が記憶する構成情報とに基づいて、アノマリ事象に関するアノマリ情報の表示順序を決定する。分析サーバ２００は、アノマリデータと、アノマリ事象の検知頻度とに基づいて、アノマリ情報の表示順序を決定してもよい。検知頻度はアノマリ事象を連続して検知した連続検知数であってもよいし、単位時間あたりの検知回数であってもよい。本実施形態では、検知頻度の一例として連続検知数を使用して説明する。 Further, the analysis server 200 determines the display order of the anomaly information regarding the anomaly event based on the monitoring data in which the anomaly event is detected (hereinafter referred to as anomaly data) and the configuration information stored in the CMDB 140. The analysis server 200 may determine the display order of the anomaly information based on the anomaly data and the detection frequency of the anomaly event. The detection frequency may be the number of continuous detections in which anomaly events are continuously detected, or may be the number of detections per unit time. In this embodiment, the number of continuous detections will be described as an example of the detection frequency.

さらに、分析サーバ２００は、アノマリデータと、構成情報と、アノマリ事象の検知頻度と、管理サーバ１９０が管理するＩＮＣチケットの発行数とに基づいて、アノマリ事象情報の表示順序を決定してもよい。表示順序を決定する際に、ＩＮＣチケットの発行数を利用することで、アノマリ事象に起因する問題が発生する可能性の確度が向上する。例えば、ＩＮＣチケットが全く発行されていなければ、アノマリ事象が検知されても、そのアノマリ事象に起因する問題が発生する可能性は低いと想定される。尚、詳細は後述するが、分析サーバ２００は運用端末３００から各種の要求や指示を受け付けると、アノマリ情報を、決定した表示順序で、運用端末３００の表示部３２０に表示する。 Further, the analysis server 200 may determine the display order of the anomaly event information based on the anomaly data, the configuration information, the detection frequency of the anomaly event, and the number of INC tickets issued by the management server 190. .. By using the number of INC tickets issued when determining the display order, the probability that a problem due to an anomaly event will occur is improved. For example, if no INC ticket is issued, even if an anomaly event is detected, it is unlikely that a problem caused by the anomaly event will occur. Although the details will be described later, when the analysis server 200 receives various requests and instructions from the operation terminal 300, the analysis server 200 displays the anomaly information on the display unit 320 of the operation terminal 300 in the determined display order.

次に、図３を参照して、分析サーバ２００のハードウェア構成について説明する。尚、上述した複数のホストマシン１１０、構成収集サーバ１３０、ＣＭＤＢ１４０、性能監視サーバ１５０、イベント監視サーバ１７０、管理サーバ１９０、及び運用端末３００については基本的に分析サーバ２００と同様のハードウェア構成であるため、説明を省略する。 Next, the hardware configuration of the analysis server 200 will be described with reference to FIG. The plurality of host machines 110, the configuration collection server 130, the CMDB 140, the performance monitoring server 150, the event monitoring server 170, the management server 190, and the operation terminal 300 described above basically have the same hardware configuration as the analysis server 200. Therefore, the description is omitted.

図３は分析サーバ２００のハードウェア構成の一例である。図３に示すように、分析サーバ２００は、少なくともハードウェアプロセッサとしてのＣＰＵ２００Ａ、Random Access Memory（ＲＡＭ）２００Ｂ、Read Only Memory（ＲＯＭ）２００Ｃ、及びネットワークＩ／Ｆ（インタフェース）２００Ｄを含んでいる。分析サーバ２００は、必要に応じて、Hard Disk Drive（ＨＤＤ）２００Ｅ、入力Ｉ／Ｆ２００Ｆ、出力Ｉ／Ｆ２００Ｇ、入出力Ｉ／Ｆ２００Ｈ、ドライブ装置２００Ｉの少なくとも１つを含んでいてもよい。ＣＰＵ２００Ａからドライブ装置２００Ｉまでは、内部バス２００Ｊによって互いに接続されている。すなわち、分析サーバ２００はコンピュータによって実現することができる。尚、ＣＰＵ２００Ａに代えてMicro Processing Unit（ＭＰＵ）をハードウェアプロセッサとして利用してもよい。 FIG. 3 is an example of the hardware configuration of the analysis server 200. As shown in FIG. 3, the analysis server 200 includes at least a CPU 200A as a hardware processor, a Random Access Memory (RAM) 200B, a Read Only Memory (ROM) 200C, and a network I / F (interface) 200D. The analysis server 200 may include at least one of a Hard Disk Drive (HDD) 200E, an input I / F200F, an output I / F200G, an input / output I / F200H, and a drive device 200I, if necessary. The CPU 200A to the drive device 200I are connected to each other by an internal bus 200J. That is, the analysis server 200 can be realized by a computer. A Micro Processing Unit (MPU) may be used as a hardware processor instead of the CPU 200A.

入力Ｉ／Ｆ２００Ｆには、入力装置７１０が接続される。入力装置７１０としては、例えばキーボードやマウスなどがある。出力Ｉ／Ｆ２００Ｇには、表示装置７２０が接続される。表示装置７２０としては、例えば液晶ディスプレイがある。入出力Ｉ／Ｆ２００Ｈには、半導体メモリ７３０が接続される。半導体メモリ７３０としては、例えばUniversal Serial Bus（ＵＳＢ）メモリやフラッシュメモリなどがある。入出力Ｉ／Ｆ２００Ｈは、半導体メモリ７３０に記憶されたプログラムやデータを読み取る。入力Ｉ／Ｆ２００Ｆ及び入出力Ｉ／Ｆ２００Ｈは、例えばＵＳＢポートを備えている。出力Ｉ／Ｆ２００Ｇは、例えばディスプレイポートを備えている。 An input device 710 is connected to the input I / F 200F. The input device 710 includes, for example, a keyboard and a mouse. A display device 720 is connected to the output I / F 200G. The display device 720 includes, for example, a liquid crystal display. A semiconductor memory 730 is connected to the input / output I / F 200H. Examples of the semiconductor memory 730 include a Universal Serial Bus (USB) memory and a flash memory. The input / output I / F 200H reads programs and data stored in the semiconductor memory 730. The input I / F 200F and the input / output I / F 200H include, for example, a USB port. The output I / F 200G includes, for example, a display port.

ドライブ装置２００Ｉには、可搬型記録媒体７４０が挿入される。可搬型記録媒体７４０としては、例えばCompact Disc（ＣＤ）－ＲＯＭ、Digital Versatile Disc（ＤＶＤ）といったリムーバブルディスクがある。ドライブ装置２００Ｉは、可搬型記録媒体７４０に記録されたプログラムやデータを読み込む。ネットワークＩ／Ｆ２００Ｄは、例えばＬＡＮポートを備えている。ネットワークＩ／Ｆ２００Ｄは上述したＣＭＤＢ１４０、性能監視サーバ１５０、管理サーバ１９０、及び運用端末３００と接続される。 A portable recording medium 740 is inserted into the drive device 200I. Examples of the portable recording medium 740 include removable discs such as Compact Disc (CD) -ROM and Digital Versatile Disc (DVD). The drive device 200I reads programs and data recorded on the portable recording medium 740. The network I / F200D includes, for example, a LAN port. The network I / F200D is connected to the CMDB 140, the performance monitoring server 150, the management server 190, and the operation terminal 300 described above.

上述したＲＡＭ２００Ｂには、ＲＯＭ２００ＣやＨＤＤ２００Ｅに記憶されたプログラムがＣＰＵ２００Ａによって一時的に格納される。ＲＡＭ２００Ｂには、可搬型記録媒体７４０に記録されたプログラムがＣＰＵ２００Ａによって一時的に格納される。格納されたプログラムをＣＰＵ２００Ａが実行することにより、ＣＰＵ２００Ａは後述する各種の機能を実現し、また、後述する各種の処理を実行する。尚、プログラムは後述するフローチャートに応じたものとすればよい。 In the RAM 200B described above, the program stored in the ROM 200C or the HDD 200E is temporarily stored by the CPU 200A. The program recorded on the portable recording medium 740 is temporarily stored in the RAM 200B by the CPU 200A. When the CPU 200A executes the stored program, the CPU 200A realizes various functions described later and also executes various processes described later. The program may be adapted to the flowchart described later.

次に、図４から図７を参照して、分析サーバ２００の機能構成について説明する。 Next, the functional configuration of the analysis server 200 will be described with reference to FIGS. 4 to 7.

図４（ａ）は分析サーバ２００のブロック図の一例である。図４（ｂ）はデータ記憶部２２０の一例である。図５は第１データ管理テーブルＴ１の一例である。図６（ａ）は第２データ管理テーブルＴ２及び第３データ管理テーブルＴ３の一例である。図６（ｂ）は第４データ管理テーブルＴ４の一例である。図７は第５データ管理テーブルＴ５の一例である。図４（ａ）に示すように、分析サーバ２００は、処理部としての、データ収集部２１０、アノマリ検知部２３０、表示順序制御部２４０、及びデータ可視化部２５０を含んでいる。また、分析サーバ２００はデータ記憶部２２０を含んでいる。データ記憶部２２０は、図４（ｂ）に示すように、第１データ管理テーブルＴ１、第２データ管理テーブルＴ２、第３データ管理テーブルＴ３、第４データ管理テーブルＴ４、及び第５データ管理テーブルＴ５により管理された種々のデータを記憶する。 FIG. 4A is an example of a block diagram of the analysis server 200. FIG. 4B is an example of the data storage unit 220. FIG. 5 is an example of the first data management table T1. FIG. 6A is an example of the second data management table T2 and the third data management table T3. FIG. 6B is an example of the fourth data management table T4. FIG. 7 is an example of the fifth data management table T5. As shown in FIG. 4A, the analysis server 200 includes a data collection unit 210, an anomaly detection unit 230, a display order control unit 240, and a data visualization unit 250 as processing units. Further, the analysis server 200 includes a data storage unit 220. As shown in FIG. 4B, the data storage unit 220 includes a first data management table T1, a second data management table T2, a third data management table T3, a fourth data management table T4, and a fifth data management table. Stores various data managed by T5.

尚、データ収集部２１０は、公知のログ収集転送ツール（例えばFluentdやLogstashなどのソフトウェア）と後述する第１処理とをＣＰＵ２００Ａが実行することによって実現することができる。データ記憶部２２０は、Open Source Software（ＯＳＳ）の分散型検索リアルタイム分析エンジン（例えばElasticsearchなどのソフトウェア）をＣＰＵ２００Ａが実行し、ＨＤＤ２００Ｅと協同することによって実現することができる。アノマリ検知部２３０及び表示順序制御部２４０は後述する第２処理をＣＰＵ２００Ａが実行することによって実現することができる。データ可視化部２５０は、公知のデータ可視化ツール（例えばkibanaなどのソフトウェア）と後述する第３処理とをＣＰＵ２００Ａが実行することによって実現することができる。 The data collection unit 210 can be realized by the CPU 200A executing a known log collection / transfer tool (for example, software such as Fluentd or Logstash) and the first process described later. The data storage unit 220 can be realized by the CPU 200A executing a distributed search real-time analysis engine (for example, software such as Elasticsearch) of Open Source Software (OSS) and cooperating with the HDD 200E. The anomaly detection unit 230 and the display order control unit 240 can be realized by the CPU 200A executing the second process described later. The data visualization unit 250 can be realized by the CPU 200A executing a known data visualization tool (for example, software such as kibana) and a third process described later.

データ収集部２１０は性能監視サーバ１５０から監視データを収集する。より詳しくは、データ収集部２１０は数分から十数分の単位で定期的に性能監視サーバ１５０にアクセスし、性能監視サーバ１５０が複数のホストマシン１１０のそれぞれから取得した監視データを収集する。データ収集部２１０は監視データを収集すると、収集した監視データをデータ記憶部２２０に格納する。これにより、データ記憶部２２０は監視データを記憶する。特に、監視データは第１データ管理テーブルＴ１により管理される。 The data collection unit 210 collects monitoring data from the performance monitoring server 150. More specifically, the data collection unit 210 periodically accesses the performance monitoring server 150 in units of several minutes to ten and several minutes, and the performance monitoring server 150 collects monitoring data acquired from each of the plurality of host machines 110. When the data collection unit 210 collects the monitoring data, the collected monitoring data is stored in the data storage unit 220. As a result, the data storage unit 220 stores the monitoring data. In particular, the monitoring data is managed by the first data management table T1.

ここで、図５に示すように、第１データ管理テーブルＴ１はホストマシン１１０と監視対象リソースの組み合わせ毎に複数の監視データを管理する。第１データ管理テーブルＴ１は収集日時、ホストマシン、監視対象リソース、データ値、単位、アノマリ分析結果、及びアノマリ検知日時といった複数のフィールドを有する。収集日時のフィールドには監視データを収集した日時が登録される。ホストマシンのフィールドにはホストマシン１１０の名称又は識別子が登録される。監視対象リソースのフィールドには監視対象リソースの名称が登録される。データ値と単位のフィールドには監視データの性能を表す実測値とその実測値の単位を表す記号又は文字がそれぞれ登録される。例えば監視対象リソース「ＣＰＵ」に対し単位「％」が登録されている場合、データ値「７８」はＣＰＵ使用率を表している。同様に、監視対象リソース「メモリ」に対し単位「％」が登録されている場合、データ値「６４」はメモリ使用率を表している。 Here, as shown in FIG. 5, the first data management table T1 manages a plurality of monitoring data for each combination of the host machine 110 and the monitored resource. The first data management table T1 has a plurality of fields such as a collection date and time, a host machine, a monitored resource, a data value, a unit, an anomaly analysis result, and an anomaly detection date and time. The date and time when the monitoring data was collected is registered in the collection date and time field. The name or identifier of the host machine 110 is registered in the field of the host machine. The name of the monitored resource is registered in the field of the monitored resource. In the data value and unit fields, the measured value indicating the performance of the monitoring data and the symbol or character indicating the unit of the measured value are registered, respectively. For example, when the unit "%" is registered for the monitored resource "CPU", the data value "78" represents the CPU usage rate. Similarly, when the unit "%" is registered for the monitored resource "memory", the data value "64" represents the memory usage rate.

アノマリ分析結果のフィールドにはアノマリ事象を検知したか否かを識別する識別フラグが登録される。例えば識別フラグ「１」はアノマリ事象を検知したことを表し、識別フラグ「０」はアノマリ事象を検知しなかったことを表す。アノマリ検知日時のフィールドにはアノマリ事象を検知した日時が登録される。アノマリ事象を検知した日時は、識別フラグ「１」が登録された監視データ（以下、アノマリデータという。）に対して登録され、識別フラグ「０」が登録された監視データ（以下、非アノマリデータという。）に対しては登録されない。このように、第１データ管理テーブルＴ１によりホストマシン１１０と監視対象リソースの組み合わせ毎に複数のアノマリデータ及び非アノマリデータが管理される。 An identification flag that identifies whether or not an anomaly event has been detected is registered in the field of the anomaly analysis result. For example, the identification flag "1" indicates that an anomaly event has been detected, and the identification flag "0" indicates that an anomaly event has not been detected. The date and time when the anomaly event was detected is registered in the anomaly detection date and time field. The date and time when the anomaly event is detected is registered for the monitoring data in which the identification flag "1" is registered (hereinafter referred to as anomaly data), and the monitoring data in which the identification flag "0" is registered (hereinafter, non-anomaly data). It is not registered for.). In this way, a plurality of anomaly data and non-anomaly data are managed for each combination of the host machine 110 and the monitored resource by the first data management table T1.

また、データ収集部２１０は管理サーバ１９０からＩＮＣチケット及び変更要求チケットを収集する。より詳しくは、データ収集部２１０は数分から十数分の単位で定期的に管理サーバ１９０にアクセスし、管理サーバ１９０が管理するＩＮＣチケット及び変更要求チケットを収集する。データ収集部２１０はＩＮＣチケット及び変更要求チケットを収集すると、収集したＩＮＣチケット及び変更要求チケットをデータ記憶部２２０に格納する。これにより、データ記憶部２２０はＩＮＣチケット及び変更要求チケットを記憶する。ＩＮＣチケットは第２データ管理テーブルＴ２及び第３データ管理テーブルＴ３により発行元毎に管理される。また、変更要求チケットは第４データ管理テーブルＴ４により管理される。 In addition, the data collection unit 210 collects INC tickets and change request tickets from the management server 190. More specifically, the data collection unit 210 periodically accesses the management server 190 in units of several minutes to a dozen minutes, and collects INC tickets and change request tickets managed by the management server 190. When the data collection unit 210 collects the INC ticket and the change request ticket, the collected INC ticket and the change request ticket are stored in the data storage unit 220. As a result, the data storage unit 220 stores the INC ticket and the change request ticket. The INC ticket is managed for each issuer by the second data management table T2 and the third data management table T3. Further, the change request ticket is managed by the fourth data management table T4.

ここで、図６（ａ）に示すように、第２データ管理テーブルＴ２はイベント監視サーバ１７０が発行した複数のＩＮＣチケットを管理する。また、第３データ管理テーブルＴ３は利用者端末が発行した複数のＩＮＣチケットを管理する。第２データ管理テーブルＴ２はチケットＩＤ、発行日時、サービス名、ホストマシン、及びタイトルといった複数のフィールドを有する。チケットＩＤのフィールドにはＩＮＣチケットを識別する識別子が登録される。発行日時のフィールドにはＩＮＣチケットの発行日時が登録される。サービス名のフィールドには発生したイベントに関連するサービスの名称が登録される。ホストマシンのフィールドにはそのサービスを実行するホストマシン１１０の名称又は識別子が登録される。タイトルのフィールドには発生したイベントの内容が登録される。尚、第３データ管理テーブルＴ３については、基本的に、第２データ管理テーブルＴ２と同様であるため、詳細な説明は省略する。 Here, as shown in FIG. 6A, the second data management table T2 manages a plurality of INC tickets issued by the event monitoring server 170. Further, the third data management table T3 manages a plurality of INC tickets issued by the user terminal. The second data management table T2 has a plurality of fields such as a ticket ID, an issue date and time, a service name, a host machine, and a title. An identifier that identifies an INC ticket is registered in the ticket ID field. The issue date and time of the INC ticket is registered in the issue date and time field. The name of the service related to the event that occurred is registered in the service name field. The name or identifier of the host machine 110 that executes the service is registered in the field of the host machine. The content of the event that occurred is registered in the title field. Since the third data management table T3 is basically the same as the second data management table T2, detailed description thereof will be omitted.

一方、図６（ｂ）に示すように、第４データ管理テーブルＴ４は運用端末３００が発行した複数の変更要求チケットを管理する。第４データ管理テーブルＴ４はチケットＩＤ、発行日時、サービス名、ホストマシン、及びタイトルといった複数のフィールドを有する。チケットＩＤのフィールドには変更要求チケットを識別する識別子が登録される。発行日時のフィールドには変更要求チケットの発行日時が登録される。サービス名のフィールドにはホストマシン１１０の変更により影響を受けるサービスの名称が登録される。ホストマシンのフィールドにはそのサービスを実行するホストマシン１１０の名称又は識別子が登録される。タイトルのフィールドには改変や修正といった変更内容が登録される。 On the other hand, as shown in FIG. 6B, the fourth data management table T4 manages a plurality of change request tickets issued by the operation terminal 300. The fourth data management table T4 has a plurality of fields such as a ticket ID, an issue date and time, a service name, a host machine, and a title. An identifier that identifies the change request ticket is registered in the ticket ID field. The issue date and time of the change request ticket is registered in the issue date and time field. In the service name field, the name of the service affected by the change of the host machine 110 is registered. The name or identifier of the host machine 110 that executes the service is registered in the field of the host machine. Changes such as modifications and corrections are registered in the title field.

アノマリ検知部２３０はデータ記憶部２２０から監視データを取得し、取得した監視データについてアノマリ事象検知処理を実行する。例えば、アノマリ検知部２３０はデータ記憶部２２０から取得した監視データに基づいて、監視データ（具体的には監視データのデータ値）の正規分布をホストマシン１１０と監視対象リソースとの組み合わせ毎に生成し、正規分布の平均値μから２σ以上離れた監視データをアノマリ事象として検知する。アノマリ検知部２３０はアノマリ事象を検知した監視データについて、第１データ管理テーブルＴ１に含まれるアノマリ分析結果のフィールドに識別フラグ「１」を分析結果として登録する。アノマリ検知部２３０はアノマリ事象を検知した監視データについて、第１データ管理テーブルＴ１に含まれるアノマリ検知日時のフィールドに検知日時を登録する。一方、アノマリ検知部２３０はアノマリ事象を検知しなかった監視データについて、第１データ管理テーブルＴ１に含まれるアノマリ分析結果に識別フラグ「０」を分析結果として登録する。すなわち、アノマリ検知部２３０は、監視データをアノマリデータと非アノマリデータに分類する。 The anomaly detection unit 230 acquires monitoring data from the data storage unit 220, and executes anomaly event detection processing for the acquired monitoring data. For example, the anomaly detection unit 230 generates a normal distribution of monitoring data (specifically, the data value of the monitoring data) for each combination of the host machine 110 and the monitored resource based on the monitoring data acquired from the data storage unit 220. Then, the monitoring data separated by 2σ or more from the average value μ of the normal distribution is detected as an anomaly event. The anomaly detection unit 230 registers the identification flag "1" as an analysis result in the field of the anomaly analysis result included in the first data management table T1 for the monitoring data in which the anomaly event is detected. The anomaly detection unit 230 registers the detection date and time in the field of the anomaly detection date and time included in the first data management table T1 for the monitoring data in which the anomaly event is detected. On the other hand, the anomaly detection unit 230 registers the identification flag "0" in the anomaly analysis result included in the first data management table T1 as the analysis result for the monitoring data in which the anomaly event is not detected. That is, the anomaly detection unit 230 classifies the monitoring data into anomaly data and non-anomaly data.

表示順序制御部２４０はアノマリ情報の表示順序を制御する。より詳しくは、表示順序制御部２４０はアノマリ検知部２３０が監視データをアノマリデータと非アノマリデータに分類すると、データ記憶部２２０からアノマリデータとＩＮＣチケットと変更要求チケットとを取得する。特に、表示順序制御部２４０はイベント監視サーバ１７０が発行したＩＮＣチケットと利用者端末が発行したＩＮＣチケットの両方を取得する。表示順序制御部２４０はアノマリデータとＩＮＣチケットと変更要求チケットとを取得すると、ＣＭＤＢ１４０が記憶する構成情報を参照し、アノマリデータとＩＮＣチケットと変更要求チケットと構成情報とに基づいて、種々の処理を実行して、アノマリ情報の表示順序を決定する。尚、表示順序制御部２４０が実行する処理の詳細については後述する。表示順序制御部２４０は表示順序を決定すると、表示順序が登録された表示対象データを生成してデータ記憶部２２０に格納する。表示対象データは第５データ管理テーブルＴ５によって管理される。 The display order control unit 240 controls the display order of the anomaly information. More specifically, when the anomaly detection unit 230 classifies the monitoring data into anomaly data and non-anomaly data, the display order control unit 240 acquires the anomaly data, the INC ticket, and the change request ticket from the data storage unit 220. In particular, the display order control unit 240 acquires both the INC ticket issued by the event monitoring server 170 and the INC ticket issued by the user terminal. When the display order control unit 240 acquires the anomaly data, the INC ticket, and the change request ticket, the display order control unit 240 refers to the configuration information stored in the CMDB 140, and performs various processes based on the anomaly data, the INC ticket, the change request ticket, and the configuration information. To determine the display order of the anomaly information. The details of the processing executed by the display order control unit 240 will be described later. When the display order control unit 240 determines the display order, the display order control unit 240 generates the display target data in which the display order is registered and stores it in the data storage unit 220. The display target data is managed by the fifth data management table T5.

ここで、図７に示すように、第５データ管理テーブルＴ５はアノマリＩＤ、検知日時、経過時間、ホストマシン、監視対象リソースといった複数のフィールドを有する。アノマリＩＤのフィールドには表示対象データを識別する識別子が登録される。アノマリＩＤは表示対象データが生成される度に表示順序制御部２４０によって付与される。検知日時のフィールドにはアノマリデータのアノマリ検知日時が登録される。経過時間のフィールドにはアノマリ検知日時から現在日時まで経過した経過時間が登録される。すなわち、経過時間はアノマリ検知日時と現在日時との差を表している。ホストマシンのフィールドにはアノマリデータのホストマシン１１０の名称又は識別子が登録される。監視対象リソースのフィールドにはアノマリデータの監視対象リソースの名称が登録される。 Here, as shown in FIG. 7, the fifth data management table T5 has a plurality of fields such as anomaly ID, detection date and time, elapsed time, host machine, and monitored resource. An identifier that identifies the data to be displayed is registered in the field of the anomaly ID. The anomaly ID is assigned by the display order control unit 240 each time the display target data is generated. The anomaly detection date and time of the anomaly data is registered in the detection date and time field. In the elapsed time field, the elapsed time elapsed from the anomaly detection date and time to the current date and time is registered. That is, the elapsed time represents the difference between the anomaly detection date and time and the current date and time. The name or identifier of the host machine 110 of the anomaly data is registered in the field of the host machine. The name of the monitored resource of the anomaly data is registered in the field of the monitored resource.

また、図７に示すように、第５データ管理テーブルＴ５はＩＮＣチケット数（サーバ発行）、変更要求チケット数、ＩＮＣチケット数（ユーザ発行）といった複数のフィールドを有する。ＩＮＣチケット数（サーバ発行）のフィールドには、イベント監視サーバ１７０が発行したＩＮＣチケットの数が登録される。変更要求チケット数のフィールドには、運用端末３００が発行した変更要求チケットの数が登録される。ＩＮＣチケット数（ユーザ発行）のフィールドには、利用者端末が発行したＩＮＣチケットの数が登録される。尚、イベント監視サーバ１７０及び利用者端末がそれぞれ発行した各ＩＮＣチケットには、いずれもホストマシン１１０の名称が含まれているため、ホストマシン１１０毎に各ＩＮＣチケットの数を特定することができる。また、変更要求チケットにもホストマシン１１０の名称が含まれているため、ホストマシン１１０毎に変更要求チケットの数を特定することができる。 Further, as shown in FIG. 7, the fifth data management table T5 has a plurality of fields such as the number of INC tickets (issued by the server), the number of change request tickets, and the number of INC tickets (issued by the user). The number of INC tickets issued by the event monitoring server 170 is registered in the field of the number of INC tickets (issued by the server). In the field of the number of change request tickets, the number of change request tickets issued by the operation terminal 300 is registered. The number of INC tickets issued by the user terminal is registered in the field of the number of INC tickets (issued by the user). Since each INC ticket issued by the event monitoring server 170 and the user terminal includes the name of the host machine 110, the number of each INC ticket can be specified for each host machine 110. .. Further, since the name of the host machine 110 is included in the change request ticket, the number of change request tickets can be specified for each host machine 110.

さらに、図７に示すように、第５データ管理テーブルＴ５は連続検知数、関連サービス数、表示順序といった複数のフィールドを有する。連続検知数のフィールドには、監視対象リソースについて発生したアノマリ事象を連続して検知した回数が登録される。例えば、監視対象リソースについて発生したアノマリ事象が１分間隔で４分間に渡って連続して検知された場合、連続検知数のフィールドには数値「４」が登録される。尚、連続検知数は検知頻度の一例であって、単位時間あたりにアノマリ事象を検知した回数を検知頻度として利用してもよい。関連サービス数のフィールドには、そのホストマシン１１０が実行するサービスと関連する他のサービスの数が登録される。例えば、構成情報（図２参照）に基づけば、「ホスト１」の名称を有するホストマシン１１０は、サービス１及びサービス２の２つを実行し、「ホスト２」の名称を有するホストマシン１１０がサービス３～サービス５の３つを実行する。このため、図７に示すように、「ホスト１」の名称を有するホストマシン１１０に対応する関連サービス数のフィールドには、数値「３」が登録される。表示順序のフィールドには表示順序制御部２４０がアノマリデータとＩＮＣチケットと変更要求チケットと構成情報とに基づいて決定した表示順序が登録される。尚、本実施形態では、１２３件の表示対象データが第５データ管理テーブルＴ５によって管理されている。 Further, as shown in FIG. 7, the fifth data management table T5 has a plurality of fields such as the number of continuous detections, the number of related services, and the display order. In the field of the number of continuous detections, the number of times that the anomaly event that occurred for the monitored resource is continuously detected is registered. For example, when the anomaly event generated for the monitored resource is continuously detected for 4 minutes at 1-minute intervals, the numerical value "4" is registered in the field of the number of continuous detections. The number of continuous detections is an example of the detection frequency, and the number of times anomaly events are detected per unit time may be used as the detection frequency. In the field of the number of related services, the number of other services related to the service executed by the host machine 110 is registered. For example, based on the configuration information (see FIG. 2), the host machine 110 having the name "host 1" executes the service 1 and the service 2, and the host machine 110 having the name "host 2" executes the service 1. The three services 3 to 5 are executed. Therefore, as shown in FIG. 7, the numerical value "3" is registered in the field of the number of related services corresponding to the host machine 110 having the name "host 1". The display order determined by the display order control unit 240 based on the anomaly data, the INC ticket, the change request ticket, and the configuration information is registered in the display order field. In this embodiment, 123 display target data are managed by the fifth data management table T5.

データ可視化部２５０は運用端末３００からアノマリ一覧画面の表示を要求する画面表示要求を受け付けると、データ記憶部２２０から表示対象データを取得し、取得した表示対象データを可視化する。例えば、データ可視化部２５０は取得した表示対象データを表示順序に基づいて並び替え、並び替えた表示対象データから表示順序のフィールドを除外したアノマリ情報を生成する。データ可視化部２５０はアノマリ情報を生成すると、生成したアノマリ情報を含むアノマリ一覧画面を運用端末３００の表示部３２０に表示する。 When the data visualization unit 250 receives a screen display request for displaying the anomaly list screen from the operation terminal 300, the data visualization unit 250 acquires display target data from the data storage unit 220 and visualizes the acquired display target data. For example, the data visualization unit 250 sorts the acquired display target data based on the display order, and generates anomaly information excluding the display order field from the sorted display target data. When the data visualization unit 250 generates the anomaly information, the data visualization unit 250 displays the anomaly list screen including the generated anomaly information on the display unit 320 of the operation terminal 300.

続いて、分析サーバ２００の動作について説明する。 Subsequently, the operation of the analysis server 200 will be described.

図８はデータ収集部２１０が実行する第１処理の一例を示すフローチャートである。データ収集部２１０は第１処理を所定の時間毎に定期的に実行する。まず、データ収集部２１０は性能監視サーバ１５０から監視データを収集し（ステップＳ１０１）、収集した監視データをデータ記憶部２２０に格納する（ステップＳ１０２）。ステップＳ１０２の処理が完了すると、データ収集部２１０は管理サーバ１９０から各種のチケットを収集し（ステップＳ１０３）、収集したチケットをデータ記憶部２２０に格納する（ステップＳ１０４）。すなわち、データ収集部２１０は発行元が異なる各ＩＮＣチケットと変更要求チケットとを収集し、収集したＩＮＣチケット及び変更要求チケットをデータ記憶部２２０に格納する。尚、データ収集部２１０はチケットを収集して格納してから、監視データを収集して格納してもよい。ステップＳ１０４の処理が完了すると、データ収集部２１０は処理を終了する。 FIG. 8 is a flowchart showing an example of the first process executed by the data collection unit 210. The data collection unit 210 periodically executes the first process at predetermined time intervals. First, the data collection unit 210 collects monitoring data from the performance monitoring server 150 (step S101), and stores the collected monitoring data in the data storage unit 220 (step S102). When the process of step S102 is completed, the data collection unit 210 collects various tickets from the management server 190 (step S103), and stores the collected tickets in the data storage unit 220 (step S104). That is, the data collection unit 210 collects each INC ticket and a change request ticket from different issuers, and stores the collected INC ticket and the change request ticket in the data storage unit 220. The data collection unit 210 may collect and store the ticket and then collect and store the monitoring data. When the process of step S104 is completed, the data collection unit 210 ends the process.

図９はアノマリ検知部２３０及び表示順序制御部２４０が実行する第２処理の一例を示すフローチャートである。図１０は表示優先度の一例を説明する図である。アノマリ検知部２３０及び表示順序制御部２４０は第２処理を定期的に実行する。まず、図９に示すように、アノマリ検知部２３０は設定時間が到来したか否かを判断する（ステップＳ２０１）。設定時間は上述した所定の時間より長い時間（例えば１時間といった数時間）である。例えばデータ収集部２１０は１０分毎に監視データと各種のチケットを収集してデータ記憶部２２０に格納し、アノマリ検知部２３０は１時間毎にステップＳ２０１の処理以後の処理を実行するか否かを判断する。 FIG. 9 is a flowchart showing an example of the second process executed by the anomaly detection unit 230 and the display order control unit 240. FIG. 10 is a diagram illustrating an example of display priority. The anomaly detection unit 230 and the display order control unit 240 periodically execute the second process. First, as shown in FIG. 9, the anomaly detection unit 230 determines whether or not the set time has arrived (step S201). The set time is a time longer than the above-mentioned predetermined time (for example, several hours such as one hour). For example, the data collection unit 210 collects monitoring data and various tickets every 10 minutes and stores them in the data storage unit 220, and the anomaly detection unit 230 determines whether or not to execute the processing after the processing of step S201 every hour. To judge.

例えば、アノマリ検知部２３０がステップＳ２０１の処理以後の処理を前回実行してから設定時間が到来していない場合（ステップＳ２０１：ＮＯ）、後続のステップＳ２０２からＳ２１３の処理をスキップする。これにより、データ収集部２１０は監視データ及び各種のチケットを収集してから１０分経過する度に、監視データ及び各種のチケットを収集してデータ記憶部２２０に格納する。これにより、データ記憶部２２０には時系列の監視データと各種のチケットが蓄積される。 For example, if the set time has not arrived since the anomaly detection unit 230 last executed the processing after the processing of step S201 (step S201: NO), the processing of subsequent steps S202 to S213 is skipped. As a result, the data collection unit 210 collects the monitoring data and various tickets and stores them in the data storage unit 220 every 10 minutes after collecting the monitoring data and various tickets. As a result, time-series monitoring data and various tickets are stored in the data storage unit 220.

一方、アノマリ検知部２３０がステップＳ２０１の処理以後の処理を前回実行してから設定時間が到来している場合（ステップＳ２０１：ＹＥＳ）、アノマリ検知部２３０はデータ記憶部２２０から監視データを取得する（ステップＳ２０２）。ステップＳ２０２の処理が完了すると、アノマリ検知部２３０はアノマリ事象検知処理を実行し（ステップＳ２０３）、分析結果と検知日時を第１データ管理テーブルＴ１に登録する（ステップＳ２０４）。これにより、監視データはアノマリデータと非アノマリデータに分類される。 On the other hand, when the set time has arrived since the anomaly detection unit 230 last executed the processing after the processing of step S201 (step S201: YES), the anomaly detection unit 230 acquires monitoring data from the data storage unit 220. (Step S202). When the process of step S202 is completed, the anomaly detection unit 230 executes the anomaly event detection process (step S203), and registers the analysis result and the detection date and time in the first data management table T1 (step S204). As a result, the monitoring data is classified into anomaly data and non-anomaly data.

ステップＳ２０４の処理が完了すると、次いで、表示順序制御部２４０はデータ記憶部２２０からアノマリデータを取得し（ステップＳ２０５）、次いで、各種のチケットを取得する（ステップＳ２０６）。ステップＳ２０４の処理が完了すると、次いで、表示順序制御部２４０はＣＭＤＢ１４０が記憶する構成情報を参照し（ステップＳ２０７）、関連サービス数を特定する（ステップＳ２０８）。ステップＳ２０８の処理が完了すると、次いで、表示順序制御部２４０は連続検知数を算出し（ステップＳ２０９）、チケット数を特定する（ステップＳ２１０）。特に、表示順序制御部２４０はホストマシン１１０毎のチケット数をチケットの種別毎に特定する。これにより、イベント監視サーバ１７０が特定のホストマシン１１０に対して発行したＩＮＣチケットの数と、利用者端末が特定のホストマシン１１０に対して発行したＩＮＣチケットの数と、運用端末３００が特定のホストマシン１１０に対して発行した変更要求チケットの数が特定される。 When the process of step S204 is completed, the display order control unit 240 then acquires anomaly data from the data storage unit 220 (step S205), and then acquires various tickets (step S206). When the process of step S204 is completed, the display order control unit 240 then refers to the configuration information stored in the CMDB 140 (step S207) and specifies the number of related services (step S208). When the process of step S208 is completed, the display order control unit 240 then calculates the number of continuous detections (step S209) and specifies the number of tickets (step S210). In particular, the display order control unit 240 specifies the number of tickets for each host machine 110 for each type of ticket. As a result, the number of INC tickets issued by the event monitoring server 170 to the specific host machine 110, the number of INC tickets issued by the user terminal to the specific host machine 110, and the operation terminal 300 are specified. The number of change request tickets issued to the host machine 110 is specified.

ステップＳ２１０の処理が完了すると、表示順序制御部２４０は経過時間を計測し（ステップＳ２１１）、次いで、表示順序を決定する（ステップＳ２１２）。より詳しくは、表示順序制御部２４０は、関連サービス数、連続検知数、イベント監視サーバ１７０が発行したＩＮＣチケットのチケット数、利用者端末が発行したＩＮＣチケットのチケット数、経過時間、及び予め定めた表示優先度に基づいて、アノマリ情報の表示順序を決定する。表示順序制御部２４０は、表示順序を決定する際に、運用端末３００が発行した変更要求チケットのチケット数を利用してもよい。 When the process of step S210 is completed, the display order control unit 240 measures the elapsed time (step S211), and then determines the display order (step S212). More specifically, the display order control unit 240 determines the number of related services, the number of continuous detections, the number of INC ticket tickets issued by the event monitoring server 170, the number of INC ticket tickets issued by the user terminal, the elapsed time, and predetermined time. The display order of the anomaly information is determined based on the display priority. The display order control unit 240 may use the number of change request tickets issued by the operation terminal 300 when determining the display order.

ここで、図１０に示すように、最も高い表示優先度は関連サービス数に定められている。そして、連続検知数、利用者端末が発行したＩＮＣチケットのチケット数、イベント監視サーバ１７０が発行したＩＮＣチケットのチケット数の順に表示優先度が定められ、経過時間に最も低い表示優先度が定められている。これらの優先度は例えば問題が発生した際の影響の大きさなどに従って定めればよい。したがって、例えば関連サービス数に数値「５」が登録され、連続検知数に数値「１」が登録された表示対象データと、関連サービス数に数値「３」が登録され、連続検知数に数値「９」が登録された表示対象データとが存在する場合、前者の表示対象データに応じたアノマリ情報が後者の表示対象データに応じたアノマリ情報より上位に表示される。 Here, as shown in FIG. 10, the highest display priority is defined in the number of related services. Then, the display priority is determined in the order of the number of continuous detections, the number of INC ticket tickets issued by the user terminal, and the number of INC ticket tickets issued by the event monitoring server 170, and the lowest display priority is determined for the elapsed time. ing. These priorities may be determined according to, for example, the magnitude of the impact when a problem occurs. Therefore, for example, the display target data in which the numerical value "5" is registered in the number of related services and the numerical value "1" is registered in the number of continuous detections, and the numerical value "3" is registered in the number of related services, and the numerical value "3" is registered in the number of continuous detections. When the display target data in which "9" is registered exists, the anomaly information corresponding to the display target data of the former is displayed higher than the anomaly information corresponding to the display target data of the latter.

また、関連サービス数、連続検知数、利用者端末が発行したＩＮＣチケットのチケット数、及びイベント監視サーバ１７０が発行したＩＮＣチケットのチケット数は、いずれも、その数が多いほど、その表示対象データに応じたアノマリ情報は上位に表示される。逆に、関連サービス数、連続検知数、利用者端末が発行したＩＮＣチケットのチケット数、及びイベント監視サーバ１７０が発行したＩＮＣチケットのチケット数は、いずれも、その数が少ないほど、その表示対象データに応じたアノマリ情報は下位に表示される。例えば、関連サービス数にいずれも数値「３」が登録された２つの表示対象データが存在する場合、一方の表示対象データに連続検知数に数値「９」が登録され、他方の表示対象データの連続検知数に数値「７」が登録されている場合、前者の表示対象データに応じたアノマリ情報が後者の表示対象データに応じたアノマリ情報より上位に表示される。 Further, the number of related services, the number of continuous detections, the number of INC ticket tickets issued by the user terminal, and the number of INC ticket tickets issued by the event monitoring server 170 are all the data to be displayed as the number increases. Anomaly information according to is displayed at the top. Conversely, the smaller the number of related services, the number of continuous detections, the number of INC ticket tickets issued by the user terminal, and the number of INC ticket tickets issued by the event monitoring server 170, the more the display target. Anomaly information according to the data is displayed at the bottom. For example, when there are two display target data in which the numerical value "3" is registered in the number of related services, the numerical value "9" is registered in the continuous detection number in one display target data and the other display target data. When the numerical value "7" is registered in the number of continuous detections, the anomaly information corresponding to the display target data of the former is displayed higher than the anomaly information corresponding to the display target data of the latter.

ただし、経過時間については、経過時間が短いほど、アノマリ情報が上位に表示される。アノマリ事象が検知されてから現在時刻までの時間が短いアノマリ事象ほど運用者は急いで対応することが求められる。これはアノマリ事象が検知されてから現在時刻までの時間が長いアノマリ事象については問題が発生していないと想定されるからである。 However, with regard to the elapsed time, the shorter the elapsed time, the higher the anomaly information is displayed. The shorter the time from the detection of the anomaly event to the current time, the sooner the operator is required to respond. This is because it is assumed that no problem has occurred for anomaly events that take a long time from the detection of the anomaly event to the current time.

図９に戻り、ステップＳ２１２の処理が完了すると、表示順序制御部２４０は決定した表示順序を登録した表示対象データを生成し、生成した表示対象データをデータ記憶部２２０に格納する（ステップＳ２１３）。これにより、データ記憶部２２０は第５データ管理テーブルＴ５により管理された表示対象データを記憶する（図７参照）。ステップＳ２１３の処理が完了すると、アノマリ検知部２３０及び表示順序制御部２４０は処理を終了する。アノマリ検知部２３０及び表示順序制御部２４０は以上説明した処理を動的に繰り返し実行する。特に、表示順序制御部２４０は関連サービス数や連続検知数、各種のＩＮＣチケットの数などに基づいて表示順序を定期的に決定するため、表示順序は動的に変化する。 Returning to FIG. 9, when the process of step S212 is completed, the display order control unit 240 generates display target data in which the determined display order is registered, and stores the generated display target data in the data storage unit 220 (step S213). .. As a result, the data storage unit 220 stores the display target data managed by the fifth data management table T5 (see FIG. 7). When the process of step S213 is completed, the anomaly detection unit 230 and the display order control unit 240 end the process. The anomaly detection unit 230 and the display order control unit 240 dynamically and repeatedly execute the processes described above. In particular, since the display order control unit 240 periodically determines the display order based on the number of related services, the number of continuous detections, the number of various INC tickets, and the like, the display order dynamically changes.

図１１はデータ可視化部２５０が実行する第３処理の一例を示すフローチャートである。図１２はアノマリ一覧画面の一例である。図１３はアノマリ詳細画面の一例である。尚、アノマリ一覧画面及びアノマリ詳細画面の画面レイアウトについては適宜変更してもよい。 FIG. 11 is a flowchart showing an example of the third process executed by the data visualization unit 250. FIG. 12 is an example of the anomaly list screen. FIG. 13 is an example of an anomaly detail screen. The screen layouts of the anomaly list screen and the anomaly detail screen may be changed as appropriate.

まず、図１１に示すように、データ可視化部２５０は運用端末３００から画面表示要求を受け付けるまで待機する（ステップＳ３０１：ＮＯ）。データ可視化部２５０は画面表示要求を受け付けると（ステップＳ３０１：ＹＥＳ）、アノマリ一覧画面を表示する（ステップＳ３０２）。より詳しくは、データ可視化部２５０はデータ記憶部２２０が記憶する表示対象データの中からアノマリ検知日時と現在日時との差である経過時間が予め定めた特定の時間（例えば２４時間）以内の表示対象データを抽出し、抽出した表示対象データに基づいてアノマリ一覧画面を生成して表示する。 First, as shown in FIG. 11, the data visualization unit 250 waits until the screen display request is received from the operation terminal 300 (step S301: NO). When the data visualization unit 250 receives the screen display request (step S301: YES), the data visualization unit 250 displays the anomaly list screen (step S302). More specifically, the data visualization unit 250 displays the elapsed time, which is the difference between the anomaly detection date and time and the current date and time, within a predetermined specific time (for example, 24 hours) from the display target data stored in the data storage unit 220. The target data is extracted, and the anomaly list screen is generated and displayed based on the extracted display target data.

アノマリ一覧画面は、図１２に示すように、アノマリ事象の検知数と、優先順位に並んだ複数のアノマリ情報の一覧を含んでいる。図１２では、上位数件のアノマリ情報が示されているが、縦方向のスクロールバーを設けることにより、下位のアノマリ情報が表示されるようにしてもよい。また、アノマリ一覧画面は、経過時間の割合を表す円グラフと経過時間の内訳、ホストマシン１１０の割合を表す円グラフとホストマシン１１０の名称の内訳、監視対象リソースの割合を表す円グラフと監視対象リソースの名称の内訳を含んでいる。特に、ホストマシン１１０の名称と監視対象リソースの名称はアノマリ事象が検知された数が多い順に表示されている。ホストマシン１１０の名称及び監視対象リソースの名称が指定されると、データ可視化部２５０は表示されたアノマリ情報の一覧から指定された名称を含むアノマリ情報に絞り込んで表示する。このように、大量のアノマリ事象が検知されても、これらのアノマリ事象に関するアノマリ情報が優先順位に並んで表示されるため、運用者はアノマリ事象に対する対応の優先順位の判断を誤る可能性が減少する。この結果、アノマリ事象に対する対応の遅延により問題が発生することを抑制することができる。尚、アノマリ情報に関連する情報も併せて表示されるため、優先順位の判断を誤る可能性がさらに減少する。 As shown in FIG. 12, the anomaly list screen includes the number of detected anomaly events and a list of a plurality of anomaly information arranged in order of priority. Although the upper-order anomaly information is shown in FIG. 12, the lower-level anomaly information may be displayed by providing a vertical scroll bar. In addition, the anomaly list screen has a pie chart showing the percentage of elapsed time and a breakdown of elapsed time, a pie chart showing the percentage of host machine 110 and a breakdown of the names of host machines 110, and a pie chart and monitoring showing the percentage of monitored resources. Contains a breakdown of the names of the target resources. In particular, the name of the host machine 110 and the name of the monitored resource are displayed in descending order of the number of detected anomaly events. When the name of the host machine 110 and the name of the monitored resource are specified, the data visualization unit 250 narrows down and displays the anomaly information including the specified name from the displayed anomaly information list. In this way, even if a large number of anomaly events are detected, the anomaly information related to these anomaly events is displayed in order of priority, so that the operator is less likely to make a mistake in determining the priority of response to the anomaly event. do. As a result, it is possible to prevent problems from occurring due to delays in responding to anomaly events. Since the information related to the anomaly information is also displayed, the possibility of erroneous determination of the priority is further reduced.

図１１に戻り、ステップＳ３０２の処理が完了すると、次いで、データ可視化部２５０は運用端末３００からアノマリ情報の指定を受け付けるまで待機する（ステップＳ３０３：ＮＯ）。データ可視化部２５０はアノマリ情報の指定を受け付けると（ステップＳ３０３：ＹＥＳ）、アノマリ詳細画面を表示する（ステップＳ３０４）。より詳しくは、データ可視化部２５０は指定されたアノマリ情報に関するアノマリ詳細画面を表示する。 Returning to FIG. 11, when the process of step S302 is completed, the data visualization unit 250 then waits until the operation terminal 300 accepts the designation of the anomaly information (step S303: NO). When the data visualization unit 250 receives the designation of the anomaly information (step S303: YES), the anomaly detail screen is displayed (step S304). More specifically, the data visualization unit 250 displays an anomaly detail screen regarding the specified anomaly information.

例えば、図１２に示すように、アノマリ詳細画面に表示された複数のアノマリ情報の中からポインタＰｔによりいずれか１つのアノマリ情報を指定して指示（例えばクリックやタップなど）すると、データ可視化部２５０は、アノマリ情報の指定を受け付ける。本実施形態ではアノマリＩＤ「８８」のアノマリ情報がポインタＰｔにより指定されている。データ可視化部２５０は、アノマリ情報の指定を受け付けると、図１３に示すように、指定されたアノマリ情報に関するアノマリ詳細画面を表示する。 For example, as shown in FIG. 12, when any one of the anomaly information is specified and instructed (for example, click or tap) from a plurality of anomaly information displayed on the anomaly detail screen by the pointer Pt, the data visualization unit 250 Accepts the designation of anomaly information. In this embodiment, the anomaly information of the anomaly ID "88" is designated by the pointer Pt. When the data visualization unit 250 receives the designation of the anomaly information, as shown in FIG. 13, the data visualization unit 250 displays the anomaly detail screen regarding the designated anomaly information.

特に、データ可視化部２５０は指定されたアノマリ情報に含まれるホストマシン１１０の名称及び監視対象リソースの名称に基づいて、データ記憶部２２０からこれらの名称を含む監視データを抽出してグラフ化し、グラフ化した監視データを表すグラフＧＲ１をアノマリ詳細画面内に表示する。抽出した監視データにはアノマリデータと非アノマリデータが混在するため、データ可視化部２５０はアノマリデータをグラフ化した部分に対しマークＭ１を付与して強調して表示する。 In particular, the data visualization unit 250 extracts monitoring data including these names from the data storage unit 220 based on the name of the host machine 110 and the name of the monitored resource included in the designated anomaly information, graphs the data, and graphs the data. The graph GR1 representing the converted monitoring data is displayed in the anomaly detail screen. Since the extracted monitoring data contains both anomaly data and non-anomaly data, the data visualization unit 250 assigns a mark M1 to the graphed portion of the anomaly data and highlights it.

また、データ可視化部２５０は指定されたアノマリ情報に含まれるホストマシン１１０の名称に基づいて、ＣＭＤＢ１４０からその名称と関連付けられた関連サービス名全てを抽出してアノマリ詳細画面内に表示する。さらに、データ可視化部２５０は指定されたアノマリ情報に含まれるホストマシン１１０の名称に基づいて、データ記憶部２２０からその名称を含むＩＮＣチケット及び変更要求チケットを抽出してアノマリ詳細画面内に表示する。特に、尚、データ可視化部２５０はアノマリ事象の検知日時から前後１時間以内の発行日時が登録されたＩＮＣチケット及び変更要求チケットを抽出してアノマリ詳細画面内に表示する。尚、図１３では、イベント監視サーバ１７０が発行したＩＮＣチケットが示されているが、利用者端末が発行したＩＮＣチケットであってもよい。 Further, the data visualization unit 250 extracts all the related service names associated with the name from the CMDB 140 based on the name of the host machine 110 included in the designated anomaly information and displays it in the anomaly detail screen. Further, the data visualization unit 250 extracts an INC ticket and a change request ticket including the name from the data storage unit 220 based on the name of the host machine 110 included in the designated anomaly information and displays it in the anomaly detail screen. .. In particular, the data visualization unit 250 extracts the INC ticket and the change request ticket in which the issue date and time within one hour before and after the detection date and time of the anomaly event are registered and displays them in the anomaly detail screen. Although the INC ticket issued by the event monitoring server 170 is shown in FIG. 13, it may be an INC ticket issued by the user terminal.

以上、第１実施形態によれば、分析サーバ２００はアノマリ検知部２３０、表示順序制御部２４０、及びデータ可視化部２５０を含んでいる。アノマリ検知部２３０は監視対象システム１００に含まれる複数のホストマシン１１０の監視データに基づき、複数のホストマシン１１０のいずれかで発生したアノマリ事象を検知するアノマリ事象検知処理を実行する。複数のアノマリ事象が検知された場合、表示順序制御部２４０は監視対象システム１００で実行される複数のサービス間の関係を示す構成情報を記憶するＣＭＤＢ１４０を参照する。そして、表示順序制御部２４０は検知された複数のアノマリ事象それぞれの発生元のホストマシン１１０ごとに、該ホストマシン１１０で実行されるサービスに対して特定の関係性を有する他のサービスの数を関連サービス数として特定する。データ可視化部２５０は検知された複数のアノマリ事象に関するアノマリ情報を、特定した関連サービス数に応じた表示順序で運用端末３００の表示部３２０に表示する。これにより、アノマリ事象対応の遅延による問題の発生を抑制することができる。 As described above, according to the first embodiment, the analysis server 200 includes an anomaly detection unit 230, a display order control unit 240, and a data visualization unit 250. The anomaly detection unit 230 executes anomaly event detection processing for detecting an anomaly event that has occurred in any of the plurality of host machines 110 based on the monitoring data of the plurality of host machines 110 included in the monitored system 100. When a plurality of anomaly events are detected, the display order control unit 240 refers to the CMDB 140 that stores the configuration information indicating the relationship between the plurality of services executed by the monitored system 100. Then, the display order control unit 240 determines the number of other services having a specific relationship to the service executed by the host machine 110 for each host machine 110 from which each of the detected plurality of anomaly events occurs. Specify as the number of related services. The data visualization unit 250 displays the anomaly information about the detected plurality of anomaly events on the display unit 320 of the operation terminal 300 in the display order according to the specified number of related services. As a result, it is possible to suppress the occurrence of problems due to the delay in responding to anomaly events.

また、アノマリ検知部２３０は監視対象システム１００に含まれる複数の監視対象リソースの監視データに基づき、複数の監視対象リソースのいずれかについて発生したアノマリ事象を検知するアノマリ事象検知処理を実行する。複数のアノマリ事象が検知された場合、表示順序制御部２４０は検知された複数のアノマリ事象それぞれの発生元の監視対象リソースごとに、該監視対象リソースについて発生したアノマリ事象の連続検知数を算出する。データ可視化部２５０は検知された複数のアノマリ事象に関するアノマリ情報を、算出した連続検知数に応じた表示順序で運用端末３００の表示部３２０に表示する。このような処理によっても同様に、アノマリ事象対応の遅延による問題の発生を抑制することができる。 Further, the anomaly detection unit 230 executes anomaly event detection processing for detecting an anomaly event that has occurred in any of the plurality of monitored resources based on the monitoring data of the plurality of monitored resources included in the monitored system 100. When a plurality of anomaly events are detected, the display order control unit 240 calculates the number of consecutive detections of anomaly events that have occurred for the monitored resource for each monitored resource that is the source of each of the detected anomaly events. .. The data visualization unit 250 displays anomaly information about a plurality of detected anomaly events on the display unit 320 of the operation terminal 300 in a display order according to the calculated number of continuous detections. Similarly, such processing can suppress the occurrence of problems due to the delay in responding to anomaly events.

（第２実施形態）
続いて、図１４から図１６を参照して、本件の第２実施形態について説明する。 (Second Embodiment)
Subsequently, the second embodiment of the present case will be described with reference to FIGS. 14 to 16.

図１４及び図１５はアノマリ一覧画面の他の一例である。図１６はアノマリ詳細画面の他の一例である。第２実施形態では、表示順序制御部２４０は、複数のアノマリ事象が検知された場合、検知された複数のアノマリ事象それぞれの発生元のホストマシン１１０を特定する。一方、データ可視化部２５０は、表示順序制御部２４０が特定したホストマシン１１０が共通し、かつ、特定したホストマシン１１０で発生したアノマリ事象の検知日時の差が閾値時間以内であると判断した場合、検知された複数のアノマリ事象に関するアノマリ情報を集約した代表のアノマリ情報を表示する。 14 and 15 are other examples of the anomaly list screen. FIG. 16 is another example of the anomaly detail screen. In the second embodiment, when a plurality of anomaly events are detected, the display order control unit 240 identifies the host machine 110 from which each of the detected plurality of anomaly events occurs. On the other hand, when the data visualization unit 250 determines that the host machine 110 specified by the display order control unit 240 is common and the difference in the detection date and time of the anomaly event generated in the specified host machine 110 is within the threshold time. , Displays the representative anomaly information that aggregates the anomaly information about multiple detected anomaly events.

これにより、図１４に示すように、「ホスト５」の名称を有する２つの共通するホストマシン１１０のアノマリ情報について、検知日時の差が閾値時間（例えば５分）以内である場合には、データ可視化部２５０は、アノマリ情報を集約して表示することが可能であることを提示する集約マークＧ１を表示する。そして、ポインタＰｔにより集約マークＧ１が指定されて指示されると、データ可視化部２５０は、その指示に基づいて、図１５に示すように、優先順位が高い方のアノマリ情報に低い方のアノマリ情報を集約し、優先順位が高い方のアノマリ情報を代表のアノマリ情報として表示する。 As a result, as shown in FIG. 14, when the difference between the detection dates and times is within the threshold time (for example, 5 minutes) for the anomaly information of the two common host machines 110 having the name of "host 5", the data is obtained. The visualization unit 250 displays the aggregation mark G1 indicating that the anomaly information can be aggregated and displayed. Then, when the aggregation mark G1 is designated and instructed by the pointer Pt, the data visualization unit 250, based on the instruction, attaches the higher priority anomaly information to the lower anomaly information as shown in FIG. Are aggregated, and the anomaly information with the higher priority is displayed as the representative anomaly information.

そして、データ可視化部２５０は代表のアノマリ情報を表示するとともに、その代表のアノマリ情報に集約マークＧ２を付与して表示する。ポインタＰｔにより集約マークＧ２が指定されて指示されると、データ可視化部２５０は、その指示に基づいて、図１４に示すように、代表のアノマリ情報を展開する。尚、アノマリ情報を集約した場合、データ可視化部２５０は、集約前の監視対象リソースの名称全てを表示するが、監視対象リソース以外の表示項目（例えば連続検知数や関連サービス数など）については、代表のアノマリ情報の表示項目を優先して表示する。 Then, the data visualization unit 250 displays the representative anomaly information, and adds the aggregation mark G2 to the representative anomaly information and displays it. When the aggregation mark G2 is designated and instructed by the pointer Pt, the data visualization unit 250 develops representative anomaly information as shown in FIG. 14 based on the instruction. When the anomaly information is aggregated, the data visualization unit 250 displays all the names of the monitored resources before aggregation, but for display items other than the monitored resources (for example, the number of continuous detections and the number of related services), Priority is given to the display items of the representative anomaly information.

一方、図１５に示すように、集約マークＧ２が表示されたアノマリ情報の表示領域から集約マークＧ２の表示領域を除いた残りの領域がポインタＰｔにより指定されて指示されると、データ可視化部２５０は、指定されたアノマリ情報の指定を受け付ける。データ可視化部２５０は、アノマリ情報の指定を受け付けると、図１６に示すように、指定されたアノマリ情報に関するアノマリ詳細画面を表示する。この場合、アノマリ詳細画面は集約前の監視対象リソースのそれぞれの監視データを抽出してグラフ化し、グラフ化したそれぞれの監視データを表すグラフＧＲ１，ＧＲ２をアノマリ詳細画面内に表示する。第１実施形態でも説明したように、抽出した監視データにはアノマリデータと非アノマリデータが混在するため、データ可視化部２５０はアノマリデータをグラフ化した部分に対しマークＭ１，Ｍ２を付与して強調して表示する。これにより、共通するホストマシン１１０に発生した複数のアノマリ事象に対し運用者はまとめて対応することができる。 On the other hand, as shown in FIG. 15, when the remaining area excluding the display area of the aggregate mark G2 from the display area of the anomaly information on which the aggregate mark G2 is displayed is designated and instructed by the pointer Pt, the data visualization unit 250 Accepts the specification of the specified anomaly information. When the data visualization unit 250 receives the designation of the anomaly information, as shown in FIG. 16, the data visualization unit 250 displays the anomaly detail screen regarding the designated anomaly information. In this case, the anomaly detail screen extracts and graphs each monitoring data of the monitored resource before aggregation, and displays graphs GR1 and GR2 representing the graphed monitoring data in the anomaly detail screen. As described in the first embodiment, since the extracted monitoring data contains anomaly data and non-anomaly data, the data visualization unit 250 assigns marks M1 and M2 to the graphed portion of the anomaly data to emphasize it. And display. As a result, the operator can collectively respond to a plurality of anomaly events that occur in the common host machine 110.

尚、データ可視化部２５０は、構成情報を記憶するＣＭＤＢ１４０を参照して、特定したホストマシン１１０それぞれで実行されるサービス間に特定の関係性があると判断した場合、特定の関係性があると判断されたサービスを実行するホストマシン１１０それぞれで発生したアノマリ事象に関するアノマリ情報を並べて表示してもよい。特定の関係性とは、例えば関連するサービスの有無、及び検知日時が閾値時間以内であるか否かなどがある。 When the data visualization unit 250 refers to the CMDB 140 that stores the configuration information and determines that there is a specific relationship between the services executed by each of the specified host machines 110, the data visualization unit 250 determines that there is a specific relationship. Anomaly information about anomaly events that have occurred in each of the host machines 110 that execute the determined service may be displayed side by side. Specific relationships include, for example, the presence or absence of related services and whether or not the detection date and time is within the threshold time.

これにより、例えば「ホスト５」の名称を有するホストマシン１１０が実行するサービスと「ホスト４」の名称を有するホストマシン１１０が実行するサービスが関連しており、かつ、検知日時の差が閾値時間（例えば５分）以内である場合には、データ可視化部２５０は特定の関係性があると判断し、図１４に示すように、「ホスト５」の名称を有するアノマリ情報と「ホスト４」の名称を有するアノマリ情報を優先度の高い方に並べて表示する。尚、アノマリ情報を並べて表示する場合、並べる前のアノマリ情報を残しておいてもよいし、表示対象から除外してもよい。本実施形態では、「ホスト４」の名称を有するアノマリ情報を残して表示している。これにより、サービスが関連する複数のアノマリ事象に対し運用者はまとめて対応することができる。 As a result, for example, the service executed by the host machine 110 having the name "host 5" and the service executed by the host machine 110 having the name "host 4" are related, and the difference between the detection dates and times is the threshold time. If it is within (for example, 5 minutes), the data visualization unit 250 determines that there is a specific relationship, and as shown in FIG. 14, the anomaly information having the name of "host 5" and the "host 4" Anomaly information with a name is displayed side by side in the order of priority. When displaying the anomaly information side by side, the anomaly information before the arrangement may be left or excluded from the display target. In the present embodiment, the anomaly information having the name of "host 4" is left and displayed. As a result, the operator can collectively respond to a plurality of anomaly events related to the service.

以上、本発明の好ましい実施形態について詳述したが、本発明に係る特定の実施形態に限定されるものではなく、特許請求の範囲に記載された本発明の要旨の範囲内において、種々の変形・変更が可能である。例えば、上述した第１処理から第３処理の少なくとも１つは、管理サーバ１９０など、分析サーバ２００以外の各種サーバが実行してもよいし、運用端末３００が実行してもよい。 Although the preferred embodiments of the present invention have been described in detail above, the present invention is not limited to the specific embodiments of the present invention, and various modifications are made within the scope of the gist of the present invention described in the claims.・ Can be changed. For example, at least one of the first to third processes described above may be executed by various servers other than the analysis server 200, such as the management server 190, or may be executed by the operation terminal 300.

なお、以上の説明に関して更に以下の付記を開示する。
（付記１）システムに含まれる複数のマシンの監視データに基づき、前記複数のマシンのいずれかで発生したアノマリ事象を検知するアノマリ事象検知処理を行い、複数のアノマリ事象が検知された場合、前記システムで実行される複数のサービス間の関係を示す情報を記憶する記憶部を参照して、検知された前記複数のアノマリ事象それぞれの発生元のマシンごとに、該マシンで実行されるサービスに対して特定の関係性を有する他のサービスの数を特定し、検知された前記複数のアノマリ事象に関する情報を、特定した前記他のサービスの数に応じた順序で表示部に表示する、処理をコンピュータに実行させることを特徴とする順序制御プログラム。
（付記２）システムに含まれる複数の監視対象の監視データに基づき、前記複数の監視対象のいずれかについて発生したアノマリ事象を検知するアノマリ事象検知処理を行い、複数のアノマリ事象が検知された場合、検知された前記複数のアノマリ事象それぞれの発生元の監視対象ごとに、該監視対象について発生したアノマリ事象の検知頻度を算出し、検知された前記複数のアノマリ事象に関する情報を、算出した前記検知頻度に応じた順序で表示部に表示する、処理をコンピュータに実行させることを特徴とする順序制御プログラム。
（付記３）前記システムで実行される複数のサービスのサービスレベルの低下の原因となり得る事象に基づくインシデント情報の数を特定し、検知された前記複数のアノマリ事象に関する情報を、特定した前記インシデント情報の数にも応じた順序で表示する、処理を実行させることを特徴とする付記１又は２に記載の順序制御プログラム。
（付記４）前記アノマリ事象を検知してから経過した時間を計測し、検知された前記複数のアノマリ事象に関する情報を、計測した前記時間にも応じた順序で表示する、処理を実行させることを特徴とする付記１から３のいずれか１項に記載の順序制御プログラム。
（付記５）計測した前記時間が短いほど、検知された前記複数のアノマリ事象に関する情報を上位に表示する、処理を実行させることを特徴とする付記４に記載の順序制御プログラム。
（付記６）複数のアノマリ事象が検知された場合、検知された前記複数のアノマリ事象それぞれの発生元のマシンを特定し、前記システムで実行される複数のサービス間の関係を示す情報を記憶する記憶部を参照して、特定した前記マシンそれぞれで実行されるサービス間に特定の関係性があると判断した場合、前記特定の関係性があると判断されたサービスを実行するマシンそれぞれで発生したアノマリ事象に関する情報を並べて表示する、処理を実行させることを特徴とする付記１から５のいずれか１項に記載の順序制御プログラム。
（付記７）複数のアノマリ事象が検知された場合、検知された前記複数のアノマリ事象それぞれの発生元のマシンを特定し、特定した前記マシンが共通し、かつ、特定した前記マシンで発生したアノマリ事象の検知時刻の差が閾値時間以内であると判断した場合、検知された前記複数のアノマリ事象に関する情報を集約した代表の情報を表示する、処理を実行させることを特徴とする付記１から６のいずれか１項に記載の順序制御プログラム。
（付記８）システムに含まれる複数のマシンの監視データに基づき、前記複数のマシンのいずれかで発生したアノマリ事象を検知するアノマリ事象検知処理を行い、複数のアノマリ事象が検知された場合、前記システムで実行される複数のサービス間の関係を示す情報を記憶する記憶部を参照して、検知された前記複数のアノマリ事象それぞれの発生元のマシンごとに、該マシンで実行されるサービスに対して特定の関係性を有する他のサービスの数を特定し、検知された前記複数のアノマリ事象に関する情報を、特定した前記他のサービスの数に応じた順序で表示部に表示する、処理をコンピュータが実行することを特徴とする順序制御方法。
（付記９）システムに含まれる複数の監視対象の監視データに基づき、前記複数の監視対象のいずれかについて発生したアノマリ事象を検知するアノマリ事象検知処理を行い、複数のアノマリ事象が検知された場合、検知された前記複数のアノマリ事象それぞれの発生元の監視対象ごとに、該監視対象について発生したアノマリ事象の検知頻度を算出し、検知された前記複数のアノマリ事象に関する情報を、算出した前記検知頻度に応じた順序で表示部に表示する、処理をコンピュータが実行することを特徴とする順序制御方法。
（付記１０）システムに含まれる複数のマシンの監視データに基づき、前記複数のマシンのいずれかで発生したアノマリ事象を検知するアノマリ事象検知処理を行い、複数のアノマリ事象が検知された場合、前記システムで実行される複数のサービス間の関係を示す情報を記憶する記憶部を参照して、検知された前記複数のアノマリ事象それぞれの発生元のマシンごとに、該マシンで実行されるサービスに対して特定の関係性を有する他のサービスの数を特定し、検知された前記複数のアノマリ事象に関する情報を、特定した前記他のサービスの数に応じた順序で表示部に表示する、処理を実行する処理部を有することを特徴とする情報処理装置。
（付記１１）システムに含まれる複数の監視対象の監視データに基づき、前記複数の監視対象のいずれかについて発生したアノマリ事象を検知するアノマリ事象検知処理を行い、複数のアノマリ事象が検知された場合、検知された前記複数のアノマリ事象それぞれの発生元の監視対象ごとに、該監視対象について発生したアノマリ事象の検知頻度を算出し、検知された前記複数のアノマリ事象に関する情報を、算出した前記検知頻度に応じた順序で表示部に表示する、処理を実行する処理部を有することを特徴とする情報処理装置。
（付記１２）前記処理部は、前記システムで実行される複数のサービスのサービスレベルの低下の原因となり得る事象に基づくインシデント情報の数を特定し、検知された前記複数のアノマリ事象に関する情報を、特定した前記インシデント情報の数にも応じた順序で表示する、ことを特徴とする付記１０又は１１に記載の情報処理装置。
（付記１３）前記処理部は、前記アノマリ事象を検知してから経過した時間を計測し、検知された前記複数のアノマリ事象に関する情報を、計測した前記時間にも応じた順序で表示する、ことを特徴とする付記１０から１２のいずれか１項に記載の情報処理装置。
（付記１４）前記処理部は、計測した前記時間が短いほど、検知された前記複数のアノマリ事象に関する情報を上位に表示する、ことを特徴とする付記１３に記載の情報処理装置。
（付記１５）前記処理部は、複数のアノマリ事象が検知された場合、検知された前記複数のアノマリ事象それぞれの発生元のマシンを特定し、前記システムで実行される複数のサービス間の関係を示す情報を記憶する記憶部を参照して、特定した前記マシンそれぞれで実行されるサービス間に特定の関係性があると判断した場合、前記特定の関係性があると判断されたサービスを実行するマシンそれぞれで発生したアノマリ事象に関する情報を並べて表示する、ことを特徴とする付記１０から１４のいずれか１項に記載の情報処理装置。
（付記１６）前記処理部は、複数のアノマリ事象が検知された場合、検知された前記複数のアノマリ事象それぞれの発生元のマシンを特定し、特定した前記マシンが共通し、かつ、特定した前記マシンで発生したアノマリ事象の検知時刻の差が閾値時間以内であると判断した場合、検知された前記複数のアノマリ事象に関する情報を集約した代表の情報を表示する、ことを特徴とする付記１０から１５のいずれか１項に記載の情報処理装置。 The following additional notes will be further disclosed with respect to the above explanation.
(Appendix 1) Based on the monitoring data of a plurality of machines included in the system, anomaly event detection processing for detecting an anomaly event generated in any of the plurality of machines is performed, and when a plurality of anomaly events are detected, the above With reference to a storage unit that stores information indicating the relationship between multiple services executed in the system, for each of the machines that originated each of the detected multiple anomaly events, for the service executed on that machine. The computer processes the process of identifying the number of other services having a specific relationship and displaying the detected information on the plurality of anomaly events on the display unit in an order according to the number of the identified other services. An order control program characterized by being executed by a computer.
(Appendix 2) When anomaly event detection processing is performed to detect anomaly events that have occurred in any of the multiple monitoring targets based on the monitoring data of multiple monitoring targets included in the system, and multiple anomaly events are detected. , The detection frequency of the anomaly event that occurred for the monitored target is calculated for each of the monitoring targets of the occurrence source of each of the plurality of detected anomaly events, and the information about the detected plurality of anomaly events is calculated. An order control program characterized by having a computer execute a process that is displayed on the display unit in an order according to the frequency.
(Appendix 3) The incident information that identifies the number of incident information based on an event that can cause a decrease in the service level of a plurality of services executed by the system, and identifies information on the detected plurality of anomaly events. The order control program according to Appendix 1 or 2, wherein the processing is displayed in an order corresponding to the number of the above.
(Appendix 4) A process of measuring the time elapsed since the detection of the anomaly event and displaying the detected information on the plurality of anomaly events in the order corresponding to the measured time is executed. The order control program according to any one of Supplementary note 1 to 3, which is characterized.
(Appendix 5) The sequence control program according to Appendix 4, wherein the shorter the measured time, the higher the information about the detected plurality of anomaly events is displayed, and the process is executed.
(Appendix 6) When a plurality of anomaly events are detected, the machine from which each of the detected plurality of anomaly events is generated is identified, and information indicating the relationship between the plurality of services executed by the system is stored. When it is determined that there is a specific relationship between the services executed by each of the specified machines by referring to the storage unit, it occurs in each machine executing the service determined to have the specific relationship. The order control program according to any one of Supplementary note 1 to 5, wherein information related to anomaly events is displayed side by side, and processing is executed.
(Appendix 7) When a plurality of anomaly events are detected, the machine from which each of the detected plurality of anomaly events is generated is specified, and the identified machines are common and the anomalies generated in the specified machine are common. When it is determined that the difference in the detection time of the event is within the threshold time, the representative information that aggregates the information on the plurality of detected anomaly events is displayed, and the processing is executed. The order control program according to any one of the above items.
(Appendix 8) Based on the monitoring data of a plurality of machines included in the system, anomaly event detection processing for detecting an anomaly event generated in any of the plurality of machines is performed, and when a plurality of anomaly events are detected, the above-mentioned With reference to a storage unit that stores information indicating the relationship between multiple services executed in the system, for each of the machines that originated each of the detected multiple anomaly events, for the service executed on that machine. The computer processes the process of identifying the number of other services having a specific relationship and displaying the detected information on the plurality of anomaly events on the display unit in an order according to the number of the identified other services. An order control method characterized by the execution of.
(Appendix 9) When anomaly event detection processing is performed to detect anomaly events that have occurred in any of the plurality of monitoring targets based on the monitoring data of a plurality of monitoring targets included in the system, and a plurality of anomaly events are detected. , The detection frequency of the anomaly event that occurred for the monitored target is calculated for each of the monitored targets of the source of each of the detected plurality of anomaly events, and the information about the detected plurality of anomaly events is calculated. An order control method characterized in that a computer executes processing, which is displayed on a display unit in an order according to frequency.
(Appendix 10) Based on the monitoring data of a plurality of machines included in the system, anomaly event detection processing for detecting an anomaly event generated in any of the plurality of machines is performed, and when a plurality of anomaly events are detected, the above-mentioned With reference to a storage unit that stores information indicating the relationship between a plurality of services executed by the system, for each of the machines from which each of the plurality of anomaly events detected is generated, for the service executed by the machine. The process of identifying the number of other services having a specific relationship and displaying the detected information on the plurality of anomaly events on the display unit in the order according to the number of the identified other services is executed. An information processing device characterized by having a processing unit for processing.
(Appendix 11) When anomaly event detection processing is performed to detect anomaly events that have occurred in any of the plurality of monitoring targets based on the monitoring data of a plurality of monitoring targets included in the system, and a plurality of anomaly events are detected. , The detection frequency of the anomaly event that occurred for the monitored target was calculated for each of the monitored targets of the occurrence source of each of the plurality of detected anomaly events, and the information about the detected plurality of anomaly events was calculated. An information processing apparatus having a processing unit for executing processing, which is displayed on the display unit in an order according to frequency.
(Appendix 12) The processing unit identifies the number of incident information based on an event that can cause a decrease in the service level of a plurality of services executed by the system, and obtains information on the detected plurality of anomaly events. The information processing apparatus according to Appendix 10 or 11, wherein the information processing apparatus is displayed in an order corresponding to the number of the specified incident information.
(Appendix 13) The processing unit measures the time elapsed since the detection of the anomaly event, and displays the information regarding the detected plurality of anomaly events in the order corresponding to the measured time. The information processing apparatus according to any one of Supplementary note 10 to 12, wherein the information processing apparatus is characterized by the above-mentioned item 1.
(Supplementary Note 14) The information processing apparatus according to Supplementary Note 13, wherein the processing unit displays information on the detected plurality of anomaly events at a higher level as the measured time becomes shorter.
(Appendix 15) When a plurality of anomaly events are detected, the processing unit identifies the machine from which each of the detected plurality of anomaly events is generated, and determines the relationship between the plurality of services executed by the system. When it is determined that there is a specific relationship between the services executed on each of the specified machines by referring to the storage unit that stores the indicated information, the service determined to have the specific relationship is executed. The information processing apparatus according to any one of Supplementary note 10 to 14, wherein information on anomaly events generated in each machine is displayed side by side.
(Appendix 16) When a plurality of anomaly events are detected, the processing unit identifies the machine from which each of the detected plurality of anomaly events is generated, and the identified machines are common and specified. When it is determined that the difference in the detection time of the anomaly event generated in the machine is within the threshold time, the representative information that aggregates the information on the plurality of detected anomaly events is displayed. The information processing apparatus according to any one of 15.

Ｓ情報処理システム
１００監視対象システム
１１０ホストマシン
１３０構成収集サーバ
１４０ＣＭＤＢ
１５０性能監視サーバ
１７０イベント監視サーバ
１９０管理サーバ
２００分析サーバ
２１０データ収集部
２２０データ記憶部
２３０アノマリ検知部
２４０表示順序制御部
２５０データ可視化部
３００運用端末
S Information processing system 100 Monitored system 110 Host machine 130 Configuration collection server 140 CMDB
150 Performance monitoring server 170 Event monitoring server 190 Management server 200 Analysis server 210 Data collection unit 220 Data storage unit 230 Anomaly detection unit 240 Display order control unit 250 Data visualization unit 300 Operation terminal

Claims

Based on the monitoring data of a plurality of machines included in the system, anomaly event detection processing for detecting an anomaly event occurring in any of the plurality of machines is performed.
When a plurality of anomaly events are detected, each of the machines from which the plurality of anomaly events are detected is referred to by referring to a storage unit that stores information indicating the relationship between the plurality of services executed by the system. To identify the number of other services that have a particular relationship to the services running on that machine.
The time elapsed since the detection of the multiple anomaly events was measured, and the time elapsed was measured.
Information about the detected plurality of anomaly events is displayed on the display unit in an order according to the number of the specified other services.
The shorter the measured time, the higher the information about the detected plurality of anomaly events is displayed .
An order control program characterized by having a computer perform processing.

Based on the monitoring data of a plurality of monitoring targets included in the system, anomaly event detection processing for detecting an anomaly event that has occurred in any of the plurality of monitoring targets is performed.
When a plurality of anomaly events are detected, the detection frequency of the anomaly events that have occurred for the monitored target is calculated for each monitoring target of the source of each of the detected plurality of anomaly events.
The time elapsed since the detection of the multiple anomaly events was measured, and the time elapsed was measured.
Information on the detected plurality of anomaly events is displayed on the display unit in the order according to the calculated detection frequency.
The shorter the measured time, the higher the information about the detected plurality of anomaly events is displayed .
An order control program characterized by having a computer perform processing.

Based on the monitoring data of a plurality of machines included in the system, anomaly event detection processing for detecting an anomaly event occurring in any of the plurality of machines is performed.
When multiple anomaly events are detected, the machine from which each of the detected multiple anomaly events is generated is identified.
When it is determined that the specified machines are common and the difference in the detection time of the anomaly events that occurred in the specified machine is within the threshold time, the representative that aggregates the information on the plurality of detected anomaly events. Display information on the display
An order control program characterized by having a computer perform processing.

Based on the monitoring data of a plurality of monitoring targets included in the system, anomaly event detection processing for detecting an anomaly event that has occurred in any of the plurality of monitoring targets is performed.
When multiple anomaly events are detected, the machine from which each of the detected multiple anomaly events is generated is identified.
When it is determined that the specified machines are common and the difference in the detection time of the anomaly events that occurred in the specified machine is within the threshold time, the representative that aggregates the information on the plurality of detected anomaly events. Display information on the display
An order control program characterized by having a computer perform processing.

Identify the number of incident information based on events that can cause service level degradation for multiple services running on the system.
Information about the plurality of detected anomaly events is displayed in an order according to the number of the identified incident information.
The order control program according to any one of claims 1 to 4, wherein the process is executed.

When multiple anomaly events are detected, the machine from which each of the detected multiple anomaly events is generated is identified.
When it is determined that there is a specific relationship between the services executed on each of the specified machines by referring to the storage unit that stores the information indicating the relationship between the plurality of services executed by the system, the specification is performed. Display side-by-side information about anomaly events that occurred on each machine running a service that was determined to be related to
The order control program according to any one of claims 1 to 5, wherein the process is executed.

Based on the monitoring data of a plurality of machines included in the system, anomaly event detection processing for detecting an anomaly event occurring in any of the plurality of machines is performed.
When a plurality of anomaly events are detected, each of the machines from which the plurality of anomaly events are detected is referred to by referring to a storage unit that stores information indicating the relationship between the plurality of services executed by the system. To identify the number of other services that have a particular relationship to the services running on that machine.
The time elapsed since the detection of the multiple anomaly events was measured, and the time elapsed was measured.
Information about the detected plurality of anomaly events is displayed on the display unit in an order according to the number of the specified other services.
The shorter the measured time, the higher the information about the detected plurality of anomaly events is displayed .
An order control method characterized by a computer performing processing.

Based on the monitoring data of a plurality of monitoring targets included in the system, anomaly event detection processing for detecting an anomaly event that has occurred in any of the plurality of monitoring targets is performed.
When a plurality of anomaly events are detected, the detection frequency of the anomaly events that have occurred for the monitored target is calculated for each monitoring target of the source of each of the detected plurality of anomaly events.
The time elapsed since the detection of the multiple anomaly events was measured, and the time elapsed was measured.
Information on the detected plurality of anomaly events is displayed on the display unit in the order according to the calculated detection frequency.
The shorter the measured time, the higher the information about the detected plurality of anomaly events is displayed .
An order control method characterized by a computer performing processing.

Based on the monitoring data of a plurality of machines included in the system, anomaly event detection processing for detecting an anomaly event occurring in any of the plurality of machines is performed.
When a plurality of anomaly events are detected, each of the machines from which the plurality of anomaly events are detected is referred to by referring to a storage unit that stores information indicating the relationship between the plurality of services executed by the system. To identify the number of other services that have a particular relationship to the services running on that machine.
The time elapsed since the detection of the multiple anomaly events was measured, and the time elapsed was measured.
Information about the detected plurality of anomaly events is displayed on the display unit in an order according to the number of the specified other services.
The shorter the measured time, the higher the information about the detected plurality of anomaly events is displayed .
An information processing device characterized by having a processing unit that executes processing.

Based on the monitoring data of a plurality of monitoring targets included in the system, anomaly event detection processing for detecting an anomaly event that has occurred in any of the plurality of monitoring targets is performed.
When a plurality of anomaly events are detected, the detection frequency of the anomaly events that have occurred for the monitored target is calculated for each monitoring target of the source of each of the detected plurality of anomaly events.
The time elapsed since the detection of the multiple anomaly events was measured, and the time elapsed was measured.
Information on the detected plurality of anomaly events is displayed on the display unit in the order according to the calculated detection frequency.
The shorter the measured time, the higher the information about the detected plurality of anomaly events is displayed .
An information processing device characterized by having a processing unit that executes processing.