JP2019028878A

JP2019028878A - Information processing device and program

Info

Publication number: JP2019028878A
Application number: JP2017149995A
Authority: JP
Inventors: 章二大嶋; Shoji Oshima; 宏和松林; Hirokazu Matsubayashi
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2017-08-02
Filing date: 2017-08-02
Publication date: 2019-02-21
Anticipated expiration: 2037-08-02
Also published as: JP6974703B2

Abstract

To provide an information processing device for suppressing the collection of logs which are not useful to analysis.SOLUTION: A storage part 1a of an information processing device 1 stores a time range of log records to be an extraction object and the priority level of each type of the log records in operation information 2 including a plurality of log records about a component of a prescribed device as management information 3 in each message. A processing part 1b extracts log records #n-3, #n-1, #n from in the operation information to an extraction size upper limit=3 on the basis of a time range ΔT1=10 minutes from the current time=18:10:00 corresponding to a message (ID:M1) and a priority level type1>type2>type3 with reference to the storage part when the message is detected.SELECTED DRAWING: Figure 1

Description

本発明は情報処理装置およびプログラムに関する。 The present invention relates to an information processing apparatus and a program.

情報処理システムでは、運用管理用のコンピュータにより、情報処理システムに含まれる装置の動作に関する動作情報を取得し、当該装置の稼働状況を把握可能にすることがある。動作情報は、当該装置のハードウェアやソフトウェアなどのコンポーネントによって出力されるログを含む。 In an information processing system, an operation management computer may acquire operation information related to the operation of an apparatus included in the information processing system and make it possible to grasp the operating status of the apparatus. The operation information includes a log output by a component such as hardware or software of the device.

例えば、一連の通信パスの状態変化を表すログ情報を一括して収集し、一連の通信パスの状態変化を表すログ情報に統一したフォーマットで編集して外部記憶装置に出力するデータ通信処理装置の提案がある。 For example, the data communication processing apparatus collects log information representing a series of communication path status changes in a batch, edits the log information representing a series of communication path status changes in a unified format, and outputs the log information to an external storage device. I have a suggestion.

また、複数の監視対象が正常に稼働しているか監視し、複数の監視対象各々の稼働状況をまとめてディスプレイに表示する監視装置の提案もある。この提案では、監視装置は、ディスプレイに時間軸を表示するとともに、時間軸上に、所定数を上限として複数のイベント情報をイベント発生順またはイベント情報発生順に並べて表示する。監視装置は、新たなイベント情報を取得すると、当該イベント情報を時間軸条の所定の位置に、他のイベント情報と並べて表示する。 There is also a proposal for a monitoring device that monitors whether a plurality of monitoring targets are operating normally and displays the operating status of each of the plurality of monitoring targets on a display. In this proposal, the monitoring apparatus displays a time axis on the display, and displays a plurality of event information on the time axis in a sequence of event occurrence or event information occurrence with a predetermined number as an upper limit. When the monitoring device acquires new event information, the monitoring device displays the event information side by side with other event information at a predetermined position on the time axis.

特開平２−２１２９５６号公報Japanese Laid-Open Patent Publication No. Hei 2-212156 国際公開第２０１３／０２１５３０号International Publication No. 2013/021530

障害などの事象に対して、装置に保存されているログ情報を全て収集しようとすると、当該事象との関係が薄く、当該事象の解析に有用でないログも収集されるという問題がある。 When all the log information stored in the apparatus is collected for an event such as a failure, there is a problem that a log that is not related to the event and is not useful for analyzing the event is collected.

１つの側面では、本発明は、解析に有用でないログの収集を抑えることを目的とする。 In one aspect, the present invention aims to suppress the collection of logs that are not useful for analysis.

１つの態様では、情報処理装置が提供される。情報処理装置は、記憶部と処理部とを有する。記憶部は、所定の装置の構成部品に関する複数のログレコードを含む動作情報のうち、抽出対象とするログレコードの時間範囲とログレコードのタイプ毎の優先レベルとを、メッセージ毎に記憶する。処理部は、メッセージを検出すると、記憶部を参照して、メッセージに応じた現時刻からの時間範囲および優先レベルに基づき、動作情報の中からログレコードを抽出する。 In one aspect, an information processing apparatus is provided. The information processing apparatus includes a storage unit and a processing unit. A memory | storage part memorize | stores for every message the time range of the log record made into extraction object, and the priority level for every type of log record among the operation information containing the several log record regarding the component of a predetermined | prescribed apparatus. When the processing unit detects the message, the processing unit refers to the storage unit and extracts a log record from the operation information based on the time range and priority level from the current time according to the message.

１つの側面では、解析に有用でないログの収集を抑えることができる。 In one aspect, collection of logs that are not useful for analysis can be suppressed.

第１の実施の形態の情報処理装置を示す図である。It is a figure which shows the information processing apparatus of 1st Embodiment. 第２の実施の形態のストレージシステムの例を示す図である。It is a figure which shows the example of the storage system of 2nd Embodiment. 第２の実施の形態のストレージ装置の接続例を示す図である。It is a figure which shows the example of a connection of the storage apparatus of 2nd Embodiment. 第２の実施の形態のストレージ装置のハードウェア例を示す図である。3 is a diagram illustrating an example of hardware of a storage device according to a second embodiment. FIG. 第２の実施の形態のＣＭの機能例を示す図である。It is a figure which shows the function example of CM of 2nd Embodiment. 第２の実施の形態のページの例を示す図である。It is a figure which shows the example of the page of 2nd Embodiment. 第２の実施の形態のページリストの例を示す図である。It is a figure which shows the example of the page list of 2nd Embodiment. 第２の実施の形態の割り当て方式管理テーブルの例を示す図である。It is a figure which shows the example of the allocation system management table of 2nd Embodiment. 第２の実施の形態のログ抽出管理テーブルの例を示す図である。It is a figure which shows the example of the log extraction management table of 2nd Embodiment. 第２の実施の形態のログ収集例を示す図である。It is a figure which shows the log collection example of 2nd Embodiment. 第２の実施の形態のログ収集例を示すフローチャートである。It is a flowchart which shows the log collection example of 2nd Embodiment. 第２の実施の形態のＣＭ単位のログ抽出例を示すフローチャートである。It is a flowchart which shows the log extraction example of CM unit of 2nd Embodiment. 第２の実施の形態の時間範囲内のログ抽出例を示すフローチャートである。It is a flowchart which shows the log extraction example within the time range of 2nd Embodiment. 第２の実施の形態の優先レベル単位のログ抽出例を示すフローチャートである。It is a flowchart which shows the log extraction example of the priority level unit of 2nd Embodiment. 第２の実施の形態のログ抽出例（その１）を示す図である。It is a figure which shows the log extraction example (the 1) of 2nd Embodiment. 第２の実施の形態のログ抽出例（その２）を示す図である。It is a figure which shows the log extraction example (the 2) of 2nd Embodiment. 第２の実施の形態のログ抽出例（その３）を示す図である。It is a figure which shows the log extraction example (the 3) of 2nd Embodiment. 第２の実施の形態のログ抽出例（その４）を示す図である。It is a figure which shows the log extraction example (the 4) of 2nd Embodiment. 第３の実施の形態のログ抽出管理テーブルの例を示す図である。It is a figure which shows the example of the log extraction management table of 3rd Embodiment. 第３の実施の形態のＣＭ単位のログ抽出例を示すフローチャートである。15 is a flowchart illustrating an example of CM-unit log extraction according to the third embodiment. 第３の実施の形態のログ抽出例を示す図である。It is a figure which shows the log extraction example of 3rd Embodiment. 第３の実施の形態のログ抽出管理テーブルの第１具体例を示す図である。It is a figure which shows the 1st specific example of the log extraction management table of 3rd Embodiment. 第３の実施の形態のログ抽出の第１具体例を示す図である。It is a figure which shows the 1st specific example of the log extraction of 3rd Embodiment. 第３の実施の形態のログ抽出管理テーブルの第２具体例を示す図である。It is a figure which shows the 2nd specific example of the log extraction management table of 3rd Embodiment. 第３の実施の形態のログ抽出の第２具体例を示す図である。It is a figure which shows the 2nd specific example of log extraction of 3rd Embodiment.

以下、本実施の形態について図面を参照して説明する。
［第１の実施の形態］
図１は、第１の実施の形態の情報処理装置を示す図である。情報処理装置１は、所定の装置の構成部品の動作に関する動作情報を取得する。所定の装置は、情報処理装置１でもよいし、情報処理装置１以外の他の装置でもよい。情報処理装置１は、障害などのイベントに対して解析用の動作情報を収集する機能を提供する。情報処理装置１は、記憶部１ａおよび処理部１ｂを有する。 Hereinafter, the present embodiment will be described with reference to the drawings.
[First Embodiment]
FIG. 1 is a diagram illustrating the information processing apparatus according to the first embodiment. The information processing apparatus 1 acquires operation information regarding the operation of the component parts of a predetermined apparatus. The predetermined device may be the information processing device 1 or another device other than the information processing device 1. The information processing apparatus 1 provides a function of collecting operation information for analysis for an event such as a failure. The information processing apparatus 1 includes a storage unit 1a and a processing unit 1b.

記憶部１ａは、ＲＡＭ（Random Access Memory）などの揮発性記憶装置でもよいし、ＨＤＤ（Hard Disk Drive）やフラッシュメモリなどの不揮発性記憶装置でもよい。処理部１ｂは、ＣＰＵ（Central Processing Unit）、ＤＳＰ（Digital Signal Processor）、ＡＳＩＣ（Application Specific Integrated Circuit）、ＦＰＧＡ（Field Programmable Gate Array）などを含み得る。処理部１ｂはプログラムを実行するプロセッサでもよい。「プロセッサ」には、複数のプロセッサの集合（マルチプロセッサ）も含まれ得る。 The storage unit 1a may be a volatile storage device such as a RAM (Random Access Memory) or a non-volatile storage device such as an HDD (Hard Disk Drive) or a flash memory. The processing unit 1b may include a CPU (Central Processing Unit), a DSP (Digital Signal Processor), an ASIC (Application Specific Integrated Circuit), an FPGA (Field Programmable Gate Array), and the like. The processing unit 1b may be a processor that executes a program. The “processor” may include a set of multiple processors (multiprocessor).

記憶部１ａは、動作情報２および管理情報３を記憶する。動作情報２は、所定の装置の構成部品に関する複数のログレコードを含む。構成部品は、例えば、該当の装置が備えるハードウェアやソフトウェアなどのコンポーネントである。あるいは、構成部品は、当該コンポーネントにおいて所定の機能を実現するモジュールでもよい。１つのログレコードは、レコード番号（図中“＃”と表記）、タイムスタンプ、ログタイプおよびログ内容を含む。レコード番号は、ログレコードの識別番号である。タイムスタンプは、ログ内容が記録された日時である。ログタイプは、ログ内容の分類を示す識別情報である。分類は、例えば、ログに関連するハードウェアの種類（記憶デバイスや通信デバイスなど）、ソフトウェアの種類（ＯＳ（Operating system）、ミドルウェアおよびアプリケーションなど）に応じて分けられる。動作情報２の例では、ログタイプは、“ｔｙｐｅ１”、“ｔｙｐｅ２”、“ｔｙｐｅ３”の３種類ある。ログ内容は、記録されたログの内容を示す情報である。 The storage unit 1a stores operation information 2 and management information 3. The operation information 2 includes a plurality of log records related to components of a predetermined device. The component is, for example, a component such as hardware or software included in the corresponding device. Alternatively, the component may be a module that realizes a predetermined function in the component. One log record includes a record number (indicated as “#” in the figure), a time stamp, a log type, and log contents. The record number is an identification number of the log record. The time stamp is the date and time when the log content was recorded. The log type is identification information indicating the classification of log contents. The classification is classified according to, for example, the type of hardware (storage device, communication device, etc.) and the type of software (OS (Operating system), middleware, application, etc.) related to the log. In the example of the operation information 2, there are three log types, “type 1”, “type 2”, and “type 3”. The log content is information indicating the content of the recorded log.

例えば、動作情報２は、ログ番号（＃）“ｎ−５”（ｎは６以上の整数）、タイムスタンプ“２０１７／６／３０１７：５８：５０”、ログタイプ“ｔｙｐｅ１”、ログ内容“ｆａｕｌｔａａａａ”というログレコードを含む。動作情報２の例では、ログ番号（＃）“ｎ”のログレコードが最新である。 For example, the operation information 2 includes a log number (#) “n-5” (n is an integer of 6 or more), a time stamp “2017/6/30 17:58:50”, a log type “type1”, and a log content “ It contains a log record called “fault aaa”. In the example of the operation information 2, the log record with the log number (#) “n” is the latest.

管理情報３は、動作情報２のうち、抽出対象とするログレコードの時間範囲とログレコードのタイプ（ログタイプ）毎の優先レベルとが、事象の発生を示すメッセージに対して登録された情報である。メッセージは、所定の装置（情報処理装置１または他の装置）におけるハードウェアやソフトウェアなどのコンポーネントにより発行される。管理情報の１つのレコードは、メッセージＩＤ（IDentifier）、時間範囲、およびログタイプ優先レベルを含む。メッセージＩＤは、メッセージの識別情報である。時間範囲は、ログレコードの抽出対象とする時間範囲を示す情報である。ログタイプ優先レベルは、ログタイプ毎の優先度を示す情報である。メッセージの発行元のコンポーネントに対して関連性が高いログタイプほど、優先度が高くなるように予め設定される。 The management information 3 is information in which the time range of the log record to be extracted and the priority level for each type of log record (log type) are registered with respect to the message indicating the occurrence of the event. is there. The message is issued by a component such as hardware or software in a predetermined device (the information processing device 1 or another device). One record of management information includes a message ID (IDentifier), a time range, and a log type priority level. The message ID is message identification information. The time range is information indicating a time range to be extracted from the log record. The log type priority level is information indicating the priority for each log type. The log type that is more relevant to the component that issued the message is set in advance so that the priority is higher.

例えば、管理情報３は、メッセージＩＤ“Ｍ１”、時間範囲“ΔＴ１”、ログタイプ優先レベル“ｔｙｐｅ１＞ｔｙｐｅ２＞ｔｙｐｅ３”というレコードを含む。ここで、ログタイプ優先レベルの記号“＞”は、当該記号の左側のログタイプの方が、当該記号の右側のログタイプよりも優先度が高いことを示す。例えば、“ｔｙｐｅ１＞ｔｙｐｅ２＞ｔｙｐｅ３”の表記は、３つのログタイプのうち、“ｔｙｐｅ１”が最も優先度が高く、次いで“ｔｙｐｅ２”の優先度が高く、“ｔｙｐｅ３”の優先度が最も低いことを示す。 For example, the management information 3 includes a record of message ID “M1”, time range “ΔT1”, and log type priority level “type1> type2> type3”. Here, the symbol “>” of the log type priority level indicates that the log type on the left side of the symbol has higher priority than the log type on the right side of the symbol. For example, in the notation “type1> type2> type3”, among the three log types, “type1” has the highest priority, “type2” has the highest priority, and “type3” has the lowest priority. Indicates.

処理部１ｂは、管理情報３を取得し、記憶部１ａに格納する。管理情報３は、例えば、ユーザにより情報処理装置１に対して予め入力される。
処理部１ｂは、メッセージを検出すると、記憶部１ａを参照して、当該メッセージに応じた現時刻からの時間範囲および優先レベル（ログタイプ優先レベル）に基づき、動作情報２の中からログレコードを抽出する。 The processing unit 1b acquires the management information 3 and stores it in the storage unit 1a. For example, the management information 3 is input in advance to the information processing apparatus 1 by the user.
When the processing unit 1b detects the message, the processing unit 1b refers to the storage unit 1a, and extracts a log record from the operation information 2 based on the time range and priority level (log type priority level) from the current time according to the message. Extract.

例えば、処理部１ｂは、メッセージＩＤ“Ｍ１”を含むメッセージを受信する。管理情報３によれば、メッセージＩＤ“Ｍ１”に応じた時間範囲は“ΔＴ１”である。管理情報３によれば、メッセージＩＤ“Ｍ１”に応じたログタイプ優先レベルは、“ｔｙｐｅ１＞ｔｙｐｅ２＞ｔｙｐｅ３”である。したがって、処理部１ｂは、現時刻からの時間範囲“ΔＴ１”およびログタイプ優先レベル“ｔｙｐｅ１＞ｔｙｐｅ２＞ｔｙｐｅ３”に基づいて動作情報２の中からログレコードを抽出する。 For example, the processing unit 1b receives a message including the message ID “M1”. According to the management information 3, the time range corresponding to the message ID “M1” is “ΔT1”. According to the management information 3, the log type priority level corresponding to the message ID “M1” is “type1> type2> type3”. Accordingly, the processing unit 1b extracts a log record from the operation information 2 based on the time range “ΔT1” from the current time and the log type priority level “type1> type2> type3”.

より具体的には、抽出条件の一例として、（１）ΔＴ１が１０分（ΔＴ１＝１０分）であり、（２）現時刻が２０１７／６／３０の１８：１０：００であり、（３）抽出するログレコードの合計サイズの上限が“３”（抽出サイズ上限＝３）の場合を考える。ここで、例えば、動作情報２のログレコード１つ当たりのサイズを１とする。 More specifically, as an example of the extraction condition, (1) ΔT1 is 10 minutes (ΔT1 = 10 minutes), (2) the current time is 18/60: 00 of 2017/6/30, (3 ) Consider a case where the upper limit of the total size of log records to be extracted is “3” (extraction size upper limit = 3). Here, for example, the size of one log record of the operation information 2 is assumed to be 1.

この場合、処理部１ｂは、例えば、次のようにログレコードの抽出を行う。
まず、処理部１ｂは、現時刻（１８：１０：００）から時間範囲“ΔＴ１＝１０分”だけ遡った時刻１８：００：００を計算する。そして、処理部１ｂは、時刻１８：００：００から現時刻までの時間範囲に属するログレコードを抽出対象候補とする。動作情報２の例では、抽出対象候補は、レコード番号“ｎ−４”〜“ｎ”までのログレコードである。 In this case, the processing unit 1b extracts log records as follows, for example.
First, the processing unit 1b calculates a time 18:00 that is back by the time range “ΔT1 = 10 minutes” from the current time (18:10:00). Then, the processing unit 1b sets log records belonging to the time range from time 18:00 to the current time as extraction target candidates. In the example of the operation information 2, the extraction target candidates are log records with record numbers “n−4” to “n”.

次に、処理部１ｂは、抽出対象候補のログレコードのうち、最高の優先レベルであるログタイプ“ｔｙｐｅ１”のログレコードを新しい方から古い方へ順に抽出する。まず、処理部１ｂは、ログタイプ“ｔｙｐｅ１”であるレコード番号“ｎ”のログレコードを抽出する。処理部１ｂは、ログレコードを１つ抽出するたびに、抽出したログレコードの合計サイズが上限“３”に達したか否かを判定する。この段階では、抽出したログレコードの合計サイズは“１”であり、上限“３”に達していない。次に、処理部１ｂは、ログタイプ“ｔｙｐｅ１”であるレコード番号“ｎ−３”のログレコードを抽出する。この段階では、抽出したログレコードの合計サイズは“２”であり、上限“３”に達していない。 Next, the processing unit 1b sequentially extracts the log records of the log type “type1”, which is the highest priority level, from the newest to the oldest among the extraction target candidate log records. First, the processing unit 1b extracts a log record with a record number “n” having a log type “type1”. Each time the processing unit 1b extracts one log record, the processing unit 1b determines whether or not the total size of the extracted log records has reached the upper limit “3”. At this stage, the total size of the extracted log records is “1” and does not reach the upper limit “3”. Next, the processing unit 1b extracts a log record with the record number “n-3” having the log type “type1”. At this stage, the total size of the extracted log records is “2” and does not reach the upper limit “3”.

次に、処理部１ｂは、抽出対象候補のログレコードのうち、２番目に高い優先レベルであるログタイプ“ｔｙｐｅ２”のログレコードを新しい方から古い方へ順に抽出する。まず、処理部１ｂは、ログタイプ“ｔｙｐｅ２”であるレコード番号“ｎ−１”のログレコードを抽出する。この段階で、抽出したログレコードの合計サイズは“３”であり、上限“３”に達する。したがって、処理部１ｂは、抽出対象候補であるレコード番号“ｎ−４”〜“ｎ”までのログレコードのうち、レコード番号“ｎ−３”、“ｎ−１”、“ｎ”のログレコードを抽出する。一方、処理部１ｂは、抽出対象候補であるレコード番号“ｎ−４”〜“ｎ”までのログレコードのうち、レコード番号“ｎ−４”（ログタイプ“ｔｙｐｅ２”）、“ｎ−２”（ログタイプ“ｔｙｐｅ３”）のレコードを抽出しない。 Next, the processing unit 1b sequentially extracts the log records of the log type “type2”, which is the second highest priority level, from the newest to the oldest among the log records of extraction target candidates. First, the processing unit 1b extracts a log record having a record number “n−1” having a log type “type 2”. At this stage, the total size of the extracted log records is “3” and reaches the upper limit “3”. Therefore, the processing unit 1b selects the log records with the record numbers “n-3”, “n−1”, and “n” from the log records with the record numbers “n-4” to “n” that are candidates for extraction. To extract. On the other hand, the processing unit 1b records the record number “n-4” (log type “type2”), “n-2” among the log records from the record numbers “n-4” to “n” that are extraction target candidates. The record of (log type “type3”) is not extracted.

処理部１ｂは、抽出したレコード番号“ｎ−３”、“ｎ−１”、“ｎ”のログレコードを出力する。例えば、処理部１ｂは、抽出したログレコードを、情報処理装置１に接続された表示装置（図１では図示を省略している）に出力し、表示装置により抽出したログレコードに含まれるログ内容を表示させてもよい。また、処理部１ｂは、抽出したログレコードを解析することで、障害などのイベントの原因特定を支援してもよい。あるいは、処理部１ｂは、ログレコードの解析を行う他の装置（図１では図示を省略している）に、抽出したログレコードを送信してもよい。 The processing unit 1b outputs log records having the extracted record numbers “n-3”, “n−1”, and “n”. For example, the processing unit 1b outputs the extracted log record to a display device (not shown in FIG. 1) connected to the information processing device 1, and the log contents included in the log record extracted by the display device May be displayed. Further, the processing unit 1b may support the identification of the cause of an event such as a failure by analyzing the extracted log record. Alternatively, the processing unit 1b may transmit the extracted log record to another device (not shown in FIG. 1) that analyzes the log record.

このように、情報処理装置１によれば、解析に有用でないログの収集を抑えることができる。
ここで、例えば、障害などの事象（イベント）に対して、動作情報２を全て収集することも考えられる。しかし、動作情報２には、新しいものや古いもの、ハードウェアやソフトウェアなどに関する種々のログレコードが含まれる。このため、動作情報２を全て収集すると、発生した事象との関係が薄く、当該事象の解析に有用でないログレコードも収集されるという問題がある。余計なログレコードの収集は、ログレコードを他の装置に送信する際の通信量の増加や、有用でないログレコードによる解析量の増加などの要因になる。 Thus, according to the information processing apparatus 1, it is possible to suppress the collection of logs that are not useful for analysis.
Here, for example, it is also conceivable to collect all the operation information 2 for an event such as a failure. However, the operation information 2 includes various log records related to new and old items, hardware and software. For this reason, when all of the operation information 2 is collected, there is a problem that the relationship with the generated event is thin, and log records that are not useful for analyzing the event are also collected. The collection of unnecessary log records causes an increase in the amount of communication when the log records are transmitted to other devices, and an increase in the amount of analysis due to unusable log records.

例えば、収集するログサイズを小さくするために、単に、抽出対象サイズに上限を設けることも考えられる。しかし、抽出対象サイズに上限を設けただけでは、該当の事象に対して有用でないログレコードが抽出される可能性は大きい。なぜなら、発生する事象に応じて、当該事象と関連性の高いコンポーネントは異なるからである。また、現時点に対して古いログレコードほど、現時点の事象との関連が薄くなるからである。 For example, in order to reduce the log size to be collected, it is possible to simply set an upper limit on the extraction target size. However, simply setting an upper limit on the extraction target size is likely to extract log records that are not useful for the event. This is because components highly relevant to the event differ depending on the event that occurs. Also, the older the log record with respect to the current time, the less the relationship with the current event.

そこで、情報処理装置１は、事象の発生を示すメッセージ毎に抽出対象ログレコードの時間範囲とログタイプ別の優先レベルと（管理情報３）を取得し、記憶部１ａにより記憶する。情報処理装置１は、メッセージを検出すると、当該メッセージに応じた時間範囲とログタイプ別の優先レベルとを記憶部１ａに記憶された管理情報３から検索する。そして、情報処理装置１は、現時点以前の時間範囲とログタイプ別の優先レベルとを基にログレコードを抽出する。これにより、情報処理装置１は、動作情報２のうち、障害解析に有用なログレコードのみを得ることができる。 Therefore, the information processing apparatus 1 acquires the time range of the extraction target log record and the priority level for each log type (management information 3) for each message indicating the occurrence of an event, and stores it in the storage unit 1a. When the information processing apparatus 1 detects a message, the information processing apparatus 1 searches the management information 3 stored in the storage unit 1a for a time range corresponding to the message and a priority level for each log type. Then, the information processing apparatus 1 extracts log records based on the time range before the current time and the priority level for each log type. Thereby, the information processing apparatus 1 can obtain only the log record useful for the failure analysis in the operation information 2.

以下では、情報処理装置１の機能を有するストレージ装置を例示して、当該機能を更に具体的に説明する。
［第２の実施の形態］
図２は、第２の実施の形態のストレージシステムの例を示す図である。第２の実施の形態のストレージシステムは、ストレージ装置１０，２０を含む。ストレージ装置１０，２０は、フロントエンドエンクロージャ（ＦＥ：Front-end Enclosure）３０を介して接続されている。ストレージ装置１０は、ローカルエリアネットワーク（ＬＡＮ：Local Area Network）４０に接続されている。ＬＡＮ４０は、インターネット５０に接続されている。 Hereinafter, the storage apparatus having the function of the information processing apparatus 1 will be exemplified to describe the function more specifically.
[Second Embodiment]
FIG. 2 illustrates an example of the storage system according to the second embodiment. The storage system according to the second embodiment includes storage apparatuses 10 and 20. The storage apparatuses 10 and 20 are connected via a front-end enclosure (FE) 30. The storage device 10 is connected to a local area network (LAN) 40. The LAN 40 is connected to the Internet 50.

ストレージ装置１０は、ＬＡＮ４０およびインターネット５０を介して、サポートサーバ６０と通信する。サポートサーバ６０は、ストレージシステムの保守に用いられるサーバコンピュータである。サポートサーバ６０は、ストレージシステムで障害が発生した場合に、ストレージ装置１０，２０のログを取得し、取得したログに基づく保守員による保守作業（例えば、障害の原因究明や対策立案など）を支援する。 The storage device 10 communicates with the support server 60 via the LAN 40 and the Internet 50. The support server 60 is a server computer used for maintenance of the storage system. The support server 60 acquires logs of the storage devices 10 and 20 when a failure occurs in the storage system, and supports maintenance work (for example, investigation of the cause of the failure and planning of countermeasures) based on the acquired logs. To do.

ストレージ装置１０，２０は、ストレージエリアネットワーク（ＳＡＮ：Storage Area Network）７０に接続されている。ＳＡＮ７０には、業務サーバ８０が接続されている。業務サーバ８０は、ユーザの業務を支援するソフトウェアを実行するサーバコンピュータである。ストレージ装置１０，２０は、業務サーバ８０の業務処理に用いられるデータを記憶する。ストレージ装置１０，２０はＳＡＮ７０を介して、業務サーバ８０によるデータアクセスを受け付ける。 The storage devices 10 and 20 are connected to a storage area network (SAN) 70. A business server 80 is connected to the SAN 70. The business server 80 is a server computer that executes software that supports a user's business. The storage devices 10 and 20 store data used for business processing of the business server 80. The storage apparatuses 10 and 20 accept data access by the business server 80 via the SAN 70.

図３は、第２の実施の形態のストレージ装置の接続例を示す図である。ストレージ装置１０は、コントローラモジュール（ＣＭ：Controller Module）１００，２００およびドライブエンクロージャ（ＤＥ：Drive Enclosure）１１，１２を有する。 FIG. 3 is a diagram illustrating a connection example of the storage apparatus according to the second embodiment. The storage apparatus 10 includes controller modules (CM) 100 and 200 and drive enclosures (DE) 11 and 12.

ＣＭ１００，２００は、ＤＥ１１，１２に収納されたＨＤＤやＳＳＤ（Solid State Drive）などの記憶装置に対するデータアクセスを制御するストレージ制御装置である。ＣＭ１００，２００は、コントローラエンクロージャ（ＣＥ：Controller Enclosure）と呼ばれる１つの筐体に収められる。ＣＭ１００，２００は、ＦＥ３０に接続されている。ＣＭ１００は、ＤＥ１１，１２に接続されている。ＣＭ２００は、ＤＥ１１，１２に接続されている。 The CMs 100 and 200 are storage control devices that control data access to storage devices such as HDDs and SSDs (Solid State Drives) housed in the DEs 11 and 12. The CMs 100 and 200 are housed in a single housing called a controller enclosure (CE). The CMs 100 and 200 are connected to the FE 30. The CM 100 is connected to the DEs 11 and 12. The CM 200 is connected to the DEs 11 and 12.

ＤＥ１１，１２は、ＨＤＤやＳＳＤなどの記憶装置を複数収容する。ＤＥ１１，１２は、ＣＭ１００，２００とは別筐体でも（ＣＭ１００，２００に対して外付けされても）よいし、ＣＭ１００，２００と同じ筐体に収められてもよい。ＣＭ１００は、第１の実施の形態の情報処理装置１の一例である。 The DEs 11 and 12 accommodate a plurality of storage devices such as HDDs and SSDs. The DEs 11 and 12 may be separate from the CMs 100 and 200 (externally attached to the CMs 100 and 200) or may be housed in the same housing as the CMs 100 and 200. The CM 100 is an example of the information processing apparatus 1 according to the first embodiment.

ストレージ装置２０は、ＣＭ３００，４００およびＤＥ２１，２２を有する。
ＣＭ３００，４００は、ＤＥ２１，２２に収納されたＨＤＤやＳＳＤなどの記憶装置に対するデータアクセスを制御するストレージ制御装置である。ＣＭ３００，４００は、ＦＥ３０に接続されている。ＣＭ３００は、ＤＥ２１，２２に接続されている。ＣＭ４００は、ＤＥ２１，２２に接続されている。 The storage apparatus 20 includes CMs 300 and 400 and DEs 21 and 22.
The CMs 300 and 400 are storage control devices that control data access to storage devices such as HDDs and SSDs stored in the DEs 21 and 22. The CMs 300 and 400 are connected to the FE 30. The CM 300 is connected to the DEs 21 and 22. The CM 400 is connected to the DEs 21 and 22.

ＤＥ２１，２２は、ＨＤＤやＳＳＤなどの記憶装置を複数収容する。ＤＥ２１，２２は、ＣＭ３００，４００とは別筐体でも（ＣＭ３００，４００に対して外付けされても）よいし、ＣＭ３００，４００と同じ筐体に収められてもよい。 The DEs 21 and 22 accommodate a plurality of storage devices such as HDDs and SSDs. The DEs 21 and 22 may be separate from the CMs 300 and 400 (externally attached to the CMs 300 and 400), or may be housed in the same case as the CMs 300 and 400.

ここで、ＣＭ１００は、ＣＭ２００，３００，４００に対するマスタＣＭとして機能する。マスタＣＭは、ストレージシステムの運用管理機能を統括するＣＭであり、ＦＥ３０を介して、他のＣＭ（ＣＭ２００，３００，４００）からログなどの情報を収集し、収集したログをサポートサーバ６０に送信する機能を担う。 Here, the CM 100 functions as a master CM for the CMs 200, 300, and 400. The master CM is a CM that supervises the operation management function of the storage system, collects information such as logs from other CMs (CMs 200, 300, and 400) via the FE 30, and transmits the collected logs to the support server 60. It bears the function to do.

図４は、第２の実施の形態のストレージ装置のハードウェア例を示す図である。ＣＭ１００は、プロセッサ１０１、ＲＡＭ１０２、ＮＡ（Network Adapter）１０３、ＣＡ（Channel Adapter）１０４、ＮＴＢ（Non-Transparent Bridge）１０５、ＢＵＤ（Boot-up and Utility Device）１０６、ＤＩ（Drive Interface）１０７、ＣＭ−ＩＦ（InterFace）１０８および媒体リーダ１０９を有する。これらのハードウェアは、ＣＭ１００の内部バスに接続されている。ＣＭ２００，３００，４００も同様のハードウェアにより実現される。 FIG. 4 is a diagram illustrating a hardware example of the storage apparatus according to the second embodiment. The CM 100 includes a processor 101, RAM 102, NA (Network Adapter) 103, CA (Channel Adapter) 104, NTB (Non-Transparent Bridge) 105, BUD (Boot-up and Utility Device) 106, DI (Drive Interface) 107, CM An IF (InterFace) 108 and a media reader 109 are included. These hardwares are connected to the internal bus of the CM 100. The CMs 200, 300, and 400 are also realized by similar hardware.

プロセッサ１０１は、ＣＭ１００の情報処理を制御するハードウェアである。プロセッサ１０１は、マルチプロセッサであってもよい。プロセッサ１０１は、例えばＣＰＵ、ＤＳＰ、ＡＳＩＣまたはＦＰＧＡなどである。プロセッサ１０１は、ＣＰＵ、ＤＳＰ、ＡＳＩＣ、ＦＰＧＡなどのうちの２以上の要素の組み合わせであってもよい。 The processor 101 is hardware that controls information processing of the CM 100. The processor 101 may be a multiprocessor. The processor 101 is, for example, a CPU, DSP, ASIC, or FPGA. The processor 101 may be a combination of two or more elements of CPU, DSP, ASIC, FPGA, and the like.

ＲＡＭ１０２は、ＣＭ１００の主記憶装置である。ＲＡＭ１０２は、揮発性の半導体メモリである。ＲＡＭ１０２として、例えば、ＳＲＡＭ（Static RAM）やＤＲＡＭ（Dynamic RAM）などが用いられる。ＲＡＭ１０２は、プロセッサ１０１に実行させるＯＳやファームウェアのプログラムの少なくとも一部を一時的に記憶する。また、ＲＡＭ１０２は、プロセッサ１０１による処理に用いられる各種データを記憶する。 The RAM 102 is a main storage device of the CM 100. The RAM 102 is a volatile semiconductor memory. For example, SRAM (Static RAM) or DRAM (Dynamic RAM) is used as the RAM 102. The RAM 102 temporarily stores at least part of an OS or firmware program to be executed by the processor 101. The RAM 102 stores various data used for processing by the processor 101.

ＮＡ１０３は、ＬＡＮ４０を介してサポートサーバ６０と通信する通信インタフェースである。ＮＡ１０３として、例えばイーサネット（登録商標）のインタフェースを用いることができる。 The NA 103 is a communication interface that communicates with the support server 60 via the LAN 40. As the NA 103, for example, an Ethernet (registered trademark) interface can be used.

ＣＡ１０４は、ＳＡＮ５０を介して業務サーバ８０と通信する通信インタフェースである。ＣＡ１０４は、業務サーバ８０からＤＥ２１，２２へのブロックアクセスに用いられる。ＣＡ１０４として、例えばＦＣ（Fibre Channel）のインタフェースを用いることができる。ＣＡ１０４として、ＦＣ以外のインタフェース（例えば、ＳＡＳ（Serial Attached SCSI、ＳＣＳＩはSmall Computer System Interfaceの略）など）が用いられることもある。 The CA 104 is a communication interface that communicates with the business server 80 via the SAN 50. The CA 104 is used for block access from the business server 80 to the DEs 21 and 22. As the CA 104, for example, an FC (Fibre Channel) interface can be used. An interface other than FC (for example, SAS (Serial Attached SCSI; SCSI is an abbreviation for Small Computer System Interface)) may be used as the CA 104.

ＮＴＢ１０５は、ＦＥ３０と接続する通信インタフェースである。ＮＴＢ１０５は、ＦＥ３０を介して、ＣＭ２００，３００，４００と通信する。
ＢＵＤ１０６は、ＣＭ１００の補助記憶装置である。ＢＵＤ１０６は、不揮発性の半導体メモリである。例えば、ＢＵＤ１０６として、ＳＳＤが用いられる。ＢＵＤ１０６は、ＯＳやファームウェアを含むプログラムや各種データなどを記憶する。ＢＵＤ１０６は、ＣＭ１００において動作するハードウェアやソフトウェアなどのコンポーネントにより出力されたログの保存にも用いられる。 The NTB 105 is a communication interface connected to the FE 30. The NTB 105 communicates with the CMs 200, 300, and 400 via the FE 30.
The BUD 106 is an auxiliary storage device of the CM 100. The BUD 106 is a nonvolatile semiconductor memory. For example, an SSD is used as the BUD 106. The BUD 106 stores programs including OS and firmware, various data, and the like. The BUD 106 is also used for storing logs output by components such as hardware and software that operate in the CM 100.

ＤＩ１０７は、ＤＥ２１，２２と通信するためのインタフェースである。例えば、ＤＩ１０７として、ＳＡＳなどのインタフェースを用いることができる。
ＣＭ−ＩＦ１０８は、ＣＭ２００と接続するためのインタフェースである。ＣＭ１００は、ＣＭ−ＩＦ１０８を用いて、ＣＭ２００と連携してデータアクセスを行える。例えば、ＣＭ１００を運用系、ＣＭ２００を待機系としてもよい。あるいは、ＣＭ１００，２００の両方を運用系として、データアクセスを分散して行ってもよい。何れの場合も、一方の故障時に他方でデータアクセスを引き継ぐことができ、ユーザの業務が停止されることを防げる。 The DI 107 is an interface for communicating with the DEs 21 and 22. For example, an interface such as SAS can be used as the DI 107.
The CM-IF 108 is an interface for connecting to the CM 200. The CM 100 can perform data access in cooperation with the CM 200 using the CM-IF 108. For example, CM 100 may be the active system and CM 200 may be the standby system. Alternatively, the data access may be distributed by using both the CMs 100 and 200 as the active system. In either case, data access can be taken over by the other in the event of a failure, and the user's business can be prevented from being stopped.

媒体リーダ１０９は、記録媒体９１に記憶されたプログラムやデータを読み取る装置である。記録媒体９１として、例えば、フラッシュメモリカードなどの不揮発性の半導体メモリを使用することができる。媒体リーダ１０９は、例えば、プロセッサ１０１からの命令に従って、記録媒体９１から読み取ったプログラムやデータを、ＲＡＭ１０２やＢＵＤ１０６に格納することもできる。 The medium reader 109 is a device that reads programs and data stored in the recording medium 91. As the recording medium 91, for example, a non-volatile semiconductor memory such as a flash memory card can be used. For example, the medium reader 109 can store the program and data read from the recording medium 91 in the RAM 102 and the BUD 106 in accordance with an instruction from the processor 101.

図５は、第２の実施の形態のＣＭの機能例を示す図である。ＣＭ１００は、記憶部１１０、メッセージ生成部１２０、通知制御部１３０、ログ収集部１４０およびログ抽出部１５０を有する。 FIG. 5 is a diagram illustrating a function example of the CM according to the second embodiment. The CM 100 includes a storage unit 110, a message generation unit 120, a notification control unit 130, a log collection unit 140, and a log extraction unit 150.

記憶部１１０は、ＲＡＭ１０２やＢＵＤ１０６の記憶領域を用いて実現される。
メッセージ生成部１２０、通知制御部１３０、ログ収集部１４０およびログ抽出部１５０は、プロセッサ１０１によって実現される。例えば、プロセッサ１０１は、ＲＡＭ１０２に記憶されたプログラムを実行することで、メッセージ生成部１２０、通知制御部１３０、ログ収集部１４０およびログ抽出部１５０の機能を発揮してもよい。あるいは、メッセージ生成部１２０、通知制御部１３０、ログ収集部１４０およびログ抽出部１５０は、ＦＰＧＡやＡＳＩＣなどのハードワイヤードロジックにより実現されてもよい。 The storage unit 110 is realized using a storage area of the RAM 102 or the BUD 106.
The message generation unit 120, the notification control unit 130, the log collection unit 140, and the log extraction unit 150 are realized by the processor 101. For example, the processor 101 may exhibit the functions of the message generation unit 120, the notification control unit 130, the log collection unit 140, and the log extraction unit 150 by executing a program stored in the RAM 102. Alternatively, the message generation unit 120, the notification control unit 130, the log collection unit 140, and the log extraction unit 150 may be realized by a hard wired logic such as an FPGA or an ASIC.

記憶部１１０は、ＣＭ１００におけるハードウェア（ＤＥ２１，２２を含む）、ＯＳ、ミドルウェアおよびアプリケーションなどのコンポーネントの動作に関するログを記憶する。また、記憶部１１０は、ＣＭ１００，２００，３００，４００のコンポーネントにより生成されるメッセージに対して、ログの抽出に用いられる管理情報を予め記憶する。管理情報は、ＣＭ１００，２００，３００，４００それぞれで抽出するログの上限サイズ（抽出量の上限値）を決定するための情報を含む。また、管理情報は、抽出候補とする時間範囲や抽出候補とするログの分類（ログタイプ）の優先レベルの情報を含む。 The storage unit 110 stores a log regarding the operation of components such as hardware (including the DEs 21 and 22), the OS, middleware, and applications in the CM 100. Further, the storage unit 110 stores in advance management information used for log extraction for messages generated by the components of the CMs 100, 200, 300, and 400. The management information includes information for determining the upper limit size (the upper limit value of the extraction amount) of the log extracted by each of the CMs 100, 200, 300, and 400. Further, the management information includes information on a priority range of a time range as an extraction candidate and a log classification (log type) as an extraction candidate.

メッセージ生成部１２０は、通知制御部１３０に対して障害の発生を示すメッセージを通知する。メッセージ生成部１２０は、ＣＭ１００のコンポーネントにおける障害通知用のモジュールでもよい。また、メッセージ生成部１２０は、障害発生時以外にも、当該コンポーネントの動作に関するログを記憶部１１０に格納してもよい。 The message generation unit 120 notifies the notification control unit 130 of a message indicating the occurrence of a failure. The message generation unit 120 may be a failure notification module in the CM 100 component. Further, the message generation unit 120 may store a log related to the operation of the component in the storage unit 110 in addition to when a failure occurs.

通知制御部１３０は、メッセージ生成部１２０およびＣＭ２００，３００，４００によるメッセージの通知を監視する。通知制御部１３０は、メッセージ生成部１２０およびＣＭ２００，３００，４００の何れかにより通知されたメッセージを取得すると、取得したメッセージをサポートサーバ６０に送信する。通知制御部１３０は、一定時間後に、ログ収集部１４０に対して、障害調査用のログ収集を依頼する。ここで、「一定時間」は、例えば、障害事象に応じた後処理を実行するための時間である。障害事象に応じた後処理の一例としては、異常部品の切り離しや再組み込みなどが挙げられる。 The notification control unit 130 monitors message notifications by the message generation unit 120 and the CMs 200, 300, and 400. When the notification control unit 130 acquires the message notified by any of the message generation unit 120 and the CMs 200, 300, and 400, the notification control unit 130 transmits the acquired message to the support server 60. The notification control unit 130 requests the log collection unit 140 to collect a log for failure investigation after a certain time. Here, the “certain time” is, for example, a time for executing post-processing according to a failure event. As an example of post-processing according to a failure event, separation or reassembly of an abnormal part can be cited.

通知制御部１３０は、ログ収集部１４０からログ収集結果を取得する。通知制御部１３０は、取得したログ収集結果をサポートサーバ６０に送信する。ここで、サポートサーバ６０へのログ収集結果の送信可能なサイズには上限が設けられる。例えば、サポートサーバ６０へのログ収集結果の送信可能なサイズの上限は、１ＭＢ（Mega Bytes）である。 The notification control unit 130 acquires the log collection result from the log collection unit 140. The notification control unit 130 transmits the acquired log collection result to the support server 60. Here, an upper limit is set for the size of the log collection result that can be transmitted to the support server 60. For example, the upper limit of the transmittable size of the log collection result to the support server 60 is 1 MB (Mega Bytes).

ログ収集部１４０は、通知制御部１３０によるログ収集の依頼に応じて、記憶部１１０に記憶された管理情報を参照し、今回のメッセージに対してＣＭ１００，２００，３００，４００それぞれで抽出するログのサイズ（抽出量）の上限値を決定する。ログ収集部１４０は、今回のメッセージのメッセージＩＤと決定した上限とをログ抽出部１５０およびＣＭ２００，３００，４００に通知し、通知した抽出量の上限値でのログ抽出を指示する。 The log collection unit 140 refers to the management information stored in the storage unit 110 in response to a log collection request from the notification control unit 130, and logs extracted by the CMs 100, 200, 300, and 400 for the current message, respectively. The upper limit value of the size (extraction amount) is determined. The log collection unit 140 notifies the log extraction unit 150 and the CMs 200, 300, and 400 of the message ID of the current message and the determined upper limit, and instructs log extraction with the upper limit value of the notified extraction amount.

なお、ログ収集部１４０は、ＣＭ１００，２００，３００，４００それぞれにＣＭ番号と呼ばれる識別番号を付与している。ＣＭ１００のＣＭ番号は“１”である。ＣＭ２００のＣＭ番号は“２”である。ＣＭ３００のＣＭ番号は“３”である。ＣＭ４００のＣＭ番号は“４”である。 The log collection unit 140 assigns an identification number called a CM number to each of the CMs 100, 200, 300, and 400. The CM number of the CM 100 is “1”. The CM number of the CM 200 is “2”. The CM number of the CM 300 is “3”. The CM number of the CM 400 is “4”.

ログ収集部１４０は、ログ抽出部１５０およびＣＭ２００，３００，４００により抽出されたログのレコード群を取得し、取得したレコード群をログ収集結果として、通知制御部１３０に提供する。 The log collection unit 140 acquires a record group of logs extracted by the log extraction unit 150 and the CMs 200, 300, and 400, and provides the acquired record group to the notification control unit 130 as a log collection result.

ログ抽出部１５０は、ログ収集部１４０のログ抽出の指示に応じて、記憶部１１０に記憶されたログから障害調査用のレコードを抽出する。ログ抽出部１５０は、記憶部１１０に記憶された管理情報を参照し、今回のメッセージＩＤに対して、抽出候補とする時間範囲や抽出候補とするログタイプの優先レベルを特定する。ログ抽出部１５０は、レコードの抽出に、ログ収集部１４０により通知された抽出量の上限値、および、特定した時間範囲やログタイプの優先レベルの情報を用いる。ログ抽出部１５０は、抽出したレコードをログ収集部１４０に提供する。 The log extraction unit 150 extracts a failure investigation record from the log stored in the storage unit 110 in accordance with the log extraction instruction from the log collection unit 140. The log extraction unit 150 refers to the management information stored in the storage unit 110 and identifies the time range that is the extraction candidate and the priority level of the log type that is the extraction candidate for the current message ID. The log extraction unit 150 uses the upper limit value of the extraction amount notified by the log collection unit 140 and information on the priority level of the identified time range and log type for record extraction. The log extraction unit 150 provides the extracted record to the log collection unit 140.

ＣＭ２００は、記憶部２１０、メッセージ生成部２２０およびログ抽出部２３０を有する。記憶部２１０は、ＣＭ２００が備えるＲＡＭやＢＵＤの記憶領域を用いて実現される。メッセージ生成部２２０およびログ抽出部２３０は、ＣＭ２００が備えるプロセッサを用いて実現される。例えば、ＣＭ２００のプロセッサは、ＣＭ２００のＲＡＭに記憶されたプログラムを実行することで、メッセージ生成部２２０およびログ抽出部２３０の機能を発揮してもよい。あるいは、メッセージ生成部２２０およびログ抽出部２３０は、ＦＰＧＡやＡＳＩＣなどのハードワイヤードロジックにより実現されてもよい。 The CM 200 includes a storage unit 210, a message generation unit 220, and a log extraction unit 230. The storage unit 210 is realized using a RAM or BUD storage area included in the CM 200. The message generator 220 and the log extractor 230 are realized using a processor included in the CM 200. For example, the processor of the CM 200 may exhibit the functions of the message generation unit 220 and the log extraction unit 230 by executing a program stored in the CM 200 RAM. Alternatively, the message generation unit 220 and the log extraction unit 230 may be realized by hard wired logic such as FPGA or ASIC.

記憶部２１０は、ＣＭ２００におけるハードウェア、ＯＳ、ミドルウェアおよびアプリケーションなどのコンポーネントの動作に関するログを記憶する。
メッセージ生成部２２０は、通知制御部１３０に対して障害の発生を示すメッセージを通知する。メッセージ生成部２２０は、ＣＭ２００のコンポーネントにおける障害通知用のモジュールでもよい。また、メッセージ生成部２２０は、障害発生時以外にも、当該コンポーネントの動作に関するログを記憶部２１０に格納してもよい。 The storage unit 210 stores a log regarding the operation of components such as hardware, OS, middleware, and applications in the CM 200.
The message generation unit 220 notifies the notification control unit 130 of a message indicating the occurrence of a failure. The message generation unit 220 may be a failure notification module in the CM 200 component. Further, the message generation unit 220 may store a log related to the operation of the component in the storage unit 210 other than when a failure occurs.

ログ抽出部２３０は、ログ収集部１４０のログ抽出の指示に応じて、記憶部２１０に記憶されたログから障害調査用のレコードを抽出する。ログ抽出部２３０は、記憶部２１０に記憶された管理情報を参照し、今回のメッセージＩＤに対して、抽出候補とする時間範囲や抽出候補とするログタイプの優先レベルを特定する。ログ抽出部２３０は、レコードの抽出に、ログ収集部１４０により通知された抽出量の上限値、および、特定した時間範囲やログタイプの優先レベルの情報を用いる。ログ抽出部２３０は、抽出したレコードをログ収集部１４０に送信する。 The log extraction unit 230 extracts a record for failure investigation from the log stored in the storage unit 210 in accordance with the log extraction instruction from the log collection unit 140. The log extraction unit 230 refers to the management information stored in the storage unit 210 and identifies the time range that is the extraction candidate and the priority level of the log type that is the extraction candidate for the current message ID. The log extraction unit 230 uses the upper limit value of the extraction amount notified by the log collection unit 140 and information on the priority level of the identified time range and log type for record extraction. The log extraction unit 230 transmits the extracted record to the log collection unit 140.

ＣＭ３００は、記憶部３１０、メッセージ生成部３２０およびログ抽出部３３０を有する。記憶部３１０は、ＣＭ３００が備えるＲＡＭやＢＵＤの記憶領域を用いて実現される。メッセージ生成部３２０およびログ抽出部３３０は、ＣＭ３００が備えるプロセッサを用いて実現される。例えば、ＣＭ３００のプロセッサは、ＣＭ３００のＲＡＭに記憶されたプログラムを実行することで、メッセージ生成部３２０およびログ抽出部３３０の機能を発揮してもよい。あるいは、メッセージ生成部３２０およびログ抽出部３３０は、ＦＰＧＡやＡＳＩＣなどのハードワイヤードロジックにより実現されてもよい。 The CM 300 includes a storage unit 310, a message generation unit 320, and a log extraction unit 330. The storage unit 310 is realized using a RAM or BUD storage area included in the CM 300. The message generation unit 320 and the log extraction unit 330 are realized using a processor included in the CM 300. For example, the processor of the CM 300 may exhibit the functions of the message generation unit 320 and the log extraction unit 330 by executing a program stored in the RAM of the CM 300. Alternatively, the message generation unit 320 and the log extraction unit 330 may be realized by hard wired logic such as FPGA or ASIC.

記憶部３１０は、ＣＭ３００におけるハードウェア、ＯＳ、ミドルウェアおよびアプリケーションなどのコンポーネントの動作に関するログを記憶する。
メッセージ生成部３２０は、通知制御部１３０に対して障害の発生を示すメッセージを通知する。メッセージ生成部３２０は、ＣＭ３００のコンポーネントにおける障害通知用のモジュールでもよい。また、メッセージ生成部３２０は、障害発生時以外にも、当該コンポーネントの動作に関するログを記憶部３１０に格納してもよい。 The storage unit 310 stores a log related to the operation of components such as hardware, OS, middleware, and applications in the CM 300.
The message generation unit 320 notifies the notification control unit 130 of a message indicating the occurrence of a failure. The message generation unit 320 may be a failure notification module in the CM 300 component. Further, the message generation unit 320 may store a log related to the operation of the component in the storage unit 310 other than when a failure occurs.

ログ抽出部３３０は、ログ収集部１４０のログ抽出の指示に応じて、記憶部３１０に記憶されたログから障害調査用のレコードを抽出する。ログ抽出部３３０は、記憶部３１０に記憶された管理情報を参照し、今回のメッセージＩＤに対して、抽出候補とする時間範囲や抽出候補とするログタイプの優先レベルを特定する。ログ抽出部３３０は、レコードの抽出に、ログ収集部１４０により通知された抽出量の上限値、および、特定した時間範囲やログタイプの優先レベルの情報を用いる。ログ抽出部３３０は、抽出したレコードをログ収集部１４０に送信する。 The log extraction unit 330 extracts a failure investigation record from the log stored in the storage unit 310 in response to a log extraction instruction from the log collection unit 140. The log extraction unit 330 refers to the management information stored in the storage unit 310 and identifies the time range that is the extraction candidate and the priority level of the log type that is the extraction candidate for the current message ID. The log extraction unit 330 uses the upper limit value of the extraction amount notified by the log collection unit 140 and information on the specified time range and priority level of the log type for record extraction. The log extraction unit 330 transmits the extracted record to the log collection unit 140.

ＣＭ４００は、記憶部４１０、メッセージ生成部４２０およびログ抽出部４３０を有する。記憶部４１０は、ＣＭ４００が備えるＲＡＭやＢＵＤの記憶領域を用いて実現される。メッセージ生成部４２０およびログ抽出部４３０は、ＣＭ４００が備えるプロセッサを用いて実現される。例えば、ＣＭ４００のプロセッサは、ＣＭ４００のＲＡＭに記憶されたプログラムを実行することで、メッセージ生成部４２０およびログ抽出部４３０の機能を発揮してもよい。あるいは、メッセージ生成部４２０およびログ抽出部４３０は、ＦＰＧＡやＡＳＩＣなどのハードワイヤードロジックにより実現されてもよい。 The CM 400 includes a storage unit 410, a message generation unit 420, and a log extraction unit 430. The storage unit 410 is realized using a RAM or BUD storage area included in the CM 400. The message generation unit 420 and the log extraction unit 430 are realized using a processor included in the CM 400. For example, the processor of the CM 400 may exhibit the functions of the message generation unit 420 and the log extraction unit 430 by executing a program stored in the RAM of the CM 400. Alternatively, the message generation unit 420 and the log extraction unit 430 may be realized by hard wired logic such as FPGA or ASIC.

記憶部４１０は、ＣＭ４００におけるハードウェア、ＯＳ、ミドルウェアおよびアプリケーションなどのコンポーネントの動作に関するログを記憶する。
メッセージ生成部４２０は、通知制御部１３０に対して障害の発生を示すメッセージを通知する。メッセージ生成部４２０は、ＣＭ４００のコンポーネントにおける障害通知用のモジュールでもよい。また、メッセージ生成部４２０は、障害発生時以外にも、当該コンポーネントの動作に関するログを記憶部４１０に格納してもよい。 The storage unit 410 stores a log regarding the operation of components such as hardware, OS, middleware, and applications in the CM 400.
The message generation unit 420 notifies the notification control unit 130 of a message indicating the occurrence of a failure. The message generation unit 420 may be a failure notification module in the CM 400 component. Further, the message generation unit 420 may store a log related to the operation of the component in the storage unit 410 other than when a failure occurs.

ログ抽出部４３０は、ログ収集部１４０のログ抽出の指示に応じて、記憶部４１０に記憶されたログから障害調査用のレコードを抽出する。ログ抽出部４３０は、記憶部４１０に記憶された管理情報を参照し、今回のメッセージＩＤに対して、抽出候補とする時間範囲や抽出候補とするログタイプの優先レベルを特定する。ログ抽出部４３０は、レコードの抽出に、ログ収集部１４０により通知された抽出量の上限値、および、特定した時間範囲やログタイプの優先レベルの情報を用いる。ログ抽出部４３０は、抽出したレコードをログ収集部１４０に送信する。 The log extraction unit 430 extracts a failure investigation record from the log stored in the storage unit 410 in response to a log extraction instruction from the log collection unit 140. The log extraction unit 430 refers to the management information stored in the storage unit 410 and identifies the time range that is the extraction candidate and the priority level of the log type that is the extraction candidate for the current message ID. The log extraction unit 430 uses the upper limit value of the extraction amount notified by the log collection unit 140 and information on the priority level of the identified time range and log type for record extraction. The log extraction unit 430 transmits the extracted record to the log collection unit 140.

図６は、第２の実施の形態のページの例を示す図である。ページＰ１は、ログのレコード（ログレコード）の集合である。ページＰ１のサイズは、固定サイズである。ページＰ１のサイズは、例えば、６４ＫＢ（Kilo Bytes）である。１つのページＰ１に含まれるレコードの数は、１つでもよいし、２以上でもよい。ページＰ１に含まれるレコードの数が１つの場合、ページＰ１とレコードとは同義である。ページＰ１の例では、ページＰ１の３行目以降の１行が１つのレコードである。 FIG. 6 is a diagram illustrating an example of a page according to the second embodiment. The page P1 is a set of log records (log records). The size of the page P1 is a fixed size. The size of the page P1 is, for example, 64 KB (Kilo Bytes). The number of records included in one page P1 may be one or two or more. When the number of records included in page P1 is one, page P1 and the record are synonymous. In the example of page P1, one line after the third line of page P1 is one record.

例えば、１つのレコードは、タイムスタンプ（time stamp）、ログタイプ（log type）、モジュール（module）、ログテキスト（log text）のフィールドを含む。
タイムスタンプは、レコードが記録された日時（年月日時分秒）である。ログタイプは、ログの種別である。例えば、ログタイプとして、発行元のハードウェアやソフトウェアおよび障害の内容などに応じて種々の種別が予め定められる。モジュールは、レコードの発行元のモジュール（例えば、ハードウェアやソフトウェアなどのコンポーネントにおける構成部品）の識別名である。ログテキストは、コンポーネントの動作に関するログの具体的な内容を示す情報である。 For example, one record includes fields of time stamp, log type, module, and log text.
The timestamp is the date and time (year / month / day / hour / minute / second) when the record was recorded. The log type is a log type. For example, as the log type, various types are determined in advance according to the hardware and software of the issuer and the content of the failure. The module is an identification name of the module that issued the record (for example, a component in a component such as hardware or software). The log text is information indicating specific contents of the log regarding the operation of the component.

例えば、ページＰ１には、タイムスタンプ“２０１７／６／３０１８：００：００”、ログタイプ“ｔｙｐｅ１”、モジュール“Ｍ１”、ログテキスト“ｆａｕｌｔｘｘｘｘｘｘ”というレコードが登録されている。このレコードは、２０１７年６月３０日１８時００分００秒に、ログタイプが“ｔｙｐｅ１”、発行元のモジュールが“Ｍ１”、ログテキスト“ｆａｕｌｔｘｘｘｘｘｘ”という情報が記録されたことを示す。 For example, a record of a time stamp “2017/6/30 18:00:00”, a log type “type1”, a module “M1”, and a log text “fault xxxxxxx” is registered in the page P1. This record indicates that information of the log type “type 1”, the issuing module “M1”, and the log text “fault xxxxxxx” was recorded on June 30, 2017 at 18:00:00.

各ＣＭは、ログタイプ毎に、時系列のリスト構造により複数のページを管理する。ページのリスト構造は、各ページを時系列にリンクしたデータ構造である。例えば、あるページの時刻は、当該ページに属するレコードのうちの最も古い時刻（例えば、ページＰ１であれば“２０１７／６／３０１８：００：００”）である。１つのログタイプに関する一連のページを、ページリストと呼ぶこととする。 Each CM manages a plurality of pages with a time-series list structure for each log type. The page list structure is a data structure in which each page is linked in time series. For example, the time of a certain page is the oldest time of records belonging to the page (for example, “2017/6/30 18:00:00” for page P1). A series of pages related to one log type will be referred to as a page list.

図７は、第２の実施の形態のページリストの例を示す図である。ページリストＺ１は、ログタイプ“ｔｙｐｅ１”に関するログである。ページリストＺ１は、ページＡ１，Ａ２，Ａ３，Ａ４，Ａ５，Ａ６，Ａ７を含む。ページリストＺ１に属する各ページのうち、ページＡ１が最も古く、ページＡ２，Ａ３，Ａ４，Ａ５，Ａ６の順に新しくなり、ページＡ７が最も新しい。ここで、図中、古いページほど上側に、新しいページほど下側に記載する。すなわち、図面の上側から下側へ向かう方向が時系列の正方向である。ページリストＺ１のうち、ページＡ１は、ｔｏｐ（最古）である。ページリストＺ１のうち、ページＡ７は、ｂｏｔｔｏｍ（最新）である。 FIG. 7 is a diagram illustrating an example of a page list according to the second embodiment. The page list Z1 is a log related to the log type “type1”. The page list Z1 includes pages A1, A2, A3, A4, A5, A6, and A7. Of the pages belonging to the page list Z1, the page A1 is the oldest, the pages A2, A3, A4, A5, A6 are newest, and the page A7 is the newest. Here, in the figure, older pages are listed on the upper side, and newer pages are listed on the lower side. That is, the direction from the upper side to the lower side of the drawing is the time-series positive direction. Of the page list Z1, page A1 is top (oldest). Of page list Z1, page A7 is bottom (latest).

このように、ＣＭ１００におけるメッセージ生成部１２０などのログ生成機能は、複数のページを、ログタイプ毎に時系列にリンクさせる。そして、ログ抽出部１５０は、ページ間のリンクに基づき、各ログタイプのページの抽出順を決定する。ページには１以上のログレコードが含まれる。このため、ログ抽出部１５０は、ログレコード間のリンクに基づき、各ログタイプのログレコードの抽出順を決定するともいえる。このようなリスト構造によってログを管理することで、ログ抽出部１５０は、ログ抽出を高速に行える。 As described above, the log generation function such as the message generation unit 120 in the CM 100 links a plurality of pages in time series for each log type. Then, the log extraction unit 150 determines the extraction order of pages of each log type based on the link between pages. A page contains one or more log records. For this reason, it can be said that the log extraction unit 150 determines the extraction order of the log records of each log type based on the link between the log records. By managing logs with such a list structure, the log extraction unit 150 can perform log extraction at high speed.

図８は、第２の実施の形態の割り当て方式管理テーブルの例を示す図である。割り当て方式管理テーブル１１１は、各ＣＭで抽出するページのサイズの上限値を決定するために用いられる情報である。割り当て方式管理テーブル１１１は、記憶部１１０に予め記憶されている。割り当て方式管理テーブル１１１は、マスタＣＭにより用いられる情報であるが、記憶部２１０，３１０，４１０にも記憶されていてもよい。割り当て方式管理テーブル１１１は、メッセージＩＤおよび割り当て方式の項目を含む。 FIG. 8 is a diagram illustrating an example of an allocation method management table according to the second embodiment. The allocation method management table 111 is information used to determine the upper limit value of the page size extracted by each CM. The allocation method management table 111 is stored in the storage unit 110 in advance. The allocation method management table 111 is information used by the master CM, but may also be stored in the storage units 210, 310, and 410. The assignment method management table 111 includes items of message ID and assignment method.

メッセージＩＤの項目には、メッセージ生成部１２０（あるいは、他ＣＭのメッセージ生成部２２０，３２０，４２０）により生成されるメッセージに含まれ得るメッセージＩＤが登録される。割り当て方式の項目には、各ＣＭで抽出するページのサイズの決定方法（割り当て方式）の識別情報が登録される。 In the message ID item, a message ID that can be included in a message generated by the message generation unit 120 (or the message generation unit 220, 320, or 420 of another CM) is registered. In the allocation method item, identification information of a method for determining the size of a page (allocation method) to be extracted by each CM is registered.

ここで、一例では、割り当て方式を、割り当て方式Ａ，Ｂ，Ｃの３種類とする。
割り当て方式Ａは、標準の割り当て方式である。割り当て方式Ａでは、各ＣＭに対する割り当てサイズ（抽出量の上限値に相当）を同じにする。サポートサーバ６０に送信可能な収集ログのサイズの上限が１ＭＢで、ＣＭ数が４の場合、ＣＭ毎に２５６ＫＢを割り当てる。この場合、１つのＣＭは、抽出量の上限値２５６ＫＢまでページを抽出する。 Here, in one example, there are three types of allocation methods A, B, and C.
The allocation method A is a standard allocation method. In the allocation method A, the allocation size (corresponding to the upper limit value of the extraction amount) for each CM is the same. When the upper limit of the size of the collected log that can be transmitted to the support server 60 is 1 MB and the number of CMs is 4, 256 KB is allocated for each CM. In this case, one CM extracts pages up to the upper limit 256 KB of the extraction amount.

割り当て方式Ｂは、マスタＣＭ優先の割り当て方式である。割り当て方式Ｂでは、マスタＣＭに対する割り当てを、他ＣＭの２倍にする。マスタＣＭは、ストレージシステム全体を管理するＣＭであり、全体動作の調査を要する障害の場合に、割り当て方式Ｂを採用する。例えば、サポートサーバ６０に送信可能な収集ログのサイズの上限が１ＭＢで、ＣＭ数が４の場合、マスタＣＭの割り当てサイズは４１０ＫＢであり、他ＣＭの割り当てサイズは２０５ＫＢである。 The allocation method B is a master CM priority allocation method. In the allocation method B, the allocation to the master CM is made twice that of other CMs. The master CM is a CM that manages the entire storage system, and adopts the allocation method B in the case of a failure that requires an investigation of the overall operation. For example, when the upper limit of the size of the collection log that can be transmitted to the support server 60 is 1 MB and the number of CMs is 4, the allocation size of the master CM is 410 KB, and the allocation size of other CMs is 205 KB.

割り当て方式Ｃは、障害検出ＣＭ優先の割り当て方式である。割り当て方式Ｃでは、障害を検出したＣＭの割り当てを、他ＣＭの２倍にする。特定の機能に関する障害であり、当該機能の処理を行っていたＣＭの情報をより多く要する場合に、割り当て方式Ｃを採用する。例えば、サポートサーバ６０に送信可能な収集ログのサイズの上限が１ＭＢで、ＣＭ数が４の場合、障害検出ＣＭの割り当てサイズは４１０ＫＢであり、他ＣＭの割り当てサイズ２０５ＫＢである。 Allocation method C is a failure detection CM priority allocation method. In the allocation method C, the allocation of CMs in which a failure has been detected is twice that of other CMs. Allocation method C is employed when a failure is related to a specific function and more information on the CM that has been processing the function is required. For example, when the upper limit of the size of the collected log that can be transmitted to the support server 60 is 1 MB and the number of CMs is 4, the allocation size of the failure detection CM is 410 KB and the allocation size of other CMs is 205 KB.

例えば、割り当て方式管理テーブル１１１には、メッセージＩＤが“ａ０００００００１”、割り当て方式が“Ａ（標準）”という情報が登録される。これは、メッセージＩＤ“ａ０００００００１”を含むメッセージが検出された場合に、割り当て方式Ａにより各ＣＭに対するサイズ割り当てを行うことを示す。 For example, information that the message ID is “a00000001” and the assignment method is “A (standard)” is registered in the assignment method management table 111. This indicates that when a message including the message ID “a00000001” is detected, size allocation is performed for each CM by the allocation method A.

図９は、第２の実施の形態のログ抽出管理テーブルの例を示す図である。ログ抽出管理テーブル１１２は、メッセージＩＤに応じたログ抽出対象の時間範囲およびログタイプ毎の優先レベルが登録された情報である。ログ抽出管理テーブル１１２は、記憶部１１０に予め記憶されている。ログ抽出管理テーブル１１２は、記憶部２１０，３１０，４１０にも予め記憶されている。ログ抽出管理テーブル１１２は、メッセージＩＤ、時間範囲およびログタイプの優先レベルの項目を含む。 FIG. 9 illustrates an example of a log extraction management table according to the second embodiment. The log extraction management table 112 is information in which a log extraction target time range corresponding to a message ID and a priority level for each log type are registered. The log extraction management table 112 is stored in the storage unit 110 in advance. The log extraction management table 112 is also stored in advance in the storage units 210, 310, and 410. The log extraction management table 112 includes items of message ID, time range, and log type priority level.

メッセージＩＤの項目には、メッセージ生成部１２０（あるいは、他ＣＭのメッセージ生成部２２０，３２０，４２０）により生成されるメッセージに含まれ得るメッセージＩＤが登録される。時間範囲の項目には、ログ抽出対象の時間範囲が登録される。当該時間範囲は、障害発生時から何時間前のログまでを抽出対象とするかを示す。すなわち、障害発生時から当該時間範囲の分だけ遡った時刻までがログ抽出対象の時間範囲である。時間範囲の単位は、例えば、時間（hour）である。ログタイプの優先レベルは、ログタイプ毎の優先レベルである。優先レベルは、レベル“１”が最も優先順位が高く、レベル“２”、“３”、・・・とレベルの数値が大きくなるほど、優先順位が低くなる。なお、優先レベル“０”は、抽出しないことを示す。また、優先レベルが同じ複数のログタイプについては、時刻（タイムスタンプ）が新しいページを優先して抽出する。 In the message ID item, a message ID that can be included in a message generated by the message generation unit 120 (or the message generation unit 220, 320, or 420 of another CM) is registered. In the time range item, a time range for log extraction is registered. The time range indicates how many hours before the failure occurs to be extracted. That is, the time range for log extraction is from the time of failure occurrence to the time that is back by the time range. The unit of the time range is, for example, time (hour). The log type priority level is a priority level for each log type. As for the priority level, the level “1” has the highest priority, and the higher the level “2”, “3”,..., The lower the priority. The priority level “0” indicates that no extraction is performed. For a plurality of log types having the same priority level, a page with a new time (time stamp) is extracted with priority.

例えば、ログ抽出管理テーブル１１２には、メッセージＩＤが“ａ０００００００１”、時間範囲が“１２”、ログタイプ“ｔｙｐｅ１”の優先レベル“１”、ログタイプ“ｔｙｐｅ２”の優先レベル“１”、ログタイプ“ｔｙｐｅ３”の優先レベル“１”、ログタイプ“ｔｙｐｅ４”の優先レベル“１”、ログタイプ“ｔｙｐｅ５”の優先レベル“１”、・・・という情報が登録される。これは、メッセージＩＤ“ａ０００００００１”を含むメッセージが検出された場合、当該検出時（障害発生時）から１２時間前に遡った時刻までをログ抽出対象の時間範囲とすることを示す。また、各ログタイプの優先レベルにしたがって、ログ抽出を行うことを示す（この場合、ログタイプ“ｔｙｐｅ１”〜“ｔｙｐｅ５”までの優先レベルは“１”で同じである）。なお、各ログタイプの優先レベルにしたがったログ抽出方法の具体例は後述される。 For example, in the log extraction management table 112, the message ID is “a00000001”, the time range is “12”, the priority level is “1” for the log type “type1”, the priority level is “1” for the log type “type2”, and the log type. Information such as a priority level “1” of “type 3”, a priority level “1” of the log type “type 4”, a priority level “1” of the log type “type 5”, and the like are registered. This indicates that when a message including the message ID “a00000001” is detected, the time range from the time of the detection (at the time of the failure) to the time that goes back 12 hours is set as the log extraction target time range. It also indicates that log extraction is performed according to the priority level of each log type (in this case, the priority levels of log types “type1” to “type5” are the same as “1”). A specific example of the log extraction method according to the priority level of each log type will be described later.

図１０は、第２の実施の形態のログ収集例を示す図である。マスタＣＭであるＣＭ１００は、ＣＭ１００，２００，３００，４００の何れかから障害に関する所定のメッセージを受け付けると、割り当て方式管理テーブル１１１に基づいて、各ＣＭのログの抽出量の上限値を決定する。また、ＣＭ１００は、ログ抽出管理テーブル１１２に基づいて、ログ抽出対象の時間範囲およびログタイプ毎の優先レベルを決定する。ＣＭ１００は、決定した上限値、時間範囲および優先レベルによるログ抽出を、ＣＭ２００，３００，４００に指示する。また、ＣＭ１００は、自装置においてもログ抽出を行う。 FIG. 10 is a diagram illustrating an example of log collection according to the second embodiment. When the CM 100 as the master CM receives a predetermined message regarding a failure from any one of the CMs 100, 200, 300, and 400, the CM 100 determines an upper limit value of the log extraction amount of each CM based on the allocation method management table 111. Further, the CM 100 determines a priority level for each log type and a time range for log extraction based on the log extraction management table 112. The CM 100 instructs the CMs 200, 300, and 400 to perform log extraction based on the determined upper limit value, time range, and priority level. Further, the CM 100 also performs log extraction in its own device.

例えば、抽出ログＬ１は、ＣＭ１００においてＢＵＤ１０６に記憶されたログから抽出されたページ群である。抽出ログＬ２は、ＣＭ２００においてＢＵＤ２０６に記憶されたログから抽出されたページ群である。抽出ログＬ３は、ＣＭ３００においてＢＵＤ３０６に記憶されたログから抽出されたページ群である。抽出ログＬ４は、ＣＭ４００においてＢＵＤ４０６に記憶されたログから抽出されたページ群である。 For example, the extracted log L1 is a page group extracted from a log stored in the BUD 106 in the CM 100. The extracted log L2 is a page group extracted from the log stored in the BUD 206 in the CM 200. The extracted log L3 is a group of pages extracted from the log stored in the BUD 306 in the CM 300. The extracted log L4 is a group of pages extracted from the log stored in the BUD 406 in the CM 400.

ＣＭ１００は、抽出ログＬ１，Ｌ２，Ｌ３，Ｌ４を収集する。収集ログＬ０は、抽出ログＬ１，Ｌ２，Ｌ３，Ｌ４の収集結果である。ＣＭ１００は、ＬＡＮ４０およびインターネット５０を介して、サポートサーバ６０に収集ログＬ０を送信する。 The CM 100 collects the extraction logs L1, L2, L3, and L4. The collection log L0 is a collection result of the extraction logs L1, L2, L3, and L4. The CM 100 transmits the collection log L0 to the support server 60 via the LAN 40 and the Internet 50.

次に、上記の各ＣＭによる処理手順を具体的に説明する。
図１１は、第２の実施の形態のログ収集例を示すフローチャートである。以下、図１１に示す処理をステップ番号に沿って説明する。ログ収集部１４０は、通知制御部１３０から障害通知のメッセージの検出結果を受け付けると下記の手順を行う。 Next, the processing procedure by each CM will be specifically described.
FIG. 11 is a flowchart illustrating an example of log collection according to the second embodiment. In the following, the process illustrated in FIG. 11 will be described in order of step number. When the log collection unit 140 receives the detection result of the failure notification message from the notification control unit 130, the log collection unit 140 performs the following procedure.

（Ｓ１１）ログ収集部１４０は、記憶部１１０に記憶された割り当て方式管理テーブル１１１から、今回のメッセージに対応する割り当て方式を取得する。具体的には、ログ収集部１４０は、障害通知のメッセージに含まれるメッセージＩＤに対応する割り当て方式を、割り当て方式管理テーブル１１１から取得する。 (S11) The log collection unit 140 acquires an allocation method corresponding to the current message from the allocation method management table 111 stored in the storage unit 110. Specifically, the log collection unit 140 acquires an assignment method corresponding to the message ID included in the failure notification message from the assignment method management table 111.

（Ｓ１２）ログ収集部１４０は、ステップＳ１１で取得した割り当て方式にしたがって、各ＣＭのログ抽出量の上限値を計算する。
（Ｓ１３）ログ収集部１４０は、ＣＭ番号Ｎを、Ｎ＝０に設定する。ログ収集部１４０は、ＣＭ番号Ｎ＝０に対応するＣＭ１００のログ抽出部１５０にログ抽出を指示する。ログ抽出の指示は、ログ抽出量の上限値を含む。 (S12) The log collection unit 140 calculates the upper limit value of the log extraction amount of each CM according to the allocation method acquired in step S11.
(S13) The log collection unit 140 sets the CM number N to N = 0. The log collection unit 140 instructs the log extraction unit 150 of the CM 100 corresponding to the CM number N = 0 to perform log extraction. The log extraction instruction includes an upper limit value of the log extraction amount.

（Ｓ１４）ログ抽出部１５０，２３０，３３０，４３０は、ログ収集部１４０のログ抽出の指示に応じて、ＣＭ単位のログ抽出処理を行う。ＣＭ単位のログ抽出処理の詳細は後述される。 (S14) The log extraction units 150, 230, 330, and 430 perform CM-unit log extraction processing in accordance with the log extraction instruction from the log collection unit 140. Details of the CM-unit log extraction processing will be described later.

（Ｓ１５）ログ収集部１４０は、全ＣＭ（ＣＭ１００，２００，３００，４００）のログ（抽出ログ）を収集済であるか否かを判定する。全ＣＭのログを収集済である場合、ログ収集部１４０は、ステップＳ１７に処理を進める。全ＣＭのログを収集済でない場合、ログ収集部１４０は、ステップＳ１６に処理を進める。 (S15) The log collection unit 140 determines whether logs (extraction logs) of all CMs (CMs 100, 200, 300, 400) have been collected. If all CM logs have been collected, the log collection unit 140 proceeds to step S17. If the logs of all CMs have not been collected, the log collection unit 140 proceeds to step S16.

（Ｓ１６）ログ収集部１４０は、ＣＭ番号Ｎを、Ｎ＝Ｎ＋１に設定する（ＣＭ番号をインクリメントする）。そして、ログ収集部１４０は、ＣＭ番号Ｎに対応するＣＭのログ抽出部（ログ抽出部２３０，３３０，４３０の何れか）に対してログ抽出を指示して、ステップＳ１４に処理を進める。 (S16) The log collection unit 140 sets the CM number N to N = N + 1 (increments the CM number). Then, the log collection unit 140 instructs the log extraction unit (any one of the log extraction units 230, 330, and 430) of the CM corresponding to the CM number N to perform log extraction, and proceeds to step S14.

（Ｓ１７）ログ収集部１４０は、通知制御部１３０に収集ログを提供する。収集ログは、各ＣＭから収集された抽出された抽出ログの集合である。通知制御部１３０は、サポートサーバ６０に収集ログを送信する。 (S17) The log collection unit 140 provides the collection log to the notification control unit 130. The collection log is a set of extracted extraction logs collected from each CM. The notification control unit 130 transmits the collection log to the support server 60.

このように、記憶部１１０は、抽出するページの合計サイズの上限値の算出方法（割り当て方式）をメッセージ毎に登録した割り当て方式管理テーブル１１１を記憶する。ログ収集部１４０は、メッセージを検出すると、記憶部１１０に記憶された割り当て方式管理テーブル１１１を参照して、当該メッセージに応じた算出方法に基づき、上限値を算出する。特に、ログ収集部１４０は、メッセージに応じて、複数のＣＭ（ＣＭ１００，２００，３００，４００）それぞれによるＣＭ毎（情報処理装置毎）のログ（動作情報）からのページの抽出を指示する。ログ収集部１４０は、ページの抽出を指示する際に、メッセージに応じた算出方法に基づき、ＣＭ毎の抽出ログのサイズの上限値を決定し、決定した上限値を、各ＣＭに通知する。これにより、障害に応じて、障害解析に有用なログを収集可能となる。 As described above, the storage unit 110 stores the allocation method management table 111 in which the calculation method (allocation method) of the upper limit value of the total size of pages to be extracted is registered for each message. When the log collection unit 140 detects a message, the log collection unit 140 refers to the allocation method management table 111 stored in the storage unit 110 and calculates an upper limit value based on a calculation method according to the message. In particular, the log collection unit 140 instructs extraction of a page from a log (operation information) for each CM (for each information processing apparatus) by each of a plurality of CMs (CMs 100, 200, 300, and 400) according to a message. When instructing page extraction, the log collection unit 140 determines the upper limit value of the size of the extracted log for each CM based on the calculation method according to the message, and notifies each CM of the determined upper limit value. Thereby, logs useful for failure analysis can be collected according to the failure.

図１２は、第２の実施の形態のＣＭ単位のログ抽出例を示すフローチャートである。以下、図１２に示す処理をステップ番号に沿って説明する。以下の手順は、図１１のステップＳ１４に相当する。ここで、以下の説明では、ログ抽出部１５０の処理手順を例示するが、ログ抽出部２３０，３３０，４３０も同様の処理手順となる。 FIG. 12 is a flowchart illustrating an example of CM-unit log extraction according to the second embodiment. In the following, the process illustrated in FIG. 12 will be described in order of step number. The following procedure corresponds to step S14 in FIG. Here, in the following description, the processing procedure of the log extraction unit 150 is illustrated, but the log extraction units 230, 330, and 430 also have the same processing procedure.

（Ｓ２１）ログ抽出部１５０は、記憶部１１０に記憶されたログ抽出管理テーブル１１２から、今回のメッセージに対応する時間範囲を取得する。具体的には、ログ抽出部１５０は、障害通知のメッセージに含まれるメッセージＩＤに対応する時間範囲を、ログ抽出管理テーブル１１２から取得する。 (S21) The log extraction unit 150 acquires a time range corresponding to the current message from the log extraction management table 112 stored in the storage unit 110. Specifically, the log extraction unit 150 acquires a time range corresponding to the message ID included in the failure notification message from the log extraction management table 112.

（Ｓ２２）ログ抽出部１５０は、取得した時間範囲内のログ抽出処理を実行する。時間範囲内のログ抽出処理の詳細は後述される。
（Ｓ２３）ログ抽出部１５０は、ステップＳ２２で抽出したログ（抽出ログ）をログ収集部１４０に提供する。 (S22) The log extraction unit 150 executes log extraction processing within the acquired time range. Details of the log extraction processing within the time range will be described later.
(S23) The log extraction unit 150 provides the log collection unit 140 with the log (extraction log) extracted in step S22.

図１３は、第２の実施の形態の時間範囲内のログ抽出例を示すフローチャートである。以下、図１３に示す処理をステップ番号に沿って説明する。以下の手順は、図１２のステップＳ２２に相当する。 FIG. 13 is a flowchart illustrating an example of log extraction within a time range according to the second embodiment. In the following, the process illustrated in FIG. 13 will be described in order of step number. The following procedure corresponds to step S22 in FIG.

（Ｓ３１）ログ抽出部１５０は、優先レベルＰを、Ｐ＝１に設定する。
（Ｓ３２）ログ抽出部１５０は、優先レベル単位のログ抽出処理を行う。優先レベル単位のログ抽出処理の詳細は、後述される。 (S31) The log extraction unit 150 sets the priority level P to P = 1.
(S32) The log extraction unit 150 performs log extraction processing in units of priority levels. Details of the log extraction processing in units of priority levels will be described later.

（Ｓ３３）ログ抽出部１５０は、ログ抽出部１５０による抽出ログの抽出量の合計が上限値に達したか否かを判定する。抽出量の合計が上限値に達した場合、ログ抽出部１５０は、処理を終了する。抽出量の合計が上限値に達していない場合、ログ抽出部１５０は、処理をステップＳ３４に進める。 (S33) The log extraction unit 150 determines whether the total extraction amount of the extracted logs by the log extraction unit 150 has reached the upper limit value. When the total of the extraction amounts reaches the upper limit value, the log extraction unit 150 ends the process. When the total of the extraction amounts has not reached the upper limit value, the log extraction unit 150 proceeds with the process to step S34.

（Ｓ３４）ログ抽出部１５０は、全優先レベルのページの抽出を行ったか否かを判定する。全優先レベルのページの抽出を行った場合、ログ抽出部１５０は、処理を終了する。全優先レベルのページの抽出を行っていない場合、ログ抽出部１５０は、ステップＳ３５に処理を進める。全優先レベルのページの抽出を行った場合とは、優先レベルＰの値が最高値（優先順位が最低であることに相当）に達した場合である。 (S34) The log extraction unit 150 determines whether pages of all priority levels have been extracted. When the pages of all priority levels are extracted, the log extraction unit 150 ends the process. If the extraction of all priority level pages has not been performed, the log extraction unit 150 proceeds to step S35. The case where the pages of all priority levels are extracted is when the value of the priority level P reaches the highest value (corresponding to the lowest priority order).

（Ｓ３５）ログ抽出部１５０は、優先レベルＰを、Ｐ＝Ｐ＋１に設定する（優先レベルＰをインクリメントする）。そして、ログ抽出部１５０は、ステップＳ３２に処理を進める。 (S35) The log extraction unit 150 sets the priority level P to P = P + 1 (increments the priority level P). Then, the log extraction unit 150 proceeds with the process to step S32.

図１４は、第２の実施の形態の優先レベル単位のログ抽出例を示すフローチャートである。以下、図１４に示す処理をステップ番号に沿って説明する。以下の手順は、図１３のステップＳ３２に相当する。 FIG. 14 is a flowchart illustrating an example of log extraction in priority level units according to the second embodiment. In the following, the process illustrated in FIG. 14 will be described in order of step number. The following procedure corresponds to step S32 in FIG.

（Ｓ４１）ログ抽出部１５０は、着目する優先レベルＰのログタイプのｂｏｔｔｏｍページ（最新のページ）のタイムスタンプを取得する。なお、優先レベルＰであるログタイプが複数の場合、複数のログタイプの各ｂｏｔｔｏｍページのうち、最新のタイムスタンプを取得する。 (S41) The log extraction unit 150 acquires the time stamp of the bottom page (latest page) of the log type of the priority level P of interest. When there are a plurality of log types having the priority level P, the latest time stamp is acquired from each bottom page of the plurality of log types.

（Ｓ４２）ログ抽出部１５０は、タイムスタンプが全て時間範囲外であるか否かを判定する。タイムスタンプが全て時間範囲外である場合、ログ抽出部１５０は、処理を終了する。タイムスタンプが全て時間範囲外でない場合、ログ抽出部１５０は、ステップＳ４３に処理を進める。タイムスタンプが全て時間範囲外である場合とは、ステップＳ４１で取得したタイムスタンプが、現時刻から当該時間範囲分だけ遡った時刻よりも過去の時刻を示している場合である。 (S42) The log extraction unit 150 determines whether all time stamps are out of the time range. When all the time stamps are out of the time range, the log extracting unit 150 ends the process. When all the time stamps are not out of the time range, the log extraction unit 150 proceeds with the process to step S43. The case where all the time stamps are out of the time range is a case where the time stamp acquired in step S41 indicates a time that is in the past from a time that is back by the time range from the current time.

（Ｓ４３）ログ抽出部１５０は、最新のタイムスタンプのページを抽出し、当該ページが属するページリストのリンクから当該ページを外す。
（Ｓ４４）ログ抽出部１５０は、抽出量の合計が上限値に達したか否かを判定する。抽出量の合計が上限値に達した場合、ログ抽出部１５０は、処理を終了する。抽出量の合計が上限値に達していない場合、ログ抽出部１５０は、ステップＳ４５に処理を進める。 (S43) The log extraction unit 150 extracts the page with the latest time stamp, and removes the page from the link of the page list to which the page belongs.
(S44) The log extraction unit 150 determines whether or not the total amount of extraction has reached the upper limit. When the total of the extraction amounts reaches the upper limit value, the log extraction unit 150 ends the process. When the total of the extraction amounts has not reached the upper limit value, the log extraction unit 150 proceeds with the process to step S45.

（Ｓ４５）ログ抽出部１５０は、着目する優先レベルＰのログタイプのページが残っているか否かを判定する。該当のログタイプのページが残っている場合、ログ抽出部１５０は、ステップＳ４１に処理を進める。該当のログタイプのページが残っていない場合、ログ抽出部１５０は、処理を終了する。 (S45) The log extraction unit 150 determines whether or not a log type page with a priority level P of interest remains. When the log type page remains, the log extraction unit 150 advances the process to step S41. If no page of the corresponding log type remains, the log extraction unit 150 ends the process.

このように、ログ抽出部１５０は、現時刻から過去の時間範囲に属するページ群（ログレコード群ともいえる）のうち、第１の優先レベルに対応する第１のページ（第１のログレコード）を、第１の優先レベルで示される優先順位よりも低い優先順位を示す第２の優先レベルに対応する第２のページ（第２のログレコード）よりも優先的に抽出する。これにより、限られたサイズの中で、抽出されるページ（ログレコード）を、障害解析に有用なページ（ログレコード）に適切に絞り込むことができる。 As described above, the log extraction unit 150 selects the first page (first log record) corresponding to the first priority level among the page groups (also referred to as log record groups) belonging to the past time range from the current time. Are extracted with higher priority than the second page (second log record) corresponding to the second priority level indicating a lower priority level than the priority level indicated by the first priority level. As a result, the pages (log records) to be extracted can be appropriately narrowed down to pages (log records) useful for failure analysis within a limited size.

次に、ログ抽出部１５０によるログ抽出の具体例を説明する。ログ抽出部１５０について主に説明するが、ログ抽出部２３０，３３０，４３０も同様にしてログ抽出を行う。
図１５は、第２の実施の形態のログ抽出例（その１）を示す図である。図１５の例では、あるメッセージに対するログ抽出について次の条件を考える。抽出量の上限値は、ページ１１個分（例えば、１ページのサイズが６４ＫＢの場合、６４ＫＢ×１１＝７０４ＫＢ）である。ログ抽出の時間範囲はｘ時間である。抽出対象のログタイプは、“ｔｙｐｅ１”、“ｔｙｐｅ２”および“ｔｙｐｅ３”である。ログタイプ“ｔｙｐｅ１”、“ｔｙｐｅ２”、“ｔｙｐｅ３”の優先レベルは何れも“１”である。 Next, a specific example of log extraction by the log extraction unit 150 will be described. Although the log extracting unit 150 will be mainly described, the log extracting units 230, 330, and 430 perform log extraction in the same manner.
FIG. 15 is a diagram illustrating a log extraction example (part 1) according to the second embodiment. In the example of FIG. 15, the following conditions are considered for log extraction for a certain message. The upper limit of the extraction amount is 11 pages (for example, when the size of one page is 64 KB, 64 KB × 11 = 704 KB). The time range for log extraction is x hours. The log types to be extracted are “type 1”, “type 2”, and “type 3”. The priority levels of the log types “type 1”, “type 2”, and “type 3” are all “1”.

また、ページリストＺ１は、ログタイプ“ｔｙｐｅ１”のページリストである。ページリストＺ１は、タイムスタンプの古い方から新しい方へ向かって、ページＡ１，Ａ２，Ａ３，Ａ４，Ａ５，Ａ６，Ａ７を含む。ページリストＺ２は、ログタイプ“ｔｙｐｅ２”のページリストである。ページリストＺ２は、タイムスタンプの古い方から新しい方へ向かって、ページＢ１，Ｂ２，Ｂ３，Ｂ４，Ｂ５，Ｂ６，Ｂ７を含む。ページリストＺ３は、ログタイプ“ｔｙｐｅ３”のページリストである。ページリストＺ３は、タイムスタンプの古い方から新しい方へ向かって、ページＣ１，Ｃ２，Ｃ３，Ｃ４，Ｃ５，Ｃ６，Ｃ７を含む。 The page list Z1 is a page list of the log type “type1”. The page list Z1 includes pages A1, A2, A3, A4, A5, A6, and A7 from the oldest time stamp to the newest time stamp. The page list Z2 is a page list of the log type “type2”. The page list Z2 includes pages B1, B2, B3, B4, B5, B6, and B7 from the oldest time stamp to the newest time stamp. The page list Z3 is a page list of the log type “type3”. The page list Z3 includes pages C1, C2, C3, C4, C5, C6, and C7 from the oldest time stamp to the newest time stamp.

この場合、メッセージの検出時（障害発生時）を現在とすると、現在からｘ時間前までがログ抽出対象の時間範囲である。図１５の例では、ページＡ３，Ｂ３，Ｃ３以降のページにおけるタイムスタンプがログ抽出対象の時間範囲に含まれる。 In this case, assuming that the time when a message is detected (when a failure occurs) is present, the time range from the present to x hours before is the log extraction target time range. In the example of FIG. 15, the time stamps of pages A3, B3, and C3 and subsequent pages are included in the time range for log extraction.

ここで、図１５における各ページの左側に付した数字は、ログ抽出処理において該当のページが抽出される順番を示す（以降の図に関しても同様）。
上記のように、各ログタイプの優先レベルは“１”であり、ページリストＺ１，Ｚ２，Ｚ３に属する各ページのうちの最新のページＢ７は、現在からｘ時間前の時刻よりも後の時刻である。このため、ログ抽出部１５０は、ページＢ７を抽出する。そして、ログ抽出部１５０は、ページリストＺ２からページＢ７を外す。あるページが、あるページリストから外されると当該ページは、当該ページリストに属するページではなくなる。 Here, the numbers attached to the left side of each page in FIG. 15 indicate the order in which the corresponding page is extracted in the log extraction process (the same applies to the following figures).
As described above, the priority level of each log type is “1”, and the latest page B7 among the pages belonging to the page lists Z1, Z2, and Z3 is a time later than the time x hours before the current time. It is. For this reason, the log extraction unit 150 extracts the page B7. Then, the log extraction unit 150 removes the page B7 from the page list Z2. When a page is removed from a page list, the page is no longer a page belonging to the page list.

以降の処理でも、ログ抽出部１５０は、抽出候補のページがｘ時間前の時刻よりも後の時刻であることを確認する。
２番目に、ログ抽出部１５０は、ページリストＺ１，Ｚ２，Ｚ３に属する各ページのうち、最新のページＡ７を抽出する。そして、ログ抽出部１５０は、ページリストＺ１からページＡ７を外す。 Also in the subsequent processing, the log extraction unit 150 confirms that the extraction candidate page is a time later than the time before x hours.
Second, the log extraction unit 150 extracts the latest page A7 from the pages belonging to the page lists Z1, Z2, and Z3. Then, the log extraction unit 150 removes the page A7 from the page list Z1.

３番目に、ログ抽出部１５０は、ページリストＺ１，Ｚ２，Ｚ３に属する各ページのうち、最新のページＣ７を抽出する。そして、ログ抽出部１５０は、ページリストＺ３からページＣ７を外す。 Third, the log extraction unit 150 extracts the latest page C7 from the pages belonging to the page lists Z1, Z2, and Z3. Then, the log extraction unit 150 removes the page C7 from the page list Z3.

４番目に、ログ抽出部１５０は、ページリストＺ１，Ｚ２，Ｚ３に属する各ページのうち、最新のページＡ６を抽出する。そして、ログ抽出部１５０は、ページリストＺ１からページＡ６を外す。 Fourth, the log extraction unit 150 extracts the latest page A6 from the pages belonging to the page lists Z1, Z2, and Z3. Then, the log extraction unit 150 removes the page A6 from the page list Z1.

以降、同様にして、ログ抽出部１５０は、ページの抽出を行う。５番目に抽出されるページは、ページＢ６である。６番目に抽出されるページは、ページＣ６である。７番目に抽出されるページは、ページＣ５である。８番目に抽出されるページは、ページＢ５である。９番目に抽出されるページは、ページＡ５である。１０番目に抽出されるページは、ページＡ４である。１１番目に抽出されるページは、ページＢ４である。 Thereafter, similarly, the log extraction unit 150 performs page extraction. The fifth extracted page is page B6. The sixth page to be extracted is page C6. The seventh extracted page is page C5. The eighth extracted page is page B5. The ninth extracted page is page A5. The tenth page extracted is page A4. The eleventh extracted page is page B4.

ログ抽出部１５０は、ページＢ４を抽出すると、抽出量の上限値に達したことを検出して、ログ抽出を終了する。抽出ログＬ１ａは、上記の処理によってログ抽出部１５０により抽出されたページＢ７，Ａ７，Ｃ７，Ａ６，Ｂ６，Ｃ６，Ｃ５，Ｂ５，Ａ５，Ａ４，Ｂ４を含む。 When extracting the page B4, the log extraction unit 150 detects that the upper limit of the extraction amount has been reached, and ends the log extraction. The extracted log L1a includes pages B7, A7, C7, A6, B6, C6, C5, B5, A5, A4, and B4 extracted by the log extracting unit 150 by the above processing.

図１６は、第２の実施の形態のログ抽出例（その２）を示す図である。図１６の例では、あるメッセージに対するログ抽出について次の条件を考える。抽出量の上限値は、ページ１１個分である。ログ抽出の時間範囲はｘ時間である。抽出対象のログタイプは、“ｔｙｐｅ１”、“ｔｙｐｅ２”および“ｔｙｐｅ３”である。ログタイプ“ｔｙｐｅ１”の優先レベルは“１”である。ログタイプ“ｔｙｐｅ２”の優先レベルは“２”である。ログタイプ“ｔｙｐｅ３”の優先レベルは“３”である。ページリストＺ１，Ｚ２，Ｚ３に属する各ページは、図１５と同様である。 FIG. 16 is a diagram illustrating a log extraction example (part 2) according to the second embodiment. In the example of FIG. 16, the following conditions are considered for log extraction for a certain message. The upper limit of the extraction amount is 11 pages. The time range for log extraction is x hours. The log types to be extracted are “type 1”, “type 2”, and “type 3”. The priority level of the log type “type1” is “1”. The priority level of the log type “type2” is “2”. The priority level of the log type “type 3” is “3”. Each page belonging to the page list Z1, Z2, Z3 is the same as that shown in FIG.

メッセージの検出時（障害発生時）を現在とすると、現在からｘ時間前までがログ抽出対象の時間範囲である。図１６の例では、ページＡ３，Ｂ３，Ｃ３以降のページにおけるタイムスタンプがログ抽出対象の時間範囲に含まれる。 Assuming that the time when a message is detected (when a failure occurs) is the time range from the present to x hours before the log extraction target. In the example of FIG. 16, the time stamps in the pages after pages A3, B3, and C3 are included in the time range for log extraction.

最も優先順位の高いログタイプ“ｔｙｐｅ１”の最新のページＡ７は、現在からｘ時間前の時刻よりも後の時刻である。このため、ログ抽出部１５０は、ページＡ７を抽出する。そして、ログ抽出部１５０は、ページリストＺ１からページＡ７を外す。 The latest page A7 of the log type “type1” with the highest priority is a time later than the time x hours before the present time. For this reason, the log extraction unit 150 extracts the page A7. Then, the log extraction unit 150 removes the page A7 from the page list Z1.

以降の処理でも、ログ抽出部１５０は、抽出候補のページがｘ時間前の時刻よりも後の時刻であることを確認する。
２番目に、ログ抽出部１５０は、ページリストＺ１に属する各ページのうち、最新のページＡ６を抽出する。そして、ログ抽出部１５０は、ページリストＺ１からページＡ６を外す。 Also in the subsequent processing, the log extraction unit 150 confirms that the extraction candidate page is a time later than the time before x hours.
Second, the log extraction unit 150 extracts the latest page A6 from the pages belonging to the page list Z1. Then, the log extraction unit 150 removes the page A6 from the page list Z1.

３番目に、ログ抽出部１５０は、ページリストＺ１に属する各ページのうち、最新のページＡ５を抽出する。そして、ログ抽出部１５０は、ページリストＺ１からページＡ５を外す。 Third, the log extraction unit 150 extracts the latest page A5 from the pages belonging to the page list Z1. Then, the log extraction unit 150 removes the page A5 from the page list Z1.

４番目に、ログ抽出部１５０は、ページリストＺ１に属する各ページのうち、最新のページＡ４を抽出する。そして、ログ抽出部１５０は、ページリストＺ１からページＡ４を外す。 Fourth, the log extraction unit 150 extracts the latest page A4 from the pages belonging to the page list Z1. Then, the log extraction unit 150 removes the page A4 from the page list Z1.

５番目に、ログ抽出部１５０は、ページリストＺ１に属する各ページのうち、最新のページＡ３を抽出する。そして、ログ抽出部１５０は、ページリストＺ１からページＡ３を外す。 Fifth, the log extraction unit 150 extracts the latest page A3 from the pages belonging to the page list Z1. Then, the log extraction unit 150 removes the page A3 from the page list Z1.

ログ抽出部１５０は、ページリストＺ１に属する各ページのうち、最新のページＡ２のタイムスタンプが、現在からｘ時間前の時刻よりも前の時刻を示すことを確認し、ページリストＺ１からのログ抽出を完了する。ログ抽出部１５０は、抽出量の上限値に未だ達していないため、次に優先順位の高いログタイプ“ｔｙｐｅ２”のページリストＺ２からのログ抽出に移る。 The log extraction unit 150 confirms that the time stamp of the latest page A2 among the pages belonging to the page list Z1 indicates a time before x hours before the current time, and logs from the page list Z1. Complete the extraction. Since the log extraction unit 150 has not yet reached the upper limit of the extraction amount, the log extraction unit 150 proceeds to log extraction from the page list Z2 of the log type “type 2” having the next highest priority.

６番目に、ログ抽出部１５０は、ページリストＺ２に属する各ページのうち、最新のページＢ７を抽出する。そして、ログ抽出部１５０は、ページリストＺ２からページＢ７を外す。 Sixth, the log extraction unit 150 extracts the latest page B7 from the pages belonging to the page list Z2. Then, the log extraction unit 150 removes the page B7 from the page list Z2.

以降、同様にして、ログ抽出部１５０は、ページＢ６，Ｂ５，Ｂ４，Ｂ３をページリストＺ２から順番に抽出する。そして、ログ抽出部１５０は、ページリストＺ２に属する各ページのうち、最新のページＢ２のタイムスタンプが現在からｘ時間前の時刻よりも前の時刻を示すことを確認し、ページリストＺ２からのログ抽出を完了する。ログ抽出部１５０は、抽出量の上限値に未だ達していないため、次に優先順位の高いログタイプ“ｔｙｐｅ３”のページリストＺ３からのログ抽出に移る。 Thereafter, similarly, the log extraction unit 150 sequentially extracts pages B6, B5, B4, and B3 from the page list Z2. Then, the log extraction unit 150 confirms that the time stamp of the latest page B2 among the pages belonging to the page list Z2 indicates a time before x hours before the current time, and from the page list Z2. Complete log extraction. Since the log extraction unit 150 has not yet reached the upper limit of the extraction amount, the log extraction unit 150 proceeds to log extraction from the page list Z3 of the log type “type3” having the next highest priority.

１１番目に、ログ抽出部１５０は、ページリストＺ３からページＣ７を抽出する。そして、ログ抽出部１５０は、ページリストＺ３からページＣ７を外す。
ログ抽出部１５０は、ページＣ７を抽出すると、抽出量の上限値に達したことを検出して、ログ抽出を終了する。抽出ログＬ１ｂは、上記の処理によってログ抽出部１５０により抽出されたページＡ７，Ａ６，Ａ５，Ａ４，Ａ３，Ｂ７，Ｂ６，Ｂ５，Ｂ４，Ｂ３，Ｃ７を含む。 Eleventh, the log extraction unit 150 extracts a page C7 from the page list Z3. Then, the log extraction unit 150 removes the page C7 from the page list Z3.
When extracting the page C7, the log extraction unit 150 detects that the upper limit of the extraction amount has been reached, and ends the log extraction. The extracted log L1b includes pages A7, A6, A5, A4, A3, B7, B6, B5, B4, B3, and C7 extracted by the log extracting unit 150 by the above processing.

図１７は、第２の実施の形態のログ抽出例（その３）を示す図である。図１７の例では、あるメッセージに対するログ抽出について次の条件を考える。抽出量の上限値は、ページ１１個分である。ログ抽出の時間範囲はｘ時間である。抽出対象のログタイプは、“ｔｙｐｅ１”、“ｔｙｐｅ２”および“ｔｙｐｅ３”である。ログタイプ“ｔｙｐｅ１”の優先レベルは“１”である。ログタイプ“ｔｙｐｅ２”、“ｔｙｐｅ３”の優先レベルは何れも“２”である。ページリストＺ１，Ｚ２，Ｚ３に属する各ページは、図１５と同様である。 FIG. 17 is a diagram illustrating a log extraction example (part 3) according to the second embodiment. In the example of FIG. 17, the following conditions are considered for log extraction for a certain message. The upper limit of the extraction amount is 11 pages. The time range for log extraction is x hours. The log types to be extracted are “type 1”, “type 2”, and “type 3”. The priority level of the log type “type1” is “1”. The priority levels of the log types “type 2” and “type 3” are both “2”. Each page belonging to the page list Z1, Z2, Z3 is the same as that shown in FIG.

メッセージの検出時（障害発生時）を現在とすると、現在からｘ時間前までがログ抽出対象の時間範囲である。図１７の例では、ページＡ３，Ｂ３，Ｃ３以降のページにおけるタイムスタンプがログ抽出対象の時間範囲に含まれる。 Assuming that the time when a message is detected (when a failure occurs) is the time range from the present to x hours before the log extraction target. In the example of FIG. 17, the time stamps in the pages subsequent to pages A3, B3, and C3 are included in the time range for log extraction.

ログ抽出部１５０は、ページリストＺ１に属する各ページのうち、最新のページＡ２のタイムスタンプが、現在からｘ時間前の時刻よりも前の時刻を示すことを確認し、ページリストＺ１からのログ抽出を完了する。ログ抽出部１５０は、抽出量の上限値に未だ達していないため、次に優先順位の高いログタイプ“ｔｙｐｅ２”、“ｔｙｐｅ３”のページリストＺ２，Ｚ３からのログ抽出に移る。 The log extraction unit 150 confirms that the time stamp of the latest page A2 among the pages belonging to the page list Z1 indicates a time before x hours before the current time, and logs from the page list Z1. Complete the extraction. Since the log extraction unit 150 has not yet reached the upper limit of the extraction amount, the log extraction unit 150 proceeds to log extraction from the page lists Z2 and Z3 of the log types “type 2” and “type 3” having the next highest priority.

上記のように、ログタイプ“ｔｙｐｅ２”、“ｔｙｐｅ３”の優先レベルは“２”であり、ページリストＺ２，Ｚ３に属する各ページのうちの最新のページＢ７は、現在からｘ時間前の時刻よりも後の時刻である。このため、６番目に、ログ抽出部１５０は、ページＢ７を抽出する。そして、ログ抽出部１５０は、ページリストＺ２からページＢ７を外す。 As described above, the priority level of the log types “type2” and “type3” is “2”, and the latest page B7 among the pages belonging to the page lists Z2 and Z3 is the time before x hours from the current time. Is a later time. For this reason, sixth, the log extraction unit 150 extracts the page B7. Then, the log extraction unit 150 removes the page B7 from the page list Z2.

７番目に、ログ抽出部１５０は、ページリストＺ２，Ｚ３に属する各ページのうち、最新のページＣ７を抽出する。そして、ログ抽出部１５０は、ページリストＺ３からページＣ７を外す。 Seventh, the log extraction unit 150 extracts the latest page C7 from the pages belonging to the page lists Z2 and Z3. Then, the log extraction unit 150 removes the page C7 from the page list Z3.

８番目に、ログ抽出部１５０は、ページリストＺ２，Ｚ３に属する各ページのうち、最新のページＢ６を抽出する。そして、ログ抽出部１５０は、ページリストＺ２からページＢ６を外す。 Eighth, the log extraction unit 150 extracts the latest page B6 from the pages belonging to the page lists Z2 and Z3. Then, the log extraction unit 150 removes the page B6 from the page list Z2.

９番目に、ログ抽出部１５０は、ページリストＺ２，Ｚ３に属する各ページのうち、最新のページＣ６を抽出する。そして、ログ抽出部１５０は、ページリストＺ３からページＣ６を外す。 Ninth, the log extraction unit 150 extracts the latest page C6 from the pages belonging to the page lists Z2 and Z3. Then, the log extraction unit 150 removes the page C6 from the page list Z3.

以降、同様にして、ログ抽出部１５０は、ページの抽出を行う。１０番目に抽出されるページは、ページＣ５である。１１番目に抽出されるページは、ページＢ５である。
ログ抽出部１５０は、ページＢ５を抽出すると、抽出量の上限値に達したことを検出して、ログ抽出を終了する。抽出ログＬ１ｃは、上記の処理によってログ抽出部１５０により抽出されたページＡ７，Ａ６，Ａ５，Ａ４，Ａ３，Ｂ７，Ｃ７，Ｂ６，Ｃ６，Ｃ５，Ｂ５を含む。 Thereafter, similarly, the log extraction unit 150 performs page extraction. The tenth extracted page is page C5. The eleventh extracted page is page B5.
When extracting the page B5, the log extraction unit 150 detects that the upper limit of the extraction amount has been reached, and ends the log extraction. The extracted log L1c includes pages A7, A6, A5, A4, A3, B7, C7, B6, C6, C5, and B5 extracted by the log extracting unit 150 by the above processing.

図１８は、第２の実施の形態のログ抽出例（その４）を示す図である。図１８の例では、あるメッセージに対するログ抽出について次の条件を考える。抽出量の上限値は、ページ１０個分（例えば、１ページのサイズが６４ＫＢの場合、６４ＫＢ×１０＝６４０ＫＢ）である。ログ抽出の時間範囲はｘ時間である。抽出対象のログタイプは、“ｔｙｐｅ１”、“ｔｙｐｅ２”および“ｔｙｐｅ３”である。ログタイプ“ｔｙｐｅ１”の優先レベルは“１”である。ログタイプ“ｔｙｐｅ２”、“ｔｙｐｅ３”の優先レベルは何れも“２”である。 FIG. 18 is a diagram illustrating a log extraction example (part 4) according to the second embodiment. In the example of FIG. 18, the following conditions are considered for log extraction for a certain message. The upper limit of the extraction amount is 10 pages (for example, when the size of one page is 64 KB, 64 KB × 10 = 640 KB). The time range for log extraction is x hours. The log types to be extracted are “type 1”, “type 2”, and “type 3”. The priority level of the log type “type1” is “1”. The priority levels of the log types “type 2” and “type 3” are both “2”.

また、ページリストＺ４は、ログタイプ“ｔｙｐｅ１”のページリストである。ページリストＺ４は、タイムスタンプの古い方から新しい方へ向かって、ページＡ１，Ａ２，Ａ３，Ａ４，Ａ５，Ａ６，Ａ７，Ａ８を含む。ページリストＺ５は、ログタイプ“ｔｙｐｅ２”のページリストである。ページリストＺ５は、タイムスタンプの古い方から新しい方へ向かって、ページＢ１，Ｂ２，Ｂ３，Ｂ４を含む。ページリストＺ６は、ログタイプ“ｔｙｐｅ３”のページリストである。ページリストＺ６は、タイムスタンプの古い方から新しい方へ向かって、ページＣ１，Ｃ２，Ｃ３，Ｃ４を含む。 The page list Z4 is a page list of the log type “type1”. The page list Z4 includes pages A1, A2, A3, A4, A5, A6, A7, and A8 from the oldest time stamp to the newest time stamp. The page list Z5 is a page list of the log type “type2”. The page list Z5 includes pages B1, B2, B3, and B4 from the oldest time stamp to the newest time stamp. The page list Z6 is a page list of the log type “type3”. The page list Z6 includes pages C1, C2, C3, and C4 from the oldest time stamp to the newest time stamp.

ページリストＺ４，Ｚ５，Ｚ６に属する各ページのタイムスタンプは、図１５〜図１７の場合とは異なっている。図１８の例では、ページＡ１，Ｂ１，Ｃ１以降のページにおけるタイムスタンプがログ抽出対象の時間範囲に含まれる。 The time stamps of the pages belonging to the page lists Z4, Z5 and Z6 are different from those in FIGS. In the example of FIG. 18, the time stamps of pages A1, B1, C1 and subsequent pages are included in the time range for log extraction.

最も優先順位の高いログタイプ“ｔｙｐｅ１”の最新のページＡ８は、現在からｘ時間前の時刻よりも後の時刻である。このため、ログ抽出部１５０は、ページＡ８を抽出する。そして、ログ抽出部１５０は、ページリストＺ４からページＡ８を外す。 The latest page A8 of the log type “type 1” with the highest priority is a time later than the time x hours before the present time. For this reason, the log extraction unit 150 extracts the page A8. Then, the log extraction unit 150 removes the page A8 from the page list Z4.

以降の処理でも、ログ抽出部１５０は、抽出候補のページがｘ時間前の時刻よりも後の時刻であることを確認する。
２番目に、ログ抽出部１５０は、ページリストＺ４に属する各ページのうち、最新のページＡ７を抽出する。そして、ログ抽出部１５０は、ページリストＺ４からページＡ７を外す。 Also in the subsequent processing, the log extraction unit 150 confirms that the extraction candidate page is a time later than the time before x hours.
Second, the log extraction unit 150 extracts the latest page A7 from the pages belonging to the page list Z4. Then, the log extraction unit 150 removes the page A7 from the page list Z4.

以降、同様にして、ログ抽出部１５０は、ページリストＺ４のページＡ６からページＡ１までを順に抽出し、ページリストＺ４に残りのページ（未抽出のページ）がなくなったことを検出する。ログ抽出部１５０は、抽出量の上限値に未だ達していないため、次に優先順位の高いログタイプ“ｔｙｐｅ２”、“ｔｙｐｅ３”のページリストＺ５，Ｚ６からのログ抽出に移る。 Thereafter, similarly, the log extraction unit 150 sequentially extracts pages A6 to A1 of the page list Z4, and detects that there are no remaining pages (unextracted pages) in the page list Z4. Since the log extraction unit 150 has not yet reached the upper limit of the extraction amount, the log extraction unit 150 proceeds to log extraction from the page lists Z5 and Z6 of the log types “type2” and “type3” having the next highest priority.

上記のように、ログタイプ“ｔｙｐｅ２”、“ｔｙｐｅ３”の優先レベルは“２”であり、ページリストＺ５，Ｚ６に属する各ページのうちの最新のページＢ４は、現在からｘ時間前の時刻よりも後の時刻である。このため、９番目に、ログ抽出部１５０は、ページＢ４を抽出する。そして、ログ抽出部１５０は、ページリストＺ５からページＢ４を外す。 As described above, the priority level of the log types “type2” and “type3” is “2”, and the latest page B4 among the pages belonging to the page lists Z5 and Z6 is from the time x hours before the current time. Is a later time. For this reason, ninthly, the log extraction unit 150 extracts the page B4. Then, the log extraction unit 150 removes the page B4 from the page list Z5.

１０番目に、ログ抽出部１５０は、ページリストＺ５，Ｚ６に属する各ページのうち、最新のページＣ４を抽出する。そして、ログ抽出部１５０は、ページリストＺ６からページＣ４を外す。 Tenth, the log extraction unit 150 extracts the latest page C4 from the pages belonging to the page lists Z5 and Z6. Then, the log extraction unit 150 removes the page C4 from the page list Z6.

ログ抽出部１５０は、ページＣ４を抽出すると、抽出量の上限値に達したことを検出して、ログ抽出を終了する。抽出ログＬ１ｄは、上記の処理によってログ抽出部１５０により抽出されたページＡ８，Ａ７，Ａ６，Ａ５，Ａ４，Ａ３，Ａ２，Ａ１，Ｂ４，Ｃ４を含む。 When extracting the page C4, the log extracting unit 150 detects that the upper limit of the extraction amount has been reached, and ends the log extraction. The extracted log L1d includes pages A8, A7, A6, A5, A4, A3, A2, A1, B4, and C4 extracted by the log extracting unit 150 by the above processing.

このようにして、ＣＭ１００によれば、解析に有用でないログの収集を抑えることができる。
ここで、例えば、障害などの事象（イベント）に対して、ＣＭ１００，２００，３００，４００におけるログを全て収集することも考えられる。しかし、ログには、新しいものや古いもの、ハードウェアやソフトウェアなどに関する種々のログレコードが含まれる。このため、ログを全て収集すると、発生した事象との関係が薄く、当該事象の解析に有用でないログレコードも収集されるという問題がある。余計なログレコードの収集は、収集したログレコードを他の装置に送信する際の通信量の増加や、有用でないログレコードによる解析量の増加などの要因になる。 In this way, according to the CM 100, collection of logs that are not useful for analysis can be suppressed.
Here, for example, it is also conceivable to collect all logs in the CMs 100, 200, 300, and 400 for events such as failures. However, the log includes various log records related to new and old items, hardware and software. For this reason, when all the logs are collected, there is a problem that the relationship with the occurred event is thin, and log records that are not useful for analyzing the event are also collected. The collection of unnecessary log records causes an increase in the amount of communication when the collected log records are transmitted to other devices, and an increase in the amount of analysis due to unusable log records.

そこで、ＣＭ１００は、障害の発生を示すメッセージ毎に抽出対象のページの時間範囲とログタイプ別の優先レベルとをログ抽出管理テーブル１１２により保持する。ＣＭ１００は、メッセージを検出すると、当該メッセージに応じた時間範囲とログタイプ別の優先レベルとをログ抽出管理テーブル１１２から検索する。そして、ＣＭ１００は、現時点以前の時間範囲とログタイプ別の優先レベルとを基に、ページを抽出する。これにより、ＣＭ１００は、ＣＭ１００のログのうち、障害解析に有用なログのみを抽出することができる。ＣＭ２００，３００，４００も同様にして、障害解析に有用なログのみを抽出することができる。更に、ＣＭ１００は、ＣＭ１００，２００，３００，４００における抽出ログを収集し、サポートサーバ６０に収集ログを送信することで、障害解析に有用なログのみを、サポートサーバ６０に送信することができる。すなわち、ＣＭ１００は、サポートサーバ６０に対して収集ログを送信する際の通信量の増加を抑えつつ、有用なログに絞った情報提供を行える。その結果、サポートサーバ６０側での解析量の低減を図れる。 Therefore, the CM 100 holds the time range of the page to be extracted and the priority level for each log type in the log extraction management table 112 for each message indicating the occurrence of a failure. When the CM 100 detects a message, the CM 100 searches the log extraction management table 112 for a time range corresponding to the message and a priority level for each log type. Then, the CM 100 extracts pages based on the time range before the current time and the priority level for each log type. As a result, the CM 100 can extract only logs useful for failure analysis from the CM 100 logs. Similarly, the CMs 200, 300, and 400 can extract only logs useful for failure analysis. Further, the CM 100 collects the extracted logs in the CMs 100, 200, 300, and 400 and transmits the collected logs to the support server 60, so that only the logs useful for failure analysis can be transmitted to the support server 60. In other words, the CM 100 can provide information focused on useful logs while suppressing an increase in the amount of communication when transmitting a collection log to the support server 60. As a result, the amount of analysis on the support server 60 side can be reduced.

［第３の実施の形態］
以下、第３の実施の形態を説明する。前述の第２の実施の形態と相違する事項を主に説明し、共通する事項の説明を省略する。 [Third Embodiment]
Hereinafter, a third embodiment will be described. Items that differ from the second embodiment described above will be mainly described, and descriptions of common items will be omitted.

図１８で例示したように、ログ抽出対象の時間範囲の設定によっては、特定のログタイプのページ（図１８の例では、ログタイプ“ｔｙｐｅ１”のページ）に偏ってログ抽出が行われる。また、抽出対象の時間範囲を広げた場合に、優先レベルの高いログの量が多いと、図１８で例示したように、優先レベルの低いログをほとんど収集できないことも考えられる。一方、障害の内容によっては、特定のログタイプのページを重点的に抽出しながら、他のログタイプのページもある程度取得して解析を行いたいこともある。そこで、第３の実施の形態では、各メッセージに対して複数の時間範囲の設定を許容することで、ログ抽出の柔軟化を図る機能を提供する。 As illustrated in FIG. 18, depending on the setting of the time range for log extraction, log extraction is performed with a bias toward a specific log type page (in the example of FIG. 18, the log type “type1” page). Further, when the time range to be extracted is expanded, if the amount of logs having a high priority level is large, it is conceivable that logs having a low priority level can hardly be collected as illustrated in FIG. On the other hand, depending on the content of the failure, there may be a case where a specific log type page is focused on and other log type pages are acquired to some extent for analysis. In view of this, the third embodiment provides a function for making log extraction flexible by allowing a plurality of time ranges to be set for each message.

第３の実施の形態のストレージシステムのハードウェアおよび機能構成は、図２〜図５で例示した第２の実施の形態のストレージシステムのハードウェアおよび機能構成と同様である。このため、第３の実施の形態でも、第２の実施の形態と同様の名称および符号により各要素を指し示すこととする。第３の実施の形態では、ログ抽出管理テーブル１１２の代わりに、ログ抽出管理テーブル１１３を用いる点が、第２の実施の形態と異なる。 The hardware and functional configuration of the storage system of the third embodiment are the same as the hardware and functional configuration of the storage system of the second embodiment illustrated in FIGS. For this reason, in the third embodiment, each element is indicated by the same name and symbol as in the second embodiment. The third embodiment is different from the second embodiment in that a log extraction management table 113 is used instead of the log extraction management table 112.

図１９は、第３の実施の形態のログ抽出管理テーブルの例を示す図である。ログ抽出管理テーブル１１３は、記憶部１１０に予め記憶されている。ログ抽出管理テーブル１１３は、メッセージＩＤに応じたログ抽出対象の時間範囲およびログタイプ毎の優先レベルが登録された情報である。ログ抽出管理テーブル１１３では、ログ抽出対象の時間範囲を２種類登録可能である点が、ログ抽出管理テーブル１１２と異なる。ログ抽出管理テーブル１１３は、メッセージＩＤ、時間範囲１（ｘ）、時間範囲２（ｙ）およびログタイプの優先レベルの項目を含む。 FIG. 19 is a diagram illustrating an example of a log extraction management table according to the third embodiment. The log extraction management table 113 is stored in the storage unit 110 in advance. The log extraction management table 113 is information in which a log extraction target time range corresponding to a message ID and a priority level for each log type are registered. The log extraction management table 113 is different from the log extraction management table 112 in that two types of log extraction target time ranges can be registered. The log extraction management table 113 includes items of message ID, time range 1 (x), time range 2 (y), and log type priority level.

メッセージＩＤおよびログタイプの優先レベルの項目の設定内容は、ログ抽出管理テーブル１１２における同名の項目の設定内容と同様である。
時間範囲１（ｘ）の項目には、ログ抽出対象の第１の時間範囲ｘが登録される。時間範囲２（ｙ）の項目には、ログ抽出対象の第２の時間範囲ｙが登録される。第１の時間範囲ｘおよび第２の時間範囲ｙの何れも、単位は、例えば、時間（hour）である。また、第２の時間範囲ｙは、第１の時間範囲ｘよりも新しい時刻である。時間範囲１（ｘ）の項目における第１の時間範囲ｘの設定は、必須である。時間範囲２（ｙ）の項目における第２の時間範囲ｙの設定は、任意である（時間範囲２（ｙ）の項目は設定なしでもよい）。時間範囲２（ｙ）の項目が設定なしの場合、図ではハイフン記号“−”を表記する。 The setting contents of the item of the priority level of the message ID and the log type are the same as the setting contents of the item of the same name in the log extraction management table 112.
In the item of time range 1 (x), the first time range x to be subjected to log extraction is registered. In the item of time range 2 (y), the second time range y to be extracted from the log is registered. The unit of both the first time range x and the second time range y is, for example, hour. The second time range y is a newer time than the first time range x. The setting of the first time range x in the item of the time range 1 (x) is indispensable. The setting of the second time range y in the item of the time range 2 (y) is arbitrary (the item of the time range 2 (y) may not be set). When the item of the time range 2 (y) is not set, a hyphen symbol “-” is shown in the figure.

例えば、ログ抽出管理テーブル１１２には、メッセージＩＤが“ａ０００００００５”、時間範囲１（ｘ）が“４８”、時間範囲２（ｙ）が“３”、ログタイプ“ｔｙｐｅ１”の優先レベル“１”、ログタイプ“ｔｙｐｅ２”の優先レベル“２”、ログタイプ“ｔｙｐｅ３”の優先レベル“３”、ログタイプ“ｔｙｐｅ４”の優先レベル“０”，・・・という情報が登録される。これは、メッセージＩＤ“ａ０００００００５”を含むメッセージが検出された場合、当該検出時（障害発生時）から３時間前に遡った時刻までを第１段階のログ抽出対象の時間範囲とすることを示す。また、第１段階のログ抽出が完了した後に、当該検出時（障害発生時）から４８時間前に遡った時刻までを第２段階のログ抽出対象の時間範囲とすることを示す。また、各ログタイプの優先レベルにしたがって、ログ抽出を行うことを示す。 For example, in the log extraction management table 112, the message ID is “a00000005”, the time range 1 (x) is “48”, the time range 2 (y) is “3”, and the priority level “1” is the log type “type1”. , The priority level “2” of the log type “type 2”, the priority level “3” of the log type “type 3”, the priority level “0” of the log type “type 4”, and so on are registered. This indicates that, when a message including the message ID “a00000005” is detected, the time range from the time of the detection (at the time of failure) to the time that goes back three hours ago is set as the time range of the log extraction target in the first stage. . In addition, after the log extraction at the first stage is completed, the time range from the time of the detection (at the time of failure) to the time that goes back 48 hours ago is set as the time range of the log extraction target of the second stage. It also indicates that log extraction is performed according to the priority level of each log type.

次に、第３の実施の形態におけるログ抽出部１５０によるログ抽出の手順を説明する。第３の実施の形態では、図１２で例示したＣＭ単位のログ抽出処理の手順に代えて、ログ抽出部１５０が以下に示す手順を実行する点が異なる。他の処理の手順について、第２の実施の形態で例示した手順と同様であるため、説明を省略する。また、以下では、ログ抽出部１５０について主に説明するが、ログ抽出部２３０，３３０，４３０も同様の手順によりログ抽出を行う。 Next, the log extraction procedure by the log extraction unit 150 in the third embodiment will be described. The third embodiment is different in that the log extraction unit 150 executes the following procedure instead of the CM-unit log extraction processing procedure illustrated in FIG. The other processing procedures are the same as the procedures exemplified in the second embodiment, and thus description thereof is omitted. In the following, the log extraction unit 150 will be mainly described, but the log extraction units 230, 330, and 430 also perform log extraction according to the same procedure.

図２０は、第３の実施の形態のＣＭ単位のログ抽出例を示すフローチャートである。以下、図２０に示す処理をステップ番号に沿って説明する。以下に示す手順は、図１１のステップＳ１４に相当する。 FIG. 20 is a flowchart illustrating an example of CM-unit log extraction according to the third embodiment. In the following, the process illustrated in FIG. 20 will be described in order of step number. The procedure shown below corresponds to step S14 in FIG.

（Ｓ５１）ログ抽出部１５０は、記憶部１１０に記憶されたログ抽出管理テーブル１１３から、今回のメッセージに対応する時間範囲２（ｙ）の値を取得する。具体的には、ログ抽出部１５０は、障害通知のメッセージに含まれるメッセージＩＤに対応する時間範囲２（ｙ）を、ログ抽出管理テーブル１１３から取得する。 (S51) The log extraction unit 150 acquires the value of the time range 2 (y) corresponding to the current message from the log extraction management table 113 stored in the storage unit 110. Specifically, the log extraction unit 150 acquires the time range 2 (y) corresponding to the message ID included in the failure notification message from the log extraction management table 113.

（Ｓ５２）ログ抽出部１５０は、ステップＳ５１の結果を基に、時間範囲２（ｙ）が設定なしであるか否かを判定する。時間範囲２（ｙ）が設定なしの場合、ログ抽出部１５０は、ステップＳ５６に処理を進める。時間範囲２（ｙ）が設定ありの場合、ログ抽出部１５０は、ステップＳ５３に処理を進める。 (S52) The log extraction unit 150 determines whether the time range 2 (y) is not set based on the result of step S51. When the time range 2 (y) is not set, the log extraction unit 150 proceeds with the process to step S56. When the time range 2 (y) is set, the log extraction unit 150 proceeds with the process to step S53.

（Ｓ５３）ログ抽出部１５０は、時間範囲を時間範囲２（ｙ）に設定する。
（Ｓ５４）ログ抽出部１５０は、時間範囲を時間範囲２（ｙ）に設定した状態で、時間範囲内のログ抽出処理を実行する。時間範囲内のログ抽出処理の手順は、図１３の手順と同様である。 (S53) The log extraction unit 150 sets the time range to the time range 2 (y).
(S54) The log extraction unit 150 executes log extraction processing within the time range in a state where the time range is set to the time range 2 (y). The procedure of log extraction processing within the time range is the same as the procedure of FIG.

（Ｓ５５）ログ抽出部１５０は、抽出量の合計が上限値に達したか否かを判定する。抽出量の合計が上限値に達した場合、ログ抽出部１５０は、処理をステップＳ５８に進める。抽出量の合計が上限値に達していない場合、ログ抽出部１５０は、ステップＳ５６に処理を進める。 (S55) The log extraction unit 150 determines whether or not the total extraction amount has reached the upper limit. When the total of the extraction amounts reaches the upper limit value, the log extraction unit 150 proceeds with the process to step S58. If the total extraction amount has not reached the upper limit value, the log extraction unit 150 proceeds with the process to step S56.

（Ｓ５６）ログ抽出部１５０は、ログ抽出管理テーブル１１３から、今回のメッセージに対応する時間範囲１（ｘ）の値を取得する。具体的には、ログ抽出部１５０は、障害通知のメッセージに含まれるメッセージＩＤに対応する時間範囲１（ｘ）を、ログ抽出管理テーブル１１３から取得する。 (S56) The log extraction unit 150 acquires the value of the time range 1 (x) corresponding to the current message from the log extraction management table 113. Specifically, the log extraction unit 150 acquires the time range 1 (x) corresponding to the message ID included in the failure notification message from the log extraction management table 113.

（Ｓ５７）ログ抽出部１５０は、時間範囲を時間範囲１（ｘ）に設定した状態で、時間範囲内のログ抽出処理を実行する。時間範囲内のログ抽出処理の手順は、図１３の手順と同様である。 (S57) The log extraction unit 150 executes log extraction processing within the time range in a state where the time range is set to the time range 1 (x). The procedure of log extraction processing within the time range is the same as the procedure of FIG.

（Ｓ５８）ログ抽出部１５０は、ステップＳ５４，Ｓ５７の両方または何れか一方により抽出したログ（抽出ログ）をログ収集部１４０に提供する。
このように、記憶部１１０は、時間範囲２（ｙ）および時間範囲２（ｙ）よりも長い期間を示す時間範囲１（ｘ）（他の時間範囲）をメッセージ毎に登録したログ抽出管理テーブル１１３を記憶する。そして、ログ抽出部１５０は、障害発生を示すメッセージを検出すると、記憶部１１０に記憶されたログ抽出管理テーブル１１３を参照して、メッセージに応じた現時刻から過去の時間範囲２（ｙ）および優先レベルに基づき、ログ（動作情報）の中からページ（ログレコード）を抽出する。その後、ログ抽出部１５０は、メッセージ応じた現時刻から過去の時間範囲１（ｘ）および優先レベルに基づき、ログ（動作情報）の中から他のページ（他のログレコード）を抽出する。これにより、障害に応じて、抽出ログの内容を柔軟に調整可能になる。 (S58) The log extraction unit 150 provides the log collection unit 140 with the log (extraction log) extracted in both or one of steps S54 and S57.
As described above, the storage unit 110 stores the time range 2 (y) and the time range 1 (x) (other time range) indicating a period longer than the time range 2 (y) for each message. 113 is stored. Then, when the log extraction unit 150 detects a message indicating the occurrence of a failure, the log extraction unit 150 refers to the log extraction management table 113 stored in the storage unit 110 and refers to the past time range 2 (y) and the past time range corresponding to the message. Based on the priority level, a page (log record) is extracted from the log (operation information). Thereafter, the log extraction unit 150 extracts other pages (other log records) from the log (operation information) based on the past time range 1 (x) and the priority level from the current time according to the message. Thereby, the contents of the extraction log can be flexibly adjusted according to the failure.

図２１は、第３の実施の形態のログ抽出例を示す図である。図２１の例では、あるメッセージに対するログ抽出について次の条件を考える。抽出量の上限値は、ページ１０個分（例えば、１ページのサイズが６４ＫＢの場合、６４ＫＢ×１０＝６４０ＫＢ）である。ログ抽出の時間範囲１（ｘ）はｘ時間である。ログ抽出の時間範囲２（ｙ）はｙ時間である。抽出対象のログタイプは、“ｔｙｐｅ１”、“ｔｙｐｅ２”および“ｔｙｐｅ３”である。ログタイプ“ｔｙｐｅ１”の優先レベルは“１”である。ログタイプ“ｔｙｐｅ２”、“ｔｙｐｅ３”の優先レベルは何れも“２”である。ページリストＺ４，Ｚ５，Ｚ６に属する各ページは、図１８と同様である。 FIG. 21 is a diagram illustrating a log extraction example according to the third embodiment. In the example of FIG. 21, the following conditions are considered for log extraction for a certain message. The upper limit of the extraction amount is 10 pages (for example, when the size of one page is 64 KB, 64 KB × 10 = 640 KB). The log extraction time range 1 (x) is x hours. The log extraction time range 2 (y) is y hours. The log types to be extracted are “type 1”, “type 2”, and “type 3”. The priority level of the log type “type1” is “1”. The priority levels of the log types “type 2” and “type 3” are both “2”. Each page belonging to the page list Z4, Z5, Z6 is the same as that shown in FIG.

この場合、メッセージの検出時（障害発生時）を現在とすると、現在からｙ時間前までが第１段階のログ抽出対象の時間範囲である。図２１の例では、ページＡ５，Ｂ３，Ｃ３以降のページにおけるタイムスタンプがログ抽出対象の時間範囲に含まれる。 In this case, assuming that the time when a message is detected (when a failure occurs) is now, the time range from the present to y hours before is the first stage of log extraction target. In the example of FIG. 21, the time stamps in the pages subsequent to pages A5, B3, and C3 are included in the time range for log extraction.

最も優先順位の高いログタイプ“ｔｙｐｅ１”の最新のページＡ８は、現在からｙ時間前の時刻よりも後の時刻である。このため、ログ抽出部１５０は、ページＡ８を抽出する。そして、ログ抽出部１５０は、ページリストＺ４からページＡ８を外す。 The latest page A8 of the log type “type 1” with the highest priority is a time later than the time y hours before the present time. For this reason, the log extraction unit 150 extracts the page A8. Then, the log extraction unit 150 removes the page A8 from the page list Z4.

以降の第１段階のログ抽出処理でも、ログ抽出部１５０は、抽出候補のページがｙ時間前の時刻よりも後の時刻であることを確認する。
２番目に、ログ抽出部１５０は、ページリストＺ４に属する各ページのうち、最新のページＡ７を抽出する。そして、ログ抽出部１５０は、ページリストＺ４からページＡ７を外す。 Also in the subsequent first-stage log extraction processing, the log extraction unit 150 confirms that the extraction candidate page is later than the time before y hours.
Second, the log extraction unit 150 extracts the latest page A7 from the pages belonging to the page list Z4. Then, the log extraction unit 150 removes the page A7 from the page list Z4.

３番目に、ログ抽出部１５０は、ページリストＺ４に属する各ページのうち、最新のページＡ６を抽出する。そして、ログ抽出部１５０は、ページリストＺ４からページＡ６を外す。 Third, the log extraction unit 150 extracts the latest page A6 from the pages belonging to the page list Z4. Then, the log extraction unit 150 removes the page A6 from the page list Z4.

４番目に、ログ抽出部１５０は、ページリストＺ４に属する各ページのうち、最新ページＡ５を抽出する。そして、ログ抽出部１５０は、ページリストＺ４からページＡ５を外す。 Fourth, the log extraction unit 150 extracts the latest page A5 from the pages belonging to the page list Z4. Then, the log extraction unit 150 removes the page A5 from the page list Z4.

ログ抽出部１５０は、ページリストＺ４の最新のページＡ４のタイムスタンプが現在からｙ時間前の時刻よりも前の時刻を示すことを確認し、ページリストＺ４からの第１段階のログ抽出を完了する。ログ抽出部１５０は、抽出量の上限値に未だ達していないため、次に優先順位の高いログタイプ“ｔｙｐｅ２”、“ｔｙｐｅ３”のページリストＺ５，Ｚ６からの第１段階のログ抽出に移る。 The log extraction unit 150 confirms that the time stamp of the latest page A4 in the page list Z4 indicates a time before the time y hours before the current time, and completes the first stage log extraction from the page list Z4. To do. Since the log extraction unit 150 has not yet reached the upper limit of the extraction amount, the log extraction unit 150 proceeds to the first-stage log extraction from the page lists Z5 and Z6 of the log types “type2” and “type3” having the next highest priority.

上記のように、ログタイプ“ｔｙｐｅ２”、“ｔｙｐｅ３”の優先レベルは“２”であり、ページリストＺ５，Ｚ６に属する各ページのうちの最新のページＢ４は、現在からｙ時間前の時刻よりも後の時刻である。このため、５番目に、ログ抽出部１５０は、ページＢ４を抽出する。そして、ログ抽出部１５０は、ページリストＺ５からページＢ４を外す。 As described above, the priority level of the log types “type2” and “type3” is “2”, and the latest page B4 among the pages belonging to the page lists Z5 and Z6 is from the time y hours before the current time. Is a later time. For this reason, fifthly, the log extraction unit 150 extracts the page B4. Then, the log extraction unit 150 removes the page B4 from the page list Z5.

６番目に、ログ抽出部１５０は、ページリストＺ５，Ｚ６に属する各ページのうち、最新のページＣ４を抽出する。そして、ログ抽出部１５０は、ページリストＺ６からページＣ４を外す。 Sixth, the log extraction unit 150 extracts the latest page C4 from the pages belonging to the page lists Z5 and Z6. Then, the log extraction unit 150 removes the page C4 from the page list Z6.

７番目に、ログ抽出部１５０は、ページリストＺ５，Ｚ６に属する各ページのうち、最新のページＢ３を抽出する。そして、ログ抽出部１５０は、ページリストＺ５からページＢ３を外す。 Seventh, the log extraction unit 150 extracts the latest page B3 from the pages belonging to the page lists Z5 and Z6. Then, the log extraction unit 150 removes the page B3 from the page list Z5.

８番目に、ログ抽出部１５０は、ページリストＺ５，Ｚ６に属する各ページのうち、最新のページＣ３を抽出する。そして、ログ抽出部１５０は、ページリストＺ６からページＣ３を外す。 Eighth, the log extraction unit 150 extracts the latest page C3 from the pages belonging to the page lists Z5 and Z6. Then, the log extraction unit 150 removes the page C3 from the page list Z6.

ログ抽出部１５０は、ページリストＺ５，Ｚ６に属する各ページのうち、最新のページＣ２のタイムスタンプが現在からｙ時間前の時刻よりも前の時刻を示すことを確認し、ページリストＺ５，Ｚ６からの第１段階のログ抽出を完了する。ログ抽出部１５０は、抽出量の上限値に未だ達していないため、第２段階のログ抽出に移る。第２段階のログ抽出の時間範囲は、現在からｘ時間前の時刻までである。 The log extraction unit 150 confirms that among the pages belonging to the page lists Z5 and Z6, the time stamp of the latest page C2 indicates a time before y hours before the current time, and the page lists Z5 and Z6. Complete the first stage log extraction from. Since the log extraction unit 150 has not yet reached the upper limit of the extraction amount, the log extraction unit 150 proceeds to the second-stage log extraction. The time range of the second stage log extraction is from the present time to the time x hours before.

最も優先順位の高いログタイプ“ｔｙｐｅ１”の最新のページＡ４は、現在からｘ時間前の時刻よりも後の時刻である。このため、９番目に、ログ抽出部１５０は、ページＡ４を抽出する。そして、ログ抽出部１５０は、ページリストＺ４からページＡ４を外す。 The latest page A4 of the log type “type1” with the highest priority is a time later than the time x hours before the present time. For this reason, ninthly, the log extraction unit 150 extracts the page A4. Then, the log extraction unit 150 removes the page A4 from the page list Z4.

１０番目に、ログ抽出部１５０は、ページＡ３を抽出する。そして、ログ抽出部１５０は、ページリストＺ４からページＡ３を外す。
ログ抽出部１５０は、ページＡ３を抽出すると、抽出量の上限値に達したことを検出して、第２段階のログ抽出を終了する。抽出ログＬ１ｅは、上記の処理によってログ抽出部１５０により抽出されたページＡ８，Ａ７，Ａ６，Ａ５，Ｂ４，Ｃ４，Ｂ３，Ｃ３，Ａ４，Ａ３を含む。 Tenth, the log extraction unit 150 extracts the page A3. Then, the log extraction unit 150 removes the page A3 from the page list Z4.
When extracting the page A3, the log extraction unit 150 detects that the upper limit of the extraction amount has been reached, and ends the log extraction in the second stage. The extracted log L1e includes pages A8, A7, A6, A5, B4, C4, B3, C3, A4, and A3 extracted by the log extracting unit 150 by the above processing.

次に、第３の実施の形態のログ抽出方法について、更に具体的な例を説明する。以下の説明では、具体的な障害内容と、具体的なログタイプとを例示することで、ＣＭ１００，２００，３００，４００によるログ抽出例を更に具体的に説明する。 Next, a more specific example of the log extraction method according to the third embodiment will be described. In the following description, an example of log extraction by the CMs 100, 200, 300, and 400 will be described more specifically by illustrating specific failure contents and specific log types.

図２２は、第３の実施の形態のログ抽出管理テーブルの第１具体例を示す図である。ログ抽出管理テーブル１１４は、ＣＭ１００，２００，３００，４００それぞれが備える筐体内の冷却用のファン（fan）の故障に対する時間範囲１（ｘ）、時間範囲２（ｙ）およびログタイプの優先レベルを例示している。例えば、ファンの故障を示すメッセージのメッセージＩＤを“ＦＡＮＦａｕｌｔ”とする。ログ抽出管理テーブル１１４には、当該メッセージＩＤに対して、時間範囲１（ｘ）が“４８”、時間範囲２（ｙ）が“１”という情報が登録されている。また、当該メッセージＩＤに対して、ログタイプ“ｔｙｐｅ１”の優先レベル“１”、ログタイプ“ｔｙｐｅ２”の優先レベル“０”、ログタイプ“ｔｙｐｅ３”の優先レベル“０”、ログタイプ“ｔｙｐｅ４”の優先レベル“１”、ログタイプ“ｔｙｐｅ５”の優先レベル“２”、ログタイプ“ｔｙｐｅ６”の優先レベル“０”、ログタイプ“ｔｙｐｅ７”の優先レベル“０”、ログタイプ“ｔｙｐｅ８”の優先レベル“０”という情報が登録されている。 FIG. 22 is a diagram illustrating a first specific example of the log extraction management table according to the third embodiment. The log extraction management table 114 indicates the time range 1 (x), the time range 2 (y), and the priority level of the log type with respect to the failure of the cooling fan in the casing of each of the CMs 100, 200, 300, and 400. Illustrated. For example, the message ID of a message indicating a fan failure is “FAN Fault”. In the log extraction management table 114, information that the time range 1 (x) is “48” and the time range 2 (y) is “1” is registered for the message ID. Also, for this message ID, the priority level “1” of the log type “type1”, the priority level “0” of the log type “type2”, the priority level “0” of the log type “type3”, and the log type “type4”. Priority level “1”, log type “type5” priority level “2”, log type “type6” priority level “0”, log type “type7” priority level “0”, log type “type8” priority Information of level “0” is registered.

ここで、ログタイプ“ｔｙｐｅ１”は、ハードウェアエラー（ハードエラー）である。ログタイプ“ｔｙｐｅ２”は、データのコピー機能に関するソフトウェアエラー（ソフトエラー）である。ログタイプ“ｔｙｐｅ３”は、データの重複排除／圧縮機能に関するソフトエラーである。ログタイプ“ｔｙｐｅ４”は、温度などの環境に関する情報である。ログタイプ“ｔｙｐｅ５”は、電源オン／オフや消費電力などの電源制御に関する情報である。ログタイプ“ｔｙｐｅ６”は、ＭＭＩ（Man Machine Interface）に対する操作（ＭＭＩ操作）に関する情報である。ログタイプ“ｔｙｐｅ７”は、データのコピー機能に関するイベントである。ログタイプ“ｔｙｐｅ８”は、データの重複排除／圧縮機能に関するイベントである。 Here, the log type “type1” is a hardware error (hardware error). The log type “type2” is a software error (soft error) related to the data copy function. The log type “type 3” is a soft error related to the data deduplication / compression function. The log type “type 4” is information about the environment such as temperature. The log type “type 5” is information relating to power control such as power on / off and power consumption. The log type “type 6” is information related to an operation (MMI operation) for an MMI (Man Machine Interface). The log type “type 7” is an event related to a data copy function. The log type “type 8” is an event related to the data deduplication / compression function.

ＦＡＮ故障の解析に当たっては、故障の直接の原因を解析するために故障発生時付近のログを取得する。また、ＦＡＮ故障を加速するような間接的な要因（例えば、温度異常など）の有無を解析するために、故障発生前の比較的長時間に亘る環境ログを抽出することが好ましい。そこで、故障発生から１時間前までのログを抽出し、更に、故障発生から４８時間前までの範囲でハードエラーと環境情報のログを優先して抽出するように、ログ抽出管理テーブル１１４の設定を行う。 In analyzing a FAN failure, a log near the time of failure occurrence is acquired in order to analyze the direct cause of the failure. In addition, in order to analyze the presence or absence of an indirect factor (for example, temperature abnormality) that accelerates the FAN failure, it is preferable to extract an environmental log for a relatively long time before the failure occurs. Therefore, the log extraction management table 114 is set so that logs up to 1 hour before the occurrence of the failure are extracted, and further, logs of hardware errors and environmental information are preferentially extracted up to 48 hours before the occurrence of the failure. I do.

図２３は、第３の実施の形態のログ抽出の第１具体例を示す図である。図２３では、ログ抽出管理テーブル１１４に基づくログ抽出部１５０によるログ抽出を例示する。
図２３の例では、メッセージ“ＦＡＮＦａｕｌｔ”に対するログ抽出について次の条件を考える。抽出量の上限値は、ページ１０個分（例えば、１ページのサイズが６４ＫＢの場合、６４ＫＢ×１０＝６４０ＫＢ）である。ログ抽出の時間範囲１（ｘ）は４８時間である。ログ抽出の時間範囲２（ｙ）は１時間である。抽出対象のログタイプは、“ｔｙｐｅ１”、“ｔｙｐｅ４”および“ｔｙｐｅ５”である。ただし、図２３では、比較のために、ログタイプ“ｔｙｐｅ６”も図示している。ログタイプ“ｔｙｐｅ１”、“ｔｙｐｅ４”の優先レベルは何れも“１”である。ログタイプ“ｔｙｐｅ５”の優先レベルは“２”である。 FIG. 23 is a diagram illustrating a first specific example of log extraction according to the third embodiment. FIG. 23 illustrates log extraction by the log extraction unit 150 based on the log extraction management table 114.
In the example of FIG. 23, the following conditions are considered for log extraction for the message “FAN Fault”. The upper limit of the extraction amount is 10 pages (for example, when the size of one page is 64 KB, 64 KB × 10 = 640 KB). The log extraction time range 1 (x) is 48 hours. The log extraction time range 2 (y) is one hour. The log types to be extracted are “type1”, “type4”, and “type5”. However, in FIG. 23, the log type “type6” is also illustrated for comparison. The priority levels of the log types “type 1” and “type 4” are both “1”. The priority level of the log type “type5” is “2”.

また、ページリストＺ７は、ログタイプ“ｔｙｐｅ１”のページリストである。ページリストＺ７は、タイムスタンプの古い方から新しい方へ向かって、ページＡ１，Ａ２，Ａ３を含む。ページリストＺ８は、ログタイプ“ｔｙｐｅ４”のページリストである。ページリストＺ８は、タイムスタンプの古い方から新しい方へ向かって、ページＢ１，Ｂ２，Ｂ３，Ｂ４，Ｂ５を含む。ページリストＺ９は、ログタイプ“ｔｙｐｅ５”のページリストである。ページリストＺ９は、タイムスタンプの古い方から新しい方へ向かって、ページＣ１，Ｃ２，Ｃ３，Ｃ４を含む。ページリストＺ１０は、ログタイプ“ｔｙｐｅ６”のページリストである。ページリストＺ１０は、タイムスタンプの古い方から新しい方へ向かって、ページＤ１，Ｄ２，Ｄ３，Ｄ４を含む。ただし、前述のように、ページリストＺ１０は、比較のために図示したものであり、ページの抽出対象ではない。 The page list Z7 is a page list of the log type “type1”. The page list Z7 includes pages A1, A2, and A3 from the oldest time stamp to the newest time stamp. The page list Z8 is a page list of the log type “type4”. The page list Z8 includes pages B1, B2, B3, B4, and B5 from the oldest time stamp to the newest time stamp. The page list Z9 is a page list of the log type “type5”. The page list Z9 includes pages C1, C2, C3, and C4 from the oldest time stamp to the newest time stamp. The page list Z10 is a page list of the log type “type6”. The page list Z10 includes pages D1, D2, D3, and D4 from the oldest time stamp to the newest time stamp. However, as described above, the page list Z10 is illustrated for comparison and is not a page extraction target.

この場合、メッセージの検出時（障害発生時）を現在とすると、現在から１時間前までが第１段階のログ抽出対象の時間範囲である。図２３の例では、ページＡ２，Ｂ５，Ｃ４以降のページにおけるタイムスタンプがログ抽出対象の時間範囲に含まれる。 In this case, assuming that the time when a message is detected (at the time of occurrence of a failure) is the current time range from the present to one hour before the log extraction target. In the example of FIG. 23, time stamps in pages A2, B5, and C4 and subsequent pages are included in the time range for log extraction.

最も優先順位の高いログタイプ“ｔｙｐｅ１”、“ｔｙｐｅ４”の最新のページＢ５は、現在から１時間前の時刻よりも後の時刻である。このため、ログ抽出部１５０は、ページＢ５を抽出する。そして、ログ抽出部１５０は、ページリストＺ８からページＢ５を外す。 The latest page B5 of the log types “type1” and “type4” with the highest priority is a time later than the time one hour before the current time. For this reason, the log extraction unit 150 extracts the page B5. Then, the log extraction unit 150 removes the page B5 from the page list Z8.

以降の第１段階のログ抽出処理でも、ログ抽出部１５０は、抽出候補のページが１時間前の時刻よりも後の時刻であることを確認する。
２番目に、ログ抽出部１５０は、ページリストＺ７，Ｚ８に属する各ページのうち、最新のページＡ３を抽出する。そして、ログ抽出部１５０は、ページリストＺ７からページＡ３を外す。 Also in the subsequent first-stage log extraction processing, the log extraction unit 150 confirms that the extraction candidate page is later than the time one hour before.
Second, the log extraction unit 150 extracts the latest page A3 from the pages belonging to the page lists Z7 and Z8. Then, the log extraction unit 150 removes the page A3 from the page list Z7.

３番目に、ログ抽出部１５０は、ページリストＺ７，Ｚ８に属する各ページのうち、最新のページＡ２を抽出する。そして、ログ抽出部１５０は、ページリストＺ７からページＡ２を外す。 Third, the log extraction unit 150 extracts the latest page A2 from the pages belonging to the page lists Z7 and Z8. Then, the log extraction unit 150 removes the page A2 from the page list Z7.

ログ抽出部１５０は、ページリストＺ７，Ｚ８に属する各ページのうち、最新のページＢ４のタイムスタンプが１時間前の時刻よりも前の時刻であることを確認する。すると、ログ抽出部１５０は、次の優先レベルであるログタイプ“ｔｙｐｅ５”のページリストＺ９からの第１段階のログ抽出に移る。 The log extraction unit 150 confirms that the time stamp of the latest page B4 among the pages belonging to the page lists Z7 and Z8 is a time before the time one hour before. Then, the log extraction unit 150 proceeds to the first stage log extraction from the page list Z9 of the log type “type 5” which is the next priority level.

４番目に、ログ抽出部１５０は、ページリストＺ９に属する各ページのうち、最新のページＣ４を抽出する。そして、ログ抽出部１５０は、ページリストＺ９からページＣ４を外す。 Fourth, the log extraction unit 150 extracts the latest page C4 from the pages belonging to the page list Z9. Then, the log extraction unit 150 removes the page C4 from the page list Z9.

ログ抽出部１５０は、ページリストＺ９に属する各ページのうち、最新のページＣ３のタイムスタンプが１時間前の時刻よりも前の時刻であることを確認する。すると、ログ抽出部１５０は、抽出対象の全てのログタイプについて第１段階のログ抽出処理を終えたので、第２段階のログ抽出処理に移る。 The log extraction unit 150 confirms that the time stamp of the latest page C3 among the pages belonging to the page list Z9 is a time before the time one hour before. Then, since the log extraction unit 150 has finished the first-stage log extraction process for all the log types to be extracted, the log-extraction unit 150 proceeds to the second-stage log extraction process.

５番目に、ログ抽出部１５０は、ページリストＺ７，Ｚ８に属する各ページのうち、最新のページＢ４を抽出する。そして、ログ抽出部１５０は、ページリストＺ８からページＢ４を外す。 Fifth, the log extraction unit 150 extracts the latest page B4 from the pages belonging to the page lists Z7 and Z8. Then, the log extraction unit 150 removes the page B4 from the page list Z8.

６番目に、ログ抽出部１５０は、ページリストＺ７，Ｚ８に属する各ページのうち、最新のページＡ１を抽出する。そして、ログ抽出部１５０は、ページリストＺ７からページＡ１を外す。この段階では、ページリストＺ７には、未抽出のページがなくなる。 Sixth, the log extraction unit 150 extracts the latest page A1 from the pages belonging to the page lists Z7 and Z8. Then, the log extraction unit 150 removes the page A1 from the page list Z7. At this stage, there are no unextracted pages in the page list Z7.

７番目に、ログ抽出部１５０は、ページリストＺ８に属する各ページのうち、最新のページＢ３を抽出する。そして、ログ抽出部１５０は、ページリストＺ８からページＢ３を外す。 Seventh, the log extraction unit 150 extracts the latest page B3 from the pages belonging to the page list Z8. Then, the log extraction unit 150 removes the page B3 from the page list Z8.

８番目に、ログ抽出部１５０は、ページリストＺ８に属する各ページのうち、最新のページＢ２を抽出する。そして、ログ抽出部１５０は、ページリストＺ８からページＢ２を外す。 Eighth, the log extraction unit 150 extracts the latest page B2 from the pages belonging to the page list Z8. Then, the log extraction unit 150 removes the page B2 from the page list Z8.

９番目に、ログ抽出部１５０は、ページリストＺ８に属する各ページのうち、最新のページＢ１を抽出する。そして、ログ抽出部１５０は、ページリストＺ８からページＢ１を外す。ページリストＺ８にも未抽出のページがなくなったので、ログ抽出部１５０は、次の優先レベルであるページリストＺ９からの第２段階のログ抽出処理に移る。 Ninth, the log extraction unit 150 extracts the latest page B1 from the pages belonging to the page list Z8. Then, the log extraction unit 150 removes the page B1 from the page list Z8. Since there are no unextracted pages in the page list Z8, the log extraction unit 150 proceeds to the second stage log extraction processing from the page list Z9 which is the next priority level.

１０番目に、ログ抽出部１５０は、ページリストＺ９に属する各ページのうち、最新のページＣ３を抽出する。そして、ログ抽出部１５０は、ページリストＺ９からページＣ３を外す。 Tenth, the log extraction unit 150 extracts the latest page C3 from the pages belonging to the page list Z9. Then, the log extraction unit 150 removes the page C3 from the page list Z9.

ログ抽出部１５０は、ページＣ３を抽出すると、抽出量の上限値に達したことを検出して、第２段階のログ抽出を終了する。抽出ログＬ１ｆは、上記の処理によってログ抽出部１５０により抽出されたページＢ５，Ａ３，Ａ２，Ｃ４，Ｂ４，Ａ１，Ｂ３，Ｂ２，Ｂ１，Ｃ３を含む。 When extracting the page C3, the log extraction unit 150 detects that the upper limit of the extraction amount has been reached, and ends the log extraction in the second stage. The extracted log L1f includes pages B5, A3, A2, C4, B4, A1, B3, B2, B1, and C3 extracted by the log extracting unit 150 by the above processing.

こうして、ＣＭ１００，２００，３００，４００は、ＦＡＮ故障の障害調査に適した調査用ログを抽出することができる。また、ＣＭ１００は、抽出された調査用ログを収集して、サポートサーバ６０に送信することで、ＦＡＮ故障の障害調査に有用な情報に絞った情報提供を行うことができる。また、余計な情報を送るよりも通信量を減らすことができる。 In this way, the CMs 100, 200, 300, and 400 can extract investigation logs suitable for FAN failure investigations. Further, the CM 100 collects the extracted investigation logs and transmits them to the support server 60, thereby providing information focused on information useful for trouble investigation of FAN failure. In addition, the amount of communication can be reduced compared to sending extra information.

図２４は、第３の実施の形態のログ抽出管理テーブルの第２具体例を示す図である。ログ抽出管理テーブル１１５は、ＣＭ１００，２００，３００，４００それぞれにおけるデータのコピーセッションにおけるエラー（copy session error）に対する時間範囲１（ｘ）、時間範囲２（ｙ）およびログタイプの優先レベルを例示している。例えば、コピーセッションエラーのメッセージＩＤを“ｃｏｐｙｓｅｓｓｉｏｎｅｒｒｏｒ”とする。ログ抽出管理テーブル１１５には、当該メッセージＩＤに対して、時間範囲１（ｘ）が“６４”、時間範囲２（ｙ）が“−”（設定なし）という情報が登録されている。また、当該メッセージＩＤに対して、ログタイプ“ｔｙｐｅ１”、“ｔｙｐｅ３”、“ｔｙｐｅ４”、“ｔｙｐｅ５”、“ｔｙｐｅ６”、“ｔｙｐｅ８”の優先レベル“０”という情報が登録されている。更に、当該メッセージＩＤに対して、ログタイプ“ｔｙｐｅ２”、“ｔｙｐｅ７”の優先レベル“１”という情報が登録されている。 FIG. 24 is a diagram illustrating a second specific example of the log extraction management table according to the third embodiment. The log extraction management table 115 exemplifies time range 1 (x), time range 2 (y), and log type priority level for an error (copy session error) in a data copy session in each of the CMs 100, 200, 300, and 400. ing. For example, the message ID of the copy session error is “copy session error”. In the log extraction management table 115, information that the time range 1 (x) is “64” and the time range 2 (y) is “−” (no setting) is registered for the message ID. In addition, information of the priority level “0” of the log types “type1”, “type3”, “type4”, “type5”, “type6”, and “type8” is registered for the message ID. Furthermore, information of the priority level “1” of the log types “type 2” and “type 7” is registered for the message ID.

ここで、ログ抽出管理テーブル１１５におけるログタイプは、ログ抽出管理テーブル１１４で例示したログタイプと同様である。
データのコピー機能のエラーの解析に当たっては、エラーに至るまでの経緯から原因を特定するために、事象発生からできるだけ長時間に亘るコピー機能に関するログを抽出することが好ましい。そこで、事象発生から６４時間前までの時間範囲でコピー機能のログを優先して抽出するように、ログ抽出管理テーブル１１５の設定を行う。 Here, the log type in the log extraction management table 115 is the same as the log type exemplified in the log extraction management table 114.
In analyzing the error of the data copy function, it is preferable to extract a log relating to the copy function for as long as possible from the occurrence of the event in order to identify the cause from the background up to the error. Therefore, the log extraction management table 115 is set so that the log of the copy function is preferentially extracted in the time range from the occurrence of the event to 64 hours ago.

図２５は、第３の実施の形態のログ抽出の第２具体例を示す図である。図２５では、ログ抽出管理テーブル１１５に基づくログ抽出部１５０によるログ抽出を例示する。
図２５の例では、メッセージ“ｃｏｐｙｓｅｓｓｉｏｎｅｒｒｏｒ”に対するログ抽出について次の条件を考える。抽出量の上限値は、ページ１０個分（例えば、１ページのサイズが６４ＫＢの場合、６４ＫＢ×１０＝６４０ＫＢ）である。ログ抽出の時間範囲１（ｘ）は６４時間である。ログ抽出の時間範囲２（ｙ）は設定なしである。抽出対象のログタイプは、“ｔｙｐｅ２”および“ｔｙｐｅ７”である。ただし、図２５では、比較のために、ログタイプ“ｔｙｐｅ１”および“ｔｙｐｅ４”も図示している。ログタイプ“ｔｙｐｅ２”、“ｔｙｐｅ７”の優先レベルは何れも“１”である。 FIG. 25 is a diagram illustrating a second specific example of log extraction according to the third embodiment. FIG. 25 illustrates log extraction by the log extraction unit 150 based on the log extraction management table 115.
In the example of FIG. 25, the following conditions are considered for log extraction for the message “copy session error”. The upper limit of the extraction amount is 10 pages (for example, when the size of one page is 64 KB, 64 KB × 10 = 640 KB). The log extraction time range 1 (x) is 64 hours. The log extraction time range 2 (y) is not set. The log types to be extracted are “type 2” and “type 7”. However, in FIG. 25, log types “type1” and “type4” are also illustrated for comparison. The priority levels of the log types “type 2” and “type 7” are both “1”.

また、ページリストＺ１１は、ログタイプ“ｔｙｐｅ１”のページリストである。ページリストＺ１１は、タイムスタンプの古い方から新しい方へ向かって、ページＡ１，Ａ２，Ａ３を含む。ページリストＺ１２は、ログタイプ“ｔｙｐｅ２”のページリストである。ページリストＺ１２は、タイムスタンプの古い方から新しい方へ向かって、ページＢ１，Ｂ２，Ｂ３を含む。ページリストＺ１３は、ログタイプ“ｔｙｐｅ４”のページリストである。ページリストＺ１３は、タイムスタンプの古い方から新しい方へ向かって、ページＣ１，Ｃ２，Ｃ３，Ｃ４を含む。ページリストＺ１４は、ログタイプ“ｔｙｐｅ７”のページリストである。ページリストＺ１４は、タイムスタンプの古い方から新しい方へ向かって、ページＤ１，Ｄ２，Ｄ３，Ｄ４，Ｄ５，Ｄ６，Ｄ７を含む。ただし、前述のように、ページリストＺ１１，Ｚ１３は、比較のために図示したものであり、ページの抽出対象ではない。 The page list Z11 is a page list of the log type “type1”. The page list Z11 includes pages A1, A2, and A3 from the oldest time stamp to the newest time stamp. The page list Z12 is a page list of the log type “type2”. The page list Z12 includes pages B1, B2, and B3 from the oldest time stamp to the newest time stamp. The page list Z13 is a page list of the log type “type4”. The page list Z13 includes pages C1, C2, C3, and C4 from the oldest time stamp to the newest time stamp. The page list Z14 is a page list of the log type “type7”. The page list Z14 includes pages D1, D2, D3, D4, D5, D6, and D7 from the oldest time stamp to the newest time stamp. However, as described above, the page lists Z11 and Z13 are illustrated for comparison, and are not pages to be extracted.

この場合、メッセージの検出時（障害発生時）を現在とすると、現在から６４時間前までがログ抽出対象の時間範囲である。なお、図２５の例では、ログ抽出の時間範囲２（ｙ）は設定なしなので、時間範囲２（ｙ）を用いたログ抽出は行われずに、時間範囲１（ｘ）を用いたログ抽出が行われる。図２５の例では、ページＢ１，Ｄ１以降のページにおけるタイムスタンプがログ抽出対象の時間範囲に含まれる。 In this case, assuming that the current message detection time (failure occurrence time) is 64 hours before the current time is the log extraction target time range. In the example of FIG. 25, since log extraction time range 2 (y) is not set, log extraction using time range 2 (y) is not performed, and log extraction using time range 1 (x) is performed. Done. In the example of FIG. 25, the time stamps in the pages after page B1, D1 are included in the time range for log extraction.

最も優先順位の高いログタイプ“ｔｙｐｅ２”、“ｔｙｐｅ７”の最新のページＤ７は、現在から６４時間前よりも後の時刻である（ただし、ここでは、優先レベル“１”のログタイプのみがログの抽出元候補である）。このため、ログ抽出部１５０は、ページＤ７を抽出する。そして、ログ抽出部１５０は、ページリストＺ１４からページＤ７を外す。 The latest page D7 of the log types “type 2” and “type 7” with the highest priority is the time after 64 hours from the current time (however, only the log type with the priority level “1” is logged here) Is a source candidate). For this reason, the log extraction unit 150 extracts the page D7. Then, the log extraction unit 150 removes the page D7 from the page list Z14.

以降のログ抽出処理でも、ログ抽出部１５０は、抽出候補のページが６４時間前の時刻よりも後の時刻であることを確認する。
２番目に、ログ抽出部１５０は、ページリストＺ１２，Ｚ１４に属する各ページのうち、最新のページＢ３を抽出する。そして、ログ抽出部１５０は、ページリストＺ１２からページＢ３を外す。 In the subsequent log extraction process, the log extraction unit 150 confirms that the extraction candidate page is later than the time 64 hours ago.
Second, the log extraction unit 150 extracts the latest page B3 from the pages belonging to the page lists Z12 and Z14. Then, the log extraction unit 150 removes the page B3 from the page list Z12.

３番目に、ログ抽出部１５０は、ページリストＺ１２，Ｚ１４に属する各ページのうち、最新のページＤ６を抽出する。そして、ログ抽出部１５０は、ページリストＺ１４からページＤ６を外す。 Third, the log extraction unit 150 extracts the latest page D6 from the pages belonging to the page lists Z12 and Z14. Then, the log extraction unit 150 removes the page D6 from the page list Z14.

以降、同様にして、ログ抽出部１５０は、ページリストＺ１２，Ｚ１４に属する各ページのうち、新しいページから古いページへ順に抽出する。
９番目に、ログ抽出部１５０は、ページリストＺ１２，Ｚ１４に属する各ページのうち、最新のページＤ１を抽出する。そして、ログ抽出部１５０は、ページリストＺ１４からページＤ１を外す。この段階で、ページリストＺ１４には、未抽出のページがなくなる。 Thereafter, similarly, the log extraction unit 150 sequentially extracts from the new page to the old page among the pages belonging to the page lists Z12 and Z14.
Ninth, the log extraction unit 150 extracts the latest page D1 from the pages belonging to the page lists Z12 and Z14. Then, the log extraction unit 150 removes the page D1 from the page list Z14. At this stage, there are no unextracted pages in the page list Z14.

１０番目に、ログ抽出部１５０は、ページリストＺ１２に属する各ページのうち、最新のページＢ１を抽出する。そして、ログ抽出部１５０は、ページリストＺ１２からページＢ１を外す。この段階で、ページリストＤ１２には、未抽出のページがなくなる。 Tenth, the log extraction unit 150 extracts the latest page B1 from the pages belonging to the page list Z12. Then, the log extraction unit 150 removes the page B1 from the page list Z12. At this stage, there are no unextracted pages in the page list D12.

ログ抽出部１５０は、ページリストＺ１２，Ｚ１４において、未抽出のページがなくなったことを検出し、ログ抽出を完了する。抽出ログＬ１ｇは、上記の処理によってログ抽出部１５０により抽出されたページＤ７，Ｂ３，Ｄ６，Ｂ２，Ｄ５，Ｄ４，Ｄ３，Ｄ２，Ｄ１，Ｂ１を含む。 The log extraction unit 150 detects that there are no unextracted pages in the page lists Z12 and Z14, and completes the log extraction. The extracted log L1g includes pages D7, B3, D6, B2, D5, D4, D3, D2, D1, and B1 extracted by the log extracting unit 150 by the above processing.

こうして、ＣＭ１００，２００，３００，４００は、コピー機能のエラーに適した調査用ログを抽出することができる。また、ＣＭ１００は、抽出された調査用ログを収集して、サポートサーバ６０に送信することで、コピー機能のエラー解析に有用な情報に絞った情報提供を行うことができる。また、余計な情報を送るよりも通信量を減らすことができる。 In this way, the CMs 100, 200, 300, and 400 can extract the investigation log suitable for the copy function error. Further, the CM 100 collects the extracted investigation logs and transmits them to the support server 60, thereby providing information focused on information useful for error analysis of the copy function. In addition, the amount of communication can be reduced compared to sending extra information.

なお、第１の実施の形態の情報処理は、処理部１ｂにプログラムを実行させることで実現できる。また、第２，第３の実施の形態の情報処理は、プロセッサ１０１にプログラムを実行させることで実現できる。ＣＭ１００は、プロセッサ１０１とＲＡＭ１０２とを備えたコンピュータを含むといえる。プログラムは、コンピュータ読み取り可能な記録媒体９１に記録できる。 The information processing according to the first embodiment can be realized by causing the processing unit 1b to execute a program. The information processing according to the second and third embodiments can be realized by causing the processor 101 to execute a program. It can be said that the CM 100 includes a computer including a processor 101 and a RAM 102. The program can be recorded on a computer-readable recording medium 91.

例えば、プログラムを記録した記録媒体９１を配布することで、プログラムを流通させることができる。また、プログラムを他のコンピュータに格納しておき、ネットワーク経由でプログラムを配布してもよい。コンピュータは、例えば、記録媒体９１に記録されたプログラムまたは他のコンピュータから受信したプログラムを、ＲＡＭ１０２やＢＵＤ１０６などの記憶装置に格納し（インストールし）、当該記憶装置からプログラムを読み込んで実行してもよい。 For example, the program can be distributed by distributing the recording medium 91 on which the program is recorded. Alternatively, the program may be stored in another computer and distributed via a network. For example, the computer stores (installs) a program recorded in the recording medium 91 or a program received from another computer in a storage device such as the RAM 102 or the BUD 106, and reads and executes the program from the storage device. Good.

１情報処理装置
１ａ記憶部
１ｂ処理部
２動作情報
３管理情報 DESCRIPTION OF SYMBOLS 1 Information processing apparatus 1a Storage part 1b Processing part 2 Operation information 3 Management information

Claims

Among the operation information including a plurality of log records related to a component of a predetermined device, a storage unit that stores, for each message, a time range of the log record to be extracted and a priority level for each type of the log record;
When detecting a message, referring to the storage unit, based on the time range from the current time according to the message and the priority level, a processing unit for extracting the log record from the operation information,
An information processing apparatus.

The information processing apparatus according to claim 1, wherein the processing unit links the plurality of log records in time series for each type, and determines an extraction order of the log records of each type based on a link between the log records. .

The storage unit stores, for each message, information indicating a calculation method of the upper limit value of the total size of the log records to be extracted,
When the processing unit detects the message, the processing unit refers to the storage unit and calculates the upper limit value based on the calculation method according to the message.
The information processing apparatus according to claim 1 or 2.

The processing unit prioritizes a first log record corresponding to a first priority level among log record groups belonging to the time range in the past from the current time, the priority order being lower than the priority order indicated by the first priority level. The information processing apparatus according to claim 3, wherein the information processing apparatus extracts the information more preferentially than the second log record corresponding to the second priority level indicating the ranking.

Upon receipt of the message, the processing unit instructs a plurality of information processing devices to extract the log record from the operation information for each information processing device,
When instructing the extraction of the log record, the upper limit value for each information processing device is determined based on the calculation method according to the message, and the determined upper limit value is notified to the plurality of information processing devices. ,
The information processing apparatus according to claim 3 or 4.

The storage unit stores, for each message, another time range indicating a period longer than the time range,
When the processing unit detects the message, the processing unit refers to the storage unit and extracts the log record from the operation information based on the past time range and the priority level from the current time according to the message. Then, another log record is extracted from the operation information based on the other time range and the priority level in the past from the current time according to the message.
The information processing apparatus according to any one of claims 1 to 5.

When a message is detected, the time range of the log record to be extracted and the priority level for each type of the log record are stored for each message among the operation information including a plurality of log records related to the components of the predetermined device. With reference to the storage unit, the log record is extracted from the operation information based on the time range and the priority level according to the message.
A program that causes a computer to execute processing.