JP2010198410A

JP2010198410A - Server failure prediction system

Info

Publication number: JP2010198410A
Application number: JP2009043520A
Authority: JP
Inventors: Nobukazu Shinomiya; 伸和篠宮
Original assignee: NEC Computertechno Ltd
Current assignee: NEC Computertechno Ltd
Priority date: 2009-02-26
Filing date: 2009-02-26
Publication date: 2010-09-09

Abstract

<P>PROBLEM TO BE SOLVED: To prevent a failure of a server in advance by preliminarily predicting the possibility of abnormal conditions in the server. <P>SOLUTION: The server records a log, including a time when an event occurs and the contents of the event, into a log area in time series. A maintenance server monitors a log area and collects a first log, including a time when the power supply of the server is turned on, and a second log, including a time when the server becomes usable. The maintenance server calculates a time being a difference between the first log and the second log, as a time after the power supply of the server is turned on before the server becomes usable. The maintenance server notifies, when the time is equal to or more than a setting time, a maintenance person of the possibility of the abnormal conditions in the server. In a server failure prediction system, in consideration of a state of deterioration due to the fact that the server is used for a long time, and the possibility of the abnormal conditions in the server is predicted in advance. Accordingly, the failure of the server can be prevented in advance by notifying the maintenance person of the possibility of the abnormal conditions in the server. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、サーバを監視して故障を予測するサーバ故障予測システムに関する。 The present invention relates to a server failure prediction system that monitors a server and predicts a failure.

図１は、一般的なシステムの構成を示している。そのサーバ故障予測システムは、サーバ１１０と保守サーバ１２０とを具備している。保守サーバ１２０は、サーバ１１０に接続されている。サーバ１１０は、記憶装置１１２を備えている。記憶装置１１２には、ログエリア１１３が設けられている。 FIG. 1 shows a general system configuration. The server failure prediction system includes a server 110 and a maintenance server 120. The maintenance server 120 is connected to the server 110. The server 110 includes a storage device 112. The storage device 112 is provided with a log area 113.

サーバ１１０は、イベントが発生したときの時刻とそのイベントの内容とを含むログ１１３−１、１１３−２、…を時系列で記憶装置１１２のログエリア１１３に記録する。保守サーバ１２０は、ログエリア１１３を監視して、イベントの内容にエラーを表すログが存在するものとする。そのエラーとしては、デバイスの電圧異常や温度以上などが挙げられる。この場合、サーバ１１０の異常を保守員に通知する。 The server 110 records logs 113-1, 113-2,... Including the time when the event occurred and the contents of the event in the log area 113 of the storage device 112 in time series. The maintenance server 120 monitors the log area 113 and assumes that there is a log indicating an error in the event content. Examples of the error include device voltage abnormality and temperature. In this case, the maintenance staff is notified of the abnormality of the server 110.

従来では、単一のイベントに対してサーバ１１０が異常であるか否かを判断している。即ち、異常があるときだけ保守員に通知している。このため、エラーが起きたときに保守員が初めて保守作業を行う。保守作業では、デバイスの交換や修理を行う。サーバに異常がある可能性があることを事前に予測して、サーバの故障を予防することが望まれる。 Conventionally, it is determined whether or not the server 110 is abnormal for a single event. That is, the maintenance staff is notified only when there is an abnormality. For this reason, when an error occurs, maintenance personnel perform maintenance work for the first time. In maintenance work, devices are replaced or repaired. It is desirable to prevent a server failure by predicting in advance that there may be an abnormality in the server.

故障の予測や装置の監視に関する文献を紹介する。 Introduces literature on failure prediction and device monitoring.

特開２００１−３１２３７５号公報には、外部記憶装置の故障予測システムが記載されている。外部記憶装置の故障予測システムは、外部記憶装置と、この外部記憶装置を使用する顧客用コンピュータと、この顧客用コンピュータに通信回線網を介して接続されたサービス提供者用コンピュータとを備えている。顧客用コンピュータは、外部記憶装置の使用状況に関する検査データを取得し、この検査データを通信回線網を介してサービス提供者用コンピュータへ送信する。サービス提供者用コンピュータは、検査データに基づき外部記憶装置の故障予測を行ない、その結果を通信回線網を介して顧客用コンピュータへ送信することを特徴としている。 Japanese Patent Application Laid-Open No. 2001-31375 describes a failure prediction system for an external storage device. A failure prediction system for an external storage device includes an external storage device, a customer computer using the external storage device, and a service provider computer connected to the customer computer via a communication network. . The customer computer acquires inspection data relating to the usage status of the external storage device, and transmits the inspection data to the service provider computer via the communication network. The service provider computer predicts a failure of the external storage device based on the inspection data, and transmits the result to the customer computer via the communication network.

特開２００４−２１３６２１号公報には、リモート監視システムが記載されている。リモート監視システムは、被監視システムの正常／異常を含む事象情報を受信し、受信した事象情報を含む電子メールをネットワーク経路によって定期または不定期に通報する第１の手段と、第１の手段による電子メールを受信することで被監視システムとネットワーク経路の状態とを併せて監視する第２の手段と、を備えることを特徴としている。 Japanese Patent Laid-Open No. 2004-213621 describes a remote monitoring system. The remote monitoring system receives event information including normality / abnormality of the monitored system, and reports the e-mail including the received event information periodically or irregularly via a network path, and the first means And a second means for monitoring the monitored system and the state of the network path together by receiving an electronic mail.

特開２００２−２５９１３０号公報には、情報処理システムが記載されている。情報処理システムは、オペレーティングシステムを起動する手段と、オペレーティングシステムの起動完了を検出する手段と、オペレーティングシステムを起動するための起動信号が発生されてからの経過時間を計時し、起動信号が発生されてから所定の経過時間内にオペレーティングシステムの起動完了が検出されたか否かに基づいて、オペレーティングシステムの起動対象となるブートデバイスの切り換えを制御する手段とを具備することを特徴としている。 Japanese Patent Laid-Open No. 2002-259130 describes an information processing system. The information processing system measures the elapsed time since the start of the operating system, the means for detecting the completion of the start of the operating system, and the start signal for starting the operating system, and the start signal is generated And a means for controlling switching of a boot device to be activated by the operating system based on whether or not the activation completion of the operating system is detected within a predetermined elapsed time.

特開２００６−２３６５２４号公報には、画像処理装置が記載されている。画像処理装置は、画像処理装置のプログラムや画像データを記憶でき、内部に駆動機構を有する磁気記憶手段と、画像処理装置の制御を行う制御手段とを有している。制御手段は、磁気記憶手段の故障診断情報を取得するため磁気記憶手段へコマンドを送信し、取得した故障診断情報をもとに磁気記憶手段の内部駆動機構の故障予測を行うことを特徴としている。 Japanese Patent Application Laid-Open No. 2006-236524 describes an image processing apparatus. The image processing apparatus includes a magnetic storage unit that can store a program and image data of the image processing apparatus and has a drive mechanism therein, and a control unit that controls the image processing apparatus. The control means is characterized by transmitting a command to the magnetic storage means in order to acquire failure diagnosis information of the magnetic storage means, and performing failure prediction of the internal drive mechanism of the magnetic storage means based on the acquired failure diagnosis information. .

特開２００１−３１２３７５号公報JP 2001-31375 A 特開２００４−２１３６２１号公報JP 2004-213621 A 特開２００２−２５９１３０号公報JP 2002-259130 A 特開２００６−２３６５２４号公報JP 2006-236524 A

本発明の目的は、サーバに異常がある可能性があることを事前に予測して、サーバの故障を予防することができるサーバ故障予測システムを提供することにある。 An object of the present invention is to provide a server failure prediction system capable of preventing a server failure by predicting in advance that there may be an abnormality in the server.

本発明のサーバ故障予測システムは、サーバと、サーバに接続された保守サーバとを具備している。サーバは、記憶装置を備え、イベントが発生したときの時刻とそのイベントの内容とを含むログを時系列で記憶装置のログエリアに記録する。保守サーバは、監視部と、算出部と、通知部とを備えている。監視部は、ログエリアを監視して、サーバの電源がオンしたときの時刻を含む第１ログと、サーバが利用可能になったときの時刻を含む第２ログとを収集する。算出部は、サーバの電源がオンしてからサーバが利用可能になるまでの時間として、第１ログの時刻と第２ログとの差分である第１時間を算出する。通知部は、第１時間が第１設定時間以上である場合、サーバに異常がある可能性があることを保守員に通知する。 The server failure prediction system of the present invention includes a server and a maintenance server connected to the server. The server includes a storage device, and records a log including the time when the event occurs and the contents of the event in a time series in the log area of the storage device. The maintenance server includes a monitoring unit, a calculation unit, and a notification unit. The monitoring unit monitors the log area and collects a first log including a time when the server is turned on and a second log including a time when the server becomes available. The calculation unit calculates a first time, which is a difference between the time of the first log and the second log, as the time from when the server is turned on until the server becomes available. When the first time is equal to or longer than the first set time, the notification unit notifies the maintenance staff that there is a possibility that the server has an abnormality.

本発明のサーバ故障予測システムでは、保守サーバは、サーバの電源がオンしてからサーバが利用可能になるまでの時間が設定時間以上であるか否かを判断することにより、サーバに異常がある可能性があるか否かを事前に予測している。即ち、サーバが長く使われることによる劣化具合（ハードディスクのシーク時間の劣化や、熱によるデバイスの応答時間の劣化などに起因する、サーバのトータル起動時間の劣化）を考慮して、サーバに異常がある可能性があることを事前に予測している。従って、サーバに異常がある可能性があることを保守員に通知することにより、サーバの故障を予防することができる。 In the server failure prediction system of the present invention, the maintenance server has an abnormality in the server by determining whether the time from when the server is turned on until the server becomes available is longer than a set time. Predict whether or not there is a possibility. In other words, taking into account the deterioration caused by the server being used for a long time (degradation of the total startup time of the server due to deterioration of the seek time of the hard disk or deterioration of the response time of the device due to heat, etc.) Predicting that there is a possibility. Therefore, it is possible to prevent a server failure by notifying maintenance personnel that there is a possibility that the server has an abnormality.

図１は、一般的なシステムの構成を示している。FIG. 1 shows a general system configuration. 図２は、本発明の実施形態によるサーバ故障予測システムの構成を示している。FIG. 2 shows the configuration of the server failure prediction system according to the embodiment of the present invention. 図３は、本発明の実施形態によるサーバ故障予測システムの動作を示すフローチャートである。FIG. 3 is a flowchart showing the operation of the server failure prediction system according to the embodiment of the present invention. 図４は、本発明の実施形態によるサーバ故障予測システムの動作を説明するための図である。FIG. 4 is a diagram for explaining the operation of the server failure prediction system according to the embodiment of the present invention.

以下に添付図面を参照して、本発明の実施形態によるサーバ故障予測システムについて詳細に説明する。 Hereinafter, a server failure prediction system according to an embodiment of the present invention will be described in detail with reference to the accompanying drawings.

図２は、本発明の実施形態によるサーバ故障予測システムの構成を示している。本発明の実施形態によるサーバ故障予測システムは、サーバ１０と保守サーバ２０とを具備している。保守サーバ２０は、サーバ１０に接続されている。 FIG. 2 shows the configuration of the server failure prediction system according to the embodiment of the present invention. The server failure prediction system according to the embodiment of the present invention includes a server 10 and a maintenance server 20. The maintenance server 20 is connected to the server 10.

サーバ１０は、コンピュータであり、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）１１と記憶装置１２と複数のデバイスとを備えている。複数のデバイスとしては、ハードディスクなどのメモリや、チップセットなどが挙げられる。 The server 10 is a computer, and includes a CPU (Central Processing Unit) 11, a storage device 12, and a plurality of devices. Examples of the plurality of devices include a memory such as a hard disk and a chip set.

記憶装置１２には、サーバ１０に実行させるためのコンピュータプログラムが記憶されたエリアと、ログエリア１３とが設けられている。ＣＰＵ１１は、起動時などに記憶装置１２からコンピュータプログラムを読み取って実行する。 The storage device 12 is provided with an area for storing a computer program to be executed by the server 10 and a log area 13. The CPU 11 reads and executes a computer program from the storage device 12 at the time of startup or the like.

保守サーバ２０は、コンピュータであり、ＣＰＵ２１と記憶装置２２と表示装置２７とスピーカ２８とを備えている。 The maintenance server 20 is a computer, and includes a CPU 21, a storage device 22, a display device 27, and a speaker 28.

記憶装置２２には、ＣＰＵ２１が実行するためのコンピュータプログラム２３が記憶されたエリアが設けられている。ＣＰＵ２１は、起動時などに記憶装置２２からコンピュータプログラム２３を読み取って実行する。 The storage device 22 is provided with an area in which a computer program 23 to be executed by the CPU 21 is stored. The CPU 21 reads and executes the computer program 23 from the storage device 22 at the time of startup or the like.

そのコンピュータプログラム２３は、監視部２４、算出部２５、通知部２６を含んでいる。 The computer program 23 includes a monitoring unit 24, a calculation unit 25, and a notification unit 26.

図３は、本発明の実施形態によるサーバ故障予測システムの動作を示すフローチャートである。 FIG. 3 is a flowchart showing the operation of the server failure prediction system according to the embodiment of the present invention.

まず、サーバ１０の動作について説明する。 First, the operation of the server 10 will be described.

ＣＰＵ１１は、利用者がサーバ１０の電源をオンしたときに、サーバ１０を起動する（ステップＳ１）。 The CPU 11 starts up the server 10 when the user turns on the power of the server 10 (step S1).

ＣＰＵ１１は、イベントが発生したときの時刻とそのイベントの内容とを含むログ１３−１、１３−２、…を時系列で記憶装置１２のログエリア１３に記録する（ステップＳ２）。 The CPU 11 records the logs 13-1, 13-2,... Including the time when the event occurs and the contents of the event in the log area 13 of the storage device 12 in time series (step S2).

ＣＰＵ１１は、利用者がサーバ１０の起動を終了する指示が行われない場合（ステップＳ３−ＮＯ）、ステップＳ２を実行し、利用者がサーバ１０の起動を終了する指示を行った場合、サーバ１０の起動を終了する（ステップＳ３−ＹＥＳ）。 When the user does not give an instruction to end the startup of the server 10 (step S3-NO), the CPU 11 executes step S2, and when the user gives an instruction to end the startup of the server 10, the server 10 Is finished (step S3-YES).

次に、保守サーバ２０の動作について説明する。 Next, the operation of the maintenance server 20 will be described.

ここで、保守サーバ２０は、前述の保守サーバ１２０の動作（以下、エラー処理と称する）に加えて、次の動作（以下、予測処理と称する）を実行する。 Here, in addition to the operation of the maintenance server 120 described above (hereinafter referred to as error processing), the maintenance server 20 executes the following operation (hereinafter referred to as prediction processing).

エラー処理において、監視部２４は、ログエリア１３を監視して、イベントの内容にエラーを表すログが存在するものとする。そのエラーとしては、デバイスの電圧異常や温度以上などが挙げられる。この場合、通知部２６は、サーバ１０の異常を保守員に通知する。 In the error processing, the monitoring unit 24 monitors the log area 13 and it is assumed that there is a log indicating an error in the event content. Examples of the error include device voltage abnormality and temperature. In this case, the notification unit 26 notifies the maintenance staff of the abnormality of the server 10.

予測処理について説明する。 The prediction process will be described.

監視部２４は、ログエリア１３を監視して、図２に示されるように、利用者によりサーバ１０の電源がオンしたときの時刻を含む第１ログ（以下、電源オンログ１３−１と称する）と、サーバ１０が利用可能になったときの時刻を含む第２ログ（以下、起動完了ログ１３−ｊと称する）とを収集する（ステップＳ１１）。 The monitoring unit 24 monitors the log area 13 and, as shown in FIG. 2, the first log including the time when the power of the server 10 is turned on by the user (hereinafter referred to as the power-on log 13-1). And a second log including the time when the server 10 becomes available (hereinafter referred to as a start completion log 13-j) (step S11).

算出部２５は、サーバ１０の電源がオンしてからサーバ１０が利用可能になるまでの時間（特定イベントが所要する時間）として、電源オンログ１３−１の時刻と起動完了ログ１３−ｊとの差分である第１時間Δｔ（以下、時間Δｔ１と称する）を算出する（ステップＳ１２）。 The calculation unit 25 calculates the time between the power-on log 13-1 and the start completion log 13-j as the time from when the power of the server 10 is turned on until the server 10 becomes available (time required for the specific event). A first time Δt (hereinafter referred to as time Δt1), which is a difference, is calculated (step S12).

通知部２６は、時間Δｔ１と予め定められた第１設定時間ｔ（以下、設定時間ｔ１と称する）とを比較する（ステップＳ１３）。 The notification unit 26 compares the time Δt1 with a predetermined first set time t (hereinafter referred to as set time t1) (step S13).

そこで、時間Δｔ１が設定時間ｔ１未満である場合（ステップＳ１３−ＮＯ）、保守サーバ１０はステップＳ１１を実行する。 Therefore, when the time Δt1 is less than the set time t1 (step S13—NO), the maintenance server 10 executes step S11.

一方、時間Δｔ１が設定時間ｔ１以上である場合（ステップＳ１３−ＹＥＳ）、通知部２６は、サーバ１０に異常がある可能性があることを文字により表示装置２７に表示し、それをスピーカ２８から音により出力して、保守員に通知する（ステップＳ１４）。 On the other hand, when the time Δt1 is equal to or longer than the set time t1 (step S13—YES), the notification unit 26 displays on the display device 27 by text that there is a possibility that the server 10 is abnormal, and this is displayed from the speaker 28. The sound is output and notified to maintenance personnel (step S14).

保守サーバ１０は、保守員に１度通知したら予測処理を終了する仕様である場合、ステップＳ１４を実行した後、予測処理を終了する。又は、保守サーバ１０は、保守員に１度通知しても更にサーバ１０の異常を検出する仕様である場合、ステップＳ１１を実行する（図示しない）。 When the maintenance server 10 has a specification that ends the prediction process once notified to the maintenance staff, the maintenance server 10 ends the prediction process after executing step S14. Alternatively, the maintenance server 10 executes step S <b> 11 (not shown) when the specification is such that the abnormality of the server 10 is further detected even if the maintenance staff is notified once.

このように、本発明の実施形態によるサーバ故障予測システムでは、保守サーバ２０は、サーバ１０の電源がオンしてからサーバ１０が利用可能になるまでの時間Δｔ１が設定時間ｔ１以上であるか否かを判断することにより、サーバ１０に異常がある可能性があるか否かを事前に予測している。即ち、サーバ１０が長く使われることによる劣化具合（ハードディスクのシーク時間の劣化や、熱によるデバイスの応答時間の劣化などに起因する、サーバ１０のトータル起動時間の劣化）を考慮して、サーバ１０に異常がある可能性があることを事前に予測している。従って、サーバ１０に異常がある可能性があることを保守員に通知することにより、サーバ１０の故障を予防することができる。 Thus, in the server failure prediction system according to the embodiment of the present invention, the maintenance server 20 determines whether or not the time Δt1 from when the server 10 is powered on until the server 10 becomes available is equal to or longer than the set time t1. It is predicted in advance whether or not there is a possibility that the server 10 has an abnormality. That is, the server 10 is considered in consideration of the deterioration due to the server 10 being used for a long time (deterioration of the total startup time of the server 10 due to deterioration of the seek time of the hard disk or the response time of the device due to heat). It is predicted in advance that there may be an abnormality. Therefore, it is possible to prevent a failure of the server 10 by notifying maintenance personnel that there is a possibility that the server 10 has an abnormality.

ここで、保守サーバ２０は、複数のデバイスのうちの特定デバイスについて、以下の動作を実行する。 Here, the maintenance server 20 performs the following operation on a specific device among the plurality of devices.

監視部２４は、ログエリア１３を監視して、図４に示されるように、特定デバイスが起動したときの時刻を含む第３ログ（以下、起動開始ログ１３−ｘと称する）と、特定デバイスが利用可能になったときの時刻を含む第４ログ（以下、起動完了ログ１３−ｙと称する）とを収集する（ステップＳ１１）。 The monitoring unit 24 monitors the log area 13 and, as shown in FIG. 4, a third log including the time when the specific device is activated (hereinafter referred to as activation start log 13-x), the specific device A fourth log (hereinafter referred to as a start completion log 13-y) including the time when becomes available is collected (step S11).

算出部２５は、特定デバイスが起動してから利用可能になるまでの時間（特定イベントが所要する時間）として、起動開始ログ１３−ｘの時刻と起動完了ログ１３−ｙとの差分である第２時間Δｔ（以下、時間Δｔ２と称する）を算出する（ステップＳ１２）。 The calculation unit 25 is the difference between the time of the start start log 13-x and the start completion log 13-y as the time from when the specific device starts up until it becomes usable (the time required for the specific event). 2 hours Δt (hereinafter referred to as time Δt2) is calculated (step S12).

保守サーバ２０の通知部２６は、時間Δｔ２と予め定められた第２設定時間ｔ（以下、設定時間ｔ２と称する）とを比較する（ステップＳ１３）。 The notification unit 26 of the maintenance server 20 compares the time Δt2 with a predetermined second set time t (hereinafter referred to as set time t2) (step S13).

そこで、時間Δｔ２が設定時間ｔ２未満である場合（ステップＳ１３−ＮＯ）、保守サーバ１０はステップＳ１１を実行する。 Therefore, when the time Δt2 is less than the set time t2 (step S13—NO), the maintenance server 10 executes step S11.

一方、時間Δｔ２が設定時間ｔ２以上である場合（ステップＳ１３−ＹＥＳ）、通知部２６は、サーバ１０に異常がある可能性として、特定デバイスに異常がある可能性があることを文字により表示装置２７に表示し、それをスピーカ２８から音により出力して、保守員に通知する（ステップＳ１４）。 On the other hand, when the time Δt2 is equal to or longer than the set time t2 (step S13—YES), the notification unit 26 uses a character display to indicate that there is a possibility that the specific device has an abnormality as a possibility that the server 10 has an abnormality. 27, which is output by sound from the speaker 28 and notified to maintenance personnel (step S14).

このように、本発明の実施形態によるサーバ故障予測システムでは、特定デバイスが起動してから利用可能になるまでの時間Δｔ２が設定時間ｔ２以上であるか否かを判断することにより、特定デバイスに異常がある可能性があるか否かを事前に予測している。従って、特定デバイスに異常がある可能性があることを保守員に通知することにより、サーバ１０の故障を予防することができる。 As described above, in the server failure prediction system according to the embodiment of the present invention, it is determined whether or not the time Δt2 from when the specific device is activated until it becomes available is greater than or equal to the set time t2. Predict whether there is a possibility of abnormality. Therefore, it is possible to prevent a failure of the server 10 by notifying maintenance personnel that there is a possibility that the specific device has an abnormality.

１０サーバ、
１１ＣＰＵ、
１２記憶装置、
１３ログエリア、
１３−１、１３−２、１３−ｊ、１３−ｘ、１３−ｙログ、
２０保守サーバ、
２１ＣＰＵ、
２２記憶装置、
２３コンピュータプログラム、
２４監視部、
２５算出部、
２６通知部、
２７表示装置、
２８スピーカ、
１１０サーバ、
１１２記憶装置、
１１３ログエリア、
１１３−１、１１３−２ログ、
１２０保守サーバ、 10 servers,
11 CPU,
12 storage devices,
13 Log area,
13-1, 13-2, 13-j, 13-x, 13-y log,
20 maintenance server,
21 CPU,
22 storage devices,
23 computer program,
24 monitoring unit,
25 calculation unit,
26 Notification section,
27 display device,
28 speakers,
110 servers,
112 storage device,
113 log area,
113-1, 113-2 logs,
120 maintenance server,

Claims

A server comprising a storage device, and recording a log including the time when the event occurred and the contents of the event in a time series in the log area of the storage device;
A maintenance server connected to the server,
The maintenance server
A monitoring unit that monitors the log area and collects a first log including a time when the server is powered on and a second log including a time when the server is available;
A calculation unit that calculates a first time that is a difference between the time of the first log and the second log as a time from when the server is turned on until the server becomes available;
A server failure prediction system comprising: a notification unit that notifies a maintenance person that there is a possibility of an abnormality in the server when the first time is equal to or longer than a first set time.

The server further comprises a plurality of devices,
The monitoring unit monitors the log area, and includes a third log including a time when a specific device of the plurality of devices is activated, and a time including a time when the specific device is available. 4 logs and collect
The calculation unit calculates a second time, which is a difference between the time of the third log and the fourth log, as the time from when the specific device is activated until it becomes usable,
The notifying unit notifies a maintenance person that there is a possibility that the specific device may be abnormal as a possibility that the server has an abnormality when the second time is equal to or longer than a second set time. Server failure prediction system described in 1.

The maintenance server
A display device;
3. The server failure prediction system according to claim 1, wherein the notification unit displays on the display device by characters that there is a possibility of abnormality in the server, and notifies a maintenance staff.

The maintenance server
A speaker,
The server failure prediction system according to any one of claims 1 to 3, wherein the notification unit outputs a sound from the speaker that there is a possibility that the server is abnormal, and notifies a maintenance staff.

A maintenance server connected to a server that records a time series of logs including the time when an event occurs and the contents of the event in its own log area,
A monitoring unit that monitors the log area and collects a first log including a time when the server is powered on and a second log including a time when the server is available;
A calculation unit that calculates a first time that is a difference between the time of the first log and the second log as a time from when the server is turned on until the server becomes available;
A maintenance server comprising: a notification unit for notifying maintenance personnel that there is a possibility of an abnormality in the server when the first time is equal to or longer than a first set time.

The monitoring unit monitors the log area, and includes a third log including a time when a specific device of a plurality of devices of the server is activated, and a time when the specific device becomes available. 4th log including
The calculation unit calculates a second time, which is a difference between the time of the third log and the fourth log, as the time from when the specific device is activated until it becomes usable,
The notifying unit notifies maintenance personnel that there is a possibility that the specific device is abnormal as a possibility that the server has an abnormality when the second time is equal to or longer than a second set time. The maintenance server described in.

A display device;
The maintenance server according to claim 5 or 6, wherein the notification unit displays on the display device by text that there is a possibility that the server has an abnormality, and notifies a maintenance staff.

A speaker,
The maintenance server according to any one of claims 5 to 7, wherein the notification unit outputs a sound from the speaker that there is a possibility of an abnormality in the server, and notifies a maintenance staff.

A method of using a computer connected to a server that records a log including a time when an event occurs and contents of the event in a time series in its own log area,
Monitoring the log area and collecting a first log including a time when the server is powered on and a second log including a time when the server is available;
Calculating a first time, which is a difference between the time of the first log and the second log, as a time from when the server is turned on until the server becomes available;
A server failure prediction method comprising: notifying maintenance personnel that the server may be abnormal when the first time is equal to or longer than a first set time.

A third log including a time when a specific device of a plurality of devices of the server is activated by monitoring the log area; and a fourth log including a time when the specific device becomes available; Collecting steps,
Calculating a second time, which is a difference between the time of the third log and the fourth log, as the time from when the specific device is activated until it becomes available;
And a step of notifying a maintenance staff that there is a possibility that the specific device has an abnormality as a possibility that the server has an abnormality when the second time is equal to or longer than a second set time. 9. The server failure prediction method according to 9.

The step of notifying the maintenance staff includes:
The server failure prediction method according to claim 9 or 10, wherein a message indicating that there is a possibility of an abnormality in the server is displayed on the display device and notified to maintenance personnel.

The step of notifying the maintenance staff includes:
The server failure prediction method according to any one of claims 9 to 11, wherein the server is output by sound from the speaker that there is a possibility of abnormality in the server, and is notified to maintenance personnel.

The computer program which makes the said computer perform the server failure prediction method in any one of Claims 9-12.