JP2006268281A - Abnormality detection system in disconnecting power source input - Google Patents

Abnormality detection system in disconnecting power source input Download PDF

Info

Publication number
JP2006268281A
JP2006268281A JP2005083803A JP2005083803A JP2006268281A JP 2006268281 A JP2006268281 A JP 2006268281A JP 2005083803 A JP2005083803 A JP 2005083803A JP 2005083803 A JP2005083803 A JP 2005083803A JP 2006268281 A JP2006268281 A JP 2006268281A
Authority
JP
Japan
Prior art keywords
time
monitor
address
difference
memory
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
JP2005083803A
Other languages
Japanese (ja)
Inventor
Koji Nozu
光司 野津
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP2005083803A priority Critical patent/JP2006268281A/en
Publication of JP2006268281A publication Critical patent/JP2006268281A/en
Withdrawn legal-status Critical Current

Links

Images

Landscapes

  • Power Sources (AREA)
  • Debugging And Monitoring (AREA)

Abstract

<P>PROBLEM TO BE SOLVED: To solve the problem that since a message to be outputted by an OS is monitored in a conventional abnormality monitor system, it is impossible for the OS whose message output is small, or whose message output is not available to monitor the message (1), and that since it is necessary to start a prescribed program for monitor, and to secure an OS operating status for making it possible for the OS to communicate with the program for monitor, and particularly since a monitor program ends according to the start of the stop processing of the OS, it is impossible to detect any failure during stop processing (2). <P>SOLUTION: In this automatic operation control system for realizing the power source supply of each computer device 1 according to a schedule, when any abnormality is generated in the start and stop processing of the compute device whose power source is controlled, the abnormality is detected by a monitoring part, and a staff is notified of the result. Thus, it is possible to eliminate any influence on a scheduled system operation. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、コンピュータの異常監視処理に関し、特に、コンピュータの立ち上げ、立ち下げ処理において決められた主記憶メモリのアドレスについてそのアドレスへのアクセス時刻を前回のアクセス時刻と比較して異常の有無を検出する電源入力切断時の異常検知方式に関する。   The present invention relates to an abnormality monitoring process for a computer, and in particular, compares the access time to the address of the main memory determined in the startup and shutdown process of the computer with the previous access time to determine whether there is an abnormality. The present invention relates to an abnormality detection method at the time of power supply input detection to be detected.

従来のコンピュータの故障などの監視システムは、コンピュータの立ち上げ時に監視用プログラムを起動し立ち上げ時刻を基準時刻と比較することで異常を検出したりしている。例えば特許文献1にはOSの立ち上げおよび立ち下げ時に出力するメッセージを監視し、前回同一のメッセージが出力された時間との差分を調べ、差分があれば異常と判断し、保守センタに通知するという方法が開示されている
特開2000-330831号公報
In a conventional monitoring system for a computer failure or the like, an abnormality is detected by starting a monitoring program when the computer is started up and comparing the startup time with a reference time. For example, Patent Document 1 monitors messages output at the time of OS startup and shutdown, examines the difference from the time when the same message was output last time, and if there is a difference, determines that it is abnormal and notifies the maintenance center. The method is disclosed
JP 2000-330831 A

しかしながら、この特許文献1における従来技術はOSの出力するメッセージを監視しているため、メッセージの出力が少ないOSや、メッセージを出力しないOSなどでは適用できなかった。また、特定の監視用プログラムを起動し、かつOSが監視用プログラムと通信するためにOS稼働状態でなければならなく、特に立ち下げ時はOSの立ち下げ処理開始とともに監視プログラムも終了してしまうため、立ち下げ処理中の障害は検知不可能であった。   However, since the prior art in Patent Document 1 monitors messages output by the OS, it cannot be applied to an OS that outputs few messages or an OS that does not output messages. In addition, a specific monitoring program must be started and the OS must be in an operating state in order for the OS to communicate with the monitoring program. In particular, at the time of shutdown, the monitoring program ends with the start of OS shutdown processing. Therefore, the failure during the shutdown process could not be detected.

この発明は、自動運転を行うコンピュータシステムにおいて、コンピュータシステムの立ち上げおよび立ち下げ時に処理に必要なメモリアドレスにアクセスした時の立ち上げおよび立ち下げ処理の開始からの時間を記憶しておき、前回のシステム立ち上げおよび立ち下げ時に記憶していた時間を比較し、差分があった場合は差分情報をファイルに保存するとともに、障害と判断し保守センタに自動通報する。これによって、すべてのOSに対し、コンピュータシステムの立ち上げおよび立ち下げ時の状況を段階的に監視することが可能となり、特に立ち下げ時に関してはOS立ち下げ終了の直前まで監視することが可能とする電源入力切断時の異常検知方式を提供するものである。   The present invention, in a computer system that performs automatic operation, memorizes the time from the start of startup and shutdown processing when accessing a memory address required for processing during startup and shutdown of the computer system, The time stored at the time of system startup and shutdown is compared, and if there is a difference, the difference information is saved in a file, and it is determined that there is a failure and is automatically notified to the maintenance center. As a result, it is possible to monitor the startup and shutdown status of the computer system in stages for all the OSs, and in particular, it is possible to monitor the shutdown until immediately before the OS shutdown ends. An abnormality detection method at the time of power input disconnection is provided.

本発明の電源入力切断時の異常検知方式は、コンピュータの立ち上げ時又は立ち下げ時に、前記コンピュータの主記憶メモリの予め設定されたアドレスへのアクセス時刻を前記アドレスとともに不揮発性メモリに記憶する記憶手段と、立ち上げ立ち下げ処理の終了時に前回の立ち上げ立ち下げ処理で記憶された情報を前記不揮発性メモリから読み出す読み出し手段と、前記記憶手段により記憶した今回の情報と、前記読み出し手段で読み出した前回の情報について同じアドレスを持つ時刻の差分をとり前記差分を予め設定された閾値と比較する比較手段と、前記差分が前記閾値を越えると保守センタに前記差分を通知する通知手段と、を備えることを特徴とする。   According to the present invention, the abnormality detection method at the time of power-off is a memory that stores the access time to a preset address in the main memory of the computer together with the address in the nonvolatile memory when the computer is started up or shut down. Means, reading means for reading out information stored in the previous start-up / down process at the end of the start-up / down process from the non-volatile memory, current information stored in the storage means, and reading out by the read-out means Comparing means for taking a time difference having the same address for the previous information and comparing the difference with a preset threshold value, and notifying means for notifying the difference to the maintenance center when the difference exceeds the threshold value. It is characterized by providing.

本発明によれば、スケジュールに従って各コンピュータ装置電源制御を実現している自動運転制御システムにおいて、従来の監視用プログラムでは監視困難であったコンピュータ装置のOS立ち上げおよび立ち下げ処理中の異常を検知が可能となり、さらに、立ち上げおよび立ち下げ時に出力するメッセージがないOSでも異常を検知することが可能であり、予定していたシステム運用に支障をきたす事態を回避することができる。   According to the present invention, in an automatic operation control system that realizes power control of each computer device according to a schedule, an abnormality during the startup and shutdown processing of a computer device that has been difficult to monitor with a conventional monitoring program is detected. In addition, it is possible to detect an abnormality even in an OS that does not have a message to be output at the time of start-up and shut-down, and a situation in which a planned system operation is hindered can be avoided.

次に、本発明を実施するための最良の形態について図面を参照して詳細に説明する。   Next, the best mode for carrying out the present invention will be described in detail with reference to the drawings.

図1は本発明を使用した自動運転制御システムの一実施の形態を示す。同図において、コンピュータ装置1は、演算処理を行うEPU11と、主記憶メモリ12と、OS(オペレーティングシステム)の立ち上げおよび立ち下げ処理の開始の監視やEPUからのメモリアクセスを監視する監視部13と、磁気ディスク装置10と、コンピュータ1にケーブルで接続され、保守センタ3への異常の通知を行うSVP2と、を有する。   FIG. 1 shows an embodiment of an automatic driving control system using the present invention. In the figure, a computer apparatus 1 includes an EPU 11 that performs arithmetic processing, a main memory 12, and a monitoring unit 13 that monitors start of OS (operating system) startup and shutdown processing and memory access from the EPU. And a magnetic disk device 10 and an SVP 2 that is connected to the computer 1 with a cable and notifies the maintenance center 3 of an abnormality.

監視部13は、立ち上げ、立ち下げ処理においてアクセスされる予め決められた1以上のアドレスを記憶し、このアドレスへのアクセスを監視するメモリアクセス監視手段131と、メモリアクセス監視手段131からアクセスされたアドレスの通知を受けるとその時の時刻をOSの計時機構から取り出し、通知されたアドレスと時刻とを磁気ディスク装置10の時刻履歴ファイル134に記憶するメモリアドレス記憶手段132と、立ち上げ立ち下げ処理が終了すると、立ち上げおよび立ち下げ処理中での同じメモリアドレスをアクセスした今回と前回の情報を時刻履歴ファイル134から読み込み、同じアドレスについての時刻を比較するメモリアドレス比較手段133と、前回との差分がある場合、その差分情報を時刻履歴ファイル134に記憶するとともに、差分情報をSVP2に通知する差分情報通報手段135と、からなる。時刻履歴ファイル134は、複数回の立ち上げ立ち下げ時のアドレスと時刻情報が記憶でき、ファイルの最後まで使用されると、ファイルの先頭から記憶が行われるようになっている。   The monitoring unit 13 stores one or more predetermined addresses to be accessed in the startup and shutdown processes, and is accessed from the memory access monitoring unit 131 that monitors access to these addresses, and the memory access monitoring unit 131. When the notification of the address is received, the time at that time is taken out from the timing mechanism of the OS, the memory address storage means 132 for storing the notified address and time in the time history file 134 of the magnetic disk device 10, and start-up / down processing Is completed, the memory address comparing means 133 that reads the current and previous information accessed from the time history file 134 and compares the time for the same address, and the previous address, If there is a difference, the difference information is stored in the time history file 1 Stores 4, the difference information notification means 135 for notifying the difference information to the SVP2, consists. The time history file 134 can store the address and time information at the time of starting and falling a plurality of times, and when it is used to the end of the file, it is stored from the beginning of the file.

メモリアクセス監視手段131は、EPUファームウェア(FW)機能を含み、このEPUFWが立ち上げ時、立ち下げ時にOSによってアクセスされたアドレスを監視する。   The memory access monitoring unit 131 includes an EPU firmware (FW) function, and monitors an address accessed by the OS when the EPUFW is started up or shut down.

次に、本発明を実施するための最良の形態の動作について図面を参照して説明する。   Next, the operation of the best mode for carrying out the present invention will be described with reference to the drawings.

メモリアクセス監視手段131は、OSの立ち上げまたは立ち下げ処理が開始すると、EPU11内のEPUFWがOSの立ち上げおよび立ち下げ処理の実行時にアクセスが行われる、常駐領域の予め決められた1以上のメモリアドレスへのアクセスの監視を開始する。   When the OS startup or shutdown process is started, the memory access monitoring means 131 is accessed by the EPUFW in the EPU 11 when the OS startup or shutdown process is executed. Start monitoring access to memory addresses.

メモリアクセス監視手段131が、コンピュータ装置1の立ち上げの開始を検出し、予め決められたアドレスへのアクセスされたことを検出すると、メモリアドレス記憶手段132は、メモリアクセス監視手段131によりコンピュータ装置1の立ち上げ時のメモリのアドレスの記録の指示を受ける。   When the memory access monitoring unit 131 detects the start of startup of the computer apparatus 1 and detects that a predetermined address has been accessed, the memory address storage unit 132 uses the memory access monitoring unit 131 to detect the computer apparatus 1. Receives an instruction to record the address of the memory at the start-up.

メモリアドレス記憶手段132は、立ち上げ終了までにメモリアクセス監視手段132から通知されるアドレスを時刻情報とともに時刻履歴ファイル134に記憶する。   The memory address storage unit 132 stores the address notified from the memory access monitoring unit 132 by the time history file 134 together with the time information before the start-up is completed.

また、同様の要領で、メモリアクセス監視手段131からコンピュータ装置1の立ち下げ開始が検出された場合も、メモリアドレス記憶手段132は、メモリアクセス監視手段131により立ち下げ終了までのアクセスされたアドレスを時刻情報とともに記憶する。   In the same manner, when the memory access monitoring unit 131 detects that the computer device 1 has started to fall, the memory address storage unit 132 also stores the address accessed by the memory access monitoring unit 131 until the end of the fall. Store with time information.

メモリアドレス比較手段133は、立ち上げおよび立ち下げ処理が終了すると、時刻履歴ファイル134から前回の立ち上げ、立ち下げ時にアクセスしたメモリアドレスと時刻情報と、今回の立ち上げおよび立ち下げ時にアクセスしたメモリアドレスと時刻情報を取り出す。立ち上げ処理時には、前回の立ち上げ時の情報、立ち下げ処理時には、前回の立ち下げ処理時の情報となる。   When the startup and shutdown processing is completed, the memory address comparing means 133 reads the memory address and time information accessed at the previous startup and shutdown from the time history file 134, and the memory accessed at the current startup and shutdown. Retrieve address and time information. At the time of start-up processing, information at the time of the previous start-up processing is used, and at the time of the start-up processing, information at the time of the previous start-up processing is used.

メモリアドレス比較手段133は、同じアドレスについて時刻情報を比較し、予め決められた閾値以上の差分があった場合に時刻履歴ファイル134にその差分を格納する。   The memory address comparison unit 133 compares time information for the same address, and stores the difference in the time history file 134 when there is a difference equal to or greater than a predetermined threshold.

時刻履歴ファイル134に格納された閾値以上の差分情報がある場合、差分情報通報手段135はこの差分情報を故障、あるいは障害の予兆情報としてSVP2に通知する。SVP2はこれを保守センタ3に対して通知する。   If there is difference information equal to or greater than the threshold stored in the time history file 134, the difference information reporting unit 135 notifies the SVP 2 of this difference information as failure or failure sign information. The SVP 2 notifies this to the maintenance center 3.

スケジュールに従った運転を行うコンピュータの立ち上げ時、立ち下げ時の異常を検出する保守業務に適用することができる。   The present invention can be applied to maintenance work that detects an abnormality at the time of startup or shutdown of a computer that operates according to a schedule.

本発明の実施例の構成を説明するブロック図である。It is a block diagram explaining the structure of the Example of this invention. 本発明の実施例の動作を表した説明図である。It is explanatory drawing showing operation | movement of the Example of this invention.

符号の説明Explanation of symbols

1 コンピュータ装置
11 EPU
12 メモリ
13 監視部
131 メモリアクセス監視手段
132 メモリアドレス記憶手段
133 メモリアドレス比較手段
134 時刻履歴ファイル
135 差分情報通報手段
10 磁気ディスク装置
2 SVP
3 保守センタ
1 Computer equipment 11 EPU
DESCRIPTION OF SYMBOLS 12 Memory 13 Monitoring part 131 Memory access monitoring means 132 Memory address memory | storage means 133 Memory address comparison means 134 Time history file 135 Difference information reporting means 10 Magnetic disk apparatus 2 SVP
3 Maintenance Center

Claims (1)

コンピュータの立ち上げ時又は立ち下げ時に、前記コンピュータの主記憶メモリの予め設定されたアドレスへのアクセス時刻を前記アドレスとともに不揮発性メモリに記憶する記憶手段と、立ち上げ立ち下げ処理の終了時に前回の立ち上げ立ち下げ処理で記憶された情報を前記不揮発性メモリから読み出す読み出し手段と、前記記憶手段により記憶した今回の情報と、前記読み出し手段で読み出した前回の情報について同じアドレスを持つ時刻の差分をとり前記差分を予め設定された閾値と比較する比較手段と、前記差分が前記閾値を越えると保守センタに前記差分を通知する通知手段と、を備えることを特徴とする電源入力切断時の異常検知方式。
When starting up or shutting down the computer, storage means for storing the access time to a preset address in the main memory of the computer together with the address in the non-volatile memory; Read means for reading out the information stored in the start-up / down process from the nonvolatile memory, the current information stored by the storage means, and the difference between the times having the same address for the previous information read by the read means And a comparison means for comparing the difference with a preset threshold value, and a notification means for notifying the difference to the maintenance center when the difference exceeds the threshold value. method.
JP2005083803A 2005-03-23 2005-03-23 Abnormality detection system in disconnecting power source input Withdrawn JP2006268281A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2005083803A JP2006268281A (en) 2005-03-23 2005-03-23 Abnormality detection system in disconnecting power source input

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2005083803A JP2006268281A (en) 2005-03-23 2005-03-23 Abnormality detection system in disconnecting power source input

Publications (1)

Publication Number Publication Date
JP2006268281A true JP2006268281A (en) 2006-10-05

Family

ID=37204218

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2005083803A Withdrawn JP2006268281A (en) 2005-03-23 2005-03-23 Abnormality detection system in disconnecting power source input

Country Status (1)

Country Link
JP (1) JP2006268281A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010092127A (en) * 2008-10-03 2010-04-22 Fujitsu Ltd Computer system, processor diagnostic method, and processor diagnosis control program
JP2013045154A (en) * 2011-08-22 2013-03-04 Nec Corp Information processing apparatus, information processing system, abnormality sign detection method for information processing apparatus, and abnormality sign detection program

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010092127A (en) * 2008-10-03 2010-04-22 Fujitsu Ltd Computer system, processor diagnostic method, and processor diagnosis control program
JP2013045154A (en) * 2011-08-22 2013-03-04 Nec Corp Information processing apparatus, information processing system, abnormality sign detection method for information processing apparatus, and abnormality sign detection program
US9256486B2 (en) 2011-08-22 2016-02-09 Nec Corporation Information processing apparatus detecting sign of abnormality, fault tolerant information processing system, method for detecting sign of abnormality in information processing apparatus and recording medium

Similar Documents

Publication Publication Date Title
JP2005092859A (en) Information processing device and method, recording medium and program
CN110445638B (en) Switch system fault protection method and device
US20200033928A1 (en) Method of periodically recording for events
JP2017187992A (en) Control device, control method and program
JP2010086364A (en) Information processing device, operation state monitoring device and method
JP5041290B2 (en) PROGRAMMABLE CONTROLLER AND ITS ERROR RECOVERY METHOD
US6598170B1 (en) Power supply control based on preset schedule with independent schedule monitor and backup system for executing schedule operation when malfunction occurs
JP4886558B2 (en) Information processing device
CN113468617A (en) Method, equipment and storage medium for power failure protection of equipment
JP2006268281A (en) Abnormality detection system in disconnecting power source input
JP2008225697A (en) Power source control device
JP2006221484A (en) Temperature anomaly monitoring method for computer
JP2007185022A (en) Fan controller
JPH10307635A (en) Computer system and temperature monitoring method applied to the same system
JP2010198410A (en) Server failure prediction system
JP2005122424A (en) Watchdog timer
JP5422426B2 (en) Information processing device
CN112084085B (en) System outage recording method
JP2009205276A (en) Watchdog timer control device, watchdog timer control method, program, and cpu system
JP2755159B2 (en) Self-diagnosis method for information processing equipment
JP2006085497A (en) Cooling method, cooling system, and cooling program
JP2020190800A (en) Monitoring control device, monitoring control method, computer, and program
JP6168238B2 (en) Video recording / reproducing apparatus, surveillance recorder apparatus, and video recording / reproducing method
JP6410015B2 (en) Information processing device
JP2022030028A (en) Information processing device, information processing method, information processing system, and program

Legal Events

Date Code Title Description
A300 Withdrawal of application because of no request for examination

Free format text: JAPANESE INTERMEDIATE CODE: A300

Effective date: 20080603