JP2000207291A

JP2000207291A - Method for diagnosing memory device

Info

Publication number: JP2000207291A
Application number: JP609199A
Authority: JP
Inventors: Kazuhiro Hiuga; 一弘日向; Hiroaki Fukumaru; 広昭福丸
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1999-01-13
Filing date: 1999-01-13
Publication date: 2000-07-28

Abstract

PROBLEM TO BE SOLVED: To obtain a method for diagnosing the fault of a memory element by referring to a memory address where single-bit error occurrence is recorded and deciding that a control circuit in a memory element has gone out of order, when the memory addresses for single-bit errors generated in a prescribed time are more than a prescribed number. SOLUTION: When the type of the latest error in an error log file is '01', a memory error file is referred to (30). The memory address of the last error log in the memory error file is compared with the memory address of the latest error (40), and when the addresses do not match each other, the current number of memory addresses is compared with a previously set prescribed value (60). When prescribed value is exceeded, it is decided that the control circuit of the memory element is out of order (70) on condition that the generation time difference between the oldest error log and the latest error log in the memory error file is within the previously set prescribed time, thereby outputting an alarm (80). Then all the error logs in the memory error file are erased (90).

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は主メモリ及びＥＣＣ
訂正機構を備える計算機に係り、メモリの診断方法に関
する。The present invention relates to a main memory and an ECC.
The present invention relates to a computer having a correction mechanism and a method of diagnosing a memory.

【０００２】[0002]

【従来の技術】従来から電力，交通，公共などの産業分
野では信頼性の高い産業用計算機が利用されている。こ
のような計算機は故障が発生しないように工夫されると
ともに、仮に故障が発生したとしてもその故障が軽度の
場合は処理を続行すると共に重度故障に至る前に予防保
全を行う機能を持つ必要がある。このような要求に応え
るため、例えばメモリシングルビットエラーでは従来で
はＥＣＣ訂正機構により読み出したデータの訂正を行う
と共に訂正したデータをメモリに書き込み、再度同じメ
モリアドレスでシングルビットエラーが発生した場合は
素子故障と判断する診断手段を備えていた。2. Description of the Related Art Conventionally, highly reliable industrial computers have been used in industrial fields such as electric power, transportation, and the public. Such computers must be devised so that no failure occurs, and even if a failure occurs, if the failure is minor, it must have a function to continue processing and perform preventive maintenance before a severe failure occurs. is there. In order to respond to such a request, for example, in the case of a memory single-bit error, conventionally, data read by an ECC correction mechanism is corrected, and the corrected data is written to a memory. It had a diagnostic means for determining a failure.

【０００３】しかしながら、訂正したデータを書き込む
ための付加機構が必要であり、また書き込み動作による
性能低下が発生する。[0003] However, an additional mechanism for writing the corrected data is required, and the write operation causes a reduction in performance.

【０００４】近年、産業用計算機には信頼性に加えて性
能が求められており、汎用の高性能なプロセッサを利用
するケースが増えているが、これらのプロセッサは標準
のメモリインタフェースを備えており、多くの場合上記
のような付加機構は持たず、また性能重視のため従来の
ような診断方法は採れない。In recent years, industrial computers have been required to have high performance in addition to reliability. In many cases, general-purpose high-performance processors are used. These processors have a standard memory interface. However, in many cases, the above-mentioned additional mechanism is not provided, and the conventional diagnostic method cannot be adopted because performance is emphasized.

【０００５】また、特開平6−161796 号公報や特開平10
−133899号公報のように特別な付加機構を持たず診断プ
ログラムによりメモリＥＣＣエラーの発生回数をカウン
トし、しきい値監視する方法が知られているが、これら
は素子のソリッド故障を判断するには至っていない。Further, Japanese Patent Application Laid-Open Nos. Hei 6-161796 and
As described in JP-A-133899, there is known a method of counting the number of occurrences of a memory ECC error by a diagnostic program without a special additional mechanism and monitoring a threshold value. Has not been reached.

【０００６】[0006]

【発明が解決しようとする課題】本発明の目的は、ＥＣ
Ｃ訂正したデータを書き込むための付加機構等の特別な
機構を持たない汎用の計算機において、メモリの素子故
障を診断する方法を提供することにある。SUMMARY OF THE INVENTION An object of the present invention is to provide an EC
It is an object of the present invention to provide a method for diagnosing a memory element failure in a general-purpose computer having no special mechanism such as an additional mechanism for writing C-corrected data.

【０００７】[0007]

【課題を解決するための手段】本発明は、上記課題を解
決するために、シングルビットエラーの発生を時系列的
に記憶し、復電によっても揮発しない記憶手段，復電の
発生を時系列的に記憶する手段，シングルビットエラー
及び復電の発生を監視する手段，シングルビットエラー
が発生したアドレス，過去に発生したエラーのメモリア
ドレス，復電発生の有無によりメモリのシングルビット
エラーが素子故障によるものかα線等の外的要因による
ものかを判断する手段を持ち、請求項１に示す方法によ
り、ＥＣＣ訂正したデータを書き込むための付加機構等
の特別な機構を持たない汎用の計算機において、メモリ
の素子故障を診断することを可能とした。SUMMARY OF THE INVENTION In order to solve the above-mentioned problems, the present invention is to store a single bit error occurrence in a time-series manner, a storage means which does not volatilize even when the power is restored, and a time-series in which the occurrence of a power recovery is performed. Means for temporarily storing data, means for monitoring the occurrence of single-bit errors and power recovery, addresses at which single-bit errors have occurred, memory addresses for errors that have occurred in the past, and single-bit errors in memory depending on whether power recovery has occurred. A general-purpose computer having means for judging whether it is due to an external factor such as α-rays or the like, and having no special mechanism such as an additional mechanism for writing ECC-corrected data by the method according to claim 1. This makes it possible to diagnose a memory element failure.

【０００８】[0008]

【発明の実施の形態】図１に、本発明を適用する計算機
を示す。この計算機はプロセッサ１０，プロセッサから
のアクセスを制御するシステムコントローラ２０，主メ
モリ３０，入出力機器を制御する入出力コントローラ４
０，システムコントローラ２０とプロセッサ１０，主メ
モリ３０，入出力コントローラ４０をそれぞれ接続する
ためのプロセッサバス５０，メモリバス６０，システム
バス７０を有している。プロセッサ１０からの主メモリ
３０，入出力コントローラ４０へのアクセスはシステム
コントローラ２０を経由して行われる。FIG. 1 shows a computer to which the present invention is applied. The computer includes a processor 10, a system controller 20 for controlling access from the processor, a main memory 30, and an input / output controller 4 for controlling input / output devices.
0, a processor bus 50, a memory bus 60, and a system bus 70 for connecting the system controller 20 to the processor 10, the main memory 30, and the input / output controller 40, respectively. Access to the main memory 30 and the input / output controller 40 from the processor 10 is performed via the system controller 20.

【０００９】図２に、システムコントローラ２０，主メ
モリ３０の構成を示す。FIG. 2 shows the configuration of the system controller 20 and the main memory 30.

【００１０】システムコントローラ２０と主メモリ３０
はメモリバス６０で接続され、メモリバス６０は列アド
レス有効信号６１，行アドレス有効信号６２，読み出し
／書込み信号６３，行，列アドレス信号６４，データ信
号６５から構成される。The system controller 20 and the main memory 30
Are connected by a memory bus 60, which comprises a column address valid signal 61, a row address valid signal 62, a read / write signal 63, a row and column address signal 64, and a data signal 65.

【００１１】システムコントローラ２０はプロセッサバ
スのアドレス信号５１から列アドレス有効信号６１，行
アドレス有効信号６２，読み出し／書込み信号６３を生
成する制御回路２１，アドレスバッファ２２，プロセッ
サバスのデータ信号５２からＥＣＣコードを生成するＥ
ＣＣエンコード回路２４，書込みデータバッファ２３，
メモリバスのデータ信号６５からのデータ読み出しにお
いてＥＣＣによりシングルビットエラーの検出を行うＥ
ＣＣエラー検出回路２５，シングルビットエラーが発生
した場合にＥＣＣコードをデコードし、データの訂正を
行うＥＣＣデコード回路２６，読み出しデータバッファ
２７より構成される。The system controller 20 includes a control circuit 21 for generating a column address valid signal 61, a row address valid signal 62, and a read / write signal 63 from an address signal 51 of the processor bus, an address buffer 22, and an ECC from a data signal 52 of the processor bus. Generate code E
CC encoding circuit 24, write data buffer 23,
E for detecting a single bit error by ECC when reading data from the data signal 65 of the memory bus
It comprises a CC error detecting circuit 25, an ECC decoding circuit 26 for decoding an ECC code when a single bit error occurs and correcting data, and a read data buffer 27.

【００１２】主メモリ３０はダイナミックＲＡＭ等のメ
モリ素子より構成される。The main memory 30 comprises a memory element such as a dynamic RAM.

【００１３】次にシステムコントローラ２０と主メモリ
３０の動作について説明する。メモリへの書込みは図１
に示したプロセッサ１０がプロセッサバスのアドレス信
号５１及びデータ信号５２に書込み先のメモリアドレス
及びデータを出力することで行われる。アドレスは制御
回路２１によってメモリ素子を特定する列アドレス有効
信号６１，行アドレス有効信号６２に変換され、読み出
し／書込み信号６３は書込みを表わすＬレベルとなる。
データはＥＣＣエンコード回路２４によってＥＣＣコー
ドを付加され、書込みデータバッファ２３に一旦保持さ
れ、前記の列アドレス有効信号６１，行アドレス有効信
号６２，読み出し／書込み信号６３の確定に合わせて主
メモリ３０に書き込まれる。Next, the operation of the system controller 20 and the main memory 30 will be described. Figure 1
Is performed by outputting the write destination memory address and data to the processor bus address signal 51 and data signal 52. The address is converted by the control circuit 21 into a column address valid signal 61 and a row address valid signal 62 for specifying a memory element, and the read / write signal 63 goes to L level representing write.
The data is added with an ECC code by the ECC encoding circuit 24, temporarily stored in the write data buffer 23, and stored in the main memory 30 in accordance with the determination of the column address valid signal 61, the row address valid signal 62, and the read / write signal 63. Written.

【００１４】また読み出し時は読み出し／書込み信号６
３は読み出しを表わすＨレベルとなり、メモリバスのデ
ータ信号６５に出力されたデータはＥＣＣエラー検出回
路２５によりチェックされ、エラーが無ければそのまま
読み出し、エラーが有った場合にはＥＣＣデコード回路
２６によりデータを訂正し、データバッファ２７に保持
され、プロセッサ１０に読み込まれる。At the time of reading, a read / write signal 6
3 is at the H level indicating read, and the data output to the data signal 65 of the memory bus is checked by the ECC error detection circuit 25. If there is no error, the data is read as it is. If there is an error, the data is output by the ECC decode circuit 26. The data is corrected, held in the data buffer 27, and read by the processor 10.

【００１５】このようにシステムコントローラ２０はメ
モリシングルビットエラーが発生した場合にＥＣＣ訂正
機構により読み出しデータのみ訂正し、主メモリ上のデ
ータは訂正しない。As described above, when a memory single-bit error occurs, the system controller 20 corrects only read data by the ECC correction mechanism, and does not correct data in the main memory.

【００１６】図３に図１の計算機上で動作するプログラ
ムとデータの構成を示す。エラー発生監視プログラムは
メモリのシングルビットエラーの発生及び復電を監視し
ており、それらのイベントが発生すると発生時刻，エラ
ー種別，メモリアドレス等の補助情報をエラーログファ
イル３００に記録する。エラーログファイル３００は不
揮発性の記憶装置に構成され、本実施例の場合は固定磁
気ディスク装置である。FIG. 3 shows the structure of programs and data operating on the computer shown in FIG. The error occurrence monitoring program monitors the occurrence of a single-bit error in the memory and the restoration of power, and when such an event occurs, records auxiliary information such as an occurrence time, an error type, and a memory address in the error log file 300. The error log file 300 is configured in a nonvolatile storage device, and in the case of the present embodiment, is a fixed magnetic disk device.

【００１７】メモリ診断プログラム２００はエラーログ
ファイル３００を参照し、シングルビットエラーのログ
があった場合にはメモリエラーファイル４００に一時的
に記録する。メモリエラーファイル４００も同様に不揮
発性の記憶装置に構成され、本実施例の場合は固定磁気
ディスク装置である。The memory diagnostic program 200 refers to the error log file 300, and if there is a single-bit error log, temporarily records it in the memory error file 400. The memory error file 400 is similarly configured in a nonvolatile storage device, and in the case of the present embodiment, is a fixed magnetic disk device.

【００１８】図４にエラーログファイル３００の記録フ
ォーマットを示す。１件あたりのエラー情報はエラー発
生時刻，エラー発生時刻順に割り付けるエラーＮo.，エ
ラー種別，エラーに関する補助情報により構成される。
ここでエラー種別‘０１’はメモリシングルビットエラ
ー、‘０２’は復電ログを表わす。FIG. 4 shows the recording format of the error log file 300. The error information per case is composed of an error occurrence time, an error number assigned in the order of the error occurrence time, an error type, and auxiliary information on the error.
Here, the error type '01' indicates a memory single bit error, and '02' indicates a power recovery log.

【００１９】図５にメモリエラーファイル４００の記録
フォーマットを示す。１件あたりのエラー情報はエラー
発生時刻，エラー発生時刻順に割り付けるエラーＮo.，
メモリシングルビットエラーが発生したメモリアドレス
により構成される。ここでメモリアドレスとは物理アド
レスである。FIG. 5 shows a recording format of the memory error file 400. The error information for one case is the error occurrence time, the error number assigned in the order of the error occurrence time,
The memory is configured by a memory address at which a single bit error has occurred. Here, the memory address is a physical address.

【００２０】図６にメモリ診断プログラムのフローチャ
ートを示す。メモリ監視プログラムは計算機稼働中は常
に起動され、エラーログファイルを参照し（ステップ１
０）、シングルビットエラーの発生をエラー種別により
識別する（ステップ２０）。ステップ２０においてはエ
ラーログファイル中の最新のエラーのエラー種別が‘０
１’以外の場合はステップ１０に戻り、‘０１’の場合
はメモリエラーファイルを参照する（ステップ３０）。FIG. 6 shows a flowchart of the memory diagnosis program. The memory monitoring program is always started while the computer is running, and refers to the error log file (step 1).
0), the occurrence of a single bit error is identified by the error type (step 20). In step 20, the error type of the latest error in the error log file is "0".
If it is not "1", the process returns to Step 10, and if it is "01", the memory error file is referred to (Step 30).

【００２１】次にステップ４０に進み、メモリエラーフ
ァイルの前回エラーログのメモリアドレスと最新のエラ
ーのメモリアドレスを比較し、アドレスが不一致の場合
はステップ６０に進む。ステップ６０では現在のメモリ
アドレス件数と予め設定した規定値と比較する。Next, the process proceeds to step 40, where the memory address of the previous error log of the memory error file is compared with the memory address of the latest error. If the addresses do not match, the process proceeds to step 60. In step 60, the current number of memory addresses is compared with a preset specified value.

【００２２】ここで規定値未満の場合はステップ１２０
に進み、規定値を超えた場合はステップ７０にてメモリ
エラーファイル中の最も古いエラーログと最新のエラー
ログの発生時間差が予め設定した規定時間以内の場合、
主メモリを構成するメモリ素子の制御回路の故障と判断
し、ステップ８０にてアラームを出力する。その後、ス
テップ９０にてメモリエラーファイル内の全エラーログ
を消去する。ステップ７０にて発生時間差が規定時間を
超えている場合はステップ１２０に進み、最新のエラー
ログをメモリエラーファイルに追加する。If the value is less than the specified value, step 120 is executed.
If the difference exceeds the specified value, if the time difference between the oldest error log in the memory error file and the latest error log in the memory error file is within a predetermined time set in step 70,
It is determined that the control circuit of the memory element constituting the main memory has failed, and an alarm is output in step 80. Thereafter, in step 90, all error logs in the memory error file are deleted. If the occurrence time difference exceeds the specified time in step 70, the process proceeds to step 120, and the latest error log is added to the memory error file.

【００２３】ステップ４０にて前回エラーログのメモリ
アドレスと最新のエラーのメモリアドレスが一致した場
合はステップ１００に進み、再度エラーログファイルを
参照し、最新のメモリエラーログ以前に復電ログが無い
か確認する(ステップ１１０)。ステップ１１０にてエラ
ー種別‘０２’を確認した場合は主メモリを構成するメ
モリ素子のメモリアドレスにて特定されるメモリセルの
故障と判断し、ステップ８０に進み、アラームを出力す
る。ステップ１１０にて復電ログが確認できなかった場
合はステップ１２０に進む。If the memory address of the last error log matches the memory address of the latest error at step 40, the process proceeds to step 100, where the error log file is referred to again and there is no power recovery log before the latest memory error log. Is confirmed (step 110). If the error type "02" is confirmed in step 110, it is determined that the memory cell specified by the memory address of the memory element constituting the main memory has failed, and the process proceeds to step 80 to output an alarm. If the power recovery log cannot be confirmed in step 110, the process proceeds to step 120.

【００２４】図７に図６のステップ６０における規定値
を３回、ステップ７０における規定時間を５秒とした実
施例を示す。本図に従い、各ケースにおけるフローを説
明する。FIG. 7 shows an embodiment in which the prescribed value in step 60 of FIG. 6 is set to three times and the prescribed time in step 70 is set to 5 seconds. The flow in each case will be described with reference to FIG.

【００２５】１）図４，図５のケース図４においてＮo.１のエラーログが記録された時点から
の処理はまず、図７のステップ１０においてエラーログ
ファイルを参照し、ステップ２０にてＮo.１のエラーロ
グのエラー種別が‘０１’であることからステップ３０
に進む。ここでメモリエラーファイル（図５）を参照
し、前回ログであるＮo.２と比較する（ステップ４
０）。この結果アドレスが不一致であることからステッ
プ６０に進み、規定値と比較し、３回以上であることか
らステップ７０に進む。ステップ７０ではエラーログＮ
o.１からＮo.３までの発生時間を規定時間である５秒と
比較し、これを超えていることからメモリセルの故障と
は判断せず、ステップ９０に進み、メモリエラーファイ
ルを消去する。1) Cases of FIGS. 4 and 5 In FIG. 4, the processing from the time when the error log of No. 1 is recorded first refers to the error log file in step 10 of FIG. Step 30 because the error type of the error log of .1 is '01'
Proceed to. Here, the memory error file (FIG. 5) is referred to and compared with the previous log No. 2 (step 4).
0). As a result, since the addresses do not match, the routine proceeds to step 60, where the address is compared with the specified value. In step 70, the error log N
The occurrence time from o.1 to No.3 is compared with the specified time of 5 seconds, and since it exceeds this, it is not determined that the memory cell has failed, and the process proceeds to step 90 to erase the memory error file. .

【００２６】２）図４Ａ，図５Ａのケース図５ＡのＮo.１を記録した時点では図７のステップ６０
における規定値３を超えており、またステップ７０にお
いてエラーログＮo.１からＮo.３までの発生時間が規定
時間の５秒以内であることからメモリセル故障と判断
し、ステップ８０に進みアラームを出力する。2) Cases of FIGS. 4A and 5A When No. 1 in FIG. 5A is recorded, step 60 in FIG.
And the occurrence time from error log No. 1 to No. 3 is within 5 seconds of the specified time, it is determined that the memory cell has failed. Output.

【００２７】２）図４Ｂ，図５Ｂのケース図５ＢのＮo.１を記録した時点では図７のステップ４０
において前回ログのＮo.２とアドレスが一致しているた
めステップ１００に進む。ステップ１００でエラーログ
を参照し、図４ＢのＮo.２に復電ログがあるためメモリ
セル故障と判断し、ステップ１１０からステップ８０に
進みアラームを出力する。2) Cases of FIGS. 4B and 5B At the time when No. 1 in FIG. 5B is recorded, step 40 in FIG.
Since the address matches the address of No. 2 in the previous log, the process proceeds to step 100. In step 100, the error log is referred to, and since there is a power recovery log in No. 2 in FIG. 4B, it is determined that the memory cell has failed, and the process proceeds from step 110 to step 80 to output an alarm.

【００２８】[0028]

【発明の効果】本発明によれば、シングルビットエラー
の発生を時系列的に記憶し、復電によっても揮発しない
記憶手段，復電の発生を時系列的に記憶する手段，シン
グルビットエラー及び復電の発生を監視する手段，シン
グルビットエラーが発生したアドレス，過去に発生した
エラーのメモリアドレス，復電発生の有無によりメモリ
のシングルビットエラーが素子故障によるものかα線等
の外的要因によるものかを判断する手段を持ち、ＥＣＣ
訂正したデータを書き込むための付加機構等の特別な機
構を持たない汎用の計算機において、メモリの素子故障
を診断することができる。According to the present invention, storage means for storing the occurrence of a single bit error in a time series and not volatilizing even after power recovery, means for storing the occurrence of a power recovery in a time series, a single bit error and Means for monitoring the occurrence of power recovery, the address where a single-bit error occurred, the memory address of an error that occurred in the past, and whether a single-bit error in the memory was caused by an element failure depending on whether or not power recovery occurred. ECC has a means to determine whether
A general-purpose computer having no special mechanism such as an additional mechanism for writing corrected data can diagnose a memory element failure.

[Brief description of the drawings]

【図１】本発明の計算機の構成を示す図。FIG. 1 is a diagram showing a configuration of a computer according to the present invention.

【図２】システムコントローラ，主メモリの構成を示す
図。FIG. 2 is a diagram showing a configuration of a system controller and a main memory.

【図３】プログラムとデータの構成を示す図。FIG. 3 is a diagram showing a configuration of a program and data.

【図４】エラーログファイルのフォーマットを示す図。FIG. 4 is a diagram showing a format of an error log file.

【図５】メモリエラーファイルのフォーマットを示す
図。FIG. 5 is a view showing a format of a memory error file.

【図６】診断プログラムのフローを示す図。FIG. 6 is a diagram showing a flow of a diagnostic program.

【図７】診断プログラムのフローを示す図。FIG. 7 is a diagram showing a flow of a diagnostic program.

[Explanation of symbols]

１０…プロセッサ、２０…システムコントローラ、３０
…主メモリ、４０…入出力コントローラ、１００…エラ
ー発生監視プログラム、２００…メモリ診断プログラ
ム。10: Processor, 20: System controller, 30
... main memory, 40 ... input / output controller, 100 ... error occurrence monitoring program, 200 ... memory diagnostic program.

───────────────────────────────────────────────────── フロントページの続きＦターム(参考） 5B018 GA03 GA05 GA07 HA14 HA25 HA31 JA22 KA01 MA01 NA02 QA13 RA01 RA03 RA04 RA11 5B025 AD13 AE09 5B042 GA33 JJ17 JJ31 MA08 ──────────────────────────────────────────────────続き Continued on the front page F term (reference) 5B018 GA03 GA05 GA07 HA14 HA25 HA31 JA22 KA01 MA01 NA02 QA13 RA01 RA03 RA04 RA11 5B025 AD13 AE09 5B042 GA33 JJ17 JJ31 MA08

Claims

[Claims]

A processor, a system controller, an input / output controller, a main memory, and a processor bus, a memory bus, and a system bus for connecting the main memory to each other, wherein a single-bit error occurs when reading data from the main memory. ECC (Error Checking and Corr
ecting feature) A computer that corrects only the read data by the correction mechanism and does not correct the data in the main memory, stores the occurrence of single-bit errors in chronological order, and a storage means that does not volatilize even after power recovery. It has means for storing in chronological order, means for monitoring the occurrence of a single-bit error, and means for diagnosing memory. The means for diagnosing memory refers to the memory address recorded by the storage means for single-bit error occurrence. If the memory address of the single-bit error that occurred within the specified time is equal to or greater than the specified number, it is determined that the control circuit in the memory element has failed, a replacement request is made, and the memory of the single-bit error that occurred within the specified time is If the number of addresses is less than the specified number, it is judged that it is due to external factors such as α rays, and no exchange request is made. Diagnostic method of the memory device to be.

2. A computer according to claim 1, wherein a storage means for storing the occurrence of a single-bit error in time series and which does not volatilize even after power recovery, a means for storing the occurrence of power recovery in time series, It has means for monitoring the occurrence of bit errors and power recovery, and means for diagnosing the memory. The means for diagnosing the memory refers to the memory address recorded by the storage means for single bit error occurrence, and
If there is a power recovery between the single bit errors and the memory addresses match, if the memory address of the single bit error that occurred within the specified time is more than the specified number, the specific memory in the memory element A method for diagnosing a memory device, comprising determining that a cell failure has occurred, making a replacement request, and if there is no power recovery, determining that an external factor such as α-rays has occurred, and not making a replacement request.