JPS63157251A - Restoring system for information stored in main memory - Google Patents

Restoring system for information stored in main memory

Info

Publication number
JPS63157251A
JPS63157251A JP61307018A JP30701886A JPS63157251A JP S63157251 A JPS63157251 A JP S63157251A JP 61307018 A JP61307018 A JP 61307018A JP 30701886 A JP30701886 A JP 30701886A JP S63157251 A JPS63157251 A JP S63157251A
Authority
JP
Japan
Prior art keywords
main memory
information
storage device
external storage
memory
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP61307018A
Other languages
Japanese (ja)
Inventor
Mitsue Iwamoto
岩本 光恵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP61307018A priority Critical patent/JPS63157251A/en
Publication of JPS63157251A publication Critical patent/JPS63157251A/en
Pending legal-status Critical Current

Links

Landscapes

  • Debugging And Monitoring (AREA)
  • Techniques For Improving Reliability Of Storages (AREA)

Abstract

PURPOSE:To realize a continuous operation of an information processing system even in case a main memory has a fault during operation of said system, by separating the faulty area and sweeping the normal contents into an external memory to restore these contents into the main memory after the second rise. CONSTITUTION:When a main memory 2 has a fault, a normalcy check area 21 in a main memory constitution unit 20 is checked by a read-and-write operation via a main memory fault separating means 7. If an error is detected, it is decided that the unit 20 is defective and other normal units are written into an external memory 4. When a saving operation is through with the information on the main memory, the second rise is carried out by an OS rise means 6. Then the saved main memory information is read out to the memory 2 by means of an external memory reading means 5. Thus a continuous operation of an information processing system is ensured even after the second rise.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明は情報処理システムにおける障害処理方式に関す
る。
DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a fault handling method in an information processing system.

〔従来の技術〕[Conventional technology]

従来、主記憶装置の障害が発生すると、障害部位以外の
主記憶装置の内容が読める状態であっても無視してシス
テムの立上げが再度行なわれていた。
Conventionally, when a failure occurs in a main memory, the system is restarted, ignoring the contents even if the contents of the main memory other than the failed part are readable.

〔発明が解決しようとする問題点〕[Problem that the invention seeks to solve]

上述したように、一度主記憶装置の障害が発生すると、
障害部位以外の主記憶装置の内容が読める状態であって
も無視して再度システムの立上げが行なわれていたので
、ファイルの更新情報等のように継続性が要求される情
報が無効となり、システム運用の重大な障害となってい
た。
As mentioned above, once a main storage failure occurs,
Even if the contents of the main memory other than the faulty part were readable, they were ignored and the system was restarted, so information that required continuity, such as file update information, became invalid. This was a serious obstacle to system operation.

〔問題点を解決するための手段〕[Means for solving problems]

本発明の主記憶内情報復元方式は、主記憶装置障害時、
主記憶装置へアクセスし障害部位を切分ける主記憶障害
部位切分は手段と、前記障害部位以外の正常な主記憶装
置の内容を外部記憶装置へ書込む外部記憶装置書込手段
と、障害部位の主記憶が切離された後、オペレーティン
グシステムを再度立とげるO3立上げ手段と、オペレー
ティングシステム立上げ後前記外部記憶装置から主記憶
装置へ情報を転送する外部記憶装置読出手段とを有して
いる。
The method for restoring information in main memory of the present invention is such that when a main memory failure occurs,
a main memory failure part isolating means for accessing the main memory and isolating the faulty part; an external storage writing means for writing the contents of the normal main memory other than the faulty part to the external storage; an O3 startup means for restarting the operating system after the main memory of the operating system is disconnected; and an external storage device reading means for transferring information from the external storage device to the main storage device after the operating system is started up. There is.

〔作用〕[Effect]

したがって、主記憶装置障害時でもシステム運用の連続
性に不可欠な情報の消失を回避することができる。
Therefore, even in the event of a failure of the main storage device, it is possible to avoid loss of information essential for continuity of system operation.

〔実施例〕〔Example〕

次に、本発明の実施例について図面を参照して説明する
Next, embodiments of the present invention will be described with reference to the drawings.

第1図は本発明の主記憶内情報復元方式が適用された情
報処理システムの一実施例のブロック図である。
FIG. 1 is a block diagram of an embodiment of an information processing system to which the main memory information restoration method of the present invention is applied.

この情報処理システムは、中央処理装置1と、主記憶装
置2と、処理袋W3と、外部記憶装置4から構成されて
いる。
This information processing system includes a central processing unit 1, a main storage device 2, a processing bag W3, and an external storage device 4.

中央処理装置l内には外部記憶装置読出手段5が含まれ
、処理装置3内にはoS立上げ手段6、主記憶障害部位
切分は手段7、外部記憶装置書込手段8が含まれている
The central processing unit 1 includes an external storage device reading means 5, the processing device 3 includes an oS startup means 6, a main memory failure part isolation means 7, and an external storage device writing means 8. There is.

情報処理システム運用中、主記憶装置2に障害が発生し
た場合、処理装置3内の主記憶障害部位切分は手段7を
使って次のように障害部位の切分けを行なう。まず、O
S立上げ以前に、主記憶装置構成単位(バンクまたはメ
モリユニー/))20毎に主記憶装置2の正常動作確認
用として1ページ(1024バイト)程度の正常性チェ
ック用エリア21を前もって確保しておく、なお、この
チェック用エリア21はチェック専用エリアとするため
、OSからはアク゛セス出来ないようにO3立上げ手段
6がOSへ使用可能主記憶範囲を通知するとき正常性チ
ェック用エリア21のアドレスを除外しておき、かつ主
記憶障害部位切分は手段7により特殊なデータパターン
を正常性チェック用エリア21へ書込んでおく。運用中
主記憶装置2の障害が発生するとエラー発生通知が主記
憶装置1より処理装置3へ通知され、処理装置3内の主
記憶障害部位切分は手段7が起動される。主記憶障害部
位切分は手段7は、主記憶装置構成単位20内の各正常
性チェック用エリア21の内容を読出し、前もって書込
んだデータパターンが正しく読出されるかチェック後同
じデータパターンを再度書込みその後読出しチェックす
る。もし、データの比較エラーまたは書込みあるいは読
出しでエラーが発生した場合は、チェー2り対象を含む
主記憶装置構成単位20が不良と判定し主記憶装置2よ
り切離す。
When a fault occurs in the main memory device 2 during operation of the information processing system, the faulty part of the main memory in the processing device 3 is isolated using means 7 as follows. First, O
Before starting S, a normality check area 21 of about 1 page (1024 bytes) is secured in advance for checking the normal operation of the main memory 2 for each 20 main memory unit (bank or memory unit). Note that this check area 21 is a check-only area, so when the O3 startup means 6 notifies the OS of the usable main memory range, the normality check area 21 is not accessed from the OS. Addresses are excluded, and a special data pattern is written in the normality check area 21 by the means 7 for main memory failure site isolation. When a fault occurs in the main storage device 2 during operation, an error occurrence notification is sent from the main storage device 1 to the processing device 3, and a means 7 for isolating the faulty part of the main memory in the processing device 3 is activated. The main memory failure site isolation means 7 reads the contents of each normality check area 21 in the main memory unit 20, checks whether the previously written data pattern is read correctly, and then writes the same data pattern again. After writing, read and check. If a data comparison error or an error occurs in writing or reading, the main storage unit 20 containing the target to be checked is determined to be defective and is separated from the main storage unit 2.

正常と判定した主記憶部分に対しては、処理装置3内の
外部記憶装置書込手段8を使って動作が正常と判定され
た主記憶の内容を読出した後外部記七〇装置4へ書込む
、なお、外部記憶装M4が磁気ディスクの場合はOSが
立上り完了後処理装置3の外部記憶装置書込手段8に主
記憶内容の退避位置をあらかじめ通知しておく、退避位
置としては磁気ディスク内の固定エリアとするか、また
は固定エリアには退避エリアのポインタ値のみを入れ退
避エリアは可変位置に確保してもよい、ただし、外部記
憶装置書込手段8によって誤まった磁気ディスクアドレ
スへの書込みによるデータ破壊を避けるため主記憶情報
退避用エリアの先頭数ワードには、エリアの妥当性チェ
ック用としてO3と処理装置3との間で前もって取決め
である識別コードをOSが書込んでおく、外部記憶装置
書込手段8は主記憶情報退避前に、この識別コードを読
出してエリアの妥当性チェックを行ない、不用意なデー
タ破壊を防ぐ、主記憶情報退避完了後、退避エリアの先
頭ワードに情報書込みフラグを書込んだ後、OS立上げ
手段6により主記憶を再構成後OSの再立上げを行なう
。O3は自身が立上がると外部記憶装置4の主記憶情報
退避用エリアの先頭ワードにある書込みフラグにより情
報が退避されているかどうかチェックする。もし、情報
が退避されていれば、退避主記憶情報をOSの外部記憶
装置読出手段5によって主記憶装置2へ読出すことによ
り、主記憶障害発生時の場合でも正常に読出せる主記憶
部分についてはO3の再立上げ後でも引継ぎ可能とする
For the main memory portion determined to be normal, the contents of the main memory whose operation has been determined to be normal are read out using the external storage device writing means 8 in the processing device 3, and then written to the external storage device 4. In addition, when the external storage device M4 is a magnetic disk, the external storage device writing means 8 of the processing device 3 is notified in advance of the evacuation position of the main memory contents after the OS has completed startup. Alternatively, the fixed area may contain only the pointer value of the evacuation area and the evacuation area may be secured at a variable position. However, if the external storage device writing means 8 writes to a wrong magnetic disk address, In order to avoid data destruction due to writing, the OS writes an identification code agreed upon in advance between the O3 and the processing unit 3 in the first few words of the main memory information saving area for checking the validity of the area. , the external storage device writing means 8 reads this identification code and checks the validity of the area before saving the main memory information to prevent accidental data destruction.After the main memory information is saved, the first word of the save area is read. After writing the information write flag to, the OS startup means 6 reconfigures the main memory and then restarts the OS. When O3 starts up, it checks whether information has been saved using the write flag in the first word of the main memory information saving area of the external storage device 4. If the information has been saved, the part of the main memory that can be read normally even in the event of a main memory failure by reading the saved main memory information to the main memory device 2 by the external storage device reading means 5 of the OS. can be taken over even after restarting O3.

〔発明の効果〕〔Effect of the invention〕

以上説明したように本発明は、情報処理システム運用中
主記憶装置の障害が発生した場合にも障害部位の切分け
を行ない、正常に読出せる主記憶の内容を一時外部記憶
装置に掃出しておき、再度O3立上げ時外部記憶装置よ
り情報を主記憶上に復元することにより、システム運用
に当って特に継続性が強く要求される情報の引継ぎが容
易に出来るという効果がある。
As explained above, even if a failure occurs in the main memory during operation of an information processing system, the present invention isolates the faulty part and temporarily flushes out the contents of the main memory that can be read normally to an external storage. By restoring information from the external storage device to the main memory when the O3 is started up again, there is an effect that it is possible to easily take over information that particularly requires continuity in system operation.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の主記憶情報復元方式が運用された情報
処理システムの一実施例のブロック図である。 l ・・・ 中央処理装置、2 ・・・ 主記憶装置、
3 ・・・ 処理装置、4 ・・・ 外部記憶装置、5
 ・・・ 外部記憶装置読出手段、 6 ・・・ oS立上げ手段、 7 ・・・ 主記憶障害部位切分は手段。 8 ・・・ 外部記憶装置書込手段、 20  ・・・ 主記憶装置構成単位、21  ・・・
 正常性チェック用エリア。 第1rXJ
FIG. 1 is a block diagram of an embodiment of an information processing system in which the main memory information restoration method of the present invention is applied. l... central processing unit, 2... main storage device,
3... Processing device, 4... External storage device, 5
... External storage device reading means, 6 ... oS startup means, 7 ... Main memory failure part isolation means. 8... External storage device writing means, 20... Main storage device configuration unit, 21...
Area for health check. 1st rXJ

Claims (1)

【特許請求の範囲】 主記憶装置障害時、主記憶装置へアクセスし障害部位を
切分ける主記憶障害部位切分け手段と、前記障害部位以
外の正常な主記憶装置の内容を外部記憶装置へ書込む外
部記憶装置書込手段と、障害部位の主記憶が切離された
後、オペレーティングシステムを再度立上げるOS立上
げ手段と、 オペレーティングシステム立上げ後前記外部記憶装置よ
り主記憶装置へ情報を転送する外部記憶装置読出手段と
を有する主記憶内情報復元方式。
[Scope of Claims] Main memory fault isolation means that accesses the main memory and isolates the faulty part when the main storage fails, and writes the normal contents of the main storage other than the faulty part to an external storage. an external storage device writing means for re-starting the operating system after the failed main memory has been disconnected; A method for restoring information in main memory, comprising an external storage device reading means.
JP61307018A 1986-12-22 1986-12-22 Restoring system for information stored in main memory Pending JPS63157251A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP61307018A JPS63157251A (en) 1986-12-22 1986-12-22 Restoring system for information stored in main memory

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP61307018A JPS63157251A (en) 1986-12-22 1986-12-22 Restoring system for information stored in main memory

Publications (1)

Publication Number Publication Date
JPS63157251A true JPS63157251A (en) 1988-06-30

Family

ID=17964032

Family Applications (1)

Application Number Title Priority Date Filing Date
JP61307018A Pending JPS63157251A (en) 1986-12-22 1986-12-22 Restoring system for information stored in main memory

Country Status (1)

Country Link
JP (1) JPS63157251A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5933298A (en) * 1995-03-24 1999-08-03 U.S. Philips Corporation System comprising a magnetic head, measuring device and a current device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5933298A (en) * 1995-03-24 1999-08-03 U.S. Philips Corporation System comprising a magnetic head, measuring device and a current device

Similar Documents

Publication Publication Date Title
US6622263B1 (en) Method and apparatus for achieving system-directed checkpointing without specialized hardware assistance
JP2790034B2 (en) Non-operational memory update method
JPS5913783B2 (en) Duplicate file method
JPS63157251A (en) Restoring system for information stored in main memory
JP2513060B2 (en) Failure recovery type computer
JPH10133926A (en) Mirror disk restoring method and restoring system
JPH0690683B2 (en) Fault handling method for multiprocessor system
JPH05233466A (en) Fault recovery system of doubled auxiliary storage device
JPH0341538A (en) Main storage device
JP2527964B2 (en) Backup system initial startup control method
JPS6027953A (en) Check point processing system
JPH07141120A (en) Processing method for fault in information storage medium
KR100249809B1 (en) A continuous memory backup apparatus and method
JP2526726B2 (en) Multiplexed file recovery method
JP3340284B2 (en) Redundant system
JPH02118745A (en) Memory back-up device
JPH04273516A (en) Magnetic disk device
JPH07287694A (en) Multiplex processing system and memory synchronous control method
JPH0552538B2 (en)
KR100426943B1 (en) How to handle operating system errors in the redundant system of exchanges
JPS6143739B2 (en)
JPS6130297B2 (en)
JPS6130296B2 (en)
JPH07248929A (en) Host device and restart system using the same
JPH02194444A (en) Restarting device for information processor