JPS63157251A - Restoring system for information stored in main memory - Google Patents
Restoring system for information stored in main memoryInfo
- Publication number
- JPS63157251A JPS63157251A JP61307018A JP30701886A JPS63157251A JP S63157251 A JPS63157251 A JP S63157251A JP 61307018 A JP61307018 A JP 61307018A JP 30701886 A JP30701886 A JP 30701886A JP S63157251 A JPS63157251 A JP S63157251A
- Authority
- JP
- Japan
- Prior art keywords
- main memory
- information
- storage device
- external storage
- memory
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000002955 isolation Methods 0.000 claims description 5
- 238000000034 method Methods 0.000 claims description 5
- 230000010365 information processing Effects 0.000 abstract description 8
- 230000002950 deficient Effects 0.000 abstract description 2
- 238000010408 sweeping Methods 0.000 abstract 1
- 230000000694 effects Effects 0.000 description 3
- 230000006378 damage Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
Landscapes
- Debugging And Monitoring (AREA)
- Techniques For Improving Reliability Of Storages (AREA)
Abstract
Description
【発明の詳細な説明】
〔産業上の利用分野〕
本発明は情報処理システムにおける障害処理方式に関す
る。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a fault handling method in an information processing system.
従来、主記憶装置の障害が発生すると、障害部位以外の
主記憶装置の内容が読める状態であっても無視してシス
テムの立上げが再度行なわれていた。Conventionally, when a failure occurs in a main memory, the system is restarted, ignoring the contents even if the contents of the main memory other than the failed part are readable.
上述したように、一度主記憶装置の障害が発生すると、
障害部位以外の主記憶装置の内容が読める状態であって
も無視して再度システムの立上げが行なわれていたので
、ファイルの更新情報等のように継続性が要求される情
報が無効となり、システム運用の重大な障害となってい
た。As mentioned above, once a main storage failure occurs,
Even if the contents of the main memory other than the faulty part were readable, they were ignored and the system was restarted, so information that required continuity, such as file update information, became invalid. This was a serious obstacle to system operation.
本発明の主記憶内情報復元方式は、主記憶装置障害時、
主記憶装置へアクセスし障害部位を切分ける主記憶障害
部位切分は手段と、前記障害部位以外の正常な主記憶装
置の内容を外部記憶装置へ書込む外部記憶装置書込手段
と、障害部位の主記憶が切離された後、オペレーティン
グシステムを再度立とげるO3立上げ手段と、オペレー
ティングシステム立上げ後前記外部記憶装置から主記憶
装置へ情報を転送する外部記憶装置読出手段とを有して
いる。The method for restoring information in main memory of the present invention is such that when a main memory failure occurs,
a main memory failure part isolating means for accessing the main memory and isolating the faulty part; an external storage writing means for writing the contents of the normal main memory other than the faulty part to the external storage; an O3 startup means for restarting the operating system after the main memory of the operating system is disconnected; and an external storage device reading means for transferring information from the external storage device to the main storage device after the operating system is started up. There is.
したがって、主記憶装置障害時でもシステム運用の連続
性に不可欠な情報の消失を回避することができる。Therefore, even in the event of a failure of the main storage device, it is possible to avoid loss of information essential for continuity of system operation.
次に、本発明の実施例について図面を参照して説明する
。Next, embodiments of the present invention will be described with reference to the drawings.
第1図は本発明の主記憶内情報復元方式が適用された情
報処理システムの一実施例のブロック図である。FIG. 1 is a block diagram of an embodiment of an information processing system to which the main memory information restoration method of the present invention is applied.
この情報処理システムは、中央処理装置1と、主記憶装
置2と、処理袋W3と、外部記憶装置4から構成されて
いる。This information processing system includes a central processing unit 1, a main storage device 2, a processing bag W3, and an external storage device 4.
中央処理装置l内には外部記憶装置読出手段5が含まれ
、処理装置3内にはoS立上げ手段6、主記憶障害部位
切分は手段7、外部記憶装置書込手段8が含まれている
。The central processing unit 1 includes an external storage device reading means 5, the processing device 3 includes an oS startup means 6, a main memory failure part isolation means 7, and an external storage device writing means 8. There is.
情報処理システム運用中、主記憶装置2に障害が発生し
た場合、処理装置3内の主記憶障害部位切分は手段7を
使って次のように障害部位の切分けを行なう。まず、O
S立上げ以前に、主記憶装置構成単位(バンクまたはメ
モリユニー/))20毎に主記憶装置2の正常動作確認
用として1ページ(1024バイト)程度の正常性チェ
ック用エリア21を前もって確保しておく、なお、この
チェック用エリア21はチェック専用エリアとするため
、OSからはアク゛セス出来ないようにO3立上げ手段
6がOSへ使用可能主記憶範囲を通知するとき正常性チ
ェック用エリア21のアドレスを除外しておき、かつ主
記憶障害部位切分は手段7により特殊なデータパターン
を正常性チェック用エリア21へ書込んでおく。運用中
主記憶装置2の障害が発生するとエラー発生通知が主記
憶装置1より処理装置3へ通知され、処理装置3内の主
記憶障害部位切分は手段7が起動される。主記憶障害部
位切分は手段7は、主記憶装置構成単位20内の各正常
性チェック用エリア21の内容を読出し、前もって書込
んだデータパターンが正しく読出されるかチェック後同
じデータパターンを再度書込みその後読出しチェックす
る。もし、データの比較エラーまたは書込みあるいは読
出しでエラーが発生した場合は、チェー2り対象を含む
主記憶装置構成単位20が不良と判定し主記憶装置2よ
り切離す。When a fault occurs in the main memory device 2 during operation of the information processing system, the faulty part of the main memory in the processing device 3 is isolated using means 7 as follows. First, O
Before starting S, a normality check area 21 of about 1 page (1024 bytes) is secured in advance for checking the normal operation of the main memory 2 for each 20 main memory unit (bank or memory unit). Note that this check area 21 is a check-only area, so when the O3 startup means 6 notifies the OS of the usable main memory range, the normality check area 21 is not accessed from the OS. Addresses are excluded, and a special data pattern is written in the normality check area 21 by the means 7 for main memory failure site isolation. When a fault occurs in the main storage device 2 during operation, an error occurrence notification is sent from the main storage device 1 to the processing device 3, and a means 7 for isolating the faulty part of the main memory in the processing device 3 is activated. The main memory failure site isolation means 7 reads the contents of each normality check area 21 in the main memory unit 20, checks whether the previously written data pattern is read correctly, and then writes the same data pattern again. After writing, read and check. If a data comparison error or an error occurs in writing or reading, the main storage unit 20 containing the target to be checked is determined to be defective and is separated from the main storage unit 2.
正常と判定した主記憶部分に対しては、処理装置3内の
外部記憶装置書込手段8を使って動作が正常と判定され
た主記憶の内容を読出した後外部記七〇装置4へ書込む
、なお、外部記憶装M4が磁気ディスクの場合はOSが
立上り完了後処理装置3の外部記憶装置書込手段8に主
記憶内容の退避位置をあらかじめ通知しておく、退避位
置としては磁気ディスク内の固定エリアとするか、また
は固定エリアには退避エリアのポインタ値のみを入れ退
避エリアは可変位置に確保してもよい、ただし、外部記
憶装置書込手段8によって誤まった磁気ディスクアドレ
スへの書込みによるデータ破壊を避けるため主記憶情報
退避用エリアの先頭数ワードには、エリアの妥当性チェ
ック用としてO3と処理装置3との間で前もって取決め
である識別コードをOSが書込んでおく、外部記憶装置
書込手段8は主記憶情報退避前に、この識別コードを読
出してエリアの妥当性チェックを行ない、不用意なデー
タ破壊を防ぐ、主記憶情報退避完了後、退避エリアの先
頭ワードに情報書込みフラグを書込んだ後、OS立上げ
手段6により主記憶を再構成後OSの再立上げを行なう
。O3は自身が立上がると外部記憶装置4の主記憶情報
退避用エリアの先頭ワードにある書込みフラグにより情
報が退避されているかどうかチェックする。もし、情報
が退避されていれば、退避主記憶情報をOSの外部記憶
装置読出手段5によって主記憶装置2へ読出すことによ
り、主記憶障害発生時の場合でも正常に読出せる主記憶
部分についてはO3の再立上げ後でも引継ぎ可能とする
。For the main memory portion determined to be normal, the contents of the main memory whose operation has been determined to be normal are read out using the external storage device writing means 8 in the processing device 3, and then written to the external storage device 4. In addition, when the external storage device M4 is a magnetic disk, the external storage device writing means 8 of the processing device 3 is notified in advance of the evacuation position of the main memory contents after the OS has completed startup. Alternatively, the fixed area may contain only the pointer value of the evacuation area and the evacuation area may be secured at a variable position. However, if the external storage device writing means 8 writes to a wrong magnetic disk address, In order to avoid data destruction due to writing, the OS writes an identification code agreed upon in advance between the O3 and the processing unit 3 in the first few words of the main memory information saving area for checking the validity of the area. , the external storage device writing means 8 reads this identification code and checks the validity of the area before saving the main memory information to prevent accidental data destruction.After the main memory information is saved, the first word of the save area is read. After writing the information write flag to, the OS startup means 6 reconfigures the main memory and then restarts the OS. When O3 starts up, it checks whether information has been saved using the write flag in the first word of the main memory information saving area of the external storage device 4. If the information has been saved, the part of the main memory that can be read normally even in the event of a main memory failure by reading the saved main memory information to the main memory device 2 by the external storage device reading means 5 of the OS. can be taken over even after restarting O3.
以上説明したように本発明は、情報処理システム運用中
主記憶装置の障害が発生した場合にも障害部位の切分け
を行ない、正常に読出せる主記憶の内容を一時外部記憶
装置に掃出しておき、再度O3立上げ時外部記憶装置よ
り情報を主記憶上に復元することにより、システム運用
に当って特に継続性が強く要求される情報の引継ぎが容
易に出来るという効果がある。As explained above, even if a failure occurs in the main memory during operation of an information processing system, the present invention isolates the faulty part and temporarily flushes out the contents of the main memory that can be read normally to an external storage. By restoring information from the external storage device to the main memory when the O3 is started up again, there is an effect that it is possible to easily take over information that particularly requires continuity in system operation.
第1図は本発明の主記憶情報復元方式が運用された情報
処理システムの一実施例のブロック図である。
l ・・・ 中央処理装置、2 ・・・ 主記憶装置、
3 ・・・ 処理装置、4 ・・・ 外部記憶装置、5
・・・ 外部記憶装置読出手段、
6 ・・・ oS立上げ手段、
7 ・・・ 主記憶障害部位切分は手段。
8 ・・・ 外部記憶装置書込手段、
20 ・・・ 主記憶装置構成単位、21 ・・・
正常性チェック用エリア。
第1rXJFIG. 1 is a block diagram of an embodiment of an information processing system in which the main memory information restoration method of the present invention is applied. l... central processing unit, 2... main storage device,
3... Processing device, 4... External storage device, 5
... External storage device reading means, 6 ... oS startup means, 7 ... Main memory failure part isolation means. 8... External storage device writing means, 20... Main storage device configuration unit, 21...
Area for health check. 1st rXJ
Claims (1)
切分ける主記憶障害部位切分け手段と、前記障害部位以
外の正常な主記憶装置の内容を外部記憶装置へ書込む外
部記憶装置書込手段と、障害部位の主記憶が切離された
後、オペレーティングシステムを再度立上げるOS立上
げ手段と、 オペレーティングシステム立上げ後前記外部記憶装置よ
り主記憶装置へ情報を転送する外部記憶装置読出手段と
を有する主記憶内情報復元方式。[Scope of Claims] Main memory fault isolation means that accesses the main memory and isolates the faulty part when the main storage fails, and writes the normal contents of the main storage other than the faulty part to an external storage. an external storage device writing means for re-starting the operating system after the failed main memory has been disconnected; A method for restoring information in main memory, comprising an external storage device reading means.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP61307018A JPS63157251A (en) | 1986-12-22 | 1986-12-22 | Restoring system for information stored in main memory |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP61307018A JPS63157251A (en) | 1986-12-22 | 1986-12-22 | Restoring system for information stored in main memory |
Publications (1)
Publication Number | Publication Date |
---|---|
JPS63157251A true JPS63157251A (en) | 1988-06-30 |
Family
ID=17964032
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP61307018A Pending JPS63157251A (en) | 1986-12-22 | 1986-12-22 | Restoring system for information stored in main memory |
Country Status (1)
Country | Link |
---|---|
JP (1) | JPS63157251A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5933298A (en) * | 1995-03-24 | 1999-08-03 | U.S. Philips Corporation | System comprising a magnetic head, measuring device and a current device |
-
1986
- 1986-12-22 JP JP61307018A patent/JPS63157251A/en active Pending
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5933298A (en) * | 1995-03-24 | 1999-08-03 | U.S. Philips Corporation | System comprising a magnetic head, measuring device and a current device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6622263B1 (en) | Method and apparatus for achieving system-directed checkpointing without specialized hardware assistance | |
JP2790034B2 (en) | Non-operational memory update method | |
JPS5913783B2 (en) | Duplicate file method | |
JPS63157251A (en) | Restoring system for information stored in main memory | |
JP2513060B2 (en) | Failure recovery type computer | |
JPH10133926A (en) | Mirror disk restoring method and restoring system | |
JPH0690683B2 (en) | Fault handling method for multiprocessor system | |
JPH05233466A (en) | Fault recovery system of doubled auxiliary storage device | |
JPH0341538A (en) | Main storage device | |
JP2527964B2 (en) | Backup system initial startup control method | |
JPS6027953A (en) | Check point processing system | |
JPH07141120A (en) | Processing method for fault in information storage medium | |
KR100249809B1 (en) | A continuous memory backup apparatus and method | |
JP2526726B2 (en) | Multiplexed file recovery method | |
JP3340284B2 (en) | Redundant system | |
JPH02118745A (en) | Memory back-up device | |
JPH04273516A (en) | Magnetic disk device | |
JPH07287694A (en) | Multiplex processing system and memory synchronous control method | |
JPH0552538B2 (en) | ||
KR100426943B1 (en) | How to handle operating system errors in the redundant system of exchanges | |
JPS6143739B2 (en) | ||
JPS6130297B2 (en) | ||
JPS6130296B2 (en) | ||
JPH07248929A (en) | Host device and restart system using the same | |
JPH02194444A (en) | Restarting device for information processor |