JPH01292552A - Peripheral system fault processing system for virtual computer system - Google Patents

Peripheral system fault processing system for virtual computer system

Info

Publication number
JPH01292552A
JPH01292552A JP63123154A JP12315488A JPH01292552A JP H01292552 A JPH01292552 A JP H01292552A JP 63123154 A JP63123154 A JP 63123154A JP 12315488 A JP12315488 A JP 12315488A JP H01292552 A JPH01292552 A JP H01292552A
Authority
JP
Japan
Prior art keywords
computer
virtual
virtual machine
real
channel
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP63123154A
Other languages
Japanese (ja)
Inventor
Naomi Yamamura
山村 直美
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP63123154A priority Critical patent/JPH01292552A/en
Publication of JPH01292552A publication Critical patent/JPH01292552A/en
Pending legal-status Critical Current

Links

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

PURPOSE:To attain the high reliability when a channel fault occurs with a real computer by inhibiting the information of this fault to a virtual computer and recovering an I/O under execution after the fault is processed. CONSTITUTION:A real device 26 of a real computer 1 is allocated as a virtual device 30 of a virtual computer 27. In case a host real channel 22 of the device 26 has a fault while an I/O instruction is carried out to the device 26 working as the device 30, a channel fault occurs at the computer 1. In this case, a real calculation channel fault processing means 2 of the computer 1 works to perform the I/O recovering processes to the real devices 24-26 of the computer 1. When the channel fault occurred at the computer 1 is reported to a virtual computer monitor 3 of the channel 22, a means 4 of the monitor 3 identifies the relevant virtual computer. At the same time, a means 5 discontinues the execution of the I/O instruction and a means 6 extracts the I/O information. Then a means 7 performs the recovery of the I/O instruction to the device 30 and means 8 restarts the execution of the I/O instruction.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明は仮想計算機システムに関し、特に仮想計算機動
作中に発生しノこチャネル障害に対応する仮想計算機シ
ステムにおける障害処理方式に関する。
DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a virtual computer system, and more particularly to a failure handling method in a virtual computer system that deals with a channel failure that occurs during the operation of a virtual computer.

〔従来の技術〕[Conventional technology]

従来、この種の仮想計算機システムにおいて仮想計算機
動作中に実計算機でチャネル障害が発生すると、以下に
述べる処理が行われていた。
Conventionally, in this type of virtual computer system, when a channel failure occurs in a real computer while the virtual computer is operating, the following processing has been performed.

まず、実計算機における実計算機チャネル障害処理、に
よって、実計算機で使用していた装置に対するI10回
復処理が行われた。その後、実計算機チャネル障害処理
から、仮想計算機モニタに対してチャネル障害発生と仮
想装置のI10情報が報告さ、れ、仮想計算機モニタは
、仮想計算機にチャネル障害発生を通知していた。チャ
ネル障害発生通知を受けた仮想計算機では、仮想計算機
チャネル障害処理による仮想装置のI10回復処理が行
われた。この時、仮想計算機モニタが実計算機チャネル
障害処理から得たI10情報を基に、仮想計算機の仮想
計算機チャネル障害処理が、仮想装置のI10回復処理
を行った。
First, I10 recovery processing for the device used in the actual computer was performed by actual computer channel failure processing in the actual computer. Thereafter, the actual computer channel failure process reported the occurrence of a channel failure and the I10 information of the virtual machine to the virtual machine monitor, and the virtual machine monitor notified the virtual machine of the occurrence of the channel failure. In the virtual machine that received the notification of the occurrence of a channel failure, I10 recovery processing of the virtual machine by virtual machine channel failure processing was performed. At this time, the virtual machine channel fault processing of the virtual machine performed the I10 recovery processing of the virtual machine based on the I10 information obtained by the virtual machine monitor from the real computer channel fault processing.

このように、実計算機の実計算機チャネル障害処理と仮
想計¥i、磯モニタ、仮想計算機モニタと仮想計算機の
仮想計算機チャネル障害処理、との間にそれぞれインタ
フェース全役けて、仮想装置のI10回復処理を矛盾な
く行っていた。
In this way, all interfaces are used between the real computer channel failure processing of the real computer and the virtual machine monitor, the virtual machine monitor, and the virtual computer channel failure processing of the virtual machine, and the I10 recovery of the virtual machine is performed. Processing was carried out without any contradiction.

偽り一部 〔発明が解決しようとする課題〕 上述した従来の仮想計算機システムにおいては、仮想計
算機動作中に発生したチャネル障害の回復処理を仮想計
算機が行わなければならず、仮想計算機モニタとインタ
フェースをもつチャネル障害、 処理番有する仮想計算
機でなければ、チャネル障害発生時に仮想計算機の実行
を継続できないという欠点があった。  、 、 〔課題を解決する為の手段〕 “複数の仮想計算機全実現する仮想計算機システム構成
において、仮想計算機の動作中に実装置を仮想装置とし
て割り当て使用中にチャネル障害が発生口だ場客に実計
算機におけるチャネル障害処理を実行する実計算機チャ
ネル障害処理手段′と、4害を受けだ実装置を仮想装置
として使用していた仮想計算機を識別する障害仮想計算
機識別手段と、障害仮想計算機識別手段によって識別さ
れた仮想計算機におけるI10実行処理を中断させる仮
想計算機I10中断手段と、チャネル障害発生時に障害
となった装置のI10情報を取り出す障害I10情報取
シ出し手段と、障害I10情報取り出し手段によって得
た情報を基に仮想計算機で実行していたIloの回復を
行う仮想計算機I10回復手段と、I10回復後当該仮
想計算機におけるI10実行処理を再開させる仮想計□
算機I10再開手段とを備えチャネル障害発生時に仮想
計算機モニタが仮想計算機で実行中だったIloの回復
処理を行って構成される。
False Part [Problem to be Solved by the Invention] In the conventional virtual machine system described above, the virtual machine must perform recovery processing for a channel failure that occurs while the virtual machine is operating, and the virtual machine monitor and interface The drawback is that unless the virtual machine has a channel failure and process number, the virtual machine cannot continue execution when a channel failure occurs. , , [Means for solving the problem] “In a virtual computer system configuration in which multiple virtual computers are fully implemented, it is possible to allocate a real device as a virtual device while the virtual computer is operating and cause a channel failure to occur during use. a real computer channel fault processing means for executing channel fault processing in a real computer; a faulty virtual computer identification means for identifying a virtual computer that was using the affected real device as a virtual device; and a faulty virtual computer identification means. virtual machine I10 interrupting means for interrupting the I10 execution process in the virtual machine identified by; failure I10 information retrieval means for retrieving I10 information of the failed device when a channel failure occurs; A virtual machine I10 recovery means that recovers Ilo that was being executed on the virtual machine based on the information obtained, and a virtual machine that resumes I10 execution processing in the virtual machine after I10 recovery.
A virtual machine monitor is provided with a computer I10 restart means, and is configured so that when a channel failure occurs, a virtual machine monitor performs a recovery process for Ilo that was being executed in the virtual machine.

し実施例〕 次に、本発明について図面を参照して説明子る。Example] Next, the present invention will be explained with reference to the drawings.

第1図は本発明の一実施例の構成を示すブロック図であ
り、第2図は本発明の一実施例のシステム構成の一部を
表す図であシ、第3図は本発明の一実施例の処理動作の
流れ図である。
FIG. 1 is a block diagram showing the configuration of one embodiment of the present invention, FIG. 2 is a diagram showing part of the system configuration of one embodiment of the present invention, and FIG. 3 is a block diagram showing the configuration of one embodiment of the present invention. It is a flow chart of processing operation of an example.

本実施例は、実計算機チャネル障害処理手段2と、仮想
計算機モニタ3とを備え、仮想計算機モニタ3は、障害
仮想計算機識別手段4と、仮想計算機I10中断手段5
と、障害I10情報取シ出し手段6と、仮想計算機I1
0回復手段7と、仮想計算機工10m開手段8とを備え
て構成される。
This embodiment includes a real computer channel failure processing means 2 and a virtual computer monitor 3, and the virtual computer monitor 3 includes a failed virtual computer identification means 4 and a virtual computer I10 interrupting means 5.
, failure I10 information extraction means 6, and virtual computer I1
0 recovery means 7 and virtual computer 10m opening means 8.

次に、本実施例の全体的な動作を具体的に説明−一へ する。Next, the overall operation of this embodiment will be explained in detail. do.

実計算機 1内の実装26を、仮想計算機27内の仮想
装置30として割り当て使用している仮想計算機におい
て、仮想計算機内で、仮想装置30として動作中の実装
置26に対してIloを実行している時、実装置26の
上位の実チャネル22で障害が発生すると、実計算機で
チャネル障害が発生する。この時、実計算機の実計算機
チャネル障害処理手段2が動作し、実計算−で使用して
いる実装置一対するI1010回復処理テップ41)が
行われる。
In a virtual computer in which the implementation 26 in the real computer 1 is allocated and used as the virtual device 30 in the virtual computer 27, Ilo is executed on the real device 26 operating as the virtual device 30 in the virtual computer. When a failure occurs in the real channel 22 of the real device 26, a channel failure occurs in the real computer. At this time, the real computer channel fault processing means 2 of the real computer operates, and the I1010 recovery process step 41) for the real device used in the real calculation is performed.

そして、実チャネル22の配下に仮想計算機で使用して
いる装置があると、仮想計算機モニタに実計算機+チャ
ネル障害が廃止したことが報告される。チャネル障害め
報告を受けた仮想計算機モニタは、障害仮想計算機識別
手段4によって、障害を受けた実装置26を仮想装置3
0として使用している仮想計算機を識別する(ステップ
42)。
If there is a device being used by a virtual computer under the real channel 22, the virtual computer monitor reports that the real computer + channel failure has been abolished. Upon receiving the channel failure report, the virtual machine monitor identifies the failed real device 26 as the virtual machine 3 using the failed virtual machine identifying means 4.
The virtual machine being used as 0 is identified (step 42).

さらに、障害となったIloの回復処理を行うため、該
当する仮想計算機に対して、仮想計算機I10中断手段
5によって仮想計算機における■10命令の実行を中断
しくステップ43)、I10情報格納域に格納されてお
シ、チャネル障害発生時に仮想計算機で実行されていた
実装置26のI10情報を、I10情報取シ出し手段6
によって取シ出す(ステップ44)。取シ出されたI1
0情報を基に、仮想計算機I10回復手段7によって、
仮想計算機で実行されていた仮想装置30に対するIl
oの回復処理全行う(ステップ45)。
Furthermore, in order to perform recovery processing for the failed Ilo, the execution of the ■10 instruction in the virtual machine is interrupted by the virtual machine I10 interrupting means 5 in step 43) and stored in the I10 information storage area. The I10 information of the real device 26 that was being executed on the virtual machine at the time of the channel failure is transferred to the I10 information extraction means 6.
(step 44). I1 taken out
Based on the 0 information, the virtual machine I10 recovery means 7
Il for the virtual machine 30 running on the virtual machine
All recovery processing of o is performed (step 45).

実行中I10回復処理完了後、仮想計算機のI10再開
手段8によって中断されていた仮想計算機におけるIl
oが再開され(ステップ46)、仮想計算機内でのIl
oの実行が続行される。
After the completion of the running I10 recovery process, the Il in the virtual machine that was suspended by the I10 restarting means 8 of the virtual machine
o is resumed (step 46), and Il in the virtual machine is restarted (step 46).
The execution of o continues.

以上のように、仮想計算機モニタによる障害処理によっ
て、仮想計算機で動作するシステムは、実計算機で障害
が発生しても障害eこ対する処理をすることなく、また
、障害のあったことを知る、ことなく、動作を続ける事
が可能である。
As described above, by handling faults using the virtual machine monitor, a system running on a virtual machine can detect a fault without having to take any action to deal with the fault even if a fault occurs in a real computer. It is possible to continue operating without any problems.

〔発明の効果〕〔Effect of the invention〕

以上説明したように、本発明は、仮想計算機力動作中て
実計算機でチャネル障害が発生した場合、実計算機にお
ける障害処理後、仮想計算機モニタが仮想計算機にチャ
ネル障害を通知することなく、障害発生時に仮想計算機
で実行中のI10回復を行うことによって、チャネル障
害を実装しない仮想計算機においても、チャネル障害発
生時に、高い信頼性を実現できるという効果がめる。
As explained above, in the present invention, when a channel failure occurs in a real computer while a virtual computer is operating, the virtual computer monitor does not notify the virtual machine of the channel failure after the failure is processed in the real computer. By performing I10 recovery during execution in a virtual machine, high reliability can be achieved even in a virtual machine that does not implement a channel fault when a channel fault occurs.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の一実施例の構成を示すブロック図、第
2図は本発明の一実施例の構成の一部を示すブロック図
、第3図は本発明の一実施例の処理動作の流れ図。 1・・・・・・実計算機、2・・・・・・実計算機チャ
ネル障害処理手段、3・・・・・・仮想針′n機モニタ
、4・・・・・・障害仮想計算機識別手段、訃・・・・
・仮想計算機I10中断手段、6・・・・・・障害I1
0情報取シ出し手段、7・・・・・・仮想計算機I10
回復手段、8・・・・・・仮想計算機I10再開手段、
27・・・・・・仮想計算機。
FIG. 1 is a block diagram showing a configuration of an embodiment of the present invention, FIG. 2 is a block diagram showing a part of the configuration of an embodiment of the present invention, and FIG. 3 is a processing operation of an embodiment of the present invention. flowchart. 1...Real computer, 2...Real computer channel failure processing means, 3...Virtual needle'n machine monitor, 4...Faulty virtual computer identification means , death...
・Virtual computer I10 interrupting means, 6... Failure I1
0 information retrieval means, 7...virtual computer I10
Recovery means, 8...Virtual computer I10 restart means,
27...Virtual computer.

Claims (1)

【特許請求の範囲】[Claims]  複数の仮想計算機を実現する仮想計算機システムにお
いて、仮想計算機の動作中に実計算機の装置(以下、実
装置という)を仮想計算機の装置(以下、仮想装置とい
う)として割り当て使用中にチャネル障害が発生した場
合に実計算機におけるチャネル障害処理を実行する実計
算機チャネル障害処理手段と、障害を受けた実装置を仮
想装置として使用していた仮想計算機を識別する障害仮
想計算機識別手段と、障害仮想計算機識別手段によって
識別された仮想計算機における入出力(以下、I/Oと
いう)実行処理を中断させる仮想計算機I/O中断手段
と、チャネル障害発生時に障害となった装置のI/O情
報を取り出す障害I/O情報取り出し手段と、障害I/
O情報取り出し手段によって得た情報を基に仮想計算機
で実行していたI/Oの回復を行う仮想計算機I/O回
復手段と、I/O回復後当該仮想計算機におけるI/O
実行処理を再開させる仮想計算機I/O再開手段を備え
チャネル障害発生時に仮想計算機モニタが仮想計算機で
実行中だったI/Oの回復処理を行って成ることを特徴
とする仮想計算機システムにおける周辺系障害処理方式
In a virtual machine system that realizes multiple virtual machines, a channel failure occurs while the real machine's device (hereinafter referred to as "real device") is allocated and used as a virtual machine's device (hereinafter referred to as "virtual device") while the virtual machine is operating. a real computer channel fault processing means for executing channel fault processing in a real computer when a fault occurs; a faulty virtual machine identification means for identifying a virtual machine that was using a faulty real device as a virtual device; and a faulty virtual computer identification means. a virtual machine I/O interrupting means for interrupting input/output (hereinafter referred to as I/O) execution processing in a virtual machine identified by the means; and a failure I/O for extracting I/O information of a faulty device when a channel fault occurs. /O information retrieval means and fault I/O
A virtual machine I/O recovery means that recovers the I/O being executed in the virtual machine based on the information obtained by the O information extraction means;
A peripheral system in a virtual computer system, comprising a virtual computer I/O restart means for restarting execution processing, and a virtual computer monitor performing recovery processing for I/O being executed in the virtual machine when a channel failure occurs. Failure handling method.
JP63123154A 1988-05-20 1988-05-20 Peripheral system fault processing system for virtual computer system Pending JPH01292552A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63123154A JPH01292552A (en) 1988-05-20 1988-05-20 Peripheral system fault processing system for virtual computer system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63123154A JPH01292552A (en) 1988-05-20 1988-05-20 Peripheral system fault processing system for virtual computer system

Publications (1)

Publication Number Publication Date
JPH01292552A true JPH01292552A (en) 1989-11-24

Family

ID=14853517

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63123154A Pending JPH01292552A (en) 1988-05-20 1988-05-20 Peripheral system fault processing system for virtual computer system

Country Status (1)

Country Link
JP (1) JPH01292552A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05265783A (en) * 1992-03-19 1993-10-15 Fujitsu Ltd Virtual computer system i/o control system

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05265783A (en) * 1992-03-19 1993-10-15 Fujitsu Ltd Virtual computer system i/o control system

Similar Documents

Publication Publication Date Title
CN101377750B (en) System and method for cluster fault toleration
CN102810073B (en) Background real-time monitoring processing method for data
US7734956B2 (en) Process management system
CN104077220A (en) Method and device for debugging microprocessor without interlocked piped stages (MIPS) framework operating system kernel
CN101373450A (en) Method and system for processing CPU abnormity
CN104391777A (en) Cloud platform based on Linux operating system and operation monitoring method and device of cloud platform
JPH02294739A (en) Fault detecting system
JPH01292552A (en) Peripheral system fault processing system for virtual computer system
JPH07311693A (en) Debugging system
CN113297074B (en) Memory tracking method and device
CN115563070A (en) Method, apparatus, device, medium, and program product for collecting exception log
JPS62190543A (en) Control system for quick restoration from trouble of communication system
KR100206472B1 (en) Error manage & recover method of switching system
JP2695552B2 (en) Failure handling method
JPH02310633A (en) Fault restoration processing system for virtual computer system
JP2902769B2 (en) Robot sequence control method
JPH04102930A (en) Central processing unit
JPH02141831A (en) Peripheral system fault processing system in virtual computer system
JP2000172578A (en) Fault processing system and method
JPH0619720B2 (en) Information processing equipment
JPH0520106A (en) System function stop preventing system
JPH08137719A (en) Program test reexecution method
JPH05241886A (en) Operating system build-up type debug support system
JPS63742A (en) Patrol priority control system
JPH0628206A (en) Recovery system for fault in data processing station of cluster system