WO2017054626A1

WO2017054626A1 - Fault recovery method and device for virtual machine

Info

Publication number: WO2017054626A1
Application number: PCT/CN2016/098341
Authority: WO
Inventors: 谢军勇; 阳代平
Original assignee: 华为技术有限公司
Priority date: 2015-09-30
Filing date: 2016-09-07
Publication date: 2017-04-06
Also published as: CN105204955B; CN105204955A

Abstract

A fault recovery method and device for a virtual machine, the method comprising: when it is detected that an original virtual machine has failed, using an image template to create a new virtual machine (101); setting the MAC address of the original virtual machine to the MAC address of the new virtual machine, and mounting the data disk of the original virtual machine to the new virtual machine (102); and starting the new virtual machine (103), wherein when the data disk in the new virtual machine is partitioned, the reservation area where the service data in the data disk is located is not formatted. The method can be used to troubleshoot a VM under no DHCP service scenario.

Description

Virtual machine fault repairing method and device

The present application claims priority to Chinese Patent Application No. 201510638436.8, entitled "A Virtual Machine Fault Repair Method and Apparatus", filed on September 30, 2015, the entire contents of which are incorporated herein by reference. In the application.

Technical field

The present invention relates to the field of virtual technologies, and in particular, to a virtual machine fault repair method and apparatus.

Background technique

With the development of virtual technologies, virtual machines (VMs) are becoming more and more widely used. However, the VM may experience system disk failure during the running process. For example, the VM is faulty due to the guest operating system (Guest OS). And when the VM system disk fails, the VM cannot work normally, resulting in business damage. The current solution for handling VM failures is mainly through the following methods:

After the VM fails, the VM is set to boot from the network and apply for IP from the Dynamic Host Configuration Protocol (DHCP) server. The VM uses the applied IP connection to download the Trivial File Transfer Protocol (TFTP) server. Micro OS (VM); VM starts from micro OS and reinstalls the operating system (Operation System, OS); after installing the production OS, restart the VM, you can continue to install the app. Through the above steps, the entire VM can be reinstalled, and the service is automatically restored to recover the VM failure.

However, in some actual production environments (such as cloud built by operators), based on security considerations, the IP of the VM is required to be allocated by the cloud's infrastructure layer. In order not to cause conflicts, the tenant's VM cannot open the DHCP service. Therefore, the tenant's VM can no longer rely on the DHCP service for VM failure recovery. It can be seen that the VM fault repair method cannot repair the VM in the DHCP-free service scenario.

Summary of the invention

The embodiment of the invention provides a virtual machine fault repairing method and device, which can perform fault repair on a VM in a DHCP-free service scenario.

In a first aspect, an embodiment of the present invention provides a virtual machine fault repairing method, including:

Create a new virtual machine using the mirror template when it detects that the original virtual machine has failed.

Setting a Media Access Control (MAC) address of the original virtual machine to a MAC address of the new virtual machine, and mounting the data disk of the original virtual machine to the new virtual machine;

The new virtual machine is started, wherein when the new virtual machine partitions the data disk, the reserved area where the service data in the data disk is located is not formatted.

In a first possible implementation manner of the first aspect, the method further includes:

When it is detected that the original virtual machine and the high availability (HA) arbitration module stop transmitting the message packet exceed the first time threshold, reset the original virtual machine, and detect that after the original virtual machine is reset Whether the time when the original virtual machine and the HA arbitration module stop transmitting the message packet exceeds a second time threshold, and if yes, determining that the original virtual machine is faulty.

With the first aspect or the first possible implementation of the first aspect, in a second possible implementation of the first aspect, the creating a new virtual machine by using the image template of the source virtual machine includes:

Use a mirroring template to create a new virtual machine that includes only system disks.

In conjunction with the first aspect, or the first possible implementation of the first aspect, in a third possible implementation manner of the first aspect, the data disk includes a partition table, where the data is indicated in the partition table. a reserved area where the service data of the disk is located. When the new virtual machine partitions the data disk, according to the partition table, the reserved area where the service data in the data disk is located is not formatted. The area of the data disk other than the reserved area where the service data is located is formatted.

In conjunction with the first aspect or the first possible implementation of the first aspect, in a fourth possible implementation manner of the first aspect, the data disk of the original virtual machine is mounted to the new virtual machine ,include:

Unloading the data disk of the original virtual machine, and mounting the unloaded data disk to the new virtual machine;

The method further includes:

Delete the original virtual machine.

In a second aspect, an embodiment of the present invention provides a virtual machine fault repair apparatus, including: a creating unit, a setting unit, and a starting unit, where:

The creating unit is configured to create a new virtual machine by using a mirror template when detecting that the original virtual machine is faulty;

The setting unit is configured to set a MAC address of the original virtual machine to a MAC address of the new virtual machine, and mount the data disk of the original virtual machine to the new virtual machine;

The startup unit is configured to start the new virtual machine, where the new virtual machine does not format a reserved area of the service data in the data disk when the data disk is partitioned.

In a first possible implementation manner of the second aspect, the device further includes:

a detecting unit, configured to: when detecting that the original virtual machine and the high availability HA arbitration module stop transmitting the message packet exceed a first time threshold, reset the original virtual machine, and detect that after the original virtual machine is reset Whether the time when the original virtual machine and the HA arbitration module stop transmitting the message packet exceeds a second time threshold, and if yes, determining that the original virtual machine is faulty.

In conjunction with the second aspect or the first possible implementation of the second aspect, in a second possible implementation of the second aspect, the creating unit is configured to create a new virtual machine that includes only the system disk by using the image template.

With reference to the second aspect, or the first possible implementation manner of the second aspect, in a third possible implementation manner of the second aspect, the data disk includes a partition table, where the data is indicated in the partition table a reserved area where the service data of the disk is located, when the new virtual machine partitions the data disk, according to the partition table, the service data in the data disk is not The reserved area is formatted to format an area of the data disk other than the reserved area in which the service data is located.

In conjunction with the second aspect, or the first possible implementation of the second aspect, in a fourth possible implementation manner of the second aspect, the setting unit is further configured to uninstall the data disk of the original virtual machine, and Mounting the unloaded data disk to the new virtual machine;

The device also includes:

A delete unit is used to delete the original virtual machine.

In the foregoing technical solution, when detecting that the original virtual machine fails, the mirroring template is used to create a new virtual machine; setting the media access control MAC address of the original virtual machine to the MAC address of the new virtual machine, and The data disk of the original virtual machine is mounted to the new virtual machine; the new virtual machine is started, wherein when the new virtual machine partitions the data disk, the service data in the data disk is not located. The reserved area is formatted. In this way, in the repair process, the faulty virtual machine is prevented from being repaired by the DHCP network, so that the VM in the DHCP-free service scenario can be repaired.

DRAWINGS

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments or the description of the prior art will be briefly described below. Obviously, the drawings in the following description are only It is a certain embodiment of the present invention, and other drawings can be obtained from those skilled in the art without any creative work.

1 is a schematic flowchart of a virtual machine fault repairing method according to an embodiment of the present invention;

2 is a schematic flowchart of another virtual machine fault repairing method according to an embodiment of the present invention;

3 is a schematic diagram of another virtual machine fault repairing method according to an embodiment of the present invention;

4 is a schematic structural diagram of a virtual machine fault repairing apparatus according to an embodiment of the present invention;

FIG. 5 is a schematic structural diagram of another virtual machine fault repair apparatus according to an embodiment of the present invention; FIG.

FIG. 6 is a schematic structural diagram of another virtual machine fault repair apparatus according to an embodiment of the present invention; FIG.

FIG. 7 is a schematic structural diagram of another virtual machine fault repair apparatus according to an embodiment of the present invention.

detailed description

The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, but not all embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.

Referring to FIG. 1 , FIG. 1 is a schematic flowchart of a method for repairing a fault of a virtual machine according to an embodiment of the present invention. As shown in FIG. 1 , the method includes the following steps:

101. When detecting that the original virtual machine fails, use the image template to create a new virtual machine.

In this embodiment, the image template may be a mirror template used to create the original virtual machine. And the above image template may be pre-stored. In this way, step 101 can directly create a new virtual machine using the image template. In addition, the creation here may specifically be a system disk that creates a new virtual machine.

102. Set a MAC address of the original virtual machine to a MAC address of the new virtual machine, and mount the data disk of the original virtual machine to the new virtual machine.

The MAC address of the new virtual machine is the same as the MAC address of the original virtual machine, and the data disk of the new virtual machine is the data disk of the original virtual machine, so that the new virtual machine and the original virtual machine can be created. The same virtual machine.

103. The new virtual machine is started, where the new virtual machine does not format a reserved area of the service data in the data disk when the data disk is partitioned.

After the new virtual machine is started, the data disk can be partitioned. Since the reserved area of the service data in the data disk is not formatted, the service data of the original virtual machine can be guaranteed to avoid loss of service data. In this way, when the new virtual machine runs, the business data of the original virtual machine can be used, so that it can be understood as repairing the original virtual machine.

In this embodiment, the foregoing method may be applied to a network function virtualization (NFV) distributed architecture, that is, the foregoing method may be implemented by one or more network devices in the NFV distributed architecture, for example: Network devices such as servers, computers, laptops, in-vehicle devices, and network televisions.

In this embodiment, when it is detected that the original virtual machine is faulty, the mirroring template is used to create a new virtual machine; the media access control MAC address of the original virtual machine is set to the MAC address of the new virtual machine, and the The data disk of the original virtual machine is mounted to the new virtual machine; the new virtual machine is started, wherein when the new virtual machine partitions the data disk, the service data in the data disk is not located. The reserved area is formatted. In this way, in the repair process, the faulty virtual machine is prevented from being repaired by the DHCP network, so that the VM in the DHCP-free service scenario can be repaired.

Referring to FIG. 2, FIG. 2 is a schematic flowchart of another virtual machine fault repairing method according to an embodiment of the present invention. As shown in FIG. 2, the method includes the following steps:

When the time when the original virtual machine and the HA arbitration module stop transmitting the message packet exceeds the first time threshold, the original virtual machine is reset, and the original virtual machine is detected after the original virtual machine is reset. Whether the time when the HA arbitration module stops transmitting the message packet exceeds the second time threshold, and if yes, determining that the original virtual machine has failed, the flow may be ended.

In this embodiment, the foregoing virtual machine and the HA arbitration module stop transmitting the message packet, which may be understood as the heartbeat interruption of the original virtual machine HA arbitration module. In addition, the message packet may be any message packet transmitted by the original virtual machine and the HA arbitration module. The arbitration module can be an HA arbitration module in the NFV distributed architecture.

Whether the time when the original virtual machine and the HA arbitration module stop transmitting the message packet exceeds the second time threshold after the reset of the original virtual machine is understood to be that the original virtual machine is reset to the start time of the timer, when the timer reaches the above At the second time threshold, when the original virtual machine has not transmitted the message packet with the HA arbitration module, it determines that the original virtual machine has failed.

The first time threshold and the second time threshold may be preset time thresholds.

Through step 201, automatic fault detection of the virtual machine is realized, manual detection is avoided, and the timeliness of the fault alarm is improved.

202. When detecting that the original virtual machine fails, use the image template to create a new virtual machine.

In this embodiment, when the fault of the original virtual machine is detected, the HA arbitration module may detect that the original virtual machine is faulty, and the HA may manage and orchestrate to the NFV distributed architecture (Management And Orchestration, MANO). The module sends a request to rebuild the virtual machine message. After receiving the message, the MANO module can notify the infrastructure layer (I layer) of the NFV distributed architecture to create a new virtual machine, and the I layer uses the mirror template to create the new virtual machine.

In this embodiment, the creating a new virtual machine by using the image template of the source virtual machine may include:

In this embodiment, a new virtual machine that includes only system disks can be implemented.

203. Set a MAC address of the original virtual machine to a MAC address of the new virtual machine, and mount the data disk of the original virtual machine to the new virtual.

In this embodiment, the data disk may be all data disks except the system disk in the original virtual machine. In addition, the above-mentioned mounting of the data disk of the original virtual machine to the new virtual may be understood as using the data disk of the original virtual machine as the data disk of the new virtual machine. The mount may be that the I layer mounts the data disk of the original virtual machine to the new virtual using an Application Programming Interface (API). Specifically, the I layer finds the data disk of the original virtual machine through the site identifier and the description file of the original virtual machine, and then uses the API interface to mount the data disk of the original virtual machine to the new virtual machine.

In this embodiment, the step of mounting the data disk of the original virtual machine to the new virtual machine may include:

Unloading the data disk of the original virtual machine, and mounting the unloaded data disk to the new virtual machine.

In addition, in this implementation manner, the foregoing method may further include:

Delete the original virtual machine.

In this way, the virtual machine can be automatically deleted during the virtual machine repair process, thus improving efficiency.

204. The new virtual machine is started, where the new virtual machine does not format a reserved area of service data in the data disk when partitioning the data disk.

In this embodiment, the data disk may include a partition table, and the reserved area in which the service data of the data disk is located may be indicated in the partition table. For example, the partition description module indicates, in the partition table, an area in which the service data of the data disk is located as a reserved area.

In addition, when the data disk is partitioned, the new virtual machine does not format the reserved area where the service data in the data disk is located according to the partition table, and the service data is divided in the data disk. The area outside the reserved area is formatted.

In addition, after the necessary formatting of the data disk, the new virtual machine can re-install the application (Application, APP) on the data disk, thereby restoring the entire virtual machine. For example, use the business data in the reserved area to install the APP installed on the original virtual machine. In addition, since the business data is reserved, the installed APP can also retain the data recorded on the original virtual machine.

In addition, in this embodiment, file system check and repair can also be performed on the reserved area. It can be formatted when the check and repair fails.

In addition, in the foregoing embodiment, the virtual machine is deployed for the service active/standby deployment, because the process of redistributing the service to the new virtual machine is not introduced during the virtual machine failure recovery process.

In this embodiment, a plurality of optional implementation manners are added on the basis of the embodiment shown in FIG. 1, and the VMs in the DHCP-free service scenario can be fault-repaired.

Referring to FIG. 3, FIG. 3 is a schematic diagram of another virtual machine fault repairing method according to an embodiment of the present invention. As shown in FIG. 3, the method includes the following steps:

301. The APP VM detects whether the heartbeat interruption of the original VM and the HA arbitration module exceeds TI seconds. If yes, step 302 is performed; the APP VM may be understood as a function module for installing the VM.

302; the original VM is reset;

303, the APP VM detects whether the heartbeat interruption of the original VM and the HA arbitration module exceeds T2 seconds, and if yes, step 304 is performed;

304. The APP VM sends a message to the MANO to rebuild the VM system disk. The step may be that the HA arbitration module notifies the MANO module to send a message for rebuilding the VM system disk.

305. The MANO module notifies the I layer to use the image to create the VM.

306. The I layer module creates a new VM with only the system disk;

307. The MANO module notifies the I layer module to change the MAC of the new VM to the original VM.

308. The I layer module changes the MAC of the new VM to be the same as the original VM.

309. The MANO module notifies the I layer module to unload the data disk from the original VM and hold the data disk to the new VM.

3010. The I layer module unloads the data disk from the original VM and holds the data disk to the new VM;

3011, the MANO module notifies the I layer module to delete the original VM;

3012. The I layer module deletes the original VM;

3013. The MANO module notifies the APP VM that the reconstruction is successful;

3014. The APP VM starts a new VM from the system disk.

3015. The APP VM installs the APP on the data disk of the new VM.

It should be noted that the APP VM, the MANO module, and the I layer module may be functional modules located in the same network device or in different network devices in the NFV distributed architecture.

The following is a device embodiment of the present invention. The device embodiment of the present invention is used to perform the method for implementing the first to second embodiments of the present invention. For the convenience of description, only parts related to the embodiment of the present invention are shown, and the specific technical details are not disclosed. Please refer to Embodiment 1 and Embodiment 2 of the present invention.

Please refer to FIG. 4. FIG. 4 is a schematic structural diagram of a virtual machine fault repairing apparatus according to an embodiment of the present invention, as shown in FIG. As shown, it includes: a creating unit 41, a setting unit 42 and a starting unit 43, wherein:

The creating unit 41 is configured to create a new virtual machine by using a mirror template when detecting that the original virtual machine fails.

In this embodiment, the image template may be a mirror template used to create the original virtual machine. And the above image template may be pre-stored. This way, the creation unit 41 can directly create a new virtual machine using the image template. In addition, the creation here may specifically be a system disk that creates a new virtual machine.

The setting unit 42 is configured to set a media access control MAC address of the original virtual machine to a MAC address of the new virtual machine, and mount the data disk of the original virtual machine to the new virtual machine.

The setting unit 42 can realize that the MAC address of the new virtual machine is the same as the MAC address of the original virtual machine, and the data disk of the new virtual machine is the data disk of the original virtual machine, so that the created new virtual machine and the original virtual machine can be realized. For the same virtual machine.

The startup unit 43 is configured to start the new virtual machine, where the new virtual machine does not format the reserved area of the service data in the data disk when the data disk is partitioned.

In this embodiment, the foregoing apparatus may be applied to a network function virtualization (NFV) distributed architecture, that is, the foregoing apparatus may be implemented by one or more network devices in an NFV distributed architecture, for example, a server or a computer. , laptop computers, car equipment, network television and other network equipment.

Referring to FIG. 5, FIG. 5 is a schematic structural diagram of another virtual machine fault repairing apparatus according to an embodiment of the present invention. As shown in FIG. 5, the method includes: a detecting unit 51, a creating unit 52, a setting unit 53, and a starting unit 54, among them:

The detecting unit 51 is configured to: when detecting that the original virtual machine and the high availability HA arbitration module stop transmitting the message packet exceed a first time threshold, reset the original virtual machine, and detect that the original virtual machine is reset Whether the time when the original virtual machine and the HA arbitration module stop transmitting the message packet exceeds the second time threshold, and if yes, determining that the original virtual machine is faulty.

Whether the time when the original virtual machine and the HA arbitration module stop transmitting the message packet after the original virtual machine is reset exceeds The second time threshold can be understood as: the original virtual machine is reset to the start time of the timer. When the timer reaches the second time threshold, when the original virtual machine has not transmitted the message packet with the HA arbitration module, the original virtual machine is determined. malfunction.

The detection unit 51 automatically detects the failure of the virtual machine, avoids manual detection, and improves the timeliness of the failure alarm.

The creating unit 52 is configured to create a new virtual machine by using a mirror template when detecting that the original virtual machine fails.

In this embodiment, when the fault of the original virtual machine is detected, the HA arbitration module may detect that the original virtual machine is faulty, and the HA may send a request to rebuild the virtual machine message to the MANO module in the NFV distributed architecture, MANO After receiving the message, the module can notify the infrastructure layer (I layer) of the NFV distributed architecture to create a new virtual machine, and the I layer uses the mirror template to create the new virtual machine.

Additionally, the creation unit 52 can be used to create a new virtual machine that includes only system disks using a mirror template.

In this embodiment, it is possible to implement the creation of only new virtual machines that only include system disks.

The setting unit 53 is configured to set a media access control MAC address of the original virtual machine to a MAC address of the new virtual machine, and mount the data disk of the original virtual machine to the new virtual machine.

In this embodiment, the data disk may be all data disks except the system disk in the original virtual machine. In addition, the above-mentioned mounting of the data disk of the original virtual machine to the new virtual may be understood as using the data disk of the original virtual machine as the data disk of the new virtual machine. The mount may be that the I layer uses the API to mount the data disk of the original virtual machine to the new virtual. Specifically, the I layer finds the data disk of the original virtual machine through the site identifier and the description file of the original virtual machine, and then uses the API interface to mount the data disk of the original virtual machine to the new virtual machine.

In this embodiment, the setting unit 53 is further configured to: uninstall the data disk of the original virtual machine, and mount the unloaded data disk to the new virtual machine;

As shown in FIG. 6, the foregoing apparatus may further include:

The deleting unit 55 is configured to delete the original virtual machine.

The startup unit 54 is configured to start the new virtual machine, where the new virtual machine does not format the reserved area of the service data in the data disk when the data disk is partitioned.

In addition, after the necessary formatting of the data disk, the new virtual machine can re-install the APP on the data disk, thereby restoring the entire virtual machine. For example, use the business data in the reserved area to install the APP installed on the original virtual machine. In addition, since the business data is reserved, the installed APP can also retain the data recorded on the original virtual machine.

In this embodiment, a plurality of optional implementation manners are added on the basis of the embodiment shown in FIG. 4, and the VMs in the DHCP-free service scenario can be fault-repaired.

Referring to FIG. 7, FIG. 7 is a schematic structural diagram of another virtual machine fault repair apparatus according to an embodiment of the present invention. As shown in FIG. 7, the processor 71 includes a processor 71, a network interface 72, a memory 73, and a communication bus 74. The communication bus 74 is configured to implement connection communication between the processor 71, the network interface 72, and the memory 73, and the processor 71 executes a program stored in the memory 73 for implementing the following method:

Setting a media access control MAC address of the original virtual machine to a MAC address of the new virtual machine, and mounting the data disk of the original virtual machine to the new virtual machine;

In this embodiment, the program executed by the processor 71 may further include:

When it is detected that the original virtual machine and the high availability HA arbitration module stop transmitting the message packet exceed the first time threshold, reset the original virtual machine, and detect the original virtual machine after the original virtual machine is reset. Whether the time when the HA arbitration module stops transmitting the message packet exceeds a second time threshold, and if so, determines that the original virtual machine has failed.

In this embodiment, the processor 71 executes a program for creating a new virtual machine by using the image template of the source virtual machine, and may include:

In this embodiment, the data disk may include a partition table, where the partition table indicates a reserved area where the service data of the data disk is located, and when the new virtual machine partitions the data disk, And formatting, according to the partition table, a reserved area where the service data in the data disk is located, and formatting an area of the data disk other than the reserved area where the service data is located.

In this embodiment, the program executed by the processor 71 to mount the data disk of the original virtual machine to the new virtual machine may include:

In addition, the program executed by the processor 71 may further include:

Delete the original virtual machine.

In this embodiment, when it is detected that the original virtual machine is faulty, the mirroring template is used to create a new virtual machine; the media access control MAC address of the original virtual machine is set to the MAC address of the new virtual machine, and the Data disk of the original virtual machine Mounting to the new virtual machine; starting the new virtual machine, wherein the new virtual machine does not format a reserved area of service data in the data disk when partitioning the data disk. In this way, in the repair process, the faulty virtual machine is prevented from being repaired by the DHCP network, so that the VM in the DHCP-free service scenario can be repaired.

One of ordinary skill in the art can understand that all or part of the process of implementing the foregoing embodiments can be completed by a computer program to instruct related hardware, and the program can be stored in a computer readable storage medium. When executed, the flow of an embodiment of the methods as described above may be included. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).

The above is only the preferred embodiment of the present invention, and the scope of the present invention is not limited thereto, and thus equivalent changes made in the claims of the present invention are still within the scope of the present invention.

Claims

A virtual machine fault repairing method, comprising:

Create a new virtual machine using the mirror template when it detects that the original virtual machine has failed.

Setting a media access control MAC address of the original virtual machine to a MAC address of the new virtual machine, and mounting the data disk of the original virtual machine to the new virtual machine;

The new virtual machine is started, wherein when the new virtual machine partitions the data disk, the reserved area where the service data in the data disk is located is not formatted.
The method of claim 1 wherein the method further comprises:

When it is detected that the original virtual machine and the high availability HA arbitration module stop transmitting the message packet exceed the first time threshold, reset the original virtual machine, and detect the original virtual machine after the original virtual machine is reset. Whether the time when the HA arbitration module stops transmitting the message packet exceeds a second time threshold, and if so, determines that the original virtual machine has failed.
The method of claim 1 or 2, the creating a new virtual machine by using a mirror template of the source virtual machine, comprising:

Use a mirroring template to create a new virtual machine that includes only system disks.
The method according to claim 1 or 2, wherein the data disk comprises a partition table, and the reserved area in which the service data of the data disk is located is indicated in the partition table, and the new virtual machine is When the data disk is partitioned, the reserved area where the service data in the data disk is located is not formatted according to the partition table, and the data disk is excluded from the reserved area where the service data is located. The area is formatted.
The method of claim 1 or 2, wherein the mounting the data disk of the original virtual machine to the new virtual machine comprises:

Unloading the data disk of the original virtual machine, and mounting the unloaded data disk to the new virtual machine;

The method further includes:

Delete the original virtual machine.
A virtual machine fault repairing device, comprising: a creating unit, a setting unit and a starting unit, wherein:

The creating unit is configured to create a new virtual machine by using a mirror template when detecting that the original virtual machine is faulty;

The setting unit is configured to set a media access control MAC address of the original virtual machine to a MAC address of the new virtual machine, and mount the data disk of the original virtual machine to the new virtual machine;

The startup unit is configured to start the new virtual machine, where the new virtual machine does not format a reserved area of the service data in the data disk when the data disk is partitioned.
The device of claim 6 wherein said device further comprises:

a detecting unit, configured to: when detecting that the original virtual machine and the high availability HA arbitration module stop transmitting the message packet exceed a first time threshold, reset the original virtual machine, and detect that after the original virtual machine is reset Whether the time when the original virtual machine and the HA arbitration module stop transmitting the message packet exceeds a second time threshold, and if yes, determining that the original virtual machine is faulty.
The apparatus according to claim 6 or 7, said creating unit for creating a new virtual machine including only a system disk using a mirror template.
The device according to claim 6 or 7, wherein the data disk comprises a partition table, and the reserved area in which the service data of the data disk is located is indicated in the partition table, and the new virtual machine is When the data disk is partitioned, the reserved area where the service data in the data disk is located is not formatted according to the partition table, and the data disk is excluded from the reserved area where the service data is located. The area is formatted.
The device according to claim 6 or 7, wherein the setting unit is further configured to uninstall the data disk of the original virtual machine, and mount the unloaded data disk to the new virtual machine. ;

The device also includes:

A delete unit is used to delete the original virtual machine.