WO2016106756A1 - Disaster recovery method, system and apparatus - Google Patents

Disaster recovery method, system and apparatus Download PDF

Info

Publication number
WO2016106756A1
WO2016106756A1 PCT/CN2014/096068 CN2014096068W WO2016106756A1 WO 2016106756 A1 WO2016106756 A1 WO 2016106756A1 CN 2014096068 W CN2014096068 W CN 2014096068W WO 2016106756 A1 WO2016106756 A1 WO 2016106756A1
Authority
WO
WIPO (PCT)
Prior art keywords
disaster recovery
virtual machines
storage
disaster
management platform
Prior art date
Application number
PCT/CN2014/096068
Other languages
French (fr)
Chinese (zh)
Inventor
邹锋哨
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to PCT/CN2014/096068 priority Critical patent/WO2016106756A1/en
Publication of WO2016106756A1 publication Critical patent/WO2016106756A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation

Abstract

Provided are a disaster recovery method, system and apparatus. The system comprises a production centre and a disaster recovery centre. The production centre comprises a disaster recovery management platform, a virtualization platform and a storage device. The storage device of the production centre comprises a master logic unit (LUN). The disaster recovery centre comprises a storage device. The storage device of the disaster recovery centre comprises a slave LUN. The disaster recovery management platform is used to send a request to perform silence processing on one or more virtual machines of the master LUN to the virtualization platform. The virtualization platform is used to perform silence processing on the one or more virtual machines according to the request for silence processing, and return a response of performing silence processing on the one or more virtual machines to the disaster recovery management platform. The disaster recovery management platform is used to receive the response of performing silence processing on the one or more virtual machines, and send a remote replication start request to the storage device of a production site.

Description

一种容灾方法、系统和装置Disaster tolerance method, system and device 技术领域Technical field
本发明涉及信息技术领域,尤其涉及一种容灾方法、系统和装置。The present invention relates to the field of information technology, and in particular, to a disaster tolerance method, system and device.
背景技术Background technique
现在许多生产系统中,对于重要的业务数据都要进行数据容灾。所谓数据容灾,又称为远程数据复制技术,是指建立一个异地的数据系统,该系统是本地数据的一个可用复制。在本地数据及整个应用系统出现灾难时,系统至少在异地保存有一份可用的关键业务的数据。In many production systems, data disaster tolerance is required for important business data. Data disaster tolerance, also known as remote data replication technology, refers to the establishment of an off-site data system, which is an available copy of local data. In the event of a disaster in local data and the entire application system, the system maintains at least one copy of the critical business data available in the field.
典型的数据容灾系统包括生产中心和容灾中心。在生产中心,部署有主机、存储设备,用于正常的业务运行;在容灾中心,部署有主机、存储设备,用于在生产中心发生灾难后,接管其业务。其中,生产中心或容灾中心的存储设备均包含多个逻辑单元号(logical unit number,简称LUN)。A typical data disaster recovery system includes a production center and a disaster recovery center. In the production center, hosts and storage devices are deployed for normal service operations. In the disaster recovery center, hosts and storage devices are deployed to take over the services after a disaster occurs in the production center. The storage devices of the production center or the disaster recovery center contain multiple logical unit numbers (LUNs).
为了保证灾难发生后,容灾中心的数据能够支撑业务接管,复制到容灾中心的数据必须保证一致性(consistency)。在现有技术中,可以利用快照技术解决这个问题。快照是数据在某个时间点(拷贝开始的时间点)的映像。快照的目的是为LUN创建一个在特定时间点的状态视图,通过这个视图可以看到LUN在创建时刻的数据。利用这个快照视图,就可以做数据的复制。而且为了保证数据的一致性,需要在容灾的虚拟机内安装客户端代理软件,在进行异步远程复制时,生产中心的存储设备向虚拟机的客户端代理软件发送静默请求,客户端代理软件挂起虚拟机接收的写IO,并将虚拟机写入缓存(如Cache)但没有 写到磁盘的数据刷新到磁盘。客户端代理软件在缓存刷新完成后,向生产中心的存储设备返回刷新完成的消息。生产中心的存储设备在接收到客户端代理软件返回的刷新完成的消息后,开始远程复制,先对主逻辑单元号(logical unit number,LUN)打快照,快照成功后再将主LUN快照同步到容灾中心的存储设备的从LUN。。In order to ensure that the data of the disaster recovery center can support the business takeover after the disaster occurs, the data copied to the disaster recovery center must ensure consistency. In the prior art, snapshot technology can be used to solve this problem. A snapshot is an image of data at a certain point in time (the point in time when the copy begins). The purpose of the snapshot is to create a state view for the LUN at a specific point in time. This view shows the data of the LUN at the time of creation. With this snapshot view, you can copy the data. In order to ensure the consistency of the data, the client agent software needs to be installed in the disaster-tolerant virtual machine. When the asynchronous remote replication is performed, the storage device of the production center sends a silent request to the client agent software of the virtual machine, and the client agent software Suspend the write IO received by the virtual machine and write the virtual machine to the cache (such as Cache) but no The data written to the disk is flushed to disk. After the cache refresh is completed, the client agent software returns a refresh completion message to the storage device of the production center. After receiving the refresh message returned by the client agent software, the storage device of the production center starts remote replication. First, take a snapshot of the logical unit number (LUN). After the snapshot is successful, synchronize the snapshot of the primary LUN to the snapshot. The slave LUN of the storage device in the disaster recovery center. .
由此可知,现在技术为了实现容灾,需要在容灾的虚拟机内安装客户端代理软件,但是虚拟化场景下一般要求虚拟机内无代理;并且由于不同厂家的存储设备可能不同,导致客户端代理软件多样化,无法做到统一。Therefore, in order to implement disaster tolerance, the technology needs to install the client agent software in the disaster-tolerant virtual machine. However, in the virtualization scenario, the virtual machine usually requires no agent; and because the storage devices of different manufacturers may be different, the customer is caused. The end-agent software is diversified and cannot be unified.
发明内容Summary of the invention
本发明实施例提供一种容灾方法、系统和装置。The embodiment of the invention provides a disaster tolerance method, system and device.
第一方面,提供了一种容灾系统,所述系统包括生产中心和容灾中心,所述生产中心包括容灾管理平台、虚拟化平台和存储设备,所述生产中心的存储设备包括主逻辑单元号LUN;所述容灾中心包括存储设备,所述容灾中心的存储设备包括从LUN:In a first aspect, a disaster tolerance system is provided, where the system includes a production center and a disaster recovery center, and the production center includes a disaster recovery management platform, a virtualization platform, and a storage device, where the storage device of the production center includes a main logic. The LUN is a storage device, and the storage device of the disaster recovery center includes a slave LUN:
所述容灾管理平台,用于向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行静默处理的请求;The disaster management platform is configured to send, to the virtualization platform, a request for silent processing of one or more virtual machines of the primary LUN;
所述虚拟化平台,用于根据所述静默处理的请求,对所述一个或多个虚拟机进行静默处理,并向所述容灾管理平台返回对所述一个或多个虚拟机进行静默处理的响应;The virtualization platform is configured to perform silent processing on the one or more virtual machines according to the request of the silent processing, and return to the disaster recovery management platform to perform silent processing on the one or more virtual machines. the response to;
所述容灾管理平台,用于接收对所述一个或多个虚拟机进行静默处理的响应,并向所述生产站点的存储设备发送启动远程复制请求;The disaster management platform is configured to receive a response to silent processing of the one or more virtual machines, and send a startup remote replication request to a storage device of the production site;
所述生产站点的存储设备,用于对所述主LUN进行快照处理,并将所述主 LUN快照复制到所述容灾站点的存储设备的从LUN。a storage device of the production site, configured to perform snapshot processing on the primary LUN, and the primary device The LUN snapshot is copied to the slave LUN of the storage device at the disaster recovery site.
结合第一方面,在第一方面的第一种可能的实现方式中,当所述静默处理的请求中包含所述主LUN的标识时,所述虚拟化平台用于对所述一个或多个虚拟机进行静默处理之前,还用于根据所述主LUN的标识获取所述主LUN的一个或多个虚拟机的标识。With reference to the first aspect, in a first possible implementation manner of the first aspect, when the request for the silent processing includes the identifier of the primary LUN, the virtualization platform is configured to use the one or more The virtual machine is configured to obtain an identifier of one or more virtual machines of the primary LUN according to the identifier of the primary LUN.
结合第一方面,在第一方面的第二种可能的实现方式中,当所述请求中包含所述一个或多个虚拟机的标识时,所述容灾管理平台用于向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行静默处理的请求之前,还用于向所述虚拟化平台发送查询请求,所述查询请求包括所述主LUN的标识;With reference to the first aspect, in a second possible implementation manner of the first aspect, when the request includes the identifier of the one or more virtual machines, the disaster management platform is used to perform the virtualization Before the platform sends a request for the silent processing of the one or more virtual machines of the primary LUN, the platform is further configured to send a query request to the virtualization platform, where the query request includes an identifier of the primary LUN;
所述容灾管理平台,还用于获取所述虚拟化平台基于所述主LUN的标识获得的所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的一个或多个虚拟机的标识。The disaster recovery management platform is further configured to obtain a virtual machine list of the primary LUN obtained by the virtualization platform based on the identifier of the primary LUN, where the virtual machine list includes one or more of the primary LUNs The ID of the virtual machine.
结合第一方面,第一方面的第一种可能的实现方式或第一方面的第二种可能的实现方式,在第一方面的第三种可能的实现方式中,当所述主LUN的虚拟机有多个时,所述虚拟化平台用于对所述多个虚拟机进行静默处理,并向所述容灾管理平台返回对所述多个虚拟机进行静默处理的响应具体包括:With reference to the first aspect, the first possible implementation manner of the first aspect or the second possible implementation manner of the first aspect, in a third possible implementation manner of the first aspect, when the primary LUN is virtualized The response of the virtualization platform to the silent processing of the multiple virtual machines and the returning to the disaster recovery management platform for the silent processing of the multiple virtual machines includes:
所述虚拟化平台用于通知所述多个虚拟机中的每个虚拟机进行静默处理,接收所述每个虚拟机返回的静默处理的响应;The virtualization platform is configured to notify each of the plurality of virtual machines to perform a silent process, and receive a response of the silent process returned by each virtual machine;
当所述虚拟化平台判断接收了所述多个虚拟机中每个虚拟机返回的响应时,所述虚拟化平台用于向所述容灾管理平台返回对所述多个虚拟机进行静默处理的响应。When the virtualization platform determines that the response returned by each of the plurality of virtual machines is received, the virtualization platform is configured to return to the disaster recovery management platform to perform silent processing on the multiple virtual machines. the response to.
结合第一方面,第一方面的第一种可能的实现方式至第一方面的第三种任 一种可能的实现方式,在第一方面第四种可能的实现方式中,所述生产站点的存储设备,还用于向所述容灾管理平台返回启动远程复制响应;In combination with the first aspect, the first possible implementation of the first aspect to the third aspect of the first aspect A possible implementation manner, in a fourth possible implementation manner of the first aspect, the storage device of the production site is further configured to return a remote replication response to the disaster recovery management platform;
所述容灾管理平台,还用于接收所述启动远程复制响应,向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行取消静默处理的请求;The disaster management platform is further configured to receive the initiated remote replication response, and send a request to the virtualization platform to cancel the silent processing on one or more virtual machines of the primary LUN;
所述虚拟化平台,还用于根据所述取消静默处理的请求,对所述一个或多个虚拟机进行取消静默处理,并向所述容灾管理平台返回对所述一个或多个虚拟机进行取消静默处理的响应。The virtualization platform is further configured to perform the un-silent processing on the one or more virtual machines according to the request for canceling the silent processing, and return the one or more virtual machines to the disaster recovery management platform. Perform a response to cancel the silent process.
第二方面,本发明实施例提供了另外一种容灾系统,所述系统包括生产中心和容灾中心,所述生产中心包括容灾管理平台、虚拟化平台和存储设备,所述生产中心的存储设备包括主逻辑单元号LUN;所述容灾中心包括存储设备,所述容灾中心的存储设备包括从LUN:In a second aspect, the embodiment of the present invention provides another disaster tolerance system, where the system includes a production center and a disaster recovery center, and the production center includes a disaster recovery management platform, a virtualization platform, and a storage device, where the production center The storage device includes a primary logical unit number LUN; the disaster recovery center includes a storage device, and the storage device of the disaster recovery center includes a secondary LUN:
所述容灾管理平台,用于获取所述主LUN的多个虚拟机的标识,向所述虚拟化平台发送对所述多个虚拟机中的每个虚拟机进行静默处理的请求;The disaster management platform is configured to obtain identifiers of multiple virtual machines of the primary LUN, and send a request for silent processing to each of the multiple virtual machines to the virtualization platform;
所述虚拟化平台,用于根据所述静默处理的请求,对所述每个虚拟机进行静默处理,并向所述容灾管理平台返回对所述每个虚拟机进行静默处理的响应;The virtualization platform is configured to perform silent processing on each virtual machine according to the request of the silent processing, and return a response to the disaster recovery management platform to perform silent processing on each virtual machine;
所述容灾管理平台,用于接收对所述每个虚拟机进行静默处理的响应,并向所述生产站点的存储设备发送启动远程复制请求;The disaster recovery management platform is configured to receive a response to the silent processing of each virtual machine, and send a startup remote replication request to a storage device of the production site;
所述生产站点的存储设备,用于对所述主LUN进行快照处理,并将所述主LUN快照复制到所述容灾站点的存储设备的从LUN。The storage device of the production site is configured to perform snapshot processing on the primary LUN, and copy the primary LUN snapshot to the secondary LUN of the storage device at the disaster recovery site.
结合第二方面,在第二方面的第一种可能的实现方式中,所述容灾管理平台用于获取所述主LUN的多个虚拟机的标识具体包括:With reference to the second aspect, in a first possible implementation manner of the second aspect, the acquiring, by the disaster management platform, the identifiers of the multiple virtual machines of the primary LUN specifically includes:
所述容灾管理平台,用于向所述虚拟化平台发送查询请求,所述查询请求 包括所述主LUN的标识;The disaster management platform is configured to send a query request to the virtualization platform, where the query request is Including the identifier of the primary LUN;
所述容灾管理平台,还用于获取所述虚拟化平台基于所述主LUN的标识获得的所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的多个虚拟机的标识。The disaster management platform is further configured to obtain a virtual machine list of the primary LUN obtained by the virtualization platform based on the identifier of the primary LUN, where the virtual machine list includes multiple virtual machines of the primary LUN Logo.
结合第二方面或第二方面的第一种可能的实现方式,在第二方面的第二种可能的实现方式中,所述容灾管理平台,用于接收对所述每个虚拟机进行静默处理的响应,并向所述生产站点的存储设备发送启动远程复制请求具体包括:With reference to the second aspect, or the first possible implementation manner of the second aspect, in a second possible implementation manner of the second aspect, the disaster management platform is configured to receive a silent manner for each virtual machine Processing the response and sending the initiate remote copy request to the storage device of the production site specifically includes:
所述容灾管理平台,用于接收对所述每个虚拟机进行静默处理的响应,判断接收了所述多个虚拟机中每个虚拟机返回的响应时,向所述生产站点的存储设备发送启动远程复制请求。The disaster management platform is configured to receive a response to the silent processing of each virtual machine, and determine, when the response returned by each virtual machine of the multiple virtual machines is received, to the storage device of the production site Send a remote copy request.
第三方面,本发明实施例提供一种应用于容灾系统的容灾方法,所述系统包括生产中心和容灾中心,所述生产中心包括容灾管理平台、虚拟化平台和存储设备,所述生产中心的存储设备包括主逻辑单元号LUN;所述容灾中心包括存储设备,所述容灾中心的存储设备包括从LUN:In a third aspect, the embodiment of the present invention provides a disaster tolerance method for a disaster tolerance system, where the system includes a production center and a disaster recovery center, and the production center includes a disaster recovery management platform, a virtualization platform, and a storage device. The storage device of the production center includes a primary logical unit number LUN; the disaster recovery center includes a storage device, and the storage device of the disaster recovery center includes a slave LUN:
所述容灾管理平台向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行静默处理的请求;The disaster management platform sends a request for silent processing to one or more virtual machines of the primary LUN to the virtualization platform;
所述容灾管理平台接收所述虚拟化平台对所述一个或多个虚拟机进行静默处理的响应,并向所述生产站点的存储设备发送启动远程复制请求,以使所述生产站点的存储设备对所述主LUN进行快照处理,并将所述主LUN快照复制到所述容灾站点的存储设备的从LUN。Receiving, by the disaster management platform, a response of the virtualization platform to perform silent processing on the one or more virtual machines, and sending a startup remote replication request to a storage device of the production site to enable storage of the production site The device performs snapshot processing on the primary LUN, and copies the primary LUN snapshot to the secondary LUN of the storage device at the disaster recovery site.
结合第三方面,在第三方面的第一种可能的实现方式中,当所述请求中包含所述一个或多个虚拟机的标识时,所述容灾管理平台向所述虚拟化平台发送 对所述主LUN的一个或多个虚拟机进行静默处理的请求之前,还向所述虚拟化平台发送查询请求,所述查询请求包括所述主LUN的标识;With reference to the third aspect, in a first possible implementation manner of the third aspect, when the request includes the identifier of the one or more virtual machines, the disaster management platform sends the virtualization management platform to the virtualization platform. Before the request for the silent processing of the one or more virtual machines of the primary LUN, the query request is sent to the virtualization platform, where the query request includes the identifier of the primary LUN;
所述容灾管理平台获取所述虚拟化平台基于所述主LUN的标识获得的所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的一个或多个虚拟机的标识。The disaster management platform acquires a virtual machine list of the primary LUN obtained by the virtualization platform based on the identifier of the primary LUN, where the virtual machine list includes identifiers of one or more virtual machines of the primary LUN. .
结合第三方面或在第三方面的第一种可能的实现方式中,在第三方面的第二种可能的实现方式中,所述容灾管理平台接收所述生产站点的存储设备发送的启动远程复制响应,向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行取消静默处理的请求;In conjunction with the third aspect or the first possible implementation manner of the third aspect, in a second possible implementation manner of the third aspect, the disaster management platform receives the startup sent by the storage device of the production site Remotely replicating a response, and sending, to the virtualization platform, a request to cancel a silent process on one or more virtual machines of the primary LUN;
所述容灾管理平台还接收所述虚拟机平台返回的对所述一个或多个虚拟机进行取消静默处理的响应。The disaster management platform further receives a response returned by the virtual machine platform to cancel the silent processing of the one or more virtual machines.
第四方面,本发明实施例提供一种应用于容灾系统的容灾方法,所述系统包括生产中心和容灾中心,所述生产中心包括容灾管理平台、虚拟化平台和存储设备,所述生产中心的存储设备包括主逻辑单元号LUN:A fourth aspect of the present invention provides a disaster tolerance method for a disaster tolerance system, where the system includes a production center and a disaster recovery center, and the production center includes a disaster recovery management platform, a virtualization platform, and a storage device. The storage device in the production center includes the main logical unit number LUN:
所述虚拟化平台从所述容灾管理平台接收对所述主LUN的一个或多个虚拟机进行静默处理的请求;Receiving, by the virtualization platform, a request for silent processing of one or more virtual machines of the primary LUN from the disaster recovery management platform;
所述虚拟化平台根据所述静默处理的请求,对所述一个或多个虚拟机进行静默处理,并向所述容灾管理平台返回对所述一个或多个虚拟机进行静默处理的响应,以使所述容灾管理平台向所述生产站点的存储设备发送启动远程复制请求。The virtualization platform performs silent processing on the one or more virtual machines according to the request of the silent processing, and returns a response to the disaster recovery management platform to perform silent processing on the one or more virtual machines. The disaster recovery management platform is configured to send a remote replication request to a storage device of the production site.
结合第四方面,在第四方面的第一种可能的实现方式中,当所述静默处理的请求中包含所述主LUN的标识时,所述虚拟化平台对所述一个或多个虚拟机 进行静默处理之前,还根据所述主LUN的标识获取所述主LUN的一个或多个虚拟机的标识。With reference to the fourth aspect, in a first possible implementation manner of the fourth aspect, when the request for the silent LUN includes the identifier of the primary LUN, the virtualization platform is configured to the one or more virtual machines The identifier of one or more virtual machines of the primary LUN is obtained according to the identifier of the primary LUN.
结合第四方面,在第四方面的第二种可能的实现方式中,所述虚拟化平台还接收所述容灾管理平台发送的查询请求,所述查询请求包括所述主LUN的标识;With reference to the fourth aspect, in a second possible implementation manner of the fourth aspect, the virtualization platform further receives a query request sent by the disaster recovery management platform, where the query request includes an identifier of the primary LUN;
所述虚拟化平台基于所述主LUN的标识获得所述主LUN的虚拟机列表,向所述容灾管理平台返回所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的一个或多个虚拟机的标识。The virtualization platform obtains a virtual machine list of the primary LUN based on the identifier of the primary LUN, and returns a virtual machine list of the primary LUN to the disaster recovery management platform, where the virtual machine list includes the primary LUN The identity of one or more virtual machines.
结合第四方面、第四方面的第一种可能的实现方式或第四方面的第二种可能的实现方式,在第四方面的第三种可能的实现方式中,当所述主LUN的虚拟机有多个时,所述虚拟化平台对所述多个虚拟机进行静默处理,并向所述容灾管理平台返回对所述多个虚拟机进行静默处理的响应具体包括:With reference to the fourth aspect, the first possible implementation manner of the fourth aspect, or the second possible implementation manner of the fourth aspect, in a third possible implementation manner of the fourth aspect, when the primary LUN is virtualized The response of the virtualization platform to the silent processing of the plurality of virtual machines and the returning to the disaster recovery management platform for the silent processing of the multiple virtual machines includes:
所述虚拟化平台通知所述多个虚拟机中的每个虚拟机进行静默处理,接收所述每个虚拟机返回的静默处理的响应;The virtualization platform notifies each of the plurality of virtual machines to perform a silent process, and receives a response of the silent process returned by each of the virtual machines;
当所述虚拟化平台判断接收了所述多个虚拟机中每个虚拟机返回的响应时,所述虚拟化平台向所述容灾管理平台返回对所述多个虚拟机进行静默处理的响应。When the virtualization platform determines that the response returned by each of the plurality of virtual machines is received, the virtualization platform returns a response to the disaster recovery management platform to perform silent processing on the multiple virtual machines. .
第五方面,本发明实施例提供一种应用于容灾系统的容灾方法,所述系统包括生产中心和容灾中心,所述生产中心包括容灾管理平台、虚拟化平台和存储设备,所述生产中心的存储设备包括主逻辑单元号LUN;所述容灾中心包括存储设备,所述容灾中心的存储设备包括从LUN:A fifth aspect of the present invention provides a disaster tolerance method for a disaster tolerance system, where the system includes a production center and a disaster recovery center, and the production center includes a disaster recovery management platform, a virtualization platform, and a storage device. The storage device of the production center includes a primary logical unit number LUN; the disaster recovery center includes a storage device, and the storage device of the disaster recovery center includes a slave LUN:
所述容灾管理平台获取所述主LUN的多个虚拟机的标识,向所述虚拟化平 台发送对所述多个虚拟机中的每个虚拟机进行静默处理的请求;The disaster recovery management platform acquires identifiers of multiple virtual machines of the primary LUN, and the virtualization is Sending a request for silent processing of each of the plurality of virtual machines;
所述容灾管理平台,接收所述虚拟化平台对所述每个虚拟机进行静默处理的响应,并向所述生产站点的存储设备发送启动远程复制请求,以使所述生产站点的存储设备对所述主LUN进行快照处理,并将所述主LUN快照复制到所述容灾站点的存储设备的从LUN。The disaster management platform receives a response of the virtualization platform to perform silent processing on each virtual machine, and sends a startup remote replication request to the storage device of the production site to enable the storage device of the production site Snapshot processing is performed on the primary LUN, and the primary LUN snapshot is copied to the secondary LUN of the storage device at the disaster recovery site.
结合第五方面,在第五方面第一种可能的实现方式中,所述容灾管理平台获取所述主LUN的多个虚拟机的标识具体包括:With reference to the fifth aspect, in a first possible implementation manner of the fifth aspect, the acquiring, by the disaster management platform, the identifiers of the multiple virtual machines of the primary LUN specifically includes:
所述容灾管理平台向所述虚拟化平台发送查询请求,所述查询请求包括所述主LUN的标识;The disaster management platform sends a query request to the virtualization platform, where the query request includes an identifier of the primary LUN;
所述容灾管理平台获取所述虚拟化平台基于所述主LUN的标识获得的所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的多个虚拟机的标识。The disaster recovery management platform acquires a virtual machine list of the primary LUN obtained by the virtualization platform based on the identifier of the primary LUN, where the virtual machine list includes identifiers of multiple virtual machines of the primary LUN.
本发明实施例中提供的一个或多个技术方案,至少具有如下技术效果或优点:通过增加静默接口,容灾管理平台可以通过静默接口向虚拟化平台发送对所述主LUN的一个或多个虚拟机进行静默处理的请求,虚拟化平台基于该静默处理的请求,对所述一个或多个虚拟机进行静默处理,并向所述容灾管理平台返回对所述一个或多个虚拟机进行静默处理的响应,从而容灾管理平台可以向所述生产站点的存储设备发送启动远程复制请求,进行主LUN的容灾,而不用在虚拟机内安装客户端代理软件,实现了无代理容灾,提高了容灾方案的可用性。One or more technical solutions provided in the embodiments of the present invention have at least the following technical effects or advantages: by adding a silent interface, the disaster recovery management platform may send one or more of the primary LUNs to the virtualization platform through the silent interface. The virtual machine performs a silent processing request, and the virtualization platform performs a silent process on the one or more virtual machines based on the silent processing request, and returns the one or more virtual machines to the disaster recovery management platform. Silently processing the response, so that the disaster recovery management platform can send a remote replication request to the storage device of the production site to perform disaster recovery of the primary LUN without installing the client agent software in the virtual machine, thereby implementing agentless disaster recovery. Increase the availability of disaster recovery solutions.
附图说明DRAWINGS
图1为本发明实施例中的容灾系统结构示意图; 1 is a schematic structural diagram of a disaster tolerance system according to an embodiment of the present invention;
图2A为本发明实施例提供的静默请求流程示意图;2A is a schematic flowchart of a silent request process according to an embodiment of the present invention;
图2B为本发明实施例提供的取消静默请求流程示意图;2B is a schematic flowchart of canceling a silent request process according to an embodiment of the present invention;
图3为本发明实施例中的一种容灾方法流程示意图;3 is a schematic flowchart of a disaster tolerance method according to an embodiment of the present invention;
图4为本发明实施例中的另一种容灾方法流程示意图;4 is a schematic flowchart of another disaster tolerance method according to an embodiment of the present invention;
图5为本发明实施例中的容灾管理装置结构示意图;FIG. 5 is a schematic structural diagram of a disaster recovery management apparatus according to an embodiment of the present invention;
图6为本发明实施例中的容灾装置结构示意图;FIG. 6 is a schematic structural diagram of a disaster tolerance device according to an embodiment of the present invention;
图7为本发明实施例中的另一容灾管理装置结构示意图;FIG. 7 is a schematic structural diagram of another disaster recovery management apparatus according to an embodiment of the present invention;
图8为本发明实施例中的服务器结构示意图。FIG. 8 is a schematic structural diagram of a server in an embodiment of the present invention.
具体实施方式detailed description
本发明实施例提供一种容灾方法、系统和装置,用于实现解决现有技术中实现容灾必须在虚拟机中安装客户端代理软件的问题。The embodiment of the invention provides a disaster tolerance method, system and device, which are used to solve the problem that the client agent software must be installed in the virtual machine in the prior art to implement disaster tolerance.
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described in conjunction with the drawings in the embodiments of the present invention. It is a partial embodiment of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
本发明实施例的系统架构System architecture of an embodiment of the present invention
在介绍本申请实施例的技术方案之前,首先对容灾系统进行介绍。所述容灾系统包括生产中心和容灾中心。本领域技术人员应当理解,在具体实现过程中,可以存在多个生产中心或者多个容灾中心。并且,本申请实施例中的生产中心也可以作为其他中心的容灾中心。生产中心与容灾中心可以通过IP(Internet Protocol)或者FC(Fiber Chanel)进行数据传输。 Before introducing the technical solution of the embodiment of the present application, the disaster tolerance system is first introduced. The disaster recovery system includes a production center and a disaster tolerance center. Those skilled in the art should understand that in a specific implementation process, there may be multiple production centers or multiple disaster tolerance centers. Moreover, the production center in the embodiment of the present application can also serve as a disaster recovery center of other centers. The production center and the disaster recovery center can transmit data through IP (Internet Protocol) or FC (Fiber Chanel).
下面结合图1,对生产中心和容灾中心的基本结构进行介绍。The basic structure of the production center and the disaster recovery center will be introduced below with reference to FIG.
生产中心中包括M个存储设备,用于存储生产中心的数据。其中,M为正整数,如3,5,8,73等。生产中心可以将数据写入M个存储设备,也可以从M个存储设备中读取数据。生产中心中有多台服务器,在具体实现过程中,存储设备可以为网络附属存储(Network Attached Storage,简称NAS),也可以为存储区域网络(Storage Area Network,简称SAN),本申请所属技术人员可以根据实际需要来进行设置,本申请不作具体的限制。The production center includes M storage devices for storing data in the production center. Where M is a positive integer, such as 3, 5, 8, 73, and the like. The production center can write data to M storage devices or read data from M storage devices. There are a plurality of servers in the production center. In a specific implementation process, the storage device may be a network attached storage (NAS) or a storage area network (SAN). The setting can be made according to actual needs, and the present application does not specifically limit it.
容灾管理平台,用于基于虚拟化平台向用户提供容灾管理功能,具体的,可以在容灾管理平台上部署容灾软件,用于配置容灾策略,向容灾中心复制容灾策略或接收需要生产中心提供容灾能力的其他中心复制而来的容灾策略。在容灾过程中向所在的中心提供容灾策略,并按容灾策略恢复受保护的虚拟机。容灾管理平台可以运行在主机上,主机可以是当前技术已知的任何计算设备,如服务器、台式计算机等等。在主机内部,安装有容灾管理平台以及其他应用程序。The disaster recovery management platform is used to provide the disaster recovery management function to the user based on the virtualization platform. Specifically, the disaster recovery software can be deployed on the disaster recovery management platform to configure the disaster recovery policy and replicate the disaster recovery strategy to the disaster recovery center. Receive disaster recovery strategies that are replicated from other centers that require disaster recovery capabilities at the production center. Provide a disaster recovery policy to the center in the disaster recovery process and restore the protected virtual machine according to the disaster recovery policy. The disaster management platform can run on a host, which can be any computing device known in the art, such as servers, desktop computers, and the like. Inside the host, a disaster recovery management platform and other applications are installed.
虚拟化平台,用于提供将物理资源虚拟化的能力,具体为提供虚拟化的计算、网络、存储资源的能力。虚拟化平台安装的物理服务器上。虚拟化平台也可以称为虚拟机监控器(Virtual Machine Monitor,简称VMM)。在Xen的系统架构下,VMM的功能由虚拟机管理器Hypervisor来承担,有其他架构下,也可以通过vSphere或hyperv来承担。A virtualization platform that provides the ability to virtualize physical resources, specifically the ability to provide virtualized computing, networking, and storage resources. The virtualized platform is installed on the physical server. The virtualization platform can also be called a virtual machine monitor (VMM). Under the Xen system architecture, the VMM function is assumed by the hypervisor Hypervisor. Under other architectures, it can also be undertaken by vSphere or hyperv.
存储设备,提供虚拟机的存储资源,可以包含多个逻辑单元号(logical unit number,简称LUN)。A storage device that provides storage resources of a virtual machine and can contain multiple logical unit numbers (LUNs).
虚拟机(Virtual Machine,简称VM),负责在一个硬件平台上虚拟多个独立 的、指令集架构(instruction set architectur,简称ISA)和实际硬件相同的虚拟硬件系统,其中每个虚拟硬件系统均可运行不同的操作系统,即用户操作系统(Guest OS)。用户操作系统将通过虚拟机监控器VMM来访问实际的物理资源。Virtual Machine (VM), which is responsible for virtualizing multiple independents on a single hardware platform. The instruction set architect (or ISA) is the same virtual hardware system as the actual hardware, where each virtual hardware system can run a different operating system, namely the guest operating system (Guest OS). The user operating system will access the actual physical resources through the virtual machine monitor VMM.
在执行本申请实施例中的方法之前,管理员会在生产中心配置M个存储设备。为了便于介绍,下面以M个存储设备为SAN为例,对如何进行配置进行介绍。SAN中有多个LUN,如生产中心的SAN1中有LUN11,LUN12,…管理员根据需要将多个LUN划分成多个一致性组,每个一致性组中包括至少一个LUN,其中,本申请实施例中的一致性组是由存储设备中的一个或多个具有相同存储复制信息的LUN组成,一致性组的数据会同时进行复制以及停止复制动作。而存储复制信息中表征了每一个存储设备复制数据的方向,以及每一个存储设备复制数据的时间间隔等。生产中心中的LUN可以称为主LUN,容灾中心中的LUN可以称为从LUN。Before executing the method in the embodiment of the present application, the administrator configures M storage devices in the production center. For the convenience of the introduction, the following describes how to configure the storage device as a SAN. There are multiple LUNs in the SAN. For example, the LUN11 and LUN12 in the SAN1 of the production center are divided into multiple consistency groups. Each consistency group includes at least one LUN. The consistency group in the embodiment is composed of one or more LUNs having the same storage replication information in the storage device, and the data of the consistency group is simultaneously copied and the copying action is stopped. The storage copy information indicates the direction in which each storage device copies data, and the time interval at which each storage device copies data. A LUN in a production center can be called a primary LUN. A LUN in a disaster recovery center can be called a secondary LUN.
进一步,管理员配置每个一致性组的复制方向,本申请实施例中,配置复制方向是指配置将一致性组中各个LUN上的数据复制到哪个中心的存储设备上。并且,管理员还需要配置每个一致性组的复制时间,本申请实施例中,配置复制时间是指本次复制的时间点与上一次复制的时间点之间的时间间隔。配置完成后,一致性组中的数据将会自动按照复制方向和复制时间自动完成复制。Further, the administrator configures the replication direction of each consistency group. In this embodiment, the configuration of the replication direction refers to the configuration of the storage device on which the data on each LUN in the consistency group is copied. In addition, the administrator needs to configure the replication time of each consistency group. In this embodiment, the configuration replication time refers to the time interval between the time point of the current replication and the time point of the previous replication. After the configuration is complete, the data in the consistency group will automatically be copied automatically according to the copy direction and copy time.
完成配置后,存储设备会生成相应的存储复制信息,也可以叫做配置属性。After the configuration is complete, the storage device generates corresponding storage replication information, which can also be called configuration attributes.
举例来说明,假设管理员将LUN11和LUN12划分为一致性组1,生产中心所在地为深圳,并配置一致性组1的复制方向为从生产中心到西安的一中心,复制时间为5分钟等。进而生成存储复制信息,例如:For example, suppose the administrator divides LUN11 and LUN12 into consistency group 1, the production center is located in Shenzhen, and the replication direction of consistency group 1 is from the production center to a center in Xi'an, and the replication time is 5 minutes. In turn generate storage replication information, for example:
ReplicateGroup1:{lunId=11,12}; ReplicateGroup1: {lunId=11,12};
ReplicateGroup1.repliateDirection:{from(xi’an),to(shenzhen)};ReplicateGroup1.repliateDirection:{from(xi’an),to(shenzhen)};
ReplicateGroup1.replicateInterval:{5min};ReplicateGroup1.replicateInterval: {5min};
............
那么在配置完成后,一致性组1中的LUN11、和LUN12就会每隔5分钟向位于西安的另一中心的两个LUN中复制数据,例如该位于西安的中心上的两个LUN为LUN21、和LUN22。Then, after the configuration is complete, LUN11 and LUN12 in consistency group 1 will replicate data to the two LUNs in another center in Xi'an every 5 minutes. For example, the two LUNs in the center of Xi'an are LUN21. , and LUN22.
在具体复制过程中,可以将生产中心的数据全量复制到另一中心的存储设备上。当然,为了降低数据处理的负荷,较佳的方式为首次复制为全量复制,从第二次开始,仅将增量数据复制到另一中心的存储设备上。但是,本申请所属技术人员可以根据实际需要来进行设置,本申请不作具体的限制。In the specific replication process, the data of the production center can be copied in full to the storage device of another center. Of course, in order to reduce the load of data processing, the preferred method is to copy to full copy for the first time, and from the second time, only the incremental data is copied to the storage device of another center. However, the person skilled in the art can make settings according to actual needs, and the present application does not specifically limit the present application.
本发明实施例的方法流程Method flow of embodiment of the present invention
为了实现本发明实施例的方案,虚拟化平台提供针对虚拟机的静默和取消静默接口,其中:In order to implement the solution of the embodiment of the present invention, the virtualization platform provides a silent and un-silent interface for the virtual machine, wherein:
静默接口,用于挂起虚拟机写IO操作、应用一致性保证、写入缓存(如Cache)但没有写到磁盘的数据刷新到磁盘等操作;A silent interface that suspends virtual machine write IO operations, application consistency guarantees, write caches (such as Cache), but does not write data to disk to disk operations;
取消静默接口,用于恢复虚拟机写IO操作。Cancel the silent interface and restore the virtual machine write IO operation.
现有技术中,虚拟化平台不提供单独的静默和取消静默接口,只有一个快照接口。增加了静默和取消静默接口之后的处理过程如图2A和图2B所示。In the prior art, the virtualization platform does not provide a separate silent and un-silent interface, and only one snapshot interface. The processing after adding silence and canceling the silent interface is as shown in FIGS. 2A and 2B.
图2A中,容灾管理平台(具体可以是容灾管理平台中的容灾软件)向虚拟化平台发送对虚拟机进行静默处理的请求。虚拟化平台收到对虚拟机进行静默处理的请求,对虚拟机进行静默处理,比如将虚拟机所在系统的写IO挂起,并将系统内写入缓存但没有写到磁盘的数据刷新到磁盘。然后向容灾管理平台回对虚拟机进行静默处理的响应。 In FIG. 2A, the disaster recovery management platform (specifically, the disaster recovery software in the disaster recovery management platform) sends a request for silent processing of the virtual machine to the virtualization platform. The virtualization platform receives the request for silent processing of the virtual machine, and performs silent processing on the virtual machine, such as suspending the write IO of the system where the virtual machine is located, and flushing the data written in the cache but not written to the disk to the disk. . Then, the disaster recovery management platform returns a response to the silent processing of the virtual machine.
图2B中,容灾管理平台(具体可以是容灾管理平台中的容灾软件)向虚拟化平台发送对虚拟机进行取消静默处理的请求。虚拟化平台收到对虚拟机进行取消静默处理的请求,对虚拟机进行取消静默处理,比如取消虚拟机所在系统的写IO挂起,虚拟机可以进行正常读写磁盘操作,从而可继续进行正常业务。然后向容灾管理平台回对虚拟机进行取消静默处理的响应。In FIG. 2B, the disaster recovery management platform (specifically, the disaster recovery software in the disaster recovery management platform) sends a request for canceling the silent processing of the virtual machine to the virtualization platform. The virtualization platform receives the request to cancel the silent processing of the virtual machine, and cancels the silent processing of the virtual machine, for example, cancels the write IO hang of the system where the virtual machine is located, and the virtual machine can perform normal read and write disk operations, thereby continuing normal operation. business. Then, the disaster recovery management platform returns a response to cancel the silent processing of the virtual machine.
下图结合附图3,对本发明实施例提供的另一容灾方法详细介绍。本发明实施例提供的方法可以应用于图1所示的系统架构,以生产中心的存储设备中某一主LUN为例说明:The following figure, in conjunction with FIG. 3, further introduces another disaster tolerance method provided by the embodiment of the present invention. The method provided by the embodiment of the present invention can be applied to the system architecture shown in FIG. 1 , and an example of a primary LUN in the storage device of the production center is taken as an example:
101、当容灾管理平台根据容灾策略发现需要进行远程复制时,容灾管理平台向虚拟化平台发送对所述主LUN的一个或多个虚拟机进行静默处理的请求。When the disaster recovery management platform finds that remote replication is required according to the disaster recovery policy, the disaster recovery management platform sends a request for silent processing to one or more virtual machines of the primary LUN to the virtualization platform.
需要说明的是,主LUN的虚拟机指位于所述主LUN上的虚拟机,可以是一个虚拟机也可以是多个虚拟机。It should be noted that the virtual machine of the primary LUN refers to a virtual machine located on the primary LUN, and may be a virtual machine or multiple virtual machines.
一种可选的方案,容灾管理平台向虚拟化平台发送对所述主LUN的所有虚拟机进行静默处理的请求。本发明实施例可以对所述主LUN的所有虚拟机静默处理,还可以对所述主LUN的部分虚拟机静默处理,对此不做限定。An optional solution, the disaster recovery management platform sends a request for silent processing to all virtual machines of the primary LUN to the virtualization platform. In this embodiment of the present invention, all the virtual machines of the primary LUN may be silently processed, and some virtual machines of the primary LUN may be silently processed, which is not limited.
该静默处理的请求中可以包含所述主LUN的标识,或者也可以包含所述主LUN的一个或多个虚拟机的标识。The request for the silent processing may include the identifier of the primary LUN, or may also include the identifier of one or more virtual machines of the primary LUN.
当所述请求中包含所述一个或多个虚拟机的标识时,所述容灾管理平台在向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行静默处理的请求之前,还需要向所述虚拟化平台发送查询请求,所述查询请求包括所述主LUN的标识;所述虚拟化平台基于所述主LUN的标识获得的所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的一个或多个虚拟机的标识,然后向 所述容灾管理平台返回所述主LUN的虚拟机列表。When the request includes the identifier of the one or more virtual machines, the disaster management platform sends a request to the virtualization platform to perform silent processing on one or more virtual machines of the primary LUN. And sending, by the virtualization platform, a query request, where the query request includes an identifier of the primary LUN, and the virtualization platform obtains a virtual machine list of the primary LUN obtained by using the identifier of the primary LUN, The virtual machine list includes the identifiers of one or more virtual machines of the primary LUN, and then The disaster recovery management platform returns a list of virtual machines of the primary LUN.
102、虚拟化平台根据所述静默处理的请求,对所述一个或多个虚拟机进行静默处理。102. The virtualization platform performs silent processing on the one or more virtual machines according to the request of the silent processing.
当所述静默处理的请求中包含所述主LUN的标识时,所述虚拟化平台对所述一个或多个虚拟机进行静默处理之前,还需要根据所述主LUN的标识获取所述主LUN的一个或多个虚拟机的标识。And obtaining, by the virtualization platform, the primary LUN according to the identifier of the primary LUN, before the virtualized platform performs the silent processing on the one or more virtual machines. The identity of one or more virtual machines.
所述虚拟化平台通知所述一个或多个虚拟机进行静默处理,具体可以通知到虚拟机内的虚拟化驱动(如VM TOOLS),由虚拟化驱动调用客操作系统(GUEST OS)进行IO静默处理。静默处理即是将IO挂起,并将系统内写入Cache缓存但没有写到磁盘的数据刷新到磁盘。处理完毕之后,所述一个或多个虚拟机虚拟机给虚拟化平台返回静默处理的响应。The virtualization platform notifies the one or more virtual machines to perform silent processing, specifically, may notify a virtualized driver (such as VM TOOLS) in the virtual machine, and the virtualized driver invokes a guest operating system (GUEST OS) for IO quiescing. deal with. Silent processing is to hang the IO and flush the data written to the Cache cache but not written to disk to the disk. After processing, the one or more virtual machine virtual machines return a silent response to the virtualization platform.
103、当所述主LUN的虚拟机有多个时,所述虚拟化平台判断接收了所述多个虚拟机中每个虚拟机返回的响应时,向所述容灾管理平台返回对所述多个虚拟机进行静默处理的响应。103. When there are multiple virtual machines of the primary LUN, the virtualization platform determines, when receiving the response returned by each of the multiple virtual machines, returning to the disaster recovery management platform. The response of multiple virtual machines for silent processing.
也就是说,当虚拟化平台判断完成了所述多个虚拟机中每个虚拟机的静默处理后,向容灾管理平台返回对所述主LUN的多个虚拟机进行静默处理的响应。That is, after the virtualization platform determines that the virtual processing of each of the plurality of virtual machines is completed, the disaster recovery management platform returns a response to silent processing of the plurality of virtual machines of the primary LUN.
当所述主LUN的虚拟机只有一个时,所述虚拟化平台接收了所述虚拟机返回的响应时,就可以向所述容灾管理平台返回对所述虚拟机进行静默处理的响应。When the virtual machine of the primary LUN receives only the response returned by the virtual machine, the virtualized platform may return a response to the virtual machine to perform silent processing on the virtual machine.
104、容灾管理平台接收对所述一个或多个虚拟机进行静默处理的响应,并向所述生产站点的存储设备发送启动远程复制请求。104. The disaster recovery management platform receives a response to silent processing of the one or more virtual machines, and sends a startup remote replication request to a storage device of the production site.
105、生产站点的存储设备开始远程复制,先对主LUN进行快照处理,即 对主LUN打快照,由于此时主LUN上一个或多个虚拟机的IO缓存都已经刷新到主LUN上,从而可保证主LUN上这些虚拟机中磁盘数据是一致的。105. The storage device at the production site starts remote replication, and the primary LUN is first snapshot processed. The snapshot of the primary LUN is performed. The IO cache of one or more virtual machines on the primary LUN is flushed to the primary LUN. This ensures that the disk data on the primary LUN is consistent.
106、生产站点的存储设备对主LUN快照处理成功后,向容灾管理平台返回启动远程复制响应。106. After the storage device of the production site processes the snapshot of the primary LUN successfully, it returns a remote replication response to the disaster recovery management platform.
110、容灾管理平台在收到启动远程复制响应后,向虚拟化平台发送对所述主LUN的一个或多个虚拟机上进行取消静默处理的请求。After receiving the remote replication response, the disaster recovery management platform sends a request to the virtualization platform to cancel the silent processing on one or more virtual machines of the primary LUN.
如101类似,该取消静默处理的请求中可以包含所述主LUN的标识,或者也可以包含所述主LUN的一个或多个虚拟机的标识。具体的方法详见101。The request for canceling the silent process may include the identifier of the primary LUN, or may also include the identifier of one or more virtual machines of the primary LUN. See 101 for details.
111、虚拟化平台根据所述取消静默处理的请求,对所述一个或多个虚拟机进行取消静默处理,即将所述一个或多个虚拟机所在系统进行取消写IO挂起,继续进行正常业务。111. The virtualization platform cancels the silent processing on the one or more virtual machines according to the request for canceling the silent processing, that is, the system where the one or more virtual machines are located cancels the write IO suspension, and continues the normal service. .
取消静默处理是静默处理的逆过程,与静默处理类似,详见102的流程。The cancellation of silent processing is the reverse process of silent processing, similar to the silent processing. See the process of 102 for details.
112、当所述虚拟化平台判断接收了所述一个或多个虚拟机返回的取消静默的响应时,向所述容灾管理平台返回对所述一个或多个虚拟机进行取消静默处理的响应。And when the virtualization platform determines that the response to cancel the silence returned by the one or more virtual machines is received, returning, to the disaster recovery management platform, a response to canceling the silent processing of the one or more virtual machines. .
115、生产站点的存储设备将主LUN快照复制到容灾站点的存储设备的从LUN,并在同步数据完成后,取消主LUN的快照。The storage device at the production site copies the snapshot of the primary LUN to the secondary LUN of the storage device at the disaster recovery site. After the synchronization data is complete, the snapshot of the primary LUN is deleted.
本发明实施例提供的方法,通过增加静默接口,容灾管理平台可以通过静默接口向虚拟化平台发送对所述主LUN的一个或多个虚拟机进行静默处理的请求,虚拟化平台基于该静默处理的请求,对所述一个或多个虚拟机进行静默处理,并向所述容灾管理平台返回对所述一个或多个虚拟机进行静默处理的响应,从而容灾管理平台可以向所述生产站点的存储设备发送启动远程复制请求,进 行主LUN的容灾,而不用在虚拟机内安装客户端代理软件,实现了无代理容灾,提高了容灾方案的可用性。According to the method provided by the embodiment of the present invention, the disaster recovery management platform may send a request for silent processing of one or more virtual machines of the primary LUN to the virtualization platform by using a silent interface, where the virtualization platform is based on the silence. Processing the request, performing silent processing on the one or more virtual machines, and returning a response to the one or more virtual machines to the disaster recovery management platform, so that the disaster management platform can The storage device at the production site sends a start remote copy request, The disaster recovery of the primary LUN does not require the installation of the client agent software in the virtual machine, which enables agentless disaster recovery and improves the availability of the disaster recovery solution.
而且现有技术通过客户端代理软件实现容灾的方案,存储设备向虚拟机的客户端代理软件发送静默请求,客户端代理软件在缓存刷新完成后,向存储设备返回刷新完成的消息,需要虚拟机与存储设备直接进行通信,存在安全隐患。本发明实施例提供的方法,虚拟机不直接与存储设备交互,而是由比较安全的容灾管理平台与存储设备进行通信,因此可以提升存储设备数据的安全性。Moreover, the prior art implements a disaster recovery solution by using the client agent software, and the storage device sends a silent request to the client agent software of the virtual machine, and after the cache refresh is completed, the client agent software returns a refresh completion message to the storage device, which needs to be virtualized. The device communicates directly with the storage device, which poses a security risk. In the method provided by the embodiment of the present invention, the virtual machine does not directly interact with the storage device, but the relatively secure disaster recovery management platform communicates with the storage device, so that the security of the storage device data can be improved.
下图结合附图4,对本发明实施例提供的容灾方法详细介绍。本发明实施例提供的方法可以应用于图1所示的系统架构,以生产中心的存储设备中某一主LUN为例说明:The following is a detailed description of the disaster tolerance method provided by the embodiment of the present invention with reference to FIG. The method provided by the embodiment of the present invention can be applied to the system architecture shown in FIG. 1 , and an example of a primary LUN in the storage device of the production center is taken as an example:
201、当容灾管理平台根据容灾策略发现需要进行远程复制时,向所述虚拟化平台发送查询请求,所述查询请求包括所述主LUN的标识。201. When the disaster recovery management platform finds that the remote replication is required according to the disaster recovery policy, the query request is sent to the virtualization platform, where the query request includes the identifier of the primary LUN.
202、所述虚拟化平台基于所述主LUN的标识获得所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的多个虚拟机的标识。202. The virtualization platform obtains a virtual machine list of the primary LUN according to the identifier of the primary LUN, where the virtual machine list includes identifiers of multiple virtual machines of the primary LUN.
203、所述虚拟化平台向所述容灾管理平台返回所述主LUN的虚拟机列表。203. The virtualization platform returns a virtual machine list of the primary LUN to the disaster recovery management platform.
204、所述容灾管理平台从所述主LUN的虚拟机列表,获取所述主LUN的多个虚拟机的标识,向所述虚拟化平台发送对所述多个虚拟机中的每个虚拟机进行静默处理的请求,每个静默处理的请求中包含一个虚拟机的标识。The disaster management platform acquires identifiers of the plurality of virtual machines of the primary LUN from the virtual machine list of the primary LUN, and sends each virtual virtual machine to the virtualized platform. The machine performs a silent processing request, and each silently processed request contains an identifier of a virtual machine.
一种可选的方案,容灾管理平台向虚拟化平台发送对所述主LUN的所有虚拟机进行静默处理的请求。本发明实施例可以对所述主LUN的所有虚拟机静默处理,还可以对所述主LUN的部分虚拟机静默处理,对此不做限定。An optional solution, the disaster recovery management platform sends a request for silent processing to all virtual machines of the primary LUN to the virtualization platform. In this embodiment of the present invention, all the virtual machines of the primary LUN may be silently processed, and some virtual machines of the primary LUN may be silently processed, which is not limited.
205、虚拟化平台根据所述静默处理的请求,对所述每个虚拟机进行静默处 理。205. The virtualization platform performs a silent manner on each virtual machine according to the request of the silent processing. Reason.
所述虚拟化平台通知所述每个虚拟机进行静默处理,具体可以通知到虚拟机内的虚拟化驱动(如VM TOOLS),由虚拟化驱动调用客操作系统(GUEST OS)进行IO静默处理。静默处理即是将IO挂起,并将系统内写入Cache缓存但没有写到磁盘的数据刷新到磁盘。处理完毕之后,每个虚拟机给虚拟化平台返回静默处理的响应。The virtualization platform notifies the virtual machine to perform silent processing, specifically, the virtualization driver (such as VM TOOLS) in the virtual machine is notified, and the virtual operating driver invokes the guest operating system (GUEST OS) to perform IO silent processing. Silent processing is to hang the IO and flush the data written to the Cache cache but not written to disk to the disk. After processing, each virtual machine returns a silent response to the virtualization platform.
206、所述虚拟化平台向所述容灾管理平台返回对所述每个虚拟机进行静默处理的响应。206. The virtualization platform returns a response to the virtual disaster management platform to perform silent processing on each virtual machine.
当虚拟化平台完成某个虚拟机的静默处理后,向容灾管理平台返回对所述某个虚拟机进行静默处理的响应;当然也可以如图3类似,当所述虚拟化平台判断接收了所述多个虚拟机中每个虚拟机返回的响应时,向所述容灾管理平台返回对所述多个虚拟机进行静默处理的响应。After the virtualized platform completes the silent processing of the virtual machine, the virtualized platform returns a response to the silent processing of the virtual machine. Similarly, as shown in FIG. 3, when the virtualization platform determines that the virtualized platform has received the And returning a response to performing silent processing on the plurality of virtual machines to the disaster recovery management platform when the response returned by each of the plurality of virtual machines is returned.
207、容灾管理平台接收对所述每个虚拟机进行静默处理的响应,并向所述生产站点的存储设备发送启动远程复制请求。207. The disaster recovery management platform receives a response to the silent processing of each virtual machine, and sends a startup remote replication request to a storage device of the production site.
具体的所述容灾管理平台接收对所述每个虚拟机进行静默处理的响应后,在判断接收了所述多个虚拟机中每个虚拟机返回的响应时,向所述生产站点的存储设备发送启动远程复制请求。After the disaster recovery management platform receives the response to the silent processing of each virtual machine, the storage to the production site is determined when it is determined that the response returned by each of the plurality of virtual machines is received. The device sends a start remote copy request.
208、生产站点的存储设备开始远程复制,先对主LUN进行快照处理,即对主LUN打快照,由于此时主LUN上多个虚拟机的IO缓存都已经刷新到主LUN上,从而可保证主LUN上这些虚拟机中磁盘数据是一致的。208. The storage device at the production site starts remote replication, and the primary LUN is snapshotd first, that is, the primary LUN is snapshotd. The IO cache of multiple virtual machines on the primary LUN is already flushed to the primary LUN. The disk data in these virtual machines on the primary LUN is consistent.
209、生产站点的存储设备对主LUN快照处理成功后,向容灾管理平台返回启动远程复制响应。 209. After the storage device of the production site processes the snapshot of the primary LUN successfully, the storage device returns a remote replication response to the disaster recovery management platform.
210、容灾管理平台在收到启动远程复制响应后,向虚拟化平台发送对所述多个虚拟机中的每个虚拟机上进行取消静默处理的请求。210. After receiving the remote replication response, the disaster recovery management platform sends a request for canceling the silent processing on each of the plurality of virtual machines to the virtualization platform.
如204类似,每个取消静默处理的请求中包含一个虚拟机的标识,具体的方法详见204。For example, in 204, each request for canceling the silent processing includes the identifier of a virtual machine. For details, see 204.
211、虚拟化平台根据所述取消静默处理的请求,对所述每个虚拟机进行取消静默处理,即将所述每个虚拟机所在系统进行取消写IO挂起,继续进行正常业务。211. The virtualization platform cancels the silent process for each virtual machine according to the request for canceling the silent process, that is, the system where each virtual machine is located performs cancel write IO suspension, and continues normal service.
取消静默处理是静默处理的逆过程,与静默处理类似,详见205的流程。The cancellation of silent processing is the reverse process of silent processing, similar to the silent processing. See the flow of 205 for details.
212、所述虚拟化平台向所述容灾管理平台返回对所述每个虚拟机进行取消静默处理的响应。212. The virtualization platform returns a response to canceling the silent processing of each virtual machine to the disaster recovery management platform.
215、生产站点的存储设备将主LUN快照同步到容灾站点的存储设备的从LUN,并在同步数据完成后,取消主LUN的快照。215. The storage device at the production site synchronizes the snapshot of the primary LUN to the secondary LUN of the storage device at the disaster recovery site, and cancels the snapshot of the primary LUN after the synchronization data is completed.
本发明实施例提供的方法,通过增加静默接口,容灾管理平台可以通过静默接口向虚拟化平台发送对所述多个虚拟机中的每个虚拟机进行静默处理的请求,虚拟化平台基于该静默处理的请求,对所述每个虚拟机进行静默处理,并向所述容灾管理平台返回对所述每个虚拟机进行静默处理的响应,从而容灾管理平台可以向所述生产站点的存储设备发送启动远程复制请求,进行主LUN的容灾,而不用在虚拟机内安装客户端代理软件,实现了无代理容灾,提高了容灾方案的可用性。The method provided by the embodiment of the present invention, by adding a silent interface, the disaster recovery management platform may send a request for silent processing to each of the plurality of virtual machines to the virtualization platform through a silent interface, where the virtualization platform is based Silently processing the request, performing silent processing on each of the virtual machines, and returning a response to the virtual disaster management platform to silently process each virtual machine, so that the disaster recovery management platform can be to the production site The storage device sends a remote replication request to perform the disaster recovery of the primary LUN. The client agent software is installed in the virtual machine to implement agentless disaster recovery and improve the availability of the disaster recovery solution.
而且现有技术通过客户端代理软件实现容灾的方案,存储设备向虚拟机的客户端代理软件发送静默请求,客户端代理软件在缓存刷新完成后,向存储设备返回刷新完成的消息,需要虚拟机与存储设备直接进行通信,存在安全隐患。 本发明实施例提供的方法,虚拟机不直接与存储设备交互,而是由比较安全的容灾管理平台与存储设备进行通信,因此可以提升存储设备数据的安全性。Moreover, the prior art implements a disaster recovery solution by using the client agent software, and the storage device sends a silent request to the client agent software of the virtual machine, and after the cache refresh is completed, the client agent software returns a refresh completion message to the storage device, which needs to be virtualized. The device communicates directly with the storage device, which poses a security risk. In the method provided by the embodiment of the present invention, the virtual machine does not directly interact with the storage device, but the relatively secure disaster recovery management platform communicates with the storage device, so that the security of the storage device data can be improved.
本发明实施例的装置Device of embodiment of the invention
本发明实施例还提供一种容灾管理装置,请参考图5,所述容灾管理装置可以实现前述实施例中容灾管理平台的功能,所述容灾管理装置应用类似于图1的容灾系统,所述系统包括生产中心和容灾中心,所述生产中心包括容灾管理装置50、虚拟化平台和存储设备,所述生产中心的存储设备包括主逻辑单元号LUN;所述容灾中心包括存储设备,所述容灾中心的存储设备包括从LUN。所述容灾管理装置包括输入单元501,输出单元502:The embodiment of the present invention further provides a disaster recovery management device. Referring to FIG. 5, the disaster recovery management device can implement the function of the disaster recovery management platform in the foregoing embodiment, and the disaster recovery management device application is similar to the capacity of FIG. The disaster recovery system, the system includes a production center and a disaster recovery center, the production center includes a disaster recovery management device 50, a virtualization platform, and a storage device, where the storage device of the production center includes a primary logical unit number LUN; The center includes a storage device, and the storage device of the disaster recovery center includes a slave LUN. The disaster management device includes an input unit 501 and an output unit 502:
所述输出单元502,用于向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行静默处理的请求;The output unit 502 is configured to send, to the virtualization platform, a request for performing silent processing on one or more virtual machines of the primary LUN;
所述输入单元501,用于接收所述虚拟化平台对所述一个或多个虚拟机进行静默处理的响应;The input unit 501 is configured to receive a response of the virtualization platform to perform silent processing on the one or more virtual machines.
所述输出单元502,还用于向所述生产站点的存储设备发送启动远程复制请求,以使所述生产站点的存储设备对所述主LUN进行快照处理,并将所述主LUN快照复制到所述容灾站点的存储设备的从LUN。The output unit 502 is further configured to send a startup remote replication request to the storage device of the production site, so that the storage device of the production site performs snapshot processing on the primary LUN, and copies the primary LUN snapshot to The slave LUN of the storage device at the disaster recovery site.
当所述请求中包含所述一个或多个虚拟机的标识时,所述输出单元502用于向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行静默处理的请求之前,还用于所述虚拟化平台发送查询请求,所述查询请求包括所述主LUN的标识;所述输入单元501还用于获取所述虚拟化平台基于所述主LUN的标识获得的所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的一个或多个虚拟机的标识。When the request includes the identifier of the one or more virtual machines, the output unit 502 is configured to send, to the virtualization platform, a request to perform silent processing on one or more virtual machines of the primary LUN. And the querying request is sent by the virtualization platform, where the query request includes the identifier of the primary LUN, and the input unit 501 is further configured to acquire the obtained by the virtualization platform based on the identifier of the primary LUN. A list of virtual machines of the primary LUN, where the virtual machine list includes identifiers of one or more virtual machines of the primary LUN.
所述输入单元501还用于接收所述生产站点的存储设备发送的启动远程复 制响应,所述输出单元502还用于向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行取消静默处理的请求,然后输入单元501还用于接收所述虚拟机平台返回的对所述一个或多个虚拟机进行取消静默处理的响应。The input unit 501 is further configured to receive a startup remote complex sent by the storage device of the production site. The output unit 502 is further configured to send a request for canceling the silent process to the one or more virtual machines of the primary LUN to the virtualization platform, and then the input unit 501 is further configured to receive the virtual machine platform. The returned response to cancel the silent processing of the one or more virtual machines.
本发明实施例还提供一种容灾装置,请参考图6,所述容灾装置可以实现前述实施例中虚拟化平台的功能,所述容灾装置应用类似于图1所示的容灾系统,所述系统包括生产中心和容灾中心,所述生产中心包括容灾管理平台、容灾装置60和存储设备,所述生产中心的存储设备包括主逻辑单元号LUN。所述容灾装置60包括输入单元601,输出单元602,处理单元603:The embodiment of the present invention further provides a disaster tolerance device. Referring to FIG. 6, the disaster recovery device can implement the function of the virtualization platform in the foregoing embodiment, and the disaster recovery device is similar to the disaster tolerance system shown in FIG. The system includes a production center and a disaster recovery center, and the production center includes a disaster recovery management platform, a disaster recovery device 60, and a storage device, and the storage device of the production center includes a primary logical unit number LUN. The disaster recovery device 60 includes an input unit 601, an output unit 602, and a processing unit 603:
所述输入单元601,用于从所述容灾管理平台接收对所述主LUN的一个或多个虚拟机进行静默处理的请求,The input unit 601 is configured to receive, from the disaster recovery management platform, a request for performing silent processing on one or more virtual machines of the primary LUN, where
所述处理单元603,用于根据所述静默处理的请求,对所述一个或多个虚拟机进行静默处理;The processing unit 603 is configured to perform silent processing on the one or more virtual machines according to the request of the silent processing;
所述输出单元602,用于向所述容灾管理平台返回对所述一个或多个虚拟机进行静默处理的响应,以使所述容灾管理平台向所述生产站点的存储设备发送启动远程复制请求。The output unit 602 is configured to return a response to the disaster recovery management platform to perform silent processing on the one or more virtual machines, so that the disaster recovery management platform sends a startup remote to the storage device of the production site. Copy the request.
当所述静默处理的请求中包含所述主LUN的标识时,所述处理单元603用于对所述一个或多个虚拟机进行静默处理之前,还用于根据所述主LUN的标识获取所述主LUN的一个或多个虚拟机的标识。The processing unit 603 is configured to acquire, according to the identifier of the primary LUN, the identifier of the primary LUN, before the one or more virtual machines perform the silent processing, when the request for the silent processing includes the identifier of the primary LUN. The identifier of one or more virtual machines of the primary LUN.
所述输入单元601还用于接收所述容灾管理平台发送的查询请求,所述查询请求包括所述主LUN的标识;处理单元603,还用于基于所述主LUN的标识获得所述主LUN的虚拟机列表;输出单元602,还用于向所述容灾管理平台返回所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的一个或多 个虚拟机的标识。The input unit 601 is further configured to receive the query request sent by the disaster recovery management platform, where the query request includes an identifier of the primary LUN, and the processing unit 603 is further configured to obtain the primary based on the identifier of the primary LUN. a virtual machine list of the LUN; the output unit 602 is further configured to return a virtual machine list of the primary LUN to the disaster recovery management platform, where the virtual machine list includes one or more of the primary LUNs The identifier of the virtual machine.
所述处理单元603用于对所述一个或多个虚拟机进行静默处理,输出单元602用于向所述容灾管理平台返回对所述一个或多个虚拟机进行静默处理的响应具体包括:The processing unit 603 is configured to perform a silent process on the one or more virtual machines, and the outputting, by the output unit 602, the response to the disaster recovery management platform to perform the silent processing on the one or more virtual machines includes:
所述处理单元603用于通知所述一个或多个虚拟机进行静默处理,接收所述一个或多个虚拟机返回的静默处理的响应,所述输出单元602用于向所述容灾管理平台返回对所述一个或多个虚拟机进行静默处理的响应。如果有从个虚拟机时,当处理单元603判断接收了所述多个虚拟机中每个虚拟机返回的响应时,所述输出单元602才用于向所述容灾管理平台返回对所述多个虚拟机进行静默处理的响应。The processing unit 603 is configured to notify the one or more virtual machines to perform a silent process, and receive a response of the silent process returned by the one or more virtual machines, where the output unit 602 is configured to send to the disaster recovery management platform. Returns a response to silent processing of the one or more virtual machines. If there is a slave virtual machine, when the processing unit 603 determines that the response returned by each of the plurality of virtual machines is received, the output unit 602 is configured to return to the disaster recovery management platform The response of multiple virtual machines for silent processing.
所述输入单元601还用于接收所述容灾管理平台发送的对所述主LUN的一个或多个虚拟机进行取消静默处理的请求,所述处理单元603还用于根据所述取消静默处理的请求,对所述一个或多个虚拟机进行取消静默处理,所述输出单元602还用于向所述容灾管理平台返回对所述一个或多个虚拟机进行取消静默处理的响应。The input unit 601 is further configured to receive a request for canceling the silent process of one or more virtual machines of the primary LUN that is sent by the disaster recovery management platform, where the processing unit 603 is further configured to perform the silent process according to the cancellation. The request is performed to cancel the silent processing on the one or more virtual machines, and the output unit 602 is further configured to return a response to the disaster recovery management platform to cancel the silent processing of the one or more virtual machines.
本发明实施例提供的装置,通过增加静默接口,容灾管理装置50的输出单元502可以通过静默接口向容灾装置60的输入单元601发送对所述主LUN的一个或多个虚拟机进行静默处理的请求,容灾装置60的处理单元603基于该静默处理的请求,对所述一个或多个虚拟机进行静默处理,容灾装置60的输出单元602向容灾管理装置50的输入单元501返回对所述一个或多个虚拟机进行静默处理的响应,从而容灾管理装置50的输出单元502可以向所述生产站点的存储设备发送启动远程复制请求,进行主LUN的容灾,而不用在虚拟机内安装客 户端代理软件,实现了无代理容灾,提高了容灾方案的可用性。In the device provided by the embodiment of the present invention, the output unit 502 of the disaster recovery management device 50 can send a silent interface to the input unit 601 of the disaster recovery device 60 to silence one or more virtual machines of the primary LUN. The processing unit 603 of the disaster tolerance device 60 performs a silent process on the one or more virtual machines based on the request of the silent process, and the output unit 602 of the disaster tolerance device 60 sends the input unit 501 to the disaster tolerance management device 50. Returning a response to the silent processing of the one or more virtual machines, so that the output unit 502 of the disaster recovery management device 50 can send a remote replication request to the storage device of the production site to perform disaster recovery of the primary LUN without using Install guest in the virtual machine The client agent software implements agentless disaster recovery and improves the availability of the disaster recovery solution.
本发明实施例还提供一种容灾管理装置,请参考图7,所述容灾管理装置可以实现前述实施例中容灾管理平台的功能,所述容灾管理装置应用类似于图1的容灾系统,所述系统包括生产中心和容灾中心,所述生产中心包括容灾管理装置70、虚拟化平台和存储设备,所述生产中心的存储设备包括主逻辑单元号LUN;所述容灾中心包括存储设备,所述容灾中心的存储设备包括从LUN。所述容灾管理装置包括输入单元701,输出单元702,处理单元703:The embodiment of the present invention further provides a disaster recovery management device. Referring to FIG. 7, the disaster recovery management device can implement the function of the disaster recovery management platform in the foregoing embodiment, and the disaster recovery management device is similar to the capacity of FIG. The disaster recovery system, the system includes a production center and a disaster recovery center, the production center includes a disaster recovery management device 70, a virtualization platform, and a storage device, where the storage device of the production center includes a primary logical unit number LUN; The center includes a storage device, and the storage device of the disaster recovery center includes a slave LUN. The disaster management device includes an input unit 701, an output unit 702, and a processing unit 703:
所述处理单元703用于获取所述主LUN的多个虚拟机的标识,所述输出单元702用于向所述虚拟化平台发送对所述多个虚拟机中的每个虚拟机进行静默处理的请求;The processing unit 703 is configured to acquire identifiers of multiple virtual machines of the primary LUN, and the output unit 702 is configured to send, to the virtualization platform, perform silent processing on each of the multiple virtual machines. Request
所述输入单元703,用于接收所述虚拟化平台对所述每个虚拟机进行静默处理的响应,所述输出单元702还用于所述生产站点的存储设备发送启动远程复制请求,以使所述生产站点的存储设备对所述主LUN进行快照处理,并将所述主LUN快照复制到所述容灾站点的存储设备的从LUN。The input unit 703 is configured to receive a response of the virtualization platform to the silent processing of each virtual machine, where the output unit 702 is further configured to send, by the storage device of the production site, a remote replication request to enable The storage device of the production site performs snapshot processing on the primary LUN, and copies the primary LUN snapshot to the secondary LUN of the storage device at the disaster recovery site.
所述处理单元703用于获取所述主LUN的多个虚拟机的标识具体包括:所述处理单元703用于向所述虚拟化平台发送查询请求,所述查询请求包括所述主LUN的标识,获取所述虚拟化平台基于所述主LUN的标识获得的所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的多个虚拟机的标识。The processing unit 703 is configured to obtain the identifiers of the plurality of virtual machines of the primary LUN, and the processing unit 703 is configured to send a query request to the virtualization platform, where the query request includes an identifier of the primary LUN. And obtaining, by the virtualization platform, a virtual machine list of the primary LUN obtained by using the identifier of the primary LUN, where the virtual machine list includes identifiers of multiple virtual machines of the primary LUN.
所述输入单元701用于接收对所述每个虚拟机进行静默处理的响应,所述输出单元702用于向所述生产站点的存储设备发送启动远程复制请求具体包括:The input unit 701 is configured to receive a response to the silent processing of each of the virtual machines, where the outputting, by the outputting unit 702, the sending the remote copy request to the storage device of the production site, specifically includes:
所述输入单元701,用于接收对所述每个虚拟机进行静默处理的响应,当所述处理单元703判断接收了所述多个虚拟机中每个虚拟机返回的响应时,所述 输出单元703用于向所述生产站点的存储设备发送启动远程复制请求。The input unit 701 is configured to receive a response to performing silent processing on each of the virtual machines, where the processing unit 703 determines that a response returned by each of the plurality of virtual machines is received, The output unit 703 is configured to send a startup remote copy request to a storage device of the production site.
所述输入单元701还用于接收所述生产站点的存储设备返回的启动远程复制响应,所述输出单元702还用于向所述虚拟化平台发送对所述多个虚拟机中的每个虚拟机进行取消静默处理的请求,所述输入单元701还用于接收所述虚拟化平台返回的对所述每个虚拟机进行取消静默处理的响应。The input unit 701 is further configured to receive a startup remote replication response returned by the storage device of the production site, where the output unit 702 is further configured to send, to the virtualization platform, each virtual of the multiple virtual machines The device performs a request to cancel the silent process, and the input unit 701 is further configured to receive a response returned by the virtualization platform to cancel the silent process for each virtual machine.
本发明实施例提供的装置,通过增加静默接口,容灾管理装置70的输出单元702可以通过静默接口向虚拟化平台发送对所述多个虚拟机中的每个虚拟机进行静默处理的请求,虚拟化平台基于该静默处理的请求,对所述每个虚拟机进行静默处理,并向容灾管理装置70的输入单元701返回对所述每个虚拟机进行静默处理的响应,从而容灾管理装置70的输出单元702可以向所述生产站点的存储设备发送启动远程复制请求,进行主LUN的容灾,而不用在虚拟机内安装客户端代理软件,实现了无代理容灾,提高了容灾方案的可用性。The device provided by the embodiment of the present invention, by adding a silent interface, the output unit 702 of the disaster recovery management device 70 can send a request for silent processing to each of the plurality of virtual machines to the virtualization platform through the silent interface. The virtualization platform performs a silent process on the virtual machine according to the request of the silent process, and returns a response to the silent process of each virtual machine to the input unit 701 of the disaster recovery management device 70, thereby implementing disaster recovery management. The output unit 702 of the device 70 can send a remote replication request to the storage device of the production site to perform disaster recovery of the primary LUN without installing the client agent software in the virtual machine, thereby implementing agentless disaster recovery and improving the capacity. Availability of the disaster plan.
本发明实施例还提供了一种服务器,请参考图8所示,该服务器80包括处理器801,存储器802以及通信端口803。The embodiment of the present invention further provides a server. Referring to FIG. 8 , the server 80 includes a processor 801 , a memory 802 , and a communication port 803 .
处理器801用于执行程序。本实施例中所述程序可以包括程序代码,所述程序代码包括计算机操作指令。处理器可能是中央处理器CPU,或者是被配置成实施本发明实施例的一个或多个集成电路。处理器所执行的程序为前述实施例中容灾管理平台或者虚拟化平台执行各个步骤所对应的程序;The processor 801 is for executing a program. The program in this embodiment may include program code, and the program code includes computer operation instructions. The processor may be a central processing unit CPU or one or more integrated circuits configured to implement embodiments of the present invention. The program executed by the processor is a program corresponding to each step of the disaster recovery management platform or the virtualization platform in the foregoing embodiment;
存储器602,用于存储所述处理器所执行的程序;a memory 602, configured to store a program executed by the processor;
通信端口603,用于与外部设备通信。The communication port 603 is configured to communicate with an external device.
具体的功能参照前面实施例中容灾管理平台或者虚拟化平台的描述。For specific functions, refer to the description of the disaster recovery management platform or the virtualization platform in the previous embodiment.
本领域内的技术人员应明白,本发明的实施例可提供为方法、系统、或计 算机程序产品。因此,本发明可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器和光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art will appreciate that embodiments of the present invention can be provided as a method, system, or meter. Computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or a combination of software and hardware. Moreover, the invention can take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage and optical storage, etc.) including computer usable program code.
本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present invention has been described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (system), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flowchart illustrations and/or FIG. These computer program instructions can be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing device to produce a machine for the execution of instructions for execution by a processor of a computer or other programmable data processing device. Means for implementing the functions specified in one or more of the flow or in a block or blocks of the flow chart.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。The computer program instructions can also be stored in a computer readable memory that can direct a computer or other programmable data processing device to operate in a particular manner, such that the instructions stored in the computer readable memory produce an article of manufacture comprising the instruction device. The apparatus implements the functions specified in one or more blocks of a flow or a flow and/or block diagram of the flowchart.
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded onto a computer or other programmable data processing device such that a series of operational steps are performed on a computer or other programmable device to produce computer-implemented processing for execution on a computer or other programmable device. The instructions provide steps for implementing the functions specified in one or more of the flow or in a block or blocks of a flow diagram.
显然,本领域的技术人员可以对本发明进行各种改动和变型而不脱离本发明的精神和范围。这样,倘若本发明的这些修改和变型属于本发明权利要求及其等同技术的范围之内,则本发明也意图包含这些改动和变型在内。 It is apparent that those skilled in the art can make various modifications and variations to the invention without departing from the spirit and scope of the invention. Thus, it is intended that the present invention cover the modifications and modifications of the invention

Claims (24)

  1. 一种容灾系统,所述系统包括生产中心和容灾中心,其特征在于,所述生产中心包括容灾管理平台、虚拟化平台和存储设备,所述生产中心的存储设备包括主逻辑单元号LUN;所述容灾中心包括存储设备,所述容灾中心的存储设备包括从LUN:A disaster recovery system, the system includes a production center and a disaster recovery center, wherein the production center includes a disaster recovery management platform, a virtualization platform, and a storage device, and the storage device of the production center includes a main logical unit number. The LUN includes the storage device, and the storage device of the disaster recovery center includes the slave LUN:
    所述容灾管理平台,用于向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行静默处理的请求;The disaster management platform is configured to send, to the virtualization platform, a request for silent processing of one or more virtual machines of the primary LUN;
    所述虚拟化平台,用于根据所述静默处理的请求,对所述一个或多个虚拟机进行静默处理,并向所述容灾管理平台返回对所述一个或多个虚拟机进行静默处理的响应;The virtualization platform is configured to perform silent processing on the one or more virtual machines according to the request of the silent processing, and return to the disaster recovery management platform to perform silent processing on the one or more virtual machines. the response to;
    所述容灾管理平台,用于接收对所述一个或多个虚拟机进行静默处理的响应,并向所述生产站点的存储设备发送启动远程复制请求;The disaster management platform is configured to receive a response to silent processing of the one or more virtual machines, and send a startup remote replication request to a storage device of the production site;
    所述生产站点的存储设备,用于对所述主LUN进行快照处理,并将所述主LUN快照复制到所述容灾站点的存储设备的从LUN。The storage device of the production site is configured to perform snapshot processing on the primary LUN, and copy the primary LUN snapshot to the secondary LUN of the storage device at the disaster recovery site.
  2. 如权利要求1所述的系统,其特征在于:The system of claim 1 wherein:
    当所述静默处理的请求中包含所述主LUN的标识时,所述虚拟化平台用于对所述一个或多个虚拟机进行静默处理之前,还用于根据所述主LUN的标识获取所述主LUN的一个或多个虚拟机的标识。And the virtualized platform is configured to acquire the identifier according to the identifier of the primary LUN before performing the silent processing on the one or more virtual machines. The identifier of one or more virtual machines of the primary LUN.
  3. 如权利要求1所述的系统,其特征在于:The system of claim 1 wherein:
    当所述请求中包含所述一个或多个虚拟机的标识时,所述容灾管理平台用于向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行静默处理的请求之前,还用于向所述虚拟化平台发送查询请求,所述查询请求包括所述主LUN 的标识;When the request includes the identifier of the one or more virtual machines, the disaster recovery management platform is configured to send, to the virtualization platform, a request for silent processing of one or more virtual machines of the primary LUN. Previously, the method further includes sending a query request to the virtualization platform, where the query request includes the primary LUN Identification
    所述容灾管理平台,还用于获取所述虚拟化平台基于所述主LUN的标识获得的所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的一个或多个虚拟机的标识。The disaster recovery management platform is further configured to obtain a virtual machine list of the primary LUN obtained by the virtualization platform based on the identifier of the primary LUN, where the virtual machine list includes one or more of the primary LUNs The ID of the virtual machine.
  4. 如权利要求1至3任一所述的系统,其特征在于,当所述主LUN的虚拟机有多个时,所述虚拟化平台用于对所述多个虚拟机进行静默处理,并向所述容灾管理平台返回对所述多个虚拟机进行静默处理的响应具体包括:The system according to any one of claims 1 to 3, wherein when there are multiple virtual machines of the primary LUN, the virtualization platform is configured to perform silent processing on the multiple virtual machines, and The response of the disaster recovery management platform to the silent processing of the multiple virtual machines specifically includes:
    所述虚拟化平台用于通知所述多个虚拟机中的每个虚拟机进行静默处理,接收所述每个虚拟机返回的静默处理的响应;The virtualization platform is configured to notify each of the plurality of virtual machines to perform a silent process, and receive a response of the silent process returned by each virtual machine;
    当所述虚拟化平台判断接收了所述多个虚拟机中每个虚拟机返回的响应时,所述虚拟化平台用于向所述容灾管理平台返回对所述多个虚拟机进行静默处理的响应。When the virtualization platform determines that the response returned by each of the plurality of virtual machines is received, the virtualization platform is configured to return to the disaster recovery management platform to perform silent processing on the multiple virtual machines. the response to.
  5. 如权利要求1至4任一所述的系统,其特征在于:A system according to any one of claims 1 to 4, characterized in that:
    所述生产站点的存储设备,还用于向所述容灾管理平台返回启动远程复制响应;The storage device of the production site is further configured to return a remote replication response to the disaster recovery management platform;
    所述容灾管理平台,还用于接收所述启动远程复制响应,向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行取消静默处理的请求;The disaster management platform is further configured to receive the initiated remote replication response, and send a request to the virtualization platform to cancel the silent processing on one or more virtual machines of the primary LUN;
    所述虚拟化平台,还用于根据所述取消静默处理的请求,对所述一个或多个虚拟机进行取消静默处理,并向所述容灾管理平台返回对所述一个或多个虚拟机进行取消静默处理的响应。The virtualization platform is further configured to perform the un-silent processing on the one or more virtual machines according to the request for canceling the silent processing, and return the one or more virtual machines to the disaster recovery management platform. Perform a response to cancel the silent process.
  6. 一种容灾系统,所述系统包括生产中心和容灾中心,其特征在于,所述生产中心包括容灾管理平台、虚拟化平台和存储设备,所述生产中心的存储设 备包括主逻辑单元号LUN;所述容灾中心包括存储设备,所述容灾中心的存储设备包括从LUN:A disaster tolerance system, the system includes a production center and a disaster recovery center, wherein the production center includes a disaster recovery management platform, a virtualization platform, and a storage device, and the storage center of the production center The storage unit includes a storage device, and the storage device of the disaster recovery center includes a slave LUN:
    所述容灾管理平台,用于获取所述主LUN的多个虚拟机的标识,向所述虚拟化平台发送对所述多个虚拟机中的每个虚拟机进行静默处理的请求;The disaster management platform is configured to obtain identifiers of multiple virtual machines of the primary LUN, and send a request for silent processing to each of the multiple virtual machines to the virtualization platform;
    所述虚拟化平台,用于根据所述静默处理的请求,对所述每个虚拟机进行静默处理,并向所述容灾管理平台返回对所述每个虚拟机进行静默处理的响应;The virtualization platform is configured to perform silent processing on each virtual machine according to the request of the silent processing, and return a response to the disaster recovery management platform to perform silent processing on each virtual machine;
    所述容灾管理平台,用于接收对所述每个虚拟机进行静默处理的响应,并向所述生产站点的存储设备发送启动远程复制请求;The disaster recovery management platform is configured to receive a response to the silent processing of each virtual machine, and send a startup remote replication request to a storage device of the production site;
    所述生产站点的存储设备,用于对所述主LUN进行快照处理,并将所述主LUN快照复制到所述容灾站点的存储设备的从LUN。The storage device of the production site is configured to perform snapshot processing on the primary LUN, and copy the primary LUN snapshot to the secondary LUN of the storage device at the disaster recovery site.
  7. 如权利要求6所述的系统,其特征在于,所述容灾管理平台用于获取所述主LUN的多个虚拟机的标识具体包括:The system of claim 6, wherein the obtaining, by the disaster management platform, the identifiers of the plurality of virtual machines of the primary LUN specifically includes:
    所述容灾管理平台,用于向所述虚拟化平台发送查询请求,所述查询请求包括所述主LUN的标识;The disaster management platform is configured to send a query request to the virtualization platform, where the query request includes an identifier of the primary LUN;
    所述容灾管理平台,还用于获取所述虚拟化平台基于所述主LUN的标识获得的所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的多个虚拟机的标识。The disaster management platform is further configured to obtain a virtual machine list of the primary LUN obtained by the virtualization platform based on the identifier of the primary LUN, where the virtual machine list includes multiple virtual machines of the primary LUN Logo.
  8. 如权利要求6或7所述的系统,其特征在于,所述容灾管理平台,用于接收对所述每个虚拟机进行静默处理的响应,并向所述生产站点的存储设备发送启动远程复制请求具体包括:The system according to claim 6 or 7, wherein the disaster management platform is configured to receive a response to the silent processing of each virtual machine, and send a startup remote to the storage device of the production site. The copy request specifically includes:
    所述容灾管理平台,用于接收对所述每个虚拟机进行静默处理的响应,判断接收了所述多个虚拟机中每个虚拟机返回的响应时,向所述生产站点的存储 设备发送启动远程复制请求。The disaster management platform is configured to receive a response to the silent processing of each virtual machine, and determine to store the response returned by each virtual machine in the plurality of virtual machines to the production site. The device sends a start remote copy request.
  9. 如权利要求6至8任一所述的系统,其特征在于:A system according to any one of claims 6 to 8 wherein:
    所述生产站点的存储设备,还用于向所述容灾管理平台返回启动远程复制响应;The storage device of the production site is further configured to return a remote replication response to the disaster recovery management platform;
    所述容灾管理平台,还用于接收所述启动远程复制响应,向所述虚拟化平台发送对所述多个虚拟机中的每个虚拟机进行取消静默处理的请求;The disaster management platform is further configured to receive the initiated remote replication response, and send a request to the virtualization platform to cancel the silent processing of each of the multiple virtual machines.
    所述虚拟化平台,还用于根据所述取消静默处理的请求,对所述每个虚拟机进行取消静默处理,并向所述容灾管理平台返回对所述每个虚拟机进行取消静默处理的响应。The virtualization platform is further configured to cancel the silent processing on the virtual machine according to the request for canceling the silent processing, and return to the disaster recovery management platform to cancel the silent processing on each virtual machine. the response to.
  10. 一种应用于容灾系统的容灾方法,所述系统包括生产中心和容灾中心,其特征在于,所述生产中心包括容灾管理平台、虚拟化平台和存储设备,所述生产中心的存储设备包括主逻辑单元号LUN;所述容灾中心包括存储设备,所述容灾中心的存储设备包括从LUN:A disaster tolerance method for a disaster recovery system, the system includes a production center and a disaster recovery center, wherein the production center includes a disaster recovery management platform, a virtualization platform, and a storage device, and the storage of the production center The device includes a primary logical unit number LUN; the disaster recovery center includes a storage device, and the storage device of the disaster recovery center includes a secondary LUN:
    所述容灾管理平台向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行静默处理的请求;The disaster management platform sends a request for silent processing to one or more virtual machines of the primary LUN to the virtualization platform;
    所述容灾管理平台接收所述虚拟化平台对所述一个或多个虚拟机进行静默处理的响应,并向所述生产站点的存储设备发送启动远程复制请求,以使所述生产站点的存储设备对所述主LUN进行快照处理,并将所述主LUN快照复制到所述容灾站点的存储设备的从LUN。Receiving, by the disaster management platform, a response of the virtualization platform to perform silent processing on the one or more virtual machines, and sending a startup remote replication request to a storage device of the production site to enable storage of the production site The device performs snapshot processing on the primary LUN, and copies the primary LUN snapshot to the secondary LUN of the storage device at the disaster recovery site.
  11. 如权利要求10所述的方法,其特征在于:The method of claim 10 wherein:
    当所述请求中包含所述一个或多个虚拟机的标识时,所述容灾管理平台向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行静默处理的请求之 前,还向所述虚拟化平台发送查询请求,所述查询请求包括所述主LUN的标识;When the request includes the identifier of the one or more virtual machines, the disaster management platform sends a request for silent processing of one or more virtual machines of the primary LUN to the virtualization platform. And sending a query request to the virtualization platform, where the query request includes an identifier of the primary LUN;
    所述容灾管理平台获取所述虚拟化平台基于所述主LUN的标识获得的所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的一个或多个虚拟机的标识。The disaster management platform acquires a virtual machine list of the primary LUN obtained by the virtualization platform based on the identifier of the primary LUN, where the virtual machine list includes identifiers of one or more virtual machines of the primary LUN. .
  12. 如权利要求10或11所述的方法,其特征在于,所述方法还包括:The method of claim 10 or 11, wherein the method further comprises:
    所述容灾管理平台接收所述生产站点的存储设备发送的启动远程复制响应,向所述虚拟化平台发送对所述主LUN的一个或多个虚拟机进行取消静默处理的请求;The disaster recovery management platform receives a startup remote replication response sent by the storage device of the production site, and sends a request for canceling the silent processing to one or more virtual machines of the primary LUN to the virtualization platform;
    所述容灾管理平台还接收所述虚拟机平台返回的对所述一个或多个虚拟机进行取消静默处理的响应。The disaster management platform further receives a response returned by the virtual machine platform to cancel the silent processing of the one or more virtual machines.
  13. 一种应用于容灾系统的容灾方法,所述系统包括生产中心和容灾中心,其特征在于,所述生产中心包括容灾管理平台、虚拟化平台和存储设备,所述生产中心的存储设备包括主逻辑单元号LUN:A disaster tolerance method for a disaster recovery system, the system includes a production center and a disaster recovery center, wherein the production center includes a disaster recovery management platform, a virtualization platform, and a storage device, and the storage of the production center The device includes the primary logical unit number LUN:
    所述虚拟化平台从所述容灾管理平台接收对所述主LUN的一个或多个虚拟机进行静默处理的请求,The virtualization platform receives a request for silent processing of one or more virtual machines of the primary LUN from the disaster recovery management platform,
    所述虚拟化平台根据所述静默处理的请求,对所述一个或多个虚拟机进行静默处理,并向所述容灾管理平台返回对所述一个或多个虚拟机进行静默处理的响应,以使所述容灾管理平台向所述生产站点的存储设备发送启动远程复制请求。The virtualization platform performs silent processing on the one or more virtual machines according to the request of the silent processing, and returns a response to the disaster recovery management platform to perform silent processing on the one or more virtual machines. The disaster recovery management platform is configured to send a remote replication request to a storage device of the production site.
  14. 如权利要求13所述的方法,其特征在于:The method of claim 13 wherein:
    当所述静默处理的请求中包含所述主LUN的标识时,所述虚拟化平台对所述一个或多个虚拟机进行静默处理之前,还根据所述主LUN的标识获取所述主 LUN的一个或多个虚拟机的标识。The virtualization platform acquires the primary LUN according to the identifier of the primary LUN before performing the silent processing on the one or more virtual machines. ID of one or more virtual machines of the LUN.
  15. 如权利要求13所述的方法,其特征在于:The method of claim 13 wherein:
    所述虚拟化平台还接收所述容灾管理平台发送的查询请求,所述查询请求包括所述主LUN的标识;The virtualization platform further receives a query request sent by the disaster recovery management platform, where the query request includes an identifier of the primary LUN;
    所述虚拟化平台基于所述主LUN的标识获得所述主LUN的虚拟机列表,向所述容灾管理平台返回所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的一个或多个虚拟机的标识。The virtualization platform obtains a virtual machine list of the primary LUN based on the identifier of the primary LUN, and returns a virtual machine list of the primary LUN to the disaster recovery management platform, where the virtual machine list includes the primary LUN The identity of one or more virtual machines.
  16. 如权利要求13至15任一所述的方法,其特征在于,当所述主LUN的虚拟机有多个时,所述虚拟化平台对所述多个虚拟机进行静默处理,并向所述容灾管理平台返回对所述多个虚拟机进行静默处理的响应具体包括:The method according to any one of claims 13 to 15, wherein when there are multiple virtual machines of the primary LUN, the virtualization platform performs silent processing on the plurality of virtual machines, and The response of the disaster recovery management platform to the silent processing of the multiple virtual machines includes:
    所述虚拟化平台通知所述多个虚拟机中的每个虚拟机进行静默处理,接收所述每个虚拟机返回的静默处理的响应;The virtualization platform notifies each of the plurality of virtual machines to perform a silent process, and receives a response of the silent process returned by each of the virtual machines;
    当所述虚拟化平台判断接收了所述多个虚拟机中每个虚拟机返回的响应时,所述虚拟化平台向所述容灾管理平台返回对所述多个虚拟机进行静默处理的响应。When the virtualization platform determines that the response returned by each of the plurality of virtual machines is received, the virtualization platform returns a response to the disaster recovery management platform to perform silent processing on the multiple virtual machines. .
  17. 如权利要求13至16任一所述的方法,其特征在于,所述方法包括:The method of any of claims 13 to 16, wherein the method comprises:
    所述虚拟化平台接收所述容灾管理平台发送的对所述主LUN的一个或多个虚拟机进行取消静默处理的请求,根据所述取消静默处理的请求,对所述一个或多个虚拟机进行取消静默处理,并向所述容灾管理平台返回对所述一个或多个虚拟机进行取消静默处理的响应。The virtualization platform receives a request for canceling the silent processing of one or more virtual machines of the primary LUN sent by the disaster recovery management platform, and according to the request for canceling the silent processing, the one or more virtual The machine cancels the silent process, and returns a response to the disaster recovery management platform to cancel the silent process of the one or more virtual machines.
  18. 一种应用于容灾系统的容灾方法,所述系统包括生产中心和容灾中心,其特征在于,所述生产中心包括容灾管理平台、虚拟化平台和存储设备,所述 生产中心的存储设备包括主逻辑单元号LUN;所述容灾中心包括存储设备,所述容灾中心的存储设备包括从LUN:A disaster recovery method for a disaster tolerance system, the system includes a production center and a disaster recovery center, wherein the production center includes a disaster recovery management platform, a virtualization platform, and a storage device, The storage device of the production center includes a primary logical unit number LUN; the disaster recovery center includes a storage device, and the storage device of the disaster recovery center includes a secondary LUN:
    所述容灾管理平台获取所述主LUN的多个虚拟机的标识,向所述虚拟化平台发送对所述多个虚拟机中的每个虚拟机进行静默处理的请求;The disaster management platform acquires identifiers of the plurality of virtual machines of the primary LUN, and sends a request for silent processing to each of the plurality of virtual machines to the virtualization platform;
    所述容灾管理平台,接收所述虚拟化平台对所述每个虚拟机进行静默处理的响应,并向所述生产站点的存储设备发送启动远程复制请求,以使所述生产站点的存储设备对所述主LUN进行快照处理,并将所述主LUN快照复制到所述容灾站点的存储设备的从LUN。The disaster management platform receives a response of the virtualization platform to perform silent processing on each virtual machine, and sends a startup remote replication request to the storage device of the production site to enable the storage device of the production site Snapshot processing is performed on the primary LUN, and the primary LUN snapshot is copied to the secondary LUN of the storage device at the disaster recovery site.
  19. 如权利要求18所述的方法,其特征在于,所述容灾管理平台获取所述主LUN的多个虚拟机的标识具体包括:The method of claim 18, wherein the obtaining, by the disaster management platform, the identifiers of the plurality of virtual machines of the primary LUN specifically includes:
    所述容灾管理平台向所述虚拟化平台发送查询请求,所述查询请求包括所述主LUN的标识;The disaster management platform sends a query request to the virtualization platform, where the query request includes an identifier of the primary LUN;
    所述容灾管理平台获取所述虚拟化平台基于所述主LUN的标识获得的所述主LUN的虚拟机列表,所述虚拟机列表中包括所述主LUN的多个虚拟机的标识。The disaster recovery management platform acquires a virtual machine list of the primary LUN obtained by the virtualization platform based on the identifier of the primary LUN, where the virtual machine list includes identifiers of multiple virtual machines of the primary LUN.
  20. 如权利要求18或19所述的方法,其特征在于,所述容灾管理平台接收对所述每个虚拟机进行静默处理的响应,并向所述生产站点的存储设备发送启动远程复制请求具体包括:The method according to claim 18 or 19, wherein the disaster management platform receives a response to the silent processing of each virtual machine, and sends a start remote copy request to the storage device of the production site. include:
    所述容灾管理平台,接收对所述每个虚拟机进行静默处理的响应,判断接收了所述多个虚拟机中每个虚拟机返回的响应时,向所述生产站点的存储设备发送启动远程复制请求。The disaster recovery management platform receives a response to the silent processing of each virtual machine, and determines to send a startup to the storage device of the production site when receiving a response returned by each of the plurality of virtual machines Remote copy request.
  21. 如权利要求18至20任一所述的方法,其特征在于,所述方法还包括: The method of any one of claims 18 to 20, wherein the method further comprises:
    所述容灾管理平台接收所述生产站点的存储设备返回的启动远程复制响应,向所述虚拟化平台发送对所述多个虚拟机中的每个虚拟机进行取消静默处理的请求;The disaster management platform receives a startup remote replication response returned by the storage device of the production site, and sends a request for canceling the silent processing to each of the plurality of virtual machines to the virtualization platform;
    所述容灾管理平台还接收所述虚拟化平台返回的对所述每个虚拟机进行取消静默处理的响应。The disaster recovery management platform further receives a response returned by the virtualization platform to cancel the silent processing of each virtual machine.
  22. 一种应用于容灾系统的服务器,其特征在于,所述服务器上运行容灾管理平台,所述服务器包括处理器,存储器,通信端口;A server applied to a disaster recovery system, wherein the server runs a disaster recovery management platform, and the server includes a processor, a memory, and a communication port;
    所述通信端口,用于与所述容灾系统中的虚拟化平台通信;The communication port is configured to communicate with a virtualization platform in the disaster tolerance system;
    所述存储器用于存储程序;The memory is used to store a program;
    所述处理器用于执行所述程序,实现权利要求10-12任一所述的方法。The processor is operative to execute the program to implement the method of any of claims 10-12.
  23. 一种应用于容灾系统的服务器,其特征在于,所述服务器上运行虚拟化平台,所述服务器包括处理器,存储器,通信端口;A server applied to a disaster tolerant system, wherein the server runs a virtualization platform, and the server includes a processor, a memory, and a communication port;
    所述通信端口,用于与所述容灾系统中的容灾管理平台通信;The communication port is configured to communicate with a disaster recovery management platform in the disaster tolerance system;
    所述存储器用于存储程序;The memory is used to store a program;
    所述处理器用于执行所述程序,实现权利要求13-17任一所述的方法。The processor is operative to execute the program to implement the method of any of claims 13-17.
  24. 一种应用于容灾系统的服务器,其特征在于,所述服务器上运行容灾管理平台,所述服务器包括处理器,存储器,通信端口;A server applied to a disaster recovery system, wherein the server runs a disaster recovery management platform, and the server includes a processor, a memory, and a communication port;
    所述通信端口,用于与所述容灾系统中的虚拟化平台通信;The communication port is configured to communicate with a virtualization platform in the disaster tolerance system;
    所述存储器用于存储程序;The memory is used to store a program;
    所述处理器用于执行所述程序,实现权利要求18-21任一所述的方法。 The processor is operative to execute the program to implement the method of any of claims 18-21.
PCT/CN2014/096068 2014-12-31 2014-12-31 Disaster recovery method, system and apparatus WO2016106756A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2014/096068 WO2016106756A1 (en) 2014-12-31 2014-12-31 Disaster recovery method, system and apparatus

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201480084424.9A CN107111530B (en) 2014-12-31 2014-12-31 A kind of disaster recovery method, system and device
PCT/CN2014/096068 WO2016106756A1 (en) 2014-12-31 2014-12-31 Disaster recovery method, system and apparatus

Publications (1)

Publication Number Publication Date
WO2016106756A1 true WO2016106756A1 (en) 2016-07-07

Family

ID=56284025

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/096068 WO2016106756A1 (en) 2014-12-31 2014-12-31 Disaster recovery method, system and apparatus

Country Status (2)

Country Link
CN (1) CN107111530B (en)
WO (1) WO2016106756A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109144803A (en) * 2018-10-24 2019-01-04 郑州云海信息技术有限公司 A kind of consistency characteristic test method, device, equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8566542B1 (en) * 2011-01-06 2013-10-22 Hewlett-Packard Development Company, L.P. Backup using storage array LUN level snapshot
US20140149696A1 (en) * 2012-11-28 2014-05-29 Red Hat Israel, Ltd. Virtual machine backup using snapshots and current configuration
CN103946807A (en) * 2013-11-20 2014-07-23 华为技术有限公司 Method, system and device for generating snapshot
CN104239166A (en) * 2014-09-11 2014-12-24 武汉噢易云计算有限公司 Method for realizing file backup for running virtual machine

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102761566B (en) * 2011-04-26 2015-09-23 国际商业机器公司 The method and apparatus of migration virtual machine
CN102306115B (en) * 2011-05-20 2014-01-08 华为数字技术(成都)有限公司 Asynchronous remote copying method, system and equipment
WO2013011541A1 (en) * 2011-07-20 2013-01-24 Hitachi, Ltd. Data storage apparatus and control method therefor
US8862883B2 (en) * 2012-05-16 2014-10-14 Cisco Technology, Inc. System and method for secure cloud service delivery with prioritized services in a network environment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8566542B1 (en) * 2011-01-06 2013-10-22 Hewlett-Packard Development Company, L.P. Backup using storage array LUN level snapshot
US20140149696A1 (en) * 2012-11-28 2014-05-29 Red Hat Israel, Ltd. Virtual machine backup using snapshots and current configuration
CN103946807A (en) * 2013-11-20 2014-07-23 华为技术有限公司 Method, system and device for generating snapshot
CN104239166A (en) * 2014-09-11 2014-12-24 武汉噢易云计算有限公司 Method for realizing file backup for running virtual machine

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109144803A (en) * 2018-10-24 2019-01-04 郑州云海信息技术有限公司 A kind of consistency characteristic test method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN107111530A (en) 2017-08-29
CN107111530B (en) 2019-09-20

Similar Documents

Publication Publication Date Title
US11016857B2 (en) Microcheckpointing with service processor
US10346215B2 (en) Replication of a virtualized computing environment to a computing system with offline hosts
US9870291B2 (en) Snapshotting shared disk resources for checkpointing a virtual machine cluster
US9575894B1 (en) Application aware cache coherency
US8959323B2 (en) Remote restarting client logical partition on a target virtual input/output server using hibernation data in a cluster aware data processing system
US9753761B1 (en) Distributed dynamic federation between multi-connected virtual platform clusters
US8850146B1 (en) Backup of a virtual machine configured to perform I/O operations bypassing a hypervisor
US8458413B2 (en) Supporting virtual input/output (I/O) server (VIOS) active memory sharing in a cluster environment
US9489274B2 (en) System and method for performing efficient failover and virtual machine (VM) migration in virtual desktop infrastructure (VDI)
US8473692B2 (en) Operating system image management
US10614096B2 (en) Disaster recovery of mobile data center via location-aware cloud caching
US20150135003A1 (en) Replication of a write-back cache using a placeholder virtual machine for resource management
US10417096B2 (en) Multi-virtual machine time consistent snapshots
US8689054B1 (en) Increased distance of virtual machine mobility over asynchronous distances
US20150205542A1 (en) Virtual machine migration in shared storage environment
WO2015074200A1 (en) Method, system and apparatus for generating snapshot
WO2016045428A1 (en) Method for creating virtual machine and apparatus for creating virtual machine
CN107402839B (en) Method and system for backing up data
JP5966466B2 (en) Backup control method and information processing apparatus
WO2016106756A1 (en) Disaster recovery method, system and apparatus
JP6219514B2 (en) Computing device that provides virtual multipath state access, remote computing device for virtual multipath, method for providing virtual multipath state access, method for virtual multipath, computing device, multiple methods for computing device And a machine-readable recording medium
Haga et al. Windows server 2008 R2 hyper-V server virtualization
US10474394B2 (en) Persistent reservation emulation in shared virtual storage environments
US10620856B2 (en) Input/output (I/O) fencing with persistent reservation information in shared virtual storage environments
US10223402B1 (en) Multi-site block level write consistency

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14909562

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14909562

Country of ref document: EP

Kind code of ref document: A1