CN104683131A - Application stage virtualization high-reliability method and device - Google Patents

Application stage virtualization high-reliability method and device Download PDF

Info

Publication number
CN104683131A
CN104683131A CN201310627348.9A CN201310627348A CN104683131A CN 104683131 A CN104683131 A CN 104683131A CN 201310627348 A CN201310627348 A CN 201310627348A CN 104683131 A CN104683131 A CN 104683131A
Authority
CN
China
Prior art keywords
service
status information
state information
processing instruction
state
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310627348.9A
Other languages
Chinese (zh)
Inventor
林宗正
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou DPTech Technologies Co Ltd
Original Assignee
Hangzhou DPTech Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou DPTech Technologies Co Ltd filed Critical Hangzhou DPTech Technologies Co Ltd
Priority to CN201310627348.9A priority Critical patent/CN104683131A/en
Publication of CN104683131A publication Critical patent/CN104683131A/en
Pending legal-status Critical Current

Links

Abstract

The invention provides an application stage virtualization high-reliability method and device. The method comprises the following steps that service state information requiring monitoring service in a virtual machine operation system is collected, and in addition, service state information requiring the monitoring service and having the abnormal service state is filtered out; the service state information of the service requiring the monitoring and having the abnormal service state is analyzed, a processing instruction relative to each service state is generated, and the processing instruction includes a plurality of parameters; an API (application program interface) corresponding to the name of the processing instruction is found, the API interface is called, the service name of the processing instruction parameters is loaded in, and the abnormal state restoration operation is completed. Compared with the prior art, the method and the device have the advantages that the high reliability is realized by aiming at the application service in a virtual machine, and the real-time high-reliability processing of the service during stop or pause due to non-physical fault is guaranteed, so that the uninterrupted operation of the service can be realized.

Description

A kind of virtual high reliability method of application layer and device
Technical field
The present invention relates to network communication field, particularly relate to a kind of virtual high reliability method of application layer and device.
Background technology
The IBM large computer system that virtual (VIrtualization) technology appears at the sixties in 20th century the earliest.Along with multiple nucleus system in recent years, cluster, the network even widespread deployment of cloud computing, the advantage of Intel Virtualization Technology in business application embodies day by day, not only reduce IT cost, but also enhancing security of system and reliability, virtualized concept is also deep into people's routine work gradually with life.
Increasing enterprise uses server virtualization to provide business at present, such as e-mail system, data base set is unified web application server, disposes IT data center by Intel Virtualization Technology, maximizedly can utilize server resource, save IT construction cost.
In the IT environment of routine, server due to hardware damage or the collapse of gaseous state reason can interrupt an important application program and have influence on other relevant to this application program operating, by contrast, in a virtualized environment, each station server is all by application program important for support 5 to 10.The collapse of the station server in conventional environment can interrupt an application, and this is only a problem.But in virtual environment, a station server collapse may interrupt the Database Systems of this client, and e-mail system, file server, e-commerce system and financial application system etc., this is a disaster.
So the heart is in the process of virtual development in the data, this situation is once occur the direct production to enterprise and profit to have an impact, and server virtualization reliability engineering keeps competitiveness to seem extremely important to enterprise in current business environment.The availability of server virtualization reliability engineering refers to that system or assembly can times of continuous service, availability use service in a year can the percentage of time weigh, the high reliability index be widely used at present is 99.999%.Thus, high reliability has become data center and has implemented, function more and more important in deployment.
Present stage is for virtualized high reliability (High Availability, HA), there is a lot of solution, such as two-node cluster hot backup mode, mode and trunking mode are shared in load equally, two-node cluster hot backup (dual computer fault-tolerant) is exactly for important service, use two-server, mutual backup, the same service of common execution, when a station server breaks down, can bear service role by another station server, thus when not needing manual intervention, automatic guarantee system can continue to provide service.
The shortcoming of two-node cluster hot backup:
Reliability is relatively poor, when there being service to break down, it is a more fragile link that data between migration two server carrying out virtual machine copy in real time, carry out at file and disk layer because it copies, whether successfully can affect db transaction operation when carrying out copying in virtual machine (vm) migration process, therefore easily occurring the incomplete situation of data.
The shortcoming of cluster:
1. build cluster environment complexity, relate to many aspects, the stability of the system of impact.
2. cluster detecting confirms fault, when there being service to break down, carry out the migration of virtual machine, take over the data field of sharing to need to spend the regular hour in the process switched, the time according to the switching that varies in size of application also can be different, and the time that larger application switches is longer.
Summary of the invention
In view of this, the invention provides a kind of virtual high reliability method of application layer, solve under virtualized environment, realize high reliability for application service in virtual machine, carry out highly reliable process in real time guarantee business causes stopping or suspending during due to non-physical fault and run without interruption to make business.
Specifically, the invention provides a kind of virtual high reliability method of application layer, described method comprises:
A) all services of VME operating system are obtained, the service name of monitor service that needs of preserving in the service name of all for operating system services and local data base is compared, filter out and wherein need monitor service, call operation system api interface, obtains the service status information that these need monitor service;
B) gather the service status information needing monitor service in VME operating system, and filter out the service status information needing monitor service of wherein service state exception;
C) analyze the service status information of monitor service that needs of service state exception, produce the processing instruction corresponding relative to each exception service state, described processing instruction comprises some parameters;
D) search processing instruction title corresponding A PI interface, call this api interface, import the parameter of this processing instruction into, complete the recovery operation of abnormality.
Further, described service status information comprise halted state information, start in state information, run in state information, continue in state information, suspend in state information and paused state information.
Further, the service status information of monitor service that needs of described service state exception comprises paused state information and halted state information.
Further, described processing instruction comprises recovery class instruction and restarts class instruction; Wherein recover class instruction comprise closedown service order and start service order; Restart class instruction comprise closedown service order and restart service order.
Further, described processing instruction parameter comprises authority, the service name of operate services and starts the authority of serving.
The present invention provides a kind of application layer virtual high reliability devices simultaneously, and this device comprises:
Service filter element: for obtaining all services of VME operating system, the service name of monitor service that needs of preserving in the service name of all for operating system services and local data base is compared, filter out and wherein need monitor service, call operation system api interface, obtains the service status information that these need monitor service;
Service status information collecting unit: for gathering in VME operating system the service status information needing monitor service, and filter out the service status information needing monitor service of wherein service state exception;
Service status information analytic unit: for analyzing the service status information of monitor service that needs of service state exception, produce the processing instruction corresponding relative to each exception service state, described processing instruction comprises some parameters;
Service status information processing unit: for searching processing instruction title corresponding A PI, calling this api interface, importing the parameter of this processing instruction into, complete the recovery operation of abnormality.
Further, described service status information comprise halted state information, start in state information, run in state information, continue in state information, suspend in state information and paused state information.
Further, the service status information of monitor service that needs of described service state exception comprises paused state information and halted state information.
Further, described processing instruction comprises recovery class instruction and restarts class instruction; Wherein recover class instruction comprise closedown service order and start service order; Restart class instruction comprise closedown service order and restart service order.
Further, described processing instruction parameter comprises authority, the service name of operate services and starts the authority of serving.
As can be seen here, the virtual high reliability method of a kind of application layer provided by the invention and device, analyze by needing the service status information of monitor service to application layer and realize timely restarting and starting service processing to the service of service state exception, achieve the high reliability based on service application level in virtualized environment, the deficiency of uncontrollable application service under compensate for traditional approach, avoid in prior art when service break down carry out virtual machine (vm) migration time two-node cluster hot backup easily occur that the imperfect and trunking mode complex environment of data is built in unsteadiness and virtual machine service handoff procedure because application service is large, the problem that switching time is long, the present invention realizes implementing highly reliable guarantee from application service rank by virtual machine internal.
Accompanying drawing explanation
Fig. 1 is the virtual high reliability method flow diagram of a kind of application layer;
Fig. 2 is the virtual high reliability devices building-block of logic of a kind of application layer.
Embodiment
In order to address this problem, embodiments provide a kind of virtual high reliability method of application layer.Please refer to Fig. 1, in a preferred embodiment, the inventive method is specific as follows:
A) all services of VME operating system are obtained, the service name of monitor service that needs of preserving in the service name of all for operating system services and local data base is compared, filter out and wherein need monitor service, call operation system api interface, obtains the service status information that these need monitor service;
The service status information of each service comprise halted state information, start in state information, run in state information, continue in state information, suspend in state information and paused state information.
Service as run in VME operating system has IIS (Internet Information Services, Internet Information Service), oracle database and mySQL database to serve; Then now can call the open api interface (Application Programming Interface, application programming interface) of operating system (windows, linux etc.).Obtain all service lists to compare with the list of monitor service that needs of preserving in local data base, judge that service that service name is identical is for needing monitor service, that preserves with this locality as IIS service needs the service name in monitor service identical, then obtain the service status information of service name W3SVC and this service needing monitor service IIS to serve.
In specific implementation, different operating system has different api interfaces.For Windows system, service managerZ-HU control handle is obtained as called OpenSCManager interface under windows system, call EnumServicesStatus interface and enumerate all information on services, comprising the title, service status information etc. of service, call OpenService () interface and open service.For linux system, under linux system, obtain all service chkconfig--list, call the title that service service name status checks service, call service Service name (start restart) and realize the startup of service and restart.
B) gather the service status information needing monitor service in VME operating system, and filter out the service status information needing monitor service of wherein service state exception;
The service status information of monitor service that needs of service state exception generally includes paused state information and halted state information.For IIS service, if this service needs monitor service, then gather the service status information of IIS service, the service status information of the IIS service judging to get is as halted state information or paused state information, then illustrate that this IIS service state is the service status information of service state exception, respective handling need be carried out to this IIS service.
C) analyze the service status information of monitor service that needs of service state exception, produce the processing instruction corresponding relative to each exception service state, described processing instruction comprises some parameters;
The service status information needing monitor service IIS to serve got is analyzed, judges that then service status information calls corresponding processing instruction.Processing instruction comprises recovery class instruction and restarts class instruction; Wherein recover class instruction comprise closedown service order and start service order; Restart class instruction comprise closedown service order and restart service order.
If judge that the service status information that IIS serves stops (SERVICE_STOPPED) state information, then need to restart IIS service, call and restart class processing instruction (open); The service status information that IIS serves if judge is paused (SERVICE_PAUSED) state information, need recover service, then call and recover class processing instruction (delete, open).
D) search processing instruction title corresponding A PI interface, call this api interface, import the parameter of this processing instruction into, complete the recovery operation of abnormality.
Service status information according to the IIS service analyzed stops (SERVICE_STOPPED) state information, that calls restarts class processing instruction (open), search the api interface corresponding with restarting class processing instruction title (open), import the authority of relevant parameter as operate services into, service name (W3SVC), start the authority (SERVICE_ALL_ACCESS) of service, realize restarting of service.As service status information paused (SERVICE_PAUSED) state information according to the IIS service analyzed, the recovery class processing instruction (delete called, open), search and recover class processing instruction title (delete, open) corresponding api interface, imports the authority of relevant parameter as operate services into, service name (W3SVC), start the authority of service, realize the startup of service.Start in realization service and need first to perform to close before reboot operation and serve.
This embodiment is by the monitoring to the application service of virtual machine, judge the service status information needing monitor service, automatically realize starting and restarting to the service of service state exception, ensure tasks carrying, recovery reaches the continuity and high reliability and fault-tolerant (Fault-tolerant that ensure business in virtual machine, FT), the high reliability of service is ensured.
Please refer to Fig. 2, based on said method, the invention provides the virtual high reliability devices of a kind of corresponding application layer, this application of installation is on main frame (such as server), as the operation carrier of this logic device, the hardware environment of described main process equipment at least all comprises CPU, internal memory, non-volatile memory medium and other necessary hardware usually, and this device comprises:
Service filter element: for obtaining all services of VME operating system, the service name of monitor service that needs of preserving in the service name of all for operating system services and local data base is compared, filter out and wherein need monitor service, call operation system api interface, obtains the service status information that these need monitor service;
Obtain all service lists to compare with the list of monitor service that needs of preserving in local data base, judge that service that service name is identical is for needing monitor service, that preserves with this locality as IIS service needs the service name in monitor service identical, then obtain the service status information of service name W3SVC and this service needing monitor service IIS to serve.
The service status information of each service comprise halted state information, start in state information, run in state information, continue in state information, suspend in state information and paused state information.
Service status information collecting unit: for gathering in VME operating system the service status information needing monitor service, and filter out the service status information needing monitor service of wherein service state exception;
The service status information of monitor service that needs of service state exception comprises paused state information and halted state information.For IIS service, gather the service status information needing monitor service IIS to serve obtained, the service status information of the IIS service judging to get is as halted state information or paused state information, then illustrate that this IIS service state is the service status information of service state exception, respective handling need be carried out to this IIS service.
Service status information analytic unit: for analyzing the service status information of monitor service that needs of service state exception, produce the processing instruction corresponding relative to each exception service state, described processing instruction comprises some parameters;
The service status information needing monitor service IIS to serve got is analyzed, judges that then service state calls corresponding processing instruction.Processing instruction comprises recovery class instruction and restarts class instruction; Wherein recover class instruction comprise closedown service order and start service order; Restart class instruction comprise closedown service order and restart service order.
Service status information processing unit: for searching processing instruction title corresponding A PI interface, calling this api interface, importing this processing instruction parameter into, completing the recovery operation of abnormal task.
Service status information according to the IIS service analyzed stops (SERVICE_STOPPED) state information, that calls restarts class processing instruction (open), search the api interface corresponding with restarting class processing instruction title (open), import the authority of relevant parameter as operate services into, service name (W3SVC), start the authority (SERVICE_ALL_ACCESS) of service, realize restarting of service.As service status information paused (SERVICE_PAUSED) state information according to the IIS service analyzed, the recovery class processing instruction (delete called, open), search and recover class processing instruction title (delete, open) corresponding api interface, imports the authority of relevant parameter as operate services into, service name (W3SVC), start the authority of service, realize the startup of service.Start in realization service and need first to perform to close before reboot operation and serve.
As can be seen from above execution mode, virtual highly reliable object-oriented is converted to the application service of virtual machine internal by virtual machine itself, by external control, inter-process instruction is converted to for the control of application service and api interface controls, under compensate for traditional approach, controls the deficiency of application service.Highly reliable guarantee is implemented by virtual machine internal from application service rank.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, within the spirit and principles in the present invention all, any amendment made, equivalent replacement, improvement etc., all should be included within the scope of protection of the invention.

Claims (10)

1. the virtual high reliability method of application layer, it is characterized in that, the method comprises:
A) all services of VME operating system are obtained, the service name of monitor service that needs of preserving in the service name of all for operating system services and local data base is compared, filter out and wherein need monitor service, call operation system api interface, obtains the service status information that these need monitor service;
B) gather the service status information needing monitor service in VME operating system, and filter out the service status information needing monitor service of wherein service state exception;
C) analyze the service status information of monitor service that needs of service state exception, produce the processing instruction corresponding relative to each exception service state, described processing instruction comprises some parameters;
D) search processing instruction title corresponding A PI interface, call this api interface, import the parameter of this processing instruction into, complete the recovery operation of abnormality.
2. the method for claim 1, is characterized in that, described service status information comprise halted state information, start in state information, run in state information, continue in state information, suspend in state information and paused state information.
3. the method for claim 1, is characterized in that, the service status information of monitor service that needs of described service state exception comprises paused state information and halted state information.
4. the method for claim 1, is characterized in that, described processing instruction comprises recovery class instruction and restarts class instruction; Wherein recover class instruction comprise closedown service order and start service order; Restart class instruction comprise closedown service order and restart service order.
5. the method for claim 1, is characterized in that, described processing instruction parameter comprises authority, the service name of operate services and starts the authority of serving.
6. the virtual high reliability devices of application layer, this device comprises:
Service filter element: for obtaining all services of VME operating system, the service name of monitor service that needs of preserving in the service name of all for operating system services and local data base is compared, filter out and wherein need monitor service, call operation system api interface, obtains the service status information that these need monitor service;
Service status information collecting unit: for gathering in VME operating system the service status information needing monitor service, and filter out the service status information needing monitor service of wherein service state exception;
Service status information analytic unit: for analyzing the service status information of monitor service that needs of service state exception, produce the processing instruction corresponding relative to each exception service state, described processing instruction comprises some parameters;
Service status information processing unit: for searching processing instruction title corresponding A PI interface, calling this api interface, importing the parameter of this processing instruction into, completing the recovery operation of abnormality.
7. device as claimed in claim 6, is characterized in that, described service status information comprise halted state information, start in state information, run in state information, continue in state information, suspend in state information and paused state information.
8. device as claimed in claim 6, is characterized in that, the service status information of monitor service that needs of described service state exception comprises paused state information and halted state information.
9. device as claimed in claim 6, is characterized in that, described processing instruction comprises recovery class instruction and restarts class instruction; Wherein recover class instruction comprise closedown service order and start service order; Restart class instruction comprise closedown service order and restart service order.
10. device as claimed in claim 6, is characterized in that, described processing instruction parameter comprises authority, the service name of operate services and starts the authority of serving.
CN201310627348.9A 2013-11-27 2013-11-27 Application stage virtualization high-reliability method and device Pending CN104683131A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310627348.9A CN104683131A (en) 2013-11-27 2013-11-27 Application stage virtualization high-reliability method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310627348.9A CN104683131A (en) 2013-11-27 2013-11-27 Application stage virtualization high-reliability method and device

Publications (1)

Publication Number Publication Date
CN104683131A true CN104683131A (en) 2015-06-03

Family

ID=53317764

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310627348.9A Pending CN104683131A (en) 2013-11-27 2013-11-27 Application stage virtualization high-reliability method and device

Country Status (1)

Country Link
CN (1) CN104683131A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106559441A (en) * 2015-09-25 2017-04-05 华为技术有限公司 It is a kind of based on the virtual machine monitoring method of cloud computing service, apparatus and system
CN110750586A (en) * 2019-10-12 2020-02-04 北京浪潮数据技术有限公司 Operation information processing method and system of virtualization management platform
CN117032881A (en) * 2023-07-31 2023-11-10 广东保伦电子股份有限公司 Method, device and storage medium for detecting and recovering abnormality of virtual machine

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101136044A (en) * 2006-08-29 2008-03-05 联想(北京)有限公司 Software watchdog system and method
CN102708018A (en) * 2012-04-20 2012-10-03 华为技术有限公司 Method and system for exception handling, proxy equipment and control device
CN102819465A (en) * 2012-06-29 2012-12-12 华中科技大学 Failure recovery method in virtualization environment
CN103152414A (en) * 2013-03-01 2013-06-12 四川省电力公司信息通信公司 High available system based on cloud calculation and implementation method thereof
CN103152419A (en) * 2013-03-08 2013-06-12 中标软件有限公司 High availability cluster management method for cloud computing platform
CN103365758A (en) * 2013-08-05 2013-10-23 北京搜狐新媒体信息技术有限公司 Process monitoring method and system in virtualization environment

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101136044A (en) * 2006-08-29 2008-03-05 联想(北京)有限公司 Software watchdog system and method
CN102708018A (en) * 2012-04-20 2012-10-03 华为技术有限公司 Method and system for exception handling, proxy equipment and control device
CN102819465A (en) * 2012-06-29 2012-12-12 华中科技大学 Failure recovery method in virtualization environment
CN103152414A (en) * 2013-03-01 2013-06-12 四川省电力公司信息通信公司 High available system based on cloud calculation and implementation method thereof
CN103152419A (en) * 2013-03-08 2013-06-12 中标软件有限公司 High availability cluster management method for cloud computing platform
CN103365758A (en) * 2013-08-05 2013-10-23 北京搜狐新媒体信息技术有限公司 Process monitoring method and system in virtualization environment

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106559441A (en) * 2015-09-25 2017-04-05 华为技术有限公司 It is a kind of based on the virtual machine monitoring method of cloud computing service, apparatus and system
CN110750586A (en) * 2019-10-12 2020-02-04 北京浪潮数据技术有限公司 Operation information processing method and system of virtualization management platform
CN117032881A (en) * 2023-07-31 2023-11-10 广东保伦电子股份有限公司 Method, device and storage medium for detecting and recovering abnormality of virtual machine

Similar Documents

Publication Publication Date Title
US8983961B2 (en) High availability for cloud servers
US9052935B1 (en) Systems and methods for managing affinity rules in virtual-machine environments
US10169173B2 (en) Preserving management services with distributed metadata through the disaster recovery life cycle
US11321197B2 (en) File service auto-remediation in storage systems
Bala et al. Fault tolerance-challenges, techniques and implementation in cloud computing
US8843717B2 (en) Maintaining consistency of storage in a mirrored virtual environment
US10162708B2 (en) Fault tolerance for complex distributed computing operations
WO2019152122A1 (en) Systems and methods for performing computing cluster node switchover
US20190235979A1 (en) Systems and methods for performing computing cluster node switchover
CN101895540B (en) For the system and method that application service process is guarded
US9317380B2 (en) Preserving management services with self-contained metadata through the disaster recovery life cycle
CN104516789A (en) Method and system for failover detection and treatment in checkpoint systems
CN103795742B (en) Isomery storage and disaster tolerance management system and method
CN104683131A (en) Application stage virtualization high-reliability method and device
CN105068899A (en) Automatic reboot stability test method for Vmware system
CN114035905A (en) Fault migration method and device based on virtual machine, electronic equipment and storage medium
CN114691304B (en) Method, device, equipment and medium for realizing high availability of cluster virtual machine
CN105391790A (en) Database high-availability method similar to RAC One Node
US20230101776A1 (en) Desired state configuration for virtual machines
CN103268271A (en) Disaster tolerance realizing method of all-in-one machine
CN112035295A (en) Virtual machine crash event processing method, system, terminal and storage medium
Cao et al. IT Operation and Maintenance Process improvement and design under virtualization environment
WO2023185355A1 (en) Method and apparatus for achieving high availability of clustered virtual machines, device, and medium
Singh et al. Fault tolerance-challenges, techniques and implementation in cloud computing
AU2015249127B2 (en) Fault tolerance for complex distributed computing operations

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: Binjiang District and Hangzhou city in Zhejiang Province Road 310051 No. 68 in the 6 storey building

Applicant after: Hangzhou Dipu Polytron Technologies Inc

Address before: Binjiang District and Hangzhou city in Zhejiang Province Road 310051 No. 68 in the 6 storey building

Applicant before: Hangzhou Dipu Technology Co., Ltd.

CB02 Change of applicant information
RJ01 Rejection of invention patent application after publication

Application publication date: 20150603

RJ01 Rejection of invention patent application after publication