CN104683131A - Application stage virtualization high-reliability method and device - Google Patents
Application stage virtualization high-reliability method and device Download PDFInfo
- Publication number
- CN104683131A CN104683131A CN201310627348.9A CN201310627348A CN104683131A CN 104683131 A CN104683131 A CN 104683131A CN 201310627348 A CN201310627348 A CN 201310627348A CN 104683131 A CN104683131 A CN 104683131A
- Authority
- CN
- China
- Prior art keywords
- service
- status information
- state information
- processing instruction
- state
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Abstract
The invention provides an application stage virtualization high-reliability method and device. The method comprises the following steps that service state information requiring monitoring service in a virtual machine operation system is collected, and in addition, service state information requiring the monitoring service and having the abnormal service state is filtered out; the service state information of the service requiring the monitoring and having the abnormal service state is analyzed, a processing instruction relative to each service state is generated, and the processing instruction includes a plurality of parameters; an API (application program interface) corresponding to the name of the processing instruction is found, the API interface is called, the service name of the processing instruction parameters is loaded in, and the abnormal state restoration operation is completed. Compared with the prior art, the method and the device have the advantages that the high reliability is realized by aiming at the application service in a virtual machine, and the real-time high-reliability processing of the service during stop or pause due to non-physical fault is guaranteed, so that the uninterrupted operation of the service can be realized.
Description
Technical field
The present invention relates to network communication field, particularly relate to a kind of virtual high reliability method of application layer and device.
Background technology
The IBM large computer system that virtual (VIrtualization) technology appears at the sixties in 20th century the earliest.Along with multiple nucleus system in recent years, cluster, the network even widespread deployment of cloud computing, the advantage of Intel Virtualization Technology in business application embodies day by day, not only reduce IT cost, but also enhancing security of system and reliability, virtualized concept is also deep into people's routine work gradually with life.
Increasing enterprise uses server virtualization to provide business at present, such as e-mail system, data base set is unified web application server, disposes IT data center by Intel Virtualization Technology, maximizedly can utilize server resource, save IT construction cost.
In the IT environment of routine, server due to hardware damage or the collapse of gaseous state reason can interrupt an important application program and have influence on other relevant to this application program operating, by contrast, in a virtualized environment, each station server is all by application program important for support 5 to 10.The collapse of the station server in conventional environment can interrupt an application, and this is only a problem.But in virtual environment, a station server collapse may interrupt the Database Systems of this client, and e-mail system, file server, e-commerce system and financial application system etc., this is a disaster.
So the heart is in the process of virtual development in the data, this situation is once occur the direct production to enterprise and profit to have an impact, and server virtualization reliability engineering keeps competitiveness to seem extremely important to enterprise in current business environment.The availability of server virtualization reliability engineering refers to that system or assembly can times of continuous service, availability use service in a year can the percentage of time weigh, the high reliability index be widely used at present is 99.999%.Thus, high reliability has become data center and has implemented, function more and more important in deployment.
Present stage is for virtualized high reliability (High Availability, HA), there is a lot of solution, such as two-node cluster hot backup mode, mode and trunking mode are shared in load equally, two-node cluster hot backup (dual computer fault-tolerant) is exactly for important service, use two-server, mutual backup, the same service of common execution, when a station server breaks down, can bear service role by another station server, thus when not needing manual intervention, automatic guarantee system can continue to provide service.
The shortcoming of two-node cluster hot backup:
Reliability is relatively poor, when there being service to break down, it is a more fragile link that data between migration two server carrying out virtual machine copy in real time, carry out at file and disk layer because it copies, whether successfully can affect db transaction operation when carrying out copying in virtual machine (vm) migration process, therefore easily occurring the incomplete situation of data.
The shortcoming of cluster:
1. build cluster environment complexity, relate to many aspects, the stability of the system of impact.
2. cluster detecting confirms fault, when there being service to break down, carry out the migration of virtual machine, take over the data field of sharing to need to spend the regular hour in the process switched, the time according to the switching that varies in size of application also can be different, and the time that larger application switches is longer.
Summary of the invention
In view of this, the invention provides a kind of virtual high reliability method of application layer, solve under virtualized environment, realize high reliability for application service in virtual machine, carry out highly reliable process in real time guarantee business causes stopping or suspending during due to non-physical fault and run without interruption to make business.
Specifically, the invention provides a kind of virtual high reliability method of application layer, described method comprises:
A) all services of VME operating system are obtained, the service name of monitor service that needs of preserving in the service name of all for operating system services and local data base is compared, filter out and wherein need monitor service, call operation system api interface, obtains the service status information that these need monitor service;
B) gather the service status information needing monitor service in VME operating system, and filter out the service status information needing monitor service of wherein service state exception;
C) analyze the service status information of monitor service that needs of service state exception, produce the processing instruction corresponding relative to each exception service state, described processing instruction comprises some parameters;
D) search processing instruction title corresponding A PI interface, call this api interface, import the parameter of this processing instruction into, complete the recovery operation of abnormality.
Further, described service status information comprise halted state information, start in state information, run in state information, continue in state information, suspend in state information and paused state information.
Further, the service status information of monitor service that needs of described service state exception comprises paused state information and halted state information.
Further, described processing instruction comprises recovery class instruction and restarts class instruction; Wherein recover class instruction comprise closedown service order and start service order; Restart class instruction comprise closedown service order and restart service order.
Further, described processing instruction parameter comprises authority, the service name of operate services and starts the authority of serving.
The present invention provides a kind of application layer virtual high reliability devices simultaneously, and this device comprises:
Service filter element: for obtaining all services of VME operating system, the service name of monitor service that needs of preserving in the service name of all for operating system services and local data base is compared, filter out and wherein need monitor service, call operation system api interface, obtains the service status information that these need monitor service;
Service status information collecting unit: for gathering in VME operating system the service status information needing monitor service, and filter out the service status information needing monitor service of wherein service state exception;
Service status information analytic unit: for analyzing the service status information of monitor service that needs of service state exception, produce the processing instruction corresponding relative to each exception service state, described processing instruction comprises some parameters;
Service status information processing unit: for searching processing instruction title corresponding A PI, calling this api interface, importing the parameter of this processing instruction into, complete the recovery operation of abnormality.
Further, described service status information comprise halted state information, start in state information, run in state information, continue in state information, suspend in state information and paused state information.
Further, the service status information of monitor service that needs of described service state exception comprises paused state information and halted state information.
Further, described processing instruction comprises recovery class instruction and restarts class instruction; Wherein recover class instruction comprise closedown service order and start service order; Restart class instruction comprise closedown service order and restart service order.
Further, described processing instruction parameter comprises authority, the service name of operate services and starts the authority of serving.
As can be seen here, the virtual high reliability method of a kind of application layer provided by the invention and device, analyze by needing the service status information of monitor service to application layer and realize timely restarting and starting service processing to the service of service state exception, achieve the high reliability based on service application level in virtualized environment, the deficiency of uncontrollable application service under compensate for traditional approach, avoid in prior art when service break down carry out virtual machine (vm) migration time two-node cluster hot backup easily occur that the imperfect and trunking mode complex environment of data is built in unsteadiness and virtual machine service handoff procedure because application service is large, the problem that switching time is long, the present invention realizes implementing highly reliable guarantee from application service rank by virtual machine internal.
Accompanying drawing explanation
Fig. 1 is the virtual high reliability method flow diagram of a kind of application layer;
Fig. 2 is the virtual high reliability devices building-block of logic of a kind of application layer.
Embodiment
In order to address this problem, embodiments provide a kind of virtual high reliability method of application layer.Please refer to Fig. 1, in a preferred embodiment, the inventive method is specific as follows:
A) all services of VME operating system are obtained, the service name of monitor service that needs of preserving in the service name of all for operating system services and local data base is compared, filter out and wherein need monitor service, call operation system api interface, obtains the service status information that these need monitor service;
The service status information of each service comprise halted state information, start in state information, run in state information, continue in state information, suspend in state information and paused state information.
Service as run in VME operating system has IIS (Internet Information Services, Internet Information Service), oracle database and mySQL database to serve; Then now can call the open api interface (Application Programming Interface, application programming interface) of operating system (windows, linux etc.).Obtain all service lists to compare with the list of monitor service that needs of preserving in local data base, judge that service that service name is identical is for needing monitor service, that preserves with this locality as IIS service needs the service name in monitor service identical, then obtain the service status information of service name W3SVC and this service needing monitor service IIS to serve.
In specific implementation, different operating system has different api interfaces.For Windows system, service managerZ-HU control handle is obtained as called OpenSCManager interface under windows system, call EnumServicesStatus interface and enumerate all information on services, comprising the title, service status information etc. of service, call OpenService () interface and open service.For linux system, under linux system, obtain all service chkconfig--list, call the title that service service name status checks service, call service Service name (start restart) and realize the startup of service and restart.
B) gather the service status information needing monitor service in VME operating system, and filter out the service status information needing monitor service of wherein service state exception;
The service status information of monitor service that needs of service state exception generally includes paused state information and halted state information.For IIS service, if this service needs monitor service, then gather the service status information of IIS service, the service status information of the IIS service judging to get is as halted state information or paused state information, then illustrate that this IIS service state is the service status information of service state exception, respective handling need be carried out to this IIS service.
C) analyze the service status information of monitor service that needs of service state exception, produce the processing instruction corresponding relative to each exception service state, described processing instruction comprises some parameters;
The service status information needing monitor service IIS to serve got is analyzed, judges that then service status information calls corresponding processing instruction.Processing instruction comprises recovery class instruction and restarts class instruction; Wherein recover class instruction comprise closedown service order and start service order; Restart class instruction comprise closedown service order and restart service order.
If judge that the service status information that IIS serves stops (SERVICE_STOPPED) state information, then need to restart IIS service, call and restart class processing instruction (open); The service status information that IIS serves if judge is paused (SERVICE_PAUSED) state information, need recover service, then call and recover class processing instruction (delete, open).
D) search processing instruction title corresponding A PI interface, call this api interface, import the parameter of this processing instruction into, complete the recovery operation of abnormality.
Service status information according to the IIS service analyzed stops (SERVICE_STOPPED) state information, that calls restarts class processing instruction (open), search the api interface corresponding with restarting class processing instruction title (open), import the authority of relevant parameter as operate services into, service name (W3SVC), start the authority (SERVICE_ALL_ACCESS) of service, realize restarting of service.As service status information paused (SERVICE_PAUSED) state information according to the IIS service analyzed, the recovery class processing instruction (delete called, open), search and recover class processing instruction title (delete, open) corresponding api interface, imports the authority of relevant parameter as operate services into, service name (W3SVC), start the authority of service, realize the startup of service.Start in realization service and need first to perform to close before reboot operation and serve.
This embodiment is by the monitoring to the application service of virtual machine, judge the service status information needing monitor service, automatically realize starting and restarting to the service of service state exception, ensure tasks carrying, recovery reaches the continuity and high reliability and fault-tolerant (Fault-tolerant that ensure business in virtual machine, FT), the high reliability of service is ensured.
Please refer to Fig. 2, based on said method, the invention provides the virtual high reliability devices of a kind of corresponding application layer, this application of installation is on main frame (such as server), as the operation carrier of this logic device, the hardware environment of described main process equipment at least all comprises CPU, internal memory, non-volatile memory medium and other necessary hardware usually, and this device comprises:
Service filter element: for obtaining all services of VME operating system, the service name of monitor service that needs of preserving in the service name of all for operating system services and local data base is compared, filter out and wherein need monitor service, call operation system api interface, obtains the service status information that these need monitor service;
Obtain all service lists to compare with the list of monitor service that needs of preserving in local data base, judge that service that service name is identical is for needing monitor service, that preserves with this locality as IIS service needs the service name in monitor service identical, then obtain the service status information of service name W3SVC and this service needing monitor service IIS to serve.
The service status information of each service comprise halted state information, start in state information, run in state information, continue in state information, suspend in state information and paused state information.
Service status information collecting unit: for gathering in VME operating system the service status information needing monitor service, and filter out the service status information needing monitor service of wherein service state exception;
The service status information of monitor service that needs of service state exception comprises paused state information and halted state information.For IIS service, gather the service status information needing monitor service IIS to serve obtained, the service status information of the IIS service judging to get is as halted state information or paused state information, then illustrate that this IIS service state is the service status information of service state exception, respective handling need be carried out to this IIS service.
Service status information analytic unit: for analyzing the service status information of monitor service that needs of service state exception, produce the processing instruction corresponding relative to each exception service state, described processing instruction comprises some parameters;
The service status information needing monitor service IIS to serve got is analyzed, judges that then service state calls corresponding processing instruction.Processing instruction comprises recovery class instruction and restarts class instruction; Wherein recover class instruction comprise closedown service order and start service order; Restart class instruction comprise closedown service order and restart service order.
Service status information processing unit: for searching processing instruction title corresponding A PI interface, calling this api interface, importing this processing instruction parameter into, completing the recovery operation of abnormal task.
Service status information according to the IIS service analyzed stops (SERVICE_STOPPED) state information, that calls restarts class processing instruction (open), search the api interface corresponding with restarting class processing instruction title (open), import the authority of relevant parameter as operate services into, service name (W3SVC), start the authority (SERVICE_ALL_ACCESS) of service, realize restarting of service.As service status information paused (SERVICE_PAUSED) state information according to the IIS service analyzed, the recovery class processing instruction (delete called, open), search and recover class processing instruction title (delete, open) corresponding api interface, imports the authority of relevant parameter as operate services into, service name (W3SVC), start the authority of service, realize the startup of service.Start in realization service and need first to perform to close before reboot operation and serve.
As can be seen from above execution mode, virtual highly reliable object-oriented is converted to the application service of virtual machine internal by virtual machine itself, by external control, inter-process instruction is converted to for the control of application service and api interface controls, under compensate for traditional approach, controls the deficiency of application service.Highly reliable guarantee is implemented by virtual machine internal from application service rank.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, within the spirit and principles in the present invention all, any amendment made, equivalent replacement, improvement etc., all should be included within the scope of protection of the invention.
Claims (10)
1. the virtual high reliability method of application layer, it is characterized in that, the method comprises:
A) all services of VME operating system are obtained, the service name of monitor service that needs of preserving in the service name of all for operating system services and local data base is compared, filter out and wherein need monitor service, call operation system api interface, obtains the service status information that these need monitor service;
B) gather the service status information needing monitor service in VME operating system, and filter out the service status information needing monitor service of wherein service state exception;
C) analyze the service status information of monitor service that needs of service state exception, produce the processing instruction corresponding relative to each exception service state, described processing instruction comprises some parameters;
D) search processing instruction title corresponding A PI interface, call this api interface, import the parameter of this processing instruction into, complete the recovery operation of abnormality.
2. the method for claim 1, is characterized in that, described service status information comprise halted state information, start in state information, run in state information, continue in state information, suspend in state information and paused state information.
3. the method for claim 1, is characterized in that, the service status information of monitor service that needs of described service state exception comprises paused state information and halted state information.
4. the method for claim 1, is characterized in that, described processing instruction comprises recovery class instruction and restarts class instruction; Wherein recover class instruction comprise closedown service order and start service order; Restart class instruction comprise closedown service order and restart service order.
5. the method for claim 1, is characterized in that, described processing instruction parameter comprises authority, the service name of operate services and starts the authority of serving.
6. the virtual high reliability devices of application layer, this device comprises:
Service filter element: for obtaining all services of VME operating system, the service name of monitor service that needs of preserving in the service name of all for operating system services and local data base is compared, filter out and wherein need monitor service, call operation system api interface, obtains the service status information that these need monitor service;
Service status information collecting unit: for gathering in VME operating system the service status information needing monitor service, and filter out the service status information needing monitor service of wherein service state exception;
Service status information analytic unit: for analyzing the service status information of monitor service that needs of service state exception, produce the processing instruction corresponding relative to each exception service state, described processing instruction comprises some parameters;
Service status information processing unit: for searching processing instruction title corresponding A PI interface, calling this api interface, importing the parameter of this processing instruction into, completing the recovery operation of abnormality.
7. device as claimed in claim 6, is characterized in that, described service status information comprise halted state information, start in state information, run in state information, continue in state information, suspend in state information and paused state information.
8. device as claimed in claim 6, is characterized in that, the service status information of monitor service that needs of described service state exception comprises paused state information and halted state information.
9. device as claimed in claim 6, is characterized in that, described processing instruction comprises recovery class instruction and restarts class instruction; Wherein recover class instruction comprise closedown service order and start service order; Restart class instruction comprise closedown service order and restart service order.
10. device as claimed in claim 6, is characterized in that, described processing instruction parameter comprises authority, the service name of operate services and starts the authority of serving.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310627348.9A CN104683131A (en) | 2013-11-27 | 2013-11-27 | Application stage virtualization high-reliability method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310627348.9A CN104683131A (en) | 2013-11-27 | 2013-11-27 | Application stage virtualization high-reliability method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN104683131A true CN104683131A (en) | 2015-06-03 |
Family
ID=53317764
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310627348.9A Pending CN104683131A (en) | 2013-11-27 | 2013-11-27 | Application stage virtualization high-reliability method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104683131A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106559441A (en) * | 2015-09-25 | 2017-04-05 | 华为技术有限公司 | It is a kind of based on the virtual machine monitoring method of cloud computing service, apparatus and system |
CN110750586A (en) * | 2019-10-12 | 2020-02-04 | 北京浪潮数据技术有限公司 | Operation information processing method and system of virtualization management platform |
CN117032881A (en) * | 2023-07-31 | 2023-11-10 | 广东保伦电子股份有限公司 | Method, device and storage medium for detecting and recovering abnormality of virtual machine |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101136044A (en) * | 2006-08-29 | 2008-03-05 | 联想(北京)有限公司 | Software watchdog system and method |
CN102708018A (en) * | 2012-04-20 | 2012-10-03 | 华为技术有限公司 | Method and system for exception handling, proxy equipment and control device |
CN102819465A (en) * | 2012-06-29 | 2012-12-12 | 华中科技大学 | Failure recovery method in virtualization environment |
CN103152414A (en) * | 2013-03-01 | 2013-06-12 | 四川省电力公司信息通信公司 | High available system based on cloud calculation and implementation method thereof |
CN103152419A (en) * | 2013-03-08 | 2013-06-12 | 中标软件有限公司 | High availability cluster management method for cloud computing platform |
CN103365758A (en) * | 2013-08-05 | 2013-10-23 | 北京搜狐新媒体信息技术有限公司 | Process monitoring method and system in virtualization environment |
-
2013
- 2013-11-27 CN CN201310627348.9A patent/CN104683131A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101136044A (en) * | 2006-08-29 | 2008-03-05 | 联想(北京)有限公司 | Software watchdog system and method |
CN102708018A (en) * | 2012-04-20 | 2012-10-03 | 华为技术有限公司 | Method and system for exception handling, proxy equipment and control device |
CN102819465A (en) * | 2012-06-29 | 2012-12-12 | 华中科技大学 | Failure recovery method in virtualization environment |
CN103152414A (en) * | 2013-03-01 | 2013-06-12 | 四川省电力公司信息通信公司 | High available system based on cloud calculation and implementation method thereof |
CN103152419A (en) * | 2013-03-08 | 2013-06-12 | 中标软件有限公司 | High availability cluster management method for cloud computing platform |
CN103365758A (en) * | 2013-08-05 | 2013-10-23 | 北京搜狐新媒体信息技术有限公司 | Process monitoring method and system in virtualization environment |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106559441A (en) * | 2015-09-25 | 2017-04-05 | 华为技术有限公司 | It is a kind of based on the virtual machine monitoring method of cloud computing service, apparatus and system |
CN110750586A (en) * | 2019-10-12 | 2020-02-04 | 北京浪潮数据技术有限公司 | Operation information processing method and system of virtualization management platform |
CN117032881A (en) * | 2023-07-31 | 2023-11-10 | 广东保伦电子股份有限公司 | Method, device and storage medium for detecting and recovering abnormality of virtual machine |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8983961B2 (en) | High availability for cloud servers | |
US9052935B1 (en) | Systems and methods for managing affinity rules in virtual-machine environments | |
US10169173B2 (en) | Preserving management services with distributed metadata through the disaster recovery life cycle | |
US11321197B2 (en) | File service auto-remediation in storage systems | |
Bala et al. | Fault tolerance-challenges, techniques and implementation in cloud computing | |
US8843717B2 (en) | Maintaining consistency of storage in a mirrored virtual environment | |
US10162708B2 (en) | Fault tolerance for complex distributed computing operations | |
WO2019152122A1 (en) | Systems and methods for performing computing cluster node switchover | |
US20190235979A1 (en) | Systems and methods for performing computing cluster node switchover | |
CN101895540B (en) | For the system and method that application service process is guarded | |
US9317380B2 (en) | Preserving management services with self-contained metadata through the disaster recovery life cycle | |
CN104516789A (en) | Method and system for failover detection and treatment in checkpoint systems | |
CN103795742B (en) | Isomery storage and disaster tolerance management system and method | |
CN104683131A (en) | Application stage virtualization high-reliability method and device | |
CN105068899A (en) | Automatic reboot stability test method for Vmware system | |
CN114035905A (en) | Fault migration method and device based on virtual machine, electronic equipment and storage medium | |
CN114691304B (en) | Method, device, equipment and medium for realizing high availability of cluster virtual machine | |
CN105391790A (en) | Database high-availability method similar to RAC One Node | |
US20230101776A1 (en) | Desired state configuration for virtual machines | |
CN103268271A (en) | Disaster tolerance realizing method of all-in-one machine | |
CN112035295A (en) | Virtual machine crash event processing method, system, terminal and storage medium | |
Cao et al. | IT Operation and Maintenance Process improvement and design under virtualization environment | |
WO2023185355A1 (en) | Method and apparatus for achieving high availability of clustered virtual machines, device, and medium | |
Singh et al. | Fault tolerance-challenges, techniques and implementation in cloud computing | |
AU2015249127B2 (en) | Fault tolerance for complex distributed computing operations |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: Binjiang District and Hangzhou city in Zhejiang Province Road 310051 No. 68 in the 6 storey building Applicant after: Hangzhou Dipu Polytron Technologies Inc Address before: Binjiang District and Hangzhou city in Zhejiang Province Road 310051 No. 68 in the 6 storey building Applicant before: Hangzhou Dipu Technology Co., Ltd. |
|
CB02 | Change of applicant information | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20150603 |
|
RJ01 | Rejection of invention patent application after publication |