CN107040406B - End cloud cooperative computing system and fault-tolerant method thereof - Google Patents

End cloud cooperative computing system and fault-tolerant method thereof Download PDF

Info

Publication number
CN107040406B
CN107040406B CN201710148783.1A CN201710148783A CN107040406B CN 107040406 B CN107040406 B CN 107040406B CN 201710148783 A CN201710148783 A CN 201710148783A CN 107040406 B CN107040406 B CN 107040406B
Authority
CN
China
Prior art keywords
cloud
task
module
tasks
server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710148783.1A
Other languages
Chinese (zh)
Other versions
CN107040406A (en
Inventor
沈玉龙
张宇恒
司旭
保积元
王思怡
阮金清
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xidian University
Original Assignee
Xidian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xidian University filed Critical Xidian University
Priority to CN201710148783.1A priority Critical patent/CN107040406B/en
Publication of CN107040406A publication Critical patent/CN107040406A/en
Application granted granted Critical
Publication of CN107040406B publication Critical patent/CN107040406B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0659Management of faults, events, alarms or notifications using network fault recovery by isolating or reconfiguring faulty entities
    • H04L41/0661Management of faults, events, alarms or notifications using network fault recovery by isolating or reconfiguring faulty entities by reconfiguring faulty entities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0663Performing the actions predefined by failover planning, e.g. switching to standby network elements

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer And Data Communications (AREA)
  • Hardware Redundancy (AREA)

Abstract

The invention discloses a terminal cloud cooperative computing system and a fault-tolerant method thereof, which utilize the characteristic that the cloud capability is stronger than that of a terminal to realize two safe fault-tolerant strategies of cloud backup and cloud recovery at the cloud end, thereby ensuring the task safety and the correctness of terminal cloud cooperative computing. And an abnormal terminal node detection function is provided, so that the cloud node can replace a terminal node which does not meet the system computing standard in real time, and the stable operation of the whole service system is ensured. According to the invention, the terminal with certain computing capacity in the security certification access system is also added into the computing resource pool to form the terminal cluster, the terminal cluster has strong expansibility, the expansibility of the cloud is increased, the terminal cluster and the service nodes in the static cloud together complete the tasks required to be computed by the single cloud platform, the traditional cloud platform is optimized, the load of the cloud is effectively reduced, and the load balance is achieved.

Description

End cloud cooperative computing system and fault-tolerant method thereof
Technical Field
The invention relates to the technical field of cloud computing, in particular to a terminal cloud cooperative computing system and a fault-tolerant method thereof.
Background
Cloud computing is an increasing, usage and delivery model for internet-based related services, typically involving the provision of dynamically scalable and often virtualized resources over the internet. Cloud is a metaphor of network and internet. In the past, telecommunications networks were often represented by clouds and later also by the abstraction of the internet and the underlying infrastructure. Cloud computing can enable a user to experience computing power of 10 trillion times per second, and the powerful computing power can simulate nuclear explosion, forecast climate change and market development trend. The user can access the data center through a computer, a notebook, a mobile phone and the like and calculate according to the own requirements.
Cloud computing is a product of development and fusion of traditional computer and network technologies, such as distributed computing, parallel computing, utility computing, network storage, virtualization, load balancing, hot backup redundancy and the like. There are many references to the definition of cloud computing, and at the present stage, it is widely accepted that the definition of the national institute of standards and technology is: cloud computing is a pay-per-use model that provides available, convenient, on-demand network access into a configurable shared pool of computing resources (resources including networks, servers, storage, applications, services) that can be provisioned quickly with little administrative effort or interaction with service providers.
The cloud computing platform is also referred to as a cloud platform. Cloud platforms can be divided into 3 classes: the cloud computing platform comprises a storage type cloud platform taking data storage as a main part, a computing type cloud platform taking data processing as a main part and a comprehensive type cloud computing platform taking computing and data processing into consideration.
The terminal cloud cooperation platform is different from a traditional cloud platform, and mobile terminals providing certain computing capacity are added into a resource pool to jointly complete computing tasks. The method achieves effective utilization of idle resources, and responds to the call of the existing low-carbon technology and green technology.
Most of terminal mobile devices are mobile devices, most of access terminal cloud cooperative architectures are accessed in a wireless network mode, and certain difference exists between the access terminal cloud cooperative architectures and computing nodes on a cloud platform in terms of network stability and quality. Meanwhile, the terminal has relatively weak durability and is easily influenced by a plurality of external environment factors, and the possibility of abnormity or failure is greatly increased.
Disclosure of Invention
In view of the above defects in the prior art, the technical problem to be solved by the present invention is to provide an end cloud cooperative computing system and a fault tolerance method thereof, which implement a safe end cloud cooperative fault tolerance method by combining two strategies of cloud backup and cloud recovery, thereby ensuring high reliability of the architecture.
In order to achieve the above object, the present invention provides a peer cloud collaborative computing system, which is characterized in that: the system consists of a task management server module, an end cloud cooperative server module, a static cloud service node module and a mobile terminal computing service node module, wherein the task management server module comprises four modules:
a task management server module: the module is responsible for acquiring tasks submitted by a user, packaging the tasks, processing the tasks and sending the tasks to the end cloud cooperation server module;
the end cloud collaboration server module: the module is responsible for receiving tasks sent by the management server, formulating a task scheduling strategy and cooperatively managing the static cloud and the mobile terminal resources;
static cloud service node module: the module consists of a static cloud server and is responsible for calculating various calculation tasks sent by an end cloud cooperative server, setting check points for mobile terminal tasks and carrying out cloud backup on the tasks so as to prevent the tasks from being lost due to calculation failure on mobile terminal resources;
the mobile terminal computing service node module: the module consists of various hardware terminals and is responsible for computing tasks sent by the terminal cloud collaboration server.
Furthermore, the end cloud cooperation server module is also provided with an automatic discovery module, the end cloud cooperation server uses an active detection method for discovering available resources in time, can dynamically expand mobile terminal resources for the cloud platform, allocate tasks suitable for operation of the static cloud and the mobile terminal resources for the static cloud and the mobile terminal resources, and perform real-time communication with the static cloud and the mobile terminal resources to ensure normal execution of the fault-tolerant system.
Furthermore, the static cloud service node module is also provided with a dynamic monitoring module, and can monitor the uploading condition of the calculation result of the terminal resource check point in real time by the cooperation of the cloud and the server so as to ensure the operation of the fault-tolerant system.
Further, the mobile terminal computing service node module is also provided with a log storage and uploading module, and can store the calculation results of the check points and upload the calculation results of the check points to the terminal cloud coordination server on time.
Furthermore, the mobile terminals are all brands of pads and mobile phones based on the android system.
A fault tolerance method of an end cloud cooperative computing system is characterized by comprising the following specific steps:
the method comprises the following steps: after a user issues a task, the task management server receives the task, sorts all the tasks and sends the tasks to the end cloud coordination server;
step two: after receiving the task, the end cloud cooperation server carries out task scheduling and transmission;
step three: after receiving the task, the static cloud end performs checkpoint setting on the task, performs static cloud backup operation on the task after checkpoint setting, and sends the task to the end cloud coordination server after execution is completed;
step four: the end cloud cooperation server sends the tasks processed by the static cloud end to the mobile end equipment;
step five: and after receiving the task, the mobile terminal performs operation processing on the task.
Further, the second step is specifically: the tasks are classified and processed, the tasks are divided into priorities according to the end cloud distribution module, the task queues are sorted according to the first-come first-serve and priority sizes, a task distribution strategy is formulated, and the tasks suitable for being calculated on front-end equipment are sent to the cloud end first.
Further, the fifth step is specifically: in the processing process, if the operation is successful, the result is uploaded to the cooperative server, meanwhile, the cooperative server uploads the result to the cloud end, and then the backup part of the task is deleted by the cloud end; if the operation fails, the cooperative server executes the fault-tolerant method and returns the task to the cloud for execution.
The invention has the beneficial effects that:
according to the method, two safety fault-tolerant strategies of cloud backup and cloud recovery are realized at the cloud side by utilizing the characteristic that the cloud side capability is higher than that of the terminal, so that the task safety and the accuracy of end cloud cooperative computing are ensured. And an abnormal terminal node detection function is provided, so that the cloud node can replace a terminal node which does not meet the system computing standard in real time, and the stable operation of the whole service system is ensured. According to the invention, the terminal with certain computing capacity in the security certification access system is also added into the computing resource pool to form the terminal cluster, the terminal cluster has strong expansibility, the expansibility of the cloud is increased, the terminal cluster and the service nodes in the static cloud together complete the tasks required to be computed by the single cloud platform, the traditional cloud platform is optimized, the load of the cloud is effectively reduced, and the load balance is achieved.
The conception, the specific structure and the technical effects of the present invention will be further described with reference to the accompanying drawings to fully understand the objects, the features and the effects of the present invention.
Drawings
Fig. 1 is a structural block diagram of an end cloud cooperative computing system according to the present invention.
Fig. 2 is a task computing flow chart of an end cloud coordination system-based task computing with a fault tolerant system according to the present invention.
Fig. 3 is a flowchart of a fault tolerance method of an end cloud cooperative computing architecture according to the present invention.
Detailed Description
As shown in fig. 1, an end cloud collaborative computing system is characterized in that: the system consists of a task management server module, an end cloud cooperative server module, a static cloud service node module and a mobile terminal computing service node module, wherein the task management server module comprises four modules:
a task management server module: the module is responsible for acquiring tasks submitted by a user, packaging the tasks, processing the tasks and sending the tasks to the end cloud cooperation server module;
the end cloud collaboration server module: the module is responsible for receiving tasks sent by the management server, formulating a task scheduling strategy and cooperatively managing the static cloud and the mobile terminal resources;
static cloud service node module: the module consists of a static cloud server and is responsible for calculating various calculation tasks sent by an end cloud cooperative server, setting check points for mobile terminal tasks and carrying out cloud backup on the tasks so as to prevent the tasks from being lost due to calculation failure on mobile terminal resources;
the mobile terminal computing service node module: the module consists of various hardware terminals and is responsible for computing tasks sent by the terminal cloud collaboration server.
In this embodiment, the end cloud cooperation server module is further equipped with an automatic discovery module, and the end cloud cooperation server uses an active detection method to discover available resources in time, dynamically extend mobile terminal resources for the cloud platform, allocate tasks suitable for operations of static clouds and mobile terminal resources, and perform real-time communication with the static clouds and the mobile terminal resources to ensure normal execution of the fault-tolerant system.
In this embodiment, the static cloud service node module is further equipped with a dynamic monitoring module, and the dynamic monitoring module can monitor the uploading condition of the calculation result of the terminal resource check point in real time by the cooperation with the cloud coordination server, so as to ensure the operation of the fault-tolerant system.
In this embodiment, the mobile terminal computing service node module is further equipped with a log storage and uploading module, and can upload the computing results of each check point to the peer cloud cooperation server on time.
In this embodiment, the mobile terminals are all brands of pads and mobile phones based on the android system.
As shown in fig. 2, a fault tolerance method for a peer cloud cooperative computing system is characterized by comprising the following specific steps:
the method comprises the following steps: after a user issues a task, the task management server receives the task, sorts all the tasks and sends the tasks to the end cloud coordination server;
step two: after receiving the task, the end cloud cooperation server carries out task scheduling and transmission;
step three: after receiving the task, the cloud end performs checkpoint setting on the task, performs static cloud backup operation on the task after checkpoint setting, and sends the task to the end cloud coordination server after execution is completed;
step four: the end cloud cooperation server sends the task processed by the cloud end to the mobile end equipment;
step five: and after receiving the task, the mobile terminal performs operation processing on the task.
In this embodiment, the second step specifically includes: the tasks are classified and processed, the tasks are divided into priorities according to the end cloud distribution module, the task queues are sorted according to the first-come first-serve and priority sizes, a task distribution strategy is formulated, and the tasks suitable for being calculated on front-end equipment are sent to the cloud end first.
In this embodiment, the fifth step is specifically: in the processing process, if the operation is successful, the result is uploaded to the cooperative server, meanwhile, the cooperative server uploads the result to the cloud end, and then the backup part of the task is deleted by the cloud end; if the operation fails, the cooperative server executes the fault-tolerant method and returns the task to the cloud for execution.
Example one
With reference to fig. 1 and fig. 2, this example details a specific execution flow of a cloud-based cooperative computing architecture with a fault tolerance method according to the present invention, and includes the following steps:
step 1, a task management server of the end cloud coordination system sends a received task from a user to an end cloud coordination server;
step 2, the end cloud cooperative server receives the task of the task management server, then performs task allocation operation on the task, mainly based on the principle that priority and first-in first-out are set for the task, and respectively sends the task to the cloud end and the mobile end for calculation according to the category of the task;
step 3, sending the task suitable for the mobile terminal to calculate in the step 2 to a static cloud terminal, wherein the static cloud terminal obtains an estimated deadline T for the task, then setting a check point for the task, and after the setting is finished, performing static cloud backup on the task and sending the task to a cooperative server;
step 4, the cooperative server sends the task with the check point to the mobile terminal;
step 5, the mobile terminal executes the task, judges the task according to a fault-tolerant mechanism in the calculation process, and if the static cloud terminal receives the calculation result of each check point of the mobile terminal obtained from the cooperative server all the time in the execution process, the task is continuously executed to the completion by the mobile terminal;
step 6, if the static cloud end does not receive the check point calculation result of the mobile end acquired from the cooperative server in the execution process of the mobile end, the cloud server of the end requests the calculation of the mobile end to return to an available check point, recalculation is carried out, if the retry frequency exceeds 3 times and the static cloud end still does not receive the result, cloud recovery operation is executed, the task calculated on the mobile end is unloaded, the task is recovered to the static cloud end, and the static cloud end calculates according to the calculation result of the available check point;
and 7, ending.
Example two
With reference to fig. 3, this example describes in detail a fault tolerance method based on an end cloud cooperative computing architecture provided in the present invention, where the method is described as follows:
setting the total amount of one task as M, wherein a represents the number of check points which can tolerate e error tasks and are inserted at equal intervals, and the interval is N, and then N is M/(a + 1); wherein e is1Indicating an error occurred during checkpoint save, e2Indicating an error that occurred at checkpoint recovery, e3Indicating an error that occurred during the efficient computation; when e is generated1In case of error, the maximum time required for recovering to normal operation is' check point interval time N + time R for recovering one check pointc+ time S for saving a check pointc", when generating e2In the event of an error, the maximum time required to restore to normal operation is "time for restoring one checkpoint" Rc", when the b3 error is generated, the maximum time required to recover to normal operation is" checkpoint interval time N + time R for recovering a checkpointc". When a task is executed, its total response time is defined as "estimated task execution time without error + time required to save a checkpoints + e1Time + e at which a fault occurs during the retention period2One fault occurs during state recovery + e3A fault occurs during active execution. It can be seen that the worst case occurs when only e occurs1Type of error (when it occurs, the maximum time required to recover from a failure to the task functioning properly is the longest), and,the total response time should be less than a cutoff time T pre-calculated by the system, by which the number of checkpoints a can be finally calculated.
After the check point is set, cloud backup storage is carried out on the static cloud end, the task with the check point set is sent to the end cloud coordination server, and the task is sent to the mobile end by the end cloud coordination server to be calculated. In the calculation process of the mobile terminal, when each check point is reached, the task carries out calculation result backup operation once, the result is stored in a local log file and uploaded to the cooperative server, and the result is uploaded to the static cloud terminal by the cooperative server. And the static cloud end waits at each check point, the waiting time does not exceed the check point interval time and the task transmission time at most, if the overtime does not obtain the correspondence, the static cloud end sends an instruction to the cooperative server, and the mobile end is required to retry: the calculation is returned to the mobile for recalculation to an available checkpoint and the maximum number of retries is 3. And if the terminal cloud cooperation server does not obtain any response of the mobile terminal for more than 3 times or the total number of times of errors of the mobile terminal exceeds more than half of the number of the check points, executing cloud recovery operation, unloading the task of the mobile terminal, and calculating by the static cloud terminal according to the data of the check points available on the task. If the retry receives the response, the mobile terminal continues to execute the task, and after the task is completed, the static cloud terminal unloads the backup information of the task which is stored before.
The foregoing detailed description of the preferred embodiments of the invention has been presented. It should be understood that numerous modifications and variations could be devised by those skilled in the art in light of the present teachings without departing from the inventive concepts. Therefore, the technical solutions available to those skilled in the art through logic analysis, reasoning and limited experiments based on the prior art according to the concept of the present invention should be within the scope of protection defined by the claims.

Claims (7)

1. An end cloud collaborative computing system, characterized in that: the system consists of a task management server module, an end cloud cooperative server module, a static cloud service node module and a mobile terminal computing service node module, wherein the task management server module comprises four modules:
a task management server module: the module is responsible for acquiring tasks submitted by a user, packaging the tasks, processing the tasks and sending the tasks to the end cloud cooperation server module;
the end cloud collaboration server module: the module is responsible for receiving tasks sent by the management server, formulating a task scheduling strategy and cooperatively managing the static cloud and the mobile terminal resources;
static cloud service node module: the module consists of a static cloud server and is responsible for calculating various calculation tasks sent by an end cloud cooperative server, setting check points for mobile terminal tasks and carrying out cloud backup on the tasks so as to prevent the tasks from being lost due to calculation failure on mobile terminal resources;
the mobile terminal computing service node module: the module consists of various mobile terminals and is responsible for calculating tasks sent by the terminal cloud collaboration server;
the end cloud cooperation server module is also provided with an automatic discovery module, and the end cloud cooperation server uses an active detection method for discovering available resources in time, can dynamically expand mobile terminal resources for the cloud platform, allocate tasks suitable for the operation of the static cloud and the mobile terminal resources, and perform real-time communication with the static cloud and the mobile terminal resources to ensure the normal execution of the fault-tolerant system.
2. The peer-cloud collaborative computing system of claim 1, wherein: the static cloud service node module is also provided with a dynamic monitoring module, and can monitor the uploading condition of the calculation result of the terminal resource check point in real time by the cooperation of the cloud and the cloud cooperative server so as to ensure the operation of the fault-tolerant system.
3. The peer-cloud collaborative computing system of claim 1, wherein: the mobile terminal computing service node module is also provided with a log storage and uploading module, and can upload the computing results of each check point to the terminal cloud cooperation server on time.
4. The peer-cloud collaborative computing system of claim 1, wherein: the mobile terminals are all brands of pads and mobile phones based on the android system.
5. A fault tolerance method for applying the end cloud cooperative computing system according to any one of claims 1 to 4 is characterized by comprising the following specific steps:
the method comprises the following steps: after a user issues a task, the task management server receives the task, sorts all the tasks and sends the tasks to the end cloud coordination server;
step two: after receiving the task, the end cloud cooperation server carries out task scheduling and transmission;
step three: after receiving the task, the cloud end performs checkpoint setting on the task, performs static cloud backup operation on the task after checkpoint setting, and sends the task to the end cloud coordination server after execution is completed;
step four: the end cloud cooperation server sends the task processed by the cloud end to the mobile end equipment;
step five: and after receiving the task, the mobile terminal performs operation processing on the task.
6. The fault tolerance method for the cloud-side collaborative computing system according to claim 5, wherein the second step is specifically: the tasks are classified and processed, the tasks are divided into priorities according to the end cloud distribution module, the task queues are sorted according to the first-come first-serve and priority sizes, a task distribution strategy is formulated, and the tasks suitable for being calculated on front-end equipment are sent to the cloud end first.
7. The fault tolerance method for the cloud-side collaborative computing system according to claim 5, wherein the step five is specifically: in the processing process, if the operation is successful, the result is uploaded to the cooperative server, meanwhile, the cooperative server uploads the result to the cloud end, and then the backup part of the task is deleted by the cloud end; if the operation fails, the cooperative server executes the fault-tolerant method and returns the task to the cloud for execution.
CN201710148783.1A 2017-03-14 2017-03-14 End cloud cooperative computing system and fault-tolerant method thereof Active CN107040406B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710148783.1A CN107040406B (en) 2017-03-14 2017-03-14 End cloud cooperative computing system and fault-tolerant method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710148783.1A CN107040406B (en) 2017-03-14 2017-03-14 End cloud cooperative computing system and fault-tolerant method thereof

Publications (2)

Publication Number Publication Date
CN107040406A CN107040406A (en) 2017-08-11
CN107040406B true CN107040406B (en) 2020-08-11

Family

ID=59533962

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710148783.1A Active CN107040406B (en) 2017-03-14 2017-03-14 End cloud cooperative computing system and fault-tolerant method thereof

Country Status (1)

Country Link
CN (1) CN107040406B (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107959708B (en) * 2017-10-24 2020-10-13 北京邮电大学 Cloud-end-edge-vehicle-end-based vehicle networking service collaborative computing method and system
CN109815003B (en) * 2017-11-21 2020-12-18 中国移动通信有限公司研究院 Cooperative computing method and system between intelligent devices, intelligent device and medium
CN110502220A (en) * 2018-05-16 2019-11-26 北京理工大学 A kind of packaging method of cloud application
CN108901046B (en) * 2018-06-14 2020-10-09 北京大学 Cooperative task unloading algorithm and system design scheme for mobile edge computing
WO2020133450A1 (en) * 2018-12-29 2020-07-02 驭势科技(北京)有限公司 System and method for sharing computing power by means of dynamic networking for mobile device
CN111683129A (en) * 2020-05-29 2020-09-18 电子科技大学 Dynamic ad hoc network mobile cloud computing system and method
CN112866404B (en) * 2021-02-03 2023-01-24 视若飞信息科技(上海)有限公司 Semi-cloud system and execution method
CN117056133B (en) * 2023-10-12 2024-02-06 杭州浩联智能科技有限公司 Data backup method, device and medium based on distributed Internet of things architecture

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101719931A (en) * 2009-11-27 2010-06-02 南京邮电大学 Multi-intelligent body-based hierarchical cloud computing model construction method
CN102916830A (en) * 2012-09-11 2013-02-06 北京航空航天大学 Implement system for resource service optimization allocation fault-tolerant management
CN103544064A (en) * 2013-10-28 2014-01-29 华为数字技术(苏州)有限公司 Cloud computing method, cloud management platform and client
CN103778031A (en) * 2014-01-15 2014-05-07 华中科技大学 Distributed system multilevel fault tolerance method under cloud environment

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10747620B2 (en) * 2013-11-15 2020-08-18 Netapp, Inc. Network storage management at scale using service level objectives

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101719931A (en) * 2009-11-27 2010-06-02 南京邮电大学 Multi-intelligent body-based hierarchical cloud computing model construction method
CN102916830A (en) * 2012-09-11 2013-02-06 北京航空航天大学 Implement system for resource service optimization allocation fault-tolerant management
CN103544064A (en) * 2013-10-28 2014-01-29 华为数字技术(苏州)有限公司 Cloud computing method, cloud management platform and client
CN103778031A (en) * 2014-01-15 2014-05-07 华中科技大学 Distributed system multilevel fault tolerance method under cloud environment

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
《Fault Tolerance Middleware for Cloud Computing》;Wenbing Zhao等;《2010 IEEE 3rd International Conference on Cloud Computing》;20100710;全文 *

Also Published As

Publication number Publication date
CN107040406A (en) 2017-08-11

Similar Documents

Publication Publication Date Title
CN107040406B (en) End cloud cooperative computing system and fault-tolerant method thereof
CN108632365B (en) Service resource adjusting method, related device and equipment
CN107087019B (en) Task scheduling method and device based on end cloud cooperative computing architecture
CN102819465B (en) Failure recovery method in virtualization environment
CN111913818A (en) Method for determining dependency relationship between services and related device
US20180157563A1 (en) Enabling a Cloud Controller to Communicate with Power Systems
CN109873714B (en) Cloud computing node configuration updating method and terminal equipment
CN102088490A (en) Data storage method, device and system
CN113904917A (en) Meteorological data service platform based on micro-service architecture
CN106470123A (en) Log collecting method, client, server and electronic equipment
CN102385536B (en) Method and system for realization of parallel computing
CN109656783A (en) System platform monitoring method and device
CN113867957A (en) Method and device for realizing elastic expansion of number of cross-cluster containers
CN106375102A (en) Service registration method, application method and correlation apparatus
CN115328741A (en) Exception handling method, device, equipment and storage medium
EP3633508A1 (en) Load distribution for integration scenarios
CN103634167A (en) Security configuration check method and system for target hosts in cloud environment
CN111680914B (en) Event distribution method, event distribution device, storage medium, and electronic device
CN113364857A (en) Service data processing method and device and server
CN111400241B (en) Data reconstruction method and device
CN112567687B (en) Availability towards network slicing
CN108154343B (en) Emergency processing method and system for enterprise-level information system
CN114936106A (en) Method, device and medium for processing host fault
CN111813621A (en) Data processing method, device, equipment and medium based on Flume data middlebox
CN116909758B (en) Processing method and device of calculation task and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant