CN107395733B

CN107395733B - Geographic distribution interactive service cloud resource collaborative optimization method

Info

Publication number: CN107395733B
Application number: CN201710639715.5A
Authority: CN
Inventors: 姚建国; 吴家宏; 管海兵
Original assignee: Shanghai Jiaotong University
Current assignee: Shanghai Jiaotong University
Priority date: 2017-07-31
Filing date: 2017-07-31
Publication date: 2020-08-04
Anticipated expiration: 2037-07-31
Also published as: WO2019024445A1; CN107395733A

Abstract

The invention provides a collaborative optimization method for cloud resources of a geographically distributed interactive service, which comprises the following steps: step 1: determining available data centers for each area; step 2: determining that the interactive service can schedule and distribute all possible data center combinations; and step 3: designing and appointing a data center resource providing mode; and 4, step 4: determining a price demand probability distribution model corresponding to a stochastic programming algorithm; and 5: determining a price model according to the resource providing plan formulated in the step 3; step 6: defining tail delay of interactive service, and using the tail delay as delay constraint of optimization model; and 7: determining a data center resource scheduling optimization model; and 8: decomposing the overall problem into a main problem and a series of sub-problems by using the characteristics of random programming; and step 9: and (6) solving an optimization algorithm. The present invention aims to minimize resource configuration costs for geographically distributed interactive services by a multi-cloud service provider.

Description

Geographic distribution interactive service cloud resource collaborative optimization method

Technical Field

The invention relates to an optimization method, in particular to a collaborative optimization method for cloud resources of a geographically distributed interactive service.

Background

Geographically distributed interactive services are a class of delay sensitive computing services that need to be deployed to multiple data center areas (e.g., data center areas such as north america, asia-pacific, and europe). For interactive services such as Web search and real-time data analysis, a user sends a Web request to a data center of service deployment to acquire data or perform decision analysis. Since interactive services rely on the basic data provided by multiple data center areas, which are distributed globally, it is decided that geographically distributed interactive services require the selection of a suitable set of data centers for deployment, and that the set of data centers consists of at least one data center provided by each data center area. In addition, in order to meet the deployment requirement of the interactive service, the data center is required to provide sufficient resources to run the data center workload supporting the interactive service. Therefore, due to the dependence of basic data and the requirement of data center resources, the deployment position of the geographic distribution interactive service and the data center resource plan need to meet the tail delay requirement of users. The ultimate goal is to minimize the overall computational and communication costs of geographically distributed interactive services.

Today's cloud computing services are already well established. A certain number of data centers are deployed in regions on the global scale, such as Google computer Engine, Amazon AWS, Microsoft Azure, Ariicloud and the like, and all the data centers can provide complete cloud resource use schemes. Google computer Engine and Microsoft Azure provide policies for using data center resources on demand. They provide VM instances of differentiated computing and storage capabilities and corresponding prices. In particular, the actual usage cost of a resource is calculated in terms of usage time, e.g., a user rents a VM instance for 10 hours, then the total cost is the unit price of the VM instance multiplied by 10 hours. Amazon AWS not only provides a policy like Google computerengine and Microsoft Azure for on demand use of resources, but Amazon AWS also allows users to make a reservation resource contract, where users can reserve 1 or 3 year specifically configured VM instances and pay resource reservation fees in advance, and then bill for actual usage time at a lower price (around 50% of the price of on demand resources). On one hand, the resource prices of the cloud service providers are different, on the other hand, the resource prices of the data centers in different areas of the same cloud service provider are also different, and the resource prices are continuously changed according to market demands. Because the interactive service requests to access the data center to obtain the computing service through the Web request, the cloud service provider rates the use of the WAN network bandwidth according to the total amount of the used traffic, and the unit price of the traffic is generally in a ladder shape. For example, AWS is free for the first 1GB, more than 1G less than 10TB is charged for $0.090/GB, and more than 10TB less than 40TB is charged for $ 0.085/GB.

Since geographically distributed interactive services are a class of delay sensitive Web services, an appropriate amount of data center resources are required to provide the quality of service desired by the user for different service request rates. In the statistical information of the request rates of the Google computer Engine for the interactive service in different periods, the difference of the service request rates is great, and the difference between the lowest request rate and the highest request is 10³More than twice. Therefore, the uncertainty of the resource plan adapting to the requirement for the deployment of the interactive service is an urgent solutionTo a problem of (a). Although current cloud service providers offer policies based on reservation or on-demand use of resources, they all expose some drawbacks. If the user uses the resources based on reservation, in order to guarantee the service quality, the user must use the reserved large amount of resources to guarantee that the interactive service can still meet the service quality requirement of the user when the request rate is maximum. But because of the uncertainty and large differences in the demand, this results in a large amount of wasted resources, which is expensive and often does not work. When the user uses only the on-demand resource, although the resource demand of the service can be dynamically satisfied, since the price of the on-demand resource is higher than the price of the reserved resource by more than 50%, the resource cost is not much different from that of the strategy of utilizing the reserved resource. Based on the premise, a strategy capable of mixing reserved resources and resources on demand needs to be designed, so that the deployment cost of the geographically distributed interactive service is saved.

Disclosure of Invention

Aiming at the defects in the prior art, the invention aims to provide a geographical distribution interactive service cloud resource collaborative optimization method, which aims to minimize the resource configuration cost of the geographical distribution interactive service through a multi-cloud service provider.

According to one aspect of the invention, a method for collaborative optimization of cloud resources of geographically distributed interactive services is provided, which is characterized by comprising the following steps:

step 1: determining available data centers for each area;

step 2: determining all available data center combinations that the interactive service can mobilize;

and step 3: designing and appointing a data center resource providing mode;

and 4, step 4: determining a price demand probability distribution model corresponding to a stochastic programming algorithm;

and 5: determining a price model according to the resource providing plan formulated in the step 3;

step 6: defining tail delay of the interactive service and using the tail delay as a delay constraint of an optimization model;

and 7: determining a data center resource scheduling optimization model;

decomposing the overall problem into a main problem and a series of sub-problems by using the characteristics of random programming;

and 8: solving an optimization algorithm, recording the resource demand of each interactive service and corresponding delay information, determining the relation between the resource quantity and the delay and predicting the primary workload scheduling; when the service is on-line, the resource demand and the delay are correspondingly changed, and the changes are used for random planning and finally applied to the dynamic adjustment of the resource plan.

Preferably, the step 3 specifically includes the following steps:

step 3.1: using reserved resource pattern only; the data center provides reserved resources, and the cloud consumer firstly makes a reserved contract and pays a prepaid fee according to the service deployment cycle and the compromised resource requirements; after the interactive service is on line, firstly, the reserved resources are distributed according to the actual requirements of the service resources, and if the reserved resources can meet the requirements of the service quality in the deployment period, the service quality requirements of the interactive service can be met only by reserving the resources;

step 3.2: using only on-demand resource patterns; the actual demand of the service is greater than the maximum available resource number of the lease due to the change of the request rate, so that the required service quality can not be achieved, and extra resources on demand are leased on the basis of reserved resources to achieve the requirement of the service quality, wherein the situation is called a hybrid phase;

step 3.3: in the mixed mode, the cloud consumers perform short-term deployment of services according to the resources on demand, and the data center only provides the resources on demand.

Preferably, the step 5 specifically includes the following steps:

step 5.1: determining a pre-paid price of the reserved contract;

step 5.2: determining a dynamic price of an actual running VM instance;

step 5.3: determining a network resource charging mode and price;

step 5.4: overlapping the cost of the computing resources and the cost of the network resources to obtain a total price;

step 5.5: a dynamic cost of the instance running in real time is determined.

Preferably, the step 6 specifically includes the following steps:

step 6.1: determining a calculation delay and a network transmission delay;

step 6.2: determining a delay threshold value of each request, and establishing a delay threshold value matrix;

step 6.3: and (4) utilizing a service level agreement to measure whether the tail delay meets the service quality requirement under the condition of determining the resource providing plan.

Preferably, said step 6.3 comprises in particular the steps of:

and 6.3.1, performing probability analysis on the historical data by using a probability statistical method to obtain a preliminary relation between the resource providing plan and the S L A tail delay.

Preferably, said step 6.3.1 comprises in particular the steps of:

step 6.3.1.1: calculating delay and network transmission delay probability statistical analysis;

step 6.3.1.2: calculating a probabilistic delay for each request;

step 6.3.1.3: and calculating probability tail delay.

Preferably, said step 7 comprises the steps of:

step 7.1: calculating the deployment cost of the candidate data center of each area;

step 7.2: and determining a main random planning problem and a sub-random planning problem.

Preferably, said step 8 comprises the steps of:

step 8.1: scheduling the workload;

step 8.2: the workload resource plan is adjusted in real time.

Preferably, said step 8.1 comprises the steps of:

step 8.1.1 firstly, generating probability distribution of VCPU (virtual central processing unit) requirements, memory requirements, disk requirements and network utilization rate according to the request record of the geographic distribution interactive service;

step 8.1.2, the generated probability distribution is used as configuration data, and the corresponding deployment cost of each schedulable data center is obtained;

step 8.1.3 initializes the scheduling decision variables.

Preferably, said step 8.2 comprises the steps of:

step 8.2.1, through the scheduling of the workload, obtaining target data center groups of all interactive services, but the obtained preliminary resource plan is coarse-grained, and the resource plan needs to be adjusted at any RPP time, so that corresponding resource demand probability distribution is generated at each RPP time;

step 8.2.2, solving by taking the generated probability distribution as configuration data to obtain a resource plan of each working load in the target data center, wherein the resource plan is fine-grained;

step 8.2.3 selects a target data center group for the geographically distributed interactive service per WDD time and reconfigures the resource plan at each RPP time, where each WDD time includes a plurality of RPP times.

Compared with the prior art, the method has the advantages that the method aims to minimize the resource allocation cost of the geographic distribution interactive service through multiple cloud service providers, because of the uncertainty of the resource price of the data center and the characteristics of the dynamic resource demand of the interactive service, a price model and an S L A delay model are designed, the scheduling of the workload and the resource plan are respectively described as a main method and a series of sub-problems based on the obtained constraint conditions and combined with random planning, the method is the essence of the method, the optimized design and the constraint method obtained through price modeling and S L A delay modeling have better universality and expansibility for the resource scheduling of most of the geographic distribution interactive service, the method saves the cost by 24% compared with the service deployment based on the resource demand, the cost by 10% compared with the service deployment based on the reserved resource, the delayed S L A constraint, the tail delay constraint based on negotiation can be met, the tail delay constraint based on the consideration can be met, the research aiming at reducing the service deployment cost of the global distribution is also lacked, and the research on the cost of some related research on the application of the global distribution of the data center is created and the corresponding optimization algorithm is created.

Drawings

Other features, objects and advantages of the invention will become more apparent upon reading of the detailed description of non-limiting embodiments with reference to the following drawings:

FIG. 1 is a schematic diagram of a VCPU demand probability distribution.

FIG. 2 is a schematic diagram of memory demand probability distribution.

FIG. 3 is a schematic diagram of a disk demand probability distribution.

Fig. 4 is a schematic diagram of a network usage probability distribution.

Detailed Description

The present invention will be described in detail with reference to specific examples. The following examples will assist those skilled in the art in further understanding the invention, but are not intended to limit the invention in any way. It should be noted that variations and modifications can be made by persons skilled in the art without departing from the spirit of the invention. All falling within the scope of the present invention.

Geographically distributed interactive services require support from the underlying data, which is typically backed up between data centers in the same area in order to provide high availability. Because of the backup of the basic data, the interactive service can select a data center with lower price for deployment. However, the prices of the various cloud service providers are different for the same configured VM instance and the same network traffic. In addition, the pressure of requests for services by users of interactive services at different times varies, thus making the demand for resources by interactive services uncertain. Because of changing resource demands and changing data center resource prices, geographically distributed interactive services may change the anticipated resource plans and even be redeployed in lower priced data centers. Therefore, the design principle of the invention is to respectively optimize the distribution of the geographic distribution interactive service workload and adjust the corresponding resource plan according to the resource requirements.

The invention discloses a geographical distribution interactive service cloud resource collaborative optimization method which comprises the following steps:

step 1: determining the available data centers for each region. Assume that there are R data center regions that provide resources for the operation of the geographically distributed interactive services workload. In each region, there are P cloud service providers and each cloud service provider has deployed a DC_iA data center, wherein i ∈ P. order J_rRepresenting the number of data centers in region r, we can calculate all available data centers in region r using the following formula (1):

step 2: determining that the interactive service can schedule the distribution of all possible data center combinations. Assume that there are m geographically distributed interactive services, each containing more than | R | workloads that need to be distributed to a data center group. For example, a user sends a service request to n (n ≧ R |) data centers, which are referred to as a target data center group. In particular, the n data centers are provided by different regions, i.e., each region needs to provide at least one data center to complete the operation of the workload. Thus, let G denote all possible data center groups provided by the R zones. We use combinatorial mathematical theory to calculate all available data center groups as follows (2):

and step 3: designing and appointing a data center resource providing mode. Because the single use of reserved resources or the single use of on-demand resources requires a high resource cost, both reserved resources and on-demand resources are considered. There are three resource usage scenarios based on this premise: reservation only, on-demand and hybrid only. The step 3 specifically comprises the following steps:

step 3.1: using reserved resource pattern only; the data center provides reserved resources, and the cloud consumer firstly makes a reserved contract and pays a prepaid fee according to the service deployment cycle and the compromised resource requirements; after the interactive service is on line, firstly, the reserved resources are distributed according to the actual requirements of the service resources, and if the reserved resources can meet the requirements of the service quality in the deployment period, the service quality requirements of the interactive service can be met only by reserving the resources.

Step 3.2: using only on-demand resource patterns; the actual demand of the service, which is greater than the lease maximum available resource amount due to the change of the request rate, results in the service quality not reaching the requirement, and the additional on-demand resources can be leased on the basis of the reserved resources to reach the requirement of the service quality, which is called a hybrid phase (reserved and on-demand).

Step 3.3: a hybrid mode. Cloud consumers prefer short term deployment of on-demand resources, which only are provided by data centers.

Determining a probability distribution model of price demand corresponding to a stochastic programming algorithm, wherein the price of data center resources is dynamically changed, and the resource demand of geographic distribution interactive services is dynamically changed according to a service request rate, the uncertainty of the price and the demand can be processed by utilizing the stochastic programming, the stochastic programming adopts a group of uncertain parameters described by probability distribution to decompose a problem into a main problem and a series of sub-problems related to probability, solving the main problem can obtain a primary result, then solving the sub-problems one by one to compensate the error of the primary solution of the main problem, and the step 4 comprises the following steps of determining the probability distribution of each sub-problem of the stochastic programming, providing a plan according to the three resources, analyzing the probability attributes of the price and the demand, and leading Λ to be used for solving the sub-problems one by one to compensate the error of the_tRepresenting a set of price and demand cases in each RPP (resource plan) time T, Λ is defined as the following formula (3) for all price and demand cases in one WDD (workload schedule) time T:

Λ, e.g., set Λ represents a finite number of price and demand cases, denoted P_λ∈[0,1]Show every kind of situationProbability of occurrence of a condition, where λ is a complex variable, λ ═ λ (λ)₁,λ₂,…,λ_|T|)∈Λ。

And 5: and determining a price model according to the resource providing plan formulated in the step 3. The step 5 specifically comprises the following steps:

step 5.1: a reservation contract prepaid price is determined. Suppose a cloud consumer makes a reserved resource contract k at data center j for workload i, the contract k describing a particular configuration of VM instances, let

Indicating the VM instance to resource type

Is required by

Representing resource types

The unit price of (c). Adding the resource type prices required by the VM examples to obtain the prepayment price of the work load i for establishing the reservation contract k in the data center j

Step 5.2: a dynamic price for the actual running VM instance is determined. When the VM instance is actually used under the lambda condition, the actual use price of the reserved resources and the resources on demand in the time t can be obtained

And

because the actually used resources need to be charged according to the use time, wherein t is a charging period and is also an RPP period, as shown in the following formula (4):

step 5.3: and determining the charging mode and price of the network resources. WAN network resources are billed based on monthly transmission traffic. WAN network resources will be charged at different unit prices when data transmission exceeds the maximum traffic set in the network pricing policy. Let K_nRepresenting the relationship between the amount of data transferred and the price per unit. For example, Amazon EC 2's WAN network price policy is: k_n{ "first 1 GB": "$ 0.000/GB", "greater than 1G less than 10 TB": "$ 0.090/GB", "greater than 10T less than 40 TB": "$ 0.085/GB", "greater than 40TB less than 100 TB": "$ 0.070/GB", "greater than 100TB less than 350 TB": "$ 0.050/GB",. Thus, let

Representing the unit price of WAN network traffic.

If the geographically distributed interactive service is distributed to a data center group, a sufficient number of VM instances (e.g., 16 core CPU, 64G memory and 500G SSD × 10 per VM instance) must be allocated for the workload.

Wherein S denotes the request sources of m geographically distributed interactive services, I denotes the workload set of each interactive service, J_gRepresents all data center collections of data center group G (G ∈ G).

Representing the number of reserved VM instances that make a reservation contract k at data center j for workload i, a cloud consumer may make multiple reservation contracts at one data center.

Representing the amount of network data transmission per charging period of the network resource, where T_nTypically expressed as 12 network resource billing periods a year. X_ijRepresenting a binary variable equal to 1 if workload i is assigned to data center j, and 0 otherwise.

Step 5.5: a dynamic cost of the instance running in real time is determined. As can be seen from equation (5), the resource allocation cost includes three parts: reservation

Prepaid cost of the quantity VM instance; cost of WAN network usage; the cost of actually running the reserved VM instance and running the on-demand VM instance. Because the VM is run at startup as needed, the cost of actually running a VM instance is dynamically variable. We use C^YRepresenting the dynamic cost. If a reserved VM instance

Cannot meet the tail delay requirement of the interactive service: (

<

<Actual demand), then we need extra

The number of VM instances to bring the interactive service to the tail delay requirement is as follows (6):

step 6: the tail delay of the interactive service is defined and used as the delay constraint of the optimization model. The step 6 specifically comprises the following steps:

step 6.1: the computational delay and the network transmission delay are determined. Order to

And

respectively representing data processing delays and WAN network transmission delays. The total delay of the service is calculated using the following equation (7).

Step 6.2: determining a delay threshold value of each request and establishing a delay threshold value matrix. Each service request has a corresponding delay requirement, embodied as a delay threshold. We consider each service request independently and we consider a geographically distributed interactive service to meet the end-of-service delay requirement if all service requests meet the respective delay requirement. Thus, let

And

let L (t) denote the transmit to n (n ═ J), representing the processing delay threshold and WAN network transmission delay threshold, respectively_g|) delay constraint thresholds for service requests of several data centers, we can get the following delay threshold matrix, as shown in equation (8):

step 6.3. Using service level agreement to measure whether the tail delay meets QoS (quality of service) requirements under the resource provision plan determination conditions, since the overhead of predicting the delay of each service request in real time and finally reflecting as resource requirements is very large, especially for a large number of interactive service requests, it is difficult to realize in reality, therefore, we estimate and predict the resource requirements of interactive services using the high percentage tail delay as S L A (service level agreement), and this estimation is based on historical requests.for example, if we set x as the high percentile S L A tail delay constraint, the probability that the service delay does not exceed the threshold value is not less than x%, otherwise the service quality cannot be guaranteed.the high percentile S L A tail delay provides a method for the geographically distributed interactive services to meet the service quality requirements.6.3. specifically, step 6.3.1. using probability statistics to perform probability analysis on historical data to obtain the initial relation between the resource provision plan and S L A tail delay.considering that a request source completes the work load i within t time, the step 6.3.1. the step of performing the probability calculation of the final step of using probability statistics to calculate the following steps:

step 6.3.1.1: and (4) calculating the probability statistical analysis of the delay and the network transmission delay. Order to

Representing the probability that the computation delay of workload i at data center j does not exceed the computation delay threshold, order

Indicating the probability that the WAN network transmission delay does not exceed the transmission delay threshold. Operation of

Represents a method for calculating the processing probability delay of a data center when

If so, the result is 1, otherwise it is 0. Similarly, operate

For estimating the network probabilistic delay, the following equation (9) is used:

the average delay probability values obtained from a large number of requested delay records should be close to the expected value of delay according to the majority rule (LL N). when more requested records are considered in the estimation, the estimation of the delay probability will be more accurate.

Step 6.3.1.2: the probabilistic delay for each request is calculated. The delay of interactive services mainly consists of two parts: a) data center processing delays. b) WAN network transmission delay. The convolution function mathematically produces a third function representing the superposition of the first two functions and is therefore generally considered herein to be the superposition of the computation delay and the network delay. For workload i, the expected probability delay for each service request sent to data center j can be found using a convolution function, as follows (10):

where the operator "", represents a convolution method.

Step 6.3.1.3: and calculating probability tail delay. Interactive services require sending requests to a specific data center group to complete the corresponding workload, so the probability tail delay for all service requests of an interactive service s should be averaged across the data center groups as shown in the following equation (11):

wherein F_sg(t) probabilistic tail delay, F, for sending an Internet request for interactive service s to data center group g_sg(t) as a constrained function of the scheduling decisions and resource provision plans for the interactive services.

And 7: and determining a data center resource scheduling optimization model. Through price modeling and delay constraint modeling, the scheduling of interactive services and resource planning can be obtained by solving the following optimization equation as follows (12) and the like:

wherein (11a) indicates that the workload of each interactive service needs to be allocated to a target data center, (12b) indicates that the tail latency of the interactive service should meet the requirements of the tail latency SA L, (12c) indicates that the allocated resources must be less than or equal to the maximum resource constraint of the data center, (12d) indicates that the number of reserved VM instances actually used does not exceed the maximum constraint of the reservation contract, and (12e-12h) indicates that the number of VM instances is a natural number.

S L A (Service-L event acquisition) Service level Agreement resource allocation plan time (RPP), Workload Distribution Decision time (WDD)

And 7: the overall problem is decomposed into a main problem and a series of sub-problems by using the characteristics of stochastic programming.

The step 7 comprises the following steps:

step 7.1: the deployment cost of the candidate data center for each area is calculated using equation (13). Taking the obtained initial cost as a weight factor of workload scheduling, selecting a data center group with the least cost by utilizing integer programming, scheduling interactive services to an optimized data center group, and residing the interactive services in the data center group in each WDD time interval; meanwhile, at each RPP time, solving the resource plan of the target data center group in the time period by utilizing stochastic programming according to the change of the demand as the following formula (13) and the like:

step 7.2: and determining a main random planning problem and a sub-random planning problem. The main question dispatches the interactive service to the data center group with the lowest cost according to the weight value; a series of sub-questions regarding the probability distribution of price and demand dynamically adjust the resource offering plan according to the dynamic resource demand, as shown in equation (14):

because of the uncertainty in price, the above algorithm needs to be repeated at each WDD time for workload scheduling and resource planning.

And 8: solving an optimization algorithm, recording the resource demand of each interactive service and corresponding delay information, determining the relation between the resource quantity and the delay and predicting the primary workload scheduling; when the service is on-line, the resource demand and the delay are correspondingly changed, and the changes are used for random planning and finally applied to the dynamic adjustment of the resource plan. Step 8 comprises the following steps:

step 8.1: and (4) workload scheduling. Step 9.1 comprises the following steps:

and 8.1.1, firstly, generating probability distribution of VCPU requirements, memory requirements, disk requirements and network utilization rate according to the request records of the geographic distribution interactive service.

Step 8.1.2, taking the generated probability distribution as configuration data, solving a formula (13) and obtaining the corresponding deployment cost of each schedulable data center;

and 8.1.3, initializing a scheduling decision variable, using the initial cost obtained in the step 9.1.2 as a weight value, using a formula (14) to select the data center with the lowest price as a target data center group, wherein the distribution of the workload needs to meet the constraint of the maximum processing capacity of the data center.

Step 8.2: the workload resource plan is adjusted in real time. Step 8.2 comprises the following steps:

step 8.2.1, through scheduling the workload, we can obtain the target data center groups of all interactive services, but the obtained preliminary resource plan is coarse-grained, and the resource plan needs to be adjusted at any RPP time, so that corresponding resource demand probability distribution is generated at each RPP time;

step 8.2.2, taking the generated probability distribution as configuration data, and solving by using the subproblems defined by the formula (15), so as to obtain a resource plan of each workload in the target data center, wherein the resource plan is fine-grained;

step 8.2.3 our algorithm entails selecting a target data center group for the geographically distributed interactive service at each WDD time, and reconfiguring the resource plan at each RPP time, where each WDD time contains multiple RPP times.

The invention selects a plurality of data centers of three cloud service providers of Google computer Engine, Microsoft Azure and Amazon AWS as experimental configuration data, and selects three interactive services of Web Search, Real-time DataAnalysis and Big Query and related service request records as basic test data. The scheduling of workloads and the resource planning of interactive services were separately experimented.

The workload scheduling process comprises the following steps: firstly, generating probability distribution of VCPU requirements, memory requirements, disk requirements and network utilization rate according to request records of geographic distribution interactive services. The experimental results are shown in fig. 1 to 4; taking the generated probability distribution as configuration data, solving the formula (13) and obtaining the corresponding deployment cost of each schedulable data center; and initializing scheduling decision variables, using the obtained initial cost as a weighted value, using the data center with the lowest price as a target data center group, and meeting the constraint of the maximum processing capacity of the data center according to the distribution requirement of the workload.

The flow of the workload resource plan is as follows: through the dispatching of the workload, target data center groups of all interactive services can be obtained, but the obtained preliminary resource plan is coarse-grained, and the resource plan needs to be adjusted at any RPP time, so that corresponding resource demand probability distribution is generated at each RPP time; solving by using the generated probability distribution as configuration data and utilizing the subproblems defined by the formula (15), so as to obtain a resource plan of each workload in the target data center, wherein the resource plan is fine-grained; a target data center group needs to be selected for the geographically distributed interactive service at each WDD time and the resource plan is reconfigured at each RPP time, where each WDD time contains multiple RPP times.

The foregoing description of specific embodiments of the present invention has been presented. It is to be understood that the present invention is not limited to the specific embodiments described above, and that various changes and modifications may be made by one skilled in the art within the scope of the appended claims without departing from the spirit of the invention.

Claims

1. A geographical distribution interactive service cloud resource collaborative optimization method is characterized by comprising the following steps:

step 1: determining available data centers for each area;

and step 3: designing and appointing a data center resource providing mode;

and 5: determining a price model according to the resource providing mode plan formulated in the step 3;

and 7: determining a data center resource scheduling optimization model;

2. The method for collaborative optimization of cloud resources for geographic distribution interactive services according to claim 1, wherein the step 3 specifically includes the following steps:

3. The method for collaborative optimization of cloud resources for geographic distribution interactive services according to claim 1, wherein the step 5 specifically includes the following steps:

step 5.1: determining a pre-paid price of the reserved contract;

step 5.2: determining a dynamic price of an actual running VM instance;

step 5.3: determining a network resource charging mode and price;

step 5.5: a dynamic cost of the instance running in real time is determined.

4. The method for collaborative optimization of cloud resources for geographic distribution interactive services according to claim 1, wherein the step 6 specifically includes the following steps:

step 6.1: determining a calculation delay and a network transmission delay;

5. The method for collaborative optimization of cloud resources for geographic distribution interactive services according to claim 4, wherein the step 6.3 specifically includes the steps of:

and 6.3.1, performing probability analysis on the historical data by using a probability statistical method to obtain a preliminary relation between the resource providing plan and the service level agreement S L A tail delay.

6. The method for collaborative optimization of cloud resources for geographic distribution interactive services according to claim 5, wherein the step 6.3.1 specifically includes the steps of:

step 6.3.1.2: calculating a probabilistic delay for each request;

step 6.3.1.3: and calculating probability tail delay.

7. The method for collaborative optimization of cloud resources for geographically distributed interactive services according to claim 1, wherein the step 7 comprises the steps of:

8. The method for collaborative optimization of cloud resources for geographically distributed interactive services according to claim 1, wherein the step 8 comprises the steps of:

step 8.1: scheduling the workload;

step 8.2: the workload resource plan is adjusted in real time.

9. The method for collaborative optimization of cloud resources for geographically distributed interactive services according to claim 8, wherein the step 8.1 includes the steps of:

step 8.1.3 initializes the scheduling decision variables.

10. The method for collaborative optimization of cloud resources for geographically distributed interactive services according to claim 8, wherein the step 8.2 includes the steps of:

step 8.2.1, through the dispatching of the workload, the target data center groups of all the interactive services are obtained, but the obtained primary resource plan is coarse-grained, and the resource plan needs to be adjusted in the time of one resource plan RPP, so that corresponding resource demand probability distribution is generated in the time of each resource plan RPP;

step 8.2.3 selects a target data center group for the geographically distributed interactive service at each workload scheduled WDD time, and reconfigures the resource plan at each resource plan RPP time, where each workload scheduled WDD time includes a plurality of resource plan RPP times.