CN115150277B

CN115150277B - Energy-saving strategy based on dual-threshold hysteresis cluster scheduling mechanism in cloud data center

Info

Publication number: CN115150277B
Application number: CN202210665311.4A
Authority: CN
Inventors: 金顺福; 白小军; 武海星; 崔瑜; 魏士昌
Original assignee: Yanshan University
Current assignee: Yanshan University
Priority date: 2022-06-13
Filing date: 2022-06-13
Publication date: 2023-09-15
Anticipated expiration: 2042-06-13
Also published as: CN115150277A

Abstract

The application provides an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism in a cloud data center, which comprises the following steps: dividing all isomorphic servers in a cloud data center into two clusters, namely a basic cluster and a standby cluster, setting a task classifier to classify and track tasks, constructing an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism, describing the working principle of the energy-saving strategy based on the dual-threshold hysteresis cluster scheduling mechanism, evaluating the performance index of the working principle, analyzing the quantization behavior of the cloud data center under the energy-saving strategy based on the dual-threshold hysteresis cluster scheduling mechanism, searching for the pareto optimal solution, and balancing the overall power consumption of the cloud data center with the average waiting time of real-time tasks. According to the method and the system, the partial servers are dynamically started and stopped according to the system load change, so that idle energy consumption can be effectively reduced, energy consumption expenditure of the cloud data center is greatly reduced, and cloud service providers are helped to select the best compromise solution among different targets according to the preference of the cloud service providers.

Description

Energy-saving strategy based on dual-threshold hysteresis cluster scheduling mechanism in cloud data center

Technical Field

The application belongs to the technical field of cloud computing, and particularly relates to an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism in a cloud data center.

Background

In recent years, companies such as Huacheng, aliba, ten, google, amazon and the like invest huge amounts of funds to construct a super-large-scale cloud data center, and China communication institute 'data center white paper 2020' shows that the number of frames of the global data center reaches 750 ten thousand in 2019, and the number of frames of the global data center is increased by 4.3% in a same way. At the same time, the energy consumption of 2018 global data center is estimated to rise to 205TWh, accounting for about 1% of the global power consumption. As is well known, data centers are energy intensive enterprises, and the greatly increased and ever-increasing power costs of ultra-large scale cloud data centers result in more power expenditures, causing various energy conservation and efficiency problems. Achieving energy conservation and low carbon operation of cloud data centers is a key to continuously reducing Power Utilization Efficiency (PUE) and achieving the goal of "carbon neutralization".

To cope with the sudden traffic surge and server overload problems, the existing cloud data centers are built according to annual peak load prediction, however, due to over-allocation of resources, the average server utilization of the global data center is only 12% -18% according to statistics, and in addition, when the server is idle, the idle power consumption is only 50% -60% of the peak power. Thus, one key strategy to effectively reduce data center power consumption is to dynamically adjust the turning on and off of servers, which may be turned off temporarily by wake on lan (WoL) techniques, and to start up the servers when the cloud data center workload increases again substantially. It is clear that there is a significant tradeoff between powering down the idle server to save power consumption and keeping the idle server on to minimize the average latency of tasks.

The cloud computing is realized based on a server cluster technology, the server clusters are pooled through a software means, unified management and scheduling are carried out, and cloud services with the characteristics of elasticity, flexible charging on demand, high availability, high safety and the like are provided for the outside. A cluster is a collection of computing nodes organized together to serve, as if there were only one server, on a client side, and a cluster management system runs thousands of tasks on multiple clusters, which can have tens of thousands of nodes per cluster. Therefore, in order to achieve the energy consumption-performance tradeoff, designing a flexible cluster scheduling mechanism becomes a key place for the elastic expansion of the cloud data center. Different tasks are differentially served in cloud computing, and different service rates are provided for the different tasks, so that the tasks of multiple classes are reasonably scheduled, and a great amount of luxury energy consumption caused by unreasonable task scheduling is reduced. In order to track the whole process of the multi-class tasks being served in the cloud data center, a multi-class customer queuing system is utilized to describe a very close theoretical model of the random behavior of different classes of tasks in the cloud data center, and is very suitable for describing the problem that the multi-class customers compete for the same resources. In the existing research, the poisson process (PoissonArrival Process) is widely used for describing the arrival flows of different tasks, different customers in the poisson process are mutually independent and do not consider the correlation characteristic, however, the information flows in the real cloud data center are generally correlated, such as the common phenomena of sudden flow surge and server overload. For this reason, it is very important to extend the poisson process to a more general markov arrival process (Marked MarkovianArrival Process, MMAP) with its natural association with a multi-class customer queuing system with relevance, the first proposal of the markov arrival process being to characterize a random point process marking different classes of arrival. To date, the literature for modeling analysis of task random behavior, energy conservation, and performance tradeoff strategies in cloud data centers using multi-class customer queuing systems based on markov arrival processes is very sparse. Further, from the optimization approach of the energy saving strategy, some feasible solutions may be optimal from different angles for cloud users and cloud service providers when there are multiple evaluation criteria, while pareto frontier points typically select a set of potential optimal solutions for decision makers to choose the best solution according to their preferences. Therefore, the pareto optimal method is very suitable for solving the multi-objective optimization problem of energy consumption and performance trade-off in the cloud data center. In summary, for the above research background, it is urgent and necessary to find an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism in a cloud data center to develop a construction method and a solution idea of an analytical model so that the analytical model is richer and closer to reality.

Disclosure of Invention

Aiming at the defects in the prior art, the application provides an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism in a cloud data center. The method comprises the steps of dividing all isomorphic servers in a cloud data center into two clusters, namely a basic cluster and a standby cluster, setting a task classifier to classify and track tasks, constructing an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism, describing the working principle of the energy-saving strategy based on the dual-threshold hysteresis cluster scheduling mechanism, evaluating performance indexes below the working principle, analyzing quantization behaviors of the cloud data center under the energy-saving strategy based on the dual-threshold hysteresis cluster scheduling mechanism, searching for a pareto optimal solution, and balancing the overall power consumption of the cloud data center with the average waiting time of real-time tasks. According to the method and the system, the partial servers are dynamically started and stopped according to the system load change, so that idle energy consumption can be effectively reduced, energy consumption expenditure of the cloud data center is greatly reduced, and cloud service providers are helped to select the best compromise solution among different targets according to the preference of the cloud service providers.

The application provides an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism in a cloud data center, which comprises the following steps:

s1, all isomorphic servers N in one cloud data center _total Dividing into basic clusters N ₁ And standby cluster N ₂ Two clusters, wherein the servers in the basic cluster are always in operation, while the servers in the standby cluster are dynamically turned on and off along with the load change and are controlled by a hysteresis mechanism of double thresholds, wherein the double thresholds comprise an on threshold T ₁ And a shutdown threshold T ₂ ；

S2, setting a task classifier to classify and track the tasks: setting a task classifier to divide tasks reaching a cloud data center into real-time tasks and non-real-time tasks, marking the tasks with different labels, and tracking the service process of each type of tasks in the system by using the labels;

s3, constructing an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism:

s31, when a task arrives at the cloud data center, as long as an idle server exists, the newly arrived task immediately occupies one of the idle servers; if all servers are busy and are real-time tasks, executing step S32, and if all servers are busy and are non-real-time tasks, executing step S33;

s32, if the real-time task queued in the buffer area does not reach the access threshold H, the newly arrived real-time task is added into the buffer area, otherwise, the real-time task leaves the cloud data center and is timely scheduled to other cloud data center nodes;

s33, when one non-real-time task arrives at the cloud data center, the arriving non-real-time task enters a buffer area to wait in a queuing manner without any limitation even if all servers are busy;

s34, when the service of one task is completed, if at least one real-time task exists in the buffer zone, once the server is idle, the first waiting real-time task is served; if the buffer area has only non-real-time tasks, the first waiting non-real-time task is served; for the same class of tasks, the service rule is first come first served;

s4, describing the working principle of the energy-saving strategy based on the dual-threshold hysteresis cluster scheduling mechanism;

s5, evaluating performance indexes under the energy-saving strategy based on the dual-threshold hysteresis cluster scheduling mechanism;

s6, analyzing quantization behaviors of the cloud data center under the energy-saving strategy based on the dual-threshold hysteresis cluster scheduling mechanism: and searching the pareto optimal solution, namely the pareto front point, from the angles of cloud users and cloud service providers respectively, so that the overall power consumption of the cloud data center and the average waiting time of real-time tasks are balanced.

Further, the step S4 specifically includes the following steps:

s41, describing the arrival process of the multi-class task as a Markov arrival process (MMAP) with marks, and respectively obeying the parameters of service time of the real-time task and service time of the non-real-time task as mu ₁ and μ₂ Is used for constructing an MMAP [ K ]]/M[K]/N ₁ +N ₂ A non-preemptive priority queue;

s42, constructing a five-dimensional continuous time Markov chain by considering random behaviors of multiple tasks in the cloud data center:

{(i _t ,w _t ,r _t ,j _t ,v _t ),t≥0} (1)

wherein ,i_t Indicating the number of tasks in the system at time t, and i _t 0, called system waterLeveling; w (w) _t Representing the state of the standby cluster at time t, anIf w _t If 0 indicates that the standby cluster is off, if w _t =1 then indicates that the standby cluster is active; r is (r) _t Indicating the real-time task number in the buffer at time t, and +.>j _t Representing the number of real-time tasks being serviced at time t, and +.>The arrival of tasks is performed by a random process { v _t Guided by t.gtoreq.0 } the random process { v } _t T.gtoreq.0 } is an irreducible continuous time Markov chain, the underlying markov chain, called the MMAP arrival process, has a state space of {0, 1..m }, v _t Represents the underlying Markov chain { v } at time t _t State of t.gtoreq.0 }, and +.>

S43, adopting a pseudo-extinction process, a matrix geometric solution method and a Gaussian-Saidel method to obtain steady-state distribution n of the system:

Π＝(π ₀ ,π ₁ ,π ₂ ,...) (2)

wherein ,π_i A probability vector representing a system level i, and satisfies

π _i ＝(π(i,0),π(i,1)) (3)

Wherein pi (i, 0), pi (i, 1) respectively represent a probability vector when the standby cluster is not turned on and a probability vector when the standby cluster is turned on, and the following are satisfied:

π(i,0)＝(π(i,0,0),π(i,0,1),...,π(i,0,max{0,min{i-N ₁ ,H}})) (4)

π(i,1)＝(π(i,1,0),π(i,1,1),...,π(i,1,max{0,min{i-N ₁ -N ₂ ,H}})) (5)

wherein the method comprises the steps ofPi (i, 0, r), pi (i, 1, r) respectively represent a probability vector of a previous three-dimensional when the standby cluster is not turned on and a probability vector of a previous three-dimensional when the standby cluster is turned on, and r= {0, 1.. _t -N ₁ H }, satisfy:

π(i,0,r)＝(π(i,0,r,0),π(i,0,r,1),...,π(i,0,r,min{i,N ₁ })) (6)

π(i,1,r)＝(π(i,1,r,0),π(i,1,r,1),...,π(i,1,r,min{i,N ₁ +N ₂ })) (7)

where pi (i, w, r, j) represents the probability vector of the first four dimensions, and w= {0,1}, j= {0,1,.. ₁ +N ₂ -meeting the following:

π(i,w,r,j)＝(π(i,w,r,j,0),π(i,w,r,j,1),...,π(i,w,r,j,m)) (8)。

further, the step S5 specifically includes the following steps:

s51, from the perspective of cloud users, solving QoS indexes related to service quality, wherein the QoS indexes related to the service quality comprise average waiting time of real-time tasks, average waiting time of non-real-time tasks and loss rate of real-time tasks; average waiting time W of the real-time task and the non-real-time task _real ,W _non-real The method comprises the following steps of:

wherein ,λ₁ Representing the arrival rate of the real-time task; lambda (lambda) ₂ Representing the arrival rate of the non-real-time task;

loss rate of the real-time taskThe method comprises the following steps:

wherein ,representation (N) ₁ +1)×(N ₁ A unit vector of +1); d (D) ₁ Representing a real-time task arrival matrix; />Representation (N) ₁ +1) (m+1) ×1 column vectors with all elements 1; />Representation (N) ₁ +N ₂ +1)×(N ₁ +N ₂ A unit vector of +1);representation (N) ₁ +N ₂ +1) (m+1) ×1 column vectors with all elements 1; />Represents the kronecker product;

s52, from the perspective of cloud service providers, solving TCO indexes related to overall utilization, wherein the TCO indexes related to overall utilization comprise the activation rate, the system utilization rate and the overall system power consumption of the standby cluster, and the activation rate P of the standby cluster _act The method comprises the following steps:

wherein ,e_i Column vectors representing 1 for all elements of ix1;

the system utilization U _s The method comprises the following steps:

the overall power consumption E of the system is as follows:

wherein ,representing the execution power consumption of the server when processing the real-time task; />Representing the execution power consumption of the server when processing non-real-time tasks; e (E) ^idle Representing idle power consumption of the server; e (E) ^off Representing the power consumption of the server to shut down.

Preferably, the step S6 specifically includes the following steps:

s61, defining the overall power consumption of the cloud data center and the average waiting time of the real-time tasks as key parameter variables N ₁ 、N ₂ 、T ₁ 、T ₂ The functions of H, i.e. E (N) ₁ ,N ₂ ,T ₁ ,T ₂ ,H)、W _real (N ₁ ,N ₂ ,T ₁ ,T ₂ H) represented by a mathematical model of multi-objective optimization:

s62, obtaining the pareto optimal solution, namely the pareto leading edge point, by utilizing a non-dominant ordering genetic algorithm with an improved elite strategy, so as to provide a group of optimal solutions.

Preferably, in step S1, if the number of tasks in the buffer exceeds the threshold T ₁ The servers in the standby cluster will all be activated and remain operational until the total number of tasks in the cloud data center falls to the shutdown threshold T ₂ And migrating the task being serviced in the standby cluster to the basic cluster, and shutting down the standby cluster after all migration.

Preferably, said step S1 takes into account ensuring that once ready for useThe cluster is activated and all tasks in the buffer can be serviced immediately, setting an on threshold T ₁ <N ₂ The method comprises the steps of carrying out a first treatment on the surface of the Setting a shutdown threshold T in consideration of timely shutdown of the standby cluster when the workload of the cloud data center is low ₂ ＝N ₁ -1。

Preferably, in the step S2, a higher non-preemptive priority is given to the real-time task than to the non-real-time task, and an access threshold H (H<T ₁ )。

Compared with the prior art, the application has the technical effects that:

1. aiming at the problem of rapid increase of energy consumption caused by rapid increase of industrial scale of the global data center, the application designs an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism in the cloud data center, and considers that partial servers are dynamically started and stopped according to system load change of the servers of the cloud data center so as to effectively reduce idle energy consumption, thereby greatly reducing energy consumption expenditure of the cloud data center; in order to balance the service quality of multi-class tasks and the energy consumption level of a cloud data center, an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism is provided, and a non-preemptive priority queuing model based on a markov arrival process with a mark is constructed aiming at the energy-saving strategy.

2. According to the energy-saving strategy based on the dual-threshold hysteresis cluster scheduling mechanism in the cloud data center, qoS indexes such as average waiting time of real-time tasks and non-real-time tasks, loss rate of real-time tasks and the like, and TCO indexes such as activation probability of standby clusters, system utilization rate and overall system power consumption are obtained by adopting a simulated vanishing process, a matrix geometric solution method and a Gaussian-seidel method, and the average waiting time of real-time tasks and the overall power consumption of the cloud data center are weighed, the pareto front edge point is obtained by utilizing an improved non-dominant ordering genetic algorithm with elite strategy, and a set of optimal solutions are provided for cloud service providers, so that the cloud service providers are helped to select a solution representing optimal compromise among different targets according to preferences of the cloud service providers.

Drawings

Other features, objects and advantages of the present application will become more apparent upon reading of the detailed description of non-limiting embodiments made with reference to the following drawings.

FIG. 1 is a flow chart of a power saving strategy based on a dual threshold hysteresis cluster scheduling mechanism in a cloud data center of the present application;

FIG. 2 is a schematic scheduling diagram of a power saving strategy based on a dual threshold hysteresis cluster scheduling mechanism in a cloud data center of the present application;

fig. 3 is a pareto front point of the average latency of the real-time task versus the overall power consumption of the system of the present application.

Detailed Description

The application is described in further detail below with reference to the drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the application and are not limiting of the application. It should be noted that, for convenience of description, only the portions related to the present application are shown in the drawings.

It should be noted that, without conflict, the embodiments of the present application and features of the embodiments may be combined with each other. The application will be described in detail below with reference to the drawings in connection with embodiments.

Aiming at the problems of energy waste such as large idle energy consumption caused by idle computing nodes and large luxury energy consumption caused by unreasonable task scheduling in the operation process of a cloud data center, an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism is provided.

Fig. 1 and 2 illustrate a power saving strategy based on a dual threshold hysteresis cluster scheduling mechanism in a cloud data center of the present application, the method comprising the steps of:

s1, all isomorphic servers N in one cloud data center _total Dividing into basic clusters N ₁ And standby cluster N ₂ Two clusters, wherein the servers in the basic cluster are always in operation state, while the servers in the standby cluster are dynamically turned on and off along with the load change and are controlled by a hysteresis mechanism of double thresholds, wherein the double thresholds comprise an on threshold T ₁ And a shutdown threshold T ₂ The method comprises the steps of carrying out a first treatment on the surface of the If any of the buffersThe number of transactions exceeds the turn-on threshold T ₁ The servers in the standby cluster will all be activated and remain operational until the total number of tasks in the cloud data center falls to the shutdown threshold T ₂ And migrating the task being serviced in the standby cluster to the basic cluster, and shutting down the standby cluster after all migration.

The turn-on threshold T is set to ensure that all tasks in the buffer can be serviced immediately once the standby cluster is activated ₁ <N ₂ The method comprises the steps of carrying out a first treatment on the surface of the Setting a shutdown threshold T in consideration of timely shutdown of the standby cluster when the workload of the cloud data center is low ₂ ＝N ₁ -1。

S2, setting a task classifier to classify and track the tasks: setting a task classifier to divide tasks reaching a cloud data center into real-time tasks and non-real-time tasks, marking the tasks with different labels, and tracking the service process of each type of tasks in the system by using the labels; giving a higher non-preemptive priority to a real-time task than to a non-real-time task while setting an access threshold H (H<T ₁ )。

Real-time tasks are very sensitive to delay and delay jitter, and are typically performed in some applications where the expiration date is critical, such as autopilot, avionics, robotics, etc. Real-time tasks require quick service completion, while non-real-time tasks in certain application scenarios can tolerate longer delays, such as email systems, wild animal tracking networks, deep space communication networks, etc., requiring differentiated services to guarantee their service level agreements (Service Level Agreement, SLAs) based on the characteristics of each class of task. By differentially serving different classes of tasks, the two classes of tasks are scheduled reasonably at different service rates, improving their quality of service, and reducing luxury energy consumption.

s31, when a task arrives at the cloud data center, as long as an idle server exists, the newly arrived task immediately occupies one of the idle servers; if all servers are busy and are real-time tasks, step S32 is performed, and if all servers are busy and are non-real-time tasks, step S33 is performed.

S32, if the real-time tasks queued in the buffer area do not reach the access threshold H, the newly arrived real-time tasks are added into the buffer area, otherwise, the real-time tasks leave the cloud data center and are timely scheduled to other cloud data center nodes.

And S33, when one non-real-time task arrives at the cloud data center, the arriving non-real-time task enters a buffer zone to wait in a queuing manner without any limitation even if all servers are busy.

S34, when the service of one task is completed, if at least one real-time task exists in the buffer zone, once the server is idle, the first waiting real-time task is served; if the buffer area has only non-real-time tasks, the first waiting non-real-time task is served; for the same class of tasks, the service rules are first come first served.

S4, describing the working principle of an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism.

S41, describing the arrival process of the multi-class task as a Markov arrival process (MMAP) with marks, and respectively obeying the parameters of service time of the real-time task and service time of the non-real-time task as mu ₁ and μ₂ Is used for constructing an MMAP [ K ]]/M[K]/N ₁ +N ₂ Non-preemptive priority queues.

{(i _t ,w _t ,r _t ,j _t ,v _t ),t≥0} (1)

wherein ,i_t Indicating the number of tasks in the system at time t, and i _t 0, called system level; w (w) _t Representing the state of the standby cluster at time t, anIf w _t If 0 indicates that the standby cluster is off, if w _t =1 then indicates that the standby cluster is active; r is (r) _t Representing real-time tasks in a buffer at time tCount and->j _t Representing the number of real-time tasks being serviced at time t, and +.>The arrival of tasks is performed by a random process { v _t Guided by t.gtoreq.0 } the random process { v } _t T.gtoreq.0 } is an irreducible continuous time Markov chain, the underlying markov chain, called the MMAP arrival process, has a state space of {0, 1..m }, v _t Represents the underlying Markov chain { v } at time t _t State of t.gtoreq.0 }, and +.>

Π＝(π ₀ ,π ₁ ,π ₂ ,...) (2)

wherein ,π_i A probability vector representing a system level i, and satisfies

π _i ＝(π(i,0),π(i,1)) (3)

Wherein pi (i, 0), pi (i, 1) respectively represents a probability vector when the standby cluster is not turned on and a probability vector when the standby cluster is turned on, and the following conditions are satisfied:

π(i,0)＝(π(i,0,0),π(i,0,1),...,π(i,0,max{0,min{i-N ₁ ,H}})) (4)

π(i,1)＝(π(i,1,0),π(i,1,1),...,π(i,1,max{0,min{i-N ₁ -N ₂ ,H}})) (5)

wherein pi (i, 0, r), pi (i, 1, r) respectively represent a probability vector of a previous three-dimensional when the standby cluster is not turned on and a probability vector of a previous three-dimensional when the standby cluster is turned on, and r= {0, 1.. _t -N ₁ H }, satisfy:

π(i,0,r)＝(π(i,0,r,0),π(i,0,r,1),...,π(i,0,r,min{i,N ₁ })) (6)

π(i,1,r)＝(π(i,1,r,0),π(i,1,r,1),...,π(i,1,r,min{i,N ₁ +N ₂ })) (7)

π(i,w,r,j)＝(π(i,w,r,j,0),π(i,w,r,j,1),...,π(i,w,r,j,m)) (8)。

s5, evaluating performance indexes under an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism.

S51, from the perspective of cloud users, solving QoS indexes related to service quality, wherein the QoS indexes related to the service quality comprise average waiting time of real-time tasks, average waiting time of non-real-time tasks and loss rate of the real-time tasks; average waiting time W of real-time task and non-real-time task _real ,W _non-real The method comprises the following steps of:

wherein ,λ₁ Representing the arrival rate of the real-time task; lambda (lambda) ₂ Representing the arrival rate of non-real-time tasks.

Loss rate of real-time tasksThe method comprises the following steps:

wherein ,representation (N) ₁ +1)×(N ₁ A unit vector of +1); d (D) ₁ Representing a real-time task arrival matrix; />Representation (N) ₁ +1) (m+1) ×1 column vectors with all elements 1; />Representation (N) ₁ +N ₂ +1)×(N ₁ +N ₂ A unit vector of +1); />Representation (N) ₁ +N ₂ +1) (m+1) ×1 column vectors with all elements 1; />Representing the kronecker product.

S52, from the perspective of cloud service providers, solving TCO indexes related to overall utilization, wherein the TCO indexes related to overall utilization comprise the activation rate of the standby cluster, the utilization rate of the system and the overall power consumption of the system, and the activation rate P of the standby cluster _act The method comprises the following steps:

wherein ,e_i Representing a column vector with 1 for all elements of i x 1.

System utilization U _s The method comprises the following steps:

the overall power consumption E of the system is as follows:

wherein ,representing processing real-time tasksExecuting power consumption of the time server; />Representing the execution power consumption of the server when processing non-real-time tasks; e (E) ^idle Representing idle power consumption of the server; e (E) ^off Representing the power consumption of the server to shut down.

S6, analyzing quantization behaviors of the cloud data center under an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism: and searching the pareto optimal solution, namely the pareto front point, from the angles of cloud users and cloud service providers respectively, so that the overall power consumption of the cloud data center and the average waiting time of real-time tasks are balanced.

s62, obtaining the pareto optimal solution, namely the pareto leading edge point, by utilizing a non-dominant ordering genetic algorithm with an improved elite strategy, so as to provide a group of optimal solutions. In a specific embodiment, the pareto front point of the overall power consumption of the system-average waiting time of the real-time task shown in fig. 3 is obtained, and from the perspective of cloud users, the schemes can improve the service quality of the real-time task and simultaneously save energy consumption to the greatest extent; from the perspective of the cloud service provider, the cloud service provider can choose the desired energy savings effect according to a guaranteed energy savings agreement (Guaranteed Energy SavingsAgreement, GESA) and infer whether the corresponding real-time task latency meets the requirements of its service level agreement.

Aiming at the problem of rapid increase of energy consumption caused by rapid increase of industrial scale of the global data center, the application designs an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism in the cloud data center, and considers that partial servers are dynamically started and stopped according to system load change of the servers of the cloud data center so as to effectively reduce idle energy consumption, thereby greatly reducing energy consumption expenditure of the cloud data center; in order to balance the service quality of multi-class tasks and the energy consumption level of a cloud data center, an energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism is provided, and a non-preemptive priority queuing model based on a Markov arrival process with a mark is constructed aiming at the energy-saving strategy; qoS indexes such as average waiting time of real-time tasks and non-real-time tasks in a steady state, loss rate of real-time tasks and the like, TCO indexes such as activation probability of standby clusters, system utilization rate, overall system power consumption and the like are obtained by adopting a pseudo-living process, a matrix geometric solution method and a Gaussian-Saider method, and the average waiting time of the real-time tasks and the overall power consumption of a cloud data center are balanced, and the pareto front edge point is obtained by utilizing an improved non-dominant ordering genetic algorithm with elite strategy, so that a set of optimal solutions are provided for cloud service providers, and the cloud service providers are helped to select solutions representing optimal trade-offs among different targets according to preferences of the cloud service providers.

Finally, what should be said is: the above embodiments are merely for illustrating the technical aspects of the present application, and it should be understood by those skilled in the art that although the present application has been described in detail with reference to the above embodiments: modifications and equivalents may be made thereto without departing from the spirit and scope of the application, which is intended to be encompassed by the claims.

Claims

1. An energy-saving strategy based on a dual-threshold hysteresis cluster scheduling mechanism in a cloud data center is characterized by comprising the following steps of:

s1, all isomorphic servers N in one cloud data center _total Dividing into basic clusters N ₁ And standby cluster N ₂ Two clusters, wherein servers in the base cluster are always in operation, and servers in the standby cluster are in serviceThe server is dynamically turned on and off along with the load change and is controlled by a hysteresis mechanism of double thresholds, wherein the double thresholds comprise an on threshold T ₁ And a shutdown threshold T ₂ ；

2. The energy saving strategy based on the dual threshold hysteresis cluster scheduling mechanism in the cloud data center according to claim 1, wherein the step S4 specifically comprises the steps of:

{(i _t ,w _t ,r _t ,j _t ,v _t ),t≥0} (1)

wherein ,i_t Indicating the number of tasks in the system at time t, and i _t 0, called system level; w (w) _t Representing the state of the standby cluster at time t, anIf w _t If 0 indicates that the standby cluster is off, if w _t =1 then indicates that the standby cluster is active; r is (r) _t Indicating the real-time task number in the buffer at time t, and +.>j _t Representing the number of real-time tasks being serviced at time t, and +.>The arrival of tasks is performed by a random process { v _t Guided by t.gtoreq.0 } the random process { v } _t T.gtoreq.0 } is an irreducible continuous time Markov chain, called MMAP-toThe state space of the bottom markov chain of the reach process is {0,1,..m }, v _t Represents the underlying Markov chain { v } at time t _t State of t.gtoreq.0 }, and

Π＝(π ₀ ,π ₁ ,π ₂ ,...) (2)

wherein ,π_i A probability vector representing a system level i, and satisfies

π _i ＝(π(i,0),π(i,1)) (3)

π(i,0)＝(π(i,0,0),π(i,0,1),...,π(i,0,max{0,min{i-N ₁ ,H}})) (4)

π(i,1)＝(π(i,1,0),π(i,1,1),...,π(i,1,max{0,min{i-N ₁ -N ₂ ,H}})) (5)

π(i,0,r)＝(π(i,0,r,0),π(i,0,r,1),...,π(i,0,r,min{i,N ₁ })) (6)

π(i,1,r)＝(π(i,1,r,0),π(i,1,r,1),...,π(i,1,r,min{i,N ₁ +N ₂ })) (7)

π(i,w,r,j)＝(π(i,w,r,j,0),π(i,w,r,j,1),...,π(i,w,r,j,m)) (8)。

3. the energy saving strategy based on the dual threshold hysteresis cluster scheduling mechanism in the cloud data center according to claim 1, wherein the step S5 specifically comprises the steps of:

loss rate of the real-time taskThe method comprises the following steps:

wherein ,e_i Column vectors representing 1 for all elements of ix1;

the system utilization U _s The method comprises the following steps:

the overall power consumption E of the system is as follows:

4. The energy saving strategy based on the dual threshold hysteresis cluster scheduling mechanism in the cloud data center according to claim 1, wherein the step S6 specifically comprises the steps of:

5. The energy saving strategy based on the dual-threshold hysteresis cluster scheduling mechanism in the cloud data center according to claim 1, wherein in the step S1, if the number of tasks in the buffer exceeds the turn-on threshold T ₁ The servers in the standby cluster will all be activated and remain operational until the total number of tasks in the cloud data center falls to the shutdown threshold T ₂ And migrating the task being serviced in the standby cluster to the basic cluster, and shutting down the standby cluster after all migration.

6. The energy saving strategy based on a dual threshold hysteresis cluster scheduling mechanism in a cloud data center according to claim 1, wherein the step S1 is considered to ensure that all tasks in the buffer are guaranteed once the standby cluster is activatedCan be immediately serviced, set up a turn-on threshold T ₁ <N ₂ The method comprises the steps of carrying out a first treatment on the surface of the Setting a shutdown threshold T in consideration of timely shutdown of the standby cluster when the workload of the cloud data center is low ₂ ＝N ₁ -1。

7. The energy saving strategy based on the dual-threshold hysteresis cluster scheduling mechanism in the cloud data center according to claim 1, wherein the step S2 gives the real-time tasks a higher non-preemptive priority than the non-real-time tasks, while setting the access threshold H (H<T ₁ )。