CN107222540B

CN107222540B - Negative feedback-based server cluster grouping scheduling method

Info

Publication number: CN107222540B
Application number: CN201710416047.XA
Authority: CN
Inventors: 张家琦; 贺欣; 邹昕; 王啸; 王子厚; 尚秋里; 刘培朋; 涂波; 刘丙双; 戴帅夫; 何清林; 马秀娟
Original assignee: National Computer Network and Information Security Management Center
Current assignee: National Computer Network and Information Security Management Center
Priority date: 2017-06-06
Filing date: 2017-06-06
Publication date: 2020-11-20
Anticipated expiration: 2037-06-06
Also published as: CN107222540A

Abstract

The invention discloses a server cluster grouping scheduling method based on negative feedback. The method comprises the following steps: 1) calculating the optimal server operation number of the kth period according to the historical operation state of the kth period of the server; 2) and obtaining the number of the servers to be started in the k +1 th period in a negative feedback mode according to the optimal server operation number and the operation state of the server in the k-th period. According to the invention, the resource utilization rate and the energy efficiency of the server cluster are improved by counting the historical operating conditions and automatically adjusting the number of the started servers in a negative feedback manner.

Description

Negative feedback-based server cluster grouping scheduling method

Technical Field

The invention relates to a scheduling method of a server cluster, in particular to a server cluster method and system based on negative feedback.

Background

With the rapid development of network technologies such as cloud computing and internet of things, a server cluster technology is developed.

The server cluster is generally a cluster system in which a plurality of servers are connected through a high-speed network, and has the characteristics of high performance, high availability and high cost performance, so that the server cluster is widely applied. However, with the continuous increase of the scale of the data center, the energy consumption of the server occupies a large amount of enterprise investment, and energy-saving scheduling of a server cluster has become a problem of wide attention in the industry at present.

However, the server cluster also faces the problem of dynamic resource management under the condition of load intensity change, and if there is no cluster scheduling method based on grouping, the problems of resource utilization and performance reduction caused by too frequent switching of the start-stop states of the servers and unbalanced load may occur.

In the server cluster grouping scheduling, the number of servers to be started is an important parameter, the purpose of improving the utilization rate of the servers cannot be achieved due to the fact that the number of the started servers is too large, and the system requirements cannot be met due to the fact that the number of the started servers is too low.

Disclosure of Invention

In view of the above problems, the present invention provides a server cluster group scheduling method based on negative feedback, which improves resource utilization and energy efficiency of a server cluster by counting historical operating conditions and automatically adjusting the number of servers that are started in a negative feedback manner.

The technical scheme of the invention is as follows:

a server cluster grouping scheduling method based on negative feedback comprises the following steps:

1) calculating the optimal server operation number of the kth period according to the historical operation state of the kth period of the server;

2) and obtaining the number of the servers to be started in the k +1 th period in a negative feedback mode according to the optimal server operation number and the operation state of the server in the k-th period.

Further, the method for calculating the running number of the optimal servers comprises the following steps: setting the CPU utilization rate and the processing performance of the server acquired in the kth period as c ═ c respectively₁,c₂,…,c_i,…,c_n]，p＝[p₁,p₂,…,p_i,…,p_n]Wherein n is the total number of servers operating in the k period, c_iFor the ith server CPU utilization, p_iFor the processing performance of the ith server, c_i∈(0,1)，p _i0 or 1; according to p_iValue taking, namely dividing the set c into two sets c respectively⁰And c¹I.e. the handling properties p_iAdding server CPU utilization with value 0 to set c⁰To process the property p_iAdding server CPU utilization with value 1 to set c¹(ii) a Then dividing the CPU utilization rate into a plurality of intervals according to c_o＝argmax_i[N_i ⁰/N_i ¹]H is calculated to obtain the optimal operation parameter c of the k period_o(ii) a Wherein N is_i ⁰Is a set c⁰The number of samples of the middle CPU utilization rate in the ith interval, N_i ¹Is a set c¹And the utilization rate of the middle CPU is the number of samples in the ith interval, and h is an interval coefficient.

Further, the processing performance is a packet loss rate, and is set to be 0 when there is a packet loss, and is set to be 1 when there is no packet loss.

Further, the interval coefficient h takes a value of 0.1.

Further, the method for obtaining the number of servers to be started in the k +1 th period in a negative feedback manner includes: and calculating the difference between the average CPU utilization rate of the server in the k period and the optimal CPU utilization rate in the k period, if the difference is greater than 0, increasing the number of the started servers in the k +1 period, and otherwise, reducing the number of the started servers.

Further, using the formula

Calculating the number M (k +1) of servers to be started in the k +1 th period; wherein M (k) is the number of servers that have been turned on in the k-th period,

is the average CPU utilization in the k-th cycle, c_oIs the optimal operating parameter for the k-th cycle and α is the adjustment step in negative feedback.

The invention has two key technologies:

1) counting the optimal parameters of the server operation according to the historical operation state of the server;

2) and obtaining the number of the servers to be started in a negative feedback mode according to the optimal parameters of the server operation and the current operation state.

In view of the above, the main contents of the present invention are as follows:

acquiring the running state of the server in real time: and carrying out real-time statistics on the CPU utilization rate and the processing performance of the server at certain time intervals, wherein the processing performance comprises but is not limited to the network card packet loss rate. The processing performance and server running state information should be synchronized in time.

Counting the optimal operation parameters of the server:

setting the CPU utilization rate and the processing performance of the server acquired in a period of time as c ═ c₁,c₂,…,c_i,…,c_n]，p＝[p₁,p₂,…,p_i,…,p_n]Wherein c is_i∈(0,1),p _i0 or 1. According to the values of corresponding pi, c can be divided into two sets c⁰And c¹. Meanwhile, the CPU utilization is divided into 10 intervals, and the number of samples in each interval is counted as follows.

The optimal operating parameters are:

c_o＝argmax_i[N_i ⁰/N_i ¹]h, h is the interval coefficient, the value in the invention is 0.1, and the coefficient ensures c_oIs between (0, 1).

Determining the current optimal server number in a negative feedback mode:

after each sampling is finished, calculating the difference between the current average CPU utilization rate and the optimal CPU utilization rate, if the current CPU utilization rate is higher than the optimal CPU utilization rate (the difference is more than 0), increasing the number of the started servers, otherwise, reducing the number of the servers. The specific calculation formula is as follows:

wherein M (k +1) is the number of servers that are turned on in the next operation cycle, M (k) is the number of servers that are currently turned on,

is the average of the current CPU utilization, c_oIs the calculated optimum operating parameter and α is the step size.

Compared with the prior art, the invention has the following positive effects:

the invention can adaptively learn the optimal system operation parameters and dynamically schedule the system in the optimal working state, thereby having better performance power consumption ratio.

Drawings

Fig. 1 is a system block diagram of a negative feedback-based server cluster packet scheduling method in the present invention.

Fig. 2 is a schematic diagram of an optimal server number calculation algorithm for server grouping based on negative feedback in the present invention.

Detailed Description

The method of the present invention is described in detail below with reference to the accompanying drawings and examples.

As shown in fig. 1, the workflow of the method of the present invention in this embodiment is:

step 101: and in the system initialization stage, a load acquisition period is initialized, the adjustment step length alpha in negative feedback is used for initially and defaultly starting all the servers, and then the system starts to synchronously acquire the running state and performance of each server.

Step 102: after the end of each acquisition cycle, adding the CPU utilization of each core to c⁰And c¹In (c), the CPU utilization without packet loss is added to (c)⁰Will haveCPU utilization of lost packets added to c¹. The processing performance is represented by 0 and 1, and whether there is a packet loss or not is 0 if there is no packet loss, and is 1 if there is a packet loss.

Step 103: according to c_o＝argmax_i[N_i ⁰/N_i ¹]0.1 calculate optimal server operational parameters.

Step 103: calculating the average CPU utilization of the current server and recording the average CPU utilization as

Step 104: calculating the number of servers to be started in the next period according to the following formula

Step 105: and intensively scheduling the flow to the M (k +1) servers by using a dynamic load balancing method.

The above embodiments are only for illustrating the technical solution of the present invention and not for limiting the same, and a person skilled in the art can make modifications or equivalent substitutions to the technical solution of the present invention without departing from the spirit and scope of the present invention, and the scope of the present invention should be determined by the claims.

Claims

1. A server cluster grouping scheduling method based on negative feedback comprises the following steps:

1) calculating the optimal parameters of the server operation in the kth period according to the historical operation state of the server in the kth period; the method for calculating the optimal parameters of the server operation comprises the following steps: setting the CPU utilization rate and the processing performance of the server acquired in the kth period as c ═ c respectively₁,c₂,…,c_i,…,c_n]，p＝[p_1,p₂,…,p_i,…,p_n]Wherein n is the total number of servers operating in the k period, c_iFor the ith server CPU utilization, p_iFor the processing performance of the ith server, c_i∈(0,1)，p_i0 or 1; according to p_iValue taking, namely dividing the set c into two sets c respectively⁰And c¹I.e. the handling properties p_iAdding server CPU utilization with value 0 to set c⁰To process the property p_iAdding server CPU utilization with value 1 to set c¹(ii) a Then dividing the CPU utilization rate into a plurality of intervals according to c_o＝argmax_i[N_i ⁰/N_i ¹]H is calculated to obtain the optimal operation parameter c of the k period_o(ii) a Wherein N is_i ⁰Is a set c⁰The number of samples of the middle CPU utilization rate in the ith interval, N_i ¹Is a set c¹The number of samples of the middle CPU utilization rate in the ith interval is h, and the h is an interval coefficient;

2) obtaining the number of servers to be started in the k +1 th period in a negative feedback mode according to the optimal parameters of the server operation and the operation state of the server in the k-th period; the method for obtaining the number of servers to be started in the (k +1) th period in a negative feedback mode comprises the following steps: and calculating the difference between the average CPU utilization rate of the server in the k period and the optimal CPU utilization rate in the k period, if the difference is greater than 0, increasing the number of the started servers in the k +1 period, and otherwise, reducing the number of the started servers.

2. The method of claim 1, wherein the processing performance is a packet loss ratio, and when there is a packet loss, the processing performance takes a value of 0, and when there is no packet loss, the processing performance takes a value of 1.

3. The method of claim 1, wherein the interval coefficient h is 0.1.

4. The method of claim 1, wherein a formula is utilized

Calculating the number M (k +1) of servers to be started in the k +1 th period; wherein M (k) is that the k-th period is onThe number of servers,