CN110995863B

CN110995863B - Data center load distribution method and system based on load demand characteristics

Info

Publication number: CN110995863B
Application number: CN201911317923.9A
Authority: CN
Inventors: 郑文立; 王浩翔; 李超; 姚斌; 过敏意
Original assignee: Shanghai Jiaotong University
Current assignee: Shanghai Jiaotong University
Priority date: 2019-12-19
Filing date: 2019-12-19
Publication date: 2021-07-30
Anticipated expiration: 2039-12-19
Also published as: CN110995863A

Abstract

A data center load distribution method based on load demand characteristics is characterized in that interference classification is carried out on incoming loads through time sequence resource demand characteristics of the loads which are measured in advance, corresponding interference degree indexes are obtained according to resource use conditions of different computing nodes, the loads are distributed to the computing nodes enabling the sum of the interference degree indexes to be minimum through scheduling nodes, and therefore scheduling of interference among the loads is achieved to be minimum. The invention carries out the scheduling of the load by a mode of minimizing the interference between the loads. The method avoids the whole resource idleness of the server caused by the overload of the single resource, and the resource waste and the efficiency reduction caused by the idleness. And simultaneously, the service quality reduction caused by single resource overload is reduced. By calculating the interference generated by the load and avoiding the interference scheduling mode as much as possible, the utilization rates of different resource types are improved in a balanced manner, and the overall resource utilization rate of the data center is improved.

Description

Data center load distribution method and system based on load demand characteristics

Technical Field

The invention relates to a technology in the field of computers, in particular to a data center load distribution method and system based on load demand characteristics.

Background

In a typical data center environment, servers may distribute workloads to computing nodes in a network to perform related services. The administrator of the data center needs to utilize the resources in each computing node to the maximum extent, so that the utilization rate of the resources is maximized. In practice, however, when running a load that requires the same computational resources, resource contention occurs, resulting in a severe degradation of the performance of the program, a phenomenon also referred to as interference. Interference occurs because different loads at the same time request a large amount of the same resources, exceeding the supply of computers, resulting in resource contention. For a particular resource, such as a processor, memory, network bandwidth, etc., if the server allocates a number of workloads that are heavily dependent on the resource, the workloads available in the compute nodes may be overloaded, creating interference. As a result, the overall performance of the computing node is negatively impacted even if other resources in the computing node are nearly idle. In order to alleviate interference and meet the requirements of users in the prior art, a data center usually maintains a low resource utilization rate, which results in resource waste and increased calculation cost.

Disclosure of Invention

The invention provides a data center load distribution method and system based on load demand characteristics, aiming at the defects in the prior art, the load requiring different resources is distributed to the same computer, so that the interference is avoided, the program performance is improved under the condition of ensuring the same resource utilization rate, or the resource utilization rate is improved under the condition of maintaining the program performance without reduction.

The invention is realized by the following technical scheme:

the invention relates to a data center load distribution method based on load demand characteristics, which is characterized in that the interference classification is carried out on the coming loads through the time sequence resource demand characteristics of the measured loads in advance, the corresponding interference degree indexes are obtained according to the resource use conditions of different computing nodes, and the loads are distributed to the computing nodes with the minimum sum of the interference degree indexes by a scheduling node, so that the scheduling of the interference among the loads is minimized.

The time sequence resource demand characteristics are represented by a demand characteristic matrix, the demand characteristic matrix is a 4 x 3 matrix, and the time sequence resource demand characteristics specifically comprise the following components: dividing the life cycle of the load into three time intervals according to the time sequence, namely a loading time interval, an execution time interval and an ending time interval; the demand of the load for the resource is divided into four dimensions, namely: computing resources, memory resources, hard disk resources, and network resources.

And the values of the four dimensions are recorded in the matrix according to the respective demand of the load.

The interference classification is as follows: the method comprises the following steps of recording the resource use condition of future loads to obtain a demand characteristic matrix, namely the interference classification when the loads really arrive, wherein the specific operation process comprises the following steps: the load is independently operated on one physical server, other interferences are eliminated, an operating system instruction is called to read the resource use conditions of the four dimensions of the physical server, and a demand characteristic matrix of the type of load is recorded.

The future load refers to: the data center is working with all possible assigned workloads.

The resource use condition is represented by a maximum resource vector, an allocable resource vector and a used resource matrix, and specifically, the computing node is recorded in the four dimensions according to the actual number of physical computing resources to obtain a four-dimensional maximum resource vector to represent the maximum resource of the computing node; a 4 x N used resource matrix, wherein: n represents the number of working time periods, all elements in the matrix are initialized to zero, and the number of the used resources is obtained by adding the corresponding number of the resources used by the scheduled load, wherein the number of the used resources of the computing node is represented by the used resource matrix; the allocable resource vector at a certain moment is obtained by subtracting the used resource vector at a certain moment from the maximum resource vector.

The allocable resource vector at a certain time is: a column of four-dimensional vectors representing a time instant in the 4 x N used resource matrix.

The interference degree index is as follows: calculating the allocable resource vectors of the node at the time of arrival and two times after the time of arrival according to the load by the method, forming a 4 x 3 matrix by the three allocable resource vectors, and subtracting the matrix by using the demand characteristic matrix of the load, wherein an element larger than 0 represents the interference generated by the resource of the dimension when the load is scheduled on the physical server, and summing the values larger than 0 in the matrix to obtain the interference degree index. This is performed for all the compute nodes, the compute node with the smallest interference level index is selected, and the load is scheduled on it.

The arrival time of the load is as follows: the moment the load is distributed to the data center at which the corresponding computing resources are to be acquired and begin working.

Technical effects

The invention integrally solves the problems of resource idling and performance reduction caused by interference in the data center.

Compared with the prior art, the resource demand of the load is divided into three sections according to the life cycle of the load, and a 4 x 3 matrix is used for representing the resource demand of the life cycle of the load. And the interference calculation of the calculation node is carried out according to the interference calculation, and the load distribution is carried out according to the interference degree index.

The invention carries out the scheduling of the load by a mode of minimizing the interference between the loads. The method avoids the whole resource idleness of the server caused by the overload of the single resource, and the resource waste and the efficiency reduction caused by the idleness. And simultaneously, the service quality reduction caused by single resource overload is reduced. By calculating the interference generated by the load and avoiding the interference scheduling mode as much as possible, the utilization rates of different resource types are improved in a balanced manner, the condition that one resource is already distributed but other resources have a large amount of surplus can not occur, and the overall resource utilization rate of the data center is improved.

Drawings

FIG. 1 is a schematic diagram of a load distribution system of the present invention;

FIG. 2 is a timing diagram illustrating a load distribution method according to the present invention;

fig. 3 and 4 are schematic diagrams of embodiments of the present invention.

Detailed Description

As shown in fig. 1, the present embodiment relates to a data center load distribution system based on load demand characteristics, including: the system comprises a load demand characteristic analysis module, an interference classification module, a computing node resource analysis module, an interference calculation module, a scheduling module and resource monitoring modules located in all nodes, wherein the load demand characteristic analysis module, the interference classification module, the computing node resource analysis module, the interference calculation module and the scheduling module are located in a data center, and the resource monitoring modules are located in all nodes: the load demand characteristic analysis module is connected with the interference classification module and transmits the user load information acquired in real time to the interference classification module after analyzing the acquired demand characteristic information; the interference classification module reads the analysis result of the load demand characteristic analysis module and records the resource demand of each load; the computing node resource analysis module is connected with the resource monitoring module and records the resource use condition and the resource residual condition of the physical server in real time; the interference calculation module is respectively connected with the interference classification module and the calculation node resource analysis module and carries out interference degree index calculation on results obtained by the interference classification module and the calculation node resource analysis module; the scheduling module is connected with the interference calculation module and receives the calculation result, and the load is distributed to the physical server with the minimum interference degree index according to the calculation result.

The load demand characteristic analysis module divides the load into a loading time interval, an execution time interval and an ending time interval according to the life cycle, and each time interval corresponds to four dimensions of computing resources, memory resources, hard disk resources and network resources to form a 4 x 3 demand characteristic matrix.

And the interference classification module reads a demand characteristic matrix corresponding to the load demand characteristic analysis module according to the load.

The computing node resource analysis module commonly represents the resource use condition of each physical server by using a vector and a matrix, and specifically obtains a four-dimensional maximum resource vector to represent the maximum resource of the computing node according to the computing resource, namely the number of physical computing resources with four dimensions of processor frequency, memory size, hard disk speed and network bandwidth; representing the used resource matrix by a 4 x N matrix, wherein: n represents the number of working time periods, the used resource use condition is obtained by monitoring the operating system parameters of the physical server, the future resource use condition is obtained according to the used resources plus the corresponding resources used by the scheduled load, and the allocable resource vector at any moment is obtained by subtracting the corresponding column vector representing the moment in the used resource matrix from the maximum resource vector.

The interference calculation module calculates the allocable resource vectors of the node at the time of arrival and two subsequent times by the method according to the time of arrival of the load, forms a 4 x 3 matrix by the three allocable resource vectors, subtracts the matrix by using the demand characteristic matrix of the load, wherein an element larger than 0 represents the size of interference generated by the resource of the dimension when the load is scheduled on the physical server, and sums the values larger than 0 in the matrix to obtain the interference degree index.

And the scheduling module performs scheduling distribution according to the interference degree index of the load in each physical server, and optimizes the load in the scheduling process by using a greedy algorithm.

The greedy algorithm is as follows: and in each scheduling, the subsequent load is not considered, the current load is scheduled to the server with the minimum generated interference degree index, and when the interference degree indexes of a plurality of physical servers to the load to be scheduled are the same, one physical server is randomly selected for scheduling. This is performed for each incoming load.

As shown in fig. 2, the physical server of the data center obtains the resource usage after analyzing the resource usage: consisting of a maximum resource vector and a used resource matrix. And after the loads to be distributed are subjected to interference classification, a demand characteristic matrix is obtained, the interference degree index of each server is calculated according to the interference degree index calculation method, and then the load is dispatched to the server with the minimum interference degree index by the dispatcher.

Compared with the prior art, the method considers the resource demand change of the load, records according to different stages of the life cycle of the load, expresses the resource demand of the load by a matrix, and calculates the interference according to the matrix.

Through concrete actual experiment, at the concrete environment of data center simulation system, set up 10 ~ 30 physical servers, under the setting of 100 ~ 300 loads of distribution, the experimental data that can obtain are: as shown in fig. 3 and 4, the four technical solutions in the drawings are respectively expressed as (a) Best-Fit scheduling (BFS) preferentially schedules the load on the physical server closest to full load; (b) first time adaptive scheduling (FFS) is used for preferentially scheduling the load on a First physical server capable of meeting the load resource requirement, and if all the servers cannot completely meet the load requirement, random scheduling is performed for one time; (c) random Scheduling (RS) performs completely Random scheduling on all loads; (d) the interference minimization scheduling (MIS) is a scheduling algorithm for minimizing interference between loads, which is proposed in this example. The rate of Service Level Agreement (SLA) violation in fig. 4 represents the proportion of the load that is not completed within a specified time according to the customer requirements, which in practice is most likely caused by a performance degradation due to interference. According to experimental data, the load distribution method for minimizing the interference among the loads can effectively reduce the interference among the loads of the data center, and meanwhile, the SLA violation rate of the data center is effectively reduced.

Compared with the prior art, the method can reduce the inter-load interference by 34% on average and reduce the SLA violation rate by 17% on average.

The foregoing embodiments may be modified in many different ways by those skilled in the art without departing from the spirit and scope of the invention, which is defined by the appended claims and all changes that come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.

Claims

1. A data center load distribution method based on load demand characteristics is characterized in that interference classification is carried out on future loads through measuring time sequence resource demand characteristics of the loads in advance, corresponding interference degree indexes are obtained according to resource use conditions of different computing nodes, and the loads are distributed to the computing nodes enabling the sum of the interference degree indexes to be minimum through scheduling nodes, so that scheduling of interference among the loads is minimized;

the time sequence resource demand characteristics are represented by a demand characteristic matrix, the demand characteristic matrix is a 4 x 3 matrix, and the time sequence resource demand characteristics specifically comprise the following components: dividing the life cycle of the load into three time intervals according to the time sequence, namely a loading time interval, an execution time interval and an ending time interval; the demand of the load for the resource is divided into four dimensions, namely: computing resources, memory resources, hard disk resources and network resources;

the interference classification is as follows: the method comprises the following steps of recording the resource use condition of future loads to obtain a demand characteristic matrix, namely the interference classification when the loads really arrive, specifically comprising the following steps: the resource use condition of each physical server is represented by a vector and a matrix together, and specifically, a four-dimensional maximum resource vector is obtained by a computing node according to the number of computing resources, namely the physical computing resources with four dimensions of processor frequency, memory size, hard disk speed and network bandwidth, so as to represent the maximum resource of the computing node; representing the used resource matrix by a 4 x N matrix, wherein: n represents the number of working time periods, the used resource use condition is obtained by monitoring the operating system parameters of the physical server, the future resource use condition is obtained according to the used resources plus the corresponding resources used by the scheduled load, and the allocable resource vector at any moment is obtained by subtracting the corresponding column vector representing the moment in the used resource matrix from the maximum resource vector.

2. A system for implementing the method of any preceding claim, comprising: the system comprises a load demand characteristic analysis module, an interference classification module, a computing node resource analysis module, an interference calculation module, a scheduling module and resource monitoring modules located in all nodes, wherein the load demand characteristic analysis module, the interference classification module, the computing node resource analysis module, the interference calculation module and the scheduling module are located in a data center, and the resource monitoring modules are located in all nodes: the load demand characteristic analysis module is connected with the interference classification module and transmits the user load information acquired in real time to the interference classification module after analyzing the acquired demand characteristic information; the interference classification module reads the analysis result of the load demand characteristic analysis module and calculates and classifies the interference type of each load; the computing node resource analysis module is connected with the resource monitoring module and records the resource use condition and the resource residual condition of the physical server in real time; the interference calculation module is respectively connected with the interference classification module and the calculation node resource analysis module and carries out interference degree index calculation on results obtained by the interference classification module and the calculation node resource analysis module; the scheduling module is connected with the interference calculation module and receives the calculation result of the interference calculation module, and the load is distributed to the physical server with the minimum interference degree index according to the calculation result;

the interference degree index is as follows: and adding the load demand characteristic matrix to the used resource matrix of the corresponding calculation node, and calculating the difference value between each column in the new matrix and the maximum resource vector of the node, wherein the sum of the values of the difference value larger than 0 is the interference degree index.

3. The system of claim 2, wherein the load demand characteristic analysis module divides the load into a loading period, an execution period and an end period according to the life cycle, and each period corresponds to four dimensions of computing resources, memory resources, hard disk resources and network resources to form a 4 x 3 demand characteristic matrix;

the four dimensions are quantized into eleven grades from 0-10 according to respective demand degrees.

4. The system of claim 3, wherein the interference classification module reads the demand signature matrix corresponding to the load demand signature analysis module according to the load.

5. The system according to claim 2, wherein the compute node resource analysis module uses vectors and matrices to jointly represent the resource usage of each physical server, specifically, the compute node obtains a four-dimensional maximum resource vector according to the quantization level of the compute resource to represent the maximum resource of the compute node; and quantifying the distributed computing resources of the three time intervals to obtain a used resource matrix of 4 × N, wherein: n represents the number of working time periods, the used resource usage is obtained by monitoring operating system parameters of the physical server, the future resource usage is obtained according to the used resources plus the corresponding resources used by the scheduled load, and the allocable resource vector is obtained by subtracting the used resource vector at a certain moment from the maximum resource vector.

6. The system of claim 2, wherein the interference calculation module adds the requirement characteristic matrix of each load to the used resource matrix of the corresponding calculation node, and calculates the difference between each column in the new matrix and the maximum resource vector of the node, wherein the difference is greater than 0, and the sum of the values is the interference degree index.

7. The system of claim 2, wherein the scheduling module performs optimal scheduling according to the interference degree index of each load, a greedy algorithm is used, during each scheduling, the current load is scheduled to the server with the smallest generated interference degree index without considering the subsequent load, and when the interference degree indexes of a plurality of physical servers to the load to be scheduled are the same, one physical server is randomly selected for scheduling.