CN111143059B

CN111143059B - Improved Kubernetes resource scheduling method

Info

Publication number: CN111143059B
Application number: CN201911305242.0A
Authority: CN
Inventors: 杨晋生; 熊衍捷; 高镇; 李�根
Original assignee: Tianjin University
Current assignee: Tianjin University
Priority date: 2019-12-17
Filing date: 2019-12-17
Publication date: 2023-10-20
Anticipated expiration: 2039-12-17
Also published as: CN111143059A

Abstract

The invention relates to a Kubernetes (K8 s) operation and maintenance technology, which aims to improve network service quality and ensure efficient utilization of resources. The technical scheme adopted by the invention is that the improved Kubernetes resource scheduling method comprises the following steps: 1) The problem of insufficient distribution prediction of quality of service QoS class by using a quality of service optimization scheduling BalanceQoSPriority improvement platform; 2) The pre-screening and the preferential Priority pre-screening of the pre-selection algorithm are weighted to achieve the comprehensive realization of the two optimization targets. The invention is mainly applied to the occasion of reasonable distribution of network resources.

Description

Improved Kubernetes resource scheduling method

Technical Field

The invention relates to a Kubernetes (K8 s) operation and maintenance technology, in particular to an improved Kubernetes resource scheduling method.

Background

Cloud Computing (Cloud Computing) integrates Computing resources, storage resources, network resources, data resources, and software resources, and provides an on-demand mode of operation by transmitting Computing services over the internet. The user may rent different types of resources to meet their own computing needs, such as Virtual Machines (VMs), containers (containers), dedicated hardware or bare-metal resources, etc. The container technology is a lightweight virtualization technology, which packages codes and all their dependencies, and can quickly and reliably migrate applications in different computing environments without starting any virtual machine.

Kubernetes, abbreviated as K8s, is an abbreviation that replaces 8 characters "ubennee" with 8. The method is an open source application for managing containerization on a plurality of hosts in the cloud platform, K8s is an open source container arrangement platform developed by Google corporation, and can arrange containers into a cluster, and easily managed and used containerization application services are provided outside. The K8s adopts a Master multi-slave cluster architecture, uses a Master Node (Master) to manage a plurality of child nodes (nodes), and the Master Node is a central nerve for cluster management and is an access entry for providing cluster control. Kubernetes Scheduler is a scheduler running on the master node, with Pod (a combination of one or more containers) as the base unit assigned to the appropriate child node, and default scheduling algorithms can be divided into two categories depending on the order of the startup phase: a pre-selection algorithm (Priority) and a preference algorithm (Priority), wherein the pre-selection algorithm is executed first, and takes the size of the disk space, whether the disk space has enough computing resources, whether the disk space has tags tolerating the Pod and the like as hard standards to form a potential list of alternative child nodes. Scoring evaluation of preferred algorithm the Pod is scheduled to the highest scoring child node, and the preferred algorithm provided by default at present mainly comprises: leastrequestedPriority (minimum request algorithm), balancedResourceAllocation (resource balanced allocation algorithm), imageLocaly (node mirror score), and the like. The leastrequestedPriority takes the CPU and the memory (MEM) of the Pod request as input parameters, and the traversable child nodes respectively subtract the calculated percentages of the parameters from available computing resources and add the calculated percentages to be averaged as a total score. The balance of resource usage is emphasized by the balancedsourceAllocation, the closer the CPU percentage and memory percentage used, the higher the score. ImageLocality is scored according to whether child nodes have mirror images required by Pod or not and the mirror image size. In addition to the scheduling algorithms provided by the platform by default, a large number of researchers, enterprises, etc. have proposed such as preemptive resource scheduling algorithms, neural network-based resource scheduling algorithms, ARIMA (Autoregressive Integrated Moving Average Model, autoregressive moving average model) based models, quality of service oriented balanceqospority resource scheduling algorithms, etc.

Disclosure of Invention

In order to overcome the problem that in the prior art, the leastrequstedPriority algorithm provided by Kubernetes by default can distribute Pod to Node nodes with most abundant computing resources and uniformly distribute working units, but does not consider the problem that the service quality possibly decreases due to the unbalanced QoS (Quality of Service ) level, and the existing barancedQoSpriority only considers the balanced QoS level as much as possible, but cannot guarantee the efficient utilization of resources. Therefore, the technical scheme adopted by the invention is that the improved Kubernetes resource scheduling method comprises the following steps:

1) The problem of insufficient distribution prediction of quality of service QoS class by using a quality of service optimization scheduling BalanceQoSPriority improvement platform;

2) The pre-screening and the preferential Priority pre-screening of the pre-selection algorithm are weighted to achieve the comprehensive realization of the two optimization targets.

The method comprises the following specific steps:

step 1: reading a pre-screening list of the pre-guide, and taking a combined Pod of the child Node and one or more containers to be scheduled as an input of an improved scheduling method;

the resource scheduling algorithm of Kubernetes is divided into two flows of pre-screening and Priority screening, a pre-screening list of pre-screening is read to obtain Node nodes which can be potentially scheduled, the pre-scheduled Pod is used as input of an improved resource scheduling method, a QoS class qosClass field of the Pod, a requested processor number request.millicpu field, a requested memory number request.memory field are read, and an allocable processor number allocable.millicpu (field, allocable memory number (field;

step 2: leastrequest degree score ordering to obtain Node _i Corresponding score L _i

Ordering the requested.MilliCPU field, the requested.memory field, the allowable.MilliCPU field, and the allowable.memory field extracted according to the step 1, and if the allowable.MilliCPU is larger than the requested.MilliCPU and the allowable.memory is larger than the requested.memory, using LeastRequestedPriority scoring to obtain the Node score _i Corresponding score L _i ，L＝((allocable.MilliCPU-requested.MilliCPU)/allocable.MilliCPU+(allocable.Memory

-requested/allowable/memory)/2, otherwise, ending the scheduling procedure and returning an error message;

step 3: balancedQoSPriority scoring order to obtain Node _i Corresponding score B _i

Step 4: let Score _i ＝ω ₁ L _i +ω ₂ B _i Arranged in descending order of score

In order to obtain the final Node score ranking list, the Node obtained in step 2 and step 3 are selected _i Corresponding score L _i 、B _i Weighted summation is carried out to obtain Node _i Corresponding total Score _i ，ω ₁ 、ω ₂ Can be according to nothing by the userThe same scene is decided, but the weight addition should be 1, the invention sets omega ₁ ＝ω ₂ ＝0.5；

Step 5: the Node with the highest score is used as the optimal destination Node, and the Master performs Binding operation Binding

For the Score obtained in step 4 _i And (3) arranging in descending order, and taking the Node with the highest score as the optimal destination Node to carry out Binding.

In step 3, node is obtained by using BalanceQoSPriority scoring order according to the qosClass field extracted in step 1 _i Corresponding score B _i The specific score calculation steps are as follows:

step 1.1: extracting QoS grade of Pod to be scheduled, and setting the QoS grade as P;

step 1.2: traversing each Node, counting the Pod number of P grade on each Node and the Pod number in the Node, and respectively marking as P _L And P _all And calculate the duty ratio

Step 1.3: traversing each Node, counting the total number of P grade Pod numbers in the cluster and Pod numbers in the cluster, and respectively marking as C _L And C _all And calculate the duty ratio

Step 1.4: finally, the score of each Node is calculated, and B=10×|1- (P) _L +1)/(P _all +1)+(C _L +1)/(C _all +1) |, is an integer symbol.

The invention has the characteristics and beneficial effects that:

the invention provides an improved Kubernetes resource scheduling method, which improves default leastrequest priority by comprehensively using leastrequest priority and balanceqosporicity algorithm. The method can not only meet the high-efficiency utilization of the Kubernetes resources and reduce single-point faults to enhance high availability, but also uniformly distribute Pods with different QoS levels and improve the service quality of the platform.

Description of the drawings:

FIG. 1 is a CPU/Memory dispersion distribution.

Fig. 2 is a QoS balance distribution.

FIG. 3 is a comprehensive dispersion distribution.

FIG. 4 is a flow chart of the steps of the present invention.

Detailed Description

In order to overcome the problems that in the prior art, the leastrequstedPriority algorithm provided by Kubernetes by default can distribute Pod to Node nodes with most abundant computing resources and uniformly distribute working units, but the service quality possibly caused by the service quality caused by unbalanced QoS level is reduced, and the balancedQoSPority only considers the balanced QoS level as much as possible, but the efficient utilization of resources cannot be ensured.

The technical scheme of the invention is as follows:

1) The problem of insufficient distribution prediction of QoS classes by a platform is improved by using BalanceQoSPriority.

2) The two algorithms are weighted to achieve the comprehensive realization of two optimization targets, thereby not only meeting the efficient utilization of computing resources, but also improving the service quality of the platform.

Step 1: reading a pre-screening list of the pre-indicator, and taking Node and Pod to be scheduled as input of an improved scheduling method

The resource scheduling algorithm of Kubernetes is divided into two flows of pre-screening and Priority screening. And reading the pre-screening list of the pre-guide to obtain the Node nodes which can be potentially scheduled. Writing shell scripts on a Master host as a self-defined scheduling program by taking input of a pre-scheduled Pod as an improved resource scheduling method, acquiring resource information by calling an api of K8s, reading a qos class field, a requested.MilliCPU field and a requested.memory field of the pre-scheduled Pod by using a jq tool capable of extracting a designated JSON field, and reading an allowable.MilliCPU field and an allowable.memory field of a schedulable Node;

The requested.MilliCPU extracted according to step 1Fields, requested.memory fields, allowed.MilliCPU fields, allowed.memory fields, if allowed.MilliCPU is greater than requested.MilliCPU and allowed.memory is greater than requested.memory, using LeastRequestedPriority scoring to obtain Node _i Corresponding score L _i L= ((allowed.millicpu-requested.millicpu)/allowed.millicpu+ (allowed.memory-requested.memory)/allowed.memory))/2, otherwise, the scheduling flow is ended, and error information is returned. Multiplying 1000 if the MilliCPU label number is the number of cores, otherwise, keeping unchanged;

Obtaining Node according to the qosClass field extracted in the step 1 by using BalancedQoSPriority scoring order _i Corresponding score B _i The specific score calculation steps are as follows:

Step 1.4: finally, the score of each Node is calculated, and B=10×|1- (P) _L +1)/(P _all +1)+(C _L +1)/(C _all +1) |. And I is an integer sign.

In order to obtain the final Node score ranking list, the method comprises the steps 2 and 3Node of (2) _i Corresponding score L _i 、B _i Weighted summation is carried out to obtain Node _i Corresponding total Score _i ，ω ₁ 、ω ₂ Can be determined by the user according to different scenes, but the weight addition should be 1 when ω ₁ ＝1,ω ₂ =0 the present algorithm is converted to a pure LeastRequestedPriority algorithm when ω ₁ ＝0,ω ₂ =1 the algorithm is converted to a pure balanceqosporicity algorithm, the invention proposes to set ω ₁ ＝ω ₂ ＝0.5；

Step 5: the Node with the highest score is used as the optimal destination Node to carry out Binding (Binding operation executed by Master)

The effectiveness of the invention is verified by simulation as follows:

the experiment adopts 1 Master (4 cores 4 GB), 3 Nodes (2 cores 2 GB), and deploys 40 Pods, wherein the Pods comprise 10 BestEfforts (lowest level), 10 Guaranideds (highest level) and 20 burst tables (medium level), and the CPUs and memories of the Pod applications are unequal.

The calculation of the CPU dispersion is as shown in the formulas (1), (2) and (3):

the Memory dispersion is calculated as shown in formulas (4), (5) and (6):

the QoS balance is calculated using standard deviations of the same QoS level Pod on each Node.

The composite dispersion is defined as the mean of the sum of the normalized CPU dispersion and Memory dispersion and the mean of the normalized QoS equilibrium.

The 4 scheduling methods for the simulation experiment test are respectively as follows: default scheduling algorithm (shown as Default in the figure), a modified Kubernetes resource scheduling method (shown as QL in the figure), leastrequest priority (shown as Least in the figure), and baranceqospority (shown as QoS in the figure). The obtained CPU/Memory dispersion is shown in figure 1, the obtained QoS equilibrium degree is shown in figure 2, the obtained comprehensive dispersion degree is shown in figure 3, and experimental results show that the improved Kubernetes resource scheduling method has smaller comprehensive dispersion degree, so that the computing resources can be efficiently utilized, and the QoS levels can be uniformly distributed.

The foregoing description of the preferred embodiments of the invention is not intended to limit the invention to the precise form disclosed, and any such modifications, equivalents, and alternatives falling within the spirit and scope of the invention are intended to be included within the scope of the invention.

Claims

1. An improved Kubernetes resource scheduling method is characterized by comprising the following steps:

2) Weighting pre-screening and preferential Priority pre-screening of a pre-selection algorithm to achieve comprehensive realization of two optimization targets; the method comprises the following specific steps:

the resource scheduling algorithm of Kubernetes is divided into two flows of pre-screening and Priority screening, a pre-screening list of pre-screening is read to obtain Node nodes which can be potentially scheduled, the pre-scheduled Pod is used as input of an improved resource scheduling method, a QoS class qosClass field of the Pod, a requested processor number request.millicpu field, a requested memory number request.memory field are read, and an allocable processor number allocable.millicpu (field, allocable memory number allocable.memory field) of the Node which can be scheduled is read;

Ordering the requested.MilliCPU field, the requested.memory field, the allowable.MilliCPU field, and the allowable.memory field extracted according to the step 1, and if the allowable.MilliCPU is larger than the requested.MilliCPU and the allowable.memory is larger than the requested.memory, using LeastRequestedPriority scoring to obtain the Node score _i Corresponding score L _i ，

L= ((allowed.millicpu-requested.millicpu)/allowed.millicpu+ (allowed.memory-requested.memory)/allowed.memory))/2, otherwise, the scheduling flow is ended, and error information is returned;

In order to obtain the final Node score ranking list, the Node obtained in step 2 and step 3 are selected _i Corresponding score L _i 、B _i Weighted summation is carried out to obtain Node _i Corresponding total Score _i ，ω ₁ 、ω ₂ Can be determined by a user according to different scenes, but the weight addition should be 1, and omega is set ₁ ＝ω ₂ ＝0.5；

2. The improved Kubernetes resource scheduling method of claim 1 wherein in step 3, node is obtained using a balancedtosporicity scoring order according to the qosClass field extracted in step 1 _i Corresponding score B _i The specific score calculation steps are as follows: