CN109960585B

CN109960585B - Resource scheduling method based on kubernets

Info

Publication number: CN109960585B
Application number: CN201910107749.9A
Authority: CN
Inventors: 郑雅羽; 韩沈钢; 王济浩
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2019-02-02
Filing date: 2019-02-02
Publication date: 2021-05-14
Anticipated expiration: 2039-02-02
Also published as: CN109960585A

Abstract

The invention relates to a method based onkubernetesThe resource scheduling method of (1) calculates all the resources in the cluster by a fixed ruleNodeThe scores are generated into a first node priority queue which is obtained by a dynamic priority algorithmPodPriority queue, two queue filtering non-dispatchableNodeGenerating the second sectionA point priority queue, from which the highest priority node and the highest priority node are selectedPodWith pop-up priority queuePodBinding, the binding is successfully carried out to enter the next stepPodScheduling circulation, if failure, the priority algorithm is adopted to optimize from the priority queue of the second nodeNodeBinding, failing again without being appropriateNodeCan be provided withPodOperation, enter the next onePodThe scheduling loop of (2). The method comprises static scheduling and dynamic resource load balancing, improves scheduling efficiency, accelerates task deployment efficiency, improves task operation integrity and load balancing of the whole cluster, actively adjusts the load balancing of the cluster, and improves the resource utilization efficiency of the cluster.

Description

Resource scheduling method based on kubernets

Technical Field

The invention belongs to the technical field of digital information transmission, such as telegraph communication, and particularly relates to a kubernetes-based resource scheduling method for accelerating the deployment efficiency of cluster tasks.

Background

Kubernetes is a container arrangement engine of Google open source, which supports automatic deployment, large-scale scalable, application containerization management, and can manage the state of multiple nodes (nodes) in a large-scale cluster and the operation of Pod on the nodes. When an application is deployed in a production environment, multiple instances of the application are typically deployed to load balance application requests.

In kubernets, a container (containers) virtualization technology is a sharing mode of server resources, can meet the requirement of building a customized container according to needs, and is different from the traditional virtualization technology, so that the container virtualization technology is more flexible and convenient; in addition, Pod in kubernets refers to a set of one or several containers, is the minimum unit deployed by kubernets, logically represents an example of an application, and kubernets can manage multiple Pod examples created by users, so that the operation difficulty and the operation and maintenance management cost of operation and maintenance personnel are simplified.

The most important concern in the cloud computing field is the efficiency of resource scheduling and the load balancing of resource scheduling. In the prior art, a default scheduler selects an optimal Node to run a Pod instance according to a Node preselection and optimization algorithm, and a preemptive Priority scheduling algorithm designed for the Pod also exists, however, a Priority algorithm in a kubernets default scheduler calculates a score of each Node according to a requirement of each Pod instance, this calculation process reduces efficiency of resource scheduling, and meanwhile, a static Priority policy is adopted by a sorting algorithm of Pod Priority queues (Pod Priority queues) in the kubernets, which on one hand may possibly cause a large business to monopolize a part of nodes for a long time and reduce business deployment efficiency, and on the other hand, may possibly cause a part of low-Priority pods to fail to run for a long time and affect operation of the whole business.

A patent with a patent publication number of CN107948330A discloses a load balancing strategy based on dynamic priority in a cloud environment, Node dynamic priority takes the Pod priority algorithm in kubernets into consideration, and the Node dynamic priority is improved into a dynamic priority algorithm to make up for the defect of static priority in kubernets; however, the rescheduling for the kubernetes scheduler mainly occurs in the cases of Pod, Node abnormality, Pod capacity expansion and contraction upgrade, Node increase and decrease, and the like, and when the cluster runs stably and the above abnormality does not occur, the load of the cluster is not dynamically adjusted to promote the cluster load to be more balanced.

Patent publication No. CN106790726A discloses a dynamic load balancing method, but the method only divides all nodes into two high and low load queues, which means that some nodes are already in the average load state of the cluster, and there is no need to schedule Pod, and unnecessary scheduling occurs after the method, which reduces the efficiency of the system.

Disclosure of Invention

The invention solves the problems that in the prior art, the Priorities algorithm in the kubernets default scheduler can calculate the score of each Node according to the requirement of each Pod instance, the efficiency of resource scheduling is reduced, and the sorting algorithm of Pod priority queues in the kubernets adopts a static priority strategy, so that a part of nodes can be monopolized for a long time by a large business, the business deployment efficiency is reduced, a part of Pod with low priority can not run for a long time, and the running of the whole business is influenced, and provides the optimized resource scheduling method based on the kubernets.

The technical scheme adopted by the invention is that a resource scheduling method based on kubernets comprises the following steps:

step 1: initializing, calculating the scores of all nodes in the cluster, and adding all nodes into a first Node priority queue according to the scores from high to low; monitoring all the Pod in the cluster;

step 2: the cluster runs time T, if the Nodename field of any Pod is empty, the current Pod is added into the Pod priority queue by a dynamic priority algorithm, otherwise, the operation returns to the step 1;

and step 3: filtering useless nodes by using a high-priority Pod and a first Node priority queue which are matched preferentially through a pre-selection algorithm of kubernetes;

and 4, step 4: if no Node meeting the operating requirement of the Pod exists, the scheduling fails, and the step 3 is returned to enter the scheduling cycle of the next Pod; if available nodes exist, generating a filtered second Node priority queue and carrying out the next step;

and 5: selecting the Node with the highest priority from the priority queue of the second Node and carrying out binding operation with the high-priority Pod which is matched with the priority;

step 6: if the binding is successful, the high-priority Pod operates on the selected Node, the step 3 is returned to enter the scheduling cycle of the next Pod, otherwise, the binding is failed, and the next step is carried out;

and 7: node is optimized by using a priority algorithm of kubernetes from the second Node priority queue; binding operation is carried out on the preferred Node and the high-priority Pod which is matched with the priority;

and 8: and if the binding is successful, the high-priority Pod operates on the selected Node, the step 3 is returned to enter the scheduling cycle of the next Pod, otherwise, the binding is failed, and the scheduling cycle of the next Pod is still entered until the Node meeting the Pod to be scheduled appears in the cluster.

Preferably, in step 1, calculating the scores of all nodes includes the following steps:

step 1.1: calculating scores score of all nodes by using minimum consumption algorithm₁，score₁The sum of the CPU utilization rate, the memory utilization rate and the network bandwidth utilization rate is included;

step 1.2: calculating scores score of all nodes by using resource optimal balance algorithm₂；

Step 1.3: in a score of score₁And score₂And adding to obtain the total score of any node.

Preferably, in step 1.1, score₁Cpu (capacity-sum) (requested))10/capacity) + memory (capacity-sum (requested))10/capacity) + network (capacity-sum (requested))10/capacity, wherein the first term is cpu utilization, the second term is memory utilization, the third term is network bandwidth utilization, capacity represents the total amount of each resource on each node, and sum (requested) represents the corresponding total amount of resources required by the Pod.

Preferably, in step 2, the dynamic priority algorithm includes the following steps:

step 2.1: setting initial priority value initPodPriorityValue, system high priority threshold value alpha, system low priority threshold value beta and Pod escaping monitoring time T_escapeThe minimum required Pod number minPodAmount and the Pod operation number weight W required by each Pod service operation₁Time weight W occupying Pod priority queue₂Pod running time weight W₃；

Step 2.2: when the time waiting for the Pod to be established is more than T_escapeCarrying out the next step;

step 2.3: obtaining the running number runningPodAmount of each Pod, the expected running number needledPodAmount of the corresponding Pod, and the time T of occupying the Pod priority queue_queuePosition Num of queue and time T of operation_runtime；

Step 2.4: calculating the priority value runtimedopriorityValue of each Pod in the Pod priority queue, wherein the value is W₁*runningPodAmount+W₂*Num*T_queue+W₃*T_runtime；

Step 2.5: if runtimedopriorityvalue is greater than α, calculating deprease podpriorityvalue, which is initPodpriorityvalue (1-runningPodAmount/needPodAmount), according to the value going from large to small into the reduced priority cohort; carrying out the next step;

calculating incrasepodpriorityvalue, which is the initPodpriorityvalue (1+ minPodAmount/ne dedPodAmount), by entering the queue of increasing priority from small to large value if runtimePriorityvalue is less than β; carrying out the next step;

if the runtimedopriorityvalue is between the threshold values alpha and beta, the dynamic priority is not adjusted, and the step 3 is carried out;

step 2.6: and adding all the Pod into the Pod priority queue again, and updating the information of the cluster.

Preferably, in the step 2.5, if the depreamedPodPriorityValue is not less than α, the priority of the corresponding Pod is decreased; if incrasepodpriorityvalue is not greater than β, the priority of the corresponding Pod is increased.

Preferably, in step 5, the binding operation is to change the value of the nodame field of the selected Pod to the name of the selected Node.

Preferably, in step 7, the priority algorithm includes a leaserequeudpriority algorithm, a BalanceResourceAllocation algorithm, a selectorspradpriority algorithm, a NodeAffinityPriority algorithm, a taintolerationpriority algorithm, and an interpackitoprioritypriority algorithm, and if any Node satisfies any algorithm, the algorithm is multiplied by a corresponding weight coefficient, and all products are added to obtain a score of the Node, and the Node with a higher score is given a higher priority.

Preferably, the resource scheduling method further includes a cluster dynamic resource load balancing method, where the cluster dynamic resource load balancing method includes the following steps:

step 9.1: initializing threshold η of high-load Node₁And threshold η of low load Node₂；

Step 9.2: monitoring cluster information in real time, and acquiring state information of all nodes according to a certain polling period T;

step 9.3: calculating scores Avg _ score (i) of average loads of all nodes, (1-Avg _ CPU (i)) 1-Avg _ network (i)) 1-Avg _ storage (i)), and calculating the load scores score (i) of each Node, (1-CPU (i)) 1-network (i) (1-storage (i)), wherein i is a Node sequence number, Avg _ CPU (i), Avg _ network (i) and Avg _ storage (i) are average values of CPUs, network bandwidths and memory utilizations of all nodes in the cluster respectively, and CPU, network bandwidth and memory utilizations of each Node respectively;

step 9.4: let eta be₁＝λ₁Avg_Score(i)、η₂＝λ₂Avg_Score(i)；

Step 9.5: according to score (i) of each node by η₁And η₂As the threshold value, Score (i) < η₁Node(s) of (c) is classified as a high load queue, score (i) > η₂Node of (1) is classified as a low load queue, and η is₁≤Score(i)≤η₂Node(s) is classified as a balanced load queue;

step 9.6: if the high load queue and the low load queue are not empty, the next step is carried out; otherwise, the dynamic load scheduling is not carried out, and the step 9.2 is returned to enter the next polling period;

step 9.7: selecting the Pod running on the Node with the minimum value in the high load queue, preselecting the nodes in the low load queue, and running the Pod to the nodes in the low load queue to achieve cluster load balance.

Preferably, in step 9.2, the Node status information includes a CPU utilization rate, a memory utilization rate, a network bandwidth utilization rate, and a storage utilization rate of the Node.

Preferably, in said step 9.3,

the invention provides an optimized resource scheduling method based on kubernetes, which comprises the steps of reading cluster information, calculating scores of all nodes according to a fixed rule, generating a first Node priority queue, simultaneously obtaining a Pod priority queue obtained according to a dynamic priority algorithm, filtering nodes which cannot be scheduled by two queues through a preselection algorithm, generating a second Node priority queue, directly selecting a Node with the highest priority from the second Node priority queue to bind with a Pod popped out by the Pod priority queue, entering a scheduling cycle of the next Pod if the binding is successful, adopting a priority algorithm of the kubernetes to bind preferentially the Node in the second Node priority queue if the binding is failed, entering the scheduling cycle of the next Pod if the binding is successful, and indicating that no proper Node of the cluster can be operated by the Pod if the scheduling is failed.

The method comprises static scheduling and dynamic resource load balancing, and a more appropriate and non-optimal node is selected, so that the scheduling time of the kubernets scheduler can be saved, the scheduling efficiency is improved, the task deployment efficiency of the cluster is accelerated, the task operation integrity is improved, and the load balancing performance of the whole cluster is improved; the dynamic load balancing method transfers the Pod on the high-load Node to the low-load Node for operation, actively adjusts the load balance of the cluster, and improves the resource utilization efficiency of the cluster.

Drawings

FIG. 1 is a flow chart of the present invention;

FIG. 2 is a flow chart of the dynamic priority algorithm of the present invention;

fig. 3 is a flowchart of a cluster dynamic resource load balancing method according to the present invention.

Detailed Description

The present invention will be described in further detail with reference to examples, but the scope of the present invention is not limited thereto.

The invention relates to a resource scheduling method based on kubernets, which comprises static scheduling and dynamic resource load balancing.

The method comprises the following steps.

Step 1: initializing, calculating the scores of all nodes in the cluster, and adding all nodes into a first Node priority queue according to the scores from high to low; all the Pod in the cluster are snooped.

In step 1, calculating the scores of all the nodes includes the following steps:

step 1.1: calculating scores score of all nodes by using minimum consumption algorithm₁，score₁Including cpu utilization, memory utilization, and network bandwidth utilizationSum of the utilization rates;

in said step 1.1, score₁Cpu (capacity-sum) (requested))10/capacity) + memory (capacity-sum (requested))10/capacity) + network (capacity-sum (requested))10/capacity, wherein the first term is cpu utilization, the second term is memory utilization, the third term is network bandwidth utilization, capacity represents the total amount of each resource on each node, and sum (requested) represents the corresponding total amount of resources required by the Pod.

In the invention, two algorithms for initializing and calculating the Node point values comprise a least consumption algorithm Leastrequest priority and a resource most balanced algorithm BalanceAllocation, wherein the former is used for selecting the Node with the least resource consumption, the resources comprise CPU utilization rate, memory utilization rate and network bandwidth utilization rate, and the latter selects the Node with the most balanced resource use, which is a known algorithm and mainly is the balance between the memory and the CPU utilization rate.

In the invention, score₂10-variance (cpufaction, memoryFraction) × 10, where cpufaction and memoryFraction are the ratio of Pod request resources to resources available on Node, cpufaction ═ cpu (requested)/cpu (available), memoryFraction ═ memory (requested)/memory (available), and variance is an algorithm for computing the balance between two resources in the kubernet itself, where requested represents the required amount and available represents the available amount.

Step 2: and (4) the cluster runs time T, if the Nodename field of any Pod is empty, the current Pod is added into the Pod priority queue by using a dynamic priority algorithm, and if not, the operation returns to the step 1.

In the invention, the dynamic priority algorithm of the Pod mainly solves the problems that the high-priority Pod occupies the head position of the Pod priority queue for a long time, so that the high-priority Pod possibly occupies Node resources of a cluster for a long time, and a low-priority Pod task cannot run. Specifically, the dynamic priority algorithm refers to an algorithm which is operated after a cluster is operated for a period of time, and can prevent the change of the high-priority Pod priority and the low-priority Pod priority when the cluster is operated, so that the high-priority Pod priority is lowered, and the low-priority Pod priority is raised.

In the invention, when there is no problem of Pod, the system returns to the state of monitoring Pod and continues to calculate the Node score.

In step 2, the dynamic priority algorithm includes the following steps:

in the step 2.5, if the createpodpriorityvalue is not less than α, the priority of the corresponding Pod is reduced; if incrasepodpriorityvalue is not greater than β, the priority of the corresponding Pod is increased.

In the invention, the time for waiting for the Pod establishment in the step 2.2 is more than T_escapeI.e. indicating that the Pod creation was successful.

In the invention, the deceepPodPriorityValue and the incrasePodPriorityValue are further judged, so that the integral sequencing of the priorities of all the Pods is not required to be changed, and the situation that the priorities of all the Pods are positioned between alpha and beta, so that the resource scheduling is congested is prevented.

And step 3: and filtering useless nodes by the high-priority Pod and the first Node priority queue which are matched preferentially through a pre-selection algorithm of kubernets.

In the invention, the useless Node is a Node which obviously does not meet the requirement, including but not limited to filtering out the nodes which do not meet the requirement, such as insufficient residual CPU and memory resources, conflict of persistent storage volume, incapability of scheduling due to the fact that the host Node has dirty point, and affinity and counter-affinity of Pod.

And 4, step 4: if no Node meeting the operating requirement of the Pod exists, the scheduling fails, and the step 3 is returned to enter the scheduling cycle of the next Pod; and if available nodes exist, generating a filtered second Node priority queue and carrying out the next step.

In the invention, meeting the demand of Pod operation means meeting the requirements of memory, bandwidth, CPU quantity and the like of Pod operation.

And 5: and selecting the Node with the highest priority from the priority queue of the second Node and carrying out binding operation on the Node with the highest priority and the high-priority Pod with the priority matching.

In step 5, the binding operation is to change the value of the nodame field of the selected Pod into the name of the selected Node.

In the invention, Kubernetes only updates the Pod and Node information in the cache in the binding stage, and finally verifies whether the Pod can operate on the Node or not on the selected Node.

In the invention, Pod is popped up, and the Nodename of the selected Node is written into the attribute value of the Nodename in the Pod to be used as binding.

Step 6: and if the binding is successful, the high-priority Pod operates on the selected Node, the step 3 is returned to enter the scheduling cycle of the next Pod, and if the binding is not successful, the next step is carried out.

And 7: node is optimized by using a priority algorithm of kubernetes from the second Node priority queue; and binding the preferred Node and the high-priority Pod which is matched with the priority.

In step 7, the priority algorithm includes a leaserequeudpriority algorithm, a BalanceResourceAllocation algorithm, a selectorspradpriority algorithm, a NodeAffinityPriority algorithm, a tainttoleriority algorithm, and an interposaffinitypriority algorithm, and if any Node satisfies any algorithm, the algorithm is multiplied by a corresponding weight coefficient, and all products are added to obtain the score of the Node, and the Node with higher score is used as the higher priority.

In the invention, the scores of all nodes in the priority queue of the second Node are calculated through the algorithms, and the score is higher when the number of fields of one Node meeting the rules is more.

In the invention, the two binding operations are the same.

The resource scheduling method also comprises a cluster dynamic resource load balancing method, and the cluster dynamic resource load balancing method comprises the following steps.

Step 9.1: initializing threshold η of high-load Node₁And low load Node threshold η₂。

In the present invention, eta₁And η₂The average value of the load according to all the servers is set by the person skilled in the art.

Step 9.2: and monitoring cluster information in real time, and acquiring the state information of all nodes according to a certain polling period T.

In step 9.2, the Node status information includes the CPU utilization, the memory utilization, the network bandwidth utilization, and the storage utilization of the Node.

Step 9.3: calculating scores Avg _ score (i) of average loads of all nodes, (1-Avg _ CPU (i)) 1-Avg _ network (i)) 1-Avg _ storage (i)), and calculating the load scores score (i) of each Node, (1-CPU (i)) 1-network (i) (1-storage (i)), wherein i is a Node sequence number, Avg _ CPU (i), Avg _ network (i) and Avg _ storage (i) are average values of CPUs, network bandwidths and memory utilizations of all nodes in the cluster, and CPU, network bandwidths and memory utilizations of each Node are CPU, network bandwidths and memory utilizations.

In said step 9.3, the process is carried out,

step 9.4: let eta be₁＝λ₁Avg_Score(i)、η₂＝λ₂Avg_Score(i)。

Step 9.5: according to score (i) of each node by η₁And η₂As the threshold value, Score (i) < η₁Node(s) of (c) is classified as a high load queue, score (i) > η₂Node of (1) is classified as a low load queue, and η is₁≤Score(i)≤η₂Node(s) of (2) is classified as a balanced load queue.

Step 9.6: if the high load queue and the low load queue are not empty, the next step is carried out; otherwise, the dynamic load scheduling is not carried out, and the step 9.2 is returned to enter the next polling period.

In the present invention, 0 < lambda₁＜λ₂。

In the present invention, eta₁＜η₂。

In the present invention, the Node in the low load queue is preselected in step 9.7 by using a kubernetes self-contained known algorithm.

The invention calculates the scores of all nodes by a fixed rule by reading cluster information, generates a first Node priority queue, simultaneously acquires a Pod priority queue obtained according to a dynamic priority algorithm, filters nodes which cannot be scheduled by two queues through a pre-selection algorithm, generates a second Node priority queue, directly selects a Node with the highest priority from the second Node priority queue to bind with a Pod popped by the Pod priority queue, if the binding is successful, the next Pod scheduling cycle is entered, if the binding is failed, the preferred Node in the second Node priority queue is bound by adopting a kubernetes self-contained priority algorithm, if the binding is successful, the next Pod scheduling cycle is entered, and if the scheduling is failed, the cluster does not have a proper Node for the operation of the Pod, the next Pod scheduling cycle is entered.

Claims

1. A resource scheduling method based on kubernets is characterized in that: the method comprises the following steps:

step 2: the cluster runs time T, if the Nodename field of any Pod is empty, the current Pod is added into the Pod priority queue by a dynamic priority algorithm, otherwise, the operation returns to the step 1; the dynamic priority algorithm comprises the steps of:

if runtimedopriorityvalue is less than β, calculating incrustedpoidpriorityvalue, which is the initPodpriorityvalue (1+ minPodAmount/needPodAmount), by entering the queue of increasing priority from small to large values; carrying out the next step;

step 2.6: all the Pod are added into the Pod priority queue again, and cluster information is updated;

2. The method of claim 1, wherein the resource scheduling method based on kubernets comprises: in step 1, calculating the scores of all the nodes includes the following steps:

3. The method of claim 2, wherein the resource scheduling method based on kubernets comprises: in said step 1.1, score1 ═ cpuX + memoryX + networkX, where,

wherein, the first item of score1 is cpu utilization rate, the second item is memory utilization rate, and the third item is network bandwidth utilization rate, capacity respectively represents the total amount of cpu, memory, and network bandwidth on each node, and sum (requested) represents the corresponding total amount of cpu, memory, and network bandwidth resources required by Pod.

4. The method of claim 1, wherein the resource scheduling method based on kubernets comprises: in the step 2.5, if the createpodpriorityvalue is not less than α, the priority of the corresponding Pod is reduced; if incrasepodpriorityvalue is not greater than β, the priority of the corresponding Pod is increased.

5. The method of claim 1, wherein the resource scheduling method based on kubernets comprises: in step 5, the binding operation is to change the value of the nodame field of the selected Pod into the name of the selected Node.

6. The method of claim 1, wherein the resource scheduling method based on kubernets comprises: in step 7, the priority algorithm includes a leaserequeudpriority algorithm, a BalanceResourceAllocation algorithm, a selectorspradpriority algorithm, a NodeAffinityPriority algorithm, a tainttoleriority algorithm, and an interposaffinitypriority algorithm, and if any Node satisfies any algorithm, the algorithm is multiplied by a corresponding weight coefficient, and all products are added to obtain the score of the Node, and the Node with higher score is used as the higher priority.

7. The method of claim 1, wherein the resource scheduling method based on kubernets comprises: the resource scheduling method further comprises a cluster dynamic resource load balancing method, and the cluster dynamic resource load balancing method comprises the following steps:

step 9.4: let eta be₁＝λ₁Avg_Score(i)、η₂＝λ₂Avg_Score(i)；0＜λ₁＜λ₂；

8. The method of claim 7, wherein the resource scheduling method based on kubernets comprises: in step 9.2, the Node status information includes the CPU utilization, the memory utilization, and the network bandwidth utilization of the Node.

9. The method of claim 7, wherein the resource scheduling method based on kubernets comprises: in said step 9.3, the process is carried out,