CN110780998A

CN110780998A - Kubernetes-based dynamic load balancing resource scheduling method

Info

Publication number: CN110780998A
Application number: CN201910933130.3A
Authority: CN
Inventors: 陈晶; 何琨; 杜瑞颖; 叶琼州
Original assignee: Wuhan University WHU
Current assignee: Wuhan University WHU
Priority date: 2019-09-29
Filing date: 2019-09-29
Publication date: 2020-02-11

Abstract

The invention discloses a dynamic load balancing resource scheduling method based on Kubernetes, which improves a dynamic load balancing algorithm by combining the characteristics of a Kubernetes scheduling system, divides the scheduling algorithm into a static scheduling part and a dynamic scheduling part, and can still better maintain the load balancing of the system when the cluster environment changes. The Kubernetes container resource scheduling strategy provided by the invention solves the problems that the Kubernetes scheduling strategy is single, and the reasonable scheduling from the container to the machine node in the cluster can not be realized according to the constraint scheduling requirements of the container application on specific factors such as system kernels, network transmission speed and the like. And the dynamic load balancing scheduling strategy of migrating the Pod running on a certain working node to another new working node with higher matching degree with the scheduling strategy is realized.

Description

Kubernetes-based dynamic load balancing resource scheduling method

Technical Field

The invention belongs to the technical field of cloud computing, relates to a dynamic load balancing resource scheduling method, and particularly relates to a Kubernetes-based container resource scheduling method.

Background

The rapid development of cloud computing, big data and mobile technology and the continuous change of business requirements of enterprises lead to that the enterprise architecture needs to be changed at any time to meet the business requirements, and keep pace with the technology updating. These burdens will, of course, be placed on enterprise developers. How to coordinate efficiently, deliver products quickly, deploy applications quickly and meet business requirements of enterprises among teams is a problem which needs to be solved urgently by developers. The Docker technique may help developers solve these problems.

Docker is an open source application container engine, so that developers can package their applications and dependency packages into a portable container, and then distribute the container to any popular Linux machine, and also realize virtualization. The containers use a sandbox mechanism entirely without any interface between each other.

Kubernets is built on Docker container technology, provides an integral solution for containerization application for users, has strong container arrangement capacity, follows micro-service architecture theory, and is open source, and has become the most popular open source container cluster scheduling system of Docker ecosphere nowadays. Kubernetes uses Docker to pack, instantiate and run application programs, runs and manages containers across hosts in a cluster mode, and solves the communication problem between the containers running among different hosts. The Scheduler is a Scheduler loaded and operated in the Kubernetes container cluster management system, and is responsible for collecting, statistically analyzing resource usage of all nodes in the container cluster management system, and then allocating the newly-built Pod to an available Node with the highest priority for building according to the resource usage.

The main consideration factors of the Kubernetes scheduler in the prior art are CPU and memory when resource scheduling is carried out, but the resource scheduling mode ignores the influence of network factors on the startup and operation of Pod on a host. After the Pod to be scheduled and the host are bound by the Scheduler component of the control node, the host needs to consider the following two factors to start and normally operate the Pod:

1. the host computer needs to download the image files of all containers in the Pod to the network address specified by the Pod resource description file, and the network transmission speed between the host computer and the image storage system is directly related to the starting speed of the Pod;

the data in the Pod is temporary, and when the Pod is destroyed, the data in the Pod is lost, so the Pod needs to persist the data by means of a data volume. Therefore, after the Pod is started and operated, the Pod also needs to mount a persistent storage system for data access, and the IO speed of an application operated in the Pod is directly influenced by the data transmission speed between the host and the persistent storage.

The Kubernetes carries out resource scheduling according to the resource use condition reported by each working node and the resource amount requested during Pod creation. From the time a Pod is scheduled by the Scheduler onto the appropriate working node to the end of its lifecycle, the Pod will not migrate between working nodes. However, as time progresses, the resources on the worker nodes change, and the Pod is created and deleted, the scheduling choices made by the scheduler may not be available at this point in time. In an actual application scenario, a Pod running at a certain working node often needs to be migrated to a new working node with a higher matching degree with a scheduling policy.

Disclosure of Invention

The invention aims to provide a Kubernetes container resource scheduling method, which solves the problems that the Kubernetes scheduling strategy is single, and reasonable scheduling from a container to a machine node in a cluster cannot be realized according to specific factors such as system kernel, network transmission speed and the like in container application; and the dynamic load balancing scheduling of migrating the Pod running on a certain working node to another new working node with higher matching degree with the scheduling strategy is realized.

The technical scheme adopted by the invention is as follows: a dynamic load balancing resource scheduling method based on Kubernetes adopts a Kubernetes container cluster management system; the system loads and operates a plurality of nodes, wherein the nodes are working nodes of a Kubernetes container cluster;

characterized in that the method comprises the following steps:

step 1: a working node in the Kubernetes container cluster management system reports the resource use condition to a control node;

step 2: creating a Pod on a control node, indicating Pod requirements in a resource description file of the Pod, and placing the Pod requirements into a to-be-scheduled Pod queue;

and step 3: taking out the Pod from the Pod queue to be scheduled by the scheduler, and selecting the most appropriate node for the Pod according to the resource description file of the Pod;

and 4, step 4: scheduling and running the Pod to be scheduled on the target node;

and 5: the monitoring program regularly collects the performance information of the Pod and the host machine and stores the information into the persistent database etcd;

step 6: reading the performance data of the Pod and the host machine thereof from the persistent database etcd, performing processing operation, and feeding back the ratio of the CPU utilization rate, the memory utilization rate, the mirror image average transmission speed and the mirror image network load average value of the cluster to the scheduler after the processing operation;

and 7: and the scheduler dynamically adjusts the resources according to the integral load information of the cluster.

Compared with the existing scheduling strategy, the invention has the advantages that:

the invention improves the resource model of Kubernetes, and provides resource models of network bandwidth, storage space and the like added on the basis of CPU and memory as the weighing factors of scheduling;

2. a load balancing method based on double load queues is designed, a high load queue and a low load queue are realized in a binary heap mode, and through periodically detecting the load condition of a cluster, some Pod of nodes with higher load are migrated to nodes with lower load so as to ensure the load balancing of the cluster;

3. aiming at the defect that Kubernets do not support dynamic scheduling, the invention improves a dynamic load balancing algorithm by combining the characteristics of a Kubernets scheduling system, divides the scheduling algorithm into a static scheduling part and a dynamic scheduling part, and can still better maintain the load balancing of the system when the cluster environment changes.

Drawings

FIG. 1 is a method schematic of an embodiment of the invention;

FIG. 2 is a diagram illustrating a dynamic scheduling trigger condition according to an embodiment of the present invention;

FIG. 3 is a flow chart of dynamic control in an embodiment of the present invention;

fig. 4 is a diagram of a dynamic scheduling process in an embodiment of the present invention.

Detailed Description

In order to facilitate the understanding and implementation of the present invention for those of ordinary skill in the art, the present invention is further described in detail with reference to the accompanying drawings and examples, it is to be understood that the embodiments described herein are merely illustrative and explanatory of the present invention and are not restrictive thereof.

The invention provides a dynamic load balancing resource scheduling method based on Kubernetes, which adopts a Kubernetes container cluster management system; the system loads and operates a plurality of nodes, wherein the nodes are working nodes of a Kubernetes container cluster; the invention can improve the scheduling strategy of the scheduler in the prior Kubernets system, and can ensure that the Pod can be scheduled to the available Node (Node) with the highest priority, thereby improving the overall performance of the Kubernets container cluster, particularly the dynamic scheduling strategy, and realizing the load balance of the whole cluster resource.

Referring to fig. 1, the dynamic load balancing resource scheduling method based on Kubernetes provided by the present invention includes the following steps:

the most core component of the Kubernetes resource scheduling module is a Scheduler module on a control Node (Master Node), the module is responsible for managing Pods to be scheduled and all available nodes, establishing a high-load queue and a low-load queue according to cluster load information stored on a persistent database etcd, selecting the best host for the Pods to be scheduled according to a scheduling algorithm and a scheduling policy, binding the Pods and the nodes, and storing binding information into the persistent database etcd.

Step 2: creating a Pod on a control node, indicating Pod requirements in a resource description file of the Pod, and placing the Pod requirements into a to-be-scheduled Pod queue; the Pod requirements comprise the amount of applied resources, mounted storage volumes and the like;

and step 3: taking out the Pod from the Pod queue to be scheduled by the scheduler, and selecting a scheduling algorithm to select the most appropriate node for the Pod according to the resource description file of the Pod;

selecting the most appropriate node for the Pod in the embodiment, which is specifically implemented by selecting the most appropriate node for the Pod by adopting static scheduling or selecting the most appropriate node for the Pod by adopting dynamic scheduling;

static scheduling, when the Pod queue to be scheduled is not empty, taking out the Pod (scheduling the Pod in the Pod queue to be scheduled to a proper node according to a designed scheduling strategy according to a first-in first-out sequence) from the queue to select the most proper host;

and (3) dynamically scheduling, wherein the monitor regularly feeds back the relevant information of the host and the running task to the persistent database etcd, the scheduler reads data from the persistent database etcd, dynamically schedules according to the overall load condition of the cluster, and migrates some Pod of the nodes with the load higher than the preset value to the nodes with the load lower than the preset value.

step 6: reading the performance data of the Pod and the host thereof from the persistent database etcd, and performing processing operation (taking the load average value of each resource in the system and the load ratio of the node as each resource score of the node, and judging the resource condition of the node according to the comprehensive score, aiming at comprehensively considering four factors of the downloading speed among the CPU, the memory and the mirror image warehouse and the data transmission speed among the persistent storage system to measure the load state of the host), and feeding back the ratio of the CPU utilization rate, the memory utilization rate, the mirror image average transmission speed and the mirror image network load average value of the cluster to a scheduler after processing;

In this embodiment, to implement a preferred policy based on dynamic load balancing, a LoadBalancePriority policy is designed, which is different from a kubernets default policy, and the LoadBalancePriority policy takes a load average value of each resource in a system and a load ratio of a node as each resource score of the node, calculates a comprehensive score according to the resource scores, and evaluates a resource condition of the node according to the comprehensive score.

In Kubernetes, the preferred strategy for the default algorithm is to take the arithmetic mean of the individual resource scores as the composite score for the node. However, in the cluster environment, different types of resource use conditions have different influences on the performance of the host, different types of applications have different requirements on resources, some applications are high-CPU-consumption type applications, some applications are high-memory-consumption type applications, and even some applications do not need to perform data interaction with the mirror image storage system and the persistent data storage system. Therefore, a weight factor is introduced here to represent the degree of influence of each resource on the total score of the node, and the greater the influence of a certain resource on the operation of Pod, the greater the weight factor of the resource. The weight factor vector is represented as:

Δ＝(λ _cpu，λ _memory，λ _imageNet，λ _dataNet)

wherein

λ _cpu+λ _memory+λ _imageNet+λ _dataNet＝1

Each element of the vector represents a degree of contribution of the corresponding resource to the node composite score.

Composite score of node score _iThe calculation formula is as follows:

score _i＝10×[λ _cpu(cpuScore _i)+λ _memory(memoryScore _i)+λ _imageNet(imageNetScore _i)+λ _dataNet(dataNetScore _i)]

wherein the CPU score of the ith node is cpuScore _iEqual to the ratio of the CPU utilization of this host to the average CPU load of the cluster.

For the CPU and the memory, the value of the corresponding item in the formula is more than 0, which means that the load of the resource is less than the load average value of the cluster, and the value of the corresponding item in the formula is less than 0, which means that the load degree is inferior to the average condition of the cluster; for the mirror network transmission speed and the data network transmission speed, a value greater than 0 indicates that the average network transmission speed is faster than the average network transmission speed of the cluster, and conversely, the average network transmission speed is slower than the average network transmission speed of the cluster.

When the comprehensive score is greater than 0, the comprehensive load condition of the node is superior to the comprehensive load average condition of the cluster, and the node can be taken as a task immigration object; and when the comprehensive score is less than 0, the comprehensive load of the node is poor relative to the comprehensive load average condition of the cluster, and the node can be taken as a task emigration object and a part of Pod is selected for rescheduling.

And (3) realizing a load queue:

the idea of the invention for realizing dynamic load balancing is as follows: two load queues are established in the scheduler: and the high-load queue and the low-load queue divide all the nodes of the cluster into two queues according to the comprehensive score calculated by the LoadBalanancepriority, wherein the score of more than 0 indicates that the comprehensive load is idle relative to the cluster and is stored in the low-load queue, and the score of less than 0 indicates that the comprehensive load is heavy relative to the cluster and is stored in the high-load queue. And migrating some Pod of the nodes with higher load to the nodes with lower load by periodically detecting the load condition of the cluster so as to ensure the load balance of the cluster.

In the load queue, the scores of the nodes are sequentially calculated, the nodes are inserted into the queue, the node with the highest score (or the node with the lowest score) is searched, and the node is deleted. Therefore, the load queue is realized by adopting a priority queue, the priority queue is different from a common queue, the common queue is first-in first-out, the positions of elements in the queue can be dynamically adjusted by each insertion and deletion operation in the priority queue, and the node with the highest (or lowest) weight in the queue can be removed by each deletion operation.

Corresponding to the three basic operations in the load queue, insertion, deletion and search are required in the priority queue. There are three ways to implement priority queues: ordered tables, unordered tables, and binary heaps. The time complexity of the ordered list insertion is O (n), and the time complexity of the deletion is O (1), so that the method is suitable for queues with more insertion operations than deletion operations; the time complexity of the insertion and deletion of the unordered table is opposite to that of the ordered table, and the method is suitable for deleting the queue dominated by operation. In the load queue, the insertion and deletion operations are frequent, so the design adopts a third mode, namely a binary heap to realize the priority queue, the time complexity of the insertion and deletion of the priority queue realized by the binary heap is O (log2n), and the load queue realized by the binary heap can balance the complexity of the algorithm.

The high load queue and the low load queue need to carry out insertion, searching and deletion operations, the high load queue stores the nodes with the comprehensive scores smaller than 0 in the queue in an ascending order, and the low load queue stores the nodes with the comprehensive scores larger than 0 in a descending order.

In the current version of Kubernetes, the main reasons for triggering rescheduling are:

1, Pod operation is abnormal; the Pod terminates its operation at the host for various reasons (e.g., is not compatible with other pods in the Node) by the Kublet, recovering its occupied resources.

2, Node operation is abnormal; if the storage volume of the Pod is mounted in a distributed file system such as a GlusterFS, when the Node stops working due to power failure or damage, the Master Node extracts the Pod related information stored in the network file system and the etcd and recovers the pods on other nodes.

However, in the dynamic load balancing policy, this is obviously insufficient, as shown in fig. 2, two ways of triggering dynamic scheduling are added to the original kubernets way of the design:

1. triggering an external event; the external event may be, in addition to the above-described abnormal operation of Pod or Node, a reason such as cluster capacity expansion and capacity reduction, and may be dynamically scheduled according to an increase or decrease in the number of available nodes of the cluster.

2.Node overload protection; the load condition of each node is periodically detected through a timer, when the load of the nodes in the cluster is too heavy, rescheduling is triggered, and the over-loaded node part Pod is migrated out.

The specific implementation steps of the dynamic control are shown in fig. 3:

step 7.1: initializing a system, and setting parameters such as system resource specification coefficients, required weight factors, dynamic scheduling thresholds, load information collection periods and the like;

step 7.2: the monitor regularly acquires the load information of each working node according to a set period, the period is generally set to be 8-60 seconds, and the collected load information is stored in a persistent database etcd;

step 7.3: the control node reads the load information of the persistent database etc, and calculates the load mean value, each resource score and the comprehensive score of each working node;

step 7.4: a scheduler of the control node establishes a high-load queue and a low-load queue in a binary heap mode according to the comprehensive score;

step 7.5: and completing resource scheduling by the scheduler, and migrating the part of the Pod (sequentially migrating the Pod on the high-load queue to the low-load queue in a polling mode, and removing the node from the high-load queue when the score of the node is changed after migration) on the node with the comprehensive score lower than the lower threshold to the node with relatively lower load.

In this embodiment, the dynamic scheduling may be triggered by an external event, such as: pod or Node exceptions, cluster expansion and contraction, external control commands, etc.

The specific implementation of dynamic scheduling is divided into 4 steps: sorting, preselecting, building a team and scheduling, as shown in fig. 4, a scheduler reads load information of all available nodes from a persistent database etcd at regular time and calculates a comprehensive score, the nodes are divided into two queues according to the score condition, the nodes of each queue are filtered out some nodes which do not meet requirements through a preselection process, a high-load queue and a low-load queue are built for the filtered nodes according to the score value, and the scheduler selects a plurality of Pod from the head of the high-load queue to migrate to the nodes of the low-load queue, so that the overall load balance of the system can be maintained through a dynamic adjustment mode.

The invention discloses a dynamic load balancing resource scheduling strategy based on Kubernetes (an open source container cluster arrangement system). The invention improves the resource model of Kubernetes, and provides the resource models of network bandwidth, storage space and the like added on the basis of CPU and memory as the measurement factors of scheduling; through a load balancing method based on double load queues, through periodically detecting the load condition of a cluster, some Pods (Pod is the minimum unit which can be scheduled by a Kubernets container cluster management system) of nodes with higher load are migrated to nodes with lower load, so as to ensure the load balance of the cluster; aiming at the defect that Kubernets do not support dynamic scheduling, the invention improves a dynamic load balancing algorithm by combining the characteristics of a Kubernets scheduling system, divides the scheduling algorithm into a static scheduling part and a dynamic scheduling part, and can still better maintain the load balancing of the system when the cluster environment changes. The Kubernetes container resource scheduling strategy provided by the invention solves the problems that the Kubernetes scheduling strategy is single, and the reasonable scheduling from the container to the machine node in the cluster can not be realized according to the constraint scheduling requirements of the container application on specific factors such as system kernels, network transmission speed and the like. And the dynamic load balancing scheduling strategy of migrating the Pod running on a certain working node to another new working node with higher matching degree with the scheduling strategy is realized.

It should be understood that parts of the specification not set forth in detail are well within the prior art.

It should be understood that the above description of the preferred embodiments is given for clarity and not for any purpose of limitation, and that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

1. A dynamic load balancing resource scheduling method based on Kubernetes adopts a Kubernetes container cluster management system; the system loads and operates a plurality of nodes, wherein the nodes are working nodes of a Kubernetes container cluster;

characterized in that the method comprises the following steps:

2. The dynamic load balancing resource scheduling method based on Kubernetes according to claim 1, characterized in that: the requirements in step 2 include the amount of resources requested and the mounted storage volume.

3. The dynamic load balancing resource scheduling method based on Kubernetes according to claim 1, characterized in that: selecting the most appropriate node for the Pod in the step 3, wherein the specific implementation comprises selecting the most appropriate node for the Pod by adopting static scheduling or selecting the most appropriate node for the Pod by adopting dynamic scheduling;

in the static scheduling, when the Pod queue to be scheduled is not empty, the pods in the Pod queue to be scheduled are scheduled to a proper node according to a designed scheduling strategy according to a first-in first-out sequence, and the pods are taken out from the node to select the most proper host;

and in the dynamic scheduling, the monitor regularly feeds back the relevant information of the host and the running task to the persistent database etcd, the scheduler reads data from the persistent database etcd, performs dynamic scheduling according to the overall load condition of the cluster, and migrates some Pod of the nodes with the load higher than the preset value to the nodes with the load lower than the preset value.

4. The dynamic load balancing resource scheduling method based on Kubernetes according to claim 1, characterized in that: and 6, performing the processing operation, wherein the load average value of each resource in the system and the load ratio of the node are used as each resource score of the node, and the resource condition of the node is judged according to the comprehensive score.

5. The Kubernetes-based dynamic load balancing resource scheduling method according to any one of claims 1-4, wherein the specific implementation of step 7 comprises the following sub-steps:

step 7.1: initializing a system and setting related parameters; the parameters comprise system resource specification coefficients, demand weight factors, dynamic scheduling thresholds and load information collection periods;

step 7.2: the monitor regularly acquires the load information of each working node according to a set period and stores the collected load information into a persistent database etcd;

step 7.3: the control node reads load information of the persistent database etcd, and calculates a load mean value, each resource score and a comprehensive score of each working node;

step 7.5: and completing resource scheduling by the scheduler, and migrating the part Pod on the node with the comprehensive score lower than the lower threshold value to the node with relatively low load.

6. The dynamic load balancing resource scheduling method based on Kubernetes according to any one of claims 1 to 4, characterized in that: in step 7, the dynamic scheduling is triggered by a timer, d or Node exception trigger, cluster capacity expansion and capacity reduction trigger or an external control command.