CN112363827A

CN112363827A - Multi-resource index Kubernetes scheduling method based on delay factors

Info

Publication number: CN112363827A
Application number: CN202011162367.5A
Authority: CN
Inventors: 李克文; 吴雪锋; 张达; 杨建涛
Original assignee: China University of Petroleum East China
Current assignee: China University of Petroleum East China
Priority date: 2020-10-27
Filing date: 2020-10-27
Publication date: 2021-02-12

Abstract

The invention discloses a multi-resource index Kubernets scheduling method based on delay factors, which is characterized in that the priority of a cloud computing task is evaluated according to a resource request of the cloud computing task and a specific evaluation criterion; setting scheduling delay for the cloud computing task according to the cloud computing task priority and the computing resource state; inserting the cloud computing task into a delay sequence according to the scheduling delay of the cloud computing task; and performing multi-resource index Node matching of dynamic weight according to the resource occupancy rate of the cloud computing task and the Node resource idle state, and distributing the cloud computing task to the optimal Node according to the matching degree. According to the cloud computing resource scheduling method and device, efficient resource scheduling of the cloud computing tasks for different types of resources is achieved through multi-resource index Node matching of the delay factors and the dynamic weights, timely response of the cloud computing tasks is guaranteed, the utilization rate of the cloud computing resources is improved, and load balancing among the resources is achieved.

Description

Multi-resource index Kubernetes scheduling method based on delay factors

Technical Field

The invention belongs to the technical field of cloud computing, and particularly relates to a multi-resource index Kubernetes scheduling method based on a delay factor.

Background

The cloud computing technology is the trend under the current wave and has huge market prospect. Cloud computing is going deep into every corner of life, leading to a thorough revolution in various industries. Kubernets is a container cloud arrangement management system which is most widely applied in the field of cloud computing, and is an important support technology of cloud computing services.

The traditional Kubernetes scheduling method carries out Node scheduling according to two resource indexes of a Node memory and a CPU, resources related to a cloud computing task comprise the CPU and the memory, and also comprise bandwidth, a magnetic disc and a GPU, and the traditional scheduling method cannot meet the resource scheduling requirement of multiple resource indexes in a cloud computing scene. In addition, the resource request response of the cloud computing task is time-ordered, while the traditional Kubernetes scheduling belongs to static resource scheduling, and resource requests in a period of time in the future are not fully considered, so that the problems that the computing task cannot be responded in time, the resource utilization rate is low, the resource load is unbalanced and the like can be caused.

Disclosure of Invention

In order to overcome the defects of the traditional Kubernetes scheduling method, the invention provides a multi-resource index Kubernetes scheduling method based on delay factors, which fully considers resource requests in a period of time in the future, sets scheduling delay for computing tasks according to task priorities and computing resource states, carries out multi-resource index Node matching of dynamic weights aiming at cloud computing tasks of various resource indexes, realizes efficient resource scheduling of the cloud computing tasks, ensures timely response of the cloud computing tasks, improves the utilization rate of the cloud computing resources and realizes load balance among the resources.

In order to achieve the purpose, the technical scheme of the invention is as follows:

A. evaluating the priority of the cloud computing task: calculating the required resource ratio s according to the CPU, memory, bandwidth, disk and GPU resource requests of the cloud computing task_iAccording to s_iPriority p of cloud computing task according to specific evaluation criterion_iEvaluation was carried out.

B. And (3) cloud computing task scheduling delay setting: and setting scheduling delay for the cloud computing task according to the cloud computing task priority and the computing resource state. Longer scheduling delays are set for low priority tasks and shorter or no delays are set for high priority tasks. Longer scheduling delays are set when computing resources are tight, and shorter or no delays are set when computing resources are abundant.

C. Cloud computing task delay: and inserting the cloud computing task into the delay sequence according to the scheduling delay of the cloud computing task, and waiting for the delay to finish the scheduling of the cloud computing task Node.

D. Scheduling a cloud computing task Node: node preselection is carried out according to the cloud computing task resource request, and nodes which do not meet the requirement are excluded; performing dynamic-weight multi-resource index Node matching in the alternative nodes, wherein the resource occupancy rate of a cloud computing task determines the dynamic weight size of a matching function, and the idle state of each resource of the nodes determines the matching degree; and distributing the cloud computing task to the optimal Node according to the matching degree, and if the distribution fails, re-entering a delay queue.

The invention has the beneficial effects that: resource requests in a period of time in the future are fully considered, computing task delay scheduling according to task priorities and computing resource states is achieved, dynamic-weight multi-resource-index Node matching is achieved for cloud computing tasks with multiple resource indexes, efficient resource scheduling of the cloud computing tasks is achieved, timely response of the cloud computing tasks is guaranteed, the utilization rate of the cloud computing resources is improved, and load balancing among resources is achieved.

Drawings

FIG. 1 is a model architecture diagram of the present invention

Detailed Description

A. Evaluating the priority of the cloud computing task: and evaluating the priority of the cloud computing task according to the specific resource request condition of the cloud computing task. Calculating the required resource ratio s according to the resource request of the cloud computing task_i，s_iThe calculation formula is as follows:

s_i＝(s_cpu+s_mem+s_bandwidth+s_disk+s_gpu)/5，

wherein, the cpu_i、mem_i、bandwidth_i、disk_i、gpu_iRespectively representing the computing resources requested by the cloud computing task i. CPU (Central processing Unit)_average、mem_average、bandwidth_average、disk_average、gpu_averageRespectively representing the average of the computing resources of all nodes. s_cpu、s_mem、s_bandwidth、s_disk、s_gpuRespectively representing the respective resource occupation ratios of the computing resources required by the cloud computing task i.

According to s_iPriority p of cloud computing task according to specific evaluation criterion_iAnd (4) evaluating, wherein evaluation criteria are as follows: s_iWhen < 5%, p_iIs 1; s is more than or equal to 5%_iWhen < 20%, p_iIs 2; s is more than or equal to 20%_iWhen < 35%, p_iIs 3; s is more than or equal to 35%_iWhen < 50%, p_iIs 4; s_iWhen p is more than or equal to 50 percent_iIs 5; p is a radical of_iThe larger the task priority.

B. And (3) cloud computing task scheduling delay setting: due to the time sequence of the cloud computing task request, the following two situations are considered: r₁For a cloud computing resource task request with priority 3, R₂For a cloud computing resource task request with priority 4,

(1) for R in case of shortage of computing resources₁、R₂Carry out Node matching and satisfy R₂The matched Node of the requirement is the Node₁Satisfy R₁The matched Node of the requirement is the Node₁、Node₂Corresponding to a degree of matching of m₁、m₂(m₁＞m₂). If T is at time R₁When coming, the system distributes it to Node₁Then T + T time R₂Coming, Node₁Resource deficiency, R₂No response is available.

(2) In the same case for R₁、R₂Conducting Node matching, R₁、R₂All the optimally matched nodes are nodes₁Corresponding to a degree of matching of m₃、m₄(m₃＜m₄)，R₁、R₂All the sub-optimal matching nodes are nodes₂Corresponding to a degree of matching of m₅、m₆(m₅＞m₆). If T is at time R₁When coming, the system distributes it to Node₁Time T + T R₂When coming, the system distributes it to Node₂。

Obviously, the traditional allocation scheme has various defects in the time sequence of cloud computing, and cannot ensure the timely response of cloud computing tasks, and cannot meet the efficient utilization of resources of service clusters and the load balance among the resources. In order to avoid the problems, scheduling delay is set for the cloud computing task according to the cloud computing task priority and the computing resource state. Longer scheduling delays are set for low priority tasks and shorter or no delays are set for high priority tasks. Longer scheduling delays are set when computing resources are tight, and shorter or no delays are set when computing resources are abundant. The delay calculation formula is as follows:

wherein t is_iThe delay set for task i, t is one unit time, q is the total resource usage,

the total occupation ratio of each computing resource.

D. Scheduling a cloud computing task Node: node preselection is carried out according to the cloud computing task resource request, and nodes which do not meet the requirement are excluded; and performing dynamic-weight multi-resource index Node matching in the alternative nodes, wherein the resource occupancy rate of the cloud computing task determines the dynamic weight of a matching function, and the idle state of each resource of the nodes determines the matching degree. The matching function calculation formula is as follows:

wherein, w_cpu、w_mem、w_bandwidth、w_disk、w_gpuIs a dynamic weight, w, of five resources_cpu：w_mem：w_bandwidth：w_disk：w_gpu＝s_cpu：smem：s_bandwidth：s_disk：s_gpuAnd is

Is the idle ratio of each computing resource in the Node.

And distributing the cloud computing task to the optimal Node according to the matching degree, and if the distribution fails, re-entering a delay queue.

The foregoing is only a preferred embodiment of this invention and any person skilled in the art may use the above-described solutions to modify or change the same into equivalent embodiments with equivalent variations. Any simple modification, change or amendment to the above-mentioned embodiments according to the technical solutions of the present invention without departing from the technical solutions of the present invention belong to the protection scope of the technical solutions of the present invention.

Claims

1. A multi-resource index Kubernets scheduling method based on delay factors is characterized by comprising the following steps:

A. evaluating the priority of the cloud computing task:

calculating the required resource ratio s according to the CPU, memory, bandwidth, disk and GPU resource requests of the cloud computing task_iAccording to s_iPriority p of cloud computing task according to specific evaluation criterion_iCarrying out evaluation;

B. and (3) cloud computing task scheduling delay setting:

and setting scheduling delay for the cloud computing task according to the cloud computing task priority and the computing resource state. Longer scheduling delays are set for low priority tasks and shorter or no delays are set for high priority tasks. Setting longer scheduling delay when the computing resources are short, and setting shorter delay or not when the computing resources are abundant;

C. cloud computing task delay:

inserting the cloud computing task into a delay sequence according to the scheduling delay of the cloud computing task, and scheduling the cloud computing task Node after the delay is finished;

D. scheduling a cloud computing task Node:

node preselection is carried out according to the cloud computing task resource request, and nodes which do not meet the requirement are excluded; performing dynamic-weight multi-resource index Node matching in the alternative nodes, wherein the resource occupancy rate of a cloud computing task determines the dynamic weight size of a matching function, and the idle state of each resource of the nodes determines the matching degree; and distributing the cloud computing task to the optimal Node according to the matching degree, and if the distribution fails, re-entering a delay queue.