CN112306631A

CN112306631A - Method, controller and host for scheduling kubernets resources

Info

Publication number: CN112306631A
Application number: CN202011098833.8A
Authority: CN
Inventors: 马磊
Original assignee: Suzhou Inspur Intelligent Technology Co Ltd
Current assignee: Suzhou Inspur Intelligent Technology Co Ltd
Priority date: 2020-10-14
Filing date: 2020-10-14
Publication date: 2021-02-02

Abstract

The invention discloses a method, a controller and a host for kubernets resource scheduling, which comprises the following steps of S100, deploying a cluster environment and rating according to multi-node resources; s200, creating a new pod, judging whether node resources in a cluster corresponding to the new pod are sufficient currently, and scheduling according to the rating; s300, the new pod meets the resource condition and is successfully scheduled and operated; the invention reasonably schedules resources by combining dynamic resource scheduling with node resource integration, can efficiently utilize resources, improve the utilization rate of resources, can quickly run services, does not need to queue and wait to save time of customers, simultaneously does not release resources in a pod clearing mode, effectively saves services, ensures that the services can be continuously and effectively, and improves the use experience of the customers.

Description

Method, controller and host for scheduling kubernets resources

Technical Field

The invention relates to a method, a controller and a host for kubernets resource scheduling, and belongs to the technical field of artificial intelligence.

Background

The continuous development of the field of artificial intelligence is widely applied to a plurality of fields. Kubernets is widely applied to a deployment environment of a multi-node cluster as a brand-new container technology-based distributed architecture leading scheme, and in the current kubernets scheduling mode, when resources are insufficient, tasks can continue to queue until enough resources are available, and can be scheduled for use; or the low-level pod clears the released resources, so that neither new service requirements can be issued, nor services possibly used can be deleted by mistake, and finally bad influence is caused.

Disclosure of Invention

In view of the above technical problems, the present invention aims to: a method, a controller and a host for dispatching kubernets resources are provided, so that when the resources are insufficient, the resources can be reasonably dispatched on the premise of not clearing pod.

The technical solution of the invention is realized as follows: a method for kubernets resource scheduling includes the following steps,

s100, deploying a cluster environment, and rating according to multi-node resources;

s200, creating a new pod, judging whether node resources in a cluster corresponding to the new pod are sufficient currently, and scheduling according to the rating;

and S300, the new pod meets the resource condition and is successfully scheduled and operated.

Preferably, the multi-node resource rating level includes the original resource quantity and the comprehensive rating of each performance index of the CPU, the GPU, the MEM, and the disk, and the multi-node is rated as a high level, a medium level, and a low level.

Preferably, when the resources are sufficient, the new pod is scheduled to run on the high-level node resources.

Preferably, when the resources of the high-level node are insufficient, the new pod is scheduled to the middle-level node to run according to the node level from high to low.

Preferably, when the situation that resources are insufficient is generated by traversing all the nodes, the original pod already running on the high-level node is judged, the original pod is graded and evaluated, the resources allocated by the original pod of the low level are dynamically deleted, and partial resources are sparsely released.

Preferably, the high-level node recovers the released resources, integrates the remaining resources of the high-level node, and judges whether the resources meet the requirement of new pod operation again, and if the resources are sufficient, the resources are scheduled to the high-level node for operation.

Preferably, when the original pod in the high-level node is still insufficient to schedule the new pod after releasing the resources through dynamic pruning but can run to the nodes of the middle and low levels, the original pod is scheduled from the high-level node to the nodes of the middle and low levels, and the resources of the high-level node are released at the same time.

Preferably, after the high-level node releases the resources, traversing the node resources again, and determining whether the resource condition required to be met by the new pod operation is met, if so, scheduling the node to the high-level node after releasing the resources for operation, and if not, continuing to dynamically delete the node to release the resources until the node has enough resources for operation.

A controller for kubernets resource scheduling comprises a plurality of controllers with different functions, and the controller executes the method for kubernets resource scheduling.

A host of kubernets resource scheduling, comprising a plurality of hosts for managing a cloud platform, the hosts performing a method of kubernets resource scheduling of claim.

Due to the application of the technical scheme, compared with the prior art, the invention has the following advantages:

according to the method, the controller and the host for kubernets resource scheduling, resources are reasonably scheduled in a mode of combining dynamic scheduling of resources and node integration of resources, the resources can be efficiently utilized, the utilization rate of the resources is improved, services can be rapidly operated, queuing is not needed, and time of customers is saved. Meanwhile, resources cannot be released in a pod clearing mode, the service is effectively stored, the service can be continuously and effectively, and the use experience of a client is improved.

Drawings

The technical scheme of the invention is further explained by combining the accompanying drawings as follows:

fig. 1 is a schematic flow chart of a method for kubernets resource scheduling according to the present invention.

Detailed Description

The invention is described below with reference to the accompanying drawings.

Example one

Fig. 1 shows a method for kubernets resource scheduling according to the present invention, which includes the following steps,

s100, deploying a cluster environment, and rating according to multi-node resources; the multi-node resource evaluation level comprises the original resource quantity and the comprehensive evaluation of each performance index of a CPU, a GPU, MEM and a disk, and the multi-node is evaluated as a high level, a medium level and a low level;

s200, creating a new pod, judging whether node resources in a cluster corresponding to the new pod are sufficient currently, and scheduling according to the rating; when the resources are sufficient, scheduling the new pod to the high-level node resources for operation; when the resources of the nodes at the high level are insufficient, scheduling the resources to the nodes at the middle level for operation by the new pod according to the sequence from the high level to the low level; when the situation that resources are insufficient is generated in all the nodes in traversal, judging the resource calling state of an original pod which is operated on a high-level node, carrying out hierarchical evaluation on the original pod, dynamically deleting the resources distributed by the original pod of a low level, and sparsely releasing partial resources; the high-level node recovers the released resources, integrates the residual resources of the high-level node, judges whether the resources meet the resource conditions required to be met by the new pod operation again, and dispatches the resources to the high-level node for operation if the resources meet the resource conditions; when the original pod in the high-level node releases resources through dynamic deletion, the original pod is not enough to be scheduled to a new pod, but can be operated to the nodes of middle and low levels, the original pod is scheduled to the nodes of middle and low levels from the high-level node; traversing the node resources again after the high-level node releases the resources, judging whether the resource conditions required to be met by the new pod operation are met, if so, scheduling the node to be operated after the resources are released, otherwise, continuing to dynamically delete the released resources until the node can have enough resources to operate

Example two

A method for dispatching kubernets resources comprises the following steps that S100, a cluster to which a current environment belongs is deployed, a plurality of nodes including a master, a node and the like are contained in the cluster, a user can add node grade labels in a configuration file according to deployment conditions, grades of the nodes are configured according to hardware conditions such as node resource configuration and performance data after deployment, and the nodes can be divided into different priority grades such as high, medium and low.

S200, when a user issues a service, a new pod is created in the bottom layer to first determine whether the current resource is sufficient? And at the moment, whether the resources of the high-level node are sufficient or not is preferentially judged, if so, the high-level node is directly scheduled to operate, and the new pod can be ensured to well utilize the high-efficiency resources.

S300, when the resources of the high-level node are judged to be insufficient during the creation of the new pod, whether the resources of the middle-level node are sufficient is judged according to the priority sequence from high to low, and when the resources are sufficient, the middle-level node is scheduled to run, so that the task can be guaranteed to run normally.

S400, when the new pod is created and all node resources are judged to be insufficient, the operated original pod on the high-level node is judged preferentially, the operated original pod is graded, and the performance indexes such as the CPU, the GPU and the mem are judged comprehensively according to the operation duration of the operated original pod.

S500, classifying the original pod according to the judgment condition, wherein the low-level pod can dynamically release the allocated resources, such as CPU, GPU, mem and the like, the original pod can continuously keep running by using the minimized resources, after the allocated resources are dynamically released, the current node recovers the released resources, whether the created new pod resource quota is met is judged, and if the created new pod resource quota is met, the node is scheduled to run.

S600, when the low-level original pod is used for dynamically releasing resources and the resources are recycled by the high-level node, the new pod judges that the resources do not meet the scheduling, and at the moment, the low-level original pod of the released resources judges whether enough resources can run on the middle-level node and the low-level node according to the released resource quota; and if the node can operate, scheduling the low-level original pod of the minimum resource to the middle-level node and the low-level node, releasing the resource of the high-level node, recovering the resource again, judging whether the resource is enough again by the new pod, and directly scheduling the node to operate at the high level when the resource is enough.

S700, when the low-level node of the high-level node dynamically releases and drives away the high-level node and the released resource still does not meet the scheduling resource, traversing the resources in the steps S500 and S600 again until the resource is sufficient, meeting the scheduling condition and being successfully scheduled.

EXAMPLE III

In order to protect the invention more fully, the application also protects a controller for kubernets resource scheduling, which comprises a plurality of controllers with different functions, and the controller executes the method for kubernets resource scheduling.

Example four

In order to protect the invention more fully, the application also protects a host of kubernets resource scheduling, which comprises a plurality of hosts for managing the cloud platform, and the host executes the method of kubernets resource scheduling.

The above-mentioned embodiments are merely illustrative of the technical idea and features of the present invention, and the purpose thereof is to enable those skilled in the art to understand the contents of the present invention and implement the present invention, and not to limit the scope of the present invention, and all equivalent changes or modifications made according to the spirit of the present invention should be covered in the scope of the present invention.

Claims

1. A method for resource scheduling of kubernets is characterized in that: comprises the following steps of (a) carrying out,

2. The method of claim 1, wherein: the multi-node resource evaluation level comprises the original resource quantity and the comprehensive evaluation of each performance index of a CPU, a GPU, MEM and a disk, and the multi-node is evaluated to be a high level, a medium level and a low level.

3. The method of claim 2, wherein: and when the resources are sufficient, scheduling the new pod to run on the high-level node resources.

4. The method of claim 2, wherein: and when the resources of the high-level nodes are insufficient, scheduling the resources to the middle-level nodes for operation by the new pod according to the node level from high to low.

5. The method of claim 2, wherein: when the situation that resources are insufficient is generated by traversing all the nodes, the original pod which is operated on the high-level node is judged, the original pod is graded and evaluated, the resources which are distributed by the original pod with the low level are dynamically deleted, and partial resources are sparsely released.

6. The method of claim 5, wherein: and the high-level node recovers the released resources, integrates the residual resources of the high-level node, judges whether the resources meet the requirement of new pod operation again, and dispatches the resources to the high-level node for operation if the resources are sufficient.

7. The method of claim 5, wherein: when the original pod in the high-level node releases resources through dynamic deletion, the new pod is still not enough to be scheduled, but the original pod can be scheduled to the nodes of the middle and low levels, and the resources of the high-level node are released at the same time.

8. The method of claim 7, wherein: and traversing the node resources again after the high-level node releases the resources, judging whether the resource conditions required to be met by the new pod operation are met, if so, scheduling the node to the high-level node after the resources are released, and if not, continuing dynamically deleting the node to release the resources until the node has enough resources to operate.

9. A controller for kubernets resource scheduling, comprising a plurality of controllers having different functions, characterized in that: the controller performing a method of kubernets resource scheduling of any of claims 1-8.

10. A host of kubernets resource scheduling, comprising a plurality of hosts for managing a cloud platform, characterized by: the host performing a method of kubernets resource scheduling of any of claims 1-8.