CN115297112A

CN115297112A - Dynamic resource quota and scheduling component based on Kubernetes

Info

Publication number: CN115297112A
Application number: CN202210912913.5A
Authority: CN
Inventors: 张贺; 吕国骏; 周鑫; 荣国平; 邵栋
Original assignee: Nanjing Kuangji Information Technology Co ltd
Current assignee: Nanjing Kuangji Information Technology Co ltd
Priority date: 2022-07-31
Filing date: 2022-07-31
Publication date: 2022-11-04

Abstract

The invention discloses a dynamic resource quota and scheduling component based on Kubernets, which comprises: acquiring historical resource usage amount of application service in a Kubernetes cluster; analyzing the usage amount of each application resource by adopting an ARIMA and LSTM prediction model and dynamically adjusting quota; acquiring the actual usage amount of each resource of each Node; evaluating the adaptation degree of the Node and the Pod to be scheduled from multiple dimensions; and scheduling the Pod to be scheduled to the target Node. The assembly provided by the invention is used for transforming the traditional Kubernetes resource quota method, predicting the use amount of Pod resources by using a time sequence prediction model and dynamically adjusting quota; and transforming a Kubernetnes scheduling module, and estimating and scheduling the Pod to the target Node from dimensions such as a CPU, a memory, a network bandwidth, a disk IO, a priority and the like by utilizing the actual resource usage amount of the Node. The difficulty of determining the resource usage amount when a Kubernets user deploys the application is reduced, and the problem of unbalanced scheduling of data intensive or I/O intensive applications is solved.

Description

Dynamic resource quota and scheduling component based on Kubernetes

Technical Field

The invention belongs to the technical field of cloud computing, and particularly relates to a dynamic resource quota and scheduling component based on Kubernetes.

Background

Kubernetes is an open-source container cluster management system, provides functions of arranging, automatically deploying, discovering services and scheduling resources for large-scale container groups, and provides a whole set of container application solution for users. The application scenes are very wide, and the method becomes an industry standard. However, kubernetes currently has the following disadvantages in resource allocation and scheduling:

1) Kubernets give the right to allocate resources to Pod to users, and the users can request the resources according to the requirements. In a practical scenario, it is statistical that in about 70% of cases, the resources requested by the user exceed the required resources, and the application resource over-allocation results in a decrease in throughput and cluster resource utilization, while under-allocation, although more services can be deployed in each node, will cause the application to compete for resources when the services are busy, resulting in an increase in task latency.

2) When measuring the node resource capacity, the Kubernets default scheduler applies for the sum of resources through applications already deployed on the nodes, belongs to a static value, cannot accurately represent the actual load of the nodes, and only pays attention to a CPU and a memory during scheduling, so that when file storage, a mirror image center and some data intensive applications are scheduled, a plurality of applications of the type can be scheduled to a few repeated nodes, and the node resource bottleneck is caused.

Disclosure of Invention

The invention aims to provide a dynamic resource quota and scheduling component based on Kubernetes, so as to solve the problems in the background technology.

In order to solve the technical problems, the invention provides the following technical scheme: a dynamic resource quota and scheduling component based on Kubernetes comprises the steps of predicting the resource usage amount of a Pod in the future for a period of time through a time sequence model and dynamically adjusting, selecting the most appropriate Node for the Pod to deploy from five aspects of CPU, memory, network bandwidth, disk I/O and Node high-priority application deployment conditions by utilizing the actual load information of the Node, and further comprising the following steps:

the resource monitoring module is used for monitoring, alarming and collecting data, particularly monitoring the actual use condition of various resources of each node in the Kubernetes cluster, sending alarm information to a manager when the load of the node is overhigh, and providing a resource load inquiring function for a user;

the dynamic resource quota module is used for predicting the use amount of the Pod resources, comparing the predicted value with the current Pod resource quota value, and if the predicted value exceeds or is lower than a threshold interval, adjusting the resource amount applied by the Pod, specifically providing an application service automatic quota adjusting function for a user;

and the dynamic scheduling module is used for scheduling the Pod of the unbound nodes, measuring the adaptation degree of the Pod and each Node from multiple dimensions through the Node information acquired by the monitoring module, and finally selecting the most suitable Node to deploy the Pod on the Node.

Preferably, the resource monitoring module includes:

the index monitoring unit is used for monitoring the CPU, the memory, the network bandwidth and the real-time use condition of the disk I/O of each node in the Kubernetes cluster, and monitoring the actual use condition of resources of each Pod on the node and the working state of the Pod;

the alarm management unit is used for alarming the Node and the Pod with abnormal monitoring data, and the alarm rules established by the user comprise whether the resource usage of the Node and the Pod exceeds a threshold value and whether the working state of the Pod interface can normally provide service to the outside;

and the data management unit is used for acquiring and monitoring the acquired historical data of each resource, processing abnormal data, packaging the data and inquiring an interface for a user.

Preferably, the dynamic resource quota module includes:

the resource prediction unit is used for predicting the resource usage of the Pod in the next period of time by using a time series prediction model according to the historical resource usage of the Pod obtained by the monitoring module;

the dynamic adjustment unit is used for comparing the Pod resource usage amount obtained by the resource prediction unit with the current Pod resource usage amount and realizing whether to carry out dynamic quota on the Pod or not according to the error between the Pod resource usage amount and the current Pod resource usage amount;

preferably, the resource prediction unit further includes:

acquiring resource usage including CPU, memory, network bandwidth and disk I/O;

linear prediction, namely analyzing the obtained use amount of each resource of Pod by adopting an ARIMA prediction model;

performing nonlinear prediction, namely predicting residual errors by adopting an LSTM (least squares metric) model according to the difference between a true value and a linear predicted value;

model checking, namely checking the prediction model through a root mean square error, an average absolute error and an average absolute percentage error;

and predicting the result, and obtaining a final predicted value through the sum of linearity and nonlinearity.

Preferably, the dynamic scheduling module includes:

the pre-selection scheduling unit is used for continuously pre-checking all nodes, including the surplus of network bandwidth, the occupation of ports and the sufficiency of disk space, and eliminating the nodes which do not meet the requirements;

the optimal scheduling unit is used for selecting the node most suitable for Pod in a quantification way for the nodes meeting the conditions, and describing the load balancing degree of each resource by adopting each resource variance;

and the binding unit is used for deploying the Pod to the Node with the highest score.

Compared with the prior art, the invention has the following beneficial effects: aiming at the situation that the quota value cannot be determined before the Pod is deployed, the resource usage amount of the Pod in a future period of time is predicted by constructing a model, and dynamic adjustment is carried out, so that the service quality is ensured, and the user experience is improved; when disk I/O and data intensive applications are faced, a plurality of applications of the type are dispatched to the same node, the real-time load of the node is obtained through monitoring, and the node is evaluated from more dimensions, so that the resource load of each node is more balanced, and the resource utilization rate and the load balancing degree of an enterprise cluster are improved.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate preferred embodiments of the invention and together with the description serve to explain the principles of the invention in which:

FIG. 1 is a general architecture diagram of a dynamic resource quota and scheduling component based on Kubernetes in accordance with the present invention;

FIG. 2 is a flow chart of a dynamic resource quota and scheduling component based on Kubernetes in accordance with the present invention;

FIG. 3 is a flow diagram of a resource monitoring module according to an embodiment of the present invention;

FIG. 4 is a flow diagram of a dynamic resource quota module in accordance with an embodiment of the present invention;

FIG. 5 is a flow chart of a dynamic scheduling module according to an embodiment of the present invention;

FIG. 6 is a flow chart of the ARIMA and LSTM combined prediction model.

Detailed Description

The technical solutions in the embodiments of the present invention will be described clearly and completely with reference to the accompanying drawings in the embodiments of the present invention, and other advantages and effects of the present invention will be easily understood by those skilled in the art from the description in the present specification. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The term Kubernets is used herein, is an open-source cross-platform container cluster management system, and can be used for automatically deploying, expanding and managing containerized applications, and in architectural design, kubernets defines a series of expansion points for design structures, so that users can expand according to their own needs, and thus, the Kubernets can meet various different workloads;

the term "Prometheus" is used herein, which is a set of open source system monitoring alarm framework that can monitor resources in kubernets cluster and display data in a time-series database.

The term "Pod" is used herein, a Pod being the smallest deployable computing unit that can be created and managed in kubernets, which is a combination of one or more containers.

The term "Node" is used herein, a Node being a working Node of a kubernets cluster, which may be a physical machine or a virtual machine.

The term "ARIMA" is used herein, and ARIMA is a time series modeling method for performing optimization expansion on an autoregressive moving average model, because the autoregressive moving average model requires that a time series meets stationarity, but most data in real life does not have stationarity, so that the autoregressive moving average model cannot be directly used, but an original time series can be differentiated, and if the differentiated sequence can pass stability check and become stable, the autoregressive moving average model can be continuously used.

The term "LSTM" is used herein, and long-short term memory neural networks (LSTM) are an improved model proposed in 1997 to address the problems of gradient disappearance and gradient explosion that occur with the cyclic neural network model.

Example 1: a dynamic resource quota and scheduling assembly based on Kubernetes, the assembly overall architecture diagram is as shown in figure 1, the assembly generally comprises 4 layers of infrastructure layer, data storage layer, service logic layer and interaction layer, the main part of the invention is located in the service logic layer, the function comprises predicting the resource usage amount of Pod in a period of time and dynamically adjusting through a time sequence model, and the most appropriate Node is selected for Pod to be deployed from five aspects of CPU, memory, network bandwidth, disk I/O and Node high-priority application deployment condition by utilizing the actual load information of the Node, further comprising:

the dynamic resource quota module is used for predicting the use amount of the Pod resources, comparing the predicted value with the current Pod resource quota value, and if the predicted value exceeds or is lower than a threshold interval, adjusting the resource amount applied by the Pod, specifically providing an automatic quota adjustment function of the application service for the user;

and the dynamic scheduling module is used for scheduling the Pod of the Node which is not bound yet, measuring the adaptation degree of the Pod and each Node from multiple dimensions through the Node information acquired by the monitoring module, and finally selecting the most suitable Node to deploy the Pod on the Node.

The resource monitoring module includes:

the index monitoring unit is used for monitoring the CPU, the memory, the network bandwidth and the real-time use condition of the disk I/O of each node in the Kubernets cluster, and monitoring the actual use condition of resources of each Pod on the node and the working state of the Pod;

the alarm management unit is used for alarming the Node nodes and the Pod with abnormal monitoring data, and the alarm rules established by the user comprise whether the resource usage amount of the Node nodes and the Pod exceeds a threshold value and whether the working state of the Pod interface can normally provide services to the outside;

and the data management unit is used for collecting and monitoring the acquired historical data of each resource, processing abnormal data, packaging the data and inquiring an interface for a user.

The dynamic resource quota module includes:

the resource prediction unit is used for predicting the resource usage of the Pod in the next period of time by using a time sequence prediction model according to the historical resource usage of the Pod obtained by the monitoring module;

the resource prediction unit further includes:

acquiring resource usage including CPU, memory, network bandwidth and disk I/O;

linear prediction, namely analyzing the usage amount of each acquired Pod resource by adopting an ARIMA prediction model;

Preferably, the dynamic scheduling module includes:

the optimal scheduling unit is used for selecting the node most suitable for the Pod in a quantification manner for the nodes meeting the conditions, and describing the load balancing degree of each resource by adopting each resource variance;

Example 2: fig. 1 is a general architecture diagram of a dynamic resource quota and scheduling component based on kubernets according to the present invention, and fig. 2 is a flowchart of a dynamic resource quota and scheduling component based on kubernets according to the present invention, where the component uses a user as a main body, and performs a dynamic quota on a specific Pod according to a user requirement, so as to provide a disk I/O and data intensive scheduler for the user, and includes the following steps:

and S100, monitoring the resource usage amount of each node and Pod on the node in the cluster, giving an alarm according to a predefined rule, and collecting data required to be used.

As shown in fig. 3, step S100 specifically includes the steps of:

step S101, a monitoring module is set up, and nodes and application indexes needing to be monitored, such as a CPU, a memory, a network bandwidth, a disk I/O and the like, are configured;

step S102, configuring alarm names, alarm rules and alarm sending ways, such as high load of resources like a CPU, a memory and the like;

step S103, operating an alarm rule according to the monitoring data, and sending alarm information;

step S104, collecting monitoring data and providing an inquiry interface for the outside;

step S200, inquiring the Pod needing dynamic quota in the cluster, obtaining the historical resource usage amount of the service from the monitoring module, predicting and dynamically adjusting through a prediction model, and specifically comprising the following steps:

as shown in fig. 4, step S200 specifically includes the steps of:

step S201, obtaining CPU, memory, network bandwidth and historical usage amount of disk I/O of Pod in the cluster;

step S210, predicting a time sequence formed by the use amount of the Pod historical resources by a combined prediction model;

as shown in fig. 6, step S210 further specifically includes the steps of:

step S211, firstly, carrying out stationarity check on the time sequence of resource usage, if not, carrying out difference operation until the stationarity check is passed, recording the difference times as d, determining P and q of ARIMA (p, d, q) types through an autocorrelation function and a partial autocorrelation function, carrying out parameter estimation and model diagnosis on the model to obtain a final model, finally, predicting historical resource usage by using a prediction model to obtain a predicted value, and recording the predicted value as a predicted value

Step S212, calculating the real value y and the predicted value of the previous step

Residual error between, noted as e _t ；

Step S213, establishing LSTM model pair residual e _t Predicting, wherein the model is divided into four layers, namely an input layer, an LSTM layer 1, an LSTM layer 2 and an output layer, the neuron of the output layer is 1, the input layer is selected to be more than 5, and the number of the intermediate layers is determined according to an empirical formula

To select, obtain the final model to predict the residual error, and record as

Step S214, the ARIMA model prediction value is obtained

And LSTM prediction value

Adding the obtained data to obtain a final predicted value

Step S220, comparing the predicted results

And the current resource application amount of Pod, setting 0.2 as a threshold value, and if the application value is greater than the predicted value

Multiply by 1.2 or less than predicted

Multiplying by 0.8, then adjusting; if the quota is not required to be adjusted in the interval;

step S230, updating the quota value of each resource for Pod, and waiting for entering the next dynamic quota cycle;

step S300, selecting an optimal node for deployment through a pre-selection strategy and an optimal strategy according to the resource amount applied by the Pod and the actual load of each node;

as shown in fig. 5, step S300 specifically includes the steps of:

step S301, acquiring actual usage amounts of a CPU, a memory, a network bandwidth, a disk I/O and the like of each node, and acquiring a Pod resource application amount to be scheduled;

step S302, in a preselection stage, nodes with insufficient resources or conflicts, such as insufficient disk space, port conflicts, insufficient network bandwidth and the like, are filtered, so that the nodes do not participate in the next stage, and the calculated amount is reduced;

step S303, in the optimization stage, after the Pod to be scheduled is scheduled to each node, the node resource load balance degree is calculated, and the calculation formulas of the balance degree and the score are respectively as follows:

and step S304, selecting the node with the highest score, deploying the Pod to the node, and updating the node information.

Finally, it should be noted that: although the present invention has been described in detail with reference to the foregoing embodiments, it should be understood by those skilled in the art that various changes, modifications, substitutions and equivalents can be made without departing from the spirit and scope of the invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A dynamic resource quota and scheduling component based on Kubernetes comprises the steps of dynamically adjusting application resource application amount by utilizing monitoring data of all nodes of a cluster and a time sequence prediction model, measuring node and application adaptation degree from multiple dimensions by utilizing actual resource load of nodes, and scheduling, and is characterized by comprising the following steps:

the resource monitoring module is used for monitoring the use condition of each resource of each node in the cluster, monitoring the use condition and the working state of the resource applied on the node, giving an alarm according to load and providing data for other modules;

the dynamic resource quota module is used for dynamically adjusting various resource application amounts of application services on the nodes, acquiring the actual use condition of each resource of the application, predicting by using a combined prediction model and determining whether to adjust;

and the dynamic scheduling module is used for scheduling the application service of the unbound nodes to the node which is most suitable for the application service, and balancing the resource use of each node by measuring the adaptation degree from a plurality of dimensions.

2. The dynamic resource quota and scheduling assembly based on Kubernetes of claim 1, wherein the resource monitoring module comprises:

the index monitoring unit is used for monitoring the real-time use conditions of the CPU, the memory, the network bandwidth and the disk I/O of each node and application in the cluster;

the alarm management unit is used for alarming Node nodes and Pod with abnormal monitoring data;

and the data management unit is used for acquiring and monitoring the acquired resource use data, processing abnormal data, packaging the data and inquiring an interface for a user.

3. The dynamic resource quota and schedule component based on Kubernetes of claim 1, wherein the dynamic resource quota module comprises:

the resource prediction unit is used for predicting the resource usage of the Pod in the next period of time by using a time series prediction model according to the application historical resource usage obtained by the monitoring module;

and the dynamic adjusting unit is used for comparing and determining whether to adjust according to the predicted value and the current value.

4. The Kubernets-based dynamic resource quota and scheduling component of claim 3, wherein the resource prediction unit comprises:

acquiring resource usage including CPU, memory, network bandwidth and disk I/O;

linear prediction, i.e. prediction of linear part of future resource usage by using ARIMA model

Non-linear prediction, namely predicting residual errors by adopting an LSTM model according to the difference between a true value and a linear predicted value;

model checking for checking model accuracy;

5. The dynamic resource quota and schedule component based on Kubernetes of claim 1, wherein the dynamic scheduling module comprises:

the pre-selection scheduling unit is used for continuously pre-checking all nodes, including the surplus of network bandwidth, occupied ports and sufficient disk space, and eliminating the nodes which do not meet the requirements;

the optimal scheduling unit is used for selecting the optimal node from the nodes meeting the conditions for deployment;

and the binding unit is used for binding the Pod with the optimal node and updating the binding information.

6. The dynamic resource quota and scheduling component based on kubernets of claim 1, wherein the component is based on a user, performs dynamic quota on a specific Pod according to user demand, and provides a disk I/O and data intensive scheduler for the user, comprising the steps of:

step S100, monitoring the resource usage amount of each node and Pod on the node in the cluster, giving an alarm according to a predefined rule, collecting data required to be used,

step S200, inquiring the Pod needing dynamic quota in the cluster, obtaining the historical resource usage amount of the service from the monitoring module, predicting and dynamically adjusting through a prediction model,

and step S300, selecting the optimal node for deployment through a pre-selection strategy and an optimal strategy according to the resource amount applied by the Pod and the actual load of each node.

7. The dynamic resource quota and scheduling assembly based on Kubernetes as claimed in claim 1, wherein step S100 specifically comprises the steps of:

and step S104, collecting monitoring data and providing an inquiry interface for the outside.

8. The dynamic resource quota and scheduling assembly based on Kubernetes according to claim 1, wherein step S200 specifically comprises the steps of:

step S220, comparing the predicted results

Multiply by 1.2 or less than predicted

Multiplying by 0.8, then adjusting; if the quota is not adjusted in the interval, the quota is not required to be adjusted;

step S230, updating the quota value of each resource for Pod, and waiting for entering a next dynamic quota cycle;

step S210 further specifically includes the steps of:

step S211, firstly, the stationarity check is carried out on the time sequence of resource usage, if the time sequence of resource usage is not stationary, the difference operation is needed until the stationarity check is passed, the difference times are marked as d, p and q of ARIMA (p, d, q) types are determined through the autocorrelation function and the partial autocorrelation function, and the parameter estimation and the model are carried out on the modelType diagnosis, obtaining a final model, finally using the prediction model to predict the usage amount of the historical resources to obtain a predicted value, and recording the predicted value as

Residual error between them, denoted as e _t ；

To select, obtain the final model to predict the residual error, and record as

Step S214, the ARIMA model prediction value is calculated

And LSTM prediction value

Adding to obtain the final predicted value

9. The dynamic resource quota and scheduling assembly based on Kubernetes as claimed in claim 1, wherein step S300 specifically includes the steps of: