CN112559186B

CN112559186B - Kubernetes container resource expansion and contraction method

Info

Publication number: CN112559186B
Application number: CN202011524569.XA
Authority: CN
Inventors: 王磊; 黄启功
Original assignee: Beijing Yunsi Imagination Technology Co ltd
Current assignee: Beijing Yunsi Imagination Technology Co ltd
Priority date: 2020-12-22
Filing date: 2020-12-22
Publication date: 2021-09-24
Anticipated expiration: 2040-12-22
Also published as: CN112559186A

Abstract

A Kubernetes container resource expansion and contraction method comprises the following specific steps: s1, acquiring Metrics Server monitoring information, and judging whether the resource usage reaches the capacity expansion standard and the capacity reduction standard: if the capacity expansion standard is reached, executing S2; s2, judging whether the number of instances reaches the upper limit; if yes, go to S3; s3, judging whether the resource use reaches the upper limit; if not, executing S4; s4, judging whether the increased resources can be scheduled successfully; if yes, go to S5; s5, creating a new deployment set, increasing the number of resources and expanding the capacity; if the resource usage reaches the capacity reduction standard, executing S10; s10, judging whether the number of the deployment sets reaches the minimum value; if the number of deployment sets does not reach the minimum value, executing S11; and S11, deleting the newly created deployment set and carrying out capacity reduction. The invention carries out the expansion and contraction at the level of the deployment set, so that the horizontal expansion and the vertical expansion coexist, a more comprehensive and comprehensive elastic expansion mechanism is provided, and the expansion and contraction efficiency is improved.

Description

Kubernetes container resource expansion and contraction method

Technical Field

The invention relates to a Kubernetes container resource arrangement platform, in particular to a Kubernetes container resource expansion and contraction method.

Background

One core function of Kubernetes is to support elastic expansion and contraction of container resources so as to deal with the scene that the system load pressure changes along with time; when the load pressure is high, the container resources are automatically expanded to ensure the response speed of the system and prevent the system from being broken down by large flow; when the load is low, the container resources are automatically reduced, so that the resources are saved. The basic unit of Kubernetes resource scheduling is a container group (Pod), a group of same Pod resources are managed through a Deployment set (Deployment), the value of the copy number (Replica) in the Deployment set is the number of the pods managed by the Deployment set, and the CPU and the memory resources of the container in each Pod are configured in a Pod template of the Deployment set; the current Kubernetes container resource elastic expansion and contraction includes Horizontal expansion and contraction (HPA) and Vertical expansion and contraction (VPA).

In the prior art, horizontal expansion and vertical expansion of the same Deployment are possibly in conflict and cannot be used simultaneously; if the horizontal expansion is used only, the scene that the lowest resource demand exceeds the current configuration value after the system runs for a period of time cannot be dealt with; if the vertical expansion is used only, the vertical expansion is limited by node resources, and has an upper limit, so that a scene with extremely high resource demand cannot be met, and the existing Pod needs to be deleted and reconstructed during the vertical expansion, which may cause interruption to the existing service connection request.

Disclosure of Invention

Objects of the invention

In order to solve the technical problems in the background technology, the invention provides a Kubernetes container resource expansion and contraction method.

(II) technical scheme

The invention provides a Kubernetes container resource expansion and contraction method, which comprises the following specific steps:

s1, acquiring Metrics Server monitoring information, and judging whether the resource usage reaches the capacity expansion standard and the capacity reduction standard:

if the resource usage reaches the capacity expansion standard, executing S2;

if the resource usage reaches the capacity reduction standard, executing S10;

s2, judging whether the number of instances reaches the upper limit;

if the number of instances reaches the upper limit, go to S3;

if the number of instances reaches the non-upper limit, go to S6;

s3, judging whether the resource use reaches the upper limit;

if the resource usage reaches the upper limit, returning to execute S1;

if the resource usage does not reach the upper limit, executing S4;

s4, judging whether the increased resources can be scheduled successfully;

if the resource pre-scheduling is successfully increased, executing S5;

if the increase of the resource pre-scheduling fails, returning to execute S1;

s5, creating a new deployment set and increasing the number of resources;

s6, judging whether the resource use reaches the upper limit;

if the resource usage reaches the upper limit, go to S9;

if the resource usage does not reach the upper limit, executing S7;

s7, judging whether the increased resources can be scheduled successfully;

if the resource pre-scheduling is successfully increased, executing S8;

if the increase of the resource pre-scheduling fails, executing S9;

s8, creating a new deployment set, and increasing the number of instances and resources;

s9, creating a new deployment set, and only increasing the number of instances;

s10, judging whether the number of the deployment sets reaches the minimum value;

if the number of the deployment sets reaches the minimum value, returning to the step S1;

if the number of deployment sets does not reach the minimum value, executing S11;

and S11, deleting the newly created deployment set.

Preferably, also includes Kubernetes' container; the Kubernetes container includes a horizontal container bank jack and a vertical container bank jack.

Preferably, the horizontal container group retractors are connected with the deployment set; the deployment set includes a container group copy number; the deployment set container group copy number includes container group a1, container group a2, … …, and container group AN.

Preferably, the vertical container group retractors are connected with the deployment set; the deployment set includes container resources; the deployment set container resource includes container group B1, container group B2, … …, and container group BN.

Compared with the prior art, the technical scheme of the invention has the following beneficial technical effects:

in the invention, the capacity expansion is not carried out at the Pod level, but at the Deployment level; the newly expanded Deployment and the original Deployment can realize the effects of horizontal expansion and vertical expansion at different values of Replicas, a container CPU and an internal memory, provide a more comprehensive and comprehensive elastic expansion mechanism, improve the efficiency of expansion and contraction, perform vertical expansion under the condition of not deleting and rebuilding the existing Pod and avoid causing long service connection interruption.

Drawings

Fig. 1 is a flowchart of a kubernets container resource expansion and contraction method provided by the present invention.

Fig. 2 is a horizontal container group expansion and contraction device of the kubernets container resource expansion and contraction method provided by the invention.

Fig. 3 is a vertical container group expansion and contraction device of the kubernets container resource expansion and contraction method provided by the invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in further detail with reference to the accompanying drawings in conjunction with the following detailed description. It should be understood that the description is intended to be exemplary only, and is not intended to limit the scope of the present invention. Moreover, in the following description, descriptions of well-known structures and techniques are omitted so as to not unnecessarily obscure the concepts of the present invention.

As shown in fig. 1-3, the kubernets container resource expansion and contraction method provided by the present invention includes the following specific steps:

if the resource usage reaches the capacity expansion standard, executing S2;

if the resource usage reaches the capacity reduction standard, executing S10;

s2, judging whether the number of instances reaches the upper limit;

if the number of instances reaches the upper limit, go to S3;

if the number of instances reaches the non-upper limit, go to S6;

s3, judging whether the resource use reaches the upper limit;

if the resource usage reaches the upper limit, returning to execute S1;

if the resource usage does not reach the upper limit, executing S4;

s4, judging whether the increased resources can be scheduled successfully;

if the resource pre-scheduling is successfully increased, executing S5;

if the increase of the resource pre-scheduling fails, returning to execute S1;

s5, creating a new deployment set and increasing the number of resources;

s6, judging whether the resource use reaches the upper limit;

if the resource usage reaches the upper limit, go to S9;

if the resource usage does not reach the upper limit, executing S7;

s7, judging whether the increased resources can be scheduled successfully;

if the resource pre-scheduling is successfully increased, executing S8;

if the increase of the resource pre-scheduling fails, executing S9;

s9, creating a new deployment set, and only increasing the number of instances;

and S11, deleting the newly created deployment set.

In an alternative embodiment, a kubernets container is also included; the Kubernetes container includes a horizontal container bank jack and a vertical container bank jack.

In an alternative embodiment, horizontal container group expanders connect deployment sets; the deployment set includes a container group copy number; the deployment set container group copy number includes container group a1, container group a2, … …, and container group AN.

In an alternative embodiment, vertical vessel group expanders connect the deployment set; the deployment set includes container resources; the deployment set container resource includes container group B1, container group B2, … …, and container group BN.

It is to be understood that the above-described embodiments of the present invention are merely illustrative of or explaining the principles of the invention and are not to be construed as limiting the invention. Therefore, any modification, equivalent replacement, improvement and the like made without departing from the spirit and scope of the present invention should be included in the protection scope of the present invention. Further, it is intended that the appended claims cover all such variations and modifications as fall within the scope and boundaries of the appended claims or the equivalents of such scope and boundaries.

Claims

1. A Kubernetes container resource expansion and contraction method is characterized by comprising the following specific steps:

if the resource usage reaches the capacity expansion standard, executing S2;

if the resource usage reaches the capacity reduction standard, executing S10;

s2, judging whether the number of instances reaches the upper limit;

if the number of instances reaches the upper limit, go to S3;

if the number of instances reaches the non-upper limit, go to S6;

s3, judging whether the resource use reaches the upper limit;

if the resource usage reaches the upper limit, returning to execute S1;

if the resource usage does not reach the upper limit, executing S4;

s4, judging whether the increased resources can be scheduled successfully;

if the resource pre-scheduling is successfully increased, executing S5;

if the increase of the resource pre-scheduling fails, returning to execute S1;

s5, creating a new deployment set and increasing the number of resources;

s6, judging whether the resource use reaches the upper limit;

if the resource usage reaches the upper limit, go to S9;

if the resource usage does not reach the upper limit, executing S7;

s7, judging whether the increased resources can be scheduled successfully;

if the resource pre-scheduling is successfully increased, executing S8;

if the increase of the resource pre-scheduling fails, executing S9;

s9, creating a new deployment set, and only increasing the number of instances;

and S11, deleting the newly created deployment set.

2. The Kubernetes container resource expansion and contraction method according to claim 1, further comprising a Kubernetes container; the Kubernetes container includes a horizontal container bank jack and a vertical container bank jack.

3. The Kubernetes container resource expansion and contraction method according to claim 2, wherein a horizontal container group expansion and contraction device is connected with a deployment set; the deployment set includes a container group copy number; the deployment set container group copy number includes container group a1, container group a2, … …, and container group AN.

4. The Kubernetes container resource expansion and contraction method according to claim 2, wherein a vertical container group expansion and contraction device is connected with a deployment set; the deployment set includes container resources; the deployment set container resource includes container group B1, container group B2, … …, and container group BN.