CN113742033A

CN113742033A - Kubernetes cluster federal system and implementation method thereof

Info

Publication number: CN113742033A
Application number: CN202111048076.8A
Authority: CN
Inventors: 韦克璐; 赵凯麟; 李玲容; 周东
Original assignee: Guangxi Dongxin Digital Construction Information Technology Co ltd
Current assignee: Guangxi Dongxin Digital Construction Information Technology Co ltd
Priority date: 2021-09-08
Filing date: 2021-09-08
Publication date: 2021-12-03

Abstract

The invention discloses a kubernets cluster federation realization method, which relates to the technical field of network communication and solves the problem of service discovery and arrangement of kubernets across cluster resources, and the method comprises the following steps: setting a cluster controller, a state controller, a synchronous controller, a scheduler and a command controller; initializing a cluster controller, a state controller, a synchronous controller, a scheduler and a command controller to a kubernets system to enable the kubernets to become a main cluster of a federal cluster, wherein the cluster controller, the state controller, the synchronous controller, the scheduler and the command controller can run at least 1 copy. The invention also discloses a kubernets cluster federal system. The invention runs a cluster controller, a state controller, a synchronous controller, a scheduler and a command controller in a kubernets system, utilizes CRD resources and admission control of an administration webhook built in the kubernets to achieve the complete cluster federal function, realizes unified management of a plurality of kubernets clusters by a single cluster, and can effectively solve the problem of service discovery and arrangement of cross-cluster resources of the kubernets.

Description

Kubernetes cluster federal system and implementation method thereof

Technical Field

The invention relates to the technical field of network communication, in particular to a kubernetes cluster federal system and an implementation method thereof.

Background

With the continuous development of cloud computing, the technical architecture of IT infrastructure is undergoing drastic change, kubernets reduces the development, management, deployment and operation and maintenance costs of application programs from a brand-new perspective, and the architecture thereof becomes a de facto standard for rapid development, testing and deployment of application programs, and is deployed on a large scale in a production environment and adopted by more and more companies.

For enterprise-level customers, the data is required to be in safety compliance in the cloud uploading process, so that the cloud uploading is generally carried out in a multi-cloud and mixed-cloud manner. Before Kubernetes and container technology came into existence, it was necessary to customize development efforts for each cloud facilitator to achieve a cloudy and mixed cloud. After the Kubernets and the containers are arranged, the local data center and the Kubernets cluster of the cloud service provider can provide consistent interfaces, so that the application program does not need to be directly bound with a specific cloud service provider under most conditions, the cloud service provider is unlocked, and the situation that the service is only dependent on a certain cloud service provider is avoided. Meanwhile, the service availability is improved, a scheme with lower cost can be selected according to the price of the cloud service provider, the elastic expansion of service burst is guaranteed, and the method is expanded to other cloud service providers when the capacity of a certain cloud service provider is insufficient.

Building the Kubernetes cluster federation, however, presents a service discovery and orchestration problem across cluster resources for Kubernetes.

Disclosure of Invention

The technical problem to be solved by the invention is to solve the defects in the prior art, and the invention aims to provide a kubernets cluster federation realization method.

The invention also aims to provide a kubernets cluster federal system.

In order to achieve the first purpose, the invention provides a kubernetes cluster federation realization method, which comprises the steps of setting a cluster controller, a state controller, a synchronous controller, a scheduler and a command controller;

initializing the cluster controller, the state controller, the synchronous controller, the scheduler and the command controller to a kubernets system to enable the kubernets to become a main cluster of a federal cluster, wherein the cluster controller, the state controller, the synchronous controller, the scheduler and the command controller can run at least 1 copy;

configuring a federal cluster through the command controller, and performing interactive operation with an administrator, wherein the interactive operation comprises initializing the federal cluster, adding a slave cluster and viewing information of the federal cluster;

managing, by the cluster controller, a lifecycle of the federated cluster and checking health of master and slave clusters;

sensing and caching the running state of the federated resource in the slave cluster through the state controller;

distributing, by the dispatcher, workload instances into different clusters of the federated cluster;

and monitoring the expected value and the current running state of the federated resource of the slave cluster through the synchronous controller, and when the expected value is inconsistent with the running state, the synchronous controller is responsible for operating the federated resource of the slave cluster so as to keep the running state of the resource consistent with the expected value.

As a further improvement, the command controller operates as follows:

step 1.1, the command controller is initialized and operated to the main cluster, communicates with a kube-apiserver of the main cluster, and acquires configuration information of the federal cluster; if the acquisition is successful, executing the step 1.3; otherwise, executing step 1.2;

step 1.2, the command controller initializes the configuration information of the federal cluster; if the initialization is successful, the command controller is started successfully, and the step 1.3 is executed; otherwise, re-executing step 1.1;

step 1.3, the command controller waits for a user to input a command;

step 1.4, when the command controller receives a slave cluster adding command, executing a slave cluster adding step;

step 1.5, when the command controller receives a command for deleting the slave cluster, executing the step of deleting the slave cluster;

step 1.6, when the command controller receives a request for displaying a slave cluster list, searching and displaying the slave cluster resources;

step 1.7, step 1.3 is performed.

Further, the step of performing an add slave cluster comprises:

step 1.4.1, the command controller receives configuration file information of a slave cluster;

step 1.4.2, the command controller generates CRD resources of kubernets according to the configuration file information of the slave cluster;

step 1.4.3, the command controller connects the kube-apiserver of the slave cluster according to the connection information of the CRD resource;

step 1.4.4. the command controller creates role resources on the slave cluster, and if the creation is successful, successful information is returned; otherwise, failure information is returned.

Further, the step of performing deletion slave clustering comprises:

step 1.5.1, the command controller receives a request for deleting the appointed slave cluster, and searches the connection information of the CRD resource in the resources in the slave cluster;

step 1.5.2, the command controller connects and appoints a slave cluster kube-apiserver according to the connection information in the step 1.5.1, and deletes relevant information appointed on the slave cluster;

step 1.5.3, the command controller deletes the CRD resource in the appointed slave cluster, and if the deletion is successful, the success information is returned; otherwise, failure information is returned.

Further, the cluster controller operates as follows:

step 2.1, the cluster controller initializes and runs to the main cluster, communicates with the kube-apiserver of the main cluster, and obtains configuration information of the federal cluster and a slave cluster list; if the acquisition is successful, executing the step 2.2; otherwise, continuing to execute the step;

step 2.2, the cluster controller records the current time T1, traverses the slave cluster list to obtain slave cluster access information, and then accesses the kube-apiserver of the slave cluster; when the request can be correctly returned and the return value is a set value, the slave cluster is in a healthy state, otherwise, the slave cluster is in an unhealthy state;

if the health state changes, writing new health information into the slave cluster list, and recording the current time T2 after traversing is completed;

and 2.3, the cluster controller acquires health check delay time T according to the configuration information of the federal cluster, if T < T2-T1, the step 2.2 is executed, and if not, the step is repeatedly executed.

Further, the state controller operates as follows:

step 3.1, the state controller initializes and runs to the main cluster, communicates with the kube-apiserver of the main cluster, and obtains configuration information and a cluster list of the federal cluster; if the acquisition is successful, executing the step 3.2; otherwise, continuing to execute the step;

step 3.2, the state controller acquires all running federatization resources in a traversing manner;

and 3.3, the state controller acquires the current single federal resource, uses a kubernetes Informer mode to subscribe the change of the current resource, caches the change to the main cluster, and circulates the step until all the currently running federal resources are monitored.

And 3.4, the state controller continuously monitors the change of all running federated resources, and when the federated resources change, the changed information is cached to the main cluster.

Further, the scheduler operates as follows:

step 4.1, the scheduler is initialized and operated to the main cluster, communicates with the kube-apiserver of the main cluster, and executes the step 4.2 if the communication is successful; otherwise, continuing to execute the step;

step 4.2, the scheduler records the current time T3, and acquires the currently running Deployment federal resource, replicase federal resource, Statefelset federal resource and Daemonset federal resource;

step 4.3, the scheduler acquires the current single federal resource mentioned in the step 4.2, compares the scheduling state of the current federal resource, and extracts newly created kubernets Pod, unscheduled kubernets Pod and failed kubernets Pod;

step 4.4, the scheduler calls a set scheduling algorithm to schedule the currently acquired resources to a proper cluster, the scheduling result is written into the resources, and the synchronous controller executes resource synchronization;

step 4.5. if the scheduler has traversed all the resources mentioned in step 4.2, then step 4.6 is performed; otherwise, the scheduler acquires the next resource and executes the step 4.3;

step 4.6. the scheduler obtains the current time T4, T5 ═ T4-T3, if T5> the set time, step 4.2 is performed; otherwise, step 4.2 is executed after waiting the set time-T5.

Further, the setting of the scheduling algorithm includes:

step 4.4.1, acquiring the current resource type, kubernets nodes, cluster information and the service condition of resources;

4.4.2, arranging the resources of kubernets to obtain a PSP file based on the total copy number and the definition strategy of the cluster;

4.4.3, performing priority ordering operation according to the PSP file, and performing priority ordering on nodes suitable for resource operation;

and 4.4.4, selecting one cluster with the highest final priority as a resource scheduling result.

Further, the synchronization controller operates as follows:

step 5.1, the synchronous controller is initialized to run to the main cluster, communicates with the kube-apiserver of the main cluster, and if the communication is successful, the step 5.2 is executed; otherwise, continuing to execute the step;

step 5.2, the synchronous controller records the current time T6 and traverses a resource list needing to be synchronized;

step 5.3, the synchronous controller acquires a single resource and the latest state of the resource, and if the expected state of the resource is inconsistent with the latest state, the step 5.4 is executed; otherwise, executing step 5.6;

step 5.4, the synchronous controller acquires the slave cluster name from the resource, inquires the connection information of the slave cluster from the master cluster according to the slave cluster name, communicates with the kube-apiserver of the slave cluster, and executes the step 5.5 if the communication is successful, otherwise executes the step 5.6;

step 5.5, the synchronous controller creates or modifies corresponding resource information on the slave cluster according to the acquired data of the resource, and executes step 5.6;

step 5.6, if the synchronous controller has traversed all resources, executing step 5.7; otherwise, executing step 5.3;

step 5.7. if the synchronization controller obtains the current time T7, T8 ═ T7-T6, if T8> set time, then step 5.2 is performed; otherwise, step 5.2 is executed after waiting the set time-T8.

In order to achieve the second purpose, the invention provides a kubernetes cluster federal system, which comprises a cluster controller, a state controller, a synchronous controller, a scheduler and a command controller, wherein the cluster controller is used for controlling the state controller to be in a state of being in a state of being in a being in a state of being in a state of being in a being in a state of being in a being in a;

the cluster controller, the state controller, the synchronous controller, the scheduler and the command controller are initialized to a kubernets system and operate according to the kubernets cluster federal implementation method.

Advantageous effects

Compared with the prior art, the invention has the advantages that:

the invention runs a cluster controller, a state controller, a synchronous controller, a scheduler and a command controller in a kubernets system, utilizes CRD resources and admission control of an administration webhook built in the kubernets to achieve the complete cluster federal function, realizes unified management of a plurality of kubernets clusters by a single cluster, and can effectively solve the problem of service discovery and arrangement of cross-cluster resources of the kubernets. The invention realizes the complete cluster federal function, is easy to maintain, and enables the invention to be unbound with a specific Kubernets version by skillfully utilizing Kubernets CRD resources, and to be backward compatible with a newer version.

Drawings

FIG. 1 is an architectural diagram of the present invention.

Detailed Description

The invention will be further described with reference to specific embodiments shown in the drawings.

Referring to fig. 1, a kubernets Cluster federation implementation method sets a Cluster Controller, a Status Controller, a synchronous Controller, a Scheduler, and a Command Controller;

initializing a Cluster Controller, a Status Controller, a synchronous Controller, a Scheduler, a Command Controller to a kubernets system, so that the kubernets becomes a main Cluster of a federal Cluster, and the Cluster Controller, the Status Controller, the synchronous Controller, the Scheduler, and the Command Controller can run at least 1 copy;

configuring a federal cluster through a Command Controller, and performing interactive operation with an administrator, wherein the operations comprise initializing the federal cluster, adding a slave cluster and viewing information of the federal cluster;

managing the life cycle of the federated Cluster through a Cluster Controller, and checking the health of a master Cluster and a slave Cluster;

sensing and caching the running state of the federated resource in the slave cluster through a state Controller Status Controller;

the method comprises the steps that a Scheduler disperses workload instances into different clusters of a federated cluster;

and monitoring the expected value and the current running state of the federated resource of the slave cluster through a synchronous Controller Sync Controller, and when the expected value is inconsistent with the running state, the synchronous Controller Sync Controller is responsible for operating the federated resource of the slave cluster so as to keep the running state of the resource consistent with the expected value.

The Command Controller operates as follows:

step 1.1, initializing and running a Command Controller to a main cluster, communicating with a kube-apiserver of the main cluster, and acquiring configuration information CaihFederalConfig of the federal cluster; if the acquisition is successful, executing the step 1.3; otherwise, executing step 1.2;

step 1.2, a Command Controller initializes configuration information of the federal cluster; if the initialization is successful, the Command Controller is successfully started, and the step 1.3 is executed; otherwise, re-executing step 1.1; the configuration information for initialization is as follows:

-Role, general Role information;

-ServiceAccount, service account information;

-clusterrollingbinding, cluster role and account binding information;

-RoleBinding, generic role and account binding information;

-a CRD resource of caihhfeldalcluster, kubernets, for holding connection information from the cluster, initialization declaring only this resource, containing no content;

-crid resources of caihhfeleralconfig, kubernets, describing federal cluster configuration information, including the following:

-a health check delay time;

-a failed retry time;

-whether domain name service is enabled;

-whether Scheduler is enabled;

-whether cross cluster service discovery is enabled;

-a ValidatingWebhookConfiguration and a muttatingwebhookconfiguration for unified format and associated calls when cluster federation generates resources;

-caihh federalcrds federate resources federatization kubernets resources, defamation of Namespace, Deployment, Service, ConfigMap, Secret, Ingress, Job, ReplicaSet, ClusterRole, Role, ServiceAccount, ClusterRole binding, rollering binding resources federatization, other resources may federate using command lines;

step 1.3, a Command Controller waits for a user to input a Command;

step 1.4, when the Command Controller receives the Command of adding the slave cluster, executing the step of adding the slave cluster;

step 1.6, when the Command Controller receives a request for displaying a slave cluster list, searching and displaying the slave cluster resources; searching and displaying the resources of the slave cluster in the CaihFederalCluster resources;

step 1.7, step 1.3 is performed.

The step of performing an add slave cluster comprises:

step 1.4.1, the Command Controller receives the information of the yaml configuration file added to the slave cluster; the yaml profile information is as follows:

-federally clustered proprietary namespace: caih-feeder

Cluster role information

Cluster url, ca authentication information and cluster name from the cluster

-user configuration information of the slave cluster, including user name, authentication information cert, authentication key

-association configuration of usernames and clusters

The file format is referenced as follows:

step 1.4.2, the Command Controller generates CRD resources of kubernets according to the yaml configuration file information of the slave cluster, namely, generates CRD resources of CaihFederalCluster;

step 1.4.3, the Command Controller connects the kube-apiserver of the slave cluster according to the connection information of the CRD resource (namely according to the connection information of the CaihFederalCluster);

step 1.4.4, the Command Controller creates role resources on the slave cluster, and if the creation is successful, successful information is returned; otherwise, returning failure information;

role resources are as follows:

-creating namespace where the federated cluster is located: caih-feed;

-ServiceAccount, service account information;

-ClusterRole, cluster role information;

-clusterrollingbinding, cluster role and account binding information;

-Secret, account key;

-caihhfeleralcrds, federatnetes resources federally, identical to caihhfeleralcrds initialized by the master cluster in step 1.2.

The step of performing deletion slave clustering comprises:

step 1.5.1, the Command Controller receives a request for deleting the appointed slave cluster, and searches the connection information of the CRD resource in the slave cluster, namely the connection information of the slave cluster in the CaihFederalCluster resource;

step 1.5.2, the Command Controller connects and appoints the slave cluster kube-apiserver according to the connection information in step 1.5.1, and deletes the relevant information appointed on the slave cluster;

step 1.5.3, the Command Controller deletes the CRD resource in the appointed slave cluster, namely, deletes the CaihFederalCluster resource, if the deletion is successful, the success information is returned; otherwise, failure information is returned.

The Cluster Controller operates as follows:

step 2.1, the Cluster Controller initializes and runs to the main Cluster, communicates with the kube-apiserver of the main Cluster, and obtains the configuration information CaihFederalConfig of the federal Cluster and the CaihFederal Cluster from the Cluster list; if the acquisition is successful, executing the step 2.2; otherwise, continuing to execute the step;

step 2.2, the Cluster Controller records the current time T1, traverses the slave Cluster list CaihFederalCluster to obtain the slave Cluster access information, and then accesses the kube-apiserver of the slave Cluster; when the request can be correctly returned and the return value is a set value, in this embodiment, the set value is 200, which indicates that the slave cluster is in a healthy state, otherwise, indicates that the slave cluster is in an unhealthy state;

if the health state changes, writing new health information into a subordinate cluster list CaihFederalCluster, and recording the current time T2 after traversing is completed;

and 2.3, the Cluster Controller acquires the health check delay time T according to the configuration information CaihFederalConfig of the federal Cluster, if T is less than T2-T1, the step 2.2 is executed, otherwise, the step is repeatedly executed.

The state Controller Status Controller operates as follows:

step 3.1, initializing and running a Status Controller to a main cluster, communicating with a kube-apiserver of the main cluster, and acquiring configuration information CaihFederalConfig of the federated cluster and a cluster list CaihFederalCluster; if the acquisition is successful, executing the step 3.2; otherwise, continuing to execute the step;

step 3.2, the Status Controller traverses and obtains all running federated resources;

and 3.3, the Status Controller acquires the current single federal resource, subscribes the change of the current resource by using a kubernets inform mode, caches the change to the main cluster, and circulates the step until all the currently running federal resources are monitored.

And 3.4, continuously monitoring the change of all running federal resources by a Status Controller, and caching the changed information to the main cluster when the change of the federal resources occurs.

The Scheduler runs as follows:

step 4.1, the Scheduler is initialized and operated to the main cluster, communicates with the kube-apiserver of the main cluster, if successful, then step 4.2 is executed; otherwise, continuing to execute the step;

step 4.2, the Scheduler records the current time T3, and acquires the currently running Delploymet federal resource, replicase federal resource, Statedulet federal resource and Daemonset federal resource;

step 4.3, the Scheduler obtains the current single federal resource mentioned in the step 4.2, compares the scheduling state of the current federal resource, and extracts newly created kubernets Pod, unscheduled kubernets Pod and failed kubernets Pod;

step 4.4, the Scheduler calls a set scheduling algorithm to schedule the currently acquired resources to a proper cluster, the scheduling result is written into the resources, and a synchronous Controller (Sync Controller) executes resource synchronization;

step 4.5, if the Scheduler has traversed all the resources mentioned in step 4.2, then step 4.6 is executed; otherwise, the Scheduler acquires the next resource and executes the step 4.3;

step 4.6, the Scheduler acquires the current time T4, T5 is T4-T3, if T5> the set time, step 4.2 is executed; otherwise, step 4.2 is executed after waiting the set time-T5.

The set scheduling algorithm comprises the following steps:

step 4.4.1, acquiring the current resource type, kubernets nodes, cluster information and the service condition of resources; such as how many scheduled pots are on the cluster, the resources that have been applied for, and the resources that can also be allocated; acquiring kubernets nodes, namely acquiring slave cluster information of the current resource and all node information of the slave cluster, and determining which node a pod related to the Deploymet, the replicase, the Stateful set and the Daemonset should be dispatched to;

step 4.4.2, arranging the resources of kubernets to obtain a PSP (PodSchedulingPreference) file based on the total copy number and the definition strategy of the cluster;

4.4.3, performing priority ordering operation according to the PSP file, and performing priority ordering on the nodes suitable for resource operation;

The Sync Controller operates as follows:

step 5.1, the synchronous Controller is initialized to operate to the main cluster, communicates with kube-apiserver of the main cluster, if successful, carry out step 5.2; otherwise, continuing to execute the step;

step 5.2, the synchronous Controller records the current time T6 and traverses the resource list to be synchronized;

step 5.3, the synchronous Controller obtains the single resource and the latest state of the resource, if the expected state of the resource is not consistent with the latest state, the step 5.4 is executed; otherwise, executing step 5.6;

step 5.4, the synchronous Controller Sync Controller acquires the names of the slave clusters from the resources, inquires the connection information of the slave clusters from the master cluster according to the names of the slave clusters, and communicates with the kube-apiserver of the slave clusters, if the communication is successful, the step 5.5 is executed, otherwise, the step 5.6 is executed;

step 5.5, the synchronous Controller creates or modifies the corresponding resource information on the slave cluster according to the acquired data of the resource, and executes step 5.6;

step 5.6, if the synchronous Controller Sync Controller has traversed all resources, then step 5.7 is executed; otherwise, executing step 5.3;

step 5.7, if the synchronization Controller obtains the current time T7, and T8 is T7-T6, if T8> the set time, step 5.2 is executed; otherwise, step 5.2 is executed after waiting the set time-T8.

A kubernets Cluster federation system comprises a Cluster Controller, a state Controller Status Controller, a synchronous Controller Sync Controller, a Scheduler and a Command Controller;

the Cluster Controller, the Status Controller, the synchronous Controller, the Scheduler and the Command Controller are initialized to the kubernets system and operate according to the kubernets Cluster federation realization method.

The above is only a preferred embodiment of the present invention, and it should be noted that it is obvious to those skilled in the art that several variations and modifications can be made without departing from the structure of the present invention, which will not affect the effect of the implementation of the present invention and the utility of the patent.

Claims

1. A kubernetes cluster federation realization method is characterized in that a cluster controller, a state controller, a synchronous controller, a scheduler and a command controller are arranged;

2. The method of claim 1, wherein the command controller operates as follows:

step 1.3, the command controller waits for a user to input a command;

step 1.7, step 1.3 is performed.

3. The method of claim 2, wherein performing the add slave cluster step comprises:

4. The method of claim 2, wherein the step of performing a delete slave cluster comprises:

5. The method of claim 1, wherein the cluster controller operates as follows:

6. The method of claim 1, wherein the state controller operates as follows:

7. The method of claim 1, wherein the scheduler operates as follows:

8. The method of claim 7, wherein setting a scheduling algorithm comprises:

9. The method of claim 1, wherein the synchronization controller operates as follows:

10. A kubernetes cluster federation system is characterized by comprising a cluster controller, a state controller, a synchronous controller, a scheduler and a command controller;

the cluster controller, the state controller, the synchronization controller, the scheduler and the command controller are initialized to a kubernets system and operate according to a kubernets cluster federation implementation method of any one of claims 1 to 9.