CN111949383A

CN111949383A - Management method for maintaining working nodes of cloud computing system

Info

Publication number: CN111949383A
Application number: CN202011009779.5A
Authority: CN
Inventors: 晏东; 吴晓勇; 叶盼
Original assignee: Chengdu Ghostcloud Technology Co ltd
Current assignee: Chengdu Ghostcloud Technology Co ltd
Priority date: 2020-09-23
Filing date: 2020-09-23
Publication date: 2020-11-17

Abstract

The invention discloses a management method for maintaining a working node of a cloud computing system, and relates to the field of container node maintenance based on a cloud platform. The invention can solve the problems of troublesome maintenance and great difficulty of the current working nodes of the cloud computing system, and comprises the following steps: the system receives the node maintenance request and then informs the task scheduler and the node to be maintained, the task scheduler does not distribute new tasks to the node to be maintained, the application instance on the node to be maintained is migrated to other nodes which normally operate and stops the application instance on the node to be maintained, and the system receives the node quit maintenance request and then informs the task scheduler that the corresponding node normally operates. By the method, the difficulty degree of node maintenance work is greatly reduced, the user operation is simpler and more convenient, the system operation and maintenance cost is further reduced, and the user experience is better. The invention is particularly suitable for maintaining the working nodes in the cloud cluster environment.

Description

Management method for maintaining working nodes of cloud computing system

Technical Field

The invention relates to the field of container node maintenance based on a cloud platform, in particular to a management method for maintaining a working node of a cloud computing system.

Background

In a cluster environment of a cloud computing system, a maintenance work node exists as a basic requirement of an operation and maintenance cluster environment. In a traditional operation and maintenance management mode, because maintenance of working nodes in a cluster involves a large amount of detail work interacting with the cluster, operation and maintenance personnel need to know the cloud environment above the cluster very much and complete the work of maintaining the working nodes on the basis of the detail function provided by the cloud environment, and non-professional personnel cannot complete the work well, so that the maintenance of the working nodes of a cloud computing system is troublesome and difficult at present.

Disclosure of Invention

The invention provides a management method for maintaining working nodes of a cloud computing system, aiming at solving the problems of troublesome maintenance and high difficulty of the working nodes of the cloud computing system at present.

In order to solve the problems, the invention adopts the technical scheme that:

the management method for maintaining the working nodes of the cloud computing system comprises a maintenance mode entering part, a maintenance mode exiting part and a task scheduling part;

the entering the maintenance mode part comprises:

step S1, the cloud computing system management center monitors whether a maintenance request for any working node is received, and if the maintenance request for any working node is received, the step S2 is executed;

step S2, the cloud computing system management center informs the task scheduler that the working node enters a maintenance mode;

step S3, the cloud computing system management center informs the working node to enter the maintenance mode, and after receiving the notice of entering the maintenance mode, the working node migrates the application instance on the current node to the working node which does not enter the maintenance mode;

step S4, the working node stops the application instance on the current node;

the exit from the maintenance mode section includes:

the method comprises the steps that a cloud computing system management center monitors whether a maintenance quitting request for any working node is received or not, and if the maintenance quitting request for any working node is received, a task scheduler is informed that the working node does not enter a maintenance mode;

the task scheduling part comprises:

and when the task scheduler deploys the application instances to each working node, not deploying the application instances to the working nodes entering the maintenance mode according to the mode of each working node.

As a further optimization, the step S3 of migrating the application instance on the current node to the working node that does not enter the maintenance mode specifically adopts:

the working node requests the task scheduler to redeploy the application instance on the current node;

the method comprises the steps that a cloud computing system management center monitors whether application instances on a current working node are completely redeployed to the working node which does not enter a maintenance mode and can provide corresponding services for the outside, if yes, first notification information is sent to the working node, and if not, the first notification information is not sent to the working node;

the worker node, upon receiving the first notification information, executes step S4.

As a further optimization, the cloud computing system employs a cloud computing system based on container technology.

As a further optimization, the cloud computing system management center receives maintenance requests and quit maintenance requests for any working node by providing command input lines and/or web page buttons.

The invention has the beneficial effects that: according to the management method for maintaining the working nodes of the cloud computing system, when a user needs to maintain a certain working node, a request for maintaining the corresponding working node is directly input into the cloud computing system, then the cloud computing system stops distributing a new task to the working node needing to be maintained, an application instance on the working node needing to be maintained is migrated to other working nodes which normally operate for processing, the user can maintain the working node needing to be maintained after the application instance on the working node needing to be maintained stops operating, after the maintenance is completed, the user directly inputs a request for quitting the maintenance of the corresponding working node into the cloud computing system, then the task distributor can distribute the new task to the corresponding working node quitting the maintenance, and the corresponding working node quitting the maintenance operates normally. Therefore, through the management method provided by the invention, when a certain working node is maintained, the working node is enabled to enter the maintenance state and exit the maintenance state only by one simple request operation of a user without considering various details in the cloud environment, the difficulty degree of node maintenance work is greatly reduced, the user operation is simpler and more convenient, the system operation and maintenance cost is further reduced, and the user experience is better. The invention is particularly suitable for maintaining the working nodes in the cloud cluster environment.

Drawings

Fig. 1 is a flowchart of overall maintenance in the embodiment of the present invention.

Detailed Description

The technical solution of the present invention will be described in detail below with reference to the embodiments and the accompanying drawings.

The invention provides a management method for maintaining a working node of a cloud computing system, which comprises a maintenance mode entering part, a maintenance mode exiting part and a task scheduling part;

the enter maintenance mode part includes:

step S4, the working node stops the application instance on the current node;

the exit from maintenance mode portion includes:

the task scheduling part comprises:

By the management method, when a user needs to maintain a certain working node, a request for maintaining the corresponding working node is directly input into the cloud computing system, then the cloud computing system stops distributing a new task to the working node needing to be maintained, an application instance on the working node needing to be maintained is migrated to other working nodes which normally run for processing, after the application instance on the working node needing to be maintained stops running, the user can maintain the working node needing to be maintained, after maintenance is completed, the user directly inputs a request for quitting the corresponding working node to be maintained into the cloud computing system, then the task distributor can distribute the new task to the corresponding working node which quits maintenance, and the corresponding working node which quits maintenance runs normally.

The method is further optimized, and specifically, the following aspects can be provided:

on one hand, the step S3 of migrating the application instance on the current node to the working node that does not enter the maintenance mode may specifically adopt: the working node requests the task scheduler to redeploy the application instance on the current node; the method comprises the steps that a cloud computing system management center monitors whether application instances on a current working node are completely redeployed to the working node which does not enter a maintenance mode and can provide corresponding services for the outside, if yes, first notification information is sent to the working node, and if not, the first notification information is not sent to the working node; the worker node, upon receiving the first notification information, executes step S4. Through the optimization, the application instance to be stopped on the working node needing maintenance is stopped after the application instance to be stopped on the working node needing maintenance is ready on other working nodes again and can provide service to the outside, the phenomenon of service interruption can not occur on the application layer, and non-stop maintenance is realized.

On the other hand, the cloud computing system may adopt a cloud computing system based on container technology. By adopting the cloud environment based on the container technology, the workload of operation and maintenance personnel can be greatly reduced, and the safety and the portability of the system are better.

On the other hand, the cloud computing system management center receives a maintenance request and a maintenance quitting request for any working node by providing a command input line and/or a webpage button. The user can operate the device more conveniently and quickly.

Examples

The following specifically exemplifies the technical solution of the present invention.

In this example, the overall maintenance flow chart is shown in fig. 1, where the cloud computing system is a container cloud computing system, and it is assumed that the container cloud computing system in this example has a work node one, a work node two, and a work node three.

In the practical application process, when a user needs to maintain a first working node, the user selects to maintain the first working node through a webpage button provided by a container cloud computing system management center, and at the moment, the container cloud computing system management center receives a maintenance request for the first working node; then the management center of the container cloud computing system informs the task scheduler that the working node enters a maintenance mode, and the task scheduler does not distribute a new application instance to the working node; and then the container cloud computing system management center informs the first working node of entering a maintenance mode, the first working node requests the task scheduler to relocate the application instance on the current node after receiving the notification of entering the maintenance mode, the task scheduler responds to the request of the first working node to perform corresponding processing, in addition, the container cloud computing system management center starts to monitor whether all the application instances on the first working node are relocated to the working nodes which do not enter the maintenance mode and can provide corresponding services for the outside, if yes, first notification information is sent to the first working node, and if not, the first notification information is not sent to the first working node.

In this embodiment, the working nodes to which the application instance on the working node one can be redeployed are a working node two and a working node three; assuming that the application instance on the first working node is already redeployed and can provide corresponding service to the outside, the first working node receives the first notification message, and the first working node stops the application instance on the current node after receiving the first notification message. Therefore, the application instance on the first working node is safely migrated to the second working node and/or the third working node, and the operation and maintenance personnel can perform maintenance operation on the server in the first working node.

It should be noted that, after the maintenance is completed, the user selects to quit the maintenance on the first working node through a webpage button provided by the container cloud computing system management center, and at this time, the container cloud computing system management center receives a request for quitting the maintenance on the first working node; the container cloud computing system management center then notifies the task scheduler that the worker node will not enter the maintenance mode, and thereafter the task scheduler will normally allocate a new application instance to the worker node, and the worker node will normally process the application instance thereon.

Claims

1. The management method for maintaining the working nodes of the cloud computing system is characterized by comprising a maintenance mode entering part, a maintenance mode exiting part and a task scheduling part;

the entering the maintenance mode part comprises:

step S4, the working node stops the application instance on the current node;

the exit from the maintenance mode section includes:

the task scheduling part comprises:

2. The management method for maintaining a cloud computing system working node as claimed in claim 1, wherein the migrating the application instance on the current local node to the working node that does not enter the maintenance mode in step S3 specifically employs:

3. A management method for maintaining cloud computing system worker nodes as recited in claim 1, wherein the cloud computing system employs a cloud computing system based on container technology.

4. An administration method for maintaining working nodes of a cloud computing system according to any one of claims 1 to 3, wherein the cloud computing system management center receives maintenance requests and maintenance quit requests for any working node by providing command input lines and/or web page buttons.