CN111949383A - Management method for maintaining working nodes of cloud computing system - Google Patents

Management method for maintaining working nodes of cloud computing system Download PDF

Info

Publication number
CN111949383A
CN111949383A CN202011009779.5A CN202011009779A CN111949383A CN 111949383 A CN111949383 A CN 111949383A CN 202011009779 A CN202011009779 A CN 202011009779A CN 111949383 A CN111949383 A CN 111949383A
Authority
CN
China
Prior art keywords
node
working node
maintenance
computing system
cloud computing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011009779.5A
Other languages
Chinese (zh)
Inventor
晏东
吴晓勇
叶盼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Ghostcloud Technology Co ltd
Original Assignee
Chengdu Ghostcloud Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Ghostcloud Technology Co ltd filed Critical Chengdu Ghostcloud Technology Co ltd
Priority to CN202011009779.5A priority Critical patent/CN111949383A/en
Publication of CN111949383A publication Critical patent/CN111949383A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5072Grid computing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/4557Distribution of virtual machine instances; Migration and load balancing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/45595Network integration; Enabling network access in virtual machine instances

Abstract

The invention discloses a management method for maintaining a working node of a cloud computing system, and relates to the field of container node maintenance based on a cloud platform. The invention can solve the problems of troublesome maintenance and great difficulty of the current working nodes of the cloud computing system, and comprises the following steps: the system receives the node maintenance request and then informs the task scheduler and the node to be maintained, the task scheduler does not distribute new tasks to the node to be maintained, the application instance on the node to be maintained is migrated to other nodes which normally operate and stops the application instance on the node to be maintained, and the system receives the node quit maintenance request and then informs the task scheduler that the corresponding node normally operates. By the method, the difficulty degree of node maintenance work is greatly reduced, the user operation is simpler and more convenient, the system operation and maintenance cost is further reduced, and the user experience is better. The invention is particularly suitable for maintaining the working nodes in the cloud cluster environment.

Description

Management method for maintaining working nodes of cloud computing system
Technical Field
The invention relates to the field of container node maintenance based on a cloud platform, in particular to a management method for maintaining a working node of a cloud computing system.
Background
In a cluster environment of a cloud computing system, a maintenance work node exists as a basic requirement of an operation and maintenance cluster environment. In a traditional operation and maintenance management mode, because maintenance of working nodes in a cluster involves a large amount of detail work interacting with the cluster, operation and maintenance personnel need to know the cloud environment above the cluster very much and complete the work of maintaining the working nodes on the basis of the detail function provided by the cloud environment, and non-professional personnel cannot complete the work well, so that the maintenance of the working nodes of a cloud computing system is troublesome and difficult at present.
Disclosure of Invention
The invention provides a management method for maintaining working nodes of a cloud computing system, aiming at solving the problems of troublesome maintenance and high difficulty of the working nodes of the cloud computing system at present.
In order to solve the problems, the invention adopts the technical scheme that:
the management method for maintaining the working nodes of the cloud computing system comprises a maintenance mode entering part, a maintenance mode exiting part and a task scheduling part;
the entering the maintenance mode part comprises:
step S1, the cloud computing system management center monitors whether a maintenance request for any working node is received, and if the maintenance request for any working node is received, the step S2 is executed;
step S2, the cloud computing system management center informs the task scheduler that the working node enters a maintenance mode;
step S3, the cloud computing system management center informs the working node to enter the maintenance mode, and after receiving the notice of entering the maintenance mode, the working node migrates the application instance on the current node to the working node which does not enter the maintenance mode;
step S4, the working node stops the application instance on the current node;
the exit from the maintenance mode section includes:
the method comprises the steps that a cloud computing system management center monitors whether a maintenance quitting request for any working node is received or not, and if the maintenance quitting request for any working node is received, a task scheduler is informed that the working node does not enter a maintenance mode;
the task scheduling part comprises:
and when the task scheduler deploys the application instances to each working node, not deploying the application instances to the working nodes entering the maintenance mode according to the mode of each working node.
As a further optimization, the step S3 of migrating the application instance on the current node to the working node that does not enter the maintenance mode specifically adopts:
the working node requests the task scheduler to redeploy the application instance on the current node;
the method comprises the steps that a cloud computing system management center monitors whether application instances on a current working node are completely redeployed to the working node which does not enter a maintenance mode and can provide corresponding services for the outside, if yes, first notification information is sent to the working node, and if not, the first notification information is not sent to the working node;
the worker node, upon receiving the first notification information, executes step S4.
As a further optimization, the cloud computing system employs a cloud computing system based on container technology.
As a further optimization, the cloud computing system management center receives maintenance requests and quit maintenance requests for any working node by providing command input lines and/or web page buttons.
The invention has the beneficial effects that: according to the management method for maintaining the working nodes of the cloud computing system, when a user needs to maintain a certain working node, a request for maintaining the corresponding working node is directly input into the cloud computing system, then the cloud computing system stops distributing a new task to the working node needing to be maintained, an application instance on the working node needing to be maintained is migrated to other working nodes which normally operate for processing, the user can maintain the working node needing to be maintained after the application instance on the working node needing to be maintained stops operating, after the maintenance is completed, the user directly inputs a request for quitting the maintenance of the corresponding working node into the cloud computing system, then the task distributor can distribute the new task to the corresponding working node quitting the maintenance, and the corresponding working node quitting the maintenance operates normally. Therefore, through the management method provided by the invention, when a certain working node is maintained, the working node is enabled to enter the maintenance state and exit the maintenance state only by one simple request operation of a user without considering various details in the cloud environment, the difficulty degree of node maintenance work is greatly reduced, the user operation is simpler and more convenient, the system operation and maintenance cost is further reduced, and the user experience is better. The invention is particularly suitable for maintaining the working nodes in the cloud cluster environment.
Drawings
Fig. 1 is a flowchart of overall maintenance in the embodiment of the present invention.
Detailed Description
The technical solution of the present invention will be described in detail below with reference to the embodiments and the accompanying drawings.
The invention provides a management method for maintaining a working node of a cloud computing system, which comprises a maintenance mode entering part, a maintenance mode exiting part and a task scheduling part;
the enter maintenance mode part includes:
step S1, the cloud computing system management center monitors whether a maintenance request for any working node is received, and if the maintenance request for any working node is received, the step S2 is executed;
step S2, the cloud computing system management center informs the task scheduler that the working node enters a maintenance mode;
step S3, the cloud computing system management center informs the working node to enter the maintenance mode, and after receiving the notice of entering the maintenance mode, the working node migrates the application instance on the current node to the working node which does not enter the maintenance mode;
step S4, the working node stops the application instance on the current node;
the exit from maintenance mode portion includes:
the method comprises the steps that a cloud computing system management center monitors whether a maintenance quitting request for any working node is received or not, and if the maintenance quitting request for any working node is received, a task scheduler is informed that the working node does not enter a maintenance mode;
the task scheduling part comprises:
and when the task scheduler deploys the application instances to each working node, not deploying the application instances to the working nodes entering the maintenance mode according to the mode of each working node.
By the management method, when a user needs to maintain a certain working node, a request for maintaining the corresponding working node is directly input into the cloud computing system, then the cloud computing system stops distributing a new task to the working node needing to be maintained, an application instance on the working node needing to be maintained is migrated to other working nodes which normally run for processing, after the application instance on the working node needing to be maintained stops running, the user can maintain the working node needing to be maintained, after maintenance is completed, the user directly inputs a request for quitting the corresponding working node to be maintained into the cloud computing system, then the task distributor can distribute the new task to the corresponding working node which quits maintenance, and the corresponding working node which quits maintenance runs normally.
The method is further optimized, and specifically, the following aspects can be provided:
on one hand, the step S3 of migrating the application instance on the current node to the working node that does not enter the maintenance mode may specifically adopt: the working node requests the task scheduler to redeploy the application instance on the current node; the method comprises the steps that a cloud computing system management center monitors whether application instances on a current working node are completely redeployed to the working node which does not enter a maintenance mode and can provide corresponding services for the outside, if yes, first notification information is sent to the working node, and if not, the first notification information is not sent to the working node; the worker node, upon receiving the first notification information, executes step S4. Through the optimization, the application instance to be stopped on the working node needing maintenance is stopped after the application instance to be stopped on the working node needing maintenance is ready on other working nodes again and can provide service to the outside, the phenomenon of service interruption can not occur on the application layer, and non-stop maintenance is realized.
On the other hand, the cloud computing system may adopt a cloud computing system based on container technology. By adopting the cloud environment based on the container technology, the workload of operation and maintenance personnel can be greatly reduced, and the safety and the portability of the system are better.
On the other hand, the cloud computing system management center receives a maintenance request and a maintenance quitting request for any working node by providing a command input line and/or a webpage button. The user can operate the device more conveniently and quickly.
Examples
The following specifically exemplifies the technical solution of the present invention.
In this example, the overall maintenance flow chart is shown in fig. 1, where the cloud computing system is a container cloud computing system, and it is assumed that the container cloud computing system in this example has a work node one, a work node two, and a work node three.
In the practical application process, when a user needs to maintain a first working node, the user selects to maintain the first working node through a webpage button provided by a container cloud computing system management center, and at the moment, the container cloud computing system management center receives a maintenance request for the first working node; then the management center of the container cloud computing system informs the task scheduler that the working node enters a maintenance mode, and the task scheduler does not distribute a new application instance to the working node; and then the container cloud computing system management center informs the first working node of entering a maintenance mode, the first working node requests the task scheduler to relocate the application instance on the current node after receiving the notification of entering the maintenance mode, the task scheduler responds to the request of the first working node to perform corresponding processing, in addition, the container cloud computing system management center starts to monitor whether all the application instances on the first working node are relocated to the working nodes which do not enter the maintenance mode and can provide corresponding services for the outside, if yes, first notification information is sent to the first working node, and if not, the first notification information is not sent to the first working node.
In this embodiment, the working nodes to which the application instance on the working node one can be redeployed are a working node two and a working node three; assuming that the application instance on the first working node is already redeployed and can provide corresponding service to the outside, the first working node receives the first notification message, and the first working node stops the application instance on the current node after receiving the first notification message. Therefore, the application instance on the first working node is safely migrated to the second working node and/or the third working node, and the operation and maintenance personnel can perform maintenance operation on the server in the first working node.
It should be noted that, after the maintenance is completed, the user selects to quit the maintenance on the first working node through a webpage button provided by the container cloud computing system management center, and at this time, the container cloud computing system management center receives a request for quitting the maintenance on the first working node; the container cloud computing system management center then notifies the task scheduler that the worker node will not enter the maintenance mode, and thereafter the task scheduler will normally allocate a new application instance to the worker node, and the worker node will normally process the application instance thereon.

Claims (4)

1. The management method for maintaining the working nodes of the cloud computing system is characterized by comprising a maintenance mode entering part, a maintenance mode exiting part and a task scheduling part;
the entering the maintenance mode part comprises:
step S1, the cloud computing system management center monitors whether a maintenance request for any working node is received, and if the maintenance request for any working node is received, the step S2 is executed;
step S2, the cloud computing system management center informs the task scheduler that the working node enters a maintenance mode;
step S3, the cloud computing system management center informs the working node to enter the maintenance mode, and after receiving the notice of entering the maintenance mode, the working node migrates the application instance on the current node to the working node which does not enter the maintenance mode;
step S4, the working node stops the application instance on the current node;
the exit from the maintenance mode section includes:
the method comprises the steps that a cloud computing system management center monitors whether a maintenance quitting request for any working node is received or not, and if the maintenance quitting request for any working node is received, a task scheduler is informed that the working node does not enter a maintenance mode;
the task scheduling part comprises:
and when the task scheduler deploys the application instances to each working node, not deploying the application instances to the working nodes entering the maintenance mode according to the mode of each working node.
2. The management method for maintaining a cloud computing system working node as claimed in claim 1, wherein the migrating the application instance on the current local node to the working node that does not enter the maintenance mode in step S3 specifically employs:
the working node requests the task scheduler to redeploy the application instance on the current node;
the method comprises the steps that a cloud computing system management center monitors whether application instances on a current working node are completely redeployed to the working node which does not enter a maintenance mode and can provide corresponding services for the outside, if yes, first notification information is sent to the working node, and if not, the first notification information is not sent to the working node;
the worker node, upon receiving the first notification information, executes step S4.
3. A management method for maintaining cloud computing system worker nodes as recited in claim 1, wherein the cloud computing system employs a cloud computing system based on container technology.
4. An administration method for maintaining working nodes of a cloud computing system according to any one of claims 1 to 3, wherein the cloud computing system management center receives maintenance requests and maintenance quit requests for any working node by providing command input lines and/or web page buttons.
CN202011009779.5A 2020-09-23 2020-09-23 Management method for maintaining working nodes of cloud computing system Pending CN111949383A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011009779.5A CN111949383A (en) 2020-09-23 2020-09-23 Management method for maintaining working nodes of cloud computing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011009779.5A CN111949383A (en) 2020-09-23 2020-09-23 Management method for maintaining working nodes of cloud computing system

Publications (1)

Publication Number Publication Date
CN111949383A true CN111949383A (en) 2020-11-17

Family

ID=73356487

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011009779.5A Pending CN111949383A (en) 2020-09-23 2020-09-23 Management method for maintaining working nodes of cloud computing system

Country Status (1)

Country Link
CN (1) CN111949383A (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108733533A (en) * 2017-04-18 2018-11-02 微软技术许可有限责任公司 The optional manual scheduling of host maintenance through plan

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108733533A (en) * 2017-04-18 2018-11-02 微软技术许可有限责任公司 The optional manual scheduling of host maintenance through plan

Similar Documents

Publication Publication Date Title
EP3522013B1 (en) Method and system for migration of containers in a container orchestration platform between compute nodes
CN108737270B (en) Resource management method and device for server cluster
CN102457906B (en) Load balancing control method and system of message queues
US10628273B2 (en) Node system, server apparatus, scaling control method, and program
US8032780B2 (en) Virtualization based high availability cluster system and method for managing failure in virtualization based high availability cluster system
CN107959705B (en) Distribution method of streaming computing task and control server
US8954982B2 (en) Resource management using reliable and efficient delivery of application performance information in a cloud computing system
CN106817408B (en) Distributed server cluster scheduling method and device
CN105743995A (en) Transplantable high-available container cluster deploying and managing system and method
JPWO2015146355A1 (en) Update management system and update management method
US10541901B2 (en) Methods, systems and computer readable media for optimizing placement of virtual network visibility components
CN112000434A (en) Kubernetes dynamic management service based governance rule configuration method and system
CN106790092B (en) Remote procedure call server control system and method
CN103810015A (en) Virtual machine establishing method and equipment
CN110958311A (en) YARN-based shared cluster elastic expansion system and method
CN108632106A (en) System for monitoring service equipment
CN102902594A (en) Resource management system and resource management method
CN110532060B (en) Hybrid network environment data acquisition method and system
CN109739634A (en) A kind of atomic task execution method and device
CN110618821A (en) Container cluster system based on Docker and rapid building method
CN106897128B (en) Distributed application quitting method, system and server
CN105208111A (en) Information processing method and physical machine
CN112698929A (en) Information acquisition method and device
CN111949383A (en) Management method for maintaining working nodes of cloud computing system
EP2775400B1 (en) Ressource management system a method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination