CN116909757A - Cluster management control system, method, electronic device and storage medium - Google Patents

Cluster management control system, method, electronic device and storage medium Download PDF

Info

Publication number
CN116909757A
CN116909757A CN202311178971.0A CN202311178971A CN116909757A CN 116909757 A CN116909757 A CN 116909757A CN 202311178971 A CN202311178971 A CN 202311178971A CN 116909757 A CN116909757 A CN 116909757A
Authority
CN
China
Prior art keywords
cluster
network element
target
module
management module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202311178971.0A
Other languages
Chinese (zh)
Other versions
CN116909757B (en
Inventor
高阳
马成龙
吴仪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Mobile Communications Group Co Ltd
China Mobile Suzhou Software Technology Co Ltd
Original Assignee
China Mobile Communications Group Co Ltd
China Mobile Suzhou Software Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Mobile Communications Group Co Ltd, China Mobile Suzhou Software Technology Co Ltd filed Critical China Mobile Communications Group Co Ltd
Priority to CN202311178971.0A priority Critical patent/CN116909757B/en
Publication of CN116909757A publication Critical patent/CN116909757A/en
Application granted granted Critical
Publication of CN116909757B publication Critical patent/CN116909757B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5072Grid computing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/505Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5077Logical partitioning of resources; Management or configuration of virtualized resources
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • H04L67/1008Server selection for load balancing based on parameters of servers, e.g. available memory or workload
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1029Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers using data related to the state of servers by a load balancer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1031Controlling of the operation of servers by a load balancer, e.g. adding or removing servers that serve requests
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1034Reaction to server failures by a load balancer
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/5022Workload threshold
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/505Clust
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/508Monitor

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Computer Hardware Design (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The disclosure discloses a cluster management control system, a method, an electronic device and a storage medium, wherein the cluster management control system comprises: the template configuration module generates a control instruction of the target cluster and sends the control instruction to the cluster management module; the cluster management module receives the first attribute information and the second attribute information, creates a corresponding target cluster according to the first attribute information, and sends the second attribute information to the virtual machine management module; the virtual machine management module receives the second attribute information sent by the cluster management module, creates a corresponding target network element according to the second attribute information, and sends the identification information corresponding to the target network element to the cluster management module. The target cluster comprising at least two types of target network elements is configured through the template configuration module, the target cluster is created by the cluster management module, the virtual machine management module creates the corresponding target network elements according to the second attribute information sent by the cluster management module, and the cluster creation with non-single functions is realized.

Description

Cluster management control system, method, electronic device and storage medium
Technical Field
The present disclosure relates to the field of data processing technologies, and in particular, to a control system, a method, an electronic device, and a storage medium for cluster management.
Background
In the cluster scheme of the related technology, the capacity expansion and contraction of the micro service can be performed according to the resource utilization rate of the micro service. But the technical scheme only relates to the construction, capacity expansion and capacity reduction of the single-function cluster.
Disclosure of Invention
The disclosure provides a cluster management control system, a cluster management control method, electronic equipment and a storage medium. The method mainly aims to solve the problems of construction, capacity expansion and capacity shrinkage of the single-function cluster.
According to a first aspect of the present disclosure, there is provided a control system for cluster management, comprising:
the template configuration module is used for generating a control instruction of a target cluster and sending the control instruction to the cluster management module, wherein the control instruction comprises first attribute information corresponding to the target cluster and second attribute information respectively corresponding to at least two types of target network elements;
the cluster management module is used for receiving the first attribute information and the second attribute information, creating a corresponding target cluster according to the first attribute information and sending the second attribute information to the virtual machine management module;
The virtual machine management module is configured to receive the second attribute information sent by the cluster management module, create a corresponding target network element according to the second attribute information, and send identification information corresponding to the target network element to the cluster management module.
In some embodiments, a monitoring module is also included;
the template configuration module is further used for configuring a monitoring strategy of the monitoring module and sending the monitoring strategy to the monitoring module;
the monitoring module is used for receiving the monitoring strategy sent by the template configuration module and monitoring the target cluster based on the monitoring strategy.
In some embodiments, an alert module is also included;
the template configuration module is further used for configuring an alarm sending strategy of the monitoring module and sending the alarm sending strategy to the monitoring module;
the monitoring module is further configured to send a first notification message to the alarm module based on the alarm sending policy when it is determined that the target cluster is abnormal, where the first notification message carries identification information of a faulty network element;
the alarm module is used for receiving and responding to the first notification message.
In some embodiments, a fault migration module is also included;
the monitoring module is further configured to send a second notification message to the fault migration module when it is determined that the target network element is abnormal, where the second notification message carries identification information of the fault network element;
the fault migration module is configured to receive a second notification message sent by the monitoring module, and send the second notification message to the cluster management module;
the cluster management module is further configured to receive a second notification message sent by the fault migration module, and send indication information for creating a new network element to the virtual machine management module;
the virtual machine management module is further configured to receive the indication information for creating the new network element sent by the cluster management module, create the new network element, and send the identification information of the new network element to the virtual machine management module;
the virtual machine management module is further configured to receive an identifier of a new network element sent by the virtual machine management module
Information and a migration instruction are sent to the virtual machine management module, wherein the migration instruction comprises a request address of the fault network element migration processing;
the virtual machine management module is further configured to receive the migration instruction sent by the virtual machine management module, and migrate the faulty network element to a new network element based on the request address of the faulty network element migration process.
In some embodiments of the present invention, in some embodiments,
the monitoring module is further configured to send a third notification message to the alarm module and send a fourth notification message for modifying the maximum number of network elements of the target cluster in the first attribute information to the template configuration module when it is determined that the load of the target cluster exceeds a first preset load threshold;
the template configuration module is further configured to receive the fourth notification message, and modify a maximum number of network elements of the target cluster in the first attribute information.
In some embodiments of the present invention, in some embodiments,
the monitoring module is further configured to send a capacity expansion notification message to the cluster management module when it is determined that the load of the target cluster is lower than a second preset load threshold and the target network element in the target cluster is higher than a third preset load threshold;
the cluster management module is further configured to receive the capacity expansion notification message sent by the monitoring module, and send the capacity expansion notification message to the virtual machine management module;
the virtual machine management module is further configured to receive the capacity expansion notification message, and execute creation of a target network element based on the capacity expansion notification message.
In some embodiments of the present invention, in some embodiments,
the monitoring module is further configured to send a capacity reduction notification message to the cluster management module when it is determined that the load of the target cluster is lower than a fourth preset load threshold;
The cluster management module is further configured to receive the volume reduction notification message sent by the monitoring module, and send the volume reduction notification message to the virtual machine management module;
the virtual machine management module is further configured to receive the capacity reduction notification message, and perform deletion of the target network element in the target cluster based on the capacity reduction notification message.
According to a second aspect of the present disclosure, there is provided a control method of cluster management, including:
responding to a control instruction for generating a target cluster, and acquiring first attribute information and second attribute information in the control instruction, wherein the first attribute information is attribute information corresponding to the target cluster, and the second attribute information is attribute information respectively corresponding to at least two types of target network elements;
and creating a target network element according to the first attribute information, and creating the target cluster according to the second attribute information.
In some embodiments, after creating the target cluster from the second attribute information, the method further comprises:
calling a first preset configuration template to generate a monitoring strategy;
monitoring the target network element and the target cluster based on the monitoring strategy;
And under the condition that the target network element is monitored to be abnormal, outputting a first abnormal notification message, wherein the first notification message carries the identification information of the fault network element.
In some embodiments, the method further comprises:
under the condition that the target network element is monitored to be abnormal, creating a new network element which is the same as the second attribute information of the fault network element;
and acquiring the request address of the fault network element migration processing, and executing the migration from the abnormal network element to the new network element.
In some embodiments, the method further comprises:
judging whether the load of the target cluster exceeds a first preset load threshold value or not under the condition that the target cluster is monitored to be abnormal;
and if the load of the target cluster exceeds a first preset load threshold, reconfiguring the maximum network element number of the target cluster in the first attribute information.
In some embodiments, the method further comprises:
judging whether the load of the target cluster is lower than a second preset load threshold value or not and whether a target network element in the target cluster is higher than a third preset load threshold value or not under the condition that the target cluster is monitored to be abnormal;
and under the condition that the load of the target cluster is lower than a second preset load threshold and the target network element in the target cluster is higher than a third preset load threshold, performing capacity expansion processing on the target network element in the target cluster.
In some embodiments, the method further comprises:
judging whether the load of the target cluster is lower than a fourth preset load threshold value or not under the condition that the target cluster is monitored to be abnormal;
and under the condition that the load of the target cluster is lower than a fourth preset load threshold, carrying out capacity reduction processing on the target network elements in the target cluster.
In some embodiments, the monitoring the target network element and the target cluster based on the monitoring policy includes:
determining a corresponding target monitoring strategy according to the type of the target network element;
monitoring whether the target network element survives according to the target monitoring strategy;
and under the condition that the target network element is determined to survive, monitoring whether the function corresponding to the target network element exists.
According to a third aspect of the present disclosure, there is provided an electronic device comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of the second aspect.
According to a fourth aspect of the present disclosure, there is provided a non-transitory computer readable storage medium storing computer instructions for causing the computer to perform the method of the preceding second aspect.
According to a fifth aspect of the present disclosure, there is provided a computer program product comprising a computer program which, when executed by a processor, implements a method as described in the foregoing second aspect.
The present disclosure provides a cluster management control system, a method, an electronic device, and a storage medium, where the cluster management control system includes: the system comprises a template configuration module, a cluster management module and a virtual machine management module; the template configuration module is used for generating a control instruction of a target cluster and sending the control instruction to the cluster management module, wherein the control instruction comprises first attribute information corresponding to the target cluster and second attribute information respectively corresponding to at least two types of target network elements; the cluster management module is used for receiving the first attribute information and the second attribute information, creating a corresponding target cluster according to the first attribute information and sending the second attribute information to the virtual machine management module; the virtual machine management module is configured to receive the second attribute information sent by the cluster management module, create a corresponding target network element according to the second attribute information, and send identification information corresponding to the target network element to the cluster management module. Compared with the related art, the embodiment of the application configures the target cluster comprising at least two types of target network elements through the template configuration module, the target cluster is created by the cluster management module, and the virtual machine management module creates the corresponding target network element according to the second attribute information sent by the cluster management module, thereby realizing the cluster creation with non-single function.
It should be understood that the description in this section is not intended to identify key or critical features of the embodiments of the application or to delineate the scope of the application. Other features of the present application will become apparent from the description that follows.
Drawings
The drawings are for a better understanding of the present solution and are not to be construed as limiting the present disclosure. Wherein:
fig. 1 is a schematic diagram of a control system for cluster management according to an embodiment of the disclosure;
FIG. 2 is a schematic diagram of a monitoring template provided by an embodiment of the present disclosure;
FIG. 3 is a schematic diagram of a fault migration module provided by an embodiment of the present disclosure;
fig. 4 is a schematic diagram of creation or deletion of a target network element according to an embodiment of the present disclosure;
fig. 5 is a flow chart of a control method for cluster management according to an embodiment of the disclosure;
fig. 6 is a schematic block diagram of an example electronic device provided by an embodiment of the present disclosure.
Detailed Description
Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and should be considered as merely exemplary. Accordingly, one of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
Text generation methods, apparatuses, electronic devices, and storage media of embodiments of the present disclosure are described below with reference to the accompanying drawings.
Fig. 1 is a schematic diagram of a cluster management control system according to an embodiment of the disclosure.
As shown in fig. 1, the system includes: the system comprises a template configuration module 11, a cluster management module 12 and a virtual machine management module 13;
the template configuration module 11 is configured to generate a control instruction of a target cluster, and send the control instruction to the cluster management module, where the control instruction includes first attribute information corresponding to the target cluster and second attribute information corresponding to at least two types of target network elements respectively; the first attribute information corresponding to the target cluster includes, but is not limited to, a minimum number of network elements, a maximum number of network elements, etc., and the second attribute information corresponding to the target network elements includes, but is not limited to, a mirror image, a CPU, a memory, a disk storage, etc., and the specific content of the first attribute information and the second attribute information is not limited in the embodiment of the present application.
The cluster management module 12 is configured to receive the first attribute information and the second attribute information, create a corresponding target cluster according to the first attribute information, and send the second attribute information to a virtual machine management module; the cluster management module 12 is composed of a plurality of target network elements with related functions, and one target cluster is composed of a plurality of types of network element clusters, and the cluster management module is responsible for performing management operations such as creation, deletion and the like on the plurality of clusters.
The virtual machine management module 13 is configured to receive the second attribute information sent by the cluster management module 12, create a corresponding target network element according to the second attribute information, and send identification information corresponding to the target network element to the cluster management module. The identification information includes, but is not limited to, id, ip, etc. information of the target network element.
The virtual machine management module 13 manages N managed target network elements (such as cloud hosts), where each cloud host is generated by a certain type of mirror image, and the mirror image includes a certain function, for example mysql or redis. The virtual machine management module 13 performs management operations such as creation and deletion of the target network element according to the issued second attribute information (related parameters such as mirror image, CPU, memory, disk storage, etc.).
In fig. 1, the cluster management module manages a plurality of target clusters, and target network elements involved in each target cluster may be different. Class e.g. target cluster 1, comprises class a network element clusters, which may be mysql databases, and class B network element clusters, which may be network element clusters of the service function class. Specifically, the category of the network element cluster in the embodiment of the application is not limited.
The present disclosure provides a cluster management control system, comprising: the system comprises a template configuration module, a cluster management module and a virtual machine management module; the template configuration module is used for generating a control instruction of a target cluster and sending the control instruction to the cluster management module, wherein the control instruction comprises first attribute information corresponding to the target cluster and second attribute information respectively corresponding to at least two types of target network elements; the cluster management module is used for receiving the first attribute information and the second attribute information, creating a corresponding target cluster according to the first attribute information and sending the second attribute information to the virtual machine management module; the virtual machine management module is configured to receive the second attribute information sent by the cluster management module, create a corresponding target network element according to the second attribute information, and send identification information corresponding to the target network element to the cluster management module. Compared with the related art, the embodiment of the application configures the target cluster comprising at least two types of target network elements through the template configuration module, the target cluster is created by the cluster management module, and the virtual machine management module creates the corresponding target network element according to the second attribute information sent by the cluster management module, thereby realizing the cluster creation with non-single function.
With continued reference to fig. 1 and 2, the system further includes a monitoring module 14;
the template configuration module 11 is further configured to configure a monitoring policy of the monitoring module, and send the monitoring policy to the monitoring module; the monitoring strategy includes, but is not limited to, configuration of a high load threshold, high load times or duration, a virtual machine availability detection mode, a virtual machine corresponding network element function detection mode, and the like, and specific numerical values are set according to actual requirements.
The monitoring module 14 is configured to receive the monitoring policy sent by the template configuration module, and monitor the target cluster based on the monitoring policy. The monitoring module 14 is configured to monitor the target network element maintained by the virtual machine management module 13 and monitor a target cluster state.
The monitoring module 14 monitors whether the target network element survives, and may monitor by but not limited to PING host IP, and then monitors whether a function corresponding to the target network element exists, for example, for mysql-type network element, TELNET host ip+mysql port. When the monitoring module 14 detects an abnormal network element, a certain monitoring period may be set to avoid the occasional network problem, and the abnormal network element may be used N times (e.g. 10 times) or the target network element may be kept in an abnormal state for a certain time (e.g. 10 minutes), that is, an alarm is triggered and/or the function of the target network element is migrated.
With continued reference to fig. 1, the system further includes an alarm module 15;
the template configuration module 11 is further configured to configure an alarm sending policy of the monitoring module 14, and send the alarm sending policy to the monitoring module 14;
the monitoring module 14 is further configured to send a first notification message to the alarm module 15 based on the alarm sending policy when it is determined that the target cluster is abnormal, where the first notification message carries identification information of a faulty network element;
the alarm module 15 is configured to receive and respond to the first notification message. In responding to the first notification message, the fault notification may be performed on the management user by, but not limited to, using a short message or mail.
With continued reference to fig. 1 and 3, the system further includes a fault migration module;
the monitoring module 14 is further configured to send a second notification message to the fault migration module 16 when it is determined that the target network element is abnormal, where the second notification message carries identification information of the faulty network element;
the fault migration module 16 is configured to receive the second notification message sent by the monitoring module 14, and send the second notification message to the cluster management module 12;
The cluster management module 12 is further configured to receive a second notification message sent by the fault migration module 16, and send indication information for creating a new network element to the virtual machine management module 13;
the virtual machine management module 13 is further configured to receive the indication information for creating a new network element sent by the cluster management module 12, create a new network element, and send the identification information of the new network element to the cluster management module;
the cluster management module 12 is further configured to receive identification information of a new network element sent by the virtual machine management module, and send a migration instruction to the virtual machine management module, where the migration instruction includes a request address of the migration processing of the faulty network element;
the virtual machine management module 13 is further configured to receive the migration instruction sent by the cluster management module 12, and migrate the faulty network element to a new network element based on the request address of the faulty network element migration process. It should be noted that, the request address of the fault network element migration processing includes, but is not limited to, port information of the target network element, when migration is executed, information of each network element is stored in the shared storage space, and migration of the new network element can be completed by obtaining corresponding information from the shared storage space according to the request address.
With continued reference to fig. 1 and 4, the monitoring module 14 also monitors the target cluster status. Three states are assigned to the target cluster and the target network element, namely high load (corresponding to a first preset load threshold value or a third preset load threshold value), low load (corresponding to a second preset load threshold value) and zero load (corresponding to a fourth preset load threshold value).
The monitoring module 14 is further configured to send a third notification message to the alarm module and send a fourth notification message for modifying the maximum number of network elements of the target cluster in the first attribute information to the template configuration module when it is determined that the load of the target cluster exceeds a first preset load threshold;
the template configuration module 11 is further configured to receive the fourth notification message, and modify a maximum number of network elements of the target cluster in the first attribute information.
Further, referring to fig. 4, the monitoring module 14 is further configured to send a capacity expansion notification message to the cluster management module when it is determined that the load of the target cluster is lower than a second preset load threshold and the target network element in the target cluster is higher than a third preset load threshold; in a specific application process, the third preset load threshold and the first preset load threshold may be the same or different, and are not specifically limited.
The cluster management module is further configured to receive the capacity expansion notification message sent by the monitoring module, and send the capacity expansion notification message to the virtual machine management module;
the virtual machine management module is further configured to receive the capacity expansion notification message, and execute creation of a target network element based on the capacity expansion notification message.
Further, with continued reference to fig. 4, the monitoring module 14 is further configured to send a capacity reduction notification message to the cluster management module when it is determined that the load of the target cluster is lower than a fourth preset load threshold;
the cluster management module 12 is further configured to receive the volume reduction notification message sent by the monitoring module, and send the volume reduction notification message to the virtual machine management module;
the virtual machine management module 13 is further configured to receive the volume reduction notification message, and perform deletion of the target network element in the target cluster based on the volume reduction notification message.
Corresponding to the cluster management control system, the invention also provides a cluster management control method. Since the method embodiment of the present invention corresponds to the system embodiment described above, details not disclosed in the method embodiment may refer to the system embodiment described above, and the details are not described in detail in the present invention.
Fig. 5 is a flow chart of a control method for cluster management according to an embodiment of the present disclosure, where the method is applied to the control system for cluster management according to the foregoing embodiment, as shown in fig. 5, and includes:
step 101, responding to a control instruction for generating a target cluster, and acquiring first attribute information and second attribute information in the control instruction, wherein the first attribute information is attribute information corresponding to the target cluster, and the second attribute information is attribute information respectively corresponding to at least two types of target network elements;
step 102, creating a target network element according to the first attribute information, and creating the target cluster according to the second attribute information.
The control method for cluster management provided by the disclosure is used for responding to a control instruction for generating a target cluster, and acquiring first attribute information and second attribute information in the control instruction, wherein the first attribute information is attribute information corresponding to the target cluster, and the second attribute information is attribute information respectively corresponding to at least two types of target network elements; and creating a target network element according to the first attribute information, and creating the target cluster according to the second attribute information. Compared with the related art, the embodiment of the application creates the target cluster according to the first attribute information, wherein one target cluster comprises at least two types of target network elements, and creates the corresponding target network elements according to the second attribute information, thereby realizing the cluster creation with non-single function.
Further, after creating the target cluster according to the second attribute information, the method further includes: invoking a first preset configuration template to generate a monitoring strategy, and monitoring the target network element and the target cluster based on the monitoring strategy; and under the condition that the target network element is monitored to be abnormal, outputting a first abnormal notification message, wherein the first notification message carries the identification information of the fault network element.
In order to realize unified management of the target cluster, the target network element and the monitoring policy are respectively generated through the visualized first preset configuration template, so that unified management can be facilitated, and configuration time for configuring each policy can be saved.
Further, under the condition that the target network element is monitored to be abnormal, creating a new network element which is the same as the second attribute information of the fault network element; and acquiring the request address of the fault network element migration processing, and executing the migration from the abnormal network element to the new network element.
Further, under the condition that the target cluster is monitored to be abnormal, judging whether the load of the target cluster exceeds a first preset load threshold; and if the load of the target cluster exceeds a first preset load threshold, reconfiguring the maximum network element number of the target cluster in the first attribute information.
Further, under the condition that the target cluster is monitored to be abnormal, judging whether the load of the target cluster is lower than a second preset load threshold value or not, and whether a target network element in the target cluster is higher than a third preset load threshold value or not;
and under the condition that the load of the target cluster is lower than a second preset load threshold and the target network element in the target cluster is higher than a third preset load threshold, performing capacity expansion processing on the target network element in the target cluster.
Further, under the condition that the target cluster is monitored to be abnormal, judging whether the load of the target cluster is lower than a fourth preset load threshold; and under the condition that the load of the target cluster is lower than a fourth preset load threshold, carrying out capacity reduction processing on the target network elements in the target cluster.
In performing the monitoring of the target network element and the target cluster based on the monitoring policy, the following manner may be adopted, but is not limited to: determining a corresponding target monitoring strategy according to the type of the target network element; according to the described
A target monitoring strategy monitors whether the target network element survives; and under the condition that the target network element is determined to survive, monitoring whether the function corresponding to the target network element exists.
The embodiment of the application has the following beneficial effects:
1. configuring specific features of the micro-service by using a template configuration module, for example, monitoring a target cluster or a target network element by combining a plurality of modes of acquiring a health state by using an api interface, acquiring a micro-service port state by using telnet, acquiring a host state where the micro-service is located by using ping and the like; when a fault occurs to a certain target network element, a fault migration module is used for creating a new target network element with the same function to bear the work of the target network element with the fault, so that the target network element is ensured to be in a continuously available state;
2. different templates are configured for different target clusters, the target clusters are created and managed according to the templates, clusters with different functions can be created, the clusters are combined into a whole target cluster, and combined class services are provided for the outside; meanwhile, aiming at the multifunctional cluster, an integrated monitoring, alarming and self-healing scheme is provided, and the friendliness, stability and high availability of external service are improved.
3. And constructing a multifunctional cluster by using a template mode, and calculating the load of the target cluster by using different calculation modes aiming at the target cluster with the function which is not communicated, so as to dynamically expand and contract the capacity of the multifunctional cluster.
The foregoing explanation of the method embodiment is also applicable to the apparatus of this embodiment, and the principle is the same, and this embodiment is not limited thereto.
According to embodiments of the present disclosure, the present disclosure also provides an electronic device, a readable storage medium and a computer program product.
Fig. 6 shows a schematic block diagram of an example electronic device 200 that may be used to implement embodiments of the present disclosure. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smartphones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be exemplary only, and are not meant to limit implementations of the disclosure described and/or claimed herein.
As shown in fig. 6, the apparatus 200 includes a computing unit 201 that can perform various appropriate actions and processes according to a computer program stored in a ROM (Read-Only Memory) 202 or a computer program loaded from a storage unit 208 into a RAM (Random Access Memory ) 203. In the RAM 203, various programs and data required for the operation of the device 200 can also be stored. The computing unit 201, ROM 202, and RAM 203 are connected to each other through a bus 204. An I/O (Input/Output) interface 205 is also connected to bus 204.
Various components in device 200 are connected to I/O interface 205, including: an input unit 206 such as a keyboard, a mouse, etc.; an output unit 207 such as various types of displays, speakers, and the like; a storage unit 208 such as a magnetic disk, an optical disk, or the like; and a communication unit 209 such as a network card, modem, wireless communication transceiver, etc. The communication unit 209 allows the device 200 to exchange information/data with other devices through a computer network such as the internet and/or various telecommunication networks.
The computing unit 201 may be a variety of general and/or special purpose processing components with processing and computing capabilities. Some examples of computing unit 201 include, but are not limited to, a CPU (Central Processing Unit ), a GPU (Graphic Processing Units, graphics processing unit), various dedicated AI (Artificial Intelligence ) computing chips, various computing units running machine learning model algorithms, a DSP (Digital Signal Processor ), and any suitable processor, controller, microcontroller, etc. The computing unit 201 performs the respective methods and processes described above, for example, a text generation method. For example, in some embodiments, the text generation method may be implemented as a computer software program tangibly embodied on a machine-readable medium, such as the storage unit 208. In some embodiments, part or all of the computer program may be loaded and/or installed onto the device 200 via the ROM 202 and/or the communication unit 209. When the computer program is loaded into RAM 203 and executed by computing unit 201, one or more steps of the method described above may be performed. Alternatively, in other embodiments, the computing unit 201 may be configured to perform the aforementioned text generation method by any other suitable means (e.g. by means of firmware).
Various implementations of the systems and techniques described here above may be implemented in digital electronic circuitry, integrated circuit System, FPGA (Field Programmable Gate Array ), ASIC (Application-Specific Integrated Circuit, application-specific integrated circuit), ASSP (Application Specific Standard Product, special-purpose standard product), SOC (System On Chip ), CPLD (Complex Programmable Logic Device, complex programmable logic device), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs, the one or more computer programs may be executed and/or interpreted on a programmable system including at least one programmable processor, which may be a special purpose or general-purpose programmable processor, that may receive data and instructions from, and transmit data and instructions to, a storage system, at least one input device, and at least one output device.
Program code for carrying out methods of the present disclosure may be written in any combination of one or more programming languages. These program code may be provided to a processor or controller of a general purpose computer, special purpose computer, or other programmable data processing apparatus such that the program code, when executed by the processor or controller, causes the functions/operations specified in the flowchart and/or block diagram to be implemented. The program code may execute entirely on the machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
In the context of this disclosure, a machine-readable medium may be a tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. The machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, RAM, ROM, EPROM (Electrically Programmable Read-Only-Memory, erasable programmable read-Only Memory) or flash Memory, an optical fiber, a CD-ROM (Compact Disc Read-Only Memory), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., CRT (Cathode-Ray Tube) or LCD (Liquid Crystal Display ) monitor) for displaying information to a user; and a keyboard and pointing device (e.g., a mouse or trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic input, speech input, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a background component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such background, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: LAN (Local Area Network ), WAN (Wide Area Network, wide area network), internet and blockchain networks.
The computer system may include a client and a server. The client and server are typically remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The server can be a cloud server, also called a cloud computing server or a cloud host, and is a host product in a cloud computing service system, so that the defects of high management difficulty and weak service expansibility in the traditional physical hosts and VPS service ("Virtual Private Server" or simply "VPS") are overcome. The server may also be a server of a distributed system or a server that incorporates a blockchain.
It should be noted that, artificial intelligence is a subject of studying a certain thought process and intelligent behavior (such as learning, reasoning, thinking, planning, etc.) of a computer to simulate a person, and has a technology at both hardware and software level. Artificial intelligence hardware technologies generally include technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing, and the like; the artificial intelligence software technology mainly comprises a computer vision technology, a voice recognition technology, a natural language processing technology, a machine learning/deep learning technology, a big data processing technology, a knowledge graph technology and the like.
It should be appreciated that various forms of the flows shown above may be used to reorder, add, or delete steps. For example, the steps recited in the present disclosure may be performed in parallel, sequentially, or in a different order, provided that the desired results of the disclosed aspects are achieved, and are not limited herein.
The above detailed description should not be taken as limiting the scope of the present disclosure. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives are possible, depending on design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present disclosure are intended to be included within the scope of the present disclosure.

Claims (17)

1. A cluster management control system, comprising: the system comprises a template configuration module, a cluster management module and a virtual machine management module;
the template configuration module is used for generating a control instruction of a target cluster and sending the control instruction to the cluster management module, wherein the control instruction comprises first attribute information corresponding to the target cluster and second attribute information respectively corresponding to at least two types of target network elements;
the cluster management module is used for receiving the first attribute information and the second attribute information, creating a corresponding target cluster according to the first attribute information and sending the second attribute information to the virtual machine management module;
the virtual machine management module is configured to receive the second attribute information sent by the cluster management module, create a corresponding target network element according to the second attribute information, and send identification information corresponding to the target network element to the cluster management module.
2. The system of claim 1, further comprising a monitoring module;
the template configuration module is further used for configuring a monitoring strategy of the monitoring module and sending the monitoring strategy to the monitoring module;
The monitoring module is used for receiving the monitoring strategy sent by the template configuration module and monitoring the target cluster based on the monitoring strategy.
3. The system of claim 2, further comprising an alert module;
the template configuration module is further used for configuring an alarm sending strategy of the monitoring module and sending the alarm sending strategy to the monitoring module;
the monitoring module is further configured to send a first notification message to the alarm module based on the alarm sending policy when it is determined that the target cluster is abnormal, where the first notification message carries identification information of a faulty network element;
the alarm module is used for receiving and responding to the first notification message.
4. The system of claim 2, further comprising a fault migration module;
the monitoring module is further configured to send a second notification message to the fault migration module when it is determined that the target network element is abnormal, where the second notification message carries identification information of the fault network element;
the fault migration module is configured to receive a second notification message sent by the monitoring module, and send the second notification message to the cluster management module;
The cluster management module is further configured to receive a second notification message sent by the fault migration module, and send indication information for creating a new network element to the virtual machine management module;
the virtual machine management module is further configured to receive the indication information for creating the new network element sent by the cluster management module, create the new network element, and send the identification information of the new network element to the cluster management module;
the cluster management module is further configured to receive an identifier of a new network element sent by the virtual machine management module
Information and a migration instruction are sent to the virtual machine management module, wherein the migration instruction comprises a request address of the fault network element migration processing;
the virtual machine management module is further configured to receive the migration instruction sent by the cluster management module, and migrate the faulty network element to a new network element based on a request address of the faulty network element migration process.
5. The system of claim 3, wherein the system further comprises a controller configured to control the controller,
the monitoring module is further configured to send a third notification message to the alarm module and send a fourth notification message for modifying the maximum number of network elements of the target cluster in the first attribute information to the template configuration module when it is determined that the load of the target cluster exceeds a first preset load threshold;
The template configuration module is further configured to receive the fourth notification message, and modify a maximum number of network elements of the target cluster in the first attribute information.
6. The system of claim 2, wherein the system further comprises a controller configured to control the controller,
the monitoring module is further configured to send a capacity expansion notification message to the cluster management module when it is determined that the load of the target cluster is lower than a second preset load threshold and the target network element in the target cluster is higher than a third preset load threshold;
the cluster management module is further configured to receive the capacity expansion notification message sent by the monitoring module, and send the capacity expansion notification message to the virtual machine management module;
the virtual machine management module is further configured to receive the capacity expansion notification message, and execute creation of a target network element based on the capacity expansion notification message.
7. The system of claim 2, wherein the system further comprises a controller configured to control the controller,
the monitoring module is further configured to send a capacity reduction notification message to the cluster management module when it is determined that the load of the target cluster is lower than a fourth preset load threshold;
the cluster management module is further configured to receive the volume reduction notification message sent by the monitoring module, and send the volume reduction notification message to the virtual machine management module;
The virtual machine management module is further configured to receive the capacity reduction notification message, and perform deletion of the target network element in the target cluster based on the capacity reduction notification message.
8. A method for controlling cluster management, comprising:
responding to a control instruction for generating a target cluster, and acquiring first attribute information and second attribute information in the control instruction, wherein the first attribute information is attribute information corresponding to the target cluster, and the second attribute information is attribute information respectively corresponding to at least two types of target network elements;
and creating a target network element according to the first attribute information, and creating the target cluster according to the second attribute information.
9. The method of claim 8, wherein after creating the target cluster from the second attribute information, the method further comprises:
calling a first preset configuration template to generate a monitoring strategy;
monitoring the target network element and the target cluster based on the monitoring strategy;
and under the condition that the target network element is monitored to be abnormal, outputting a first abnormal notification message, wherein the first notification message carries the identification information of the fault network element.
10. The method according to claim 9, wherein the method further comprises:
under the condition that the target network element is monitored to be abnormal, creating a new network element which is the same as the second attribute information of the fault network element;
and acquiring the request address of the fault network element migration processing, and executing the migration from the abnormal network element to the new network element.
11. The method according to claim 9, wherein the method further comprises:
judging whether the load of the target cluster exceeds a first preset load threshold value or not under the condition that the target cluster is monitored to be abnormal;
and if the load of the target cluster exceeds a first preset load threshold, reconfiguring the maximum network element number of the target cluster in the first attribute information.
12. The method according to claim 9, wherein the method further comprises:
judging whether the load of the target cluster is lower than a second preset load threshold value or not and whether a target network element in the target cluster is higher than a third preset load threshold value or not under the condition that the target cluster is monitored to be abnormal;
and under the condition that the load of the target cluster is lower than a second preset load threshold and the target network element in the target cluster is higher than a third preset load threshold, performing capacity expansion processing on the target network element in the target cluster.
13. The method according to claim 9, wherein the method further comprises:
judging whether the load of the target cluster is lower than a fourth preset load threshold value or not under the condition that the target cluster is monitored to be abnormal;
and under the condition that the load of the target cluster is lower than a fourth preset load threshold, carrying out capacity reduction processing on the target network elements in the target cluster.
14. The method according to any of claims 9-13, wherein monitoring the target network element and the target cluster based on the monitoring policy comprises:
determining a corresponding target monitoring strategy according to the type of the target network element;
monitoring whether the target network element survives according to the target monitoring strategy;
and under the condition that the target network element is determined to survive, monitoring whether the function corresponding to the target network element exists.
15. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 8-14.
16. A non-transitory computer readable storage medium storing computer instructions for causing the computer to perform the method of any one of claims 8-14.
17. A computer program product comprising a computer program which, when executed by a processor, implements the method according to any of claims 8-14.
CN202311178971.0A 2023-09-13 2023-09-13 Cluster management control system, method, electronic device and storage medium Active CN116909757B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311178971.0A CN116909757B (en) 2023-09-13 2023-09-13 Cluster management control system, method, electronic device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311178971.0A CN116909757B (en) 2023-09-13 2023-09-13 Cluster management control system, method, electronic device and storage medium

Publications (2)

Publication Number Publication Date
CN116909757A true CN116909757A (en) 2023-10-20
CN116909757B CN116909757B (en) 2024-01-26

Family

ID=88351554

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311178971.0A Active CN116909757B (en) 2023-09-13 2023-09-13 Cluster management control system, method, electronic device and storage medium

Country Status (1)

Country Link
CN (1) CN116909757B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110389836A (en) * 2019-07-17 2019-10-29 腾讯科技(深圳)有限公司 A kind of more cluster management methods, device, server and storage medium
CN112653577A (en) * 2020-12-14 2021-04-13 武汉绿色网络信息服务有限责任公司 Network element management method, device and storage medium
CN113821168A (en) * 2021-08-31 2021-12-21 济南浪潮数据技术有限公司 Shared storage migration system and method, electronic equipment and storage medium
WO2022247359A1 (en) * 2021-05-27 2022-12-01 北京百度网讯科技有限公司 Cluster access method and apparatus, electronic device, and medium
CN115729673A (en) * 2022-10-31 2023-03-03 浪潮通信技术有限公司 Cluster management method, management module, electronic device and medium
CN116225703A (en) * 2023-02-16 2023-06-06 中移动信息技术有限公司 Cluster processing method, device, equipment and computer readable storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110389836A (en) * 2019-07-17 2019-10-29 腾讯科技(深圳)有限公司 A kind of more cluster management methods, device, server and storage medium
CN112653577A (en) * 2020-12-14 2021-04-13 武汉绿色网络信息服务有限责任公司 Network element management method, device and storage medium
WO2022247359A1 (en) * 2021-05-27 2022-12-01 北京百度网讯科技有限公司 Cluster access method and apparatus, electronic device, and medium
CN113821168A (en) * 2021-08-31 2021-12-21 济南浪潮数据技术有限公司 Shared storage migration system and method, electronic equipment and storage medium
CN115729673A (en) * 2022-10-31 2023-03-03 浪潮通信技术有限公司 Cluster management method, management module, electronic device and medium
CN116225703A (en) * 2023-02-16 2023-06-06 中移动信息技术有限公司 Cluster processing method, device, equipment and computer readable storage medium

Also Published As

Publication number Publication date
CN116909757B (en) 2024-01-26

Similar Documents

Publication Publication Date Title
CN112667403B (en) Scheduling method and device of server and electronic equipment
CN111625195B (en) Method and device for server capacity expansion
US12034820B2 (en) Fusing and degradation method and apparatus for micro-service, device, and medium
US20230144949A1 (en) Virtual-machine cold migration method and apparatus, electronic device and storage medium
CN113312560B (en) Group detection method and device and electronic equipment
CN113037489B (en) Data processing method, device, equipment and storage medium
CN113468021B (en) Method, device, equipment and storage medium for monitoring performance data
CN112685256B (en) Method, equipment and medium for monitoring server
CN112925811B (en) Method, apparatus, device, storage medium and program product for data processing
CN116909757B (en) Cluster management control system, method, electronic device and storage medium
CN115514718B (en) Data interaction method, control layer and equipment based on data transmission system
CN116594563A (en) Distributed storage capacity expansion method and device, electronic equipment and storage medium
CN114327819B (en) Task management method, device, equipment and storage medium
CN116112342A (en) Alarm information processing method, device, electronic equipment and storage medium
CN115550363A (en) Node hierarchical management method and device and electronic equipment
CN114139605A (en) Distributed model training method, system, device and storage medium
CN113778645A (en) Task scheduling method, device and equipment based on edge calculation and storage medium
CN118170617B (en) Cluster anomaly detection method and device, electronic equipment and storage medium
CN118170324B (en) Cloud hard disk migration method and device, electronic equipment and storage medium
CN113342463B (en) Capacity adjustment method, device, equipment and medium of computer program module
CN112596922B (en) Communication management method, device, equipment and medium
CN117632600A (en) Fault management method and device and electronic equipment
CN117749656A (en) Cluster operation monitoring method, device, equipment and storage medium
CN117215862A (en) Early warning method, device and system for hard disk faults and related equipment thereof
CN114356505A (en) Method, apparatus, device, medium and product for scheduling groups of containers

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant