WO2021109125A1 - 一种弹性伸缩组的管理方法、装置 - Google Patents
一种弹性伸缩组的管理方法、装置 Download PDFInfo
- Publication number
- WO2021109125A1 WO2021109125A1 PCT/CN2019/123663 CN2019123663W WO2021109125A1 WO 2021109125 A1 WO2021109125 A1 WO 2021109125A1 CN 2019123663 W CN2019123663 W CN 2019123663W WO 2021109125 A1 WO2021109125 A1 WO 2021109125A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- elastic scaling
- scaling group
- elastic
- group
- information
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
- G06F9/505—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G06F9/5077—Logical partitioning of resources; Management or configuration of virtualized resources
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/08—Configuration management of networks or network elements
- H04L41/0803—Configuration setting
- H04L41/0813—Configuration setting characterised by the conditions triggering a change of settings
- H04L41/0816—Configuration setting characterised by the conditions triggering a change of settings the condition being an adaptation, e.g. in response to network events
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/08—Configuration management of networks or network elements
- H04L41/0876—Aspects of the degree of configuration automation
- H04L41/0886—Fully automatic configuration
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/08—Configuration management of networks or network elements
- H04L41/0894—Policy-based network configuration management
Definitions
- This application relates to the computer field, and more specifically, to a method and device for managing an elastic scaling group.
- Auto scaling (AS) service is a service that automatically adjusts its business resources through strategies according to the user's business needs. Users can pre-define the elastic scaling group and the elastic scaling strategy information corresponding to the elastic scaling group according to the business requirements, without having to prepare a lot of business resources for their own business in advance.
- elastic scaling services can be deployed in public cloud scenarios.
- the public cloud due to the limitations of physical regions, most cloud computing services do not support cross-regional.
- the deployed elastic scaling service also does not support cross-regional.
- the set elastic scaling group can only be used in one region, and computing instances are also limited to expansion (creating new computing instances) and reduction (release or dormant computing) in one region. Examples).
- the present application provides a management method of an elastic scaling group, which supports the deployment of an elastic scaling group across levels, and the calculation instances of the elastic scaling group can also be operated in different levels.
- a method for managing an elastic scaling group is provided.
- the management method is applied to a service providing system.
- the service providing system includes an elastic scaling group management device and a plurality of levels. Each level includes at least one service server.
- Methods include:
- the elastic scaling group management apparatus receives a first elastic scaling group configuration message, the first elastic scaling group configuration message including the level information of the first elastic scaling group, the initialization configuration information of the calculation instance of the first elastic scaling group, and the first elastic scaling group configuration message.
- Elastic scaling policy information of an elastic scaling group the elastic scaling group management device is created in the business server included in the level indicated by the level information of the first elastic scaling group according to the initial configuration information of the calculation instance of the first elastic scaling group The calculation instance of the first elastic scaling group; the elastic scaling group management device operates the calculation instance of the first elastic scaling group according to the elastic scaling policy information of the first elastic scaling group.
- operations on computing instances in an elastic scaling group may include any one or more of the following: capacity expansion (creating a new computing instance in the elastic scaling group, or increasing the specifications of the computing instances in the elastic scaling group), Capacity reduction (release or hibernate computing instances in an elastic scaling group, or reduce the specifications of computing instances in an elastic scaling group).
- the elastic scaling group management apparatus generates the first elastic scaling group according to the running status of the calculation instance of the first elastic scaling group and the elastic scaling strategy information of the first elastic scaling group.
- Elastic scaling command the elastic scaling group management device sends the first elastic scaling command to the business server that created the computing instance of the first elastic scaling group.
- the elastic scaling group management device does not send the first elastic scaling command to all the computing instance service servers that create the first elastic scaling group, but sends it to the first elastic scaling command based on the resource scheduling policy information. Part or all of the business servers selected from the business servers of the calculation instances of the scaling group.
- the first elastic scaling group configuration message may also include resource scheduling policy information, and the resource scheduling policy information is used to determine the location of the initially created computing instance in the elastic scaling group The business server and the business server where the computing instance needs to be expanded or reduced. It should be understood that the resource scheduling policy information in the elastic scaling group is optional, and the user may or may not configure it. When the user does not perform configuration, the elastic scaling group management device uses preset resource scheduling policy information.
- the management method further includes: the elastic scaling group management apparatus receives a second elastic scaling group configuration message, and the second elastic scaling group configuration message includes the second elastic scaling group configuration message.
- the elastic scaling group management device is indicated by the level information of the second elastic scaling group Among the business servers included in the hierarchy, the calculation instance of the second elastic scaling group is created according to the initialization configuration information of the calculation instance of the second elastic scaling group, wherein the business server included in the hierarchy indicated by the hierarchy information of the first elastic scaling group There is overlap with the service server included in the level indicated by the level information of the second elastic scaling group; the elastic scaling group management device operates the calculation instance of the second elastic scaling group according to the elastic scaling strategy information of the second elastic scaling group .
- the elastic scaling group management apparatus generates a second elastic scaling group according to the running status of the calculation instance of the second elastic scaling group and the elastic scaling strategy information of the second elastic scaling group. Elastic scaling command; the elastic scaling group management device sends the second elastic scaling command to the business server that created the computing instance of the second elastic scaling group.
- the elastic scaling group management apparatus does not send the second elastic scaling command to all computing instance service servers that create the second elastic scaling group, but sends it to the second elastic scaling command based on the resource scheduling policy information. Part or all of the business servers selected from the business servers of the calculation instances of the scaling group.
- the second elastic scaling group configuration message further includes a conflict resolution strategy
- the conflict resolution strategy indicates the priority of the elastic scaling strategy information
- the management method further includes: The elastic scaling group management device sends the conflict resolution strategy to an overlapping service server, the overlapping service server being included in the level indicated by the level information of the first elastic scaling group and included in the level indicated by the level information of the second elastic scaling group; The overlapping service server receives the first elastic scaling command generated according to the first elastic scaling policy information; the overlapping service server receives the second elastic scaling command generated according to the second elastic scaling policy information; the overlapping service server resolves according to the conflict Strategy, select to execute the first elastic scaling command or the second elastic scaling command.
- the elastic scaling group management apparatus can identify the overlapping service servers of the level where the multiple elastic scaling groups created by the user are located, and can send the conflict resolution strategy to the identified overlapping service servers.
- the elastic scaling command generated by the elastic scaling group management device according to the high priority elastic scaling strategy information indicated in the conflict resolution strategy has a high priority, and the priority is high according to the low priority elastic scaling strategy information indicated in the conflict resolution strategy.
- the priority of the generated elastic scaling command is low.
- the elastic scaling group management apparatus may also send the conflict resolution strategy to each business server that has created the computing instance of the first elastic scaling group.
- an elastic scaling group management method is provided, the management method is applied to an elastic scaling group management device, and the management method includes: receiving a first elastic scaling group configuration message, the first elastic scaling group configuration message including the Level information of the first elastic scaling group, initialization configuration information of the calculation instance of the first elastic scaling group, and elastic scaling policy information of the first elastic scaling group; included in the level indicated by the level information of the first elastic scaling group In the business server, the calculation instance of the first elastic scaling group is created according to the initial configuration information of the calculation instance of the first elastic scaling group; the calculation of the first elastic scaling group according to the elastic scaling policy information of the first elastic scaling group Instance to operate.
- operations on computing instances in an elastic scaling group may include any one or more of the following: capacity expansion (creating a new computing instance in the elastic scaling group, or increasing the specifications of the computing instances in the elastic scaling group), Capacity reduction (release or hibernate computing instances in an elastic scaling group, or reduce the specifications of computing instances in an elastic scaling group).
- the first elastic scaling command is generated according to the running status of the calculation instance of the first elastic scaling group and the elastic scaling strategy information of the first elastic scaling group;
- the first elastic scaling command is sent to the business server that created the computing instance of the first elastic scaling group.
- the elastic scaling group management device does not send the first elastic scaling command to all the computing instance service servers that create the first elastic scaling group, but sends it to the one selected according to the resource scheduling policy information to create the first elastic scaling command. Part or all of the business servers in the business servers of the computing instances of an elastic scaling group.
- the first elastic scaling group configuration message may also include resource scheduling policy information, and the resource scheduling policy information is used to determine the location of the initially created computing instance in the elastic scaling group The business server and the business server where the computing instance needs to be expanded or reduced. It should be understood that the resource scheduling policy information in the elastic scaling group is optional, and the user may or may not configure it. When the user does not perform configuration, the elastic scaling group management device uses preset resource scheduling policy information.
- the management method further includes: receiving a second elastic scaling group configuration message, the second elastic scaling group configuration message including the level information of the second elastic scaling group, The initialization configuration information of the calculation instance of the second elastic scaling group and the elastic scaling strategy information of the second elastic scaling group; among the business servers included in the level indicated by the level information of the second elastic scaling group, according to the second elastic
- the initialization configuration information of the computing instance of the scaling group creates the computing instance of the second elastic scaling group, where the level indicated by the level information of the first elastic scaling group includes the business server and the level indicated by the level information of the second elastic scaling group
- the second elastic scaling command is generated according to the running status of the calculation instance of the second elastic scaling group and the elastic scaling strategy information of the second elastic scaling group;
- the second elastic scaling command is sent to the business server that created the computing instance of the second elastic scaling group.
- the second elastic scaling group configuration message further includes a conflict resolution strategy
- the conflict resolution strategy indicates the priority of the elastic scaling strategy information
- the management method further includes: sending The conflict resolution strategy is directed to an overlapping service server that is included in the level indicated by the level information of the first elastic scaling group and is included in the level indicated by the level information of the second elastic scaling group.
- the elastic scaling group management apparatus can identify the overlapping service servers of the level where the multiple elastic scaling groups created by the user are located, and can send the conflict resolution strategy to the identified overlapping service servers.
- a method for managing an elastic scaling group is provided.
- the management method is applied to a business server.
- the business server is included in the level indicated by the level information of the first elastic scaling group and is included in the level of the second elastic scaling group.
- the level indicated by the information, the management method includes: receiving a first elastic scaling command issued by an elastic scaling group management device, where the first elastic scaling command is based on the first elastic scaling group configuration message received by the elastic scaling group management device The first elastic scaling strategy information included; receiving a second elastic scaling command issued by the elastic scaling group management device, where the second elastic scaling command is configured by the elastic scaling group management device according to the received second elastic scaling group configuration Generated by the second elastic scaling strategy information included in the message; receiving the conflict resolution strategy issued by the elastic scaling group management device, the conflict resolution strategy indicating the priority of the elastic scaling strategy information; according to the conflict resolution strategy, selecting to execute the first An elastic scaling command or the second elastic scaling command.
- a device for managing an elastic scaling group including:
- the communication module is configured to receive a first elastic scaling group configuration message, where the first elastic scaling group configuration message includes the level information of the first elastic scaling group, the initialization configuration information of the calculation instance of the first elastic scaling group, and the first elastic scaling group configuration message.
- a processing module configured to create a calculation instance of the first elastic scaling group according to the initial configuration information of the calculation instance of the first elastic scaling group in the business server included in the level indicated by the layer information of the first elastic scaling group;
- the processing module is further configured to operate the calculation instance of the first elastic scaling group according to the elastic scaling strategy information of the first elastic scaling group.
- the processing module is specifically configured to: generate the first elastic scaling group according to the running status of the calculation instance of the first elastic scaling group and the elastic scaling strategy information of the first elastic scaling group.
- the communication module is specifically configured to send the first elastic scaling command to the business server that created the calculation instance of the first elastic scaling group.
- the communication module is further configured to: receive a second elastic scaling group configuration message, where the second elastic scaling group configuration message includes level information of the second elastic scaling group , The initialization configuration information of the calculation instance of the second elastic scaling group and the elastic scaling policy information of the second elastic scaling group;
- the processing module is further configured to: in the business server included in the level indicated by the level information of the second elastic scaling group, create a calculation instance of the second elastic scaling group according to the initial configuration information of the calculation instance of the second elastic scaling group , Wherein the business server included in the level indicated by the level information of the first elastic scaling group and the business server included in the level indicated by the level information of the second elastic scaling group overlap; according to the elastic scaling strategy of the second elastic scaling group The information performs operations on the calculation instance of the second elastic scaling group.
- the processing module is specifically configured to: generate the second elastic scaling group according to the running status of the calculation instance of the second elastic scaling group and the elastic scaling strategy information of the second elastic scaling group. 2. Elastic scaling commands;
- the communication module is specifically configured to send the second elastic scaling command to the business server that created the calculation instance of the second elastic scaling group.
- the communication module is specifically used for:
- the overlapping service server being included in the level indicated by the level information of the first elastic scaling group and included in the level indicated by the level information of the second elastic scaling group.
- a business server including the level indicated by the level information of the first elastic scaling group and the level indicated by the level information of the second elastic scaling group, the business server including:
- the communication module is configured to receive the first elastic scaling command issued by the elastic scaling group management device, and the first elastic scaling command is the first elastic scaling command included in the first elastic scaling group configuration message received by the elastic scaling group management device according to the received first elastic scaling group configuration message. Generated by scaling strategy information;
- the communication module is also configured to receive a second elastic scaling command issued by the elastic scaling group management device, where the second elastic scaling command is included in the second elastic scaling group configuration message received by the elastic scaling group management device according to Generated by the second elastic scaling strategy information;
- the communication module is further configured to receive a conflict resolution strategy issued by the elastic scaling group management device, where the conflict resolution strategy indicates the priority of the elastic scaling strategy information;
- the processing module is configured to select and execute the first elastic scaling command or the second elastic scaling command according to the conflict resolution strategy.
- a global auto-scaling server including a memory and at least one processor.
- the memory is used for program instructions.
- the at least one processor executes the program instructions in the memory to execute the first The second aspect or the method in any one of the possible implementation manners of the second aspect.
- a business server including a memory and at least one processor, where the memory is used for program instructions.
- the business server When the business server is running, at least one processor executes the program instructions in the memory to execute the third aspect or the third aspect.
- the method in any possible implementation of the aspect.
- a service provision system includes at least one global auto-scaling server and multiple levels, each level includes multiple business servers, and each global auto-scaling server and each business server includes storage and At least one processor, and the memory is used for program instructions.
- the processor of the at least one global auto-scaling server executes the program instructions in the memory to execute the second aspect or any one of the second aspects
- the processors of the multiple business servers execute the program instructions in the memory to execute the third aspect or the method in any one of the possible implementation manners of the third aspect .
- a non-transitory readable storage medium including program instructions.
- the program instructions When the program instructions are executed by a computer, the computer executes the second aspect or any one of the possible implementation manners of the second aspect. Methods.
- a non-transitory readable storage medium including program instructions.
- the program instructions When the program instructions are executed by a computer, the computer executes the third aspect or any one of the possible implementation manners of the third aspect. Methods.
- a computer program product including program instructions, and when the program instructions are executed by a computer, the computer executes the method in the second aspect or any one of the possible implementation manners of the second aspect.
- a computer program product including program instructions.
- the program instructions When the program instructions are executed by a computer, the computer executes the method in the third aspect or any one of the possible implementation manners of the third aspect.
- Fig. 1 is a schematic structural diagram of a service providing system that can be applied to an embodiment of the present application.
- FIG. 2 is a schematic diagram of a scenario in which an elastic scaling group management apparatus 700 provided by an embodiment of the present application provides a user with an elastic scaling group configuration message through a visual window.
- Fig. 3 is a schematic structural diagram of a service server provided by an embodiment of the present application.
- Fig. 4 is a flowchart of a method for creating an elastic scaling group provided by an embodiment of the present application.
- FIG. 5 is a schematic diagram of a scenario in which another elastic scaling group management apparatus 700 provided by an embodiment of the present application provides a user with an elastic scaling group configuration message through a visual window.
- FIG. 6 is a schematic flowchart of a method for a business server to elastically scale a computing instance according to a conflict resolution strategy according to an embodiment of the present application.
- FIG. 7 is a schematic structural diagram of an elastic scaling group management apparatus 700 provided by an embodiment of the present application.
- FIG. 8 is a schematic structural diagram of a global auto-scaling server 800 in a service providing system provided by an embodiment of the present application.
- FIG. 9 is a schematic structural diagram of a service server 900 in a service providing system provided by an embodiment of the present application.
- Auto scaling (AS) service is a service that automatically adjusts its business resources through strategies according to the user's business needs. Users can pre-define the elastic scaling group and the elastic scaling strategy information corresponding to the elastic scaling group according to the business requirements, without having to prepare a lot of business resources for their own business in advance.
- the service provision system can automatically adjust the cloud server resources in the elastic scaling group according to the set elastic scaling strategy information, thereby reducing the workload of artificially adjusting business resources to cope with business changes and peak pressures, saving users' resources and manpower costs, and providing users with Provide strategies for efficiently managing computing resources.
- the service providing system includes one or more service servers, and the service providing system can manage the number of computing instances (business resources) running on the one or more service servers in the elastic scaling group according to the elastic scaling strategy information, and Complete the environment deployment of the computing instance to ensure the smooth operation of the business.
- the elastic scaling service can automatically increase the number of computing instances in the elastic scaling group to ensure that performance is not affected.
- the number of computing instances in the elastic scaling group will be reduced to reduce costs.
- elastic scaling services can be provided in public clouds. In the public cloud, concepts such as regions and availability zones are involved.
- the region refers to the location of the data center, which can be a large area (for example, South China, North China), or a city (for example, Shenzhen, Dongguan).
- Available zone (AZ) refers to the physical area where the computer room or cloud data center is located, which has the characteristics of energy consumption and network independence.
- a region usually contains one or more low-latency interconnected availability zones, which are used for scenarios and services such as disaster recovery, backup and load balancing in the same region. Due to the limitations of physical regions, most existing cloud services do not support cross-regional deployment. Similarly, the elastic scaling service also does not support cross-regional. In other words, when deploying elastic scaling services in the public cloud, the set elastic scaling group can only be used in one region, and computing instances are also limited to expansion (creating new computing instances) and reduction (release or dormant computing) in one region. Examples).
- the computing instance may be a virtual machine (VM), a container, or a software module for running services.
- VM virtual machine
- container a software module for running services.
- each business server in the embodiments of the present application may run one or more computing instances, and the one or more computing instances may belong to one or more elastic scaling groups.
- Each elastic scaling group includes at least one calculation instance.
- the unit of capacity expansion/reduction in an elastic scaling group can be a computing instance or a computing instance group.
- the unit of capacity expansion/reduction is a computing instance
- the computing instances included in an elastic scaling group are the same.
- an elastic scaling group can include one or more types of calculation instances, for example, each calculation instance group includes N calculation instances of the same type, or each calculation
- the instance group includes multiple types of computing instances (for example, load balancing instance and rendering instance, each computing instance group includes 1 load balancing instance and 5 rendering instances).
- the unit of capacity expansion/reduction in each elastic scaling group is a calculation instance as an example for description.
- Edge cloud is a kind of cloud computing based on extensive coverage of business servers.
- Edge cloud service is a form of distributed computing in which computing services are provided nearby through business servers on the edge of the network close to the source of data, and computing resources (business servers) are distributed closer to the users of edge cloud services.
- edge cloud services can better meet the key needs of industry digitalization in terms of agile connection, real-time business, data optimization, application intelligence, security and privacy protection.
- the service provision system in this application can be supported by edge cloud, and can also be supported by public cloud, private cloud or hybrid cloud.
- Business servers are widely distributed in various places, and each business server can be divided into different levels according to administrative, geographical and other factors, for example, the first level, the second level, and the third level... Different levels can be nested with each other, where a first level can include one or more second levels, and a second level can include one or more third levels.
- the first level may be, for example, the Northeast Region, the Northwest Region, and the Central China Region.
- the second level may be a national district, for example, Guangdong province, Shaanxi province, etc.
- the third level may be a municipal district, for example, Shenzhen City, Xi'an City, etc. Users can directly deploy the desired elastic scaling group and corresponding elastic scaling strategy information at the specified level.
- first level may also be referred to as a large area
- second level may also be referred to as a secondary area
- third level may also be referred to as a secondary area.
- a large area may include one or more secondary areas
- a secondary area may include one or more secondary areas.
- multiple service servers are divided into different levels.
- multiple first levels can be divided according to administrative, geographic and other factors, for example, first level 1 and first level 2.
- first level 1 a plurality of second levels may be included, for example, the second level 1 and the second level 2.
- Each second level includes multiple third levels.
- the second level 1 includes multiple third levels, for example, the third level 1 and the third level 2.
- Each third level can include multiple service servers.
- the third level 1 may include multiple service servers, and the third level 2 may also include multiple service servers.
- Other first levels may also include multiple second levels, and each second level includes multiple third levels.
- the first level 2 is similar to the first level 1.
- each level may include one or more business servers, and the number of business servers included in each level in FIG. 1 is merely an example.
- the global auto-scaling server may run an elastic-scaling group management device 700, and the elastic-scaling group management device 700 may provide the user with an application programming interface (API) or a visualization window for managing the elastic-scaling group created by the user.
- API application programming interface
- the user can complete operations such as creation, configuration, and query of an elastic scaling group through the elastic scaling group management apparatus 700.
- the process of configuring the elastic scaling group by the user through the elastic scaling group management apparatus 700 will be described below.
- the user completes the configuration of the elastic scaling group by inputting an elastic scaling group configuration message to the elastic scaling group management apparatus 700.
- the configuration messages of the elastic scaling group include:
- the level may indicate any one or more of the levels described above, for example. For details, please refer to the above description, which will not be repeated here.
- the initialization configuration information of the calculation instances of the elastic scaling group may include the number parameters of the calculation instances in the elastic scaling group, for example, the initialization number of the calculation instances in the elastic scaling group, the maximum and minimum of the number of calculation instances in the elastic scaling group Value, the expected value of the calculation instance in the elastic scaling group.
- the initialization configuration information of the computing instance may also include the configuration information of the computing instance, for example, the specifications of the computing instance, the type of computing instance (e.g., rendering instance, load balancing instance), the network configuration of the computing instance, and the mirror image used by the computing instance, User ID and other information.
- the elastic scaling strategy information of the elastic scaling group includes a trigger condition and an operation on the computing instance in the elastic scaling group that is triggered when the set trigger condition is reached.
- the elastic scaling strategy information may include any one or more of the following: static scaling strategy information and dynamic scaling strategy information. among them,
- the static scaling strategy information includes set static trigger conditions, such as time conditions, and operations on computing instances in the elastic scaling group that are triggered when the set static trigger conditions are reached.
- Elastic scaling policy information reduce the capacity of 2 calculation instances in the layer where the elastic scaling group is located.
- the dynamic scaling strategy information includes the set dynamic trigger condition and the operation on the calculation instance in the elastic scaling group that is triggered when the set dynamic trigger condition is reached.
- Triggering condition The load of any computing instance in the elastic scaling group reaches 80% of the full load.
- Elastic scaling policy information Expand 2 computing instances in the level of the elastic scaling group.
- the elastic scaling strategy information further includes a cooling time, and the cooling time does not allow operations on the calculation instances in the elastic scaling group.
- the cooling time setting can prevent the calculation instances in the elastic scaling group from being frequently operated.
- operations on computing instances in the elastic scaling group may include any one or more of the following: capacity expansion (creating a new computing instance in the elastic scaling group, or improving the computing instance in the elastic scaling group Specifications (specification), capacity reduction (releasing or sleeping computing instances in an elastic scaling group, or reducing the specifications of computing instances in an elastic scaling group).
- the above-mentioned elastic scaling policy information further includes configuration information of the calculation instances for capacity expansion/reduction in the elastic scaling group.
- the elastic scaling policy information also includes the specifications of the computing instance to be expanded/reduced in the corresponding level, and the type of the computing instance to be expanded/reduced (for example, rendering instance, load balancing instance).
- the elastic scaling group configuration message may also include resource scheduling policy information.
- the resource scheduling policy information of the elastic scaling group is used to determine the business server where the computing instance created initially in the elastic scaling group is located and the business server where the computing instance needs to be expanded or reduced.
- the resource scheduling policy information of the elastic scaling group may include any one or more of the following: even distribution, designated cluster distribution, designated proportional distribution, automatic distribution according to load intensity, and so on.
- a certain number of calculation instances can be deployed in the corresponding service server according to one or more of the above resource scheduling policy information. For example, in the initialization phase, according to one or more of the maximum value, minimum value, and expected value of the number of calculation instances in the elastic scaling group, an average can be performed among multiple business servers in the corresponding tier of the elastic scaling group distribution.
- the expansion/reduction stage according to the number of calculation instances for expansion/reduction included in the elastic scaling strategy information in the elastic scaling group and the configuration information of the calculation instances, according to the number of tiers corresponding to the elastic scaling group
- the load intensity of each business server is allocated to the calculation instances for capacity expansion/reduction in each business server.
- the above level information, the initialization configuration information of the calculation instance, the elastic scaling policy information, and the resource scheduling policy information may be sent to the elastic scaling group management apparatus 700 through one or more elastic scaling group configuration messages.
- the user may first send one or more elastic scaling group configuration messages including hierarchical information and initialization configuration information of calculation instances to the elastic scaling group management apparatus 700 to create a calculation instance group in a specified hierarchy. Then, the user sends the elastic scaling policy information to the elastic scaling group management apparatus 700 through the elastic scaling group configuration message. Subsequently, the user instructs to apply the sent elastic scaling policy information to the created computing instance group to complete the creation of the elastic scaling group.
- each elastic scaling group includes one or more computing instance groups
- the method of deploying a certain number of computing instances in the corresponding business server according to the resource scheduling policy information is similar to the above method. For details, please refer to the above description. I won't repeat them here.
- FIG. 2 exemplarily shows a scenario in which the elastic scaling group management apparatus 700 provides a user with an elastic scaling group configuration message through a visual window.
- the user can choose according to the options of each type of information in the elastic scaling group configuration message.
- the resource scheduling policy information in the elastic scaling group is optional, and the user may or may not configure it.
- the elastic scaling group management apparatus 700 uses preset resource scheduling policy information.
- the business server includes a management device, an execution device, and at least one calculation instance running on it.
- the management device can be used as a decision-making entity of the level to which the business server belongs, responsible for managing the information of at least one computing instance running on all business servers in the level to which the business server belongs, and issuing scaling commands.
- the execution device is responsible for implementing the scaling commands issued by the management device.
- the management device and the execution device are described in detail below.
- Each level can include one or more business servers.
- the elastic scaling group management apparatus 700 can select a business server from multiple business servers included in a level as the level according to the distributed master selection algorithm.
- Other service servers included in this hierarchy may be referred to as slave service servers.
- the management device may be deployed on the main service server, so that the main service server can manage the instances or instance groups in the hierarchy to which it belongs through the management device.
- the management device can be deployed on all service servers in advance, and after one of the service servers is selected as the main service server in the hierarchy to which it belongs, the management device deployed in the main service server is activated , The remaining slave service server may not deploy the management device or the deployed management device may not be activated.
- the management device may be deployed in the main service server .
- the management device can include one or more of the following functions: storing the configuration messages of the elastic scaling group configured by the user at this level, monitoring the running status of the computing instances in this level, Determine the automatic scaling of the computing instance in the hierarchy according to the elastic scaling strategy information included in the elastic scaling group configuration message, determine the business server where the operated computing instance is located according to the resource scheduling policy information of the elastic scaling group, and report to the elastic scaling group management device 700 Synchronize the results of the elastic scaling group and calculate the results of the elastic scaling of the instance.
- the functions of the management device are described in detail below.
- the elastic scaling group configuration message configured by the user through the elastic scaling group management apparatus 700 may be forwarded by the elastic scaling group management apparatus 700 to the management apparatus in the main service server, and saved by the management apparatus.
- the management device in the main service server maintains the message channel with the slave service server in the hierarchy, and periodically receives the running status of the computing instance running on the slave service server sent from the service server. At the same time, the management device in the main service server also periodically receives the running status of the computing instances running on the main service server.
- the management device in the main service server may receive monitoring data of each computing instance.
- the monitoring data can include any one or more of the following: the number of abnormal computing instances, the number of abnormal processes, the usage rate of the central processing unit (CPU), the memory usage rate, the number of network connections, the bandwidth usage rate, and Other parameters that can reflect the operating conditions of the calculation instance.
- the management device in the main business server can determine the computing instance that needs to be operated in the elastic scaling group according to the stored elastic scaling group configuration information, for example, one or more of the elastic scaling strategy information and the initialization configuration information of the computing instance. The number, and the type of calculation instance that needs to be operated.
- the management device in the main service server determines that the elastic scaling group reaches the trigger condition in the elastic scaling policy information, it can determine the number of computing instances that need to be operated in the elastic scaling group and the number of computing instances that need to be operated according to the elastic scaling policy information.
- the management device in the main service server may determine the service server in the hierarchy where the computing instance to be operated is located according to the resource scheduling policy information, and send an elastic scaling command to the determined service server in the hierarchy.
- the elastic scaling command includes operations on the calculation instances in the elastic scaling group, the number of calculation instances that need to be operated, and the type of calculation instances that need to be operated.
- the management device in the main service server can actively synchronize the creation result and operation result of the elastic scaling group to the elastic scaling group management device 700 through the interface.
- the creation result of the elastic scaling group includes the initial creation situation of the computing instances in the elastic scaling group
- the operation result of the elastic scaling group includes the information about the execution of the elastic scaling strategy of the computing instances in the elastic scaling group.
- the execution device on each business server is responsible for monitoring the calculation instances running on the business server, and reporting the operation status of the calculation instances running on the business server to the management device in the main business server, and is responsible for implementing the management in the main business server
- the elastic scaling command issued by the device is responsible for monitoring the calculation instances running on the business server, and reporting the operation status of the calculation instances running on the business server to the management device in the main business server, and is responsible for implementing the management in the main business server The elastic scaling command issued by the device.
- the execution device can periodically obtain the monitoring data of the computing instance running on the service server.
- the execution device may also receive the elastic scaling command issued by the management device in the main service server.
- the elastic scaling command includes operations on the calculation instances in the elastic scaling group, the number of calculation instances that need to be operated, and the type of calculation instances that need to be operated.
- the execution device performs corresponding operations on the computing instance running on the business server according to the elastic scaling command.
- Fig. 4 is a flowchart of a method for creating an elastic scaling group provided by an embodiment of the present application. As shown in FIG. 4, the method may include steps 410-470, and steps 410-470 will be described in detail below.
- Step 410 The user sends an elastic scaling group configuration message to the elastic scaling group management apparatus 700.
- the user sends an elastic scaling group configuration message to the elastic scaling group management apparatus 700 through an API or a visual window to complete the creation of the elastic scaling group.
- the elastic scaling group configuration message may include, but is not limited to: hierarchical information, initial configuration information of a computing instance, and elastic scaling strategy information.
- the elastic scaling group configuration message further includes: resource scheduling policy information.
- Step 420 The elastic scaling group management device 700 sends an elastic scaling group configuration message to the management device in the main service server.
- the elastic scaling group management apparatus 700 may determine the level of the elastic scaling group that the user needs to create according to the level information in the elastic scaling group configuration message. And determine the main service server in the hierarchy, and send the above-mentioned elastic scaling group configuration message to the management device of the main service server. Optionally, if the management device on the main service server is not activated by default, the elastic scaling group management device 700 needs to activate the management device on the main service server before sending the elastic scaling group configuration message.
- Step 430 The management device in the main service server saves the elastic scaling group configuration message.
- the management device in the main service server may save the elastic scaling group configuration message. And according to the configuration message of the elastic scaling group, the number of calculation instances that need to be initialized for each business server in the hierarchy to which the created elastic scaling group belongs is determined.
- the management device in the main business server may determine the number of initial calculation instances created in the elastic scaling group according to the initial configuration information of the calculation instances in the elastic scaling group included in the elastic scaling group configuration message, and initialize the number of the calculation instances created Specifications, initialize the type of calculation instance created, etc.
- the management device can also determine the number of calculation instances that need to be initialized in each service server in the hierarchy to which the elastic scaling group belongs according to the resource scheduling policy information in the elastic scaling group included in the elastic scaling group configuration message.
- Step 440 The management device in the main service server respectively sends a command to create a calculation instance to the service server in the level to which the elastic scaling group belongs.
- the management device in the main business server determines which business servers in the tier to which the elastic scaling group belongs, the calculation instances that need to be initialized, as well as the types and numbers of calculation instances that each business server needs to initialize, can report to each of the determined calculation instances.
- the execution device of the business server sends a command to create a calculation instance.
- Each command to create a calculation instance includes the type and number of calculation instances that need to be created.
- each service server includes the main service server in the tier to which the elastic scaling group belongs, or includes the slave service server in the tier to which the elastic scaling group belongs, or includes the master service server and the slave service in the tier to which the elastic scaling group belongs server.
- Step 450 The execution device in each service server is created according to the command to create a calculation instance sent by the management device of the main service server.
- each business server in the tier to which the elastic scaling group belongs receives the command to create a calculation instance issued by the management device in the main business server
- the execution device in each business server creates it according to the needs carried in the command to create a calculation instance.
- Step 460 The execution device in each service server sends the creation result of the elastic scaling group to the management device in the main service server.
- the creation result of the elastic scaling group includes the creation result of the calculation instance, and the creation result of the calculation instance may include information about whether the creation process is successful, and information such as the identification (ID) of the created calculation instance.
- Step 470 The management device in the main service server sends the creation result of the elastic scaling group to the elastic scaling group management device 700.
- the management device in the main service server may actively send the creation result of the elastic scaling group to the elastic scaling group management device 700.
- the management device in the main service server may save the creation result of the elastic scaling group, and after receiving the user's query message, synchronize the creation result of the elastic scaling group with the elastic scaling group management device 700.
- multiple elastic scaling groups may be created by the method shown in FIG. 4.
- the following takes the creation of the first elastic scaling group and the second elastic scaling group as an example for description.
- the first elastic scaling group is deployed at the first level
- the second elastic scaling group is deployed at the second level
- a first level includes one or more second levels.
- the business servers of the first tier and the second tier overlap, and the business servers that belong to both the first tier and the second tier are called overlapping business servers.
- the overlapping service server executes the elastic scaling command corresponding to the first elastic scaling group and the elastic scaling command corresponding to the second elastic scaling group, conflicts may occur.
- the overlapping service server may conflict with computing instances and need to be operated. Conflicts in the number of calculation instances, conflicts in the execution order of elastic scaling commands, etc.
- the user may also send a conflict resolution strategy to the elastic scaling group management apparatus 700.
- the user sends a conflict resolution strategy to the elastic scaling group management apparatus 700 through an API or a visual window.
- FIG. 5 exemplarily shows another scenario in which the elastic scaling group management apparatus 700 provides the user with an elastic scaling group configuration message through a visual window.
- the configuration message of the elastic scaling group also includes a conflict resolution strategy.
- the priority of the elastic scaling strategy information can be determined. Specifically, the user can directly specify the priority of each elastic scaling strategy information in the conflict resolution strategy, or it can also indicate in the conflict resolution strategy to determine the priority of the elastic scaling strategy information according to the action time of the elastic scaling strategy information, for example, first
- the priority of the elastic scaling policy information entered into the elastic scaling group management apparatus 700 is higher (that is, the priority of the elastic scaling policy information of the first created elastic scaling group is higher), or the conflict resolution policy can be indicated in accordance with the elasticity
- the level information of the scaling group determines the priority, for example, the higher the level of the elastic scaling group, the priority of the elastic scaling strategy information is higher.
- the priority of the elastic scaling command generated according to the high priority elastic scaling strategy information is high, and the priority of the elastic scaling command generated according to the low priority elastic scaling strategy information is low.
- the overlapping service server can implement the elastic scaling command according to the pre-configured conflict resolution strategy.
- the overlapping service server if the overlapping service server first receives the high-priority elastic scaling command, it will scale the computing instance running on it according to the high-priority elastic scaling command, and within the cooling time. The received low-priority elastic scaling commands will be discarded and will not be implemented.
- the overlapping service server sends a notification that the elastic scaling command conflicts to the management device in the main service server that sent the discarded elastic scaling command, and the management device in the main service server sends a notification that the elastic scaling command conflicts to the elastic scaling group.
- the management device 700, the elastic scaling group management device 700 prompts the user in a notification manner.
- the overlapping service server if the overlapping service server has implemented the low priority elastic scaling command before receiving the high priority elastic scaling command (after the cooling time has elapsed), the overlapping service server will have the high priority
- the elastic scaling command is queued in the queue of the elastic scaling command to be implemented and waiting to be executed.
- the overlapping service server executes the high-priority elastic scaling command and discards the low-priority elastic scaling command.
- Elastic scaling command if the overlapping service server receives both the high-priority elastic scaling command and the low-priority elastic scaling command, the overlapping service server executes the high-priority elastic scaling command and discards the low-priority elastic scaling command. Elastic scaling command.
- the method may include steps 610-650, and steps 610-650 will be described in detail below.
- Step 610 The user sends a conflict resolution strategy to the elastic scaling group management apparatus 700.
- the user sends an elastic scaling group configuration message to the elastic scaling group management apparatus 700 through an API or a visual window, and the elastic scaling group configuration message includes a conflict resolution strategy.
- step 410 has been performed twice to complete the creation of two elastic scaling groups.
- Figure 6 uses the method shown in Figure 3 to create the first elastic scaling group and the second elastic scaling group as an example, where the first elastic scaling group is deployed at the first level, and the second elastic scaling group is deployed at The second level, a first level can include one or more second levels.
- the conflict resolution strategy specified by the user in the embodiment of the present application is: the priority of the elastic scaling policy information of the first elastic scaling group is higher than the priority of the elastic scaling policy information of the second elastic scaling group.
- Step 620 The elastic scaling group management device 700 sends the conflict resolution strategy to the execution device of each business server in the first level where the computing instance of the first elastic scaling group is deployed, and delivers the conflict resolution strategy to each of the second level.
- the execution device in the service server that has received the conflict resolution strategy can save the conflict resolution strategy.
- Step 630 The management device of the main service server in the first level sends the first elastic scaling command to the service server in the first level according to the first elastic scaling group configuration message.
- step 630 the initialization process of the first elastic scaling group and the second elastic scaling group has been completed, that is, step 450 in FIG. 4 has been executed.
- the management device of the main service server in the first level can send information to the service server in the first level according to the elastic scaling policy information and resource scheduling policy information included in the first elastic scaling group configuration message. Issue the first elastic scaling command.
- the elastic scaling strategy information of the first elastic scaling group includes: if the number of network connections of the computing instances in the first tier is greater than 80%, then expanding the capacity of 10 computing instances in the first tier.
- the management device in the main service server in the first level determines that the received monitoring data of the calculation instance of the first elastic scaling group meets the triggering condition in the elastic scaling policy information of the first elastic scaling group, it can be based on the first elastic scaling group.
- the resource scheduling policy information of the scaling group sends the first elastic scaling command to the service server in the first level.
- the first elastic scaling command includes operations on calculation instances in the first elastic scaling group, the number of calculation instances that need to be operated, and the number of calculation instances that need to be operated.
- Step 640 The management device of the main service server in the second level sends a second elastic scaling command to the service server in the second level according to the second elastic scaling group configuration message.
- the management apparatus of the main service server at the second level may send the second service server to the service server at the second level according to the elastic scaling strategy information and resource scheduling policy information included in the second elastic scaling group configuration message. Elastic scaling command.
- the elastic scaling policy information of the second elastic scaling group is: if the average CPU usage rate of the computing instances in the second tier is less than 20%, the capacity of the second tier is reduced by 2 computing instances.
- Step 630 the method for the management device related to the main service server in the second level to issue the second elastic scaling command to the service server in the second level according to the elastic scaling policy information and resource scheduling policy information of the second elastic scaling group is the same as Step 630 is similar.
- Step 630 please refer to the description in step 630, which will not be repeated here.
- Step 630 can be performed first, and then step 660; or step 640 can be performed first, and then step 630; or, step 630 and step 640 can be performed at the same time.
- Step 650 The overlapping service server selects to execute the first elastic scaling command or executes the second elastic scaling command according to the saved conflict resolution strategy.
- the overlapping server is a business server belonging to the first tier and a business server belonging to the second tier.
- the overlapping server may be a main service server or a secondary service server. In FIG. 6, the overlapping server is an example of a secondary service server.
- the user respectively configures a first elastic scaling group and a second elastic scaling group for service 1, and both the first elastic scaling group and the second elastic scaling group include at least one computing instance for running service 1.
- the first elastic scaling group is deployed at the first level, and the first level includes the business server 1 and the business server 2.
- the second elastic scaling group is deployed at the second level, and the second level includes the business server 2 and the business server 3.
- the first elastic scaling command issued by the main business server (for example, business server 1) in the first level to the business server 2 in the first level is to expand the capacity of 3 computing instances.
- the second elastic scaling command issued by the main business server (for example, business server 3) in the second level to the business server 2 in the second level is to reduce the capacity by one calculation instance.
- the execution device in the service server 2 needs to expand 3 computing instances if it follows the first elastic scaling command issued by the main service server in the first level , And if the business server 2 follows the second elastic scaling command issued by the main business server in the second level, it needs to reduce the capacity by one calculation instance.
- the execution device in the business server 2 can implement the elastic scaling command corresponding to the elastic scaling strategy information with higher priority according to the stored conflict resolution strategy.
- FIG. 7 is an elastic scaling group management apparatus 700 provided by an embodiment of the present application.
- the elastic scaling group management apparatus 700 is used to provide users with global AS group services.
- the elastic scaling group management apparatus 700 may include: a communication module 710 and a processing module 720.
- the communication module 710 is configured to receive an elastic scaling group configuration message sent by a user through an API or a visualization window.
- an elastic scaling group configuration message sent by a user through an API or a visualization window.
- the processing module 720 is configured to determine the level to which the elastic scaling group to be deployed belongs according to the level information in the elastic scaling group configuration message, and determine the main service server in the level.
- the communication module 710 is also configured to send the elastic scaling group configuration message to the main service server in the hierarchy.
- the communication module 710 may also receive a conflict resolution strategy sent by the user through an API or a visualization window.
- the communication module 710 sends the conflict resolution strategy to each service server in the hierarchy, including the main service server and the slave service server.
- the communication module 710 may also be configured to receive the creation result of the elastic scaling group sent by the management device in the main service server in the hierarchy.
- the communication module 710 may also be configured to receive notifications of conflicts of elastic scaling commands sent by the management device in the main service server that sent the elastic scaling commands discarded by the overlapping service server.
- the communication module 710 may also send notifications of conflicting elastic scaling commands to the user through an API or a visual window.
- the elastic scaling group management apparatus 700 here is embodied in the form of a functional module.
- module herein can be implemented in the form of software and/or hardware, which is not specifically limited.
- a “module” can be a software program, a hardware circuit, or a combination of the two that realize the above-mentioned functions.
- the software exists in the form of computer program instructions and is stored in the memory, and the processor can be used to execute the program instructions to implement the above method flow.
- the processor may include but is not limited to at least one of the following: a central processing unit (central processing unit, CPU), a microprocessor, a digital signal processing (digital signal processing, DSP), and a microcontroller (microcontroller unit, MCU) , Or artificial intelligence processors and other computing devices that run software.
- a central processing unit central processing unit, CPU
- a microprocessor central processing unit
- DSP digital signal processing
- microcontroller microcontroller unit, MCU
- Each computing device may include one or more cores for executing software instructions for calculation or processing.
- the processor can be a single semiconductor chip, or it can be integrated with other circuits to form a semiconductor chip.
- the processor can be combined with other circuits (such as codec circuits, hardware acceleration circuits, or various bus and interface circuits) to form a system-on-chip ( system on chip, SoC), or as an application-specific integrated circuit (ASIC) built-in processor integrated in the ASIC, the ASIC integrated with the processor can be packaged separately or can be combined with other The circuits are packaged together.
- the processor may also include necessary hardware accelerators, such as field programmable gate array (FPGA) and programmable logic device (FPGA). device, PLD), or a logic circuit that implements dedicated logic operations.
- FPGA field programmable gate array
- FPGA programmable logic device
- PLD programmable logic circuit that implements dedicated logic operations.
- the hardware circuits may be a general-purpose central processing unit (central processing unit, CPU), microcontroller (microcontroller unit, MCU), microprocessor (microprocessing unit, MPU), Digital signal processor (digital signal processing, DSP), system on chip (system on chip, SoC) to achieve, of course, it can also be implemented by application-specific integrated circuit (ASIC), or programmable logic device (programmable logic) device, PLD).
- the above-mentioned PLD can be a complex programmable logical device (CPLD), a field-programmable gate array (FPGA), a generic array logic (generic array logic, GAL) or its In any combination, it can run necessary software or does not rely on software to execute the above method flow.
- CPLD complex programmable logical device
- FPGA field-programmable gate array
- GAL generic array logic
- FIG. 8 is a schematic structural diagram of a global auto-scaling server 800 in a service providing system provided by an embodiment of the present application.
- the service providing system includes at least one global auto-scaling server 800 as shown in FIG. 8.
- the global auto-scaling server 800 includes a processor 802, a communication interface 803, and a memory 804.
- the global auto-scaling server 800 further includes a bus 801, and the processor 802, the memory 804, and the communication interface 803 communicate through the bus 801.
- the processor 802 may adopt a general-purpose central processing unit (CPU) to execute related program codes to implement the part executed on the side of the elastic scaling group management device in the method of the embodiment of the present application. .
- CPU central processing unit
- the memory 804 may include a volatile memory (volatile memory), such as a random access memory (random access memory, RAM).
- volatile memory such as a random access memory (random access memory, RAM).
- the memory 804 may also include non-volatile memory (non-volatile memory), such as read-only memory (ROM), flash memory, hard disk drive (HDD), solid state drive (solid state drive, SSD).
- ROM read-only memory
- HDD hard disk drive
- solid state drive solid state drive
- Executable code is stored in the memory 804, and the processor 802 executes the executable code to execute the aforementioned elastic scaling group management method.
- the memory 804 may also include an operating system and other software modules required for running processes.
- the operating system can be LINUX TM , UNIX TM , WINDO WS TM and so on.
- the memory 804 stores executable codes for implementing the processing module 720.
- the communication module 710 in the elastic scaling group management apparatus 700 is implemented through the communication interface 803.
- the communication module 710 in the elastic scaling group management apparatus 700 is implemented through the communication interface 803.
- At least one global auto-scaling server 800 in the service providing system establishes communication with each other through a communication network.
- FIG. 9 is a schematic structural diagram of a service server 900 in a service providing system provided by an embodiment of the present application.
- the service providing system includes at least one service server 90 as shown in FIG. 9.
- the service server 90 includes a processor 902, a communication interface 903 and a memory 904.
- the business server 900 further includes a bus 901, and the processor 902, the memory 904, and the communication interface 903 communicate through the bus 901.
- the processor 902 may adopt a general-purpose central processing unit for executing related programs to implement the part executed on the business server side in the method for managing an elastic scaling group in the method embodiment of the present application.
- the memory 904 may include a volatile memory (volatile memory), such as a random access memory (random access memory, RAM).
- the memory 904 may also include a non-volatile memory (non-volatile memory), such as read-only memory (ROM), flash memory, HDD or SSD.
- Executable code is stored in the memory 904, and the processor 902 executes the executable code to execute the aforementioned elastic scaling group management method.
- the memory 904 may also include an operating system and other software modules required for running processes.
- the operating system can be LINUX TM , UNIX TM , WINDOWS TM etc.
- the memory 904 stores executable codes for implementing the execution device 905 and the management device 906.
- the memory 904 also stores software modules required by other running processes, such as an operating system.
- the server in FIG. 8 or FIG. 9 may specifically be a blade server, a tower server, a personal computer, or other computers with computing functions.
- the above-mentioned embodiments may be implemented in whole or in part by software, hardware, firmware or any other combination.
- the above-mentioned embodiments may be implemented in the form of a computer program product in whole or in part.
- the computer program product includes one or more computer instructions or computer programs.
- the processes or functions described in the embodiments of the present application are generated in whole or in part.
- the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices.
- the computer instructions may be stored in a computer-readable storage medium or transmitted from one computer-readable storage medium to another computer-readable storage medium.
- the computer instructions may be transmitted from a website, computer, server, or data center. Transmission to another website, computer, server or data center via wired (such as infrared, wireless, microwave, etc.).
- the computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server or a data center that includes one or more sets of available media.
- the usable medium may be a magnetic medium (for example, a floppy disk, a hard disk, a magnetic tape), an optical medium (for example, a DVD), or a semiconductor medium.
- the semiconductor medium may be a solid state drive.
- At least one refers to one or more, and “multiple” refers to two or more.
- the following at least one item (a)” or similar expressions refers to any combination of these items, including any combination of single item (a) or plural items (a).
- at least one item (a) of a, b, or c can mean: a, b, c, ab, ac, bc, or abc, where a, b, and c can be single or multiple .
- the size of the sequence number of the above-mentioned processes does not mean the order of execution, and the execution order of each process should be determined by its function and internal logic, and should not correspond to the embodiments of the present application.
- the implementation process constitutes any limitation.
- the disclosed system, device, and method can be implemented in other ways.
- the device embodiments described above are only illustrative.
- the division of the units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components may be combined or It can be integrated into another system, or some features can be ignored or not implemented.
- the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
- the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
- the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
- the function is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium.
- the technical solution of the present application essentially or the part that contributes to the existing technology or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application.
- the aforementioned storage media include: U disk, mobile hard disk, read-only memory (read-only memory, ROM), random access memory (random access memory, RAM), magnetic disk or optical disk and other media that can store program code .
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Automation & Control Theory (AREA)
- Computer And Data Communications (AREA)
- Debugging And Monitoring (AREA)
Abstract
一种弹性伸缩组的管理方法,包括:该弹性伸缩组管理装置接收弹性伸缩组配置消息,该弹性伸缩组配置消息包括该弹性伸缩组的层级信息,弹性伸缩组的计算实例的初始化配置信息和弹性伸缩组的弹性伸缩策略信息;弹性伸缩组管理装置在弹性伸缩组的层级信息指示的层级包括的业务服务器中,根据弹性伸缩组的计算实例的初始化配置信息创建弹性伸缩组的计算实例;并根据该弹性伸缩组的弹性伸缩策略信息对弹性伸缩组的计算实例进行操作。该方法中,部署的弹性伸缩组可以支持跨层部署或操作。
Description
本申请涉及计算机领域,并且更具体地,涉及一种弹性伸缩组的管理方法、装置。
弹性伸缩(auto scaling,AS)服务是根据用户的业务需求,通过策略自动调整其业务资源的服务。用户可以根据业务需求预先自行定义弹性伸缩组以及该弹性伸缩组对应的弹性伸缩策略信息,无需提前为自己的业务准备大量业务资源。
目前在公有云的场景中可以部署弹性伸缩服务。在公有云中,由于物理区域的限制,绝大多数云计算服务不支持跨地区。在这种场景下,部署的弹性伸缩服务同样也不支持跨地区。也就是说,在公有云中部署弹性伸缩服务,设置的弹性伸缩组仅支持在一个地区中使用,计算实例也仅限于一个地区中扩容(创建新的计算实例)、减容(释放或休眠计算实例)。
发明内容
本申请提供一种弹性伸缩组的管理方法,该管理方法支持跨层级部署弹性伸缩组,弹性伸缩组的计算实例也可以在不同的层级中被操作。
第一方面,提供了一种弹性伸缩组的管理方法,该管理方法应用于业务提供系统,该业务提供系统包括弹性伸缩组管理装置和多个层级,每个层级包括至少一个业务服务器,该管理方法包括:
该弹性伸缩组管理装置接收第一弹性伸缩组配置消息,该第一弹性伸缩组配置消息包括该第一弹性伸缩组的层级信息,该第一弹性伸缩组的计算实例的初始化配置信息以及该第一弹性伸缩组的弹性伸缩策略信息;该弹性伸缩组管理装置在该第一弹性伸缩组的层级信息指示的层级包括的业务服务器中,根据该第一弹性伸缩组的计算实例的初始化配置信息创建该第一弹性伸缩组的计算实例;该弹性伸缩组管理装置根据该第一弹性伸缩组的弹性伸缩策略信息对该第一弹性伸缩组的计算实例进行操作。
应理解,对弹性伸缩组中计算实例的操作可以包括以下任意一种或多种:扩容(在弹性伸缩组中创建新的计算实例,或者提升弹性伸缩组内计算实例的规格(specification))、减容(在弹性伸缩组中释放或休眠计算实例,或者降低弹性伸缩组内计算实例的规格)。
结合第一方面,在第一方面的某些实现方式中,该弹性伸缩组管理装置根据该第一弹性伸缩组的计算实例的运行情况和该第一弹性伸缩组的弹性伸缩策略信息生成第一弹性伸缩命令;该弹性伸缩组管理装置将该第一弹性伸缩命令发送至创建了该第一弹性伸缩组的计算实例的业务服务器。
需要说明的是,弹性伸缩组管理装置并不是将第一弹性伸缩命令发送给了所有创建第一弹性伸缩组的计算实例业务服务器,而是发送给根据资源调度策略信息从创建了该第一 弹性伸缩组的计算实例的业务服务器中选择出来的部分或者全部业务服务器。
结合第一方面,在第一方面的某些实现方式中,第一弹性伸缩组配置消息中还可以包括资源调度策略信息,该资源调度策略信息用于确定弹性伸缩组内初始化创建的计算实例所在的业务服务器以及需要扩容或减容的计算实例所在的业务服务器。应理解,弹性伸缩组内资源调度策略信息为可选项,用户可以进行配置,也可以不进行配置。用户不进行配置时,弹性伸缩组管理装置采用预设的资源调度策略信息。
结合第一方面,在第一方面的某些实现方式中,该管理方法还包括:该弹性伸缩组管理装置接收第二弹性伸缩组配置消息,该第二弹性伸缩组配置消息包括该第二弹性伸缩组的层级信息,该第二弹性伸缩组的计算实例的初始化配置信息以及该第二弹性伸缩组的弹性伸缩策略信息;该弹性伸缩组管理装置在该第二弹性伸缩组的层级信息指示的层级包括的业务服务器中,根据该第二弹性伸缩组的计算实例的初始化配置信息创建该第二弹性伸缩组的计算实例,其中,该第一弹性伸缩组的层级信息指示的层级包括的业务服务器和该第二弹性伸缩组的层级信息指示的层级包括的业务服务器有重叠;该弹性伸缩组管理装置根据该第二弹性伸缩组的弹性伸缩策略信息对该第二弹性伸缩组的计算实例进行操作。
结合第一方面,在第一方面的某些实现方式中,该弹性伸缩组管理装置根据该第二弹性伸缩组的计算实例的运行情况和该第二弹性伸缩组的弹性伸缩策略信息生成第二弹性伸缩命令;该弹性伸缩组管理装置将该第二弹性伸缩命令发送至创建了该第二弹性伸缩组的计算实例的业务服务器。
需要说明的是,弹性伸缩组管理装置并不是将第二弹性伸缩命令发送给了所有创建第二弹性伸缩组的计算实例业务服务器,而是发送给根据资源调度策略信息从创建了该第二弹性伸缩组的计算实例的业务服务器中选择出来的部分或者全部业务服务器。
结合第一方面,在第一方面的某些实现方式中,该第二弹性伸缩组配置消息还包括冲突解决策略,该冲突解决策略指示弹性伸缩策略信息的优先级,该管理方法还包括:该弹性伸缩组管理装置发送该冲突解决策略至重叠业务服务器,该重叠业务服务器包括于该第一弹性伸缩组的层级信息指示的层级且包括于该第二弹性伸缩组的层级信息指示的层级;该重叠业务服务器接收根据该第一弹性伸缩策略信息生成的该第一弹性伸缩命令;该重叠业务服务器接收根据该第二弹性伸缩策略信息生成的第二弹性伸缩命令;该重叠业务服务器根据该冲突解决策略,选择执行该第一弹性伸缩命令或该第二弹性伸缩命令。
应理解,弹性伸缩组管理装置可以识别用户创建的多个弹性伸缩组所在的层级的重叠业务服务器,并可以将冲突解决策略发给识别出的重叠业务服务器。
需要说明的是,弹性伸缩组管理装置根据冲突解决策略中指示的高优先级的弹性伸缩策略信息生成的弹性伸缩命令的优先级高,根据冲突解决策略中指示的低优先级的弹性伸缩策略信息生成的弹性伸缩命令的优先级低。
结合第一方面,在第一方面的某些实现方式中,弹性伸缩组管理装置还可以将冲突解决策略发送给创建了该第一弹性伸缩组的计算实例的每一个业务服务器。
第二方面,提供了一种弹性伸缩组的管理方法,该管理方法应用于弹性伸缩组管理装置,该管理方法包括:接收第一弹性伸缩组配置消息,该第一弹性伸缩组配置消息包括该第一弹性伸缩组的层级信息,该第一弹性伸缩组的计算实例的初始化配置信息以及该第一弹性伸缩组的弹性伸缩策略信息;在该第一弹性伸缩组的层级信息指示的层级包括的业务 服务器中,根据该第一弹性伸缩组的计算实例的初始化配置信息创建该第一弹性伸缩组的计算实例;根据该第一弹性伸缩组的弹性伸缩策略信息对该第一弹性伸缩组的计算实例进行操作。
应理解,对弹性伸缩组中计算实例的操作可以包括以下任意一种或多种:扩容(在弹性伸缩组中创建新的计算实例,或者提升弹性伸缩组内计算实例的规格(specification))、减容(在弹性伸缩组中释放或休眠计算实例,或者降低弹性伸缩组内计算实例的规格)。
结合第二方面,在第二方面的某些实现方式中,根据该第一弹性伸缩组的计算实例的运行情况和该第一弹性伸缩组的弹性伸缩策略信息生成第一弹性伸缩命令;将该第一弹性伸缩命令发送至创建了该第一弹性伸缩组的计算实例的业务服务器。
需要说明的是,弹性伸缩组管理装置并不是将第一弹性伸缩命令发送给了所有创建第一弹性伸缩组的计算实例业务服务器,而是发送给根据资源调度策略信息选择出来的创建了该第一弹性伸缩组的计算实例的业务服务器中的部分或者全部业务服务器。
结合第二方面,在第二方面的某些实现方式中,第一弹性伸缩组配置消息中还可以包括资源调度策略信息,该资源调度策略信息用于确定弹性伸缩组内初始化创建的计算实例所在的业务服务器以及需要扩容或减容的计算实例所在的业务服务器。应理解,弹性伸缩组内资源调度策略信息为可选项,用户可以进行配置,也可以不进行配置。用户不进行配置时,弹性伸缩组管理装置采用预设的资源调度策略信息。
结合第二方面,在第二方面的某些实现方式中,该管理方法还包括:接收第二弹性伸缩组配置消息,该第二弹性伸缩组配置消息包括该第二弹性伸缩组的层级信息,该第二弹性伸缩组的计算实例的初始化配置信息以及该第二弹性伸缩组的弹性伸缩策略信息;在该第二弹性伸缩组的层级信息指示的层级包括的业务服务器中,根据该第二弹性伸缩组的计算实例的初始化配置信息创建该第二弹性伸缩组的计算实例,其中,该第一弹性伸缩组的层级信息指示的层级包括的业务服务器和该第二弹性伸缩组的层级信息指示的层级包括的业务服务器有重叠;根据该第二弹性伸缩组的弹性伸缩策略信息对该第二弹性伸缩组的计算实例进行操作。
结合第二方面,在第二方面的某些实现方式中,根据该第二弹性伸缩组的计算实例的运行情况和该第二弹性伸缩组的弹性伸缩策略信息生成第二弹性伸缩命令;将该第二弹性伸缩命令发送至创建了该第二弹性伸缩组的计算实例的业务服务器。
结合第二方面,在第二方面的某些实现方式中,该第二弹性伸缩组配置消息还包括冲突解决策略,该冲突解决策略指示弹性伸缩策略信息的优先级,该管理方法还包括:发送该冲突解决策略至重叠业务服务器,该重叠业务服务器包括于该第一弹性伸缩组的层级信息指示的层级且包括于该第二弹性伸缩组的层级信息指示的层级。
应理解,弹性伸缩组管理装置可以识别用户创建的多个弹性伸缩组所在的层级的重叠业务服务器,并可以将冲突解决策略发给识别出的重叠业务服务器。
第三方面,提供了一种弹性伸缩组的管理方法,该管理方法应用于业务服务器,该业务服务器包括于第一弹性伸缩组的层级信息指示的层级且包括于该第二弹性伸缩组的层级信息指示的层级,该管理方法包括:接收弹性伸缩组管理装置下发的第一弹性伸缩命令,该第一弹性伸缩命令是该弹性伸缩组管理装置根据接收到的第一弹性伸缩组配置消息中包括的第一弹性伸缩策略信息生成的;接收该弹性伸缩组管理装置下发的第二弹性伸缩命 令,该第二弹性伸缩命令是该弹性伸缩组管理装置根据接收到的第二弹性伸缩组配置消息中包括的第二弹性伸缩策略信息生成的;接收该弹性伸缩组管理装置下发的冲突解决策略,该冲突解决策略指示弹性伸缩策略信息的优先级;根据该冲突解决策略,选择执行该第一弹性伸缩命令或该第二弹性伸缩命令。
第四方面,提供了一种弹性伸缩组的管理装置,包括:
通信模块,用于接收第一弹性伸缩组配置消息,该第一弹性伸缩组配置消息包括该第一弹性伸缩组的层级信息,该第一弹性伸缩组的计算实例的初始化配置信息以及该第一弹性伸缩组的弹性伸缩策略信息;
处理模块,用于在该第一弹性伸缩组的层级信息指示的层级包括的业务服务器中,根据该第一弹性伸缩组的计算实例的初始化配置信息创建该第一弹性伸缩组的计算实例;
该处理模块,还用于根据该第一弹性伸缩组的弹性伸缩策略信息对该第一弹性伸缩组的计算实例进行操作。
结合第四方面,在第四方面的某些实现方式中,该处理模块具体用于:根据该第一弹性伸缩组的计算实例的运行情况和该第一弹性伸缩组的弹性伸缩策略信息生成第一弹性伸缩命令;
该通信模块具体用于:将该第一弹性伸缩命令发送至创建了该第一弹性伸缩组的计算实例的业务服务器。
结合第四方面,在第四方面的某些实现方式中,该通信模块还用于:接收第二弹性伸缩组配置消息,该第二弹性伸缩组配置消息包括该第二弹性伸缩组的层级信息,该第二弹性伸缩组的计算实例的初始化配置信息以及该第二弹性伸缩组的弹性伸缩策略信息;
该处理模块还用于:在该第二弹性伸缩组的层级信息指示的层级包括的业务服务器中,根据该第二弹性伸缩组的计算实例的初始化配置信息创建该第二弹性伸缩组的计算实例,其中,该第一弹性伸缩组的层级信息指示的层级包括的业务服务器和该第二弹性伸缩组的层级信息指示的层级包括的业务服务器有重叠;根据该第二弹性伸缩组的弹性伸缩策略信息对该第二弹性伸缩组的计算实例进行操作。
结合第四方面,在第四方面的某些实现方式中,该处理模块具体用于:根据该第二弹性伸缩组的计算实例的运行情况和该第二弹性伸缩组的弹性伸缩策略信息生成第二弹性伸缩命令;
该通信模块具体用于:将该第二弹性伸缩命令发送至创建了该第二弹性伸缩组的计算实例的业务服务器。
结合第四方面,在第四方面的某些实现方式中,该通信模块具体用于:
发送该冲突解决策略至重叠业务服务器,该重叠业务服务器包括于该第一弹性伸缩组的层级信息指示的层级且包括于该第二弹性伸缩组的层级信息指示的层级。
第五方面,提供了一种业务服务器,该业务服务器包括于第一弹性伸缩组的层级信息指示的层级且包括于该第二弹性伸缩组的层级信息指示的层级,该业务服务器包括:
通信模块,用于接收弹性伸缩组管理装置下发的第一弹性伸缩命令,该第一弹性伸缩命令是该弹性伸缩组管理装置根据接收到的第一弹性伸缩组配置消息中包括的第一弹性伸缩策略信息生成的;
该通信模块,还用于接收该弹性伸缩组管理装置下发的第二弹性伸缩命令,该第二弹 性伸缩命令是该弹性伸缩组管理装置根据接收到的第二弹性伸缩组配置消息中包括的第二弹性伸缩策略信息生成的;
该通信模块,还用于接收该弹性伸缩组管理装置下发的冲突解决策略,该冲突解决策略指示弹性伸缩策略信息的优先级;
处理模块,用于根据该冲突解决策略,选择执行该第一弹性伸缩命令或该第二弹性伸缩命令。
第六方面,提供了一种全局自动伸缩服务器,包括存储器和至少一个处理器,存储器用于程序指令,当弹性伸缩组管理装置运行时,至少一个处理器执行该存储器中的程序指令以执行第二方面或第二方面中任一种可能的实现方式中的方法。
第七方面,提供了一种业务服务器,包括存储器和至少一个处理器,存储器用于程序指令,该业务服务器运行时,至少一个处理器执行该存储器中的程序指令以执行第三方面或第三方面中任一种可能的实现方式中的方法。
第八方面,提供了一种业务提供系统,该系统包括至少一个全局自动伸缩服务器和多个层级,每个层级包括多个业务服务器,每个全局自动伸缩服务器和每个业务服务器均包括存储器和至少一个处理器,存储器用于程序指令,该至少一个全局自动伸缩服务器运行时,该至少一个全局自动伸缩服务器的处理器执行存储器中的程序指令以执行第二方面或第二方面中任一种可能的实现方式中的方法,该多个业务服务器运行时,该多个业务服务器的处理器执行存储器中的程序指令以执行第三方面或第三方面中任一种可能的实现方式中的方法。
第九方面,提供了一种非瞬态的可读存储介质,包括程序指令,当该程序指令被计算机运行时,该计算机执行如第二方面或第二方面中任一种可能的实现方式中的方法。
第十方面,提供了一种非瞬态的可读存储介质,包括程序指令,当该程序指令被计算机运行时,该计算机执行如第三方面或第三方面中任一种可能的实现方式中的方法。
第十一方面,提供了一种计算机程序产品,包括程序指令,当该程序指令被计算机运行时,该计算机执行如第二方面或第二方面中任一种可能的实现方式中的方法。
第十二方面,提供了一种计算机程序产品,包括程序指令,当该程序指令被计算机运行时,该计算机执行如第三方面或第三方面中任一种可能的实现方式中的方法。
图1是可应用于本申请实施例的业务提供系统的架构示意图。
图2是本申请实施例提供的一种弹性伸缩组管理装置700通过可视化窗口向用户提供弹性伸缩组配置消息的场景示意图。
图3是本申请实施例提供的一种业务服务器的示意性结构图。
图4是本申请实施例提供的一种创建弹性伸缩组的方法的流程图。
图5是本申请实施例提供的另一种弹性伸缩组管理装置700通过可视化窗口向用户提供弹性伸缩组配置消息的场景示意图。
图6是本申请实施例提供的一种业务服务器根据冲突解决策略对计算实例进行弹性伸缩的方法的示意性流程图。
图7是本申请实施例提供的一种弹性伸缩组管理装置700的示意性结构图。
图8是本申请实施例提供的一种业务提供系统中全局自动伸缩服务器800的示意性结构图。
图9是本申请实施例提供的一种业务提供系统中业务服务器900的示意性结构图。
下面将结合附图,对本申请中的技术方案进行描述。
弹性伸缩(auto scaling,AS)服务是根据用户的业务需求,通过策略自动调整其业务资源的服务。用户可以根据业务需求预先自行定义弹性伸缩组以及该弹性伸缩组对应的弹性伸缩策略信息,无需提前为自己的业务准备大量业务资源。业务提供系统能够根据设置的弹性伸缩策略信息自动调整弹性伸缩组中的云服务器资源,从而降低人为反复调整业务资源以应对业务变化和高峰压力的工作量,节省用户的资源和人力成本,为用户提供高效管理计算资源的策略。
具体的,业务提供系统中包括一个或多个业务服务器,业务提供系统可以根据弹性伸缩策略信息来管理弹性伸缩组中运行于这一个或多个业务服务器上的计算实例(业务资源)数量,并完成对计算实例的环境部署,保证业务平稳顺利运行。当用户业务量增大,对计算实例需求量较大时,弹性伸缩服务可自动增加弹性伸缩组中的计算实例数量,以保证性能不受影响。当用户业务量减小,对计算实例需求量较低时,则会减少弹性伸缩组中的计算实例数量,以降低成本。现有技术中,在公有云中可以提供弹性伸缩服务。公有云中会涉及地区、可用区等概念。地区(region)指数据中心所在地,可以是大区(例如,华南地区,华北地区),或者也可以是城市(例如,深圳、东莞)。可用区(available zone,AZ)指机房或云数据中心所在的物理区域,具有能耗、网络所独立的特点。一个地区通常包含一个或多个低时延互联的可用区,用于同地区内的容灾备份和负载均衡等场景与服务。由于物理区域的限制,现有的绝大多数云服务不支持跨地区部署。同样的,弹性伸缩服务同样也不支持跨地区。也就是说,在公有云中部署弹性伸缩服务,设置的弹性伸缩组仅支持在一个地区中使用,计算实例也仅限于一个地区中扩容(创建新的计算实例)、减容(释放或休眠计算实例)。
应理解,计算实例可以是虚拟机(virtual machine,VM),容器,或者运行业务的软件模块。
还应理解,本申请实施例中的每个业务服务器上可以运行有一个或多个计算实例,这一个或多个计算实例可以属于一个或多个弹性伸缩组。
每个弹性伸缩组内包括至少一个计算实例。弹性伸缩组中扩容/减容的单位可以是计算实例,也可以是计算实例组。扩容/减容的单位是计算实例的情况下,一个弹性伸缩组中包括的计算实例相同。扩容/减容的单位是计算实例组的情况下,一个弹性伸缩组中可以包括一种或多种类型的计算实例,例如每个计算实例组包括N个相同类型的计算实例,或者每个计算实例组包括多种类型的计算实例(例如负载均衡实例和渲染实例,每个计算实例组包括1个负载均衡实例和5个渲染实例)。
为了便于描述,下面中以每个弹性伸缩组中扩容/减容的单位是计算实例为例进行说明。
边缘云是基于广泛覆盖的业务服务器进行的一种云计算。边缘云服务是一种在靠近数 据源头的网络边缘侧通过业务服务器就近提供计算服务,将计算资源(业务服务器)分散到离边缘云服务的用户更近的地方的分布式计算形式。相比中心化的云计算服务,边缘云服务更能满足行业数字化在敏捷连接、实时业务、数据优化、应用智能、安全与隐私保护等方面的关键需求。本申请中的业务提供系统可以由边缘云支持,也可以由公有云、私有云或者混合云支持。
业务服务器广泛分布于各地,可以根据行政、地理等因素将各个业务服务器划分到不同的层级,例如,第一层级,第二层级,第三层级····。不同的层级之间可以相互嵌套,其中,一个第一层级中可以包括一个或多个第二层级,一个第二层级中可以包括一个或多个第三层级。具体的,作为示例,该第一层级例如可以是东北大区、西北大区、华中大区等。该第二层级可以是省级区,例如,广东省、陕西省等。该第三层级可以是市级区,例如,深圳市、西安市等。用户可以直接在指定的层级部署其所需的弹性伸缩组以及对应的弹性伸缩策略信息。
应理解,第一层级也可以称为大区,第二层级也可以称为次级区,第三层级也可以称为再次级区。一个大区中可以包括一个或多个次级区,一个次级区中可以包括一个或多个再次级区。
下面结合图1,对适用于本申请实施例的一种可能的应用场景进行详细描述。
参见图1,多个业务服务器被划分为不同的层级。作为示例,可以根据行政、地理等因素划分出多个第一层级,例如,第一层级1和第一层级2。在第一层级1中,可以包括多个第二层级,例如,第二层级1和第二层级2。每一个第二层级中包括多个第三层级,以第二层级1为例,第二层级1中包括多个第三层级,例如,第三层级1和第三层级2。每一个第三层级中可以包括多个业务服务器。例如,第三层级1中可以包括多个业务服务器,第三层级2中也可以包括多个业务服务器。其他的第一层级也可以包括多个第二层级,每一个第二层级中包括多个第三层级。例如,第一层级2,与第一层级1类似,具体的请参考对第一层级1的描述,此处不再赘述。
需要说明的是,每个层级中均可以包括一个或多个业务服务器,图1中每个层级中包括的业务服务器的数量仅仅是作为示例。
全局自动伸缩服务器中可以运行弹性伸缩组管理装置700,该弹性伸缩组管理装置700可以向用户提供该用户创建的管理弹性伸缩组的应用程序接口(application programming interface,API)或可视化窗口。用户可以通过弹性伸缩组管理装置700来完成弹性伸缩组的创建、配置、查询等操作。
下面对用户通过弹性伸缩组管理装置700对弹性伸缩组的配置的过程进行描述。用户通过向弹性伸缩组管理装置700输入弹性伸缩组配置消息完成弹性伸缩组的配置。弹性伸缩组配置消息包括:
(1)、层级信息
用户在配置弹性伸缩组时可以指定弹性伸缩组所属的层级,并在该层级上部署弹性伸缩组。该层级例如可以指示上文中描述的任意一个或多个层级。具体的请参见上文中的描述,此处不再赘述。
(2)、计算实例的初始化配置信息
弹性伸缩组的计算实例的初始化配置信息可以包括弹性伸缩组内的计算实例的数量 参数,例如,弹性伸缩组内的计算实例的初始化数量,弹性伸缩组内的计算实例的数量的最大值、最小值,弹性伸缩组内的计算实例的期望值。该计算实例的初始化配置信息还可以包括计算实例的配置信息,例如,计算实例的规格,计算实例的类型(例如,渲染实例,负载均衡实例),计算实例的网络配置,计算实例采用的镜像,用户标识等信息。
(3)、弹性伸缩策略信息
弹性伸缩组的弹性伸缩策略信息包括触发条件和达到该设定的触发条件的情况下触发的对弹性伸缩组中计算实例的操作。弹性伸缩策略信息可以包括以下任意一种或多种:静态伸缩策略信息,动态伸缩策略信息。其中,
静态伸缩策略信息中包括设定的静态触发条件,例如时间条件,以及达到设定的静态触发条件的情况下触发的对弹性伸缩组中计算实例的操作。
作为示例,列举一种可能的弹性伸缩策略信息。触发条件:下午11点-下午13点。弹性伸缩策略信息:在该弹性伸缩组所在的层级内减容2个计算实例。
动态伸缩策略信息中包括设定的动态触发条件,以及达到该设定的动态触发条件的情况下触发的对弹性伸缩组中计算实例的操作。
作为示例,列举一种可能的弹性伸缩策略信息。触发条件:弹性伸缩组内任意一个计算实例的负载达到满负载的80%。弹性伸缩策略信息:在该弹性伸缩组所在的层级内扩容2个计算实例。
可选地,该弹性伸缩策略信息中还包括冷却时间,该冷却时间不允许对弹性伸缩组内的计算实例进行操作。冷却时间的设置可以防止弹性伸缩组内的计算实例被频繁操作。
应理解,以上各个策略信息中,对弹性伸缩组中计算实例的操作可以包括以下任意一种或多种:扩容(在弹性伸缩组中创建新的计算实例,或者提升弹性伸缩组内计算实例的规格(specification))、减容(在弹性伸缩组中释放或休眠计算实例,或者降低弹性伸缩组内计算实例的规格)。
可选地,上述弹性伸缩策略信息中还包括弹性伸缩组中扩容/减容的计算实例的配置信息。作为示例,该弹性伸缩策略信息中还包括在对应的层级内扩容/减容的计算实例的规格,扩容/减容的计算实例的类型(例如,渲染实例,负载均衡实例)。
可选地,在一些实施例中,弹性伸缩组配置消息中还可以包括资源调度策略信息。
(4)、资源调度策略信息
弹性伸缩组的资源调度策略信息用于确定弹性伸缩组内初始化创建的计算实例所在的业务服务器以及需要扩容或减容的计算实例所在的业务服务器。
弹性伸缩组资源调度策略信息可以包括以下任意一种或多种:平均分配,指定集群分配,指定比例分配,按负载强度自动分配等。
以每个弹性伸缩组中的一个或多个计算实例为同一类计算实例为例。可以根据上述资源调度策略信息中的一种或多种,在相应的业务服务器中部署一定数量的计算实例。例如,在初始化阶段,可以根据弹性伸缩组内的计算实例的数量的最大值,最小值,期望值中的一个或多个,在该弹性伸缩组对应的层级中的多个业务服务器之间进行平均分配。又如,在扩容/减容阶段,可以根据弹性伸缩组内弹性伸缩策略信息中包括的扩容/减容的计算实例的数量以及计算实例的配置信息,按照该弹性伸缩组对应的层级中的多个业务服务器的负载强度,对各个业务服务器中扩容/减容的计算实例进行分配。
需要说明的是,以上层级信息、计算实例的初始化配置信息、弹性伸缩策略信息、资源调度策略信息可以通过一个或多个弹性伸缩组配置消息发送至弹性伸缩组管理装置700。示例性的,用户可以先向弹性伸缩组管理装置700发送包括层级信息和计算实例的初始化配置信息的一个或多个弹性伸缩组配置消息,以在指定的层级内创建一个计算实例组。然后,用户通过弹性伸缩组配置消息发送弹性伸缩策略信息至弹性伸缩组管理装置700。随后,用户指示将发送的弹性伸缩策略信息应用到创建的计算实例组上以完成弹性伸缩组的创建。
每个弹性伸缩组包括一个或多个计算实例组的场景中,根据资源调度策略信息在相应的业务服务器中部署一定数量的计算实例的方法与上述方法类似,具体的请参考上文中的描述,此处不再赘述。
图2示例性的展示了,弹性伸缩组管理装置700通过可视化窗口向用户提供弹性伸缩组配置消息的场景。参见图2,用户可以根据弹性伸缩组配置消息中每种信息的选项进行选择。应理解,弹性伸缩组内资源调度策略信息为可选项,用户可以进行配置,也可以不进行配置。用户不进行配置时,弹性伸缩组管理装置700采用预设的资源调度策略信息。
下面结合图3,对业务服务器进行详细描述。图3中,业务服务器中包括管理装置,执行装置,以及其上运行的至少一个计算实例。
参见图3,其中,管理装置可以作为业务服务器所属的层级的决策实体,负责管理业务服务器所属的层级中的所有业务服务器上运行的至少一个计算实例的信息,并下发伸缩命令。执行装置作为业务服务器的执行实体,负责实施管理装置下发的伸缩命令。
下面分别对管理装置和执行装置进行详细描述。
(1)管理装置
每个层级中可以包括一个或多个业务服务器,针对每个层级,弹性伸缩组管理装置700根据分布式选主算法,可以从一个层级中包括的多个业务服务器中选择一个业务服务器作为该层级中的主业务服务器。该层级中包括的其他业务服务器可以称为从业务服务器。
管理装置可以部署在主业务服务器,以便于该主业务服务器通过管理装置管理其所属的层级内的实例或实例组。
具体的,一种可能的实现方式中,管理装置可以事先部署在所有业务服务器上,在其中一个业务服务器被选择为所属的层级内的主业务服务器后,将主业务服务器中部署的管理装置激活,其余从业务服务器上可以不部署管理装置或者部署的管理装置不被激活。另一种可能的实现方式中,还可以根据分布式选主算法,从层级中的多个业务服务器中选择一个业务服务器作为该层级中的主业务服务器之后,在该主业务服务器中部署管理装置。
管理装置作为主业务服务器中的决策主体,其所具有的功能可以包括以下中的一种或多种:存储用户在该层级配置的弹性伸缩组配置消息、监控该层级内计算实例的运行情况、根据弹性伸缩组配置消息中包括的弹性伸缩策略信息确定该层级内计算实例的自动伸缩、根据弹性伸缩组的资源调度策略信息确定被操作的计算实例所在的业务服务器、向弹性伸缩组管理装置700同步弹性伸缩组的结果和计算实例弹性伸缩的操作结果。下面对管理装置所具有的功能进行详细描述。
1、存储用户在该层级配置的弹性伸缩组配置消息
用户通过弹性伸缩组管理装置700配置的弹性伸缩组配置消息可以由弹性伸缩组管理装置700转发至主业务服务器中的管理装置,并由该管理装置保存。
2、监控该层级内计算实例的运行情况
主业务服务器中的管理装置维护着与该层级内的从业务服务器之间的消息通道,并周期性地接收从业务服务器发送的从业务服务器上运行的计算实例的运行情况。同时,主业务服务器中的管理装置也周期性地接收主业务服务器上运行的计算实例的运行情况。
具体的,主业务服务器中的管理装置可以接收各计算实例的监控数据。监控数据可以包括以下任意一种或多种:异常计算实例的数量、异常进程的数量、中央处理器(central processing unit,CPU)的使用率、内存使用率、网络连接数、带宽使用率,以及其他能够体现计算实例的运行情况的参数。
3、根据弹性伸缩策略信息确定该层级内计算实例的自动伸缩。
主业务服务器中的管理装置可以根据保存的弹性伸缩组配置消息,例如,弹性伸缩策略信息、计算实例的初始化配置信息中的一个或多个,确定在该弹性伸缩组中需要被操作的计算实例的数量,以及需要被操作的计算实例的类型。
具体的,当主业务服务器中的管理装置确定弹性伸缩组达到弹性伸缩策略信息中的触发条件时,可以根据弹性伸缩策略信息确定在该弹性伸缩组中需要被操作的计算实例的数量,以及需要被操作的计算实例的类型,并生成弹性伸缩命令。
4、根据资源调度策略信息向确定出的该层级内的业务服务器发送弹性伸缩命令。
主业务服务器中的管理装置可以根据资源调度策略信息在该层级内确定需要被操作的计算实例所在的业务服务器,并向确定出的该层级内的业务服务器发送弹性伸缩命令。该弹性伸缩命令包括对弹性伸缩组中计算实例的操作,以及需要被操作的计算实例的数量,需要被操作的计算实例的类型。
5、向弹性伸缩组管理装置700同步弹性伸缩组的创建结果和操作结果。
主业务服务器中的管理装置可以通过接口主动向弹性伸缩组管理装置700同步弹性伸缩组的创建结果和操作结果。该弹性伸缩组的创建结果包括弹性伸缩组内计算实例的初始化创建情况,该弹性伸缩组的操作结果包括弹性伸缩组内计算实例执行弹性伸缩策略信息的情况。
(2)执行装置
每个业务服务器上的执行装置负责监控本业务服务器上运行的计算实例,并向主业务服务器中的管理装置上报本业务服务器上运行的计算实例的运行情况,以及负责实施主业务服务器中的管理装置下发的弹性伸缩命令。
执行装置可以周期性的获取本业务服务器上运行的计算实例的监控数据。
执行装置还可以接收主业务服务器中的管理装置下发的弹性伸缩命令。该弹性伸缩命令包括对弹性伸缩组中计算实例的操作,需要被操作的计算实例的数量,以及需要被操作的计算实例的类型。执行装置根据弹性伸缩命令对本业务服务器上运行的计算实例进行相应的操作。下面先对用户创建弹性伸缩组的过程进行详细描述。
图4是本申请实施例提供的一种创建弹性伸缩组的方法的流程图。如图4所示,该方法可以包括步骤410-470,下面分别对步骤410-470进行详细描述。
步骤410:用户向弹性伸缩组管理装置700发送弹性伸缩组配置消息。
用户通过API或可视化窗口向弹性伸缩组管理装置700发送弹性伸缩组配置消息以完成创建弹性伸缩组。
弹性伸缩组配置消息可以包括但不限于:层级信息、计算实例的初始化配置信息、弹性伸缩策略信息。可选地,弹性伸缩组配置消息中还包括:资源调度策略信息。
步骤420:弹性伸缩组管理装置700向主业务服务器中的管理装置发送弹性伸缩组配置消息。
具体的,弹性伸缩组管理装置700可以根据弹性伸缩组配置消息中的层级信息,确定用户需要创建的弹性伸缩组所属的层级。并确定该层级中的主业务服务器,并向该主业务服务器的管理装置发送上述弹性伸缩组配置消息。可选的,如果主业务服务器上的管理装置默认未被激活,弹性伸缩组管理装置700发送弹性伸缩组配置消息前还需要激活主业务服务器上的管理装置。
步骤430:主业务服务器中的管理装置保存弹性伸缩组配置消息。
主业务服务器中的管理装置在接收到弹性伸缩组管理装置700发送的弹性伸缩组配置消息后,可以保存该弹性伸缩组配置消息。并根据该弹性伸缩组配置消息确定创建的弹性伸缩组所属的层级中各个业务服务器需要初始化创建的计算实例的数量。
具体的,主业务服务器中的管理装置可以根据弹性伸缩组配置消息中包括的弹性伸缩组内计算实例的初始化配置信息,确定弹性伸缩组中初始化创建的计算实例的数量,初始化创建的计算实例的规格,初始化创建的计算实例的类型等。该管理装置还可以根据弹性伸缩组配置消息中包括的弹性伸缩组内资源调度策略信息确定弹性伸缩组所属的层级内各个业务服务器中需要初始化创建的计算实例的数量。
步骤440:主业务服务器中的管理装置分别向弹性伸缩组所属的层级中的业务服务器发送创建计算实例的命令。
主业务服务器中的管理装置在确定弹性伸缩组所属的层级中哪些业务服务器中需要初始化创建的计算实例,以及各业务服务器需要初始化创建的计算实例的类型和数量后,可以分别向确定出的各业务服务器的执行装置发送创建计算实例的命令。每个创建计算实例的命令包括需要创建的计算实例的类型和数量。
应理解,该各业务服务器包括弹性伸缩组所属的层级内的主业务服务器,或包括弹性伸缩组所属的层级内的从业务服务器,或包括弹性伸缩组所属的层级内的主业务服务器和从业务服务器。
步骤450:各个业务服务器中的执行装置根据主业务服务器的管理装置发送的创建计算实例的命令创建。
弹性伸缩组所属的层级中的各个业务服务器在接收到主业务服务器中的管理装置下发的创建计算实例的命令后,各个业务服务器中的执行装置根据该创建计算实例的命令中携带的需要创建的计算实例的类型和数量,创建计算实例。
步骤460:各个业务服务器中的执行装置向主业务服务器中的管理装置发送弹性伸缩组的创建结果。
弹性伸缩组的创建结果中包括计算实例的创建结果,该计算实例的创建结果可以包括创建过程是否成功的信息,以及所创建的计算实例的标识(identification,ID)等信息。
步骤470:主业务服务器中的管理装置向弹性伸缩组管理装置700发送弹性伸缩组的 创建结果。
具体的,可以是主业务服务器中的管理装置主动向弹性伸缩组管理装置700发送弹性伸缩组的创建结果。或者还可以是主业务服务器中的管理装置保存弹性伸缩组的创建结果,在接收到用户的查询消息之后,向弹性伸缩组管理装置700同步弹性伸缩组的创建结果。
本申请实施例中可以通过图4所示的方法创建多个弹性伸缩组。为了便于描述,下面以创建了第一弹性伸缩组和第二弹性伸缩组为例进行说明。其中,第一弹性伸缩组部署在第一层级,第二弹性伸缩组部署在第二层级,一个第一层级中包括一个或多个第二层级。
由于第二层级包括于第一层级,第一层级和第二层级的业务服务器存在重叠,将同时属于第一层级和第二层级的业务服务器称之为重叠业务服务器。重叠业务服务器在执行第一弹性伸缩组对应的弹性伸缩命令和第二弹性伸缩组对应的弹性伸缩命令时,可能会出现冲突,例如,重叠业务服务器可能会出现计算实例的操作冲突,需要被操作的计算实例的数量冲突,弹性伸缩命令的执行顺序冲突等情况。
本申请实施例中,用户还可以向弹性伸缩组管理装置700发送冲突解决策略。具体的,用户通过API或可视化窗口向弹性伸缩组管理装置700发送冲突解决策略。图5示例性的展示了,弹性伸缩组管理装置700通过可视化窗口向用户提供弹性伸缩组配置消息的另一种场景。参见图5,弹性伸缩组配置消息中还包括冲突解决策略。
根据冲突解决策略可以确定弹性伸缩策略信息的优先级。具体的,可以由用户在冲突解决策略中直接指定各弹性伸缩策略信息的优先级,或者还可以在冲突解决策略中指示根据弹性伸缩策略信息的作用时间决定弹性伸缩策略信息的优先级,例如先录入弹性伸缩组管理装置700的弹性伸缩策略信息的优先级更高(也即先被创建的弹性伸缩组的弹性伸缩策略信息的优先级越高),或者还可以在冲突解决策略中指示根据弹性伸缩组的层级信息决定优先级,例如层级越高的弹性伸缩组的弹性伸缩策略信息的优先级更高。根据高优先级的弹性伸缩策略信息生成的弹性伸缩命令的优先级为高,根据低优先级的弹性伸缩策略信息生成的弹性伸缩命令的优先级为低。重叠业务服务器可以根据事先配置的冲突解决策略来实施弹性伸缩命令。
一种可能的实现方式中,若重叠业务服务器先接收到高优先级的弹性伸缩命令,则按照高优先级的弹性伸缩命令对其上运行的计算实例实施伸缩,而在冷却时间渡过之内收到的低优先级的弹性伸缩命令会被丢弃,不再实施。重叠业务服务器向发送该被丢弃的弹性伸缩命令的主业务服务器中的管理装置发送弹性伸缩命令发生冲突的通知,该主业务服务器中的管理装置再发送弹性伸缩命令发生冲突的通知给弹性伸缩组管理装置700,弹性伸缩组管理装置700以通知的方式提示用户。
另一种可能的实现方式中,若重叠业务服务器已经实施过低优先级的弹性伸缩命令后才收到高优先级的弹性伸缩命令(冷却时间渡过之后),则重叠业务服务器将高优先级的弹性伸缩命令排入要实施的弹性伸缩命令的队列中等待执行。
另一种可能的实现方式中,若重叠业务服务器同时接收到高优先级的弹性伸缩命令和低优先级的弹性伸缩命令,则重叠业务服务器执行高优先级的弹性伸缩命令,丢弃低优先级的弹性伸缩命令。
下面结合图6,对重叠业务服务器根据冲突解决策略对其上运行的计算实例进行弹性 伸缩的过程进行详细描述。如图6所示,所述方法可以包括步骤610-650,下面分别对步骤610-650进行详细描述。
步骤610:用户向弹性伸缩组管理装置700发送冲突解决策略。
用户通过API或可视化窗口向弹性伸缩组管理装置700发送弹性伸缩组配置消息,该弹性伸缩组配置消息中包括冲突解决策略。
应理解,在执行步骤610之前,已经执行两次步骤410以完成两个弹性伸缩组的创建。为了便于描述,图6中以通过图3所示的方法创建第一弹性伸缩组和第二弹性伸缩组为例,其中,第一弹性伸缩组部署在第一层级,第二弹性伸缩组部署在第二层级,一个第一层级中可以包括一个或多个第二层级。
作为示例,本申请实施例中以用户指定的冲突解决策略为:第一弹性伸缩组的弹性伸缩策略信息的优先级高于第二弹性伸缩组的弹性伸缩策略信息的优先级。
步骤620:弹性伸缩组管理装置700将冲突解决策略发送至第一层级中每一个部署了第一弹性伸缩组的计算实例的业务服务器的执行装置,将冲突解决策略下发至第二层级中每一个部署了第二弹性伸缩组的计算实例的业务服务器的执行装置。
接收到冲突解决策略的业务服务器中的执行装置可以保存该冲突解决策略。
步骤630:第一层级中的主业务服务器的管理装置根据第一弹性伸缩组配置消息向第一层级中的业务服务器发送第一弹性伸缩命令。
应理解,在步骤630之前,已经完成第一弹性伸缩组和第二弹性伸缩组的初始化过程,即已执行图4中的步骤450。
参考图4对应的流程图,第一层级中的主业务服务器的管理装置可以根据第一弹性伸缩组配置消息中包括的弹性伸缩策略信息以及资源调度策略信息,向该第一层级中的业务服务器下发第一弹性伸缩命令。
作为示例,第一弹性伸缩组的弹性伸缩策略信息包括:如果第一层级中计算实例的网络连接数大于80%,则在第一层级中扩容10个计算实例。
当第一层级中的主业务服务器中的管理装置确定接收到的第一弹性伸缩组的计算实例的监控数据达到第一弹性伸缩组的弹性伸缩策略信息中的触发条件时,可以根据第一弹性伸缩组的资源调度策略信息向该第一层级内的业务服务器发送第一弹性伸缩命令。
该第一弹性伸缩命令包括对第一弹性伸缩组中计算实例的操作,需要被操作的计算实例的数量,以及需要被操作的计算实例的数量。
步骤640:第二层级中的主业务服务器的管理装置根据第二弹性伸缩组配置消息向该第二层级中的业务服务器发送第二弹性伸缩命令。
本申请实施例中,第二层级的主业务服务器的管理装置可以根据第二弹性伸缩组配置消息中包括的弹性伸缩策略信息以及资源调度策略信息,向该第二层级中的业务服务器发送第二弹性伸缩命令。
作为示例,第二弹性伸缩组的弹性伸缩策略信息为:如果第二层级中计算实例的CPU的平均使用率小于20%,则在第二层级中减容2个计算实例。
具体的有关第二层级中的主业务服务器的管理装置根据第二弹性伸缩组的弹性伸缩策略信息以及资源调度策略信息,向该第二层级中的业务服务器下发第二弹性伸缩命令的方法与步骤630类似,具体的请参考步骤630中的描述,此处不再赘述。
值得说明的是,步骤630和步骤640之间并无先后顺序关系,可以先执行步骤630,再执行步骤660;也可以先执行步骤640,再执行步骤630;或者,同时执行步骤630和步骤640,本申请对此不做限制。步骤650:重叠业务服务器根据保存的冲突解决策略来选择执行第一弹性伸缩命令或执行第二弹性伸缩命令。重叠服务器是属于第一层级的业务服务器且属于第二层级的业务服务器,重叠服务器可以是主业务服务器或者从业务服务器,图6中以重叠服务器为从业务服务器为例。
例如,用户分别为业务1分别配置第一弹性伸缩组和第二弹性伸缩组,第一弹性伸缩组和第二弹性伸缩组内均包括至少一个用于运行业务1的计算实例。其中,第一弹性伸缩组部署在第一层级,该第一层级中包括业务服务器1、业务服务器2。第二弹性伸缩组部署在第二层级,该第二层级中包括业务服务器2、业务服务器3。
第一层级中的主业务服务器(例如,业务服务器1)向第一层级中的业务服务器2下发的第一弹性伸缩命令为扩容3个计算实例。
第二层级中的主业务服务器(例如,业务服务器3)向第二层级中的业务服务器2下发的第二弹性伸缩命令为减容1个计算实例。
对于同时属于第一层级和第二层级的重叠业务服务器2而言,业务服务器2中的执行装置如果按照第一层级中的主业务服务器下发的第一弹性伸缩命令则需要扩容3个计算实例,而如果业务服务器2按照第二层级中的主业务服务器下发的第二弹性伸缩命令则需要减容1个计算实例。
此时,业务服务器2中的执行装置可以根据保存的冲突解决策略来实施优先级更高的弹性伸缩策略信息对应的弹性伸缩命令。
上文结合图1至图6,详细描述了本申请实施例提供的一种弹性伸缩组的管理方法,下面详细描述本申请的装置的实施例。应理解,方法实施例的描述与装置实施例的描述相互对应,因此,未详细描述的部分可以参见前面方法实施例。
图7是本申请实施例提供的一种弹性伸缩组管理装置700。弹性伸缩组管理装置700用于向用户提供全局AS组服务。该弹性伸缩组管理装置700可以包括:通信模块710,处理模块720。
具体的,通信模块710用于接收用户通过API或可视化窗口发送的弹性伸缩组配置消息。具体的请参见上文中的描述,此处不再赘述。
处理模块720用于根据弹性伸缩组配置消息中的层级信息确定待部署的弹性伸缩组所属的层级,并确定该层级中的主业务服务器。
通信模块710还用于将弹性伸缩组配置消息发送至该层级中的主业务服务器。
可选地,在一些实施例中,通信模块710还可以接收用户通过API或可视化窗口发送的冲突解决策略。通信模块710冲突解决策略发送给该层级中的每一个业务服务器,包括主业务服务器和从业务服务器。
可选地,在一些实施例中,通信模块710还可以用于接收该层级中的主业务服务器中的管理装置发送的弹性伸缩组的创建结果。
可选地,在一些实施例中,通信模块710还可以用于接收发送被重叠业务服务器丢弃的弹性伸缩命令的主业务服务器中的管理装置发送的弹性伸缩命令冲突的通知。
可选地,在一些实施例中,通信模块710还可以通过API或可视化窗口向用户发送弹 性伸缩命令冲突的通知。
应理解,这里的弹性伸缩组管理装置700以功能模块的形式体现。这里的术语“模块”可以通过软件和/或硬件形式实现,对此不作具体限定。例如,“模块”可以是实现上述功能的软件程序、硬件电路或二者结合。当以上任意一个模块以软件实现的时候,所述软件以计算机程序指令的方式存在,并被存储在存储器中,处理器可以用于执行所述程序指令以实现以上方法流程。所述处理器可以包括但不限于以下至少一种:中央处理单元(central processing unit,CPU)、微处理器、数字信号处理器(digital signal processing,DSP)、微控制器(microcontroller unit,MCU)、或人工智能处理器等各类运行软件的计算设备,每种计算设备可包括一个或多个用于执行软件指令以进行运算或处理的核。该处理器可以是个单独的半导体芯片,也可以跟其他电路一起集成为一个半导体芯片,例如,可以跟其他电路(如编解码电路、硬件加速电路或各种总线和接口电路)构成一个片上系统(system on chip,SoC),或者也可以作为一个专用集成电路(application-specific integrated circuit,ASIC)的内置处理器集成在所述ASIC当中,该集成了处理器的ASIC可以单独封装或者也可以跟其他电路封装在一起。该处理器除了包括用于执行软件指令以进行运算或处理的核外,还可进一步包括必要的硬件加速器,如现场可编程门阵列(field programmable gate array,FPGA)、可编程逻辑器件(programmable logic device,PLD)、或者实现专用逻辑运算的逻辑电路。
当以上模块以硬件电路实现的时候,所述硬件电路可能以通用中央处理器(central processing unit,CPU)、微控制器(micro controller unit,MCU)、微处理器(micro processing unit,MPU)、数字信号处理器(digital signal processing,DSP)、片上系统(system on chip,SoC)来实现,当然也可以采用专用集成电路(application-specific integrated circuit,ASIC)实现,或可编程逻辑器件(programmable logic device,PLD)实现,上述PLD可以是复杂程序逻辑器件(complex programmable logical device,CPLD),现场可编程门阵列(field-programmable gate array,FPGA),通用阵列逻辑(generic array logic,GAL)或其任意组合,其可以运行必要的软件或不依赖于软件以执行以上方法流程。
图8是本申请实施例提供的一种业务提供系统中的全局自动伸缩服务器800的示意性结构图。
业务提供系统中包括至少一个如图8所示的全局自动伸缩服务器800,所述全局自动伸缩服务器800包括:处理器802、通信接口803和存储器804。可选的,全局自动伸缩服务器800还包括总线801,处理器802、存储器804和通信接口803之间通过总线801通信。
处理器802可以采用通用的中央处理器(central processing unit,CPU),用于执行相关程序代码,以实现本申请方法侧实施例的弹性伸缩组的管理方法中弹性伸缩组管理装置侧执行的部分。
存储器804可以包括易失性存储器(volatile memory),例如随机存取存储器(random access memory,RAM)。存储器804还可以包括非易失性存储器(non-volatile memory),例如只读存储器(read-only memory,ROM),快闪存储器,硬盘(hard disk drive,HDD)、固态硬盘(solid state drive,SSD)。存储器804中存储有可执行代码,处理器802执行该可执行代码以执行前述弹性伸缩组的管理方法。存储器804中还可以包括操作系统等其他 运行进程所需的软件模块。操作系统可以为LINUX
TM,UNIX
TM,WINDO WS
TM等。
具体的,存储器804中存储有用于实现处理模块720的可执行代码。弹性伸缩组管理装置700中的通信模块710通过通信接口803实现。
弹性伸缩组管理装置700中的通信模块710通过通信接口803实现。所述业务提供系统中的至少一个全局自动伸缩服务器800之间通过通信网络互相建立通信。
图9是本申请实施例提供的一种业务提供系统中的业务服务器900的示意性结构图。
业务提供系统中包括至少一个如图9所示的业务服务器90,所述业务服务器90包括:处理器902、通信接口903和存储器904。可选的,业务服务器900还包括总线901,处理器902、存储器904和通信接口903之间通过总线901通信。
处理器902可以采用通用的中央处理器,用于执行相关程序,以实现本申请方法实施例的弹性伸缩组的管理方法中业务服务器侧执行的部分。
存储器904可以包括易失性存储器(volatile memory),例如随机存取存储器(random access memory,RAM)。存储器904还可以包括非易失性存储器(non-volatile memory),例如只读存储器(read-only memory,ROM),快闪存储器,HDD或SSD。存储器904中存储有可执行代码,处理器902执行该可执行代码以执行前述弹性伸缩组的管理方法。存储器904中还可以包括操作系统等其他运行进程所需的软件模块。操作系统可以为LINUX
TM,UNIX
TM,WINDOWS
TM等。
具体的,存储器904中存储用于实现执行装置905和管理装置906的可执行代码。存储器904中还存储包括操作系统等其他运行进程所需的软件模块。
图8或图9中的服务器,具体可以是刀片式服务器,塔式服务器,还可以是个人计算机,或其他具有计算功能的计算机。
上述实施例,可以全部或部分地通过软件、硬件、固件或其他任意组合来实现。当使用软件实现时,上述实施例可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令或计算机程序。在计算机上加载或执行所述计算机指令或计算机程序时,全部或部分地产生按照本申请实施例所述的流程或功能。所述计算机可以为通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集合的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质。半导体介质可以是固态硬盘。
应理解,本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况,其中A,B可以是单数或者复数。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系,但也可能表示的是一种“和/或”的关系,具体可参考前后文进行理解。
本申请中,“至少一个”是指一个或者多个,“多个”是指两个或两个以上。“以下 至少一项(个)”或其类似表达,是指的这些项中的任意组合,包括单项(个)或复数项(个)的任意组合。例如,a,b,或c中的至少一项(个),可以表示:a,b,c,a-b,a-c,b-c,或a-b-c,其中a,b,c可以是单个,也可以是多个。
应理解,在本申请的各种实施例中,上述各过程的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(read-only memory,ROM)、随机存取存储器(random access memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。
Claims (22)
- 一种弹性伸缩组的管理方法,其特征在于,所述管理方法应用于业务提供系统,所述业务提供系统包括弹性伸缩组管理装置和多个层级,每个层级包括至少一个业务服务器,所述管理方法包括:所述弹性伸缩组管理装置接收第一弹性伸缩组配置消息,所述第一弹性伸缩组配置消息包括所述第一弹性伸缩组的层级信息,所述第一弹性伸缩组的计算实例的初始化配置信息以及所述第一弹性伸缩组的弹性伸缩策略信息;所述弹性伸缩组管理装置在所述第一弹性伸缩组的层级信息指示的层级包括的业务服务器中,根据所述第一弹性伸缩组的计算实例的初始化配置信息创建所述第一弹性伸缩组的计算实例;所述弹性伸缩组管理装置根据所述第一弹性伸缩组的弹性伸缩策略信息对所述第一弹性伸缩组的计算实例进行操作。
- 如权利要求1所述的管理方法,其特征在于,所述弹性伸缩组管理装置根据所述第一弹性伸缩组的弹性伸缩策略信息对所述第一弹性伸缩组的计算实例进行操作包括:所述弹性伸缩组管理装置根据所述第一弹性伸缩组的计算实例的运行情况和所述第一弹性伸缩组的弹性伸缩策略信息生成第一弹性伸缩命令;所述弹性伸缩组管理装置将所述第一弹性伸缩命令发送至创建了所述第一弹性伸缩组的计算实例的业务服务器。
- 如权利要求1或2所述的管理方法,其特征在于,所述管理方法还包括:所述弹性伸缩组管理装置接收第二弹性伸缩组配置消息,所述第二弹性伸缩组配置消息包括所述第二弹性伸缩组的层级信息,所述第二弹性伸缩组的计算实例的初始化配置信息以及所述第二弹性伸缩组的弹性伸缩策略信息;所述弹性伸缩组管理装置在所述第二弹性伸缩组的层级信息指示的层级包括的业务服务器中,根据所述第二弹性伸缩组的计算实例的初始化配置信息创建所述第二弹性伸缩组的计算实例,其中,所述第一弹性伸缩组的层级信息指示的层级包括的业务服务器和所述第二弹性伸缩组的层级信息指示的层级包括的业务服务器有重叠;所述弹性伸缩组管理装置根据所述第二弹性伸缩组的弹性伸缩策略信息对所述第二弹性伸缩组的计算实例进行操作。
- 如权利要求3所述的管理方法,其特征在于,所述第二弹性伸缩组配置消息还包括冲突解决策略,所述冲突解决策略指示弹性伸缩策略信息的优先级,所述管理方法还包括:所述弹性伸缩组管理装置发送所述冲突解决策略至重叠业务服务器,所述重叠业务服务器包括于所述第一弹性伸缩组的层级信息指示的层级且包括于所述第二弹性伸缩组的层级信息指示的层级;所述重叠业务服务器接收根据所述第一弹性伸缩策略信息生成的所述第一弹性伸缩命令;所述重叠业务服务器接收根据所述第二弹性伸缩策略信息生成的第二弹性伸缩命令;所述重叠业务服务器根据所述冲突解决策略,选择执行所述第一弹性伸缩命令或所述第二弹性伸缩命令。
- 一种弹性伸缩组的管理方法,其特征在于,所述管理方法应用于弹性伸缩组管理装置,所述管理方法包括:接收第一弹性伸缩组配置消息,所述第一弹性伸缩组配置消息包括所述第一弹性伸缩组的层级信息,所述第一弹性伸缩组的计算实例的初始化配置信息以及所述第一弹性伸缩组的弹性伸缩策略信息;在所述第一弹性伸缩组的层级信息指示的层级包括的业务服务器中,根据所述第一弹性伸缩组的计算实例的初始化配置信息创建所述第一弹性伸缩组的计算实例;根据所述第一弹性伸缩组的弹性伸缩策略信息对所述第一弹性伸缩组的计算实例进行操作。
- 如权利要求5所述的管理方法,其特征在于,所述根据所述第一弹性伸缩组的弹性伸缩策略信息对所述第一弹性伸缩组的计算实例进行操作,包括:根据所述第一弹性伸缩组的计算实例的运行情况和所述第一弹性伸缩组的弹性伸缩策略信息生成第一弹性伸缩命令;将所述第一弹性伸缩命令发送至创建了所述第一弹性伸缩组的计算实例的业务服务器。
- 如权利要求5或6所述的管理方法,其特征在于,所述管理方法还包括:接收第二弹性伸缩组配置消息,所述第二弹性伸缩组配置消息包括所述第二弹性伸缩组的层级信息,所述第二弹性伸缩组的计算实例的初始化配置信息以及所述第二弹性伸缩组的弹性伸缩策略信息;在所述第二弹性伸缩组的层级信息指示的层级包括的业务服务器中,根据所述第二弹性伸缩组的计算实例的初始化配置信息创建所述第二弹性伸缩组的计算实例,其中,所述第一弹性伸缩组的层级信息指示的层级包括的业务服务器和所述第二弹性伸缩组的层级信息指示的层级包括的业务服务器有重叠;根据所述第二弹性伸缩组的弹性伸缩策略信息对所述第二弹性伸缩组的计算实例进行操作。
- 如权利要求7所述的管理方法,其特征在于,所述根据所述第二弹性伸缩组的弹性伸缩策略信息对所述第二弹性伸缩组的计算实例进行操作,包括:根据所述第二弹性伸缩组的计算实例的运行情况和所述第二弹性伸缩组的弹性伸缩策略信息生成第二弹性伸缩命令;将所述第二弹性伸缩命令发送至创建了所述第二弹性伸缩组的计算实例的业务服务器。
- 如权利要求7或8所述的管理方法,其特征在于,所述第二弹性伸缩组配置消息还包括冲突解决策略,所述冲突解决策略指示弹性伸缩策略信息的优先级,所述管理方法还包括:发送所述冲突解决策略至重叠业务服务器,所述重叠业务服务器包括于所述第一弹性伸缩组的层级信息指示的层级且包括于所述第二弹性伸缩组的层级信息指示的层级。
- 一种弹性伸缩组的管理方法,其特征在于,所述管理方法应用于业务服务器,所 述业务服务器包括于第一弹性伸缩组的层级信息指示的层级且包括于所述第二弹性伸缩组的层级信息指示的层级,所述管理方法包括:接收弹性伸缩组管理装置下发的第一弹性伸缩命令,所述第一弹性伸缩命令是所述弹性伸缩组管理装置根据接收到的第一弹性伸缩组配置消息中包括的第一弹性伸缩策略信息生成的;接收所述弹性伸缩组管理装置下发的第二弹性伸缩命令,所述第二弹性伸缩命令是所述弹性伸缩组管理装置根据接收到的第二弹性伸缩组配置消息中包括的第二弹性伸缩策略信息生成的;接收所述弹性伸缩组管理装置下发的冲突解决策略,所述冲突解决策略指示弹性伸缩策略信息的优先级;根据所述冲突解决策略,选择执行所述第一弹性伸缩命令或所述第二弹性伸缩命令。
- 一种弹性伸缩组的管理装置,其特征在于,包括:通信模块,用于接收第一弹性伸缩组配置消息,所述第一弹性伸缩组配置消息包括所述第一弹性伸缩组的层级信息,所述第一弹性伸缩组的计算实例的初始化配置信息以及所述第一弹性伸缩组的弹性伸缩策略信息;处理模块,用于在所述第一弹性伸缩组的层级信息指示的层级包括的业务服务器中,根据所述第一弹性伸缩组的计算实例的初始化配置信息创建所述第一弹性伸缩组的计算实例;根据所述第一弹性伸缩组的弹性伸缩策略信息对所述第一弹性伸缩组的计算实例进行操作。
- 如权利要求11所述的管理装置,其特征在于,所述处理模块,用于根据所述第一弹性伸缩组的计算实例的运行情况和所述第一弹性伸缩组的弹性伸缩策略信息生成第一弹性伸缩命令;所述通信模块,用于将所述第一弹性伸缩命令发送至创建了所述第一弹性伸缩组的计算实例的业务服务器。
- 如权利要求11或12所述的管理装置,其特征在于,所述通信模块,还用于接收第二弹性伸缩组配置消息,所述第二弹性伸缩组配置消息包括所述第二弹性伸缩组的层级信息,所述第二弹性伸缩组的计算实例的初始化配置信息以及所述第二弹性伸缩组的弹性伸缩策略信息;所述处理模块,还用于在所述第二弹性伸缩组的层级信息指示的层级包括的业务服务器中,根据所述第二弹性伸缩组的计算实例的初始化配置信息创建所述第二弹性伸缩组的计算实例,其中,所述第一弹性伸缩组的层级信息指示的层级包括的业务服务器和所述第二弹性伸缩组的层级信息指示的层级包括的业务服务器有重叠;根据所述第二弹性伸缩组的弹性伸缩策略信息对所述第二弹性伸缩组的计算实例进行操作。
- 如权利要求13所述的管理装置,其特征在于,所述处理模块,用于根据所述第二弹性伸缩组的计算实例的运行情况和所述第二弹性伸缩组的弹性伸缩策略信息生成第二弹性伸缩命令;所述通信模块,用于将所述第二弹性伸缩命令发送至创建了所述第二弹性伸缩组的计算实例的业务服务器。
- 如权利要求13或14所述的管理装置,其特征在于,所述通信模块,用于发送所 述冲突解决策略至重叠业务服务器,所述重叠业务服务器包括于所述第一弹性伸缩组的层级信息指示的层级且包括于所述第二弹性伸缩组的层级信息指示的层级。
- 一种业务服务器,其特征在于,所述业务服务器包括于第一弹性伸缩组的层级信息指示的层级且包括于所述第二弹性伸缩组的层级信息指示的层级,所述业务服务器包括:通信模块,用于接收弹性伸缩组管理装置下发的第一弹性伸缩命令,所述第一弹性伸缩命令是所述弹性伸缩组管理装置根据接收到的第一弹性伸缩组配置消息中包括的第一弹性伸缩策略信息生成的;接收所述弹性伸缩组管理装置下发的第二弹性伸缩命令,所述第二弹性伸缩命令是所述弹性伸缩组管理装置根据接收到的第二弹性伸缩组配置消息中包括的第二弹性伸缩策略信息生成的;接收所述弹性伸缩组管理装置下发的冲突解决策略,所述冲突解决策略指示弹性伸缩策略信息的优先级;处理模块,用于根据所述冲突解决策略,选择执行所述第一弹性伸缩命令或所述第二弹性伸缩命令。
- 一种服务器,其特征在于,包括存储器和至少一个处理器,所述存储器用于程序指令,所述至少一个处理器执行所述存储器中的程序指令以执行权利要求5至9中任一项所述的方法。
- 一种服务器,其特征在于,包括存储器和至少一个处理器,所述存储器用于程序指令,所述至少一个处理器执行所述存储器中的程序指令以执行权利要求10所述的方法。
- 一种非瞬态的可读存储介质,其特征在于,包括程序指令,当所述程序指令被计算机运行时,所述计算机执行如权利要求5至9中任一项所述的方法。
- 一种非瞬态的可读存储介质,其特征在于,包括程序指令,当所述程序指令被计算机运行时,所述计算机执行如权利要求10所述的方法。
- 一种计算机程序产品,其特征在于,包括程序指令,当所述程序指令被计算机运行时,所述计算机执行如权利要求5至9中任一项所述的方法。
- 一种计算机程序产品,其特征在于,包括程序指令,当所述程序指令被计算机运行时,所述计算机执行如权利要求10所述的方法。
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP19955293.6A EP3896912A4 (en) | 2019-12-06 | 2019-12-06 | Auto scaling group management method and apparatus |
PCT/CN2019/123663 WO2021109125A1 (zh) | 2019-12-06 | 2019-12-06 | 一种弹性伸缩组的管理方法、装置 |
CN201980022021.4A CN113228565A (zh) | 2019-12-06 | 2019-12-06 | 一种弹性伸缩组的管理方法、装置 |
US17/516,486 US20220050719A1 (en) | 2019-12-06 | 2021-11-01 | Auto-scaling group management method and apparatus |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2019/123663 WO2021109125A1 (zh) | 2019-12-06 | 2019-12-06 | 一种弹性伸缩组的管理方法、装置 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/516,486 Continuation US20220050719A1 (en) | 2019-12-06 | 2021-11-01 | Auto-scaling group management method and apparatus |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021109125A1 true WO2021109125A1 (zh) | 2021-06-10 |
Family
ID=76220856
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2019/123663 WO2021109125A1 (zh) | 2019-12-06 | 2019-12-06 | 一种弹性伸缩组的管理方法、装置 |
Country Status (4)
Country | Link |
---|---|
US (1) | US20220050719A1 (zh) |
EP (1) | EP3896912A4 (zh) |
CN (1) | CN113228565A (zh) |
WO (1) | WO2021109125A1 (zh) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106209409A (zh) * | 2015-05-07 | 2016-12-07 | 中国移动通信集团公司 | 一种基于虚拟网络功能vnf的调度消息处理方法及装置 |
CN107786587A (zh) * | 2016-08-25 | 2018-03-09 | 华为软件技术有限公司 | 一种调整应用资源的方法以及云控制器 |
CN108540336A (zh) * | 2018-02-24 | 2018-09-14 | 国家计算机网络与信息安全管理中心 | 一种弹性伸缩调度方法和装置 |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9161064B2 (en) * | 2012-08-23 | 2015-10-13 | Adobe Systems Incorporated | Auto-scaling management of web content |
US10355942B1 (en) * | 2014-09-29 | 2019-07-16 | Amazon Technologies, Inc. | Scaling of remote network directory management resources |
US10411960B1 (en) * | 2014-11-12 | 2019-09-10 | Amazon Technologies, Inc. | Detaching instances from auto-scaling group |
US9647889B1 (en) * | 2014-11-12 | 2017-05-09 | Amazon Technologies, Inc. | Standby instances for auto-scaling groups |
US10038640B2 (en) * | 2015-04-30 | 2018-07-31 | Amazon Technologies, Inc. | Managing state for updates to load balancers of an auto scaling group |
US10412020B2 (en) * | 2015-04-30 | 2019-09-10 | Amazon Technologies, Inc. | Background processes in update load balancers of an auto scaling group |
US9848041B2 (en) * | 2015-05-01 | 2017-12-19 | Amazon Technologies, Inc. | Automatic scaling of resource instance groups within compute clusters |
US9880880B2 (en) * | 2015-06-26 | 2018-01-30 | Amazon Technologies, Inc. | Automatic scaling of computing resources using aggregated metrics |
US10660361B1 (en) * | 2015-11-24 | 2020-05-26 | Zachary S. Drennan | Green smoking tips and methods of manufacture |
US10135712B2 (en) * | 2016-04-07 | 2018-11-20 | At&T Intellectual Property I, L.P. | Auto-scaling software-defined monitoring platform for software-defined networking service assurance |
US11038986B1 (en) * | 2016-09-29 | 2021-06-15 | Amazon Technologies, Inc. | Software-specific auto scaling |
CN109766174B (zh) * | 2018-12-24 | 2021-04-16 | 杭州数梦工场科技有限公司 | 资源调度方法、资源调度装置和计算机可读存储介质 |
CN110427250A (zh) * | 2019-07-30 | 2019-11-08 | 无锡华云数据技术服务有限公司 | 创建云主机实例、弹性伸缩组的方法、装置、设备及介质 |
US10985979B2 (en) * | 2019-08-13 | 2021-04-20 | Verizon Patent And Licensing Inc. | Method and system for resource management based on machine learning |
-
2019
- 2019-12-06 EP EP19955293.6A patent/EP3896912A4/en active Pending
- 2019-12-06 WO PCT/CN2019/123663 patent/WO2021109125A1/zh unknown
- 2019-12-06 CN CN201980022021.4A patent/CN113228565A/zh active Pending
-
2021
- 2021-11-01 US US17/516,486 patent/US20220050719A1/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106209409A (zh) * | 2015-05-07 | 2016-12-07 | 中国移动通信集团公司 | 一种基于虚拟网络功能vnf的调度消息处理方法及装置 |
CN107786587A (zh) * | 2016-08-25 | 2018-03-09 | 华为软件技术有限公司 | 一种调整应用资源的方法以及云控制器 |
CN108540336A (zh) * | 2018-02-24 | 2018-09-14 | 国家计算机网络与信息安全管理中心 | 一种弹性伸缩调度方法和装置 |
Non-Patent Citations (1)
Title |
---|
See also references of EP3896912A4 * |
Also Published As
Publication number | Publication date |
---|---|
US20220050719A1 (en) | 2022-02-17 |
CN113228565A (zh) | 2021-08-06 |
EP3896912A1 (en) | 2021-10-20 |
EP3896912A4 (en) | 2022-06-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11429408B2 (en) | System and method for network function virtualization resource management | |
CN111796908B (zh) | 一种资源自动弹性伸缩的系统、方法及云平台 | |
WO2020135799A1 (zh) | Vnf服务实例化方法及装置 | |
US20150339156A1 (en) | Managing virtual machine migration | |
EP3503472B1 (en) | Method for managing slice instance and apparatus | |
EP3584998B1 (en) | Method for virtual machine capacity expansion and reduction and virtual management device | |
WO2018006676A1 (zh) | 加速资源处理方法、装置及网络功能虚拟化系统 | |
CN106856438B (zh) | 一种网络业务实例化的方法、装置及nfv系统 | |
WO2016183799A1 (zh) | 一种硬件加速方法以及相关设备 | |
US20240354150A1 (en) | Rightsizing virtual machine deployments in a cloud computing environment | |
JP2018512001A (ja) | 仮想化ネットワーク機能を管理するための方法及び装置 | |
CN105308553A (zh) | 动态提供存储 | |
WO2017011938A1 (zh) | 虚拟网络功能扩容的方法和装置 | |
WO2017041650A1 (zh) | 用于扩展分布式一致性服务的方法和设备 | |
WO2021109125A1 (zh) | 一种弹性伸缩组的管理方法、装置 | |
TWI608377B (zh) | 監控管理系統及方法 | |
WO2024051236A1 (zh) | 资源调度方法及其相关设备 | |
WO2017070963A1 (zh) | 一种虚拟资源的部署方法、装置及系统 | |
CN112241293A (zh) | 工业互联网云平台的应用管理方法、装置、设备及介质 | |
WO2017128820A1 (zh) | 一种虚拟化网络功能的管理方法、网络设备及系统 | |
WO2022142515A1 (zh) | 管理实例的方法、装置以及云应用引擎 | |
CN115328608A (zh) | 一种Kubernetes容器垂直伸缩调节方法和装置 | |
US20240348513A1 (en) | Slice-driven deployment of network functions | |
CN108885558B (zh) | 通过事件触发器合并来降低系统能耗 | |
WO2024002190A1 (zh) | 基于监控器的容器调整方法、设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
ENP | Entry into the national phase |
Ref document number: 2019955293 Country of ref document: EP Effective date: 20210712 |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19955293 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |