CN113032134A - Method and device for realizing cloud computing resource allocation and cloud management server - Google Patents

Method and device for realizing cloud computing resource allocation and cloud management server Download PDF

Info

Publication number
CN113032134A
CN113032134A CN201911344153.7A CN201911344153A CN113032134A CN 113032134 A CN113032134 A CN 113032134A CN 201911344153 A CN201911344153 A CN 201911344153A CN 113032134 A CN113032134 A CN 113032134A
Authority
CN
China
Prior art keywords
instance
inventory
instance specification
available area
specifications
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911344153.7A
Other languages
Chinese (zh)
Other versions
CN113032134B (en
Inventor
田永军
贺荣徽
余洋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201911344153.7A priority Critical patent/CN113032134B/en
Publication of CN113032134A publication Critical patent/CN113032134A/en
Application granted granted Critical
Publication of CN113032134B publication Critical patent/CN113032134B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Stored Programmes (AREA)

Abstract

The application discloses a method and a device for realizing cloud computing resource allocation and a cloud management server, which realize that computing resources are automatically created for a user according to corresponding instance specifications acceptable by the user to complete computation according to the real-time inventory condition of public cloud services, fully utilize the elasticity of a single data center, namely a single available area, and effectively help the user to obtain the computing resources as soon as possible without the problem of waiting for resource release caused by the single instance specifications.

Description

Method and device for realizing cloud computing resource allocation and cloud management server
Technical Field
The present application relates to, but not limited to, cloud computing technologies, and in particular, to a method and an apparatus for implementing cloud computing resource allocation, and a cloud management server.
Background
The most important advantage of cloud computing is flexible expansion and contraction, which allows users to use computing resources, storage resources, etc. like water, electricity, coal, and in all, allows users to consider resources on the cloud as unlimited.
A public cloud computing platform generally deploys a plurality of data centers in a plurality of regions, the data centers are also called as Available Zones (AZ), each machine room may deploy different instance specifications, and the inventory of each instance specification changes in real time and is different. For a user of High Performance Computing (HPC), the cluster size is large, and in order to pursue Performance, resources requested by the user are generally deployed in the same data center in a centralized manner. Therefore, even on a cloud computing platform, the problem of cluster capacity expansion failure caused by insufficient inventory can be encountered, and the computing time of the computing task is prolonged.
In the existing scheme for realizing resource allocation, only one specification example can be automatically stretched, and the capacity expansion is limited in one region. In this case, for a user who cannot request the resource, the user waits until the resource in the data center is released and available.
Disclosure of Invention
The application provides a method and a device for realizing cloud computing resource allocation and a cloud management server, which can give full play to public cloud elasticity, thereby ensuring computing performance.
The embodiment of the invention provides a method for realizing cloud computing resource allocation, which comprises the following steps:
the cloud management server determines different types of instance specifications matched with the new task according to resource demand configuration information configured in advance by a user;
according to the determined priority sequence of the example specifications of different types, inquiring whether the inventory of the current priority example specification in the available area is sufficient;
when the sufficient inventory of the current priority instance specification in the available area is inquired, selecting the current priority instance specification to create computing resources for a new task of a user; when the query finds that the stock of the current priority instance specification in the available area is insufficient, the query of the stock of the next priority instance specification is continued.
In one illustrative example, the method further comprises, prior to:
the cloud management server receives the pre-configured resource demand configuration information;
the resource requirement configuration information comprises one or more resource configuration items, and each resource configuration item comprises one or more different types of instance specifications.
In one illustrative example, the usable area includes one usable area; the querying whether the inventory of the current priority instance specification in the available area is sufficient includes:
for different resource configuration items, the cloud management server inquires whether the inventory of the current instance specification in the available area is sufficient or not from the instance specification with the highest priority in the determined different types of instance specifications.
In one illustrative example, the method further comprises:
if the stock of the current priority instance specification in the available area is determined to be sufficient, establishing computing resources in the cluster according to the current instance specification, running the new operation, and ending the process;
if the stock of the current priority instance specification in the available area is determined to be insufficient, returning to the step of inquiring, inquiring the stock of the next priority instance specification of the current priority instance specification until the stock of all the different types of instance specifications in the resource configuration item is inquired, and ending the process.
In one illustrative example, the available regions include two or more and all belong to the same geographical area; the querying whether the inventory of the current priority instance specification in the available area is sufficient includes:
for different resource configuration items, the cloud management server inquires whether the inventory corresponding to the current instance specification in the available area is sufficient or not from the instance specification with the highest priority in the determined different types of instance specifications; the available area corresponding to each resource configuration item comprises one or more than one, and the example specifications provided by the available areas corresponding to different resource configuration items are different.
In one illustrative example, the method further comprises:
if the inventory of the current priority instance specification in the corresponding available area is determined to be sufficient according to different resource configuration items, establishing computing resources in the cluster according to the current instance specification, running the new operation, and ending the process;
if the inventory of the current priority instance specification in the corresponding available area is determined to be insufficient, returning to the step of inquiring, inquiring the inventory of the next priority instance specification of the current priority instance specification until the inventory of all different types of instance specifications in the resource configuration item is inquired, and ending the process.
In one illustrative example, the available regions include two or more and are distributed over different geographical areas; the querying whether the inventory of the current priority instance specification in the available area is sufficient includes:
for different resource configuration items, the cloud management server inquires whether the inventory of the current instance specification in the corresponding available area of the corresponding cluster is sufficient or not from the determined instance specification with the highest priority in the different types of instance specifications; and one or more available areas corresponding to each resource configuration item are distributed in one or more clusters, and the example specifications provided by the available areas corresponding to different resource configuration items are different.
In one illustrative example, the method further comprises:
if the fact that the inventory of the current priority instance specification in the corresponding available area of the corresponding cluster is sufficient is determined for different resource configuration items, computing resources are created in the cluster according to the current instance specification, the new operation is run, and the process is ended;
if the fact that the inventory of the current priority instance specification in the corresponding available area of the corresponding cluster is insufficient is determined, the step of inquiring is returned, the inventory of the next priority instance specification of the current priority instance specification is inquired until the inventory of all different types of instance specifications in the resource configuration item is inquired, and the process is ended.
The application also provides a computer-readable storage medium, which stores computer-executable instructions for executing any one of the above methods for realizing cloud computing resource allocation.
The application further provides an apparatus for implementing cloud computing resource allocation, comprising a memory and a processor, wherein the memory stores the following instructions executable by the processor: for performing the steps of any of the above described methods of implementing cloud computing resource allocation.
The present application further provides a cloud management server, including: the device comprises a determining module, a query module and a processing module; wherein the content of the first and second substances,
the determining module is used for determining different types of example specifications of new task configurable according to the resource demand configuration information configured in advance by the user;
the query module is used for querying whether the inventory of the current priority instance specification in the available area is sufficient according to the determined priority sequence of the different types of instance specifications;
the processing module is used for selecting the current priority instance specification to create computing resources for a new task of a user when the sufficient inventory of the current priority instance specification in the available area is inquired; when the query result shows that the inventory of the current priority instance specification in the available area is insufficient, the return query module continues to query the inventory of the next priority instance specification.
According to the method and the device, the computing resources are automatically created for the user according to the corresponding instance specification accepted by the user to complete the computation according to the real-time inventory condition of the public cloud service, the elasticity of a single data center, namely a single available area, is fully utilized, the problem of waiting for resource release caused by the single instance specification is solved, and the user is effectively helped to obtain the computing resources as soon as possible.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
Drawings
The accompanying drawings are included to provide a further understanding of the claimed subject matter and are incorporated in and constitute a part of this specification, illustrate embodiments of the subject matter and together with the description serve to explain the principles of the subject matter and not to limit the subject matter.
FIG. 1 is a flow chart of a method for implementing cloud computing resource allocation according to the present application;
FIG. 2 is a schematic view of a first embodiment of a cloud computing resource allocation implementation according to the present application;
FIG. 3 is a schematic flow chart of a first embodiment of the present application;
FIG. 4 is a schematic view of a second embodiment of a cloud computing resource allocation implementation according to the present application;
FIG. 5 is a schematic view of a third embodiment of a cloud computing resource allocation implementation according to the present application;
FIG. 6 is a schematic view of a fourth embodiment of cloud computing resource allocation according to the present application;
fig. 7 is a schematic structural diagram of a cloud management server according to the present application;
fig. 8 is a schematic view of an application scenario for implementing cloud computing resource allocation according to the present application.
Detailed Description
To make the objects, technical solutions and advantages of the present application more apparent, embodiments of the present application will be described in detail below with reference to the accompanying drawings. It should be noted that the embodiments and features of the embodiments in the present application may be arbitrarily combined with each other without conflict.
In one exemplary configuration of the present application, a computing device includes one or more processors (CPUs), input/output interfaces, a network interface, and memory.
The memory may include forms of volatile memory in a computer readable medium, Random Access Memory (RAM) and/or non-volatile memory, such as Read Only Memory (ROM) or flash memory (flash RAM). Memory is an example of a computer-readable medium.
Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, computer readable media does not include non-transitory computer readable media (transient media), such as modulated data signals and carrier waves.
The steps illustrated in the flow charts of the figures may be performed in a computer system such as a set of computer-executable instructions. Also, while a logical order is shown in the flow diagrams, in some cases, the steps shown or described may be performed in an order different than here.
Fig. 1 is a flowchart of a method for implementing cloud computing resource allocation according to the present application, and as shown in fig. 1, the method includes:
step 100: and the cloud management server determines different types of instance specifications matched with the new task according to the resource requirement configuration information configured in advance by the user.
There are many types of instance specifications on the public cloud, such as general, computing, memory, high dominant frequency, etc. In an exemplary embodiment, the method further comprises the following steps:
the cloud management server receives pre-configured resource demand configuration information reported by a user (such as a cluster administrator), wherein the resource demand configuration information comprises one or more resource configuration items, and each resource configuration item comprises one or more different types of instance specifications.
In an exemplary embodiment, a user may set resource configuration items in a queue manner, and set multiple types of instance specifications that can be expanded/reduced for each queue, such as: example specifications for capacity expansion/contraction in the first queue include: specification 1, specification 2; the following steps are repeated: example specifications for capacity expansion/contraction in the second queue include: specification 3, Specification 4, etc.
In combination with practical application scenarios, such as: the first queue is configured as a 32-check example, then the user may sequentially expand the following examples according to the priority setting according to the characteristics of the computing task: general type (32-core 128GB memory) -high main frequency (32-core 128GB memory) -calculation type (32-core 64GB memory) -memory type (32-core 256GB memory). The following steps are repeated: the second queue is configured as a 16-core instance, and then, according to the characteristics of the computing task, the user can sequentially set the following instances of capacity expansion according to the priority: general type (16-core 64GB memory) -high main frequency (16-core 64GB memory) -calculation type (16-core 32GB memory) -memory type (16-core 128GB memory). It should be noted that the description is only for illustration and is not intended to limit the scope of the present application.
In an exemplary embodiment, the resource requirement configuration information includes two or more resource configuration items, that is, multiple instance specifications can be expanded/reduced in multiple available areas, so that the elastic advantage of the public cloud is fully exerted.
In an exemplary embodiment, step 100 may be preceded by:
the cloud management server acquires the job information in the cluster according to a preset period or a specified time period, and learns that a new job waits for the creation of computing resources to complete computing from the job information.
Step 101: and inquiring whether the inventory of the current priority instance specification in the available area is sufficient according to the determined priority order of the different types of instance specifications.
In an exemplary embodiment, for a case where one cluster corresponds to one usable area, such as cluster 1 in fig. 8, the usable area in this step includes one usable area; this step may include:
and for different resource configuration items, the cloud management server inquires whether the inventory of the current instance specification in the available area is sufficient or not from the determined instance specification with the highest priority in the different types of instance specifications.
In an exemplary embodiment, for a case where one cluster corresponds to multiple available areas, such as the cluster 2 and the cluster 3 in fig. 8, the available areas in this step include two or more available areas, and both of the available areas belong to the same region; this step may include:
aiming at different resource configuration items, the cloud management server inquires whether the inventory of the current example specification in the corresponding available area is sufficient or not from the example specification with the highest priority in the determined example specifications of different types; the available area corresponding to each resource configuration item comprises one or more than one, and the example specifications provided by the available areas corresponding to different resource configuration items are different.
In an exemplary embodiment, an available area may be used to expand/reduce an instance specification included in one resource configuration item, or may be used to expand/reduce an instance specification included in two or more resource configuration items.
In an exemplary example, for a case where a plurality of clusters, such as cluster 1, cluster 2, and cluster 3 shown in fig. 8, each cluster corresponds to one or more available areas, the available areas in this step include two or more available areas and are distributed in different geographical areas; this step may include:
for different resource configuration items, the cloud management server inquires whether the inventory of the current instance specification in the corresponding available area of the corresponding cluster is sufficient or not from the determined instance specification with the highest priority in the different types of instance specifications; and one or more available areas corresponding to each resource configuration item are distributed in one or more clusters, and the example specifications provided by the available areas corresponding to different resource configuration items are different.
Step 102: when the sufficient inventory of the current priority instance specification in the available area is inquired, selecting the current priority instance specification to create computing resources for a new task of a user; when the query finds that the inventory of the current priority instance specification in the available area is insufficient, the step 101 is returned to continue to query the inventory of the next priority instance specification.
In one illustrative example, the available region includes one available region; this step may include:
if the stock of the current priority instance specification in the available area is determined to be sufficient, establishing computing resources in the cluster according to the current instance specification, running new operation, and ending the process;
if the determination is made that the inventory of the current priority instance specification in the available area is not sufficient, the process returns to step 101 to query the inventory of the next priority instance specification of the current priority instance specification until all the inventories of the different types of instance specifications in the resource configuration item are queried, and the process is ended.
Therefore, according to the real-time inventory condition of the public cloud service, the computing resources are automatically created for the user according to the corresponding instance specification accepted by the user to complete the computation, the elasticity of a single data center, namely a single available area, is fully utilized, the problem of waiting for resource release caused by the single instance specification is avoided, and the user is effectively helped to obtain the computing resources as soon as possible.
In one illustrative example, the available regions include two or more and all belong to the same geographical area; this step may include:
aiming at different resource configuration items, if the fact that the inventory of the current priority instance specification in the corresponding available area is sufficient is determined, computing resources are created in the cluster according to the current instance specification, new operation is carried out, and the process is finished;
if the determination result shows that the inventory of the current priority instance specification in the corresponding available area is not sufficient, the process returns to step 101 to query the inventory of the next priority instance specification of the current priority instance specification until all the inventories of the different types of instance specifications in the resource configuration item are queried, and the process is ended.
Therefore, by configuring multiple data centers, namely multiple instance specifications of multiple available areas, in the resource demand configuration information, different capacity expansion/reduction demands of users are met by the multiple data centers of one cluster, and the performance of the cluster is ensured.
In one illustrative example, the available areas include two or more and are distributed over different geographical areas; this step may include:
aiming at different resource configuration items, if the fact that the inventory of the current priority instance specification in the corresponding available area of the corresponding cluster is sufficient is determined, computing resources are created in the cluster according to the current instance specification, new operation is run, and the process is finished;
if the determination result shows that the inventory of the current priority instance specification in the corresponding available area of the corresponding cluster is not sufficient, the process returns to step 101, the inventory of the next priority instance specification of the current priority instance specification is queried until the inventory of all different types of instance specifications in the resource configuration item is queried, and the process is ended.
Therefore, by configuring multiple instance specifications of multiple data centers in multiple regions, namely multiple instance specifications of multiple available areas of multiple clusters in the resource demand configuration information, different capacity expansion/capacity reduction demands of users are met by the multiple data centers in the multiple regions, and the performance of the clusters is ensured.
By the method for realizing cloud computing resource allocation, public cloud elasticity is fully exerted, and computing performance is guaranteed. Particularly, under the condition that a user has an urgent computing task, the method effectively helps the user to obtain computing resources as soon as possible.
The application also provides a computer-readable storage medium storing computer-executable instructions for executing any one of the above methods for implementing cloud computing resource allocation.
The present application further provides an apparatus for implementing cloud computing resource allocation, including a memory and a processor, where the memory stores the following instructions executable by the processor: for performing the steps of any of the above described methods of implementing cloud computing resource allocation.
The technical solution of the present application is described in detail below with reference to specific embodiments.
A first embodiment, fig. 2 is a schematic view of a scenario of a first embodiment for implementing cloud computing resource allocation according to the present application, where the first embodiment takes as an example that multiple instance specifications are expanded in an available area of a cluster (i.e. the same region), and it is assumed that a user sets multiple instance specifications in a queue, for example: the first queue (corresponding to the first resource configuration item) is configured as a 32-core instance, and it is assumed that a user sequentially expands the following instances according to the characteristics of the computing task and the priority setting: example specification 11 is a general-purpose type (32-core 128GB memory) -example specification 12 is a high main frequency (32-core 128GB memory), and as shown in the flowchart of the first embodiment shown in fig. 3, the method includes:
step 300: and setting resource requirement configuration information by a user (cluster administrator) and submitting the resource requirement configuration information to the cloud management server.
In this embodiment, the resource allocation items and the instance specification included in each queue are set in a queue manner, and it is assumed that, in the first embodiment, a first queue is set, and the instance specifications 11 and 12 are expanded according to the priority, and a second queue is expanded according to the priority, and the instance specifications 21 and 22 are expanded.
Step 301: if the user submits the job to the first queue of the cluster service, the cloud management server can acquire the job information on the first queue from the cluster service at regular time, and know that a new job waits for the creation of the computing resource in the first queue to complete the computation.
Step 302: according to resource demand configuration information configured by a user, the cloud management server determines an example specification 11 and an example specification 12 of a new task needing expansion; the cloud management server queries the cloud service for real-time inventory conditions of the instance specification 11 in the currently available region, assuming that the instance specification 11 returned by the cloud service is inventory-free.
Step 303: the cloud management server queries the cloud service for real-time inventory conditions of the instance specification 12 in the currently available region, assuming that the instance specification 12 returned by the cloud service is in inventory.
Step 304: the cloud management server initiates the creation of a compute node of the instance specification 12 to the cloud service and automatically joins the compute node to the cluster to run the new job.
In general, there are multiple types of instance specifications on a public cloud, such as general, computing, memory, high dominant frequency, and so on. The first embodiment realizes that the calculation is completed by automatically creating the calculation resources for the user according to the corresponding instance specification accepted by the user according to the inventory real-time condition of the public cloud service, fully utilizes the elasticity of a single data center, namely a single available area, and effectively helps the user to obtain the calculation resources as soon as possible without the problem of waiting for resource release caused by the single instance specification
A second embodiment, fig. 4 is a schematic view of a scenario of the second embodiment for implementing cloud computing resource allocation according to the present application, where the second embodiment uses, as an example, multiple instance specifications that are expanded in multiple available areas of a cluster, and the expanded instance specifications provided by the available areas are different, and it is assumed that a user sets resource requirement configuration information and submits the resource requirement configuration information to a cloud management server.
When the cloud management server learns that a user submits a new job to the first queue, the cloud management server sequentially inquires the instance specification 11 and the inventory condition of the instance specification 12 in the available area 1 of the cloud service according to a corresponding first resource configuration item in the resource demand configuration information, namely the instance specification sequence configured by the first queue, and then automatically expands the capacity; when the cloud management server learns that the user submits a new job to the second queue, the cloud management server inquires the instance specification 23 in the available area 3 from the cloud service in sequence according to a corresponding second resource configuration item in the resource demand configuration information, namely the instance specification sequence configured by the second queue, and then the inventory of the instance specification 24 is automatically expanded.
Typically, there are differences in the specifications of instances provided by different available areas on a public cloud, or there may be large differences in inventory, such as: in this embodiment, the size of the expansion instances provided by the available area 1 and the available area 2 in one cluster are different. The second embodiment configures multiple instance specifications of multiple data centers, that is, multiple available areas, in the resource demand configuration information, thereby implementing that multiple data centers of a cluster meet different capacity expansion/capacity reduction demands of users, and ensuring the performance of the cluster.
Fig. 5 is a schematic view of a third embodiment of implementing cloud computing resource allocation according to the present application, where the third embodiment is the same as the second embodiment in that multiple instance specifications are expanded in multiple available areas of a cluster, and different from the second embodiment, each instance specification that is expandable in an available area is included in one or more resource configuration items, that is, a single queue may attempt to expand multiple available areas.
It should be noted that, for inter-node communication across an available area, there is a network delay compared to inter-node communication within the same available area, and therefore, the present application further includes:
through the management of the cluster scheduling queue, a single queue is limited in an available area for expansion, and after a user submits a new job to the queue, the new job is guaranteed to be only operated on the node of the same available area through scheduling, so that the computing performance is further guaranteed.
Fig. 6 is a scene schematic diagram of a fourth embodiment for implementing cloud computing resource allocation according to the present application, and as shown in fig. 6, for a demand of a very large scale cluster, capacity expansion across regions may be performed, that is, a cloud management server may meet a capacity expansion demand of a user between different regions. The specific implementation in different regions, such as the region a and the region B in fig. 6, may adopt any combination of the implementation manners in the first embodiment to the third embodiment.
The fourth embodiment configures multiple instance specifications of multiple data centers in multiple regions, that is, multiple instance specifications of multiple available areas of multiple clusters, in the resource demand configuration information, thereby implementing different capacity expansion/reduction demands of users in multiple data centers in multiple regions, and ensuring cluster performance.
Fig. 7 is a schematic structural diagram of a cloud management server according to the present application, as shown in fig. 7, at least including: the device comprises a determining module, a query module and a processing module; wherein the content of the first and second substances,
the determining module is used for determining different types of example specifications of new task configurable according to the resource demand configuration information configured in advance by the user;
the query module is used for querying whether the inventory of the current priority instance specification in the available area is sufficient according to the determined priority sequence of the different types of instance specifications;
the processing module is used for selecting the current priority instance specification to create computing resources for a new task of a user when the sufficient inventory of the current priority instance specification in the available area is inquired; when the query result shows that the inventory of the current priority instance specification in the available area is insufficient, the return query module continues to query the inventory of the next priority instance specification.
In one illustrative example, the determining module is further configured to:
receiving resource demand configuration information pre-configured by a user (such as a cluster administrator) reported by the user, wherein the resource demand configuration information comprises one or more resource configuration items, and each resource configuration item comprises one or more different types of instance specifications.
Although the embodiments disclosed in the present application are described above, the descriptions are only for the convenience of understanding the present application, and are not intended to limit the present application. It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the disclosure as defined by the appended claims.

Claims (11)

1. A method of implementing cloud computing resource allocation, comprising:
the cloud management server determines different types of instance specifications matched with the new task according to resource demand configuration information configured in advance by a user;
according to the determined priority sequence of the example specifications of different types, inquiring whether the inventory of the current priority example specification in the available area is sufficient;
when the sufficient inventory of the current priority instance specification in the available area is inquired, selecting the current priority instance specification to create computing resources for a new task of a user; when the query finds that the stock of the current priority instance specification in the available area is insufficient, the query of the stock of the next priority instance specification is continued.
2. The method of claim 1, further comprising, prior to the method:
the cloud management server receives the pre-configured resource demand configuration information;
the resource requirement configuration information comprises one or more resource configuration items, and each resource configuration item comprises one or more different types of instance specifications.
3. The method of claim 2, wherein the available region comprises one available region; the querying whether the inventory of the current priority instance specification in the available area is sufficient includes:
for different resource configuration items, the cloud management server inquires whether the inventory of the current instance specification in the available area is sufficient or not from the instance specification with the highest priority in the determined different types of instance specifications.
4. The method of claim 3, wherein the method further comprises:
if the stock of the current priority instance specification in the available area is determined to be sufficient, establishing computing resources in the cluster according to the current instance specification, running the new operation, and ending the process;
if the stock of the current priority instance specification in the available area is determined to be insufficient, returning to the step of inquiring, inquiring the stock of the next priority instance specification of the current priority instance specification until the stock of all the different types of instance specifications in the resource configuration item is inquired, and ending the process.
5. The method of claim 2, wherein the available regions comprise two or more and all belong to the same geographical area; the querying whether the inventory of the current priority instance specification in the available area is sufficient includes:
for different resource configuration items, the cloud management server inquires whether the inventory corresponding to the current instance specification in the available area is sufficient or not from the instance specification with the highest priority in the determined different types of instance specifications; the available area corresponding to each resource configuration item comprises one or more than one, and the example specifications provided by the available areas corresponding to different resource configuration items are different.
6. The method of claim 5, wherein the method further comprises:
if the inventory of the current priority instance specification in the corresponding available area is determined to be sufficient according to different resource configuration items, establishing computing resources in the cluster according to the current instance specification, running the new operation, and ending the process;
if the inventory of the current priority instance specification in the corresponding available area is determined to be insufficient, returning to the step of inquiring, inquiring the inventory of the next priority instance specification of the current priority instance specification until the inventory of all different types of instance specifications in the resource configuration item is inquired, and ending the process.
7. The method of claim 2, wherein the available areas comprise two or more and are distributed over different geographical areas; the querying whether the inventory of the current priority instance specification in the available area is sufficient includes:
for different resource configuration items, the cloud management server inquires whether the inventory of the current instance specification in the corresponding available area of the corresponding cluster is sufficient or not from the determined instance specification with the highest priority in the different types of instance specifications; and one or more available areas corresponding to each resource configuration item are distributed in one or more clusters, and the example specifications provided by the available areas corresponding to different resource configuration items are different.
8. The method of claim 7, wherein the method further comprises:
if the fact that the inventory of the current priority instance specification in the corresponding available area of the corresponding cluster is sufficient is determined for different resource configuration items, computing resources are created in the cluster according to the current instance specification, the new operation is run, and the process is ended;
if the fact that the inventory of the current priority instance specification in the corresponding available area of the corresponding cluster is insufficient is determined, the step of inquiring is returned, the inventory of the next priority instance specification of the current priority instance specification is inquired until the inventory of all different types of instance specifications in the resource configuration item is inquired, and the process is ended.
9. A computer-readable storage medium storing computer-executable instructions for performing the method of implementing cloud computing resource allocation recited in any one of claims 1-8.
10. An apparatus for implementing cloud computing resource allocation, comprising a memory and a processor, wherein the memory has stored therein the following instructions executable by the processor: for performing the steps of any one of claims 1 to 8 of the method of implementing cloud computing resource allocation.
11. A cloud management server, comprising: the device comprises a determining module, a query module and a processing module; wherein the content of the first and second substances,
the determining module is used for determining different types of example specifications of new task configurable according to the resource demand configuration information configured in advance by the user;
the query module is used for querying whether the inventory of the current priority instance specification in the available area is sufficient according to the determined priority sequence of the different types of instance specifications;
the processing module is used for selecting the current priority instance specification to create computing resources for a new task of a user when the sufficient inventory of the current priority instance specification in the available area is inquired; when the query result shows that the inventory of the current priority instance specification in the available area is insufficient, the return query module continues to query the inventory of the next priority instance specification.
CN201911344153.7A 2019-12-24 2019-12-24 Method and device for realizing cloud computing resource allocation and cloud management server Active CN113032134B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911344153.7A CN113032134B (en) 2019-12-24 2019-12-24 Method and device for realizing cloud computing resource allocation and cloud management server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911344153.7A CN113032134B (en) 2019-12-24 2019-12-24 Method and device for realizing cloud computing resource allocation and cloud management server

Publications (2)

Publication Number Publication Date
CN113032134A true CN113032134A (en) 2021-06-25
CN113032134B CN113032134B (en) 2024-06-18

Family

ID=76451449

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911344153.7A Active CN113032134B (en) 2019-12-24 2019-12-24 Method and device for realizing cloud computing resource allocation and cloud management server

Country Status (1)

Country Link
CN (1) CN113032134B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115379000A (en) * 2022-10-26 2022-11-22 小米汽车科技有限公司 Service platform capacity expansion method and device, electronic equipment and storage medium
WO2023179170A1 (en) * 2022-03-24 2023-09-28 华为云计算技术有限公司 Instance configuration method and apparatus, and device
CN117971512A (en) * 2024-04-02 2024-05-03 杭州骋风而来数字科技有限公司 Intelligent power calculation scheduling system and method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104461744A (en) * 2014-12-18 2015-03-25 曙光云计算技术有限公司 Resource allocation method and device
WO2018077079A1 (en) * 2016-10-31 2018-05-03 阿里巴巴集团控股有限公司 Application capacity enlargement method, apparatus and system
CN108337109A (en) * 2017-12-28 2018-07-27 中兴通讯股份有限公司 A kind of resource allocation methods and device and resource allocation system
WO2018149263A1 (en) * 2017-02-15 2018-08-23 华为技术有限公司 Method and apparatus for instantiating network service
CN109766175A (en) * 2018-12-28 2019-05-17 深圳晶泰科技有限公司 Resource elastic telescopic system and its dispatching method towards high-performance calculation on cloud

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104461744A (en) * 2014-12-18 2015-03-25 曙光云计算技术有限公司 Resource allocation method and device
WO2018077079A1 (en) * 2016-10-31 2018-05-03 阿里巴巴集团控股有限公司 Application capacity enlargement method, apparatus and system
WO2018149263A1 (en) * 2017-02-15 2018-08-23 华为技术有限公司 Method and apparatus for instantiating network service
CN108337109A (en) * 2017-12-28 2018-07-27 中兴通讯股份有限公司 A kind of resource allocation methods and device and resource allocation system
CN109766175A (en) * 2018-12-28 2019-05-17 深圳晶泰科技有限公司 Resource elastic telescopic system and its dispatching method towards high-performance calculation on cloud

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023179170A1 (en) * 2022-03-24 2023-09-28 华为云计算技术有限公司 Instance configuration method and apparatus, and device
CN115379000A (en) * 2022-10-26 2022-11-22 小米汽车科技有限公司 Service platform capacity expansion method and device, electronic equipment and storage medium
CN115379000B (en) * 2022-10-26 2023-03-24 小米汽车科技有限公司 Service platform capacity expansion method and device, electronic equipment and storage medium
CN117971512A (en) * 2024-04-02 2024-05-03 杭州骋风而来数字科技有限公司 Intelligent power calculation scheduling system and method

Also Published As

Publication number Publication date
CN113032134B (en) 2024-06-18

Similar Documents

Publication Publication Date Title
CN113032134B (en) Method and device for realizing cloud computing resource allocation and cloud management server
CN108280080B (en) Data synchronization method and device and electronic equipment
CN108282514B (en) Distributed service establishing method and device
CN115328663A (en) Method, device, equipment and storage medium for scheduling resources based on PaaS platform
CN111399764B (en) Data storage method, data reading device, data storage equipment and data storage medium
CN110908774B (en) Resource scheduling method, equipment, system and storage medium
CN112202853B (en) Data synchronization method, system, computer device and storage medium
EP3786802A1 (en) Method and device for failover in hbase system
CN104410666A (en) Method and system for implementing heterogeneous storage resource management under cloud computing
US20240061712A1 (en) Method, apparatus, and system for creating training task on ai training platform, and medium
CN112463290A (en) Method, system, apparatus and storage medium for dynamically adjusting the number of computing containers
CN107562803B (en) Data supply system and method and terminal
CN113296931A (en) Resource control method, system, computing device and storage medium
US11863612B2 (en) Network edge computing and network edge computation scheduling method, device and medium
CN106682198B (en) Method and device for realizing automatic database deployment
CN106790354B (en) Communication method and device for preventing data congestion
CN111431951B (en) Data processing method, node equipment, system and storage medium
CN110018986B (en) Abnormal snapshot identification method and device
CN113630317B (en) Data transmission method and device, nonvolatile storage medium and electronic device
CN115134373A (en) Data synchronization method and device, storage medium and electronic equipment
CN105760215A (en) Map-reduce model based job running method for distributed file system
CN111722783B (en) Data storage method and device
CN113656496A (en) Data processing method and system
AU2018290375B2 (en) Resource allocation method and system
CN112559164A (en) Resource sharing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant