CN116610457B - Resource scheduling method for AI cloud computing server group - Google Patents

Resource scheduling method for AI cloud computing server group Download PDF

Info

Publication number
CN116610457B
CN116610457B CN202310891636.9A CN202310891636A CN116610457B CN 116610457 B CN116610457 B CN 116610457B CN 202310891636 A CN202310891636 A CN 202310891636A CN 116610457 B CN116610457 B CN 116610457B
Authority
CN
China
Prior art keywords
computing
client
computing resource
resource
virtual
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310891636.9A
Other languages
Chinese (zh)
Other versions
CN116610457A (en
Inventor
吕超星
丁鹏
吴清忠
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Wanjie Data Technology Co ltd Wuhan Branch
Beijing Wanjie Data Technology Co ltd
Original Assignee
Beijing Wanjie Data Technology Co ltd Wuhan Branch
Beijing Wanjie Data Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Wanjie Data Technology Co ltd Wuhan Branch, Beijing Wanjie Data Technology Co ltd filed Critical Beijing Wanjie Data Technology Co ltd Wuhan Branch
Priority to CN202310891636.9A priority Critical patent/CN116610457B/en
Publication of CN116610457A publication Critical patent/CN116610457A/en
Application granted granted Critical
Publication of CN116610457B publication Critical patent/CN116610457B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5077Logical partitioning of resources; Management or configuration of virtualized resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5011Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
    • G06F9/5022Mechanisms to release resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5072Grid computing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/60Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/5011Pool
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Mathematical Physics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention relates to the field of data processing, and discloses a resource scheduling method for an AI cloud computing server group, which comprises the steps of obtaining historical computing task data of a client in a cloud database, obtaining computing task characteristics of the client according to the historical computing task data of the client, and generating a virtual computing resource pool corresponding to the client according to the computing task characteristics of the client; the method comprises the steps that the occupation of computing resources of computing tasks uploaded by clients is obtained, a computing resource module manager wakes up computing resource modules of the corresponding computing tasks, the computing resource modules apply for the corresponding computing resources from an AI cloud computing server group, the computing tasks are processed, after the processing is completed, the computing resource modules release the computing resources, and in a set period, the capacity of a virtual computing resource pool of each corresponding client is adjusted according to the utilization rate and the utilization ratio, so that the resource scheduling of the AI cloud computing server group is completed. The invention realizes reasonable scheduling of the computational resources.

Description

Resource scheduling method for AI cloud computing server group
Technical Field
The invention relates to the field of data processing, in particular to a resource scheduling method for an AI cloud computing server group.
Background
The computing power resource scheduling refers to reasonably distributing and managing available computing resources under a distributed computing environment so as to realize efficient task execution and system performance optimization. In the fields of cloud computing, big data processing, artificial intelligence, distributed computing and the like, computing power resource scheduling is important for improving computing efficiency, reducing cost and improving user experience.
Therefore, how to reasonably schedule the computational resources and maximally utilize the existing computational resources is a problem that needs to be studied by the skilled person.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provides a resource scheduling method for an AI cloud computing server group, which comprises the following steps:
step one, acquiring historical calculation task data of a client in a cloud database, obtaining calculation task characteristics of the client according to the historical calculation task data of the client, and generating a virtual calculation resource pool corresponding to the client according to the calculation task characteristics of the client;
step two, acquiring the computing resource occupation of the computing task uploaded by the client, if the computing resource occupation is in the capacity range of the virtual computing resource pool of the corresponding client, matching whether a computing resource module of the corresponding computing task exists in the computing resource module sequence of the virtual computing resource pool of the corresponding client, if so, entering the step three, otherwise, entering the step four; if the occupation of the computing resources is not in the capacity range of the virtual computing resource pool of the corresponding client, entering a step five;
thirdly, the computing resource module manager wakes up a computing resource module of a corresponding computing task, the computing resource module applies for the corresponding computing resource from the AI cloud computing server group, processes the computing task, releases the computing resource after the processing is completed, and enters into dormancy to enter into step six;
step four, a computing resource module manager generates a computing resource module of a corresponding computing task in a virtual computing resource pool of a corresponding client, the computing resource module of the corresponding computing task calls computing resources from an AI cloud computing server group, the computing tasks are distributed to the computing resource modules of the corresponding computing tasks, the computing tasks are processed, after the processing is completed, the computing resource modules release the computing resources, the generated computing resource modules of the corresponding computing tasks are stored in a computing resource module sequence of the virtual computing resource pool of the corresponding client, and the step six is entered into;
step five, the computing resource module manager establishes the communication between the temporary computing resource module and the AI cloud computing server group, acquires computing resources corresponding to the computing tasks, processes the computing tasks, releases the computing resources after the processing is completed, and deletes the temporary computing resource module;
and step six, acquiring the utilization rate and the utilization ratio of the computing resources of the virtual computing resource pools of each corresponding client in a set period, and adjusting the capacity of the virtual computing resource pools of each corresponding client according to the utilization rate and the utilization ratio to complete the resource scheduling of the AI cloud computing server group.
Further, the obtaining the computing task characteristics of the client according to the historical computing task data of the client, and generating the virtual computing resource pool corresponding to the client according to the computing task characteristics of the client, includes: and in the set time length, the maximum value of the calculated force used in the calculation task uploaded by the client is the calculation task characteristic of the client, the range from the minimum value of the calculated force used in the calculation task uploaded by the client to the maximum value of the calculated force is the capacity of the virtual calculation resource pool, and the minimum value of the calculated force and the maximum value of the calculated force used in the calculation task uploaded by the client respectively form the lower limit of the calculated force and the upper limit of the calculated force of the virtual calculation resource pool.
Further, the obtaining the computing resource occupation of the computing task uploaded by the client includes obtaining the computing power required by the computing task uploaded by the client, that is, the computing resource occupation.
Further, the computing resource module sequence is the generated computing resource modules, and the computing resource modules are ordered according to the used computing power.
Further, in the set period, the obtaining the usage rate and the usage ratio of the computing resources in the virtual computing resource pool of each corresponding client, and adjusting the capacity of the virtual computing resource pool of each corresponding client according to the usage rate and the usage ratio, includes:
and after the set period is completed, respectively acquiring the calculation force use ranges from the calculation force use minimum value to the calculation force maximum value of the virtual calculation resource pools of the corresponding clients, and adjusting the capacity of the virtual calculation resource pools of the corresponding clients to the calculation force use ranges.
The beneficial effects of the invention are as follows: by the technical scheme provided by the invention, the corresponding computing resource scheduling can be realized for the use conditions of computing resources of different clients, the reasonable scheduling of the computing resources is realized, and the existing computing resources are utilized to the greatest extent.
Drawings
Fig. 1 is a flow chart of a resource scheduling method for an AI cloud computing server group.
Detailed Description
The technical solution of the present invention will be described in further detail with reference to the accompanying drawings, but the scope of the present invention is not limited to the following description.
For the purpose of making the technical solution and advantages of the present invention more apparent, the present invention will be further described in detail with reference to the accompanying drawings and examples. It should be understood that the particular embodiments described herein are illustrative only and are not intended to limit the invention, i.e., the embodiments described are merely some, but not all, of the embodiments of the invention. The components of the embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.
Thus, the following detailed description of the embodiments of the invention, as presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be made by a person skilled in the art without making any inventive effort, are intended to be within the scope of the present invention. It is noted that relational terms such as "first" and "second", and the like, are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions.
Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article or apparatus that comprises the element.
The features and capabilities of the present invention are described in further detail below in connection with the examples.
As shown in fig. 1, a resource scheduling method for an AI cloud computing server group includes the following steps:
step one, acquiring historical calculation task data of a client in a cloud database, obtaining calculation task characteristics of the client according to the historical calculation task data of the client, and generating a virtual calculation resource pool corresponding to the client according to the calculation task characteristics of the client;
step two, acquiring the computing resource occupation of the computing task uploaded by the client, if the computing resource occupation is in the capacity range of the virtual computing resource pool of the corresponding client, matching whether a computing resource module of the corresponding computing task exists in the computing resource module sequence of the virtual computing resource pool of the corresponding client, if so, entering the step three, otherwise, entering the step four; if the occupation of the computing resources is not in the capacity range of the virtual computing resource pool of the corresponding client, entering a step five;
thirdly, the computing resource module manager wakes up a computing resource module of a corresponding computing task, the computing resource module applies for the corresponding computing resource from the AI cloud computing server group, processes the computing task, releases the computing resource after the processing is completed, and enters into dormancy to enter into step six;
step four, a computing resource module manager generates a computing resource module of a corresponding computing task in a virtual computing resource pool of a corresponding client, the computing resource module of the corresponding computing task calls computing resources from an AI cloud computing server group, the computing tasks are distributed to the computing resource modules of the corresponding computing tasks, the computing tasks are processed, after the processing is completed, the computing resource modules release the computing resources, the generated computing resource modules of the corresponding computing tasks are stored in a computing resource module sequence of the virtual computing resource pool of the corresponding client, and the step six is entered into;
step five, the computing resource module manager establishes the communication between the temporary computing resource module and the AI cloud computing server group, acquires computing resources corresponding to the computing tasks, processes the computing tasks, releases the computing resources after the processing is completed, and deletes the temporary computing resource module;
and step six, acquiring the utilization rate and the utilization ratio of the computing resources of the virtual computing resource pools of each corresponding client in a set period, and adjusting the capacity of the virtual computing resource pools of each corresponding client according to the utilization rate and the utilization ratio to complete the resource scheduling of the AI cloud computing server group.
According to the historical calculation task data of the client, obtaining the calculation task characteristics of the client, and generating a virtual calculation resource pool corresponding to the client according to the calculation task characteristics of the client, wherein the method comprises the following steps: and in the set time length, the maximum value of the calculated force used in the calculation task uploaded by the client is the calculation task characteristic of the client, the range from the minimum value of the calculated force used in the calculation task uploaded by the client to the maximum value of the calculated force is the capacity of the virtual calculation resource pool, and the minimum value of the calculated force and the maximum value of the calculated force used in the calculation task uploaded by the client respectively form the lower limit of the calculated force and the upper limit of the calculated force of the virtual calculation resource pool.
The method comprises the steps of obtaining the computing resource occupation of the computing task uploaded by the client, wherein the computing resource occupation comprises the computing power required by the computing task uploaded by the client.
The computing resource module sequence is the generated computing resource modules, and is ordered according to the used computing power, and the ordering mode is descending order from big to small.
In a set period, acquiring the utilization rate and the utilization ratio of the computing resources of the virtual computing resource pools of each corresponding client, and adjusting the capacity of the virtual computing resource pools of each corresponding client according to the utilization rate and the utilization ratio, wherein the method comprises the following steps: and after the set period is completed, respectively acquiring the calculation force use ranges from the calculation force use minimum value to the calculation force maximum value of the virtual calculation resource pools of the corresponding clients, and adjusting the capacity of the virtual calculation resource pools of the corresponding clients to the calculation force use ranges.
The foregoing is merely a preferred embodiment of the invention, and it is to be understood that the invention is not limited to the form disclosed herein but is not to be construed as excluding other embodiments, but is capable of numerous other combinations, modifications and environments and is capable of modifications within the scope of the inventive concept, either as taught or as a matter of routine skill or knowledge in the relevant art. And that modifications and variations which do not depart from the spirit and scope of the invention are intended to be within the scope of the appended claims.

Claims (5)

1. The resource scheduling method for the AI cloud computing server group is characterized by comprising the following steps of:
step one, acquiring historical calculation task data of a client in a cloud database, obtaining calculation task characteristics of the client according to the historical calculation task data of the client, and generating a virtual calculation resource pool corresponding to the client according to the calculation task characteristics of the client;
step two, acquiring the computing resource occupation of the computing task uploaded by the client, if the computing resource occupation is in the capacity range of the virtual computing resource pool of the corresponding client, matching whether a computing resource module of the corresponding computing task exists in the computing resource module sequence of the virtual computing resource pool of the corresponding client, if so, entering the step three, otherwise, entering the step four; if the occupation of the computing resources is not in the capacity range of the virtual computing resource pool of the corresponding client, entering a step five;
thirdly, the computing resource module manager wakes up a computing resource module of a corresponding computing task, the computing resource module applies for the corresponding computing resource from the AI cloud computing server group, processes the computing task, releases the computing resource after the processing is completed, and enters into dormancy to enter into step six;
step four, a computing resource module manager generates a computing resource module of a corresponding computing task in a virtual computing resource pool of a corresponding client, the computing resource module of the corresponding computing task calls computing resources from an AI cloud computing server group, the computing tasks are distributed to the computing resource modules of the corresponding computing tasks, the computing tasks are processed, after the processing is completed, the computing resource modules release the computing resources, the generated computing resource modules of the corresponding computing tasks are stored in a computing resource module sequence of the virtual computing resource pool of the corresponding client, and the step six is entered into;
step five, the computing resource module manager establishes the communication between the temporary computing resource module and the AI cloud computing server group, acquires computing resources corresponding to the computing tasks, processes the computing tasks, releases the computing resources after the processing is completed, and deletes the temporary computing resource module;
and step six, acquiring the utilization rate and the utilization ratio of the computing resources of the virtual computing resource pools of each corresponding client in a set period, and adjusting the capacity of the virtual computing resource pools of each corresponding client according to the utilization rate and the utilization ratio to complete the resource scheduling of the AI cloud computing server group.
2. The method for scheduling resources for an AI cloud computing server group according to claim 1, wherein the obtaining computing task characteristics of the client according to historical computing task data of the client, and generating a virtual computing resource pool corresponding to the client according to the computing task characteristics of the client, comprises: and in the set time length, the maximum value of the calculated force used in the calculation task uploaded by the client is the calculation task characteristic of the client, the range from the minimum value of the calculated force used in the calculation task uploaded by the client to the maximum value of the calculated force is the capacity of the virtual calculation resource pool, and the minimum value of the calculated force and the maximum value of the calculated force used in the calculation task uploaded by the client respectively form the lower limit of the calculated force and the upper limit of the calculated force of the virtual calculation resource pool.
3. The method for scheduling resources for an AI cloud computing server group according to claim 2, wherein the obtaining of the computing resource occupation of the computing task uploaded by the client includes obtaining the computing power required by the computing task uploaded by the client, that is, the computing resource occupation.
4. The resource scheduling method for AI-oriented cloud computing server group of claim 3, wherein the sequence of computing resource modules is a sequence of generated computing resource modules, and the sequence is ordered according to the magnitude of the computing power used.
5. The method for scheduling resources for an AI-oriented cloud computing server group of claim 4, wherein the obtaining, in a set period, a computing resource usage rate and a usage ratio of a virtual computing resource pool of each corresponding client, and adjusting a capacity of the virtual computing resource pool of each corresponding client according to the usage rate and the usage ratio, includes:
and after the set period is completed, respectively acquiring the calculation force use ranges from the calculation force use minimum value to the calculation force maximum value of the virtual calculation resource pools of the corresponding clients, and adjusting the capacity of the virtual calculation resource pools of the corresponding clients to the calculation force use ranges.
CN202310891636.9A 2023-07-20 2023-07-20 Resource scheduling method for AI cloud computing server group Active CN116610457B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310891636.9A CN116610457B (en) 2023-07-20 2023-07-20 Resource scheduling method for AI cloud computing server group

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310891636.9A CN116610457B (en) 2023-07-20 2023-07-20 Resource scheduling method for AI cloud computing server group

Publications (2)

Publication Number Publication Date
CN116610457A CN116610457A (en) 2023-08-18
CN116610457B true CN116610457B (en) 2023-09-12

Family

ID=87678644

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310891636.9A Active CN116610457B (en) 2023-07-20 2023-07-20 Resource scheduling method for AI cloud computing server group

Country Status (1)

Country Link
CN (1) CN116610457B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104461744A (en) * 2014-12-18 2015-03-25 曙光云计算技术有限公司 Resource allocation method and device
JP2017142838A (en) * 2017-04-06 2017-08-17 華為技術有限公司Huawei Technologies Co.,Ltd. Resource allocation method of central processing unit and calculation node
CN115328663A (en) * 2022-10-10 2022-11-11 亚信科技(中国)有限公司 Method, device, equipment and storage medium for scheduling resources based on PaaS platform
CN115348264A (en) * 2022-07-28 2022-11-15 招商局金融科技有限公司 Multi-tenant cloud service management method, device, equipment and storage medium
CN116263701A (en) * 2022-11-29 2023-06-16 中移(苏州)软件技术有限公司 Computing power network task scheduling method and device, computer equipment and storage medium
CN116450290A (en) * 2023-03-24 2023-07-18 阿里巴巴(中国)有限公司 Computer resource management method and device, cloud server and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104461744A (en) * 2014-12-18 2015-03-25 曙光云计算技术有限公司 Resource allocation method and device
JP2017142838A (en) * 2017-04-06 2017-08-17 華為技術有限公司Huawei Technologies Co.,Ltd. Resource allocation method of central processing unit and calculation node
CN115348264A (en) * 2022-07-28 2022-11-15 招商局金融科技有限公司 Multi-tenant cloud service management method, device, equipment and storage medium
CN115328663A (en) * 2022-10-10 2022-11-11 亚信科技(中国)有限公司 Method, device, equipment and storage medium for scheduling resources based on PaaS platform
CN116263701A (en) * 2022-11-29 2023-06-16 中移(苏州)软件技术有限公司 Computing power network task scheduling method and device, computer equipment and storage medium
CN116450290A (en) * 2023-03-24 2023-07-18 阿里巴巴(中国)有限公司 Computer resource management method and device, cloud server and storage medium

Also Published As

Publication number Publication date
CN116610457A (en) 2023-08-18

Similar Documents

Publication Publication Date Title
CN111200528B (en) Intelligent linkage method for smart city with edge cloud cooperation
CN107038071B (en) Storm task flexible scheduling algorithm based on data flow prediction
CN104391918B (en) The implementation method of distributed networks database query priority management based on equity deployment
CN107040475B (en) Resource scheduling method and device
CN109240814A (en) A kind of deep learning intelligent dispatching method and system based on TensorFlow
CN109936606A (en) Method for allocating tasks and device
CN116149821A (en) Cluster multi-task sliding window scheduling processing method, system, equipment and medium
CN112363827A (en) Multi-resource index Kubernetes scheduling method based on delay factors
Sharma et al. A Dynamic optimization algorithm for task scheduling in cloud computing with resource utilization
CN116662376A (en) Data query method, device, electronic equipment and storage medium
CN116610457B (en) Resource scheduling method for AI cloud computing server group
CN109388501B (en) Communication matching method, device, equipment and medium based on face recognition request
CN114666335A (en) DDS-based distributed system load balancing device
CN103176850A (en) Electric system network cluster task allocation method based on load balancing
CN117608754A (en) Processing method and device for multi-resource pool application, electronic equipment and storage medium
WO2023066035A1 (en) Resource allocation method and resource allocation apparatus
CN117472551B (en) Cloud computing hardware acceleration control system and method based on GPU integration
Nazari et al. An optimal service composition algorithm in multi-cloud environment
CN113971115A (en) Method and device for determining demand of container resource, server and storage medium
CN110750608A (en) Spatial data analysis automatic scheduling method based on rule base
CN113220458B (en) Real-time scheduling method and device for realizing load balance of database connection pool
CN118152103B (en) Node scheduling method and related device applied to distributed system
CN117667409A (en) Data calculation power recovery and energy consumption control method based on AI and block chain
CN116431329A (en) GPU cluster scheduling method and device
CN117827446A (en) Interconnected multi-data center server cluster load space-time migration strategy

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant