CN111796934A - Task issuing method and device, storage medium and electronic equipment - Google Patents

Task issuing method and device, storage medium and electronic equipment Download PDF

Info

Publication number
CN111796934A
CN111796934A CN202010601477.0A CN202010601477A CN111796934A CN 111796934 A CN111796934 A CN 111796934A CN 202010601477 A CN202010601477 A CN 202010601477A CN 111796934 A CN111796934 A CN 111796934A
Authority
CN
China
Prior art keywords
resource
target
quota
task
platform
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010601477.0A
Other languages
Chinese (zh)
Other versions
CN111796934B (en
Inventor
褚向阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Xiaomi Pinecone Electronic Co Ltd
Original Assignee
Beijing Xiaomi Pinecone Electronic Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Xiaomi Pinecone Electronic Co Ltd filed Critical Beijing Xiaomi Pinecone Electronic Co Ltd
Priority to CN202010601477.0A priority Critical patent/CN111796934B/en
Publication of CN111796934A publication Critical patent/CN111796934A/en
Application granted granted Critical
Publication of CN111796934B publication Critical patent/CN111796934B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The disclosure relates to a task issuing method, a device, a storage medium and an electronic device, wherein the method comprises the following steps: after receiving a resource request of a target task, determining the resource occupation amount, the name space, the priority and the resource model corresponding to the target task, wherein the target resource type is the equipment model of the computing resource of a resource platform required by executing the target task; determining a first quota limit of the namespace for the target task and a second quota limit of the resource platform for the target task; and comparing the resource occupation with the first quota limit and the second quota limit respectively, and determining whether to issue the target task to the resource platform according to a comparison result. The quota limitation method has the advantages that quota limitation can be performed on the occupation of the resource quantity of the naming space and the total resource quantity of the platform resources by the computing tasks with different priorities and different resource models, the pertinence and the accuracy of resource allocation are improved, and the utilization rate of the platform resources is improved.

Description

Task issuing method and device, storage medium and electronic equipment
Technical Field
The present disclosure relates to the field of computing resource management, and in particular, to a task issuing method and apparatus, a storage medium, and an electronic device.
Background
With the development of artificial intelligence technology, the data volume and the calculation volume involved in the development and reference process related to artificial intelligence are larger and larger, and a single resource device cannot meet the large calculation volume required by artificial intelligence calculation. Therefore, a resource platform based on a cloud computing platform needs to be built for artificial intelligent computing. When the user scale and the resource scale of the resource platform are large, reasonable resource quota management needs to be adopted so as to meet the resource scheduling requirement under the multi-user multi-task scene under the condition that the whole resource platform is ensured to be stably available.
Disclosure of Invention
In order to overcome the problems in the related art, the disclosure provides a task issuing method, a device, a storage medium and an electronic device.
According to a first aspect of the embodiments of the present disclosure, a task issuing method is provided, which is applied to a cloud computing platform, and the method includes:
under the condition of receiving a resource request of a target task, determining a resource occupation amount, a target naming space, a target priority and a target resource model corresponding to the target task, wherein the target naming space is a naming space (namespace) where a service end sending the target task is located, and the target resource model is a device model of computing resources of a resource platform required for executing the target task;
determining a first quota limit for the target task for the target namespace and a second quota limit for the target task for the resource platform; wherein the first quota limit is used to characterize a limit of a resource amount of the target resource allowed to be occupied by the target task among all target resources allocated for the target namespace, the second quota limit is used to characterize a limit of a resource amount of the target resource allowed to be occupied by the target task in the resource platform, the target resource corresponds to the target priority and the target resource is provided with the target resource model;
and comparing the resource occupation with the first quota limit and the second quota limit respectively, and determining whether to issue the target task to the resource platform according to a comparison result.
Optionally, before determining the target namespace, the target priority, and the target resource model corresponding to the target task under the condition that the resource request of the target task is received, the method further includes:
identifying device information for each computing resource in the resource platform to determine a device model for the each computing resource from the device information;
and outputting the equipment model of each computing resource, so that the service end sets the target resource model of the target task according to the equipment model of each computing resource.
Optionally, the determining a first quota limit of the target namespace for the target task and a second quota limit of the resource platform for the target task includes:
acquiring a first resource quota table corresponding to the preset target namespace and a second resource quota table corresponding to the resource platform; the first resource quota table is used for representing the corresponding relation among the priority, the resource model and the resource quota limit of the target namespace for each task, which are sent by all service terminals in the target namespace, and the second resource quota table is used for representing the corresponding relation among the priority, the resource model and the resource quota limit of the resource platform for each task;
determining the first resource quota limit according to the target priority, the target resource model and the first resource quota table; and the number of the first and second groups,
and determining the second resource quota limit according to the target priority, the target resource model and the second resource quota table.
Optionally, the comparing the resource occupancy with the first quota limit and the second quota limit respectively to determine whether to issue the target task to the resource platform according to a comparison result includes:
determining whether the resource occupancy exceeds the first quota limit;
in an instance in which it is determined that the resource occupancy does not exceed the first quota limit, determining whether the resource occupancy exceeds the second quota limit;
under the condition that the resource occupancy does not exceed the second quota limit, the target task is issued to the resource platform to be executed; alternatively, the first and second electrodes may be,
and outputting information for indicating that the target task cannot be issued to the resource platform under the condition that the resource occupancy exceeds the first quota limit or the resource occupancy exceeds the second quota limit.
According to a second aspect of the embodiments of the present disclosure, there is provided a task issuing device applied to a cloud computing platform, the device including:
the information determining module is configured to determine resource occupation, a target naming space, a target priority and a target resource model corresponding to a target task under the condition that a resource request of the target task is received, wherein the target naming space is a naming space where a service end sending the target task is located, and the target resource model is a device model of computing resources of a resource platform required for executing the target task;
a quota determination module configured to determine a first quota limit for the target task for the target namespace and a second quota limit for the target task for the resource platform; wherein the first quota limit is used to characterize a limit of a resource amount of the target resource allowed to be occupied by the target task among all target resources allocated for the target namespace, the second quota limit is used to characterize a limit of a resource amount of the target resource allowed to be occupied by the target task in the resource platform, the target resource corresponds to the target priority and the target resource is provided with the target resource model;
and the task issuing module is configured to compare the resource occupancy with the first quota limit and the second quota limit respectively, and determine whether to issue the target task to the resource platform according to a comparison result.
Optionally, the apparatus further comprises:
a device identification module configured to identify device information for each computing resource in the resource platform to determine a device model for the each computing resource from the device information;
and the resource model output module is configured to output the equipment model of each computing resource so that the service end sets the target resource model of the target task according to the equipment model of each computing resource.
Optionally, the quota determining module is configured to:
acquiring a first resource quota table corresponding to the preset target namespace and a second resource quota table corresponding to the resource platform; the first resource quota table is used for representing the corresponding relation among the priority, the resource model and the resource quota limit of the target namespace for each task, which are sent by all service terminals in the target namespace, and the second resource quota table is used for representing the corresponding relation among the priority, the resource model and the resource quota limit of the resource platform for each task;
determining the first resource quota limit according to the target priority, the target resource model and the first resource quota table; and the number of the first and second groups,
and determining the second resource quota limit according to the target priority, the target resource model and the second resource quota table.
Optionally, the task issuing module is configured to:
determining whether the resource occupancy exceeds the first quota limit;
in an instance in which it is determined that the resource occupancy does not exceed the first quota limit, determining whether the resource occupancy exceeds the second quota limit;
under the condition that the resource occupancy does not exceed the second quota limit, the target task is issued to the resource platform to be executed; alternatively, the first and second electrodes may be,
and outputting information for indicating that the target task cannot be issued to the resource platform under the condition that the resource occupancy exceeds the first quota limit or the resource occupancy exceeds the second quota limit.
According to a third aspect of the embodiments of the present disclosure, a computer-readable storage medium is provided, on which computer program instructions are stored, and the program instructions, when executed by a processor, implement the steps of the task issuing method provided by the first aspect of the present disclosure.
According to a fourth aspect of the embodiments of the present disclosure, there is provided an electronic apparatus including: the task issuing device provided by the second aspect of the disclosure.
According to the technical scheme provided by the embodiment of the disclosure, under the condition that a resource request of a target task is received, the resource occupation amount, the target name space, the target priority and the target resource model corresponding to the target task can be determined, wherein the target name space is a name space where a service end sending the target task is located, and the target resource type is a device model of computing resources of a resource platform required by executing the target task; determining a first quota limit of the target namespace for the target task and a second quota limit of the resource platform for the target task; the first quota limit is used for representing a limit value of the resource quantity of the target resource allowed to be occupied by the target task in all target resources allocated to the target namespace, the second quota limit is used for representing a limit value of the resource quantity of the target resource allowed to be occupied by the target task in the resource platform, the target resource corresponds to the target priority and has the target resource model; and comparing the resource occupation with the first quota limit and the second quota limit respectively, and determining whether to issue the target task to the resource platform according to a comparison result. The quota limitation method has the advantages that quota limitation can be performed on the occupation of the resource quantity corresponding to the naming space and the total resource quantity of the platform resources by the computing tasks with different priorities and different resource models, the pertinence and the accuracy of resource allocation are improved, and the utilization rate of the platform resources is improved.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and together with the description, serve to explain the principles of the disclosure.
FIG. 1 is a flowchart illustrating a method of task delivery, according to an exemplary embodiment;
FIG. 2 is a flow diagram of another task delivery method according to that shown in FIG. 1;
FIG. 3 is a flow diagram of a method of determining a resource quota limit in accordance with the description of FIG. 1;
FIG. 4 is a flow chart of a task delivery method according to the one shown in FIG. 1;
FIG. 5 is a block diagram illustrating a task issuing device in accordance with an illustrative embodiment;
FIG. 6 is a block diagram of another task issuing device shown in FIG. 5;
FIG. 7 is a block diagram illustrating an electronic device in accordance with an example embodiment.
Detailed Description
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The implementations described in the exemplary embodiments below are not intended to represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present disclosure, as detailed in the appended claims.
In the related art of resource scheduling, resource quota management for multiple users (or clients) or user groups is usually implemented through a namespace. Namespaces provide a way to logically partition and isolate resources throughout a computing platform. Specifically, before a user issues a task, respective namespaces need to be created for different user groups, names of objects of the same kind of computing task instance are unique in the namespaces, and task instances in the namespaces have the same control strategy. In a multi-user scenario, in order to avoid that an individual user occupies too much resources and affects the use of other users of the platform, the limitation of the allowed resource amount of each namespace can be defined according to the actual service requirement of each user and the total resource amount of the resource platform through a resource quota concept.
However, the resource quota management method only distinguishes resource types such as GPU (Graphics Processing Unit) computing resources, CPU (Central Processing Unit) computing resources, and memory resources, that is, resource devices with the same resource type are all regarded as the same resource to perform quota management, and differences in parameters and performance of Graphics cards (GPU devices), memory banks, or processors (CPU devices) of different models are not considered, so that the pertinence of quota limitation on resources is poor, and further, the utilization rate of platform resources is reduced. And the quota limitation on the task is only performed according to the resource amount of each namespace, and the quota limitation on the actual resource amount of the whole resource platform is not considered, so that the accuracy of resource allocation is low, and the utilization rate of platform resources is reduced.
In view of the above, the present disclosure provides a task issuing method, which includes:
before introducing the task issuing method provided by the present disclosure, a target application scenario related to each embodiment in the present disclosure is introduced, where the target application scenario includes a resource platform, the resource platform is a cloud computing platform, the cloud computing platform includes at least one scheduler provided with a resource scheduling system, and a plurality of resource servers, and the resource servers are cloud servers that have GPU computing resources, CPU computing resources, and memory resources and can provide cloud data acquisition and cloud computing services for received computing tasks.
Fig. 1 is a flowchart illustrating a task issuing method according to an exemplary embodiment, and as shown in fig. 1, the method is applied to a cloud computing platform described in the foregoing application scenario, and includes the following steps:
in step 101, when a resource request of a target task is received, a resource occupancy, a target namespace, a target priority and a target resource model corresponding to the target task are determined.
The target name space is the name space where the service end sending the target task is located, and the target resource model is the equipment model of the computing resource of the resource platform required by executing the target task.
Illustratively, the resource platform corresponds to a plurality of namespaces, each namespace corresponds to a property end group (or user group), and each service end of the namespace can create tasks according to service requirements. After the resource request of the target task is acquired, the four resource information, namely the resource occupation amount, the target name space, the target priority and the target resource model, corresponding to the target task can be acquired from the resource request information. The method for acquiring the target name space comprises the following steps: and determining the service end group of the service end which sends the target task, and then determining the name space corresponding to the service end group as the target name space. The four items of information are resource information corresponding to the computing resources of the same resource type. The resource categories may include: GPU computing resources, memory resources, CPU computing resources, and the like. For example, the resource occupation amount may be a resource occupation amount of the target task for the GPU computing resources, the target priority is used to represent a priority order of occupation of the target task corresponding to the GPU computing resources, and the target resource model is a device model of the graphics card, or the resource occupation amount may be a resource occupation amount of the target task for the CPU computing resources, the target priority is used to represent a priority order of occupation of the target task corresponding to the CPU computing resources, and the target resource model is a device model of the processor chip. In the embodiment of the present disclosure, the task issuing method is described with the resource type of the computing resource as the quota management of the GPU computing resource.
In step 102, a first quota limit of the target namespace for the target task and a second quota limit of the resource platform for the target task are determined.
The first quota limit is used to represent a limit value of a resource amount of the target resource that is allowed to be occupied by the target task in all target resources allocated to the target namespace, the second quota limit is used to represent a limit value of a resource amount of the target resource that is allowed to be occupied by the target task in the resource platform, the target resource corresponds to the target priority and the target resource has the target resource model.
In step 103, the resource occupancy is compared with the first quota limit and the second quota limit, respectively, so as to determine whether to issue the target task to the resource platform according to the comparison result.
Illustratively, for a computing resource, e.g., a GPU computing resource, the resource platform corresponds to a set of quota limiting policies, and the resource platform corresponds to a set of quota limiting policies for each of the namespaces associated with the resource platform. And under the condition that the resource requirement of the target task simultaneously meets the quota limit policy of the resource platform and the quota limit policy corresponding to the namespace.
In the present embodiment, the expression "resource amount" and the "total resource amount" and the "resource occupation amount" related thereto may indicate the number of entity devices that calculate the resource, or the capacity or the calculation capability of the resource device. For example, the resource occupation of a task is 6 GPU computing resources, which actually means that 6 graphics cards (with the same computing power) are needed to support the running of the task. Or, the resource occupation amount of a certain task is 2 memory resources, which actually means that 2 capacity units of storage capacity are needed to support the operation of the task, and the capacity units may be mb (mbyte), gb (gigabyte), or larger storage capacity units.
In summary, according to the technical solution provided by the embodiments of the present disclosure, under the condition that a resource request of a target task is received, a resource occupancy amount, a target namespace, a target priority and a target resource model corresponding to the target task can be determined, where the target namespace is a namespace in which a service end that sends the target task is located, and the target resource type is a device model of a computing resource of a resource platform required for executing the target task; determining a first quota limit of the target namespace for the target task and a second quota limit of the resource platform for the target task; the first quota limit is used for representing a limit value of the resource quantity of the target resource allowed to be occupied by the target task in all target resources allocated to the target namespace, the second quota limit is used for representing a limit value of the resource quantity of the target resource allowed to be occupied by the target task in the resource platform, the target resource corresponds to the target priority and has the target resource model; and comparing the resource occupation with the first quota limit and the second quota limit respectively, and determining whether to issue the target task to the resource platform according to a comparison result. The quota limitation method has the advantages that quota limitation can be performed on the occupation of the resource quantity corresponding to the naming space and the total resource quantity of the platform resources by the computing tasks with different priorities and different resource models, the pertinence and the accuracy of resource allocation are improved, and the utilization rate of the platform resources is improved.
Fig. 2 is a flowchart of another task issuing method shown in fig. 1, and as shown in fig. 2, before the step 101, the method may further include:
in step 104, device information for each computing resource in the resource platform is identified to determine a device model for each computing resource based on the device information.
In step 105, the device model of each computing resource is output, so that the service end sets the target resource model of the target task according to the device model of each computing resource.
For example, the scheduling system of the resource platform may apply device information of all computing resources in the resource platform, such as performance of all video cards, size of video memory, production merchant, production number, and sales number, every preset time. The production number of each display card can be directly obtained, the display cards with the same production number are used as a resource model, or the equipment information can be clustered through a preset equipment identification model, and the computing resources corresponding to the equipment information which is clustered into one type have the same equipment model. After each device model is output or reported to the service end, the service end can determine a target resource model corresponding to the target task according to the number of each device model in the resource platform, the computing capacity of each device, and the computing amount of the target task. It should be noted that the setting procedure of the target resource model (also including the target priority) may include: the resource model division is realized through manual setting of operators at the service end, or the service end automatically divides the resource models through a preset classification algorithm based on three parameters, namely the number of each equipment model in the resource platform, the calculation capacity of each equipment and the calculation amount of the target task.
FIG. 3 is a flowchart of a method for determining a resource quota limit according to the method shown in FIG. 1, and as shown in FIG. 3, the step 102 may include:
in step 1021, a preset first resource quota table corresponding to the target namespace and a preset second resource quota table corresponding to the resource platform are obtained.
The first resource quota table is used for representing a corresponding relation among a priority, a resource model and a resource quota limit of the target namespace for each task, which are corresponding to each task sent by all service terminals in the target namespace, and the second resource quota table is used for representing a corresponding relation among a priority, a resource model and a resource quota limit of the resource platform for each task.
In step 1022, the first resource quota limit is determined according to the target priority, the target resource model, and the first resource quota table.
In step 1023, the second resource quota limit is determined according to the target priority, the target resource model, and the second resource quota table.
For example, the first resource quota table and the second resource quota table may be stored in the scheduling system of the resource platform in the form of the following table 1, where there is one resource quota table for the resource platform, and there is one namespace resource quota table for each namespace. In the process of initializing the resource platform or in the process of re-starting the resource platform after updating, the resource scheduling system of the resource platform can count and output the number of each type of computing resource in the resource platform, and an operator can set the second resource quota table according to the number of each type of computing resource. When each namespace is created, an operator can set a resource quota table corresponding to each namespace according to service requirements and the number of each computing resource.
TABLE 1
Resource quota limiting Resource type a Resource model b Resource model c
Priority A 3 4 2
Priority class B 4 5 4
Priority C 3 6 6
Priority D 2 5 7
For example, after determining the target namespace, a plurality of resource quota tables corresponding to the target namespace may be determined from a plurality of namespace resource quota tables. Taking the table 1 as the first resource quota table (or the second resource quota table) as an example, if the target priority corresponding to the target task is priority B and the target resource model is resource model c, the resource quota limit of the target namespace (or the resource platform) for the target task is 4.
It should be noted that the total amount of the resource quota corresponding to each namespace may be adjusted according to the task amount or the resource occupancy of the task actually received within a period of time, and the specific numerical value of the resource quota limit corresponding to different priorities in the resource quota table of each namespace.
For example, if it is monitored that the task amount or the resource occupancy amount corresponding to the namespace a is small and the task amount or the resource occupancy amount corresponding to the namespace B is large in one week, the total amount of the resource quotas corresponding to the namespace a may be decreased at the beginning of the next week, and the total amount of the resource quotas corresponding to the namespace B may be increased, but the sum of the total amount of the resource quotas corresponding to each namespace is kept larger than the total amount of the resources of the resource platform. Or, for any namespace C, if it is monitored that the task volume or the resource occupancy of the task with the priority a is small and the task volume or the resource occupancy of the task with the priority B is large in one week, the total amount of the resource quota corresponding to the priority a may be decreased at the beginning of the next week, and the total amount of the resource quota corresponding to the priority B may be increased, but the total amount of the resource quota of the namespace C is kept unchanged.
Fig. 4 is a flowchart of a task issuing method shown in fig. 1, and as shown in fig. 4, the step 103 may include: step 1031-1032, or steps 1031 and 1034, or steps 1031, 1032 and 1034.
In step 1031, it is determined whether the resource footprint exceeds the first quota limit.
In step 1032, in an instance in which it is determined that the resource occupancy does not exceed the first quota limit, it is determined whether the resource occupancy exceeds the second quota limit.
In step 1033, the target task is issued to the resource platform for execution under the condition that it is determined that the resource occupancy does not exceed the second quota limit.
In step 1034, when it is determined that the resource occupancy exceeds the first quota limit, or the resource occupancy exceeds the second quota limit, information indicating that the target task cannot be issued to the resource platform is output.
For example, in consideration of a situation that all namespaces issue tasks to a resource platform at the same time rarely occurs in an actual application process, when setting a resource quota corresponding to each namespace, a certain redundancy amount is generally considered, that is, the sum of the total amount of resource quotas corresponding to each namespace may be set to be greater than the total amount of resources of the resource platform. Therefore, a situation that the resource occupancy does not exceed the first quota limit but exceeds the second quota limit may occur, and therefore, in steps 1031 and 1032, the resource occupancy needs to be successively compared with the first quota limit and the second quota limit. If both the target task and the resource occupation amount can meet the requirement (the target task is not exceeded), allowing the target task to be issued to the resource platform and executing the target task through the computing resources improved by the resource platform; if any one of the two can not meet the resource occupation amount (is exceeded), the target task is rejected from being issued, and then information used for indicating that the target task can not be issued to the resource platform is output.
In summary, according to the technical solution provided by the embodiments of the present disclosure, under the condition that a resource request of a target task is received, a resource occupancy amount, a target namespace, a target priority and a target resource model corresponding to the target task can be determined, where the target namespace is a namespace in which a service end that sends the target task is located, and the target resource type is a device model of a computing resource of a resource platform required for executing the target task; determining a first quota limit of the target namespace for the target task and a second quota limit of the resource platform for the target task; the first quota limit is used for representing a limit value of the resource quantity of the target resource allowed to be occupied by the target task in all target resources allocated to the target namespace, the second quota limit is used for representing a limit value of the resource quantity of the target resource allowed to be occupied by the target task in the resource platform, the target resource corresponds to the target priority and has the target resource model; and comparing the resource occupation with the first quota limit and the second quota limit respectively, and determining whether to issue the target task to the resource platform according to a comparison result. The quota limitation method has the advantages that quota limitation can be performed on the occupation of the resource quantity corresponding to the naming space and the total resource quantity of the platform resources by the computing tasks with different priorities and different resource models, the pertinence and the accuracy of resource allocation are improved, and the utilization rate of the platform resources is improved.
Fig. 5 is a block diagram of a task issuing device according to an exemplary embodiment, as shown in fig. 5, the device 500 is applied to the cloud computing platform described in the foregoing application scenario, and includes:
an information determining module 510, configured to determine, in a case that a resource request of a target task is received, a resource occupancy amount, a target namespace, a target priority, and a target resource model corresponding to the target task, where the target namespace is a namespace in which a service end that sends the target task is located, and the target resource model is a device model of a computing resource of a resource platform required for executing the target task;
a quota determining module 520 configured to determine a first quota limit for the target task for the target namespace and a second quota limit for the target task for the resource platform; the first quota limit is used for representing a limit value of the resource quantity of the target resource allowed to be occupied by the target task in all target resources allocated to the target namespace, the second quota limit is used for representing a limit value of the resource quantity of the target resource allowed to be occupied by the target task in the resource platform, the target resource corresponds to the target priority and has the target resource model;
the task issuing module 530 is configured to compare the resource occupancy with the first quota limit and the second quota limit, respectively, and determine whether to issue the target task to the resource platform according to a comparison result.
Fig. 6 is a block diagram of another task issuing device shown in fig. 5, and as shown in fig. 6, the device 500 may further include:
a device identification module 540 configured to identify device information of each computing resource in the resource platform, so as to determine a device model of each computing resource according to the device information;
and a resource model output module 550 configured to output the device model of each computing resource, so that the service end sets the target resource model of the target task according to the device model of each computing resource.
Optionally, the quota determining module 520 is configured to:
acquiring a preset first resource quota table corresponding to the target namespace and a preset second resource quota table corresponding to the resource platform; the first resource quota table is used for representing the corresponding relation among the priority, the resource model and the resource quota limit of the target namespace for each task, which are corresponding to each task sent by all service terminals in the target namespace, and the second resource quota table is used for representing the corresponding relation among the priority, the resource model and the resource quota limit of the resource platform for each task;
determining the first resource quota limit according to the target priority, the target resource model and the first resource quota table; and the number of the first and second groups,
and determining the second resource quota limit according to the target priority, the target resource model and the second resource quota table.
Optionally, the task issuing module 530 is configured to:
determining whether the resource occupancy exceeds the first quota limit;
determining whether the resource occupancy exceeds the second quota limit if it is determined that the resource occupancy does not exceed the first quota limit;
under the condition that the resource occupancy does not exceed the second quota limit, the target task is issued to the resource platform to be executed; alternatively, the first and second electrodes may be,
and outputting information for indicating that the target task cannot be issued to the resource platform under the condition that the resource occupancy exceeds the first quota limit or the resource occupancy exceeds the second quota limit.
In summary, according to the technical solution provided by the embodiments of the present disclosure, under the condition that a resource request of a target task is received, a resource occupancy amount, a target namespace, a target priority and a target resource model corresponding to the target task can be determined, where the target namespace is a namespace in which a service end that sends the target task is located, and the target resource type is a device model of a computing resource of a resource platform required for executing the target task; determining a first quota limit of the target namespace for the target task and a second quota limit of the resource platform for the target task; the first quota limit is used for representing a limit value of the resource quantity of the target resource allowed to be occupied by the target task in all target resources allocated to the target namespace, the second quota limit is used for representing a limit value of the resource quantity of the target resource allowed to be occupied by the target task in the resource platform, the target resource corresponds to the target priority and has the target resource model; and comparing the resource occupation with the first quota limit and the second quota limit respectively, and determining whether to issue the target task to the resource platform according to a comparison result. The quota limitation method has the advantages that quota limitation can be performed on the occupation of the resource quantity corresponding to the naming space and the total resource quantity of the platform resources by the computing tasks with different priorities and different resource models, the pertinence and the accuracy of resource allocation are improved, and the utilization rate of the platform resources is improved.
FIG. 7 is a block diagram illustrating an electronic device in accordance with an example embodiment. For example, the electronic device 700 may be a scheduler or a server in a cloud computing platform. Referring to fig. 7, electronic device 700 includes a processing component 722 that further includes one or more processors, and memory resources, represented by memory 732, for storing instructions, such as applications, that are executable by processing component 722. The application programs stored in memory 732 may include one or more modules that each correspond to a set of instructions. In addition, the processing component 722 is configured to execute instructions to perform the task issuing method illustrated in fig. 1-4.
The electronic device 700 may also include a power component 726 configured to perform power management of the electronic device 700, a wired or wireless network interface 770 configured to connect the electronic device 700 to a network, and an input output (I/O) interface 778. The electronic device 700 may operate based on an operating system, such as a Windows Server, stored in memory 732TM,Mac OS XTM,UnixTM,LinuxTM,FreeBSDTMOr the like.
The electronic device provided by the embodiment of the disclosure can limit quota for occupation of resource amount corresponding to a namespace and total resource amount of platform resources by computing tasks with different priorities and different resource models, so that pertinence and accuracy of resource allocation are improved, and utilization rate of the platform resources is improved.
Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.
It will be understood that the present disclosure is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims (10)

1. A task issuing method is applied to a cloud computing platform and is characterized by comprising the following steps:
under the condition of receiving a resource request of a target task, determining a resource occupation amount, a target name space, a target priority and a target resource model corresponding to the target task, wherein the target name space is a name space where a service end sending the target task is located, and the target resource model is a device model of computing resources of a resource platform required for executing the target task;
determining a first quota limit for the target task for the target namespace and a second quota limit for the target task for the resource platform; wherein the first quota limit is used to characterize a limit of a resource amount of the target resource allowed to be occupied by the target task among all target resources allocated for the target namespace, the second quota limit is used to characterize a limit of a resource amount of the target resource allowed to be occupied by the target task in the resource platform, the target resource corresponds to the target priority and the target resource is provided with the target resource model;
and comparing the resource occupation with the first quota limit and the second quota limit respectively, and determining whether to issue the target task to the resource platform according to a comparison result.
2. The method according to claim 1, wherein before determining a target namespace, a target priority, and a target resource model corresponding to a target task in case of receiving a resource request of the target task, the method further comprises:
identifying device information for each computing resource in the resource platform to determine a device model for the each computing resource from the device information;
and outputting the equipment model of each computing resource, so that the service end sets the target resource model of the target task according to the equipment model of each computing resource.
3. The method of claim 1, wherein the determining a first quota limit for the target task for the target namespace and a second quota limit for the target task for the resource platform comprises:
acquiring a first resource quota table corresponding to the preset target namespace and a second resource quota table corresponding to the resource platform; the first resource quota table is used for representing the corresponding relation among the priority, the resource model and the resource quota limit of the target namespace for each task, which are sent by all service terminals in the target namespace, and the second resource quota table is used for representing the corresponding relation among the priority, the resource model and the resource quota limit of the resource platform for each task;
determining the first resource quota limit according to the target priority, the target resource model and the first resource quota table; and the number of the first and second groups,
and determining the second resource quota limit according to the target priority, the target resource model and the second resource quota table.
4. The method of claim 1, wherein the comparing the resource occupancy with the first quota limit and the second quota limit, respectively, to determine whether to issue the target task to the resource platform according to a comparison result comprises:
determining whether the resource occupancy exceeds the first quota limit;
in an instance in which it is determined that the resource occupancy does not exceed the first quota limit, determining whether the resource occupancy exceeds the second quota limit;
under the condition that the resource occupancy does not exceed the second quota limit, the target task is issued to the resource platform to be executed; alternatively, the first and second electrodes may be,
and outputting information for indicating that the target task cannot be issued to the resource platform under the condition that the resource occupancy exceeds the first quota limit or the resource occupancy exceeds the second quota limit.
5. A task issuing device is applied to a cloud computing platform and is characterized by comprising:
the information determining module is configured to determine resource occupation, a target naming space, a target priority and a target resource model corresponding to a target task under the condition that a resource request of the target task is received, wherein the target naming space is a naming space where a service end sending the target task is located, and the target resource model is a device model of computing resources of a resource platform required for executing the target task;
a quota determination module configured to determine a first quota limit for the target task for the target namespace and a second quota limit for the target task for the resource platform; wherein the first quota limit is used to characterize a limit of a resource amount of the target resource allowed to be occupied by the target task among all target resources allocated for the target namespace, the second quota limit is used to characterize a limit of a resource amount of the target resource allowed to be occupied by the target task in the resource platform, the target resource corresponds to the target priority and the target resource is provided with the target resource model;
and the task issuing module is configured to compare the resource occupancy with the first quota limit and the second quota limit respectively, and determine whether to issue the target task to the resource platform according to a comparison result.
6. The apparatus of claim 5, further comprising:
a device identification module configured to identify device information for each computing resource in the resource platform to determine a device model for the each computing resource from the device information;
and the resource model output module is configured to output the equipment model of each computing resource so that the service end sets the target resource model of the target task according to the equipment model of each computing resource.
7. The apparatus of claim 5, wherein the quota determination module is configured to:
acquiring a first resource quota table corresponding to the preset target namespace and a second resource quota table corresponding to the resource platform; the first resource quota table is used for representing the corresponding relation among the priority, the resource model and the resource quota limit of the target namespace for each task, which are sent by all service terminals in the target namespace, and the second resource quota table is used for representing the corresponding relation among the priority, the resource model and the resource quota limit of the resource platform for each task;
determining the first resource quota limit according to the target priority, the target resource model and the first resource quota table; and the number of the first and second groups,
and determining the second resource quota limit according to the target priority, the target resource model and the second resource quota table.
8. The apparatus of claim 5, wherein the task issuing module is configured to:
determining whether the resource occupancy exceeds the first quota limit;
in an instance in which it is determined that the resource occupancy does not exceed the first quota limit, determining whether the resource occupancy exceeds the second quota limit;
under the condition that the resource occupancy does not exceed the second quota limit, the target task is issued to the resource platform to be executed; alternatively, the first and second electrodes may be,
and outputting information for indicating that the target task cannot be issued to the resource platform under the condition that the resource occupancy exceeds the first quota limit or the resource occupancy exceeds the second quota limit.
9. A computer-readable storage medium, on which computer program instructions are stored, which program instructions, when executed by a processor, carry out the steps of the method according to any one of claims 1 to 4.
10. An electronic device, comprising: the task issuing device of any one of claims 5 to 8.
CN202010601477.0A 2020-06-28 2020-06-28 Task issuing method and device, storage medium and electronic equipment Active CN111796934B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010601477.0A CN111796934B (en) 2020-06-28 2020-06-28 Task issuing method and device, storage medium and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010601477.0A CN111796934B (en) 2020-06-28 2020-06-28 Task issuing method and device, storage medium and electronic equipment

Publications (2)

Publication Number Publication Date
CN111796934A true CN111796934A (en) 2020-10-20
CN111796934B CN111796934B (en) 2023-11-21

Family

ID=72804648

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010601477.0A Active CN111796934B (en) 2020-06-28 2020-06-28 Task issuing method and device, storage medium and electronic equipment

Country Status (1)

Country Link
CN (1) CN111796934B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112801430A (en) * 2021-04-13 2021-05-14 贝壳找房(北京)科技有限公司 Task issuing method and device, electronic equipment and readable storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107040479A (en) * 2016-02-04 2017-08-11 华为软件技术有限公司 A kind of method and apparatus of cloud computing resources regulation
CN109542603A (en) * 2018-11-22 2019-03-29 北京航空航天大学 It is a kind of improve different priorities task between isolation multi dimensional resource shielding system
US20190132257A1 (en) * 2017-10-27 2019-05-02 EMC IP Holding Company Method, server system and computer program product of managing resources

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107040479A (en) * 2016-02-04 2017-08-11 华为软件技术有限公司 A kind of method and apparatus of cloud computing resources regulation
US20190132257A1 (en) * 2017-10-27 2019-05-02 EMC IP Holding Company Method, server system and computer program product of managing resources
CN109542603A (en) * 2018-11-22 2019-03-29 北京航空航天大学 It is a kind of improve different priorities task between isolation multi dimensional resource shielding system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘治国;金朦朦;刘庆利;: "基于改进Stackelberg模型的卫星网络虚拟资源分配算法", 计算机工程, no. 10 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112801430A (en) * 2021-04-13 2021-05-14 贝壳找房(北京)科技有限公司 Task issuing method and device, electronic equipment and readable storage medium

Also Published As

Publication number Publication date
CN111796934B (en) 2023-11-21

Similar Documents

Publication Publication Date Title
US10558498B2 (en) Method for scheduling data flow task and apparatus
US11496413B2 (en) Allocating cloud computing resources in a cloud computing environment based on user predictability
CN111225050B (en) Cloud computing resource allocation method and device
CN111309440B (en) Method and equipment for managing and scheduling multiple types of GPUs
CN111506398A (en) Task scheduling method and device, storage medium and electronic device
CN111488206A (en) Deep learning task scheduling method, system, terminal and storage medium
CN112395075A (en) Resource processing method and device and resource scheduling system
CN111796933A (en) Resource scheduling method, device, storage medium and electronic equipment
CN111796934B (en) Task issuing method and device, storage medium and electronic equipment
CN107193749B (en) Test method, device and equipment
CN114116173A (en) Method, device and system for dynamically adjusting task allocation
CN111475251A (en) Cluster container scheduling method, system, terminal and storage medium
CN113703945B (en) Micro service cluster scheduling method, device, equipment and storage medium
US20140047454A1 (en) Load balancing in an sap system
CN115421920A (en) Task management method and device for financial product, electronic equipment and storage medium
CN114237902A (en) Service deployment method and device, electronic equipment and computer readable medium
CN112988383A (en) Resource allocation method, device, equipment and storage medium
CN113918344A (en) Resource demand allocation method, device, equipment and computer readable medium
CN114090201A (en) Resource scheduling method, device, equipment and storage medium
CN112905351B (en) GPU and CPU load scheduling method, device, equipment and medium
CN113179308B (en) Service request processing method and device, electronic equipment and storage medium
CN115525425B (en) Federal learning calculation engine arrangement method and equipment based on cloud primordial technology
CN111798113B (en) Resource allocation method, device, storage medium and electronic equipment
CN117785486B (en) Environment resource allocation method, device, equipment and medium
WO2023065882A1 (en) Method and apparatus for resource allocation, and computer-readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant