CN111597025B

CN111597025B - Edge calculation scheduling algorithm and system

Info

Publication number: CN111597025B
Application number: CN202010407602.4A
Authority: CN
Inventors: 张锐; 兰毅
Original assignee: Planetary Computing Power Shenzhen Technology Co ltd
Current assignee: Planetary Computing Power Shenzhen Technology Co ltd
Priority date: 2020-05-14
Filing date: 2020-05-14
Publication date: 2024-02-09
Anticipated expiration: 2040-05-14
Also published as: CN111597025A

Abstract

The invention discloses an edge computing scheduling algorithm and a system, wherein the edge computing scheduling algorithm comprises the following steps: acquiring X computing tasks and determining the priority of each computing task; the X computing tasks are distributed to queues with different priority levels according to the priority levels, and Y ready queues with different priority levels are formed; and (3) allowing the execution time Ty of the queues with different preference levels, suspending the execution time Ty if the task is not executed in Ty, putting the task at the tail of the priority ready queue for queuing again, and selecting the optimal n edge computing nodes when each task is executed. The tasks with different priorities are positioned in different queues, and the tasks are not blocked for a long time when being polled and executed, so that all the tasks are ensured to be normally executed. The invention quantifies the service capability of the edge computing node and ensures that the user task selects the optimal node to execute.

Description

Edge calculation scheduling algorithm and system

Technical Field

The invention belongs to the technical field of edge calculation, and relates to an edge calculation scheduling algorithm and an edge calculation scheduling system.

Background

Edge computing refers to providing near-end services on the side near the object or data source, using an open platform with integrated network, computing, storage, and application core capabilities. The application program is initiated at the cloud end and runs at the edge side, so that a faster network service response is generated, and the requirements of the industry in the aspects of real-time service, application intelligence, safety, privacy protection and the like are met. The edge calculation adopts a distributed operation architecture, and data and operation are processed by moving a network center node to an edge node. The edge computing process is to break up the large service that would otherwise be handled entirely by the central node, cut into smaller and more manageable parts, and scatter to the edge nodes for processing. The most suitable edge node is selected to provide intelligent analysis processing service, large-scale service, data and calculation are decomposed and cut into smaller and easier-to-manage parts, the parts are scattered to the edge node to be executed, finally, a calculation result is formed, time delay is reduced, efficiency is improved, and safety privacy protection is improved. From the industrial application, edge computation is roughly divided into four types, namely, internet of things edge computation, P2P edge computation, server edge computation and operator edge computation.

The edge computation generally adopts a 'big two-layer' structure, and comprises a dispatching framework and an execution framework, wherein dispatching is a central node, and execution is widely distributed in edge computation nodes (Edge Computing Node, ECN) of different regions, different types and different scales. The edge scheduling algorithm (Edge Scheduling Algorithm, ESA) has a very important role in edge computation, which mainly achieves the following two tasks: how to determine the execution sequence of the tasks and how to select the optimal edge computing node, thereby meeting the requirements of computing tasks in different scenes.

At present, in terms of how to determine the execution sequence of tasks, a common method is a first-come first-serve scheduling algorithm, scheduling is performed according to the arrival sequence of the tasks, namely, the tasks with the longest waiting time in a system are prioritized, and the short tasks after the long tasks are scheduled by the method have large waiting time and large weighted turnover time, so that the short tasks are not easy to execute. Another common method is a priority scheduling algorithm, when the system schedules, a task with the highest priority is selected, and a computing node is allocated to the task, and when the method is used, if a task with continuously high priority arrives, a task with low priority may not be executed for a long time.

Disclosure of Invention

The invention aims at least solving the technical problems existing in the prior art, and particularly creatively provides various edge calculation scheduling algorithms and systems.

In order to achieve the above object of the present invention, according to a first aspect of the present invention, there is provided an edge calculation scheduling algorithm comprising the steps of:

acquiring X computing tasks and determining the priority of each computing task, wherein X is a positive integer;

the X computing tasks are distributed to queues with different priority levels according to the priority levels to form Y ready queues with different priority levels, wherein Y is a positive integer greater than 1;

and (3) allowing the execution time Ty of the queues with different preference levels, wherein the Ty is the total execution time of the tasks in the Y-th queue, y=1, 2, … … and Y, if the tasks are not completely executed in the Ty, suspending, putting the tasks into the tail of the priority ready queue for queuing again, and selecting the optimal n edge computing nodes when each task is executed, wherein n is a positive integer.

The tasks with different priorities are positioned in different queues, and the tasks are not blocked for a long time when being polled and executed, so that all the tasks are ensured to be normally executed.

According to a preferred embodiment of the present invention, the method for selecting the optimal n edge computing nodes is as follows:

calculating the current service capability S [ i ] of each edge computing node in m edge computing nodes and the edge computing overall service capability Sigma S [ i ] of the current system, wherein i is the serial number of the edge computing node, 0<i is less than or equal to m, m is a positive integer,

ith edge computing node current service capability value si]= (number of server CPU cores p1+number of server GPU cores p2+memory P3) (1-current number of tasks/maximum number of tasks) P [ i ]]P1, p2, p3 are weight coefficients, p1+p2+p3=1, P [ i ]]Indicating the current comprehensive index of the ith edge computing node0<N is less than or equal to N, wherein N is the total calculated scene number, fin is the index of the ith calculation scene supported by the ith edge calculation node;

the current service capability value S [ in ] = (the number of server CPU cores is P1+ the number of server GPU cores is P2+ the number of memory is P3) (1-the current task number/the maximum task number) (P [ i ]. The Fn) of the ith edge computing node in the aspect of Fn computing scene;

when Sigma S [ i ] is within the threshold, the n edge computing nodes ranked at the front are selected according to the size ranking of S [ i ].

The invention quantifies the service capability of the edge computing node and ensures that the user task selects the optimal node to execute.

According to another preferred embodiment of the invention, the task execution times Ty of the different preference level queues are the same or different or not exactly the same. By setting the execution time Ty of the task, the execution time of the tasks in different scenes can be dynamically adjusted according to the task quantity, and the efficiency is improved.

According to a further preferred embodiment of the invention, the task execution time Ty of the different preference level queues is proportional to the priority level. The priority level is high, the execution time is long, and the task with high priority level is ensured to be executed first.

According to yet another preferred embodiment of the present invention, the top-ranked tasks in the same priority level queue are performed first. The first-come first-serve improves the service efficiency.

In order to achieve the above object of the present invention, according to a second aspect of the present invention, there is provided an edge computing scheduling system including a resource scheduling node and a plurality of edge computing nodes, the resource scheduling node receiving a plurality of computing tasks of a client and controlling execution of each task according to the method of the present invention, the task execution selecting an optimal n edge computing nodes.

The edge computing scheduling system of the invention ensures that tasks with different priorities are positioned in different queues, polling is performed, the tasks are not blocked for a long time, and all the tasks are ensured to be normally performed.

In addition, the service capability of the edge computing node can be quantized, and the user task is guaranteed to select the optimal node to execute.

Drawings

FIG. 1 is a flow chart of an edge calculation scheduling algorithm in a preferred embodiment of the present invention.

Detailed Description

Embodiments of the present invention are described in detail below, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to like or similar elements or elements having like or similar functions throughout. The embodiments described below by referring to the drawings are illustrative only and are not to be construed as limiting the invention.

In the description of the present invention, it should be understood that the terms "longitudinal," "transverse," "upper," "lower," "front," "rear," "left," "right," "vertical," "horizontal," "top," "bottom," "inner," "outer," and the like indicate orientations or positional relationships based on the orientation or positional relationships shown in the drawings, merely to facilitate describing the present invention and simplify the description, and do not indicate or imply that the devices or elements referred to must have a specific orientation, be configured and operated in a specific orientation, and therefore should not be construed as limiting the present invention.

In the description of the present invention, unless otherwise specified and defined, it should be noted that the terms "mounted," "connected," and "coupled" are to be construed broadly, and may be, for example, mechanical or electrical, or may be in communication with each other between two elements, directly or indirectly through intermediaries, as would be understood by those skilled in the art, in view of the specific meaning of the terms described above.

The invention provides an edge computing scheduling algorithm, which comprises the following steps as shown in fig. 1:

the method comprises the steps of obtaining X computing tasks and determining the priority of each computing task, wherein X is a positive integer, for example, priority levels are set according to different task types (or computing scenes), for example, a plurality of computing scenes (including deep learning, reinforcement learning, countermeasure generation, internet of things, big data, cloud rendering, VASP and the like) are arranged, the priorities of the computing tasks can be set according to actual needs, for example, the deep learning, the reinforcement learning is set as a first level, the countermeasure generation, the Internet of things is set as a second level, the big data, the cloud rendering and the VASP are set as a third level, and the higher the priority of the technology is, the higher the priority is.

And distributing the X computing tasks into queues with different priority levels according to the priority levels to form Y ready queues with different priority levels, wherein Y is a positive integer greater than 1.

In this embodiment, the method for selecting the optimal n edge computing nodes is as follows:

q [ ij ] represents the j-th task of the i-th queue, task t=q [ ij ] needs to be processed at n (m > n) nodes.

And calculating the current service capacity S [ i ] of each edge computing node in the m edge computing nodes and the edge computing overall service capacity Sigma S [ i ] of the current system, wherein i is the serial number of the edge computing node, 0<i is less than or equal to m, and m is a positive integer.

Ith edge computing node current service capability value si]= (number of server CPU cores p1+number of server GPU cores p2+memory P3) (1-current number of tasks/maximum number of tasks) P [ i ]]P1, p2, p3 are weight coefficients, p1+p2+p3=1, for example p1=p2=p3=1/3. P [ i ]]Indicating the current comprehensive index of the ith edge computing node0<N is less than or equal to N, wherein N is the total calculated scene number, fin is the index of the ith calculation scene supported by the ith edge calculation node;

the edge computing nodes supporting different computing scenarios have different comprehensive indexes, and in general, the stronger the specialty, the lower the comprehensive index value. If the deep learning is f1=0.2, the reinforcement learning is f2=0.2, the generated antagonism is f3=0.15, the internet of things is f4=0.3, the big data is f5=0.35, the cloud rendering is f6=0.2, the vasp is f7=0.1, and if the ith node is supported by all application scenes, the node composite index is P [ i ] = Σfn (0 < n) =1.5.

According to a preferred embodiment of the invention, the task execution times Ty of the different preference level queues are the same or different or not exactly the same. By setting the execution time Ty of the task, the execution time of the tasks in different scenes can be dynamically adjusted according to the task quantity, and the efficiency is improved. According to another preferred embodiment of the invention, the length of the task execution time Ty of the different preference level queues is proportional to the priority level. The priority level is high, the execution time is long, and the task with high priority level is ensured to be executed first.

The invention also provides an edge computing scheduling system which comprises a resource scheduling node and a plurality of edge computing nodes, wherein the resource scheduling node receives a plurality of computing tasks of a client and controls the execution of each task according to the method disclosed by the invention, and the optimal n edge computing nodes are selected when the tasks are executed. The edge computing scheduling system of the invention ensures that tasks with different priorities are positioned in different queues, polling is performed, the tasks are not blocked for a long time, and all the tasks are ensured to be normally performed. In addition, the service capability of the edge computing node can be quantized, and the user task is guaranteed to select the optimal node to execute.

In the description of the present specification, a description referring to terms "one embodiment," "some embodiments," "examples," "specific examples," or "some examples," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiments or examples. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

While embodiments of the present invention have been shown and described, it will be understood by those of ordinary skill in the art that: many changes, modifications, substitutions and variations may be made to the embodiments without departing from the spirit and principles of the invention, the scope of which is defined by the claims and their equivalents.

Claims

1. An edge computing scheduling algorithm, comprising the steps of:

the method comprises the steps of enabling queues with different preference levels to execute time Ty, wherein Ty is the total execution time of tasks in a Y-th queue, y=1, 2, … … and Y, if some tasks are not executed in Ty, suspending, putting the tasks into the tail of the priority ready queue for queuing again, and selecting optimal n edge computing nodes when each task is executed, wherein n is a positive integer;

the method for selecting the optimal n edge computing nodes comprises the following steps:

calculating the current service capacity S [ i ] of each edge calculation node in m edge calculation nodes and the edge calculation overall service capacity Sigma S [ i ] of the current system, wherein i is the serial number of the edge calculation node, 0<i is less than or equal to m, and m is a positive integer;

ith edge computing node current service capability value si]= (number of server CPU cores p1+number of server GPU cores p2+memory P3) (1-current number of tasks/maximum number of tasks) P [ i ]]P1, p2, p3 are weight coefficients, p1+p2+p3=1, P [ i ]]Indicating the current comprehensive index of the ith edge computing nodeWherein N is the total calculated scene number, F _in Supporting an index of an nth computation scene for an ith edge computation node;

2. The edge computing scheduling algorithm of claim 1, wherein the task execution times Ty of different preference level queues are the same or different or not exactly the same.

3. The edge computing scheduling algorithm of claim 2, wherein the length of task execution time Ty for different preference level queues is proportional to priority level.

4. The edge computing scheduling algorithm of claim 1 wherein the top-ranked tasks in the same priority ranking queue are performed first.

5. The edge computing scheduling algorithm of claim 1, wherein the priority levels of computing tasks for different scenarios are different.

6. An edge computing scheduling system comprising a resource scheduling node and a plurality of edge computing nodes, the resource scheduling node receiving a plurality of computing tasks for a customer and controlling execution of each task according to the method of any one of claims 1 to 5, the task execution selecting the optimal n edge computing nodes.