CN112887345A

CN112887345A - Node load balancing scheduling method for edge computing environment

Info

Publication number: CN112887345A
Application number: CN201911196737.4A
Authority: CN
Inventors: 祝钰轩; 唐新怀
Original assignee: Shanghai Jiaotong University
Current assignee: Shanghai Jiaotong University
Priority date: 2019-11-29
Filing date: 2019-11-29
Publication date: 2021-06-01

Abstract

A node load balancing scheduling method of an edge computing environment is disclosed, which comprises the steps of obtaining the CPU utilization rate and the I/O resource utilization rate of edge nodes around a base station according to the latest completion time of a task sent by a receiving terminal of the base station; calculating the obtained average load rate of the surrounding nodes, and calculating the load rate of each node; and putting the nodes with the load rates less than or equal to the average load rate into a first queue in a positive sequence mode, putting the nodes with the load rates greater than the average load rate into a second queue in a positive sequence mode, and finally putting the current task to the nodes of the first queue or the nodes of the second queue meeting the current task resources for execution. The invention screens the nodes, thereby avoiding the problem of unbalanced load caused by selecting the nearest node only by a common method; when the nodes are placed under the task, the adopted measurement index is the load rate rather than the load, so that the task can be distributed to the nodes with less resources under the condition of large resource difference among the nodes, and the load balance is really achieved.

Description

Node load balancing scheduling method for edge computing environment

Technical Field

The invention relates to a technology in the field of wireless communication, in particular to a node load balancing scheduling method of an edge computing environment.

Background

Edge computing is used as a novel distributed multi-deployment architecture, and compared with cloud computing, all terminals under the edge computing have similar distances and similar propagation delays from computing nodes. In the conventional task scheduling method, a node is selected from edge nodes which are close to a task initiating place as a node for processing the task, and factors such as propagation delay, transmission data volume, network environment, computing capacity of the node, latest completion time of the task and the like need to be considered while the node is selected. However, the scheduling method does not use the current load of the node as a node selection index, and due to the non-uniformity of terminal distribution, part of the nodes may be overloaded and part of the nodes may be idle. Because a single edge compute node has limited computational, network I/O, and storage resources, these resources should be utilized to the maximum extent possible during scheduling.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides a node load balancing scheduling method for an edge computing environment, which can correctly distribute tasks according to the load rate of peripheral edge nodes in the edge computing environment.

The invention is realized by the following technical scheme:

the invention relates to a node load balancing scheduling method of an edge computing environment, which comprises the steps of calculating a propagation distance which can be reached before the task is completed at the latest according to the task type, the latest completion time, required computing resources and I/O resources; acquiring all node indexes within the distance that the task can propagate in the edge computing cluster monitoring system, computing the average value of the node indexes of all nodes, and computing the average load rate L according to the average value_aAnd calculating the load rate of the single node by the node indexes of the single node, respectively judging whether each node can load the current task and adaptively transferring the lower task to the corresponding node, thereby realizing balanced scheduling.

The invention relates to a system for realizing the method, which comprises the following steps: the task receiving module, the task allocation module, the task transfer module and the task recovery module, wherein: the task receiving module receives a task transmitted by a terminal through a network and distributes the task to the task distribution module, and the task distribution module calculates to obtain an optimal node for task transfer according to a distribution strategy and outputs task information and corresponding optimal node information to the task transfer module; the task transferring module transfers the task to the optimal node, and after the node finishes executing the task, the corresponding result information of the task is obtained and fed back to the task recycling module; and the task recovery module transmits the task result information back to the terminal through the network.

Technical effects

The invention integrally solves the problem of unbalanced load of the node server in the edge computing environment; the technical effects produced thereby include: the tasks are distributed according to the task type, the node load rate and other indexes, so that the possibility of overloading the server is greatly reduced, and the task execution efficiency is greatly improved.

Compared with the prior art, the method screens the nodes, and avoids the problem of load imbalance caused by selecting the nearest node only by a common method; when the nodes are placed under the task, the adopted measurement index is the load rate rather than the load, so that the task can be distributed to the nodes with less resources under the condition of large resource difference among the nodes, and the load balance is really achieved.

Drawings

FIG. 1 is a flow chart of a method of the present invention;

FIG. 2 is a flow chart of a discrimination of a node put in a queue;

fig. 3 is a flowchart of task selection drop node.

Detailed Description

As shown in fig. 1, this embodiment specifically includes the following steps:

step 1: and acquiring the task type, the latest completion time, the required computing resource and the I/O resource sent by the receiving terminal.

The task types comprise: compute intensive and I/O intensive, for the ith task, there is a computation coefficient c_iAnd I/O coefficient m_iWhen computationally intensive, then: c. C_i≥m_iWhen the I/O is intensive, then: m is_i≥c_iAnd c is and c_i+m_i＝2。

Step 2: the propagation distance that can be reached before the latest completion time of the task is calculated.

The propagation distance R ═ T_i-T_n) sv, wherein: v is the propagation velocity of electromagnetic wave in the optical fiber is 2.0 x 10⁵km/s，T_iFor the latest completion time of the task, T_nIs the current time, s(s)<1) Is the loss factor.

And step 3: and acquiring n node indexes within the distance which can be propagated by the task in the edge computing cluster monitoring system.

The propagation distance is particularly within a circle with a radius R around the base station.

And 4, step 4: and (3) calculating the average value of the node indexes of the n nodes obtained in the step (2), calculating the average load rate La according to the average value, wherein the load rate is used as a measurement index of the current load degree of the nodes, the load rate is determined by the current state of the nodes and the current task information, and tasks are distributed to correct nodes by comparing the load rate with the average load rate of the nodes in the radius R of the base station, so that load balance can be really achieved.

The node load rate is specifically determined according to the CPU utilization rate of the node, the I/O resource utilization rate of the node and the calculation coefficient c of the current task_iAnd I/O coefficient m_iDetermining:

wherein: n is a radical of_j(c) Denotes the CPU utilization, N, of the jth node_j(m) represents the I/O resource utilization of the jth node, c_iCoefficient of calculation, m, representing the current task_iRepresenting the I/O coefficient of the current task. The average load rate is composed of the CPU utilization rate of the node within the radius R by taking the base station receiving the current task as the center, the I/O resource utilization rate of the node and the calculation coefficient c of the current task_iAnd I/O coefficient m_iDetermining:

wherein:

N_j(c) denotes the CPU utilization, N, of the jth node_j(m) represents the I/O resource utilization of the jth node, c_iCoefficient of calculation, m, representing the current task_iI/O coefficient, C, representing the current task_aRepresenting the average CPU utilization, M, of a node_aRepresenting the average I/O resource usage of the node.

And 5: and (3) calculating the load rate of a single node j according to the node indexes of the single node j in the nodes obtained in the step (2), putting the nodes with the load rates smaller than or equal to the average load rate into a first queue in a positive sequence, and putting the other nodes into a second queue in a positive sequence.

The load rate of the single node j satisfies the following conditions:

wherein: n is a radical of_j(c) Denotes the CPU utilization, N, of the jth node_j(m) represents the I/O resource utilization of the jth node, c_iCoefficient of calculation, m, representing the current task_iRepresenting the I/O coefficient of the current task.

As shown in FIG. 2, the load rate L of a single node_jLess than or equal to the average load rate L_aInto a first queue Q1, the load rate L of a single node_jGreater than the average load rate L_aInto a second queue Q2.

Step 6: as shown in fig. 3, the nodes are sequentially taken out from the first queue, whether the node can load the current task is determined, when the node capable of loading the current task exists, the task is put to the node, and when the node capable of loading the current task does not exist, the node in the second queue is determined.

The judgment requires that: a is_i<a_jAnd b is_i<b_jWherein: a is_iRepresenting the computational resources required by the current task i, a_jRepresenting the remaining computational resources of node j, b_iIndicating the I/O resources required by the current task I, b_jRepresenting the remaining I/O resources of node j.

The node indexes are as follows: CPU utilization and node I/O resource usage.

In practical application, the situation that the current task can be loaded without nodes may be encountered, if the situation is encountered, the searching range of the nodes can be expanded, the nodes capable of loading the current task are searched, and then the task is put down, and the specific operation is as follows: expanding the propagation distance calculated in the step 2 by 2 times as follows: 2R, executing the step 2 to the step 6 again, if no node can load the current task, in order to avoid overlong time delay, the current task needs to be abandoned.

Compared with the prior art, the method reduces the number of idle nodes and the number of overload nodes in the edge computing environment, and because the method distributes the tasks according to the load rate and has a larger candidate range, the tasks can be uniformly distributed on the nodes around the base station receiving the tasks, so that the number of the idle nodes and the number of the overload nodes are reduced simultaneously. Due to the reduction of the number of the overload nodes, the task processing efficiency is improved, and the terminal user obtains better experience.

The foregoing embodiments may be modified in many different ways by those skilled in the art without departing from the spirit and scope of the invention, which is defined by the appended claims and all changes that come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.

Claims

1. A node load balancing scheduling method of an edge computing environment is characterized in that a propagation distance which can be reached before the task is completed at the latest is calculated according to the task type, the latest completion time, required computing resources and I/O resources; acquiring all node indexes within the distance that the task can propagate in the edge computing cluster monitoring system, computing the average value of the node indexes of all nodes, and computing the average load rate L according to the average value_aThe node indexes of the single nodes calculate the load rate of the single nodes, whether each node can load the current task or not is judged, and the adaptive lower task is transferred to the corresponding node, so that balanced scheduling is realized;

the node indexes are as follows: CPU utilization and node I/O resource utilization;

the task type packageComprises the following steps: compute intensive and I/O intensive, for the ith task, there is a computation coefficient c_iAnd I/O coefficient m_iWhen computationally intensive, then: c. C_i≥m_iWhen the I/O is intensive, then: m is_i≥c_iAnd c is_i+m_i＝2；

The load rate of each node meets the following conditions:

2. The method of claim 1, wherein said propagation distance R ═ (T)_i-T_n) sv, wherein: v is the propagation velocity of electromagnetic wave in the optical fiber is 2.0 x 10⁵km/s，T_iFor the latest completion time of the task, T_nS is the loss factor for the current time.

3. The method of claim 1, wherein the average load rate is

Wherein:

4. The method of claim 1, wherein said separate determinations are: according to the load rate of each node, putting the nodes with the load rate less than or equal to the average load rate into a first queue in a positive sequence mode, and putting the other nodes into a second queue in a positive sequence mode; traversing the nodes in sequence from the first queue, judging whether the nodes can load the current task, when the nodes capable of loading the current task exist, transferring the task to the nodes, when the nodes capable of loading the current task do not exist, judging in the nodes of the second queue, when the nodes capable of loading the current task still do not exist, expanding the searching range of the nodes to 2 times 2R of the original searching range and recalculating the propagation distance which can be reached before the latest completion time of the task, and when the nodes can not load the current task still, abandoning the current task until the nodes capable of loading the current task are found and then transferring the task.

5. The method of claim 1 or 4, wherein said determining requires that: a is_i<a_jAnd b is_i<b_jWherein: a is_iRepresenting the computational resources required by the current task i, a_jRepresenting the remaining computational resources of node j, b_iIndicating the I/O resources required by the current task I, b_jRepresenting the remaining I/O resources of node j.

6. A system for implementing the method of any preceding claim, comprising: the task receiving module, the task allocation module, the task transfer module and the task recovery module, wherein: the task receiving module receives a task transmitted by a terminal through a network and distributes the task to the task distribution module, and the task distribution module calculates to obtain an optimal node for task transfer according to a distribution strategy and outputs task information and corresponding optimal node information to the task transfer module; the task transferring module transfers the task to the optimal node, and after the node finishes executing the task, the corresponding result information of the task is obtained and fed back to the task recycling module; and the task recovery module transmits the task result information back to the terminal through the network.