CN113590335A

CN113590335A - Task load balancing method based on grouping and delay estimation in tree edge network

Info

Publication number: CN113590335A
Application number: CN202110916431.2A
Authority: CN
Inventors: 储诚贵; 叶保留; 陆桑璐
Original assignee: Nanjing University
Current assignee: Nanjing University
Priority date: 2021-08-11
Filing date: 2021-08-11
Publication date: 2021-11-02
Anticipated expiration: 2041-08-11
Also published as: CN113590335B

Abstract

The invention discloses a task load balancing method based on grouping and delay estimation in a tree-shaped edge network, which is a layered balancing strategy for executing efficient balancing on task loads of multi-edge servers. The method comprises the following steps: considering the constraints of two resources of calculation and bandwidth jointly, and formalizing the load balancing problem of the edge network into a nonlinear programming problem with linear constraint; based on the characteristics of a tree network, specific limits are applied to unloading decision and transmission bandwidth allocation, the original load balancing problem is decomposed into a plurality of subproblems, and a delay estimation function is designed to solve the subproblems; and (4) integrating the solutions of all the subproblems and analyzing the solution of the original load balancing problem.

Description

Task load balancing method based on grouping and delay estimation in tree edge network

Technical Field

The invention belongs to the field of edge calculation, and particularly relates to a hierarchical balancing method for efficiently balancing task loads of a multi-edge server in a tree-shaped edge network.

Background

With the rapid popularization of emerging technologies such as internet of things, artificial intelligence and virtual reality in daily life, terminal devices at the edge of a network are rapidly increasing compared with the terminal devices in the past, and according to the reckoning, various terminal devices which are globally connected to the internet by 2023 will reach 293 million, which is 3.6 times of the current population, while the number of terminals in 2018 is only 184 million, and the number of terminals in five years is increased by over 100 million. The cost of the new technology is also the higher convenience brought by the new technology, applications such as automatic driving, intelligent transportation, AR games and the like are highly dependent on analysis of high-density data such as videos, pictures, sounds and the like, and compared with the past applications which mainly depend on data such as texts, tables and the like, the requirements of emerging applications on data volume and calculation amount are remarkably increased, and the requirement on delay is stricter. Thus, the massive amount of terminals, massive amounts of data, and low latency are important features that service providers need to consider when providing computing services in a new era. In the past, cloud computing was the primary way to provide computing services for applications. In the traditional cloud computing, data from a terminal needs to be processed by a remote cloud server in a centralized mode, and in the new computing era, on one hand, as the data needs to be transmitted to the cloud server from the terminal in a long distance, intolerable long transmission delay is brought to emerging applications, and application response is slow; on the other hand, due to the explosion of the number of network edge terminal devices and the increase of data requirements of emerging applications, the bandwidth increase of a core network is far from keeping up with the increase speed of the data volume of the edge terminal devices, so that the current internet cannot realize a centralized computing mode of transmitting all terminal data to a cloud server for processing. To solve this problem, edge calculation is followed. Because the server is configured at the network edge closer to the terminal by the edge calculation, on one hand, the data transmission distance between the terminal and the server can be shortened, and thus, the low-delay calculation service can be provided for the terminal; on one hand, data transmission between the terminal and the edge server can complete task unloading of application without occupying core network bandwidth, so that the occupation of the core network bandwidth is greatly reduced, and the defects of cloud computing in a new application scene are overcome.

Therefore, in this new internet era, various computing-intensive and data-intensive terminal applications will be gradually created, popularized and popularized, thereby promoting the wide coverage of computing devices at the network edge and forming a wide edge computing environment. In the future, edge servers will be deployed, perhaps at each primary network access point, to provide low-latency computing services to end users in the vicinity. The edge network is formed by combining various access networks, and when the access points gather the flow from the terminal to the core network in various access networks such as 4G, 5G and family broadband, etc. which are mainstream at present, the interconnection among the devices is completed mainly by adopting a tree topology, so that edge servers which are densely arranged at the access points and all levels of switches which are connected with the edge servers form a tree network. In the future, tasks generated by various terminal applications will be offloaded and computing services obtained in such a tree network. Therefore, the research on the edge load balancing problem in the special network form and the design of a unique balancing strategy for the edge load balancing problem have important practical significance for reducing task delay and improving resource utilization rate.

In the existing related research aiming at the problem of edge load balancing, most of the work assumes that edge servers are interconnected through a network with a star topology or a mesh topology, and no related work is deeply researched by theoretical analysis and experiments for the problem of load balancing in a tree network. From the perspective of an application scenario, the star topology can reasonably describe the edge cloud constructed in a centralized manner or the interconnection among multiple servers in a single access network, and the mesh topology is suitable for the situations that the edge servers are sparsely deployed and have a wide distribution range. Both scenarios are for the current edge environment and do not substantially fully describe the tree join features that exist in future edge networks.

The problem of edge load balancing in a tree network is significantly different from the problem of load balancing in a star network or a mesh network. On one hand, compared with a star network with only two layers, the additional layers of the tree network not only increase the complexity of the offloading decision and path bandwidth allocation among servers, but also bring about the problem of load balancing among a plurality of star networks (i.e. access networks), namely the cross-domain offloading problem. Due to the fact that coupling exists between the star networks in transmission bandwidth distribution, the load balancing problem in the tree network cannot be converted into the load balancing problem in the star networks to be solved through a simple method, and the related research method cannot be applied to the method. On the other hand, although the mesh network is the most widely applicable, the load balancing problem defined based on the most general network structure is difficult to design an effective algorithm to solve quickly because of the lack of sufficient features.

Disclosure of Invention

The purpose of the invention is as follows: based on the defects, the invention provides a task load balancing method based on grouping and delay estimation in a tree-shaped edge network, which can effectively process the cross-domain unloading problem in the tree-shaped edge network, reduce the complexity of the overall load balancing problem of the network, compress the solution space of the problem and obviously improve the solving speed.

The technical scheme is as follows: in order to achieve the above purpose, the invention adopts the following technical scheme:

step S1, considering the constraint of two resources of calculation and bandwidth, formalizing the load balance problem of the edge network into a nonlinear programming problem with linear constraint, the objective of the problem is to minimize the calculation delay and the transmission delay in the edge network, and the solution of the problem is the unloading decision and the transmission bandwidth allocation between the edge servers;

step S2, based on the characteristics of the tree network, applying specific limits to the unloading decision and the transmission bandwidth allocation, decomposing the original load balancing problem into a plurality of subproblems, and designing a delay estimation function to solve the subproblems;

and step S3, the solutions of all the subproblems are integrated to analyze the solution of the original load balancing problem.

The step S2 includes:

step S21, according to the minimum principle of resource nonuniformity, sub-nodes owned by nodes with the degree greater than 2 in the tree edge network are regrouped, and the original tree network is converted into a binary tree;

and step S22, starting from the root node of the binary tree, sequentially constructing and solving the load balancing subproblems corresponding to each node according to the sequence of first root traversal.

The step S21 includes:

step S21a, selecting nodes with degree greater than 2 in the tree network in sequence, grouping the subnodes owned by the nodes according to the principle of minimizing the unevenness of resources, wherein the number of the subnodes owned by each group is not more than 2, creating a new Node for each group, and connecting the corresponding subnodes in the group to the new Node;

step S21b, if the new Node number is equal to 2, the newly created Node is directly connected to the father Node, if the new Node number is greater than 2, the newly created Node is continuously grouped and a new Node is created for the new group, and until the new Node number in the last grouping is equal to 2, the two nodes are connected to the father Node.

The step S22 includes:

step S22a, according to the resource information owned by the child node of the current node and the solution of the load balancing child problem corresponding to the parent node, generating the resource constraint of the child problem corresponding to the node;

step S22b, constructing an objective function based on delay estimation by using a delay estimation function;

s22c, solving a feasible solution with the minimum objective function value under the resource constraint condition by using a sequence least square planning algorithm;

and S22d, finding the next non-leaf node according to the sequence of the first root traversal, if so, returning to S22a, and if not, ending the step S22.

The step S3 includes:

step S31, for each subproblem, according to the quantitative relation between the solution of the subproblem and the solution of the original problem, constructing a linear equation set which takes partial solution of the original problem as variable and the solution of the subproblem as parameter;

and step S32, solving the linear equation system, and filling the obtained solution into the solution of the original problem.

Has the advantages that: on one hand, the invention utilizes the special property of the tree network to layer and group the edge tree and decomposes the original problem into a plurality of small-scale sub-problems, thereby obviously compressing the size of the solution space of the edge load balancing problem in the tree network and reducing the complexity of the problem solution; on the other hand, the method of delay estimation is used, so that the dependence of the upper-layer subproblem in the edge tree on the lower-layer subproblem is avoided, and the complexity of subproblem solving is further reduced. The invention effectively solves the problem that the layering balancing strategy effect is not obvious when the degree of the tree network is too large, and has positive application significance for cross-domain unloading of the tree edge network.

Drawings

FIG. 1 is an exemplary diagram of a tree edge network in an embodiment of the present invention;

FIG. 2 is a flow chart of a task load balancing method based on packet and delay estimation according to an embodiment of the present invention;

FIG. 3 is a diagram illustrating the relationship between domains and their subdomains in an embodiment of the present invention;

FIG. 4 is an exemplary diagram of sub-domain regrouping for a domain in an embodiment of the present invention;

FIG. 5 is a graph comparing the optimization results of the present method with other numerical methods;

FIG. 6 is a comparison between the time taken to solve the problem of the present method and other numerical methods.

Detailed Description

The technical scheme of the invention is further explained by combining the attached drawings.

The tree edge network is a network composed of at least three layers of switching equipment, computing equipment and connection, and is marked as G_treeThe method is called as an edge tree for short, wherein leaf nodes of the tree are all computing devices and are called as edge servers; the non-leaf nodes are all switching devices; the connections between nodes represent round-trip links between devices. An example of a tree edge network is shown in fig. 1. For the convenience of representation, the edge server or switching device is denoted s_i(i ═ 1, …, m), m is the number of edge servers and switching devices, and the amount of computing resources owned by each device is denoted c_i. When s is_iWhen it is a switching device, c _i0. Device s_iAnd s_jThe connection between is denoted by e_i,jThe bandwidth it has is denoted b_i,j。

Since each edge server is deployed at a network access point, the tasks uploaded by the user to the network access point will pass through the edge server. Invention falseSet end user transmission to edge server s_iThe task amount compliance parameter is alpha_iPoisson distribution, wherein_iThe physical meaning of (c) is the upload rate of the task. The calculation amount of the uploading task of the user and the data amount are random, the random variable has the expectation and the standard deviation or the variance, the calculation amount is expressed by the random variable h, and the expectation is beta^hStandard deviation is σ^h(ii) a The data volume of a task is represented by a random variable d whose expectation is beta^dStandard deviation is σ^d。

G_treeThe task load balancing problem in (1) is a nonlinear programming problem with linear constraint, and when a general nonlinear programming algorithm is adopted for solving, the time complexity of the problem is at least O (n) theoretically⁶) And the test shows an exponential grade.

The invention provides a hierarchical load balancing method based on grouping and delay estimation by utilizing the characteristics of a tree network, which is realized by combining G_treeThe whole load balancing problem in (1) is converted into a load balancing sub-problem in a plurality of domains, so that the load balancing efficiency is improved.

Referring to fig. 2, a hierarchical load balancing method based on packet and delay estimation in a tree edge network includes the following steps:

The following describes a specific implementation of the steps of the method.

Firstly, toG_treeThe domains and the relationships between the domains are defined as follows.

Definition 1: g_treeOne field g in_kIs G_treeA sub-tree corresponding to one of the nodes.

This definition also means that G is the root node when the node corresponding to the domain is the root node_treeWhich may be referred to as a domain, G_treeThe edge servers in as leaf nodes are also referred to as domains.

In addition, a subdomain of the domain needs to be defined.

Definition 2: field g_kAt G_treeThe sub-tree of the tree corresponding to (1) is called g_kIs determined. g_vIs g_kIs denoted as g_v∈g_k。

It should be noted that, in the case of no ambiguity, the present invention expresses an edge server s_jBelong to g_kWhen it comes to s_j∈g_kThe expression means s_jIs g_kThe leaf nodes of the corresponding subtrees.

G_treeEach domain or node in the set corresponds to a load balancing sub-problem, but the sub-problems corresponding to leaf nodes are trivial and do not need to be solved. So the number of sub-problems that need to be solved is G_treeNumber of non-leaf nodes in the leaf. The solution between sub-problems is now coupled. When g is_vIs g_kWhen the sub-field of (g)_kNeed to know g_vMinimum delay expectation; g_vIs required to be dependent on g_kAs a parameter. This coupling can lead to a complex solution. The present invention uses a method of delay estimation that eliminates the dependency of the solution of a domain on the solution of its subdomain. When field g_kNeeds to know its subfield g_vUsing the formula

Direct calculation without the need for recursive solution of individual nonlinear programming problems. The parameters in the formula are explained below.

Edge tree G_treeOne field g in_kIf it is not a leaf node, the external input task quantity is set to be gamma^kAnd the outward output task quantity is theta^kBandwidth occupied by external communication

The objective function of the delay-optimized oriented edge load balancing sub-problem in this domain can be expressed as:

in the expression, the summation on the left side is expected to calculate the delay, the summation on the right side is expected to transmit the delay, and the calculation modes of all symbols are respectively

Wherein gamma is_vIs g_vAmount of externally input task accepted, θ_vIs g_vThe amount of the task output to the outside,

and

are respectively g_kAnd (3) offloading decisions and transmission bandwidth allocation between sub-domains. FIG. 3 shows a field g₀An example of a numerical relationship with its parent and child domains. g₀Input task amount and input bandwidth Γ⁰,

And output workload and output bandwidth Θ⁰,

Are all determined by the resolution of its parent domain, while its child domain g₃Input task amount and input bandwidth Γ³,

And output workload and output bandwidth Θ³,

Are all g of it₀To solve, sub-field g₁And g₂The expectation of the task transmission delay between is also g₀The solution of (2).

In addition, the optimization problem satisfies some necessary constraints as long as g is satisfied_kNot the leaf node of the edge tree, then for all g_u,g_v∈g_kThe following constraints hold:

representing the external tasks received by all sub-domains within the domain and the total input tasks equal to the domain;

representing the output tasks of all sub-domains within the domain and the total output task equal to the domain;

the total amount of tasks required to be unloaded by each subdomain is equal to the total amount of tasks generated by the subdomain;

the total amount of tasks accepted by each subdomain cannot exceed the amount of tasks which can be processed per unit time;

the bandwidth of each path representing inter-domain unloading cannot be lower than the data quantity of a unit task multiplied by the task unloading rate;

indicating that the sum of the bandwidth of the paths carried on each connection in the domain cannot exceed the bandwidth of this path.

γ_v≥0

θ_v≥0

Indicating that all offload decisions cannot be negative. Wherein

The objective function in the above optimization problem is an abstract form, and needs to be according to the field g_kThe identity of which embodies it. If field g_kWhen there is a grandchild node, i.e., its subdomain is not an edge server, there are

Wherein

And

are the delay estimation functions defined in the present invention, and they are herein separately denoted

Wherein

m_vIs g_vNumber of middle edge servers, σ^hIs the standard deviation of the task computation h.

α_jIs an edge server s_jThe number of tasks generated in the unit time is increased;

is field g_kThe total load of the medium tasks, that is, the sum of the tasks received by all edge servers in a unit time domain:

wherein λ_jIs an edge server s_jThe sum of the task quantities to be processed received per unit time;

e is field g_kConnection of (b)_eIs the amount of bandwidth owned by connection e.

And if intermediate node g_kWithout grandchild node, then

At this point, its subdomains are all leaf nodes, representing edge servers.

The original problem can be decomposed into a plurality of sub-problems with smaller scale, thereby reducing the solving complexity of the problem. However, it is also noted during the study that the size of each sub-question is determined by the degree of its corresponding non-leaf node. When non-leaf nodes with large degrees exist in one edge tree, the hierarchical load balancing cannot well reduce the complexity of problem solving. To solve this drawback, it is necessary to decompose the non-leaf node with a large degree into a plurality of non-leaf nodes with a small degree, such as nodes with a degree of 2, so as to further decompose the problem, so that the size of each sub-problem of the edge tree converted after decomposing its nodes does not exceed 2, regardless of the structure of the edge tree.

The splitting operation of the nodes (otherwise known as a sub-domain grouping operation) is quite simple in nature. When the size of each subproblem after decomposition is required not to exceed r, all nodes with degrees larger than r, namely domains, are decomposed. When aiming at a domain g_kWhen decomposing, dividing the subdomain set of the system into r groups (the number of the last group is less than r), creating a virtual domain for each group, and connecting the domains in the group to the virtual domain; and if the number of the virtual domains is still larger than r, continuously grouping the virtual domains, recreating the virtual domains for the group formed by the virtual domains, and placing the virtual domains in the group under the newly-built virtual domain. Repeating the steps until the number of the top-level virtual domains does not exceed r, and connecting the top-level virtual domains to g_kBelow as its subdomain. Thus, the original edge tree is changed into a tree with more layers and smaller degrees. At this time, the hierarchical equalization method is used for the transformed edge tree, so that the complexity of the problem can be further reduced. Referring to the example of FIG. 4, by comparing g₁The 4 sub-domains owned are divided equally into two groups and two virtual sub-domains g are created for them₆、g₇Finally result in g₁Increases by 1 and decreases from 4 to 2.

The grouping needs to follow the principle of minimizing resource non-uniformity, which is defined as the resource non-uniformity of the edge server in a domain

Wherein m is_kIs domain g_kThe number of medium edge servers.

The cross-domain offload decision and the transmission bandwidth allocation obtained at this time are domain-level, and the original problem is solved by the offload decision and the transmission bandwidth allocation between the edge servers. The solution of the sub-problem needs to be converted into the solution of the original problem. Since the solution space of the subproblem is smaller than that of the original problem, one solution of the subproblem usually corresponds to multiple solutions of the original problem, and if the solution of the original problem is to be obtained, the obtained solution needs to be correspondingly converted. A possible transformation is proposed, which requires solving a linear system of equations.

If an offload decision between edge servers in two domains is specified, uniformly from the edge server in the origin domain, and uniformly offloaded to the edge server in the target domain, then the offload decision between the edge servers satisfies the equation set:

a_i,jrefers to a unit time slave edge server s_iOffloading to edge servers s_jIs measured. Solving the system of linear equations to obtain g_u,g_vOffload decisions between the middle edge servers.

For any two different domains g_u,g_vFor any s_i∈g_u、s_j∈g_vThe following constraints apply:

the corresponding transmission bandwidth allocation can be obtained.

In the edge network, the delay experienced between the issuance of a task from a terminal to the reception of the execution result from the edge server is mainly composed of its computation delay on the edge server and its transmission delay in the edge network. The present invention is concerned with the desire to minimize the total delay of all tasks in the edge network. Since the computation delay expectation of the task on one edge server is only related to the task load of the server, the method has no relation to the transmission bandwidth allocation and the unloading decision on other edge servers. In a domain (access network) composed of a plurality of edge servers, the expectation of the total task computation delay is only related to the task load of the plurality of edge servers. This means that when the total load of a task in a domain is determined, the optimal offloading decision in the domain has been completely determined from the point of view of computational delay optimization alone. Therefore, in the tree network, the optimization of the computation delay in different domains, i.e. different subtrees, is irrelevant, so that the optimal offloading decision for the computation delay optimization of the whole tree network can be formed by the optimal offloading decision for the computation delay optimization in each domain. Therefore, from the perspective of algorithm design, it is considered that the optimization problem of computation delay in the tree network has an "optimal substructure".

On the other hand, although optimization for computational delay has an optimal substructure, optimization for propagation delay does not have such good properties. This is because the impact of the allocation of transmission bandwidth is cross-domain. When a path is allocated a certain bandwidth, it means that all connections on the path are allocated this bandwidth to the path. When optimizing the transmission delay of a task in a domain, the transmission bandwidth of a path connected in the domain needs to be adjusted, so that the influence of transmission bandwidth allocation can be propagated to another domain through a path across domains, resulting in a change in available bandwidth of connections in other domains, and thus the optimization of the transmission delay does not have the optimal substructure for calculating the delay optimization. However, since in the tree network, the total communication bandwidth between any two domains is equal to the communication bandwidth between the switching nodes in the two domains, this means that as long as the communication bandwidth on the path between the switching nodes in the two domains is fixed, the total communication bandwidth between the two domains is fixed; and then the cross-domain path between the two domains can be further defined to halve the total communication bandwidth according to the data volume of the tasks carried by the two domains. At this time, the bandwidth on the cross-domain offload path is fixed, and the optimization of the transmission delay in different domains is independent of each other. Under the condition, the original problem can be decomposed into a subproblem, and a corresponding delay estimation function is designed, so that the complexity of solving the load balancing problem is reduced.

The performance of the process of the invention is verified by tests below. The hierarchical load balancing algorithm of the invention is denoted as the HIER algorithm. The two histograms of fig. 5 are a comparison of the performance of the HIER algorithm with the numerical algorithms slsrqp, PVI and SEP, where the mean and minimum values of the solutions of each algorithm are obtained statistically after 20 repeated solutions to the problem. Wherein the SLQP algorithm is a sequential least squares programming method; the PVI algorithm is a partial variable iteration algorithm that optimizes the total latency expectation of the task by updating the offload decisions first, then updating the transmission bandwidth allocation, then continuing to update the offload decisions, … so that the update is repeated; the SEP algorithm is a split iteration algorithm, which is an algorithm that optimizes separately the expectation of the total transmission delay of the task from the expectation of the computation delay. From the comparison result of the minimum value of the solution, it can be seen that the HIER algorithm has a slightly worse solving effect than slslqp and PVI when the problem scale is 4, but the best solving effect is not lower than the effects of slqp and PVI when the problem scale is larger. From the comparison result of the mean values of the solutions, the SLSLQP algorithm has the best mean effect when the problem scale is small, and as the problem scale is large, the mean value of the solution is obviously higher than that of PVI and HIER due to unstable solution, and the mean values of the solutions of PVI and HIER are very close to each other under various scales, so that the solutions have the same stability and mean solution effect.

FIG. 6 illustrates a comparison of the solution time consumption of the HIER, PVI and SLSLSLQP algorithms. Wherein the abscissa of the coordinate system is the number of servers, i.e., the scale of the problem; each point in the graph is the time consumption of solving the corresponding algorithm under the specified problem scale, and the lines are regression approximations for displaying the increasing trend of the corresponding algorithm. As can be seen from the figure, the time consumption of the SLSLSLQP and PVI algorithms increases exponentially with the problem size, while the time consumption of the HIER increases linearly. At problem size 2, the solution time of all three algorithms is in the range of tens to hundreds of milliseconds, while when the problem size increases to 13, the solution time of the slslsrp and PVI algorithms already reaches the order of one thousand seconds, while the time of one HIER solution is still between 1 and 10 seconds. HIER exhibits a very significant efficiency advantage. On one hand, the original problem can be decomposed into numerous small-scale sub-problems only by layering and grouping the edge trees by utilizing the special properties of the tree network, so that the size of the learning space is obviously compressed, and the complexity of problem solving is reduced; on the other hand, the delay estimation method is used, so that the dependence of the upper-layer subproblem in the edge tree on the lower-layer subproblem is avoided, and the complexity of solving the subproblem is further reduced.

Claims

1. A task load balancing method based on grouping and delay estimation in a tree edge network is characterized by comprising the following steps:

step S1, considering the constraint of two resources of calculation and bandwidth, formalizing the load balance problem of the edge network into a nonlinear programming problem with linear constraint, the objective of the problem is to minimize the calculation delay and the transmission delay in the edge network, and the solution of the problem is the unloading decision and the transmission bandwidth allocation between the nodes in the edge network;

2. The method for task load balancing based on packet and delay estimation in tree edge network according to claim 1, wherein said step S2 includes:

3. The method for task load balancing based on packet and delay estimation in tree edge network according to claim 2, wherein said step S21 includes:

4. The method for task load balancing based on grouping and delay estimation in tree edge network as claimed in claim 3, wherein the resource non-uniformity is defined as:

wherein s is_jRepresenting the jth node in the tree edge network; let G be the edge tree containing nodes and connections between nodes_tree，g_kRepresents G_treeThe domain is a subtree corresponding to a node in the edge tree; s_j∈g_kRepresentation node s_jBelongs to the domain g_k；

Wherein the content of the first and second substances,

c_jis node s_jAmount of computing resources owned, β^hRepresenting the expectation of the calculated amount h of the task,

is field g_kSum of the amount of tasks that can be processed per unit time, m, for all nodes in the system_vIs the sub-field g_vNumber of nodes, g_vIs g_kA sub-field of (1), the sub-field being field g_kAt the edge tree G_treeSub-tree of the corresponding tree in (1), m_kRepresents the field g_kThe number of nodal points.

5. The method for task load balancing based on packet and delay estimation in tree edge network according to claim 2, wherein said step S22 includes:

6. The method for task load balancing based on grouping and delay estimation in tree edge network as claimed in claim 5, wherein the objective function of the sub-problem of load balancing is:

in the formula, s_jRepresenting the jth node in the tree edge network; let G be the edge tree containing nodes and connections between nodes_tree，g_kRepresents G_treeThe domain is a subtree corresponding to a node in the edge tree; g_vIs g_kA sub-field of (1), the sub-field being field g_kAt the edge tree G_treeA sub-tree of the corresponding tree in (1); s_j∈g_vRepresentation node s_jBelonging to sub-domain g_v；

Wherein the content of the first and second substances,

cj is node s_jAmount of computing resources owned, β^hRepresenting the expectation of the calculated amount h of the task, beta^dExpectation of data quantity d representing task, m_vIs g_vThe number of the nodes is counted,

and

each represents g_vInput bandwidth and output bandwidth of;

Ψ is a shorthand for the following equation:

wherein sigma^hRepresenting the standard deviation of the task calculation amount h;

α_jis node s_jThe number of tasks generated in the unit time is increased;

is field g_kThe total load of the medium tasks, namely the sum of the task quantities received by all nodes in a unit time domain:

wherein λ_jIs node s_jThe sum of the task quantities to be processed received per unit time;

7. The method for task load balancing based on packet and delay estimation in tree edge network according to claim 1, wherein said step S3 includes:

8. The method for task load balancing based on grouping and delay estimation in tree edge network as claimed in claim 7, wherein the linear equation set in the step S31 includes:

the system of offload decision equations between nodes:

s_jrepresenting the jth node in the tree edge network; let G be the edge tree containing nodes and connections between nodes_tree，g_kRepresents G_treeThe domain is a subtree corresponding to a node in the edge tree; g_v、g_uIs g_kA sub-field of (1), the sub-field being field g_kAt the edge tree G_treeA sub-tree of the corresponding tree in (1); s_i∈g_uRepresentation node s_iBelonging to sub-domain g_u，s_j∈g_vRepresentation node s_jBelonging to sub-domain g_v；a_i，jRefers to a slave node s per unit time_iOffloading to node s_jThe average of the task amount of (a);

γ_jis the sub-field g_vAccepted external input task variables, Γ^vIs the sub-field g_vThe amount of the input task of (a),

representing the slave sub-field g_uTo subfield g_vAn offload decision; theta_iIs g_uThe amount of task, Θ, output outward^uIs the sub-field g_uThe output task amount of (1);

system of transmission bandwidth equations between nodes:

is a slave node s_iTo node s_jThe transmission bandwidth of (a) is,

is a slave node s_jTo node s_iThe transmission bandwidth of (a) is,

is from subfield g_vTo subfield g_uThe decision to unload(s) of (c),

is the sub-field g_uTo subfield g_vThe allocation of the transmission bandwidth of (a),

is the sub-field g_vTo subfield g_uThe transmission bandwidth allocation of (1).