CN113791895A

CN113791895A - Edge calculation and resource optimization method based on federal learning

Info

Publication number: CN113791895A
Application number: CN202110958468.1A
Authority: CN
Inventors: 孙艳华; 乔兰; 张延华; 孙恩昌; 杨睿哲; 李萌
Original assignee: Beijing University of Technology
Current assignee: Beijing University of Technology
Priority date: 2021-08-20
Filing date: 2021-08-20
Publication date: 2021-12-14

Abstract

The invention discloses an edge calculation and resource optimization method based on federal learning, which is characterized in that an antenna array is deployed at a small base station, channel information of a downlink and channel information processed by a precoding technology are obtained, channel and precoding are formed as training data of an input-output pair, and federal learning is carried out under the support of the data, namely, a model is trained at a node end, and finally, the purpose of inputting the channel information to obtain corresponding precoding information is achieved. In the process, in order to obtain a stable learning alliance and control the system energy consumption in the lowest state, user selection is carried out, namely, a plurality of users participate in training by selecting users with stable computing capability and communication capability through the physical characteristics of each node; in order to encourage users to actively participate in training, a contract mechanism is introduced to reward the users participating in training, the income and the paid training cost of each user are calculated to obtain a utility function, and the resources are distributed to the users so that the utility of the whole system is maximized.

Description

Edge calculation and resource optimization method based on federal learning

Technical Field

The invention belongs to the field of communication and federal learning, combines the precoding technology in MIMO with machine learning, is applied to edge calculation, and enables the federal learning structure to be more stable and efficient by means of selection and resource optimization.

Background

The large-scale Multiple Input Multiple Output (MIMO) technology is to deploy a large-scale array at the end of a base station, and compared with the traditional MIMO, the MIMO technology can effectively resist the interference between different users, and can significantly improve the capacity of the system. In a large-scale MIMO system, a precoding technique is a crucial signal processing technique in a downlink, and converts a modulated symbol stream into a data stream adapted to a current channel by using Channel State Information (CSI) of a transmitting end, so as to concentrate signal energy near a target user, effectively combat attenuation and loss, and improve system performance. The method specifically comprises the steps that a sending end of a downlink utilizes CSI to preprocess a sending signal, interference among different users and antennas is minimized, signal energy is concentrated to the vicinity of a target user, a receiving end obtains a better signal-to-noise ratio (SNR), and the system channel capacity is improved [1 ]. Under the background of machine learning, traditional federal learning is frequently used, receipts of an edge node end are sent to a server end to be processed, then the server end distributes data to a node end to be trained, privacy of the data is greatly reduced, centralized federal learning is proposed and well applied to the intelligent field in order to protect the data, the node does not need to share the data and only needs to be trained locally, in order to improve feasibility of the data, a weight needs to be trained by the data and sent to the server end, the server end plays a role in collection, data weights trained by all nodes are collected and correspondingly processed, then the processed weights are broadcasted to all nodes, and after the nodes receive the latest data weights, the weights are updated and applied to the next iteration, this loops until the data is trained to the best model. And the traditional system architecture is composed of a mobile node side and a base station side [2 ]. The limited computing and traffic capabilities of mobile devices have made machine learning suffer from two major bottlenecks, namely, the powerful functions of communication overhead and energy overhead Mobile Edge Computing (MEC), which is considered a promising distributed computing paradigm that supports many of the 5G era emerging intelligent applications such as video streaming, smart cities, and augmented reality. According to this concept of allowing delay-sensitive and compute-intensive tasks to be offloaded from distributed mobile devices to close-range edge servers, providing real-time response and high energy efficiency, a hierarchical federation edge learning framework is introduced, where edge servers are typically deployed fixed with base stations, as intermediaries between mobile devices and the cloud, can execute local models transmitted from nearby devices for edge aggregation. And each local device achieves the given learning precision, and global aggregation is carried out on edge transmission so as to realize model updating. A reduction in communication and energy costs is achieved in this manner [3], thereby entering a three-tier communication system architecture.

[1] Zhang Yu, Zhao Xiongwen, precoding technique in millimeter wave large scale MIMO system [ J ]. Zhongxing communication technique, 2018,024(003):26-31

[2]Elbir A M,Coleri S.Federated Learning for Hybrid Beamforming in mm-Wave Massive MIMO[J].IEEE Communications Letters,2020,PP(99):1-1.

[3]S.Luo,X.Chen,Q.Wu,Z.Zhou and S.Yu,"HFEL:Joint EdgeAssociation and ResourceAllocation for Cost-Efficient Hierarchical Federated Edge Learning,"in IEEE Transactions on Wireless Communications,vol.19,no.10,pp.6535-6548,Oct.2020,doi:10.1109/TWC.2020.3003744.

Disclosure of Invention

The invention aims to introduce the MEC into a communication system structure for federal learning to form a three-layer structure, namely a macro base station (cloud server), a small base station (edge server) and a node user, and allocate communication resources to the user under the application background of federal learning so that the whole system achieves the maximum effect, and meanwhile, the MIMO precoding technology is linked with the federal learning training data so as to predict corresponding precoding under the condition that channel data is known. Firstly, deploying an antenna array at a small base station, acquiring channel information of a downlink and channel information processed by a precoding technology, forming a channel and precoding as training data of an input-output pair, and performing federal learning under the data support, namely training a model at a node end, and finally achieving the purpose that corresponding precoding information can be obtained by inputting the channel information. In the process, in order to obtain a stable learning alliance and control the system energy consumption in the lowest state, user selection is carried out, namely, a plurality of users participate in training by selecting users with stable computing capability and communication capability through the physical characteristics of each node; meanwhile, in order to encourage users to actively participate in training, a contract mechanism is introduced to reward the users participating in training, the income and the paid training cost of each user are calculated to obtain a utility function (the utility is income-cost), and the resources are allocated to the users so that the utility of the whole system is maximized.

In order to solve the problems, the invention adopts the following technical scheme: the edge calculation and resource optimization method based on the federal learning comprises the following steps:

step 1: establishing a system model and deploying arrays according to users, edge servers and base stations

The invention relates to a multi-user millimeter wave MIMO three-layer structure system structure, wherein the first layer is a user layer and consists of N single-antenna user devices. The second layer is the ES layer (edge server layer) containing K layers with S_TAn edge server for the antenna. The third layer is a CS layer (cloud server layer), and has a cloud server S with a single antenna. Wherein, the user layer and the ES layer communicate through a wireless channel, and the ES layer and the CS layer are connected through a backhaul link, having a sufficient capacity.

Step 2: a set of users communicating with the edge server is first selected according to distance and represented as

Wherein N is the number of users,

the number of users selected after the distance selection is performed is shown.

That is, assuming that the user equipment obeys a poisson distribution, the communication range of each edge server is a radius d_kAnd in this three-level structure, no overlap between the circular communication areas of the edge servers is assumed. And then selecting the users with the shorter distance according to the distance from each user to the edge server, wherein the communication quality between the users and the server in the communication range is better, and stable communication is a necessary condition for ensuring the successful federal learning.

And step 3: and acquiring training data.

The channel model is an LOS (line of sight transmission) receiving route of an L path, and the millimeter wave channel matrix of the nth user is expressed as follows:

wherein the content of the first and second substances,

α_n,lis the complex channel gain of the l-th path,

for the directional angle of the downlink transmission path,

is S_TX 1, whose mth element is represented as:

λ is the wavelength, σ is the distance between the two antennas, and let σ equal λ/2.

Firstly, firstly

Within the range of which the transmission direction angle of the subscriber is generated

Randomly generating L-path channel transmission direction angle in a fluctuation range of (5 degrees), and then solving for h according to a channel matrix formula_nAngle interval of

Divided equally into non-overlapping Q parts, each subinterval

And marks each section range starting from 1, a section where the user transmission direction angle falls is marked as q, and the q value is a section mark value, thereby generating (h)_nAnd q) data pairs. And the final radio frequency precoding f_RF,nCan be expressed as

Wherein

Is the midpoint of the interval.

And 4, step 4: training is performed according to the selected users and the generated MIMO data.

The training process can be divided into edge aggregation and cloud aggregation

1) Edge polymerization: edge clustering is divided into local model computation, local model transmission and edge model aggregation. The specific contents are as follows:

local model calculation: after locally generating MIMO data, user n uses the local data to perform linear regression model training based on gradient descent until local model accuracy theta is reached_n∈[0,1]I.e. the number of iterations is L (theta)_n)＝μlog(1/θ_n) Then, model parameters w are generated_tAnd calculating the loss function f of the model_i(w_t). In the process, the calculation time delay is generated

And calculating energy

The computation time delay and the computation energy consumed can be derived from the amount of data trained by the user and the frequency with which the data is processed.

Local model transmission: after the local computation is over, the user n will model the parameters w_tUploading to an edge server k in communication with the edge server k, and adopting an OFDMA (orthogonal frequency division multiple access) protocol in the transmission process, wherein the transmission time delay is generated

And transferring energy

The transmission delay and the transmission energy can be calculated by a shannon formula according to the transmission rate between the user n and the edge server k and the bandwidth allocated to the user n by the server k.

Edge polymerization: in the process, the edge server aggregates the model parameters uploaded by each user and then performs an averaging process to update the model parameters (i.e. a federal averaging algorithm). After the edge server carries out model parameter aggregation and an average algorithm, updated model parameters are sent to each user in a broadcasting mode (time delay and energy consumption in the broadcasting process are ignored), after the user receives the model parameters sent by the server, the user uses the parameters to carry out a new round of data training, and the operation is repeated in a circulating mode until the edge server reaches a model precision value epsilon, and the times of global aggregation are expressed as

Where δ is a learning task parameter. When I (epsilon, theta) is performed_n) The total delay/energy consumption generated after the sub-global aggregation is the calculated delay/energy consumption and the transmission delayTotal of energy consumption.

2) Cloud polymerization: the cloud aggregation comprises two steps of edge model uploading and cloud model aggregation

Uploading the edge model: after the edge server k reaches the edge precision, the edge model parameters are uploaded to the cloud server, and time delay is generated in the process

And energy consumption

May be calculated from the transmission rate and the transmission power.

Cloud model aggregation: the cloud server aggregation process is similar to the edge aggregation process, namely the cloud server receives the model parameters uploaded by the edge server and then carries out average calculation.

The key points of the invention are as follows:

step 1: allocating resources for users

Secondary selection: in order to further obtain a stable and efficient learning alliance, a second user selection is carried out before resources are allocated to the users, the selection is carried out according to the training record of the user at the previous time, namely before the user selection is carried out, the user needs to carry out local iteration and achieve corresponding precision, then the gradient value is worked out according to the loss function of local calculation, the difference value is carried out with the gradient gathered at the edge server end, and the users with small gradient difference values are selected to be constructed into the final learning alliance

Wherein

Is the set of users after the first selection,

is the set of users after two selections.

Resource allocation: after the second selection, the alliance is successfully established and then participates in the final participationThe resource allocation is carried out by the trained users, in the process, the contract theorem is introduced to encourage and reward the users participating in the training, and the users are firstly classified according to the local precision (the smaller the better), namely

Then sorting is carried out, and simultaneously, an evaluation function of a user is set

Wherein

Is the reward for participating in the league learning user,

is a function of profit and is a function of

Increasing and increasing monotonically increasing functions. Thus, the computational utility function of the user can be obtained, i.e.

Edge server providing contract terms

The user selects proper terms according to the actual situation of the user, and the proper terms are respectively IR (personal rational constraint) and IC (incentive compatibility constraint) according to the constraint conditions of the contract theorem.

IR: the income and cost loss of the user are at the lowest level, i.e.

If u'_n<0, the user cannot participate in the training and is discarded.

IC: the user must match himself when selecting contract terms, i.e.

According to the utility function u of the user can be obtained_nUtility function U with edge server_k(the difference between the cost function and the income function is ignored, the utility function of the cloud server is ignored, in conclusion, the utility function of the whole system can be obtained

Where K is a set of edge servers,

are the set of users after two selections.

The purpose of resource allocation is to minimize the system communication overhead and maximize the utility function of the system, so an optimization problem can be presented, namely

max U,

subject to,

C₁:

C₂:

C₃:

C₄:

C₅:

C₆:

Wherein the constraint condition C₁And C₂Respectively representing uplink communication resource constraints and computation capability constraints, beta_k,nIs the bandwidth allocated to user n by edge server k, f_nIs the calculated frequency of user n and

and

is the minimum and maximum of the calculated frequency. C₄And C₅Ensure the stability and high efficiency of the equipment participating in the model training₆It is ensured that the users participating in the training are users within the service range of the edge server.

However, the optimization problem is due to the constraint C₄And C₅It becomes difficult to solve and the optimization problem is simplified below with respect to these two constraints.

1. For any valid clause

If and only if

When the temperature of the water is higher than the set temperature,

2. for any valid clause

If and only if

When the temperature of the water is higher than the set temperature,

3. for any valid clause

The computational utility function of each type of user must be satisfied

0≤u'₁<…<u'_m<…<u'_n

4. Based on 1 and 2, constraint C₅(IC) can be solved as Local Down IC (LDIC), i.e.

And Local Upward IC (LUIC), i.e.

Constraint C according to 1, 2, 3, and 4₄And C₅Can be solved as:

then the constraints of the optimization problem are reduced to:

C₁:

C₂:

suppose that

And ∈ ≠ 1, from constraint condition C₁Can be solved out

From constraint C₂Can be solved out

To sum up, can obtain

This is a nonlinear programming optimization problem with constraints, and f can be solved under the condition that the utility function is satisfied to the maximum_nAnd beta_k,nI.e., the optimal value, thereby completing the resource allocation.

The traditional federal learning is generally a two-layer structure, namely a user layer and a server layer, but the distance between the user layer and the server layer is usually longer, so that the transmission delay and the transmission energy consumption are serious, and the communication overhead is greatly increased.

In the invention, in order to strengthen the stability of the training alliance, a user selection link is added, and a user with strong calculation and communication capabilities is selected to be added into the training, so that the calculation time delay and the calculation energy consumption are greatly reduced, and the communication overhead is further reduced.

Drawings

Fig. 1 shows the loss function of the linear regression model calculated after the local precision is reached when a user trains the local MIMO channel data pair using the gradient descent algorithm, and it can be clearly seen from the figure that when the number of iterations reaches 400-500, the loss function converges to a fixed minimum value, which illustrates the situation that the user trains the model to be optimal at this time.

Fig. 2 and 3 show the resource allocation result after the second user selection, and it can be seen from fig. 3 that when the step value is 2, the inverse value F of the utility function of the system reaches the minimum value (i.e. the utility function reaches the maximum value), and at this time, the frequency resource and the bandwidth resource take the optimal values, which can be clearly seen from fig. 2.

Fig. 4 and 5 show the delay and energy consumption of the traditional federal learning (U-M) with a two-tier structure and the federal learning (U-S-M) with a three-tier structure after the edge server is introduced, and under the condition that the number of users and the precision of a user model are the same, the number of users is 15-20, and it can be seen that the delay and energy consumption of the three-tier structure are obviously less than those of the two-tier structure.

Fig. 6 shows the system delay and energy consumption with no user selection and resource allocation, and it can be seen from the figure that under the same condition, the system delay and energy consumption after selection are much smaller than those before selection, the system delay and energy consumption after optimization are smaller than those before optimization, and the system after selection and optimization is in the best state, i.e. the case where the delay and energy consumption are minimum.

Detailed Description

The present invention will be described in detail below with reference to the accompanying drawings and examples.

The edge calculation and resource optimization method based on the federal learning comprises the following steps:

Wherein N is the number of users,

And step 3: and acquiring training data.

wherein the content of the first and second substances,

α_n,lis the complex channel gain of the l-th path,

for the directional angle of the downlink transmission path,

is S_TX 1, whose mth element is represented as:

Firstly, firstly

Divided equally into non-overlapping Q parts, each subinterval

Wherein

Is the midpoint of the interval.

The training process can be divided into edge aggregation and cloud aggregation

2) Edge polymerization: edge clustering is divided into local model computation, local model transmission and edge model aggregation. The specific contents are as follows:

local model calculation: after locally generating MIMO data, user n uses the local data to perform linear regression model training based on gradient descent until local model accuracy theta is reached_n∈[0,1]I.e. iterationThe number of times is L (theta)_n)＝μlog(1/θ_n) Then, model parameters w are generated_tAnd calculating the loss function f of the model_i(w_t). In the process, the calculation time delay is generated

And calculating energy

And transferring energy

Where δ is a learning task parameter. When in useCarrying out I (epsilon, theta)_n) The total latency/energy consumption resulting after the sub-global aggregation is the sum of the computational latency/energy consumption and the transmission latency/energy consumption.

And energy consumption

May be calculated from the transmission rate and the transmission power.

The key points of the invention are as follows:

step 1: allocating resources for users

Wherein

Is the set of users after the first selection,

is the set of users after two selections.

Resource allocation: after the second selection, the alliance is successfully established, then the resources of the users who finally participate in the training are distributed, a contract theorem is introduced in the process to encourage and reward the users who participate in the training, and the users are firstly classified according to the local precision (the smaller the better), namely the users are classified according to the local precision (the smaller the better the users are), namely

Wherein

Is the reward for participating in the league learning user,

is a function of profit and is a function of

Edge server providing contract terms

IR: the income and cost loss of the user are at the lowest level, i.e.

If u'_n<0, the user cannot participate in the training and is abandoned。

IC: the user must match himself when selecting contract terms, i.e.

Where K is a set of edge servers,

are the set of users after two selections.

max U,

subject to,

C₁:

C₂:

C₃:

C₄:

C₅:

C₆:

and

3. For any valid clause

If and only if

When the temperature of the water is higher than the set temperature,

4. for any valid clause

If and only if

When the temperature of the water is higher than the set temperature,

3. for any valid clause

The computational utility function of each type of user must be satisfied

0≤u'₁<…<u'_m<…<u'_n

And Local Upward IC (LUIC), i.e.

Constraint C according to 1, 2, 3, and 4₄And C₅Can be solved as:

then the constraints of the optimization problem are reduced to:

C₁:

C₂:

suppose that

And ∈ ≠ 1, from constraint condition C₁Can be solved out

From constraint C₂Can be solved out

To sum up, can obtain

Claims

1. The edge calculation and resource optimization method based on the federal learning is characterized in that: the method comprises the following steps:

The multi-user millimeter wave MIMO three-layer structure system structure comprises a first layer, a second layer and a third layer, wherein the first layer is a user layer and consists of N single-antenna user devices; the second layer is an ES layer, i.e., an edge server layer, containing K layers with S_TAn edge server for the antenna; the third layer is a CS layer, namely a cloud server layer, and is provided with a cloud server S with a single antenna; the user layer and the ES layer communicate through a wireless channel, and the ES layer and the CS layer are connected through a backhaul link and have sufficient capacity;

Wherein N is the number of users,

the number of the selected users after the distance selection is performed during the representation; that is, assuming that the user equipment obeys a poisson distribution, the communication range of each edge server is a radius d_kAnd in this three-level structure, no overlap between the circular communication areas of the edge servers is assumed; then, selecting the users with the closer distance according to the distance from each user to the edge server;

and step 3: acquiring training data;

the channel model is an LOS receiving route of an L path, and the millimeter wave channel matrix of the nth user is expressed as:

wherein the content of the first and second substances,

is the complex channel gain of the l-th path,

for the directional angle of the downlink transmission path,

is S_TX 1, whose mth element is represented as:

λ is the wavelength, σ is the distance between the two antennas, and let σ equal λ/2;

firstly, firstly

Randomly generating L-path channel transmission direction angle in the fluctuation range, and then solving h according to a channel matrix formula_nAngle interval of

Divided equally into non-overlapping Q parts, each subinterval

And marks each section range starting from 1, a section where the user transmission direction angle falls is marked as q, and the q value is a section mark value, thereby generating (h)_nQ) data pairs; final radio frequency precoding f_RF，nIs shown as

Wherein

Is the midpoint of the interval;

and 4, step 4: training according to the selected user and the generated MIMO data; the training process is divided into edge aggregation and cloud aggregation.

2. The federated learning-based edge computing and resource optimization method of claim 1, wherein: the edge polymerization: the edge aggregation is divided into local model calculation, local model transmission and edge model aggregation; the specific contents are as follows:

local model calculation: after locally generating MIMO data, user n uses the local data to perform linear regression model training based on gradient descent to achieve local model accuracyθ_n∈[0，1]I.e. the number of iterations is L (theta)_n)＝μlog(1/θ_n) Then, model parameters w are generated_tAnd calculating the loss function f of the model_i(w_t) (ii) a In the process, the calculation time delay is generated

And calculating energy

The calculation time delay and the consumed calculation energy can be obtained according to the data volume of the user training and the frequency of processing the data;

And transferring energy

The transmission delay and the transmission energy can be calculated by a Shannon formula according to the transmission rate between the user n and the edge server k and the bandwidth allocated to the user n by the server k;

edge polymerization: in the process, the edge server gathers the model parameters uploaded by each user and then carries out averaging processing so as to update the model parameters (namely, a federal averaging algorithm); after the edge server carries out model parameter aggregation and an average algorithm, updated model parameters are sent to each user in a broadcasting mode (time delay and energy consumption in the broadcasting process are ignored), after the user receives the model parameters sent by the server, the user uses the parameters to carry out a new round of data training, and the operation is repeated in a circulating mode until the edge server reaches a model precision value epsilon, and the times of global aggregation are expressed as

Where δ is a learning task parameter; when I (epsilon, theta) is performed_n) The total latency/energy consumption resulting after the sub-global aggregation is the sum of the computational latency/energy consumption and the transmission latency/energy consumption.

3. The federated learning-based edge computing and resource optimization method of claim 1, wherein: the cloud aggregation comprises the following steps: the cloud aggregation comprises two steps of edge model uploading and cloud model aggregation;

And energy consumption

Can be calculated according to the transmission rate and the transmission power;

4. The federated learning-based edge computing and resource optimization method of claim 1, wherein: before the user selects, the user needs to carry out local iteration to reach corresponding precision, then the gradient value is worked out according to the loss function calculated locally, and difference value is made between the gradient value and the gradient gathered by the edge server end, and the user with small gradient difference value is selected to be constructed as the final learning alliance

Wherein

Is the set of users after the first selection,

is the set of users after two selections.

5. The federated learning-based edge computing and resource optimization method of claim 1, wherein: after the second selection, the alliance is successfully established, then the resources of the users who finally participate in the training are distributed, a contract theorem is introduced in the process to encourage and reward the users who participate in the training, and the users are firstly classified according to local precision, namely the users are classified according to the local precision

Wherein

Is the reward for participating in the league learning user,

is a function of profit and is a function of

A monotonically increasing function that increases; the computational utility function of the user is thus obtained, i.e.

Edge server providing contract terms

The user selects proper terms according to the actual situation of the user, and the terms are respectively IR and IC according to the constraint conditions of the contract theorem;

IR: the income and cost loss of the user are at the lowest level, i.e.

If u'_nIf < 0, the user can not participate in the training and is abandoned;

IC: the user must match himself when selecting contract terms, i.e.

Obtaining the utility function u of the user according to the above_nUtility function U with edge server_k(the difference value between the cost function and the income function is ignored by the utility function of the cloud server; in conclusion, the utility function of the whole system is obtained

Where K is a set of edge servers,

is a user set after two selections;

the purpose of resource allocation is to minimize the system communication overhead and maximize the utility function of the system, so an optimization problem is presented, namely

max U，

subject to，

C₁：

C₂：

C₃：

C₄：

C₅：

C₆：

Wherein the constraint condition C₁And C₂Respectively representing uplink communication resource constraints and computation capability constraints, beta_k，nIs the bandwidth allocated to user n by edge server k, f_nIs the calculated frequency of user n and

and

is the minimum and maximum of the calculated frequency; c₄And C₅Ensure the stability and high efficiency of the equipment participating in the model training₆Ensuring that the users participating in the training are users within the service range of the edge server.